BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 014995
(415 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 152/398 (38%), Positives = 226/398 (56%), Gaps = 37/398 (9%)
Query: 41 LPPNVCNRTRTALPQGPD-KASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLK 99
+P + C+ + P+G D +ASLEVV K+GPCS+L ++PS +IL QD+ R+
Sbjct: 1 MPSSACSPS----PKGHDQRASLEVVHKHGPCSKLRPH-KANSPSHTQILAQDESRVASI 55
Query: 100 NSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWT 158
SR + + T P+ T+ Y+V V +G PK+ ++ + DTGSD+TWT
Sbjct: 56 QSRLAKNLAGGSNLKASKATLPSKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWT 115
Query: 159 QCKPCI-HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGN---CNSKECPFNIQ 214
QC+PC+ +C+QQR+ F S S ++ + C+S SC L + GN C+S C + I+
Sbjct: 116 QCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSPSCEKLESAT--GNSPGCSSSTCLYGIR 173
Query: 215 YADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSI 274
Y DGS S GF+A +++++ S F + F GC N+ G G +G++GL R+P+S+
Sbjct: 174 YGDGSYSIGFFAREKLSL---TSTDVFNNFQF--GCGQNNRGLFGGTAGLLGLARNPLSL 228
Query: 275 ITRTNTSY---FSYCLPSPYGSTGYITFGKTD---------------TVNTLRSAFHKRM 316
+++T Y FSYCLPS STGY++FG D ++++ F + M
Sbjct: 229 VSQTAQKYGKVFSYCLPSSSSSTGYLSFGSGDGDSKAVKFTPRLPPTVYSSVQKVFRELM 288
Query: 317 KKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGF 376
Y + KG+ +LDTCYDLS Y+TV VPKI ++F GG +++L G + V VSQVCL F
Sbjct: 289 SDYPRVKGVS-ILDTCYDLSKYKTVKVPKIILYFSGGAEMDLAPEGIIYVLKVSQVCLAF 347
Query: 377 ATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
A D +GNVQQ+ V YD A R+GF P C
Sbjct: 348 AGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPSGC 385
>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
Length = 376
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 150/417 (35%), Positives = 218/417 (52%), Gaps = 64/417 (15%)
Query: 8 FLLFICLLCSSNNGAYADDNDL----SHSHIVSVSSLLPPNVCNRTRTALPQGPDK-ASL 62
FLL+ LL S A+ S H V ++SL+P +VC+ + P+G DK ASL
Sbjct: 13 FLLYSALLSSKRGLAFQGRKTALSTPSTLHNVHITSLMPSSVCSPS----PKGDDKRASL 68
Query: 63 EVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPA 122
EV+ K+GPCS+L+Q +PS ++L QD+ R++ SR + P + T P+
Sbjct: 69 EVIHKHGPCSKLSQD-KGRSPSRTQMLDQDESRVNSIRSRLAKNPADGGKLKGSKVTLPS 127
Query: 123 NINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFYASKSK 180
T+ Y+V V +G PK+ ++ + DTGSD+TWTQC+PC +C+ Q++P F SKS
Sbjct: 128 KSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKST 187
Query: 181 TFFKIPCNSTSCRILRESFPFGN---CNSKECPFNIQYADGSGSGGFWATDRITIQEANS 237
++ I C+S +C L+ GN C++ C + IQY D S S GF+A D++ + S
Sbjct: 188 SYTNISCSSPTCDELKSG--TGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLAL---TS 242
Query: 238 NGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYI 297
F FL GC N+ G G +G++GL R+ +S+
Sbjct: 243 TDVFNN--FLFGCGQNNRGLFVGVAGLIGLGRNALSL----------------------- 277
Query: 298 TFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLE 357
M KY KA +LDTCYD S Y+TV VPKI ++F G +++
Sbjct: 278 ------------------MSKYPKA-APASILDTCYDFSQYDTVDVPKINLYFSDGAEMD 318
Query: 358 LDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
LD G + ++SQVCL FA + LGNVQQ+ +V YDVAG R+GF PG C
Sbjct: 319 LDPSGIFYILNISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRIGFAPGGC 375
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 169/456 (37%), Positives = 238/456 (52%), Gaps = 81/456 (17%)
Query: 31 HSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILR 90
HSH + VSSLLP C + L +KASL+VV K+GPCS+L+Q ++ AP+ EIL
Sbjct: 45 HSHSIEVSSLLPSASCKPSTKVLSNNDNKASLKVVHKHGPCSKLSQDEASAAPTHTEILL 104
Query: 91 QDQQRLHLKNSR--RLRKPFPEFLKRTEAFTFPANINDTVADEYYIV-VAIGEPKQYVSL 147
QDQ R+ +SR + + +K T++ T PA TV YIV V +G PK+ +SL
Sbjct: 105 QDQSRVKSIHSRLSNSKTSGGKDVKVTDSTTIPAKDGSTVGSGNYIVTVGLGTPKKDLSL 164
Query: 148 LLDTGSDVTWTQCKPCIH-CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGN--- 203
+ DTGSD+TWTQC+PC C++Q++ F S+S ++ I C+S+ C L + GN
Sbjct: 165 IFDTGSDITWTQCQPCARSCYKQKEQIFDPSQSTSYTNISCSSSICNSLTSAT--GNTPG 222
Query: 204 CNSKECPFNIQYADGSGSGGFWATDRITIQ--EANSNGYFTRYPFLLGCINNSSGDKSGA 261
C S C + IQY D S S GF+ T+++T+ +A +N YF GC N+ G G+
Sbjct: 223 CASSACVYGIQYGDSSFSVGFFGTEKLTLTSTDAFNNIYF-------GCGQNNQGLFGGS 275
Query: 262 SGIMGLDRSPVSIITRTNTSY---FSYCL------------------------------- 287
+G++GL R +S++++T Y FSYCL
Sbjct: 276 AGLLGLGRDKLSVVSQTAQKYNKIFSYCLPSSSSSTGFLTFGGSASKNAKFTPLSTISAG 335
Query: 288 PSPYGS--TGYITFGK-----------------TDTVNT---------LRSAFHKRMKKY 319
PS YG TG GK + TV T LR++F M KY
Sbjct: 336 PSFYGLDFTGISVGGKKLAISASVFSTAGAIIDSGTVITRLPPAAYSALRASFRNLMSKY 395
Query: 320 KKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATY 379
K L +LDTCYD S+Y T+ VPKI F G+++++D G L +S+SQVCL FA
Sbjct: 396 PMTKALS-ILDTCYDFSSYTTISVPKIGFSFSSGIEVDIDATGILYASSLSQVCLAFAGN 454
Query: 380 PPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ GNVQQ+ EV YD + ++GF PG CS
Sbjct: 455 SDATDVFIFGNVQQKTLEVFYDGSAGKVGFAPGGCS 490
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 164/475 (34%), Positives = 239/475 (50%), Gaps = 68/475 (14%)
Query: 3 ILSKAFLLFICLLCSSNNGAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASL 62
+L+ +L +CL N GA + D SH+ VS + C + A K+SL
Sbjct: 7 LLNIIIILCVCLNLGCNEGAQEREIDDSHTIQVSSLFPASSSSCVLSPRA---STTKSSL 63
Query: 63 EVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPA 122
V ++G CSRLN G +T +P EILR DQ R++ +S+ +K + ++++ PA
Sbjct: 64 HVTHRHGTCSRLNNGKAT-SPDHVEILRLDQARVNSIHSKLSKKLTTNHVSQSQSTDLPA 122
Query: 123 NINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIH-CFQQRDPFFYASKSK 180
T+ YIV V +G PK +SL+ DTGSD+TWTQC+PC+ C+ Q++P F SKS
Sbjct: 123 KDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKST 182
Query: 181 TFFKIPCNSTSCRILRESFP-FGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA---- 235
+++ + C+S +C L + G+C++ C + IQY D S S GF A D+ T+ +
Sbjct: 183 SYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKDKFTLTSSDVFD 242
Query: 236 --------NSNGYFTRYPFLLG----------------------CINNSSGDKSGAS-GI 264
N+ G FT LLG C+ +S+ + G
Sbjct: 243 GVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPSSASYTGHLTFGS 302
Query: 265 MGLDRS----PVSIITRTNTSYFSYCLPS--------PYGSTGYITFGK---TDTVNT-- 307
G+ RS P+S IT TS++ + + P ST + T G + TV T
Sbjct: 303 AGISRSVKFTPISTITD-GTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRL 361
Query: 308 -------LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDV 360
LRS+F +M KY G+ +LDTC+DLS ++TV +PK+A F GG +EL
Sbjct: 362 PPKAYAALRSSFKAKMSKYPTTSGVS-ILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGS 420
Query: 361 RGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+G +SQVCL FA D N+ GNVQQ+ EV YD AG R+GF P CS
Sbjct: 421 KGIFYAFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 475
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 223 bits (569), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 156/476 (32%), Positives = 235/476 (49%), Gaps = 77/476 (16%)
Query: 8 FLLFICLLCSSNNGAYADDNDL----SHSHIVSVSSLLPPNVCNRTRTALPQGPDK-ASL 62
FLL+ LL S A+ S H V ++SL+P +VC+ + P+G DK ASL
Sbjct: 13 FLLYSALLSSKRGLAFQGRKTALSTPSTLHNVHITSLMPSSVCSPS----PKGDDKRASL 68
Query: 63 EVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPA 122
EV+ K+GPCS+L+Q +PS ++L QD+ R++ SR + P + T P+
Sbjct: 69 EVIHKHGPCSKLSQD-KGRSPSRTQMLDQDESRVNSIRSRLAKNPADGGKLKGSKVTLPS 127
Query: 123 NINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFYASKSK 180
T+ Y+V V +G PK+ ++ + DTGSD+TWTQC+PC +C+ Q++P F SKS
Sbjct: 128 KSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKST 187
Query: 181 TFFKIPCNSTSCRILRESFPFGN---CNSKECPFNIQYADGSGSGGFWATDRITIQEA-- 235
++ I C+S +C L+ GN C++ C + IQY D S S GF+A D++ +
Sbjct: 188 SYTNISCSSPTCDELKSGT--GNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTDV 245
Query: 236 ----------NSNGYFTRYPFLLG----------------------CINNSSGDK----- 258
N+ G F L+G C+ ++S
Sbjct: 246 FNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLVSQTAQKYGKLFSYCLPSTSSSTGYLTF 305
Query: 259 -SGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYG----STGYITFGKTDTV-------- 305
SG + +P + ++ + YF + G ST F T+
Sbjct: 306 GSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFSTAGTIIDSGTVIS 365
Query: 306 -------NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLEL 358
+ LR++F ++M KY KA +LDTCYD S Y+TV VPKI ++F G +++L
Sbjct: 366 RLPPTAYSDLRASFQQQMSKYPKA-APASILDTCYDFSQYDTVDVPKINLYFSDGAEMDL 424
Query: 359 DVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
D G + ++SQVCL FA + LGNVQQ+ +V YDVAG R+GF PG C
Sbjct: 425 DPSGIFYILNISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRIGFAPGGC 480
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 150/419 (35%), Positives = 218/419 (52%), Gaps = 65/419 (15%)
Query: 59 KASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAF 118
K+SL V ++G CSRLN G +T +P EILR DQ R++ +S+ +K + + +++
Sbjct: 59 KSSLHVTHRHGTCSRLNNGKAT-SPDHVEILRLDQARVNSIHSKLSKKLATDHVSESKST 117
Query: 119 TFPANINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIH-CFQQRDPFFYA 176
PA T+ YIV V +G PK +SL+ DTGSD+TWTQC+PC+ C+ Q++P F
Sbjct: 118 DLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNP 177
Query: 177 SKSKTFFKIPCNSTSCRILRESFP-FGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA 235
SKS +++ + C+S +C L + G+C++ C + IQY D S S GF A ++ T+ +
Sbjct: 178 SKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNS 237
Query: 236 ------------NSNGYFTRYPFLLG----------------------CINNSSGDKSGA 261
N+ G FT LLG C+ +S+
Sbjct: 238 DVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPSSASYTGHL 297
Query: 262 S-GIMGLDRS----PVSIITRTNTSYFSYCLPS--------PYGSTGYITFGK---TDTV 305
+ G G+ RS P+S IT TS++ + + P ST + T G + TV
Sbjct: 298 TFGSAGISRSVKFTPISTITD-GTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTV 356
Query: 306 NT---------LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
T LRS+F +M KY G+ +LDTC+DLS ++TV +PK+A F GG +
Sbjct: 357 ITRLPPKAYAALRSSFKAKMSKYPTTSGVS-ILDTCFDLSGFKTVTIPKVAFSFSGGAVV 415
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
EL +G V +SQVCL FA D N+ GNVQQ+ EV YD AG R+GF P CS
Sbjct: 416 ELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 474
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 220 bits (561), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 149/419 (35%), Positives = 218/419 (52%), Gaps = 65/419 (15%)
Query: 59 KASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAF 118
++SL V ++G CSRLN G +T +P EILR DQ R++ +S+ +K + + +++
Sbjct: 31 ESSLHVTHRHGTCSRLNNGKAT-SPDHVEILRLDQARVNSIHSKLSKKLATDHVSESKST 89
Query: 119 TFPANINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIH-CFQQRDPFFYA 176
PA T+ YIV V +G PK +SL+ DTGSD+TWTQC+PC+ C+ Q++P F
Sbjct: 90 DLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNP 149
Query: 177 SKSKTFFKIPCNSTSCRILRESFP-FGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA 235
SKS +++ + C+S +C L + G+C++ C + IQY D S S GF A ++ T+ +
Sbjct: 150 SKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNS 209
Query: 236 ------------NSNGYFTRYPFLLG----------------------CINNSSGDKSGA 261
N+ G FT LLG C+ +S+
Sbjct: 210 DVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPSSASYTGHL 269
Query: 262 S-GIMGLDRS----PVSIITRTNTSYFSYCLPS--------PYGSTGYITFGK---TDTV 305
+ G G+ RS P+S IT TS++ + + P ST + T G + TV
Sbjct: 270 TFGSAGISRSVKFTPISTITD-GTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTV 328
Query: 306 NT---------LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
T LRS+F +M KY G+ +LDTC+DLS ++TV +PK+A F GG +
Sbjct: 329 ITRLPPKAYAALRSSFKAKMSKYPTTSGVS-ILDTCFDLSGFKTVTIPKVAFSFSGGAVV 387
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
EL +G V +SQVCL FA D N+ GNVQQ+ EV YD AG R+GF P CS
Sbjct: 388 ELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 446
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 156/473 (32%), Positives = 227/473 (47%), Gaps = 77/473 (16%)
Query: 7 AFLLFICLLCSSNNGAYADDNDLSHSHI--VSVSSLLPPNVCNRTRTALPQGPDKASLEV 64
FL+ +C LCS G + + + ++I V V+SLLP NVC+++ L + +SL+V
Sbjct: 17 VFLVLLCPLCSLKKGLTVEGKETTKNYIRTVRVNSLLPSNVCSQSTRVLNRA---SSLKV 73
Query: 65 VSKYGPCSRLNQGIST-HAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPAN 123
V+KYGPC + T + PS E L QDQ R+ R P K + T PA+
Sbjct: 74 VNKYGPCIPVTGAPKTINVPSTAEFLLQDQLRVKSFQVRLSMNPSSGVFKEMQT-TIPAS 132
Query: 124 INDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIH-CFQQRDPFFYASKSKTF 182
I T Y + V +G PK+ +L DTGSD+TWTQC+PC+ CF Q P F + S ++
Sbjct: 133 IVPT-GGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQNQPKFDPTTSTSY 191
Query: 183 FKIPCNSTSCRILRE-SFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
+ C+S C+++ E ++P +C S C + IQY G + GF AT+ + I ++ F
Sbjct: 192 KNVSCSSEFCKLIAEGNYPAQDCISNTCLYGIQYGSGY-TIGFLATETLAIASSD---VF 247
Query: 242 TRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL---PSPYGSTG 295
FL GC S G +G +G++GL RSP+++ ++T Y FSYCL PS G
Sbjct: 248 KN--FLFGCSEESRGTFNGTTGLLGLGRSPIALPSQTTNKYKNLFSYCLPASPSSTGHLS 305
Query: 296 Y---------------------------------------------------ITFGKTDT 304
+ TF + T
Sbjct: 306 FGVEVSQAAKSTPISPKLKQLYGLNTVGISVRGRELPINGSISRTIIDSGTTFTFLPSPT 365
Query: 305 VNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLS--AYETVVVPKIAIHFLGGVDLELDVRG 362
+ L SAF + M Y G CYD S T+ +P I+I F GGV++E+DV G
Sbjct: 366 YSALGSAFREMMANYTLTNGTSSF-QPCYDFSNIGNGTLTIPGISIFFEGGVEVEIDVSG 424
Query: 363 TLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
++ V + +VCL FA D + GN QQ+ +EV YDVA +GF P C
Sbjct: 425 IMIPVNGLKEVCLAFADTGSDSDFAIFGNYQQKTYEVIYDVAKGMVGFAPKGC 477
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 149/419 (35%), Positives = 207/419 (49%), Gaps = 77/419 (18%)
Query: 55 QGPD-KASLEVVSKYGPCSRLNQ--GISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEF 111
+GP KASLEVV K+GPCS+LN G + EIL QD++R+ NSR + +
Sbjct: 63 KGPKRKASLEVVHKHGPCSQLNNHDGKAKSKTPHSEILNQDKERVKYINSRISKNLGQDS 122
Query: 112 -LKRTEAFTFPANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIH-CFQ 168
+ ++ T PA + Y++VV +G PK+ +SL+ DTGSD+TWTQC+PC C++
Sbjct: 123 SVSELDSVTLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYK 182
Query: 169 QRDPFFYASKSKTFFKIPCNSTSCRILRESF---PFGNCNSKECPFNIQYADGSGSGGFW 225
Q+D F SKS ++ I C ST C L + P + ++K C + IQY D S S G++
Sbjct: 183 QQDAIFDPSKSTSYSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYF 242
Query: 226 ATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITR-------- 277
+ +R+++ + FL GC N+ G G++G++GL R P+S + +
Sbjct: 243 SRERLSVTATD-----IVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAVYRKI 297
Query: 278 -------------------TNTSYFSYCLPS-----------------------PYGSTG 295
T TSY Y S P S+
Sbjct: 298 FSYCLPATSSSTGRLSFGTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSST 357
Query: 296 YITFGK---TDTVNT---------LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVV 343
+ T G + TV T LRSAF + M KY A L +LDTCYDLS YE +
Sbjct: 358 FSTGGAIIDSGTVITRLPPTAYTALRSAFRQGMSKYPSAGELS-ILDTCYDLSGYEVFSI 416
Query: 344 PKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDV 402
PKI F GGV ++L +G L VAS QVCL FA D + GNVQQ+ EV YDV
Sbjct: 417 PKIDFSFAGGVTVQLPPQGILYVASAKQVCLAFAANGDDSDVTIYGNVQQKTIEVVYDV 475
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 163/484 (33%), Positives = 231/484 (47%), Gaps = 81/484 (16%)
Query: 4 LSKAFLLFICLLCSSNNGAY---ADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPD-K 59
L +F L C+ + A+ + N+L H V ++SL P + + ++ +GP K
Sbjct: 5 LLASFALLFCISTLEKSFAFQATKESNNLRQYHFVHLNSLFP----SSSCSSSAKGPKRK 60
Query: 60 ASLEVVSKYGPCSRLNQ-GISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPE-FLKRTEA 117
ASLEVV K+GPCS+LN G + S +I+ D +R+ SR + E +K ++
Sbjct: 61 ASLEVVHKHGPCSQLNHNGKAKTTISHTDIMNLDNERVKYIQSRLSKNLGRENSVKELDS 120
Query: 118 FTFPANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFY 175
T PA + Y++VV +G PK+ +SL+ DTGSD+TWTQC+PC C++Q+D F
Sbjct: 121 TTLPAKSGSLIGSANYFVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFD 180
Query: 176 ASKSKTFFKIPCNSTSCRILRESFPFGNCNSK--ECPFNIQYADGSGSGGFWATDRITIQ 233
SKS ++ I C S+ C L + C+S C + IQY D S S GF + +R+TI
Sbjct: 181 PSKSSSYINITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQERLTIT 240
Query: 234 EAN------------SNGYFTRYPFLLG----------------------CINNSSGDKS 259
+ + G F+ L+G C+ ++S
Sbjct: 241 ATDIVDDFLFGCGQDNEGLFSGSAGLIGLGRHPISFVQQTSSIYNKIFSYCLPSTSSSLG 300
Query: 260 ----GASGIMG--LDRSPVSIITRTNTSY-------------FSYCLPSPYGSTGYITFG 300
GAS L +P+S I+ NT Y S + + G I
Sbjct: 301 HLTFGASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSAGGSII-- 358
Query: 301 KTDTVNT---------LRSAFHKRMKKYKKAKGLED-LLDTCYDLSAYETVVVPKIAIHF 350
+ TV T LRSAF + M+KY A ED L DTCYD S Y+ + VPKI F
Sbjct: 359 DSGTVITRLAPTAYAALRSAFRQGMEKYPVAN--EDGLFDTCYDFSGYKEISVPKIDFEF 416
Query: 351 LGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFG 410
GGV +EL + G L+ S QVCL FA D + GNVQQ+ EV YDV G R+GFG
Sbjct: 417 AGGVTVELPLVGILIGRSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVEGGRIGFG 476
Query: 411 PGNC 414
C
Sbjct: 477 AAGC 480
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 204 bits (520), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 154/474 (32%), Positives = 227/474 (47%), Gaps = 93/474 (19%)
Query: 13 CLLCSSNNGAYADDNDLSHSHI--VSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGP 70
C LCS G N+++ + V+V+SLLP +VC+ + L + +SL+VVSKYGP
Sbjct: 19 CPLCSLKKGHTVAANEITKGYFRNVNVNSLLPSSVCDHSNKVLNKA---SSLKVVSKYGP 75
Query: 71 CSRLNQGISTHAPSLEEILRQDQQRL------HLKNSRRLRKPFPEFLKRTEAFTFPANI 124
C+ G PS EILR+DQ R+ H NS F E R F
Sbjct: 76 CTV--TGDPKTFPSAAEILRRDQLRVKSIRAKHSMNSST-TGVFNEMKTRVPTTHFGGG- 131
Query: 125 NDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIH-CFQQRDPFFYASKSKTFF 183
Y + V +G PK+ SLL DTGSD+TWTQC+PC CF Q D F +KS ++
Sbjct: 132 -------YAVTVGLGTPKKDFSLLFDTGSDLTWTQCEPCSGGCFPQNDEKFDPTKSTSYK 184
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
+ C+S C+ + + G +S C + ++Y G + GF AT+ +TI ++ F
Sbjct: 185 NLSCSSEPCKSIGKESAQGCSSSNSCLYGVKYGTGY-TVGFLATETLTITPSD---VFEN 240
Query: 244 YPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL------------- 287
F++GC + G SG +G++GL RSPV++ ++T+++Y FSYCL
Sbjct: 241 --FVIGCGERNGGRFSGTAGLLGLGRSPVALPSQTSSTYKNLFSYCLPASSSSTGHLSFG 298
Query: 288 -------------------------------------PSPYGSTGYI-------TFGKTD 303
PS + + G I T+ +
Sbjct: 299 GGVSQAAKFTPITSKIPELYGLDVSGISVGGRKLPIDPSVFRTAGTIIDSGTTLTYLPST 358
Query: 304 TVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLS--AYETVVVPKIAIHFLGGVDLELDVR 361
+ L SAF + M Y KG L CYD S A + + +P+I+I F GGV++++D
Sbjct: 359 AHSALSSAFQEMMTNYTLTKGTSGL-QPCYDFSKHANDNITIPQISIFFEGGVEVDIDDS 417
Query: 362 GTLVVAS-VSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
G + A+ + +VCL F D + GNVQQ+ +EV YDVA +GF PG C
Sbjct: 418 GIFIAANGLEEVCLAFKDNGNDTDVAIFGNVQQKTYEVVYDVAKGMVGFAPGGC 471
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 152/478 (31%), Positives = 224/478 (46%), Gaps = 79/478 (16%)
Query: 8 FLLFICLLCSSNNGAYADDNDLSHSHI------VSVSSLLPPNVCNRTRTALPQGPD-KA 60
FLL+ LL + A H+ V ++SL+P + C+ + P+G D +A
Sbjct: 20 FLLYASLLSLKSGFAIEGRESAESHHVQPIHHNVHITSLMPSSACSPS----PKGHDQRA 75
Query: 61 SLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTF 120
SLEVV K+GPCS+L ++PS +IL QD+ R+ SR + + T
Sbjct: 76 SLEVVHKHGPCSKLRPH-KANSPSHTQILAQDESRVASIQSRLAKNLAGGSNLKASKATL 134
Query: 121 PANINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFYASK 178
P+ T+ Y+V V +G PK+ ++ + DTGSD+TWTQC+PC+ +C+QQR+ F S
Sbjct: 135 PSKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPST 194
Query: 179 SKTFFKIPCNSTSCRILRESFPFGN---CNSKECPFNIQYADGSGSGGFWATDRITIQEA 235
S ++ + C+S SC L + GN C+S C + I+Y DGS S GF+A +++++
Sbjct: 195 SLSYSNVSCDSPSCEKLESAT--GNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTST 252
Query: 236 ------------NSNGYFTRYPFLLGCINN------------------------------ 253
N+ G F LLG N
Sbjct: 253 DVFNNFQFGCGQNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKVFSYCLPSSSSSTGYL 312
Query: 254 SSGDKSGASGIMGLDRSPVS-------IITRTNTSYFSYCLPSP---YGSTGYITFGKT- 302
S G G S + S V+ + S LP P + + G I T
Sbjct: 313 SFGSGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFSTAGTIIDSGTV 372
Query: 303 ------DTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
++++ F + M Y + KG+ +LDTCYDLS Y+TV VPKI ++F GG ++
Sbjct: 373 ISRLPPTVYSSVQKVFRELMSDYPRVKGVS-ILDTCYDLSKYKTVKVPKIILYFSGGAEM 431
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+L G + V VSQVCL FA D +GNVQQ+ V YD A R+GF P C
Sbjct: 432 DLAPEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPSGC 489
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 197 bits (501), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 158/474 (33%), Positives = 231/474 (48%), Gaps = 82/474 (17%)
Query: 8 FLLFICLLCSSNNGAYADDND--LSHSHIVSVSSLLPPNVCNRTRTALPQGPDKAS-LEV 64
FLLF+C LCS G + N+ + H + V+SLL + C+++ + DKAS L+V
Sbjct: 17 FLLFLCPLCSLKKGYAVEANEHIKKYVHTLEVNSLLASDSCDQSSKVI----DKASSLQV 72
Query: 65 VSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANI 124
+ KYGPC ++ + S E L QDQ R+ +R L K + PA
Sbjct: 73 LHKYGPCMQV-----LNDRSHVEFLLQDQLRVDSIQAR-LSKISGHGIFEEMVTKLPAQS 126
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFYASKSKTF 182
+ Y+V V +G PK+ +L+ DTGS +TWTQC+PC+ C+ Q++ F +KS ++
Sbjct: 127 GIAIGTGNYVVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLGSCYPQKEQKFDPTKSTSY 186
Query: 183 FKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFT 242
+ C+S SC +L S + ++ C + I Y D S S GF+AT+ +TI +S+ FT
Sbjct: 187 NNVSCSSASCNLLPTSERGCSASNSTCLYQIIYGDQSYSQGFFATETLTI---SSSDVFT 243
Query: 243 RYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL---PSPYG---- 292
FL GC +++G A+G++GL S VS+ ++T Y FSYCL PS G
Sbjct: 244 N--FLFGCGQSNNGLFGQAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLPSTPSSTGYLNF 301
Query: 293 -------------STGYITFGKTDTV---------------------------------- 305
S + +F D V
Sbjct: 302 GGKVSQTAGFTPISPAFSSFYGIDIVGISVAGSQLPIDPSIFTTSGAIIDSGTVITRLPP 361
Query: 306 ---NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRG 362
L+ AF ++M Y K G ++LLDTCYD S Y TV PK+++ F GGV++++D G
Sbjct: 362 TAYKALKEAFDEKMSNYPKTNG-DELLDTCYDFSNYTTVSFPKVSVSFKGGVEVDIDASG 420
Query: 363 TL-VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
L +V V VCL FA D GN QQ+ +EV YD A +GF G CS
Sbjct: 421 ILYLVNGVKMVCLAFAANKDDSEFGIFGNHQQKTYEVVYDGAKGMIGFAAGACS 474
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 155/479 (32%), Positives = 229/479 (47%), Gaps = 80/479 (16%)
Query: 1 MWILSKAFLLFICLLCS-SNNGAY-------ADDNDLSHSHIVSVSSLLPPNVCNRTRTA 52
M ++S + LL +CL+ S S A+ A +N L H + +S+LLP C + T
Sbjct: 1 MALISFSHLLCLCLVISLSTTYAFGFEGRKIAQENHLQLIHAIEISNLLPSADCEHS-TK 59
Query: 53 LPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFL 112
+ Q +KASL+VV K+GPCS+LNQ + +AP+L EIL +DQ R+ +S + +
Sbjct: 60 VAQ--NKASLKVVHKHGPCSQLNQQ-NGNAPNLVEILLEDQSRV---DSIHAKLSDHSGV 113
Query: 113 KRTEAFTFPANINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRD 171
K T+A P ++ YIV + +G PK+ + L+ DTGSD+TW +C + D
Sbjct: 114 KETDAAKLPTKSGMSLGTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARCSAA----ETFD 169
Query: 172 PFFYASKSKTFFKIPCNSTSCR-ILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRI 230
P +KS ++ + C++ C ++ + C + C + IQY DGS S GF +R+
Sbjct: 170 P----TKSTSYANVSCSTPLCSSVISATGNPSRCAASTCVYGIQYGDGSYSIGFLGKERL 225
Query: 231 TIQEAN------------SNGYFTRYPFLLG----------------------CINNSSG 256
TI + +G F + LLG C+ +SS
Sbjct: 226 TIGSTDIFNNFYFGCGQDVDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCLPSSSS 285
Query: 257 --------DKSGASGIMGLDRSPVSI----ITRTNTSYFSYCLP-SPYGSTGYITFGKTD 303
+S ++ L P S +T +P S + + G I T
Sbjct: 286 TGFLSFGSSQSKSAKFTPLSSGPSSFYNLDLTGITVGGQKLAIPLSVFSTAGTIIDSGTV 345
Query: 304 TV-------NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+ LRSAF K M Y K L +LDTCYD S Y+T+ VPKI I F GGVD+
Sbjct: 346 VTRLPPAAYSALRSAFRKAMASYPMGKPLS-ILDTCYDFSKYKTIKVPKIVISFSGGVDV 404
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
++D G V + QVCL FA ++ GN QQR EV YDV+G ++GF P +CS
Sbjct: 405 DVDQAGIFVANGLKQVCLAFAGNTGARDTAIFGNTQQRNFEVVYDVSGGKVGFAPASCS 463
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 147/485 (30%), Positives = 221/485 (45%), Gaps = 86/485 (17%)
Query: 1 MW-ILSKAFLLFICLLCSSNNGAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDK 59
+W IL A L+ C+ D H+VSV+SLLP C + + +
Sbjct: 16 VWLILIAAALVGPCVSAPDAAERRTSRPDHQDWHVVSVASLLPAAACKAPKASAS---NS 72
Query: 60 ASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQR---LHLKNSRRLRKPFPEFLKRTE 116
++L VV + GPCS L P E+L DQ R +H K + P + + +
Sbjct: 73 SALNVVHRQGPCSPLQA--RGAPPPHAELLNDDQARVDSIHRKIAAAA-SPVLDQARGKK 129
Query: 117 AFTFPANINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFY 175
T PA ++ Y+V + +G P + ++++ DTGSD++W QC PC C++Q+DP F
Sbjct: 130 GVTLPAQRGISLGTGNYVVSMGLGTPARDMTVVFDTGSDLSWVQCTPCSDCYEQKDPLFD 189
Query: 176 ASKSKTFFKIPCNSTSCRILRESFPFGNCN-SKECPFNIQYADGSGSGGFWATDRITIQE 234
++S T+ +PC S C+ L +C+ K+C + + Y D S + G A D +T+ +
Sbjct: 190 PARSSTYSAVPCASPECQGLDSR----SCSRDKKCRYEVVYGDQSQTDGALARDTLTLTQ 245
Query: 235 ANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPY 291
++ F+ GC +G A G++GL R VS+ ++ + Y FSYCLPS
Sbjct: 246 SD-----VLPGFVFGCGEQDTGLFGRADGLVGLGREKVSLSSQAASKYGAGFSYCLPSSP 300
Query: 292 GSTGY--------------------------------------------ITFGKTDTV-- 305
+ GY I F TV
Sbjct: 301 SAAGYLSLGGPAPANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVFSAAGTVID 360
Query: 306 -------------NTLRSAFHKRMKKY--KKAKGLEDLLDTCYDLSAYETVVVPKIAIHF 350
LRSAF + M +Y K+A L +LDTCYD + + TV +P +A+ F
Sbjct: 361 SGTVITRLPPRVYAALRSAFARSMGRYGYKRAPALS-ILDTCYDFTGHTTVRIPSVALVF 419
Query: 351 LGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFG 410
GG + LD G L VA VSQ CL FA ++ +GN QQ+ V YDVA +++GFG
Sbjct: 420 AGGAAVGLDFSGVLYVAKVSQACLAFAPNGDGADAGIIGNTQQKTLAVVYDVARQKIGFG 479
Query: 411 PGNCS 415
CS
Sbjct: 480 ANGCS 484
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 146/417 (35%), Positives = 208/417 (49%), Gaps = 72/417 (17%)
Query: 55 QGPD-KASLEVVSKYGPCSRLNQ--GISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPE- 110
+GP KASLEVV K+GPCS+LN G + +IL QD++R+ NSR L K +
Sbjct: 64 KGPKTKASLEVVHKHGPCSQLNDHDGKAKSTTPHSDILNQDKERVKYINSR-LSKNLGQD 122
Query: 111 -FLKRTEAFTFPANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIH-CF 167
++ ++ T PA + Y++VV +G PK+ +SL+ DTGSD+TWTQC+PC C+
Sbjct: 123 SSVEELDSATLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCY 182
Query: 168 QQRDPFFYASKSKTFFKIPCNSTSCRILRESF---PFGNCNSKECPFNIQYADGSGSGGF 224
+Q+D F SKS ++ I C S C L + P + ++K C + IQY D S S G+
Sbjct: 183 KQQDVIFDPSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGY 242
Query: 225 WATDRITIQEA------------NSNGYFT---------RYP----------------FL 247
++ +R+T+ N+ G F R+P +
Sbjct: 243 FSRERLTVTATDVVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAKYRKIFSYC 302
Query: 248 LGCINNSSGDKSGASGIMG--LDRSPVSIITRTNTSYFSYCLPS--------PYGSTGYI 297
L ++S+G S G L +P S I+R +S++ + + P S+ +
Sbjct: 303 LPSTSSSTGHLSFGPAATGRYLKYTPFSTISR-GSSFYGLDITAIAVGGVKLPVSSSTFS 361
Query: 298 TFGK---TDTVNT---------LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPK 345
T G + TV T LRSAF + M KY A L +LDTCYDLS Y+ +P
Sbjct: 362 TGGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELS-ILDTCYDLSGYKVFSIPT 420
Query: 346 IAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDV 402
I F GGV ++L +G L VAS QVCL FA D + GNVQQR EV YDV
Sbjct: 421 IEFSFAGGVTVKLPPQGILFVASTKQVCLAFAANGDDSDVTIYGNVQQRTIEVVYDV 477
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 144/460 (31%), Positives = 215/460 (46%), Gaps = 79/460 (17%)
Query: 25 DDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPS 84
D+ + H+VSV++LLP VC R A ++L VV ++GPCS L PS
Sbjct: 32 DEGSGPNWHVVSVAALLPDAVCTPKRAAASN---SSALSVVHRHGPCSPLQA--RGGEPS 86
Query: 85 LEEILRQDQQR---LHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIV-VAIGE 140
EIL +DQ R +H + R + ++ + PA + YIV V +G
Sbjct: 87 HAEILDRDQDRVDSIHRLAAARPSSTADDPSSASKGVSLPARRGVPLGTANYIVSVGLGT 146
Query: 141 PKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFP 200
PK+ + ++ DTGSD++W QCKPC C+QQ DP F S+S T+ +PC + CR L
Sbjct: 147 PKRDLLVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQSTTYSAVPCGAQECRRLDS--- 203
Query: 201 FGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRY-PFLLGCINNSSGDKS 259
G+C+S +C + + Y D S + G A D +T+ ++S+ + F+ GC ++ +G
Sbjct: 204 -GSCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQEFVFGCGDDDTGLFG 262
Query: 260 GASGIMGLDRSPVSI-----------------------------------------ITRT 278
A G+ GL R VS+ +TR+
Sbjct: 263 KADGLFGLGRDRVSLASQAAAKYGAGFSYCLPSSSTAEGYLSLGSAAPPNARFTAMVTRS 322
Query: 279 NTSYFSY--------------CLPSPYGSTG-------YITFGKTDTVNTLRSAFHKRMK 317
+T F Y P+ + + G IT + LRS+F M+
Sbjct: 323 DTPSFYYLNLVGIKVAGRTVRVSPAVFRTPGTVIDSGTVITRLPSRAYAALRSSFAGLMR 382
Query: 318 KY--KKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLG 375
+Y K+A L +LDTCYD + V +P +A+ F GG L L L VA+ SQ CL
Sbjct: 383 RYSYKRAPALS-ILDTCYDFTGRNKVQIPSVALLFDGGATLNLGFGEVLYVANKSQACLA 441
Query: 376 FATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
FA+ D + LGN+QQ+ V YDVA +++GFG CS
Sbjct: 442 FASNGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGCS 481
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 184 bits (468), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 121/332 (36%), Positives = 179/332 (53%), Gaps = 34/332 (10%)
Query: 2 WILSKAFLLFICLL---CSSNNGAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPD 58
++L +F + LL ++ A + SH H + ++SLLP + CN +G
Sbjct: 12 FLLFSSFTFLLILLSFPVEKSHALEAKETIESHFHTLQLTSLLPSSSCNTATKGKRRG-- 69
Query: 59 KASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAF 118
ASLEVV++ GPC++LNQ AP+L EIL DQ R+ +R + + F K+ +
Sbjct: 70 -ASLEVVNRQGPCTQLNQK-GAKAPTLTEILAHDQARVDSIQARVTDQSYDLFKKKDKKS 127
Query: 119 ------------TFPANINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIH 165
PA + YIV V +G PK+ +SL+ DTGSD+TWTQC+PC+
Sbjct: 128 SNKKKSVKDSKANLPAQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVK 187
Query: 166 -CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGN---CNSKECPFNIQYADGSGS 221
C+ Q+ P F S SKT+ I C ST+C L+ + GN C+S C + IQY D S +
Sbjct: 188 SCYAQQQPIFDPSASKTYSNISCTSTACSGLKSA--TGNSPGCSSSNCVYGIQYGDSSFT 245
Query: 222 GGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNT- 280
GF+A D +T+ + N F F+ GC N+ G +G++GL R P+SI+ +T
Sbjct: 246 VGFFAKDTLTLTQ---NDVFDG--FMFGCGQNNRGLFGKTAGLIGLGRDPLSIVQQTAQK 300
Query: 281 --SYFSYCLPSPYGSTGYITFGKTDTVNTLRS 310
YFSYCLP+ GS G++TFG + V T ++
Sbjct: 301 FGKYFSYCLPTSRGSNGHLTFGNGNGVKTSKA 332
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 51/109 (46%), Positives = 67/109 (61%), Gaps = 1/109 (0%)
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVV 366
+L+S F + M KY A L LLDTCYDLS Y ++ +PKI+ +F G +++L+ G L+
Sbjct: 395 SLKSTFKQFMSKYPTAPALS-LLDTCYDLSNYTSISIPKISFNFNGNANVDLEPNGILIT 453
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
SQVCL FA D GN+QQ+ EV YDVAG +LGFG CS
Sbjct: 454 NGASQVCLAFAGNGDDDTIGIFGNIQQQTLEVVYDVAGGQLGFGYKGCS 502
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 130/437 (29%), Positives = 197/437 (45%), Gaps = 84/437 (19%)
Query: 48 RTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAP-SLEEILRQDQQRLHLKNSR--RL 104
R A P+ A L + ++GPC+ + + +P S + LR DQ+R R
Sbjct: 53 RVSAASPRNGTSAVLRLTHRHGPCAPAGKASALGSPPSFLDTLRADQRRAEYIQRRVSGA 112
Query: 105 RKPFPEF-LKRTEAFTFPANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKP 162
P L ++A T PAN+ ++ +Y + V++G P +L +DTGSDV+W QCKP
Sbjct: 113 AAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKP 172
Query: 163 CIH--CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSG 220
C C+ QRDP F ++S ++ +PC + SC L + C+ +C + + Y DGS
Sbjct: 173 CPSPPCYSQRDPLFDPTRSSSYSAVPCAAASCSQL--ALYSNGCSGGQCGYVVSYGDGST 230
Query: 221 SGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNT 280
+ G +++D +T+ +N+ FL GC + G +G G++GL R S++++ ++
Sbjct: 231 TTGVYSSDTLTLTGSNA-----LKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASS 285
Query: 281 SY---FSYC------------LPSPYGSTGYIT--------------------------- 298
+Y FSYC L P + G+ T
Sbjct: 286 TYGGVFSYCLPPTQNSVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPL 345
Query: 299 ------FGKTDTVNT--------------LRSAFHKRMKKYK-KAKGLEDLLDTCYDLSA 337
F V+T LRSAF M Y + +LDTCYD +
Sbjct: 346 SIDASVFASGAVVDTGTVVTRLPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTR 405
Query: 338 YETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHE 397
Y TV +P I+I F GG ++L G L CL FA D + LGNVQQR E
Sbjct: 406 YGTVTLPTISIAFGGGAAMDLGTSGILTSG-----CLAFAPTGGDSQASILGNVQQRSFE 460
Query: 398 VHYDVAGRRLGFGPGNC 414
V +D G +GF P +C
Sbjct: 461 VRFD--GSTVGFMPASC 475
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 143/454 (31%), Positives = 214/454 (47%), Gaps = 78/454 (17%)
Query: 30 SHSHIVSVSSLLPPNVCNRTRTALPQGP--DKASLEVVSKYGPCSRLNQGISTHAPSLEE 87
SH V ++ L P C R + +++SLEV+ ++GPC ++AP+ E
Sbjct: 29 SHFLTVDLAGLFPSASCTRRSPQVHTSSLGEQSSLEVIHRHGPCGDE----VSNAPTAAE 84
Query: 88 ILRQDQQR---LHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIV-VAIGEPKQ 143
+L +DQ R +H K + L + L+ ++A PA T+ YIV V +G PK+
Sbjct: 85 MLVKDQSRVDFIHSKIAGELESV--DRLRGSKATKIPAKSGATIGSGNYIVSVGLGTPKK 142
Query: 144 YVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFYASKSKTFFKIPC-------------NS 189
Y+SL+ DTGSD+TWTQC+PC +C+ Q+DP F S+S T+ I C N
Sbjct: 143 YLSLIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQSTTYSNISCSSPDCSQLESGTGNQ 202
Query: 190 TSCRILR----------ESFPFG-------NCNSKECPFNIQYADGSGSGGFWAT----- 227
C R +SF G S + N + G + G + +
Sbjct: 203 PGCSAARACIYGIQYGDQSFSVGYFAKETLTLTSTDVIENFLFGCGQNNRGLFGSAAGLI 262
Query: 228 ----DRITIQEANSNGYFTRYPFLLGCINNSSG-------------------DKSGASGI 264
D+I+I + + Y + + L ++S+G G +
Sbjct: 263 GLGQDKISIVKQTAQKYGQVFSYCLPKTSSSTGYLTFGGGGGGGALKYTPITKAHGVANF 322
Query: 265 MGLDRSPVSI---ITRTNTSYFSYCLPSPYGSTGYITFGKTDTVNTLRSAFHKRMKKYKK 321
G+D + + ++S FS + S IT D + L+SAF K M KY K
Sbjct: 323 YGVDIVGMKVGGTQIPISSSVFSTS-GAIIDSGTVITRLPPDAYSALKSAFEKGMAKYPK 381
Query: 322 AKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPP 381
A L +LDTCYDLS Y T+ +PK+ F GG +L+LD G + AS SQVCL FA
Sbjct: 382 APELS-ILDTCYDLSKYSTIQIPKVGFVFKGGEELDLDGIGIMYGASTSQVCLAFAG-NQ 439
Query: 382 DPNSIT-LGNVQQRGHEVHYDVAGRRLGFGPGNC 414
DP+++ +GNVQQ+ +V YDV G ++GFG C
Sbjct: 440 DPSTVAIIGNVQQKTLQVVYDVGGGKIGFGYNGC 473
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 130/437 (29%), Positives = 197/437 (45%), Gaps = 84/437 (19%)
Query: 48 RTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAP-SLEEILRQDQQRLHLKNSR--RL 104
R A P+ A L + ++GPC+ + + +P S + LR DQ+R R
Sbjct: 42 RVSAASPRNGTSAVLRLTHRHGPCAPAGKASALGSPPSFLDTLRADQRRAEYIQRRVSGA 101
Query: 105 RKPFPEF-LKRTEAFTFPANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKP 162
P L ++A T PAN+ ++ +Y + V++G P +L +DTGSDV+W QCKP
Sbjct: 102 AAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKP 161
Query: 163 CIH--CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSG 220
C C+ QRDP F ++S ++ +PC + SC L + C+ +C + + Y DGS
Sbjct: 162 CPSPPCYSQRDPLFDPTRSSSYSAVPCAAASCSQL--ALYSNGCSGGQCGYVVSYGDGST 219
Query: 221 SGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNT 280
+ G +++D +T+ +N+ FL GC + G +G G++GL R S++++ ++
Sbjct: 220 TTGVYSSDTLTLTGSNA-----LKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASS 274
Query: 281 SY---FSYC------------LPSPYGSTGYIT--------------------------- 298
+Y FSYC L P + G+ T
Sbjct: 275 TYGGVFSYCLPPTQNSVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPL 334
Query: 299 ------FGKTDTVNT--------------LRSAFHKRMKKYK-KAKGLEDLLDTCYDLSA 337
F V+T LRSAF M Y + +LDTCYD +
Sbjct: 335 SIDASVFASGAVVDTGTVVTRLPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTR 394
Query: 338 YETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHE 397
Y TV +P I+I F GG ++L G L CL FA D + LGNVQQR E
Sbjct: 395 YGTVTLPTISIAFGGGAAMDLGTSGILTSG-----CLAFAPTGGDSQASILGNVQQRSFE 449
Query: 398 VHYDVAGRRLGFGPGNC 414
V +D G +GF P +C
Sbjct: 450 VRFD--GSTVGFMPASC 464
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 136/476 (28%), Positives = 213/476 (44%), Gaps = 93/476 (19%)
Query: 10 LFIC-LLCSSNNGAYADDNDLSHSHIVSVSSLL--PPNVCNRTRTA-LPQGPDKASLEVV 65
L +C +LC+ N+ A+ N+ H + +S P C+ +R L +G + S+ +V
Sbjct: 6 LLVCFILCTYNSLAHGG-NEEEHVLVAVPTSRYSEPAATCSTSRVRWLDEGSNTVSVPLV 64
Query: 66 SKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANIN 125
++GPC+ + S+ PSL E LR+ + R SR + + P ++
Sbjct: 65 HRHGPCAPSTR--SSDEPSLSERLRRSRARSKYIMSRASKS----------NVSIPTHLG 112
Query: 126 DTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC--IHCFQQRDPFFYASKSKTF 182
+V EY + V +G P LL+DTGSD++W QC PC C+ Q+DP F S+S T+
Sbjct: 113 GSVDSLEYVVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTY 172
Query: 183 FKIPCNSTSCRILRESFPFGNCNS-----KECPFNIQYADGSGSGGFWATDRITIQEANS 237
IPCN+ +CR L +C S +C + I Y DGS + G ++ + +T+
Sbjct: 173 APIPCNTDACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLTMAPG-- 230
Query: 238 NGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGST 294
T F GC ++ G G++GL +P S++ +T++ Y FSYCLP+
Sbjct: 231 ---VTVKDFHFGCGHDQDGPNDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCLPAANDQA 287
Query: 295 GYITFGK-------------------------------TDTVNTLRSAFHKRM------- 316
G++ G + ++ SAF M
Sbjct: 288 GFLALGAPVNDASGFVFTPMVREQQTFYVVNMTGITVGGEPIDVPPSAFSGGMIIDSGTV 347
Query: 317 -------------KKYKKAKGLEDL-----LDTCYDLSAYETVVVPKIAIHFLGGVDLEL 358
++KA L LDTCY+ + + V VP++A+ F GG ++L
Sbjct: 348 VTELQHTAYAALQAAFRKAMAAYPLLPNGELDTCYNFTGHSNVTVPRVALTFSGGATVDL 407
Query: 359 DVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
DV +++ + CL F PD LGNV QR EV YDV R+GFG C
Sbjct: 408 DVPDGILLDN----CLAFQEAGPDNQPGILGNVNQRTLEVLYDVGHGRVGFGADAC 459
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 110/301 (36%), Positives = 171/301 (56%), Gaps = 22/301 (7%)
Query: 8 FLLFICLLCSSNNGAYADDNDLSHS--HIVSVSSLLPPNVCNRTRTALPQGPDKASLEVV 65
F+ LLC N G +++++ HI+ V SLLP CN+T + SLEVV
Sbjct: 13 FVNAFLLLCYLNKGHAVGEDEITKGYLHIIKVKSLLPSTACNQTFKV----SNSLSLEVV 68
Query: 66 SKYGPCSR-LNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANI 124
+ GPC + LNQ + +APS EIL QD+ R+ +S R + +A T P
Sbjct: 69 HRSGPCIQVLNQEKAANAPSNMEILLQDRHRV---DSIHARLSSHGVFQEKQA-TLPVQS 124
Query: 125 NDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIH-CFQQRDPFFYASKSKTF 182
++ +Y + V +G PK+ +L+ DTGSD+TWTQC+PC C++Q++P +KS ++
Sbjct: 125 GASIGSGDYAVTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAKTCYKQKEPRLDPTKSTSY 184
Query: 183 FKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFT 242
I C+S C++L ++ +C+S C + +QY DGS S GF+AT+ +T+ +N
Sbjct: 185 KNISCSSAFCKLL-DTEGGESCSSPTCLYQVQYGDGSYSIGFFATETLTLSSSN-----V 238
Query: 243 RYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITF 299
FL GC +SG GA+G++GL R+ +S+ ++T Y FSYCLP+ S GY++F
Sbjct: 239 FKNFLFGCGQQNSGLFRGAAGLLGLGRTKLSLPSQTAQKYKKLFSYCLPASSSSKGYLSF 298
Query: 300 G 300
G
Sbjct: 299 G 299
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 46/110 (41%), Positives = 63/110 (57%), Gaps = 2/110 (1%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+ L SAF K M Y G + DTCYD S ET+ +PK+ + F GGV++++DV G L
Sbjct: 364 SALSSAFQKLMTDYPSTDGYS-IFDTCYDFSKNETIKIPKVGVSFKGGVEMDIDVSGILY 422
Query: 366 -VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V + +VCL FA D + GN QQ+ ++V YD A R+GF P C
Sbjct: 423 PVNGLKKVCLAFAGNGDDVKAAIFGNTQQKTYQVVYDDAKGRVGFAPSGC 472
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 138/449 (30%), Positives = 204/449 (45%), Gaps = 85/449 (18%)
Query: 35 VSVSSLLPPNVCNRTRTALPQGPDKAS-LEVVSKYGPCSRLNQGISTHAPSLEEILRQDQ 93
VS +S P + C+ + PQ D + L + ++GPC+ L + S APS+ + LR DQ
Sbjct: 38 VSAASFAPSSTCSASDPVAPQQNDTFTVLRLTHRHGPCAPL-RASSLAAPSVADTLRADQ 96
Query: 94 QRLHLKNSRRLRKPFPEFLK-RTEAFTFPANINDTVADEYYIVVA-IGEPKQYVSLLLDT 151
+R R + P+ + A T PAN + Y+V A +G P +L +DT
Sbjct: 97 RRAEHILRRVSGRGAPQLWDYKAAAATVPANWGYDIGTSNYVVTASLGTPGMAQTLEVDT 156
Query: 152 GSDVTWTQCKPCI--HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKEC 209
GSD++W QCKPC C++Q+DP F ++S ++ +PC ++C L C++ +C
Sbjct: 157 GSDLSWVQCKPCAAPSCYRQKDPLFDPAQSSSYAAVPCGRSACAGL--GIYASACSAAQC 214
Query: 210 PFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDK-SGASGIMGLD 268
+ + Y DGS + G +++D +T+ AN+ T FL GC + SG +G G++G
Sbjct: 215 GYVVSYGDGSNTTGVYSSDTLTL-AANA----TVQGFLFGCGHAQSGGLFTGIDGLLGFG 269
Query: 269 RSPVSIITRTNTSY---FSYCLP----------------------------SPYGSTGYI 297
R S++ +T +Y FSYCLP SP T Y+
Sbjct: 270 REQPSLVQQTAGAYGGVFSYCLPTKSSTTGYLTLGGPSGVAPGFSTTQLLPSPNAPTYYV 329
Query: 298 -----------------------TFGKTDTVNT---------LRSAFHKRMKKYKKAKGL 325
T T TV T LRSAF M Y A +
Sbjct: 330 VMLTGISVGGQPLSVPASAFAAGTVVDTGTVITRLPPAAYAALRSAFRSGMASYPSAPPI 389
Query: 326 EDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNS 385
+LDTCY + Y TV + +A+ F G + L G + S CL FA+ D +
Sbjct: 390 -GILDTCYSFAGYGTVNLTSVALTFSSGATMTLGADGIM-----SFGCLAFASSGSDGSM 443
Query: 386 ITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
LGNVQQR EV D G +GF P +C
Sbjct: 444 AILGNVQQRSFEVRID--GSSVGFRPSSC 470
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 137/459 (29%), Positives = 210/459 (45%), Gaps = 83/459 (18%)
Query: 24 ADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAP 83
A D ++S+ SL +VC+ ++ A+ A++ + ++GPCS L + P
Sbjct: 23 AHAGDHGSYKVLSLGSLRTKSVCSESK-AVKSSTGAATVPLHHRHGPCSPLP---TKKMP 78
Query: 84 SLEEILRQDQ------QRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVAD-EYYIVV 136
+LEE L +DQ QR + ++++ A T P + ++ EY I V
Sbjct: 79 TLEERLHRDQLRAAYIQRKFSGGGVNGSRGGAGDVQQSHA-TVPTTLGTSLDTLEYLITV 137
Query: 137 AIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILR 196
+G P + ++L+DTGSDV+W QCKPC C Q DP F S S T+ C+S +C L
Sbjct: 138 RLGSPGKSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCSSAACAQLG 197
Query: 197 ESFPFGN-CNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSS 255
+ GN C+S +C + + Y DGS + G +++D + + G F GC N S
Sbjct: 198 QE---GNGCSSSQCQYTVTYGDGSSTTGTYSSDTLAL------GSNAVRKFQFGCSNVES 248
Query: 256 GDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSP----------YGSTGYI----- 297
G G+MGL S++++T ++ FSYCLP+ G++G++
Sbjct: 249 GFNDQTDGLMGLGGGAQSLVSQTAGTFGAAFSYCLPATSSSSGFLTLGAGTSGFVKTPML 308
Query: 298 ---------------------------------TFGKTDTVNT---------LRSAFHKR 315
T + TV T L SAF
Sbjct: 309 RSSQVPTFYGVRIQAIRVGGRQLSIPTSVFSAGTIMDSGTVLTRLPPTAYSALSSAFKAG 368
Query: 316 MKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLG 375
MK+Y A +LDTC+D S +V +P +A+ F GG +++ G ++ S S +CL
Sbjct: 369 MKQYPSAP-PSGILDTCFDFSGQSSVSIPTVALVFSGGAVVDIASDGIMLQTSNSILCLA 427
Query: 376 FATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
FA D + +GNVQQR EV YDV G +GF G C
Sbjct: 428 FAANSDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 466
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 109/296 (36%), Positives = 167/296 (56%), Gaps = 24/296 (8%)
Query: 22 AYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPD-KASLEVVSKYGPCSRLNQ-GIS 79
A + N+L H V ++SL P + + ++ +GP KASLEVV K+GPCS+LN G +
Sbjct: 30 ATKESNNLRQYHFVHLNSLFP----SSSCSSSAKGPKRKASLEVVHKHGPCSQLNHSGKA 85
Query: 80 THAPSLEEILRQDQQRLHLKNSRRLRKPFPE-FLKRTEAFTFPANINDTVAD-EYYIVVA 137
S +I+ D +R+ SR + E +K ++ T PA + +YY+VV
Sbjct: 86 EATISHNDIMNLDNERVKYIQSRLSKNLGGENRVKELDSTTLPAKSGRLIGSADYYVVVG 145
Query: 138 IGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFYASKSKTFFKIPCNSTSCRILR 196
+G PK+ +SL+ DTGS +TWTQC+PC C++Q+DP F SKS ++ I C S+ C R
Sbjct: 146 LGTPKRDLSLIFDTGSYLTWTQCEPCAGSCYKQQDPIFDPSKSSSYTNIKCTSSLCTQFR 205
Query: 197 ESFPFGNCNSK---ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINN 253
+ C+S C ++++Y D S S GF + +R+TI + + FL GC +
Sbjct: 206 SA----GCSSSTDASCIYDVKYGDNSISRGFLSQERLTITATD-----IVHDFLFGCGQD 256
Query: 254 SSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDTVN 306
+ G G +G+MGL R P+S + +T++ Y FSYCLPS S G++TFG + N
Sbjct: 257 NEGLFRGTAGLMGLSRHPISFVQQTSSIYNKIFSYCLPSTPSSLGHLTFGASAATN 312
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 52/108 (48%), Positives = 63/108 (58%), Gaps = 1/108 (0%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
LRSAF + M KY A G LLDTCYD S Y+ + VP+I F GGV +EL + G L
Sbjct: 376 LRSAFRQFMMKYPVAYGTR-LLDTCYDFSGYKEISVPRIDFEFAGGVKVELPLVGILYGE 434
Query: 368 SVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
S Q+CL FA + GNVQQ+ EV YDV G R+GFG C+
Sbjct: 435 SAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGRIGFGAAGCN 482
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 126/417 (30%), Positives = 193/417 (46%), Gaps = 79/417 (18%)
Query: 64 VVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKR-TEAFTFPA 122
VV ++GPCS L PS EIL +DQ R+ + R P+ ++ + PA
Sbjct: 121 VVHRHGPCSPLLA--RGGEPSHAEILDRDQDRVDSIH-RMTAGPWTAGQSSASKGVSLPA 177
Query: 123 NINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKT 181
+ + YIV V +G P++ + ++ DTGSD++W QCKPC +C++Q DP F S+S T
Sbjct: 178 HRGLRLGTANYIVSVGLGTPRRDLLVVFDTGSDLSWVQCKPCNNCYKQHDPLFDPSQSTT 237
Query: 182 FFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
+ +PC + C G C+S +C + + Y D S + G A D +T+ ++
Sbjct: 238 YSAVPCGAQEC------LDSGTCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSDQ--- 288
Query: 242 TRYPFLLGCINNSSGDKSGASGIMGLDRSPVSI--------------------------- 274
F+ GC ++ +G A G+ GL R VS+
Sbjct: 289 -LQGFVFGCGDDDTGLFGRADGLFGLGRDRVSLASQAAARYGAGFSYCLPSSWRAEGYLS 347
Query: 275 ---------------ITRTNTSYFSY--------------CLPSPYGSTG-------YIT 298
+TR++T F Y P+ + + G IT
Sbjct: 348 LGSAAAPPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAPGTVIDSGTVIT 407
Query: 299 FGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLEL 358
+ + LRS+F M++YK+A L +LDTCYD + V +P +A+ F GG L L
Sbjct: 408 RLPSRAYSALRSSFAGFMRRYKRAPALS-ILDTCYDFTGRTKVQIPSVALLFDGGATLNL 466
Query: 359 DVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
G L VA+ SQ CL FA+ D + LGN+QQ+ V YD+A +++GFG CS
Sbjct: 467 GFGGVLYVANRSQACLAFASNGDDTSVGILGNMQQKTFAVVYDLANQKIGFGAKGCS 523
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 174 bits (441), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 136/468 (29%), Positives = 212/468 (45%), Gaps = 94/468 (20%)
Query: 10 LFICLLCSSNNGAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYG 69
+F+C S+ +GA +D+ ++ V SS P +VC+ Q + +V ++G
Sbjct: 9 IFLCFYLSTVHGA-GEDSFVT----VPSSSFEPESVCSGEFVKPEQNGSTVYVPLVHRHG 63
Query: 70 PCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVA 129
PC+ +ST S +I R+ + R P ++ R + + PA++ +V
Sbjct: 64 PCAPAPS-LSTDTRSFADIFRRSRAR-------------PSYIVRGKKVSVPAHLGTSVM 109
Query: 130 D-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI--HCFQQRDPFFYASKSKTFFKIP 186
EY + V+ G P +++DTGSDV+W QCKPC CF Q+DP + S S T+ +P
Sbjct: 110 SLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVP 169
Query: 187 CNSTSCRIL-RESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEAN--SNGYFTR 243
C S C+ L +++ G + K+C F I YADG+ + G ++ D++T+ N YF
Sbjct: 170 CASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTLAPGAIVQNFYF-- 227
Query: 244 YPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFG--- 300
GC + + G++GL R S+ R FSYCLPS G++ G
Sbjct: 228 -----GCGHGKHAVRGLFDGVLGLGRLRESLGARYG-GVFSYCLPSVSSKPGFLALGAGK 281
Query: 301 --------------------------------KTD----------------TVNTLRSAF 312
K D + L+S
Sbjct: 282 NPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITGLQSTA 341
Query: 313 HKRMKK-YKKAKGLEDL-----LDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVV 366
++ ++ ++KA L LDTCY+L+ Y+ VVVPKIA+ F GG + LDV ++V
Sbjct: 342 YRALRSAFRKAMEAYRLLPNGDLDTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGILV 401
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
CL FA PD ++ LGNV QR EV +D + + GF C
Sbjct: 402 NG----CLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 445
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 145/471 (30%), Positives = 215/471 (45%), Gaps = 98/471 (20%)
Query: 35 VSVSSLLPPNV--CNRTRTALPQGPDKAS-LEVVSKYGPCSRLNQGISTHAPSLEEILRQ 91
+ V SLLP C + QG + + VV ++GPCS L + APS EIL
Sbjct: 36 LDVESLLPSAAAPCPTPQAEQKQGAAPPTRMPVVHQHGPCSPLADNRNGKAPSHAEILAA 95
Query: 92 DQQR---LHLK------NSRRLRKPFPEFLK---------------RTEAFTFPANINDT 127
DQ+R +H + +RR ++ P L+ T PA+
Sbjct: 96 DQRRAEYIHRRVAETTGRARRRKQGAPVELRPGTPPSSIVVPSSSSATSTTDLPASYGVA 155
Query: 128 VADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFYASKSKTFFKI 185
+ Y+V V +G P + +++ DTGSD TW QC+PC+ +C++Q++P F +KS T+ I
Sbjct: 156 LGTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANI 215
Query: 186 PCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQ-----------E 234
C+S+ C L S C+ C + IQY DGS + GF+A D +T+
Sbjct: 216 SCSSSYCSDLYVS----GCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYDTIKNFRFGCG 271
Query: 235 ANSNGYFTRYPFLLGCINNSSG------DKSG----------ASGIMGLDRSPVS----- 273
+ G F R LLG + DK G ++G LD P +
Sbjct: 272 EKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPATSAGTGFLDLGPGAPAANA 331
Query: 274 -----IITRTNTSYF---------SYCLPSP---YGSTGYITFGKTDTVNT--------- 307
++ R T Y+ + LP P + + G T + TV T
Sbjct: 332 RLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAG--TLVDSGTVITRLPPSAYAP 389
Query: 308 LRSAFHKRMKK--YKKAKGLEDLLDTCYDLSAYE--TVVVPKIAIHFLGGVDLELDVRGT 363
LRSAF K M+ Y A +LDTCYDL+ ++ ++ +P +++ F GG L++D G
Sbjct: 390 LRSAFSKAMQGLGYSAAPAFS-ILDTCYDLTGHKGGSIALPAVSLVFQGGACLDVDASGI 448
Query: 364 LVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L VA VSQ CL FA D + +GN QQ+ H V YD+ + +GF PG C
Sbjct: 449 LYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 499
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 171 bits (434), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 137/441 (31%), Positives = 204/441 (46%), Gaps = 95/441 (21%)
Query: 62 LEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQR---LHLK------NSRRLRKPFPEFL 112
+ VV ++GPCS L + APS EIL DQ+R +H + +RR ++ P L
Sbjct: 1 MPVVHQHGPCSPLADNRNGKAPSHAEILAADQRRAEYIHRRVAETTGRARRRKQGAPVEL 60
Query: 113 K---------------RTEAFTFPANINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVT 156
+ T PA+ + Y+V V +G P + +++ DTGSD T
Sbjct: 61 RPGTPPSSIVVPSSSSATSTTDLPASYGVALGTGNYVVPVRLGTPAERFTVVFDTGSDTT 120
Query: 157 WTQCKPCI-HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQY 215
W QC+PC+ +C++Q++P F +KS T+ I C+S+ C L S C+ C + IQY
Sbjct: 121 WVQCQPCVAYCYRQKEPLFDPTKSATYANISCSSSYCSDLYVS----GCSGGHCLYGIQY 176
Query: 216 ADGSGSGGFWATDRITIQ-----------EANSNGYFTRYPFLLGCINNSSG------DK 258
DGS + GF+A D +T+ + G F R LLG + DK
Sbjct: 177 GDGSYTIGFYAQDTLTLAYDTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDK 236
Query: 259 SG----------ASGIMGLDRSPVS----------IITRTNTSYF---------SYCLPS 289
G ++G LD P + ++ R T Y+ + LP
Sbjct: 237 YGGVFAYCLPATSAGTGFLDLGPGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPI 296
Query: 290 P---YGSTGYITFGKTDTVNT---------LRSAFHKRMKK--YKKAKGLEDLLDTCYDL 335
P + + G T + TV T LRSAF K M+ Y A +LDTCYDL
Sbjct: 297 PGSVFSTAG--TLVDSGTVITRLPPSAYAPLRSAFSKAMQGLGYSAAPAFS-ILDTCYDL 353
Query: 336 SAYE--TVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQ 393
+ ++ ++ +P +++ F GG L++D G L VA VSQ CL FA D + +GN QQ
Sbjct: 354 TGHKGGSIALPAVSLVFQGGACLDVDASGILYVADVSQACLAFAPNADDTDVAIVGNTQQ 413
Query: 394 RGHEVHYDVAGRRLGFGPGNC 414
+ H V YD+ + +GF PG C
Sbjct: 414 KTHGVLYDIGKKIVGFAPGAC 434
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 142/462 (30%), Positives = 208/462 (45%), Gaps = 85/462 (18%)
Query: 26 DNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKA--SLEVVSKYGPCSRLNQGISTHAP 83
D ++ H+VSV+SLLP VC T+ GP A SL VV ++GPCS L + + AP
Sbjct: 40 DGSETNWHVVSVNSLLPNTVCTSTK-----GPAAAPSSLTVVHRHGPCSPL-RSRGSGAP 93
Query: 84 SLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIV-VAIGEPK 142
S EILR+DQ R+ RK K + AN +++ Y+ + +G P
Sbjct: 94 SHTEILRRDQDRVDAIR----RKVTASSNKPKGGVSLLANWGKSLSTTNYVASLRLGTPA 149
Query: 143 QYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRIL---RESF 199
+ + LDTGSD +W QCKPC C++QRDP F + S T+ +PC + C+ L S
Sbjct: 150 TELVVELDTGSDQSWVQCKPCADCYEQRDPVFDPTASSTYSAVPCGARECQELASSSSSR 209
Query: 200 PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLGCINNSSGDK 258
+ N+K CP+ + Y D S + G A D +T+ + S P F+ GC ++++G
Sbjct: 210 NCSSDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPSPSPADTVPGFVFGCGHSNAGTF 269
Query: 259 SGASGIMGLDRSPVSIITRTNTSY---FSYCL---PSPYGS------------------- 293
G++GL S+ ++ Y FSYCL PS G
Sbjct: 270 GEVDGLLGLGLGKASLPSQVAARYGAAFSYCLPSSPSAAGYLSFGGAAARANAQFTEMVT 329
Query: 294 -----------TGYITFGKTDTV---------------------------NTLRSAFHKR 315
TG + G+ V LRS+F
Sbjct: 330 GQDPTSYYLNLTGIVVAGRAIKVPASAFATAAGTIIDSGTAFSRLPPSAYAALRSSFRSA 389
Query: 316 MKKYKKAKGLED-LLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVAS-VSQVC 373
M +Y+ + + DTCYD + +ETV +P + + F G + L G L + V+Q C
Sbjct: 390 MGRYRYKRAPSSPIFDTCYDFTGHETVRIPAVELVFADGATVHLHPSGVLYTWNDVAQTC 449
Query: 374 LGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
L F P+ + LGN QQR V YDV +R+GFG C+
Sbjct: 450 LAFV---PNHDLGILGNTQQRTLAVIYDVGSQRIGFGRKGCA 488
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 121/327 (37%), Positives = 174/327 (53%), Gaps = 32/327 (9%)
Query: 5 SKAFLLFICLLCSSNNGAYADDNDL-SHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLE 63
S AFLL + + A + SH H + +SSLLP + CN +G ASLE
Sbjct: 17 SSAFLLILLSFSVEKSHALETRETIESHFHTLQLSSLLPSSSCNPATKGKRRG---ASLE 73
Query: 64 VVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAF----- 118
VV++ GPC+ LNQ AP+L EIL DQ R+ +R + + F K+ +
Sbjct: 74 VVNRQGPCTLLNQK-GAKAPTLTEILAHDQARVDSIQARITDQSYDLFKKKDKKSSNKKK 132
Query: 119 -------TFPANINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIH-CFQQ 169
PA + YIV V +G PK+ +SL+ DTGSD+TWTQC+PC+ C+ Q
Sbjct: 133 SVKDSKANLPAQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQ 192
Query: 170 RDPFFYASKSKTFFKIPCNSTSCRILRESFPFGN---CNSKECPFNIQYADGSGSGGFWA 226
+ P F S SKT+ I C S +C L+ + GN C+S C + IQY D S + GF+A
Sbjct: 193 QQPIFDPSTSKTYSNISCTSAACSSLKSA--TGNSPGCSSSNCVYGIQYGDSSFTIGFFA 250
Query: 227 TDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNT---SYF 283
D++T+ + N F F+ GC N+ G +G++GL R P+SI+ +T YF
Sbjct: 251 KDKLTLTQ---NDVFDG--FMFGCGQNNKGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYF 305
Query: 284 SYCLPSPYGSTGYITFGKTDTVNTLRS 310
SYCLP+ GS G++TFG + V ++
Sbjct: 306 SYCLPTSRGSNGHLTFGNGNGVKASKA 332
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 54/109 (49%), Positives = 69/109 (63%), Gaps = 1/109 (0%)
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVV 366
+L+SAF + M KY A L LLDTCYDLS Y ++ +PKI+ +F G ++ELD G L+
Sbjct: 395 SLKSAFKQFMSKYPTAPALS-LLDTCYDLSNYTSISIPKISFNFNGNANVELDPNGILIT 453
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
SQVCL FA D + GN+QQ+ EV YDVAG +LGFG CS
Sbjct: 454 NGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVYDVAGGQLGFGYKGCS 502
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 147/481 (30%), Positives = 218/481 (45%), Gaps = 89/481 (18%)
Query: 4 LSKAFLLFICLLCSSNNGAYADDNDLSHSHIV----SVSSLLPPNVCNRTRTALPQGPDK 59
+ + FL I +LC N +A+ + S S V ++ + + K
Sbjct: 3 IMRNFLSMIIMLCVCLNWCFAEGAEKSDSGKVLDSYTIQVSSLFPSSSSCVPSSKASNTK 62
Query: 60 ASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFT 119
+SL VV +G CS L+ S +EI+R+DQ R+ S+ L K + ++
Sbjct: 63 SSLRVVHMHGACSHLS---SDARVDHDEIIRRDQARVESIYSK-LSKNSANEVSEAKSTE 118
Query: 120 FPANINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFYAS 177
PA T+ YIV + IG PK +SL+ DTGSD+TWTQC+PC+ C+ Q++P F S
Sbjct: 119 LPAKSGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPS 178
Query: 178 KSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEAN- 236
S T+ + C+S C +C++ C ++I Y D S + GF A ++ T+ ++
Sbjct: 179 SSSTYQNVSCSSPMCEDAE------SCSASNCVYSIGYGDKSFTQGFLAKEKFTLTNSDV 232
Query: 237 -SNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS-PY 291
+ YF GC N+ G G +G++GL +S+ +T T+Y FSYCLPS
Sbjct: 233 LEDVYF-------GCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTS 285
Query: 292 GSTGYITFG--------KTDTVNTLRSAFHKRMKKYKKAKGLEDL--------------- 328
STG++TFG K +++ SAF+ + + G ++L
Sbjct: 286 NSTGHLTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIID 345
Query: 329 -----------------------------------LDTCYDLSAYETVVVPKIAIHFLGG 353
DTCYD + +TV P IA F GG
Sbjct: 346 SGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGG 405
Query: 354 VDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
+ELD G + +SQVCL FA P GNVQQ +V YDVAG R+GF P
Sbjct: 406 TVVELDGSGISLPIKISQVCLAFAGNDDLP--AIFGNVQQTTLDVVYDVAGGRVGFAPNG 463
Query: 414 C 414
C
Sbjct: 464 C 464
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 130/392 (33%), Positives = 180/392 (45%), Gaps = 67/392 (17%)
Query: 89 LRQDQQRLHLKNSRRLRKPFPE-FLKRTEAFTFPANINDTVAD-EYYIVVAIGEPKQYVS 146
+ D +R+ SR + E +K ++ T PA + Y +VV +G PK+ +S
Sbjct: 1 MNLDNERVKYIQSRLSKNLGRENTVKDLDSTTLPAESGSLIGSANYVVVVGLGTPKRDLS 60
Query: 147 LLLDTGSDVTWTQCKPCI-HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCN 205
L+ DTGSD+TWTQC+PC C++Q+D F SKS ++ I C S+ C L C+
Sbjct: 61 LVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSSLCTQLTSDGIKSECS 120
Query: 206 SK---ECPFNIQYADGSGSGGFWATDR--------------------------------- 229
S C ++ +Y D S S GF + +R
Sbjct: 121 SSTDASCIYDAKYGDNSTSVGFLSQERLTITATDIVDDFLFGCGQDNEGLFNGSAGLMGL 180
Query: 230 ----ITIQEANSNGYFTRYPFLLGCINNSSGDKS-GASGIM--GLDRSPVSIITRTNTSY 282
I+I + S+ Y + + L ++S G + GAS L +P+S I+ N+ Y
Sbjct: 181 GRHPISIVQQTSSNYNKIFSYCLPATSSSLGHLTFGASAATNASLIYTPLSTISGDNSFY 240
Query: 283 F---------SYCLPSPYGST----------GYITFGKTDTV-NTLRSAFHKRMKKYKKA 322
LP+ ST G + TV LRSAF + M+KY A
Sbjct: 241 GLDIVSISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLAPTVYAALRSAFRRXMEKYPVA 300
Query: 323 KGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPD 382
LLDTCYDLS Y+ + VP+I F GGV +EL RG L V S QVCL FA D
Sbjct: 301 N-EAGLLDTCYDLSGYKEISVPRIDFEFSGGVTVELXHRGILXVESEQQVCLAFAANGSD 359
Query: 383 PNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ GNVQQ+ EV YDV G R+GFG C
Sbjct: 360 NDITVFGNVQQKTLEVVYDVKGGRIGFGAAGC 391
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 142/464 (30%), Positives = 203/464 (43%), Gaps = 96/464 (20%)
Query: 33 HIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQD 92
H+VSV+ LLP VC ++ A ++ V+ ++GPCS L APS ++L QD
Sbjct: 61 HVVSVADLLPAAVCTASQAASNSS-SASAFSVMHRHGPCSPLQ--TPGDAPSDADLLDQD 117
Query: 93 QQRLH-----LKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIV-VAIGEPKQYVS 146
Q R+ + N P + PA +V Y+V V +G P + ++
Sbjct: 118 QARVDSILGMITNETSAVGP---------GVSLPAERGISVGTGNYVVSVGLGTPARDLT 168
Query: 147 LLLDTGSDVTWTQCKPCIH--CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNC 204
++ DTGSD++W QC PC C++Q+DP F S S TF + C + CR R+S G+
Sbjct: 169 VVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSAVRCGARECRA-RQSC-GGSP 226
Query: 205 NSKECPFNIQYADGSGSGGFWATDRITI---QEANSNGYF-TRYP-FLLGCINNSSGDKS 259
CP+ + Y D S + G D +T+ AN++ + P F+ GC N++G
Sbjct: 227 GDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAENDNKLPGFVFGCGENNTGLFG 286
Query: 260 GASGIMGLDRSPVSI--------------------------------------------I 275
A G+ GL R VS+ +
Sbjct: 287 QADGLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSSAPGYLSLGTPVPAPAHAQFTPML 346
Query: 276 TRTNTSYFSY--------------------CLPSPYGSTGYITFGKTDTVNTLRSAFHKR 315
RT T F Y LP S IT LR+AF
Sbjct: 347 NRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPLIVDSGTVITRLAPRAYRALRAAFLSA 406
Query: 316 MKKY--KKAKGLEDLLDTCYDLSAYE--TVVVPKIAIHFLGGVDLELDVRGTLVVASVSQ 371
M KY K+A L +LDTCYD +A+ TV +P +A+ F GG + +D G L VA V+Q
Sbjct: 407 MGKYGYKRAPRLS-ILDTCYDFTAHANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQ 465
Query: 372 VCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
CL FA ++ LGN QQR V YDVA +++GF CS
Sbjct: 466 ACLAFAPNGDGRSAGILGNTQQRTLAVVYDVARQKIGFAAKGCS 509
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 146/481 (30%), Positives = 217/481 (45%), Gaps = 89/481 (18%)
Query: 4 LSKAFLLFICLLCSSNNGAYADDNDLSHSHIV----SVSSLLPPNVCNRTRTALPQGPDK 59
+ + FL I +LC N +A+ + S S V ++ + + K
Sbjct: 3 IMRNFLSMIIMLCVCLNWCFAEGAEKSDSGKVLDSYTIQVSSLFPSSSSCVPSSKASNTK 62
Query: 60 ASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFT 119
+SL VV +G CS L+ S +EI+R+DQ R+ S+ L K + ++
Sbjct: 63 SSLRVVHMHGACSHLS---SDARVDHDEIIRRDQARVESIYSK-LSKNSANEVSEAKSTE 118
Query: 120 FPANINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFYAS 177
PA T+ YIV + IG PK +SL+ DTGSD+TWTQC+PC+ C+ Q++P F S
Sbjct: 119 LPAKSGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPS 178
Query: 178 KSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEAN- 236
S T+ + C+S C +C++ C ++I Y D S + GF A ++ T+ ++
Sbjct: 179 SSSTYQNVSCSSPMCEDAE------SCSASNCVYSIVYGDKSFTQGFLAKEKFTLTNSDV 232
Query: 237 -SNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS-PY 291
+ YF GC N+ G G +G++GL +S+ +T T+Y FSYCLPS
Sbjct: 233 LEDVYF-------GCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTS 285
Query: 292 GSTGYITFG--------KTDTVNTLRSAFHKRMKKYKKAKGLEDL--------------- 328
STG++TFG K +++ SAF+ + + G ++L
Sbjct: 286 NSTGHLTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIID 345
Query: 329 -----------------------------------LDTCYDLSAYETVVVPKIAIHFLGG 353
DTCYD + +TV P IA F G
Sbjct: 346 SGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGS 405
Query: 354 VDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
+ELD G + +SQVCL FA P GNVQQ +V YDVAG R+GF P
Sbjct: 406 TVVELDGSGISLPIKISQVCLAFAGNDDLP--AIFGNVQQTTLDVVYDVAGGRVGFAPNG 463
Query: 414 C 414
C
Sbjct: 464 C 464
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 123/416 (29%), Positives = 189/416 (45%), Gaps = 89/416 (21%)
Query: 62 LEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFP 121
+ +V ++GPC+ +ST S +I R+ + R P ++ R + + P
Sbjct: 22 VPLVHRHGPCAPAPS-LSTDTRSFADIFRRSRAR-------------PSYIVRGKKVSVP 67
Query: 122 ANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI--HCFQQRDPFFYASK 178
A++ +V EY + V+ G P +++DTGSDV+W QCKPC CF Q+DP + S
Sbjct: 68 AHLGTSVMSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSH 127
Query: 179 SKTFFKIPCNSTSCRIL-RESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEAN- 236
S T+ +PC S C+ L +++ G + K+C F I YADG+ + G ++ D++T+
Sbjct: 128 SSTYSAVPCASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTLAPGAI 187
Query: 237 -SNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTG 295
N YF GC + + G++GL R S+ R FSYCLPS G
Sbjct: 188 VQNFYF-------GCGHGKHAVRGLFDGVLGLGRLRESLGARYG-GVFSYCLPSVSSKPG 239
Query: 296 YITFG-----------------------------------KTD----------------T 304
++ G K D
Sbjct: 240 FLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTV 299
Query: 305 VNTLRSAFHKRMKK-YKKAKGLEDL-----LDTCYDLSAYETVVVPKIAIHFLGGVDLEL 358
+ L+S ++ ++ ++KA L LDTCY+L+ Y+ VVVPKIA+ F GG + L
Sbjct: 300 ITGLQSTAYRALRSAFRKAMEAYRLLPNGDLDTCYNLTGYKNVVVPKIALTFTGGATINL 359
Query: 359 DVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
DV ++V CL FA PD ++ LGNV QR EV +D + + GF C
Sbjct: 360 DVPNGILVNG----CLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 411
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 131/428 (30%), Positives = 191/428 (44%), Gaps = 89/428 (20%)
Query: 59 KASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFL-KRTEA 117
A L + K+GPC+ ++ S PS+ + LR DQ+R R + P+ + EA
Sbjct: 64 SAVLRLTHKHGPCAP-SRASSLATPSVADTLRADQRRAEYILRRVSGRGTPQLWDSKAEA 122
Query: 118 FT--FPANINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIH--CFQQRDP 172
T PAN + Y+V V++G P +L +DTGSD++W QC PC C+ Q+DP
Sbjct: 123 ATATVPANWGFNIGTLNYVVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYSQKDP 182
Query: 173 FFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITI 232
F ++S ++ +PC C L +C++ +C + + Y DGS + G +++D +T+
Sbjct: 183 LFDPAQSSSYAAVPCGGPVCGGL--GIYASSCSAAQCGYVVSYGDGSKTTGVYSSDTLTL 240
Query: 233 QEANS-NGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLP 288
++ G+F GC + SG +G G++GL R S++ +T +Y FSYCLP
Sbjct: 241 SPNDAVRGFF------FGCGHAQSG-FTGNDGLLGLGREEASLVEQTAGTYGGVFSYCLP 293
Query: 289 -----------------------------SPYGSTGYI---------------------- 297
SP +T Y+
Sbjct: 294 TRPSTTGYLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFAG 353
Query: 298 -TFGKTDTVNT---------LRSAFHKRMKKYK-KAKGLEDLLDTCYDLSAYETVVVPKI 346
T T TV T LRSAF M Y + +LDTCY+ S Y TV +P +
Sbjct: 354 GTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILDTCYNFSGYGTVTLPNV 413
Query: 347 AIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRR 406
A+ F GG + L G L S CL FA D LGNVQQR EV D G
Sbjct: 414 ALTFSGGATVTLGADGIL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTS 466
Query: 407 LGFGPGNC 414
+GF P +C
Sbjct: 467 VGFKPSSC 474
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 109/306 (35%), Positives = 169/306 (55%), Gaps = 32/306 (10%)
Query: 7 AFLLFICLLCSSNNGAY--ADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEV 64
+F+++ LL S N AD+ ++ H + +SSL VC + AL +G +SL++
Sbjct: 8 SFVIYGFLLLSPCNSLKDNADEGTRAYFHTLKISSLPSTEVCKESSKALNEG--SSSLKL 65
Query: 65 VSKYGPCSRLNQGISTHAPSLEEILRQDQQR----LHLKNSRRLRKPFPEFLKRTEAFTF 120
V ++GPC+ ++ + A S EILR+D+ R + + S L E +K + F
Sbjct: 66 VHRFGPCNP-HRTSTAPASSFNEILRRDKLRVDSIIQARRSMNLTSSV-EHMKSSVPF-- 121
Query: 121 PANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSK 180
++ A +Y + V IG PK+ + L+ DTGS + WTQCKPC C+ + P F +KS
Sbjct: 122 -YGLSKITASDYIVNVGIGTPKKEMPLIFDTGSGLIWTQCKPCKACYPKV-PVFDPTKSA 179
Query: 181 TFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGY 240
+F +PC+S C+ +R+ C+S +C + Y D S S G AT+ I+
Sbjct: 180 SFKGLPCSSKLCQSIRQ-----GCSSPKCTYLTAYVDNSSSTGTLATETISFSH------ 228
Query: 241 FTRYPF---LLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGST 294
+Y F L+GC + SG+ G SGIMGL+RSP+S+ ++T Y FSYC+PS GST
Sbjct: 229 -LKYDFKNILIGCSDQVSGESLGESGIMGLNRSPISLASQTANIYDKLFSYCIPSTPGST 287
Query: 295 GYITFG 300
G++TFG
Sbjct: 288 GHLTFG 293
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 108/297 (36%), Positives = 169/297 (56%), Gaps = 23/297 (7%)
Query: 14 LLCSSNNGAYADDNDLSHS--HIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPC 71
LL S G ++N+ + S HI+ V+SLLP CN + + SLEVV ++GPC
Sbjct: 4 LLFSLEKGYAVEENEATKSYLHIIKVNSLLPTTACNHSSKV----SNSLSLEVVHRHGPC 59
Query: 72 -SRLNQGISTHAPSLEEILRQDQQRLHLKNSR-RLRKPFPEFLKRTEAFTFPANINDTV- 128
+NQ APS EI +DQ R+ ++R R FPE +A T P ++
Sbjct: 60 IGIVNQEKGADAPSNMEIFLRDQNRVDSIHARLSSRGMFPE----KQATTLPVQSGASIG 115
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIH-CFQQRDPFFYASKSKTFFKIPC 187
A +Y + V +G PK+ +L+ DTGSD+TWTQC+PC+ C++Q++P S S ++ I C
Sbjct: 116 AGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISC 175
Query: 188 NSTSCRILRESFPFG-NCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPF 246
+S C+++ F +C+S C + +QY DGS S GF+AT+ +T+ +N F
Sbjct: 176 SSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSN-----VFKNF 230
Query: 247 LLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFG 300
L GC ++G GA+G++GL R+ +++ ++T +Y FSYCLP+ S GY++ G
Sbjct: 231 LFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASSSSKGYLSLG 287
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 50/109 (45%), Positives = 66/109 (60%), Gaps = 2/109 (1%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
L SAF M Y G + DTCYD S Y+TV +PK+ + F GGV++++DV G L V
Sbjct: 353 LSSAFQNLMTDYPSTSGYS-IFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPV 411
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ +VCL FA D ++ GNVQQR ++V YD A R+GF PG CS
Sbjct: 412 NGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 460
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 164 bits (414), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 131/468 (27%), Positives = 195/468 (41%), Gaps = 98/468 (20%)
Query: 34 IVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQ 93
++SV+SL P C T P A + +V ++GPCS L P+ +EIL DQ
Sbjct: 43 LLSVASLFPGPACPATAEHGPSAAASARMRIVHQHGPCSPLAD-AHGKPPAHDEILAADQ 101
Query: 94 QRLHLKNSR--------RLRK---PFPEFLKRTEAF-----------TFPANINDTVADE 131
R+ R +L K P K++ + PA V+
Sbjct: 102 NRVESIQRRVSATTGRDKLTKHAAPVQPGPKKSPGIHPGHSASSSTPSLPATSGRAVSTG 161
Query: 132 YYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNS 189
Y+V V +G P +++ DTGSD TW QC+PC + C++Q++P F +KS T+ + C
Sbjct: 162 NYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDPAKSSTYANVSCTD 221
Query: 190 TSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
++C L + C C + +QY DGS + GF+A D +TI G F G
Sbjct: 222 SACADLDTN----GCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKG------FRFG 271
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDTVN 306
C ++G +G+MGL R S+ + Y F+YCLP+ TGY+ FG N
Sbjct: 272 CGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGTGYLDFGPGSAGN 331
Query: 307 TLR-------------------------------SAFHKRMKKYKKAKGLEDLLDTCYDL 335
R S F + L T Y
Sbjct: 332 NARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPATAYTA 391
Query: 336 --SAYETVVV-------PKIAI-----HFLGGVDLEL---------------DVRGTLVV 366
SA++ V++ P +I F G D+EL DV G +
Sbjct: 392 LSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYA 451
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
S +QVCL FA+ D + +GN QQ+ + V YD+ + +GF PG+C
Sbjct: 452 ISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 131/468 (27%), Positives = 194/468 (41%), Gaps = 98/468 (20%)
Query: 34 IVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQ 93
++SV+SL P C T P A + +V ++GPCS L P+ +EIL DQ
Sbjct: 43 LLSVASLFPGPACPATAEHGPSAAASARMRIVHQHGPCSPLAD-AHGKPPAHDEILAADQ 101
Query: 94 QRLHLKNSR--------RLRK---PFPEFLKRTEAF-----------TFPANINDTVADE 131
R+ R +L K P K++ + PA V+
Sbjct: 102 NRVESIQRRVSATTGRDKLTKHAAPVQPGPKKSPGIHPGHSASSSTPSLPATSGRAVSTG 161
Query: 132 YYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNS 189
Y+V V +G P +++ DTGSD TW QC+PC + C++Q+ P F +KS T+ + C
Sbjct: 162 NYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPLFDPAKSSTYANVSCTD 221
Query: 190 TSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
++C L + C C + +QY DGS + GF+A D +TI G F G
Sbjct: 222 SACADLDTN----GCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKG------FRFG 271
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDTVN 306
C ++G +G+MGL R S+ + Y F+YCLP+ TGY+ FG N
Sbjct: 272 CGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGTGYLDFGPGSAGN 331
Query: 307 TLR-------------------------------SAFHKRMKKYKKAKGLEDLLDTCYDL 335
R S F + L T Y
Sbjct: 332 NARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPATAYTA 391
Query: 336 --SAYETVVV-------PKIAI-----HFLGGVDLEL---------------DVRGTLVV 366
SA++ V++ P +I F G D+EL DV G +
Sbjct: 392 LSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYA 451
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
S +QVCL FA+ D + +GN QQ+ + V YD+ + +GF PG+C
Sbjct: 452 ISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 105/290 (36%), Positives = 166/290 (57%), Gaps = 23/290 (7%)
Query: 21 GAYADDNDLSHS--HIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPC-SRLNQG 77
G ++N+ + S HI+ V+SLLP CN + + SLEVV ++GPC +NQ
Sbjct: 23 GYAVEENEATKSYLHIIKVNSLLPTTACNHSSKV----SNSLSLEVVHRHGPCIGIVNQE 78
Query: 78 ISTHAPSLEEILRQDQQRLHLKNSR-RLRKPFPEFLKRTEAFTFPANINDTV-ADEYYIV 135
APS EI +DQ R+ ++R R FPE +A T P ++ A +Y +
Sbjct: 79 KGADAPSNMEIFLRDQNRVDSIHARLSSRGMFPE----KQATTLPVQSGASIGAGDYVVT 134
Query: 136 VAIGEPKQYVSLLLDTGSDVTWTQCKPCIH-CFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
V +G PK+ +L+ DTGSD+TWTQC+PC+ C++Q++P S S ++ I C+S C++
Sbjct: 135 VGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSALCKL 194
Query: 195 LRESFPFG-NCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINN 253
+ F +C+S C + +QY DGS S GF+AT+ +T+ +N FL GC
Sbjct: 195 VASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSN-----VFKNFLFGCGQQ 249
Query: 254 SSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFG 300
++G GA+G++GL R+ +++ ++T +Y FSYCLP+ S GY++ G
Sbjct: 250 NNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASSSSKGYLSLG 299
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 50/109 (45%), Positives = 66/109 (60%), Gaps = 2/109 (1%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
L SAF M Y G + DTCYD S Y+TV +PK+ + F GGV++++DV G L V
Sbjct: 365 LSSAFQNLMTDYPSTSGYS-IFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPV 423
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ +VCL FA D ++ GNVQQR ++V YD A R+GF PG CS
Sbjct: 424 NGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 472
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 124/443 (27%), Positives = 192/443 (43%), Gaps = 101/443 (22%)
Query: 62 LEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQR---LHLKNS-------RRLRKPFPEF 111
+ +V ++GPCS L PS +EIL DQ R +H + S + R+P P
Sbjct: 90 MTIVHRHGPCSPLADA-HGKPPSHDEILAADQNRVESIHHRVSTTATVRGKPKRRPSPSR 148
Query: 112 LKRTEAFTFPANINDTV-------------ADEYYIVVAIGEPKQYVSLLLDTGSDVTWT 158
++ + PA + Y + + +G P +++ DTGSD TW
Sbjct: 149 RQQQPSAPAPAASLSSSTASLPASSGRALGTGNYVVTIGLGTPASRYTVVFDTGSDTTWV 208
Query: 159 QCKPCIH-CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYAD 217
QC+PC+ C++Q++ F ++S T+ + C + +C L C+ C +++QY D
Sbjct: 209 QCQPCVVVCYKQQEKLFDPARSSTYANVSCAAPACSDLYTR----GCSGGHCLYSVQYGD 264
Query: 218 GSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDR-------- 269
GS S GF+A D +T+ Y F GC + G A+G++GL R
Sbjct: 265 GSYSIGFFAMDTLTLSS-----YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQ 319
Query: 270 ----------------------------SPVSIITRTNTSYFSYCLPSPY--GSTGYITF 299
SP ++ R T + P+ Y G TG
Sbjct: 320 TYDKYGGVFAHCLPARSSGTGYLDFGPGSPAAVGARQTTPMLTDNGPTFYYVGMTGIRVG 379
Query: 300 GKTDTV--------------------------NTLRSAFHKRM--KKYKKAKGLEDLLDT 331
G+ ++ ++LRSAF M + YKKA L LLDT
Sbjct: 380 GQLLSIPQSVFSTAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPALS-LLDT 438
Query: 332 CYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNV 391
CYD + V +PK+++ F GG L+++ G + AS+SQVCLGFA D + +GN
Sbjct: 439 CYDFTGMSEVAIPKVSLLFQGGAYLDVNASGIMYAASLSQVCLGFAANEDDDDVGIVGNT 498
Query: 392 QQRGHEVHYDVAGRRLGFGPGNC 414
Q + V YD+ + +GF PG C
Sbjct: 499 QLKTFGVVYDIGKKTVGFSPGAC 521
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 105/277 (37%), Positives = 155/277 (55%), Gaps = 16/277 (5%)
Query: 31 HSHI-VSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEIL 89
H+H + ++SLLP C + T +P +KA L+VV K+GPCS L QG H + IL
Sbjct: 54 HTHTTIHLTSLLPAASC-KPSTQVPSIENKAFLKVVHKHGPCSDLRQG---HKAEAQYIL 109
Query: 90 RQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVAD-EYYIVVAIGEPKQYVSLL 148
QDQ R+ +S+ + +K T A T PA + Y++ V +G PK+ SL+
Sbjct: 110 LQDQSRVDSIHSKLSKDSGLSDVKATAATTLPAKDGSIIGSGNYFVTVGLGTPKKDFSLI 169
Query: 149 LDTGSDVTWTQCKPCIH-CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFP-FGNCNS 206
DTGSD+TWTQC+PC+ C+ Q++ F S+S ++ I C ST C L + NC S
Sbjct: 170 FDTGSDLTWTQCEPCVKSCYNQKEAIFNPSQSTSYANISCGSTLCDSLASATGNIFNCAS 229
Query: 207 KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMG 266
C + IQY D S S GF+ +++++ + F GC N+ G GA+G++G
Sbjct: 230 STCVYGIQYGDSSFSIGFFGKEKLSLTATD-----VFNDFYFGCGQNNKGLFGGAAGLLG 284
Query: 267 LDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFG 300
L R +S++++T Y FSYCLPS STG++TFG
Sbjct: 285 LGRDKLSLVSQTAQRYNKIFSYCLPSSSSSTGFLTFG 321
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 61/164 (37%), Positives = 82/164 (50%), Gaps = 13/164 (7%)
Query: 259 SGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGKT-------DTVNTLRSA 311
SG S GLD + +S+ R PS + + G I T + L S
Sbjct: 337 SGGSSFYGLDLTGISVGGRKLA-----ISPSVFSTAGTIIDSGTVITRLPPAAYSALSST 391
Query: 312 FHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQ 371
F K M +Y A L +LDTC+D S ++T+ VPKI + F GGV +++D G V ++Q
Sbjct: 392 FRKLMSQYPAAPALS-ILDTCFDFSNHDTISVPKIGLFFSGGVVVDIDKTGIFYVNDLTQ 450
Query: 372 VCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
VCL FA + GNVQQ+ EV YD A R+GF P CS
Sbjct: 451 VCLAFAGNSDASDVAIFGNVQQKTLEVVYDGAAGRVGFAPAGCS 494
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 136/480 (28%), Positives = 207/480 (43%), Gaps = 89/480 (18%)
Query: 9 LLFICLLCSSNNGAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKY 68
LL + +LCS N+ + +V S VC+ ++ L S+ +V +Y
Sbjct: 5 LLLLVVLCSYCCYIALGGNEHGFA-VVQRRSYDSETVCSASKVNLEPSSATVSMSLVHRY 63
Query: 69 GPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRT-----EAFTFPAN 123
GPC+ +Q + PS+ E LR+ + R + S+ K + T A T P
Sbjct: 64 GPCAP-SQYSNVPTPSISETLRRSRARTNYIMSQA-SKSMGMGMASTPDDDDAAVTIPTR 121
Query: 124 INDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC--IHCFQQRDPFFYASKSK 180
+ V EY + + G P LL+DTGSDV+W QC PC C+ Q+DP F SKS
Sbjct: 122 LGGFVDSLEYVVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTKCYPQKDPLFDPSKSS 181
Query: 181 TFFKIPCNSTSCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSN 238
T+ I CN+ +CR L + + G C S +C ++++YADGS S G ++ + +T+
Sbjct: 182 TYAPIACNTDACRKLGDHYHNG-CTSGGTQCGYSVEYADGSHSRGVYSNETLTLAPG--- 237
Query: 239 GYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTG 295
T F GC + G G++GL +PVS++ +T++ Y FSYCLP+ G
Sbjct: 238 --ITVEDFHFGCGRDQRGPSDKYDGLLGLGGAPVSLVVQTSSVYGGAFSYCLPALNSEAG 295
Query: 296 YITFGKTDTVNTLRSAF-----------------------------HKRMKKYKKAKGLE 326
++ G + N +SAF H ++ ++
Sbjct: 296 FLVLGSPPSGN--KSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAFRGGMIID 353
Query: 327 ------DLLDTCYD---------LSAYETV-----------------VVPKIAIHFLGGV 354
+L +T Y+ L AY V VP++A F GG
Sbjct: 354 SGTVDTELPETAYNALEAALRKALKAYPLVPSDDFDTCYNFTGYSNITVPRVAFTFSGGA 413
Query: 355 DLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
++LDV ++V CL F PD +GNV QR EV YD +GF G C
Sbjct: 414 TIDLDVPNGILVND----CLAFQESGPDDGLGIIGNVNQRTLEVLYDAGRGNVGFRAGAC 469
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 124/439 (28%), Positives = 184/439 (41%), Gaps = 99/439 (22%)
Query: 62 LEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRL----HLKNSRRLRKPFPEFLKRTEA 117
+ +V ++GPCS L S PS +EIL DQ R H ++ + P+ +R +
Sbjct: 91 MTIVHRHGPCSPLAAAHS-KPPSHDEILAADQNRAESIQHRVSTTATSRGQPKRSRRQQP 149
Query: 118 FTF-----------------PANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQC 160
+ P T Y + V +G P +++ DTGSD TW QC
Sbjct: 150 SSAPAPAASLSSSTASLPASPGRALGT--GNYVVTVGLGTPASRYTVVFDTGSDTTWVQC 207
Query: 161 KPCIH-CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGS 219
+PC+ C++QR+ F ++S T+ + C + +C L C+ C + +QY DGS
Sbjct: 208 QPCVVVCYEQREKLFDPARSSTYANVSCAAPACSDLDTR----GCSGGHCLYGVQYGDGS 263
Query: 220 GSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRT- 278
S GF+A D +T+ Y F GC + G A+G++GL R S+ +T
Sbjct: 264 YSIGFFAMDTLTLSS-----YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTY 318
Query: 279 ----------------NTSYFSYCLPSP-------------------YGSTG-------- 295
T Y + SP G TG
Sbjct: 319 DKYGGVFAHCLPARSTGTGYLDFGAGSPAARLTTTPMLVDNGPTFYYVGLTGIRVGGRLL 378
Query: 296 YI---TFGKTDTV---------------NTLRSAFHKRM--KKYKKAKGLEDLLDTCYDL 335
YI F T+ ++LRSAF M + YKKA + LLDTCYD
Sbjct: 379 YIPQSVFATAGTIVDSGTVITRLPPAAYSSLRSAFAAAMSARGYKKAPAVS-LLDTCYDF 437
Query: 336 SAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRG 395
+ V +P +++ F GG L++D G + AS SQVCL FA + +GN Q +
Sbjct: 438 AGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKT 497
Query: 396 HEVHYDVAGRRLGFGPGNC 414
V YD+ + + F PG C
Sbjct: 498 FGVAYDIGKKVVSFSPGAC 516
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 125/424 (29%), Positives = 187/424 (44%), Gaps = 77/424 (18%)
Query: 59 KASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRL-----HLKNSRRLRKPFPEFLK 113
+ + +V ++GPCS L PS EEIL DQ R + + + + P K
Sbjct: 86 RTRMPIVHRHGPCSPLADAHDGKLPSHEEILAADQNRAKSIQRRVSTTTTVSRGKP---K 142
Query: 114 RTEAFTFPANINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIH-CFQQRD 171
R + PA+ + Y+V + +G P +++ DTGSD TW QC+PC+ C++Q++
Sbjct: 143 RNRP-SLPASSGSALGTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQE 201
Query: 172 PFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRIT 231
F ++S T+ I C + +C L C+ C + +QY DGS S GF+A D +T
Sbjct: 202 KLFDPARSSTYANISCAAPACSDLY----IKGCSGGHCLYGVQYGDGSYSIGFFAMDTLT 257
Query: 232 IQEANS------------NGYFTRYPFLLGCINNSSG------DKSG----------ASG 263
+ ++ G + LLG + DK G +SG
Sbjct: 258 LSSYDAIKGFRFGCGERNEGLYGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSG 317
Query: 264 IMGLDRSPVS---IITRTNTSYFSYCLPSPY--GSTGYITFGKTDTV------------- 305
LD P S + + T P+ Y G TG GK ++
Sbjct: 318 TGYLDFGPGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTSGTIVD 377
Query: 306 -------------NTLRSAFHKRM--KKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHF 350
++LRSAF M + YKKA L LLDTCYD + V +P +++ F
Sbjct: 378 SGTVITRLPPAAYSSLRSAFASAMAERGYKKAPALS-LLDTCYDFTGMSEVAIPTVSLLF 436
Query: 351 LGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFG 410
GG L++ G + ASVSQ CLGFA D + +GN Q + V YD+ + +GF
Sbjct: 437 QGGASLDVHASGIIYAASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVYDIGKKVVGFC 496
Query: 411 PGNC 414
PG C
Sbjct: 497 PGAC 500
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 135/456 (29%), Positives = 201/456 (44%), Gaps = 92/456 (20%)
Query: 35 VSVSSLLPPNVCNRTRTALPQ--GPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQD 92
VS +S +P + C+ PQ A L + ++GPC+ ++ S APS+ + LR D
Sbjct: 39 VSAASFVPSSTCSSPDPVPPQRRNGTSAVLRLTHRHGPCAP-SRASSLAAPSVADTLRAD 97
Query: 93 QQRLHLKNSRRLRKPFPEFLKR---TEAFTFPANINDTVADEYYIVVA-IGEPKQYVSLL 148
Q+R RR+ P+ A T PA+ + Y+V A +G P ++
Sbjct: 98 QRRAEYIL-RRVSGRAPQLWDSKAAAAAATVPASWGYDIGTLNYVVTASLGTPGVAQTME 156
Query: 149 LDTGSDVTWTQCKPCI---HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCN 205
+DTGSD++W QCKPC C+ Q+DP F ++S ++ +PC C L + C+
Sbjct: 157 VDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGI-YAASACS 215
Query: 206 SKECPFNIQYADGSGSGGFWATDRITIQEANS-NGYFTRYPFLLGCINNSSGDKSGASGI 264
+ +C + + Y DGS + G +++D +T+ +++ G+F GC + SG +G G+
Sbjct: 216 AAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFF------FGCGHAQSGLFNGVDGL 269
Query: 265 MGLDRSPVSIITRTNTSY---FSYCL------------------------------PSPY 291
+GL R S++ +T +Y FSYCL PSP
Sbjct: 270 LGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGLGGPSGAAPGFSTTQLLPSPN 329
Query: 292 GSTGYI-----------------------TFGKTDTVNT---------LRSAFHKRMKKY 319
T Y+ T T TV T LRSAF M Y
Sbjct: 330 APTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVITRLPPTAYAALRSAFRSGMASY 389
Query: 320 KKAKGLED-LLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFAT 378
+ +LDTCY+ + Y TV +P +A+ F G + L G L S CL FA
Sbjct: 390 GYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVMLGADGIL-----SFGCLAFAP 444
Query: 379 YPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
D LGNVQQR EV D G +GF P +C
Sbjct: 445 SGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 478
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 133/469 (28%), Positives = 202/469 (43%), Gaps = 96/469 (20%)
Query: 10 LFICLLCSSNNGAYADDNDLSHSHIVSV--SSLLPPNVCNRTRTALPQGPDKASLEVVSK 67
+F+C S NGA + V+V SS +P VC+ Q + ++ +
Sbjct: 9 IFLCFYLSIVNGA-------GNGSFVTVPSSSFVPDTVCSGALVKPEQNGSAVYVPLLHR 61
Query: 68 YGPCSRLNQGISTHAP-SLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANIND 126
+GPC+ +ST P S+ E+ R+ RL ++ + + PA++
Sbjct: 62 HGPCA---PSLSTDTPPSMSEMFRRSHARL-------------SYIVSGKKVSVPAHLGT 105
Query: 127 TVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI--HCFQQRDPFFYASKSKTFF 183
+V EY V+ G P +++DTGSD+TW QCKPC C Q+DP F S S T+
Sbjct: 106 SVKSLEYVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSSTYS 165
Query: 184 KIPCNSTSCRIL-RESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFT 242
+PC S C+ L +++ G N + C F I Y DG+ + G + D++T+ G
Sbjct: 166 AVPCASGECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDKLTLAP----GAIV 221
Query: 243 RYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITR-TNTSYFSYCLPSPYGSTGYITFG- 300
+ F GC ++ S G++GL R S+ + FSYCLP+ G++ FG
Sbjct: 222 K-DFYFGCGHSKSSLPGLFDGLLGLGRLSESLGAQYGGGGGFSYCLPAVNSKPGFLAFGA 280
Query: 301 ----------------------------------KTD----------------TVNTLRS 310
K D V L+S
Sbjct: 281 GRNPSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAFSGGMIVDSGTVVTVLQS 340
Query: 311 AFHKRMK-----KYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
++ ++ K + + LDTCYDL+ Y+ VVVPKIA+ F GG + LDV ++
Sbjct: 341 TVYRALRAAFREAMKAYRLVHGDLDTCYDLTGYKNVVVPKIALTFSGGATINLDVPNGIL 400
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V CL FA D + LGNV QR EV +D + + GF C
Sbjct: 401 VNG----CLAFAETGKDGTAGVLGNVNQRTFEVLFDTSASKFGFRAKAC 445
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 154 bits (390), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 131/456 (28%), Positives = 198/456 (43%), Gaps = 92/456 (20%)
Query: 35 VSVSSLLPPNVCNRTRTALPQ--GPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQD 92
VS +S +P + C+ PQ A L + ++GPC+ ++ S APS+ + LR D
Sbjct: 39 VSAASFVPSSTCSSPDPVPPQRRNGTSAVLRLTHRHGPCAP-SRASSLAAPSVADTLRAD 97
Query: 93 QQRLHLKNSRRLRKPFPEFLKR---TEAFTFPANINDTVADEYYIVVA-IGEPKQYVSLL 148
Q+R RR+ P+ A T PA+ + Y+V A +G P ++
Sbjct: 98 QRRAEYIL-RRVSGRAPQLWDSKAAAAAATVPASWGYDIGTLNYVVTASLGTPGVAQTME 156
Query: 149 LDTGSDVTWTQCKPCI---HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCN 205
+DTGSD++W QCKPC C+ Q+DP F ++S ++ +PC C L + C+
Sbjct: 157 VDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGI-YAASACS 215
Query: 206 SKECPFNIQYADGSGSGGFWATDRITIQEANS-NGYFTRYPFLLGCINNSSGDKSGASGI 264
+ +C + + Y DGS + G +++D +T+ +++ G+F GC + SG +G G+
Sbjct: 216 AAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFF------FGCGHAQSGLFNGVDGL 269
Query: 265 MGLDRSPVSIITRTNTSY---FSYC------------------------------LPSPY 291
+GL R S++ +T +Y FSYC LPSP
Sbjct: 270 LGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPN 329
Query: 292 GSTGYITFGKTDTVN--------------------------------TLRSAFHKRMKKY 319
T Y+ +V LRSAF M Y
Sbjct: 330 APTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYAALRSAFRSGMASY 389
Query: 320 KKAKGLED-LLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFAT 378
+ +LDTCY+ + Y TV +P +A+ F G + L G L S CL FA
Sbjct: 390 GYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL-----SFGCLAFAP 444
Query: 379 YPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
D LGNVQQR EV D G +GF P +C
Sbjct: 445 SGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 478
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 154 bits (389), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 106/307 (34%), Positives = 154/307 (50%), Gaps = 26/307 (8%)
Query: 2 WILSKAFLLFICLLCSSNNGAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKAS 61
W+L+ A L+ L GA A + + H+VSV+SLLP VC T+ A P ++
Sbjct: 10 WLLA-ASLVLATLASPHRLGAAAGEGSETKWHVVSVNSLLPSTVCTPTKAA----PSSSA 64
Query: 62 LEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFP 121
L VV +GPCS Q APS EIL +DQ R+ RR ++ P
Sbjct: 65 LTVVHGHGPCSP--QESRRGAPSHTEILGRDQDRVDAI--RRKVAAVTTAASSSKPKGVP 120
Query: 122 ANIN-----DTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYA 176
+ DT Y+ + +G P + + LDTGSD +W QCKPC C++Q + F
Sbjct: 121 LQVGWGKYLDTT--NYFTSLRLGTPATDLLVELDTGSDQSWIQCKPCPDCYEQHEALFDP 178
Query: 177 SKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEAN 236
SKS T+ I C+S C+ L S + K+CP+ I YAD S + G A D +T+ +
Sbjct: 179 SKSSTYSDITCSSRECQELGSSHKHNCSSDKKCPYEITYADDSYTVGNLARDTLTLSPTD 238
Query: 237 SNGYFTRYP-FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYG 292
+ P F+ GC +N++G G++GL R S+ ++ Y FSYCLPS
Sbjct: 239 A------VPGFVFGCGHNNAGSFGEIDGLLGLGRGKASLSSQVAARYGAGFSYCLPSSPS 292
Query: 293 STGYITF 299
+TGY++F
Sbjct: 293 ATGYLSF 299
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 46/109 (42%), Positives = 61/109 (55%), Gaps = 2/109 (1%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
LRS+ M +YK+A + DTCYDL+ +ETV +P +A+ F G + L G L
Sbjct: 370 LRSSVRSAMGRYKRAPS-STIFDTCYDLTGHETVRIPSVALVFADGATVHLHPSGVLYTW 428
Query: 368 S-VSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
S VSQ CL F P D + LGN QQR V YDV +++GFG C+
Sbjct: 429 SNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGANGCA 477
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 128/455 (28%), Positives = 192/455 (42%), Gaps = 83/455 (18%)
Query: 27 NDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLE 86
+D +V+ SSL P VC+ + + + A+L +V ++GPCS + +S PS E
Sbjct: 28 DDAQRYMVVASSSLEPSEVCSGQK--VTSSKNGATLPLVHRHGPCSPV---MSKEKPSHE 82
Query: 87 EILRQDQQR---LHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVAD-EYYIVVAIGEPK 142
E L +DQ R +H K S E + T P + ++ EY I V++G P
Sbjct: 83 ETLGRDQLRAANIHAKLSSPRNSSAKEL--QQSGVTIPTSSGYSLGTPEYVITVSLGTPA 140
Query: 143 QYVSLLLDTGSDVTWTQCKPCI--HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFP 200
+ +DTGSDV+W QC PC C Q+D F +KS T+ C+S C L
Sbjct: 141 VTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAKSATYSAFSCSSAQCAQLGGE-- 198
Query: 201 FGN-CNSKECPFNIQYADGSGSGGFWATDRITIQEAN------------SNGYFTRYPFL 247
GN C + C + ++Y D S + G + +D + + ++ +NG+ + L
Sbjct: 199 -GNGCLNSHCQYIVKYVDHSNTTGTYGSDTLGLTTSDAVKNFQFGCSHRANGFVGQLDGL 257
Query: 248 LGCINNS---------------------SGDKSGASGIMGLDRSPVSIITRTNTSYFSYC 286
+G ++ S +G +G S + T +
Sbjct: 258 MGLGGDTESLVSQTAATYGKAFSYCLPPSSSSAGGFLTLGAAAGGTSSSRYSRTPLVRFN 317
Query: 287 LPSPYG---------------------------STGYITFGKTDTVNTLRSAFHKRMKKY 319
+P+ YG S IT LR+AF K MK Y
Sbjct: 318 VPTFYGVFLQAITVAGTKLNVPASVFSGASVVDSGTVITQLPPTAYQALRTAFKKEMKAY 377
Query: 320 KKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATY 379
A + +LDTC+D S +TV VP + + F G ++LDV G CL F
Sbjct: 378 PSAAPV-GILDTCFDFSGIKTVRVPVVTLTFSRGAVMDLDVSGIFYAG-----CLAFTAT 431
Query: 380 PPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
D ++ LGNVQQR E+ +DV G LGF PG C
Sbjct: 432 AQDGDTGILGNVQQRTFEMLFDVGGSTLGFRPGAC 466
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 120/418 (28%), Positives = 180/418 (43%), Gaps = 102/418 (24%)
Query: 88 ILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVAD-EYYIVVAIGEPKQYVS 146
ILR+D+ R+ R + + T T PA + EY + + IG P + +
Sbjct: 82 ILRRDRHRV-----RSIYRRLTAAETTTTTTTIPARLGLAFQSLEYVVTIGIGTPPRNFT 136
Query: 147 LLLDTGSDVTWTQCKPC--IHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNC 204
+L DTGSD+TW QC PC C+ Q++P F SKS T+ +PC++ C I C
Sbjct: 137 VLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSSTYVDVPCSAPECHI--GGVQQTRC 194
Query: 205 NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC------INNSSGDK 258
+ C ++++Y D S + G A + T+ + + GC + N +G
Sbjct: 195 GATSCEYSVKYGDESETHGSLAEETFTLSPPSPLAP-AATGVVFGCSHEYISVFNDTG-- 251
Query: 259 SGASGIMGLDRSPVSIITRTNTS------YFSYCLPSPYGSTGYITFGK----------- 301
G +G++GL R SI+++T S FSYCLP STGY+T G
Sbjct: 252 MGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPPRGSSTGYLTIGGGAAAPQQQYSN 311
Query: 302 ------TDTVNTLRSA-------------------------------------------- 311
T++ LRSA
Sbjct: 312 LSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSLGAVIDSGTVVTHMPAAAYYP 371
Query: 312 ----FHKRMKKYKK-AKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVV 366
F M YK +G LLDTCYD++ + V P++A+ F GG +++D G L+V
Sbjct: 372 LRDEFRLHMGSYKMLPEGSMKLLDTCYDVTGQDVVTAPRVALEFGGGARIDVDASGILLV 431
Query: 367 --------ASVSQVCLGFATYPPDPNSITL-GNVQQRGHEVHYDVAGRRLGFGPGNCS 415
S++ CL F P + + + GN+QQR + V +DV G R+GFGP CS
Sbjct: 432 LPAEDGSGQSLTLACLAF--LPTNSAGLVIVGNMQQRAYNVVFDVDGGRIGFGPNGCS 487
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 127/439 (28%), Positives = 190/439 (43%), Gaps = 92/439 (20%)
Query: 62 LEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRL----HLKNSRRLRKPFPEFLKRTEA 117
+ +V ++GPCS L PS E+IL DQ R H ++ + P+ +R +
Sbjct: 87 MTIVHRHGPCSPLAD-AHGKPPSHEDILAADQNRAESIQHRVSTTATGRGNPKRSRRAPS 145
Query: 118 -------------------FTFPANINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTW 157
+ PA+ + Y+V V +G P +++ DTGSD TW
Sbjct: 146 RRQQPSSAPAPAASLSSSTASLPASSGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTW 205
Query: 158 TQCKPCIH-CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYA 216
QC+PC+ C++QR+ F ++S T+ I C + +C L C+ C + +QY
Sbjct: 206 VQCQPCVVVCYEQREKLFDPARSSTYANISCAAPACSDLDTR----GCSGGNCLYGVQYG 261
Query: 217 DGSGSGGFWATDRITIQEANS------------NGYFTRYPFLLGCINNSSG------DK 258
DGS S GF+A D +T+ ++ G F LLG + DK
Sbjct: 262 DGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDK 321
Query: 259 SG----------ASGIMGLD---RSPVSIITRTNTSYFSYCLPSPY--GSTGYITFGKTD 303
G +SG LD SP + R T + P+ Y G TG G+
Sbjct: 322 YGGVFAHCLPARSSGTGYLDFGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLL 381
Query: 304 TV--------------------------NTLRSAFHKRM--KKYKKAKGLEDLLDTCYDL 335
++ ++LRSAF M + YKKA + LLDTCYD
Sbjct: 382 SIPQSVFTTAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPAVS-LLDTCYDF 440
Query: 336 SAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRG 395
+ V +P +++ F GG L++D G + ASVSQVCLGFA + +GN Q +
Sbjct: 441 TGMSQVAIPTVSLLFQGGARLDVDASGIMYAASVSQVCLGFAANEDGGDVGIVGNTQLKT 500
Query: 396 HEVHYDVAGRRLGFGPGNC 414
V YD+ + +GF PG C
Sbjct: 501 FGVAYDIGKKVVGFSPGAC 519
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 107/356 (30%), Positives = 159/356 (44%), Gaps = 85/356 (23%)
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
+V +G Q +S+++DTGSD+TW QC+PC C+ Q P F S S ++ I CNST+C
Sbjct: 123 IVTMGLGSQNMSVIVDTGSDLTWVQCEPCRSCYNQNGPLFKPSTSPSYQPILCNSTTC-- 180
Query: 195 LRESFPFGNCN-----SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
+S G C S C + + Y DGS + G +++ G + F+ G
Sbjct: 181 --QSLELGACGSDPSTSATCDYVVNYGDGSYTSGELGIEKLGF------GGISVSNFVFG 232
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS--PYGSTGYITFGKTDT 304
C N+ G GASG+MGL RS +S+I++TN ++ FSYCLPS G++G + G
Sbjct: 233 CGRNNKGLFGGASGLMGLGRSELSMISQTNATFGGVFSYCLPSTDQAGASGSLVMGNQSG 292
Query: 305 V----------------------------------------------------NTLRSAF 312
V T+ S
Sbjct: 293 VFKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSLHVQASSFGNGGVILDSGTVISRL 352
Query: 313 HKRMKKYKKAKGLED-----------LLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVR 361
+ K KAK LE +LDTC++L+ Y+ V +P I+++F G +L +D
Sbjct: 353 APSVYKALKAKFLEQFSGFPSAPGFSILDTCFNLTGYDQVNIPTISMYFEGNAELNVDAT 412
Query: 362 GT--LVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
G LV S+VCL A+ + +GN QQR V YD ++GF C+
Sbjct: 413 GIFYLVKEDASRVCLALASLSDEYEMGIIGNYQQRNQRVLYDAKLSQVGFAKEPCT 468
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 123/442 (27%), Positives = 184/442 (41%), Gaps = 95/442 (21%)
Query: 62 LEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSR------------RLRKPFP 109
+ +V ++GPCS L PS EEIL DQ R R + +P P
Sbjct: 90 MPIVHRHGPCSPLADAHGGKPPSHEEILDADQNRAESIQRRVSTTTTAARGKPKRNRPSP 149
Query: 110 EFLKRTEAFTFPANINDTV---------------ADEYYIVVAIGEPKQYVSLLLDTGSD 154
++ + + PA Y + + +G P +++ DTGSD
Sbjct: 150 S-RRQQPSSSAPAPGASLSSSAASLPASSGRALGTGNYVVTIGLGTPAGRYTVVFDTGSD 208
Query: 155 VTWTQCKPCIH-CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNI 213
TW QC+PC+ C++Q++ F ++S T I C + +C L C+ C + +
Sbjct: 209 TTWVQCEPCVVVCYEQQEKLFDPARSSTDANISCAAPACSDLYTK----GCSGGHCLYGV 264
Query: 214 QYADGSGSGGFWATDRITIQEANS------------NGYFTRYPFLLGCINNSSG----- 256
QY DGS S GF+A D +T+ ++ G F LLG +
Sbjct: 265 QYGDGSYSIGFFAMDTLTLSSYDAIKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQA 324
Query: 257 -DKSG----------ASGIMGLDRSPVS-----------IITRTNTSYFSYCL------- 287
DK G +SG LD P S ++ +++ L
Sbjct: 325 YDKYGGVFAHCFPARSSGTGYLDFGPGSSPAVSTKLTTPMLVDNGLTFYYVGLTGIRVGG 384
Query: 288 ------PSPYGSTGYITFGKT-------DTVNTLRSAFHKRM--KKYKKAKGLEDLLDTC 332
PS + + G I T ++LRSAF + + YKKA L LLDTC
Sbjct: 385 KLLSIPPSVFTTAGTIVDSGTVITRLPPAAYSSLRSAFASAIAARGYKKAPALS-LLDTC 443
Query: 333 YDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQ 392
YD + V +P +++ F GG L++D G + ASVSQ CLGFA D + +GN Q
Sbjct: 444 YDFTGMSQVAIPTVSLLFQGGASLDVDASGIIYAASVSQACLGFAANEEDDDVGIVGNTQ 503
Query: 393 QRGHEVHYDVAGRRLGFGPGNC 414
+ V YD+ + +GF PG C
Sbjct: 504 LKTFGVVYDIGKKVVGFSPGAC 525
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 150 bits (380), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 106/354 (29%), Positives = 168/354 (47%), Gaps = 82/354 (23%)
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
+V +G +++++DTGSD+TW QC+PC+ C+ Q+ P F S S ++ + CNS++C+
Sbjct: 66 IVTMGLGSTNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQS 125
Query: 195 LRESFPFGNC-----NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
L+ F GN N C + + Y DGS + G ++++ G + F+ G
Sbjct: 126 LQ--FATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVEQLSF------GGVSVSDFVFG 177
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL------------------- 287
C N+ G G SG+MGL RS +S++++TN ++ FSYCL
Sbjct: 178 CGRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTESGASGSLVMGNESSV 237
Query: 288 -------------PSPYGSTGYI----------------TFGK------TDTVNT----- 307
P+P S YI +FG + TV T
Sbjct: 238 FKNVTPITYTRMLPNPQLSNFYILNLTGIDVDGVALQVPSFGNGGVLIDSGTVITRLPSS 297
Query: 308 ----LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGT 363
L++ F K+ + A G +LDTC++L+ Y+ V +P I++HF G +L++D GT
Sbjct: 298 VYKALKALFLKQFTGFPSAPGFS-ILDTCFNLTGYDEVSIPTISMHFEGNAELKVDATGT 356
Query: 364 LVVAS--VSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
V SQVCL A+ ++ +GN QQR V YD ++GF +CS
Sbjct: 357 FYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEESCS 410
>gi|147794033|emb|CAN68918.1| hypothetical protein VITISV_035156 [Vitis vinifera]
Length = 398
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 123/400 (30%), Positives = 182/400 (45%), Gaps = 61/400 (15%)
Query: 33 HIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQD 92
H VSSLLP N C + QG L + KYGPCS + PS +EI +D
Sbjct: 42 HSTPVSSLLPKNKCLASARGGSQG-----LPITQKYGPCSGSGH---SQPPSPQEIXGRD 93
Query: 93 QQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADE---YYIVVAIGEPKQYVSLLL 149
+ R+ NS+ + N+ + DE + + VA G P Q L+L
Sbjct: 94 ESRVSFINSKCNQYTSGNLKNHAH--------NNNLFDEDGNFLVDVAFGTPPQXFXLIL 145
Query: 150 DTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKEC 209
DTGS +TWTQCK C++C Q +F S S T+ C P + E
Sbjct: 146 DTGSSITWTQCKACVNCLQDSXRYFBXSASSTYSXGSC-----------IP----XTVEN 190
Query: 210 PFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD-KSGASGIMGLD 268
+N+ Y D S S G + +T++ ++ F ++ F G N+ GD SGA G++GL
Sbjct: 191 NYNMTYGDDSTSVGNYGCXTMTLEPSD---VFQKFQFGXG--RNNKGDFGSGADGMLGLG 245
Query: 269 RSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDTVNTLRSAFHKRMKKYKKAKGL 325
+ +S +++T + + FSYCLP S G + FG+ T + F + GL
Sbjct: 246 QGQLSTVSQTASKFXKVFSYCLPEE-DSIGSLLFGEKATSQSSSLKFTSLVNG-PGTSGL 303
Query: 326 ED-------LLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFAT 378
+ LLD D V++P+I +HF GG D+ L+ + + S++CL FA
Sbjct: 304 XESGYYFVKLLDISVD------VLLPEIVLHFGGGADVRLNGTNIVWGSDASRLCLAFAG 357
Query: 379 YPP---DPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+P +GN QQ V YD+ G R+GF CS
Sbjct: 358 NSKSTMNPELTIIGNRQQLSLTVLYDIQGGRIGFRSNGCS 397
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 111/375 (29%), Positives = 166/375 (44%), Gaps = 60/375 (16%)
Query: 88 ILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSL 147
+ R + + HL+ +RL +L ++D + EY++ V +G P L
Sbjct: 89 VARDNARVEHLE--KRLVASTSPYLPEDLVSEVVPGVDDG-SGEYFVRVGVGSPPTDQYL 145
Query: 148 LLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSK 207
++D+GSDV W QC+PC C+ Q DP F + S +F + C S CR L + G ++
Sbjct: 146 VVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAG 205
Query: 208 ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGL 267
+C +++ Y DGS + G A + +T+ G +GC + +SG GA+G++GL
Sbjct: 206 KCDYSVTYGDGSYTKGELALETLTLGGTAVQG------VAIGCGHRNSGLFVGAAGLLGL 259
Query: 268 DRSPVSIITRTNTS---YFSYCLPSP--------------YGSTGYITFGK--------- 301
+S++ + + FSYCL S G TG G+
Sbjct: 260 GWGAMSLVGQLGGAAGGVFSYCLASRGAGGAGSLASSFYYVGLTGIGVGGERLPLQDSLF 319
Query: 302 -------------TDTVNT---------LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYE 339
T T T LR AF M ++ + LLDTCYDLS Y
Sbjct: 320 QLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVS-LLDTCYDLSGYA 378
Query: 340 TVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVH 399
+V VP ++ +F G L L R LV + CL FA P LGN+QQ G ++
Sbjct: 379 SVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVFCLAFA--PSSSGISILGNIQQEGIQIT 436
Query: 400 YDVAGRRLGFGPGNC 414
D A +GFGP C
Sbjct: 437 VDSANGYVGFGPNTC 451
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 126/441 (28%), Positives = 192/441 (43%), Gaps = 96/441 (21%)
Query: 62 LEVVSKYGPCSRLNQGISTHA--PSLEEILRQDQQRL----HLKNSRRLRKPFPEFLKRT 115
+ +V ++GPCS L + H PS E+IL DQ R H ++ + P+ +R
Sbjct: 86 MTIVHRHGPCSPL---AAAHGKPPSHEDILAADQNRAESIQHRVSTTATARGNPKRSRRA 142
Query: 116 EA-------------------FTFPANINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDV 155
+ + PA+ + Y+V V +G P +++ DTGSD
Sbjct: 143 PSRRQQPSSAPAPAASLSSSTASLPASSGRALGTGNYVVTVGLGTPASRYTVVFDTGSDT 202
Query: 156 TWTQCKPCIH-CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQ 214
TW QC+PC+ C++Q++ F ++S T+ + C + +C L C+ C + +Q
Sbjct: 203 TWVQCQPCVVVCYEQQEKLFDPARSSTYANVSCAAPACFDLDTR----GCSGGHCLYGVQ 258
Query: 215 YADGSGSGGFWATDRITIQEANS------------NGYFTRYPFLLGCINNSSG------ 256
Y DGS S GF+A D +T+ ++ G F LLG +
Sbjct: 259 YGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTY 318
Query: 257 DKSG----------ASGIMGLD---RSPVSIITRTNTSYFSYCLPSPY--GSTGYITFGK 301
DK G +SG LD SP + R T + P+ Y G TG G+
Sbjct: 319 DKYGGVFAHCLPARSSGTGYLDFGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGGQ 378
Query: 302 TDTV--------------------------NTLRSAFHKRM--KKYKKAKGLEDLLDTCY 333
++ ++LRSAF M + YKKA + LLDTCY
Sbjct: 379 LLSIPQSVFATAGTIVDSGTVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVS-LLDTCY 437
Query: 334 DLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQ 393
D + V +P +++ F GG L++D G + ASVSQVCLGFA + +GN Q
Sbjct: 438 DFTGMSQVAIPTVSLLFQGGAILDVDASGIMYAASVSQVCLGFAANEDGGDVGIVGNTQL 497
Query: 394 RGHEVHYDVAGRRLGFGPGNC 414
+ V YD+ + +GF PG C
Sbjct: 498 KTFGVAYDIGKKVVGFSPGAC 518
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 111/387 (28%), Positives = 171/387 (44%), Gaps = 71/387 (18%)
Query: 88 ILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSL 147
+ R + + HL+ +RL +L ++D + EY++ V +G P L
Sbjct: 89 VARDNARVEHLE--KRLVASTSPYLPEDLVSEVVPGVDDG-SGEYFVRVGVGSPPTDQYL 145
Query: 148 LLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSK 207
++D+GSDV W QC+PC C+ Q DP F + S +F + C S CR L + G ++
Sbjct: 146 VVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAG 205
Query: 208 ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGL 267
+C +++ Y DGS + G A + +T+ G +GC + +SG GA+G++GL
Sbjct: 206 KCDYSVTYGDGSYTKGELALETLTLGGTAVQG------VAIGCGHRNSGLFVGAAGLLGL 259
Query: 268 DRSPVSIITRTNTS---YFSYCLPS-PYGSTGYITFGKTDTVNTLR--SAFH-------- 313
+S++ + + FSYCL S G G + G+T+ V R S+F+
Sbjct: 260 GWGAMSLVGQLGGAAGGVFSYCLASRGAGGAGSLVLGRTEAVPRGRRASSFYYVGLTGIG 319
Query: 314 ---KRM------------------------------KKYKKAKGLED------------- 327
+R+ + Y +G D
Sbjct: 320 VGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVS 379
Query: 328 LLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSIT 387
LLDTCYDLS Y +V VP ++ +F G L L R LV + CL FA P
Sbjct: 380 LLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVFCLAFA--PSSSGISI 437
Query: 388 LGNVQQRGHEVHYDVAGRRLGFGPGNC 414
LGN+QQ G ++ D A +GFGP C
Sbjct: 438 LGNIQQEGIQITVDSANGYVGFGPNTC 464
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 110/360 (30%), Positives = 154/360 (42%), Gaps = 73/360 (20%)
Query: 118 FTFPANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFY 175
+ PA I + Y I V G P + +++ DTGSDV W QCKPC + C+ Q++P F
Sbjct: 1 ISIPARIGLFIGSGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFD 60
Query: 176 ASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA 235
S S T+ + C +C L C+S C + + Y DGS + GF A D + A
Sbjct: 61 PSLSSTYRNVSCTEPACVGLSTR----GCSSSTCLYGVFYGDGSSTIGFLAMDTFMLTPA 116
Query: 236 NSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPV----SIITRTNTSYFSYCLPSPY 291
F F+ GC N++G G +G++GL RS S + + + FSYCLPS
Sbjct: 117 QK---FKN--FIFGCGQNNTGLFQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCLPSTS 171
Query: 292 GSTGYITFGK-----------TDT------------------------------------ 304
+TGY+ G TDT
Sbjct: 172 SATGYLNIGNPQNTPGYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQSVGTIIDS 231
Query: 305 ---VNTLRSAFHKRMKKYKKAKGLE-------DLLDTCYDLSAYETVVVPKIAIHFLGGV 354
+ L + +K +A + +LDTCYD S +VV P I +HF G+
Sbjct: 232 GTVITRLPPTAYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVLHF-AGL 290
Query: 355 DLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
D+ + G V + SQVCL FA +GNVQQ EV YD +R+GF G C
Sbjct: 291 DVRIPATGVFFVFNSSQVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSAGAC 350
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 126/456 (27%), Positives = 193/456 (42%), Gaps = 92/456 (20%)
Query: 35 VSVSSLLPPNVCNRTRTALP--QGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQD 92
VS +S +P + C+ P + A L + ++GPC+ ++ S APS+ + LR D
Sbjct: 39 VSAASFVPSSTCSSPDRVPPHRRNGTSAVLRLTHRHGPCAP-SRASSLAAPSVADTLRAD 97
Query: 93 QQRLHLKNSRRLRKPFPEFLKRTEAFTFPANIN----DTVADEYYIVVAIGEPKQYVSLL 148
Q+R RR+ P+ A D Y + ++G P ++
Sbjct: 98 QRRAEYIL-RRVSGRAPQLWDSKAAAAVATVPASWGYDIGTLNYVVTASLGTPGVAQTME 156
Query: 149 LDTGSDVTWTQCKPCI---HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCN 205
+DTGSD++W QCKPC C+ Q+DP F ++S ++ +PC C L + C+
Sbjct: 157 VDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGI-YAASACS 215
Query: 206 SKECPFNIQYADGSGSGGFWATDRITIQEANS-NGYFTRYPFLLGCINNSSGDKSGASGI 264
+ +C + + Y DGS + G +++D +T+ +++ G+F GC + SG +G G+
Sbjct: 216 AAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFF------FGCGHAQSGLFNGVDGL 269
Query: 265 MGLDRSPVSIITRTNTSY---FSYC------------------------------LPSPY 291
+GL R S++ +T +Y FSYC LPSP
Sbjct: 270 LGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPN 329
Query: 292 GSTGYITFGKTDTVN--------------------------------TLRSAFHKRMKKY 319
T Y+ +V LRSAF M Y
Sbjct: 330 APTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYAALRSAFRSGMASY 389
Query: 320 KKAKGLED-LLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFAT 378
+ +LDTCY+ + Y TV +P +A+ F G + L G L S CL FA
Sbjct: 390 GYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL-----SFGCLAFAP 444
Query: 379 YPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
D LGNVQQR EV D G +GF P +C
Sbjct: 445 SGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 478
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 126/443 (28%), Positives = 202/443 (45%), Gaps = 73/443 (16%)
Query: 34 IVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAP-SLEEILRQD 92
++SV SL C+ + P ++ + ++GPCS + S P SLEE L++D
Sbjct: 35 VLSVGSLKSAATCSEPKATPPSTSGGITVPLHHRHGPCSPVP---SNKMPASLEERLQRD 91
Query: 93 QQRL-HLKNSRRLRKPFPEFLKRTEAFTFPANINDTVAD-EYYIVVAIGEPKQYVSLLLD 150
Q R ++K R+ +++++A T P + +++ EY I V IG P ++ +D
Sbjct: 92 QLRAAYIK--RKFSGAKGGDVEQSDAATVPTTLGTSLSTLEYVITVGIGSPAVTQTMSMD 149
Query: 151 TGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECP 210
TGSDV+W QCKPC C + D F S S T+ C+S +C L +S C+S +C
Sbjct: 150 TGSDVSWVQCKPCSQCHSEVDSLFDPSASSTYSPFSCSSAACVQLSQSQQGNGCSSSQCQ 209
Query: 211 FNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGAS-GIMGLDR 269
+ + Y DGS + G +++D +T+ G F GC + SG S + G+MGL
Sbjct: 210 YIVSYVDGSSTTGTYSSDTLTLGSNAIKG------FQFGCSQSESGGFSDQTDGLMGLGG 263
Query: 270 SPVSIITRTNTSY---FSYCLPSPYGSTGYITFG---KTDTVNT--LRS----AFHKRMK 317
S++++T ++ FSYCLP GS+G++T G ++ V T LRS ++ +
Sbjct: 264 DAQSLVSQTAGTFGKAFSYCLPPTPGSSGFLTLGAASRSGFVKTPMLRSTQIPTYYGVLL 323
Query: 318 KYKKAKGLE-----------DLLDTCYDLSAYETVVVPKIAIHFLGGV------------ 354
+ + G + ++D+ ++ ++ F G+
Sbjct: 324 EAIRVGGQQLNIPTSVFSAGSVMDSGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGIL 383
Query: 355 DLELDVRG--TLVVASVSQVCLGFATYPPDPNSITL---------------------GNV 391
D D G ++ + SV+ V G A D N I L GNV
Sbjct: 384 DTCFDFSGQSSVSIPSVALVFSGGAVVNLDFNGIMLELDNWCLAFAANSDDSSLGFIGNV 443
Query: 392 QQRGHEVHYDVAGRRLGFGPGNC 414
QQR EV YDV G +GF G C
Sbjct: 444 QQRTFEVLYDVGGGAVGFRAGAC 466
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 121/440 (27%), Positives = 180/440 (40%), Gaps = 100/440 (22%)
Query: 62 LEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRL----HLKNSRRLRKPFPEFLKRTE- 116
+ +V ++GPCS L PS EIL DQ R H ++ + P+ + +
Sbjct: 93 MTIVHRHGPCSPLAA-AHGEPPSHGEILAADQSRAESIQHRVSTTTTGRVNPKRRRHRQQ 151
Query: 117 ------------------AFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWT 158
P T Y + V +G P +++ DTGSD TW
Sbjct: 152 QPPSAPAPAASLSSSTASLPASPGRALGT--GNYVVTVGLGTPASRYTVVFDTGSDTTWV 209
Query: 159 QCKPCI-HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYAD 217
QC+PC+ C++QR+ F + S T+ + C + +C L S C+ C + +QY D
Sbjct: 210 QCQPCVVACYEQREKLFDPASSSTYANVSCAAPACSDLDVS----GCSGGHCLYGVQYGD 265
Query: 218 GSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITR 277
GS S GF+A D +T+ Y F GC + G A+G++GL R S+ +
Sbjct: 266 GSYSIGFFAMDTLTLSS-----YDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQ 320
Query: 278 -----------------TNTSYFSYCLPSP------------------YGSTGYITFGK- 301
T T Y + SP G TG G+
Sbjct: 321 TYGKYGGVFAHCLPARSTGTGYLDFGAGSPPATTTTPMLTGNGPTFYYVGMTGIRVGGRL 380
Query: 302 ----------------TDTV---------NTLRSAFHKRM--KKYKKAKGLEDLLDTCYD 334
+ TV ++LRSAF M + Y+KA + LLDTCYD
Sbjct: 381 LPIAPSVFAAAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVS-LLDTCYD 439
Query: 335 LSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQR 394
+ V +P +++ F GG L++D G + S SQVCL FA + +GN Q +
Sbjct: 440 FTGMSQVAIPTVSLLFQGGAALDVDASGIMYTVSASQVCLAFAGNEDGGDVGIVGNTQLK 499
Query: 395 GHEVHYDVAGRRLGFGPGNC 414
V YD+ + +GF PG C
Sbjct: 500 TFGVAYDIGKKVVGFSPGAC 519
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 93/248 (37%), Positives = 146/248 (58%), Gaps = 17/248 (6%)
Query: 61 SLEVVSKYGPC-SRLNQGISTHAPSLEEILRQDQQRLHLKNSR-RLRKPFPEFLKRTEAF 118
SLEVV ++GPC +NQ APS EI +DQ R+ ++R R FPE +A
Sbjct: 1 SLEVVHRHGPCIGIVNQEKGADAPSNMEIFLRDQNRVDSIHARLSSRGMFPE----KQAT 56
Query: 119 TFPANINDTV-ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIH-CFQQRDPFFYA 176
T P ++ A +Y + V +G PK+ +L+ DTGSD+TWTQC+PC+ C++Q++P
Sbjct: 57 TLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNP 116
Query: 177 SKSKTFFKIPCNSTSCRILRESFPFG-NCNSKECPFNIQYADGSGSGGFWATDRITIQEA 235
S S ++ I C+S C+++ F +C+S C + +QY DGS S GF+AT+ +T+ +
Sbjct: 117 STSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSS 176
Query: 236 NSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYG 292
N FL GC ++G GA+G++GL R+ +++ ++T +Y FSYCLP+
Sbjct: 177 N-----VFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASSS 231
Query: 293 STGYITFG 300
S GY++ G
Sbjct: 232 SKGYLSLG 239
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 50/109 (45%), Positives = 66/109 (60%), Gaps = 2/109 (1%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
L SAF M Y G + DTCYD S Y+TV +PK+ + F GGV++++DV G L V
Sbjct: 305 LSSAFQNLMTDYPSTSGYS-IFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPV 363
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ +VCL FA D ++ GNVQQR ++V YD A R+GF PG CS
Sbjct: 364 NGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 412
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 123/440 (27%), Positives = 182/440 (41%), Gaps = 100/440 (22%)
Query: 62 LEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQR---------------LHLKNSRRLRK 106
+ +V ++GPCS L PS EIL DQ R ++ K SR ++
Sbjct: 89 MTIVHRHGPCSPL-AAAHGEPPSHGEILAADQSRAESIQHRVSTTTTDRVNPKRSRHRQQ 147
Query: 107 PFPEFLKRTEAFTF--------PANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWT 158
P + + P T Y + V +G P +++ DTGSD TW
Sbjct: 148 QPPSAPAPAASLSSSTASLPASPGRALGT--GNYVVTVGLGTPASRYTVVFDTGSDTTWV 205
Query: 159 QCKPCI-HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYAD 217
QC+PC+ C++QR+ F + S T+ + C + +C L S C+ C + +QY D
Sbjct: 206 QCQPCVVACYEQREKLFDPASSSTYANVSCAAPACSDLDVS----GCSGGHCLYGVQYGD 261
Query: 218 GSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITR 277
GS S GF+A D +T+ Y F GC + G A+G++GL R S+ +
Sbjct: 262 GSYSIGFFAMDTLTLSS-----YDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQ 316
Query: 278 -----------------TNTSYFSYCLPSP------------------YGSTGYITFGK- 301
T T Y + SP G TG G+
Sbjct: 317 TYGKYGGVFAHCLPARSTGTGYLDFGAGSPPATTTTPMLTGNGPTFYYVGMTGIRVGGRL 376
Query: 302 ----------------TDTV---------NTLRSAFHKRM--KKYKKAKGLEDLLDTCYD 334
+ TV ++LRSAF M + Y+KA + LLDTCYD
Sbjct: 377 LPIAPSVFAAAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVS-LLDTCYD 435
Query: 335 LSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQR 394
+ V +P +++ F GG L++D G + S SQVCL FA + +GN Q +
Sbjct: 436 FTGMSQVAIPTVSLLFQGGAALDVDASGIMYTVSASQVCLAFAGNEDGGDVGIVGNTQLK 495
Query: 395 GHEVHYDVAGRRLGFGPGNC 414
V YD+ + +GF PG C
Sbjct: 496 TFGVAYDIGKKVVGFSPGAC 515
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 123/440 (27%), Positives = 182/440 (41%), Gaps = 100/440 (22%)
Query: 62 LEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQR---------------LHLKNSRRLRK 106
+ +V ++GPCS L PS EIL DQ R ++ K SR ++
Sbjct: 90 MTIVHRHGPCSPL-AAAHGEPPSHGEILAADQSRAESIQHRVSTTTTGRVNPKRSRHRQQ 148
Query: 107 PFPEFLKRTEAFTF--------PANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWT 158
P + + P T Y + V +G P +++ DTGSD TW
Sbjct: 149 QPPSAPAPAASLSSSTASLPASPGRALGT--GNYVVTVGLGTPASRYTVVFDTGSDTTWV 206
Query: 159 QCKPCI-HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYAD 217
QC+PC+ C++QR+ F + S T+ + C + +C L S C+ C + +QY D
Sbjct: 207 QCQPCVVACYEQREKLFDPASSSTYANVSCAAPACSDLDVS----GCSGGHCLYGVQYGD 262
Query: 218 GSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITR 277
GS S GF+A D +T+ Y F GC + G A+G++GL R S+ +
Sbjct: 263 GSYSIGFFAMDTLTLSS-----YDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQ 317
Query: 278 -----------------TNTSYFSYCLPSP------------------YGSTGYITFGK- 301
T T Y + SP G TG G+
Sbjct: 318 TYGKYGGVFAHCLPPRSTGTGYLDFGAGSPPATTTTPMLTGNGPTFYYVGMTGIRVGGRL 377
Query: 302 ----------------TDTV---------NTLRSAFHKRM--KKYKKAKGLEDLLDTCYD 334
+ TV ++LRSAF M + Y+KA + LLDTCYD
Sbjct: 378 LPIAPSVFAAAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVS-LLDTCYD 436
Query: 335 LSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQR 394
+ V +P +++ F GG L++D G + S SQVCL FA + +GN Q +
Sbjct: 437 FTGMSQVAIPTVSLLFQGGAALDVDASGIMYTVSASQVCLAFAGNEDGGDVGIVGNTQLK 496
Query: 395 GHEVHYDVAGRRLGFGPGNC 414
V YD+ + +GF PG C
Sbjct: 497 TFGVAYDIGKKVVGFSPGAC 516
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 129/434 (29%), Positives = 185/434 (42%), Gaps = 94/434 (21%)
Query: 64 VVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPAN 123
V+ ++GPCS L APS ++L DQ R+ + R E + + PA
Sbjct: 22 VMHRHGPCSPLQ--TPDDAPSDADLLEHDQARVDSIH----RMIANETAVVGQDVSLPAE 75
Query: 124 INDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIH--CFQQRDPFFYASKSK 180
+V Y+V V +G P + ++++ DTGSD++W QC PC C+ Q+DP F S S
Sbjct: 76 RGISVGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSS 135
Query: 181 TFFKIPCNSTSCRILRESFPFGNCNSK----ECPFNIQYADGSGSGGFWATDRITI---- 232
TF + C C R+S C+S CP+ + Y D S + G D +T+
Sbjct: 136 TFSAVRCGEPECPRARQS-----CSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTP 190
Query: 233 ----QEANSN--------------GYFTRYPFLLG----------------------CIN 252
E NSN G F + L G C+
Sbjct: 191 STNASENNSNKLPGFVFGCGENNTGLFGKADGLFGLGRGKVSLSSQAAGKYGEGFSYCLP 250
Query: 253 NSSGDKSGASGIMGLDRSPVS-----IITRTNTSYFSYC----------------LPS-- 289
+SS + G + +P ++ R+NT F Y P+
Sbjct: 251 SSSSNAHGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALW 310
Query: 290 PYG----STGYITFGKTDTVNTLRSAFHKRMKKY--KKAKGLEDLLDTCYDLSAYE--TV 341
P G S IT + LR+AF M KY K+A L +LDTCYD +A+ TV
Sbjct: 311 PAGLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLS-ILDTCYDFTAHANATV 369
Query: 342 VVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYD 401
+P +A+ F GG + +D G L VA V+Q CL FA ++ LGN QQR V YD
Sbjct: 370 SIPAVALVFAGGATISVDFSGVLYVAKVAQACLAFAPNGNGRSAGILGNTQQRTVAVVYD 429
Query: 402 VAGRRLGFGPGNCS 415
V +++GF CS
Sbjct: 430 VGRQKIGFAAKGCS 443
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 111/355 (31%), Positives = 156/355 (43%), Gaps = 64/355 (18%)
Query: 118 FTFPANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFY 175
+ PA I + Y I V G PK+ +++ DTGS+V W QCKPC+ C+ Q++P F
Sbjct: 1 ISIPARIGLYIGTANYVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFD 60
Query: 176 ASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA 235
+ S T+ I C S +C L C+ C + + Y DGS + GF AT+ T+
Sbjct: 61 PTLSSTYRNISCTSAACTGLSSR----GCSGSTCVYGVTYGDGSSTVGFLATETFTLAAG 116
Query: 236 N------------SNGYFTRYPFLLGC----------INNSSGD--------KSGASGIM 265
N + G FT L+G + S G+ S A+G +
Sbjct: 117 NVFNNFIFGCGQNNQGLFTGAAGLIGLGRSPYSLNSQLATSLGNIFSYCLPSTSSATGYL 176
Query: 266 GLD---RSP--VSIIT--RTNTSYFSYCLPSPYGST----GYITFGKTDTV--------- 305
+ R+P +++T R T YF + G T F T+
Sbjct: 177 NIGNPLRTPGYTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSVGTIIDSGTVITR 236
Query: 306 ------NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELD 359
LR+AF M +Y +A +LDTCYD S TV P I +H+ G +D+ +
Sbjct: 237 LPPTAYGALRTAFRAAMTQYTRAAA-ASILDTCYDFSRTTTVTFPTIKLHYTG-LDVTIP 294
Query: 360 VRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
G V S SQVCL FA +GNVQQR EV YD A +R+GF G C
Sbjct: 295 GAGVFYVISSSQVCLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALKRIGFAAGAC 349
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 110/354 (31%), Positives = 160/354 (45%), Gaps = 82/354 (23%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFYASKSKTFFKIPCNS 189
E+ +VV G P Q +++LDTGSD++W QCKPC HC++Q DP F +KS ++ +PC +
Sbjct: 136 EFVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDPAKSSSYAAVPCGT 195
Query: 190 TSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
C G CN C + +QY DGS + G + D +T NS+ FT + F G
Sbjct: 196 PVCAAAG-----GMCNGTTCLYGVQYGDGSSTTGVLSRDTLTF---NSSSKFTGFTF--G 245
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGST------------ 294
C + GD G++GL R +S+ ++ S+ FSYCLPS Y +T
Sbjct: 246 CGEKNIGDFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPS-YNTTPGYLNIGATKPT 304
Query: 295 ------------------------------GYI------TFGKTDTV------------- 305
GYI F KT T+
Sbjct: 305 STVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVFTKTGTLLDSGTILTYLPPP 364
Query: 306 --NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGT 363
+LR F M+ K A E L DTCYD + +V+P ++ +F G +LD G
Sbjct: 365 AYTSLRDRFKFTMQGNKPAPPYEPL-DTCYDFTGQGAIVIPAVSFNFSDGAVFDLDFYGI 423
Query: 364 LVVASVSQV---CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
++ ++ CL F + P +GN QQR EV YDV +++GF P +C
Sbjct: 424 MIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQKIGFIPISC 477
>gi|125602787|gb|EAZ42112.1| hypothetical protein OsJ_26672 [Oryza sativa Japonica Group]
Length = 477
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 91/306 (29%), Positives = 148/306 (48%), Gaps = 41/306 (13%)
Query: 145 VSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI-LRESFPF-G 202
+++++DTGSD+TW QCKPC C+ QRDP F S S ++ +PCN+++C L+ + G
Sbjct: 176 LTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPG 235
Query: 203 NC----------NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCIN 252
+C S+ C +++ Y DGS S G ATD + + A+ +G F+ GC
Sbjct: 236 SCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDG------FVFGCGL 289
Query: 253 NSSGDKSGASGIMGLDRS------------PVSIITRTNTSYFSYCLPSPYGSTGYITFG 300
++ G G +G+MGL P + T S + + +
Sbjct: 290 SNRGLFGGTAGLMGLGPDGALAGLPDGAPPPFYFMNVTGASVGGAAVAAAGLGAANVLLD 349
Query: 301 KTDTVNTLRSAFHKRMK-KYKKAKGLE--------DLLDTCYDLSAYETVVVPKIAIHFL 351
+ L + ++ ++ ++ + G E LLD CY+L+ ++ V VP + +
Sbjct: 350 SGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLE 409
Query: 352 GGVDLELDVRGTLVVASV--SQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGF 409
GG D+ +D G L +A SQVCL A+ + + +GN QQ+ V YD G RLGF
Sbjct: 410 GGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGF 469
Query: 410 GPGNCS 415
+CS
Sbjct: 470 ADEDCS 475
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 109/359 (30%), Positives = 146/359 (40%), Gaps = 76/359 (21%)
Query: 126 DTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKI 185
D + EY++ V IG P L++D+GSDV W QCKPC+ C+ Q DP F + S TF +
Sbjct: 119 DEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPASSATFSAV 178
Query: 186 PCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA---------- 235
C S CR LR S G +S C + + Y DGS + G A + +T+
Sbjct: 179 SCGSAICRTLRTS---GCGDSGGCEYEVSYGDGSYTKGTLALETLTLGGTAVEGVAIGCG 235
Query: 236 -NSNGYFTRYPFLLG----------------------CINNSSGDKSGASGIMG---LDR 269
+ G F LLG C+ + G SGA+ G L R
Sbjct: 236 HRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGSGSGAADAAGSLVLGR 295
Query: 270 SP--------VSIITRTNTSYFSYCLPSPYG-------------------------STG- 295
S V ++ F Y S G TG
Sbjct: 296 SEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLTEDGGGGVVMDTGT 355
Query: 296 YITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVD 355
+T + LR AF + +A G+ LLDTCYDLS Y +V VP ++ +F G
Sbjct: 356 AVTRLPQEAYAALRDAFVGAVGALPRAPGVS-LLDTCYDLSGYTSVRVPTVSFYFDGAAT 414
Query: 356 LELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L L R L+ CL FA P LGN+QQ G ++ D A +GFGP C
Sbjct: 415 LTLPARNLLLEVDGGIYCLAFA--PSSSGLSILGNIQQEGIQITVDSANGYIGFGPATC 471
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 113/405 (27%), Positives = 173/405 (42%), Gaps = 87/405 (21%)
Query: 87 EILRQDQQRLHLKNSRRLRKPF---------PEFLKRTEAFTFPANINDTVAD-EYYIVV 136
+IL +D++ + +SR +K L + P N ++ YY+ +
Sbjct: 65 DILSRDEEHVKFLSSRLRKKDVQGASFSRHKSGHLLEPNSANIPLNPGLSIGSGNYYLKL 124
Query: 137 AIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFYASKSKTFFKIPCNSTSCRIL 195
+G P +Y +++LDTGS ++W QCKPC+ +C Q DP F S S T+ + C+S+ C +L
Sbjct: 125 GLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYCSSSECSLL 184
Query: 196 RESF---PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCIN 252
+ + P S C + Y D S S G+ + D +T+ + + FT GC
Sbjct: 185 KAATLNDPLCTA-SGVCVYTASYGDASYSMGYLSRDLLTLTPSQTLPSFT-----YGCGQ 238
Query: 253 NSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL-----------------PSPYG 292
++ G A+GI+GL R +S++ + + Y FSYCL PS Y
Sbjct: 239 DNEGLFGKAAGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTSSGGGFLSIGKISPSSYK 298
Query: 293 STGYI---------------------------------TFGKTDTVNT---------LRS 310
T I T + TV T LR
Sbjct: 299 FTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPTIIDSGTVVTRLPISIYAALRE 358
Query: 311 AFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVS 370
AF K M + + +LDTC+ S P+I + F GG DL L L+ A
Sbjct: 359 AFVKIMSRRYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQGGADLSLRAPNILIEADKG 418
Query: 371 QVCLGFATYPPDPNSIT-LGNVQQRGHEVHYDVAGRRLGFGPGNC 414
CL FA+ N I +GN QQ+ + + YDV+ ++GF PG C
Sbjct: 419 IACLAFAS----SNQIAIIGNHQQQTYNIAYDVSASKIGFAPGGC 459
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 96/295 (32%), Positives = 145/295 (49%), Gaps = 31/295 (10%)
Query: 33 HIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCS------RLNQGISTHAPSLE 86
H+ SVSSLLP + C TA + ++L VV ++GPCS R G THA
Sbjct: 46 HVFSVSSLLPSSAC----TASKAASNSSALGVVHRHGPCSPVQARPRGGGGAVTHA---- 97
Query: 87 EILRQDQQR---LHLKNSRRLRKPFPEFLKRT--EAFTFPANINDTVADEYYIV-VAIGE 140
EIL +DQ R +H K + P R + + PA ++ Y+V V +G
Sbjct: 98 EILERDQARVDSIHRKVAGAGGAPSVVDPARASEQGVSLPAQRGISLGTGNYVVSVGLGT 157
Query: 141 PKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFP 200
P + +++ DTGSD++W QCKPC C++Q+DP F S S T+ + C + C+ L S
Sbjct: 158 PAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPECQELDAS-- 215
Query: 201 FGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSG 260
G + C + +QY D S + G D +T+ ++ T F+ GC + ++G
Sbjct: 216 -GCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD-----TLPGFVFGCGDQNAGLFGQ 269
Query: 261 ASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDTVNTLRSAF 312
G+ GL R VS+ ++ SY F+YCLPS GY++ G N +A
Sbjct: 270 VDGLFGLGREKVSLPSQGAPSYGPGFTYCLPSSSSGRGYLSLGGAPPANAQFTAL 324
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 65/108 (60%), Gaps = 1/108 (0%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
LR+AF + M +YKKA L +LDTCYD + + T +P + + F GG + LD G L V+
Sbjct: 379 LRAAFARSMAQYKKAPALS-ILDTCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLYVS 437
Query: 368 SVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
VSQ CL FA D + LGN QQ+ V YDVA +R+GFG CS
Sbjct: 438 KVSQACLAFAPNADDSSIAILGNTQQKTFAVAYDVANQRIGFGAKGCS 485
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 138 bits (347), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 96/295 (32%), Positives = 145/295 (49%), Gaps = 31/295 (10%)
Query: 33 HIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCS------RLNQGISTHAPSLE 86
H+ SVSSLLP + C TA + ++L VV ++GPCS R G THA
Sbjct: 46 HVFSVSSLLPSSAC----TASKAASNSSALGVVHRHGPCSPVQARRRGGGGAVTHA---- 97
Query: 87 EILRQDQQR---LHLKNSRRLRKPFPEFLKRT--EAFTFPANINDTVADEYYIV-VAIGE 140
EIL +DQ R +H K + P R + + PA ++ Y+V V +G
Sbjct: 98 EILERDQARVDSIHRKVAGAGGAPSVVDPARASEQGVSLPAQRGISLGTGNYVVSVGLGT 157
Query: 141 PKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFP 200
P + +++ DTGSD++W QCKPC C++Q+DP F S S T+ + C + C+ L S
Sbjct: 158 PAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPECQELDAS-- 215
Query: 201 FGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSG 260
G + C + +QY D S + G D +T+ ++ T F+ GC + ++G
Sbjct: 216 -GCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD-----TLPGFVFGCGDQNAGLFGQ 269
Query: 261 ASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDTVNTLRSAF 312
G+ GL R VS+ ++ SY F+YCLPS GY++ G N +A
Sbjct: 270 VDGLFGLGREKVSLPSQGAPSYGPGFTYCLPSSSSGRGYLSLGGAPPANAQFTAL 324
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 50/108 (46%), Positives = 65/108 (60%), Gaps = 1/108 (0%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
LR+AF + M +YKKA L +LDTCYD + + T +P + + F GG + LD G L V+
Sbjct: 379 LRAAFARSMAQYKKAPALS-ILDTCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLYVS 437
Query: 368 SVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
VSQ CL FA D + LGN QQ+ V YDVA +R+GFG CS
Sbjct: 438 KVSQACLAFAPNADDSSIAILGNTQQKTFAVTYDVANQRIGFGAKGCS 485
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 126/455 (27%), Positives = 203/455 (44%), Gaps = 80/455 (17%)
Query: 24 ADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAP 83
A D ++S+ SL +VC+ ++ A+ ++ + ++GPCS L + P
Sbjct: 22 AHAGDHGSYKVLSIGSLRTKSVCSESK-AVRSSSGATTVPLHHRHGPCSPLP---TKKMP 77
Query: 84 SLEEILRQDQQRLHLKNSRRLRKPFPEFLKR---------TEAFTFPANINDTVAD-EYY 133
SLE+ L +DQ R + +++ F +K+ T P + ++ EY
Sbjct: 78 SLEDRLHRDQLR-----AAYIKRKFSGDVKKDGQGAGGVEQSHVTVPTTLGTSLNTLEYL 132
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCR 193
I V +G P + ++L+D+GSDV+W QCKPC+ C Q DP F S S T+ C+S +C
Sbjct: 133 ITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCHSQVDPLFDPSLSSTYSPFSCSSAACA 192
Query: 194 ILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINN 253
L + G +S +C + ++YADGS + G +++D + + G T F GC +
Sbjct: 193 QLGQDGN-GCSSSSQCQYIVRYADGSSTTGTYSSDTLAL------GSNTISNFQFGCSHV 245
Query: 254 SSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDT--VNT- 307
SG G+MGL S+ ++T ++ FSYCLP S+G++T G + V T
Sbjct: 246 ESGFNDLTDGLMGLGGGAPSLASQTAGTFGTAFSYCLPPTPSSSGFLTLGAGTSGFVKTP 305
Query: 308 -LRSA----FHKRMKKYKKAKGLE-----------DLLDTCYDLSAYETVVVPKIAIHFL 351
LRS+ F+ + + G + ++D+ ++ ++ F
Sbjct: 306 MLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFSAGMVMDSGTIITRLPRTAYSALSSAFK 365
Query: 352 GG------------VDLELDVRG--TLVVASVSQVCLGFATYPPDPNSITL--------- 388
G +D D G ++ + SV+ V G A D N I L
Sbjct: 366 AGMKQYRPAPPRSIMDTCFDFSGQSSVRLPSVALVFSGGAVVNLDANGIILGNCLAFAAN 425
Query: 389 ---------GNVQQRGHEVHYDVAGRRLGFGPGNC 414
GNVQQR EV YDV G +GF G C
Sbjct: 426 SDDSSPGIVGNVQQRTFEVLYDVGGGAVGFKAGAC 460
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 112/409 (27%), Positives = 174/409 (42%), Gaps = 97/409 (23%)
Query: 90 RQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADE-------------YYIVV 136
++ Q+RL + N +LR R + NI+D+V + Y + V
Sbjct: 16 KKLQKRLIMDN-FQLR----SLQSRIKNIILSGNIDDSVDTQIPLTSGIRLQSLNYIVTV 70
Query: 137 AIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILR 196
+G K +++++DTGSD++W QC+PC C+ Q+DP F SKS ++ + CNS +CR L+
Sbjct: 71 ELGGRK--MTVIVDTGSDLSWVQCQPCNRCYNQQDPVFNPSKSPSYRTVLCNSLTCRSLQ 128
Query: 197 -ESFPFGNCNSK--ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINN 253
+ G C S C + + Y DGS + G + + + N F+ GC
Sbjct: 129 LATGNSGVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNLGNTTVNN------FIFGCGRK 182
Query: 254 SSGDKSGASGIMGLDR---------SPV------------------SIITRTNTSYFSYC 286
+ G GASG++GL R SP+ S++ N+S +
Sbjct: 183 NQGLFGGASGLVGLGRTDLSLISQISPMFGGVFSYCLPTTEAEASGSLVMGGNSSVYKNT 242
Query: 287 LPSPYGS--------------TGYI---------TFGKTDTV---------------NTL 308
P Y TG +FGK + L
Sbjct: 243 TPISYTRMIHNPLLPFYFLNLTGITVGGVEVQAPSFGKDRMIIDSGTVISRLPPSIYQAL 302
Query: 309 RSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL--VV 366
++ F K+ Y A +LD+C++LS Y+ V +P I ++F G +L +DV G V
Sbjct: 303 KAEFVKQFSGYPSAPSFM-ILDSCFNLSGYQEVKIPDIKMYFEGSAELNVDVTGVFYSVK 361
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
SQVCL A+ P + +GN QQ+ + YD G LGF CS
Sbjct: 362 TDASQVCLAIASLPYEDEVGIIGNYQQKNQRIIYDTKGSMLGFAEEACS 410
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 121/436 (27%), Positives = 189/436 (43%), Gaps = 86/436 (19%)
Query: 57 PDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLH--LKNSRRLRKPFPEFLKR 114
P++AS+ +V ++GPC+ S PSL E LR+D+ R + + + R
Sbjct: 14 PNRASVPLVHRHGPCAP--SAASGGKPSLAERLRRDRARTNYIVTKATGGRTAATALSDA 71
Query: 115 TEAFT-FPANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI--HCFQQR 170
T P + D+V EY + + IG P ++L+DTGSD++W QCKPC C+ Q+
Sbjct: 72 AGGGTSIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQK 131
Query: 171 DPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCN------SKECPFNIQYADGSGSGGF 224
DP F S S ++ +PC+S +CR L C + C + I+Y + + + G
Sbjct: 132 DPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGV 191
Query: 225 WATDRITIQEA------------NSNGYFTRYPFLLG----------------------C 250
++T+ +T++ + +G + ++ LLG C
Sbjct: 192 YSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYC 251
Query: 251 INNSSG-----------DKSGASGIMGLDRSPVS---------IITRTNTSYFSYCL--- 287
+ +SG + S ++ GL +P+ I+T T S L
Sbjct: 252 LPPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIP 311
Query: 288 PSPYGSTGYITFGKTDT------VNTLRSAFHKRMKKYK---KAKGLEDLLDTCYDLSAY 338
PS + S I G T LRSAF M +Y+ + G +LDTCYD + +
Sbjct: 312 PSAFSSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG--GVLDTCYDFTGH 369
Query: 339 ETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEV 398
V VP I++ F GG ++L ++V CL FA D +GNV QR EV
Sbjct: 370 ANVTVPTISLTFSGGATIDLAAPAGVLV----DGCLAFAGAGTDNAIGIIGNVNQRTFEV 425
Query: 399 HYDVAGRRLGFGPGNC 414
YD +GF G C
Sbjct: 426 LYDSGKGTVGFRAGAC 441
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 126/463 (27%), Positives = 198/463 (42%), Gaps = 86/463 (18%)
Query: 28 DLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEE 87
+L++ +V SS P C+ + P++AS+ +V ++GPC+ S PSL E
Sbjct: 13 NLNNFAVVPASSFEPEAACSTSSAN--SDPNRASVPLVHRHGPCAP--SAASGGKPSLAE 68
Query: 88 ILRQDQQRLHLKNSRRLRKPFPEFLKRTE----AFTFPANINDTVAD-EYYIVVAIGEPK 142
LR+D+ R + ++ + P + D+V EY + + IG P
Sbjct: 69 RLRRDRARANYIVTKAAGGRTAATAVSDAVGGGGTSIPTFLGDSVDSLEYVVTLGIGTPA 128
Query: 143 QYVSLLLDTGSDVTWTQCKPCI--HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFP 200
+L+DTGSD++W QCKPC C+ Q+DP F S S ++ +PC+S +CR L
Sbjct: 129 VQQIVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDSDACRKLAAGAY 188
Query: 201 FGNCNSKE---CPFNIQYADGSGSGGFWATDRITIQEA------------NSNGYFTRYP 245
C S C + I+Y + + + G ++T+ +T++ + +G + ++
Sbjct: 189 GHGCTSGAAALCEYGIEYGNRATTTGVYSTETLTLKPGVVVADFGFGCGDHQHGPYEKFD 248
Query: 246 FLLG----------------------CINNSSGDK--------------SGASGIM--GL 267
LLG C+ +SG + A+G + +
Sbjct: 249 GLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSGGAGFLALGAPNSSSSSTAAAGFLFTPM 308
Query: 268 DRSP----VSIITRTNTSYFSYCL---PSPYGSTGYITFGKTDT------VNTLRSAFHK 314
R P ++T T S L PS + S I G T LRSAF
Sbjct: 309 RRIPSVPTFYVVTLTGISVGGAPLAVPPSAFSSGMVIDSGTVITGLPATAYAALRSAFRS 368
Query: 315 RMKKYK---KAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQ 371
M +Y+ + G +LDTCYD + + V VP IA+ F GG ++L ++V
Sbjct: 369 AMSEYRLLPPSNGA--VLDTCYDFTGHTNVTVPTIALTFSGGATIDLATPAGVLV----D 422
Query: 372 VCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
CL FA D +GNV QR EV YD +GF G C
Sbjct: 423 GCLAFAGAGTDDTIGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 465
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 121/436 (27%), Positives = 189/436 (43%), Gaps = 86/436 (19%)
Query: 57 PDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLH--LKNSRRLRKPFPEFLKR 114
P++AS+ +V ++GPC+ S PSL E LR+D+ R + + + R
Sbjct: 94 PNRASVPLVHRHGPCAP--SAASGGKPSLAERLRRDRARTNYIVTKATGGRTAATALSDA 151
Query: 115 TEAFT-FPANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI--HCFQQR 170
T P + D+V EY + + IG P ++L+DTGSD++W QCKPC C+ Q+
Sbjct: 152 AGGGTSIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQK 211
Query: 171 DPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCN------SKECPFNIQYADGSGSGGF 224
DP F S S ++ +PC+S +CR L C + C + I+Y + + + G
Sbjct: 212 DPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGV 271
Query: 225 WATDRITIQEA------------NSNGYFTRYPFLLG----------------------C 250
++T+ +T++ + +G + ++ LLG C
Sbjct: 272 YSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYC 331
Query: 251 INNSSG-----------DKSGASGIMGLDRSPVS---------IITRTNTSYFSYCL--- 287
+ +SG + S ++ GL +P+ I+T T S L
Sbjct: 332 LPPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIP 391
Query: 288 PSPYGSTGYITFGKTDT------VNTLRSAFHKRMKKYK---KAKGLEDLLDTCYDLSAY 338
PS + S I G T LRSAF M +Y+ + G +LDTCYD + +
Sbjct: 392 PSAFSSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG--GVLDTCYDFTGH 449
Query: 339 ETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEV 398
V VP I++ F GG ++L ++V CL FA D +GNV QR EV
Sbjct: 450 ANVTVPTISLTFSGGATIDLAAPAGVLV----DGCLAFAGAGTDNAIGIIGNVNQRTFEV 505
Query: 399 HYDVAGRRLGFGPGNC 414
YD +GF G C
Sbjct: 506 LYDSGKGTVGFRAGAC 521
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 93/320 (29%), Positives = 153/320 (47%), Gaps = 24/320 (7%)
Query: 7 AFLLFICLLCSSNNGAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVS 66
AF L +C+L S N+ +V SS +P C+ P +AS+ +
Sbjct: 2 AFPLLLCVLVCSYCSVALGGNEHGFV-VVPTSSFVPAAACSTPIGVGNPDPTRASVPLAH 60
Query: 67 KYGPCS-RLNQGISTHAPSLEEILRQDQQRL-HLKNSRRLRKPFPEFLKRTEAFTFPANI 124
++GPC+ + + PS E LR D+ R H+ R+ + + P +
Sbjct: 61 RHGPCAPKGSSATDKKKPSFAERLRSDRARADHILRKASGRR----MMSEGGGASIPTYL 116
Query: 125 NDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC--IHCFQQRDPFFYASKSKT 181
V EY + + IG P ++L+DTGSD++W QCKPC C+ Q+DP F SKS T
Sbjct: 117 GGFVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDPSKSST 176
Query: 182 FFKIPCNSTSCRILR-ESFPFGNCNSK-----ECPFNIQYADGSGSGGFWATDRITIQEA 235
F IPC S +C+ L + + G N+ +C + I+Y +G+ + G ++T+ + + +
Sbjct: 177 FATIPCASDACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLALGSS 236
Query: 236 NSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYG 292
F GC ++ G G++GL +P S++++T + Y FSYCLP
Sbjct: 237 A-----VVKSFRFGCGSDQHGPYDKFDGLLGLGGAPESLVSQTASVYGGAFSYCLPPLNS 291
Query: 293 STGYITFGKTDTVNTLRSAF 312
G++T G ++ N S F
Sbjct: 292 GAGFLTLGAPNSTNNSNSGF 311
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 61/118 (51%), Gaps = 5/118 (4%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
IT T LR+AF M +Y + LDTCY+ + + TV VPK+A+ F+GG +
Sbjct: 360 ITGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTCYNFTGHGTVTVPKVALTFVGGATV 419
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+LDV ++V + CL FA D + +GNV R EV YD LGF G C
Sbjct: 420 DLDVPSGVLV----EDCLAFADA-GDGSFGIIGNVNTRTIEVLYDSGKGHLGFRAGAC 472
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 99/318 (31%), Positives = 147/318 (46%), Gaps = 21/318 (6%)
Query: 4 LSKAFLLFICLLCSSNNGAYADDNDLSHSHIV-SVSSLLPPNVCNRTRTALPQGPDKASL 62
++ LLF+ +LCS + Y D H +V S P VC+ + L S+
Sbjct: 1 MASPLLLFV-VLCSYCS--YISHADNEHGFVVVPRRSYEPKAVCSASSVNLEPSSATLSV 57
Query: 63 EVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPA 122
+V +YGPC+ +Q PS E LR + R + SR A T P
Sbjct: 58 PLVHRYGPCAA-SQYSDMPTPSFSETLRHSRARTNYIKSRASTGM--ASTPDDAAVTVPT 114
Query: 123 NINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC--IHCFQQRDPFFYASKS 179
+ V EY + + G P LL+DTGSDV+W QC PC C+ Q+DP F SKS
Sbjct: 115 RLGGFVDSLEYMVTLGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKDPLFDPSKS 174
Query: 180 KTFFKIPCNSTSCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANS 237
T+ I C + +C L + + G C S +C + ++Y DGS + G ++ + IT
Sbjct: 175 STYAPIACGADACNKLGDHYRNG-CTSGGTQCGYRVEYGDGSSTRGVYSNETITFAPG-- 231
Query: 238 NGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGST 294
T F GC ++ G G++GL +P S++ +T + Y FSYCLP+
Sbjct: 232 ---ITVKDFHFGCGHDQRGPSDKFDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNSEA 288
Query: 295 GYITFGKTDTVNTLRSAF 312
G++ G + T SAF
Sbjct: 289 GFLALGVRPSAATNTSAF 306
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 55/109 (50%), Gaps = 6/109 (5%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
N L +A K Y ED DTCY+ + Y V VP++A+ F GG ++LDV ++
Sbjct: 363 NALNAALRKAFAAYPMVAS-EDF-DTCYNFTGYSNVTVPRVALTFSGGATIDLDVPNGIL 420
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V + CL F PD +GNV QR EV YD ++GF G C
Sbjct: 421 V----KDCLAFRESGPDVGLGIIGNVNQRTLEVLYDAGHGKVGFRAGAC 465
>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
Length = 452
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 125/455 (27%), Positives = 180/455 (39%), Gaps = 116/455 (25%)
Query: 35 VSVSSLLPPNVCNRTRTALPQ--GPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQD 92
VS +S +P + C+ PQ A L + ++GPC+ ++ S APS+ + LR D
Sbjct: 39 VSAASFVPSSTCSSPDPVPPQRRNGTSAVLRLTHRHGPCAP-SRASSLAAPSVADTLRAD 97
Query: 93 QQRLHLKNSRRLRKPFPEFLKR---TEAFTFPANINDTVADEYYIVVA-IGEPKQYVSLL 148
Q+R RR+ P+ A T PA+ + Y+V A +G P ++
Sbjct: 98 QRRAEYIL-RRVSGRAPQLWDSKAAAAAATVPASWGYDIGTLNYVVTASLGTPGVAQTME 156
Query: 149 LDTGSDVTWTQCKPCI---HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCN 205
+DTGSD++W QCKPC C+ Q+DP F ++S ++ +PC C
Sbjct: 157 VDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVC------------- 203
Query: 206 SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIM 265
+G G + A+ Q G+F GC + SG +G G++
Sbjct: 204 -------------AGLGIYAASACSAAQCGAVQGFF------FGCGHAQSGLFNGVDGLL 244
Query: 266 GLDRSPVSIITRTNTSY---FSYCL------------------------------PSPYG 292
GL R S++ +T +Y FSYCL PSP
Sbjct: 245 GLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNA 304
Query: 293 STGYITFGKTDTVN--------------------------------TLRSAFHKRMKKYK 320
T Y+ +V LRSAF M Y
Sbjct: 305 PTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYAALRSAFRSGMASYG 364
Query: 321 KAKGLED-LLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATY 379
+ +LDTCY+ + Y TV +P +A+ F G + L G L S CL FA
Sbjct: 365 YPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL-----SFGCLAFAPS 419
Query: 380 PPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
D LGNVQQR EV D G +GF P +C
Sbjct: 420 GSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 452
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 108/354 (30%), Positives = 161/354 (45%), Gaps = 78/354 (22%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y + V +G K +++++DTGSD++W QC+PC C+ Q+DP F S S ++ + C+S +
Sbjct: 135 YIVTVELGGRK--MTVIVDTGSDLSWVQCQPCKRCYNQQDPVFNPSTSPSYRTVLCSSPT 192
Query: 192 CRILRESFP-FGNCNSK--ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
C+ L+ + G C S C + + Y DGS + G T+ + + NS F+
Sbjct: 193 CQSLQSATGNLGVCGSNPPSCNYVVNYGDGSYTRGELGTEHLDL--GNSTAVNN---FIF 247
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSI-----------------ITRT------------- 278
GC N+ G GASG++GL RS +S+ IT T
Sbjct: 248 GCGRNNQGLFGGASGLVGLGRSSLSLISQTSAMFGGVFSYCLPITETEASGSLVMGGNSS 307
Query: 279 ---NTSYFSYC--LPSP-------------YGSTGYI--TFGK------TDTVNT----- 307
NT+ SY +P+P GS +FGK + TV T
Sbjct: 308 VYKNTTPISYTRMIPNPQLPFYFLNLTGITVGSVAVQAPSFGKDGMMIDSGTVITRLPPS 367
Query: 308 ----LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGT 363
L+ F K+ + A +LDTC++LS Y+ V +P I +HF G +L +DV G
Sbjct: 368 IYQALKDEFVKQFSGFPSAPAFM-ILDTCFNLSGYQEVEIPNIKMHFEGNAELNVDVTGV 426
Query: 364 --LVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
V SQVCL A+ + +GN QQ+ V YD G LGF C+
Sbjct: 427 FYFVKTDASQVCLAIASLSYENEVGIIGNYQQKNQRVIYDTKGSMLGFAAEACT 480
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 115/419 (27%), Positives = 182/419 (43%), Gaps = 91/419 (21%)
Query: 67 KYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRL--RKPFPEFLKRTEAFTFPANI 124
++GPCS ST P++ E+LR+DQ R ++ + ++++ A T P +
Sbjct: 60 RHGPCS---PAPSTVEPTMAELLRRDQLRAKYIQAKLSVNSGSGTDGVQQSAAITLPTTL 116
Query: 125 N---DTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKT 181
DT+A Y I V+IG P ++++DTGSDV+W C FF KS T
Sbjct: 117 GSALDTLA--YVITVSIGTPAMTQAVMIDTGSDVSWVHCH--ARAGAGSSLFFDPGKSST 172
Query: 182 FFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
+ C+S +C L E G + C + ++Y DGS + G + +D + + NS
Sbjct: 173 YTPFSCSSAACTRL-EGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLAL---NSTEKV 228
Query: 242 TRYPFLLGCINNSSG----DKSGASGIMGLDRSPVSIITRTNTSY---FSYC-------- 286
+ F GC S D+ G+MGL S++++T +Y FSYC
Sbjct: 229 ENFQF--GCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCLPATTRSS 286
Query: 287 ----LPSPYGSTGYIT---------------------------------FGKTDTVNT-- 307
L + G++G++T F +++
Sbjct: 287 GFLTLGASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFAAGSIMDSGT 346
Query: 308 ------------LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVD 355
L +AF M++Y +A+ +LDTC+D + + V +P + + F GG
Sbjct: 347 IITRLPPRAYSALSAAFRAGMRRYPRARAFS-ILDTCFDFTGQDNVSIPAVELVFSGGAV 405
Query: 356 LELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
++LD G + + CL FA SI +GNVQQR EV +DV LGF PG C
Sbjct: 406 VDLDADGIMYGS-----CLAFAPATGGIGSI-IGNVQQRTFEVLHDVGQSVLGFRPGAC 458
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 108/405 (26%), Positives = 168/405 (41%), Gaps = 77/405 (19%)
Query: 77 GISTHAPSLEEILRQDQQRLHLKNSR-------RLRKPFPEFLKRTEAFTFPANINDTVA 129
G ST + S E+ R D+QR+ R + + + + T P +
Sbjct: 82 GPSTASASFAEVQRADEQRVEYIQRRVSGGGARGAKGALQQLATGSRSATVPTTMG-VGT 140
Query: 130 DEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI--HCFQQRDPFFYASKSKTFFKIPC 187
+Y + V++G P ++ +DTGSDV+W QCKPC C QRD F +KS T+ +PC
Sbjct: 141 FQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPC 200
Query: 188 NSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
+ +C LR C+ +C + + Y DGS + G + +D + + N+ G FL
Sbjct: 201 GADACSELR--IYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGNTVGT-----FL 253
Query: 248 LGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDT 304
GC + +G +G G++ L R +S+ ++ +Y FSYCLPS + GY+T G +
Sbjct: 254 FGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLTLGGPSS 313
Query: 305 VNTLRS----------AFHKRMKKYKKAKG---------------------LEDLLDTCY 333
+ + F+ M G + L T Y
Sbjct: 314 ASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVDTGTVITRLPPTAY 373
Query: 334 DL--SAYETVVVP-------------------KIAIHFLGGVDLELDVRGTLVVAS---V 369
SA+ + P + + L V L TL + + +
Sbjct: 374 AALRSAFRGAIAPCGYPSAPANGILDTCYDFSRYGVVTLPTVALTFSGGATLALEAPGIL 433
Query: 370 SQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
S CL FA D ++ LGNVQQR V +D G +GF PG C
Sbjct: 434 SSGCLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 88/291 (30%), Positives = 134/291 (46%), Gaps = 35/291 (12%)
Query: 34 IVSVSSLLP-PNVCNRT--RTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILR 90
++ V SL P P+ C T R + A + +V ++GPCS L + PS EIL
Sbjct: 44 LLRVDSLFPGPSSCTSTQERKPITATSSAARVPIVHRHGPCSPLAGAHAGKPPSHAEILA 103
Query: 91 QDQQRLH------------LKNSRRLRKPFPEFLKRTEAFTFPANINDTVAD------EY 132
DQ R+ L R +K P + + ++ + Y
Sbjct: 104 ADQNRVESLHHRVSSTTTGLGGKPRTKKKTPGHSSVPASSSSSSSSVPASSGLSLGTANY 163
Query: 133 YIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNSTS 191
+ + +G P +++ DTGSD TW QC+PC + C++Q+D F +KS T+ + C +
Sbjct: 164 VVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTYANVSCADPA 223
Query: 192 CRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI 251
C L S CN+ C + IQY DGS + GF+A D + + + G F GC
Sbjct: 224 CADLDAS----GCNAGHCLYGIQYGDGSYTVGFFAKDTLAVAQDAIKG------FKFGCG 273
Query: 252 NNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITF 299
+ G +G++GL R P SI + Y FSYCLP+ +TGY+ F
Sbjct: 274 EKNRGLFGQTAGLLGLGRGPTSITVQAYEKYGGSFSYCLPASSAATGYLEF 324
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 43/96 (44%), Positives = 58/96 (60%), Gaps = 1/96 (1%)
Query: 319 YKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFAT 378
YKKA +LDTCYD + V +P +++ F GG L+LD G + S SQVCLGFA+
Sbjct: 409 YKKAAAYS-ILDTCYDFTGLSQVSLPTVSLVFQGGACLDLDASGIVYAISQSQVCLGFAS 467
Query: 379 YPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
D + +GN QQR + V YDV+ + +GF PG C
Sbjct: 468 NGDDESVGIVGNTQQRTYGVLYDVSKKVVGFAPGAC 503
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 108/405 (26%), Positives = 168/405 (41%), Gaps = 77/405 (19%)
Query: 77 GISTHAPSLEEILRQDQQRLHLKNSR-------RLRKPFPEFLKRTEAFTFPANINDTVA 129
G ST + S E+ R D+QR+ R + + + + T P +
Sbjct: 82 GPSTASASFAEVQRADEQRVEYIQRRVSGGGARGAKGALQQLATGSRSATVPTTMG-VGT 140
Query: 130 DEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI--HCFQQRDPFFYASKSKTFFKIPC 187
+Y + V++G P ++ +DTGSDV+W QCKPC C QRD F +KS T+ +PC
Sbjct: 141 FQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPC 200
Query: 188 NSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
+ +C LR C+ +C + + Y DGS + G + +D + + N+ G FL
Sbjct: 201 GADACSELR--IYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGNTVG-----TFL 253
Query: 248 LGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDT 304
GC + +G +G G++ L R +S+ ++ +Y FSYCLPS + GY+T G +
Sbjct: 254 FGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLTLGGPTS 313
Query: 305 VNTLRS----------AFHKRMKKYKKAKG---------------------LEDLLDTCY 333
+ + F+ M G + L T Y
Sbjct: 314 ASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVDTGTVITRLPPTAY 373
Query: 334 DL--SAYETVVVP-------------------KIAIHFLGGVDLELDVRGTLVVAS---V 369
SA+ + P + + L V L TL + + +
Sbjct: 374 AALRSAFRGAIAPYGYPSAPANGILDTCYDFSRYGVVTLPTVALTFSGGATLALEAPGIL 433
Query: 370 SQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
S CL FA D ++ LGNVQQR V +D G +GF PG C
Sbjct: 434 SSGCLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 95/306 (31%), Positives = 149/306 (48%), Gaps = 27/306 (8%)
Query: 25 DDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPS 84
D D +V+ SSL P VC+ + + + ++L + ++GPCS + IS PS
Sbjct: 25 DGADAQRYIVVATSSLKPSEVCSGHKVTPSK--NGSTLALSHRHGPCSPV---ISKEKPS 79
Query: 85 LEEILRQDQQR---LHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVAD-EYYIVVAIGE 140
EE LR+DQ R + K S R E + A T P + ++ EY I V IG
Sbjct: 80 HEETLRRDQLRAAYIQAKVSSRYNNVAKEL--QQSAVTIPTSSGYSLGTTEYVITVTIGT 137
Query: 141 PKQYVSLLLDTGSDVTWTQCKPCI--HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRES 198
P + +DTGSDV+W QC PC C Q+D F + S T+ C S C L +
Sbjct: 138 PAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAMSATYSAFSCGSAQCAQLGDE 197
Query: 199 FPFGN-CNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD 257
GN C +C + ++Y DGS + G + +D +++ +++ F GC + ++G
Sbjct: 198 ---GNGCLKSQCQYIVKYGDGSNTAGTYGSDTLSLTSSDA-----VKSFQFGCSHRAAGF 249
Query: 258 KSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGS-TGYITFGKTDTVNTLRSAFH 313
G+MGL S++++T +Y FSYCLP P S G++T G ++ R + H
Sbjct: 250 VGELDGLMGLGGDTESLVSQTAATYGKAFSYCLPPPSSSGGGFLTLGAAGGASSSRYS-H 308
Query: 314 KRMKKY 319
M ++
Sbjct: 309 TPMVRF 314
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 44/109 (40%), Positives = 59/109 (54%), Gaps = 6/109 (5%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
LR+AF K MK Y A + L DTC+D S + T+ VP + + F G ++LD+ G L
Sbjct: 362 QALRTAFKKEMKAYPSAAPVGSL-DTCFDFSGFNTITVPTVTLTFSRGAAMDLDISGILY 420
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
CL F D ++ LGNVQQR E+ +DV GR +GF G C
Sbjct: 421 AG-----CLAFTATAHDGDTGILGNVQQRTFEMLFDVGGRTIGFRSGAC 464
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 131/492 (26%), Positives = 211/492 (42%), Gaps = 100/492 (20%)
Query: 3 ILSKAFLLFICLLCSSNNGAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQ---GPDK 59
+ S LL + LLCS + A N+ H +V ++ N + PQ P++
Sbjct: 1 MASSHMLLCVLLLCSYSLTALGGGNE-QHGFVVVPTTTGTSTSSNPACSPAPQVTSDPNR 59
Query: 60 ASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRL-HLKNSRRLRKPFPEFLKRTEAF 118
AS+ + ++GPC+ ++ PSL E LR+D+ R H+ + RT
Sbjct: 60 ASMPLAHRHGPCA---PATTSSWPSLAERLRRDRARRDHITRKAKASG-------RTTTL 109
Query: 119 T---FPANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC--IHCFQQRDP 172
+ P ++ V EY + + IG P ++L+DTGSD++W QCKPC C+ Q+DP
Sbjct: 110 SDVSIPTSLGAAVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKDP 169
Query: 173 FFYASKSKTFFKIPCNSTSCR-ILRESFPFGNCNSKE---CPFNIQYADGSGSGGFWATD 228
+ + S T+ +PC+S +C+ ++ +++ G NS C + I+Y + + G ++T+
Sbjct: 170 LYDPTASSTYAPVPCDSKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTE 229
Query: 229 RITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSY 285
+T+ S F GC G G++GL +P S++++T +Y FSY
Sbjct: 230 TLTLSPQVS-----VKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFSY 284
Query: 286 CLPSPYGSTGYITFG----KTDTVNTLRSAFHKRMKK----------------------- 318
CLP +TG++ G DT L + H ++
Sbjct: 285 CLPPGNSTTGFLALGAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPT 344
Query: 319 -------YKKAKGLEDLLDTCYD---------LSAY--------------------ETVV 342
+ L DT Y +SAY V
Sbjct: 345 VLSGGMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYNFTGIANVT 404
Query: 343 VPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDV 402
VP +A+ F GG ++LDV +++ Q CL FA D + +GNV QR EV YD
Sbjct: 405 VPTVALTFDGGATIDLDVPSGVLI----QDCLAFAGGASDGDVGIIGNVNQRTFEVLYDS 460
Query: 403 AGRRLGFGPGNC 414
+GF PG C
Sbjct: 461 GRGHVGFRPGAC 472
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 106/367 (28%), Positives = 158/367 (43%), Gaps = 85/367 (23%)
Query: 119 TFPANINDTVADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCI---HCFQQRDPFF 174
T PA+ + Y+V A +G P ++ +DTGSD++W QCKPC C+ Q+DP F
Sbjct: 34 TVPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLF 93
Query: 175 YASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQE 234
++S ++ +PC C L + C++ +C + + Y DGS + G +++D +T+
Sbjct: 94 DPAQSSSYAAVPCGGPVCAGLGI-YAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSA 152
Query: 235 ANS-NGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYC---- 286
+++ G+F GC + SG +G G++GL R S++ +T +Y FSYC
Sbjct: 153 SSAVQGFF------FGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTK 206
Query: 287 --------------------------LPSPYGSTGYITFGKTDTVN-------------- 306
LPSP T Y+ +V
Sbjct: 207 PSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGG 266
Query: 307 ------------------TLRSAFHKRMKKYKKAKGLED-LLDTCYDLSAYETVVVPKIA 347
LRSAF M Y + +LDTCY+ + Y TV +P +A
Sbjct: 267 TVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVA 326
Query: 348 IHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRL 407
+ F G + L G L S CL FA D LGNVQQR EV D G +
Sbjct: 327 LTFGSGATVTLGADGIL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSV 379
Query: 408 GFGPGNC 414
GF P +C
Sbjct: 380 GFKPSSC 386
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 85/291 (29%), Positives = 142/291 (48%), Gaps = 22/291 (7%)
Query: 10 LFICLLCSSNNGAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYG 69
L +C++ + + A + V ++ P VC+ + L G + S+ +V ++G
Sbjct: 6 LLVCIILCTYEYSLAHGGNEHGFVAVPTTASEPEPVCSTSGVTLDPGSNTVSVPLVHRHG 65
Query: 70 PCSRLNQGISTHAPS-LEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTV 128
PC+ +S+ PS + LR+++ R SR + + + P ++ +V
Sbjct: 66 PCAPTQ--LSSDKPSSFTDRLRRNRARSKYIMSRVSKG----MMGDDADVSIPTHLGGSV 119
Query: 129 AD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC--IHCFQQRDPFFYASKSKTFFKI 185
EY + V +G P LL+DTGSD++W QC+PC C+ Q+DP F SKS T+ I
Sbjct: 120 DSLEYVVTVGLGTPSVSQVLLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTYAPI 179
Query: 186 PCNSTSCRILRESFPFGNCNS----KECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
PCN+ +CR L + G C S +C F I Y DGS + G ++ + + +
Sbjct: 180 PCNTDACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLALAPG-----V 234
Query: 242 TRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
F GC ++ G G++GL +P S++ +T + Y FSYCLP+
Sbjct: 235 AVKDFRFGCGHDQDGANDKYDGLLGLGGAPESLVVQTASVYGGAFSYCLPA 285
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 45/109 (41%), Positives = 57/109 (52%), Gaps = 6/109 (5%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
N L++AF K M Y + E LDTCYD S Y V +PK+A+ F GG ++LDV ++
Sbjct: 366 NALQAAFRKAMAAYPLVRNGE--LDTCYDFSGYSNVTLPKVALTFSGGATIDLDVPNGIL 423
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ CL F PD LGNV QR EV YD R+GF C
Sbjct: 424 LDD----CLAFQESGPDDQPGILGNVNQRTLEVLYDAGRGRVGFRAAVC 468
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 130 bits (328), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 103/355 (29%), Positives = 153/355 (43%), Gaps = 82/355 (23%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC---IHCFQQRDPFFYASKSKTFFKIPC 187
E+ + V +G P Q +L+ DTGSD++W QC+PC HC Q+DP F SKS T+ + C
Sbjct: 148 EFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHC 207
Query: 188 NSTSCRILRESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C + G C ++ C + + Y DGS + G + D + + + + +P
Sbjct: 208 GEPQC-----AAAGGLCSEDNTTCLYLVHYGDGSSTTGVLSRDTLALTSSRA---LAGFP 259
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKT 302
F GC + GD G++GL R +S+ ++ S+ FSYCLPS +TGY+T G T
Sbjct: 260 F--GCGTRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNSTTGYLTIGAT 317
Query: 303 DTVNTLRSAFHKRMKK------------------------------------------YK 320
+T + + ++K Y
Sbjct: 318 PATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFTRGGTLLDSGTVLTYL 377
Query: 321 KAKGLEDLLD-------------------TCYDLSAYETVVVPKIAIHFLGGVDLELDVR 361
A+ E L D CYD + V+VP ++ F G ELD
Sbjct: 378 PAQAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVIVPAVSFRFGDGAVFELDFF 437
Query: 362 GTLVVASVSQVCLGFATYPPD--PNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
G ++ + CL FA P SI +GN QQR EV YDVA ++GF P +C
Sbjct: 438 GVMIFLDENVGCLAFAAMDAGGLPLSI-IGNTQQRSAEVIYDVAAEKIGFVPASC 491
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 104/351 (29%), Positives = 154/351 (43%), Gaps = 76/351 (21%)
Query: 139 GEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCR-ILRE 197
G P +++++DTGSD+TW QCKPC C+ QRDP F + S T+ + CN+++C LR
Sbjct: 155 GSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACADSLRA 214
Query: 198 SFPF-GNCNS-----KECPFNIQYADGSGSGGFWATDRITIQEANSNGY----------- 240
+ G+C S ++C + + Y DGS S G ATD + + A+ G+
Sbjct: 215 ATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGASLGGFVFGCGLSNRGL 274
Query: 241 FTRYPFLLG----------------------CI-NNSSGDKSGASGIMGLDRSPVSIITR 277
F L+G C+ +SGD SG+ + G D + S
Sbjct: 275 FGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGGDDAASSYRNT 334
Query: 278 TNTSY-------------FSYCLPSPYGSTGYITFG--------KTDTVNT--------- 307
T +Y F + G T G + TV T
Sbjct: 335 TPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVLIDSGTVITRLAPSVYRA 394
Query: 308 LRSAFHKRMKK--YKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+R+ F ++ Y A G +LDTCYDL+ ++ V VP + + GG D+ +D G L
Sbjct: 395 VRAEFMRQFGAAGYPAAPGFS-ILDTCYDLTGHDEVKVPLLTLRLEGGADVTVDAAGMLF 453
Query: 366 VA--SVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V SQVCL A+ + + +GN QQ+ V YD G RLGF +C
Sbjct: 454 VVRKDGSQVCLAMASLSYEDETPIIGNYQQKNKRVVYDTLGSRLGFADEDC 504
>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
Length = 393
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 111/401 (27%), Positives = 172/401 (42%), Gaps = 75/401 (18%)
Query: 43 PNVCNRTRTALPQGPDKASLEVVS-KYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNS 101
P+ + +P D S +S +YGPCS + P+ EE+LR+DQ R
Sbjct: 13 PSARGKWLATIPSSSDGTSSVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADY--- 69
Query: 102 RRLRKPFPEF-------LKRTEAFTFPANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGS 153
+R+ F ++ + P + ++ EY I V +G P +++DTGS
Sbjct: 70 --IRRKFSGSNGTAAGEDGQSSKVSVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGS 127
Query: 154 DVTWTQCKPCIH---CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSK-EC 209
DV+W QC+PC C F + S T+ C++ +C L +S C++K C
Sbjct: 128 DVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRC 187
Query: 210 PFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSG----DKSGASGIM 265
+ ++Y DGS + G F GC + G DK+ G++
Sbjct: 188 QYIVKYGDGSNTTGTG--------------------FQFGCSHAELGAGMDDKT--DGLI 225
Query: 266 GLDRSPVSIITRTN-------TSYFSY------------CLPSPYGSTGYITFGKTDT-- 304
GL S++++T T YF+ PS + + + G T
Sbjct: 226 GLGGDAQSLVSQTAARSKKVPTYYFAALEDIAVGGKKLGLSPSVFAAGSLVDSGTVITRL 285
Query: 305 ----VNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDV 360
L SAF M +Y +A+ L +LDTC++ + + V +P +A+ F GG ++LD
Sbjct: 286 PPAAYAALSSAFRAGMTRYARAEPL-GILDTCFNFTGLDKVSIPTVALVFAGGAVVDLDA 344
Query: 361 RGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYD 401
G VS CL FA D T+GNVQQR EV YD
Sbjct: 345 HGI-----VSGGCLAFAPTRDDKAFGTIGNVQQRTFEVLYD 380
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 100/358 (27%), Positives = 152/358 (42%), Gaps = 66/358 (18%)
Query: 120 FPANINDTVADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASK 178
P V YIV A G P + L++DTGSDVTW QCKPC C+ Q DP F +
Sbjct: 125 LPLQPGSKVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCSDCYSQVDPIFEPQQ 184
Query: 179 SKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEAN-- 236
S ++ + C S++C L +C C + I Y DGS S G ++ + +T+ +
Sbjct: 185 SSSYKHLSCLSSACTELTT---MNHCRLGGCVYEINYGDGSRSQGDFSQETLTLGSDSFP 241
Query: 237 ---------SNGYFTRYPFLLG----------------------CI-----NNSSGDKSG 260
+ G F LLG C+ + S+G S
Sbjct: 242 SFAFGCGHTNTGLFKGSAGLLGLGRTALSFPSQTKSKYGGQFSYCLPDFVSSTSTGSFSV 301
Query: 261 ASGIMGLDRSPVSIITRTNTSYFSYC--------------LPSPYGSTGYITFGKT---- 302
G + + V +++ +N F + P+ G G I T
Sbjct: 302 GQGSIPATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGRGGTIVDSGTVITR 361
Query: 303 ---DTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELD 359
+ L+++F + + AK +LDTCYDLS+Y V +P I HF D+ +
Sbjct: 362 LVPQAYDALKTSFRSKTRNLPSAKPFS-ILDTCYDLSSYSQVRIPTITFHFQNNADVAVS 420
Query: 360 VRGTL--VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
G L + + SQVCL FA+ ++ +GN QQ+ V +D R+GF PG+C+
Sbjct: 421 AVGILFTIQSDGSQVCLAFASASQSISTNIIGNFQQQRMRVAFDTGAGRIGFAPGSCA 478
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 106/391 (27%), Positives = 164/391 (41%), Gaps = 70/391 (17%)
Query: 88 ILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSL 147
+ R + + HL+ +RL +L ++D + EY++ V +G P L
Sbjct: 89 VARDNARVEHLE--KRLVASTSPYLPEDLVSEVVPGVDDG-SGEYFVRVGVGSPPTDQYL 145
Query: 148 LLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSK 207
++D+GSDV W QC+PC C+ Q DP F + S +F + C S CR L + G ++
Sbjct: 146 VVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAG 205
Query: 208 ECPFNIQYADGSGSGGFWATDRITIQEA-----------NSNGYFTRYPFLLGC------ 250
+C +++ Y DGS + G A + +T+ ++G F LLG
Sbjct: 206 KCDYSVTYGDGSYTKGELALETLTLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMS 265
Query: 251 ----INNSSGD---------KSGASGIMGLDRSP--------VSIITRTNTSYFSYC--- 286
+ ++G +G +G + L R+ V ++ S F Y
Sbjct: 266 LVGQLGGAAGGVFSYCLASRGAGGAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLT 325
Query: 287 --------LP--------SPYGSTGYITFGKT-------DTVNTLRSAFHKRMKKYKKAK 323
LP + G+ G + T + LR AF M ++
Sbjct: 326 GIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSP 385
Query: 324 GLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDP 383
+ LLDTCYDLS Y +V VP ++ +F G L L R LV + CL FA P
Sbjct: 386 AVS-LLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVFCLAFA--PSSS 442
Query: 384 NSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
LGN+QQ G ++ D A +GFGP C
Sbjct: 443 GISILGNIQQEGIQITVDSANGYVGFGPNTC 473
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 100/351 (28%), Positives = 154/351 (43%), Gaps = 78/351 (22%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNS 189
Y + +G P ++++DTGS +TW QC PC + C +Q P F S T+ + C++
Sbjct: 133 NYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYASVRCSA 192
Query: 190 TSCRILRESF--PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-F 246
+ C L+ + P S C + Y D S S G +TD ++ TRYP F
Sbjct: 193 SQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGSLSTDTVSFGS-------TRYPSF 245
Query: 247 LLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTD 303
GC ++ G ++G++GL R+ +S++ + S FSYCLP+ STGY++ G +
Sbjct: 246 YYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTA-ASTGYLSIGPYN 304
Query: 304 T----------------------------------------------------VNTLRSA 311
T + L +A
Sbjct: 305 TGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRLPTA 364
Query: 312 FHKRMKK--YKKAKGLE-----DLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL 364
H + K + G + +LDTC++ A + + VP +A+ F GG ++L R L
Sbjct: 365 VHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQ-LRVPTVAMAFAGGASMKLTTRNVL 423
Query: 365 VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ S CL FA P D +I +GN QQ+ V YDVA R+GF G CS
Sbjct: 424 IDVDDSTTCLAFA--PTDSTAI-IGNTQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 108/365 (29%), Positives = 147/365 (40%), Gaps = 82/365 (22%)
Query: 126 DTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKI 185
D + EY + V++G P L++D+GSDV W QCKPC+ C+ Q DP F + S TF +
Sbjct: 165 DEGSGEYLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPCLECYVQADPLFDPATSATFSGV 224
Query: 186 PCNSTSCRILRESFPFGNCNSKE---CPFNIQYADGSGSGGFWATDRITIQEA------- 235
C S CRIL P C E C + + YADGS + G A + +T+
Sbjct: 225 SCGSAICRIL----PTSACGDGELGGCEYEVSYADGSYTKGALALETLTLGGTAVEGVVI 280
Query: 236 ----NSNGYFTRYPFLLG----------------------CINNSSGDKSGAS----GIM 265
+ G F L+G C+ + G SGA+ G +
Sbjct: 281 GCGHRNRGLFVGAAGLMGLGWGPMSLVGQLGGEVGGAFSYCLASRGGYGSGAADDDAGWL 340
Query: 266 GLDRSP--------VSIITRTNTSYFSYC-----------LPSPYGSTGYITFGKTDTV- 305
L RS V ++ F Y LP G G D V
Sbjct: 341 VLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQLTEDGAGDVVM 400
Query: 306 --------------NTLRSAFHKRMK-KYKKAKGLED-LLDTCYDLSAYETVVVPKIAIH 349
LR AF + +A+G+ +LDTCYDLS Y +V VP ++
Sbjct: 401 DTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYDLSGYASVRVPTVSFC 460
Query: 350 FLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGF 409
F G L L R L+ + CL FA P +GN QQ G ++ D A +GF
Sbjct: 461 FDGDARLILAARNVLLEVDMGIYCLAFA--PSSSGLSIMGNTQQAGIQITVDSANGYIGF 518
Query: 410 GPGNC 414
GP NC
Sbjct: 519 GPANC 523
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 106/391 (27%), Positives = 164/391 (41%), Gaps = 70/391 (17%)
Query: 88 ILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSL 147
+ R + + HL+ +RL +L ++D + EY++ V +G P L
Sbjct: 89 VARDNARVEHLE--KRLVASTSPYLPEDLVSEVVPGVDDG-SGEYFVRVGVGSPPTDQYL 145
Query: 148 LLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSK 207
++D+GSDV W QC+PC C+ Q DP F + S +F + C S CR L + G ++
Sbjct: 146 VVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAG 205
Query: 208 ECPFNIQYADGSGSGGFWATDRITIQEA-----------NSNGYFTRYPFLLGC------ 250
+C +++ Y DGS + G A + +T+ ++G F LLG
Sbjct: 206 KCDYSVTYGDGSYTKGELALETLTLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMS 265
Query: 251 ----INNSSGD---------KSGASGIMGLDRSP--------VSIITRTNTSYFSYC--- 286
+ ++G +G +G + L R+ V ++ S F Y
Sbjct: 266 LIGQLGGAAGGVFSYCLASRGAGGAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLT 325
Query: 287 --------LP--------SPYGSTGYITFGKT-------DTVNTLRSAFHKRMKKYKKAK 323
LP + G+ G + T + LR AF M ++
Sbjct: 326 GIGVGGERLPLQDGLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSP 385
Query: 324 GLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDP 383
+ LLDTCYDLS Y +V VP ++ +F G L L R LV + CL FA P
Sbjct: 386 AVS-LLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVFCLAFA--PSSS 442
Query: 384 NSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
LGN+QQ G ++ D A +GFGP C
Sbjct: 443 GISILGNIQQEGIQITVDSANGYVGFGPNTC 473
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 104/352 (29%), Positives = 146/352 (41%), Gaps = 72/352 (20%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ V +G P L++D+GSDV W QC+PC C+QQ DP F + S +F +PC+S
Sbjct: 132 EYFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQADPLFDPAASASFTAVPCDSG 191
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEAN------------SN 238
CR L G +S C + + Y DGS + G A + +T ++ +
Sbjct: 192 VCRTL-PGGSSGCADSGACRYQVSYGDGSYTQGVLAMETLTFGDSTPVQGVAIGCGHRNR 250
Query: 239 GYFTRYPFLLG----------------------CINNSSGDKSGASGIMGLDRS-PVS-- 273
G F LLG C+ + D S + G D + PV
Sbjct: 251 GLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGADAGAGSLVFGRDDAMPVGAV 310
Query: 274 ---IITRTNTSYFSYC-----------LP--------SPYGSTGYITFGKT-------DT 304
++ F Y LP + G G + T D
Sbjct: 311 WVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVMDTGTAVTRLPPDA 370
Query: 305 VNTLRSAFHKRMK-KYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHF-LGGVDLELDVRG 362
LR AF + +A G+ LLDTCYDLS Y +V VP +A++F G L L R
Sbjct: 371 YAALRDAFASTIGGDLPRAPGVS-LLDTCYDLSGYASVRVPTVALYFGRDGAALTLPARN 429
Query: 363 TLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
LV CL FA + LGN+QQ+G ++ D A +GFGP C
Sbjct: 430 LLVEMGGGVYCLAFAASASGLS--ILGNIQQQGIQITVDSANGYVGFGPSTC 479
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 101/354 (28%), Positives = 150/354 (42%), Gaps = 82/354 (23%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V IG P + + ++LDTGSDVTW QC+PC C+QQ DP F S S ++ + C+S
Sbjct: 165 EYFSRVGIGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQ 224
Query: 191 SCRILRESFPFGNCN--SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
CR L + C + C + + Y DGS + G +AT+ +T+ ++ G +
Sbjct: 225 RCRDLDTA----ACRNATGACLYEVAYGDGSYTVGDFATETLTLGDSTPVGNVA-----I 275
Query: 249 GCINNSSG----------------------------------DKSGASGIM--------G 266
GC +++ G D AS + G
Sbjct: 276 GCGHDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCLVDRDSPAASTLQFGDGAAEAG 335
Query: 267 LDRSPVSIITRTNTSYF------------------SYCLPSPYGSTGYITFGKTDTVN-- 306
+P+ RT+T Y+ ++ + + GS G I T
Sbjct: 336 TVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQ 395
Query: 307 -----TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVR 361
LR AF + + G+ L DTCYDLS +V VP +++ F GG L L +
Sbjct: 396 SAAYAALRDAFVQGAPSLPRTSGVS-LFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAK 454
Query: 362 GTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L+ V CL FA P + +GNVQQ+G V +D A +GF P C
Sbjct: 455 NYLIPVDGAGTYCLAFA--PTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 506
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 90/299 (30%), Positives = 137/299 (45%), Gaps = 44/299 (14%)
Query: 35 VSVSSLLPPNVCNRTRT--ALPQGPDKASLEVVSKYGPCSRL-NQGISTHAPSLEEILRQ 91
+ SLLP T P+ + +V ++GPCS L + APS EIL
Sbjct: 38 LDAESLLPSAAAASCHTPEQRPEAGTATRMPIVHQHGPCSPLADDKHGKKAPSHTEILVA 97
Query: 92 DQQRLHLKNSR------RLRK-----PFPEF---------------LKRTEAFTFPANIN 125
DQ+R+ + R R+R+ P E + PA
Sbjct: 98 DQRRVEYIHRRVSETTGRVRRQKHSAPVVELRPGTPSSTRSSSSSLSSSATSTNLPAKSG 157
Query: 126 DTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFYASKSKTFF 183
++ Y+V + +G P +++ DTGSD TW QC+PC+ +C+QQ++P F +KS T+
Sbjct: 158 LSLNTGNYVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFTPTKSATYA 217
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
I C S+ C L C+ C + +QY DGS + GF+A D +T+ GY T
Sbjct: 218 NISCTSSYCSDLDTR----GCSGGHCLYAVQYGDGSYTVGFYAQDTLTL------GYDTV 267
Query: 244 YPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITF 299
F GC + G A+G+MGL R S+ + Y F+YC+P+ TG++ F
Sbjct: 268 KDFRFGCGEKNRGLFGKAAGLMGLGRGKTSVPVQAYDKYSGVFAYCIPATSSGTGFLDF 326
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 48/110 (43%), Positives = 67/110 (60%), Gaps = 4/110 (3%)
Query: 308 LRSAFHKRMKK--YKKAKGLEDLLDTCYDLSAYE-TVVVPKIAIHFLGGVDLELDVRGTL 364
LRSAF K M+ YK A +LDTCYDL+ Y+ ++ +P +++ F GG L++D G L
Sbjct: 395 LRSAFAKGMEGLGYKTAPAFS-ILDTCYDLTGYQGSIALPAVSLVFQGGACLDVDASGIL 453
Query: 365 VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
VA VSQ CL FA D + +GN QQ+ + V YD+ + +GF PG C
Sbjct: 454 YVADVSQACLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFAPGAC 503
>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 452
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 120/455 (26%), Positives = 175/455 (38%), Gaps = 116/455 (25%)
Query: 35 VSVSSLLPPNVCNRTRTALP--QGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQD 92
VS +S +P + C+ P + A L + ++GPC+ ++ S APS+ + LR D
Sbjct: 39 VSAASFVPSSTCSSPDRVPPHRRNGTSAVLRLTHRHGPCAP-SRASSLAAPSVADTLRAD 97
Query: 93 QQRLHLKNSRRLRKPFPEFLKRTEAFTFPANIN----DTVADEYYIVVAIGEPKQYVSLL 148
Q+R RR+ P+ A D Y + ++G P ++
Sbjct: 98 QRRAEYIL-RRVSGRAPQLWDSKAAAAVATVPASWGYDIGTLNYVVTASLGTPGVAQTME 156
Query: 149 LDTGSDVTWTQCKPCI---HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCN 205
+DTGSD++W QCKPC C+ Q+DP F ++S ++ +PC C
Sbjct: 157 VDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVC------------- 203
Query: 206 SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIM 265
+G G + A+ Q G+F GC + SG +G G++
Sbjct: 204 -------------AGLGIYAASACSAAQCGAVQGFF------FGCGHAQSGLFNGVDGLL 244
Query: 266 GLDRSPVSIITRTNTSY---FSYCL------------------------------PSPYG 292
GL R S++ +T +Y FSYCL PSP
Sbjct: 245 GLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNA 304
Query: 293 STGYITFGKTDTVN--------------------------------TLRSAFHKRMKKYK 320
T Y+ +V LRSAF M Y
Sbjct: 305 PTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYAALRSAFRSGMASYG 364
Query: 321 KAKGLED-LLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATY 379
+ +LDTCY+ + Y TV +P +A+ F G + L G L S CL FA
Sbjct: 365 YPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL-----SFGCLAFAPS 419
Query: 380 PPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
D LGNVQQR EV D G +GF P +C
Sbjct: 420 GSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 452
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 154/351 (43%), Gaps = 71/351 (20%)
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
+V +G Q +L++DTGSD+TW QC PC C+ Q++P F S S +F +PCNS +C
Sbjct: 146 IVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVA 205
Query: 195 LR----ESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA-----------NSNG 239
L+ S N NS C + I Y DGS S G +++T+ + N+ G
Sbjct: 206 LQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDNFIFGCGRNNKG 265
Query: 240 YFTRYPFLLG----------------------CINNSSGDKSGASGIMGLD------RSP 271
F L+G C+ + SG+ + G D SP
Sbjct: 266 LFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISP 325
Query: 272 VS---IITRTNTSYFSYC-------------LPSPYGSTGYITFGKTDTVNT-------- 307
+S +I S F + +P + G ++ + TV T
Sbjct: 326 ISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYK 385
Query: 308 -LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL-- 364
++ F K+ Y+ G +L+TC++L+ YE V +P + F G ++ +DV G
Sbjct: 386 AFKAEFEKQFSGYRTTPGF-SILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYF 444
Query: 365 VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
V + SQ+CL FA+ + ++ +GN QQ+ V Y+ ++GF CS
Sbjct: 445 VKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCS 495
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 154/351 (43%), Gaps = 71/351 (20%)
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
+V +G Q +L++DTGSD+TW QC PC C+ Q++P F S S +F +PCNS +C
Sbjct: 67 IVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVA 126
Query: 195 LR----ESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA-----------NSNG 239
L+ S N NS C + I Y DGS S G +++T+ + N+ G
Sbjct: 127 LQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDNFIFGCGRNNKG 186
Query: 240 YFTRYPFLLG----------------------CINNSSGDKSGASGIMGLD------RSP 271
F L+G C+ + SG+ + G D SP
Sbjct: 187 LFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISP 246
Query: 272 VS---IITRTNTSYFSYC-------------LPSPYGSTGYITFGKTDTVNT-------- 307
+S +I S F + +P + G ++ + TV T
Sbjct: 247 ISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYK 306
Query: 308 -LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL-- 364
++ F K+ Y+ G +L+TC++L+ YE V +P + F G ++ +DV G
Sbjct: 307 AFKAEFEKQFSGYRTTPGFS-ILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYF 365
Query: 365 VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
V + SQ+CL FA+ + ++ +GN QQ+ V Y+ ++GF CS
Sbjct: 366 VKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCS 416
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 112/427 (26%), Positives = 175/427 (40%), Gaps = 86/427 (20%)
Query: 58 DKASLEVVSKYGPCSRL-NQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTE 116
++ S+ + + GPCS + +G A E+LR+D++R R R +
Sbjct: 59 NRVSVPLAHRNGPCSPVRGKGELPRA----EMLRRDRERTEYIIRRASRSR--RLQDNND 112
Query: 117 AFTFPANINDTV-ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC--IHCFQQRDPF 173
A + P + + + EY V +G P +L+LDTGS +TW QCKPC C+ QR P
Sbjct: 113 AVSVPTQLGSSYDSQEYVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPL 172
Query: 174 FYASKSKTFFKIPCNSTSCRILRESFPFGNCNSK---ECPFNIQYADGSGSGGFWATDRI 230
F + S ++ +PC+S CR L C S C + I Y G+ G ++TD +
Sbjct: 173 FDPNTSSSYSPVPCDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDAL 232
Query: 231 TIQEANSNGYFTRYPFLLGCINNSS-GDKSGASGIMGLDRSPVSII----TRTNTSYFSY 285
T+ R+ F GC ++ G A G++GL R P S+ R FS+
Sbjct: 233 TL---GPGAIVKRFHF--GCGHHQQRGKFDMADGVLGLGRLPQSLAWQASARRGGGVFSH 287
Query: 286 CLPSPYGSTGYITFGKTDTVNTL----------RSAFHKRMKKYKKAKG----------- 324
CLP STG++ G + + F++ M G
Sbjct: 288 CLPPTGVSTGFLALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFR 347
Query: 325 ----------LEDLLDTCY---------------------------DLSAYETVVVPKIA 347
L L +T Y + + Y+ V VP ++
Sbjct: 348 EGVITDSGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGHLDTCFNFTGYDNVTVPTVS 407
Query: 348 IHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRL 407
+ F GG + LD +++ CL F + D + +G+V QR EV YD+ GR++
Sbjct: 408 LTFRGGATVHLDASSGVLMDG----CLAFWSS-GDEYTGLIGSVSQRTIEVLYDMPGRKV 462
Query: 408 GFGPGNC 414
GF G C
Sbjct: 463 GFRTGAC 469
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 93/358 (25%), Positives = 151/358 (42%), Gaps = 85/358 (23%)
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
V +G ++++DT S++TW QC+PC C Q+DP F S S ++ +PCNS+SC
Sbjct: 121 VATVGLGAAEATVVVDTASELTWVQCQPCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDA 180
Query: 195 LRESFPFG-------NCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
LR + G N C + + Y DGS S G A D++ + + G F+
Sbjct: 181 LRVAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRLAGQDIEG------FV 234
Query: 248 LGCINNSSGD-------------------------------------KSGASG--IMGLD 268
GC ++ G +SG+SG ++G D
Sbjct: 235 FGCGTSNQGAPFGGTSGLMGLGRSHVSLVSQTMDQFGGVFSYCLPMRESGSSGSLVLGDD 294
Query: 269 RS----------------------PVSIITRTNTSYFSYCLPSPYGSTGYITFGKTDTVN 306
S P + T + + SP+ S G + +
Sbjct: 295 SSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEVESPWFSAGRVIIDSGTIIT 354
Query: 307 TL--------RSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLEL 358
TL R+ F ++ +Y +A +LDTC++L+ + V VP + F G V++E+
Sbjct: 355 TLVPSVYNAVRAEFLSQLAEYPQAPAFS-ILDTCFNLTGLKEVQVPSLKFVFEGSVEVEV 413
Query: 359 DVRGTL--VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
D +G L V + SQVCL A+ + ++ +GN QQ+ V +D G ++GF C
Sbjct: 414 DSKGVLYFVSSDASQVCLALASLKSEYDTSIIGNYQQKNLRVIFDTLGSQIGFAQETC 471
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 129/490 (26%), Positives = 193/490 (39%), Gaps = 100/490 (20%)
Query: 7 AFLLFICLLCSSNNGAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVS 66
+ ++ + L SS+ ++ H+V+ S L P ++C+ + A D + +
Sbjct: 4 SLVVILLLSISSSVASHGAGAGSQRYHVVATSHLEPESLCSGLKVA--PSADGTWVPLHR 61
Query: 67 KYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRR-------LRKPFPEFLKRTEAFT 119
+GPCS APSL E+LR DQ R + L P L F
Sbjct: 62 PFGPCS--PSAGRAPAPSLLEMLRWDQVRTEYVRRKASGGAEDVLNPAKPRVLMSQTDFA 119
Query: 120 FPANINDTVADEYYIVV-AIGEPK--QYVSLLLDTGSDVTWTQCKPCI--HCFQQRDPFF 174
+ + A G+P ++ +DT DV W QC PC C+ QRDP F
Sbjct: 120 VRSPFGVGSGSGSSAWIDADGDPTVVSQQTMAIDTTVDVPWIQCAPCPIPQCYPQRDPLF 179
Query: 175 YASKSKTFFKIPCNSTSCRILRESFPFGN-CNSK----ECPFNIQYADGSGSGGFWATDR 229
+ S T + C S +CR L P+GN C+++ EC + I+Y+D + G + TD
Sbjct: 180 DPTTSSTAAAVRCRSPACRSLG---PYGNGCSNRSANAECRYLIEYSDDRATAGTYMTDT 236
Query: 230 ITIQEANSNGYFTRYPFLLGCINNSSGDKSG-ASGIMGLDRSPVSIITRTNTSY---FSY 285
+TI +G F GC + G S +G M L S++ +T S FSY
Sbjct: 237 LTI-----SGTTAVRNFRFGCSHAVRGRFSDLTAGTMSLGGGAQSLLAQTARSLGNAFSY 291
Query: 286 CLPSPYGSTGYITFGKTDTVNT---------LRSAFHKRM-------------------- 316
C+P S G+++ G T N+ +RSA + +
Sbjct: 292 CVPQASAS-GFLSIGGPATTNSTTVFATTPLVRSAINPSLYLVRLQGIVVAGRRLGIPPV 350
Query: 317 --------------------------KKYKKA------KGLEDLLDTCYDLSAYETVVVP 344
+ ++ A G LDTCYD V VP
Sbjct: 351 AFSAGAVMDSSAVITQLPPTAYRALRRAFRNAMRAYPRSGATGTLDTCYDFLGLTNVRVP 410
Query: 345 KIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAG 404
+++ F GG + LD ++ CL F D +GNVQQ+ HEV YDVA
Sbjct: 411 AVSLVFGGGAVVVLDPPAVMIGG-----CLAFTATSSDLALGFIGNVQQQTHEVLYDVAA 465
Query: 405 RRLGFGPGNC 414
+GF G C
Sbjct: 466 GGVGFRRGAC 475
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 101/346 (29%), Positives = 155/346 (44%), Gaps = 72/346 (20%)
Query: 139 GEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSC--RILR 196
G P +++++DTGSD+TW QCKPC C+ QRDP F + S T+ + CN+++C +
Sbjct: 197 GSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACAASLKA 256
Query: 197 ESFPFGNCN--SKECPFNIQYADGSGSGGFWATDRITIQEANSNGY-----------FTR 243
+ G+C ++ C + + Y DGS S G ATD + + A+ +G+ F
Sbjct: 257 ATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGASLDGFVFGCGLSNRGLFGG 316
Query: 244 YPFLLG----------------------CI-NNSSGDKSGASGIMG-----LDRSPVSII 275
L+G C+ +SGD SG+ + G + +PV+
Sbjct: 317 TAGLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSLSLGGDASSYRNTTPVAY- 375
Query: 276 TRTNTS------YFSYCLPSPYGSTGYITFG--------KTDTVNT---------LRSAF 312
TR YF + G T G + TV T +R+ F
Sbjct: 376 TRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVLIDSGTVITRLAPSVYRGVRAEF 435
Query: 313 HKRMKK--YKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASV- 369
++ Y A G +LDTCYDL+ ++ V VP + + GG ++ +D G L V
Sbjct: 436 TRQFAAAGYPTAPGFS-ILDTCYDLTGHDEVKVPLLTLRLEGGAEVTVDAAGMLFVVRKD 494
Query: 370 -SQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
SQVCL A+ + + +GN QQ+ V YD G RLGF +C
Sbjct: 495 GSQVCLAMASLSYEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 540
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 102/349 (29%), Positives = 152/349 (43%), Gaps = 72/349 (20%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V IG P + + ++LDTGSDVTW QC+PC C+QQ DP F S S ++ + C+S
Sbjct: 168 EYFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSP 227
Query: 191 SCRILRESFPFGNCN--SKECPFNIQYADGSGSGGFWATDRITIQEA------------N 236
CR L + C + C + + Y DGS + G +AT+ +T+ ++ +
Sbjct: 228 RCRDLDTA----ACRNATGACLYEVAYGDGSYTVGDFATETLTLGDSTPVTNVAIGCGHD 283
Query: 237 SNGYFT-------------RYPFLLGCINNS----SGDKSGASGIM-GLDRSPVSIIT-- 276
+ G F +P + S D AS + G D + +T
Sbjct: 284 NEGLFVGAAGLLALGGGPLSFPSQISASTFSYCLVDRDSPAASTLQFGADGAEADTVTAP 343
Query: 277 -----RTNTSYF------------------SYCLPSPYGSTGYI-------TFGKTDTVN 306
RT T Y+ ++ + + GS G I T ++
Sbjct: 344 LVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGSGGVIVDSGTAVTRLQSSAYA 403
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV- 365
LR AF + + G+ L DTCYDLS +V VP +++ F GG L L + L+
Sbjct: 404 ALRDAFVRGTPSLPRTSGVS-LFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIP 462
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V CL FA P + +GNVQQ+G V +D A +GF P C
Sbjct: 463 VDGAGTYCLAFA--PTNAAVSIIGNVQQQGTRVSFDTAKGVVGFTPNKC 509
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 124/457 (27%), Positives = 197/457 (43%), Gaps = 71/457 (15%)
Query: 18 SNNGAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQG 77
S+ A D ++S+ S +VC++++ A++ + ++GPCS L
Sbjct: 16 SHRSPIARAGDDGSYKVLSMGSPRTDSVCSQSKAVPSSSAGAATVPLHHRHGPCSPLP-- 73
Query: 78 ISTHAPSLEEILRQDQQRL-HLKNSRRLRKPFPEFLKRTEAFTFPANINDTVAD-EYYIV 135
+ P+LEE L +DQ R +++ ++R++A T P + ++ EY I
Sbjct: 74 -TKKMPTLEETLHRDQLRAAYIQRKFSGGGGAGGDVQRSDA-TVPTALGTSLNTLEYLIT 131
Query: 136 VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRIL 195
V +G P ++L+DTGSDV+W QCKPC C Q DP F S S T+ C S +C L
Sbjct: 132 VGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSAACAQL 191
Query: 196 RESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSS 255
+ G +S +C + + Y DGS + G +++D + + G F GC N S
Sbjct: 192 GQEG-NGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL------GSSAVKSFQFGCSNVES 244
Query: 256 GDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDT------VN 306
G G+MGL S++++T + FSYCLP S+G++T G V
Sbjct: 245 GFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGSGTSGFVK 304
Query: 307 T--LRSA----FHKRMKKYKKAKGLE-----------DLLDTCYDLSAYETVVVPKIAIH 349
T LRS+ F+ + + G + ++D+ ++ ++
Sbjct: 305 TPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVMDSGTVITRLPPTAYSALSSA 364
Query: 350 FLGGV------------DLELDVRG--TLVVASVSQVCLGFATYPPDPNSITL------- 388
F G+ D D G ++ + SV+ V G A D + I L
Sbjct: 365 FKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIILSNCLAFA 424
Query: 389 -----------GNVQQRGHEVHYDVAGRRLGFGPGNC 414
GNVQQR EV YDV +GF G C
Sbjct: 425 ANSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 98/351 (27%), Positives = 153/351 (43%), Gaps = 78/351 (22%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNS 189
Y + +G P ++++DTGS +TW QC PC + C +Q P F S T+ + C++
Sbjct: 133 NYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYTSVRCSA 192
Query: 190 TSCRILRESF--PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-F 246
+ C L+ + P S C + Y D S S G+ +TD ++ T YP F
Sbjct: 193 SQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGYLSTDTVSFGS-------TSYPSF 245
Query: 247 LLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTD 303
GC ++ G ++G++GL R+ +S++ + S FSYCLP+ STGY++ G +
Sbjct: 246 YYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTA-ASTGYLSIGPYN 304
Query: 304 T----------------------------------------------------VNTLRSA 311
T + L +A
Sbjct: 305 TGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRLPTA 364
Query: 312 FHKRMKK--YKKAKGLE-----DLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL 364
H + K + G + +LDTC++ A + + VP + + F GG ++L R L
Sbjct: 365 VHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQ-LRVPTVVMAFAGGASMKLTTRNVL 423
Query: 365 VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ S CL FA P D +I +GN QQ+ V YDVA R+GF G CS
Sbjct: 424 IDVDDSTTCLAFA--PTDSTAI-IGNTQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 129/452 (28%), Positives = 200/452 (44%), Gaps = 75/452 (16%)
Query: 22 AYADDNDLSHSHIVSVSSLLPPNV-CNRTRTALPQGPDKASLEVVSKYGPCSRLNQGIST 80
A+A D DL ++ V SL V C+ + A G L ++GPCS + ST
Sbjct: 21 AHAGD-DLRSYKVLPVGSLKSAAVSCSLPKVAPSSGVVTVPLH--HRHGPCSTVP---ST 74
Query: 81 HAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVAD-EYYIVVAIG 139
+AP+LE++LR+DQ R + T P + ++ EY I V +G
Sbjct: 75 NAPTLEDMLRRDQLRAAYITRKYSGVNGSAGDVEGSDVTVPTTLGTSLDTLEYLITVGMG 134
Query: 140 EPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESF 199
P ++L+DTGSDV+W QCKPC C Q D F S S T+ C S +C LR+
Sbjct: 135 SPAVAQTMLIDTGSDVSWVQCKPCSQCHSQADSLFDPSSSSTYSAFSCTSAACAQLRQR- 193
Query: 200 PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD-- 257
C+S +C + ++Y DGS G +++D + + G T F GC + SG+
Sbjct: 194 ---GCSSSQCQYTVKYGDGSTGSGTYSSDTLAL------GSSTVENFQFGCSQSESGNLL 244
Query: 258 KSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDT---VNT--LR 309
+ +G+MGL S+ T+T ++ FSYCLP GS+G++T G + + V T LR
Sbjct: 245 QDQTAGLMGLGGGAESLATQTAGTFGKAFSYCLPPTPGSSGFLTLGASTSGFVVKTPMLR 304
Query: 310 S----AFHKRMKKYKKAKGLE-----------DLLDTCYDLSAYETVVVPKIAIHFLGGV 354
S +++ + + + G + ++D+ ++ ++ F G+
Sbjct: 305 STQVPSYYGVLLQAIRVGGRQLNIPASAFSAGSIMDSGTIITRLPRTAYSALSSAFKAGM 364
Query: 355 ------------DLELDVRG--TLVVASVSQVCLGFATYPPDPNSITLG----------- 389
D D G ++ + +V+ V G A + I LG
Sbjct: 365 KQYPPAQPMGIFDTCFDFSGQSSVSIPTVALVFSGGAVVDLASDGIILGSCLAFAANSDD 424
Query: 390 -------NVQQRGHEVHYDVAGRRLGFGPGNC 414
NVQQR EV YDV G +GF G C
Sbjct: 425 TSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 456
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 124/457 (27%), Positives = 196/457 (42%), Gaps = 71/457 (15%)
Query: 18 SNNGAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQG 77
S+ A D ++S+ S +VC++++ A++ + ++GPCS L
Sbjct: 16 SHRSPIARAGDDGSYKVLSMGSPRTDSVCSQSKAVPSSSAGAATVPLHHRHGPCSPLP-- 73
Query: 78 ISTHAPSLEEILRQDQQRL-HLKNSRRLRKPFPEFLKRTEAFTFPANINDTVAD-EYYIV 135
+ P+LEE L +DQ R +++ ++R++A T P + ++ EY I
Sbjct: 74 -TKKMPTLEETLHRDQLRAAYIQRKFSGGGGAGGDVQRSDA-TVPTALGTSLNTLEYLIT 131
Query: 136 VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRIL 195
V +G P ++L+DTGSDV+W QCKPC C Q DP F S S T+ C S C L
Sbjct: 132 VGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADCAQL 191
Query: 196 RESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSS 255
+ G +S +C + + Y DGS + G +++D + + G F GC N S
Sbjct: 192 GQEG-NGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL------GSSAVRSFQFGCSNVES 244
Query: 256 GDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDT------VN 306
G G+MGL S++++T + FSYCLP S+G++T G V
Sbjct: 245 GFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGSGTSGFVK 304
Query: 307 T--LRSA----FHKRMKKYKKAKGLE-----------DLLDTCYDLSAYETVVVPKIAIH 349
T LRS+ F+ + + G + ++D+ ++ ++
Sbjct: 305 TPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVMDSGTVITRLPPTAYSALSSA 364
Query: 350 FLGGV------------DLELDVRG--TLVVASVSQVCLGFATYPPDPNSITL------- 388
F G+ D D G ++ + SV+ V G A D + I L
Sbjct: 365 FKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIILSNCLAFA 424
Query: 389 -----------GNVQQRGHEVHYDVAGRRLGFGPGNC 414
GNVQQR EV YDV +GF G C
Sbjct: 425 GNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461
>gi|125595855|gb|EAZ35635.1| hypothetical protein OsJ_19925 [Oryza sativa Japonica Group]
Length = 335
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 88/276 (31%), Positives = 130/276 (47%), Gaps = 29/276 (10%)
Query: 146 SLLLDTGSDVTWTQCKPC--IHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGN 203
++++D+GSDV W QC+PC + C QRDP F + S T+ +PC+S +C L + G
Sbjct: 82 TVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLGP-YRRGC 140
Query: 204 CNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD--KSGA 261
+ +C F I YA+G+ + G +++D +T+ Y FL GC + G
Sbjct: 141 LANSQCQFGITYANGATATGTYSSDDLTLGP-----YDVVRGFLFGCAHADQGSTFSYDV 195
Query: 262 SGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDTVNTLRSAFHKRMKK 318
+G + L S + +T + Y FSYC+P S G+I FG L F
Sbjct: 196 AGTLALGGGSQSFVQQTASQYSRVFSYCVPPSTSSFGFIMFGVPPQRAALVPTFVS---- 251
Query: 319 YKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFAT 378
LL + + ++ +P IA+ F GG + LD G L+ Q CL FA
Sbjct: 252 -------TPLLSSSTMSPTFYSITLPSIALVFDGGATVNLDAAGILL-----QGCLAFAP 299
Query: 379 YPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
D +GNVQQR EV YDV G+ + F C
Sbjct: 300 TASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 335
>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
Length = 450
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 94/334 (28%), Positives = 150/334 (44%), Gaps = 70/334 (20%)
Query: 145 VSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI-LRESFPF-G 202
+++++DTGSD+TW QCKPC C+ QRDP F S S ++ +PCN+++C L+ + G
Sbjct: 122 LTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPG 181
Query: 203 NC----------NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC-I 251
+C S+ C +++ Y DGS S G ATD + + A+ +G F+ GC +
Sbjct: 182 SCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDG------FVFGCGL 235
Query: 252 NN-------------------SSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPY- 291
+N +SGD +G+ + G S + + T + P+
Sbjct: 236 SNRGLRRPGSAASSPTASPPGTSGDAAGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFY 295
Query: 292 --------------------------GSTGYITFGKTDTVNTLRSAFHKRM--KKYKKAK 323
S IT +R+ F ++ ++Y A
Sbjct: 296 FMNVTGASVGGAAVAAAGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAP 355
Query: 324 GLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASV--SQVCLGFATYPP 381
LLD CY+L+ ++ V VP + + G D+ +D G L +A SQVCL A+
Sbjct: 356 PFS-LLDACYNLTGHDEVKVPLLTLRLEAGADMTVDAAGMLFMARKDGSQVCLAMASLSF 414
Query: 382 DPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ + +GN QQ+ V YD G RLGF +CS
Sbjct: 415 EDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 448
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 105/349 (30%), Positives = 158/349 (45%), Gaps = 75/349 (21%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V +G P + + ++LDTGSDVTW QC+PC C+ Q DP + S S ++ + C+S
Sbjct: 162 EYFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSDPVYDPSVSTSYATVGCDSP 221
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA------------NSN 238
CR L + + S C + + Y DGS + G +AT+ +T+ ++ ++
Sbjct: 222 RCRDLDAAACRNSTGS--CLYEVAYGDGSYTVGDFATETLTLGDSAPVSNVAIGCGHDNE 279
Query: 239 GYF----------------------TRYPFLLGCINNSS------GDKSGASGIMGLDRS 270
G F T + + L ++ S GD + L RS
Sbjct: 280 GLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDSPSSSTLQFGDSEQPAVTAPLIRS 339
Query: 271 PVSIITRTNTSYF-----------SYCLPSPY------GSTGYI-------TFGKTDTVN 306
P RTNT Y+ + +PS GS G I T ++
Sbjct: 340 P-----RTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIVDSGTAVTRLQSGAYG 394
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV- 365
LR AF + + +A G+ L DTCYDL+ +V VP +A+ F GG +L+L + L+
Sbjct: 395 ALREAFVQGTQSLPRASGVS-LFDTCYDLAGRSSVQVPAVALWFEGGGELKLPAKNYLIP 453
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V + CL FA P SI +GNVQQ+G V +D A +GF C
Sbjct: 454 VDAAGTYCLAFAGT-SGPVSI-IGNVQQQGVRVSFDTAKNTVGFTADKC 500
>gi|222634868|gb|EEE65000.1| hypothetical protein OsJ_19937 [Oryza sativa Japonica Group]
Length = 402
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 112/409 (27%), Positives = 166/409 (40%), Gaps = 49/409 (11%)
Query: 31 HSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILR 90
H +V SSLL P A+P + + + YGPCS S L ++LR
Sbjct: 18 HYIVVETSSLLKPKAICSGLKAMPSS-NGTWVALHRPYGPCSPSPTTTSPPL--LVDMLR 74
Query: 91 QDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVV-------------- 136
D +LH RR + + + +D + +
Sbjct: 75 WD--KLHTDAIRRKATAGGDVVLEPDKPIVDVQQSDYQMQASFGIGTGGRSGSSSSSSSR 132
Query: 137 -----AIGEPKQYVSLLLDTGSDVTWTQCKPCI--HCFQQRDPFFYASKSKTFFKIPCNS 189
AI +P + +DT D+ W QC PC C+ Q++ F +S+T +PC S
Sbjct: 133 ISRPSAIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGS 192
Query: 190 TSCRILRESFPFG-NCNSKECPFNIQYADGSGSGG--FWATDRITIQEANSNGYFTRYPF 246
+C L +G C++ +C + + Y DG + G +W + N F
Sbjct: 193 AACGELGR---YGAGCSNNQCQYFVDYGDGRATSGRTWWTPSTLNPSTVVMN-------F 242
Query: 247 LLGCINNSSGDKSGA-SGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGKTDTV 305
GC + G+ S + SG MG++ R N + + S+ IT
Sbjct: 243 RFGCSHAVRGNFSASTSGTMGIEVGG----RRLNVPPVVFAGGAVMDSSVIITQLPPTAY 298
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
LR AF M Y + G LDTCYD + +V VP +++ F GG + LD G +V
Sbjct: 299 RALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV 358
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ CL F P D +GNVQQ+ HEV YDV G +GF G C
Sbjct: 359 -----EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 402
>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
distachyon]
Length = 836
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 77/254 (30%), Positives = 135/254 (53%), Gaps = 21/254 (8%)
Query: 60 ASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKR----- 114
A L + ++GPC+ ++ S APS E+LR D++R R P L++
Sbjct: 423 AVLRLTHRHGPCAGPSR--SASAPSFAEVLRADERRAEYIQRRMSGAKGPGGLQQFTAAS 480
Query: 115 -TEAFTFPANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQ--CKPCIHCFQQR 170
+++ T PANI ++ +Y + V++G P ++ +DTGSDV+W Q C+ Q+
Sbjct: 481 SSKSVTIPANIGHSIGTLQYVVTVSLGTPGVAQTVEVDTGSDVSWVQCAPCAAPACYAQK 540
Query: 171 DPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRI 230
D F +KS ++ +PC + +C L ++ G +C + + Y DGS + G + +D +
Sbjct: 541 DQLFDPAKSSSYSAVPCAADACSEL-STYGHGCAAGSQCGYVVSYGDGSNTTGVYGSDTL 599
Query: 231 TIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY----FSYC 286
T+ +A++ FL GC + +G +G G++ L R +S+ ++T+ +Y FSYC
Sbjct: 600 TLTDADAV-----TGFLFGCGHAQAGLFAGIDGLLALGRKGMSLTSQTSGAYGGGVFSYC 654
Query: 287 LPSPYGSTGYITFG 300
LP STG++T G
Sbjct: 655 LPPSPSSTGFLTLG 668
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/87 (43%), Positives = 50/87 (57%), Gaps = 7/87 (8%)
Query: 328 LLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSIT 387
+LDTCY+ + Y TV +P +++ F GG L+LD G L S CL FAT D +
Sbjct: 757 ILDTCYNFTDYGTVTLPTVSLTFSGGATLKLDAPGFL-----SSGCLAFATNSGDGDPAI 811
Query: 388 LGNVQQRGHEVHYDVAGRRLGFGPGNC 414
LGNVQQR V +D G +GF P +C
Sbjct: 812 LGNVQQRSFAVRFD--GSSVGFMPHSC 836
>gi|115466078|ref|NP_001056638.1| Os06g0121500 [Oryza sativa Japonica Group]
gi|113594678|dbj|BAF18552.1| Os06g0121500 [Oryza sativa Japonica Group]
Length = 442
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 132/300 (44%), Gaps = 35/300 (11%)
Query: 137 AIGEPKQYVSLLLDTGSDVTWTQCKPCI--HCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
AI +P + +DT D+ W QC PC C+ Q++ F +S+T +PC S +C
Sbjct: 156 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 215
Query: 195 LRESFPFG-NCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINN 253
L +G C++ +C + + Y DG + G + D +T+ + F GC +
Sbjct: 216 LGR---YGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPST-----VVMNFRFGCSHA 267
Query: 254 SSGDKSGASGIMGLDRSPV----SII---------------TRTNTSYFSYCLPSPYGST 294
G+ S ++ R+P+ SII R N + + S+
Sbjct: 268 VRGNFSASTSGTMFARTPLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSS 327
Query: 295 GYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGV 354
IT LR AF M Y + G LDTCYD + +V VP +++ F GG
Sbjct: 328 VIITQLPPTAYRALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGA 387
Query: 355 DLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ LD G +V + CL F P D +GNVQQ+ HEV YDV G +GF G C
Sbjct: 388 VVRLDAMGVMV-----EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 442
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 116/415 (27%), Positives = 179/415 (43%), Gaps = 82/415 (19%)
Query: 79 STHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTV---------A 129
ST S +++ +D++R+ +SR K T+ ++ T +
Sbjct: 51 STSPFSFSDMITKDEERVRFLHSRLTNKESVRNSATTDKLRGGPSLVSTTPLKSGLSIGS 110
Query: 130 DEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCN 188
YY+ + +G P +Y S+++DTGS ++W QC+PC I+C Q DP F S SKT+ +PC+
Sbjct: 111 GNYYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKALPCS 170
Query: 189 STSCRILRE---SFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA---------- 235
S+ C L+ + P + + C + Y D S S G+ + D +T+ +
Sbjct: 171 SSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSEAPSSGFVYG 230
Query: 236 ---NSNGYFTRYPFLLGCINNS-------------------------------SGDKS-G 260
++ G F R ++G N+ SG S G
Sbjct: 231 CGQDNQGLFGRSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAPNSSSLSGFLSIG 290
Query: 261 ASGIMG-------------------LDRSPVSIITR-TNTSYFSYCLPSPYGSTGYITFG 300
AS + LD + +++ + S SY +P+ S IT
Sbjct: 291 ASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVPTIIDSGTVITRL 350
Query: 301 KTDTVNTLRSAFHKRM-KKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELD 359
N L+ +F M KKY +A G +LDTC+ S E VP+I I F GG LEL
Sbjct: 351 PVAVYNALKKSFVLIMSKKYAQAPGFS-ILDTCFKGSVKEMSTVPEIQIIFRGGAGLELK 409
Query: 360 VRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+LV CL A +P SI +GN QQ+ +V YDVA ++GF PG C
Sbjct: 410 AHNSLVEIEKGTTCLAIAA-SSNPISI-IGNYQQQTFKVAYDVANFKIGFAPGGC 462
>gi|55296886|dbj|BAD68338.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|55296941|dbj|BAD68392.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
Length = 424
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 132/300 (44%), Gaps = 35/300 (11%)
Query: 137 AIGEPKQYVSLLLDTGSDVTWTQCKPCI--HCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
AI +P + +DT D+ W QC PC C+ Q++ F +S+T +PC S +C
Sbjct: 138 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 197
Query: 195 LRESFPFG-NCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINN 253
L +G C++ +C + + Y DG + G + D +T+ + F GC +
Sbjct: 198 LGR---YGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPST-----VVMNFRFGCSHA 249
Query: 254 SSGDKSGASGIMGLDRSPV----SII---------------TRTNTSYFSYCLPSPYGST 294
G+ S ++ R+P+ SII R N + + S+
Sbjct: 250 VRGNFSASTSGTMFARTPLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSS 309
Query: 295 GYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGV 354
IT LR AF M Y + G LDTCYD + +V VP +++ F GG
Sbjct: 310 VIITQLPPTAYRALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGA 369
Query: 355 DLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ LD G +V + CL F P D +GNVQQ+ HEV YDV G +GF G C
Sbjct: 370 VVRLDAMGVMV-----EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 424
>gi|55296937|dbj|BAD68388.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|218197467|gb|EEC79894.1| hypothetical protein OsI_21421 [Oryza sativa Indica Group]
Length = 424
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 132/300 (44%), Gaps = 35/300 (11%)
Query: 137 AIGEPKQYVSLLLDTGSDVTWTQCKPCI--HCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
AI +P + +DT D+ W QC PC C+ Q++ F +S+T +PC S +C
Sbjct: 138 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 197
Query: 195 LRESFPFG-NCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINN 253
L +G C++ +C + + Y DG + G + D +T+ + F GC +
Sbjct: 198 LGR---YGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPST-----VVMNFRFGCSHA 249
Query: 254 SSGDKSGASGIMGLDRSPV----SII---------------TRTNTSYFSYCLPSPYGST 294
G+ S ++ R+P+ SII R N + + S+
Sbjct: 250 VRGNFSASTSGTMFARTPLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSS 309
Query: 295 GYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGV 354
IT LR AF M Y + G LDTCYD + +V VP +++ F GG
Sbjct: 310 VIITQLPPTAYRALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGA 369
Query: 355 DLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ LD G +V + CL F P D +GNVQQ+ HEV YDV G +GF G C
Sbjct: 370 VVRLDAMGVMV-----EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 424
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 96/352 (27%), Positives = 145/352 (41%), Gaps = 76/352 (21%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI---HCFQQRDPFFYASKSKTFFKIPC 187
E+ + V +G P Q +L+ DTGSD++W QC+PC HC Q+DP F SKS T+ + C
Sbjct: 143 EFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHC 202
Query: 188 NSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C + + ++ C + ++Y DGS + G + D + + + + T +PF
Sbjct: 203 GEPQCAAAGD---LCSEDNTTCLYLVRYGDGSSTTGVLSRDTLALTSSRA---LTGFPFG 256
Query: 248 LG-------------------------------------CI---NNSSG---------DK 258
G C+ N+++G
Sbjct: 257 CGTRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNSTTGYLTIGATPATD 316
Query: 259 SGASGIMGLDRSP----VSIITRTNTSYFSYCLPSP----------YGSTGYITFGKTDT 304
+GA+ + R P + + Y LP P S +T+
Sbjct: 317 TGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFTRGGTLLDSGTVLTYLPAQA 376
Query: 305 VNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL 364
LR F M++Y A D+LD CYD + VVVP ++ F G ELD G +
Sbjct: 377 YALLRDRFRLTMERYTPAPP-NDVLDACYDFAGESEVVVPAVSFRFGDGAVFELDFFGVM 435
Query: 365 VVASVSQVCLGFATYPPD--PNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ + CL FA P SI +GN QQR EV YDVA ++GF P +C
Sbjct: 436 IFLDENVGCLAFAAMDTGGLPLSI-IGNTQQRSAEVIYDVAAEKIGFVPASC 486
>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 79/248 (31%), Positives = 129/248 (52%), Gaps = 25/248 (10%)
Query: 84 SLEEILRQDQQRLHLKNSRRLRKP--FPEFLKRTEAFTFPANINDTV-------ADEYYI 134
S ++L D R+ NSR RK FP+ + + FP +++ + + YY+
Sbjct: 61 SFSDVLAWDDARVKTLNSRLTRKDTRFPKSVLTKKDIRFPKSVSVPLNPGASIGSGNYYV 120
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNSTSCR 193
V G P +Y S+++DTGS ++W QCKPC ++C Q DP F S SKT+ + C S+ C
Sbjct: 121 KVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSSQCS 180
Query: 194 ILRESF---PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
L ++ P +S C + Y D S S G+ + D +T+ + T F+ GC
Sbjct: 181 SLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ-----TLPGFVYGC 235
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDTVNT 307
+S G A+GI+GL R+ +S++ + ++ + FSYCLP+ G G+++ GK
Sbjct: 236 GQDSDGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPT-RGGGGFLSIGKASLAG- 293
Query: 308 LRSAFHKR 315
SA++ R
Sbjct: 294 --SAYNGR 299
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 99/350 (28%), Positives = 155/350 (44%), Gaps = 77/350 (22%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNS 189
Y + +G P ++++DTGS +TW QC PC + C +Q P + S T+ +PC++
Sbjct: 133 NYVTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLYDPRASSTYATVPCSA 192
Query: 190 TSCRILRESF--PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-F 246
+ C L+ + P C + Y D S S G+ + D ++ + YP F
Sbjct: 193 SQCDELQAATLNPSACSVRNVCIYQASYGDSSFSVGYLSRDTVSFGSGS-------YPNF 245
Query: 247 LLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFG--- 300
GC ++ G ++G++GL R+ +S++ + S FSYCLP+P STGY++ G
Sbjct: 246 YYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTP-ASTGYLSIGPYT 304
Query: 301 -----------------------------------------------KTDTVNT-LRSAF 312
+ TV T L +A
Sbjct: 305 SGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPAEYSSLPTIIDSGTVITRLPTAV 364
Query: 313 HKRMKKYKKAK--GLE-----DLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+ + K A G++ +LDTC+ A + + VP +A+ F GG L+L + L+
Sbjct: 365 YTALSKAVAAAMVGVQSAPAFSILDTCFQGQASQ-LRVPAVAMAFAGGATLKLATQNVLI 423
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
S CL FA P D +I +GN QQ+ V YDVA R+GF G CS
Sbjct: 424 DVDDSTTCLAFA--PTDSTTI-IGNTQQQTFSVVYDVAQSRIGFAAGGCS 470
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 84/271 (30%), Positives = 136/271 (50%), Gaps = 16/271 (5%)
Query: 34 IVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQ 93
++S+ S +VC++++ A++ + ++GPCS L + P+LEE L +DQ
Sbjct: 102 VLSMGSPRTDSVCSQSKAVPSSSAGAATVPLHHRHGPCSPLP---TKKMPTLEETLHRDQ 158
Query: 94 QRL-HLKNSRRLRKPFPEFLKRTEAFTFPANINDTVAD-EYYIVVAIGEPKQYVSLLLDT 151
R +++ ++R++A T P + ++ EY I V +G P ++L+DT
Sbjct: 159 LRAAYIQRKFSGGGGAGGDVQRSDA-TVPTALGTSLNTLEYLITVGLGSPATSQTMLIDT 217
Query: 152 GSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPF 211
GSDV+W QCKPC C Q DP F S S T+ C S C L + G +S +C +
Sbjct: 218 GSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADCAQLGQE-GNGCSSSSQCQY 276
Query: 212 NIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSP 271
+ Y DGS + G +++D + + G F GC N SG G+MGL
Sbjct: 277 IVTYGDGSSTTGTYSSDTLAL------GSSAVRSFQFGCSNVESGFNDQTDGLMGLGGGA 330
Query: 272 VSIITRTNTSY---FSYCLPSPYGSTGYITF 299
S++++T + FSYCLP S+G++T
Sbjct: 331 QSLVSQTAGTLGRAFSYCLPPTPSSSGFLTL 361
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/109 (39%), Positives = 58/109 (53%), Gaps = 6/109 (5%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+ L SAF MK+Y A+ +LDTC+D S +V +P +A+ F GG + LD G ++
Sbjct: 429 SALSSAFKAGMKQYPPAQP-SGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL 487
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
CL FA D + +GNVQQR EV YDV +GF G C
Sbjct: 488 -----SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 531
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 76/234 (32%), Positives = 124/234 (52%), Gaps = 22/234 (9%)
Query: 84 SLEEILRQDQQRLHLKNSRRLRKP--FPEFLKRTEAFTFPANINDTV-------ADEYYI 134
S ++L D R+ NSR RK FP+ + + FP +++ + + YY+
Sbjct: 61 SFSDVLAWDDARVKTLNSRLTRKDTRFPKSVLTKKDIRFPKSVSVPLNPGASIGSGNYYV 120
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNSTSCR 193
V G P +Y S+++DTGS ++W QCKPC ++C Q DP F S SKT+ + C S+ C
Sbjct: 121 KVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSSQCS 180
Query: 194 ILRESF---PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
L ++ P +S C + Y D S S G+ + D +T+ + T F+ GC
Sbjct: 181 SLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ-----TLPGFVYGC 235
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGK 301
+S G A+GI+GL R+ +S++ + ++ + FSYCLP+ G G+++ GK
Sbjct: 236 GQDSDGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPT-RGGGGFLSIGK 288
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 101/351 (28%), Positives = 155/351 (44%), Gaps = 79/351 (22%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V +G P + + ++LDTGSDVTW QC+PC C+QQ DP F S S ++ + C++
Sbjct: 166 EYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSTSYASVACDNP 225
Query: 191 SCRILRESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEA------------N 236
C L + C ++ C + + Y DGS + G +AT+ +T+ ++ +
Sbjct: 226 RCHDLDAA----ACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDSAPVSSVAIGCGHD 281
Query: 237 SNGYF----------------------TRYPFLLGCINNSS------GDKSGASGIMGLD 268
+ G F T + + L ++ S GD + A L
Sbjct: 282 NEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDSPSSSTLQFGDAADAEVTAPLI 341
Query: 269 RSPVSIITRTNTSYFS------------YCLPSPYG--STGY----------ITFGKTDT 304
RSP RT+T Y+ PS + STG +T ++
Sbjct: 342 RSP-----RTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVDSGTAVTRLQSSA 396
Query: 305 VNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL 364
LR AF + + + G+ L DTCYDLS +V VP +++ F GG +L L + L
Sbjct: 397 YAALRDAFVRGTQSLPRTSGVS-LFDTCYDLSDRTSVEVPAVSLRFAGGGELRLPAKNYL 455
Query: 365 V-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ V CL FA P + +GNVQQ+G V +D A +GF C
Sbjct: 456 IPVDGAGTYCLAFA--PTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTTNKC 504
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 99/351 (28%), Positives = 153/351 (43%), Gaps = 79/351 (22%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V +G P + + ++LDTGSDVTW QC+PC C+QQ DP F S S ++ + C++
Sbjct: 162 EYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSTSYASVACDNP 221
Query: 191 SCRILRESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEA------------N 236
C L + C ++ C + + Y DGS + G +AT+ +T+ ++ +
Sbjct: 222 RCHDLDAA----ACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDSAPVSSVAIGCGHD 277
Query: 237 SNGYF----------------------TRYPFLLGCINNSS------GDKSGASGIMGLD 268
+ G F T + + L ++ S GD + A L
Sbjct: 278 NEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDSPSSSTLQFGDAADAEVTAPLI 337
Query: 269 RSPVSIITRTNTSYFS------------YCLPSPYGSTG------------YITFGKTDT 304
RSP RT+T Y+ PS + G +T ++
Sbjct: 338 RSP-----RTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGAGGVIVDSGTAVTRLQSSA 392
Query: 305 VNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL 364
LR AF + + + G+ L DTCYDLS +V VP +++ F GG +L L + L
Sbjct: 393 YAALRDAFVRGTQSLPRTSGVS-LFDTCYDLSDRTSVEVPAVSLRFAGGGELRLPAKNYL 451
Query: 365 V-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ V CL FA P + +GNVQQ+G V +D A +GF C
Sbjct: 452 IPVDGAGTYCLAFA--PTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTSNKC 500
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 94/279 (33%), Positives = 138/279 (49%), Gaps = 44/279 (15%)
Query: 33 HIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQD 92
H VSSLLP N C+ + QG L + KYGPCS + PS +EI +D
Sbjct: 76 HSTPVSSLLPKNKCSASARGGSQG-----LPITQKYGPCSGSGH---SQPPSPQEIFGRD 127
Query: 93 QQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADE---YYIVVAIGEPKQYVSLLL 149
+ R+ NS+ + PE LK N+ + DE + + VA G P Q +L+L
Sbjct: 128 ESRVSFINSK-FNQYAPENLKDHTP-------NNKLFDEDGNFLVDVAFGTPPQKFTLIL 179
Query: 150 DTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKEC 209
DTGS +TWTQCKPC+ C + F S S T+ C ++ GN
Sbjct: 180 DTGSSITWTQCKPCVRCLKASRRHFDPSASLTYSLGSCIPST---------VGNT----- 225
Query: 210 PFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD-KSGASGIMGLD 268
+N+ Y D S S G + D +T++ ++ F ++ F GC N+ GD SGA G++GL
Sbjct: 226 -YNMTYGDKSTSVGNYGCDTMTLEHSD---VFPKFQF--GCGRNNEGDFGSGADGMLGLG 279
Query: 269 RSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDT 304
+ +S +++T + + FSYCLP S G + FG+ T
Sbjct: 280 QGQLSTVSQTASKFKKVFSYCLPEE-DSIGSLLFGEKAT 317
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 48/83 (57%), Gaps = 5/83 (6%)
Query: 320 KKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATY 379
++ KG D+LDTCY+LS + V++P+I +HF G D+ L+ + + S++CL FA
Sbjct: 405 RRKKG--DILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRVIWGNDASRLCLAFAG- 461
Query: 380 PPDPNSITLGNVQQRGHEVHYDV 402
+ +GN QQ V YD+
Sbjct: 462 --NSELTIIGNRQQVSLTVLYDI 482
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 114/413 (27%), Positives = 179/413 (43%), Gaps = 80/413 (19%)
Query: 79 STHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTV-------ADE 131
ST S +++ +D++R+ +SR K T+ P+ ++ + +
Sbjct: 47 STSPFSFSDMITKDEERVRFLHSRLTNKESASNSATTDKLGGPSLVSTPLKSGLSIGSGN 106
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNST 190
YY+ + +G P +Y S+++DTGS ++W QC+PC I+C Q DP F S SKT+ + C+S+
Sbjct: 107 YYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSVSKTYKALSCSSS 166
Query: 191 SCRILRE---SFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA------------ 235
C L+ + P + + C + Y D S S G+ + D +T+ +
Sbjct: 167 QCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSAAPSSGFVYGCG 226
Query: 236 -NSNGYFTRYPFLLGCINN------------------------SSGDKSGASGIMGLDR- 269
++ G F R ++G N+ S+ S SG + +
Sbjct: 227 QDNQGLFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQPNSSVSGFLSIGAS 286
Query: 270 ---------SPVSIITRTNTSYF------------------SYCLPSPYGSTGYITFGKT 302
+P+ + + YF SY +P+ S IT
Sbjct: 287 SLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYNVPTIIDSGTVITRLPV 346
Query: 303 DTVNTLRSAFHKRM-KKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVR 361
N L+ +F M KKY +A G +LDTC+ S E VP+I I F GG LEL V
Sbjct: 347 AIYNALKKSFVMIMSKKYAQAPGFS-ILDTCFKGSVKEMSTVPEIRIIFRGGAGLELKVH 405
Query: 362 GTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+LV CL A +P SI +GN QQ+ V YDVA ++GF PG C
Sbjct: 406 NSLVEIEKGTTCLAIAA-SSNPISI-IGNYQQQTFTVAYDVANSKIGFAPGGC 456
>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 324
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 98/336 (29%), Positives = 144/336 (42%), Gaps = 84/336 (25%)
Query: 149 LDTGSDVTWTQCKPCI---HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCN 205
+DTGSD++W QCKPC C+ Q+DP F ++S ++ +PC C L + C+
Sbjct: 3 VDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGI-YAASACS 61
Query: 206 SKECPFNIQYADGSGSGGFWATDRITIQEANS-NGYFTRYPFLLGCINNSSGDKSGASGI 264
+ +C + + Y DGS + G +++D +T+ +++ G+F GC + SG +G G+
Sbjct: 62 AAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFF------FGCGHAQSGLFNGVDGL 115
Query: 265 MGLDRSPVSIITRTNTSY---FSYC------------------------------LPSPY 291
+GL R S++ +T +Y FSYC LPSP
Sbjct: 116 LGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPN 175
Query: 292 GSTGYITFGKTDTVN--------------------------------TLRSAFHKRMKKY 319
T Y+ +V LRSAF M Y
Sbjct: 176 APTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYAALRSAFRSGMASY 235
Query: 320 KKAKGLED-LLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFAT 378
+ +LDTCY+ + Y TV +P +A+ F G + L G L S CL FA
Sbjct: 236 GYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL-----SFGCLAFAP 290
Query: 379 YPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
D LGNVQQR EV D G +GF P +C
Sbjct: 291 SGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 324
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 89/281 (31%), Positives = 137/281 (48%), Gaps = 48/281 (17%)
Query: 33 HIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQD 92
H VSSLLP N C+ + QG L + KYGPCS + PS +EI +D
Sbjct: 42 HSTPVSSLLPKNKCSASARGGSQG-----LPITQKYGPCSGSGH---SQPPSPQEIFGRD 93
Query: 93 QQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADE---YYIVVAIGEPKQYVSLLL 149
+ R+ NS+ + N+ + DE + + VA G P + L+L
Sbjct: 94 ESRVSFINSKCNQYTSGNLKNHAH--------NNNLFDEDGNFLVDVAFGTPXTEIXLIL 145
Query: 150 DTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNC--NSK 207
DTGS +TWTQCK C++C Q + +F +S S T + FG+C ++
Sbjct: 146 DTGSSITWTQCKACVNCLQDSNRYFDSSASST-----------------YSFGSCIPSTV 188
Query: 208 ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD-KSGASGIMG 266
E +N+ Y D S S G + D +T++ ++ F ++ F GC N+ GD SG G++G
Sbjct: 189 ENNYNMTYGDDSTSVGNYGCDTMTLEPSD---VFQKFQF--GCGRNNKGDFGSGVDGMLG 243
Query: 267 LDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDT 304
L + +S +++T + + FSYCLP S G + FG+ T
Sbjct: 244 LGQGQLSTVSQTASKFNKVFSYCLPEE-DSIGSLLFGEKAT 283
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 39/96 (40%), Positives = 57/96 (59%), Gaps = 5/96 (5%)
Query: 320 KKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATY 379
++ KG D+LDTCY+LS + V++P+I +HF GG D+ L+ + + S++CL FA
Sbjct: 369 RRKKG--DILDTCYNLSGRKDVLLPEIVLHFGGGADVRLNGTNIVWGSDASRLCLAFAG- 425
Query: 380 PPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+GN QQ V YD+ GRR+GFG CS
Sbjct: 426 --TSELTIIGNRQQLSLTVLYDIQGRRIGFGGNGCS 459
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 106/401 (26%), Positives = 164/401 (40%), Gaps = 74/401 (18%)
Query: 81 HAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANI---NDTVADEYYIVVA 137
H L +R+D R+ R K P R E F ++I D + EY++ +
Sbjct: 77 HHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGSDIVSGMDQGSGEYFVRIG 136
Query: 138 IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRE 197
+G P + +++D+GSD+ W QC+PC C++Q DP F +KS ++ + C S+ C +
Sbjct: 137 VGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIEN 196
Query: 198 SFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQE------ANSNGYFTRYPFL---- 247
S C+S C + + Y DGS + G A + +T + A G+ R F+
Sbjct: 197 S----GCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAG 252
Query: 248 --------LGCINNSSGDKSGASG--------------IMGLDRSPVS-----IITRTNT 280
+ + SG GA G + G + PV ++
Sbjct: 253 LLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRA 312
Query: 281 SYFSYC-----------LPSPYG-------STGYITFGKTDTVNTLRSA--------FHK 314
F Y +P P G G + V L +A F
Sbjct: 313 PSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKS 372
Query: 315 RMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-VASVSQVC 373
+ +A G+ + DTCYDLS + +V VP ++ +F G L L R L+ V C
Sbjct: 373 QTANLPRASGVS-IFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYC 431
Query: 374 LGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
FA P + I GN+QQ G +V +D A +GFGP C
Sbjct: 432 FAFAASPTGLSII--GNIQQEGIQVSFDGANGFVGFGPNVC 470
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 71/200 (35%), Positives = 108/200 (54%), Gaps = 10/200 (5%)
Query: 117 AFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYA 176
FT P EYY+ + +G P V L++DTGSDV+W QC PC C P F
Sbjct: 124 GFTSPVVTLGQAGLEYYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNP 183
Query: 177 SKSKTFFKIPCNSTSCRILRESF-PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA 235
S +FFK+PC S++C + + PF + + + C F+IQY DGS S G A + I
Sbjct: 184 RHSSSFFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTP 243
Query: 236 N-SNGYFTRYP-FLLGCIN-NSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
N +G + LGC + + G +GASG++G+DR P+S ++ ++ Y FS+C P
Sbjct: 244 NFGDGEPVKLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPD 303
Query: 290 PYG---STGYITFGKTDTVN 306
S+G + FG++D ++
Sbjct: 304 KIAHLNSSGLVFFGESDIIS 323
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 57/126 (45%), Gaps = 10/126 (7%)
Query: 298 TFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLS----AYETVVVPKIAIHFLGG 353
T+ K +R F R K CY+++ A E+ ++P I +HF GG
Sbjct: 386 TYLKKPAFQAMRREFLARTSHLAKVDD-NSGFTPCYNITSGTAALESTILPSITLHFRGG 444
Query: 354 VDLELDVRGTLVVASVSQ----VCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGF 409
+D+ L L+ S S+ +CL F P +I +GN QQ+ V YD+ RLG
Sbjct: 445 LDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDIPFNI-IGNYQQQNLWVEYDLEKLRLGI 503
Query: 410 GPGNCS 415
P C+
Sbjct: 504 APAQCA 509
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 71/200 (35%), Positives = 108/200 (54%), Gaps = 10/200 (5%)
Query: 117 AFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYA 176
FT P EYY+ + +G P V L++DTGSDV+W QC PC C P F
Sbjct: 123 GFTSPVVTLGQAGLEYYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNP 182
Query: 177 SKSKTFFKIPCNSTSCRILRESF-PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA 235
S +FFK+PC S++C + + PF + + + C F+IQY DGS S G A + I
Sbjct: 183 RHSSSFFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTP 242
Query: 236 N-SNGYFTRYP-FLLGCIN-NSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
N +G + LGC + + G +GASG++G+DR P+S ++ ++ Y FS+C P
Sbjct: 243 NFGDGEPVKLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPD 302
Query: 290 PYG---STGYITFGKTDTVN 306
S+G + FG++D ++
Sbjct: 303 KIAHLNSSGLVFFGESDIIS 322
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 57/126 (45%), Gaps = 10/126 (7%)
Query: 298 TFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDL----SAYETVVVPKIAIHFLGG 353
T+ K +R F R K CY++ +A E+ ++P I +HF GG
Sbjct: 385 TYLKKPAFQAMRREFLARTSHLAKVDD-NSGFTPCYNITSGTAALESTILPSITLHFRGG 443
Query: 354 VDLELDVRGTLVVASVSQ----VCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGF 409
+D+ L L+ S S+ +CL F P +I +GN QQ+ V YD+ RLG
Sbjct: 444 LDVVLPKNSILIPVSSSEEQTTLCLAFQMSGDIPFNI-IGNYQQQNLWVEYDLEKLRLGI 502
Query: 410 GPGNCS 415
P C+
Sbjct: 503 APAQCA 508
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 105/393 (26%), Positives = 165/393 (41%), Gaps = 86/393 (21%)
Query: 98 LKNSRRLRKPFPEFLKRTEAFTFPANINDTV---------ADEYYIVVAIGEPKQYVSLL 148
L + RL F L R+ A A + V + EY + V+IG P +
Sbjct: 49 LSHYDRLANAFRRSLSRSAALLNRAATSGAVGLQSSIGPGSGEYLMSVSIGTPPVDYLGI 108
Query: 149 LDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKE 208
DTGSD+TW QC PC+ C+QQ P F KS +F +PCN+ +C + + G+C +
Sbjct: 109 ADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDD----GHCGVQG 164
Query: 209 -CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGL 267
C ++ Y D + S G ++ITI ++ ++GC + SSG ASG++GL
Sbjct: 165 VCDYSYTYGDRTYSKGDLGFEKITIGSSSVKS-------VIGCGHASSGGFGFASGVIGL 217
Query: 268 DRSPVSIITRTNTS-----YFSYCLPSPYG-STGYITFGKTDTV---------------- 305
+S++++ + + FSYCLP+ + G I FG+ V
Sbjct: 218 GGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGENAVVSGPGVVSTPLISKNTV 277
Query: 306 -------------NTLRSAFHKR-------------------------MKKYKKAKGLED 327
N AF K+ + K KAK ++D
Sbjct: 278 TYYYITLEAISIGNERHMAFAKQGNVIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKD 337
Query: 328 ---LLDTCYD--LSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPD 382
LD C+D ++A ++ +P I HF GG ++ L T + + CL P
Sbjct: 338 PHGSLDLCFDDGINAAASLGIPVITAHFSGGANVNLLPINTFRKVADNVNCLTLKAASPT 397
Query: 383 PNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+GN+ Q + YD+ +RL F P C+
Sbjct: 398 TEFGIIGNLAQANFLIGYDLEAKRLSFKPTVCA 430
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 113/433 (26%), Positives = 184/433 (42%), Gaps = 70/433 (16%)
Query: 45 VCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRL-HLKNSRR 103
VC+ R A+ ++ + ++GPCS + S P+ EE+L++DQ R H++
Sbjct: 38 VCSE-RNAISSSLSGTTVALNHRHGPCSPVPS--SKKRPTEEELLKRDQLRAEHIQRKFA 94
Query: 104 LRKPFP---EFLKRTEAFTFPANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQ 159
+ + + + + P + ++ EY I V +G P ++ +DTGSDV+W Q
Sbjct: 95 MNAAVDGAGDLQQSKVSSSVPTKLGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQ 154
Query: 160 CKPCIH--CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYAD 217
C PC + C+ Q F +KS T+ + C + C L + + EC + +QY D
Sbjct: 155 CNPCPNPPCYAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYGD 214
Query: 218 GSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITR 277
GS + G ++ D +T+ A+ F GC + SG G+MGL S++++
Sbjct: 215 GSTTNGTYSRDTLTLSGASD----AVKGFQFGCSHVESGFSDQTDGLMGLGGGAQSLVSQ 270
Query: 278 TNTSY---FSYCLPSPYGSTGYITFG--------------KTDTVNTLRSAFHKRMKKYK 320
T +Y FSYCLP GS+G++T G ++ + T A + +
Sbjct: 271 TAAAYGNSFSYCLPPTSGSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGG 330
Query: 321 KAKGLE-------DLLDTCYDLSAYETVVVPKIAIHFLGGV------------DLELDVR 361
K GL ++D+ ++ ++ F G+ D D
Sbjct: 331 KQLGLSPSVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFA 390
Query: 362 G--TLVVASVSQVCLGFATYPPDPNSIT------------------LGNVQQRGHEVHYD 401
G + + +V+ V G A DPN I +GNVQQR EV YD
Sbjct: 391 GQTQISIPTVALVFSGGAAIDLDPNGIMYGNCLAFAATGDDGTTGIIGNVQQRTFEVLYD 450
Query: 402 VAGRRLGFGPGNC 414
V LGF G C
Sbjct: 451 VGSSTLGFRSGAC 463
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 116/433 (26%), Positives = 184/433 (42%), Gaps = 70/433 (16%)
Query: 45 VCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRL-HLKNSRR 103
VC+ R A+ ++ + ++GPCS + S P+ EE+L++DQ R H++
Sbjct: 38 VCSE-RNAISSSLSGTTVALNHRHGPCSPVPS--SKKRPTEEELLKRDQLRAEHIQRKFA 94
Query: 104 LRKPFP---EFLKRTEAFTFPANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQ 159
+ + + + + P + ++ EY I V +G P ++ +DTGSDV+W Q
Sbjct: 95 MNAAVDGAGDLQQSKVSSSVPTKLGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQ 154
Query: 160 CKPCIH--CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYAD 217
C PC + C Q F +KS T+ + C + C L + + EC + +QY D
Sbjct: 155 CNPCPNPPCHAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYGD 214
Query: 218 GSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITR 277
GS + G ++ D +T+ A+ F GC + SG G+MGL S++++
Sbjct: 215 GSTTNGTYSRDTLTLSGASD----AVKGFQFGCSHLESGFSDQTDGLMGLGGGAQSLVSQ 270
Query: 278 TNTSY---FSYCLPSPYGSTGYITFGKTD------TVNTLRSA-----FHKRMKKYK--- 320
T +Y FSYCLP GS+G++T G T LRS + R++
Sbjct: 271 TAAAYGNSFSYCLPPTSGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGG 330
Query: 321 KAKGLE-------DLLDTCYDLSAYETVVVPKIAIHFLGGV------------DLELDVR 361
K GL ++D+ ++ ++ F G+ D D
Sbjct: 331 KQLGLSPSVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFA 390
Query: 362 G--TLVVASVSQVCLGFATYPPDPNSIT------------------LGNVQQRGHEVHYD 401
G + + +V+ V G A DPN I +GNVQQR EV YD
Sbjct: 391 GQTQISIPTVALVFSGGAAIDLDPNGIMYGNCLAFAATGDDGTTGIIGNVQQRTFEVLYD 450
Query: 402 VAGRRLGFGPGNC 414
V LGF G C
Sbjct: 451 VGSSTLGFRSGAC 463
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 127/468 (27%), Positives = 201/468 (42%), Gaps = 80/468 (17%)
Query: 11 FICLLCSSNNGAYADDNDLSHSHIVSVSSLLPPN-VCNRTRTALPQGPDKASLEVVSKYG 69
F+ L S + A D ++SV SL+ + C+ + P ++ + +Y
Sbjct: 7 FLLALLFSYHTLIAHAADDRRHKVLSVGSLMKSSTACSEPKVTPPS--TGVTVPLHHRYD 64
Query: 70 PCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEF--LKRTEAFTFPANINDT 127
PCS + S P+LEE LR+DQ R + +++ F +++++A T P + +
Sbjct: 65 PCSPVP---SKKVPTLEERLRRDQLR-----AAYIKRKFSGAGDIEQSDAATVPTTLGTS 116
Query: 128 VAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIP 186
++ EY I V IG P ++ +DTGSDV+W QCKPC C + D F S S T+
Sbjct: 117 LSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSSSSTYSPFS 176
Query: 187 CNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPF 246
C+S C L +S C S +C + + Y D S + G +++D +T+ G F
Sbjct: 177 CSSAPCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTYSSDTLTL------GSSAMTDF 230
Query: 247 LLGCINNSSGD-KSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFG-- 300
GC + SG G+MGL S+ ++T ++ FSYCLP GS+G++T G
Sbjct: 231 QFGCSQSESGGFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCLPPTSGSSGFLTLGTG 290
Query: 301 ----------KTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVV--VPKIAI 348
++ + T + +K + L + + L T++ +P A
Sbjct: 291 SSGFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVFSAGSLMDSGTIITRLPPTAY 350
Query: 349 HFL-----------------GGVDLELDVRG---------TLVVASVSQVCLGF------ 376
L G +D D G TLV + + V L F
Sbjct: 351 SALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIPTVTLVFSGGAAVDLAFDGIMLE 410
Query: 377 -------ATYPPDPNSITL---GNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ P+ + +L GNVQQR EV YDV G +GF G C
Sbjct: 411 ISSSIRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 458
>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
Length = 293
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 82/238 (34%), Positives = 126/238 (52%), Gaps = 28/238 (11%)
Query: 59 KASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAF 118
K+SL VV +G CS L+ +EILR+D+ R+ +S+ L K + + + ++
Sbjct: 62 KSSLRVVHMHGACSHLSSNKDARLDH-DEILRRDEARVESIHSK-LSKNIADEVSKAKST 119
Query: 119 TFPANINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFYA 176
PA + YIV + IG PK +SL+ DTGSD+TWTQC+PC+ C+ Q++P F
Sbjct: 120 KLPAKNGIILGSPNYIVTIGIGTPKHDISLMFDTGSDLTWTQCEPCLGSCYSQKEPKFNP 179
Query: 177 SKSKTFFKIPCNSTSCRILRESFPFGN---CNSKECPFNIQYADGSGSGGFWATDRITIQ 233
S S ++ + C+S C GN C++ C + I Y DGS + GF A ++ T+
Sbjct: 180 SSSSSYHNVSCSSPMC---------GNPESCSASNCLYGIGYGDGSVTVGFLAKEKFTLT 230
Query: 234 EAN--SNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYC 286
++ + YF GC N+ G G++GI+GL S +T T+Y FSYC
Sbjct: 231 NSDVLDDIYF-------GCGENNKGVFIGSAGILGLGPGKFSFPLQTTTTYNNIFSYC 281
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 101/406 (24%), Positives = 169/406 (41%), Gaps = 58/406 (14%)
Query: 55 QGPDKASLEVVSKYGPCSRLNQGIST-HAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLK 113
+G +K ++VV + +L+ G S H L+ L++D +R+ RRL +
Sbjct: 128 EGGEKWMMKVVHR----DQLSFGNSDDHRHRLDGRLKRDAKRVA-SLIRRLSSGGGGSYR 182
Query: 114 RTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPF 173
+ T + + + EY++ + +G P + +++D+GSD+ W QC+PC C+ Q DP
Sbjct: 183 VDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPV 242
Query: 174 FYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQ 233
F + S +F + C+S+ C L + C++ C + + Y DGS + G A + +T
Sbjct: 243 FDPADSASFTGVSCSSSVCDRLENA----GCHAGRCRYEVSYGDGSYTKGTLALETLTFG 298
Query: 234 E------ANSNGYFTRYPFL------------LGCINNSSGDKSGASGIMGLDRSPVSII 275
A G+ R F+ + + G GA + + V ++
Sbjct: 299 RTMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSAAWVPLV 358
Query: 276 TRTNTSYFSYCLPSPYGSTGY--------------------------ITFGKTDTVNTLR 309
F Y + G G +T T R
Sbjct: 359 RNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFR 418
Query: 310 SAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-VAS 368
AF + +A G+ + DTCYDL + +V VP ++ +F GG L L R L+ +
Sbjct: 419 DAFLAQTANLPRATGVA-IFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDD 477
Query: 369 VSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
C FA P LGN+QQ G ++ +D A +GFGP C
Sbjct: 478 AGTFCFAFA--PSTSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 521
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 72/220 (32%), Positives = 121/220 (55%), Gaps = 19/220 (8%)
Query: 96 LHLKNSR-RLRKPFPEFLKRTEAFTFP--ANINDTVADEYYIVVAIGEPKQYVSLLLDTG 152
LH+++ + RLRK P + +N + Y + + +G Q +++++DTG
Sbjct: 107 LHVRSMQNRLRKMVSSHSVEVSQIQIPLASGVNFQTLN-YIVTMELG--GQDMTVIIDTG 163
Query: 153 SDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILR-ESFPFGNC--NSKEC 209
SD+TW QC+PC+ C+ Q+ P F S S ++ IPCNS++C+ L+ + G C N C
Sbjct: 164 SDLTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNC 223
Query: 210 PFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDR 269
+ + Y DGS + G + ++ G + F+ GC N+ G G SG+MGL R
Sbjct: 224 SYAVNYGDGSYTNGELGAEHLSF------GGISVSNFVFGCGKNNKGLFGGVSGLMGLGR 277
Query: 270 SPVSIITRTNTSY---FSYCL-PSPYGSTGYITFGKTDTV 305
S +S+I++TN+++ FSYCL P+ G++G + G +V
Sbjct: 278 SNLSLISQTNSTFGGVFSYCLPPTDAGASGSLAMGNESSV 317
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 100/349 (28%), Positives = 157/349 (44%), Gaps = 72/349 (20%)
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
+V IG Q +++++DTGSD+TW QC PC+ C+ Q+ P F S S ++ + CNS++C+
Sbjct: 134 IVTIGLGNQNMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSYNSLLCNSSTCQN 193
Query: 195 LRESFPFGNC------NSKECPFNIQYADGSGS-----------GGFWATDRITIQEANS 237
L+ F GN N C + Y DGS + GG ++ + N+
Sbjct: 194 LQ--FTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGVEHLSFGGISVSNFVFGCGRNN 251
Query: 238 NGYFTRYPFLLGC----------INNSSGD---------KSGASG--IMGLDRS------ 270
G F ++G N + G SGASG ++G + S
Sbjct: 252 KGLFGGVSGIMGLGRSNLSMISQTNTTFGGVFSYCLPTTDSGASGSLVIGNESSLFKNLT 311
Query: 271 PV---SIITRTNTSYFSYCL-------------PSPYGSTGYITFGKT-------DTVNT 307
P+ S+++ S F Y L + +G+ G + T N
Sbjct: 312 PIAYTSMVSNPQLSNF-YVLNLTGIDVGGVAIQDTSFGNGGILIDSGTVITRLAPSLYNA 370
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVV- 366
L++ F K+ Y A L +LDTC++L+ E V +P +++HF VDL +D G L +
Sbjct: 371 LKAEFLKQFSGYPIAPALS-ILDTCFNLTGIEEVSIPTLSMHFENNVDLNVDAVGILYMP 429
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
SQVCL A+ + + +GN QQR V YD ++GF +CS
Sbjct: 430 KDGSQVCLALASLSDENDMAIIGNYQQRNQRVIYDAKQSKIGFAREDCS 478
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 86/295 (29%), Positives = 143/295 (48%), Gaps = 29/295 (9%)
Query: 34 IVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQ 93
+V S+ P N + P +AS+ ++ ++GPC+ + +T+ PS E+LR+D+
Sbjct: 30 VVQTSTSSPSNAACSPAAQVTSDPSRASMPLMYRHGPCAPASAA-ATNRPSPAEMLRRDR 88
Query: 94 QR----LHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVAD-EYYIVVAIGEPKQYVSLL 148
R L + RR+ T + P ++ V +Y + + G P LL
Sbjct: 89 ARRNHILRKASGRRI----------TLGVSIPTSLGAFVDSLQYVVTLGFGTPAVPQVLL 138
Query: 149 LDTGSDVTWTQCKPC--IHCFQQRDPFFYASKSKTFFKIPCNSTSCRILR-ESFPFGNCN 205
+DTGSD++W QC+PC C+ Q+DP F S S T+ +PC S +CR L +S+ G N
Sbjct: 139 IDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYAPVPCGSEACRDLDPDSYANGCTN 198
Query: 206 SKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGA 261
S C + IQY +G + G ++T+ +T+ + F GC G
Sbjct: 199 SSSGASLCQYGIQYGNGDTTVGVYSTETLTLSPEAAT---VVNNFSFGCGLVQKGVFDLF 255
Query: 262 SGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDTVNTLRSAFH 313
G++GL +P S++++T +Y FSYCLP+ + G++ G T + F
Sbjct: 256 DGLLGLGGAPESLVSQTTGTYGGAFSYCLPAGNSTAGFLALGAPATGGNNTAGFQ 310
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 46/123 (37%), Positives = 63/123 (51%), Gaps = 6/123 (4%)
Query: 294 TGYITFGKTDTV-NTLRSAFHKRMKKYKKAKGLEDL-LDTCYDLSAYETVVVPKIAIHFL 351
+G I G +T + LR+AF M Y +D LDTCYD + V VP +A+ F
Sbjct: 351 SGTIVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTCYDFTGNTNVTVPTVALTFE 410
Query: 352 GGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGP 411
GGV ++LDV +++ CL F D ++ +GNV QR EV YD A +GF
Sbjct: 411 GGVTIDLDVPSGVLLDG----CLAFVAGASDGDTGIIGNVNQRTFEVLYDSARGHVGFRA 466
Query: 412 GNC 414
G C
Sbjct: 467 GAC 469
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 95/354 (26%), Positives = 154/354 (43%), Gaps = 80/354 (22%)
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
+V +G + +++++DTGSD+TW QC+PC+ C+ Q+ P F S S ++ + CNS++C+
Sbjct: 66 IVTMGLGSKNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQS 125
Query: 195 LR----ESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
L+ + G+ N C + + Y DGS + G EA S G + F+ GC
Sbjct: 126 LQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGV------EALSFGGVSVSDFVFGC 179
Query: 251 INNSSG------------------------------------DKSGASG--IMGLDRS-- 270
N+ G ++G+SG +MG + S
Sbjct: 180 GRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGSSGSLVMGNESSVF 239
Query: 271 ----PVS--------------IITRTNTSYFSYCLPSP--YGSTG-------YITFGKTD 303
P++ I+ T L +P +G+ G IT +
Sbjct: 240 KNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAPLSFGNGGILIDSGTVITRLPSS 299
Query: 304 TVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGT 363
L++ F K+ + A G +LDTC++L+ Y+ V +P I++ F G L +D GT
Sbjct: 300 VYKALKAEFLKKFTGFPSAPGFS-ILDTCFNLTGYDEVSIPTISLRFEGNAQLNVDATGT 358
Query: 364 LVVAS--VSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
V SQVCL A+ ++ +GN QQR V YD ++GF CS
Sbjct: 359 FYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEEPCS 412
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 89/281 (31%), Positives = 135/281 (48%), Gaps = 48/281 (17%)
Query: 33 HIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQD 92
H +VSSLLP N C+ + QG L + KYGPCS + PS +EI +D
Sbjct: 41 HSTTVSSLLPKNKCSASARGGSQG-----LPITQKYGPCSGSGH---SQPPSPQEIFGRD 92
Query: 93 QQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADE---YYIVVAIGEPKQYVSLLL 149
+ R+ NS+ + N+ + DE + + VA G P Q L+L
Sbjct: 93 ESRVSFINSKCNQYTSGNLKNHAH--------NNNLFDEDGNFLVDVAFGTPPQKFKLIL 144
Query: 150 DTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNC--NSK 207
DTGS +TWTQCK C+HC + F + S T + FG+C ++
Sbjct: 145 DTGSSITWTQCKACVHCLKDSHRHFDSLASST-----------------YSFGSCIPSTV 187
Query: 208 ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD-KSGASGIMG 266
+N+ Y D S S G + D +T++ ++ F ++ F GC N+ GD SGA G++G
Sbjct: 188 GNTYNMTYGDKSTSVGNYGCDTMTLEPSD---VFQKFQF--GCGRNNEGDFGSGADGMLG 242
Query: 267 LDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDT 304
L + +S +++T + + FSYCLP S G + FG+ T
Sbjct: 243 LGQGQLSTVSQTASKFKKVFSYCLPEE-NSIGSLLFGEKAT 282
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 53/92 (57%), Gaps = 3/92 (3%)
Query: 327 DLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPP---DP 383
D+LDTCY+LS + V++P+ +HF G D+ L+ + + S++CL FA +P
Sbjct: 375 DMLDTCYNLSGRKDVLLPEXVLHFGDGADVRLNGKRVVWGNDASRLCLAFAGNSKSTMNP 434
Query: 384 NSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+GN QQ V YD+ GRR+GFG CS
Sbjct: 435 ELTIIGNRQQVSLTVLYDIRGRRIGFGGNGCS 466
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 104/357 (29%), Positives = 156/357 (43%), Gaps = 76/357 (21%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPC 187
+ YY+ + +G P +Y ++++DTGS +W QC+PC I+C Q DP F S SKT+ +PC
Sbjct: 100 SGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPC 159
Query: 188 NSTSCRILRESF---PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA--------- 235
+S+ C L+ + P + S C + Y D S S G+ + D +T+ +
Sbjct: 160 SSSQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQTLSSFVYG 219
Query: 236 ---NSNGYFTRYPFLLGCINNSSGDKSGASGIMG-------------------------- 266
++ G F R ++G NN S SG G
Sbjct: 220 CGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIGT 279
Query: 267 -------------------------LDRSPVSIITR-TNTSYFSYCLPSPYGSTGYITFG 300
+D +++ R + SY +P+ S IT
Sbjct: 280 SSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTIIDSGTVITRL 339
Query: 301 KTDTVNTLRSAFHKRM-KKYKKAKGLEDLLDTCYDLS-AYETVVVPKIAIHFLGGVDLEL 358
T TL++A+ + KKY++A G+ LLDTC+ S A + V P I I F GG DL+L
Sbjct: 340 PTPVYTTLKNAYVTILSKKYQQAPGIS-LLDTCFKGSLAGISEVAPDIRIIFKGGADLQL 398
Query: 359 DVRGTLVVASVSQVCLGFATYPPDPNSITL-GNVQQRGHEVHYDVAGRRLGFGPGNC 414
+LV CL A +SI + GN QQ+ +V YDV R+GF PG C
Sbjct: 399 KGHNSLVELETGITCLAMA----GSSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 104/357 (29%), Positives = 156/357 (43%), Gaps = 76/357 (21%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPC 187
+ YY+ + +G P +Y ++++DTGS +W QC+PC I+C Q DP F S SKT+ +PC
Sbjct: 100 SGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPC 159
Query: 188 NSTSCRILRESF---PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA--------- 235
+S+ C L+ + P + S C + Y D S S G+ + D +T+ +
Sbjct: 160 SSSQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQTLSSFVYG 219
Query: 236 ---NSNGYFTRYPFLLGCINNSSGDKSGASGIMG-------------------------- 266
++ G F R ++G NN S SG G
Sbjct: 220 CGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIGT 279
Query: 267 -------------------------LDRSPVSIITR-TNTSYFSYCLPSPYGSTGYITFG 300
+D +++ R + SY +P+ S IT
Sbjct: 280 SSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTIIDSGTVITRL 339
Query: 301 KTDTVNTLRSAFHKRM-KKYKKAKGLEDLLDTCYDLS-AYETVVVPKIAIHFLGGVDLEL 358
T TL++A+ + KKY++A G+ LLDTC+ S A + V P I I F GG DL+L
Sbjct: 340 PTPVYTTLKNAYVTILSKKYQQAPGIS-LLDTCFKGSLAGISEVAPDIRIIFKGGADLQL 398
Query: 359 DVRGTLVVASVSQVCLGFATYPPDPNSITL-GNVQQRGHEVHYDVAGRRLGFGPGNC 414
+LV CL A +SI + GN QQ+ +V YDV R+GF PG C
Sbjct: 399 KGHNSLVELETGITCLAMA----GSSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 96/183 (52%), Gaps = 14/183 (7%)
Query: 126 DTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKI 185
D + EY++ V IG P L++D+GSDV W QCKPC+ C+ Q DP F + S TF +
Sbjct: 121 DEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPATSATFSAV 180
Query: 186 PCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
PC S CR LR S G +S C + + Y DGS + G A + +T+ G
Sbjct: 181 PCGSAVCRTLRTS---GCGDSGGCDYEVSYGDGSYTKGALALETLTLGGTAVEG------ 231
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITR---TNTSYFSYCLPSPYGSTGYITFGKT 302
+GC + + G GA+G++GL P+S++ + FSYCL S G + G++
Sbjct: 232 VAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASR--GAGSLVLGRS 289
Query: 303 DTV 305
+ V
Sbjct: 290 EAV 292
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/108 (39%), Positives = 56/108 (51%), Gaps = 3/108 (2%)
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVV 366
LR AF + +A G+ LLDTCYDLS Y +V VP ++ +F G L L R L+
Sbjct: 360 ALRDAFVAAVGALPRAPGVS-LLDTCYDLSGYTSVRVPTVSFYFDGAATLTLPARNLLLE 418
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
CL FA P+ LGN+QQ G ++ D A +GFGP C
Sbjct: 419 VDGGIYCLAFAPSSSGPS--ILGNIQQEGIQITVDSANGYIGFGPTTC 464
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 100/353 (28%), Positives = 156/353 (44%), Gaps = 74/353 (20%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
Y + V IG + +++++DTGSD+TW QC+PC C+ Q+DP F S S ++ I CNS+
Sbjct: 66 NYIVTVEIG--GRNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGSPSYQTILCNSS 123
Query: 191 SCRILR-ESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEA-----------N 236
+C+ L+ + G C N+ C + + Y DGS + G +++ + N
Sbjct: 124 TCQSLQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNLGTTHVSNFIFGCGRN 183
Query: 237 SNGYFTRYPFLLG----------------------CINNSSGDKSGASGIMGLDRS---- 270
+ G F L+G C+ ++ D SG S I+G + S
Sbjct: 184 NKGLFGGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAADASG-SLILGGNSSVYKN 242
Query: 271 --PVS--------------IITRTNTSYFSYCLPSP-YGSTGYITFGKTDTVNT------ 307
P+S + T S L +P Y +G + + TV T
Sbjct: 243 TTPISYTRMIANPQLPTFYFLNLTGISIGGVALQAPNYRQSGILI--DSGTVITRLPPPV 300
Query: 308 ---LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGT- 363
L++ F K+ + A +LDTC++L+ Y+ V +P I + F G +L +DV G
Sbjct: 301 YRDLKAEFLKQFSGFPSAPPFS-ILDTCFNLNGYDEVDIPTIRMQFEGNAELTVDVTGIF 359
Query: 364 -LVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
V SQVCL A+ D +GN QQR V Y+ +LGF CS
Sbjct: 360 YFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKLGFAAEACS 412
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 122/476 (25%), Positives = 199/476 (41%), Gaps = 87/476 (18%)
Query: 8 FLLFICLLCSSNNGAYADDNDLSHSHIVSVSSLLPPNVCN--RTRTALPQGPDKASLEVV 65
LL C++ + + A D ++S SSL P VC + R + G A++ +
Sbjct: 7 LLLLPCIIMITYHALVARAGDEKSYKVLSASSLKPGAVCAEPKVRDSSSSG---ATVPLN 63
Query: 66 SKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEF--LKRTEAFTFPAN 123
++GPCS + G P+ E+LR+DQ R + + + +P L+++EA T P
Sbjct: 64 HRHGPCSPVPSGKKKQ-PTFTELLRRDQLRANYIQRQFSDEHYPRTGGLQQSEA-TVPIA 121
Query: 124 INDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTF 182
+ + EY I V+IG P ++ +DTGSDV+W +CK ++ DP S T+
Sbjct: 122 LGSLLNTLEYVITVSIGSPAVAXTMFIDTGSDVSWLRCKSRLY-----DP----GTSSTY 172
Query: 183 FKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFT 242
C++ +C L G + C ++++Y DGS + G + +D +T+ S +
Sbjct: 173 APFSCSAPACAQLGRRGT-GCSSGSTCVYSVKYGDGSNTTGTYGSDTLTL-AGTSEPLIS 230
Query: 243 RYPFLLGCINNSSG-DKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYIT 298
+ F GC G ++ G+MGL S +++T +Y FSYCLP + S+G++T
Sbjct: 231 GFQF--GCSAVEHGFEEDNTDGLMGLGGDAQSFVSQTAATYGSAFSYCLPPTWNSSGFLT 288
Query: 299 FGKTDTVNTLRSAFHKRMKKYKKA--------------KGLE---------DLLDTCYDL 335
G + + + ++ + A K LE ++D+ +
Sbjct: 289 LGAPSSSTSAAFSTTPMLRSKQAATFYGLLLRGISVGGKTLEIPSSVFSAGSIVDSGTVI 348
Query: 336 SAYETVVVPKIAIHFLGGV--------------DLELDVRG-----TLVVASVSQVCLGF 376
+ ++ F G+ D D G V SV+ V G
Sbjct: 349 TRLPPTAYGALSAAFRDGMARYQYQPAAPRGLLDTCFDFTGHGEGNNFTVPSVALVLDGG 408
Query: 377 ATYPPDPNSIT------------------LGNVQQRGHEVHYDVAGRRLGFGPGNC 414
A PN I +GNVQQR EV YDV GF PG C
Sbjct: 409 AVVDLHPNGIVQDGCLAFAATDDDGRTGIIGNVQQRTFEVLYDVGQSVFGFRPGAC 464
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 67/166 (40%), Positives = 92/166 (55%), Gaps = 10/166 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY I +AIG P Q VS LLDTGSD+ WTQC PC C Q DP F + S ++ + C+
Sbjct: 102 EYLIDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPAASSSYVPMRCSGQ 161
Query: 191 SCR-ILRESFPFGNCNSKE-CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
C IL S C + C + Y DG+ + G +AT+R T A+S+G P
Sbjct: 162 LCNDILHHS-----CQRPDTCTYRYNYGDGTTTLGVYATERFTF--ASSSGEKLSVPLGF 214
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGST 294
GC + G + SGI+G R P+S++++ + FSYCL +PY ST
Sbjct: 215 GCGTMNVGSLNNGSGIVGFGRDPLSLVSQLSIRRFSYCL-TPYTST 259
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 95/338 (28%), Positives = 141/338 (41%), Gaps = 82/338 (24%)
Query: 147 LLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCN- 205
++LDTGSDVTW QC+PC C+QQ DP F S S ++ + C+S CR L + C
Sbjct: 1 MVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTA----ACRN 56
Query: 206 -SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSG-------- 256
+ C + + Y DGS + G +AT+ +T+ ++ G +GC +++ G
Sbjct: 57 ATGACLYEVAYGDGSYTVGDFATETLTLGDSTPVGNVA-----IGCGHDNEGLFVGAAGL 111
Query: 257 --------------------------DKSGASGIM--------GLDRSPVSIITRTNTSY 282
D AS + G +P+ RT+T Y
Sbjct: 112 LALGGGPLSFPSQISASTFSYCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFY 171
Query: 283 F------------------SYCLPSPYGSTGYITFGKTDTVN-------TLRSAFHKRMK 317
+ ++ + + GS G I T LR AF +
Sbjct: 172 YVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAP 231
Query: 318 KYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-VASVSQVCLGF 376
+ G+ L DTCYDLS +V VP +++ F GG L L + L+ V CL F
Sbjct: 232 SLPRTSGVS-LFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAF 290
Query: 377 ATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
A P + +GNVQQ+G V +D A +GF P C
Sbjct: 291 A--PTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 78/262 (29%), Positives = 122/262 (46%), Gaps = 33/262 (12%)
Query: 62 LEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRL----HLKNSRRLRKPFPEFLKRTEA 117
+ +V ++GPCS L PS EIL DQ R H ++ + P+ +R +
Sbjct: 92 MTIVHRHGPCSPL-AAAHRKPPSHGEILAADQNRAESIQHRVSTTATGRGKPKRSRRQQP 150
Query: 118 FTFPANINDTVA---------------DEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKP 162
+ PA + Y + V +G P +++ DTGSD TW QC+P
Sbjct: 151 SSAPAPAASLSSSTASLPASSGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQP 210
Query: 163 C-IHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGS 221
C + C++QR+ F ++S T+ + C + +C L C+ C + +QY DGS S
Sbjct: 211 CVVVCYEQREKLFDPARSSTYANVSCAAPACSDLN----IHGCSGGHCLYGVQYGDGSYS 266
Query: 222 GGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTS 281
GF+A D +T+ Y F GC + G A+G++GL R S+ +T
Sbjct: 267 IGFFAMDTLTLSS-----YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDK 321
Query: 282 Y---FSYCLPSPYGSTGYITFG 300
Y F++CLP+ TGY+ FG
Sbjct: 322 YGGVFAHCLPARSTGTGYLDFG 343
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 39/98 (39%), Positives = 55/98 (56%), Gaps = 1/98 (1%)
Query: 317 KKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGF 376
+ YKKA + LLDTCYD + V +P +++ F GG L++D G + AS SQVCL F
Sbjct: 423 RGYKKAPAVS-LLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLAF 481
Query: 377 ATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
A + +GN Q + V YD+ + +GF PG C
Sbjct: 482 AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 101/358 (28%), Positives = 154/358 (43%), Gaps = 83/358 (23%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+I V++G P + + L++DTGSD+ W QC PC+ C+ Q D F KS T+ + CNS
Sbjct: 36 EYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYHQCDEVFDPYKSSTYSTLGCNSR 95
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGS----------------GGFWATDRITIQE 234
C L G C +C + + Y DGS S GG ++I +
Sbjct: 96 QCLNLD----VGGCVGNKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVVLNKIPLGC 151
Query: 235 ANSN-GYFT-------------------------RYPFLLGCINNSSGDKS----GASGI 264
+ N GYF R+ + L + S ++S G + +
Sbjct: 152 GHDNEGYFVGAAGLLGLGKGPLSFPNQINSENGGRFSYCLTGRDTDSTERSSLIFGDAAV 211
Query: 265 --MGLDRSPVSIITRTNTSYFSYCLPSPYGST----------------GYITFGKTDTVN 306
G+ +P + R +T Y+ G + G + +V
Sbjct: 212 PPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGGVIIDSGTSVT 271
Query: 307 TLRSAFHKRMKKYKKAKGLEDLL--------DTCYDLSAYETVVVPKIAIHFLGGVDLEL 358
L++A + +++ +A G DL+ DTCY+LS +V VP + +HF GG DL+L
Sbjct: 272 RLQNAAYASLREAFRA-GTSDLVLTTEFSLFDTCYNLSDLSSVDVPTVTLHFQGGADLKL 330
Query: 359 DVRGTLV-VASVSQVCLGFA-TYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
LV V + S CL FA T P +GN+QQ+G V YD ++GF P C
Sbjct: 331 PASNYLVPVDNSSTFCLAFAGTTGPS----IIGNIQQQGFRVIYDNLHNQVGFVPSQC 384
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 110/392 (28%), Positives = 167/392 (42%), Gaps = 68/392 (17%)
Query: 83 PSLEEILRQDQQRL-HLKNSRRLRKPFPEFLKRTEAFTFPANINDTVAD-EYYIVVAIGE 140
P+LEE L +DQ R +++ ++R++A T P + ++ EY I V +G
Sbjct: 2 PTLEETLHRDQLRAAYIQRKFSGGGGAGGDVQRSDA-TVPTALGTSLNTLEYLITVGLGS 60
Query: 141 PKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFP 200
P ++L+DTGSDV+W QCKPC C Q DP F S S T+ C S C L +
Sbjct: 61 PATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADCAQLGQE-G 119
Query: 201 FGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSG 260
G +S +C + + Y DGS + G +++D + + G F GC N SG
Sbjct: 120 NGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL------GSSAVRSFQFGCSNVESGFNDQ 173
Query: 261 ASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDTVNT--------LR 309
G+MGL S++++T + FSYCLP S+G++T G T LR
Sbjct: 174 TDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGSGTSGFVKTPMLR 233
Query: 310 SA----FHKRMKKYKKAKGLE-----------DLLDTCYDLSAYETVVVPKIAIHFLGGV 354
S+ F+ + + G + ++D+ ++ ++ F G+
Sbjct: 234 SSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVMDSGTVITRLPPTAYSALSSAFKAGM 293
Query: 355 ------------DLELDVRG--TLVVASVSQVCLGFATYPPDPNSITL------------ 388
D D G ++ + SV+ V G A D + I L
Sbjct: 294 KQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIILSNCLAFAGNSDD 353
Query: 389 ------GNVQQRGHEVHYDVAGRRLGFGPGNC 414
GNVQQR EV YDV +GF G C
Sbjct: 354 SSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 385
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 68/204 (33%), Positives = 107/204 (52%), Gaps = 19/204 (9%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPC 187
A Y++++++G P ++DTGSD+TWTQC PC CF Q P + ++S TF K+PC
Sbjct: 93 AGAYHMILSVGTPPLAFPAIIDTGSDLTWTQCAPCTTACFAQPTPLYDPARSSTFSKLPC 152
Query: 188 NSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITI----QEANSNGYFTR 243
S C+ L +F CN+ C ++ +YA G + G+ A D + I + +++ F
Sbjct: 153 ASPLCQALPSAFR--ACNATGCVYDYRYAVGF-TAGYLAADTLAIGDGDGDGDASSSFAG 209
Query: 244 YPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL---------PSPYGST 294
F GC + GD GASGI+GL RS +S++++ FSYCL P +G+
Sbjct: 210 VAF--GCSTANGGDMDGASGIVGLGRSALSLLSQIGVGRFSYCLRSDADAGASPILFGAL 267
Query: 295 GYITFGKTDTVNTLRSAFHKRMKK 318
+T K + LR+ R +
Sbjct: 268 ANVTGDKVQSTALLRNPVAARRRA 291
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 69/201 (34%), Positives = 100/201 (49%), Gaps = 14/201 (6%)
Query: 94 QRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGS 153
+R + RR+R L+ + P D EY + VAIG P S ++DTGS
Sbjct: 62 KRAIKRGERRMRS-INAMLQSSSGIETPVYAGD---GEYLMNVAIGTPDSSFSAIMDTGS 117
Query: 154 DVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNI 213
D+ WTQC+PC CF Q P F S +F +PC S C+ L P CN+ EC +
Sbjct: 118 DLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDL----PSETCNNNECQYTY 173
Query: 214 QYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVS 273
Y DGS + G+ AT+ T + ++ F G N G +GA G++G+ P+S
Sbjct: 174 GYGDGSTTQGYMATETFTFETSS----VPNIAFGCGEDNQGFGQGNGA-GLIGMGWGPLS 228
Query: 274 IITRTNTSYFSYCLPSPYGST 294
+ ++ FSYC+ S YGS+
Sbjct: 229 LPSQLGVGQFSYCMTS-YGSS 248
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 95/348 (27%), Positives = 147/348 (42%), Gaps = 66/348 (18%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFYASKSKTFFKIPC 187
+ YY+ V +G P +Y S+++DTGS ++W QCKPC+ +C Q DP F S SKT+ + C
Sbjct: 10 SGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSC 69
Query: 188 NSTSCRILRESF---PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA--------- 235
S+ C L ++ P +S C + Y D S S G+ + D +T+ +
Sbjct: 70 TSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQTLPGFVYG 129
Query: 236 ---NSNGYFTRYPFLLGCINNSS---GDKS--------------GASGIMGLDRS----- 270
+S G F R +LG N G S G G + + ++
Sbjct: 130 CGQDSEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGGGFLSIGKASLAGS 189
Query: 271 -----PVSIITRTNTSYF------------------SYCLPSPYGSTGYITFGKTDTVNT 307
P++ + YF Y +P+ S IT
Sbjct: 190 AYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSGTVITRLPMSVYTP 249
Query: 308 LRSAFHKRMK-KYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVV 366
+ AF K M KY +A G +LDTC+ + + VP++ + F GG DL L L+
Sbjct: 250 FQQAFVKIMSSKYARAPGFS-ILDTCFKGNLKDMQSVPEVRLIFQGGADLNLRPVNVLLQ 308
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
CL FA + +GN QQ+ +V +D++ R+GF G C
Sbjct: 309 VDEGLTCLAFAG---NNGVAIIGNHQQQTFKVAHDISTARIGFATGGC 353
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 120/464 (25%), Positives = 187/464 (40%), Gaps = 83/464 (17%)
Query: 10 LFICLLCSSNNGAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYG 69
L + LLC +G +D +++V SL VC+ T P ++ + +YG
Sbjct: 17 LLLVLLCGYYSGVAFAADDARTYKVLAVGSLKAEVVCSVT----PASSSGTTVPLNHRYG 72
Query: 70 PCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVA 129
PCS S P++ E+L DQ R + + L T T + + DT+
Sbjct: 73 PCS---PAPSAKVPTILELLEHDQLRAKYIQRKLSGTDGLQPLDLTVPTTLGSAL-DTM- 127
Query: 130 DEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNS 189
EY I V IG P ++++DTGSDV+W +C F SKS T+ C+S
Sbjct: 128 -EYVITVGIGSPAVTQTMMIDTGSDVSWVRCNS-----TDGLTLFDPSKSTTYAPFSCSS 181
Query: 190 TSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
+C L + C++ C + +QY DGS + G +++D + + ++ T F G
Sbjct: 182 AACAQLGNNGD--GCSNSGCQYRVQYGDGSNTTGTYSSDTLALSASD-----TVTDFHFG 234
Query: 250 CINNSSG-DKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDTV 305
C ++ D G+MGL S++++T +Y FSYCLP ++G++TFG +
Sbjct: 235 CSHHEEDFDGEKIDGLMGLGGDAQSLVSQTAATYGKSFSYCLPPTNRTSGFLTFGAPN-- 292
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHF--------------- 350
T M ++ KA L +L + + P + +
Sbjct: 293 GTSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQPSVLSNGSVMDSGTVITWLPRR 352
Query: 351 -----------------------LGGVDLELDVRGTLVVA--SVSQVCLGFATYPPDPNS 385
LG +D D G + V+ +VS V G A D N
Sbjct: 353 AYSALSSAFRSSMTRLRHQRAAPLGILDTCYDFTGLVNVSIPAVSLVLDGGAVVDLDGNG 412
Query: 386 IT---------------LGNVQQRGHEVHYDVAGRRLGFGPGNC 414
I +GNVQQR EV +DV GF G C
Sbjct: 413 IMIQDCLAFAATSGDSIIGNVQQRTFEVLHDVGQGVFGFRSGAC 456
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 93/352 (26%), Positives = 154/352 (43%), Gaps = 73/352 (20%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y + V +G + +SL++DTGSD+TW QC+PC C+ Q+ P + S S ++ + CNS++
Sbjct: 138 YIVTVELG--GKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSST 195
Query: 192 CRIL----RESFPFGNCNS---KECPFNIQYADGSGSGGFWATDRITIQEA--------- 235
C+ L S P G N C + + Y DGS + G A++ I + +
Sbjct: 196 CQDLVAATGNSGPCGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVLGDTKLENLVFGC 255
Query: 236 --NSNGYFTRYPFLLGCINNSSGDKS-------------------GASGIMGLDRSPVSI 274
N+ G F L+G +S S GASG + S+
Sbjct: 256 GRNNKGLFGGASGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGTLSFGND-FSV 314
Query: 275 ITRTNTSYFSYCLPSPYGSTGYI----------------TFGKTDTVNT----------- 307
+ + +++ + +P + YI +FG+ +++
Sbjct: 315 YKNSTSVFYTPLVQNPQLRSFYILNLTGASIGGVELKTLSFGRGILIDSGTVITRLPPSI 374
Query: 308 ---LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGT- 363
+++ F K+ + A G +LDTC++L++YE + +P I + F G +LE+DV G
Sbjct: 375 YKAVKTEFLKQFSGFPSAPGY-SILDTCFNLTSYEDISIPTIKMIFEGNAELEVDVTGVF 433
Query: 364 -LVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V S VCL A+ + +GN QQ+ V YD RLG NC
Sbjct: 434 YFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIAGENC 485
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 61/161 (37%), Positives = 90/161 (55%), Gaps = 12/161 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
E+ + V+IG P S ++DTGSD+ WTQCKPC+ CF+Q P F S S T+ +PC+S
Sbjct: 104 EFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSA 163
Query: 191 SCRILRESFPFGNCNS-KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
SC L P C S +C + Y D S + G AT+ T+ ++ G + G
Sbjct: 164 SCSDL----PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPG------VVFG 213
Query: 250 CINNSSGDK-SGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
C + + GD S +G++GL R P+S++++ FSYCL S
Sbjct: 214 CGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTS 254
Score = 47.0 bits (110), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 53/121 (43%), Gaps = 7/121 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSA--YETVVVPKIAIHFLGGV 354
IT+ + L+ AF +M A G LD C+ A + V VP++ HF GG
Sbjct: 335 ITYLEVQGYRALKKAFAAQMA-LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGA 393
Query: 355 DLELDVRGTLVVASVS-QVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
DL+L +V+ S +CL T +GN QQ+ + YDV L F P
Sbjct: 394 DLDLPAENYMVLDGGSGALCL---TVMGSRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQ 450
Query: 414 C 414
C
Sbjct: 451 C 451
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 60/172 (34%), Positives = 96/172 (55%), Gaps = 8/172 (4%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ + +G P ++V L+LDTGSD++W QC PC CF+Q +Y S T+ I C
Sbjct: 170 EYFLDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSHYYPKDSSTYRNISCYDP 229
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNG---YFTRYP 245
C+++ S P +C ++ CP+ YADGS + G +A++ T+ NG +
Sbjct: 230 RCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQVVD 289
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGST 294
+ GC + + G GASG++GL R P+S ++ + Y FSYCL + +T
Sbjct: 290 VMFGCGHWNKGFFYGASGLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNT 341
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 35/126 (27%), Positives = 56/126 (44%), Gaps = 5/126 (3%)
Query: 292 GSTGYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLS-AYETVVVPKIAIHF 350
GST +TF + ++ AF K++K + A + ++ CY++S A V +P IHF
Sbjct: 419 GST--LTFFPDSAYDIIKEAFEKKIKLQQIAAD-DFVMSPCYNVSGAMMQVELPDFGIHF 475
Query: 351 LGGVDLELDVRGTLVVASVSQV-CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGF 409
G +V CL P + +GN+ Q+ + YDV RLG+
Sbjct: 476 ADGGVWNFPAENYFYQYEPDEVICLAIMKTPNHSHLTIIGNLLQQNFHILYDVKRSRLGY 535
Query: 410 GPGNCS 415
P C+
Sbjct: 536 SPRRCA 541
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 76/235 (32%), Positives = 113/235 (48%), Gaps = 25/235 (10%)
Query: 78 ISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVAD-EYYIVV 136
+ H P ILR+D R+ + RRL A T PA++ EY + +
Sbjct: 78 VPDHHPHYTGILRRDHNRVRSIH-RRLTG------AGDTAATIPASLGLAFHSLEYVVTI 130
Query: 137 AIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFYASKSKTFFKIPCNSTSCRIL 195
IG P + ++L DTGSD+TW QCKPC C+QQ++P F SKS T+ +PC + C+I
Sbjct: 131 GIGTPARNFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLFDPSKSSTYVDVPCGTPQCKIG 190
Query: 196 RESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSS 255
C C ++++Y D S + G A + T+ + + GC + S
Sbjct: 191 GGQDL--TCGGTTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPA----AGVVFGCSHEYS 244
Query: 256 GDKSGA------SGIMGLDRSPVSIITRT----NTSYFSYCLPSPYGSTGYITFG 300
GA +G++GL R SI+++T + FSYCLP S GY+T G
Sbjct: 245 SGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYCLPPRGSSAGYLTIG 299
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 66/117 (56%), Gaps = 11/117 (9%)
Query: 308 LRSAFHKRMKKYKK-AKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVV 366
LR F + M Y +G + LDTCYD++ ++ V P +A+ F GG +++D G L+V
Sbjct: 368 LRDEFRRHMGGYTMLPEGHVESLDTCYDVTGHDVVTAPPVALEFGGGARIDVDASGILLV 427
Query: 367 -------ASVSQVCLGFATYPPD-PNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
S++ CL F P + P + +GN+QQR + V +DV GRR+GFG CS
Sbjct: 428 FAVDASGQSLTLACLAFV--PTNLPGFVIIGNMQQRAYNVVFDVEGRRIGFGANGCS 482
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 61/161 (37%), Positives = 90/161 (55%), Gaps = 12/161 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
E+ + V+IG P S ++DTGSD+ WTQCKPC+ CF+Q P F S S T+ +PC+S
Sbjct: 94 EFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSA 153
Query: 191 SCRILRESFPFGNCNS-KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
SC L P C S +C + Y D S + G AT+ T+ ++ G + G
Sbjct: 154 SCSDL----PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPG------VVFG 203
Query: 250 CINNSSGDK-SGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
C + + GD S +G++GL R P+S++++ FSYCL S
Sbjct: 204 CGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTS 244
Score = 47.0 bits (110), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 53/121 (43%), Gaps = 7/121 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSA--YETVVVPKIAIHFLGGV 354
IT+ + L+ AF +M A G LD C+ A + V VP++ HF GG
Sbjct: 325 ITYLEVQGYRALKKAFAAQMA-LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGA 383
Query: 355 DLELDVRGTLVVASVS-QVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
DL+L +V+ S +CL T +GN QQ+ + YDV L F P
Sbjct: 384 DLDLPAENYMVLDGGSGALCL---TVMGSRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQ 440
Query: 414 C 414
C
Sbjct: 441 C 441
>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
Length = 328
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/177 (36%), Positives = 103/177 (58%), Gaps = 18/177 (10%)
Query: 139 GEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCR-ILRE 197
G P +++++DTGSD+TW QCKPC C+ QRDP F + S T+ + CN+++C LR
Sbjct: 103 GSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACADSLRA 162
Query: 198 SFPF-GNC-----NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI 251
+ G+C S++C + + Y DGS S G ATD + + A+ G F+ GC
Sbjct: 163 ATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGASLGG------FVFGCG 216
Query: 252 NNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYG--STGYITFGKTD 303
++ G G +G+MGL R+ +S++++T + Y FSYCLP+ ++G ++ G D
Sbjct: 217 LSNRGLFGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGGD 273
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 94/340 (27%), Positives = 142/340 (41%), Gaps = 79/340 (23%)
Query: 141 PKQYVSLLLDTGSDVTWTQCKPCI--HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRES 198
P +++LD+ SDV W QC PC C Q D F+ S+S T C+S +C L
Sbjct: 25 PGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALG-- 82
Query: 199 FPFGN-CNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD 257
P+ N C + +C + ++Y DGS + G + D +T+ N+ F GC + G
Sbjct: 83 -PYANGCANNQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFK-----FGCSHAEQGS 136
Query: 258 -KSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFG------------- 300
+ A+GIM L P S++++T + Y FSYC+P+ +G+ T G
Sbjct: 137 FDARAAGIMALGGGPESLLSQTASRYGNAFSYCIPATASDSGFFTLGVPRRASSRYVVTP 196
Query: 301 ---------------KTDTVNTLR-------------------------SAFHKRMKKYK 320
+T TV R +A+ ++
Sbjct: 197 MVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPPTAYQALRAAFR 256
Query: 321 KAKGL------EDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCL 374
+ + + LDTCYD + + +PKI++ F L LD G L CL
Sbjct: 257 SSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILF-----NDCL 311
Query: 375 GFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
F + D LG+VQQ+ EV YDV G +GF G C
Sbjct: 312 AFTSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 61/161 (37%), Positives = 90/161 (55%), Gaps = 12/161 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
E+ + V+IG P S ++DTGSD+ WTQCKPC+ CF+Q P F S S T+ +PC+S
Sbjct: 73 EFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSA 132
Query: 191 SCRILRESFPFGNCNS-KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
SC L P C S +C + Y D S + G AT+ T+ ++ G + G
Sbjct: 133 SCSDL----PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPG------VVFG 182
Query: 250 CINNSSGDK-SGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
C + + GD S +G++GL R P+S++++ FSYCL S
Sbjct: 183 CGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTS 223
Score = 47.4 bits (111), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 53/121 (43%), Gaps = 7/121 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSA--YETVVVPKIAIHFLGGV 354
IT+ + L+ AF +M A G LD C+ A + V VP++ HF GG
Sbjct: 304 ITYLEVQGYRALKKAFAAQMA-LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGA 362
Query: 355 DLELDVRGTLVVASVS-QVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
DL+L +V+ S +CL T +GN QQ+ + YDV L F P
Sbjct: 363 DLDLPAENYMVLDGGSGALCL---TVMGSRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQ 419
Query: 414 C 414
C
Sbjct: 420 C 420
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 77/261 (29%), Positives = 120/261 (45%), Gaps = 33/261 (12%)
Query: 62 LEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRL----HLKNSRRLRKPFPEFLKRTEA 117
+ +V ++GPCS L PS EIL DQ R H ++ + P+ +R +
Sbjct: 92 MTIVHRHGPCSPL-AAAHRKPPSHGEILAADQNRAESIQHRVSTTATGRGKPKRSRRQQP 150
Query: 118 FTFPANINDTV---------------ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKP 162
+ PA Y + V +G P +++ DTGSD TW QC+P
Sbjct: 151 SSAPAPAASLSSSTASLPASSGRALGTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQP 210
Query: 163 C-IHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGS 221
C + C++QR+ F ++S T+ + C + +C L C+ C + +QY DGS S
Sbjct: 211 CVVVCYEQREKLFDPARSSTYANVSCAAPACSDLN----IHGCSGGHCLYGVQYGDGSYS 266
Query: 222 GGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTS 281
GF+A D +T+ Y F GC + G A+G++GL R S+ +T
Sbjct: 267 IGFFAMDTLTLSS-----YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDK 321
Query: 282 Y---FSYCLPSPYGSTGYITF 299
Y F++CLP+ TGY+ F
Sbjct: 322 YGGVFAHCLPARSTGTGYLDF 342
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 39/98 (39%), Positives = 55/98 (56%), Gaps = 1/98 (1%)
Query: 317 KKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGF 376
+ YKKA + LLDTCYD + V +P +++ F GG L++D G + AS SQVCL F
Sbjct: 423 RGYKKAPAVS-LLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLAF 481
Query: 377 ATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
A + +GN Q + V YD+ + +GF PG C
Sbjct: 482 AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 73/201 (36%), Positives = 116/201 (57%), Gaps = 16/201 (7%)
Query: 116 EAFTFPANINDTVAD---EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDP 172
++F P + TV EY I ++G P V +LDTGSD+ W QC+PC C++Q P
Sbjct: 70 QSFVSPNSPETTVISALGEYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTP 129
Query: 173 FFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS-KECPFNIQYADGSGSGGFWATDRIT 231
F +SKS+T+ +PC S +C+ ++ +F C+S K C ++I Y DGS S G + + +T
Sbjct: 130 IFDSSKSQTYKTLPCPSNTCQSVQGTF----CSSRKHCLYSIHYVDGSQSLGDLSVETLT 185
Query: 232 IQEANSNGYFTRYP-FLLGCIN-NSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYC 286
+ ++NG ++P ++GC N+ G + SGI+GL R P+S+IT+ + S FSYC
Sbjct: 186 L--GSTNGSPVQFPGTVIGCGRYNAIGIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSYC 243
Query: 287 L-PSPYGSTGYITFGKTDTVN 306
L P ++ + FG V+
Sbjct: 244 LVPGLSTASSKLNFGNAAVVS 264
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/161 (37%), Positives = 89/161 (55%), Gaps = 7/161 (4%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ VV +G P++ + L++DTGSD+TW QC PC +C++Q+D F S S +F + C+S+
Sbjct: 15 EYFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNPSSSSSFKVLDCSSS 74
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C L C S +C + Y DGS + G TD + + +A G LGC
Sbjct: 75 LCLNLDVM----GCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQVVLTNIPLGC 130
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTS---YFSYCLP 288
+++ G A+GI+GL R P+S + S FSYCLP
Sbjct: 131 GHDNEGTFGTAAGILGLGRGPLSFPNNLDASTRNIFSYCLP 171
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 49/108 (45%), Gaps = 5/108 (4%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
+R AF A + + DTCYD + ++ VP + HF G VD+ L +V
Sbjct: 264 VRDAFRAATMHLTSAADFK-IFDTCYDFTGMNSISVPTVTFHFQGDVDMRLPPSNYIVPV 322
Query: 368 SVSQV-CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
S + + C FA +GNVQQ+ V YD +++G P C
Sbjct: 323 SNNNIFCFAFAA---SMGPSVIGNVQQQSFRVIYDNVHKQIGLLPDQC 367
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 58/161 (36%), Positives = 89/161 (55%), Gaps = 13/161 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
E+ + ++IG P + ++DTGSD+ WTQCKPC+ CF Q P F S S T+ +PC+ST
Sbjct: 101 EFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYAALPCSST 160
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLG 249
C L P C S +C + Y D S + G A + T+ + T+ P G
Sbjct: 161 LCSDL----PSSKCTSAKCGYTYTYGDSSSTQGVLAAETFTLAK-------TKLPDVAFG 209
Query: 250 CINNSSGDK-SGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
C + + GD + +G++GL R P+S++++ + FSYCL S
Sbjct: 210 CGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLNKFSYCLTS 250
Score = 48.5 bits (114), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 41/122 (33%), Positives = 57/122 (46%), Gaps = 8/122 (6%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDL--SAYETVVVPKIAIHFLGGV 354
IT+ + L+ AF +MK A G LDTC++ S + V VPK+ H L G
Sbjct: 331 ITYLELQGYRALKKAFAAQMK-LPAADGSGIGLDTCFEAPASGVDQVEVPKLVFH-LDGA 388
Query: 355 DLELDVRGTLVVASVS-QVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
DL+L +V+ S S +CL T +GN QQ+ + YDV L F P
Sbjct: 389 DLDLPAENYMVLDSGSGALCL---TVMGSRGLSIIGNFQQQNIQFVYDVGENTLSFAPVQ 445
Query: 414 CS 415
C+
Sbjct: 446 CA 447
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/160 (37%), Positives = 88/160 (55%), Gaps = 7/160 (4%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + +AIG P Y + ++DTGSD+ WTQC PC+ C Q P+F KS T+ +PC S+
Sbjct: 88 EYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSS 147
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGY-FTRYPFLLG 249
C L +C K C + Y D + + G A + T ANS T F G
Sbjct: 148 RCASLSSP----SCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAF--G 201
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
C + ++GD + +SG++G R P+S++++ S FSYCL S
Sbjct: 202 CGSLNAGDLANSSGMVGFGRGPLSLVSQLGPSRFSYCLTS 241
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/160 (37%), Positives = 88/160 (55%), Gaps = 7/160 (4%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + +AIG P Y + ++DTGSD+ WTQC PC+ C Q P+F KS T+ +PC S+
Sbjct: 88 EYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSS 147
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGY-FTRYPFLLG 249
C L +C K C + Y D + + G A + T ANS T F G
Sbjct: 148 RCASLSSP----SCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAF--G 201
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
C + ++GD + +SG++G R P+S++++ S FSYCL S
Sbjct: 202 CGSLNAGDLANSSGMVGFGRGPLSLVSQLGPSRFSYCLTS 241
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 92/329 (27%), Positives = 142/329 (43%), Gaps = 49/329 (14%)
Query: 126 DTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKI 185
D A Y + ++IG P S+L DTGS + WTQC PC C + P F + S TF K+
Sbjct: 84 DNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKL 143
Query: 186 PCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
PC S+ C+ L P+ CN+ C + Y G + G+ AT+ + + A+ G
Sbjct: 144 PCASSLCQFLTS--PYRTCNATGCVYYYPYGMGF-TAGYLATETLHVGGASFPG------ 194
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL---------PSPYGSTGY 296
GC + +G + +SGI+GL RSP+S++++ + FSYCL P +GS
Sbjct: 195 VTFGC-STENGVGNSSSGIVGLGRSPLSLVSQVGVARFSYCLRSNADAGDSPILFGSLAK 253
Query: 297 ITFGKTDTVNTLR------SAFH---------------KRMKKYKKAKGLEDLLDTCYD- 334
+T G + L S+++ M G D C+D
Sbjct: 254 VTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPMAMANLTTVNGTRFGFDLCFDA 313
Query: 335 --LSAYETVVVPKIAIHFLGGVDLELDVR---GTLVVASVSQVCLGFATYPPDPNSIT-- 387
V VP + + F GG + + R G + V S + + P ++
Sbjct: 314 TAAGGGGGVPVPTLVLRFAGGAEYAVRRRSYFGVVEVDSQGRAAVECLLVLPASEKLSIS 373
Query: 388 -LGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+GNV Q V YD+ G F P +C+
Sbjct: 374 IIGNVMQMDLHVLYDLDGGMFSFAPADCA 402
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 65/182 (35%), Positives = 100/182 (54%), Gaps = 16/182 (8%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y V IG + ++++DT S++TW QC+PC C Q++P F S S ++ +PCNS+S
Sbjct: 113 YVATVGIGGGE--ATVIVDTASELTWVQCEPCDACHDQQEPLFDPSSSPSYAAVPCNSSS 170
Query: 192 CRILRESFPFGN--CNSK--ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C LR + C+ + C + + Y DGS S G A DR+++ + G F+
Sbjct: 171 CDALRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAHDRLSLAGEDIQG------FV 224
Query: 248 LGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL-PSPYGSTGYITFGKTD 303
GC ++ G G SG+MGL RS +S+I++T + FSYCL P GS+G + G
Sbjct: 225 FGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPPKESGSSGSLVLGDDA 284
Query: 304 TV 305
+V
Sbjct: 285 SV 286
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 37/110 (33%), Positives = 61/110 (55%), Gaps = 3/110 (2%)
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL-- 364
+R+ F ++ +Y +A +LDTC+DL+ V VP + + F GG ++E+D +G L
Sbjct: 353 AVRAEFVSQLAEYPQAAPFS-ILDTCFDLTGLREVQVPSLKLVFDGGAEVEVDSKGVLYV 411
Query: 365 VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V SQVCL A+ + ++ +GN QQ+ V +D G ++GF C
Sbjct: 412 VTGDASQVCLALASLKSEYDTPIIGNYQQKNLRVIFDTVGSQIGFAQETC 461
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 73/210 (34%), Positives = 105/210 (50%), Gaps = 26/210 (12%)
Query: 97 HLKNS---RRLRKPFPEF---LKRTEAFTFPANINDTVAD-----------EYYIVVAIG 139
H+KN RLR+ L R A A N TV D E+ + +AIG
Sbjct: 60 HVKNLTRFERLRRGVARGKNRLHRLNAMVLAA-ANATVGDQVKAPVVAGNGEFLMKLAIG 118
Query: 140 EPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESF 199
P + S ++DTGSD+ WTQCKPC CF Q P F +S +F+KI C+S C L
Sbjct: 119 SPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGAL---- 174
Query: 200 PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL-LGCINNSSGDK 258
P C+S C + Y D S + G A + T ++ + P L GC N+++GD
Sbjct: 175 PTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQI--SIPGLGFGCGNDNNGDG 232
Query: 259 -SGASGIMGLDRSPVSIITRTNTSYFSYCL 287
S +G++GL R P+S++++ F+YCL
Sbjct: 233 FSQGAGLVGLGRGPLSLVSQLKEQKFAYCL 262
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 52/120 (43%), Gaps = 7/120 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSA-YETVVVPKIAIHFLGGVD 355
IT+ + +L++ F +M G L D C++L A V VPK+ HF G D
Sbjct: 344 ITYVENSAFTSLKNEFIAQMNLPVDDSGTGGL-DLCFNLPAGTNQVEVPKLTFHF-KGAD 401
Query: 356 LELDVRGTLVVAS-VSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
LEL ++ S +CL + GN+QQ+ V +D+ L F P C
Sbjct: 402 LELPGENYMIGDSKAGLLCLAIGS---SRGMSIFGNLQQQNFMVVHDLQEETLSFLPTQC 458
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 93/177 (52%), Gaps = 15/177 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFYASKSKTFFKIPCNS 189
E+ + V G P Q +++ DTGSDV+W QC PC HC++Q DP F +KS T+ +PC
Sbjct: 134 EFVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSVVPCGH 193
Query: 190 TSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLL 248
C S C++ C + ++Y DGS S G + + +++ + P F
Sbjct: 194 PQCAAADGS----KCSNGTCLYKVEYGDGSSSAGVLSHETLSLTSTRA------LPGFAF 243
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKT 302
GC + GD G++GL R +S+ ++ S+ FSYCLPS + GY+T G T
Sbjct: 244 GCGQTNLGDFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPSDNTTHGYLTIGPT 300
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 57/121 (47%), Gaps = 4/121 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T+ + LR F M +YK A D DTCYD + + +P ++ F G
Sbjct: 357 LTYLPPEAYTALRDRFKFTMTQYKPAPAY-DPFDTCYDFTGQSAIFIPAVSFKFSDGSVF 415
Query: 357 ELDVRGTLVV---ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
+L G L+ + + CLGF P +GN+QQR EV YDVA ++GF +
Sbjct: 416 DLSFFGILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASAS 475
Query: 414 C 414
C
Sbjct: 476 C 476
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 73/210 (34%), Positives = 105/210 (50%), Gaps = 26/210 (12%)
Query: 97 HLKNS---RRLRKPFPE---FLKRTEAFTFPANINDTVAD-----------EYYIVVAIG 139
H+KN RLR+ L R A A N TV D E+ + +AIG
Sbjct: 315 HVKNLTRFERLRRGVARGKNRLHRLNAMVLAA-ANATVGDQVKAPVVAGNGEFLMKLAIG 373
Query: 140 EPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESF 199
P + S ++DTGSD+ WTQCKPC CF Q P F +S +F+KI C+S C L
Sbjct: 374 SPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGAL---- 429
Query: 200 PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL-LGCINNSSGDK 258
P C+S C + Y D S + G A + T ++ + P L GC N+++GD
Sbjct: 430 PTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQ--ISIPGLGFGCGNDNNGDG 487
Query: 259 -SGASGIMGLDRSPVSIITRTNTSYFSYCL 287
S +G++GL R P+S++++ F+YCL
Sbjct: 488 FSQGAGLVGLGRGPLSLVSQLKEQKFAYCL 517
Score = 39.7 bits (91), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 52/120 (43%), Gaps = 7/120 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSA-YETVVVPKIAIHFLGGVD 355
IT+ + +L++ F +M G L D C++L A V VPK+ HF G D
Sbjct: 599 ITYVENSAFTSLKNEFIAQMNLPVDDSGTGGL-DLCFNLPAGTNQVEVPKLTFHF-KGAD 656
Query: 356 LELDVRGTLVVAS-VSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
LEL ++ S +CL + GN+QQ+ V +D+ L F P C
Sbjct: 657 LELPGENYMIGDSKAGLLCLAIGS---SRGMSIFGNLQQQNFMVVHDLQEETLSFLPTQC 713
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 74/211 (35%), Positives = 105/211 (49%), Gaps = 21/211 (9%)
Query: 85 LEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQY 144
L+ +++ + RL +RL F EA N E+ + +AIG P +
Sbjct: 61 LQRAMKRGKLRL-----QRLSAKTASFESSVEAPVHAGN------GEFLMKLAIGTPAET 109
Query: 145 VSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNC 204
S ++DTGSD+ WTQCKPC CF Q P F KS +F K+PC+S C L P +C
Sbjct: 110 YSAIMDTGSDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKLPCSSDLCAAL----PISSC 165
Query: 205 NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGI 264
S C + Y D S + G AT+ +A+ ++ F G N+ SG GA G+
Sbjct: 166 -SDGCEYLYSYGDYSSTQGVLATETFAFGDAS----VSKIGFGCGEDNDGSGFSQGA-GL 219
Query: 265 MGLDRSPVSIITRTNTSYFSYCLPSPYGSTG 295
+GL R P+S+I++ FSYCL S S G
Sbjct: 220 VGLGRGPLSLISQLGEPKFSYCLTSMDDSKG 250
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 61/163 (37%), Positives = 87/163 (53%), Gaps = 5/163 (3%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + +AIG P Q VS LLDTGSD+ WTQC PC C Q DP F +S ++ + C
Sbjct: 101 EYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPGESASYEPMRCAGQ 160
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C + G C + Y DG+ + G +AT+R T + + T P GC
Sbjct: 161 LC---SDILHHGCEMPDTCTYRYNYGDGTMTMGVYATERFTFTSSGGDRLMT-VPLGFGC 216
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGS 293
+ + G + SGI+G R+P+S++++ + FSYCL S YGS
Sbjct: 217 GSMNVGSLNNGSGIVGFGRNPLSLVSQLSIRRFSYCLTS-YGS 258
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 76/261 (29%), Positives = 119/261 (45%), Gaps = 33/261 (12%)
Query: 62 LEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRL----HLKNSRRLRKPFPEFLKRTEA 117
+ +V ++GPCS L PS EIL DQ R H ++ + P+ +R +
Sbjct: 90 MTIVHRHGPCSPL-AAAHRKPPSHGEILAADQNRAESIQHRVSTTATGRGKPKRSRRQQP 148
Query: 118 FTFPANINDTV---------------ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKP 162
+ PA Y + V +G P +++ DTGSD TW QC+P
Sbjct: 149 SSAPAPAASLSSSTASLPASSGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQP 208
Query: 163 C-IHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGS 221
C + C++Q++ F +S T+ + C + +C L C+ C + +QY DGS S
Sbjct: 209 CVVVCYEQQEKLFDPVRSSTYANVSCAAPACSDLN----IHGCSGGHCLYGVQYGDGSYS 264
Query: 222 GGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTS 281
GF+A D +T+ Y F GC + G A+G++GL R S+ +T
Sbjct: 265 IGFFAMDTLTLSS-----YDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDK 319
Query: 282 Y---FSYCLPSPYGSTGYITF 299
Y F++CLP+ TGY+ F
Sbjct: 320 YGGVFAHCLPARSTGTGYLDF 340
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 39/98 (39%), Positives = 55/98 (56%), Gaps = 1/98 (1%)
Query: 317 KKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGF 376
+ YKKA + LLDTCYD + V +P +++ F GG L++D G + AS SQVCL F
Sbjct: 421 RGYKKAPAVS-LLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQVCLAF 479
Query: 377 ATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
A + +GN Q + V YD+ + +GF PG C
Sbjct: 480 AANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGVC 517
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 102/202 (50%), Gaps = 16/202 (7%)
Query: 94 QRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGS 153
+R + SRRL++ L P D EY + ++IG P Q S ++DTGS
Sbjct: 61 ERAVERGSRRLQR-LEAMLNGPSGVETPVYAGD---GEYLMNLSIGTPAQPFSAIMDTGS 116
Query: 154 DVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNI 213
D+ WTQC+PC CF Q P F S +F +PC+S C+ L+ C++ C +
Sbjct: 117 DLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALQSP----TCSNNSCQYTY 172
Query: 214 QYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSG-ASGIMGLDRSPV 272
Y DGS + G T+ +T G + GC N+ G G +G++G+ R P+
Sbjct: 173 GYGDGSETQGSMGTETLTF------GSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPL 226
Query: 273 SIITRTNTSYFSYCLPSPYGST 294
S+ ++ + + FSYC+ +P GS+
Sbjct: 227 SLPSQLDVTKFSYCM-TPIGSS 247
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 102/202 (50%), Gaps = 16/202 (7%)
Query: 94 QRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGS 153
+R + SRRL++ L P D EY + ++IG P Q S ++DTGS
Sbjct: 61 ERAVERGSRRLQR-LEAMLNGPSGVETPVYAGD---GEYLMNLSIGTPAQPFSAIMDTGS 116
Query: 154 DVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNI 213
D+ WTQC+PC CF Q P F S +F +PC+S C+ L+ C++ C +
Sbjct: 117 DLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALQSP----TCSNNSCQYTY 172
Query: 214 QYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSG-ASGIMGLDRSPV 272
Y DGS + G T+ +T G + GC N+ G G +G++G+ R P+
Sbjct: 173 GYGDGSETQGSMGTETLTF------GSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPL 226
Query: 273 SIITRTNTSYFSYCLPSPYGST 294
S+ ++ + + FSYC+ +P GS+
Sbjct: 227 SLPSQLDVTKFSYCM-TPIGSS 247
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 88/328 (26%), Positives = 143/328 (43%), Gaps = 66/328 (20%)
Query: 147 LLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS 206
LL+DTGSD+TW QC PC C++Q+D F + S T+ +PCNST C+ L +SF +C +
Sbjct: 3 LLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQL-QSFSH-SCLN 60
Query: 207 KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLGCINNSSGDKSGASGIM 265
C + + Y D S + G +A + +T++ ++ P F GC + + G +GA+G+M
Sbjct: 61 SSCNYMVSYGDKSTTRGDFALETLTLRSDDT--ILVSVPNFAFGCGHANKGLFNGAAGLM 118
Query: 266 GLDRSPVSIITRTNTSY---FSYCLPSPYGS--TGYITFGKTDTVN-------------- 306
GL +S + +T+ ++ FSYCLPS + +G + FG+ ++
Sbjct: 119 GLGKSSIGFPAQTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRFTPLVDSSSG 178
Query: 307 -----------------------------TLRSAFHKRMKKYKKAKGLEDL--LDTCYDL 335
T+ S F + + + + L L T +
Sbjct: 179 PSQYFVSMTGINVGDELLPISATVMVDSGTVISRFEQSAYERLRDAFTQILPGLQTAVSV 238
Query: 336 SAYET---------VVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSI 386
+ ++T + +P I +HF +L L L +C FA P
Sbjct: 239 APFDTCFRVSTVDDINIPLITLHFRDDAELRLSPVHILYPVDDGVMCFAFA--PSSSGRS 296
Query: 387 TLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
LGN QQ+ YD+ RLG C
Sbjct: 297 VLGNFQQQNLRFVYDIPKSRLGISAFEC 324
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 151/357 (42%), Gaps = 83/357 (23%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
E+ + + IG P V + DTGSD+TWTQC PC CF Q P F +S ++ K+ C S
Sbjct: 89 EFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASD 148
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
+CR L ES+ G + + C + Y D S + G A+D+ITI G F ++GC
Sbjct: 149 TCRSL-ESYHCGP-DLQSCSYGYSYGDRSFTYGDLASDQITI------GSFKLPKTVIGC 200
Query: 251 INNSSGDKSGAS-GIMGLDRSPVSIITRTNT-----SYFSYCLPSPYGS---TGYITFGK 301
+ + G G + GI+GL +S++++ T FSYCLP+ + + TG I+FG+
Sbjct: 201 GHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGR 260
Query: 302 TDTVNTLR-------------------SAFHKRMKKYKKAKGLEDL-------------- 328
V+ + A K++K A G+ +
Sbjct: 261 KAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGTTL 320
Query: 329 ------------------------------LDTCYDLSAYETVVVPKIAIHFLGGVDLEL 358
L+ CY + + +P I HF GG D++L
Sbjct: 321 TLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADVKL 380
Query: 359 DVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
T + + CL FA P GN+ Q EV YD+ +RL F P C+
Sbjct: 381 LPVNTFAPVADNVTCLTFA---PATQVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 434
>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 421
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 89/279 (31%), Positives = 132/279 (47%), Gaps = 44/279 (15%)
Query: 33 HIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQD 92
H VSSLLP N C+ + QG L + KYGPCS + PS +EI +D
Sbjct: 42 HSTPVSSLLPKNKCSASARGGSQG-----LPITQKYGPCSGSGH---SQPPSPQEIFGRD 93
Query: 93 QQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADE---YYIVVAIGEPKQYVSLLL 149
+ R+ NS+ + N+ + DE + + VA G P Q L+L
Sbjct: 94 ESRVSFINSKCNQYTSGNLKNHAH--------NNNLFDEDGNFLVDVAFGTPPQNFMLIL 145
Query: 150 DTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKEC 209
DTGS +TWTQCK C++C Q +F S S T+ C P + E
Sbjct: 146 DTGSSITWTQCKACVNCLQDSHRYFNWSASSTYSSGSC-----------IP----GTVEN 190
Query: 210 PFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD-KSGASGIMGLD 268
+N+ Y D S S G + D +T++ ++ F ++ F GC N+ GD SG G++GL
Sbjct: 191 NYNMTYGDDSTSVGNYGCDTMTLEPSD---VFQKFQF--GCGRNNKGDFGSGVDGMLGLG 245
Query: 269 RSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDT 304
+ +S +++T + + FSYCLP S G + FG+ T
Sbjct: 246 QGQLSTVSQTASKFNKVFSYCLPEE-DSIGSLLFGEKAT 283
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 64/180 (35%), Positives = 95/180 (52%), Gaps = 11/180 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + + IG P +Y S +LDTGSD+ WTQC PC+ C Q PFF ++S ++ K+PCNS
Sbjct: 88 EYLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYAKLPCNSP 147
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C L +P C C + Y D + + G + + T ++ R F GC
Sbjct: 148 MCNALY--YPL--CYRNVCVYQYFYGDSANTAGVLSNETFTFGTNDTRVTVPRIAF--GC 201
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLP---SPYGSTGYITFGKTDTVNT 307
N ++G SG++G R P+S++++ + FSYCL SP S Y FG T+N+
Sbjct: 202 GNLNAGSLFNGSGMVGFGRGPLSLVSQLGSPRFSYCLTSFMSPVPSRLY--FGAYATLNS 259
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 61/178 (34%), Positives = 93/178 (52%), Gaps = 7/178 (3%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + V IG P +Y S ++DTGSD+ WTQC PC+ C +Q P+F +KS ++ +PC+S
Sbjct: 87 EYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSA 146
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C L F N C + Y D + S G A + T ++ R F GC
Sbjct: 147 MCNALYSPLCFQN----ACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSF--GC 200
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYG-STGYITFGKTDTVNT 307
N ++G SG++G R +S++++ + FSYCL S +T + FG T+N+
Sbjct: 201 GNMNAGTLFNGSGMVGFGRGALSLVSQLGSPRFSYCLTSFMSPATSRLYFGAYATLNS 258
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 58/160 (36%), Positives = 88/160 (55%), Gaps = 6/160 (3%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + +AIG P + ++DTGSD+ WTQC PC+ C Q P+F ++S T+ +PC S
Sbjct: 91 EYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSP 150
Query: 191 SCRILRESFPFGNCNSKE-CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
C L P+ C + C + Y D + + G A++ T ANS+ G
Sbjct: 151 LCAAL----PYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVS-DVAFG 205
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
C N +SG + +SG++GL R P+S++++ S FSYCL S
Sbjct: 206 CGNINSGQLANSSGMVGLGRGPLSLVSQLGPSRFSYCLTS 245
Score = 43.5 bits (101), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 27/122 (22%), Positives = 54/122 (44%), Gaps = 6/122 (4%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVV--PKIAIHFLGGV 354
+T+ + D + +R ++ E L+TC+ +V V P + +HF GG
Sbjct: 328 LTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTVPDMELHFDGGA 387
Query: 355 DLELDVRGTLVV-ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
++ + +++ + +CL ++ +GN QQ+ + YD+A L F P
Sbjct: 388 NMTVPPENYMLIDGATGFLCLAMIR---SGDATIIGNYQQQNMHILYDIANSLLSFVPAP 444
Query: 414 CS 415
C+
Sbjct: 445 CN 446
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 61/178 (34%), Positives = 93/178 (52%), Gaps = 7/178 (3%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + V IG P +Y S ++DTGSD+ WTQC PC+ C +Q P+F +KS ++ +PC+S
Sbjct: 84 EYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSA 143
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C L F N C + Y D + S G A + T ++ R F GC
Sbjct: 144 MCNALYSPLCFQN----ACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSF--GC 197
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYG-STGYITFGKTDTVNT 307
N ++G SG++G R +S++++ + FSYCL S +T + FG T+N+
Sbjct: 198 GNMNAGTLFNGSGMVGFGRGALSLVSQLGSPRFSYCLTSFMSPATSRLYFGAYATLNS 255
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 57/182 (31%), Positives = 98/182 (53%), Gaps = 9/182 (4%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + + IG P+ Y S +DT SD+ W QC+PC+ C++Q DP F S ++ +PC+S
Sbjct: 87 EYLVKLGIGTPQHYFSAAIDTASDLVWLQCQPCVSCYRQLDPIFNPRLSSSYAVVPCSSD 146
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
+C L + + + C +N +Y+ + + G A D++ + G + +LGC
Sbjct: 147 TCSQL-DGHRCDEDDDQACRYNYKYSGNAVTNGTLAIDKLAV------GGNVFHAVVLGC 199
Query: 251 INNS-SGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGST-GYITFGKTDTVNTL 308
++S G ASG++GL R P+S++++ + F YCLP P T G + G + +
Sbjct: 200 SDSSVGGPPPQASGLVGLARGPLSLLSQLSVRRFMYCLPPPMSRTPGKLVLGAGAGADAV 259
Query: 309 RS 310
R+
Sbjct: 260 RN 261
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 68/214 (31%), Positives = 110/214 (51%), Gaps = 15/214 (7%)
Query: 86 EEILRQDQQRLHLKN--SRRLRKPFPEFLKRTEAFTFPANINDTVAD-EYYIVVAIGEPK 142
EE ++ RL K S + P L + + P N ++ YY+ + +G P
Sbjct: 76 EEHVKALSDRLANKGLGSGSAKPPKSGHLLEPNSASIPLNPGLSIGSGNYYVKLGLGTPP 135
Query: 143 QYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESF-- 199
+Y +++LDTGS ++W QC+PC ++C Q DP + S SKT+ K+ C S C L+ +
Sbjct: 136 KYYAMILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLN 195
Query: 200 -PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDK 258
P +S C + Y D S S G+ + D +T+ + + FT GC ++ G
Sbjct: 196 DPLCETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQTLPQFT-----YGCGQDNQGLF 250
Query: 259 SGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
A+GI+GL R +S++ + +T Y FSYCLP+
Sbjct: 251 GRAAGIIGLARDKLSMLAQLSTKYGHAFSYCLPT 284
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 48/131 (36%), Positives = 62/131 (47%), Gaps = 2/131 (1%)
Query: 285 YCLPSPYGSTGYITFGKTDTVNTLRSAFHKRMK-KYKKAKGLEDLLDTCYDLSAYETVVV 343
Y +P+ S IT LR AF K M KY KA +LDTC+ S V
Sbjct: 341 YRVPTLIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYS-ILDTCFKGSLKSISAV 399
Query: 344 PKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVA 403
P+I + F GG DL L L+ A CL FA +GN QQ+ + + YDV+
Sbjct: 400 PEIKMIFQGGADLTLRAPSILIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVS 459
Query: 404 GRRLGFGPGNC 414
R+GF PG+C
Sbjct: 460 TSRIGFAPGSC 470
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 58/160 (36%), Positives = 88/160 (55%), Gaps = 6/160 (3%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + +AIG P + ++DTGSD+ WTQC PC+ C Q P+F ++S T+ +PC S
Sbjct: 91 EYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSP 150
Query: 191 SCRILRESFPFGNCNSKE-CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
C L P+ C + C + Y D + + G A++ T ANS+ G
Sbjct: 151 LCAAL----PYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVS-DVAFG 205
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
C N +SG + +SG++GL R P+S++++ S FSYCL S
Sbjct: 206 CGNINSGQLANSSGMVGLGRGPLSLVSQLGPSRFSYCLTS 245
Score = 43.5 bits (101), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 27/122 (22%), Positives = 54/122 (44%), Gaps = 6/122 (4%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVV--PKIAIHFLGGV 354
+T+ + D + +R ++ E L+TC+ +V V P + +HF GG
Sbjct: 328 LTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTVPDMELHFDGGA 387
Query: 355 DLELDVRGTLVV-ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
++ + +++ + +CL ++ +GN QQ+ + YD+A L F P
Sbjct: 388 NMTVPPENYMLIDGATGFLCLAMIR---SGDATIIGNYQQQNMHILYDIANSLLSFVPAP 444
Query: 414 CS 415
C+
Sbjct: 445 CN 446
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/161 (38%), Positives = 88/161 (54%), Gaps = 10/161 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY V +G P++ S+++DTGSD+TW QC PC C+ Q D F + S +F K+ C S
Sbjct: 12 EYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDALFLPNTSTSFTKLACGSA 71
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLG 249
C L PF CN C + Y DGS + G + D IT+ NG + P F G
Sbjct: 72 LCNGL----PFPMCNQTTCVYWYSYGDGSLTTGDFVYDTITMD--GINGQKQQVPNFAFG 125
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
C +++ G +GA GI+GL + P+S ++ + Y FSYCL
Sbjct: 126 CGHDNEGSFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCL 166
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 63/134 (47%), Gaps = 17/134 (12%)
Query: 292 GSTGYITFGKTDTVNTLRSAFHKRMKKYKKA------KGLEDL--LDTCYDLSAY---ET 340
G G I F TV L A +K + A + ++D+ LD C LS + +
Sbjct: 234 GGAGTI-FDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDDISRLDLC--LSGFPKDQL 290
Query: 341 VVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHY 400
VP + HF GG D+ L + SQ T PD N I G+VQQ+ +V+Y
Sbjct: 291 PTVPAMTFHFEGG-DMVLPPSNYFIYLESSQSYCFAMTSSPDVNII--GSVQQQNFQVYY 347
Query: 401 DVAGRRLGFGPGNC 414
D AGR+LGF P +C
Sbjct: 348 DTAGRKLGFVPKDC 361
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 64/181 (35%), Positives = 92/181 (50%), Gaps = 16/181 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ +V +G P L++DTGSD+ W QC PC C+ QR F +S T+ ++PC+S
Sbjct: 85 EYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSP 144
Query: 191 SCRILRESFP---FGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
CR LR FP G C + + Y DGS S G ATD++ T
Sbjct: 145 QCRALR--FPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDKLAFANDTYVNNVT----- 197
Query: 248 LGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL---PSPYGSTGYITFGK 301
LGC ++ G A+G++G+ R +SI T+ +Y F YCL S + Y+ FG+
Sbjct: 198 LGCGRDNEGLFDSAAGLLGVARGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYLVFGR 257
Query: 302 T 302
T
Sbjct: 258 T 258
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 46/99 (46%), Gaps = 9/99 (9%)
Query: 324 GLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELD-------VRGTLVVASVSQVCLGF 376
G + D CYDL P I +HF GG D+ L V G A+ + CLGF
Sbjct: 349 GEHSVFDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGF 408
Query: 377 ATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
D +GNVQQ+G V +DV R+GF P C+
Sbjct: 409 EA--ADDGLSVIGNVQQQGFRVVFDVEKERIGFAPKGCT 445
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 64/181 (35%), Positives = 92/181 (50%), Gaps = 16/181 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ +V +G P L++DTGSD+ W QC PC C+ QR F +S T+ ++PC+S
Sbjct: 85 EYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSP 144
Query: 191 SCRILRESFP---FGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
CR LR FP G C + + Y DGS S G ATD++ T
Sbjct: 145 QCRALR--FPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFANDTYVNNVT----- 197
Query: 248 LGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL---PSPYGSTGYITFGK 301
LGC ++ G A+G++G+ R +SI T+ +Y F YCL S + Y+ FG+
Sbjct: 198 LGCGRDNEGLFDSAAGLLGVGRGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYLVFGR 257
Query: 302 T 302
T
Sbjct: 258 T 258
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 46/99 (46%), Gaps = 9/99 (9%)
Query: 324 GLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELD-------VRGTLVVASVSQVCLGF 376
G + D CYDL P I +HF GG D+ L V G A+ + CLGF
Sbjct: 349 GEHSVFDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGF 408
Query: 377 ATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
D +GNVQQ+G V +DV R+GF P C+
Sbjct: 409 EA--ADDGLSVIGNVQQQGFRVVFDVEKERIGFAPKGCT 445
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 59/154 (38%), Positives = 85/154 (55%), Gaps = 12/154 (7%)
Query: 138 IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRE 197
IG P S ++DTGSD+ WTQCKPC+ CF+Q P F S S T+ +PC+S SC L
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDL-- 230
Query: 198 SFPFGNCNS-KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSG 256
P C S +C + Y D S + G AT+ T+ ++ G + GC + + G
Sbjct: 231 --PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPG------VVFGCGDTNEG 282
Query: 257 DK-SGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
D S +G++GL R P+S++++ FSYCL S
Sbjct: 283 DGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTS 316
Score = 46.6 bits (109), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 53/121 (43%), Gaps = 7/121 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSA--YETVVVPKIAIHFLGGV 354
IT+ + L+ AF +M A G LD C+ A + V VP++ HF GG
Sbjct: 397 ITYLEVQGYRALKKAFAAQMA-LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGA 455
Query: 355 DLELDVRGTLVVASVS-QVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
DL+L +V+ S +CL T +GN QQ+ + YDV L F P
Sbjct: 456 DLDLPAENYMVLDGGSGALCL---TVMGSRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQ 512
Query: 414 C 414
C
Sbjct: 513 C 513
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 61/165 (36%), Positives = 90/165 (54%), Gaps = 8/165 (4%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + +A+G P Q ++ LLDTGSD+ WTQC C C +Q DP F S ++ + C
Sbjct: 97 EYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQ 156
Query: 191 SC-RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
C IL S C + Y DG+ + G++AT+R T A+S+G P G
Sbjct: 157 LCGDILHHS----CVRPDTCTYRYSYGDGTTTLGYYATERFTF--ASSSGETQSVPLGFG 210
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGST 294
C + G + ASGI+G R P+S++++ + FSYCL +PY S+
Sbjct: 211 CGTMNVGSLNNASGIVGFGRDPLSLVSQLSIRRFSYCL-TPYASS 254
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 61/165 (36%), Positives = 90/165 (54%), Gaps = 8/165 (4%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + +A+G P Q ++ LLDTGSD+ WTQC C C +Q DP F S ++ + C
Sbjct: 97 EYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQ 156
Query: 191 SC-RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
C IL S C + Y DG+ + G++AT+R T A+S+G P G
Sbjct: 157 LCGDILHHS----CVRPDTCTYRYSYGDGTTTLGYYATERFTF--ASSSGETQSVPLGFG 210
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGST 294
C + G + ASGI+G R P+S++++ + FSYCL +PY S+
Sbjct: 211 CGTMNVGSLNNASGIVGFGRDPLSLVSQLSIRRFSYCL-TPYASS 254
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 97/180 (53%), Gaps = 15/180 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V+ +G+P + +++DTGSD+ W QC PC C++Q P + SKT +IPC S
Sbjct: 91 EYFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTPLYDPRNSKTHRRIPCASP 150
Query: 191 SCR-ILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
CR +LR +P + + C + + Y DGS S G ATD + + + T LG
Sbjct: 151 QCRGVLR--YPGCDARTGGCVYMVVYGDGSASSGDLATDTLVLPDDTRVHNVT-----LG 203
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL----PSPYGSTGYITFGKT 302
C +++ G + A+G++G R +S T+ +Y FSYCL S+ Y+ FG+T
Sbjct: 204 CGHDNEGLLASAAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMSRARNSSSYLVFGRT 263
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 52/122 (42%), Gaps = 11/122 (9%)
Query: 303 DTVNTLRSAF--HKRMKKYKKAKGLEDLLDTCYDLSAY---ETVVVPKIAIHFLGGVDLE 357
D +R AF H ++ + + DTCYD+ V VP I +HF D+
Sbjct: 331 DAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYDVHGNGPGTGVRVPSIVLHFAAAADMA 390
Query: 358 LDVRGTLVVA----SVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
L L+ + CLG D LGNVQQ+G V +DV R+GF P
Sbjct: 391 LPQANYLIPVVGGDRRTYFCLGLQAA--DDGLNVLGNVQQQGFGVVFDVERGRIGFTPNG 448
Query: 414 CS 415
CS
Sbjct: 449 CS 450
>gi|110740049|dbj|BAF01928.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
Length = 183
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 53/108 (49%), Positives = 68/108 (62%), Gaps = 1/108 (0%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
LRS+F +M KY G+ +LDTC+DLS ++TV +PK+A F GG +EL +G V
Sbjct: 77 LRSSFKAKMSKYPTTSGVS-ILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVF 135
Query: 368 SVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+SQVCL FA D N+ GNVQQ+ EV YD AG R+GF P CS
Sbjct: 136 KISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 183
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 70/209 (33%), Positives = 106/209 (50%), Gaps = 11/209 (5%)
Query: 85 LEEILRQDQQRLHLKNSRRLRKPFP-EFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQ 143
+ +R+ + R ++ R R F + ++T A P + + EY + +AIG P Q
Sbjct: 50 IRRAMRRSKARAAALSAVRNRARFSGKNEQQTPAGVLPVRPSGDL--EYVVDLAIGTPPQ 107
Query: 144 YVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCR-ILRESFPFG 202
VS LLDTGSD+ WTQC PC C Q DP F +S ++ + C T C IL S
Sbjct: 108 PVSALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEPMRCAGTLCSDILHHS---- 163
Query: 203 NCNSKE-CPFNIQYADGSGSGGFWATDRITIQEA-NSNGYFTRYPFLLGCINNSSGDKSG 260
C + C + Y DG+ + G +AT+R T + T P GC + + G +
Sbjct: 164 -CERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGCGSVNVGSLNN 222
Query: 261 ASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
SGI+G R+P+S++++ + FSYCL S
Sbjct: 223 GSGIVGFGRNPLSLVSQLSIRRFSYCLTS 251
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 62/161 (38%), Positives = 87/161 (54%), Gaps = 10/161 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
E+ + ++IG P S ++DTGSD+ WTQCKPC CF Q P F KS ++ K+ C+S
Sbjct: 106 EFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSG 165
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
C L P NCN + C + Y D S + G AT+ T ++ NS + F
Sbjct: 166 LCNAL----PRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENS---ISGIGFGC 218
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
G N G G SG++GL R P+S+I++ + FSYCL S
Sbjct: 219 GVENEGDGFSQG-SGLVGLGRGPLSLISQLKETKFSYCLTS 258
Score = 45.8 bits (107), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 55/121 (45%), Gaps = 9/121 (7%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDL-SAYETVVVPKIAIHFLGGVD 355
IT+ + L+ F RM G L D C+ L A + + VPK+ HF G D
Sbjct: 344 ITYLEETAFKVLKEEFTSRMSLPVDDSGSTGL-DLCFKLPDAAKNIAVPKMIFHF-KGAD 401
Query: 356 LELDVRGTLVVASVSQV-CLGFATYPPDPNSITL-GNVQQRGHEVHYDVAGRRLGFGPGN 413
LEL +V S + V CL + N +++ GNVQQ+ V +D+ + F P
Sbjct: 402 LELPGENYMVADSSTGVLCLAMGS----SNGMSIFGNVQQQNFNVLHDLEKETVSFVPTE 457
Query: 414 C 414
C
Sbjct: 458 C 458
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 58/160 (36%), Positives = 95/160 (59%), Gaps = 21/160 (13%)
Query: 145 VSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI-LRESFPF-G 202
+++++DTGSD+TW QCKPC C+ QRDP F S S ++ +PCN+++C L+ + G
Sbjct: 177 LTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPG 236
Query: 203 NC----------NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCIN 252
+C S+ C +++ Y DGS S G ATD + + A+ +G F+ GC
Sbjct: 237 SCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDG------FVFGCGL 290
Query: 253 NSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
++ G G +G+MGL R+ +S++++T + FSYCLP+
Sbjct: 291 SNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPA 330
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 51/90 (56%), Gaps = 2/90 (2%)
Query: 328 LLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASV--SQVCLGFATYPPDPNS 385
LLD CY+L+ ++ V VP + + GG D+ +D G L +A SQVCL A+ + +
Sbjct: 434 LLDACYNLTGHDEVKVPLLTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQT 493
Query: 386 ITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+GN QQ+ V YD G RLGF +CS
Sbjct: 494 PIIGNYQQKNKRVVYDTVGSRLGFADEDCS 523
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 89/163 (54%), Gaps = 15/163 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
E+ + ++IG P + ++DTGSD+ WTQCKPC+ CF Q P F S S T+ +PC+S+
Sbjct: 117 EFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYSTLPCSSS 176
Query: 191 SCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FL 247
C L P C S K+C + Y D S + G A + T+ + T+ P
Sbjct: 177 LCSDL----PTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAK-------TKLPGVA 225
Query: 248 LGCINNSSGDK-SGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
GC + + GD + +G++GL R P+S++++ FSYCL S
Sbjct: 226 FGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLGKFSYCLTS 268
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 58/160 (36%), Positives = 95/160 (59%), Gaps = 21/160 (13%)
Query: 145 VSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI-LRESFPF-G 202
+++++DTGSD+TW QCKPC C+ QRDP F S S ++ +PCN+++C L+ + G
Sbjct: 176 LTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPG 235
Query: 203 NC----------NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCIN 252
+C S+ C +++ Y DGS S G ATD + + A+ +G F+ GC
Sbjct: 236 SCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDG------FVFGCGL 289
Query: 253 NSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
++ G G +G+MGL R+ +S++++T + FSYCLP+
Sbjct: 290 SNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPA 329
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 51/90 (56%), Gaps = 2/90 (2%)
Query: 328 LLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASV--SQVCLGFATYPPDPNS 385
LLD CY+L+ ++ V VP + + GG D+ +D G L +A SQVCL A+ + +
Sbjct: 433 LLDACYNLTGHDEVKVPLLTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQT 492
Query: 386 ITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+GN QQ+ V YD G RLGF +CS
Sbjct: 493 PIIGNYQQKNKRVVYDTVGSRLGFADEDCS 522
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 64/167 (38%), Positives = 90/167 (53%), Gaps = 10/167 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + +A+G P Q VS LLDTGSD+ WTQC PC C Q DP F S ++ + C
Sbjct: 103 EYLVDLAVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRCAGE 162
Query: 191 SCR-ILRESFPFGNCNSKE-CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRY--PF 246
C IL S C + C + Y DG+ + G +AT+R T ++S G T+ P
Sbjct: 163 LCNDILHHS-----CQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPL 217
Query: 247 LLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGS 293
GC + G + SGI+G R+P+S++++ FSYCL +PY S
Sbjct: 218 GFGCGTMNKGSLNNGSGIVGFGRAPLSLVSQLAIRRFSYCL-TPYAS 263
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 77/226 (34%), Positives = 111/226 (49%), Gaps = 25/226 (11%)
Query: 85 LEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQY 144
L+ +++ + RL +RL F EA N E+ + +AIG P +
Sbjct: 61 LQRAVKRGRLRL-----QRLSAKTASFEPSVEAPVHAGN------GEFLMNLAIGTPAET 109
Query: 145 VSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNC 204
S ++DTGSD+ WTQCKPC CF Q P F KS +F K+PC+S C L P +C
Sbjct: 110 YSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLCVAL----PISSC 165
Query: 205 NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDK-SGASG 263
S C + Y D S + G AT+ T +A+ ++ F GC ++ G S +G
Sbjct: 166 -SDGCEYRYSYGDHSSTQGVLATETFTFGDAS----VSKIGF--GCGEDNRGRAYSQGAG 218
Query: 264 IMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITF--GKTDTVNT 307
++GL R P+S+I++ FSYCL S S G T G TV +
Sbjct: 219 LVGLGRGPLSLISQLGVPKFSYCLTSIDDSKGISTLLVGSEATVKS 264
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 107/207 (51%), Gaps = 16/207 (7%)
Query: 87 EILRQDQQRLHLKNSRRL-RKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYV 145
++L++ +R H + SR + R + + P + + E+ + VAIG P
Sbjct: 57 QLLQRAARRSHHRMSRLVARATGVKAVAGGGDLQVPVHAGN---GEFLMDVAIGTPALSY 113
Query: 146 SLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCN 205
+ ++DTGSD+ WTQCKPC+ CF+Q P F S S T+ +PC+S C L P C
Sbjct: 114 AAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSALCSDL----PTSTCT 169
Query: 206 S-KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL-LGCINNSSGDK-SGAS 262
S +C + Y D S + G A++ T+ + + P + GC + + GD + +
Sbjct: 170 SASKCGYTYTYGDASSTQGVLASETFTLGKEKK-----KLPGVAFGCGDTNEGDGFTQGA 224
Query: 263 GIMGLDRSPVSIITRTNTSYFSYCLPS 289
G++GL R P+S++++ FSYCL S
Sbjct: 225 GLVGLGRGPLSLVSQLGLDKFSYCLTS 251
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/122 (35%), Positives = 59/122 (48%), Gaps = 7/122 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSA--YETVVVPKIAIHFLGGV 354
IT+ + L+ AF +M G E LD C+ A + V VPK+ +HF GG
Sbjct: 333 ITYLELQGYRALKKAFVAQMA-LPTVDGSEIGLDLCFQGPAKGVDEVQVPKLVLHFDGGA 391
Query: 355 DLELDVRGTLVVASVS-QVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
DL+L +V+ S S +CL T P +GN QQ+ + YDVAG L F P
Sbjct: 392 DLDLPAENYMVLDSASGALCL---TVAPSRGLSIIGNFQQQNFQFVYDVAGDTLSFAPVQ 448
Query: 414 CS 415
C+
Sbjct: 449 CN 450
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 77/226 (34%), Positives = 111/226 (49%), Gaps = 25/226 (11%)
Query: 85 LEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQY 144
L+ +++ + RL +RL F EA N E+ + +AIG P +
Sbjct: 61 LQRAVKRGRLRL-----QRLSAKTASFEPSVEAPVHAGN------GEFLMNLAIGTPAET 109
Query: 145 VSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNC 204
S ++DTGSD+ WTQCKPC CF Q P F KS +F K+PC+S C L P +C
Sbjct: 110 YSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLCVAL----PISSC 165
Query: 205 NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDK-SGASG 263
S C + Y D S + G AT+ T +A+ ++ F GC ++ G S +G
Sbjct: 166 -SDGCEYRYSYGDHSSTQGVLATETFTFGDAS----VSKIGF--GCGEDNRGRAYSQGAG 218
Query: 264 IMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITF--GKTDTVNT 307
++GL R P+S+I++ FSYCL S S G T G TV +
Sbjct: 219 LVGLGRGPLSLISQLGVPKFSYCLTSIDDSKGISTLLVGSEATVKS 264
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 87/277 (31%), Positives = 143/277 (51%), Gaps = 26/277 (9%)
Query: 45 VCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQR---LHLKNS 101
VC+ +R A++ + ++GPCS L P+LEE L +D+ R +H K S
Sbjct: 51 VCSESRAPAVH----ATVPLHHRHGPCSPLPN---KKMPTLEERLHRDKLRAAYIHRKLS 103
Query: 102 RRLRKPFPE-----FLKRTEAFTFPANINDTVAD-EYYIVVAIGEP-KQYVSLLLDTGSD 154
R ++ ++++ A T P + ++ EY I V +G P + ++L+DTGSD
Sbjct: 104 RGKKQGGGGAGGDVVVQQSHAMTVPTTLGTSLDTLEYVITVRLGSPPGKSQTMLIDTGSD 163
Query: 155 VTWTQCKPCI-HCFQQRDPFFYASKSKTFFKIPCNSTSC-RILRESFPFGNCNSKECPFN 212
++W +CKPC C Q DP F S S T+ C+S +C ++ +E G +S +C +
Sbjct: 164 ISWVRCKPCWQQCRPQVDPLFDPSLSSTYSPFSCSSAACAQLFQEGNANGCSSSGQCQYI 223
Query: 213 IQYADGS-GSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSP 271
Y DGS G+ G +++D + + +NSN F GC + +G +G+MGL
Sbjct: 224 AMYGDGSVGTTGTYSSDTLAL-GSNSNTVVVSK-FRFGCSHAETGITGLTAGLMGLGGGA 281
Query: 272 VSIITRT----NTSYFSYCLPSPYGSTGYITFGKTDT 304
S++++T T+ FSYCLP S+G++T G T
Sbjct: 282 QSLVSQTAGTFGTTAFSYCLPPTPSSSGFLTLGAAGT 318
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 45/114 (39%), Positives = 62/114 (54%), Gaps = 5/114 (4%)
Query: 306 NTLRSAFHKRMKKYKKAKGLED--LLDTCYDLSAYETVVVPKIAIHF--LGGVDLELDVR 361
++L SAF MK+Y A LDTC+D+S +V +P +A+ F GG + LD
Sbjct: 380 SSLSSAFKAGMKQYPPAPSSAGGGFLDTCFDMSGQSSVSMPTVALVFSGAGGAVVNLDAS 439
Query: 362 GTLVVASVSQV-CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
G L+ S + CL F D ++ +GNVQQR +V YDVAG +GF G C
Sbjct: 440 GILLQMETSSIFCLAFVATSDDGSTGIIGNVQQRTFQVLYDVAGGAVGFKAGAC 493
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 95/180 (52%), Gaps = 15/180 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V+ +G+P +++DTGSD+ W QC PC HC++Q P + S T +IPC S
Sbjct: 87 EYFAVINVGDPPTRALVVIDTGSDLIWLQCVPCRHCYRQVTPLYDPRSSSTHRRIPCASP 146
Query: 191 SCR-ILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
CR +LR +P + + C + + Y DGS S G ATDR+ + T LG
Sbjct: 147 RCRDVLR--YPGCDARTGGCVYMVVYGDGSASSGDLATDRLVFPDDTHVHNVT-----LG 199
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYC----LPSPYGSTGYITFGKT 302
C +++ G A+G++G+ R +S T+ +Y FSYC L + Y+ FG+T
Sbjct: 200 CGHDNVGLLESAAGLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQNGSSYLVFGRT 259
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 52/124 (41%), Gaps = 13/124 (10%)
Query: 303 DTVNTLRSAFHKRMKKYKKAKGLE---DLLDTCYDL----SAYETVVVPKIAIHFLGGVD 355
D +R AF + L + D CYDL + V VP I +HF GG D
Sbjct: 327 DAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAVRVPSIVLHFAGGAD 386
Query: 356 LELDVRGTLVVAS----VSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGP 411
+ L L+ + CLG D LGNVQQ+G + +DV R+GF P
Sbjct: 387 MALPQANYLIPVQGGDRRTYFCLGLQAA--DDGLNVLGNVQQQGFGLVFDVERGRIGFTP 444
Query: 412 GNCS 415
CS
Sbjct: 445 NGCS 448
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 61/161 (37%), Positives = 87/161 (54%), Gaps = 10/161 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
E+ + ++IG P + ++DTGSD+ WTQCKPC CF Q P F KS ++ K+ C+S
Sbjct: 107 EFLMELSIGNPAVKYAAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSG 166
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
C L P NCN + C + Y D S + G AT+ T ++ NS + F
Sbjct: 167 LCNAL----PRSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTFEDENS---ISGIGFGC 219
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
G N G G SG++GL R P+S+I++ + FSYCL S
Sbjct: 220 GVENEGDGFSQG-SGLVGLGRGPLSLISQLKETKFSYCLTS 259
Score = 45.8 bits (107), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 56/121 (46%), Gaps = 9/121 (7%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDL-SAYETVVVPKIAIHFLGGVD 355
IT+ + L+ F RM G L D C+ L +A + + VPK+ HF G D
Sbjct: 345 ITYLEETAFKVLKEEFTSRMSLPVDDSGSTGL-DLCFKLPNAAKNIAVPKLIFHF-KGAD 402
Query: 356 LELDVRGTLVVASVSQV-CLGFATYPPDPNSITL-GNVQQRGHEVHYDVAGRRLGFGPGN 413
LEL +V S + V CL + N +++ GNVQQ+ V +D+ + F P
Sbjct: 403 LELPGENYMVADSSTGVLCLAMGS----SNGMSIFGNVQQQNFNVLHDLEKETVTFVPTE 458
Query: 414 C 414
C
Sbjct: 459 C 459
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 61/171 (35%), Positives = 90/171 (52%), Gaps = 7/171 (4%)
Query: 127 TVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIP 186
+V EY + +AIG P L DTGSD+TWTQC+PC CF Q P + S S TF +P
Sbjct: 72 SVQVEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVP 131
Query: 187 CNSTSCR-ILRESFPFGNCN--SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
C+S +C +LR NC+ S C + Y+DG+ S G T+ +T+ + +
Sbjct: 132 CSSATCLPVLRSR----NCSTPSSLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSV 187
Query: 244 YPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGST 294
GC ++ GD ++G +GL R +S++ + FSYCL + ST
Sbjct: 188 SDVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLTDFFNST 238
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 67/206 (32%), Positives = 105/206 (50%), Gaps = 24/206 (11%)
Query: 94 QRLHLKNSRRLRKPFPEFLKRTEA-FTFPANINDTV---ADEYYIVVAIGEPKQYVSLLL 149
+R + SRRL +R EA P+ + +V EY + ++IG P Q S ++
Sbjct: 61 ERAIERGSRRL--------QRLEAMLNGPSGVETSVYAGDGEYLMNLSIGTPAQPFSAIM 112
Query: 150 DTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKEC 209
DTGSD+ WTQC+PC CF Q P F S +F +PC+S C+ L C++ C
Sbjct: 113 DTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALSSP----TCSNNFC 168
Query: 210 PFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSG-ASGIMGLD 268
+ Y DGS + G T+ +T G + GC N+ G G +G++G+
Sbjct: 169 QYTYGYGDGSETQGSMGTETLTF------GSVSIPNITFGCGENNQGFGQGNGAGLVGMG 222
Query: 269 RSPVSIITRTNTSYFSYCLPSPYGST 294
R P+S+ ++ + + FSYC+ +P GS+
Sbjct: 223 RGPLSLPSQLDVTKFSYCM-TPIGSS 247
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 55/159 (34%), Positives = 86/159 (54%), Gaps = 5/159 (3%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + +AIG P Y + ++DTGSD+ WTQC PC+ C Q P+F +S T+ +PC S+
Sbjct: 88 EYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCAAQPTPYFDVKRSATYRALPCRSS 147
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C L +C K C + Y D + + G A + T A+S GC
Sbjct: 148 RCAALSSP----SCFKKMCVYQYYYGDTASTAGVLANETFTFGAASSTKVRAAN-ISFGC 202
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
+ ++G+ + +SG++G R P+S++++ S FSYCL S
Sbjct: 203 GSLNAGELANSSGMVGFGRGPLSLVSQLGPSRFSYCLTS 241
Score = 42.4 bits (98), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 41/89 (46%), Gaps = 7/89 (7%)
Query: 329 LDTCYDLSAYE--TVVVPKIAIHFLGGVDLELDVRGTLVVASVS-QVCLGFATYPPDPNS 385
LDTC+ TV VP HF G ++ L +++AS + +CL A P
Sbjct: 354 LDTCFQWPPPPNVTVTVPDFVFHF-DGANMTLPPENYMLIASTTGYLCLAMA---PTSVG 409
Query: 386 ITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+GN QQ+ + YD+A L F P C
Sbjct: 410 TIIGNYQQQNLHLLYDIANSFLSFVPAPC 438
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 74/259 (28%), Positives = 126/259 (48%), Gaps = 31/259 (11%)
Query: 60 ASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPF-------PEFL 112
+S+ + +YGPCS + P+ EE+LR+DQ R +R+ F
Sbjct: 60 SSVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADY-----IRRKFSGSNGTAAGED 114
Query: 113 KRTEAFTFPANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIH---CFQ 168
++ + P + ++ EY I V +G P +++DTGSDV+W QC+PC C
Sbjct: 115 GQSSKVSVPTTLGSSLDTLEYVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHA 174
Query: 169 QRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSK-ECPFNIQYADGSGSGGFWAT 227
F + S T+ C++ +C L +S C++K C + ++Y DGS + G +++
Sbjct: 175 HAGALFDPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSS 234
Query: 228 DRITIQEANSNGYFTRYPFLLGCINNSSG----DKSGASGIMGLDRSPVSIITRTNTSY- 282
D +T+ +G F GC + G DK+ G++GL S++++T Y
Sbjct: 235 DVLTL-----SGSDVVRGFQFGCSHAELGAGMDDKT--DGLIGLGGDAQSLVSQTAARYG 287
Query: 283 --FSYCLPSPYGSTGYITF 299
FSYCLP+ S+G++T
Sbjct: 288 KSFSYCLPATPASSGFLTL 306
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 45/107 (42%), Positives = 59/107 (55%), Gaps = 6/107 (5%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
L SAF M +Y +A+ L +LDTC++ + + V +P +A+ F GG ++LD G
Sbjct: 379 LSSAFRAGMTRYARAEPL-GILDTCFNFTGLDKVSIPTVALVFAGGAVVDLDAHGI---- 433
Query: 368 SVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
VS CL FA D T+GNVQQR EV YDV G GF G C
Sbjct: 434 -VSGGCLAFAPTRDDKAFGTIGNVQQRTFEVLYDVGGGVFGFRAGAC 479
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 75/222 (33%), Positives = 115/222 (51%), Gaps = 21/222 (9%)
Query: 101 SRRLRKPFPEFLKRTEAFTFPANINDTV-------ADEYYIVVAIGEPKQYVSLLLDTGS 153
+ RL F R F A +D + A EY + ++IG P V ++DTGS
Sbjct: 54 TERLTDAFHRSASRVGRFRQSAMTSDGIQSRLVPSAGEYIMNLSIGTPPVPVIAIVDTGS 113
Query: 154 DVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNC-NSKECPFN 212
D+TWTQC+PC HC++Q PFF S T+ C ++ C L +C N K+C F
Sbjct: 114 DLTWTQCRPCTHCYKQVVPFFDPKNSSTYRDSSCGTSFCLALGND---RSCRNGKKCTFM 170
Query: 213 IQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLGCINNSSGD-KSGASGIMGLDRS 270
YADGS +GG A + +T+ A++ G +P F GC++ S G +SGI+GL +
Sbjct: 171 YSYADGSFTGGNLAVETLTV--ASTAGKPVSFPGFAFGCVHRSGGIFDEHSSGIVGLGVA 228
Query: 271 PVSIITRTNTSY---FSYCLPSPYGSTGY---ITFGKTDTVN 306
+S+I++ ++ FSYCL + + I FG++ V+
Sbjct: 229 ELSMISQLKSTINGRFSYCLLPVFTDSSMSSRINFGRSGIVS 270
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 61/161 (37%), Positives = 88/161 (54%), Gaps = 10/161 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY V +G P++ S+++DTGSD+TW QC PC C+ Q D F + S +F K+ C +
Sbjct: 2 EYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACGTE 61
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLG 249
C L P+ CN C + Y DGS S G + D IT+ NG + P F G
Sbjct: 62 LCNGL----PYPMCNQTTCVYWYSYGDGSLSTGDFVYDTITMD--GINGQKQQVPNFAFG 115
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
C +++ G +GA GI+GL + P+S ++ T + FSYCL
Sbjct: 116 CGHDNEGSFAGADGILGLGQGPLSFPSQLKTVFNGKFSYCL 156
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 6/109 (5%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYD-LSAYETVVVPKIAIHFLGGVDLELDVRGTLVV 366
+ +A + Y + LD C + + VP + HF GG D+EL +
Sbjct: 247 VLAAMNASTMDYPRKSDDSSGLDLCLGGFAEGQLPTVPSMTFHFEGG-DMELPPSNYFIF 305
Query: 367 ASVSQ-VCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
SQ C + P+ +G++QQ+ +V+YD GR++GF P +C
Sbjct: 306 LESSQSYCFSMVS---SPDVTIIGSIQQQNFQVYYDTVGRKIGFVPKSC 351
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/178 (34%), Positives = 91/178 (51%), Gaps = 15/178 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V +G P L++DTGSDV W QCKPC+HC++Q P + S T+ + PC+
Sbjct: 98 EYFASVGVGTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLSPLYDPRGSSTYAQTPCSPP 157
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
CR + + + C + I Y D S + G ATDR+ S G T LGC
Sbjct: 158 QCRNPQTC----DGTTGGCGYRIVYGDASSTSGNLATDRLVFSNDTSVGNVT-----LGC 208
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTS---YFSYCLPS---PYGSTGYITFGKT 302
+++ G A+G++G+ R S T+ S YF+YCL S+ Y+ FG+T
Sbjct: 209 GHDNEGLFGSAAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGSSSSYLVFGRT 266
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 52/122 (42%), Gaps = 6/122 (4%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKY---KKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGG 353
IT D LR AF R K K +G+ + D CYDL P + +HF GG
Sbjct: 329 ITRFARDAYGALRDAFDARAAKVGMRKVGRGIS-VFDACYDLRGVAVADAPGVVLHFAGG 387
Query: 354 VDLELDVRGTLVVASVSQV-CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPG 412
D+ L LV + C D S+ +GNV Q+ V +DV R+GF P
Sbjct: 388 ADVALPPENYLVPEESGRYHCFALEAAGHDGLSV-IGNVLQQRFRVVFDVENERVGFEPN 446
Query: 413 NC 414
C
Sbjct: 447 GC 448
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 99/384 (25%), Positives = 156/384 (40%), Gaps = 80/384 (20%)
Query: 98 LKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTW 157
L + RL F L R+ A A + V + I IG P + DTGSD+TW
Sbjct: 49 LSHYDRLANAFRRSLSRSAALLNRAATSGAVGLQSSI---IGTPPVDYLGIADTGSDLTW 105
Query: 158 TQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKE-CPFNIQYA 216
QC PC+ C+QQ P F KS +F +PCN+ +C + + G+C + C ++ Y
Sbjct: 106 AQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDD----GHCGVQGVCDYSYTYG 161
Query: 217 DGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIIT 276
D + S G ++ITI ++ ++GC + SSG ASG++GL +S+++
Sbjct: 162 DRTYSKGDLGFEKITIGSSSVKS-------VIGCGHASSGGFGFASGVIGLGGGQLSLVS 214
Query: 277 RTNTS-----YFSYCLPSPYG-STGYITFGKTDTV------------------------- 305
+ + + FSYCLP+ + G I FG+ V
Sbjct: 215 QMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNTVTYYYITLEA 274
Query: 306 ----NTLRSAFHKR-------------------------MKKYKKAKGLED---LLDTCY 333
N AF K+ + K KAK ++D D C+
Sbjct: 275 ISIGNERHMAFAKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCF 334
Query: 334 D--LSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNV 391
D ++ + +P I F GG ++ L T + + CL P +GN+
Sbjct: 335 DDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNL 394
Query: 392 QQRGHEVHYDVAGRRLGFGPGNCS 415
+ YD+ +RL F P C+
Sbjct: 395 ALANFLIGYDLEAKRLSFKPTVCT 418
>gi|297811183|ref|XP_002873475.1| hypothetical protein ARALYDRAFT_909036 [Arabidopsis lyrata subsp.
lyrata]
gi|297319312|gb|EFH49734.1| hypothetical protein ARALYDRAFT_909036 [Arabidopsis lyrata subsp.
lyrata]
Length = 292
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 79/242 (32%), Positives = 114/242 (47%), Gaps = 41/242 (16%)
Query: 202 GNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDK-SG 260
G+C+ C +++ Y D S S GF A ++ T+ ++ +F F GC N++GD G
Sbjct: 64 GSCSDSTCGYSVGYGDTSTSQGFVAKEKFTLMSSD---FFDGVNF--GCGENNTGDYYEG 118
Query: 261 ASGIMG---------------------LDRSPVSIITRTNTSYFSYC-----LPSPYGST 294
+G++G + SP N + C +PS ST
Sbjct: 119 VAGLLGNTSGHLTFGSTGISKSVKFTPVSSSPSKDFYYLNIEGITVCDKQLEIPSIESST 178
Query: 295 GYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGV 354
L+SAF ++M KY + LDTCYD + +TV + KIA F GG
Sbjct: 179 -------PRAYAALKSAFKEKMSKYTITSSGDSELDTCYDFTGLKTVTITKIAFSFSGGT 231
Query: 355 DLELDVRGTLVVASV-SQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
+ELD +G L +S S++CL FA Y PD N G+VQQ+ +V YD G R+GF P
Sbjct: 232 VVELDPKGILYSSSERSKLCLAFAEY-PDDNVAIFGSVQQQTLQVVYDGVGGRVGFAPNG 290
Query: 414 CS 415
CS
Sbjct: 291 CS 292
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 81/244 (33%), Positives = 116/244 (47%), Gaps = 26/244 (10%)
Query: 58 DKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEA 117
D A+L + + + R G+ST E+LR+ R +++R L + A
Sbjct: 50 DAAALRLHATHADAGR---GLST-----RELLRRMAARSKARSARLLSG------RAASA 95
Query: 118 FTFPANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYA 176
P + D V D EY + +AIG P Q V L+LDTGSD+TWTQC PC+ CF+Q P F
Sbjct: 96 RMDPGSYTDGVPDTEYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNP 155
Query: 177 SKSKTFFKIPCNSTSCRILRESFPFGNCNSKE-----CPFNIQYADGSGSGGFWATDRIT 231
S+S TF +PC+ CR L S +C + C + YAD S + G +D +
Sbjct: 156 SRSMTFSVLPCDLRICRDLTWS----SCGEQSWGNGICVYAYAYADHSITTGHLDSDTFS 211
Query: 232 IQEANSNGYFTRYPFL-LGC-INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
A+ P L GC + N+ S +GI G R +S+ + FSYC +
Sbjct: 212 FASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTA 271
Query: 290 PYGS 293
GS
Sbjct: 272 ITGS 275
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/181 (35%), Positives = 94/181 (51%), Gaps = 15/181 (8%)
Query: 130 DEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNS 189
+EY + +A+G P++ V+L LDTGSD+ WTQC PC CF Q P + S T+ +PC +
Sbjct: 82 NEYLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCGA 141
Query: 190 TSCRILRESFPFGNC------NSKECPFNIQYADGSGSGGFWATDRITIQEANSNG---Y 240
CR L PF +C N + C + Y D S + G ATDR T ++ +G +
Sbjct: 142 ARCRAL----PFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLH 197
Query: 241 FTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGS-TGYITF 299
R F G +N +S +GI G R S+ ++ N + FSYC S + S + +T
Sbjct: 198 TRRLTFGCGHLNKGV-FQSNETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFESKSSLVTL 256
Query: 300 G 300
G
Sbjct: 257 G 257
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 81/244 (33%), Positives = 116/244 (47%), Gaps = 26/244 (10%)
Query: 58 DKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEA 117
D A+L + + + R G+ST E+LR+ R +++R L + A
Sbjct: 24 DAAALRLHATHADAGR---GLST-----RELLRRMAARSKARSARLLSG------RAASA 69
Query: 118 FTFPANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYA 176
P + D V D EY + +AIG P Q V L+LDTGSD+TWTQC PC+ CF+Q P F
Sbjct: 70 RMDPGSYTDGVPDTEYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNP 129
Query: 177 SKSKTFFKIPCNSTSCRILRESFPFGNCNSKE-----CPFNIQYADGSGSGGFWATDRIT 231
S+S TF +PC+ CR L S +C + C + YAD S + G +D +
Sbjct: 130 SRSMTFSVLPCDLRICRDLTWS----SCGEQSWGNGICVYAYAYADHSITTGHLDSDTFS 185
Query: 232 IQEANSNGYFTRYPFL-LGC-INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
A+ P L GC + N+ S +GI G R +S+ + FSYC +
Sbjct: 186 FASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTA 245
Query: 290 PYGS 293
GS
Sbjct: 246 ITGS 249
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 91/330 (27%), Positives = 140/330 (42%), Gaps = 59/330 (17%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
Y + +AIG P ++ +LDTGSD+ WTQC PC CF Q P + ++S T+ + C S
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
C+ L+ P+ C+ + C + Y DG+ + G AT+ T+ S+ F
Sbjct: 152 MCQALQS--PWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTL---GSDTAVRGVAF-- 204
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITR----------------------TNTSYFS-- 284
GC + G +SG++G+ R P+S++++ T TS
Sbjct: 205 GCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRPRRSCRARAAARGGGAPTTTSPLEGI 264
Query: 285 ----YCLP--------SPYGSTGYITFGKTDTVNTLRSAFHKRMK------KYKKAKGLE 326
LP +P G G I T AF + + A G
Sbjct: 265 TVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEERAFVALARALASRVRLPLASGAH 324
Query: 327 DLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA--SVSQVCLGFATYPPDPN 384
L C+ ++ E V VP++ +HF G D+EL R + VV S CLG +
Sbjct: 325 LGLSLCFAAASPEAVEVPRLVLHF-DGADMELR-RESYVVEDRSAGVACLGMVSA---RG 379
Query: 385 SITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
LG++QQ+ + YD+ L F P C
Sbjct: 380 MSVLGSMQQQNTHILYDLERGILSFEPAKC 409
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 102/367 (27%), Positives = 158/367 (43%), Gaps = 72/367 (19%)
Query: 100 NSRRLRKPFPEFLKRTEAFTFPANINDTV-------ADEYYIVVAIGEPKQYVSLLLDTG 152
+ RL F + R F A +D + A EY + + IG P V ++DTG
Sbjct: 53 QAERLTDAFRRSVSRVGRFRPTAMTSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTG 112
Query: 153 SDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCN-SKECPF 211
SD+TWTQC+PC HC++Q P F S T+ C ++ C L + +C+ K+C F
Sbjct: 113 SDLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCGTSFCLALGKD---RSCSKEKKCTF 169
Query: 212 NIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLGCINNSSG--DKSGASGIMGLD 268
YADGS +GG A++ +T+ ++ G +P F GC ++S G DKS +SGI+GL
Sbjct: 170 RYSYADGSFTGGNLASETLTVD--STAGKPVSFPGFAFGCGHSSGGIFDKS-SSGIVGLG 226
Query: 269 RSPVSIITRTNTS---YFSYCL--------------------PSPYGST----------- 294
+S+I++ ++ FSYCL S YG+
Sbjct: 227 GGELSLISQLKSTINGLFSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPLRLPYKGY 286
Query: 295 --------GYITFGKTDTVNTLRSAFHKRMKKY-------KKAKGLEDLLDTCYDLSAYE 339
G I T L F+ +++K K+ + + CY+ +A
Sbjct: 287 SKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTA-- 344
Query: 340 TVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVH 399
+ P I HF ++EL T + VC T P + LGN+ Q V
Sbjct: 345 EINAPIITAHF-KDANVELQPLNTFMRMQEDLVCF---TVAPTSDIGVLGNLAQVNFLVG 400
Query: 400 YDVAGRR 406
+D+ +R
Sbjct: 401 FDLRKKR 407
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 96/373 (25%), Positives = 157/373 (42%), Gaps = 87/373 (23%)
Query: 116 EAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFY 175
E++ FP + E+ + + +G P Q +++DTGSD+TW Q +PC CF+Q DP F
Sbjct: 12 ESYEFPESAG---YGEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQADPIFD 68
Query: 176 ASKSKTFFKIPCNSTSCRILRESFPFGNCN-SKECPFNIQYADGSGSGGFWATDRITIQE 234
SKS T+ KI C+S++C L + C+ + C + Y DGS + G+++ + IT +
Sbjct: 69 PSKSSTYNKIACSSSACADLLGT---QTCSAAANCIYAYGYGDGSVTRGYFSKETITATD 125
Query: 235 ANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNT---SYFSYCL---- 287
F N + +G GI+GL + PVS+ ++ + + FSYCL
Sbjct: 126 TAGE----EVKFGASVYNTGTFGDTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCLVDWL 181
Query: 288 ------PSPYGSTGYITFGKTDTVNTLRSAFHKRMKKYKKAKGLE---DLLDTCYDLSAY 338
+ Y + G+ + +A H Y +G+ LLD D S Y
Sbjct: 182 SAGSETSTMYFGDAAVPSGEVQYTPIVPNADHPTY-YYIAVQGISVGGSLLDI--DQSVY 238
Query: 339 E--------TVVVPKIAIHFL------------------------GGVDLELDVRG---- 362
E T++ I +L G+DL + RG
Sbjct: 239 EIDSGGSGGTIIDSGTTITYLQQEVFNALVAAYTSQVRYPTTTSATGLDLCFNTRGTGSP 298
Query: 363 --------------------TLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDV 402
T + + +CL FA+ P +I GN+QQ+ ++ YD+
Sbjct: 299 VFPAMTIHLDGVHLELPTANTFISLETNIICLAFASALDFPIAI-FGNIQQQNFDIVYDL 357
Query: 403 AGRRLGFGPGNCS 415
R+GF P +C+
Sbjct: 358 DNMRIGFAPADCA 370
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/391 (25%), Positives = 154/391 (39%), Gaps = 81/391 (20%)
Query: 94 QRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGS 153
QR H + + K P+ E F P + EY + + +G P Q +++DTGS
Sbjct: 5 QRSHERVAFYTLKLSPDAFGSQE-FQSPVKAGN---GEYLMTLTLGSPPQSFDVIVDTGS 60
Query: 154 DVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNI 213
D+ W QC PC C+QQ P F SKS++F K C C + + P C + C +
Sbjct: 61 DLNWVQCLPCRVCYQQPGPKFDPSKSRSFRKAACTDNLCNV--SALPLKACAANVCQYQY 118
Query: 214 QYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMG------- 266
Y D S + G A + I++ N G + F GC + G +GA+G++G
Sbjct: 119 TYGDQSNTNGDLAFETISLN--NGAGTQSVPNFAFGCGTQNLGTFAGAAGLVGLGQGPLS 176
Query: 267 ----------------------LDRSPV---SIITRTNTSYFSYCL-------------- 287
L SP+ SI N Y S +
Sbjct: 177 LNSQLSHTFANKFSYCLVSLNSLSASPLTFGSIAAAANIQYTSIVVNARHPTYYYVQLNS 236
Query: 288 -----------PSPYG---STG----YITFGKTDTVNTLRSAFHKRMKKYK------KAK 323
PS + STG I G T T+ TL A+ ++ Y+ +
Sbjct: 237 IEVGGQPLNLAPSVFAIDQSTGRGGTIIDSGTTITMLTL-PAYSAVLRAYESFVNYPRLD 295
Query: 324 GLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDP 383
G LD C++++ VP + F G D ++ V+ S L A
Sbjct: 296 GSAYGLDLCFNIAGVSNPSVPDMVFKFQG-ADFQMRGENLFVLVDTSATTLCLAMGGSQG 354
Query: 384 NSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
SI +GN+QQ+ H V YD+ +++GF +C
Sbjct: 355 FSI-IGNIQQQNHLVVYDLEAKKIGFATADC 384
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 68/202 (33%), Positives = 103/202 (50%), Gaps = 14/202 (6%)
Query: 90 RQDQQRLHLKNSRRLRKPFPEFLKRTE-AFTFPANINDTV-ADEYYIVVAIGEPKQYVSL 147
R+ +R+ L++ R P L + A P +D V EY + +AIG P Q V L
Sbjct: 51 RELMRRMALRSKARA----PRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQL 106
Query: 148 LLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSK 207
LDTGSD+ WTQC+PC CF Q P++ AS+S TF C+ST C++ N +
Sbjct: 107 TLDTGSDLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQ 166
Query: 208 ECPFNIQYADGSGSGGFWATDRIT-IQEANSNGYFTRYPFLLGC-INNSSGDKSGASGIM 265
C F+ Y D S + GF + ++ + A+ G + GC +NN+ +S +GI
Sbjct: 167 TCAFSYSYGDKSATIGFLDVETVSFVAGASVPG------VVFGCGLNNTGIFRSNETGIA 220
Query: 266 GLDRSPVSIITRTNTSYFSYCL 287
G R P+S+ ++ FS+C
Sbjct: 221 GFGRGPLSLPSQLKVGNFSHCF 242
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 78/241 (32%), Positives = 121/241 (50%), Gaps = 28/241 (11%)
Query: 61 SLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTF 120
++E++ + P S + TH + LR+ R N+ L EA F
Sbjct: 28 TVELIHRDSPKSPMYNSSETHFDRIVNALRRSSHR----NTVVLES------DTAEAPIF 77
Query: 121 PANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSK 180
EY + +++G P + + DTGSDV WTQCKPC +C+QQ P F SKS
Sbjct: 78 ------NNGGEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAPMFDPSKST 131
Query: 181 TFFKIPCNSTSCRILRESFPFGNC-NSKECPFNIQYADGSGSGGFWATDRITIQEANSNG 239
T+ + C+S C + +C + EC ++I Y D S S G A D +T+Q +++G
Sbjct: 132 TYKNVACSSPVCSYSGDG---SSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQ--STSG 186
Query: 240 YFTRYP-FLLGCINNSSGD-KSGASGIMGLDRSPVSIITRTNTSY---FSYCL-PSPYGS 293
+P ++GC ++++G + SGI+GL R P S++T+ + FSYCL P GS
Sbjct: 187 RPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIGTGS 246
Query: 294 T 294
T
Sbjct: 247 T 247
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/120 (28%), Positives = 55/120 (45%), Gaps = 8/120 (6%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSA--YETVVVPKIAIHFLGGV 354
+T+ + +N+ SA + M A+ + LD C+ + YE +P + +HF G
Sbjct: 318 LTYLPSALLNSFGSAISQSMS-LPHAQDPSEFLDYCFATTTDDYE---MPPVTMHF-EGA 372
Query: 355 DLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
D+ L V S +CL F ++P D N GN+ Q V YD+ + F P +C
Sbjct: 373 DVPLQRENLFVRLSDDTICLAFGSFPDD-NIFIYGNIAQSNFLVGYDIKNLAVSFQPAHC 431
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 67/204 (32%), Positives = 103/204 (50%), Gaps = 14/204 (6%)
Query: 86 EEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYV 145
E +R+D R+ + + + +F A + + V Y + +++G P
Sbjct: 44 SEAVRRDSHRIAFLSDATAAG---KATTTNSSVSFQALLENGVGG-YNMNISVGTPLLTF 99
Query: 146 SLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCN 205
S++ DTGSD+ WTQC PC CFQQ P F + S TF K+PC S+ C+ L S CN
Sbjct: 100 SVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSFCQFLPNSIR--TCN 157
Query: 206 SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIM 265
+ C +N +Y G + G+ AT+ + + +A+ F F GC + +G + SGI
Sbjct: 158 ATGCVYNYKYGSGY-TAGYLATETLKVGDAS----FPSVAF--GC-STENGVGNSTSGIA 209
Query: 266 GLDRSPVSIITRTNTSYFSYCLPS 289
GL R +S+I + FSYCL S
Sbjct: 210 GLGRGALSLIPQLGVGRFSYCLRS 233
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 77/256 (30%), Positives = 122/256 (47%), Gaps = 29/256 (11%)
Query: 63 EVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPA 122
E+V + P S L TH + +R+ R+H +RT A P
Sbjct: 34 ELVHRDSPKSPLYNSQQTHLQRWNKAMRRSVSRVH-------------HFQRTAATVSPK 80
Query: 123 NINDTV---ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKS 179
+ + EY + +++G P + + DTGSD+ WTQC PC C++Q P F S
Sbjct: 81 EVESEIIANGGEYLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAPLFDPKSS 140
Query: 180 KTFFKIPCNSTSCRILRESFPFGNCNSKE-CPFNIQYADGSGSGGFWATDRITIQEANSN 238
KT+ + C++ C+ L ES +C+S++ C ++ Y D S + G A D +T+ N
Sbjct: 141 KTYRDLSCDTRQCQNLGES---SSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLPSTNGG 197
Query: 239 G-YFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL----PSP 290
YF + G NN + DK SGI+GL P+S+I++ +S FSYCL
Sbjct: 198 PVYFPKTVIGCGRRNNGTFDKKD-SGIIGLGGGPMSLISQMGSSVGGKFSYCLVPFSSES 256
Query: 291 YGSTGYITFGKTDTVN 306
G++ + FG+ V+
Sbjct: 257 AGNSSKLHFGRNAVVS 272
Score = 42.4 bits (98), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 44/96 (45%), Gaps = 6/96 (6%)
Query: 320 KKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATY 379
++ + LL CY + + VP I HF G D+ L T ++ S +CL F +
Sbjct: 349 ERTQDASGLLSHCYRPT--PDLKVPVITAHF-NGADVVLQTLNTFILISDDVLCLAFNS- 404
Query: 380 PPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ GNV Q + YD+ G+ + F P +C+
Sbjct: 405 --TQSGAIFGNVAQMNFLIGYDIQGKSVSFKPTDCT 438
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/156 (39%), Positives = 84/156 (53%), Gaps = 10/156 (6%)
Query: 136 VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRIL 195
++IG P S ++DTGSD+ WTQCKPC CF Q P F KS ++ K+ C+S C L
Sbjct: 3 LSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNAL 62
Query: 196 RESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINN 253
P NCN + C + Y D S + G AT+ T ++ NS + F G N
Sbjct: 63 ----PRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENS---ISGIGFGCGVENE 115
Query: 254 SSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
G G SG++GL R P+S+I++ + FSYCL S
Sbjct: 116 GDGFSQG-SGLVGLGRGPLSLISQLKETKFSYCLTS 150
Score = 46.2 bits (108), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 55/121 (45%), Gaps = 9/121 (7%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDL-SAYETVVVPKIAIHFLGGVD 355
IT+ + L+ F RM G L D C+ L A + + VPK+ HF G D
Sbjct: 236 ITYLEETAFKVLKEEFTSRMSLPVDDSGSTGL-DLCFKLPDAAKNIAVPKMIFHF-KGAD 293
Query: 356 LELDVRGTLVVASVSQV-CLGFATYPPDPNSITL-GNVQQRGHEVHYDVAGRRLGFGPGN 413
LEL +V S + V CL + N +++ GNVQQ+ V +D+ + F P
Sbjct: 294 LELPGENYMVADSSTGVLCLAMGS----SNGMSIFGNVQQQNFNVLHDLEKETVSFVPTE 349
Query: 414 C 414
C
Sbjct: 350 C 350
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 69/236 (29%), Positives = 113/236 (47%), Gaps = 20/236 (8%)
Query: 87 EILRQDQQRLHLKNSRRLRKPFPEFL---KRTEAFTFPANINDTVADEYYIVVAIGEPKQ 143
E+LR+ QR + RL P L R + A + + EY + + +G P+
Sbjct: 45 ELLRRAIQR----SRDRLASIAPRLLPTSSRNKVVVAEAPVL-SAGGEYLVKLGLGTPQH 99
Query: 144 YVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRIL---RESFP 200
+ +DT SD+ WTQC+PC+ C++Q DP F S ++ +PCNS +C L R +
Sbjct: 100 CFTAAIDTASDLIWTQCQPCVKCYKQLDPVFNPVASTSYAVVPCNSDTCDELDTHRCARD 159
Query: 201 FGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNS-SGDKS 259
+ + C + Y + + G A DR+ I + G + GC ++S G
Sbjct: 160 GDSDDEDACQYTYSYGGNATTRGILAVDRLAIGDDVFRG------VVFGCSSSSVGGPPP 213
Query: 260 GASGIMGLDRSPVSIITRTNTSYFSYCLPSPYG-STGYITFGKTDTVNTLRSAFHK 314
SG++GL R +S++++ + F YCLP P S G + G D T+R+A +
Sbjct: 214 QVSGVVGLGRGALSLVSQLSVRRFMYCLPPPVSRSAGRLVLG-ADAAATVRNASER 268
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 80/244 (32%), Positives = 115/244 (47%), Gaps = 26/244 (10%)
Query: 58 DKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEA 117
D A+L + + + R G+ST E+L + R +++R L + A
Sbjct: 50 DAAALRLHATHADAGR---GLST-----RELLHRMAARSKARSARLLSG------RAASA 95
Query: 118 FTFPANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYA 176
P + D V D EY + +AIG P Q V L+LDTGSD+TWTQC PC+ CF+Q P F
Sbjct: 96 RVDPGSYTDGVPDTEYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNP 155
Query: 177 SKSKTFFKIPCNSTSCRILRESFPFGNCNSKE-----CPFNIQYADGSGSGGFWATDRIT 231
S+S TF +PC+ CR L S +C + C + YAD S + G +D +
Sbjct: 156 SRSMTFSVLPCDLRICRDLTWS----SCGEQSWGNGICVYAYAYADHSITTGHLDSDTFS 211
Query: 232 IQEANSNGYFTRYPFL-LGC-INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
A+ P L GC + N+ S +GI G R +S+ + FSYC +
Sbjct: 212 FASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTA 271
Query: 290 PYGS 293
GS
Sbjct: 272 ITGS 275
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 68/225 (30%), Positives = 104/225 (46%), Gaps = 25/225 (11%)
Query: 74 LNQGISTHAPSLEEILRQDQQRLHL-----------KNSRRLRKPFPEFLKRTEAFTFPA 122
L+ G P L +L Q ++L + RR+R L+ + P
Sbjct: 31 LHHGQKRPQPGLRVVLEQVDSGMNLTKYELIKRAIKRGERRMRS-INAMLQSSSGIETPV 89
Query: 123 NINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTF 182
+ EY + VAIG P +S ++DTGSD+ WTQC+PC CF Q P F S +F
Sbjct: 90 YAG---SGEYLMNVAIGTPASSLSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSF 146
Query: 183 FKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFT 242
+PC S C+ L P +C +C + Y DGS + G+ AT+ T + ++
Sbjct: 147 STLPCESQYCQDL----PSESCY-NDCQYTYGYGDGSSTQGYMATETFTFETSS----VP 197
Query: 243 RYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
F G N G +GA G++G+ P+S+ ++ FSYC+
Sbjct: 198 NIAFGCGEDNQGFGQGNGA-GLIGMGWGPLSLPSQLGVGQFSYCM 241
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 54/119 (45%), Gaps = 4/119 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDL-SAYETVVVPKIAIHFLGGVD 355
+T+ D N + AF ++ L TC+ L S TV VP+I++ F GGV
Sbjct: 319 LTYLPQDAYNAVAQAFTDQIN-LSPVDESSSGLSTCFQLPSDGSTVQVPEISMQFDGGV- 376
Query: 356 LELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L L L+ + +CL + SI GN+QQ+ +V YD+ + F P C
Sbjct: 377 LNLGEENVLISPAEGVICLAMGSSSQQGISI-FGNIQQQETQVLYDLQNLAVSFVPTQC 434
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 66/186 (35%), Positives = 94/186 (50%), Gaps = 12/186 (6%)
Query: 122 ANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKT 181
A +EY + +A+G P + V+L LDTGSD+ WTQC PC CF Q P + S T
Sbjct: 76 AAAGGIATNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQGIPLLDPAASST 135
Query: 182 FFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQE---ANSN 238
+ +PC + CR L PF +C + C + Y D S + G ATDR T + N +
Sbjct: 136 YAALPCGAPRCRAL----PFTSCGGRSCVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGD 191
Query: 239 GYF--TRYPFLLGCINNSSGD-KSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGS-T 294
G TR GC + + G +S +GI G R S+ ++ N + FSYC S + S +
Sbjct: 192 GSLPATRR-LTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNATSFSYCFTSMFDSKS 250
Query: 295 GYITFG 300
+T G
Sbjct: 251 SIVTLG 256
>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
Length = 367
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 84/290 (28%), Positives = 134/290 (46%), Gaps = 20/290 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + + IG P + +DT SD+ WTQC+PC C+ Q DP F S T+ +PC+S
Sbjct: 88 EYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSD 147
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
+C L + G+ + + C + Y+ + + G A D++ I E G GC
Sbjct: 148 TCDEL-DVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRG------VAFGC 200
Query: 251 INNSSGDK--SGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGKTDTVNTL 308
+S+G ASG++GL R P+S++++ + + + ST ITF + + L
Sbjct: 201 STSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRYGMII--DIAST--ITFLEASLYDEL 256
Query: 309 RSAFHKRMKKYKKAKGLEDLLDTCY---DLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+ + + + G LD C+ D A++ V VP +A+ F G L LD
Sbjct: 257 VNDLEVEI-RLPRGTGSSLGLDLCFILPDGVAFDRVYVPAVALAF-DGRWLRLDKARLFA 314
Query: 366 VASVS-QVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
S +CL SI LGN QQ+ +V Y++ R+ F C
Sbjct: 315 EDRESGMMCLMVGRAEAGSVSI-LGNFQQQNMQVLYNLRRGRVTFVQSPC 363
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 101/379 (26%), Positives = 145/379 (38%), Gaps = 55/379 (14%)
Query: 87 EILRQDQQRLHLKNSRRLRKPFPEFLKRT-EAFTFPANINDTVA-DEYYIVVAIGEPKQY 144
E+LR+ QR + + L R+ A P +D EY + +A G P Q
Sbjct: 41 ELLRRMAQRSKARATHLLSAQDQSGRGRSASAPVNPGAYDDGFPFTEYLVHLAAGTPPQE 100
Query: 145 VSLLLDTGSDVTWTQCK--PCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFG 202
V L LDTGSD+TWTQCK P CF Q P F S S +F +PC+S +C
Sbjct: 101 VQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCSSPACETTPPCGGGN 160
Query: 203 NCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL-GCINNSSGD-KSG 260
+ S+ C ++I Y DGS S G + T G P L+ GC + + G S
Sbjct: 161 DATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLVFGCGHANRGVFTSN 220
Query: 261 ASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGS-TGYITFGKTDTVNTLRSAFHKRMKKY 319
+GI G R +S+ ++ FS+C + GS T + G S +R Y
Sbjct: 221 ETGIAGFGRGSLSLPSQLKVGNFSHCFTTITGSKTSAVLLGLPGVAPPSASPLGRRRGSY 280
Query: 320 K---------KAKGLEDLLDTCYDLSAYETVV---------------------------- 342
+ + L Y E
Sbjct: 281 RCRSTPRSSNSGTSITSLPPRTYRAVREEFAAQVKLPVVPGNATDPFTCFSAPLRGPKPD 340
Query: 343 VPKIAIHFLGGV------DLELDVRGTLVVASVSQ-VCLGFATYPPDPNSITLGNVQQRG 395
VP +A+HF G + +V + S+ +CL + I LGN+QQ+
Sbjct: 341 VPTMALHFEGATMRLPQENYVFEVVDDDDAGNSSRIICLAVI----EGGEIILGNIQQQN 396
Query: 396 HEVHYDVAGRRLGFGPGNC 414
V YD+ +L F P C
Sbjct: 397 MHVLYDLQNSKLSFVPAQC 415
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 67/185 (36%), Positives = 92/185 (49%), Gaps = 23/185 (12%)
Query: 126 DTVAD--EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
DTV D EY + + IG P + +LDTGS+ WTQC PC+HC+ Q P F SKS TF
Sbjct: 57 DTVFDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFK 116
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
+I C++ + CP+ + Y S + G T+ +TI + S F
Sbjct: 117 EIRCDT---------------HDHSCPYELVYGGKSYTKGTLVTETVTIH-STSGQPFVM 160
Query: 244 YPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFG 300
++GC N+SG K G +G++GLDR P S+IT+ Y SYC T I FG
Sbjct: 161 PETIIGCGRNNSGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGK--GTSKINFG 218
Query: 301 KTDTV 305
V
Sbjct: 219 ANAIV 223
Score = 42.0 bits (97), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 34/75 (45%), Gaps = 2/75 (2%)
Query: 342 VVPKIAIHFLGGVDLELDVRGTLVVASVSQV-CLGFATYPPDPNSITLGNVQQRGHEVHY 400
+ P I +HF GG DL LD V ++ V CL P +I GN Q V Y
Sbjct: 318 IFPVITMHFSGGADLVLDKYNMYVASNTGGVFCLAIICNSPIEEAI-FGNRAQNNFLVGY 376
Query: 401 DVAGRRLGFGPGNCS 415
D + + F P NCS
Sbjct: 377 DSSSLLVSFKPTNCS 391
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 69/209 (33%), Positives = 103/209 (49%), Gaps = 18/209 (8%)
Query: 102 RRLRKPFPEFLKRTEA--FTFPANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWT 158
R + +P + EA T P + ++ E+ + V G P Q +L+ DTGSDV+W
Sbjct: 87 RGIPISYPPTIPPAEAPAVTIPDSTGTSLGTLEFVVTVGFGTPAQTYTLMFDTGSDVSWI 146
Query: 159 QCKPCI-HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSK-ECPFNIQYA 216
QC PC HC++Q DP F +KS T+ +PC C G C+S C + +QY
Sbjct: 147 QCLPCSGHCYKQHDPIFDPTKSATYSAVPCGHPQCAAAG-----GKCSSNGTCLYKVQYG 201
Query: 217 DGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSI-- 274
DGS + G + + +++ A + F GC + GD G++GL R +S+
Sbjct: 202 DGSSTAGVLSHETLSLTSARALPGFA-----FGCGETNLGDFGDVDGLIGLGRGQLSLSS 256
Query: 275 -ITRTNTSYFSYCLPSPYGSTGYITFGKT 302
+ + FSYCLPS S GY+T G T
Sbjct: 257 QAAASFGAAFSYCLPSYNTSHGYLTIGTT 285
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 58/121 (47%), Gaps = 4/121 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T+ + LR F M +YK A D DTCYD + + +P ++ F G
Sbjct: 343 LTYLPPEAYTALRDRFKFTMTQYKPAPAY-DPFDTCYDFAGQNAIFMPLVSFKFSDGSSF 401
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSI---TLGNVQQRGHEVHYDVAGRRLGFGPGN 413
+L G L+ + G + P P+++ +GN QQR E+ YDVA ++GF G+
Sbjct: 402 DLSPFGVLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVSGS 461
Query: 414 C 414
C
Sbjct: 462 C 462
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 101/358 (28%), Positives = 156/358 (43%), Gaps = 85/358 (23%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y + V +G + +SL++DTGSD+TW QC+PC C+ Q+ P + S S ++ + CNS++
Sbjct: 87 YIVTVELG--GKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSST 144
Query: 192 CRIL----RESFPFGNCN---SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRY 244
C+ L S P G N C + + Y DGS + G A++ I + +
Sbjct: 145 CQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLEN----- 199
Query: 245 PFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS-PYGSTGYITFG 300
F+ GC N+ G G+SG+MGL RS VS++++T ++ FSYCLPS G++G ++FG
Sbjct: 200 -FVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFG 258
Query: 301 KTDTVNT---------------LRSAF----------HKRMKKYKKAKGLEDLLDTCYDL 335
+V T LRS + +K +G+ L+D+ +
Sbjct: 259 NDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRGI--LIDSGTVI 316
Query: 336 SAYETVVVPKIAIHFL-------------------------------------GGVDLEL 358
+ + + I FL G +LE+
Sbjct: 317 TRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEV 376
Query: 359 DVRGT--LVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
DV G V S VCL A+ + +GN QQ+ V YD RLG NC
Sbjct: 377 DVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 434
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 67/185 (36%), Positives = 92/185 (49%), Gaps = 23/185 (12%)
Query: 126 DTVAD--EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
DTV D EY + + IG P + +LDTGS+ WTQC PC+HC+ Q P F SKS TF
Sbjct: 51 DTVFDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFK 110
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
+I C++ + CP+ + Y S + G T+ +TI + S F
Sbjct: 111 EIRCDT---------------HDHSCPYELVYGGKSYTKGTLVTETVTIH-STSGQPFVM 154
Query: 244 YPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFG 300
++GC N+SG K G +G++GLDR P S+IT+ Y SYC T I FG
Sbjct: 155 PETIIGCGRNNSGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGK--GTSKINFG 212
Query: 301 KTDTV 305
V
Sbjct: 213 ANAIV 217
Score = 41.6 bits (96), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 27/75 (36%), Positives = 34/75 (45%), Gaps = 2/75 (2%)
Query: 342 VVPKIAIHFLGGVDLELDVRGTLVVASVSQV-CLGFATYPPDPNSITLGNVQQRGHEVHY 400
+ P I +HF GG DL LD V ++ V CL P +I GN Q V Y
Sbjct: 312 IFPVITMHFSGGADLVLDKYNMYVASNTGGVFCLAIICNSPIEEAI-FGNRAQNNFLVGY 370
Query: 401 DVAGRRLGFGPGNCS 415
D + + F P NCS
Sbjct: 371 DSSSLLVSFKPTNCS 385
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 68/198 (34%), Positives = 97/198 (48%), Gaps = 20/198 (10%)
Query: 128 VADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPC 187
V +EY + +A+G P + V+L LDTGSD+ WTQC PC CF Q P + S T+ +PC
Sbjct: 88 VTNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAASSTYAALPC 147
Query: 188 NSTSCRILRESFPFGNC----------NSKECPFNIQYADGSGSGGFWATDRITIQEANS 237
+ CR L PF +C ++ C + Y D S + G ATDR T N
Sbjct: 148 GAPRCRAL----PFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNG 203
Query: 238 NGYFTRYP---FLLGCINNSSGD-KSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGS 293
+G +R P GC + + G +S +GI G R S+ ++ N + FSYC S + S
Sbjct: 204 DGD-SRLPTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNVTTFSYCFTSMFES 262
Query: 294 -TGYITFGKTDTVNTLRS 310
+ +T G L S
Sbjct: 263 KSSLVTLGGAPAAALLYS 280
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 92/359 (25%), Positives = 144/359 (40%), Gaps = 80/359 (22%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
+Y++ +G P Q SL++D+GSD+ W QC PC+ C+ Q P + S S TF +PC S
Sbjct: 64 QYFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCYAQDTPLYAPSNSSTFNPVPCLSP 123
Query: 191 SCRIL--RESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
C ++ E FP C + +YAD S S G +A + T+ + +
Sbjct: 124 ECLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATVDDVRID------KVAF 177
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS---PYGSTGYITFGK- 301
GC ++ G + A G++GL + P+S ++ +Y F+YCL + P + ++ FG
Sbjct: 178 GCGRDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSWLIFGDE 237
Query: 302 ----------TDTVNTLRSAFHKRMKKYKKAKGLEDL--------LD------TCYDLSA 337
T V+ R+ ++ K G E L LD + +D
Sbjct: 238 LISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGNGGSIFDSGT 297
Query: 338 YETVVVPKIAIHFLG---------------GVDLELDVRGT------------------- 363
T +P + L G+DL +DV G
Sbjct: 298 TVTYWLPPAYRNILAAFDKNVRYPRAASVQGLDLCVDVTGVDQPSFPSFTIVLGGGAVFQ 357
Query: 364 ------LVVASVSQVCLGFATYPPDPNSI-TLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
V + + CL A P T+GN+ Q+ V YD R+GF P CS
Sbjct: 358 PQQGNYFVDVAPNVQCLAMAGLPSSVGGFNTIGNLLQQNFLVQYDREENRIGFAPAKCS 416
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 54/162 (33%), Positives = 86/162 (53%), Gaps = 9/162 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + + IG P + +DT SD+ WTQC+PC C+ Q DP F S T+ +PC+S
Sbjct: 88 EYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSD 147
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
+C L + G+ + + C + Y+ + + G A D++ I E G GC
Sbjct: 148 TCDEL-DVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRG------VAFGC 200
Query: 251 INNSSGDK--SGASGIMGLDRSPVSIITRTNTSYFSYCLPSP 290
+S+G ASG++GL R P+S++++ + F+YCLP P
Sbjct: 201 STSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRFAYCLPPP 242
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/170 (35%), Positives = 88/170 (51%), Gaps = 5/170 (2%)
Query: 127 TVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIP 186
+V EY + +AIG P L DTGSD+TWTQC+PC CF Q P + S S TF +P
Sbjct: 61 SVQVEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVP 120
Query: 187 CNSTSCRILRESFPFGNCN--SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRY 244
C+S +C S NC+ S C + Y+DG+ S G T+ +TI + +
Sbjct: 121 CSSATCLPTWRSR---NCSNPSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVG 177
Query: 245 PFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGST 294
GC ++ GD ++G +GL R +S++ + FSYCL + ST
Sbjct: 178 SVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLTDFFNST 227
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 54/162 (33%), Positives = 86/162 (53%), Gaps = 9/162 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + + IG P + +DT SD+ WTQC+PC C+ Q DP F S T+ +PC+S
Sbjct: 88 EYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSD 147
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
+C L + G+ + + C + Y+ + + G A D++ I E G GC
Sbjct: 148 TCDEL-DVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRG------VAFGC 200
Query: 251 INNSSGDK--SGASGIMGLDRSPVSIITRTNTSYFSYCLPSP 290
+S+G ASG++GL R P+S++++ + F+YCLP P
Sbjct: 201 STSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRFAYCLPPP 242
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 66/204 (32%), Positives = 102/204 (50%), Gaps = 14/204 (6%)
Query: 86 EEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYV 145
E +R+D R+ + + + +F A + + V Y + +++G P
Sbjct: 44 SEAVRRDSHRIAFLSDATAAG---KATTTNSSVSFQALLENGVGG-YNMNISVGTPLLTF 99
Query: 146 SLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCN 205
++ DTGSD+ WTQC PC CFQQ P F + S TF K+PC S+ C+ L S CN
Sbjct: 100 PVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSFCQFLPNSIR--TCN 157
Query: 206 SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIM 265
+ C +N +Y G + G+ AT+ + + +A+ F F GC + +G + SGI
Sbjct: 158 ATGCVYNYKYGSGY-TAGYLATETLKVGDAS----FPSVAF--GC-STENGVGNSTSGIA 209
Query: 266 GLDRSPVSIITRTNTSYFSYCLPS 289
GL R +S+I + FSYCL S
Sbjct: 210 GLGRGALSLIPQLGVGRFSYCLRS 233
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 103/194 (53%), Gaps = 18/194 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + +++G P Q S ++DTGSD+ W QC PC CF+Q DP F S ++ C +
Sbjct: 7 EYVLQISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYSNASCTDS 66
Query: 191 SCRILRESFPFGNCNSKE-CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
C ++ P C+ + C ++ Y DGS + G +A + +T+ + R F G
Sbjct: 67 LC----DALPRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTLNGST----LARIGF--G 116
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL--PSPYGSTGYITFGKTDT 304
C +N G +GA G++GL + P+S+ ++ N+S+ FSYCL S G+ ITFG +
Sbjct: 117 CGHNQEGTFAGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQSTTGTFSPITFG--NA 174
Query: 305 VNTLRSAFHKRMKK 318
R++F ++
Sbjct: 175 AENSRASFTPLLQN 188
Score = 47.0 bits (110), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 54/106 (50%), Gaps = 8/106 (7%)
Query: 314 KRMKKYKKAKGLEDLLDTCYDLSAYE--TVVVPKIAIHFLGGVDLELDVRGTLVVASV-- 369
+R Y +A L+ CYD+S+ ++ +P + +H L VD E+ V V+
Sbjct: 250 RRQISYPEADPTPYGLNLCYDISSVSASSLTLPSMTVH-LTNVDFEIPVSNLWVLVDNFG 308
Query: 370 SQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
VC +T D SI +GNVQQ+ + + DVA R+GF +CS
Sbjct: 309 ETVCTAMST--SDQFSI-IGNVQQQNNLIVTDVANSRVGFLATDCS 351
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 86/162 (53%), Gaps = 7/162 (4%)
Query: 127 TVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIP 186
+V EY + +AIG+P L DTGSD+TWTQC+PC CF Q P + S S TF +P
Sbjct: 66 SVQVEYLMELAIGKPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPLP 125
Query: 187 CNSTSCRILRESFPFGNCN-SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C+S +C + NC S C + Y DG+ S G T+ +T+ +++
Sbjct: 126 CSSATCLPIWSR----NCTPSSLCRYRYAYGDGAYSAGILGTETLTLGPSSAPVSVGGVA 181
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
F GC ++ GD ++G +GL R +S++ + FSYCL
Sbjct: 182 F--GCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCL 221
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 36/130 (27%), Positives = 56/130 (43%), Gaps = 7/130 (5%)
Query: 292 GSTGYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDL----LDT-CYDLSAYETVVVPKI 346
G+ G I T S F + + + + G + LD C+ A E +P +
Sbjct: 290 GTGGMIVDSGTTFTILAESGFREVVGRVARVLGQPPVNASSLDAPCFPAPAGEPPYMPDL 349
Query: 347 AIHFLGGVDLELDVRGTLVV-ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGR 405
+HF GG D+ L + S CL A P+ S+ LGN QQ+ ++ +D
Sbjct: 350 VLHFAGGADMRLYRDNYMSYNEEDSSFCLNIAGTTPESTSV-LGNFQQQNIQMLFDTTVG 408
Query: 406 RLGFGPGNCS 415
+L F P +CS
Sbjct: 409 QLSFLPTDCS 418
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 82/220 (37%), Positives = 124/220 (56%), Gaps = 14/220 (6%)
Query: 88 ILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSL 147
+L QDQ R+ ++R K K +A + A Y + +A+G PK +SL
Sbjct: 1 MLLQDQLRVKSMHARFSNKNAGSHFKEMQADIPVQSGIPLGAGNYLVKMALGTPKLSLSL 60
Query: 148 LLDTGSDVTWTQCKPCI-HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS 206
LDTGSD+TWTQC+PC+ C++Q F KS ++ + C+S+SCRI+ +S C S
Sbjct: 61 ALDTGSDITWTQCEPCVGSCYRQAQTKFDPRKSSSYKNVSCSSSSCRIITDSGGARGCVS 120
Query: 207 KECPFNIQYADGSGSGGFWATDRITIQEAN--SNGYFTRYPFLLGCINNSSGDKSGASGI 264
C + +QY DGS S GF+AT+++TI ++ SN FL GC ++G +G+
Sbjct: 121 STCIYKVQYGDGSYSVGFFATEKLTISPSDVISN-------FLFGCGQQNAGRFGRIAGL 173
Query: 265 MGLDRSPVSIITRTNTSY---FSYCLPS-PYGSTGYITFG 300
+GL R +S+ +T+ Y F+YCLPS STG++T G
Sbjct: 174 LGLGRGKLSLALQTSEKYNNLFTYCLPSFSSSSTGHLTLG 213
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 44/111 (39%), Positives = 67/111 (60%), Gaps = 2/111 (1%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL- 364
+ L S F + MK Y K G +LDTCYD S E++ VP+I+ F GGV++++ G L
Sbjct: 278 SALSSKFQQLMKDYPKTDGFS-ILDTCYDFSGNESISVPRISFFFKGGVEVDIKFFGILT 336
Query: 365 VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
V+ + +VCL FA D + + GN QQ+ ++V +D+A R+GF P C+
Sbjct: 337 VINAWDKVCLAFAPNDDDGDFVVFGNSQQQTYDVVHDLAKGRIGFAPSGCN 387
>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
Length = 417
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 54/162 (33%), Positives = 86/162 (53%), Gaps = 9/162 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + + IG P + +DT SD+ WTQC+PC C+ Q DP F S T+ +PC+S
Sbjct: 88 EYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSD 147
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
+C L + G+ + + C + Y+ + + G A D++ I E G GC
Sbjct: 148 TCDEL-DVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRG------VAFGC 200
Query: 251 INNSSGDK--SGASGIMGLDRSPVSIITRTNTSYFSYCLPSP 290
+S+G ASG++GL R P+S++++ + F+YCLP P
Sbjct: 201 STSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRFAYCLPPP 242
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/257 (28%), Positives = 123/257 (47%), Gaps = 27/257 (10%)
Query: 60 ASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEF-------L 112
+S+ + +YGPCS + P+ EE+LR+DQ R + +R+ F
Sbjct: 33 SSVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLR-----ADYIRRKFSGSNGTAAGED 87
Query: 113 KRTEAFTFPANINDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIH---CFQ 168
++ + P + ++ EY I V +G P +++DTGSDV+W QC+PC C
Sbjct: 88 GQSSKVSVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHA 147
Query: 169 QRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSK-ECPFNIQYADGSGSGGFWAT 227
F + S T+ C++ +C L +S C++K C + ++Y DGS + G +++
Sbjct: 148 HAGALFDPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSS 207
Query: 228 DRITIQEANSNGYFTRYPFLLGCINNSSG----DKS-GASGIMGLDRSPVSIITRTNTSY 282
D +T+ +G F GC + G DK+ G G+ G +SPVS
Sbjct: 208 DVLTL-----SGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKS 262
Query: 283 FSYCLPSPYGSTGYITF 299
F YCLP+ S+G++T
Sbjct: 263 FFYCLPATPASSGFLTL 279
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 39/94 (41%), Positives = 53/94 (56%), Gaps = 6/94 (6%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
L SAF M +Y +A+ L +LDTC++ + + V +P +A+ F GG ++LD G
Sbjct: 352 LSSAFRAGMTRYARAEPL-GILDTCFNFTGLDKVSIPTVALVFAGGAVVDLDAHGI---- 406
Query: 368 SVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYD 401
VS CL FA D T+GNVQQR EV YD
Sbjct: 407 -VSGGCLAFAPTRDDKAFGTIGNVQQRTFEVLYD 439
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 63/179 (35%), Positives = 100/179 (55%), Gaps = 10/179 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + IG P + + ++LDTGSDVTW QC PC C+ Q DP F + S ++ +PC+S
Sbjct: 195 EYFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPALSSSYATVPCDSP 254
Query: 191 SCRILRESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
CR L S N + C + + Y DGS + G +AT+ +T+ +G + +
Sbjct: 255 HCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTL---GGDGSAAVHDVAI 311
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL---PSPYGSTGYITFGKTDT 304
GC +++ G GA+G++ L P+S ++ + + FSYCL SP ST + FG +D+
Sbjct: 312 GCGHDNEGLFVGAAGLLALGGGPLSFPSQISATEFSYCLVDRDSPSAST--LQFGASDS 368
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 65/120 (54%), Gaps = 6/120 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T ++ + LR AF + + +A G+ L DTCYDL+ +V VP +++ F GG +L
Sbjct: 425 VTRLQSSAYSALRDAFVRGTQALPRASGVS-LFDTCYDLAGRSSVQVPAVSLRFEGGGEL 483
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITL-GNVQQRGHEVHYDVAGRRLGFGPGNC 414
+L + L+ V CL FA ++++ GNVQQ+G V +D A +GF P C
Sbjct: 484 KLPAKNYLIPVDGAGTYCLAFAA---TGGAVSIVGNVQQQGIRVSFDTAKNTVGFSPNKC 540
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/184 (32%), Positives = 96/184 (52%), Gaps = 11/184 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + + IG P ++ S +LDTGSD+ WTQC PC+ C Q P+F + S T+ + C++
Sbjct: 91 EYLMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSSTYRSLGCSAP 150
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
+C L +P C K C + Y D + + G A + T ++ R F GC
Sbjct: 151 ACNALY--YPL--CYQKTCVYQYFYGDSASTAGVLANETFTFGTNDTRVTLPRISF--GC 204
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLP---SPYGSTGYITFGKTDTVNT 307
N ++G + SG++G R +S++++ + FSYCL SP S Y FG T+N+
Sbjct: 205 GNLNAGSLANGSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSPVRSRLY--FGAYATLNS 262
Query: 308 LRSA 311
++
Sbjct: 263 TNAS 266
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/192 (33%), Positives = 102/192 (53%), Gaps = 14/192 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFYASKSKTFFKIPCNS 189
E+ +VV G P Q + + DTGSD++W QC+PC HC++Q DP F +KS ++ +PC +
Sbjct: 111 EFVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCSGHCYKQHDPVFDPAKSSSYAVVPCGT 170
Query: 190 TSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
T C G CN C + ++Y DGS + G A + +T +S+ FT F+ G
Sbjct: 171 TECAAAG-----GECNGTTCVYGVEYGDGSSTTGVLARETLTF---SSSSEFTG--FIFG 220
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDTVN 306
C + GD G++GL R +S+ ++ ++ FSYCLPS + GY++ G T
Sbjct: 221 CGETNLGDFGEVDGLLGLGRGSLSLSSQAAPAFGGIFSYCLPSYNTTPGYLSIGATPVTG 280
Query: 307 TLRSAFHKRMKK 318
+ + + K
Sbjct: 281 QIPVQYTAMVNK 292
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/172 (36%), Positives = 85/172 (49%), Gaps = 8/172 (4%)
Query: 125 NDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFK 184
N EY + +AIG P Q V L LDTGSD+ WTQCKPC+ CF Q P+F S+S T
Sbjct: 28 NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFDQPLPYFDTSRSSTNAL 87
Query: 185 IPCNSTSCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFT 242
+PC ST C++ N + C + Y D S + G A D+ T S T
Sbjct: 88 LPCESTQCKLDPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKFTFVAGTSLPGVT 147
Query: 243 RYPFLLGC-INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGS 293
GC +NN+ S +GI G R P+S+ ++ FS+C + G+
Sbjct: 148 -----FGCGLNNTGVFNSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGA 194
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/189 (33%), Positives = 100/189 (52%), Gaps = 19/189 (10%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + ++G P + ++DTGSD+ W QC+PC C+ Q P F SKS ++ IPC S
Sbjct: 86 EYLMTYSVGTPPFKLYGIVDTGSDIVWLQCEPCQECYNQTTPMFNPSKSSSYKNIPCPSK 145
Query: 191 SCRILRESFPFGNCNSKE-CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLL 248
C+ + ++ +CN K C ++ Y D S SGG + D +T++ ++NG +P ++
Sbjct: 146 LCQSMEDT----SCNDKNYCEYSTYYGDNSHSGGDLSVDTLTLE--STNGLTVSFPNIVI 199
Query: 249 GC-INNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPY-------GSTGYI 297
GC NN + +SGI+G P S IT+ +S FSYCL + +T +
Sbjct: 200 GCGTNNILSYEGASSGIVGFGSGPASFITQLGSSTGGKFSYCLTPLFSVTNIQSNATSKL 259
Query: 298 TFGKTDTVN 306
FG TV+
Sbjct: 260 NFGDAATVS 268
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 66/191 (34%), Positives = 103/191 (53%), Gaps = 17/191 (8%)
Query: 103 RLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKP 162
R + F + L T T N EY + ++G P V ++DTGSD+ W QCKP
Sbjct: 63 RANRLFKDSLSNTPESTVYVN-----GGEYLMTYSVGTPPFNVYGVVDTGSDIVWLQCKP 117
Query: 163 CIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKE-CPFNIQYADGSGS 221
C C++Q P F SKS ++ IPC+S C+ +R + +CN + C + I ++D S S
Sbjct: 118 CEQCYKQTTPIFNPSKSSSYKNIPCSSNLCQSVR----YTSCNKQNSCEYTINFSDQSYS 173
Query: 222 GGFWATDRITIQEANSNGYFTRYP-FLLGCINNSSGDKSG-ASGIMGLDRSPVSIITRTN 279
G + + +T+ ++ G+ +P ++GC +N+ G G SGI+GL PVS+ T+
Sbjct: 174 QGELSVETLTLD--STTGHSVSFPKTVIGCGHNNRGMFQGETSGIVGLGIGPVSLTTQLK 231
Query: 280 TSY---FSYCL 287
+S FSYCL
Sbjct: 232 SSIGGKFSYCL 242
>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 58/145 (40%), Positives = 83/145 (57%), Gaps = 13/145 (8%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCN 188
+ EY+ + +G P +YV ++LDTGSDV W QC PC C+ Q DP F KS +F I C
Sbjct: 171 SGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCR 230
Query: 189 STSCRILRESFPFGNCNSKE-CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-F 246
S C LR P CNS++ C + + Y DGS + G ++T+ +T + TR P
Sbjct: 231 SPLC--LRLDSP--GCNSRQSCLYQVAYGDGSFTFGEFSTETLTFRG-------TRVPKV 279
Query: 247 LLGCINNSSGDKSGASGIMGLDRSP 271
LGC +++ G GA+G++GL R P
Sbjct: 280 ALGCGHDNEGLFVGAAGLLGLGRQP 304
>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 252
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 61/174 (35%), Positives = 100/174 (57%), Gaps = 14/174 (8%)
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
+V +G + +++++DT SD+TW QC+PC+ C+ Q+ P F S S ++ + CNS++C+
Sbjct: 66 IVTMGLGSKNMTVIIDTRSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQS 125
Query: 195 LR----ESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
L+ + G+ N C + + Y DGS + G EA S G + F+ GC
Sbjct: 126 LQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGDLGV------EALSFGGVSVSDFVFGC 179
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLP-SPYGSTGYITFG 300
N+ G G SG+MGL RS +S++++TN ++ FSYCLP + GS+G + G
Sbjct: 180 GRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGSSGSLVMG 233
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 64/193 (33%), Positives = 96/193 (49%), Gaps = 19/193 (9%)
Query: 126 DTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKI 185
D A Y + ++IG P S+L DTGS + WTQC PC C + P F + S TF K+
Sbjct: 84 DNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKL 143
Query: 186 PCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
PC S+ C+ L P+ CN+ C + Y G + G+ AT+ + + A+ G
Sbjct: 144 PCASSLCQFLTS--PYLTCNATGCVYYYPYGMGF-TAGYLATETLHVGGASFPG------ 194
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL---------PSPYGSTGY 296
GC + +G + +SGI+GL RSP+S++++ FSYCL P +GS
Sbjct: 195 VAFGC-STENGVGNSSSGIVGLGRSPLSLVSQVGVGRFSYCLRSDADAGDSPILFGSLAK 253
Query: 297 ITFGKTDTVNTLR 309
+T G + L
Sbjct: 254 VTGGNVQSTPLLE 266
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 71/202 (35%), Positives = 108/202 (53%), Gaps = 20/202 (9%)
Query: 100 NSRRLRKPFPEFLKRTEAFTFPANINDTV-------ADEYYIVVAIGEPKQYVSLLLDTG 152
+ RL F + R F A +D + A EY + + IG P V ++DTG
Sbjct: 53 QAERLTDAFRRSVSRVGRFRPTAMTSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTG 112
Query: 153 SDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCN-SKECPF 211
SD+TWTQC+PC HC++Q P F S T+ C ++ C L + +C+ K+C F
Sbjct: 113 SDLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCGTSFCLALGKD---RSCSKEKKCTF 169
Query: 212 NIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLGCINNSSG--DKSGASGIMGLD 268
YADGS +GG A++ +T+ ++ G +P F GC ++S G DKS +SGI+GL
Sbjct: 170 RYSYADGSFTGGNLASETLTVD--STAGKPVSFPGFAFGCGHSSGGIFDKS-SSGIVGLG 226
Query: 269 RSPVSIITRTNTS---YFSYCL 287
+S+I++ ++ FSYCL
Sbjct: 227 GGELSLISQLKSTINGLFSYCL 248
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 63/184 (34%), Positives = 103/184 (55%), Gaps = 21/184 (11%)
Query: 132 YYIVVAIGEP-KQYVSLLLDTGSDVTWTQCKPC--IHCFQQRDPFFYASKSKTFFKIPCN 188
Y +A+G + +++++DTGSD+TW QC+PC C+ QRDP F + S TF +PC
Sbjct: 180 YVTTIALGGGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAAVPCG 239
Query: 189 STSCRI-LRESFPF-GNC------NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGY 240
S +C L+++ G+C + + C + + Y DGS S G A D + + G
Sbjct: 240 SPACAASLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGL------GT 293
Query: 241 FTRYP-FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGY 296
T+ F+ GC ++ G G +G+MGL R+ +S++++T + FSYCLP+ STG
Sbjct: 294 TTKLDGFVFGCGLSNRGLFGGTAGLMGLGRTDLSLVSQTAARFGGVFSYCLPATTTSTGS 353
Query: 297 ITFG 300
++ G
Sbjct: 354 LSLG 357
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 59/112 (52%), Gaps = 4/112 (3%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+R+ F +R + Y A G +LD CYDL+ + V VP + + GG + +D G L
Sbjct: 424 KAVRAEFARRFE-YPAAPGFS-ILDACYDLTGRDEVNVPLLTLTLEGGAQVTVDAAGMLF 481
Query: 366 VASV--SQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
V SQVCL A+ P + + +GN QQR V YD G RLGF +C+
Sbjct: 482 VVRKDGSQVCLAMASLPYEDQTPIIGNYQQRNKRVVYDTVGSRLGFADEDCT 533
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 94/162 (58%), Gaps = 12/162 (7%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y + ++G P + + DTGSD+ W QC+PC C+ Q P F SKS ++ IPC+S
Sbjct: 87 YLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCSSKL 146
Query: 192 CRILRESFPFGNCNSK-ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLG 249
C +R++ +C+ + C + I Y D S S G + D ++++ +++G +P ++G
Sbjct: 147 CHSVRDT----SCSDQNSCQYKISYGDSSHSQGDLSVDTLSLE--STSGSPVSFPKIVIG 200
Query: 250 CINNSSGDKSGA-SGIMGLDRSPVSIITRTNTSY---FSYCL 287
C +++G GA SGI+GL PVS+IT+ +S FSYCL
Sbjct: 201 CGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCL 242
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 50/119 (42%), Gaps = 5/119 (4%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T +D L SA + K + CY L + E P I +HF G D+
Sbjct: 319 LTLIPSDVYTNLESAV-VDLVKLDRVDDPNQQFSLCYSLKSNE-YDFPIITVHF-KGADV 375
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
EL T V + VC F P SI GN+ Q+ V YD+ + + F P +C+
Sbjct: 376 ELHSISTFVPITDGIVCFAFQP-SPQLGSI-FGNLAQQNLLVGYDLQQKTVSFKPTDCT 432
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 108/413 (26%), Positives = 156/413 (37%), Gaps = 75/413 (18%)
Query: 69 GPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRT------EAFTFPA 122
GPCS L+ I +L D R+ +R +K P T + P
Sbjct: 52 GPCSPLSADIP-----FSAVLTHDAARIASFAARLAKKSSPSSASATTQAAGSSLASVPL 106
Query: 123 NINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSK 180
+V Y+ + +G P + +++DTGS +TW QC PC + C +Q P F S
Sbjct: 107 TPGTSVGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSS 166
Query: 181 TFFKIPCNSTSCRILRESF--PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANS- 237
++ + C+S C L + P S C + Y D S S G+ + D ++ ANS
Sbjct: 167 SYAAVSCSSPQCDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKDTVSF-GANSV 225
Query: 238 -----------NGYFTRYPFLLGCINN------------------------SSGDKS-GA 261
G F R L+G N SSG S G+
Sbjct: 226 PNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSTSSSGYLSIGS 285
Query: 262 SGIMGLDRSPVSIITRTNTSYF-------------------SYCLPSPYGSTGYITFGKT 302
G +P+ T ++ YF LP+ S IT T
Sbjct: 286 YNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPTIIDSGTVITRLPT 345
Query: 303 DTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRG 362
L A MK K +LDTC++ A + VP +++ F GG L+L
Sbjct: 346 SVYTALSKAVAAAMKGSTKRAAAYSILDTCFEGQASKLRAVPAVSMAFSGGATLKLSAGN 405
Query: 363 TLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
LV + CL FA P ++ +GN QQ+ V YDV R+GF CS
Sbjct: 406 LLVDVDGATTCLAFA---PARSAAIIGNTQQQTFSVVYDVKSNRIGFAAAGCS 455
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 96/186 (51%), Gaps = 21/186 (11%)
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
V +G ++++DT S++TW QC PC C Q+DP F S S ++ +PCNS+SC
Sbjct: 154 VATVGLGGGEATVIVDTASELTWVQCAPCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDA 213
Query: 195 LR-----ESFPFGNCNSKE-----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRY 244
L+ S C ++ C + + Y DGS S G A DR+++ +G
Sbjct: 214 LQLATGGTSGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSLAGEVIDG----- 268
Query: 245 PFLLGCINNSSGDK-SGASGIMGLDRSPVSIITRTNTSY---FSYCLP-SPYGSTGYITF 299
F+ GC ++ G G SG+MGL RS +S++++T + FSYCLP S+G +
Sbjct: 269 -FVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYCLPLKESDSSGSLVI 327
Query: 300 GKTDTV 305
G +V
Sbjct: 328 GDDSSV 333
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 62/111 (55%), Gaps = 3/111 (2%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL- 364
N +++ F + +Y +A G +LDTC++++ V VP + + F GGV++E+D G L
Sbjct: 400 NAVKAEFLSQFAEYPQAPGFS-ILDTCFNMTGLREVQVPSLKLVFDGGVEVEVDSGGVLY 458
Query: 365 -VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V + SQVCL A + + +GN QQ+ V +D +G ++GF C
Sbjct: 459 FVSSDSSQVCLAMAPLKSEYETNIIGNYQQKNLRVIFDTSGSQVGFAQETC 509
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 102/202 (50%), Gaps = 14/202 (6%)
Query: 90 RQDQQRLHLKNSRRLRKPFPEFLKRTE-AFTFPANINDTV-ADEYYIVVAIGEPKQYVSL 147
R+ +R+ L++ R P L + A P +D V EY + +AIG P Q V L
Sbjct: 51 RELMRRMALRSKARA----PRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQL 106
Query: 148 LLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSK 207
LDTGS + WTQC+PC CF Q P++ AS+S TF C+ST C++ N +
Sbjct: 107 TLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQ 166
Query: 208 ECPFNIQYADGSGSGGFWATDRIT-IQEANSNGYFTRYPFLLGC-INNSSGDKSGASGIM 265
C ++ Y D S + GF + ++ + A+ G + GC +NN+ +S +GI
Sbjct: 167 TCAYSYSYGDKSATIGFLDVETVSFVAGASVPG------VVFGCGLNNTGIFRSNETGIA 220
Query: 266 GLDRSPVSIITRTNTSYFSYCL 287
G R P+S+ ++ FS+C
Sbjct: 221 GFGRGPLSLPSQLKVGNFSHCF 242
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 92/180 (51%), Gaps = 11/180 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + + IG P +Y S +LDTGSD+ WTQC PC+ C Q P+F ++S T+ + C S
Sbjct: 89 EYLMEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASP 148
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
+C L +P C K C + Y D + + G A + T + F GC
Sbjct: 149 ACNALY--YPL--CYQKVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISF--GC 202
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLP---SPYGSTGYITFGKTDTVNT 307
N ++G + SG++G R +S++++ + FSYCL SP S Y FG T+N+
Sbjct: 203 GNLNAGSLANGSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSPVPSRLY--FGVYATLNS 260
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 99/169 (58%), Gaps = 12/169 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + +++G P + + DTGSD+ WTQC+PC +C+QQ P F SKS T+ K+ C+S
Sbjct: 84 EYLMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQDLPMFNPSKSTTYRKVSCSSP 143
Query: 191 SCRILRESFPFGNCNSK-ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLL 248
C E +C+ K +C ++I Y D S S G +A D +T+ +++G +P +
Sbjct: 144 VCSFTGED---NSCSFKPDCTYSISYGDNSHSQGDFAVDTLTM--GSTSGRVVAFPRTAI 198
Query: 249 GCINNSSGD-KSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGS 293
GC ++++G + SGI+GL P S+I + ++ FSYCL +P G+
Sbjct: 199 GCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCL-TPIGN 246
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 73/228 (32%), Positives = 115/228 (50%), Gaps = 21/228 (9%)
Query: 68 YGPCSRLNQGISTHAPSLEEILRQDQQRLHLK-NSRRLRKPFPEFLKRTEAFTFPANIND 126
+G CS L ++ S +++ Q +R + + N+ R + P T P
Sbjct: 78 HGACSPLR---PINSSSWIDLVSQSFERDNARLNTIRSKNSGP----YTTMSNLPLQSGT 130
Query: 127 TVADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKI 185
TV YIV A G P + L++DTGSD+TW QCKPC C+ Q D F +S ++ +
Sbjct: 131 TVGTGNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDAIFEPKQSSSYKTL 190
Query: 186 PCNSTSCR--ILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
PC S +C I ES P C C + I Y DGS S G ++ + +T+ G +
Sbjct: 191 PCLSATCTELITSESNPT-PCLLGGCVYEINYGDGSSSQGDFSQETLTL------GSDSF 243
Query: 244 YPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLP 288
F GC + ++G G+SG++GL ++ +S +++ + Y F+YCLP
Sbjct: 244 QNFAFGCGHTNTGLFKGSSGLLGLGQNSLSFPSQSKSKYGGQFAYCLP 291
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 57/112 (50%), Gaps = 3/112 (2%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
N L+++F + + AK +LDTCYDLS + V +P I HF D+ + G LV
Sbjct: 371 NALKTSFRSKTRDLPSAKPFS-ILDTCYDLSRHSQVRIPTITFHFQNNADVAVSDVGILV 429
Query: 366 VA--SVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
SQVCL FA+ +GN QQ+ V +D R+GF G+C+
Sbjct: 430 PVQNGGSQVCLAFASASQMDGFNIIGNFQQQRMRVAFDTGAGRIGFASGSCA 481
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 53/172 (30%), Positives = 89/172 (51%), Gaps = 9/172 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + + G P+ + S +DT SD+ W QC+PC+ C++Q DP F S ++ +PC S
Sbjct: 91 EYLVKLGTGTPQHFFSAAIDTASDLVWMQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSD 150
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
+C L + + C + +Y+ + G A D++ I G + + GC
Sbjct: 151 TCAQL-DGHRCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAI------GGDVFHAVVFGC 203
Query: 251 INNSSGDKSG-ASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGST-GYITFG 300
++S G + ASG++GL R P+S++++ + F YCLP P T G + G
Sbjct: 204 SDSSVGGPAAQASGLVGLGRGPLSLVSQLSVHRFMYCLPPPMSRTSGKLVLG 255
>gi|21668075|gb|AAM74221.1|AF518565_1 putative chloroplast nucleoid DNA-binding protein [Brassica
oleracea]
Length = 165
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 66/108 (61%), Gaps = 1/108 (0%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
LR AF +M +YK + +LDTC+DL+ ++TV +P ++ +F GG +EL +G L
Sbjct: 59 LRGAFKAKMSQYKNTSAVS-ILDTCFDLTGFKTVTIPTVSFYFNGGAVVELGSKGVLYAF 117
Query: 368 SVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+SQVCL FA D N+ GNVQQ+ EV YD A R+GF P CS
Sbjct: 118 KMSQVCLAFAGNSDDNNAAIFGNVQQQTLEVVYDGAAGRVGFAPNGCS 165
>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
Length = 435
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 142/361 (39%), Gaps = 85/361 (23%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCN 188
A EY ++ G P Q + DT V+ +CKPC+ DP F S+S +F IPC
Sbjct: 85 ALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGG-APCDPAFEPSRSSSFAAIPCG 143
Query: 189 STSCRILRESFPFGNCNSKECPF-----NIQYADGS------------------------ 219
S C + C CPF N+ A+G+
Sbjct: 144 SPECAV--------ECTGASCPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEV 195
Query: 220 --------GSGGFWATDRITIQEAN---SNGYFTRYPFLLGCINNSSGDKSGASGIMGLD 268
G+ G R + A+ SNG T C+ +SS S +G
Sbjct: 196 GADADTFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGAS 255
Query: 269 R----------SPVSIITRTNTSYFSYCL------------PSPYGSTGYI-------TF 299
R +P+S SYF + P+ + + G + TF
Sbjct: 256 RPEYSGGDIKYAPMSSNPNHPNSYFVELVGISVGGEDLPVPPAVFAAHGTLLEAATEFTF 315
Query: 300 GKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELD 359
LR AF + M Y A +LDTCY+L+ ++ VP +A+ F GG +LELD
Sbjct: 316 LAPAAYAALRDAFRRDMAPYPAAPPFR-VLDTCYNLTGLASLAVPTVALRFAGGTELELD 374
Query: 360 VRGTLVVASVSQV-----CLGFATYPPDPNSIT-LGNVQQRGHEVHYDVAGRRLGFGPGN 413
VR + A S V CL FA P ++ +G + QR EV YD+ G R+GF PG
Sbjct: 375 VRQMMYFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGFIPGR 434
Query: 414 C 414
C
Sbjct: 435 C 435
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/330 (26%), Positives = 134/330 (40%), Gaps = 54/330 (16%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V +G P L+LDTGSDV W QC PC C+ Q F +S+++ + C +
Sbjct: 141 EYFASVGVGTPPTPALLVLDTGSDVVWLQCAPCRQCYAQSGRVFDPRRSRSYAAVRCGAP 200
Query: 191 SC-RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL-L 248
C + + C + + Y DGS + G AT+ + R P + +
Sbjct: 201 PCRGLDAGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLWFARG------ARVPRVAV 254
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS---------------- 289
GC +++ G A+G++GL R +S+ T+T Y FSYC
Sbjct: 255 GCGHDNEGLFVAAAGLLGLGRGRLSLPTQTARRYGRRFSYCFQGSDLDHRTIIRTVHQHV 314
Query: 290 ----------------PYGSTGYITFGKTDTVNTL--------RSAFHKRMKKYKKAKGL 325
P G + +V L R AF + A G
Sbjct: 315 GGARVRGVGERSLRLDPSTGRGGVILDSGTSVTRLARPVYVAVREAFRAAAGGLRLAPGG 374
Query: 326 EDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-VASVSQVCLGFATYPPDPN 384
L DTCYDL V VP +++H GG ++ L L+ V + CL A D
Sbjct: 375 FSLFDTCYDLRGRRVVKVPTVSVHLAGGAEVALPPENYLIPVDTRGTFCLALAGT--DGG 432
Query: 385 SITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+GN+QQ+G V +D +R+ P +C
Sbjct: 433 VSIVGNIQQQGFRVVFDGDRQRVALVPKSC 462
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 92/180 (51%), Gaps = 11/180 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + + IG P +Y S +LDTGSD+ WTQC PC+ C Q P+F ++S T+ + C S
Sbjct: 89 EYLMEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASP 148
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
+C L +P C K C + Y D + + G A + T + F GC
Sbjct: 149 ACNALY--YPL--CYQKVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISF--GC 202
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLP---SPYGSTGYITFGKTDTVNT 307
N ++G + SG++G R +S++++ + FSYCL SP S Y FG T+N+
Sbjct: 203 GNLNAGLLANGSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSPVPSRLY--FGVYATLNS 260
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 98/169 (57%), Gaps = 12/169 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + +++G P + + DTGSD+ WTQC PC +C+QQ P F SKS T+ K+ C+S
Sbjct: 84 EYLMKLSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQDLPMFNPSKSTTYRKVSCSSP 143
Query: 191 SCRILRESFPFGNCNSK-ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLL 248
C E +C+ K +C ++I Y D S S G +A D +T+ +++G +P +
Sbjct: 144 VCSFTGED---NSCSFKPDCTYSISYGDNSHSQGDFAVDTLTM--GSTSGRVVAFPRTAI 198
Query: 249 GCINNSSGD-KSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGS 293
GC ++++G + SGI+GL P S+I + ++ FSYCL +P G+
Sbjct: 199 GCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCL-TPIGN 246
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 64/202 (31%), Positives = 100/202 (49%), Gaps = 18/202 (8%)
Query: 128 VADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQ-RDPFFYASKSKTFFKIP 186
V +EY + V++G P + V+L LDTGSD+ WTQC PC+ CF+Q P + S T +P
Sbjct: 86 VTNEYLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLDPAASSTHAALP 145
Query: 187 CNSTSCRILRESFPFGNCNS-----KECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
C++ CR L PF +C + C + Y D S + G ATD T ++ G
Sbjct: 146 CDAPLCRAL----PFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGL 201
Query: 242 TRYPFLLGCINNSSGD-KSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFG 300
GC + + G ++ +GI G R S+ ++ N + FSYC S + +
Sbjct: 202 AARRVTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFDT------- 254
Query: 301 KTDTVNTLRSAFHKRMKKYKKA 322
K+ +V TL +A + + + A
Sbjct: 255 KSSSVVTLGAAAAELLHTHHAA 276
Score = 41.6 bits (96), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 37/89 (41%), Gaps = 4/89 (4%)
Query: 329 LDTCYDL---SAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNS 385
LD C+ L + + VP + +H GG D EL RG V + L
Sbjct: 360 LDLCFALPVAALWRRPAVPALTLHLDGGADWELP-RGNYVFEDYAARVLCVVLDAAAGEQ 418
Query: 386 ITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ +GN QQ+ V YD+ L F P C
Sbjct: 419 VVIGNYQQQNTHVVYDLENDVLSFAPARC 447
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 63/184 (34%), Positives = 99/184 (53%), Gaps = 14/184 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + +++G P + + DTGSD+ WTQCKPC C++Q DP F SKT+ C++
Sbjct: 94 EYLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDPLFDPKSSKTYRDFSCDAR 153
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLG 249
C +L +S C+ C + Y D S + G A+D IT+ ++ G +P ++G
Sbjct: 154 QCSLLDQS----TCSGNICQYQYSYGDRSYTMGNVASDTITLD--STTGSPVSFPKTVIG 207
Query: 250 CINNSSGDKSG-ASGIMGLDRSPVSIITRTNTSY---FSYC---LPSPYGSTGYITFGKT 302
C + + G S SGI+GL P+S+I++ +S FSYC L S G++ + FG
Sbjct: 208 CGHENDGTFSDKGSGIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAGNSSKLNFGSN 267
Query: 303 DTVN 306
V+
Sbjct: 268 AVVS 271
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 74/230 (32%), Positives = 109/230 (47%), Gaps = 21/230 (9%)
Query: 85 LEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQY 144
+ + LR+D +H N+R+L + P I+ T A EY + +AIG P
Sbjct: 47 VRDALRRD---MHRHNARQLAASS----SNGTTVSAPTQISPT-AGEYLMTLAIGTPPVS 98
Query: 145 VSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNST---SCRILRESFP 200
+ DTGSD+ WTQC PC CFQQ P + S S TF +PCNS+ L + P
Sbjct: 99 YQAIADTGSDLIWTQCAPCSSQCFQQPTPLYNPSSSTTFAVLPCNSSLSMCAAALAGTTP 158
Query: 201 FGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSG-DKS 259
C C +N+ Y G S + ++ T + GC N S G + S
Sbjct: 159 PPGCT---CMYNMTYGSGWTS-VYQGSETFTFGSSTPANQTGVPGIAFGCSNASGGFNTS 214
Query: 260 GASGIMGLDRSPVSIITRTNTSYFSYCLPSPY---GSTGYITFGKTDTVN 306
ASG++GL R +S++++ FSYCL +PY ST + G + ++N
Sbjct: 215 SASGLVGLGRGSLSLVSQLGVPKFSYCL-TPYQDTNSTSTLLLGPSASLN 263
Score = 39.7 bits (91), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 42/89 (47%), Gaps = 5/89 (5%)
Query: 329 LDTCYDLSAYETV--VVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSI 386
LD C++L + + +P + +HF G D+ L +++ S + CL SI
Sbjct: 358 LDLCFELPSSTSAPPTMPSMTLHF-DGADMVLPADSYMMLDS-NLWCLAMQNQTDGGVSI 415
Query: 387 TLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
LGN QQ+ + YDV L F P CS
Sbjct: 416 -LGNYQQQNMHILYDVGQETLTFAPAKCS 443
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 103/411 (25%), Positives = 172/411 (41%), Gaps = 93/411 (22%)
Query: 87 EILRQDQQRLHLKN-----SRRLRKPFPEFLKRTEAFTFPANINDTVAD---EYYIVVAI 138
+++ +D + L N + RL + F F+ +EA P V+ EY + ++I
Sbjct: 38 DLIHRDSPKSPLYNPSETPAERLDRFFRRFMSFSEASISPNTPEPPVSSNNGEYLMKISI 97
Query: 139 GEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRES 198
G P V + DTGSD+ WTQC PC+ C++Q++P F SKS +F ++ C S CR+L
Sbjct: 98 GTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCESQQCRLLDTV 157
Query: 199 FPFGNCN--SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSG 256
+C+ K C F+ Y DGS + G AT+ +T+ +NS + + GC +N+SG
Sbjct: 158 ----SCSQPQKLCDFSYGYGDGSLAQGVIATETLTLN-SNSGQPTSILNIVFGCGHNNSG 212
Query: 257 D-KSGASGIMGLDRSPVSIITRTNTSY-----FSYCLPSPYGS----TGYITFGK----- 301
G+ G P+S+ ++ ++ FS CL P+ + T I FG
Sbjct: 213 TFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCL-VPFRTDPSITSKIIFGPEAEVS 271
Query: 302 -TDTVNT-------------------------------------------------LRSA 311
+D V+T L
Sbjct: 272 GSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRD 331
Query: 312 FHKRM-KKYKKAKGLEDLLD------TCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL 364
F+ R+ + K+A +E + D CY + T++ I G D++L T
Sbjct: 332 FYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSA---TLIDGPILTAHFDGADVQLKPLNTF 388
Query: 365 VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ C FA P D ++ GN Q + +D+ G+++ F +C+
Sbjct: 389 ISPKEGVYC--FAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCT 437
>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
Length = 443
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 79/290 (27%), Positives = 122/290 (42%), Gaps = 39/290 (13%)
Query: 138 IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRE 197
+G P V + DTGS++ W QC PC HC+ Q P F ++S T+ + +S C +R
Sbjct: 63 LGVPSTLVYGIADTGSELIWLQCLPCTHCYNQTPPIFDPAESYTYETVSSDSPICNAVRR 122
Query: 198 SFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSN----GYFTRYPFLLGCI 251
+C K C + Y DG+ + G +TD ++ GY T GC
Sbjct: 123 I----SCREGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTRTIVEVGYLT-----FGCS 173
Query: 252 NNSSGDKSG-ASGIMGLDRSPVSIITRTNTSYFSYCLPSP----------YGSTGYITFG 300
+++ G +G++GL+R P S++++ FSYC+ P +GS I G
Sbjct: 174 HDTKARLKGHQAGVVGLNRHPNSLVSQLKVKKFSYCMVIPDDHGSGSRMYFGSRAVILGG 233
Query: 301 KTDTVNTLRSAFHKRMK--KYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLEL 358
KT + S + +K + KG D L + P I HF G D L
Sbjct: 234 KTPLLKGDYSHYFVTLKGISVGEEKGRSDELAS----------AGPDITFHFY-GADFIL 282
Query: 359 DVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLG 408
T V CL + LGN+QQ+ + V YD+ + +
Sbjct: 283 TKXTTYVEVEKGLWCLAMLSSNSTRKLSILGNIQQQNYHVGYDLEAQEVA 332
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 29/120 (24%), Positives = 57/120 (47%), Gaps = 5/120 (4%)
Query: 159 QCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADG 218
+ + CF Q P F SKS T+ +P ++ +C + + + ++C + I Y G
Sbjct: 327 EAQEVAQCFNQTPPIFDPSKSSTYSTVPWDAPTC--YQAGGYACHIDEEDCCYRISYGSG 384
Query: 219 SGSG-GFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGAS-GIMGLDRSPVSIIT 276
S S G + D ++ N + + GC + ++G G GI+GL++ +S+++
Sbjct: 385 STSTEGTISIDAFAFEDNRQNMVDVXH-LVFGCSDYTTGTFKGYEVGIVGLNQDSLSLVS 443
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 96/384 (25%), Positives = 146/384 (38%), Gaps = 102/384 (26%)
Query: 128 VADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRD-PFFYASKSKTFFKIP 186
V +EY + +++G P + V+L LDTGSD+ WTQC PC++CF Q P + S T +
Sbjct: 90 VTNEYLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGAIPVLDPAASSTHAAVR 149
Query: 187 CNSTSCRILRESFPFGNC-------NSKECPFNIQYADGSGSGGFWATDRITIQEANS-- 237
C++ CR L PF +C + C + Y D S + G A+DR T ++
Sbjct: 150 CDAPVCRAL----PFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNAD 205
Query: 238 NGYFTRYPFLLGCINNSSGD-KSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGST-G 295
G + GC + + G ++ +GI G R S+ ++ + FSYC S + ST
Sbjct: 206 GGGVSERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVTSFSYCFTSMFESTSS 265
Query: 296 YITFG-------------------------------KTDTVNTLRSAFHKRMKKYKKAKG 324
+T G K TV R +R ++ ++A
Sbjct: 266 LVTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLREASA 325
Query: 325 LED----------------------------------LLDTCYDLSAYET---------- 340
+ D LD C+ L +
Sbjct: 326 IIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALDLCFALPSAAAPKSAFGWRWR 385
Query: 341 -------VVVPKIAIHFLGGVDLELDVRGTLVV---ASVSQVCLGFATYPPDPNSITLGN 390
V VP++ H GG D EL + A V + L AT D ++ +GN
Sbjct: 386 GRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAATGGGD-QTVVIGN 444
Query: 391 VQQRGHEVHYDVAGRRLGFGPGNC 414
QQ+ V YD+ L F P C
Sbjct: 445 YQQQNTHVVYDLENDVLSFAPARC 468
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 93/162 (57%), Gaps = 12/162 (7%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y + ++G P + + DTGSD+ W QC+PC C+ Q P F SKS ++ IPC S
Sbjct: 87 YLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCLSKL 146
Query: 192 CRILRESFPFGNCNSK-ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLG 249
C +R++ +C+ + C + I Y D S S G + D ++++ +++G +P ++G
Sbjct: 147 CHSVRDT----SCSDQNSCQYKISYGDSSHSQGDLSVDTLSLE--STSGSPVSFPKTVIG 200
Query: 250 CINNSSGDKSGA-SGIMGLDRSPVSIITRTNTSY---FSYCL 287
C +++G GA SGI+GL PVS+IT+ +S FSYCL
Sbjct: 201 CGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCL 242
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 54/166 (32%), Positives = 90/166 (54%), Gaps = 11/166 (6%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
Y + +AIG P ++ +LDTGSD+ WTQC PC CF Q P + ++S T+ + C S
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
C+ L+ P+ C+ + C + Y DG+ + G AT+ T+ S+ F
Sbjct: 152 MCQALQS--PWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTL---GSDTAVRGVAF-- 204
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGST 294
GC + G +SG++G+ R P+S++++ + FSYC +P+ +T
Sbjct: 205 GCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCF-TPFNAT 249
Score = 42.0 bits (97), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 50/109 (45%), Gaps = 8/109 (7%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
L A R++ A G L C+ ++ E V VP++ +HF G D+EL R + VV
Sbjct: 336 LARALASRVR-LPLASGAHLGLSLCFAAASPEAVEVPRLVLHF-DGADMELR-RESYVVE 392
Query: 368 --SVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
S CLG + LG++QQ+ + YD+ L F P C
Sbjct: 393 DRSAGVACLGMVS---ARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 438
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 54/166 (32%), Positives = 90/166 (54%), Gaps = 11/166 (6%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
Y + +AIG P ++ +LDTGSD+ WTQC PC CF Q P + ++S T+ + C S
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
C+ L+ P+ C+ + C + Y DG+ + G AT+ T+ S+ F
Sbjct: 152 MCQALQS--PWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTL---GSDTAVRGVAF-- 204
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGST 294
GC + G +SG++G+ R P+S++++ + FSYC +P+ +T
Sbjct: 205 GCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCF-TPFNAT 249
Score = 43.1 bits (100), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 56/134 (41%), Gaps = 13/134 (9%)
Query: 289 SPYGSTGYITFGKTDTVNTLRSAFHKRMK------KYKKAKGLEDLLDTCYDLSAYETVV 342
+P G G I T SAF + + A G L C+ ++ E V
Sbjct: 310 TPMGDGGVIIDSGTTFTALEESAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVE 369
Query: 343 VPKIAIHFLGGVDLELDVRGTLVVA--SVSQVCLGFATYPPDPNSITLGNVQQRGHEVHY 400
VP++ +HF G D+EL R + VV S CLG + LG++QQ+ + Y
Sbjct: 370 VPRLVLHF-DGADMELR-RESYVVEDRSAGVACLGMVS---ARGMSVLGSMQQQNTHILY 424
Query: 401 DVAGRRLGFGPGNC 414
D+ L F P C
Sbjct: 425 DLERGILSFEPAKC 438
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 92/322 (28%), Positives = 137/322 (42%), Gaps = 66/322 (20%)
Query: 126 DTVAD--EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
DTV D EY + + IG P V +LDTGS++ WTQC PC+HC+ Q+ P F SKS TF
Sbjct: 57 DTVFDTYEYLMKLQIGTPPFEVEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFK 116
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
+ CN+ CP+ + Y D S + G AT+ +TI + S F
Sbjct: 117 ETRCNTP---------------DHSCPYKLVYDDKSYTQGTLATETVTIH-STSGVPFVM 160
Query: 244 YPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITRTNTSYFSYCLPSP----------- 290
++GC N+SG + +SGI+GL R +S+I++ +Y + S
Sbjct: 161 PETIIGCSRNNSGSGFRPSSSGIVGLSRGSLSLISQMGGAYPGDGVVSTTMFAKTAKRGQ 220
Query: 291 -YGSTGYITFGKTDTVNTLRSAFHKR-----------------------MKKYKKAKGLE 326
Y + ++ G T + T+ + FH K ++ +
Sbjct: 221 YYLNLDAVSVGDTR-IETVGTPFHALNGNIVIDSGTPLTYFPVSYCNLVRKAVERVVTAD 279
Query: 327 DLLD------TCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQV-CLGFATY 379
++D CY + E + P I +HF GG DL LD + + V CL
Sbjct: 280 RVVDPSRNDMLCYYSNTIE--IFPVITVHFSGGADLVLDKYNMYMELNRGGVFCLAIICN 337
Query: 380 PPDPNSITLGNVQQRGHEVHYD 401
P +I GN Q V YD
Sbjct: 338 NPTQVAI-FGNRAQNNFLVGYD 358
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 62/185 (33%), Positives = 90/185 (48%), Gaps = 25/185 (13%)
Query: 126 DTVADE--YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
DTV D Y + + +G P + ++DTGS++TWTQC PC+HC++Q P F SKS TF
Sbjct: 372 DTVFDNSVYLMKLQVGTPPFEIEAVIDTGSEITWTQCLPCVHCYKQNAPIFDPSKSSTF- 430
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
+E C+ CP+ + Y D + + G ATD +TI + S F
Sbjct: 431 ------------KEK----RCHDHSCPYEVDYFDKTYTKGTLATDTVTIH-STSGEPFVM 473
Query: 244 YPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFG 300
++GC N+S + G +GL+ P+S+IT+ Y SYC T I FG
Sbjct: 474 AETIIGCGRNNSWFRPSFEGFVGLNWGPLSLITQMGGEYPGLMSYCFAG--NGTSKINFG 531
Query: 301 KTDTV 305
V
Sbjct: 532 TNAIV 536
Score = 39.3 bits (90), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 32/90 (35%), Positives = 40/90 (44%), Gaps = 6/90 (6%)
Query: 327 DLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA-SVSQVCLGFATYPPDPNS 385
DLL CY + E + P I +HF GG DL LD + + S CL P +
Sbjct: 623 DLL--CYYSNTTE--IFPVITMHFSGGADLVLDKYNMFMESYSGGLFCLAIICNNPTQEA 678
Query: 386 ITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
I GN Q V YD + + F P NCS
Sbjct: 679 I-FGNRAQNNFLVGYDSSSLLVSFKPTNCS 707
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 64/182 (35%), Positives = 85/182 (46%), Gaps = 7/182 (3%)
Query: 131 EYYIVVAIGEPK-QYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNS 189
EY I IG P+ Q V+L +DTGSDV WTQC+PC CF Q P F S S T + C
Sbjct: 91 EYLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPCFDCFTQPLPRFDTSASDTVHGVLCTD 150
Query: 190 TSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
CR LR C C + + Y D S + G A D T + G T + G
Sbjct: 151 PICRALRPH----ACFLGGCTYQVNYGDNSVTIGQLAKDSFTF-DGKGGGKVTVPDLVFG 205
Query: 250 CINNSSGD-KSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGKTDTVNTL 308
C ++G+ S +GI G R P+S+ + S FSYC + + S F + L
Sbjct: 206 CGQYNTGNFHSNETGIAGFGRGPLSLPRQLGVSSFSYCFTTIFESKSTPVFLGGAPADGL 265
Query: 309 RS 310
R+
Sbjct: 266 RA 267
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/170 (34%), Positives = 89/170 (52%), Gaps = 9/170 (5%)
Query: 121 PANINDTV-ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKS 179
P +D V EY + +AIG P Q V L LDTGS + WTQC+PC CF Q P++ AS+S
Sbjct: 23 PGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRS 82
Query: 180 KTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRIT-IQEANSN 238
TF C+ST C++ N + C ++ Y D S + GF + ++ + A+
Sbjct: 83 STFALPSCDSTQCKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVAGASVP 142
Query: 239 GYFTRYPFLLGC-INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
G + GC +NN+ +S +GI G R P+S+ ++ FS+C
Sbjct: 143 G------VVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCF 186
>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
Length = 280
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 57/157 (36%), Positives = 87/157 (55%), Gaps = 10/157 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + IGEP ++LDTGSD++W QC PC C++Q DP F + S ++ + C +
Sbjct: 131 EYFSRIGIGEPPSQAYMVLDTGSDISWVQCAPCADCYRQADPIFEPTASASYAPLSCEAA 190
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
CR L +S C + C + + Y DGS + G + T+ +TI G LGC
Sbjct: 191 QCRYLDQS----QCRNGNCLYQVSYGDGSYTVGDFVTETVTI------GVNKVKNVALGC 240
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+N+ G GA+G++GL P+S + N++ FSYCL
Sbjct: 241 GHNNEGLFVGAAGLIGLGGGPLSFPAQLNSTSFSYCL 277
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/180 (32%), Positives = 93/180 (51%), Gaps = 15/180 (8%)
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
V +G ++++DT S++TW QC PC C Q+ P F + S ++ +PCNS+SC
Sbjct: 127 VATVGLGGGEATVIVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDA 186
Query: 195 LR-----ESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
L+ + G C + + Y DGS S G A D++++ +G F+ G
Sbjct: 187 LQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDG------FVFG 240
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLP-SPYGSTGYITFGKTDTV 305
C ++ G G SG+MGL RS +S+I++T + FSYCLP S+G + G +V
Sbjct: 241 CGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPLKESESSGSLVLGDDTSV 300
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 61/111 (54%), Gaps = 3/111 (2%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL- 364
N +++ F + +Y +A G +LDTC++L+ + V +P + F G V++E+D G L
Sbjct: 360 NAVKAEFLSQFAEYPQAPGF-SILDTCFNLTGFREVQIPSLKFVFEGNVEVEVDSSGVLY 418
Query: 365 -VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V + SQVCL A+ + + +GN QQ+ V +D G ++GF C
Sbjct: 419 FVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 469
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 59/180 (32%), Positives = 93/180 (51%), Gaps = 15/180 (8%)
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
V +G ++++DT S++TW QC PC C Q+ P F + S ++ +PCNS+SC
Sbjct: 128 VATVGLGGGEATVIVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDA 187
Query: 195 LR-----ESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
L+ + G C + + Y DGS S G A D++++ +G F+ G
Sbjct: 188 LQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDG------FVFG 241
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLP-SPYGSTGYITFGKTDTV 305
C ++ G G SG+MGL RS +S+I++T + FSYCLP S+G + G +V
Sbjct: 242 CGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPLKESESSGSLVLGDDTSV 301
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 61/111 (54%), Gaps = 3/111 (2%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL- 364
N +++ F + +Y +A G +LDTC++L+ + V +P + F G V++E+D G L
Sbjct: 361 NAVKAEFLSQFAEYPQAPGF-SILDTCFNLTGFREVQIPSLKFVFEGNVEVEVDSSGVLY 419
Query: 365 -VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V + SQVCL A+ + + +GN QQ+ V +D G ++GF C
Sbjct: 420 FVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 470
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 102/360 (28%), Positives = 139/360 (38%), Gaps = 94/360 (26%)
Query: 126 DTVADE--YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
DT+ D Y + + +G P + +DTGSD+ WTQC PC +C+ Q P F S S TF
Sbjct: 53 DTLFDYNIYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFK 112
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
+ CN SC + I YAD + S G AT+ +TI + S F
Sbjct: 113 EKRCNGNSCH-----------------YKIIYADTTYSKGTLATETVTIH-STSGEPFVM 154
Query: 244 YPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFG 300
+GC +NSS K SG++GL P S+IT+ Y SYC S T I FG
Sbjct: 155 PETTIGCGHNSSWFKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQ--GTSKINFG 212
Query: 301 KT---------------------------------DT-VNTLRSAFH------------- 313
DT V T+ + FH
Sbjct: 213 TNAIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTT 272
Query: 314 -------------KRMKKYKKAKGLEDLLDTCYDLSAYETVVV---PKIAIHFLGGVDLE 357
+ + Y A D T D+ Y T + P I +HF GG DL
Sbjct: 273 LTYFPVSYCNLVREAVDHYVTAVRTAD--PTGNDMLCYYTDTIDIFPVITMHFSGGADLV 330
Query: 358 LDVRGTLVVASVSQ--VCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
LD + + + ++++ CL P P GN Q V YD + + F P NCS
Sbjct: 331 LD-KYNMYIETITRGTFCLAIICNNP-PQDAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 388
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 72/233 (30%), Positives = 113/233 (48%), Gaps = 24/233 (10%)
Query: 85 LEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVAD--EYYIVVAIGEPK 142
+ + LR+D +H + SR L F L ++ T A + + EY + ++IG P
Sbjct: 49 VRDALRRD---MHRQQSRSL---FGRELAESDGTTVSARTRKDLPNGGEYLMTLSIGTPP 102
Query: 143 QYVSLLLDTGSDVTWTQCKPCI--HCFQQRDPFFYASKSKTFFKIPCNST---SCRILRE 197
+ DTGSD+ WTQC PC CF Q P + + S TF +PCNS+ +L
Sbjct: 103 LSYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPCNSSLSMCAGVLAG 162
Query: 198 SFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL-LGCINNSSG 256
P C C +N Y G + G ++ T A ++ R P + GC N SS
Sbjct: 163 KAPPPGC---ACMYNQTYGTG-WTAGVQGSETFTFGSAAADQ--ARVPGIAFGCSNASSS 216
Query: 257 DKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPY---GSTGYITFGKTDTVN 306
D +G++G++GL R +S++++ FSYCL +P+ ST + G + +N
Sbjct: 217 DWNGSAGLVGLGRGSLSLVSQLGAGRFSYCL-TPFQDTNSTSTLLLGPSAALN 268
Score = 38.5 bits (88), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 37/89 (41%), Gaps = 5/89 (5%)
Query: 329 LDTCYDLSAYETV--VVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSI 386
LD CY L + +P + +HF G D+ L ++ S CL D
Sbjct: 361 LDLCYALPTPTSAPPAMPSMTLHF-DGADMVLPADSYMISGS-GVWCLAMRNQT-DGAMS 417
Query: 387 TLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
T GN QQ+ + YDV L F P CS
Sbjct: 418 TFGNYQQQNMHILYDVRNEMLSFAPAKCS 446
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 90/332 (27%), Positives = 136/332 (40%), Gaps = 70/332 (21%)
Query: 146 SLLLDTGSDVTWTQCKPCI--HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFG- 202
++++DT SD+ W QC PC C Q+DP + +KS TF IPC S +C+ L S+ G
Sbjct: 170 TVVVDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGC 229
Query: 203 NCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGA- 261
+ + EC + + Y DG + G + TD +T+ F GC + G S
Sbjct: 230 SPTTDECKYIVNYGDGKATTGTYVTDTLTMSPT-----IVVKDFRFGCSHAVRGSFSNQN 284
Query: 262 SGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDTVNTLRSAFHKRMKK 318
+GI+ L S++ +T +Y FSYC+P P S G+++ G +L+ ++ +K
Sbjct: 285 AGILALGGGRGSLLEQTADAYGNAFSYCIPKP-SSAGFLSLGG-PVEASLKFSYTPLIKN 342
Query: 319 YKKAK----GLEDLL---------------DTCYDLSAYETVVVPKIAIHFLGGVDLELD 359
LE ++ D A T + P++ +
Sbjct: 343 KHAPTFYIVHLEAIIVAGKQLAVPPTAFATGAVMDSGAVVTQLPPQVYAALRAAFRSAMA 402
Query: 360 VRGTLV-------------------VASVSQVCLGFATYPPDPNSITL------------ 388
G L V VS V G AT +P SI L
Sbjct: 403 AYGPLAAPVRNLDTCYDFTRFPDVKVPKVSLVFAGGATLDLEPASIILDGCLAFAATPGE 462
Query: 389 ------GNVQQRGHEVHYDVAGRRLGFGPGNC 414
GNVQQ+ +EV YDV G ++GF G C
Sbjct: 463 ESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 56/167 (33%), Positives = 88/167 (52%), Gaps = 19/167 (11%)
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
V +G ++++DT S++TW QC PC C Q+ P F S S ++ +PC+S SC
Sbjct: 144 VATVGLGGGEATVIVDTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDA 203
Query: 195 LRESFPFGN------CNS---KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
L++ G C++ C + + Y DGS S G A DR+++ +G
Sbjct: 204 LQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSLAGEVIDG------ 257
Query: 246 FLLGCINNSSGDK-SGASGIMGLDRSPVSIITRTNTSY---FSYCLP 288
F+ GC ++ G G SG+MGL RS +S++++T + FSYCLP
Sbjct: 258 FVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCLP 304
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 63/111 (56%), Gaps = 3/111 (2%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL- 364
N +R+ F ++ +Y +A G +LDTC++++ + V VP + + F GG ++E+D G L
Sbjct: 388 NAVRAEFMSQLAEYPQAPGFS-ILDTCFNMTGLKEVQVPSLTLVFDGGAEVEVDSGGVLY 446
Query: 365 -VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V + SQVCL A+ + + +GN QQ+ V +D + ++GF C
Sbjct: 447 FVSSDSSQVCLAVASLKSEDETSIIGNYQQKNLRVVFDTSASQVGFAQETC 497
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 68/217 (31%), Positives = 111/217 (51%), Gaps = 21/217 (9%)
Query: 87 EILRQDQQRLHLKN-----SRRLRKPFPEFLKRTEAFTFPANINDTVAD---EYYIVVAI 138
+++ +D + L N + RL + F F+ +EA P V+ EY + ++I
Sbjct: 38 DLIHRDSPKSPLYNPSETPAERLDRFFRRFMSFSEASISPNTPEPPVSSNNGEYLMKISI 97
Query: 139 GEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRES 198
G P V + DTGSD+ WTQC PC+ C++Q++P F SKS +F ++ C S CR+L
Sbjct: 98 GTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCESQQCRLLDTV 157
Query: 199 FPFGNCN--SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSG 256
+C+ K C F+ Y DGS + G AT+ +T+ +NS + + GC +N+SG
Sbjct: 158 ----SCSQPQKLCDFSYGYGDGSLAQGVIATETLTLN-SNSGQPXSIXNIVFGCGHNNSG 212
Query: 257 D-KSGASGIMGLDRSPVSIITRTNTSY-----FSYCL 287
G+ G P+S+ ++ ++ FS CL
Sbjct: 213 TFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCL 249
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 53/175 (30%), Positives = 90/175 (51%), Gaps = 14/175 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ + +G P + +++D+GSD+ W QC+PC C+ Q DP F + S ++ + C ST
Sbjct: 133 EYFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSYAGVSCAST 192
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C + + C+ C + + Y DGS + G A + +T G +GC
Sbjct: 193 VCSHVDNA----GCHEGRCRYEVSYGDGSYTKGTLALETLTF------GRTLIRNVAIGC 242
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS-PYGSTGYITFGK 301
+++ G GA+G++GL P+S + + FSYCL S S+G + FG+
Sbjct: 243 GHHNQGMFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGIQSSGLLQFGR 297
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 55/114 (48%), Gaps = 4/114 (3%)
Query: 302 TDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVR 361
T R AF + +A G+ + DTCYDL + +V VP ++ +F GG L L R
Sbjct: 363 TAAYEAFRDAFIAQTTNLPRASGVS-IFDTCYDLFGFVSVRVPTVSFYFSGGPILTLPAR 421
Query: 362 GTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L+ V V C FA P +GN+QQ G E+ D A +GFGP C
Sbjct: 422 NFLIPVDDVGSFCFAFA--PSSSGLSIIGNIQQEGIEISVDGANGFVGFGPNVC 473
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 95/393 (24%), Positives = 154/393 (39%), Gaps = 81/393 (20%)
Query: 86 EEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYV 145
E +R+D R+ + + + +F A + + V Y + +++G P
Sbjct: 44 SEAVRRDSHRIAFLSDATAAG---KATTTNSSVSFQALLENGVGG-YNMNISVGTPLLTF 99
Query: 146 SLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCN 205
S++ DTGSD+ WTQC PC CFQQ P F + S TF K+PC S+ C+ L S CN
Sbjct: 100 SVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSFCQFLPNSIR--TCN 157
Query: 206 SKECPFNIQYADGSGSGGFWATDRITIQEAN----------SNGY------FTRYPFLLG 249
+ C +N +Y G + G+ AT+ + + +A+ NG R+ + L
Sbjct: 158 ATGCVYNYKYGSGY-TAGYLATETLKVGDASFPSVAFGCSTENGLGQLDLGVGRFSYCL- 215
Query: 250 CINNSSGDKSGASGIM----------GLDRSPVSIITRTNTSYF----------SYCLPS 289
SG +GAS I+ + +P + SY+ LP
Sbjct: 216 ----RSGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPV 271
Query: 290 PYGSTGY----------------ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCY 333
+ G+ +T+ D ++ AF + G LD C+
Sbjct: 272 TTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRG-LDLCF 330
Query: 334 D--LSAYETVVVPKIAIHFLGGVD---------LELDVRGTLVVASVSQVCLGFATYPPD 382
+ VP + + F GG + +E D +G++ VA CL D
Sbjct: 331 KSTGGGGGGIAVPSLVLRFDGGAEYAVPTYFAGVETDSQGSVTVA-----CLMMLPAKGD 385
Query: 383 PNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+GNV Q + YD+ G F P +C+
Sbjct: 386 QPMSVIGNVMQMDMHLLYDLDGGIFSFAPADCA 418
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 64/184 (34%), Positives = 94/184 (51%), Gaps = 13/184 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P +YV ++LDTGSD+ W QC PCI C+ Q DP F +KS++F IPC S
Sbjct: 144 EYFTRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKCYSQTDPVFDPTKSRSFANIPCGSP 203
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
CR R +P + + C + + Y DGS + G ++T+ +T + +LGC
Sbjct: 204 LCR--RLDYPGCSTKKQICLYQVSYGDGSFTVGEFSTETLTFRGTRVG------RVVLGC 255
Query: 251 INNSSG---DKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGST--GYITFGKTDTV 305
+++ G +G G+ S S I R S FSYCL S+ I FG +
Sbjct: 256 GHDNEGLFVGAAGLLGLGRGRLSFPSQIGRRFNSKFSYCLGDRSASSRPSSIVFGDSAIS 315
Query: 306 NTLR 309
T R
Sbjct: 316 RTTR 319
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 54/109 (49%), Gaps = 5/109 (4%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
LR AF K+A L DTC+DLS V VP + +HF G D+ L L+ V
Sbjct: 384 LRDAFLVGASNLKRAPEFS-LFDTCFDLSGKTEVKVPTVVLHFRG-ADVPLPASNYLIPV 441
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ C FA + I GN+QQ+G V YD+A R+GF P C+
Sbjct: 442 DNSGSFCFAFAGTASGLSII--GNIQQQGFRVVYDLATSRVGFAPRGCA 488
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 75/227 (33%), Positives = 107/227 (47%), Gaps = 22/227 (9%)
Query: 89 LRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDT-VADEYYIVVAIGEPKQYVSL 147
LR+D +H N+R+L L + T A D+ A EY + +AIG P
Sbjct: 57 LRRD---MHRHNARKLA------LAASSGATVSAPTQDSPTAGEYLMALAIGTPPLPYQA 107
Query: 148 LLDTGSDVTWTQCKPCI-HCFQQRDPFFYASKSKTFFKIPCNS--TSCRILRESFPFGNC 204
+ DTGSD+ WTQC PC CF+Q P + S S TF +PCNS + C
Sbjct: 108 IADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPP 167
Query: 205 NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLGCINNSSG-DKSGAS 262
C +N+ Y G S F ++ T + R P GC SSG + S AS
Sbjct: 168 PGCACTYNVTYGSGWTS-VFQGSETFTFGSTPAG--HARVPGIAFGCSTASSGFNASSAS 224
Query: 263 GIMGLDRSPVSIITRTNTSYFSYCLPSPY---GSTGYITFGKTDTVN 306
G++GL R +S++++ FSYCL +PY ST + G + ++N
Sbjct: 225 GLVGLGRGRLSLVSQLGVPKFSYCL-TPYQDTNSTSTLLLGPSASLN 270
Score = 38.1 bits (87), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 37/89 (41%), Gaps = 4/89 (4%)
Query: 329 LDTCYDLSAYETV--VVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSI 386
LD C+ L + + +P + +HF G D+ L ++ CL D
Sbjct: 364 LDLCFMLPSSTSAPPAMPSMTLHF-NGADMVLPADSYMMSDDSGLWCLAMQNQT-DGEVN 421
Query: 387 TLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
LGN QQ+ + YD+ L F P CS
Sbjct: 422 ILGNYQQQNMHILYDIGQETLSFAPAKCS 450
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 58/157 (36%), Positives = 89/157 (56%), Gaps = 9/157 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P + + L+LDTGSDV W QC+PC C+QQ DP F + S T+ + C++
Sbjct: 161 EYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAP 220
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C +L S C S +C + + Y DGS + G ATD +T ++G LGC
Sbjct: 221 QCSLLETS----ACRSNKCLYQVSYGDGSFTVGELATDTVTF---GNSGKINNVA--LGC 271
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+++ G +GA+G++GL +SI + + FSYCL
Sbjct: 272 GHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCL 308
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 59/119 (49%), Gaps = 3/119 (2%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T +T N+LR AF K KK L DTCYD S+ TV VP +A HF GG L
Sbjct: 384 VTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSL 443
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+L + L+ V C FA P + +GNVQQ+G + YD++ +G C
Sbjct: 444 DLPAKNYLIPVDDSGTFCFAFA--PTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500
>gi|298204765|emb|CBI25263.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 74/240 (30%), Positives = 106/240 (44%), Gaps = 32/240 (13%)
Query: 202 GNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKS 259
G C S C + I Y DGS + G +++ G F+ GC N+ G
Sbjct: 124 GVCGSAAPICNYAINYGDGSFTRGELGHEKLKF------GTILVKDFIFGCGRNNKGLFG 177
Query: 260 GASGIMGLDRSPVSIITRTNTS------YF---------SYCLPSPYGSTGYITFGKTDT 304
G SG+MGL RS +S+I++T+ + YF L +P I
Sbjct: 178 GVSGLMGLGRSDLSLISQTSENPQLYNFYFINLTGISIGGVALQAPSVGPSRILVDSGTV 237
Query: 305 VNTLRSAFHKRMKK--YKKAKGLE-----DLLDTCYDLSAYETVVVPKIAIHFLGGVDLE 357
+ L +K +K K+ G +LDTC++LSAY+ V +P I +HF G +L
Sbjct: 238 ITRLPPTIYKALKAEFLKQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELT 297
Query: 358 LDVRGT--LVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+DV G V + SQVCL A+ LGN QQ+ V YD ++GF CS
Sbjct: 298 VDVTGVFYFVKSDASQVCLALASLEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 357
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 72/128 (56%), Gaps = 12/128 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P +YV ++LDTGSD+ W QC PC C+ Q DP F KSKT+ IPC+S
Sbjct: 141 EYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSP 200
Query: 191 SCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
CR L + CN+ K C + + Y DGS + G ++T+ +T + G L
Sbjct: 201 HCRRLDSA----GCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKG------VAL 250
Query: 249 GCINNSSG 256
GC +++ G
Sbjct: 251 GCGHDNEG 258
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 42/109 (38%), Positives = 57/109 (52%), Gaps = 5/109 (4%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
+R AF K K+A L DTC+DLS V VP + +HF G D+ L L+ V
Sbjct: 381 MRDAFRVGAKTLKRAPDFS-LFDTCFDLSNMNEVKVPTVVLHFRG-ADVSLPATNYLIPV 438
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ + C FA + I GN+QQ+G V YD+A R+GF PG C+
Sbjct: 439 DTNGKFCFAFAGTMGGLSII--GNIQQQGFRVVYDLASSRVGFAPGGCA 485
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 56/157 (35%), Positives = 87/157 (55%), Gaps = 9/157 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V +G P + ++LDTGSD+ W QC+PC C+QQ DP F + S T+ + C S
Sbjct: 160 EYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTCQSQ 219
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C L S +C S +C + + Y DGS + G +AT+ ++ + S LGC
Sbjct: 220 QCSSLEMS----SCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGSVKNVA-----LGC 270
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+++ G GA+G++GL P+S+ + + FSYCL
Sbjct: 271 GHDNEGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCL 307
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/120 (40%), Positives = 62/120 (51%), Gaps = 6/120 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
IT +T N LR AF RM + K L DTCYDLS +V VP ++ HF G
Sbjct: 383 ITRLQTQAYNPLRDAF-VRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSW 441
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITL-GNVQQRGHEVHYDVAGRRLGFGPGNC 414
L L+ V S C FA P +S+++ GNVQQ+G V +D+A R+GF P C
Sbjct: 442 NLPAANYLIPVDSAGTYCFAFA---PTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 498
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 65/176 (36%), Positives = 85/176 (48%), Gaps = 19/176 (10%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y IG P Q VS ++D ++ WTQC PC CF+Q P F +KS TF +PC S
Sbjct: 57 YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSHL 116
Query: 192 CRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI 251
C + ES NC S C + G +GG TD I A F GC+
Sbjct: 117 CESIPESSR--NCTSDVCIYEAPTKAGD-TGGMAGTDTFAIGAAKETLGF-------GCV 166
Query: 252 NNSSGDK-----SGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGKT 302
+ DK G SGI+GL R+P S++T+ N + FSYCL S+G + G T
Sbjct: 167 VMT--DKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGK--SSGALFLGAT 218
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 72/128 (56%), Gaps = 12/128 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P +YV ++LDTGSD+ W QC PC C+ Q DP F KSKT+ IPC+S
Sbjct: 141 EYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSP 200
Query: 191 SCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
CR L + CN+ K C + + Y DGS + G ++T+ +T + G L
Sbjct: 201 HCRRLDSA----GCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKG------VAL 250
Query: 249 GCINNSSG 256
GC +++ G
Sbjct: 251 GCGHDNEG 258
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 42/109 (38%), Positives = 57/109 (52%), Gaps = 5/109 (4%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
+R AF K K+A L DTC+DLS V VP + +HF G D+ L L+ V
Sbjct: 381 MRDAFRVGAKALKRAPDFS-LFDTCFDLSNMNEVKVPTVVLHFRG-ADVSLPATNYLIPV 438
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ + C FA + I GN+QQ+G V YD+A R+GF PG C+
Sbjct: 439 DTNGKFCFAFAGTMGGLSII--GNIQQQGFRVVYDLASSRVGFAPGGCA 485
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 102/360 (28%), Positives = 139/360 (38%), Gaps = 94/360 (26%)
Query: 126 DTVADE--YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
DT+ D Y + + +G P + +DTGSD+ WTQC PC +C+ Q P F S S TF
Sbjct: 53 DTLFDYNIYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFK 112
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
+ CN SC + I YAD + S G AT+ +TI + S F
Sbjct: 113 EKRCNGNSCH-----------------YKIIYADTTYSKGTLATETVTIH-STSGEPFVM 154
Query: 244 YPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFG 300
+GC +NSS K SG++GL P S+IT+ Y SYC S T I FG
Sbjct: 155 PETTIGCGHNSSWFKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQ--GTSKINFG 212
Query: 301 KT---------------------------------DT-VNTLRSAFH------------- 313
DT V T+ + FH
Sbjct: 213 TNAIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTT 272
Query: 314 -------------KRMKKYKKAKGLEDLLDTCYDLSAYETVVV---PKIAIHFLGGVDLE 357
+ + Y A D T D+ Y T + P I +HF GG DL
Sbjct: 273 LTYFPVSYCNLVREAVDHYVTAVRTAD--PTGNDMLCYYTDTIDIFPVITMHFSGGADLV 330
Query: 358 LDVRGTLVVASVSQ--VCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
LD + + + ++++ CL P P GN Q V YD + + F P NCS
Sbjct: 331 LD-KYNMYIETITRGTFCLAIICNNP-PQDAIFGNRAQNNFLVGYDSSSLLVFFSPTNCS 388
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 72/128 (56%), Gaps = 12/128 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P +YV ++LDTGSD+ W QC PC C+ Q DP F KSKT+ IPC+S
Sbjct: 141 EYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSP 200
Query: 191 SCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
CR L + CN+ K C + + Y DGS + G ++T+ +T + G L
Sbjct: 201 HCRRLDSA----GCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKG------VAL 250
Query: 249 GCINNSSG 256
GC +++ G
Sbjct: 251 GCGHDNEG 258
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 56/109 (51%), Gaps = 5/109 (4%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
+R AF K K+A L DTC+DLS V VP + +HF D+ L L+ V
Sbjct: 381 MRDAFRVGAKTLKRAPNFS-LFDTCFDLSNMNEVKVPTVVLHFRR-ADVSLPATNYLIPV 438
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ + C FA + I GN+QQ+G V YD+A R+GF PG C+
Sbjct: 439 DTNGKFCFAFAGTMGGLSII--GNIQQQGFRVVYDLASSRVGFAPGGCA 485
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 58/157 (36%), Positives = 89/157 (56%), Gaps = 9/157 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P + + L+LDTGSDV W QC+PC C+QQ DP F + S T+ + C++
Sbjct: 161 EYFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAP 220
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C +L S C S +C + + Y DGS + G ATD +T ++G LGC
Sbjct: 221 QCSLLETS----ACRSNKCLYQVSYGDGSFTVGELATDTVTF---GNSGKINNVA--LGC 271
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+++ G +GA+G++GL +SI + + FSYCL
Sbjct: 272 GHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCL 308
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 59/119 (49%), Gaps = 3/119 (2%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T +T N+LR AF K KK L DTCYD S+ TV VP +A HF GG L
Sbjct: 384 VTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSL 443
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+L + L+ V C FA P + +GNVQQ+G + YD++ +G C
Sbjct: 444 DLPAKNYLIPVDDSGTFCFAFA--PTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 53/142 (37%), Positives = 77/142 (54%), Gaps = 7/142 (4%)
Query: 149 LDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKE 208
+DTGSD+ WTQC PC+ C Q P+F KS T+ +PC S+ C L +C K
Sbjct: 1 MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSP----SCFKKM 56
Query: 209 CPFNIQYADGSGSGGFWATDRITIQEANSNGY-FTRYPFLLGCINNSSGDKSGASGIMGL 267
C + Y D + + G A + T ANS T F GC + ++GD + +SG++G
Sbjct: 57 CVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAF--GCGSLNAGDLANSSGMVGF 114
Query: 268 DRSPVSIITRTNTSYFSYCLPS 289
R P+S++++ S FSYCL S
Sbjct: 115 GRGPLSLVSQLGPSRFSYCLTS 136
>gi|125553836|gb|EAY99441.1| hypothetical protein OsI_21409 [Oryza sativa Indica Group]
Length = 376
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 87/312 (27%), Positives = 134/312 (42%), Gaps = 36/312 (11%)
Query: 16 CSSNNGAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDK-----ASLEVVSKYGP 70
CSS A+ D + +++ SS+ P C+ + A P A L +VS GP
Sbjct: 15 CSSTLVAHGGDAEAGAYMLIATSSMKPKASCSGHKVA-PSNEASLNSTWAPLHLVS--GP 71
Query: 71 CSRL------NQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFL-------KRTEA 117
CS N S+ ++L DQ R+ R + + T+
Sbjct: 72 CSPAYSRGTDNSSTDDDVTSIAKMLDADQHRVAYIQKRLAGGDTSNGVAGASWDGQTTDV 131
Query: 118 FTFPANINDTVADEYYIVVAIGEPKQYV--SLLLDTGSDVTWTQCKPC--IHCFQQRDPF 173
T+ N V + A + V ++++D+GSDV W QC+PC + C QRDP
Sbjct: 132 GTYLPASNVGVGAKMIGTTAAPDGTSAVRQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPL 191
Query: 174 FYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQ 233
F + S T+ +PC+S +C L + G + +C F Y DG+ + G +++D +T+
Sbjct: 192 FDPATSTTYSAVPCSSAACARLGP-YRRGCSANVQCQFGFTYTDGATATGTYSSDDLTLG 250
Query: 234 EANSNGYFTRYPFLLGCINNSSGDKSG--ASGIMGLDRSPVSIITRTNTSY---FSYCLP 288
Y FL GC + G SG + L S + +T T Y FSYC+P
Sbjct: 251 P-----YDVVRGFLFGCAHADRGSTFSFDVSGTLALGGGAQSFVQQTATQYGRVFSYCIP 305
Query: 289 SPYGSTGYITFG 300
S G+IT G
Sbjct: 306 PSPSSLGFITLG 317
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 74/238 (31%), Positives = 109/238 (45%), Gaps = 32/238 (13%)
Query: 89 LRQDQQRLH----LKNSRRLRKPFPEFLKRTEAFTFPANIND----------TVADEYYI 134
+R + R+H + S+ +R + R A A+ +D TV E+ +
Sbjct: 28 VRVELTRVHADPSVTASQFVRAALHRDMHRHNARKLAASSSDGTVSAPVSPTTVPGEFLM 87
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFYASKSKTFFKIPCNSTSCR 193
+AIG P + DTGSD+ WTQC PC CFQQ P + S S TF +PCNS+
Sbjct: 88 TLAIGTPPLPFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSALPCNSS--- 144
Query: 194 ILRESFPFGNCNSK-ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCIN 252
G C C +N+ Y G + F T+ T + GC N
Sbjct: 145 -------LGLCAPACACMYNMTYGSG-WTYVFQGTETFTFGSSTPADQVRVPGIAFGCSN 196
Query: 253 NSSG-DKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPY---GSTGYITFGKTDTVN 306
SSG + S ASG++GL R +S++++ FSYCL +PY ST + G + ++N
Sbjct: 197 ASSGFNASSASGLVGLGRGSLSLVSQLGAPKFSYCL-TPYQDTNSTSTLLLGPSASLN 253
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/157 (38%), Positives = 86/157 (54%), Gaps = 10/157 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V IG+P L+LDTGSDV W QC PC C+QQ DP F + S +F + CN+
Sbjct: 148 EYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPASSASFSTLSCNTR 207
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
CR L S C + C + + Y DGS + G + T+ IT+ A + +GC
Sbjct: 208 QCRSLDVS----ECRNDTCLYEVSYGDGSYTVGDFVTETITLGSAPVDN------VAIGC 257
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+N+ G GA+G++GL +S ++ N + FSYCL
Sbjct: 258 GHNNEGLFVGAAGLLGLGGGSLSFPSQINATSFSYCL 294
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 49/120 (40%), Positives = 66/120 (55%), Gaps = 6/120 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
IT +TD N+LR AF KR + G+ L DTCYDLS+ V VP ++ HF G +L
Sbjct: 369 ITRLQTDVYNSLRDAFVKRTRDLPSTNGIA-LFDTCYDLSSKGNVEVPTVSFHFPDGKEL 427
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSIT-LGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L + LV + S C FA P +S++ +GNVQQ+G V YD+ +GF P C
Sbjct: 428 PLPAKNYLVPLDSEGTFCFAFA---PTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 52/162 (32%), Positives = 83/162 (51%), Gaps = 11/162 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
+Y++ +G P Q SL++D+GSD+ W QC PC C+ Q P + S S TF +PC S+
Sbjct: 63 QYFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCYAQDSPLYVPSNSSTFSPVPCLSS 122
Query: 191 SCRIL--RESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
C ++ E FP C + YAD S S G +A + T+ +
Sbjct: 123 DCLLIPATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESATVDGVRID------KVAF 176
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
GC +++ G + A G++GL + P+S ++ +Y F+YCL
Sbjct: 177 GCGSDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCL 218
Score = 39.7 bits (91), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 27/98 (27%), Positives = 42/98 (42%), Gaps = 2/98 (2%)
Query: 319 YKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFAT 378
Y +A+ ++ L D C +L+ + P I F G + + V + + CL A
Sbjct: 319 YPRAESVQGL-DLCVELTGVDQPSFPSFTIEFDDGAVFQPEAENYFVDVAPNVRCLAMAG 377
Query: 379 YP-PDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
P T+GN+ Q+ V YD +GF P CS
Sbjct: 378 LASPLGGFNTIGNLLQQNFFVQYDREENLIGFAPAKCS 415
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/162 (38%), Positives = 92/162 (56%), Gaps = 9/162 (5%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCN 188
+ EY++ + +G P + ++LDTGSDV W QC PC C+ Q DP F +KSKTF +PC
Sbjct: 133 SGEYFMRLGVGTPATNMYMVLDTGSDVVWLQCSPCKVCYNQSDPVFNPAKSKTFATVPCG 192
Query: 189 STSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
S CR L +S + SK C + + Y DGS + G ++T+ +T A + L
Sbjct: 193 SRLCRRLDDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTFHGARVD------HVAL 246
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
GC +++ G GA+G++GL R +S ++T Y FSYCL
Sbjct: 247 GCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCL 288
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/103 (38%), Positives = 54/103 (52%), Gaps = 5/103 (4%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
LR AF + K+A L DTC+DLS TV VP + HF GG ++ L L+ V
Sbjct: 381 LRDAFRLGATRLKRAPSYS-LFDTCFDLSGMTTVKVPTVVFHFTGG-EVSLPASNYLIPV 438
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGF 409
+ + C FA + I GN+QQ+G V YD+ G R+GF
Sbjct: 439 NNQGRFCFAFAGTMGSLSII--GNIQQQGFRVAYDLVGSRVGF 479
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 102/196 (52%), Gaps = 17/196 (8%)
Query: 112 LKRTEAFT--FPANINDTVAD------EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC 163
+ R FT F N N V+ EY I ++G P V +DTGS++ W QC+PC
Sbjct: 61 INRVNYFTKEFSLNKNQPVSTLTPELGEYLISYSVGTPPFKVYGFMDTGSNIVWLQCQPC 120
Query: 164 IHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGG 223
CF Q P F SKS ++ IPC S++C+ ++ + C ++I Y + S G
Sbjct: 121 NTCFNQTSPIFNPSKSSSYKNIPCTSSTCKDTNDTHISCSNGGDVCEYSITYGGDAKSQG 180
Query: 224 FWATDRITIQEANSNGYFTRYP-FLLGCIN-NSSGDKSGASGIMGLDRSPVSIITRTNT- 280
+ D +T+ +++G +P ++GC + N D S +SG++G+ R P+S+I + +
Sbjct: 181 DLSNDSLTLD--STSGSSVLFPNIVIGCGHINVLQDNSQSSGVVGMGRGPMSLIKQVGSS 238
Query: 281 ---SYFSYCLPSPYGS 293
S FSYCL PY S
Sbjct: 239 SVGSKFSYCL-IPYNS 253
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 74/297 (24%), Positives = 132/297 (44%), Gaps = 33/297 (11%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
+Y +++G P + S++ DTGSD+ W QCKPC CF Q+DP F S ++ + C T
Sbjct: 39 DYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDT 98
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C +S P +C S +C ++ Y DGSG+ G +++ +T+ + GC
Sbjct: 99 LC----DSLPRKSC-SPDCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKN-IAFGC 152
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPY----GSTGYITFGKTD 303
+ + G + ASG++GL R +S +++ + FSYCL P+ T + FG
Sbjct: 153 GHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCL-VPWRDAPSKTSPMFFGDES 211
Query: 304 TVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGT 363
+ ++ H +E Y + + + + G D++ D G
Sbjct: 212 SSHSSGKKLHYAFTPMIHNPAMESF----YYVKLKDISIAGRALRIPAGSFDIKPDGSGG 267
Query: 364 LVVASVSQVCLGFATYPPDPNSITLGNVQQR-----------GHEVHYDVAGRRLGF 409
++ S + + L P P I L ++ + G ++ YDV+G + +
Sbjct: 268 MIFDSGTTLTL----LPDAPYQIVLRALRSKISFPKIDGSSAGLDLCYDVSGSKASY 320
Score = 47.0 bits (110), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 59/135 (43%), Gaps = 12/135 (8%)
Query: 290 PYGSTGYITFGKTDTVNTLRSAFHKRMKK-------YKKAKGLEDLLDTCYDLSAYET-- 340
P GS G I F T+ L A ++ + + + K G LD CYD+S +
Sbjct: 262 PDGSGGMI-FDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSAGLDLCYDVSGSKASY 320
Query: 341 -VVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVH 399
+ +P + HF G D +L V + A+ + + A + + GN+ Q+ V
Sbjct: 321 KMKIPAMVFHFEG-ADYQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVM 379
Query: 400 YDVAGRRLGFGPGNC 414
YD+ ++G+ P C
Sbjct: 380 YDIGSSKIGWAPSQC 394
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 73/244 (29%), Positives = 116/244 (47%), Gaps = 16/244 (6%)
Query: 72 SRLNQGISTHAPS-LEEILRQDQQRLHLKNSRRLR-KPFPEFLKRTEAFTFPANINDTVA 129
+R++ T AP + + LR+D R ++ R R + E RT T A +
Sbjct: 51 TRIHSDPDTTAPQFVRDALRRDMHRQRSRSFGRDRDRELAESDGRT---TVSARTRKDLP 107
Query: 130 D--EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIP 186
+ EY + +AIG P + + DTGSD+ WTQC PC CF+Q P + + S TF +P
Sbjct: 108 NGGEYLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLP 167
Query: 187 CNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP- 245
CNS+ C +N Y G + G ++ T + ++ R P
Sbjct: 168 CNSSLSMCAGALAGAAPPPGCACMYNQTYGTG-WTAGVQGSETFTFGSSAADQ--ARVPG 224
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPY---GSTGYITFGKT 302
GC N SS D +G++G++GL R +S++++ FSYCL +P+ ST + G +
Sbjct: 225 VAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCL-TPFQDTNSTSTLLLGPS 283
Query: 303 DTVN 306
+N
Sbjct: 284 AALN 287
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 75/227 (33%), Positives = 107/227 (47%), Gaps = 22/227 (9%)
Query: 89 LRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANI-NDTVADEYYIVVAIGEPKQYVSL 147
LR+D +H N+R+L L + T A N A EY + +AIG P
Sbjct: 55 LRRD---MHRHNARKLA------LAASSGATVSAPTQNSPTAGEYLMALAIGTPPLPYQA 105
Query: 148 LLDTGSDVTWTQCKPCI-HCFQQRDPFFYASKSKTFFKIPCNS--TSCRILRESFPFGNC 204
+ DTGSD+ WTQC PC CF+Q P + S S TF +PCNS + C
Sbjct: 106 IADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPP 165
Query: 205 NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLGCINNSSG-DKSGAS 262
C +N+ Y G S F ++ T + +R P GC SSG + S AS
Sbjct: 166 PGCACTYNVTYGSGWTS-VFQGSETFTFGSTPAGQ--SRVPGIAFGCSTASSGFNASSAS 222
Query: 263 GIMGLDRSPVSIITRTNTSYFSYCLPSPY---GSTGYITFGKTDTVN 306
G++GL R +S++++ FSYCL +PY ST + G + ++N
Sbjct: 223 GLVGLGRGRLSLVSQLGVPKFSYCL-TPYQDTNSTSTLLLGPSASLN 268
Score = 38.1 bits (87), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 37/89 (41%), Gaps = 4/89 (4%)
Query: 329 LDTCYDLSAYETV--VVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSI 386
LD C+ L + + +P + +HF G D+ L ++ CL D
Sbjct: 362 LDLCFMLPSSTSAPPAMPSMTLHF-NGADMVLPADSYMMSDDSGLWCLAMQNQT-DGEVN 419
Query: 387 TLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
LGN QQ+ + YD+ L F P CS
Sbjct: 420 ILGNYQQQNMHILYDIGQETLSFAPAKCS 448
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 112/427 (26%), Positives = 173/427 (40%), Gaps = 100/427 (23%)
Query: 62 LEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFP 121
+++V P S + G + + +++ Q RL +L+ E +K EA +
Sbjct: 57 IDLVRTDSPLSPFSPGNISSTERFKRAIKRSQDRLE-----KLQMSVDE-VKAVEAPVYA 110
Query: 122 ANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKT 181
N E+ + +AIG P S +LDTGSD+TWTQCKPC C+ Q P + S+S T
Sbjct: 111 GN------GEFLMKMAIGTPSLSFSAILDTGSDLTWTQCKPCTDCYPQPTPIYDPSQSST 164
Query: 182 FFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
+ K+PC+S+ C+ L P +C+ C + Y D S + G + + T+ +
Sbjct: 165 YSKVPCSSSMCQAL----PMYSCSGANCEYLYSYGDQSSTQGILSYESFTLTSQS----- 215
Query: 242 TRYPFL-LGC-INNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGS--- 293
P + GC N G S G++G R P+S+I++ S FSYCL S S
Sbjct: 216 --LPHIAFGCGQENEGGGFSQGGGLVGFGRGPLSLISQLGQSLGNKFSYCLVSITDSPSK 273
Query: 294 TGYITFGKTDTVN--TLRSA-----------FHKRMKKYKKAKGLEDLLDTCYDL----- 335
T + GKT ++N T+ S ++ ++ L D+ D +DL
Sbjct: 274 TSPLFIGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLDGT 333
Query: 336 ----------------SAYETV---VVPKIAI-----------------------HF--- 350
S Y+ V V+ I + HF
Sbjct: 334 GGVIIDSGTTVTYLEQSGYDVVKKAVISSINLPQVDGSNIGLDLCFEPQSGSSTSHFPTI 393
Query: 351 ---LGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRL 407
G D L + S CL A P + SI GN+QQ+ +++ YD L
Sbjct: 394 TFHFEGADFNLPKENYIYTDSSGIACL--AMLPSNGMSI-FGNIQQQNYQILYDNERNVL 450
Query: 408 GFGPGNC 414
F P C
Sbjct: 451 SFAPTVC 457
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 57/157 (36%), Positives = 87/157 (55%), Gaps = 9/157 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P + + L+LDTGSDV W QC+PC C+QQ DP F + S T+ + C++
Sbjct: 161 EYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCSDCYQQSDPVFNPTSSSTYKSLTCSAP 220
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C +L S C S +C + + Y DGS + G ATD +T + LGC
Sbjct: 221 QCSLLETS----ACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKIN-----DVALGC 271
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+++ G +GA+G++GL +SI + + FSYCL
Sbjct: 272 GHDNEGLFTGAAGLLGLGGGALSITNQMKATSFSYCL 308
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 60/119 (50%), Gaps = 3/119 (2%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T +T N+LR AF K KK L DTCYD S+ +V VP +A HF GG L
Sbjct: 384 VTRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDTCYDFSSLSSVKVPTVAFHFTGGKSL 443
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+L + L+ V C FA P + +GNVQQ+G + YD+A + +G C
Sbjct: 444 DLPAKNYLIPVDDNGTFCFAFA--PTSSSLSIIGNVQQQGTRITYDLANKIIGLSGNKC 500
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/178 (34%), Positives = 92/178 (51%), Gaps = 17/178 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQRDPFFYASKSKTFFKIPCNS 189
E+ + V G P Q +L +DTGSDV+W QC PC HC++Q DP F +KS T+ +PC
Sbjct: 160 EFVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDPTKSATYSAVPCGH 219
Query: 190 TSCRILRESFPFGNC-NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FL 247
C G C NS C + + Y DGS + G + + +++ P F
Sbjct: 220 PQCAAAG-----GKCSNSGTCLYKVTYGDGSSTAGVLSHETLSLSSTRD------LPGFA 268
Query: 248 LGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKT 302
GC + G+ G G++GL R +S+ ++ ++ FSYCLPS + GY+T G T
Sbjct: 269 FGCGQTNLGEFGGVDGLVGLGRGALSLPSQAAATFGATFSYCLPSYDTTHGYLTMGST 326
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 39/121 (32%), Positives = 59/121 (48%), Gaps = 4/121 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T+ + +LR F M +YK A D DTCYD + + + +P +A F G
Sbjct: 386 LTYLPPEAYASLRDRFKFTMTQYKPAPAY-DPFDTCYDFTGHNAIFMPAVAFKFSDGAVF 444
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSI---TLGNVQQRGHEVHYDVAGRRLGFGPGN 413
+L L+ + G + P P+++ +GN QQRG EV YDVA ++GFG
Sbjct: 445 DLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTEVIYDVAAEKIGFGQFT 504
Query: 414 C 414
C
Sbjct: 505 C 505
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 56/157 (35%), Positives = 87/157 (55%), Gaps = 9/157 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V +G P + ++LDTGSD+ W QC+PC C+QQ DP F + S T+ + C S
Sbjct: 19 EYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTCQSQ 78
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C L S +C S +C + + Y DGS + G +AT+ ++ + S LGC
Sbjct: 79 QCSSLEMS----SCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGS-----VKNVALGC 129
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+++ G GA+G++GL P+S+ + + FSYCL
Sbjct: 130 GHDNEGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCL 166
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/120 (40%), Positives = 62/120 (51%), Gaps = 6/120 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
IT +T N LR AF RM + K L DTCYDLS +V VP ++ HF G
Sbjct: 242 ITRLQTQAYNPLRDAF-VRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSW 300
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITL-GNVQQRGHEVHYDVAGRRLGFGPGNC 414
L L+ V S C FA P +S+++ GNVQQ+G V +D+A R+GF P C
Sbjct: 301 NLPAANYLIPVDSAGTYCFAFA---PTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 357
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 57/157 (36%), Positives = 90/157 (57%), Gaps = 9/157 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ V IG P + +++DTGSDV W QCKPC C+QQ DP F + S +F ++ C +
Sbjct: 159 EYFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVDPIFDPASSSSFSRLGCQTP 218
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
CR L + F C + C + + Y DGS + G +AT+ ++ + S +GC
Sbjct: 219 QCRNL-DVFA---CRNDSCLYQVSYGDGSYTVGDFATETVSFGNSGSVDKVA-----IGC 269
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+++ G GA+G++GL P+S+ ++ S FSYCL
Sbjct: 270 GHDNEGLFVGAAGLIGLGGGPLSLTSQIKASSFSYCL 306
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 58/119 (48%), Gaps = 4/119 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T +T N LR F K K G L DTCY+LS+ +V VP +A F GG L
Sbjct: 381 VTRLQTQAYNALRDTFVKLTKDLPSTSGFA-LFDTCYNLSSRTSVRVPTVAFLFDGGKSL 439
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L L+ V S CL FA P + +GNVQQ+G V YD+A ++ F C
Sbjct: 440 PLPPSNYLIPVDSAGTFCLAFA--PTTASLSIIGNVQQQGTRVTYDLANSQVSFSSRKC 496
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 74/297 (24%), Positives = 131/297 (44%), Gaps = 33/297 (11%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
+Y +++G P + S++ DTGSD+ W QCKPC CF Q+DP F S ++ + C T
Sbjct: 39 DYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDT 98
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C +S P +C S C ++ Y DGSG+ G +++ +T+ + GC
Sbjct: 99 LC----DSLPRKSC-SPNCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKN-IAFGC 152
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPY----GSTGYITFGKTD 303
+ + G + ASG++GL R +S +++ + FSYCL P+ T + FG
Sbjct: 153 GHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCL-VPWRDAPSKTSPMFFGDES 211
Query: 304 TVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGT 363
+ ++ H +E Y + + + + G D++ D G
Sbjct: 212 SSHSSGKKLHYAFTPMIHNPAMESF----YYVKLKDISIAGRALRIPAGSFDIKPDGSGG 267
Query: 364 LVVASVSQVCLGFATYPPDPNSITLGNVQQR-----------GHEVHYDVAGRRLGF 409
++ S + + L P P I L ++ + G ++ YDV+G + +
Sbjct: 268 MIFDSGTTLTL----LPDAPYQIVLRALRSKVSFPEIDGSSAGLDLCYDVSGSKASY 320
Score = 45.1 bits (105), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 35/135 (25%), Positives = 59/135 (43%), Gaps = 12/135 (8%)
Query: 290 PYGSTGYITFGKTDTVNTLRSAFHKRMKKYKKAK-------GLEDLLDTCYDLSAYETVV 342
P GS G I F T+ L A ++ + + ++K G LD CYD+S +
Sbjct: 262 PDGSGGMI-FDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGSSAGLDLCYDVSGSKASY 320
Query: 343 ---VPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVH 399
+P + HF G D +L V + A+ + + A + + GN+ Q+ V
Sbjct: 321 KKKIPAMVFHFEG-ADHQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVM 379
Query: 400 YDVAGRRLGFGPGNC 414
YD+ ++G+ P C
Sbjct: 380 YDIGSSKIGWAPSQC 394
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 60/178 (33%), Positives = 92/178 (51%), Gaps = 14/178 (7%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRD--PFFYASKSKTFFKIP 186
A Y + +++G P +++DTGS++ W QC PC CF + P ++S TF ++P
Sbjct: 88 AGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLP 147
Query: 187 CNSTSCRILRESFPFGNCNS-KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
CN + C+ L S CN+ C +N Y G + G+ AT+ +T+ +G F +
Sbjct: 148 CNGSFCQYLPTSSRPRTCNATAACAYNYTYGSGY-TAGYLATETLTV----GDGTFPKVA 202
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGY--ITFGK 301
F GC + D S SGI+GL R P+S++++ FSYCL S G I FG
Sbjct: 203 F--GCSTENGVDNS--SGIVGLGRGPLSLVSQLAVGRFSYCLRSDMADGGASPILFGS 256
Score = 44.7 bits (104), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 56/136 (41%), Gaps = 22/136 (16%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKK---AKGLEDLLDTCYDLSAY---ETVVVPKIAIHF 350
+T+ D ++ AF +M + A G LD CY SA + V VP++A+ F
Sbjct: 323 LTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALRF 382
Query: 351 LGGVD-----------LELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVH 399
GG +E D +G + VA CL D +GN+ Q +
Sbjct: 383 AGGAKYNVPVQNYFAGVEADSQGRVTVA-----CLLVLPATDDLPISIIGNLMQMDMHLL 437
Query: 400 YDVAGRRLGFGPGNCS 415
YD+ G F P +C+
Sbjct: 438 YDIDGGMFSFAPADCA 453
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 60/178 (33%), Positives = 92/178 (51%), Gaps = 14/178 (7%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRD--PFFYASKSKTFFKIP 186
A Y + +++G P +++DTGS++ W QC PC CF + P ++S TF ++P
Sbjct: 88 AGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLP 147
Query: 187 CNSTSCRILRESFPFGNCNS-KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
CN + C+ L S CN+ C +N Y G + G+ AT+ +T+ +G F +
Sbjct: 148 CNGSFCQYLPTSSRPRTCNATAACAYNYTYGSGY-TAGYLATETLTV----GDGTFPKVA 202
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGY--ITFGK 301
F GC + D S SGI+GL R P+S++++ FSYCL S G I FG
Sbjct: 203 F--GCSTENGVDNS--SGIVGLGRGPLSLVSQLAVGRFSYCLRSDMADGGASPILFGS 256
Score = 44.7 bits (104), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 56/136 (41%), Gaps = 22/136 (16%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKK---AKGLEDLLDTCYDLSAY---ETVVVPKIAIHF 350
+T+ D ++ AF +M + A G LD CY SA + V VP++A+ F
Sbjct: 323 LTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALRF 382
Query: 351 LGGVD-----------LELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVH 399
GG +E D +G + VA CL D +GN+ Q +
Sbjct: 383 AGGAKYNVPVQNYFAGVEADSQGRVTVA-----CLLVLPATDDLPISIIGNLMQMDMHLL 437
Query: 400 YDVAGRRLGFGPGNCS 415
YD+ G F P +C+
Sbjct: 438 YDIDGGMFSFAPADCA 453
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 65/176 (36%), Positives = 85/176 (48%), Gaps = 19/176 (10%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y IG P Q VS ++D ++ WTQC PC CF+Q P F +KS TF +PC S
Sbjct: 57 YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSHL 116
Query: 192 CRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI 251
C + ES NC S C + G +GG TD I A F GC+
Sbjct: 117 CESIPESSR--NCTSDVCIYEAPTKAGD-TGGKAGTDTFAIGAAKETLGF-------GCV 166
Query: 252 NNSSGDK-----SGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGKT 302
+ DK G SGI+GL R+P S++T+ N + FSYCL S+G + G T
Sbjct: 167 VMT--DKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGK--SSGALFLGAT 218
>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 439
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 84/279 (30%), Positives = 123/279 (44%), Gaps = 71/279 (25%)
Query: 33 HIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQD 92
H VSSLLP N C + QG L + KYGPCS + PS +EI +D
Sbjct: 42 HSTPVSSLLPKNKCLASARGGSQG-----LPITQKYGPCSGSGH---SQPPSPQEIFGRD 93
Query: 93 QQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADE---YYIVVAIGEPKQYVSLLL 149
+ R+ NS+ + PE LK N+ + DE + + VA G P Q +L+L
Sbjct: 94 ESRVSFINSK-FNQYAPENLKDHTP-------NNKLFDEDGNFLVDVAFGTPPQNFTLIL 145
Query: 150 DTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKEC 209
DTGS +TWTQCK C + E
Sbjct: 146 DTGSSITWTQCKAC------------------------------------------TVEN 163
Query: 210 PFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD-KSGASGIMGLD 268
+N+ Y D S S G + D +T++ ++ F ++ F G N+ GD SG G++GL
Sbjct: 164 NYNMTYGDDSTSVGNYGCDTMTLEPSD---VFQKFQFGRG--RNNKGDFGSGVDGMLGLG 218
Query: 269 RSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDT 304
+ +S +++T + + FSYCLP S G + FG+ T
Sbjct: 219 QGQLSTVSQTASKFNKVFSYCLPEE-DSIGSLLFGEKAT 256
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/99 (38%), Positives = 57/99 (57%), Gaps = 5/99 (5%)
Query: 320 KKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATY 379
++ KG D+LDTCY+LS + V++P+I +HF GG D+ L+ + + S++CL FA
Sbjct: 342 RRKKG--DILDTCYNLSGRKDVLLPEIVLHFGGGADVRLNGTNIVWGSDESRLCLAFAGN 399
Query: 380 PP---DPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+P +GN QQ V YD+ G R+GF CS
Sbjct: 400 SKSTMNPELTIIGNRQQLSLTVLYDIQGGRIGFRSNGCS 438
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 56/157 (35%), Positives = 85/157 (54%), Gaps = 10/157 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ V IG+P ++LDTGSDV+W QC PC C+QQ DP F S ++ I C++
Sbjct: 148 EYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPVSSNSYSPIRCDAP 207
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C+ S C + C + + Y DGS + G +AT+ +T+ A +GC
Sbjct: 208 QCK----SLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTLGTAAVEN------VAIGC 257
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+N+ G GA+G++GL +S + N + FSYCL
Sbjct: 258 GHNNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCL 294
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 49/120 (40%), Positives = 70/120 (58%), Gaps = 6/120 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T +++ + LR AF K K KA G+ L DTCYDLS+ E+V VP ++ HF G +L
Sbjct: 369 VTRLRSEVYDALRDAFVKGAKGIPKANGVS-LFDTCYDLSSRESVQVPTVSFHFPEGREL 427
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSIT-LGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L R L+ V SV C FA P +S++ +GNVQQ+G V +D+A +GF +C
Sbjct: 428 PLPARNYLIPVDSVGTFCFAFA---PTTSSLSIMGNVQQQGTRVGFDIANSLVGFSADSC 484
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 67/230 (29%), Positives = 105/230 (45%), Gaps = 41/230 (17%)
Query: 81 HAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADE--YYIVVAI 138
H +++ I R+ + N++ P+ +TV D Y + + +
Sbjct: 28 HGFTMDLIHRRSNASSRVSNTQSGSSPYA----------------NTVFDNSVYLMKLQV 71
Query: 139 GEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRES 198
G P + ++DTGS++TWTQC PC+HC++Q P F SKS TF +E
Sbjct: 72 GTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTF-------------KEK 118
Query: 199 FPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDK 258
C+ CP+ + Y D + + G AT+ IT+ + S F ++GC +N+S K
Sbjct: 119 ----RCDGHSCPYEVDYFDHTYTMGTLATETITLH-STSGEPFVMPETIIGCGHNNSWFK 173
Query: 259 SGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDTV 305
SG++GL+ P S+IT+ Y SYC T I FG V
Sbjct: 174 PSFSGMVGLNWGPSSLITQMGGEYPGLMSYCFSGQ--GTSKINFGANAIV 221
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 52/121 (42%), Gaps = 7/121 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGL-EDLLDTCYDLSAYETVVVPKIAIHFLGGVD 355
+T+ N +R A + + A D+L CY+ + + P I +HF GGVD
Sbjct: 277 LTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDML--CYNSDTID--IFPVITMHFSGGVD 332
Query: 356 LELDVRGTLVVASVSQV-CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L LD + ++ V CL P +I GN Q V YD + + F P NC
Sbjct: 333 LVLDKYNMYMESNNGGVFCLAIICNSPTQEAI-FGNRAQNNFLVGYDSSSLLVSFSPTNC 391
Query: 415 S 415
S
Sbjct: 392 S 392
>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
Japonica Group]
Length = 377
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 65/176 (36%), Positives = 85/176 (48%), Gaps = 19/176 (10%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y IG P Q VS ++D ++ WTQC PC CF+Q P F +KS TF +PC S
Sbjct: 57 YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSHL 116
Query: 192 CRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI 251
C + ES NC S C + G +GG TD I A F GC+
Sbjct: 117 CESIPESSR--NCTSDVCIYEAPTKAGD-TGGKAGTDTFAIGAAKETLGF-------GCV 166
Query: 252 NNSSGDK-----SGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGKT 302
+ DK G SGI+GL R+P S++T+ N + FSYCL S+G + G T
Sbjct: 167 VMT--DKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGK--SSGALFLGAT 218
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 72/220 (32%), Positives = 104/220 (47%), Gaps = 19/220 (8%)
Query: 96 LHLKNSRRLRKPFPEFLKRTEAFTFPANINDT-VADEYYIVVAIGEPKQYVSLLLDTGSD 154
+H N+R+L L + T A D+ A EY + +AIG P + DTGSD
Sbjct: 1 MHRHNARKLA------LAASSGATVSAPTQDSPTAGEYLMALAIGTPPLPYQAIADTGSD 54
Query: 155 VTWTQCKPCI-HCFQQRDPFFYASKSKTFFKIPCNS--TSCRILRESFPFGNCNSKECPF 211
+ WTQC PC CF+Q P + S S TF +PCNS + C C +
Sbjct: 55 LIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPPPGCACTY 114
Query: 212 NIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL-LGCINNSSG-DKSGASGIMGLDR 269
N+ Y G S F ++ T + R P + GC SSG + S ASG++GL R
Sbjct: 115 NVTYGSGWTS-VFQGSETFTFGSTPAG--HARVPGIAFGCSTASSGFNASSASGLVGLGR 171
Query: 270 SPVSIITRTNTSYFSYCLPSPY---GSTGYITFGKTDTVN 306
+S++++ FSYCL +PY ST + G + ++N
Sbjct: 172 GRLSLVSQLGVPKFSYCL-TPYQDTNSTSTLLLGPSASLN 210
Score = 38.1 bits (87), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 37/89 (41%), Gaps = 4/89 (4%)
Query: 329 LDTCYDLSAYETV--VVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSI 386
LD C+ L + + +P + +HF G D+ L ++ CL D
Sbjct: 304 LDLCFMLPSSTSAPPAMPSMTLHF-NGADMVLPADSYMMSDDSGLWCLAMQNQT-DGEVN 361
Query: 387 TLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
LGN QQ+ + YD+ L F P CS
Sbjct: 362 ILGNYQQQNMHILYDIGQETLSFAPAKCS 390
>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 452
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 61/189 (32%), Positives = 96/189 (50%), Gaps = 18/189 (9%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCR 193
+ VA+G P Q V+++LDTGS+++W C H D F AS S ++ +PC+S +C
Sbjct: 65 VPVAVGTPPQNVTMVLDTGSELSWLLCNGSRH-----DAPFDASASSSYAPVPCSSPACT 119
Query: 194 ILRESFPFGN-CNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCIN 252
L P C+S C ++ YAD S + G A D + + P L GCI
Sbjct: 120 WLGRDLPVRPFCDSSACRVSLSYADASSADGLLAADTFLLGS-------SPMPALFGCIT 172
Query: 253 --NSSGDKSGA--SGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGKTDTVNTL 308
+SS D S +G++G++R +S +T+T T F+YC+ + G G + G DT L
Sbjct: 173 SYSSSTDPSETPPTGLLGMNRGGLSFVTQTATRRFAYCIAAGQGP-GILLLGGNDTETPL 231
Query: 309 RSAFHKRMK 317
S +++
Sbjct: 232 TSPPQQQLN 240
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 64/183 (34%), Positives = 98/183 (53%), Gaps = 9/183 (4%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY I ++G P + ++DTGSD+ W QCKPC C+ Q F SKS T+ +P +ST
Sbjct: 85 EYLISYSVGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTTRIFDPSKSNTYKILPFSST 144
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
+C+ + ++ + N K C + I Y DGS S G + + +T+ N + R ++GC
Sbjct: 145 TCQSVEDT-SCSSDNRKMCEYTIYYGDGSYSQGDLSVETLTLGSTNGSSVKFRRT-VIGC 202
Query: 251 -INNSSGDKSGASGIMGLDRSPVSIIT---RTNTSY---FSYCLPSPYGSTGYITFGKTD 303
NN+ + +SGI+GL PVS+I R ++S FSYCL S + + FG
Sbjct: 203 GRNNTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLASMSNISSKLNFGDAA 262
Query: 304 TVN 306
V+
Sbjct: 263 VVS 265
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 74/229 (32%), Positives = 111/229 (48%), Gaps = 21/229 (9%)
Query: 85 LEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQY 144
+ + LR+D R H + +R L RT A P + EY + +AIG P
Sbjct: 48 VRDALRRDMHR-HARFTRELASSG----DRTVAA--PTRKDLPNGGEYIMTLAIGTPPLS 100
Query: 145 VSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNS--TSCRILRESFPF 201
+ DTGSD+ WTQC PC CF+Q + S S TF +PCNS + C L P
Sbjct: 101 YPAIADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPCNSSVSMCAALAGPSPP 160
Query: 202 GNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL-LGCINNSSGDKSG 260
C+ C +N Y G + G + + T ++ TR P + GC N SS D +G
Sbjct: 161 PGCS---CMYNQTYGTG-WTAGIQSVETFTFGSTPADQ--TRVPGIAFGCSNASSDDWNG 214
Query: 261 ASGIMGLDRSPVSIITRTNTSYFSYCLPSPY---GSTGYITFGKTDTVN 306
++G++GL R +S++++ FSYCL +P+ ST + G + +N
Sbjct: 215 SAGLVGLGRGSMSLVSQLGAGMFSYCL-TPFQDANSTSTLLLGPSAALN 262
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 64/185 (34%), Positives = 89/185 (48%), Gaps = 15/185 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P +YV ++LDTGSDV W QC PC C+ Q DP F +KS+T+ IPC +
Sbjct: 128 EYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVFDPTKSRTYAGIPCGAP 187
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
CR R P N +K C + + Y DGS + G ++T+ +T + TR LGC
Sbjct: 188 LCR--RLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRTR----VTRVA--LGC 239
Query: 251 INNSSG----DKSGASGIMGLDRSPVSIITRTNTSYFSYCL--PSPYGSTGYITFGKTDT 304
+++ G G PV R N FSYCL S + FG +
Sbjct: 240 GHDNEGLFIGAAGLLGLGRGRLSFPVQTGRRFNQK-FSYCLVDRSASAKPSSVVFGDSAV 298
Query: 305 VNTLR 309
T R
Sbjct: 299 SRTAR 303
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 41/108 (37%), Positives = 54/108 (50%), Gaps = 5/108 (4%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
LR AF K+A L DTC+DLS V VP + +HF G D+ L L+ V
Sbjct: 368 LRDAFRVGASHLKRAAEFS-LFDTCFDLSGLTEVKVPTVVLHFRGA-DVSLPATNYLIPV 425
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ C FA + I GN+QQ+G V +D+AG R+GF P C
Sbjct: 426 DNSGSFCFAFAGTMSGLSII--GNIQQQGFRVSFDLAGSRVGFAPRGC 471
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 79/239 (33%), Positives = 117/239 (48%), Gaps = 23/239 (9%)
Query: 81 HAPSLEEILRQDQQRLHLKNSR-RLRKPFPEFL-----KRTEAFTFPANINDTVADEYYI 134
H S + + + ++ R +K R RL++ L EA P N E+ +
Sbjct: 46 HVDSGKNLTKLERIRHGVKRGRNRLQRLQAMALVASSSSEIEAPVLPGN------GEFLM 99
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
+AIG P + S +LDTGSD+ WTQCKPC CF Q P F KS +F K+ C+S C
Sbjct: 100 KLAIGTPPETYSAILDTGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLC-- 157
Query: 195 LRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNS 254
E+ P +CN+ C + Y D S + G A++ +T +A+ F G N
Sbjct: 158 --EALPQSSCNNG-CEYLYSYGDYSSTQGILASETLTFGKAS----VPNVAFGCGADNEG 210
Query: 255 SGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGS-TGYITFGKTDTVNTLRSAF 312
SG GA G++GL R P+S++++ FSYCL + + T + G +VN SA
Sbjct: 211 SGFSQGA-GLVGLGRGPLSLVSQLKEPKFSYCLTTVDDTKTSTLLMGSLASVNASSSAI 268
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 54/131 (41%), Gaps = 12/131 (9%)
Query: 292 GSTGYITFGKTDTVNTLRSAFHKRMKKYKKAKGL------EDLLDTCYDLSAYET-VVVP 344
GS G I T SAF+ K++ L LD C+ L + T + VP
Sbjct: 310 GSGGLIIDSGTTITYLEESAFNLVAKEFTAKINLPVDSSGSTGLDVCFTLPSGSTNIEVP 369
Query: 345 KIAIHFLGGVDLELDVRGTLVV-ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVA 403
K+ HF G DLEL ++ +S+ CL + GNVQQ+ V +D+
Sbjct: 370 KLVFHF-DGADLELPAENYMIGDSSMGVACLAMGS---SSGMSIFGNVQQQNMLVLHDLE 425
Query: 404 GRRLGFGPGNC 414
L F P C
Sbjct: 426 KETLSFLPTQC 436
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 60/179 (33%), Positives = 87/179 (48%), Gaps = 9/179 (5%)
Query: 111 FLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQR 170
F T + PA + A EY + +AIG P L DTGSD+TWTQC+PC CF Q
Sbjct: 73 FTMSTSSDAGPARLRSGQA-EYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQD 131
Query: 171 DPFFYASKSKTFFKIPCNSTSCRILRESFPFGNC--NSKECPFNIQYADGSGSGGFWATD 228
P + + S +F +PC S +C + S NC +S C + Y DG+ S G T+
Sbjct: 132 TPIYDTAVSSSFSPVPCASATCLPIWSSR---NCTASSSPCRYRYAYGDGAYSAGVLGTE 188
Query: 229 RITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+T A + GC ++ G ++G +GL R +S++ + FSYCL
Sbjct: 189 TLTFPGAPG---VSVGGIAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCL 244
Score = 38.9 bits (89), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 36/73 (49%), Gaps = 2/73 (2%)
Query: 343 VPKIAIHFLGGVDLELDVRGTLVV-ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYD 401
+P + +HF GG D+ L + S CL A P SI LGN QQ+ ++ +D
Sbjct: 373 MPDMVLHFAGGADMRLHRDNYMSFNQEESSFCLNIAGSPSADVSI-LGNFQQQNIQMLFD 431
Query: 402 VAGRRLGFGPGNC 414
+ +L F P +C
Sbjct: 432 ITVGQLSFMPTDC 444
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 68/205 (33%), Positives = 96/205 (46%), Gaps = 17/205 (8%)
Query: 90 RQDQQRLHLKN-SRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLL 148
R+ QR+ L++ +R R+ T+ + T EY + +AIG P Q V L
Sbjct: 42 RELMQRMALRSKARAARRLSSSASAPVSPGTYDNGVPTT---EYLVHLAIGTPPQPVQLT 98
Query: 149 LDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS-- 206
LDTGSD+ WTQC+PC CF Q P+F S S T C+ST C + P +C S
Sbjct: 99 LDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLC----QGLPVASCGSPK 154
Query: 207 ----KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGAS 262
+ C + Y D S + GF D+ T A ++ F G NN KS +
Sbjct: 155 FWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGAS--VPGVAFGCGLFNNGV-FKSNET 211
Query: 263 GIMGLDRSPVSIITRTNTSYFSYCL 287
GI G R P+S+ ++ FS+C
Sbjct: 212 GIAGFGRGPLSLPSQLKVGNFSHCF 236
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 53/164 (32%), Positives = 84/164 (51%), Gaps = 11/164 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
E+ + +++G P + ++DTGSD+ WTQCKPC+ CF Q P F + S T+ +PC+S
Sbjct: 115 EFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTTPVFDPAASSTYAALPCSSA 174
Query: 191 SCRILRESFPFGNCNSKECP----FNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPF 246
C L S + +S + Y D S + G AT+ T+ G
Sbjct: 175 LCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLARQKVPG------V 228
Query: 247 LLGCINNSSGDK-SGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
GC + + GD + +G++GL R P+S++++ FSYCL S
Sbjct: 229 AFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGIDRFSYCLTS 272
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 55/125 (44%), Gaps = 10/125 (8%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYET-----VVVPKIAIHFL 351
IT+ + LR AF M E LD C+ A V VPK+ +HF
Sbjct: 353 ITYLELRAYRALRKAFVAHMS-LPTVDASEIGLDLCFQGPAGAVDQDVQVQVPKLVLHFD 411
Query: 352 GGVDLELDVRGTLVVASVS-QVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFG 410
GG DL+L +V+ S S +CL T +GN QQ+ + YDVAG L F
Sbjct: 412 GGADLDLPAENYMVLDSASGALCL---TVMASRGLSIIGNFQQQNFQFVYDVAGDTLSFA 468
Query: 411 PGNCS 415
P C+
Sbjct: 469 PAECN 473
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 56/157 (35%), Positives = 84/157 (53%), Gaps = 10/157 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ V IG+P ++LDTGSDV+W QC PC C+QQ DP F S ++ I C+
Sbjct: 148 EYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPISSNSYSPIRCDEP 207
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C+ S C + C + + Y DGS + G +AT+ +T+ A +GC
Sbjct: 208 QCK----SLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTLGSAAVEN------VAIGC 257
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+N+ G GA+G++GL +S + N + FSYCL
Sbjct: 258 GHNNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCL 294
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 47/120 (39%), Positives = 69/120 (57%), Gaps = 6/120 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T +++ + LR AF K K KA G+ L DTCYDLS+ E+V +P ++ F G +L
Sbjct: 369 VTRLRSEVYDALRDAFVKGAKGIPKANGVS-LFDTCYDLSSRESVEIPTVSFRFPEGREL 427
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSIT-LGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L R L+ V SV C FA P +S++ +GNVQQ+G V +D+A +GF +C
Sbjct: 428 PLPARNYLIPVDSVGTFCFAFA---PTTSSLSIIGNVQQQGTRVGFDIANSLVGFSVDSC 484
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 56/157 (35%), Positives = 86/157 (54%), Gaps = 10/157 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V IG+P V ++LDTGSDV W QC PC C+ Q DP F + S ++ + C++
Sbjct: 143 EYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPASSTSYSPLSCDTK 202
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C +S C + C + + Y DGS + G + T+ IT+ A+ + +GC
Sbjct: 203 QC----QSLDVSECRNNTCLYEVSYGDGSYTVGDFVTETITLGSASVDN------VAIGC 252
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+N+ G GA+G++GL +S ++ N S FSYCL
Sbjct: 253 GHNNEGLFIGAAGLLGLGGGKLSFPSQINASSFSYCL 289
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 56/119 (47%), Gaps = 4/119 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T +T N LR AF K K + L DTCYDLS +V VP + H GG L
Sbjct: 364 VTRLQTAAYNALRDAFVKGTKDLPVTSEVA-LFDTCYDLSRKTSVEVPTVTFHLAGGKVL 422
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L L+ V S C FA P +GNVQQ+G V +D+A +GF P C
Sbjct: 423 PLPATNYLIPVDSDGTFCFAFA--PTSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 68/205 (33%), Positives = 96/205 (46%), Gaps = 17/205 (8%)
Query: 90 RQDQQRLHLKN-SRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLL 148
R+ QR+ L++ +R R+ T+ + T EY + +AIG P Q V L
Sbjct: 42 RELMQRMALRSKARAARRLSSSASAPVSPGTYDNGVPTT---EYLVHLAIGTPPQPVQLT 98
Query: 149 LDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS-- 206
LDTGSD+ WTQC+PC CF Q P+F S S T C+ST C + P +C S
Sbjct: 99 LDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLC----QGLPVASCGSPK 154
Query: 207 ----KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGAS 262
+ C + Y D S + GF D+ T A ++ F G NN KS +
Sbjct: 155 FWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGAS--VPGVAFGCGLFNNGV-FKSNET 211
Query: 263 GIMGLDRSPVSIITRTNTSYFSYCL 287
GI G R P+S+ ++ FS+C
Sbjct: 212 GIAGFGRGPLSLPSQLKVGNFSHCF 236
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 61/158 (38%), Positives = 83/158 (52%), Gaps = 8/158 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY I +AIG P +LDTGSD+ WTQCKPC C++Q P F KS +F K+ C S+
Sbjct: 107 EYLIELAIGTPPVSYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSCGSS 166
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C L P C S C + Y D S + G AT+ T S + + GC
Sbjct: 167 LCSAL----PSSTC-SDGCEYVYSYGDYSMTQGVLATETFTF--GKSKNKVSVHNIGFGC 219
Query: 251 INNSSGDK-SGASGIMGLDRSPVSIITRTNTSYFSYCL 287
++ GD ASG++GL R P+S++++ FSYCL
Sbjct: 220 GEDNEGDGFEQASGLVGLGRGPLSLVSQLKEQRFSYCL 257
Score = 41.6 bits (96), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 29/88 (32%), Positives = 42/88 (47%), Gaps = 6/88 (6%)
Query: 329 LDTCYDLSAYETVV-VPKIAIHFLGGVDLELDVRGTLVVAS-VSQVCLGFATYPPDPNSI 386
LD C+ L + T V +PK+ HF GG DLEL ++ S + CL
Sbjct: 367 LDLCFSLPSGSTQVEIPKLVFHFKGG-DLELPAENYMIGDSNLGVACLAMGA---SSGMS 422
Query: 387 TLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
GNVQQ+ V++D+ + F P +C
Sbjct: 423 IFGNVQQQNILVNHDLEKETISFVPTSC 450
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 68/205 (33%), Positives = 96/205 (46%), Gaps = 17/205 (8%)
Query: 90 RQDQQRLHLKN-SRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLL 148
R+ QR+ L++ +R R+ T+ + T EY + +AIG P Q V L
Sbjct: 42 RELMQRMALRSKARAARRLSSSASAPVSPGTYDNGVPTT---EYLVHLAIGTPPQPVQLT 98
Query: 149 LDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS-- 206
LDTGSD+ WTQC+PC CF Q P+F S S T C+ST C + P +C S
Sbjct: 99 LDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLC----QGLPVASCGSPK 154
Query: 207 ----KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGAS 262
+ C + Y D S + GF D+ T A ++ F G NN KS +
Sbjct: 155 FWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGAS--VPGVAFGCGLFNNGV-FKSNET 211
Query: 263 GIMGLDRSPVSIITRTNTSYFSYCL 287
GI G R P+S+ ++ FS+C
Sbjct: 212 GIAGFGRGPLSLPSQLKVGNFSHCF 236
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 89/157 (56%), Gaps = 9/157 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V +G P + ++LDTGSD+ W QC+PC C+QQ DP F + S ++ + C+S
Sbjct: 158 EYFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDPIFTPAASSSYSPLTCDSQ 217
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C L+ S +C + +C + + Y DGS + G + T+ ++ G T LGC
Sbjct: 218 QCNSLQMS----SCRNGQCRYQVNYGDGSFTFGDFVTETMSF-----GGSGTVNSIALGC 268
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+++ G GA+G++GL P+S+ ++ + FSYCL
Sbjct: 269 GHDNEGLFVGAAGLLGLGGGPLSLTSQLKATSFSYCL 305
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 44/120 (36%), Positives = 65/120 (54%), Gaps = 6/120 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
IT +++ N+LR +F + + G+ L DTCYDLS +V VP ++ HF GG
Sbjct: 380 ITRLQSEAYNSLRDSFVSMSRHLRSTSGVA-LFDTCYDLSGQSSVKVPTVSFHFDGGKSW 438
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSIT-LGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+L L+ V S C FA P +S++ +GNVQQ+G V +D+A R+GF C
Sbjct: 439 DLPAANYLIPVDSAGTYCFAFA---PTTSSLSIIGNVQQQGTRVSFDLANNRVGFSTNKC 495
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 94/356 (26%), Positives = 136/356 (38%), Gaps = 92/356 (25%)
Query: 138 IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRE 197
IG P Q S ++D ++ WTQC C CF+Q P F + S TF PC + +C+
Sbjct: 73 IGTPPQPASAIIDVAGELVWTQCSMCSRCFKQDLPLFVPNASSTFRPEPCGTDACK---- 128
Query: 198 SFPFGNCNSKECPFN--IQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSS 255
S P NC+S C + I G + G ATD I A ++ F GC+ S
Sbjct: 129 SIPTSNCSSNMCTYEGTINSKLGGHTLGIVATDTFAIGTATASLGF-------GCVVASG 181
Query: 256 GDKSGA-SGIMGLDRSPVSIITRTNTSYFSYCLPSPY----------GSTGYITFGKTDT 304
D G SG++GL R+P S++++ N + FSYCL +P+ GS+ + G T
Sbjct: 182 IDTMGGPSGLIGLGRAPSSLVSQMNITKFSYCL-TPHDSGKNSRLLLGSSAKLAGGGNST 240
Query: 305 -----------------------------------------VNTLR-------SAFHKRM 316
V TL SA+
Sbjct: 241 TTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIALPPSGNTVLVQTLAPMSFLVDSAYQALK 300
Query: 317 KKYKKAKGLEDL------LDTCYDLSAYETVVVPKIAIHFLGGV--------DLELDV-- 360
K+ KA G D C+ + P + F G +DV
Sbjct: 301 KEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQQGAAALTVPPPKYLIDVGE 360
Query: 361 -RGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+GT+ +A +S L T D N LG++QQ D+ + L F P +CS
Sbjct: 361 EKGTVCMAILSTSWLN--TTALDENLNILGSLQQENTHFLLDLEKKTLSFEPADCS 414
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 64/186 (34%), Positives = 94/186 (50%), Gaps = 17/186 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P +YV ++LDTGSDV W QC PC C+ Q DP F +KS++F IPC S
Sbjct: 146 EYFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANIPCGSP 205
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
CR L C++K+ C + + Y DGS + G ++T+ +T + L
Sbjct: 206 LCRRLDSP----GCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGTRVG------RVAL 255
Query: 249 GCINNSSG---DKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGST--GYITFGKTD 303
GC +++ G +G G+ S S I R + FSYCL S+ Y+ FG +
Sbjct: 256 GCGHDNEGLFIGAAGLLGLGRGRLSFPSQIGRRFSRKFSYCLVDRSASSKPSYMVFGDSA 315
Query: 304 TVNTLR 309
T R
Sbjct: 316 ISRTAR 321
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/109 (36%), Positives = 54/109 (49%), Gaps = 5/109 (4%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
LR AF K+A L DTC+DLS V VP + +HF G D+ L L+ V
Sbjct: 386 LRDAFRVGASNLKRAPEFS-LFDTCFDLSGKTEVKVPTVVLHFRG-ADVSLPASNYLIPV 443
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ C FA + + GN+QQ+G V YD+A R+GF P C+
Sbjct: 444 DNSGSFCFAFAGTMSGLSIV--GNIQQQGFRVVYDLAASRVGFAPRGCA 490
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 68/184 (36%), Positives = 98/184 (53%), Gaps = 13/184 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P +Y+ ++LDTGSDV W QCKPC C+ Q D F SKSK+F IPC S
Sbjct: 129 EYFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPCTKCYSQTDQIFDPSKSKSFAGIPCYSP 188
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
CR L P + + C + + Y DGS + G ++T+ +T + A +GC
Sbjct: 189 LCRRLDS--PGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTFRRAAVPR------VAIGC 240
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGST--GYITFGKTDTV 305
+++ G GA+G++GL R +S T+T T + FSYCL S I FG +
Sbjct: 241 GHDNEGLFVGAAGLLGLGRGGLSFPTQTGTRFNNKFSYCLTDRTASAKPSSIVFGDSAVS 300
Query: 306 NTLR 309
T R
Sbjct: 301 RTAR 304
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 56/110 (50%), Gaps = 5/110 (4%)
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV- 365
+LR AF K+A L DTCYDLS V VP + +HF G D+ L LV
Sbjct: 368 SLRDAFRVGASHLKRAPEFS-LFDTCYDLSGLSEVKVPTVVLHFRGA-DVSLPAANYLVP 425
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
V + C FA + I GN+QQ+G V +D+AG R+GF P C+
Sbjct: 426 VDNSGSFCFAFAGTMSGLSII--GNIQQQGFRVVFDLAGSRVGFAPRGCA 473
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 53/150 (35%), Positives = 80/150 (53%), Gaps = 23/150 (15%)
Query: 89 LRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVAD---EYYIVVAIGEPKQYV 145
L +D R+H NSR A F +++ ++ EY+ + +G P +Y+
Sbjct: 78 LHRDTLRVHALNSR--------------AAGFSSSVVSGLSQGSGEYFTRLGVGTPPRYL 123
Query: 146 SLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCN 205
++LDTGSDV W QC PC C+ Q DP F KSK+F IPC+S CR L S C+
Sbjct: 124 YMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKSKSFAGIPCSSPLCRRLDSS----GCS 179
Query: 206 SKE--CPFNIQYADGSGSGGFWATDRITIQ 233
++ C + + Y DGS + G +AT+ +T +
Sbjct: 180 TRRHTCLYQVSYGDGSFTTGDFATETLTFR 209
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 55/111 (49%), Gaps = 5/111 (4%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
LR AF + K+ L DTCYDLS +V VP + +HF G D+ L L+
Sbjct: 347 TALRDAFRVGARHLKRGPEFS-LFDTCYDLSGQSSVKVPTVVLHFRG-ADMALPATNYLI 404
Query: 366 -VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
V C FA + I GN+QQ+G V YD+AG R+GF P C+
Sbjct: 405 PVDENGSFCFAFAGTISGLSII--GNIQQQGFRVVYDLAGSRIGFAPRGCT 453
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 90/301 (29%), Positives = 131/301 (43%), Gaps = 43/301 (14%)
Query: 27 NDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAP--S 84
++ ++ + V+ SS P VC R + P + + +GPCS S AP S
Sbjct: 33 DEANYYYFVAASS--PNPVCQGHRVSPPLS-GGGWVPLSRPHGPCSS-----SMDAPPSS 84
Query: 85 LEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYI--VVAIGEPK 142
+ E LR DQ R R+L P + + V + V GEP
Sbjct: 85 VAETLRWDQHRAGYIQ-RKLEDQVPITRSVITQVSHQGVVQPKVGTQGQGTGVQPAGEPV 143
Query: 143 QYV----------SLLLDTGSDVTWTQCKPCI--HCFQQRDPFFYASKSKTFFKIPCNST 190
++++DT SDV W QC PC HC Q D + SKS + PC+S
Sbjct: 144 GDAPTGGSGGVAQTMVIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSP 203
Query: 191 SCRILRESFPFGN-CN--SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
+CR L P+ N C +C + +QY DGS S G + +D +T+ A + + F
Sbjct: 204 ACRNLG---PYANGCTPAGDQCQYRVQYPDGSASAGTYISDVLTLNPAKPASAISEFRF- 259
Query: 248 LGCIN-----NSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITF 299
GC + S +K+ SGIM L R S+ T+T +Y FSYCLP +G+
Sbjct: 260 -GCSHALLQPGSFSNKT--SGIMALGRGAQSLPTQTKATYGDVFSYCLPPTPVHSGFFIL 316
Query: 300 G 300
G
Sbjct: 317 G 317
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 45/114 (39%), Positives = 57/114 (50%), Gaps = 12/114 (10%)
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYET-----VVVPKIAIHFLG-GVDLELDV 360
LR+AF M+ Y+ A E L DTCYD S V +PKI + F G +ELD
Sbjct: 384 ALRAAFVAEMRAYRAAAPKEHL-DTCYDFSGAAPGGGGGVKLPKITLVFDGPNGAVELDP 442
Query: 361 RGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
G L+ CL FA D + +GNVQQ+ EV Y+V G +GF G C
Sbjct: 443 SGVLLDG-----CLAFAPNTDDQMTGIIGNVQQQALEVLYNVDGATVGFRRGAC 491
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 69/233 (29%), Positives = 113/233 (48%), Gaps = 20/233 (8%)
Query: 78 ISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLK--RTEAFTFPANI---NDTVADEY 132
+ H + +++D R+ RRL P +K R + F ++ + + EY
Sbjct: 85 VHGHRRGFNDRMKRDAIRVATL-VRRLSHGAPAAVKDSRYKVANFATDVISGMEAGSGEY 143
Query: 133 YIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSC 192
++ + +G P + +++D+GSD+ W QCKPC C+QQ DP F + S +F + C S C
Sbjct: 144 FVRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFDPADSSSFAGVSCGSDVC 203
Query: 193 RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCIN 252
L + CN+ C + + Y DGS + G A + +T+ G +GC +
Sbjct: 204 DRLENT----GCNAGRCRYEVSYGDGSYTKGTLALETLTV------GQVMIRDVAIGCGH 253
Query: 253 NSSGDKSGASGIMGLDRSPVSIITR---TNTSYFSYCLPS-PYGSTGYITFGK 301
+ G GA+G++GL +S I + FSYCL S GSTG + FG+
Sbjct: 254 TNQGMFIGAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGALEFGR 306
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 60/109 (55%), Gaps = 6/109 (5%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
R +F + +A G+ + DTCYDL+ +E+V VP ++ +F G L L R L+ V
Sbjct: 378 FRDSFTAQTSNLPRAPGVS-IFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPARNFLIPV 436
Query: 367 ASVSQVCLGFATYPPDPNSIT-LGNVQQRGHEVHYDVAGRRLGFGPGNC 414
CL FA P P+ ++ +GN+QQ G ++ +D A +GFGP C
Sbjct: 437 DGGGTFCLAFA---PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC 482
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 63/160 (39%), Positives = 90/160 (56%), Gaps = 9/160 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ + +G P V ++LDTGSDV W QC PC C+ Q D F KSKTF +PC S
Sbjct: 137 EYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQSDVIFDPKKSKTFATVPCGSR 196
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
CR L +S SK C + + Y DGS + G ++T+ +T A + P LGC
Sbjct: 197 LCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVD----HVP--LGC 250
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
+++ G GA+G++GL R +S ++T + Y FSYCL
Sbjct: 251 GHDNEGLFVGAAGLLGLGRGGLSFPSQTKSRYNGKFSYCL 290
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 54/103 (52%), Gaps = 5/103 (4%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
LR AF K K+A L DTC+DLS TV VP + HF GG ++ L L+ V
Sbjct: 383 LRDAFRLGATKLKRAPSYS-LFDTCFDLSGMTTVKVPTVVFHF-GGGEVSLPASNYLIPV 440
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGF 409
+ + C FA + I GN+QQ+G V YD+ G R+GF
Sbjct: 441 NTEGRFCFAFAGTMGSLSII--GNIQQQGFRVAYDLVGSRVGF 481
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 63/160 (39%), Positives = 89/160 (55%), Gaps = 9/160 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ + +G P V ++LDTGSDV W QC PC C+ Q D F KSKTF +PC S
Sbjct: 134 EYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSR 193
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
CR L +S SK C + + Y DGS + G ++T+ +T A + P LGC
Sbjct: 194 LCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVD----HVP--LGC 247
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
+++ G GA+G++GL R +S ++T Y FSYCL
Sbjct: 248 GHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCL 287
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 54/103 (52%), Gaps = 5/103 (4%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
LR AF K K+A L DTC+DLS TV VP + HF GG ++ L L+ V
Sbjct: 380 LRDAFRLGATKLKRAPSYS-LFDTCFDLSGMTTVKVPTVVFHF-GGGEVSLPASNYLIPV 437
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGF 409
+ + C FA + I GN+QQ+G V YD+ G R+GF
Sbjct: 438 NTEGRFCFAFAGTMGSLSII--GNIQQQGFRVAYDLVGSRVGF 478
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 125/482 (25%), Positives = 189/482 (39%), Gaps = 97/482 (20%)
Query: 16 CSSNNGAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLN 75
CSS A ++ +V+ SSL P C R + PQ + + + + +GPCS L
Sbjct: 14 CSSPVALLAAAHEHDEYTLVAKSSLKPKATCTGYRVSPPQ--NITWVPLNAPHGPCSPLP 71
Query: 76 QGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANIN---------- 125
+ APSL +L DQ R+ R P L F N N
Sbjct: 72 ---GSAAPSLAALLLHDQLRVDGIERRLSDNPHDSKLVPAGGEDFQTNGNLLQVNYGNSG 128
Query: 126 ---DTVADEYYIVVAIGE--------PKQYVSLLLDTGSDVTWTQCKPCI--HCFQQRDP 172
+ A + +V A P +++LD+ SDV W QC PC C Q D
Sbjct: 129 QPMSSEAQQSGVVNASAAGGGSRSKLPGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDS 188
Query: 173 FFYASKSKTFFKIPCNSTSCRILRESFPFGN-CNSKECPFNIQYADGSGSGGFWATDRIT 231
F+ S+S + C+S +C L P+ N C + +C + ++Y DGS + G + D +T
Sbjct: 189 FYDPSRSPSSAPFSCSSPTCTALG---PYANGCANNQCQYLVRYPDGSSTSGAYIADLLT 245
Query: 232 IQEANSNGYFTRYPFLLGCINNSSGD-KSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
+ N+ F GC + G + A+GIM L P S++++T + Y FSYC+
Sbjct: 246 LDAGNAVSGFK-----FGCSHAEQGSFDARAAGIMALGGGPESLLSQTASRYGNAFSYCI 300
Query: 288 PSPYGSTGYITFG----------------------------KTDTVNTLR----SAFHKR 315
P+ +G+ T G +T TV R A
Sbjct: 301 PATASDSGFFTLGVPRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAA 360
Query: 316 MKKYKKAKGLEDLLDTCYDL--SAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVC 373
+ L T Y SA+ + + + G +D D G + + + ++
Sbjct: 361 GSVLDSRTAITRLPPTAYQALRSAFRSSMTMYRSAPPKGYLDTCYDFTGVVNI-RLPKIS 419
Query: 374 LGF---ATYPPDPNSI------------------TLGNVQQRGHEVHYDVAGRRLGFGPG 412
L F A P DP+ I LG+VQQ+ EV YDV G +GF G
Sbjct: 420 LVFDRNAVLPLDPSGILFNDCLAFTSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQG 479
Query: 413 NC 414
C
Sbjct: 480 AC 481
>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
Length = 362
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 63/160 (39%), Positives = 89/160 (55%), Gaps = 9/160 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ + +G P V ++LDTGSDV W QC PC C+ Q D F KSKTF +PC S
Sbjct: 134 EYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSR 193
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
CR L +S SK C + + Y DGS + G ++T+ +T A + P LGC
Sbjct: 194 LCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVD----HVP--LGC 247
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
+++ G GA+G++GL R +S ++T Y FSYCL
Sbjct: 248 GHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCL 287
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 53/150 (35%), Positives = 83/150 (55%), Gaps = 12/150 (8%)
Query: 147 LLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESF---PFG 202
++LDTGS ++W QC+PC ++C Q DP + S SKT+ K+ C S C L+ + P
Sbjct: 1 MILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60
Query: 203 NCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGAS 262
+S C + Y D S S G+ + D +T+ + + FT GC ++ G A+
Sbjct: 61 ETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQTLPQFT-----YGCGQDNQGLFGRAA 115
Query: 263 GIMGLDRSPVSIITRTNTSY---FSYCLPS 289
GI+GL R +S++ + +T Y FSYCLP+
Sbjct: 116 GIIGLARDKLSMLAQLSTKYGHAFSYCLPT 145
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/131 (36%), Positives = 62/131 (47%), Gaps = 2/131 (1%)
Query: 285 YCLPSPYGSTGYITFGKTDTVNTLRSAFHKRMK-KYKKAKGLEDLLDTCYDLSAYETVVV 343
Y +P+ S IT LR AF K M KY KA +LDTC+ S V
Sbjct: 202 YRVPTLIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYS-ILDTCFKGSLKSISAV 260
Query: 344 PKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVA 403
P+I + F GG DL L L+ A CL FA +GN QQ+ + + YDV+
Sbjct: 261 PEIKMIFQGGADLTLRAPSILIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVS 320
Query: 404 GRRLGFGPGNC 414
R+GF PG+C
Sbjct: 321 TSRIGFAPGSC 331
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 102/433 (23%), Positives = 171/433 (39%), Gaps = 102/433 (23%)
Query: 61 SLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTF 120
S+E++ S +H + ++ R+H N F+F
Sbjct: 27 SVELIHPISSKSPFYNTAESHFQRMSNNMKHSTNRVHYLN---------------HVFSF 71
Query: 121 PAN------INDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFF 174
P N ++ + D Y I IG P + ++DT +D W QC PC CF P F
Sbjct: 72 PPNKVPNIVVSPFMGDGYIISFLIGTPPFQLYGVMDTANDNIWFQCNPCKPCFNTTSPMF 131
Query: 175 YASKSKTFFKIPCNSTSCRILRESFPFGNCNS---KECPFNIQYADGSGSGGFWATDRIT 231
SKS T+ IPC+S C+ + + +C+S K C ++ Y + S G + D +T
Sbjct: 132 DPSKSSTYKTIPCSSPKCKNVENT----HCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLT 187
Query: 232 IQEANSNGYFTRYPFLLGCINNSSGDKSG-ASGIMGLDRSPVSIITRTNTSY---FSYCL 287
+ +N++ + ++GC + + G G SG +GL R P+S I++ N+S FSYCL
Sbjct: 188 LN-SNNDTPISFKNIVIGCGHRNKGPLEGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCL 246
Query: 288 P---SPYGSTGYITFGK---------------------TDTVNTLRSAFHKRMKKYKKAK 323
S G +G + FG + T+N L H + K++ +
Sbjct: 247 VPLFSNEGISGKLHFGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDH--IIKFENST 304
Query: 324 GLED-----LLDTCYDL--------SAYETVVVPKIAI---------------------- 348
D ++D+ L S E++V + +
Sbjct: 305 SKNDNLGNTIIDSGTTLTILPENVYSRLESIVTSMVKLERAKSPNQQFKLCYKATLKNLD 364
Query: 349 ------HFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDV 402
HF G D+ L+ T VC F + P +I +GN+ Q+ V +D+
Sbjct: 365 VPIITAHF-NGADVHLNSLNTFYPIDHEVVCFAFVSVGNFPGTI-IGNIAQQNFLVGFDL 422
Query: 403 AGRRLGFGPGNCS 415
+ F P +C+
Sbjct: 423 QKNIISFKPTDCT 435
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 138/354 (38%), Gaps = 77/354 (21%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + IG P++ L LDTGSDVTW QC PC C+ Q DP + S S ++ ++ C S
Sbjct: 11 EYFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSA 70
Query: 191 SCRILR---------------------------ESFPFGNCNSKECPFNIQYADG-SGSG 222
C+ L ESF G NS NI + G S SG
Sbjct: 71 LCQALDYSACQGMGCSYRVVYGDSSASSGDLGIESFYLGP-NSSTAMRNIAFGCGHSNSG 129
Query: 223 GFWATDRITIQEANSNGYFTRYPFLLG-----CINNSSGDKSGASGIMGLDRSPVSIITR 277
F + + +F++ +G C+ + S + R+ + R
Sbjct: 130 LFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFAAR 189
Query: 278 ---------TNTSYFSYC---------LPSP---YGSTGYITFGKTDTVNT--------- 307
NT Y++ LP P + TG T G T
Sbjct: 190 FTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGTGGAILDSGTSVTRVVPPA 249
Query: 308 ---LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL 364
LR A+ + A G+ LLDTC++ TV +P + +HF GVD+ L L
Sbjct: 250 YAVLRDAYRAASRNLPPAPGVY-LLDTCFNFQGLPTVQIPSLVLHFDNGVDMVLPGGNIL 308
Query: 365 V-VASVSQVCLGFATYPPDPNSI---TLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ V CL FA P+S+ +GNVQQ+ + +D+ + P C
Sbjct: 309 IPVDRSGTFCLAFA-----PSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 357
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 52/162 (32%), Positives = 82/162 (50%), Gaps = 10/162 (6%)
Query: 127 TVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIP 186
T + + + +G P Q ++ D +D TW QC+PCI C+ Q D F S+S ++ +
Sbjct: 182 TGTSNFLVQIGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFDPSQSSSYTLLS 241
Query: 187 CNSTSCRILRESFPFGNC-NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C + C +L P +C + C +NI Y DG+ + G + ++ + S+G+ R
Sbjct: 242 CETKHCNLL----PNSSCSDDGYCRYNITYKDGTNTEGVLINETVSFE---SSGWVDRVS 294
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
LGC N + G G+ G GL R +S +R N S SYCL
Sbjct: 295 --LGCSNKNQGPFVGSDGTFGLGRGSLSFPSRINASSMSYCL 334
Score = 47.0 bits (110), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 52/122 (42%), Gaps = 11/122 (9%)
Query: 290 PYGSTGYI-------TFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVV 342
PYG+ G I T + DT N +R AF + + ++ K DTCY+LS+ TV
Sbjct: 396 PYGNGGMIVSSSSLITMLENDTYNVVRDAFVAKTQHLERLKAFLQF-DTCYNLSSNNTVE 454
Query: 343 VPKIAIHFLGGVDLELDVRGTL-VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYD 401
+P + G L L V C FA P + LG +QQ G V +D
Sbjct: 455 LPILEFEVNDGKSWLLPKESYLYAVDKNGTFCFAFA--PSKGSFSILGTLQQYGTRVTFD 512
Query: 402 VA 403
+
Sbjct: 513 LV 514
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 72/244 (29%), Positives = 115/244 (47%), Gaps = 14/244 (5%)
Query: 72 SRLNQGISTHAPS-LEEILRQDQQRLHLKNSRRLR-KPFPEFLKRTEAFTFPANINDTVA 129
+R++ T AP + + LR+D R ++ R R + E RT T A +
Sbjct: 51 TRIHSDPDTTAPQFVRDALRRDMHRQRSRSFGRDRDRELAESDGRTST-TVSARTRKDLP 109
Query: 130 D--EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIP 186
+ EY + +AIG P + + DTGSD+ WTQC PC CF+Q P + + S TF +P
Sbjct: 110 NGGEYLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLP 169
Query: 187 CNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP- 245
CNS+ C + Y G + G ++ T + ++ R P
Sbjct: 170 CNSSLSMCAGALAGAAPPPGCACMYYQTYGTG-WTAGVQGSETFTFGSSAADQ--ARVPG 226
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPY---GSTGYITFGKT 302
GC N SS D +G++G++GL R +S++++ FSYCL +P+ ST + G +
Sbjct: 227 VAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCL-TPFQDTNSTSTLLLGPS 285
Query: 303 DTVN 306
+N
Sbjct: 286 AALN 289
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/157 (35%), Positives = 84/157 (53%), Gaps = 9/157 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P + + ++LDTGSDV W QC PC C+QQ DP F + S TF + C+
Sbjct: 163 EYFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPCSECYQQSDPIFDPTSSSTFKSLTCSDP 222
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C S C S +C + + Y DGS + G +ATD +T E+ LGC
Sbjct: 223 KC----ASLDVSACRSNKCLYQVSYGDGSFTVGNYATDTVTFGESGKVN-----DVALGC 273
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+++ G +GA+G++GL +S+ + FSYCL
Sbjct: 274 GHDNEGLFTGAAGLLGLGGGALSMTNQIKAKSFSYCL 310
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 43/119 (36%), Positives = 58/119 (48%), Gaps = 3/119 (2%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T +T N+LR AF K +KK L DTCYD S+ TV VP + HF GG L
Sbjct: 386 VTRLQTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCYDFSSLSTVKVPTVTFHFTGGKSL 445
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L + L+ + C FA P + +GNVQQ+G + YD+A +G C
Sbjct: 446 NLPAKNYLIPIDDAGTFCFAFA--PTSSSLSIIGNVQQQGTRITYDLANNLIGLSANKC 502
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 60/162 (37%), Positives = 93/162 (57%), Gaps = 12/162 (7%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y + + +G P + + ++DTGSD+ W QCKPC C+ Q DP + S S TF K C+++S
Sbjct: 4 YTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSCSTSS 63
Query: 192 CRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLL 248
C+ L P C+S K C + QY D S + G +A + +T++ +S G +P F
Sbjct: 64 CQSL----PASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLR--SSGGSSKAFPNFQF 117
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
GC +SG GA+GI+GL + +S+ T+ ++ FSYCL
Sbjct: 118 GCGRLNSGSFGGAAGIVGLGQGKISLSTQLGSAINNKFSYCL 159
>gi|125575538|gb|EAZ16822.1| hypothetical protein OsJ_32294 [Oryza sativa Japonica Group]
Length = 392
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/167 (35%), Positives = 81/167 (48%), Gaps = 15/167 (8%)
Query: 124 INDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
I+ T A Y IG P Q S ++D ++ WTQCK C CF+Q P F + S T+
Sbjct: 43 IHWTQAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYR 102
Query: 184 KIPCNSTSCRILRESFPFG--NCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
PC + C ES P NC+ C + G +GG TD + A ++ F
Sbjct: 103 AEPCGTPLC----ESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVGTAKASLAF 157
Query: 242 TRYPFLLGCINNSSGDKSGA-SGIMGLDRSPVSIITRTNTSYFSYCL 287
GC+ S D G SGI+GL R+P S++T+T + FSYCL
Sbjct: 158 -------GCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCL 197
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 68/210 (32%), Positives = 96/210 (45%), Gaps = 11/210 (5%)
Query: 86 EEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYV 145
E+L + RL S R R + + + DT EY + +AIG P Q V
Sbjct: 378 REVLHRMAARLLFSASGRAAS------ARVDPGPYANGVPDT---EYLVHLAIGTPPQPV 428
Query: 146 SLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILR-ESFPFGNC 204
L+LDTGSD+ WTQC+PC CF + S S TF +PC+S C L S N
Sbjct: 429 QLILDTGSDLVWTQCRPCPVCFSRALGPLDPSNSSTFDVLPCSSPVCDNLTWSSCGKHNW 488
Query: 205 NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC-INNSSGDKSGASG 263
++ C + YADGS + G + T A+ G T GC + N+ S +G
Sbjct: 489 GNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQATVPDLAFGCGLFNNGIFTSNETG 548
Query: 264 IMGLDRSPVSIITRTNTSYFSYCLPSPYGS 293
I G R +S+ ++ FS+C + GS
Sbjct: 549 IAGFGRGALSLPSQLKVDNFSHCFTAITGS 578
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/157 (36%), Positives = 89/157 (56%), Gaps = 9/157 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V IG P ++V +++DTGSDV W QC PC C+QQ DP F S S ++ + C +
Sbjct: 154 EYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCETH 213
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C+ L S C + C + + Y DGS + G +AT+ IT+ +G + +GC
Sbjct: 214 QCKSLDVS----ECRNDSCLYEVSYGDGSYTVGDFATETITL-----DGSASLNNVAIGC 264
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+++ G GA+G++GL +S ++ N S FSYCL
Sbjct: 265 GHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCL 301
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/119 (36%), Positives = 62/119 (52%), Gaps = 4/119 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T ++D N+LR +F + + G+ L DTCYDLS+ +V VP ++ HF G L
Sbjct: 376 VTRLQSDVYNSLRDSFVRGTQHLPSTSGVA-LFDTCYDLSSRSSVEVPTVSFHFPDGKYL 434
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L + L+ V S C FA P +GNVQQ+G V YD++ +GF P C
Sbjct: 435 ALPAKNYLIPVDSAGTFCFAFA--PTTSALSIIGNVQQQGTRVSYDLSNSLVGFSPNGC 491
>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
Length = 447
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 71/139 (51%), Gaps = 10/139 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ +V +G P L++DTGSD+ W QC PC C+ QR F +S T+ ++PC+S
Sbjct: 85 EYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSP 144
Query: 191 SCRILRESFP---FGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
CR LR FP G C + + Y DGS S G ATD++ T
Sbjct: 145 QCRALR--FPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFANDTYVNNVT----- 197
Query: 248 LGCINNSSGDKSGASGIMG 266
LGC ++ G A+G++G
Sbjct: 198 LGCGRDNEGLFDSAAGLLG 216
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 46/99 (46%), Gaps = 9/99 (9%)
Query: 324 GLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELD-------VRGTLVVASVSQVCLGF 376
G + D CYDL P I +HF GG D+ L V G A+ + CLGF
Sbjct: 350 GEHSVFDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGF 409
Query: 377 ATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
D +GNVQQ+G V +DV R+GF P C+
Sbjct: 410 EAA--DDGLSVIGNVQQQGFRVVFDVEKERIGFAPKGCT 446
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/161 (33%), Positives = 79/161 (49%), Gaps = 4/161 (2%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + +AIG P L DTGSD+TWTQCKPC CF Q P + + S +F +PC S
Sbjct: 94 EYLMELAIGTPPVPFVALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSPVPCASA 153
Query: 191 SCR-ILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNG---YFTRYPF 246
+C I R S + C + Y DG+ S G T+ +T ++ +
Sbjct: 154 TCLPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPGVSVGGV 213
Query: 247 LLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
GC ++ G ++G +GL R +S++ + FSYCL
Sbjct: 214 AFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCL 254
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 38/132 (28%), Positives = 55/132 (41%), Gaps = 9/132 (6%)
Query: 292 GSTGYITFGKTDTVNTLRSAF-----HKRMKKYKKAKGLEDLLDTCYDLSAYETVV--VP 344
GS G I T + SAF H + L C+ +A E + +P
Sbjct: 327 GSGGMIVDSGTIFTVLVESAFRVVVNHVAGVLNQPVVNASSLDSPCFPATAGEQQLPDMP 386
Query: 345 KIAIHFLGGVDLELDVRGTLVV-ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVA 403
+ +HF GG D+ L + S CL A P SI LGN QQ+ ++ +D+
Sbjct: 387 DMLLHFAGGADMRLHRDNYMSFNQESSSFCLNIAGAPSAYGSI-LGNFQQQNIQMLFDIT 445
Query: 404 GRRLGFGPGNCS 415
+L F P +CS
Sbjct: 446 VGQLSFVPTDCS 457
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 64/206 (31%), Positives = 98/206 (47%), Gaps = 16/206 (7%)
Query: 88 ILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSL 147
++R++ H+ RRL + L E P + Y + ++IG P +
Sbjct: 33 LIRKNSSHAHVLPLRRLME-----LSAMEKTLTPQSPIYAYLGHYLMELSIGTPPFKIYG 87
Query: 148 LLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCN-S 206
+ DTGSD+TWT C PC +C++QR+P F KS T+ I C+S C L G C+
Sbjct: 88 IADTGSDLTWTSCVPCNNCYKQRNPMFDPQKSTTYRNISCDSKLCHKLDT----GVCSPQ 143
Query: 207 KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC-INNSSGDKSGASGIM 265
K C + YA + + G A + IT+ + + GC NN+ G GI+
Sbjct: 144 KRCNYTYAYASAAITRGVLAQETITLSSTKGKSVPLK-GIVFGCGHNNTGGFNDHEMGII 202
Query: 266 GLDRSPVSIITRTNTSY----FSYCL 287
GL PVS+I++ +S+ FS CL
Sbjct: 203 GLGGGPVSLISQMGSSFGGKRFSQCL 228
>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
Length = 287
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/111 (44%), Positives = 67/111 (60%), Gaps = 3/111 (2%)
Query: 306 NTLRSAFHKRM--KKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGT 363
++LRSAF M + YK+A L LLDTCYDL+ V +P +++ F GGV L++D G
Sbjct: 178 SSLRSAFAASMAARGYKRAPALS-LLDTCYDLTGASEVAIPTVSLLFQGGVSLDVDASGI 236
Query: 364 LVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ ASVSQ CLGFA + +GN Q + V YD+A + +GF PG C
Sbjct: 237 IYAASVSQACLGFAGNEAADDVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 51/101 (50%), Gaps = 8/101 (7%)
Query: 203 NCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGAS 262
C+ C + +QY DGS + GF+A D +T+ ++ F GC + G A+
Sbjct: 15 GCSGGHCLYGVQYGDGSYTIGFFAMDTLTLSSHDA-----IKGFRFGCGERNEGLFGEAA 69
Query: 263 GIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFG 300
G++GL R S+ +T Y F++C P+ TGY+ FG
Sbjct: 70 GLLGLGRGKTSLPVQTYDKYGGVFAHCFPARSSGTGYLEFG 110
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/169 (34%), Positives = 78/169 (46%), Gaps = 6/169 (3%)
Query: 125 NDTVADEYYIVVAIGEPK-QYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
N V EY I ++IG P+ Q V L LDTGSDV WTQC+PC CF Q P F + S T
Sbjct: 85 NTDVNSEYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCEPCAECFTQPLPRFDTAASNTVR 144
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
+ C+ C E C C + Y DGS S G + D T + G T
Sbjct: 145 SVACSDPLCNAHSEH----GCFLHGCTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGKVTV 200
Query: 244 YPFLLGC-INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPY 291
GC + N+ +GI G R P+S+ ++ FSYC + +
Sbjct: 201 PDIGFGCGMYNAGRFLQTETGIAGFGRGPLSLPSQLKVRQFSYCFTTRF 249
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 48/108 (44%), Gaps = 5/108 (4%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
L+SAF + ED D C+ +T +PK+ H L G D +L +
Sbjct: 336 LKSAFIAQAALPVNKTADED--DICFSWDGKKTAAMPKLVFH-LEGADWDLPRENYVTED 392
Query: 368 SVS-QVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
S QVC+ +T ++ +GN QQ+ + YD+A +L P C
Sbjct: 393 RESGQVCVAVSTSGQMDRTL-IGNFQQQNTHIVYDLAAGKLLLVPAQC 439
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 91/324 (28%), Positives = 151/324 (46%), Gaps = 36/324 (11%)
Query: 3 ILSKAFLLFICLLCSSNNGAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALP-----QGP 57
++S L F+C + +S + D + ++ V + L + +A QG
Sbjct: 6 MVSALALFFVCFVSTSVGEIF--DELSAGQQVLDVEAALKLRISRSKVSAQEWSETVQGE 63
Query: 58 DKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSR----------RLRKP 107
+K S+ + + + S L+E L++D R+ N+R KP
Sbjct: 64 EKNSIVLQVVHRDSLSSSSNTSLVKEILQERLKRDAARVDSINARVQLAAMGVSKAEMKP 123
Query: 108 F--PEFLKRTEAFTFPANINDTVAD---EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKP 162
R +A F ++I +A EY+ + +G P +Y ++LDTGSD+ W QC P
Sbjct: 124 LNGSSIDARFDAKDFSSSIISGLAQGSGEYFTRLGVGTPPRYTYMVLDTGSDIMWIQCLP 183
Query: 163 CIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSG 222
C C+ Q DP F + S T+ K+PC + C+ L S G N + C + + Y DGS +
Sbjct: 184 CAKCYGQTDPLFNPAASSTYRKVPCATPLCKKLDIS---GCRNKRYCEYQVSYGDGSFTV 240
Query: 223 GFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY 282
G ++T+ +T + G R LGC +++ G GA+G++GL R +S ++T +
Sbjct: 241 GDFSTETLTFR-----GQVIRR-VALGCGHDNEGLFIGAAGLLGLGRGSLSFPSQTGAQF 294
Query: 283 ---FSYCLP--SPYGSTGYITFGK 301
FSYCL S G+ + FGK
Sbjct: 295 SKRFSYCLVDRSASGTASSLIFGK 318
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 57/110 (51%), Gaps = 4/110 (3%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+T+R AF K A G L DTCYDLS +TV VP + HF GG + L L+
Sbjct: 389 STMRDAFRVGTGNLKSAGGFS-LFDTCYDLSGLKTVKVPTLVFHFQGGAHISLPATNYLI 447
Query: 366 -VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V S + C FA + I GN+QQ+G+ V +D R+GF G+C
Sbjct: 448 PVDSSATFCFAFAGNTGGLSII--GNIQQQGYRVVFDSLANRVGFKAGSC 495
>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
Length = 373
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/168 (35%), Positives = 86/168 (51%), Gaps = 19/168 (11%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y + IG P Q S ++ + WTQC PC CF+Q P F S S T+ PC +
Sbjct: 28 YMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRSASSTYRPEPCGTAL 87
Query: 192 CRILRESFPFGNCNSKE-CPFNIQ--YADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
C ES P C+ C + ++ + D SG GG TD I A ++ F
Sbjct: 88 C----ESVPASTCSGDGVCSYEVETMFGDTSGIGG---TDTFAIGTATASLAF------- 133
Query: 249 GCINNSSGDK-SGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTG 295
GC +S+ + GASG++GL R+P S++ + N + FSYCL +P+G+ G
Sbjct: 134 GCAMDSNIKQLLGASGVVGLGRTPWSLVGQMNATAFSYCL-APHGAAG 180
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 102/421 (24%), Positives = 164/421 (38%), Gaps = 107/421 (25%)
Query: 75 NQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVA---DE 131
NQ S +P + I +RL E+LK A+++ V
Sbjct: 38 NQIYSLQSPQVSHIKEASVERL-------------EYLKAKATGDIIAHLSPNVPIIPQA 84
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
+ + ++IG P L +DT SD+ W QC+PCI+C+ Q P F S+S T + S
Sbjct: 85 FLVNISIGSPPVTQLLHMDTASDLLWLQCRPCINCYAQSLPIFDPSRSYTH-----RNES 139
Query: 192 CRILRESFPF--GNCNSKECPFNIQYADGSGSGGFWATDRI---TIQEANSNGYFTRYPF 246
CR + S P N ++ C ++++Y DG+GS G A + + TI + +S+ +
Sbjct: 140 CRTSQYSMPSLRFNAKTRSCEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSA--ALHDV 197
Query: 247 LLGCINNSSGDKSGASGIMGLDRSPVSIITR--TNTSYFSYCLPSP--------YGSTGY 296
+ GC +++ G+ +GI+GL S++ R T SY L P G G
Sbjct: 198 VFGCGHDNYGEPLVGTGILGLGYGEFSLVHRFGTKFSYCFGSLDDPSYPHNVLVLGDDGA 257
Query: 297 ITFGKT--------------------------------------------DTVNTLRSAF 312
G T DT N+L S
Sbjct: 258 NILGDTTPLEIYNGFYYVTIEAISVDGIILPIDPWVFNRNHQTGLGGTIIDTGNSLTSLV 317
Query: 313 HKRMKKYKKAKGLEDLLD----------------TCYDLSAYETVV---VPKIAIHFLGG 353
+ K K +ED + CY+ + +V P + HF G
Sbjct: 318 EEAYKPLKNK--IEDYFEGRFTAADVNQDDMFKVECYNGNLERDLVESGFPIVTFHFSDG 375
Query: 354 VDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
+L LDV+ + S + CL A P + NSI G Q+ + + YD+ +++ F +
Sbjct: 376 AELSLDVKSVFMKLSPNVFCL--AVTPGNMNSI--GATAQQSYNIGYDLEAKKISFERID 431
Query: 414 C 414
C
Sbjct: 432 C 432
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/169 (36%), Positives = 82/169 (48%), Gaps = 13/169 (7%)
Query: 125 NDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFK 184
N EY + +AIG P Q V L LDTGSD+ WTQC+PC CF Q P+F S S T
Sbjct: 28 NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSL 87
Query: 185 IPCNSTSCRILRESFPFGNCNS------KECPFNIQYADGSGSGGFWATDRITIQEANSN 238
C+ST C+ L P +C S + C + Y D S + GF D+ T A ++
Sbjct: 88 TSCDSTLCQGL----PVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGAS 143
Query: 239 GYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
F G NN KS +GI G R P+S+ ++ FS+C
Sbjct: 144 --VPGVAFGCGLFNNGV-FKSNETGIAGFGRGPLSLPSQLKVGNFSHCF 189
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 59/158 (37%), Positives = 83/158 (52%), Gaps = 8/158 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + +AIG P +LDTGSD+ WTQCKPC C++Q P F KS +F K+ C S+
Sbjct: 107 EYLMELAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCGSS 166
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C + P C S C + Y D S + G AT+ T S + + GC
Sbjct: 167 LC----SAVPSSTC-SDGCEYVYSYGDYSMTQGVLATETFTF--GKSKNKVSVHNIGFGC 219
Query: 251 INNSSGDK-SGASGIMGLDRSPVSIITRTNTSYFSYCL 287
++ GD ASG++GL R P+S++++ FSYCL
Sbjct: 220 GEDNEGDGFEQASGLVGLGRGPLSLVSQLKEPRFSYCL 257
Score = 42.4 bits (98), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 30/88 (34%), Positives = 42/88 (47%), Gaps = 6/88 (6%)
Query: 329 LDTCYDLSAYETVV-VPKIAIHFLGGVDLELDVRGTLVVAS-VSQVCLGFATYPPDPNSI 386
LD C+ L + T V +PKI HF GG DLEL ++ S + CL
Sbjct: 367 LDLCFSLPSGSTQVEIPKIVFHFKGG-DLELPAENYMIGDSNLGVACLAMGA---SSGMS 422
Query: 387 TLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
GNVQQ+ V++D+ + F P +C
Sbjct: 423 IFGNVQQQNILVNHDLEKETISFVPTSC 450
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 59/167 (35%), Positives = 81/167 (48%), Gaps = 15/167 (8%)
Query: 124 INDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
I+ T A Y IG P Q S ++D ++ WTQCK C CF+Q P F + S T+
Sbjct: 43 IHWTQAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYR 102
Query: 184 KIPCNSTSCRILRESFPFG--NCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
PC + C ES P NC+ C + G +GG TD + A ++ F
Sbjct: 103 AEPCGTPLC----ESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVGTAKASLAF 157
Query: 242 TRYPFLLGCINNSSGDKSGA-SGIMGLDRSPVSIITRTNTSYFSYCL 287
GC+ S D G SGI+GL R+P S++T+T + FSYCL
Sbjct: 158 -------GCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCL 197
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 95/393 (24%), Positives = 156/393 (39%), Gaps = 86/393 (21%)
Query: 98 LKNSRRLRKPFPEFLKRTEAFTFPANINDTV---------ADEYYIVVAIGEPKQYVSLL 148
L + RL F L R+ A N + + EY + V+IG P +
Sbjct: 49 LSHYDRLTNAFRRSLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGM 108
Query: 149 LDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKE 208
DTGSD+ W QC PC+ C++Q P F KS +F +PCNS +C+ + +S +C ++
Sbjct: 109 ADTGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDS----HCGAQG 164
Query: 209 -CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGL 267
C ++ Y D + + G ++ITI ++ ++GC + S G ASG++GL
Sbjct: 165 VCDYSYTYGDQTYTKGDLGFEKITIGSSSVKS-------VIGCGHESGGGFGFASGVIGL 217
Query: 268 DRSPVSIITRTNTS-----YFSYCLPSPYG-STGYITFGKTDTVN--------------- 306
+S++++ + + FSYCLP+ + G I FG+ V+
Sbjct: 218 GGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPV 277
Query: 307 -----TLRSAFHKRMKKYKKAKGLEDLLDTCYDLS-----AYETVV-------------- 342
TL + + AK ++D+ LS Y+ VV
Sbjct: 278 TYYYVTLEAISIGNERHMASAKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKD 337
Query: 343 --------------------VPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPD 382
+P I F GG ++ L T + + CL P
Sbjct: 338 PGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPT 397
Query: 383 PNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+GN+ + YD+ +RL F P C+
Sbjct: 398 DEFGIIGNLALANFLIGYDLEAKRLSFKPTVCT 430
>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
Length = 256
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 58/157 (36%), Positives = 89/157 (56%), Gaps = 9/157 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V IG P ++V +++DTGSDV W QC PC C+QQ DP F S S ++ + C +
Sbjct: 52 EYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCETH 111
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C+ L S C + C + + Y DGS + G +AT+ IT+ +G + +GC
Sbjct: 112 QCKSLDVS----ECRNDSCLYEVSYGDGSYTVGDFATETITL-----DGSASLNNVAIGC 162
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+++ G GA+G++GL +S ++ N S FSYCL
Sbjct: 163 GHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCL 199
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 59/167 (35%), Positives = 81/167 (48%), Gaps = 15/167 (8%)
Query: 124 INDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
I+ T A Y IG P Q S ++D ++ WTQCK C CF+Q P F + S T+
Sbjct: 43 IHWTQAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYR 102
Query: 184 KIPCNSTSCRILRESFPFG--NCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
PC + C ES P NC+ C + G +GG TD + A ++ F
Sbjct: 103 AEPCGTPLC----ESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVGTAKASLAF 157
Query: 242 TRYPFLLGCINNSSGDKSGA-SGIMGLDRSPVSIITRTNTSYFSYCL 287
GC+ S D G SGI+GL R+P S++T+T + FSYCL
Sbjct: 158 -------GCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCL 197
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 62/185 (33%), Positives = 96/185 (51%), Gaps = 11/185 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V IG P V ++LDTGSDV+W QC PC C++Q DP F + S +F + C +
Sbjct: 150 EYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPIFEPTSSASFTSLSCETE 209
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C+ L S C + C + + Y DGS + G + T+ +T+ G + +GC
Sbjct: 210 QCKSLDVS----ECRNGTCLYEVSYGDGSYTVGDFVTETVTL------GSTSLGNIAIGC 259
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL-PSPYGSTGYITFGKTDTVNTLR 309
+N+ G GA+G++GL +S ++ N S FSYCL ST + F T + +
Sbjct: 260 GHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSDSTSTLDFNSPITPDAVT 319
Query: 310 SAFHK 314
+ H+
Sbjct: 320 APLHR 324
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 47/119 (39%), Positives = 62/119 (52%), Gaps = 4/119 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T +T N LR AF K + A+G+ L DTCYDLS+ V VP ++ HF G +L
Sbjct: 371 VTRLQTTVYNVLRDAFVKSTHDLQTARGVA-LFDTCYDLSSKSRVEVPTVSFHFANGNEL 429
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L + L+ V S C FA P D LGN QQ+G V +D+A +GF P C
Sbjct: 430 PLPAKNYLIPVDSEGTFCFAFA--PTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 59/167 (35%), Positives = 81/167 (48%), Gaps = 15/167 (8%)
Query: 124 INDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
I+ T A Y IG P Q S ++D ++ WTQCK C CF+Q P F + S T+
Sbjct: 43 IHWTQAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYR 102
Query: 184 KIPCNSTSCRILRESFP--FGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
PC + C ES P NC+ C + G +GG TD + A ++ F
Sbjct: 103 AEPCGTPLC----ESIPSDVRNCSGNVCAYEASTNAGD-TGGKVGTDTFAVGTAKASLAF 157
Query: 242 TRYPFLLGCINNSSGDKSGA-SGIMGLDRSPVSIITRTNTSYFSYCL 287
GC+ S D G SGI+GL R+P S++T+T + FSYCL
Sbjct: 158 -------GCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCL 197
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 67/187 (35%), Positives = 106/187 (56%), Gaps = 19/187 (10%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y + V +G + +SL++DTGSD+TW QC+PC C+ Q+ P + S S ++ + CNS++
Sbjct: 135 YIVTVELG--GKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSST 192
Query: 192 CRIL----RESFPFGNCN---SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRY 244
C+ L S P G N C + + Y DGS + G A++ I + +
Sbjct: 193 CQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLEN----- 247
Query: 245 PFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS-PYGSTGYITFG 300
F+ GC N+ G G+SG+MGL RS VS++++T ++ FSYCLPS G++G ++FG
Sbjct: 248 -FVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFG 306
Query: 301 KTDTVNT 307
+V T
Sbjct: 307 NDSSVYT 313
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 56/109 (51%), Gaps = 3/109 (2%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGT--LV 365
++ F K+ + A G +LDTC++L++YE + +P I + F G +LE+DV G V
Sbjct: 375 VKIEFLKQFSGFPTAPGY-SILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFV 433
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
S VCL A+ + +GN QQ+ V YD RLG NC
Sbjct: 434 KPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 482
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 86/298 (28%), Positives = 131/298 (43%), Gaps = 43/298 (14%)
Query: 57 PDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLH---LKNSRRLRKPFPEFLK 113
PD SLE+V +Y S G T + ++ + R H + S
Sbjct: 25 PDGFSLEIVHRYSRESPFYPGNITDYERITRLVELSKIRAHNLAITTSSGFSP------- 77
Query: 114 RTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPF 173
EAF + +DT Y + V IG P + L+ DTGS + WTQC+PC F+Q P
Sbjct: 78 --EAFRLRISQDDTC---YLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRFRQLPPI 132
Query: 174 FYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQ 233
F ++ S+T+ +PC C + F C +C + I YA GS + G A D +Q
Sbjct: 133 FNSTASRTYRDLPCQHQFCTNNQNVF---QCRDDKCVYRIAYAGGSATAGVAAQD--ILQ 187
Query: 234 EANSNGYFTRYPFLLGCINNSSG-----DKSGASGIMGLDRSPVSIITRTN---TSYFSY 285
A ++ R PF GC ++ GI+GL+ SPVS++ + N + FSY
Sbjct: 188 SAEND----RIPFYFGCSRDNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSY 243
Query: 286 C-----LPSPYGSTGYITFGKTDTVNTLRSAFHKRMKK-YKKAKGLEDLLDTCYDLSA 337
C L SP +T + FG N +R + K + + +G+ + D+S
Sbjct: 244 CLNLFDLSSPSHATSLLRFG-----NDIRKSRRKYLSTPFVSPRGMPNYFLNLIDVSV 296
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 67/187 (35%), Positives = 106/187 (56%), Gaps = 19/187 (10%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y + V +G + +SL++DTGSD+TW QC+PC C+ Q+ P + S S ++ + CNS++
Sbjct: 135 YIVTVELG--GKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSST 192
Query: 192 CRIL----RESFPFGNCN---SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRY 244
C+ L S P G N C + + Y DGS + G A++ I + +
Sbjct: 193 CQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLEN----- 247
Query: 245 PFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS-PYGSTGYITFG 300
F+ GC N+ G G+SG+MGL RS VS++++T ++ FSYCLPS G++G ++FG
Sbjct: 248 -FVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFG 306
Query: 301 KTDTVNT 307
+V T
Sbjct: 307 NDSSVYT 313
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 56/109 (51%), Gaps = 3/109 (2%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGT--LV 365
++ F K+ + A G +LDTC++L++YE + +P I + F G +LE+DV G V
Sbjct: 375 VKIEFLKQFSGFPTAPGY-SILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFV 433
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
S VCL A+ + +GN QQ+ V YD RLG NC
Sbjct: 434 KPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDSTQERLGIVGENC 482
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 41/126 (32%), Positives = 70/126 (55%), Gaps = 4/126 (3%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+I +++G P + + L++DTGSD+ W QC PC++C+ Q D F KS T+ + C++
Sbjct: 57 EYFIRISVGTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSDAIFDPYKSSTYSTLGCSTR 116
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C L G C + +C + + Y DGS + G + TD +++ + G LGC
Sbjct: 117 QCLNLD----IGTCQANKCLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGC 172
Query: 251 INNSSG 256
+++ G
Sbjct: 173 GHDNEG 178
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 43/111 (38%), Positives = 57/111 (51%), Gaps = 7/111 (6%)
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV- 365
+LR AF G L DTCYDLS +V VP + +HF GG DL+L L+
Sbjct: 301 SLRDAFRAGTSDLAPTAGFS-LFDTCYDLSGLASVDVPTVTLHFQGGTDLKLPASNYLIP 359
Query: 366 VASVSQVCLGFA-TYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
V + + CL FA T P +GN+QQ+G V YD ++GF P C+
Sbjct: 360 VDNSNTFCLAFAGTTGPS----IIGNIQQQGFRVIYDNLHNQVGFVPSQCN 406
>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 506
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 89/319 (27%), Positives = 145/319 (45%), Gaps = 49/319 (15%)
Query: 23 YADDNDLSHSHIV-SVSSLLPPN---VCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGI 78
+A + +LS+ H+V + SSL N VC R + P + + + PCS G
Sbjct: 28 HAAEAELSNHHVVVAASSLELANASPVCQGHRVS-PSSSGGSWAPLSHLHSPCSPAAGGR 86
Query: 79 STHAP--SLEEILRQDQQRL-HLK-----NSRRLRKPFPEFLKRTEAFTFPA-NIN---- 125
+ P +L L+ D+ R H++ N+ + E + T+ + PA N+N
Sbjct: 87 DSAPPPKTLSATLQWDEHRAGHIQRKLSGNAAPMDDAGEETPQSTQVTSSPAANVNVGKS 146
Query: 126 --DTVADEYYIVVAIGE------PKQYVSLLLDTGSDVTWTQCKPC--IHCFQQRDPFFY 175
D+ ++ + A G P S+++DT SDV W QC PC C+ Q D +
Sbjct: 147 STDSAFEQGIVPAATGPGGQKKLPGVAQSMVVDTASDVPWVQCAPCPQPQCYAQSDVLYD 206
Query: 176 ASKSKTFFKIPCNSTSCRILRE--SFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQ 233
+KS PC+S CR L + G N+ C + + Y DGSG+ G + +D +T+
Sbjct: 207 PTKSILSAPFPCSSPQCRSLGRYANGCTGAGNTGTCQYRVLYPDGSGTSGTYVSDLLTL- 265
Query: 234 EANSNGYFTRYPF-----LL--GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---- 282
A+ G +++ F LL G NN + +G M L R S+ ++T ++
Sbjct: 266 NADPKGAVSKFQFGCSHALLRPGSFNNKT------AGFMALGRGAQSLSSQTKGTFSKGN 319
Query: 283 -FSYCLPSPYGSTGYITFG 300
FSYCLP G+++ G
Sbjct: 320 VFSYCLPPTGSHKGFLSLG 338
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 55/108 (50%), Gaps = 6/108 (5%)
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVV 366
LR+AF +M+ Y+ A + LDTCYD + V +PK+ + F +ELD G ++
Sbjct: 405 ALRAAFRAQMRAYR-AVAPKGQLDTCYDFTGVPMVRLPKVTLVFDRNAAVELDPSGVML- 462
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
CL FA D +GNVQQ+ EV Y+V G +GF C
Sbjct: 463 ----DSCLAFAPNANDFMPGIIGNVQQQTLEVLYNVDGASVGFRRAAC 506
>gi|413936471|gb|AFW71022.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
Length = 315
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 69/242 (28%), Positives = 110/242 (45%), Gaps = 26/242 (10%)
Query: 10 LFICLLCSSNNGAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYG 69
+F+C S+ +GA +D+ ++ V SS P +VC+ Q + +V ++G
Sbjct: 9 IFLCFYLSTVHGA-GEDSFVT----VPSSSFEPESVCSGEFVKPEQNGSTVYVPLVHRHG 63
Query: 70 PCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVA 129
PC+ +ST S +I R+ + R P ++ R + + PA++ +V
Sbjct: 64 PCA-PAPSLSTDTRSFADIFRRSRAR-------------PSYIVRGKKVSVPAHLGTSVM 109
Query: 130 D-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI--HCFQQRDPFFYASKSKTFFKIP 186
EY + V+ G P +++DTGSDV+W QCKPC CF Q+DP + S S T+ +P
Sbjct: 110 SLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVP 169
Query: 187 CNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPF 246
C S C+ L CN E G G + +AN+ + R F
Sbjct: 170 CASDVCKKLAADAYGSGCNEWETVRVCHLVCGRNKHG----GCVQPGQANAGAWGHRTKF 225
Query: 247 LL 248
LL
Sbjct: 226 LL 227
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 62/185 (33%), Positives = 96/185 (51%), Gaps = 11/185 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V IG P V ++LDTGSDV+W QC PC C++Q DP F + S +F + C +
Sbjct: 150 EYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPXFEPTSSASFTSLSCETE 209
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C+ L S C + C + + Y DGS + G + T+ +T+ G + +GC
Sbjct: 210 QCKSLDVS----ECRNGTCLYEVSYGDGSYTVGDFVTETVTL------GSTSLGNIAIGC 259
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL-PSPYGSTGYITFGKTDTVNTLR 309
+N+ G GA+G++GL +S ++ N S FSYCL ST + F T + +
Sbjct: 260 GHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSDSTSTLDFNSPITPDAVT 319
Query: 310 SAFHK 314
+ H+
Sbjct: 320 APLHR 324
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 47/119 (39%), Positives = 62/119 (52%), Gaps = 4/119 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T +T N LR AF K + A+G+ L DTCYDLS+ V VP ++ HF G +L
Sbjct: 371 VTRLQTTVYNVLRDAFVKSTHDLQTARGVA-LFDTCYDLSSKSRVEVPTVSFHFANGNEL 429
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L + L+ V S C FA P D LGN QQ+G V +D+A +GF P C
Sbjct: 430 PLPAKNYLIPVDSEGTFCFAFA--PTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 56/180 (31%), Positives = 90/180 (50%), Gaps = 14/180 (7%)
Query: 126 DTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKI 185
D + EY++ + +G P + +++D+GSD+ W QCKPC C+ Q DP F + S +F +
Sbjct: 37 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGV 96
Query: 186 PCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C+S C + + CNS C + + Y DGS + G A + +T+ G
Sbjct: 97 SCSSAVCDQVDNA----GCNSGRCRYEVSYGDGSSTKGTLALETLTL------GRTVVQN 146
Query: 246 FLLGCINNSSG---DKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPY-GSTGYITFGK 301
+GC + + G +G G+ G S V ++R + FSYCL S S G++ FG
Sbjct: 147 VAIGCGHMNQGMFVGAAGLLGLGGGSMSFVGQLSRERGNAFSYCLVSRVTNSNGFLEFGS 206
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 56/110 (50%), Gaps = 6/110 (5%)
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV- 365
R AF + +A G+ + DTCY+L + +V VP ++ +F GG L L L+
Sbjct: 277 AFRDAFIDQTGNLPRASGVS-IFDTCYNLFGFLSVRVPTVSFYFSGGPILTLPANNFLIP 335
Query: 366 VASVSQVCLGFATYPPDPNSIT-LGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V C FA P P+ ++ LGN+QQ G ++ D A +GFGP C
Sbjct: 336 VDDAGTFCFAFA---PSPSGLSILGNIQQEGIQISVDGANEFVGFGPNVC 382
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 74/231 (32%), Positives = 109/231 (47%), Gaps = 26/231 (11%)
Query: 85 LEEILRQDQQR---LHLKNSRRLR---KPFPEFLKRTE-AFTFPANINDTVAD---EYYI 134
LEE LR+D +R L + +RLR P E A F + +A EY+
Sbjct: 140 LEETLRRDARRVRGLEQRIEKRLRLNKDPAGSHENVAEVAAEFGGEVVSGMAQGSGEYFT 199
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
+ +G P + ++LDTGSDV W QC+PC C+ Q DP F S S +F + CNS C
Sbjct: 200 RIGVGTPMREQYMVLDTGSDVVWIQCEPCSKCYSQVDPIFNPSLSASFSTLGCNSAVCSY 259
Query: 195 LRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNS 254
L NC+ C + + Y DGS + G +AT+ +T G + +GC +++
Sbjct: 260 LDAY----NCHGGGCLYKVSYGDGSYTIGSFATEMLTF------GTTSVRNVAIGCGHDN 309
Query: 255 SG----DKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYG-STGYITFG 300
+G GL P + T+T + FSYCL + S+G + FG
Sbjct: 310 AGLFVGAAGLLGLGAGLLSFPSQLGTQTGRA-FSYCLVDRFSESSGTLEFG 359
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 59/119 (49%), Gaps = 4/119 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T +T + +R AF ++ KA+G+ + DTCYDLS V VP + HF G L
Sbjct: 423 VTRLQTPVYDAVRDAFVAGTRQLPKAEGVS-IFDTCYDLSGLPLVNVPTVVFHFSNGASL 481
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L + ++ + + C FA P + +GN+QQ+G V +D A +GF C
Sbjct: 482 ILPAKNYMIPMDFMGTFCFAFA--PATSDLSIMGNIQQQGIRVSFDTANSLVGFALRQC 538
>gi|413936472|gb|AFW71023.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
Length = 289
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 76/264 (28%), Positives = 112/264 (42%), Gaps = 71/264 (26%)
Query: 205 NSKECPFNIQYADGSGSGGFWATDRITIQEAN--SNGYFTRYPFLLGCINNSSGDKSGAS 262
+ K+C F I YADG+ + G ++ D++T+ N YF GC + +
Sbjct: 33 SGKQCGFAISYADGTSTVGAYSQDKLTLAPGAIVQNFYF-------GCGHGKHAVRGLFD 85
Query: 263 GIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFG---------------------- 300
G++GL R S+ R FSYCLPS G++ G
Sbjct: 86 GVLGLGRLRESLGARYG-GVFSYCLPSVSSKPGFLALGAGKNPSGFVFTPMGTVPGQPTF 144
Query: 301 -------------KTD----------------TVNTLRSAFHKRMKK-YKKAKGLEDL-- 328
K D + L+S ++ ++ ++KA L
Sbjct: 145 STVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLP 204
Query: 329 ---LDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNS 385
LDTCY+L+ Y+ VVVPKIA+ F GG + LDV ++V CL FA PD ++
Sbjct: 205 NGDLDTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGILVNG----CLAFAESGPDGSA 260
Query: 386 ITLGNVQQRGHEVHYDVAGRRLGF 409
LGNV QR EV +D + + GF
Sbjct: 261 GVLGNVNQRAFEVLFDTSTSKFGF 284
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 95/401 (23%), Positives = 148/401 (36%), Gaps = 72/401 (17%)
Query: 81 HAPSL--EEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAI 138
+ PSL E ++ R ++ RRLR + + I D EY + I
Sbjct: 44 YNPSLTPSERIKNTVLRSFARSKRRLR-----LSQNDDRSPGTITIPDEPITEYLMRFYI 98
Query: 139 GEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRES 198
G P + DTGSD+ W QC PC C Q P F KS TF +PC+S C +L S
Sbjct: 99 GTPPVERFAIADTGSDLIWVQCAPCEKCVPQNAPLFDPRKSSTFKTVPCDSQPCTLLPPS 158
Query: 199 FPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDK 258
S +C + Y D + G + I N+ F + F NN + D+
Sbjct: 159 QRACVGKSGQCYYQYIYGDHTLVSGILGFESINFGSKNNAIKFPKLTFGCTFSNNDTVDE 218
Query: 259 SGAS-GIMGLDRSPVSIITRTNTSY---FSYCLPS-PYGSTGYITFGKTDTVNTLRSAFH 313
S + G++GL P+S+I++ FSYC P ST + FG V ++
Sbjct: 219 SKRNMGLVGLGVGPLSLISQLGYQIGRKFSYCFPPLSSNSTSKMRFGNDAIVKQIKGVVS 278
Query: 314 KRM---------------------KKYKKAKGLED---LLDT------------------ 331
+ KK K ++ D L+D+
Sbjct: 279 TPLIIKSIGPSYYYLNLEGVSIGNKKVKTSESQTDGNILIDSGTSFTILKQSFYNKFVAL 338
Query: 332 CYDLSAYETVVVPKIAIHF-----------------LGGVDLELDVRGTLVVASVSQVCL 374
++ E V +P + +F G + +D + +C+
Sbjct: 339 VKEVYGVEAVKIPPLVYNFCFENKGKRKRFPDVVFLFTGAKVRVDASNLFEAEDNNLLCM 398
Query: 375 GFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
A D + GN Q G++V YD+ G + F P +C+
Sbjct: 399 -VALPTSDEDDSIFGNHAQIGYQVEYDLQGGMVSFAPADCA 438
>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
Length = 409
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 54/162 (33%), Positives = 85/162 (52%), Gaps = 13/162 (8%)
Query: 146 SLLLDTGSDVTWTQCKPC--IHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGN 203
++++D+GSDV W QC+PC + C QRDP F + S T+ +PC+S +C L + G
Sbjct: 82 TVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLGP-YRRGC 140
Query: 204 CNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD--KSGA 261
+ +C F I YA+G+ + G +++D +T+ Y FL GC + G
Sbjct: 141 LANSQCQFGITYANGATATGTYSSDDLTLGP-----YDVVRGFLFGCAHADQGSTFSYDV 195
Query: 262 SGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFG 300
+G + L S + +T + Y FSYC+P S G+I FG
Sbjct: 196 AGTLALGGGSQSFVQQTASQYSRVFSYCVPPSTSSFGFIMFG 237
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 44/109 (40%), Positives = 57/109 (52%), Gaps = 6/109 (5%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
LR+AF M Y+ A + +LDTCYD S ++ +P IA+ F GG + LD G L+
Sbjct: 307 QALRAAFRSAMTMYRPAPPVS-ILDTCYDFSGVRSITLPSIALVFDGGATVNLDAAGILL 365
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
Q CL FA D +GNVQQR EV YDV G+ + F C
Sbjct: 366 -----QGCLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 409
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 55/157 (35%), Positives = 89/157 (56%), Gaps = 9/157 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V +G+P + ++LDTGSD+ W QC+PC C+QQ DP F S +F +PC S
Sbjct: 154 EYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQ 213
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C+ L S C + +C + + Y DGS + G + T+ +T ++G +GC
Sbjct: 214 QCQALETS----GCRASKCLYQVSYGDGSFTVGEFVTETLTF---GNSGMINDVA--VGC 264
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+++ G G++G++GL P+S+ ++ S FSYCL
Sbjct: 265 GHDNEGLFVGSAGLLGLGGGPLSLTSQMKASSFSYCL 301
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 52/120 (43%), Positives = 67/120 (55%), Gaps = 6/120 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
IT +T NTLR AF R KK G L DTCYDLS+ V +P ++ F GG L
Sbjct: 376 ITRLQTQAYNTLRDAFVSRTPYLKKTNGFA-LFDTCYDLSSQSRVTIPTVSFEFAGGKSL 434
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITL-GNVQQRGHEVHYDVAGRRLGFGPGNC 414
+L + L+ V SV C FA P +S+++ GNVQQ+G VHYD+A +GF P C
Sbjct: 435 QLPPKNYLIPVDSVGTFCFAFA---PTTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 79/299 (26%), Positives = 129/299 (43%), Gaps = 41/299 (13%)
Query: 34 IVSVSSLLPPNVCNRTRTA---LPQGPDKASLEVVSKYGPCS----RLNQGISTHAPSLE 86
+++ S++ P C+ + A +P P+ + YGPCS N + A S+
Sbjct: 35 VIATSTMKPKTFCSGHKVAPGDVPS-PNSTWAPLHHLYGPCSPAPSSANSTAADVAASMA 93
Query: 87 EILRQDQQRL-----HLKNSRRLRKPFPEFLKRTEAF-------------TFPANINDTV 128
+++ DQ+R L + ++P F RT + + P ++
Sbjct: 94 DMVDDDQRRADYIQKRLTGATDDKQPM-AFSSRTSQYEKNGQYATNGGLGSVP-HLKSLS 151
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC--IHCFQQRDPFFYASKSKTFFKIP 186
G ++++D+GSDV+W QCKPC C +QRDP F + S T+ +P
Sbjct: 152 TTATTNSAPDGTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVP 211
Query: 187 CNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPF 246
C S +C L + G + +C F I Y DGS + G ++ D +T+ Y F
Sbjct: 212 CTSAACAQLGP-YRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGP-----YDVIRGF 265
Query: 247 LLGCINNSSGDK--SGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFG 300
GC + G +G + L S++ +T T Y FSYCLP S G++ G
Sbjct: 266 RFGCAHADRGSAFDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLG 324
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 40/109 (36%), Positives = 56/109 (51%), Gaps = 6/109 (5%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
LR+AF M Y+ A + +LDTCYD + ++ +P IA+ F GG + LD G L+
Sbjct: 393 QALRAAFRSAMTMYRAAPPVS-ILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL 451
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ CL FA D +GNVQQ+ EV YDV + + F C
Sbjct: 452 GS-----CLAFAPTASDRMPGFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 56/161 (34%), Positives = 80/161 (49%), Gaps = 9/161 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + V IG P L+ DTGSDV W QC PC C+ Q DP F + S +F +PCNS
Sbjct: 122 EYLVRVGIGSPPLEQHLVADTGSDVIWVQCSPCSDCYAQGDPLFDPANSASFSPVPCNSG 181
Query: 191 SCR-ILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
CR R S EC + + Y D S + G A + +T+ +G +G
Sbjct: 182 VCRAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTL-----DGGTEVQGVAMG 236
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITR---TNTSYFSYCL 287
C + + G + A+G++GL P+S++ + FSYCL
Sbjct: 237 CGHENRGLFAEAAGLLGLGWGPMSLVGQLGGAAGGAFSYCL 277
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 59/116 (50%), Gaps = 11/116 (9%)
Query: 307 TLRSAFHKRMKK-YKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLG------GVDLELD 359
LR AF ++ +A G+ L DTCYDLS Y +V VP +A++F G L L
Sbjct: 367 ALRGAFAGAFEEGAPRAPGVS-LFDTCYDLSGYASVRVPTVALYFGGGGQGQEAASLTLP 425
Query: 360 VRGTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
R LV V CL FA P+ LGN+QQ+G E+ D A +GFGP C
Sbjct: 426 ARNLLVPVDDGGTYCLAFAAVASGPS--ILGNIQQQGIEITVDSASGYVGFGPATC 479
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 65/162 (40%), Positives = 92/162 (56%), Gaps = 16/162 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P +YV ++LDTGSDV W QC PC C+ Q DP F KS +F I C S
Sbjct: 146 EYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCRSP 205
Query: 191 SCRILRESFPFGNCNSKE-CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLL 248
C LR P CNS++ C + + Y DGS + G ++T+ +T + TR P L
Sbjct: 206 LC--LRLDSP--GCNSRQSCLYQVAYGDGSFTFGEFSTETLTFRG-------TRVPKVAL 254
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
GC +++ G GA+G++GL R +S T+T + FSYCL
Sbjct: 255 GCGHDNEGLFVGAAGLLGLGRGRLSFPTQTGLRFGRKFSYCL 296
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 54/110 (49%), Gaps = 5/110 (4%)
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVV 366
+LR AF K+A L DTC+DLS V VP + +HF G D+ L L+
Sbjct: 384 SLRDAFRAGAADLKRAPDYS-LFDTCFDLSGKTEVKVPTVVMHFRG-ADVSLPATNYLIP 441
Query: 367 ASVSQV-CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ V C FA + I GN+QQ+G V +DVA R+GF C+
Sbjct: 442 VDTNGVFCFAFAGTMSGLSII--GNIQQQGFRVVFDVAASRIGFAARGCA 489
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 90/353 (25%), Positives = 142/353 (40%), Gaps = 75/353 (21%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + IG P ++DTGS + W QC PC +CF Q P F KS T+ C+S
Sbjct: 88 EYLMRFYIGSPPVERLAMVDTGSSLIWLQCSPCHNCFPQETPLFEPLKSSTYKYATCDSQ 147
Query: 191 SCRILRESFPFGNCNS-KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
C +L+ S +C +C + I Y D S S G T+ ++ + + G
Sbjct: 148 PCTLLQPS--QRDCGKLGQCIYGIMYGDKSFSVGILGTETLSFGSTGGAQTVSFPNTIFG 205
Query: 250 C-INNSSG--DKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTG-------- 295
C ++N+ + GI GL P+S++++ FSYCL PY ST
Sbjct: 206 CGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQIGHKFSYCL-LPYDSTSTSKLKFGS 264
Query: 296 ------------------------------------YITFGKTD-------------TVN 306
++ G+TD N
Sbjct: 265 EAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVSTGQTDGNIVIDSGTPLTYLEN 324
Query: 307 TLRSAFHKRMKKYKKAKGLEDL---LDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGT 363
T + F +++ K L+DL L TC+ A + +P IA F G + L +
Sbjct: 325 TFYNNFVASLQETLGVKLLQDLPSPLKTCFPNRA--NLAIPDIAFQFTGA-SVALRPKNV 381
Query: 364 LVVASVSQVCLGFATYPPDPNSITL-GNVQQRGHEVHYDVAGRRLGFGPGNCS 415
L+ + S + L A P I+L G++ Q +V YD+ G+++ F P +C+
Sbjct: 382 LIPLTDSNI-LCLAVVPSSGIGISLFGSIAQYDFQVEYDLEGKKVSFAPTDCA 433
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 91/346 (26%), Positives = 132/346 (38%), Gaps = 71/346 (20%)
Query: 126 DTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKI 185
D+ Y + +IG P Q +S L DTGSD+ W +C C C Q P +Y +KS +F K+
Sbjct: 76 DSGGGAYDMTFSIGTPPQELSALADTGSDLIWAKCGACTRCVPQGSPSYYPNKSSSFSKL 135
Query: 186 PCNSTSCRILRESFPFGNCNS--KECPFNIQYADGSG----SGGFWATDRITIQEANSNG 239
PC+ + C L P C++ EC + Y S + G+ ++ T+ G
Sbjct: 136 PCSGSLCSDL----PSSQCSAGGAECDYKYSYGLASDPHHYTQGYLGSETFTLGSDAVPG 191
Query: 240 YFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITF 299
GC S G SG++GL R P+S++++ N FSYCL S T + F
Sbjct: 192 ------IGFGCTTMSEGGYGSGSGLVGLGRGPLSLVSQLNVGAFSYCLTSDAAKTSPLLF 245
Query: 300 GKTD-TVNTLRSAFHKRMKKYKKAKGLED--------------------------LLDTC 332
G T ++S R Y LE L +
Sbjct: 246 GSGALTGAGVQSTPLLRTSTYYYTVNLESISIGAATTAGTGSSGIIFDSGTTVAFLAEPA 305
Query: 333 YDLS------------------AYET------VVVPKIAIHFLGGVDLELDVRGTLVVAS 368
Y L+ YE V P + +HF GG D++L
Sbjct: 306 YTLAKEAVLSQTTNLTMASGRDGYEVCFQTSGAVFPSMVLHFDGG-DMDLPTENYFGAVD 364
Query: 369 VSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
S C P+ +GN+ Q + + YDV L F P NC
Sbjct: 365 DSVSCW---IVQKSPSLSIVGNIMQMNYHIRYDVEKSMLSFQPANC 407
>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 76/275 (27%), Positives = 127/275 (46%), Gaps = 40/275 (14%)
Query: 33 HIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQD 92
H + ++SLLP + C P G L + YGPCS+L Q +PS ++I QD
Sbjct: 40 HTLDINSLLPKSNCTA-----PVGGGSQGLPITYSYGPCSQLGQ---KKSPSRQQIFLQD 91
Query: 93 QQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADE--YYIVVAIGEPKQYVSLLLD 150
+ R+ N+ K F ++ + + DT+ ++ + + V G P+Q +L++D
Sbjct: 92 RSRVRSINA----KIFGQYSTQESKDGWSPESMDTLNEDGLFLVNVGFGTPQQKFNLIID 147
Query: 151 TGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECP 210
TGSD TW QC C F S S ++ C P + N
Sbjct: 148 TGSDTTWIQCNSCSLGNCHNKKTFNPSLSSSYSNRSC-----------IPSTDTN----- 191
Query: 211 FNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRS 270
+ ++Y D S S G + D +T++ F ++ F GC ++ G+ ASG++GL +
Sbjct: 192 YTMKYEDNSYSKGVFVCDEVTLKPD----VFPKFQF--GCGDSGGGEFGTASGVLGLAKG 245
Query: 271 P-VSIITRTNTSY---FSYCLPSPYGSTGYITFGK 301
S+I++T + + FSYC P + G + FG+
Sbjct: 246 EQYSLISQTASKFKKKFSYCFPPKEHTLGSLLFGE 280
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 66/140 (47%), Gaps = 6/140 (4%)
Query: 277 RTNTSYFSYCLPSPYGSTG-YITFGKTDTVNTLRSAFHKRMKKYKKAKG--LEDLLDTCY 333
R N S + P +G IT T LR+AF + M E LLDTCY
Sbjct: 316 RLNVSSSLFASPGTIIDSGTVITRLPTAAYEALRTAFQQEMLHCPSISPPPQEKLLDTCY 375
Query: 334 DLSAY--ETVVVPKIAIHFLGGVDLELDVRGTLVV-ASVSQVCLGFATYPPDPNSITLGN 390
+L + +P+I +HF+G VD+ L G L ++Q CL FA + +GN
Sbjct: 376 NLKGCGGRNIKLPEIVLHFVGEVDVSLHPSGILWANGDLTQACLAFARKSNPSHVTIIGN 435
Query: 391 VQQRGHEVHYDVAGRRLGFG 410
QQ +V YD+ G RLGFG
Sbjct: 436 RQQVSLKVVYDIEGGRLGFG 455
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 71/238 (29%), Positives = 106/238 (44%), Gaps = 31/238 (13%)
Query: 63 EVVSKYGPCS--RLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTF 120
E++ + P S R N +T L + R ++R L L F+
Sbjct: 21 ELIHREHPSSPLRSNTSKTTTEIFLAAVKRGAERRAQLSK---------HILAEGRLFST 71
Query: 121 PANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSK 180
P + EY I ++ G P Q S+++DTGSD+ WTQC PC C F KS
Sbjct: 72 PVASGN---GEYLIDISFGSPPQKASVIVDTGSDLIWTQCLPCETCNAAASVIFDPVKSS 128
Query: 181 TFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGY 240
T+ + C S C S PF +C + C ++ Y DGS + G +
Sbjct: 129 TYDTVSCASNFC----SSLPFQSCTTS-CKYDYMYGDGSSTSG-------ALSTETVTVG 176
Query: 241 FTRYP-FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTN---TSYFSYCLPSPYGST 294
P GC + + G +GA+GI+GL + P+S+I++ + + FSYCL P GST
Sbjct: 177 TGTIPNVAFGCGHTNLGSFAGAAGIVGLGQGPLSLISQASSITSKKFSYCL-VPLGST 233
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 32/119 (26%), Positives = 51/119 (42%), Gaps = 6/119 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T+ +T N L +A + + +A G LD C+ + P + HF G D
Sbjct: 303 LTYLETGAFNALVAALKAEVP-FPEADGSLYGLDYCFSTAGVANPTYPTMTFHF-KGADY 360
Query: 357 ELDVRGTLVVASVS-QVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
EL V +CL A +GN+QQ+ H + +D+ +R+GF NC
Sbjct: 361 ELPPENVFVALDTGGSICLAMAA---STGFSIMGNIQQQNHLIVHDLVNQRVGFKEANC 416
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 77/285 (27%), Positives = 132/285 (46%), Gaps = 30/285 (10%)
Query: 29 LSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEI 88
+SHS +++ L N+C AL G S+E++ + S + T +
Sbjct: 1 MSHSSCLTLVLLCLYNIC--FSEALKSG---FSVEIIHRDSSRSPFYRATETQFQRVTNA 55
Query: 89 LRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLL 148
+R+ R + F + + A P + D +Y + ++G P V +
Sbjct: 56 VRRSMNRAN---------HFNQISVYSNAVESPVTLLDD--GDYLMSYSLGTPPFPVYGI 104
Query: 149 LDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKE 208
+DT SD+ W QC+ C C+ P F S SKT+ +PC+ST+C+ ++ + +C+S E
Sbjct: 105 VDTASDIIWVQCQLCETCYNDTSPMFDPSYSKTYKNLPCSSTTCKSVQGT----SCSSDE 160
Query: 209 ---CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLGCINNSSGDKSGASGI 264
C + Y DGS S G + +T+ + N F +P ++GCI N++ GI
Sbjct: 161 RKICEHTVNYKDGSHSQGDLIVETVTL--GSYNDPFVHFPRTVIGCIRNTNVSFDSI-GI 217
Query: 265 MGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDTVN 306
+GL PVS++ + ++S FSYCL + + FG V+
Sbjct: 218 VGLGGGPVSLVPQLSSSISKKFSYCLAPISDRSSKLKFGDAAMVS 262
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/113 (31%), Positives = 54/113 (47%), Gaps = 6/113 (5%)
Query: 303 DTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRG 362
D + L SA +K + L+ CY S Y+ V VP I HF G D++L+
Sbjct: 323 DVYSKLESAVADVVKLERAEDPLKQF-SLCYK-STYDKVDVPVITAHF-SGADVKLNALN 379
Query: 363 TLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
T +VAS VCL F + + GN+ Q+ V YD+ + + F P +C+
Sbjct: 380 TFIVASHRVVCLAFLS---SQSGAIFGNLAQQNFLVGYDLQRKIVSFKPTDCT 429
>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
Length = 367
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 56/153 (36%), Positives = 73/153 (47%), Gaps = 15/153 (9%)
Query: 138 IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRE 197
IG P Q S +D ++ WTQC CIHCF+Q P F + S TF PC + C+
Sbjct: 30 IGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCK---- 85
Query: 198 SFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA--NSNGYFTRYPFLLGCINNSS 255
S P C S C F+ G + G ATD I A S G+ GC+ S
Sbjct: 86 SIPTPKCASDVCAFDGVTGLGGHTVGIVATDTFAIGTAAPASLGF--------GCVVASD 137
Query: 256 GDKSGA-SGIMGLDRSPVSIITRTNTSYFSYCL 287
D G SG +GL R+P S++ + + FSYCL
Sbjct: 138 IDTMGGPSGFIGLGRTPWSLVAQMKLTRFSYCL 170
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 92/187 (49%), Gaps = 14/187 (7%)
Query: 54 PQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLK 113
P+ SLE++ + + + TH L E L++D+QR+ S+ +
Sbjct: 50 PRDGGTLSLELIHRNSLLREAKEKLHTHEQLLLETLQRDEQRVRWIESK------AQLAG 103
Query: 114 RTEAFTFPANINDTV-------ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC 166
+ + ++N V + EY++ + +G P + + +++DTGSD+ W QC+PC C
Sbjct: 104 KKKDEASSTDLNGPVTSGLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSC 163
Query: 167 FQQRDPFFYASKSKTFFKIPCNSTSCRILR-ESFPFGNCNSKECPFNIQYADGSGSGGFW 225
++Q DP F S +F +IPC S C+ L S + C + + Y DGS S G +
Sbjct: 164 YKQADPIFDPRNSSSFQRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDF 223
Query: 226 ATDRITI 232
++D T+
Sbjct: 224 SSDLFTL 230
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 39/122 (31%), Positives = 55/122 (45%), Gaps = 10/122 (8%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T T T+R AF A L DTCY+ S +V VP + +HF G DL
Sbjct: 367 VTRFPTSVYATIRDAFRNATTNLPSAPRYS-LFDTCYNFSGKASVDVPALVLHFENGADL 425
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITL---GNVQQRGHEVHYDVAGRRLGFGPG 412
+L L+ + + CL FA P S+ L GN+QQ+ + +D+ L F P
Sbjct: 426 QLPPTNYLIPINTAGSFCLAFA-----PTSMELGIIGNIQQQSFRIGFDLQKSHLAFAPQ 480
Query: 413 NC 414
C
Sbjct: 481 QC 482
>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
Length = 486
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 94/326 (28%), Positives = 152/326 (46%), Gaps = 51/326 (15%)
Query: 9 LLFICLLCSSNNGAYADDNDLSHSHIVSVSSLLPPNV-CNRTRTALPQGPDKASLEVVSK 67
+LF+ L C ++ A D +L + +V VS L P C+ R P + + + +
Sbjct: 6 ILFLLLGCPTSRAA---DEELELT-VVDVSLLQEPRASCSGHRVMPPHPYNNSWVPLFRP 61
Query: 68 YGPCSRLNQGISTHA----PSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPAN 123
GPCS +G + A PSL ++LRQD+ R+H + RR+ +F P +
Sbjct: 62 LGPCSPSFKGAAAAAARTKPSLADVLRQDRLRVHHIH-RRVSGSSRGARASKGSFKEPVS 120
Query: 124 INDT-VADEYYIVVAIG------EPKQY--------------VSLLLDTGSDVTWTQCKP 162
+ +T + + I V +G EP V+++LDT DV W +C P
Sbjct: 121 VEETQLHHQAAISVEVGTSQTSSEPSSGIHPAAATDGSSSPPVTVVLDTAGDVPWMRCVP 180
Query: 163 CIHCFQQ---RDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYA-DG 218
C F Q DP ++S T+ PCNS++C+ L + G + +C + + A D
Sbjct: 181 CT--FAQCADYDP----TRSSTYSAFPCNSSACKQLGR-YANGCDANGQCQYMVVTAGDS 233
Query: 219 SGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD-KSGASGIMGLDRSPVSIITR 277
+ G +++D +TI NS + F GC N G ++ A GIM L R S++ +
Sbjct: 234 FTTSGTYSSDVLTI---NSGDRVEGFRF--GCSQNEQGSFENQADGIMALGRGVQSLMAQ 288
Query: 278 TNTSY---FSYCLPSPYGSTGYITFG 300
T+++Y FSYCLP + G+ G
Sbjct: 289 TSSTYGDAFSYCLPPTETTKGFFQIG 314
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 44/107 (41%), Positives = 61/107 (57%), Gaps = 7/107 (6%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
LR+AF RM+ Y+ A E+L DTCYDL+ +P+IA+ F G +E+D G L+
Sbjct: 387 LRAAFRNRMR-YRVAPPQEEL-DTCYDLTGVRYPRLPRIALVFDGNAVVEMDRSGILLNG 444
Query: 368 SVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
CL FA+ D + LGNVQQ+ +V +DV G R+GF C
Sbjct: 445 -----CLAFASNDDDSSPSILGNVQQQTIQVLHDVGGGRIGFRSAAC 486
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 55/156 (35%), Positives = 87/156 (55%), Gaps = 10/156 (6%)
Query: 136 VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRIL 195
++IG+P LL+DTGSD+TW QC PC C+ Q PFF+ S+S T+ + SC
Sbjct: 92 ISIGDPPVPQLLLIDTGSDLTWIQCLPC-KCYPQTIPFFHPSRSSTY-----RNASCESA 145
Query: 196 RESFP--FGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINN 253
+ P F + + C ++++Y D S + G A +++T Q ++ G ++ + GC +
Sbjct: 146 PHAMPQIFRDEKTGNCRYHLRYRDFSNTRGILAKEKLTFQTSD-EGLISKPNIVFGCGQD 204
Query: 254 SSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
+SG + SG++GL SI+TR S FSYC S
Sbjct: 205 NSG-FTQYSGVLGLGPGTFSIVTRNFGSKFSYCFGS 239
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 92/354 (25%), Positives = 137/354 (38%), Gaps = 77/354 (21%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + IG P++ L LDTGSDVTW QC PC C+ Q DP + S S ++ ++ C S
Sbjct: 44 EYFARMGIGSPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSA 103
Query: 191 SCRILR---------------------------ESFPFGNCNSKECPFNIQYADG-SGSG 222
C+ L ESF G NS NI + G S SG
Sbjct: 104 LCQALDYSACQGMGCSYRVVYGDSSASSGDLGIESFYLGP-NSSTAMRNIAFGCGHSNSG 162
Query: 223 GFWATDRITIQEANSNGYFTRYPFLLG-----CINNSSGDKSGASGIMGLDRSPVSIITR 277
F + + +F++ +G C+ + S + R+ + R
Sbjct: 163 LFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFAAR 222
Query: 278 ---------TNTSYFSY---------CLPSP---YGSTGYITFGKTDTVNT--------- 307
+T Y++ LP P + TG T G T
Sbjct: 223 FTPLLKNPRIDTFYYAILTGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTRVVPAA 282
Query: 308 ---LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL 364
LR A+ + A G+ LLDTC++ TV +P + +HF VD+ L L
Sbjct: 283 YAVLRDAYRAASRNLPPAPGVY-LLDTCFNFQGLPTVQIPSLVLHFDNDVDMVLPGGNIL 341
Query: 365 V-VASVSQVCLGFATYPPDPNSI---TLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ V CL FA P+S+ +GNVQQ+ + +D+ + P C
Sbjct: 342 IPVDRSGTFCLAFA-----PSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 390
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 129/520 (24%), Positives = 209/520 (40%), Gaps = 119/520 (22%)
Query: 3 ILSKAFLLFICLLCSSNNGAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASL 62
++ A LL +C+ S A ADD + +V SSL P VC R +S
Sbjct: 1 MVCAARLLILCIATSLLADAGADDQ--VNYVVVETSSLKPSAVCKGHRVHPSVNNYSSSW 58
Query: 63 EVVSK-YGPCS-RLNQGIS---THAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEA 117
+S +GPCS +G + + + ++++LR DQ R + E + +++
Sbjct: 59 TPLSNPHGPCSPSWEEGAAMDYSASSMVDDMLRWDQHRAGYIQRKLSGNVSHEDTEISDS 118
Query: 118 FTFPANINDTVA-------------------DEYYIVV----------AIG-------EP 141
T ++N A D ++ VV A G P
Sbjct: 119 TTTLESVNGGGAGDFSMGDDGTGGMAKAQQQDTHHQVVEELSSAADPAATGGSRRSRLRP 178
Query: 142 KQYVSLLLDTGSDVTWTQCKPC--IHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESF 199
+LLDT SDV W QC PC C+ Q D + SKS++ C+S +CR L
Sbjct: 179 GVRQLMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLG--- 235
Query: 200 PFGN-CNSK-----ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLGCIN 252
P+ N C+S +C + ++Y DGS + G D++++ ++ P F GC +
Sbjct: 236 PYANGCSSSSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPT------SQVPKFEFGCSH 289
Query: 253 NSSG--DKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFG------- 300
+ G +S +GIM L R S++++T+T Y FSYC P G+ G
Sbjct: 290 AARGSFSRSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCFPPTASHKGFFVLGVPRRSSS 349
Query: 301 KTDTVNTLRSA--FHKRMKKYKKAKGLEDLLDTCYDLSAY---ETVV--VPKIAIHFL-- 351
+ L++ + R++ A D+ T + A TV+ +P A L
Sbjct: 350 RYAVTPMLKTPMLYQVRLEAIAVAGQRLDVPPTVFAAGAALDSRTVITRLPPTAYQALRS 409
Query: 352 ---------------GGVDLELDVRG--TLVVASVSQV-------------------CLG 375
G +D D G ++++ ++S V CL
Sbjct: 410 AFRDKMSMYRPAAANGQLDTCYDFTGVSSIMLPTISLVFDRTGAGVQLDPSGVLFGSCLA 469
Query: 376 FATYPPDPNSI-TLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
FA+ D + +G +Q + EV Y+VAG +GF G C
Sbjct: 470 FASTAGDDRATGIIGFLQLQTIEVLYNVAGGSVGFRRGAC 509
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 75/252 (29%), Positives = 111/252 (44%), Gaps = 20/252 (7%)
Query: 58 DKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLH--LKNSRRLRKPFPEFLKRT 115
+ +SL V+ G CS S+ ++ E ++ D R +K K +
Sbjct: 50 ETSSLSVMHIQGKCSPFRLLNSSWWTAVSESIKGDTARYRAMVKGGWSAGK---TMVNPQ 106
Query: 116 EAFTFPANINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFF 174
E P ++ YI+ + G P Q +LDTGS++ W C PC C ++ P F
Sbjct: 107 EDADIPLASGQAISSSNYIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCSSKQQP-F 165
Query: 175 YASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQE 234
SKS T+ + C S C++LR + NS C +Y D S D I E
Sbjct: 166 EPSKSSTYNYLTCASQQCQLLRVCTK--SDNSVNCSLTQRYGDQS------EVDEILSSE 217
Query: 235 ANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPY 291
S G F+ GC N + G ++G R+P+S +++T T Y FSYCLPS +
Sbjct: 218 TLSVGSQQVENFVFGCSNAARGLIQRTPSLVGFGRNPLSFVSQTATLYDSTFSYCLPSLF 277
Query: 292 GS--TGYITFGK 301
S TG + GK
Sbjct: 278 SSAFTGSLLLGK 289
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 54/113 (47%), Gaps = 6/113 (5%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
N +R +F ++ A DL DTCY+ + + V P I +HF +DL L + L
Sbjct: 360 NAMRDSFRSQLSNLTMASP-TDLFDTCYNRPSGD-VEFPLITLHFDDNLDLTLPLDNILY 417
Query: 366 VAS--VSQVCLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ S +CL F P + + T GN QQ+ + +DVA RLG NC
Sbjct: 418 PGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQQQKLRIVHDVAESRLGIASENC 470
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 52/166 (31%), Positives = 83/166 (50%), Gaps = 8/166 (4%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC--FQQRDPFFYASKSKTFFKIPCN 188
EY + ++IG P Q + ++DTGSD+ W +C C HC + F++ S ++ K+PCN
Sbjct: 4 EYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCN 63
Query: 189 STSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRY--PF 246
ST C + + C + C + +Y DGS + G +DRI+ + + + F
Sbjct: 64 STHCSGMSSAGIGPRCE-ETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGF 122
Query: 247 LLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
L GC GD + G++GL + S+I + FSYCL S
Sbjct: 123 LFGCARKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVS 168
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/179 (30%), Positives = 92/179 (51%), Gaps = 14/179 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNS 189
Y + +G P +++D+GS +TW QC PC + C Q P + S T+ +PC++
Sbjct: 107 NYITRLGLGTPTTTYVMVVDSGSSLTWLQCAPCAVSCHPQAGPLYDPRASSTYAAVPCSA 166
Query: 190 TSCRILRESF--PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-F 246
C L+ + P S C + Y DGS S G+ + D +++ + S +P F
Sbjct: 167 PQCAELQAATLNPSSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSSSGS------FPGF 220
Query: 247 LLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLP-SPYGSTGYITFGK 301
GC ++ G A+G++GL R+ +S++++ S F+YCLP S S GY++FG
Sbjct: 221 YYGCGQDNVGLFGRAAGLIGLARNKLSLLSQLAPSVGNSFAYCLPTSAAASAGYLSFGS 279
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/89 (38%), Positives = 45/89 (50%), Gaps = 4/89 (4%)
Query: 327 DLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSI 386
+L TC+ + VP + + F GG L L LV + + CL FA P D +I
Sbjct: 366 SILQTCFK-GQVAKLPVPAVNMAFAGGATLRLTPGNVLVDVNETTTCLAFA--PTDSTAI 422
Query: 387 TLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+GN QQ+ V YDV G R+GF G CS
Sbjct: 423 -IGNTQQQTFSVVYDVKGSRIGFAAGGCS 450
>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 397
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 73/153 (47%), Gaps = 15/153 (9%)
Query: 138 IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRE 197
IG P Q S +D ++ WTQC CIHCF+Q P F + S TF PC + C+
Sbjct: 60 IGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCK---- 115
Query: 198 SFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA--NSNGYFTRYPFLLGCINNSS 255
S P C S C ++ G + G ATD I A S G+ GC+ S
Sbjct: 116 SIPTPKCASDVCAYDGVTGLGGHTVGIVATDTFAIGTAAPASLGF--------GCVVASD 167
Query: 256 GDKSGA-SGIMGLDRSPVSIITRTNTSYFSYCL 287
D G SG +GL R+P S++ + + FSYCL
Sbjct: 168 IDTMGGPSGFIGLGRTPWSLVAQMKLTRFSYCL 200
>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
Length = 720
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 82/162 (50%), Gaps = 13/162 (8%)
Query: 146 SLLLDTGSDVTWTQCKPC--IHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGN 203
++++D+GSDV+W QCKPC C +QRDP F + S T+ +PC S +C L + G
Sbjct: 169 TVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGP-YRRGC 227
Query: 204 CNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDK--SGA 261
+ +C F I Y DGS + G ++ D +T+ Y F GC + G
Sbjct: 228 SANAQCQFGINYGDGSTATGTYSFDDLTLGP-----YDVIRGFRFGCAHADRGSAFDYDV 282
Query: 262 SGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFG 300
+G + L S++ +T T Y FSYCLP S G++ G
Sbjct: 283 AGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLG 324
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 43/109 (39%), Positives = 58/109 (53%), Gaps = 6/109 (5%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
LR+AF + M Y+ A + +LDTCYD + ++ +P IA+ F GG + LD G L+
Sbjct: 618 QALRAAFRRAMTMYRTAPPVS-ILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL 676
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
Q CL FA D +GNVQQR EV YDV G+ + F C
Sbjct: 677 -----QGCLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 720
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 48/92 (52%), Gaps = 6/92 (6%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
LR+AF M Y+ A + +LDTCYD + ++ +P IA+ F GG + LD G L+
Sbjct: 393 QALRAAFRSAMTMYRAAPPVS-ILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL 451
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHE 397
+ CL FA D +GNVQQ+ E
Sbjct: 452 GS-----CLAFAPTASDRMPGFIGNVQQKTLE 478
>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
Length = 629
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 82/162 (50%), Gaps = 13/162 (8%)
Query: 146 SLLLDTGSDVTWTQCKPC--IHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGN 203
++++D+GSDV+W QCKPC C +QRDP F + S T+ +PC S +C L + G
Sbjct: 78 TVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQL-GPYRRGC 136
Query: 204 CNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDK--SGA 261
+ +C F I Y DGS + G ++ D +T+ Y F GC + G
Sbjct: 137 SANAQCQFGINYGDGSTATGTYSFDDLTLGP-----YDVIRGFRFGCAHADRGSAFDYDV 191
Query: 262 SGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFG 300
+G + L S++ +T T Y FSYCLP S G++ G
Sbjct: 192 AGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLG 233
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 43/109 (39%), Positives = 58/109 (53%), Gaps = 6/109 (5%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
LR+AF + M Y+ A + +LDTCYD + ++ +P IA+ F GG + LD G L+
Sbjct: 527 QALRAAFRRAMTMYRTAPPVS-ILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL 585
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
Q CL FA D +GNVQQR EV YDV G+ + F C
Sbjct: 586 -----QGCLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 629
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 48/92 (52%), Gaps = 6/92 (6%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
LR+AF M Y+ A + +LDTCYD + ++ +P IA+ F GG + LD G L+
Sbjct: 302 QALRAAFRSAMTMYRAAPPVS-ILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL 360
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHE 397
+ CL FA D +GNVQQ+ E
Sbjct: 361 GS-----CLAFAPTASDRMPGFIGNVQQKTLE 387
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 65/186 (34%), Positives = 96/186 (51%), Gaps = 19/186 (10%)
Query: 112 LKRTEAFTFPANINDTVAD-------EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI 164
L+R A A+ N + E+ + +AIG P + S ++DTGSD+ WTQCKPC
Sbjct: 73 LERLNAMVLAASSNAEINSPVLSGNGEFLMNLAIGTPPETYSAIMDTGSDLIWTQCKPCT 132
Query: 165 HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGF 224
CF Q P F KS +F K+ C+S C+ L +S +C S C + Y D S + G
Sbjct: 133 QCFDQPSPIFDPKKSSSFSKLSCSSQLCKALPQS----SC-SDSCEYLYTYGDYSSTQGT 187
Query: 225 WATDRITIQEANSNGYFTRYPFLLGCINNSSGDK-SGASGIMGLDRSPVSIITRTNTSYF 283
AT+ T G + GC ++ GD + SG++GL R P+S++++ + F
Sbjct: 188 MATETFTF------GKVSIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSLVSQLKEAKF 241
Query: 284 SYCLPS 289
SYCL S
Sbjct: 242 SYCLTS 247
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 57/131 (43%), Gaps = 12/131 (9%)
Query: 292 GSTGYITFGKTDTVNTLRSAFHKRMKKYKKAKGL------EDLLDTCYDL-SAYETVVVP 344
G+ G I T SAF K++ GL L+ CY+L S + VP
Sbjct: 313 GTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDNSGATGLELCYNLPSDTSELEVP 372
Query: 345 KIAIHFLGGVDLELDVRGTLVV-ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVA 403
K+ +HF G DLEL ++ +S+ +CL + GNVQQ+ V +D+
Sbjct: 373 KLVLHFTG-ADLELPGENYMIADSSMGVICLAMGS---SGGMSIFGNVQQQNMFVSHDLE 428
Query: 404 GRRLGFGPGNC 414
L F P NC
Sbjct: 429 KETLSFLPTNC 439
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 70/223 (31%), Positives = 111/223 (49%), Gaps = 16/223 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+I + +G P ++V L+LDTGSD++W QC PC CF+Q P + ++S ++ I C
Sbjct: 169 EYFIDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGPHYNPNESSSYRNISCYDP 228
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNG---YFTRYP 245
C+++ P +C ++ CP+ YADGS + G +A + T+ NG +
Sbjct: 229 RCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVVD 288
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGY---ITF 299
+ GC + + G GA G++GL R P+S ++ + Y FSYCL + +T + F
Sbjct: 289 VMFGCGHWNKGFFHGAGGLLGLGRGPLSFPSQLQSIYGHSFSYCLTDLFSNTSVSSKLIF 348
Query: 300 GKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVV 342
G+ + H + K G E DT Y L VV
Sbjct: 349 GEDKELLN-----HHNLNFTKLLAGEETPDDTFYYLQIKSIVV 386
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 55/125 (44%), Gaps = 4/125 (3%)
Query: 292 GSTGYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFL 351
GST +TF + ++ AF K++K + A + ++ CY++S V +P IHF
Sbjct: 413 GST--LTFFPDSAYDVIKEAFEKKIKLQQIAAD-DFIMSPCYNVSGAMQVELPDYGIHFA 469
Query: 352 GGVDLELDVRGTLVVASVSQV-CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFG 410
G +V CL P + +GN+ Q+ + YDV RLG+
Sbjct: 470 DGAVWNFPAENYFYQYEPDEVICLAILKTPNHSHLTIIGNLLQQNFHILYDVKRSRLGYS 529
Query: 411 PGNCS 415
P C+
Sbjct: 530 PRRCA 534
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 87/157 (55%), Gaps = 9/157 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V +G+P + ++LDTGSD+ W QC+PC C+QQ DP F S +F +PC S
Sbjct: 154 EYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQ 213
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C+ L S C + +C + + Y DGS + G + + +T ++G +GC
Sbjct: 214 QCQALETS----GCRASKCLYQVSYGDGSFTVGEFVIETLTF---GNSGMINNVA--VGC 264
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+++ G G++G++GL +S+ ++ S FSYCL
Sbjct: 265 GHDNEGLFVGSAGLLGLGGGSLSLTSQMKASSFSYCL 301
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 52/120 (43%), Positives = 67/120 (55%), Gaps = 6/120 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
IT +T NTLR AF R KK G L DTCYDLS+ V +P ++ F GG L
Sbjct: 376 ITRLQTQAYNTLRDAFVSRTPYLKKTNGFA-LFDTCYDLSSQSRVTIPTVSFEFAGGKSL 434
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITL-GNVQQRGHEVHYDVAGRRLGFGPGNC 414
+L + L+ V SV C FA P +S+++ GNVQQ+G VHYD+A +GF P C
Sbjct: 435 QLPPKNYLIPVDSVGTFCFAFA---PTTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 68/200 (34%), Positives = 95/200 (47%), Gaps = 19/200 (9%)
Query: 94 QRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADE----YYIVVAIGEPKQYVSLLL 149
Q L N R R R AF + VAD+ + + ++G P + +
Sbjct: 24 QSLDRNNVERRRT-------RRAAFIXDEIQANMVADDRGQAFLVNFSVGRPPVPQLVGI 76
Query: 150 DTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS-KE 208
DTGSD+ W QC+PC CF+Q P F SKS T+ + +S C + P N +
Sbjct: 77 DTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC----PNSPQKKYNHLNQ 132
Query: 209 CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSG-ASGIMGL 267
C +N YADGS S G AT+ I E + G T + GC +++ G G SGI+GL
Sbjct: 133 CIYNASYADGSTSSGNLATEDIVF-ETSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGL 191
Query: 268 DRSPVSIITRTNTSYFSYCL 287
SI++R S FSYC+
Sbjct: 192 SAGDQSIVSRLG-SRFSYCI 210
Score = 38.1 bits (87), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 21/71 (29%), Positives = 32/71 (45%)
Query: 344 PKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVA 403
P++A HF G DL LD V + CL +G + Q+ + V YD+
Sbjct: 333 PELAFHFAEGADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLI 392
Query: 404 GRRLGFGPGNC 414
G+R+ F +C
Sbjct: 393 GKRVYFQRTDC 403
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 71/221 (32%), Positives = 110/221 (49%), Gaps = 15/221 (6%)
Query: 87 EILRQDQQRLHLKN-----SRRLRKPFPEFLKRTEAFTFPANINDTV---ADEYYIVVAI 138
E++ +D L N S RL F + R+ FT ++ + EY++ ++I
Sbjct: 32 ELIHRDSPHSPLYNPHHTVSDRLNAAFLRSISRSRRFTTKTDLQSGLISNGGEYFMSISI 91
Query: 139 GEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRES 198
G P V + DTGSD+TW QCKPC C++Q P F KS T+ C+S +C+ L E
Sbjct: 92 GTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDKKKSSTYKTESCDSKTCQALSEH 151
Query: 199 FPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLGCINNSSGD 257
+ + C + Y D S + G AT+ TI +S+G +P + GC N+ G
Sbjct: 152 EEGCDESKDICKYRYSYGDNSFTKGDVATE--TISIDSSSGSSVSFPGTVFGCGYNNGGT 209
Query: 258 -KSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGST 294
+ SGI+GL P+S++++ +S FSYCL +T
Sbjct: 210 FEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTAATT 250
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 68/200 (34%), Positives = 95/200 (47%), Gaps = 19/200 (9%)
Query: 94 QRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADE----YYIVVAIGEPKQYVSLLL 149
Q L N R R R AF + VAD+ + + ++G P + +
Sbjct: 24 QSLDRNNVERRRT-------RRAAFITDEIQANMVADDRGQAFLVNFSVGRPPVPQLVGI 76
Query: 150 DTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS-KE 208
DTGSD+ W QC+PC CF+Q P F SKS T+ + +S C + P N +
Sbjct: 77 DTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC----PNSPQKKYNHLNQ 132
Query: 209 CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSG-ASGIMGL 267
C +N YADGS S G AT+ I E + G T + GC +++ G G SGI+GL
Sbjct: 133 CIYNASYADGSTSSGNLATEDIVF-ETSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGL 191
Query: 268 DRSPVSIITRTNTSYFSYCL 287
SI++R S FSYC+
Sbjct: 192 SAGDQSIVSRLG-SRFSYCI 210
Score = 38.1 bits (87), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 21/71 (29%), Positives = 32/71 (45%)
Query: 344 PKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVA 403
P++A HF G DL LD V + CL +G + Q+ + V YD+
Sbjct: 333 PELAFHFAEGADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLI 392
Query: 404 GRRLGFGPGNC 414
G+R+ F +C
Sbjct: 393 GKRVYFQRTDC 403
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 60/187 (32%), Positives = 90/187 (48%), Gaps = 16/187 (8%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCN 188
+ EY+ + +G P +YV ++LDTGSD+ W QC PC +C+ Q DP F KS +F K+ C
Sbjct: 126 SGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCR 185
Query: 189 STSCRILRESFPFGNCNSKE-CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
+ CR L CN ++ C + + Y DGS + G + T+ +T +
Sbjct: 186 TPLCRRLESP----GCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE------QVA 235
Query: 248 LGCINNSSG---DKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGST--GYITFGKT 302
LGC +++ G +G G+ S S RT FSYCL S+ + FG +
Sbjct: 236 LGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNS 295
Query: 303 DTVNTLR 309
T R
Sbjct: 296 AVSRTAR 302
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 43/109 (39%), Positives = 54/109 (49%), Gaps = 5/109 (4%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
LR AF K A L DTCYDLS TV VP + +HF G D+ L L+ V
Sbjct: 367 LRDAFRAGASSLKSAPEFS-LFDTCYDLSGKTTVKVPTVVLHFRGA-DVSLPASNYLIPV 424
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ C FA + I GN+QQ+G V YD+A R+GF P C+
Sbjct: 425 DGSGRFCFAFAGTTSGLSII--GNIQQQGFRVVYDLASSRVGFSPRGCA 471
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 60/160 (37%), Positives = 91/160 (56%), Gaps = 11/160 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P +YV ++LDTGSDV W QC PC C+ Q D F +KS+T+ IPC +
Sbjct: 117 EYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQTDHVFDPTKSRTYAGIPCGAP 176
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
CR R P + +K C + + Y DGS + G ++T+ +T + TR LGC
Sbjct: 177 LCR--RLDSPGCSNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRNR----VTRVA--LGC 228
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
+++ G +GA+G++GL R +S +T + FSYCL
Sbjct: 229 GHDNEGLFTGAAGLLGLGRGRLSFPVQTGRRFNHKFSYCL 268
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 53/108 (49%), Gaps = 5/108 (4%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
LR AF K+A L DTC+DLS V VP + +HF G D+ L L+ V
Sbjct: 357 LRDAFRIGASHLKRAPEFS-LFDTCFDLSGLTEVKVPTVVLHFRGA-DVSLPATNYLIPV 414
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ C FA + I GN+QQ+G + YD+ G R+GF P C
Sbjct: 415 DNSGSFCFAFAGTMSGLSII--GNIQQQGFRISYDLTGSRVGFAPRGC 460
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 52/166 (31%), Positives = 83/166 (50%), Gaps = 8/166 (4%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC--FQQRDPFFYASKSKTFFKIPCN 188
EY + ++IG P Q + ++DTGSD+ W +C C HC + F++ S ++ K+PCN
Sbjct: 4 EYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCN 63
Query: 189 STSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRY--PF 246
ST C + + C + C + +Y DGS + G +DRI+ + + + F
Sbjct: 64 STHCSGMSSAGIGPRCE-ETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGF 122
Query: 247 LLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
L GC GD + G++GL + S+I + FSYCL S
Sbjct: 123 LFGCGRKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVS 168
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 65/219 (29%), Positives = 103/219 (47%), Gaps = 23/219 (10%)
Query: 83 PSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPK 142
PS + L D +RLH + RR KP P F+K + + + +Y++ + IG+P
Sbjct: 42 PSPTQALALDTRRLHFLSLRR--KPVP-FVKSPVV-----SGASSGSGQYFVDLRIGQPP 93
Query: 143 QYVSLLLDTGSDVTWTQCKPCIHCFQQRDP-FFYASKSKTFFKIPCNSTSCRILRESFPF 201
Q + L+ DTGSD+ W +C C +C F+ S TF C CR++ +
Sbjct: 94 QSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPKPGRA 153
Query: 202 GNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD 257
CN CP+ YADGS + G +A + +++ ++ + GC SG
Sbjct: 154 PRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAKLKS-VAFGCGFRISGQ 212
Query: 258 K------SGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
+GA+G+MGL R P+S ++ + FSYCL
Sbjct: 213 SVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCL 251
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 68/200 (34%), Positives = 95/200 (47%), Gaps = 19/200 (9%)
Query: 94 QRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADE----YYIVVAIGEPKQYVSLLL 149
Q L N R R R AF + VAD+ + + ++G P + +
Sbjct: 56 QSLDRNNVERRRT-------RRAAFITDEIQANMVADDRGQAFLVNFSVGRPPVPQLVGI 108
Query: 150 DTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS-KE 208
DTGSD+ W QC+PC CF+Q P F SKS T+ + +S C + P N +
Sbjct: 109 DTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC----PNSPQKKYNHLNQ 164
Query: 209 CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSG-ASGIMGL 267
C +N YADGS S G AT+ I E + G T + GC +++ G G SGI+GL
Sbjct: 165 CIYNASYADGSTSSGNLATEDIVF-ETSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGL 223
Query: 268 DRSPVSIITRTNTSYFSYCL 287
SI++R S FSYC+
Sbjct: 224 SAGDQSIVSRLG-SRFSYCI 242
Score = 38.5 bits (88), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 21/71 (29%), Positives = 32/71 (45%)
Query: 344 PKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVA 403
P++A HF G DL LD V + CL +G + Q+ + V YD+
Sbjct: 365 PELAFHFAEGADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLI 424
Query: 404 GRRLGFGPGNC 414
G+R+ F +C
Sbjct: 425 GKRVYFQRTDC 435
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 60/187 (32%), Positives = 90/187 (48%), Gaps = 16/187 (8%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCN 188
+ EY+ + +G P +YV ++LDTGSD+ W QC PC +C+ Q DP F KS +F K+ C
Sbjct: 39 SGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCR 98
Query: 189 STSCRILRESFPFGNCNSKE-CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
+ CR L CN ++ C + + Y DGS + G + T+ +T +
Sbjct: 99 TPLCRRLESP----GCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE------QVA 148
Query: 248 LGCINNSSG---DKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGST--GYITFGKT 302
LGC +++ G +G G+ S S RT FSYCL S+ + FG +
Sbjct: 149 LGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNS 208
Query: 303 DTVNTLR 309
T R
Sbjct: 209 AVSRTAR 215
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 43/109 (39%), Positives = 54/109 (49%), Gaps = 5/109 (4%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
LR AF K A L DTCYDLS TV VP + +HF G D+ L L+ V
Sbjct: 280 LRDAFRAGASSLKSAPEFS-LFDTCYDLSGKTTVKVPTVVLHFRG-ADVSLPASNYLIPV 337
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ C FA + I GN+QQ+G V YD+A R+GF P C+
Sbjct: 338 DGSGRFCFAFAGTTSGLSII--GNIQQQGFRVVYDLASSRVGFSPRGCA 384
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 54/173 (31%), Positives = 81/173 (46%), Gaps = 16/173 (9%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
Y + AIG P +S +LDTGSD+ WTQC PC CF Q P + ++S T+ + C S
Sbjct: 100 YLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYANVSCGSR 159
Query: 191 SCRIL---------RESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
C L S C + Y DGS + G AT+ T
Sbjct: 160 LCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTFGAGT----- 214
Query: 242 TRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGST 294
T + GC ++ G +SG++G+ R P+S++++ + FSYC +P+ T
Sbjct: 215 TVHDLAFGCGTDNLGGTDNSSGLVGMGRGPLSLVSQLGVTKFSYCF-TPFNDT 266
Score = 43.9 bits (102), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 39/77 (50%), Gaps = 5/77 (6%)
Query: 339 ETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQV-CLGFATYPPDPNSITLGNVQQRGHE 397
E V VP++ +HF G D+EL +V V+ V CLG + LG++QQ+
Sbjct: 385 EAVDVPRLVLHF-DGADMELPRSSAVVEDRVAGVACLGIVSAR---GMSVLGSMQQQNMH 440
Query: 398 VHYDVAGRRLGFGPGNC 414
V YDV L F P NC
Sbjct: 441 VRYDVGRDVLSFEPANC 457
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 56/165 (33%), Positives = 86/165 (52%), Gaps = 19/165 (11%)
Query: 146 SLLLDTGSDVTWTQCKPCIH--CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGN 203
+++LDT SDVTW QC PC C+ Q+D + +KS + CNS +C L P+ N
Sbjct: 145 TMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLG---PYAN 201
Query: 204 --CNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD---K 258
N+ +C + ++Y DG+ + G + +D +TI A + F GC + G
Sbjct: 202 GCTNNNQCQYRVRYPDGTSTAGTYISDLLTITPATAVRSFQ-----FGCSHGVQGSFSFG 256
Query: 259 SGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFG 300
S A+GIM L P S++++T +Y FS+C P P G+ T G
Sbjct: 257 SSAAGIMALGGGPESLVSQTAATYGRVFSHCFPPPT-RRGFFTLG 300
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 52/109 (47%), Gaps = 6/109 (5%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
LR AF RM Y+ A + LDTCYD++ + +P+I + F +ELD G L
Sbjct: 367 QALRQAFRDRMAMYQPAPP-KGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLF 425
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
Q CL F P D +GN+Q + EV Y++ +GF C
Sbjct: 426 -----QGCLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 469
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 63/111 (56%), Gaps = 2/111 (1%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ V +G P ++ SL+LDTGSD+ W QC PCI CF+Q P++ S +F I C+
Sbjct: 194 EYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDP 253
Query: 191 SCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNG 239
C+++ P C + + CP+ Y DGS + G +A + T+ NG
Sbjct: 254 RCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNG 304
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 54/119 (45%), Gaps = 2/119 (1%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T+ ++ AF +++K Y+ +GL L CY++S E + +P I F G
Sbjct: 441 LTYFAEPAYEIIKEAFVRKIKGYELVEGLPPL-KPCYNVSGIEKMELPDFGILFADGAVW 499
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
V + VCL P SI +GN QQ+ + YD+ RLG+ P C+
Sbjct: 500 NFPVENYFIQIDPDVVCLAILGNPRSALSI-IGNYQQQNFHILYDMKKSRLGYAPMKCA 557
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 99/218 (45%), Gaps = 12/218 (5%)
Query: 76 QGISTHAPSLEEILRQDQQRLHLKN--SRRLRKPFPEFLKRTEAFTFPANINDTVADEYY 133
+ + H P + + H+ + S R + +K + F +++ + +
Sbjct: 9 ESVVRHNPDARVPVTPEDHIQHMTDISSARFKYLQNSIVKELGSSDFQVDVHQAIKTSLF 68
Query: 134 IV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQR--DPFFYASKSKTFFKIPCNST 190
V ++G+P ++DTGS + W QC PC HC P F + S TF + C+
Sbjct: 69 FVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTFVECSCDDR 128
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
CR P G+C+S +C + Y G+GS G A +R+T N N T+ P GC
Sbjct: 129 FCRYA----PNGHCSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQ-PIAFGC 183
Query: 251 IN-NSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+ N +S +GI+GL P S+ + S FSYC+
Sbjct: 184 GHENGEQLESEFTGILGLGAKPTSLAVQLG-SKFSYCI 220
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 62/185 (33%), Positives = 94/185 (50%), Gaps = 14/185 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + ++IG P + + DTGSD+ WTQC PC C+QQ P F +S T+ K+ C+S+
Sbjct: 85 EYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSSS 144
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
CR L ++ +C++ E C + I Y D S + G A D +T+ + R ++
Sbjct: 145 QCRALEDA----SCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLR-NMII 199
Query: 249 GCINNSSGDKSGA----SGIMGLDRSPVSIITRTNTSYFSYCL---PSPYGSTGYITFGK 301
GC + ++G A G+ G S VS + ++ FSYCL S G T I FG
Sbjct: 200 GCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFGT 259
Query: 302 TDTVN 306
V+
Sbjct: 260 NGIVS 264
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 98/402 (24%), Positives = 160/402 (39%), Gaps = 99/402 (24%)
Query: 90 RQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVA---DEYYIVVAIGEPKQYVS 146
R+ H+K + R E+LK A+++ V + + ++IG P
Sbjct: 43 RKPPHVYHIKEASVERL---EYLKAKTTGDIIAHLSPNVPIIPQAFLVNISIGSPPITQL 99
Query: 147 LLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPF--GNC 204
L +DT SD+ W QC PCI+C+ Q P F S+S T + +CR + S P N
Sbjct: 100 LHMDTASDLLWIQCLPCINCYAQSLPIFDPSRSYTH-----RNETCRTSQYSMPSLKFNA 154
Query: 205 NSKECPFNIQYADGSGSGGFWATDRI---TIQEANSNGYFTRYPFLLGCINNSSGDKSGA 261
N++ C ++++Y D +GS G A + + TI + +S+ + + GC +++ G+
Sbjct: 155 NTRSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSA--ALHDVVFGCGHDNYGEPLVG 212
Query: 262 SGIMGLDRSPVSIITRTNTSYFSYCLPS------PY-----GSTGYITFGKT-------- 302
+GI+GL S++ R FSYC S P+ G G G T
Sbjct: 213 TGILGLGYGEFSLVHRFGKK-FSYCFGSLDDPSYPHNVLVLGDDGANILGDTTPLEIHNG 271
Query: 303 ------------------------------------DTVNTLRSAFHKRMKKYKKAKGLE 326
DT N+L S + K K +E
Sbjct: 272 FYYVTIEAISVDGIILPIDPRVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNR--IE 329
Query: 327 DLLD---TCYDLSAYETVVV----------------PKIAIHFLGGVDLELDVRGTLVVA 367
D+ + T D+S + + + P + HF G +L LDV+ +
Sbjct: 330 DIFEGRFTAADVSQDDMIKMECYNGNFERDLVESGFPIVTFHFSEGAELSLDVKSLFMKL 389
Query: 368 SVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGF 409
S + CL A P + NSI G Q+ + + YD+ + F
Sbjct: 390 SPNVFCL--AVTPGNLNSI--GATAQQSYNIGYDLEAMEVSF 427
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 56/165 (33%), Positives = 86/165 (52%), Gaps = 19/165 (11%)
Query: 146 SLLLDTGSDVTWTQCKPCIH--CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGN 203
+++LDT SDVTW QC PC C+ Q+D + +KS + CNS +C L P+ N
Sbjct: 170 TMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLG---PYAN 226
Query: 204 --CNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD---K 258
N+ +C + ++Y DG+ + G + +D +TI A + F GC + G
Sbjct: 227 GCTNNNQCQYRVRYPDGTSTAGTYISDLLTITPATAVRSFQ-----FGCSHGVQGSFSFG 281
Query: 259 SGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFG 300
S A+GIM L P S++++T +Y FS+C P P G+ T G
Sbjct: 282 SSAAGIMALGGGPESLVSQTAATYGRVFSHCFPPPT-RRGFFTLG 325
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/109 (33%), Positives = 52/109 (47%), Gaps = 6/109 (5%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
LR AF RM Y+ A + LDTCYD++ + +P+I + F +ELD G L
Sbjct: 392 QALRQAFRDRMAMYQPAPP-KGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLF 450
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
Q CL F P D +GN+Q + EV Y++ +GF C
Sbjct: 451 -----QGCLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 494
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 75/257 (29%), Positives = 116/257 (45%), Gaps = 37/257 (14%)
Query: 82 APSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANIND-TVADEYYIVVAIGE 140
A SL ++ R D++R+ +SR R+ + AF P + T +Y++ +G
Sbjct: 40 AASLADLARMDRERMAFISSRGRRRA----AETASAFAMPLSSGAYTGTGQYFVRFRVGT 95
Query: 141 PKQYVSLLLDTGSDVTWTQCK----------------PCIHCFQQRDPFFYASKSKTFFK 184
P Q L+ DTGSD+TW +C P R F KS+T+
Sbjct: 96 PAQPFLLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRR-TFRPDKSRTWAP 154
Query: 185 IPCNSTSCRILRESFPF--GNCN--SKECPFNIQYADGSGSGGFWATDRITIQEANSNGY 240
IPC+S +C RES PF C + C ++ +Y DGS + G D TI +
Sbjct: 155 IPCSSATC---RESLPFSLAACATPANPCAYDYRYKDGSAARGTVGVDSATIALSGRAAR 211
Query: 241 FTRYP-FLLGCINNSSGDKSGAS-GIMGLDRSPVSIITRTNTSY---FSYCLP---SPYG 292
+ +LGC + +G AS G++ L S +S +R + + FSYCL +P
Sbjct: 212 KAKLRGVVLGCTTSYNGQSFLASDGVLSLGYSNISFASRAASRFGGRFSYCLVDHLAPRN 271
Query: 293 STGYITFGKTDTVNTLR 309
+T Y+TFG ++ R
Sbjct: 272 ATSYLTFGPNPAFSSRR 288
Score = 47.0 bits (110), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 48/109 (44%), Gaps = 7/109 (6%)
Query: 310 SAFHKRMKKYKKAKGLEDLLDTCYDLSAYE----TVVVPKIAIHFLGGVDLELDVRGTLV 365
+A KR+ + D D CY+ ++ +P +A+HF G LE + ++
Sbjct: 377 AALSKRLAGLPRVT--MDPFDYCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAKSYVI 434
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
A+ C+G P P +GN+ Q+ H YD+ RRL F C
Sbjct: 435 DAAPGVKCIGLQEGP-WPGLSVIGNILQQEHLWEYDLKNRRLRFKRSRC 482
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 60/162 (37%), Positives = 92/162 (56%), Gaps = 14/162 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
+Y+ + +G P + V ++ DTGSDV+W QC PC C++Q+DP F S S +F + C S+
Sbjct: 80 DYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACASS 139
Query: 191 SCRILRESFPFGNCNSK-ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
C L+ C+ K EC + + Y DGS + G ++T+ ++ E +G
Sbjct: 140 ICGKLK----IKGCSRKNECMYQVSYGDGSFTVGDFSTETLSFGEHAVRS------VAMG 189
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLP 288
C N+ G GA+G++GL R P+S ++T TSY FSYCLP
Sbjct: 190 CGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLP 231
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 55/114 (48%), Gaps = 5/114 (4%)
Query: 302 TDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVR 361
T LR AF + + + A G+ L DTCYDLS+ +T +P + + F GG + L
Sbjct: 311 TPAYTALRDAF-RSLVTFPSAPGIS-LFDTCYDLSSMKTATLPAVVLDFDGGASMPLPAD 368
Query: 362 GTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
G LV V CL FA P + +GNVQQ+ + D ++G P C
Sbjct: 369 GILVNVDDEGTYCLAFA--PEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 420
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 65/210 (30%), Positives = 103/210 (49%), Gaps = 25/210 (11%)
Query: 102 RRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK 161
R + P ++ F N++ TV+ +A+G P Q V+++LDTGS+++W C
Sbjct: 61 RARQMPARALPRQPSKLRFHHNVSLTVS------LAVGTPPQNVTMVLDTGSELSWLLCA 114
Query: 162 PCIHCFQQRDPF----FYASKSKTFFKIPCNSTSCRILR-ESFPFGNCNSKECPFNIQYA 216
P R+ F F S TF +PC S CR S P + S C ++ YA
Sbjct: 115 PA----GARNKFSAMSFRPRASSTFAAVPCASAQCRSRDLPSPPACDGASSRCSVSLSYA 170
Query: 217 DGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCIN---NSSGDKSGASGIMGLDRSPVS 273
DGS S G ATD + +G R F GC++ +SS D ++G++G++R +S
Sbjct: 171 DGSSSDGALATDVFAV----GSGPPLRAAF--GCMSSAFDSSPDGVASAGLLGMNRGALS 224
Query: 274 IITRTNTSYFSYCLPSPYGSTGYITFGKTD 303
+++ +T FSYC+ S G + G +D
Sbjct: 225 FVSQASTRRFSYCI-SDRDDAGVLLLGHSD 253
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/185 (31%), Positives = 91/185 (49%), Gaps = 14/185 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + ++IG P + DTGSD+ W QC PC C++Q++P F S ++ I C +
Sbjct: 59 EYLMELSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYKQQNPMFDPRSSSSYTNITCGTE 118
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
SC L S + + K C + YAD S + G A + +T+ + + GC
Sbjct: 119 SCNKLDSS--LCSTDQKTCNYTYSYADNSITQGVLAQETLTLTSTTGEPVAFQ-GIIFGC 175
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTS------YFSYCLPSPYGS----TGYITFG 300
+N+SG G++GL R P+S+I++ +S FS CL P+ + T + FG
Sbjct: 176 GHNNSGFNDREMGLIGLGRGPLSLISQIGSSLGAGGNMFSQCL-VPFNTDPSITSQMNFG 234
Query: 301 KTDTV 305
K V
Sbjct: 235 KGSEV 239
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 60/165 (36%), Positives = 91/165 (55%), Gaps = 8/165 (4%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ V IG P ++ SL+LDTGSD+ W QC PC CF+Q P++ +S +F I C+
Sbjct: 89 EYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKESSSFRNIGCHDP 148
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNG--YFTRYP- 245
C ++ P C ++ CP+ Y D S + G +AT+ T+ + G F R
Sbjct: 149 RCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKRVEN 208
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
+ GC + + G GASG++GL R P+S ++ + Y FSYCL
Sbjct: 209 VMFGCGHWNRGLFHGASGLLGLGRGPLSFSSQLQSLYGHSFSYCL 253
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 51/109 (46%), Gaps = 3/109 (2%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
++ AF K++K Y + +LD CY++S E + +P I F G V +
Sbjct: 347 IKDAFVKKVKGYPIVQDFP-ILDPCYNVSGVEKIDLPDFGILFADGAVWNFPVENYFIRL 405
Query: 368 SVSQV-CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+V CL P SI +GN QQ+ V YD RLG+ P NC+
Sbjct: 406 DPEEVVCLAILGTPRSALSI-IGNYQQQNFHVLYDTKKSRLGYAPMNCA 453
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 77/280 (27%), Positives = 123/280 (43%), Gaps = 38/280 (13%)
Query: 62 LEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQR----LH-LKNSRRLRKPFPEFLKRTE 116
LE++ ++ P ++ T L+E++ D R LH L+ + R+ E L +
Sbjct: 3 LELIHRHSP--QVMGRPKTQLQRLKELVHSDSVRQLMILHKLRGGQIPRRKAKEVLSSSS 60
Query: 117 AFTFPANIN-------DTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQ- 168
I D +Y++ +G P Q L+ DTGSD+TW CK HC
Sbjct: 61 GRGSDDAIEVPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCK--YHCRSR 118
Query: 169 ----------QRDPFFYASKSKTFFKIPCNSTSCRI-LRESFPFGNCNS--KECPFNIQY 215
+ F+A+ S +F IPC + C+I L + F NC + C ++ +Y
Sbjct: 119 NCSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRY 178
Query: 216 ADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDK-SGASGIMGLDRSPVSI 274
+DGS + GF+A + +T+ E + L+GC + G A G+MGL S S
Sbjct: 179 SDGSTALGFFANETVTV-ELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSF 237
Query: 275 ITRTNTSY---FSYCLP---SPYGSTGYITFGKTDTVNTL 308
+ + FSYCL S + Y+TFG + + L
Sbjct: 238 AIKAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEAL 277
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/119 (26%), Positives = 62/119 (52%), Gaps = 1/119 (0%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+TF + +A + K++K + L+ C++ + +E +VP++ HF G +
Sbjct: 332 LTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFEESLVPRLVFHFADGAEF 391
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
E V+ ++ A+ CLGF + P + +GN+ Q+ H +D+ ++LGF P +C+
Sbjct: 392 EPPVKSYVISAADGVRCLGFVSVAW-PGTSVVGNIMQQNHLWEFDLGLKKLGFAPSSCT 449
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 64/186 (34%), Positives = 94/186 (50%), Gaps = 19/186 (10%)
Query: 112 LKRTEAFTFPANINDTVA-------DEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI 164
L+R +A A+ N + E+ + +AIG P + S ++DTGSD+ WTQCKPC
Sbjct: 70 LQRFKAMALVASSNSEIDAPVLPGNGEFLMKLAIGTPPETYSAIMDTGSDLIWTQCKPCT 129
Query: 165 HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGF 224
CF Q P F KS +F K+ C+S C E+ P C S C + Y D S + G
Sbjct: 130 QCFDQPTPIFDPKKSSSFSKLSCSSKLC----EALPQSTC-SDGCEYLYGYGDYSSTQGM 184
Query: 225 WATDRITIQEANSNGYFTRYPFLLGCINNSSGDK-SGASGIMGLDRSPVSIITRTNTSYF 283
A++ +T G + GC ++ G S SG++GL R P+S++++ F
Sbjct: 185 LASETLTF------GKVSVPEVAFGCGEDNEGSGFSQGSGLVGLGRGPLSLVSQLKEPKF 238
Query: 284 SYCLPS 289
SYCL S
Sbjct: 239 SYCLTS 244
Score = 43.5 bits (101), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 54/131 (41%), Gaps = 12/131 (9%)
Query: 292 GSTGYITFGKTDTVNTLRSAFHKRMKKYKKAKGL------EDLLDTCYDLSAYET-VVVP 344
GS G I T +SAF K++ L L+ C+ L + T + VP
Sbjct: 310 GSGGLIIDSGTTITYLEQSAFDLVAKEFTSQINLPVDNSGSTGLEVCFTLPSGSTDIEVP 369
Query: 345 KIAIHFLGGVDLELDVRGTLVV-ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVA 403
K+ HF G DLEL ++ AS+ CL + GN+QQ+ V +D+
Sbjct: 370 KLVFHF-DGADLELPAENYMIADASMGVACLAMGS---SSGMSIFGNIQQQNMLVLHDLE 425
Query: 404 GRRLGFGPGNC 414
L F P C
Sbjct: 426 KETLSFLPTQC 436
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 75/246 (30%), Positives = 111/246 (45%), Gaps = 40/246 (16%)
Query: 74 LNQGIST---HAPSLEEILRQDQQR--LHLKNSRRLRKPFPEFLKRTEAFTFPANINDTV 128
LN G S H S + L Q Q H+ N+ R +T P +
Sbjct: 24 LNNGFSVELIHRDSSKSPLYQPTQNKYQHIVNAARRSINRANHFYKTALTNTPQSTVIPD 83
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCN 188
EY + ++G P + + DTGSD+ W QC+PC C+ Q P F SKS T+ IPC+
Sbjct: 84 HGEYLMTYSVGTPPFKLYGIADTGSDIVWLQCEPCKECYNQTTPKFKPSKSSTYKNIPCS 143
Query: 189 STSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FL 247
S C+ SG G + D +T++ +S G+ +P +
Sbjct: 144 SDLCK-------------------------SGQQGNLSVDTLTLE--SSTGHPISFPKTV 176
Query: 248 LGC-INNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYC-LPSPYGS--TGYITFG 300
+GC +N+ + +SGI+GL P S+IT+ +S FSYC LP+P S T + FG
Sbjct: 177 IGCGTDNTVSFEGASSGIVGLGGGPASLITQLGSSIDAKFSYCLLPNPVESNTTSKLNFG 236
Query: 301 KTDTVN 306
T V+
Sbjct: 237 DTAVVS 242
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/123 (33%), Positives = 59/123 (47%), Gaps = 8/123 (6%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T TD N L SA + + K K+ L + CY +++ + P I HF G D+
Sbjct: 296 LTVIPTDVYNNLESAVLE-LVKLKRVNDPTRLFNLCYSVTS-DGYDFPIITTHF-KGADV 352
Query: 357 ELDVRGTLVVASVSQVCLGFAT----YPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPG 412
+L T V + VCL FAT P D SI GN+ Q+ V YD+ + + F P
Sbjct: 353 KLHPISTFVDVADGIVCLAFATTSAFIPSDVVSI-FGNLAQQNLLVGYDLQQKIVSFKPT 411
Query: 413 NCS 415
+CS
Sbjct: 412 DCS 414
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 71/214 (33%), Positives = 104/214 (48%), Gaps = 15/214 (7%)
Query: 100 NSRRLRKPFPEFLKRTEAFTFPANINDTVAD------EYYIVVAIGEPKQYVSLLLDTGS 153
+S+RLR + R FT N D EY + V+IG P + + DTGS
Sbjct: 52 SSQRLRNAIHRSVNRVFHFTEKDNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGS 111
Query: 154 DVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNI 213
D+ WTQC PC C+ Q DP F S T+ + C+S+ C L E+ + N C +++
Sbjct: 112 DLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTAL-ENQASCSTNDNTCSYSL 170
Query: 214 QYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD-KSGASGIMGLDRSPV 272
Y D S + G A D +T+ +++ + ++GC +N++G SGI+GL PV
Sbjct: 171 SYGDNSYTKGNIAVDTLTLGSSDTRPMQLKN-IIIGCGHNNAGTFNKKGSGIVGLGGGPV 229
Query: 273 SIITRTNTSY---FSYC---LPSPYGSTGYITFG 300
S+I + S FSYC L S T I FG
Sbjct: 230 SLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFG 263
Score = 45.1 bits (105), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 53/120 (44%), Gaps = 9/120 (7%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T T+ + L A + KK + + L CY SA + VP I +HF G D+
Sbjct: 324 LTLLPTEFYSELEDAVASSIDAEKK-QDPQSGLSLCY--SATGDLKVPVITMHF-DGADV 379
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSITL-GNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+LD V S VC F P S ++ GNV Q V YD + + F P +C+
Sbjct: 380 KLDSSNAFVQVSEDLVCFAFRGSP----SFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 68/190 (35%), Positives = 93/190 (48%), Gaps = 30/190 (15%)
Query: 126 DTVADE--YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
DTV D Y + + +G P + +DTGSD+ WTQC PC +C+ Q P F SKS TF
Sbjct: 53 DTVFDYSIYLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAPIFDPSKSSTF- 111
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
+E C+ CP+ I YAD S S G AT+ +TIQ + S F
Sbjct: 112 ------------KEK----RCHGNSCPYEIIYADESYSTGILATETVTIQ-STSGEPFVM 154
Query: 244 YPFLLGC-INNSS----GDKSGASGIMGLDRSPVSIITRTNT---SYFSYCLPSPYGSTG 295
+GC +NNS+ G + +SGI+GL+ P S+I++ + SYC S T
Sbjct: 155 AETSIGCGLNNSNLMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCFSSQ--GTS 212
Query: 296 YITFGKTDTV 305
I FG V
Sbjct: 213 KINFGTNAVV 222
Score = 42.0 bits (97), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 43/92 (46%), Gaps = 8/92 (8%)
Query: 326 EDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVS--QVCLGFATYPPDP 383
E+LL CY+ E + P I +HF GG DL LD + + V +++ CL P
Sbjct: 308 ENLL--CYNWDTME--IFPVITLHFAGGADLVLD-KYNMYVETITGGTFCLAIGCVDPSM 362
Query: 384 NSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+I GN V YD + + F P NCS
Sbjct: 363 PAI-FGNRAHNNLLVGYDSSTLVISFSPTNCS 393
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 87/296 (29%), Positives = 135/296 (45%), Gaps = 45/296 (15%)
Query: 94 QRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVAD-------------------EYYI 134
+R + + RL+K E K E + PA ++ AD EY+I
Sbjct: 139 ERKNQDTTSRLKKSNVERKKPMEEVSSPAESPESYADYFSGQLMATLESGVSLGSGEYFI 198
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
V IG P ++ SL+LDTGSD+ W QC PC CF+Q P++ S +F I CN C++
Sbjct: 199 DVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPRCQL 258
Query: 195 LRESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEANS---NGYFTRYP-FLL 248
+ P C ++ CP+ Y D S + G +A + T+ +S F R +
Sbjct: 259 VSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMF 318
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL---PSPYGSTGYITFGKT 302
GC + + G GA+G++GL R P+S ++ + Y FSYCL S + + FG+
Sbjct: 319 GCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRDSDTSVSSKLIFGED 378
Query: 303 DTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLEL 358
+ T H + G E+ +DT Y L +I F+GG L++
Sbjct: 379 KDLLT-----HPELNFTSLIAGKENPVDTFYYL---------QIKSIFVGGEKLQI 420
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 51/109 (46%), Gaps = 3/109 (2%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
++ AF +++K YK + +L CY++S + + P+ I F G V + +
Sbjct: 454 IKEAFLRKVKGYKLVEDFP-ILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRI 512
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ VCL P SI +GN QQ+ + YD RLG+ P C+
Sbjct: 513 QQLDIVCLAMLGTPKSALSI-IGNYQQQNFHILYDTKNSRLGYAPMRCA 560
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 71/214 (33%), Positives = 104/214 (48%), Gaps = 15/214 (7%)
Query: 100 NSRRLRKPFPEFLKRTEAFTFPANINDTVAD------EYYIVVAIGEPKQYVSLLLDTGS 153
+S+RLR + R FT N D EY + V+IG P + + DTGS
Sbjct: 52 SSQRLRNAIHRSVNRVFHFTEKDNTPQPQIDLTSNSGEYLMNVSIGTPPFPIMAIADTGS 111
Query: 154 DVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNI 213
D+ WTQC PC C+ Q DP F S T+ + C+S+ C L E+ + N C +++
Sbjct: 112 DLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTAL-ENQASCSTNDNTCSYSL 170
Query: 214 QYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD-KSGASGIMGLDRSPV 272
Y D S + G A D +T+ +++ + ++GC +N++G SGI+GL PV
Sbjct: 171 SYGDNSYTKGNIAVDTLTLGSSDTRPMQLKN-IIIGCGHNNAGTFNKKGSGIVGLGGGPV 229
Query: 273 SIITRTNTSY---FSYC---LPSPYGSTGYITFG 300
S+I + S FSYC L S T I FG
Sbjct: 230 SLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFG 263
Score = 45.1 bits (105), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 53/120 (44%), Gaps = 9/120 (7%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T T+ + L A + KK + + L CY SA + VP I +HF G D+
Sbjct: 324 LTLLPTEFYSELEDAVASSIDAEKK-QDPQSGLSLCY--SATGDLKVPVITMHF-DGADV 379
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSITL-GNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+LD V S VC F P S ++ GNV Q V YD + + F P +C+
Sbjct: 380 KLDSSNAFVQVSEDLVCFAFRGSP----SFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 60/172 (34%), Positives = 82/172 (47%), Gaps = 16/172 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK----PCIHCFQQ---RDPFFYASKSKTFF 183
+Y + +A G P Q V L+ DTGSD+ W QC P C ++ R P F ASKS T
Sbjct: 52 QYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLS 111
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECP----FNIQYADGSGSGGFWATDRITIQEANSNG 239
+PC++ C ++ G S P + YADGS + GF A D TI S G
Sbjct: 112 VVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSGG 171
Query: 240 YFTRYPFLLGC-INNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
R GC N G SG G++GL + +S ++ + + FSYCL
Sbjct: 172 AAVR-GVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCL 222
Score = 39.3 bits (90), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 45/92 (48%), Gaps = 7/92 (7%)
Query: 329 LDTCYDLSAYETVV-----VPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGF-ATYPPD 382
L+ CY++S+ + P++ I F G+ LEL LV + CL T P
Sbjct: 336 LELCYNVSSSSSSAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSPF 395
Query: 383 PNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
++ LGN+ Q+G+ V +D A R+GF C
Sbjct: 396 AFNV-LGNLMQQGYHVEFDRASARIGFARTEC 426
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 65/171 (38%), Positives = 93/171 (54%), Gaps = 17/171 (9%)
Query: 129 ADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPC 187
A YY++ +IG P + ++DTGSD W QCKPC C Q P F SKS T+ I C
Sbjct: 86 AGSYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTSPIFNPSKSSTYKNIRC 145
Query: 188 NSTSCRILRESFPFGNCNS---KECPFNIQYADGSGSGGFWATDRITIQEANSN-GYFTR 243
+S C+ ++ C+S ++C + I Y D SGS G + D +T+ NSN G
Sbjct: 146 SSPICKRGEKT----RCSSNRKRKCEYEITYLDRSGSQGDISKDTLTL---NSNDGSPIS 198
Query: 244 YP-FLLGCINNSSGDKSG-ASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
+P ++GC + +S G ASGI+G R SI+++ +S FSYCL S
Sbjct: 199 FPKIVIGCGHKNSLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLAS 249
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 32/104 (30%), Positives = 46/104 (44%), Gaps = 12/104 (11%)
Query: 316 MKKYKKAKGLEDLLDTCYD--LSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVC 373
M K K+ K L CY L YE VP I HF G D++L+ T + + +C
Sbjct: 342 MVKLKRVKDPTQQLSLCYKTTLKKYE---VPIITAHFRGA-DVKLNAFNTFIQMNHEVMC 397
Query: 374 LGF--ATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
F + +P + GN+ Q+ V YD + F P NC+
Sbjct: 398 FAFNSSAFP----WVVYGNIAQQNFLVGYDTLKNIISFKPTNCT 437
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 64/219 (29%), Positives = 97/219 (44%), Gaps = 18/219 (8%)
Query: 80 THAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIG 139
T A L L + + R+R+ +R + + +Y IG
Sbjct: 19 TRAAGLRLELTHVDAKQNCSTEERMRRATERTHRRLASMGEASAPVHWAESQYIAEYLIG 78
Query: 140 EPKQYVSLLLDTGSDVTWTQCKPC--IHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRE 197
+P Q ++DTGS++ WTQC C CF Q F+ S+S+T + CN T+C + E
Sbjct: 79 DPPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSRTARPVACNDTACALGSE 138
Query: 198 SFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSS 255
+ C ++K C Y G GG T+ T Q + N GCI +
Sbjct: 139 T----RCARDNKACAVLTAYGAGV-IGGVLGTEAFTFQPQSEN-----VSLAFGCIAATR 188
Query: 256 ---GDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPY 291
G GASGI+GL R +S++++ + FSYCL +PY
Sbjct: 189 LTPGSLDGASGIIGLGRGNLSLVSQLGDNKFSYCL-TPY 226
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 47/128 (36%), Positives = 76/128 (59%), Gaps = 8/128 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + ++G P V ++DTGSD+ W QC+PC C++Q P F SKSKT+ +PC+S
Sbjct: 90 EYLMRYSVGSPPFQVLGIVDTGSDILWLQCEPCEDCYKQTTPIFDPSKSKTYKTLPCSSN 149
Query: 191 SCRILRESFPFGNCNSKE-CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLL 248
+C LR + C+S C ++I Y DGS S G + + +T+ +++G +P ++
Sbjct: 150 TCESLRNT----ACSSDNVCEYSIDYGDGSHSDGDLSVETLTL--GSTDGSSVHFPKTVI 203
Query: 249 GCINNSSG 256
GC +N+ G
Sbjct: 204 GCGHNNGG 211
Score = 43.1 bits (100), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 29/98 (29%), Positives = 46/98 (46%), Gaps = 5/98 (5%)
Query: 318 KYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFA 377
K ++A+ LL CY ++ E + +P I HF G D+EL+ T V VC F
Sbjct: 345 KLERARDPSKLLSLCYKTTSDE-LDLPVITAHF-KGADVELNPISTFVPVEKGVVCFAFI 402
Query: 378 TYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ GN+ Q+ V YD+ + + F P +C+
Sbjct: 403 S---SKIGAIFGNLAQQNLLVGYDLVKKTVSFKPTDCT 437
>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 491
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 85/304 (27%), Positives = 130/304 (42%), Gaps = 38/304 (12%)
Query: 20 NGAYADDNDLSHSHIVSVSSLLPP-NVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGI 78
+G AD +V S LL P ++C+ + + + + YGPCS ++G
Sbjct: 28 HGGGADQERHQRYMVVQTSHLLEPKSICSGLKVT--PSANGTWVPLHRPYGPCSP-SEGT 84
Query: 79 STHAPSLEEILRQDQQRLHLKNSRR-------LRKPFPEFLKRTEAFTFPANINDTVADE 131
PSL E+LR DQ R + L P F
Sbjct: 85 P---PSLVEMLRWDQARTDYVRRKATGEVDDVLEPDRPHVDMMQMDFMLRGTFGIGSGSG 141
Query: 132 YYIVVAIGEPKQYV----SLLLDTGSDVTWTQCKPCI--HCFQQRDPFFYASKSKTFFKI 185
Y V+ + + ++ +DT DV W QC PC+ C+ QR+ FF +S T +
Sbjct: 142 YGAVIDGDDDDDPMILSQTMAIDTTEDVPWIQCLPCLIPQCYPQRNAFFDPRRSSTGAPV 201
Query: 186 PCNSTSCRILRESFPFGNCNSK-----ECPFNIQYADGSGSGGFWATDRITIQEANSNGY 240
C S +CR L + N SK +C + I+Y+D + G + TD +TI +
Sbjct: 202 RCGSRACRTLGG---YANGCSKPNSTGDCLYRIEYSDHRLTLGTYMTDTLTISPST---- 254
Query: 241 FTRYPFLLGCINNSSGDKSG-ASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGY 296
T F GC + G S ASG M L P S++++T +Y FSYC+P P + G+
Sbjct: 255 -TFLNFRFGCSHAVRGKFSAQASGTMSLGGGPQSLLSQTARAYGNAFSYCVPGP-SAAGF 312
Query: 297 ITFG 300
++ G
Sbjct: 313 LSIG 316
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/122 (37%), Positives = 59/122 (48%), Gaps = 6/122 (4%)
Query: 293 STGYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLG 352
S+ IT LR AF M+ YK + LDTC+D V VP +++ F G
Sbjct: 376 SSAVITQLPPTAYRALRLAFRNAMRAYKT-RAPTGNLDTCFDFVGVSKVTVPTVSLVFDG 434
Query: 353 GVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPG 412
G +EL + L+ CL FA D +GNVQQ+ HEV YDVAG +GF G
Sbjct: 435 GAVIELGLLSVLL-----DSCLAFAPMAADFALGFIGNVQQQTHEVLYDVAGGAVGFRHG 489
Query: 413 NC 414
C
Sbjct: 490 AC 491
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 40/104 (38%), Positives = 59/104 (56%), Gaps = 2/104 (1%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P +YV ++LDTGSD+ W QC PC C+ Q DP F KS++F I C S
Sbjct: 125 EYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASIACRSP 184
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQE 234
C L P N + C + + Y DGS + G ++T+ +T +
Sbjct: 185 LCHRLDS--PGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRR 226
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 42/109 (38%), Positives = 55/109 (50%), Gaps = 5/109 (4%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
R AF K+A L DTC+DLS V VP + +HF G D+ L L+ V
Sbjct: 365 FRDAFRAGASNLKRAPQFS-LFDTCFDLSGKTEVKVPTVVLHFRG-ADVSLPASNYLIPV 422
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ CL FA + I GN+QQ+G V YD+AG R+GF P C+
Sbjct: 423 DTSGNFCLAFAGTMGGLSII--GNIQQQGFRVVYDLAGSRVGFAPHGCA 469
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 63/111 (56%), Gaps = 2/111 (1%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ V +G P ++ SL+LDTGSD+ W QC PCI CF+Q P++ S +F I C+
Sbjct: 196 EYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDP 255
Query: 191 SCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNG 239
C+++ P C + + CP+ Y DGS + G +A + T+ NG
Sbjct: 256 RCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNG 306
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 33/119 (27%), Positives = 53/119 (44%), Gaps = 2/119 (1%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T+ ++ AF +++K Y+ +GL L CY++S E + +P I F
Sbjct: 443 LTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPL-KPCYNVSGIEKMELPDFGILFADEAVW 501
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
V + VCL P SI +GN QQ+ + YD+ RLG+ P C+
Sbjct: 502 NFPVENYFIWIDPEVVCLAILGNPRSALSI-IGNYQQQNFHILYDMKKSRLGYAPMKCA 559
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 53/161 (32%), Positives = 80/161 (49%), Gaps = 22/161 (13%)
Query: 85 LEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVAD-------------- 130
LEE LR++ R+ R RK LK+ A ++ N+ A+
Sbjct: 97 LEEKLRREAARVRALEQRIERK---LKLKKDPAGSY-ENVAGVTAEFGSEVVSGMEQGSG 152
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + IG P + ++LDTGSDV W QC+PC C+ Q DP F S S +F + C+S
Sbjct: 153 EYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCDSA 212
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRIT 231
C L + +C+ C + + Y DGS + G +AT+ +T
Sbjct: 213 VCSQLDAN----DCHGGGCLYEVSYGDGSYTVGSYATETLT 249
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 61/119 (51%), Gaps = 4/119 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T +T + LR AF + +A G+ + DTCYDLSA ++V +P + HF G
Sbjct: 380 VTRLQTSAYDALRDAFIAGTQHLPRADGIS-IFDTCYDLSALQSVSIPAVGFHFSNGAGF 438
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L + L+ + S+ C FA P D N +GN+QQ+G V +D A +GF C
Sbjct: 439 ILPAKNCLIPMDSMGTFCFAFA--PADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 495
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 57/184 (30%), Positives = 89/184 (48%), Gaps = 14/184 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EYY + +G P Q L++DTGS++TW QC PC C D + A++S ++ + CN++
Sbjct: 99 EYYTSIKLGSPGQEAILIVDTGSELTWLQCLPCKVCAPSVDTIYDAARSASYRPVTCNNS 158
Query: 191 SCRILRESFPFGNC-NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
+ C +C F Y DGS S G +TD + ++ T F G
Sbjct: 159 QLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFG 218
Query: 250 CINNSSGD----KSGASGIMGLDRSPVSIITRTNTSY---FSYCLP---SPYGSTGYITF 299
C + GD +GASGI+GL+ +++ + + FS+C P S STG + F
Sbjct: 219 C---AQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFF 275
Query: 300 GKTD 303
G +
Sbjct: 276 GNAE 279
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 13/118 (11%)
Query: 308 LRSAFHKRMK---KYKKAKGLEDLLDTCYDLSAYET----VVVPKIAIHFLGGVDLELDV 360
LR AF K K+ + DL TC+ +S + +P +++ F GV + +
Sbjct: 344 LREAFLKHRPPSLKHLEGDSFGDL-GTCFKVSNDDIDELHRTLPSLSLVFEDGVTIGIPS 402
Query: 361 RGTLVVASVSQ----VCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
G L+ + Q +C F P+P ++ +GN QQ+ V YD+ R+GF +C
Sbjct: 403 IGVLLPVARFQNHVKMCFAFEDGGPNPVNV-IGNYQQQNLWVEYDIQRSRVGFARASC 459
>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 462
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 80/276 (28%), Positives = 134/276 (48%), Gaps = 40/276 (14%)
Query: 33 HIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQD 92
H + ++SLLP + C + P G L + YGPCS+L Q +PS ++I QD
Sbjct: 40 HTLDINSLLPKSNC-----SAPVGGGSQGLPITYSYGPCSQLGQ---KKSPSRQQIFLQD 91
Query: 93 QQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIV-VAIGEPKQYVSLLLDT 151
+ R+ N+R L + E K + P +++ D +++V V G+P+Q ++L++DT
Sbjct: 92 RSRVRSINARILGQYSTEESKDGGS---PESMHSLNEDGFFLVNVGFGKPQQNLNLIIDT 148
Query: 152 GSDVTWTQCKPCI--HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKEC 209
GSD TW +C C +C ++ P F S S ++ C S +
Sbjct: 149 GSDTTWIRCNSCSLGNCHNKKIPTFNPSLSSSYSNRSC----------------IPSTKT 192
Query: 210 PFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDR 269
+ + Y D S S G + D +T++ F + F GC ++ GD ASG++GL +
Sbjct: 193 NYTMNYEDNSYSKGVFVCDEVTLKP----DVFPK--FQFGCGDSGGGDFGSASGVLGLAQ 246
Query: 270 SP-VSIITRTNTSY---FSYCLPSPYGSTGYITFGK 301
S+I++T + + FSYC P + G + FG+
Sbjct: 247 GEQYSLISQTASKFKKKFSYCFPHNENTRGSLLFGE 282
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 43/119 (36%), Positives = 58/119 (48%), Gaps = 5/119 (4%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKG--LEDLLDTCYDLSAY--ETVVVPKIAIHFLG 352
IT T LR+AF + M E LDTCY+L + +P+I +HF+G
Sbjct: 339 ITHLPTAAYEALRTAFQQEMLHCPSVSPPPQEKPLDTCYNLKGCGGRNIKLPEIVLHFVG 398
Query: 353 GVDLELDVRGTLVV-ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFG 410
VD+ L G L ++Q CL FA + +GN QQ +V YD+ G RLGFG
Sbjct: 399 EVDVSLHPSGILWANGDLTQACLAFARKSHPSHVTIIGNRQQVSLKVVYDIEGGRLGFG 457
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 90/268 (33%), Positives = 123/268 (45%), Gaps = 37/268 (13%)
Query: 61 SLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRT-EAFT 119
S ++ Y CS T + E +R D RL FLKRT +
Sbjct: 53 SFPLIHIYSECSPFRPPNRTWESLMSEKIRGDANRLR-------------FLKRTSRSSK 99
Query: 120 FPANINDTV---ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYA 176
AN N V + EY I V G PKQ + L+DTGSDV W CK C C P F
Sbjct: 100 QDANANVPVRSGSGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGC-HSTAPIFDP 158
Query: 177 SKSKTFFKIPCNSTSCRILRESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQE 234
+KS ++ C+S C+ + GNC NSK C F + Y DG+ G A+D IT+
Sbjct: 159 AKSSSYKPFACDSQPCQEIS-----GNCGGNSK-CQFEVSYGDGTQVDGTLASDAITL-- 210
Query: 235 ANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTS-----YFSYCLPS 289
+ Y + F GC + S D S + G+MGL +S++T+ T+ FSYCLPS
Sbjct: 211 --GSQYLPNFSF--GCAESLSEDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPS 266
Query: 290 PYGSTGYITFGKTDTVNTLRSAFHKRMK 317
S+G + GK V++ F +K
Sbjct: 267 SSTSSGSLVLGKEAAVSSSSLKFTTLIK 294
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 39/108 (36%), Positives = 58/108 (53%), Gaps = 6/108 (5%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
LR AF +++ + +ED+ DTCYDLS+ +V VP I +H VDL L L+
Sbjct: 348 LRDAFRQQLSSLQPTP-VEDM-DTCYDLSS-SSVDVPTITLHLDRNVDLVLPKENILITQ 404
Query: 368 SVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
CL F++ D SI +GNVQQ+ + +DV ++GF C+
Sbjct: 405 ESGLACLAFSST--DSRSI-IGNVQQQNWRIVFDVPNSQVGFAQEQCA 449
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 60/214 (28%), Positives = 95/214 (44%), Gaps = 16/214 (7%)
Query: 86 EEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVAD--EYYIVVAIGEPKQ 143
E +R +R +++R R+ T A + + EY + ++IG P
Sbjct: 39 SEFVRGALRRDMHRHARFAREQLAPSSAAAAGLTVGAPTQKDLRNGGEYIMTLSIGTPPL 98
Query: 144 YVSLLLDTGSDVTWTQCKPCI--------HCFQQRDPFFYASKSKTFFKIPCNS--TSCR 193
+ DTGSD+ WTQC PC CF+Q + S S TF +PCNS + C
Sbjct: 99 SYRAIADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTFGVLPCNSPLSMCA 158
Query: 194 ILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINN 253
+ P C C +N Y G + G + + T +++ GC N
Sbjct: 159 AMAGPSPPPGC---ACMYNQTYGTG-WTAGVQSVETFTFGSSSTPPAVRVPNIAFGCSNA 214
Query: 254 SSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
SS D +G++G++GL R +S++++ FSYCL
Sbjct: 215 SSNDWNGSAGLVGLGRGSMSLVSQLGAGAFSYCL 248
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 55/166 (33%), Positives = 87/166 (52%), Gaps = 16/166 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
+Y + ++IG P + DTGSD+ W QC PC +C++Q +P F + S TF I C S
Sbjct: 58 DYLMELSIGTPPVKIYAQADTGSDLIWLQCIPCTNCYKQLNPMFDSQSSSTFSNIACGSE 117
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
SC L + +C+ + C +N Y DGS + G A + +T+ + +
Sbjct: 118 SCSKLYST----SCSPDQINCKYNYSYVDGSETQGVLAQETLTLTSTTGEPVAFK-GVIF 172
Query: 249 GCINNSSG---DKSGASGIMGLDRSPVSIITRTNTS----YFSYCL 287
GC +N++G DK GI+GL R P+S++++ +S FS CL
Sbjct: 173 GCGHNNNGAFNDKE--MGIIGLGRGPLSLVSQIGSSLGGNMFSQCL 216
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 87/296 (29%), Positives = 135/296 (45%), Gaps = 45/296 (15%)
Query: 94 QRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVAD-------------------EYYI 134
+R + + RL+K E K E + PA ++ AD EY+I
Sbjct: 139 ERKNQDTTSRLKKSNVERKKPMEEVSSPAESPESYADYFSGQLMATLESGVSLGSGEYFI 198
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
V IG P ++ SL+LDTGSD+ W QC PC CF+Q P++ S +F I CN C++
Sbjct: 199 DVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPRCQL 258
Query: 195 LRESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEANS---NGYFTRYP-FLL 248
+ P C ++ CP+ Y D S + G +A + T+ +S F R +
Sbjct: 259 VSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMF 318
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL---PSPYGSTGYITFGKT 302
GC + + G GA+G++GL R P+S ++ + Y FSYCL S + + FG+
Sbjct: 319 GCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRDSDTSVSSKLIFGED 378
Query: 303 DTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLEL 358
+ T H + G E+ +DT Y L +I F+GG L++
Sbjct: 379 KDLLT-----HPELNFTSLIAGKENPVDTFYYL---------QIKSIFVGGEKLQI 420
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 51/109 (46%), Gaps = 3/109 (2%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
++ AF +++K YK + +L CY++S + + P+ I F G V + +
Sbjct: 454 IKEAFLRKVKGYKLVEDFP-ILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRI 512
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ VCL P SI +GN QQ+ + YD RLG+ P C+
Sbjct: 513 QQLDIVCLAMLGTPKSALSI-IGNYQQQNFHILYDTKNSRLGYAPMRCA 560
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 60/172 (34%), Positives = 84/172 (48%), Gaps = 16/172 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK----PCIHCFQQ---RDPFFYASKSKTFF 183
+Y + +A G P Q V L+ DTGSD+ W QC P C ++ R P F ASKS T
Sbjct: 53 QYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLS 112
Query: 184 KIPCNSTSCRILRESFPFG-NCNSKE---CPFNIQYADGSGSGGFWATDRITIQEANSNG 239
+PC++ C ++ G +C+ C + YADGS + GF A D TI S G
Sbjct: 113 VVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSGG 172
Query: 240 YFTRYPFLLGC-INNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
R GC N G SG G++GL + +S ++ + + FSYCL
Sbjct: 173 AAVR-GVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCL 223
Score = 42.7 bits (99), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 29/92 (31%), Positives = 46/92 (50%), Gaps = 7/92 (7%)
Query: 329 LDTCYDLSAYETVV-----VPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGF-ATYPPD 382
L+ CY++S+ ++ P++ I F G+ LEL LV + CL T P
Sbjct: 337 LELCYNVSSSSSLAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSPF 396
Query: 383 PNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
++ LGN+ Q+G+ V +D A R+GF C
Sbjct: 397 AFNV-LGNLMQQGYHVEFDRASARIGFARTEC 427
>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
Length = 424
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 69/240 (28%), Positives = 104/240 (43%), Gaps = 22/240 (9%)
Query: 103 RLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQC-K 161
R RK F + + FP + N Y + + IG+P + L LDTGSD+TW QC
Sbjct: 28 RWRKAADRFTRAASSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDA 87
Query: 162 PCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGS 221
PC+HC + P + S IPCN C+ L + ++C + ++YADG S
Sbjct: 88 PCVHCLEAPHPLYQPSND----LIPCNDPLCKALHFNGNHRCETPEQCDYEVEYADGGSS 143
Query: 222 GGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSG---ASGIMGLDRSPVSIITRT 278
G D ++ R LGC + SG G++GL R VSI+++
Sbjct: 144 LGVLVRDVFSLNYTKGLRLTPR--LALGCGYDQIPGASGHHPLDGVLGLGRGKVSILSQL 201
Query: 279 NTSYF-----SYCLPSPYGSTGYITFGKTDTVNTLRSAF----HKRMKKYKKAKGLEDLL 329
++ + +CL S G I F D ++ R ++ + K Y A G E L
Sbjct: 202 HSQGYVKNVVGHCLSSLGGG---ILFFGNDLYDSSRVSWTPMARENSKHYSPAMGGELLF 258
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 58/161 (36%), Positives = 92/161 (57%), Gaps = 14/161 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIH---CFQQRDPFFYASKSKTFFKIPC 187
EY + +G+P + L+ DTGSDVTW QC+PC C++Q DP F S ++ + C
Sbjct: 147 EYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSC 206
Query: 188 NSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
NS C++L ++ NCNS C + + Y DGS + G AT+ ++ +NS P L
Sbjct: 207 NSQQCKLLDKA----NCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNS------IPNL 256
Query: 248 -LGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+GC +++ G +G +G++GL +S+ ++ S FSYCL
Sbjct: 257 PIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 297
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 50/114 (43%), Gaps = 4/114 (3%)
Query: 302 TDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVR 361
+D +LR AF K A G+ + DTCY+ S V VP IA G L L R
Sbjct: 377 SDVYESLREAFVKLTSSLSPAPGIS-VFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPAR 435
Query: 362 GTLVVASVSQV-CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L++ + CL F + I G+ QQ+G V YD+ +GF C
Sbjct: 436 NYLIMLDTAGTYCLAFIKTKSSLSII--GSFQQQGIRVSYDLTNSLVGFSTNKC 487
>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
Length = 435
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 100/356 (28%), Positives = 142/356 (39%), Gaps = 75/356 (21%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIP-- 186
A EY ++ G P Q + DT V+ +CKPC+ DP F S+S +F IP
Sbjct: 85 ALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGG-APCDPAFEPSRSSSFAAIPCG 143
Query: 187 -------CNSTSCRILRESFPFGNC---------NSKECPFNIQYADGS----------- 219
C SC + FGN ++ P + +A +
Sbjct: 144 SPECAVECTGASCPFTIQ---FGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEVGADAD 200
Query: 220 ---GSGGFWATDRITIQEAN---SNGYFTRYPFLLGCINNSSGDKSGASGIMGLDR---- 269
G+ G R + A+ SNG T C+ +SS S +G R
Sbjct: 201 TFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGASRPEYS 260
Query: 270 ------SPVSIITRTNTSYFSYCL------------PSPYGSTGYI-------TFGKTDT 304
+P+S SYF + P+ + + G + TF
Sbjct: 261 GGDIKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAVFAAHGTLLEAATEFTFLAPAA 320
Query: 305 VNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL 364
LR AF K M Y A +LDTCY+L+ ++ VP +A+ F GG +LELDVR +
Sbjct: 321 YAALRDAFRKDMAPYPAAPPFR-VLDTCYNLTGLASLAVPAVALRFAGGTELELDVRQMM 379
Query: 365 VVASVSQV-----CLGFATYPPDPNSIT-LGNVQQRGHEVHYDVAGRRLGFGPGNC 414
A S V CL FA P ++ +G + QR EV YD+ G R+GF PG C
Sbjct: 380 YFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 435
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 70/240 (29%), Positives = 107/240 (44%), Gaps = 42/240 (17%)
Query: 109 PEFLKRTEAFTFPANINDTVAD-------------EYYIVVAIGEPKQYVSLLLDTGSDV 155
PE ++R A + N+ T A+ +Y +G+P Q L+DTGS +
Sbjct: 50 PERVRRAIALSRQINLASTRAEGGGVSAPVHWATRQYIAEYMVGDPPQRAEALIDTGSSL 109
Query: 156 TWTQCKPCIH--CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCN-SKECPFN 212
WTQC C+ C +Q P+F AS S +F +PC +C F C C F
Sbjct: 110 IWTQCTACLRKVCVRQDLPYFNASSSGSFAPVPCQDKACAGNYLHF----CALDGTCTFR 165
Query: 213 IQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCIN----NSSGDKSGASGIMGLD 268
+ Y G G GF TD T Q + F GC++ + GASG++GL
Sbjct: 166 VTYGAG-GIIGFLGTDAFTFQSGGATLAF-------GCVSFTRFAAPDVLHGASGLIGLG 217
Query: 269 RSPVSIITRTNTSYFSYCLPSPY----GSTGYITFGKTDTVN-----TLRSAFHKRMKKY 319
R +S+ ++T FSYCL +PY G++ ++ G +++ + AF + K Y
Sbjct: 218 RGRLSLASQTGAKRFSYCL-TPYFHNNGASSHLFVGAAASLSGGGGAVMSMAFVESPKDY 276
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 35/74 (47%), Gaps = 3/74 (4%)
Query: 342 VVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYD 401
VVP + +HF GG D+ L S C+ A SI +GN QQ+ + +D
Sbjct: 371 VVPTLVLHFSGGADMALPPENYWAPLEKSTACM--AIVRGYLQSI-IGNFQQQNMHILFD 427
Query: 402 VAGRRLGFGPGNCS 415
V G RL F +CS
Sbjct: 428 VGGGRLSFQNADCS 441
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 58/161 (36%), Positives = 92/161 (57%), Gaps = 14/161 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIH---CFQQRDPFFYASKSKTFFKIPC 187
EY + +G+P + L+ DTGSDVTW QC+PC C++Q DP F S ++ + C
Sbjct: 147 EYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSC 206
Query: 188 NSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
NS C++L ++ NCNS C + + Y DGS + G AT+ ++ +NS P L
Sbjct: 207 NSQQCKLLDKA----NCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNS------IPNL 256
Query: 248 -LGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+GC +++ G +G +G++GL +S+ ++ S FSYCL
Sbjct: 257 PIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 297
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 37/114 (32%), Positives = 50/114 (43%), Gaps = 4/114 (3%)
Query: 302 TDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVR 361
+D +LR AF K A G+ + DTCY+ S V VP IA G L L R
Sbjct: 377 SDVYESLREAFVKLTSSLSPAPGIS-VFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPAR 435
Query: 362 GTLVVASVSQV-CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L++ + CL F + I G+ QQ+G V YD+ +GF C
Sbjct: 436 NYLIMLDTAGTYCLAFIKTKSSLSII--GSFQQQGIRVSYDLTNSIVGFSTNKC 487
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 54/156 (34%), Positives = 85/156 (54%), Gaps = 10/156 (6%)
Query: 136 VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRIL 195
++IG P LL+DTGSD+TW C PC C+ Q PFF+ S+S T+ + SC
Sbjct: 82 ISIGNPPVPQLLLIDTGSDLTWIHCLPC-KCYPQTIPFFHPSRSSTY-----RNASCVSA 135
Query: 196 RESFP--FGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINN 253
+ P F + + C ++++Y D S + G A +++T E + +G ++ + GC +
Sbjct: 136 PHAMPQIFRDEKTGNCQYHLRYRDFSNTRGILAEEKLTF-ETSDDGLISKQNIVFGCGQD 194
Query: 254 SSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
+SG + SG++GL SI+TR S FSYC S
Sbjct: 195 NSG-FTKYSGVLGLGPGTFSIVTRNFGSKFSYCFGS 229
>gi|296082173|emb|CBI21178.3| unnamed protein product [Vitis vinifera]
Length = 372
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 57/153 (37%), Positives = 77/153 (50%), Gaps = 19/153 (12%)
Query: 33 HIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQD 92
H VSSLLP N C+ + QG L + KYGPCS + PS +EI +D
Sbjct: 74 HSTPVSSLLPKNKCSASARGGSQG-----LPITQKYGPCSGSGH---SQPPSPQEIFGRD 125
Query: 93 QQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADE---YYIVVAIGEPKQYVSLLL 149
+ R+ NS+ + PE LK N+ + DE + + VA G P Q +L+L
Sbjct: 126 ESRVSFINSK-FNQYAPENLKDHTP-------NNKLFDEDGNFLVDVAFGTPPQKFTLIL 177
Query: 150 DTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTF 182
DTGS +TWTQCKPC+ C + F S S T+
Sbjct: 178 DTGSSITWTQCKPCVRCLKASRRHFDPSASLTY 210
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/163 (34%), Positives = 81/163 (49%), Gaps = 11/163 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
Y + V+IG P + + DTGSD+TWT C PC C++QR+P F KS ++ I C+S
Sbjct: 24 HYLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKSTSYRNISCDSK 83
Query: 191 SCRILRESFPFGNCN-SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
C L G C+ K C + YA + + G A + IT+ + + G
Sbjct: 84 LCHKLDT----GVCSPQKHCNYTYAYASAAITQGVLAQETITLSSTKGESVPLK-GIVFG 138
Query: 250 C-INNSSGDKSGASGIMGLDRSPVSIITRTNTSY----FSYCL 287
C NN+ G GI+GL PVS I++ +S+ FS CL
Sbjct: 139 CGHNNTGGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCL 181
>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 523
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 100/356 (28%), Positives = 142/356 (39%), Gaps = 75/356 (21%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIP-- 186
A EY ++ G P Q + DT V+ +CKPC+ DP F S+S +F IP
Sbjct: 173 ALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGG-APCDPAFEPSRSSSFAAIPCG 231
Query: 187 -------CNSTSCRILRESFPFGNC---------NSKECPFNIQYADGS----------- 219
C SC + FGN ++ P + +A +
Sbjct: 232 SPECAVECTGASCPF---TIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEVGADAD 288
Query: 220 ---GSGGFWATDRITIQEAN---SNGYFTRYPFLLGCINNSSGDKSGASGIMGLDR---- 269
G+ G R + A+ SNG T C+ +SS S +G R
Sbjct: 289 TFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGASRPEYS 348
Query: 270 ------SPVSIITRTNTSYFSYCL------------PSPYGSTGYI-------TFGKTDT 304
+P+S SYF + P+ + + G + TF
Sbjct: 349 GGDIKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAVFAAHGTLLEAATEFTFLAPAA 408
Query: 305 VNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL 364
LR AF K M Y A +LDTCY+L+ ++ VP +A+ F GG +LELDVR +
Sbjct: 409 YAALRDAFRKDMAPYPAAPPFR-VLDTCYNLTGLASLAVPAVALRFAGGTELELDVRQMM 467
Query: 365 VVASVSQV-----CLGFATYPPDPNSIT-LGNVQQRGHEVHYDVAGRRLGFGPGNC 414
A S V CL FA P ++ +G + QR EV YD+ G R+GF PG C
Sbjct: 468 YFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 523
>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 373
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 56/183 (30%), Positives = 102/183 (55%), Gaps = 19/183 (10%)
Query: 130 DEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCF---QQRDPFFYASKSKTFFKI 185
+++++ +++G P + + +DTGS ++W QC+ CI HC+ Q+ P F S S T+ ++
Sbjct: 21 NQFFMGISLGTPAVFNLVTIDTGSTISWVQCQYCIVHCYTQDQRAGPTFNTSSSSTYRRV 80
Query: 186 PCNSTSCRILR--ESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
C++ C + ++ P G C +E C ++++YA G S G+ + DR+T+ ANS +
Sbjct: 81 GCSAQVCHDMHVSQNIPSG-CVEEEDSCIYSLRYASGEYSAGYLSQDRLTL--ANS---Y 134
Query: 242 TRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITR----TNTSYFSYCLPSPYGSTGYI 297
+ F+ GC +++ + A GI+G S + TN S FSYC PS + G++
Sbjct: 135 SIQKFIFGCGSDNRYNGHSA-GIIGFGNKSYSFFNQIAQLTNYSAFSYCFPSNQENEGFL 193
Query: 298 TFG 300
+ G
Sbjct: 194 SIG 196
>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
Length = 334
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 83/320 (25%), Positives = 141/320 (44%), Gaps = 46/320 (14%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + ++IG P V + DTGSD+ WTQC PC+ C++Q++P F SKS +F ++ C S
Sbjct: 23 EYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCESQ 82
Query: 191 SCRILRE-----SFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
CR+L + FG ++ FN G+GG + +T Q ++ G ++
Sbjct: 83 QCRLLDTPTSILNIVFGCGHNNSGTFNENEMGLFGTGGRPLS--LTSQIMSTLGSGRKFS 140
Query: 246 FLLGCINNSSGDKSGASGIM--------GLDRSPVSIITRTNTSY--------------F 283
C+ D S S I+ G D ++T+ + +Y F
Sbjct: 141 ---QCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLF 197
Query: 284 SYCLPSPYGSTGYITFGKTDTVNTL--RSAFHKRMKKYKKAKGLEDLLDT------CYDL 335
+ SP + G + F T TL R +++ ++ K+A +E + D CY
Sbjct: 198 PFSSSSPMATKGNV-FIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCY-- 254
Query: 336 SAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRG 395
+ + P + HF G D++L T + C FA P D ++ GN Q
Sbjct: 255 RSATLIDGPILTAHF-DGADVQLKPLNTFISPKEGVYC--FAMQPIDGDTGIFGNFVQMN 311
Query: 396 HEVHYDVAGRRLGFGPGNCS 415
+ +D+ G+++ F +C+
Sbjct: 312 FLIGFDLDGKKVSFKAVDCT 331
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/213 (32%), Positives = 101/213 (47%), Gaps = 26/213 (12%)
Query: 86 EEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYV 145
+E +R+ +R H RRL + P + N+T +Y IG+P Q
Sbjct: 49 KERMRRATERTH----RRLAS----MAGGGGEASAPIHWNET---QYIAEYLIGDPPQQA 97
Query: 146 SLLLDTGSDVTWTQCKPCIH--CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGN 203
+ ++DTGS++ WTQC C CF Q F+ S+S+T + CN T+C + E+
Sbjct: 98 AAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVACNDTACLLGSET----R 153
Query: 204 C--NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSS---GDK 258
C + K C Y G+ GGF T+ T S+ F GCI S G
Sbjct: 154 CARDGKACAVLTAYGAGA-IGGFLGTEVFTFGHGQSSENNVSLAF--GCITASRLTPGSL 210
Query: 259 SGASGIMGLDRSPVSIITRTNTSYFSYCLPSPY 291
GASGI+GL R +S+ ++ + FSYCL +PY
Sbjct: 211 DGASGIIGLGRGKLSLPSQLGDNKFSYCL-TPY 242
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 92/407 (22%), Positives = 157/407 (38%), Gaps = 88/407 (21%)
Query: 89 LRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLL 148
L Q + R L++ R L+ F+ + YY V +G P ++
Sbjct: 40 LSQLRARDELRHRRMLQSS-----SGVVDFSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQ 94
Query: 149 LDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGN 203
+DTGSDV W C C C Q + FF S T I C+ C ++S
Sbjct: 95 IDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGKQSSD-AT 153
Query: 204 CNSK--ECPFNIQYADGSGSGGFWATDRI---TIQEANSNGYFTRYPFLLGCINNSSGDK 258
C+S+ +C + QY DGSG+ G++ +D + TI E + T P + GC N +GD
Sbjct: 154 CSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMTTNSTA-PVVFGCSNQQTGDL 212
Query: 259 S----GASGIMGLDRSPVSIITRTNTS-----YFSYCLPSPYGSTGYITFGKTDTVNTLR 309
+ GI G + +S+I++ ++ FS+CL G + G+ N +
Sbjct: 213 TKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGDSSGGGILVLGEIVEPNIVY 272
Query: 310 SAF-------------------------------HKRMKKYKKAKGLEDLLDTCYD---- 334
++ + R L L + YD
Sbjct: 273 TSLVPAQPHYNLNLQSISVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVS 332
Query: 335 ----------------------LSAYETVVVPKIAIHFLGGVDLELDVRGTLV----VAS 368
+++ T V P+++++F GG + L + L+ +
Sbjct: 333 AITAAIPQSVRTVVSRGNQCYLITSSVTDVFPQVSLNFAGGASMILRPQDYLIQQNSIGG 392
Query: 369 VSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ C+GF +I LG++ + V YD+AG+R+G+ +CS
Sbjct: 393 AAVWCIGFQKIQGQGITI-LGDLVLKDKIVVYDLAGQRIGWANYDCS 438
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/186 (31%), Positives = 92/186 (49%), Gaps = 21/186 (11%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIH--CFQQRDPFFYASKSKTFFKIPCN 188
+Y IG+P Q L+DTGSD+ WTQC C+ C +Q P++ +S S TF +PC
Sbjct: 89 QYVAEYLIGDPPQRAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCA 148
Query: 189 STSCRILRESFPFGNCN-SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
+ C + F C+ + C Y G +G T+ Q + F
Sbjct: 149 ARICAANDDIIHF--CDLAAGCSVIAGYGAGVVAGTL-GTEAFAFQSGTAELAF------ 199
Query: 248 LGCINNS---SGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPY----GSTGYITFG 300
GC+ + G GASG++GL R +S++++T + FSYCL +PY G+TG++ G
Sbjct: 200 -GCVTFTRIVQGALHGASGLIGLGRGRLSLVSQTGATKFSYCL-TPYFHNNGATGHLFVG 257
Query: 301 KTDTVN 306
+ ++
Sbjct: 258 ASASLG 263
>gi|296084856|emb|CBI28265.3| unnamed protein product [Vitis vinifera]
Length = 446
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 84/314 (26%), Positives = 137/314 (43%), Gaps = 47/314 (14%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y I ++ G P Q + L++DTGSD+ W PC H + R+ F S + IP +S+S
Sbjct: 90 YSIPLSFGTPPQTLPLIMDTGSDLVWF---PCTHRYVCRNCSFSTSNPSSNIFIPKSSSS 146
Query: 192 CRIL-------------------RESFPFG-NCNSKECPFNIQYADGSGSGGFWATDRIT 231
++L R+ P NC P+ + Y G +GG ++ +
Sbjct: 147 SKVLGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGI-TGGIMLSETLD 205
Query: 232 IQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPY 291
+ F++GC S S +GI G R P S+ ++ FSYCL S
Sbjct: 206 LPGKGVPN------FIVGC---SVLSTSQPAGISGFGRGPPSLPSQLGLKKFSYCLLSRR 256
Query: 292 GSTGYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDL--LDTCYDLSAYETVVVPKIAIH 349
T + + L +A ++ + K+A +E + L C+++S T P++ +
Sbjct: 257 YDD---TTESSSLIFELVAAEFEKQVQSKRATEVEGITGLRPCFNISGLNTPSFPELTLK 313
Query: 350 FLGGVDLELDVRGTLV-VASVSQVCLGFAT-------YPPDPNSITLGNVQQRGHEVHYD 401
F GG ++EL + + + VCL T + P +I LGN QQ+ V YD
Sbjct: 314 FRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGP-AIILGNFQQQNFYVEYD 372
Query: 402 VAGRRLGFGPGNCS 415
+ RLGF +C+
Sbjct: 373 LRNERLGFRQQSCN 386
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 77/280 (27%), Positives = 122/280 (43%), Gaps = 38/280 (13%)
Query: 62 LEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQR----LH-LKNSRRLRKPFPEFLKRTE 116
LE++ ++ P ++ T L+E++ D R LH L+ + R+ E L +
Sbjct: 3 LELIHRHSP--QVMGRPKTQLQRLKELVHSDSVRQLMILHKLRGGQIPRRKAKEVLSSSS 60
Query: 117 AFTFPANIN-------DTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQ- 168
I D +Y + +G P Q L+ DTGSD+TW CK HC
Sbjct: 61 GRGSDDAIEVPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCK--YHCRSR 118
Query: 169 ----------QRDPFFYASKSKTFFKIPCNSTSCRI-LRESFPFGNCNS--KECPFNIQY 215
+ F+A+ S +F IPC + C+I L + F NC + C ++ +Y
Sbjct: 119 NCSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRY 178
Query: 216 ADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDK-SGASGIMGLDRSPVSI 274
+DGS + GF+A + +T+ E + L+GC + G A G+MGL S S
Sbjct: 179 SDGSTALGFFANETVTV-ELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSF 237
Query: 275 ITRTNTSY---FSYCLP---SPYGSTGYITFGKTDTVNTL 308
+ + FSYCL S + Y+TFG + + L
Sbjct: 238 AIKAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEAL 277
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/119 (26%), Positives = 62/119 (52%), Gaps = 1/119 (0%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+TF + +A + K++K + L+ C++ + +E +VP++ HF G +
Sbjct: 332 LTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFEESLVPRLVFHFADGAEF 391
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
E V+ ++ A+ CLGF + P + +GN+ Q+ H +D+ ++LGF P +C+
Sbjct: 392 EPPVKSYVISAADGVRCLGFVSVAW-PGTSVVGNIMQQNHLWEFDLGLKKLGFAPSSCT 449
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 60/165 (36%), Positives = 75/165 (45%), Gaps = 7/165 (4%)
Query: 131 EYYIVVAIGEPK-QYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNS 189
EY I IG P+ Q V+L +DTGSD+ WTQC PC CF Q P F S S TF + C
Sbjct: 86 EYLIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPVCFDQPFPLFDPSVSSTFRAVACPD 145
Query: 190 TSCRILRESFPFGNCNSK--ECPFNIQYADGSGSGGFWATDRITIQEANSNGY--FTRYP 245
CR C K C + Y D S + G+ D T N G
Sbjct: 146 PICRP-SSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMSPNGEGAPPVAVSG 204
Query: 246 FLLGCINNSSGD-KSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
GC + ++G S SGI G R P+S+ ++ FSYCL S
Sbjct: 205 LAFGCGDYNTGVFASNESGIAGFGRGPLSLPSQLRVGRFSYCLTS 249
>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 413
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 55/159 (34%), Positives = 74/159 (46%), Gaps = 14/159 (8%)
Query: 132 YYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
YY+ IG P Q S ++D ++ WTQC C CF+Q P F + S TF PC +
Sbjct: 61 YYVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTA 120
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSG-SGGFWATDRITIQEANSNGYFTRYPFLLG 249
C ES P +C+ C + G + GF ATD I A F G
Sbjct: 121 VC----ESIPTRSCSGDVCSYKGPPTQLRGNTSGFAATDTFAIGTATVRLAF-------G 169
Query: 250 CINNSSGDK-SGASGIMGLDRSPVSIITRTNTSYFSYCL 287
C+ S D G SG +GL R+P S++ + + FSYCL
Sbjct: 170 CVVASDIDTMDGPSGFIGLGRTPWSLVAQMKLTRFSYCL 208
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/171 (29%), Positives = 85/171 (49%), Gaps = 20/171 (11%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
+Y++ ++G P+Q L++DTGSD+ + QC PC C++Q P + S S TF +PC+S
Sbjct: 33 QYFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQPSNSSTFTPVPCDSA 92
Query: 191 SCRILRESFPFGN-CNSK--------ECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
C ++ P G C+S C + +Y D S + G +A + T+ N
Sbjct: 93 ECLLIPA--PVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYETATVGGIRVNH-- 148
Query: 242 TRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
GC N + G A G++GL + +S ++ ++ F+YCL S
Sbjct: 149 ----VAFGCGNRNQGSFVSAGGVLGLGQGALSFTSQAGYAFENKFAYCLTS 195
Score = 39.3 bits (90), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 28/118 (23%), Positives = 48/118 (40%), Gaps = 2/118 (1%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T+ + +AF K + Y +A L C ++S + + P I F G
Sbjct: 273 VTYWSPQAYARIIAAFEKSVP-YPRAPPSPQGLPLCVNVSGIDHPIYPSFTIEFDQGATY 331
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ + S + CL D ++ +GN+ Q+ + V YD R+GF NC
Sbjct: 332 RPNQGNYFIEVSPNIDCLAMLESSSDGFNV-IGNIIQQNYLVQYDREEHRIGFAHANC 388
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 64/190 (33%), Positives = 91/190 (47%), Gaps = 30/190 (15%)
Query: 126 DTVADE--YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
DT+ D Y + + +G P ++ +DTGSD+ WTQC PC C+ Q DP F SKS TF
Sbjct: 74 DTLFDYNIYLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQFDPIFDPSKSSTFN 133
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
+ C+ K C + I Y D + S G AT+ +TI + S F
Sbjct: 134 E-----------------QRCHGKSCHYEIIYEDNTYSKGILATETVTIH-STSGEPFVM 175
Query: 244 YPFLLGC-INNSSGDKSG----ASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTG 295
+GC ++N+ D SG +SGI+GL+ P S+I++ + Y SYC T
Sbjct: 176 AETTIGCGLHNTDLDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGLISYCFSGQ--GTS 233
Query: 296 YITFGKTDTV 305
I FG V
Sbjct: 234 KINFGTNAIV 243
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 66/190 (34%), Positives = 92/190 (48%), Gaps = 30/190 (15%)
Query: 126 DTVADE--YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
DT+ D Y + + +G P + +DTGSD+ WTQC PC +C+ Q P F SKS TF
Sbjct: 413 DTLYDYSIYLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTF- 471
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
RE CN C + I YAD + S G AT+ +TI + S F
Sbjct: 472 ------------REQ----RCNGNSCHYEIIYADKTYSKGILATETVTI-PSTSGEPFVM 514
Query: 244 YPFLLGC-INNS----SGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTG 295
+GC ++N+ SG S +SGI+GL+ P+S+I++ + Y SYC T
Sbjct: 515 AETKIGCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLISYCFSGQ--GTS 572
Query: 296 YITFGKTDTV 305
I FG V
Sbjct: 573 KINFGTNAIV 582
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 34/121 (28%), Positives = 50/121 (41%), Gaps = 7/121 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAK-GLEDLLDTCYDLSAYETVVVPKIAIHFLGGVD 355
+T+ N +R A + + K G ++LL CY + + P I +HF GG D
Sbjct: 637 LTYFPMSYCNLVREAVEQVVTAVKVPDMGSDNLL--CYYSDTID--IFPVITMHFSGGAD 692
Query: 356 LELDVRGTLVVASVSQV-CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L LD + + CL P ++ GN Q V YD + + F P NC
Sbjct: 693 LVLDKYNMYLETITGGIFCLAIGCNDPSMPAV-FGNRAQNNFLVGYDPSSNVISFSPTNC 751
Query: 415 S 415
S
Sbjct: 752 S 752
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 94/197 (47%), Gaps = 25/197 (12%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDP--FFYASKSKTFFKIP 186
+ +Y++ + IG P Q + L+ DTGSD+ W +C PC +C R P F+A S T+ I
Sbjct: 83 SGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNC-SHRSPGSAFFARHSTTYSAIH 141
Query: 187 CNSTSCRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANS----- 237
C S C+++ P CN C + YAD S + GF++ + +T+ +
Sbjct: 142 CYSPQCQLVPHPHP-NPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTSTGKVKKL 200
Query: 238 NGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSI---ITRTNTSYFSYCL------P 288
NG F + + + GA G+MGL R+P+S + R S FSYCL P
Sbjct: 201 NGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKFSYCLMDYTLSP 260
Query: 289 SPYGSTGYITFGKTDTV 305
P T ++T G V
Sbjct: 261 PP---TSFLTIGGAQNV 274
Score = 41.2 bits (95), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 29/119 (24%), Positives = 46/119 (38%), Gaps = 1/119 (0%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+TF + AF KR+K A+ D C ++S +P+++ + GG
Sbjct: 336 LTFITEPAYTEILKAFKKRVKLPSPAEPTPGF-DLCMNVSGVTRPALPRMSFNLAGGSVF 394
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
R + CL D LGN+ Q+G + +D RLGF C+
Sbjct: 395 SPPPRNYFIETGDQIKCLAVQPVSQDGGFSVLGNLMQQGFLLEFDRDKSRLGFTRRGCA 453
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 89/157 (56%), Gaps = 10/157 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V +G+P + ++LDTGSDV W QCKPC C+QQ DP F + S ++ + C++
Sbjct: 156 EYFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDPIFDPTASSSYNPLTCDAQ 215
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C+ L S C + +C + + Y DGS + G + T+ ++ + N +GC
Sbjct: 216 QCQDLEMS----ACRNGKCLYQVSYGDGSFTVGEYVTETVSFGAGSVNR------VAIGC 265
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+++ G G++G++GL P+S+ ++ + FSYCL
Sbjct: 266 GHDNEGLFVGSAGLLGLGGGPLSLTSQIKATSFSYCL 302
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 43/119 (36%), Positives = 64/119 (53%), Gaps = 4/119 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
IT +T N++R AF ++ + A+G+ L DTCYDLS+ ++V VP ++ HF G
Sbjct: 377 ITRLRTQAYNSVRDAFKRKTSNLRPAEGVA-LFDTCYDLSSLQSVRVPTVSFHFSGDRAW 435
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L + L+ V C FA P + +GNVQQ+G V +D+A +GF P C
Sbjct: 436 ALPAKNYLIPVDGAGTYCFAFA--PTTSSMSIIGNVQQQGTRVSFDLANSLVGFSPNKC 492
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 63/111 (56%), Gaps = 2/111 (1%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ V IG P ++ SL+LDTGSD+ W QC PCI CF+Q P++ +S +F I C+
Sbjct: 191 EYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKESSSFENITCHDP 250
Query: 191 SCRILRESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEANSNG 239
C+++ P C ++ CP+ Y D S + G +A + T+ NG
Sbjct: 251 RCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNG 301
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 54/119 (45%), Gaps = 2/119 (1%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T+ ++ AF K++K Y+ +G L CY++S E + +P I F G
Sbjct: 438 LTYFAEPAYEIIKEAFMKKIKGYELVEGFPPL-KPCYNVSGIEKMELPDFGILFSDGAMW 496
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ V + VCL P SI +GN QQ+ + YD+ RLG+ P C+
Sbjct: 497 DFPVENYFIQIEPDLVCLAILGTPKSALSI-IGNYQQQNFHILYDMKKSRLGYAPMKCT 554
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 63/215 (29%), Positives = 99/215 (46%), Gaps = 28/215 (13%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-------KPCIHCFQQRDPFFYASKSKTFFK 184
+ + V IG P Q +L++DTGSD+ WTQC + +QR+P + +S +F
Sbjct: 84 HSLTVGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSSFAY 143
Query: 185 IPCNSTSCRILRESFPFGNC-NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
+PC+ C+ F + NC + C ++ Y +GG A++ T N+
Sbjct: 144 LPCSDRLCQ--EGQFSYKNCARNNRCMYDELYGSAE-AGGVLASETFTFG-VNAK---VS 196
Query: 244 YPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL---------PSPYGST 294
P GC S+GD GASG+MGL +S++++ + FSYCL P +G+
Sbjct: 197 LPLGFGCGALSAGDLVGASGLMGLSPGIMSLVSQLSVPRFSYCLTPFAERKTSPLLFGAM 256
Query: 295 G----YITFGKTDTVNTLRSAFHKRMKKYKKAKGL 325
Y T G T + LR+ + Y GL
Sbjct: 257 ADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGL 291
Score = 45.4 bits (106), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 56/150 (37%), Gaps = 12/150 (8%)
Query: 277 RTNTSYFSYCLPSPYGSTGYITFGKTDTVNTLRSAFHKRMKKYKKAKGL-------EDLL 329
R + S + P GS G I + +AF K +A L ED
Sbjct: 297 RLDVPATSLGMIKPDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAVRLPVANGTDEDYD 356
Query: 330 D--TCYDLS---AYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPN 384
D C+ L A E V P + +HF GG + L +CL T P
Sbjct: 357 DYELCFALPTGVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRAGLMCLAVGTSPDGFG 416
Query: 385 SITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+GNVQQ+ V +DV ++ F P C
Sbjct: 417 VSIIGNVQQQNMHVLFDVRNQKFSFAPTKC 446
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 92/176 (52%), Gaps = 13/176 (7%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKP-CIHCFQQRDPF-FYASKSKTFFKIPCNSTS 191
+ +A+G P Q V+++LDTGS+++W C P R F S TF +PC+S
Sbjct: 68 VSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCDSAQ 127
Query: 192 CRILR-ESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
CR S P + SK+C ++ YADGS S G AT+ T+ + G R F GC
Sbjct: 128 CRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQ----GPPLRAAF--GC 181
Query: 251 IN---NSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGKTD 303
+ ++S D +G++G++R +S +++ +T FSYC+ S G + G +D
Sbjct: 182 MATAFDTSPDGVATAGLLGMNRGALSFVSQASTRRFSYCI-SDRDDAGVLLLGHSD 236
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 92/162 (56%), Gaps = 14/162 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
+Y+ + +G P + V ++ DTGSDV+W QC PC C++Q+DP F S S +F + C S+
Sbjct: 13 DYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACASS 72
Query: 191 SCRILRESFPFGNCNSK-ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
C L+ C+ K +C + + Y DGS + G ++T+ ++ E +G
Sbjct: 73 ICGKLK----IKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSFGEHAVRS------VAMG 122
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLP 288
C N+ G GA+G++GL R P+S ++T TSY FSYCLP
Sbjct: 123 CGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLP 164
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 55/114 (48%), Gaps = 5/114 (4%)
Query: 302 TDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVR 361
T LR AF + + + A G+ L DTCYDLS+ +T +P + + F GG + L
Sbjct: 244 TPAYTALRDAF-RSLVTFPSAPGIS-LFDTCYDLSSMKTATLPAVVLDFDGGASMPLPAD 301
Query: 362 GTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
G LV V CL FA P + +GNVQQ+ + D ++G P C
Sbjct: 302 GILVNVDDEGTYCLAFA--PEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 353
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/348 (26%), Positives = 139/348 (39%), Gaps = 69/348 (19%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC--IHCFQQRDPFFYASKSKTFFKIPCNS 189
Y + + IG P + DTGSD+TW QC PC CF Q P + S TF +PC+S
Sbjct: 96 YLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLLPCDS 155
Query: 190 TSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
C L S + + +C + Y D S S G ++D I + + Y ++ F G
Sbjct: 156 QPCTQLPYS-QYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRLMLLQLH-YNSKICFGCG 213
Query: 250 CINNSSGDKSG-ASGIMGLDRSPVSIITRTNTSY---FSYC-LPSPYGSTGYITF----- 299
N + DKSG +GI+GL P+S++++ FSYC LP S + F
Sbjct: 214 FQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCLLPFSSNSNSKLKFGEAAI 273
Query: 300 ------------------------------------GKTD---------TVNTLRSAFHK 314
G+TD T+ L +F+
Sbjct: 274 VQGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTVKTGQTDGNIIIDSGSTLTYLEESFYN 333
Query: 315 R-MKKYKKAKGLED------LLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
+ K+ +E+ D C+ + P + HF GG D+ L TLV+
Sbjct: 334 EFVSLVKETVAVEEDQYIPYPFDFCFTYKEGMSTP-PDVVFHFTGG-DVVLKPMNTLVLI 391
Query: 368 SVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ +C D +I GN+ Q V YD+ G ++ F P +CS
Sbjct: 392 EDNLICSTVVPSHFDGIAI-FGNLGQIDFHVGYDIQGGKVSFAPTDCS 438
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 64/125 (51%), Gaps = 5/125 (4%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ V +G P ++ SL+LDTGSD+ W QC PC CF Q + F+ S +F I CN
Sbjct: 161 EYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEAFYDPKTSASFKNITCNDP 220
Query: 191 SCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP--- 245
C ++ P C S + CP+ Y D S + G +A + T+ + G + Y
Sbjct: 221 RCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKVEN 280
Query: 246 FLLGC 250
+ GC
Sbjct: 281 MMFGC 285
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 51/110 (46%), Gaps = 3/110 (2%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYE--TVVVPKIAIHFLGGVDLELDVRGTLV 365
+++ F ++MK+ +LD C+++S E + +P++ I F G + +
Sbjct: 419 IKNKFAEKMKENYLVFRDFPVLDPCFNVSGIEENNIHLPELGIAFADGAVWNFPAENSFI 478
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
S VCL P SI +GN QQ+ + YD RLGF P C+
Sbjct: 479 WLSEDLVCLAILGTPKSTFSI-IGNYQQQNFHILYDTKMSRLGFTPTKCA 527
>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
Length = 396
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/159 (35%), Positives = 75/159 (47%), Gaps = 14/159 (8%)
Query: 132 YYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
YY+ IG P Q S ++D ++ WTQC C CF+Q P F + S TF PC +
Sbjct: 44 YYVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTA 103
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSG-SGGFWATDRITIQEANSNGYFTRYPFLLG 249
C ES P +C+ C + G + GF ATD I A R F G
Sbjct: 104 VC----ESIPTRSCSGDVCSYKGPPTQLRGNTSGFAATDTFAIGTAT-----VRLAF--G 152
Query: 250 CINNSSGDK-SGASGIMGLDRSPVSIITRTNTSYFSYCL 287
C+ S D G SG +GL R+P S++ + + FSYCL
Sbjct: 153 CVVASDIDTMDGPSGFIGLGRTPWSLVAQMKLTRFSYCL 191
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/157 (35%), Positives = 88/157 (56%), Gaps = 10/157 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V IG+P + V ++LDTGSDV W QC PC C+ Q +P F S S ++ + C++
Sbjct: 147 EYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTP 206
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C L S C + C + + Y DGS + G +AT+ +TI G +GC
Sbjct: 207 QCNALEVS----ECRNATCLYEVSYGDGSYTVGDFATETLTI------GSTLVQNVAVGC 256
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+++ G GA+G++GL +++ ++ NT+ FSYCL
Sbjct: 257 GHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCL 293
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 48/119 (40%), Positives = 67/119 (56%), Gaps = 4/119 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T +T+ N+LR +F K +KA G+ + DTCY+LSA TV VP +A HF GG L
Sbjct: 368 VTRLQTEIYNSLRDSFVKGTLDLEKAAGVA-MFDTCYNLSAKTTVEVPTVAFHFPGGKML 426
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L + ++ V SV CL FA P + +GNVQQ+G V +D+A +GF C
Sbjct: 427 ALPAKNYMIPVDSVGTFCLAFA--PTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 66/234 (28%), Positives = 102/234 (43%), Gaps = 27/234 (11%)
Query: 60 ASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFT 119
++L+V Y PCS PS + L+ ++ L ++ + R F L ++
Sbjct: 32 SNLQVFHVYSPCSPF-------WPS--KPLKWEESVLQMQAKDQARLQFLSSLVARKSVV 82
Query: 120 FPANINDTVADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASK 178
A+ V YIV A IG P Q + L +DT +D W C C+ C F K
Sbjct: 83 PIASGRQIVQSPTYIVRAKIGTPAQTMLLAMDTSNDAAWIPCSGCVGC---SSTVFNNVK 139
Query: 179 SKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSN 238
S TF + C + C+ P C C FN+ Y S + + D +T+ +
Sbjct: 140 STTFKTVGCEAPQCK----QVPNSKCGGSACAFNMTYGSSSIAANL-SQDVVTLATDSIP 194
Query: 239 GYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
Y GC+ ++G G++GL R P+S++++T Y FSYCLPS
Sbjct: 195 SY------TFGCLTEATGSSIPPQGLLGLGRGPMSLLSQTQNLYQSTFSYCLPS 242
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 51/111 (45%), Gaps = 10/111 (9%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
+R AF KR+ DTCY +V P I F G+++ L L+ +
Sbjct: 328 VRDAFRKRVGNATVTS--LGGFDTCYT----SPIVAPTITFMF-SGMNVTLPPDNLLIHS 380
Query: 368 SVSQV-CLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ S + CL A P + NS+ + N+QQ+ H + +DV RLG C+
Sbjct: 381 TASSITCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRLGVAREPCT 431
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 93/391 (23%), Positives = 148/391 (37%), Gaps = 88/391 (22%)
Query: 107 PFPEFLKRTEAFTFPANIND-TVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIH 165
P PE AF P T +Y++ +G P Q L+ DTGSD+TW +C+
Sbjct: 88 PMPE----ASAFAMPLTSGAYTGTGQYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRA 143
Query: 166 CFQQRDPF-----FYASKSKTFFKIPCNSTSCRILRESFPFGNCNS-----KECPFNIQY 215
P F + SK++ IPC+S +C+ F NC++ C ++ +Y
Sbjct: 144 SSPDASPLASPRVFRPANSKSWAPIPCSSDTCKSY-VPFSLANCSAGTTPPAPCGYDYRY 202
Query: 216 ADGSGSGGFWATDRITI--QEANSNGYFTRYPFLLGCINNSSGDKSGAS-GIMGLDRSPV 272
D S + G TD TI + S+ +LGC + G +S G++ L S +
Sbjct: 203 KDKSSARGVVGTDAATIALSGSGSDRKAKLQEVVLGCTTSYDGQSFQSSDGVLSLGNSNI 262
Query: 273 SIITRTNTSY---FSYCLP---SPYGSTGYITFG-------------------------K 301
S +R + FSYCL +P +T Y+TFG
Sbjct: 263 SFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVT 322
Query: 302 TDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLS-----AYETVV-------------- 342
D V+ A + + + K +LD+ L+ AY+ VV
Sbjct: 323 VDAVSVAGKALNIPAEVWDVKKNGGAILDSGTSLTILATPAYKAVVAALSKQLARVPRVT 382
Query: 343 ------------------VPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPN 384
VP++ + F G L + ++ A+ C+G P
Sbjct: 383 MDPFEYCYNWTATRRPPAVPRLEVRFAGSARLRPPTKSYVIDAAPGVKCIGLQEG-VWPG 441
Query: 385 SITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+GN+ Q+ H +D+A R L F C+
Sbjct: 442 VSVIGNILQQEHLWEFDLANRWLRFQESRCA 472
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 104/214 (48%), Gaps = 14/214 (6%)
Query: 19 NNGAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGI 78
NN +Y L+ ++ + ++P V +G +K ++VV + +L+ G
Sbjct: 35 NNSSYPTFQHLNVKETIAGTRIIPLEVSEDHE----EGGEKWMMKVVHR----DQLSFGN 86
Query: 79 ST-HAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVA 137
S H L+ L++D +R+ RRL + + T + + + EY++ +
Sbjct: 87 SDDHRHRLDGRLKRDAKRVA-SLIRRLSSGGGGSYRVDDFGTDVISGMEQGSGEYFVRIG 145
Query: 138 IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRE 197
+G P + +++D+GSD+ W QC+PC C+ Q DP F + S +F + C+S+ C L
Sbjct: 146 VGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCSSSVCDRLEN 205
Query: 198 SFPFGNCNSKECPFNIQYADGSGSGGFWATDRIT 231
+ C++ C + + Y DGS + G A + +T
Sbjct: 206 A----GCHAGRCRYEVSYGDGSYTKGTLALETLT 235
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 54/110 (49%), Gaps = 4/110 (3%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
R AF + +A G+ + DTCYDL + +V VP ++ +F GG L L R L+
Sbjct: 373 QAFRDAFLAQTANLPRATGVA-IFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLI 431
Query: 366 -VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ C FA P LGN+QQ G ++ +D A +GFGP C
Sbjct: 432 PMDDAGTFCFAFA--PSTSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 479
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 69/227 (30%), Positives = 109/227 (48%), Gaps = 16/227 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ V +G P ++ SL+LDTGSD+ W QC PC CFQQ F+ S ++ I CN
Sbjct: 169 EYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDPKASASYKNITCNDQ 228
Query: 191 SCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRY---P 245
C ++ P C S + CP+ Y D S + G +A + T+ + G Y
Sbjct: 229 RCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVEN 288
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGY---ITF 299
+ GC + + G GA+G++GL R P+S ++ + Y FSYCL T + F
Sbjct: 289 MMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIF 348
Query: 300 GKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKI 346
G+ + + H + G E+L+DT Y + +V ++
Sbjct: 349 GEDKDLLS-----HPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEV 390
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 41/88 (46%), Gaps = 1/88 (1%)
Query: 328 LLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSIT 387
+LD C+++S V +P++ I F G + + + VCL P SI
Sbjct: 447 ILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSI- 505
Query: 388 LGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+GN QQ+ + YD RLG+ P C+
Sbjct: 506 IGNYQQQNFHILYDTKRSRLGYAPTKCA 533
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 89/268 (33%), Positives = 122/268 (45%), Gaps = 37/268 (13%)
Query: 61 SLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRT-EAFT 119
S ++ Y CS T + E +R D RL FLKRT +
Sbjct: 53 SFPLIHIYSECSPFRPPNRTWESLMSEKIRGDANRLR-------------FLKRTSRSSK 99
Query: 120 FPANINDTV---ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYA 176
AN N V + EY I V G PKQ + L+DTGSDV W CK C C P F
Sbjct: 100 EDANANVPVRSGSGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGC-HSTAPIFDP 158
Query: 177 SKSKTFFKIPCNSTSCRILRESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQE 234
+KS ++ C+S C+ + GNC NSK C F + Y DG+ G A+D IT+
Sbjct: 159 AKSSSYKPFACDSQPCQEIS-----GNCGGNSK-CQFEVLYGDGTQVDGTLASDAITL-- 210
Query: 235 ANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTS-----YFSYCLPS 289
+ Y + F GC + S D + G+MGL +S++T+ T+ FSYCLPS
Sbjct: 211 --GSQYLPNFSF--GCAESLSEDTYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPS 266
Query: 290 PYGSTGYITFGKTDTVNTLRSAFHKRMK 317
S+G + GK V++ F +K
Sbjct: 267 SSTSSGSLVLGKEAAVSSSSLKFTTLIK 294
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 61/119 (51%), Gaps = 6/119 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
IT+ LR AF +++ + +ED+ DTCYDLS+ +V VP I +H VDL
Sbjct: 337 ITYLVPSAYKDLRDAFRQQLSSLQPTP-VEDM-DTCYDLSS-SSVDVPTITLHLDRNVDL 393
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
L L+ CL F++ D SI +GNVQQ+ + +DV ++GF C+
Sbjct: 394 VLPKENILITQESGLSCLAFSST--DSRSI-IGNVQQQNWRIVFDVPNSQVGFAQEQCA 449
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 76/240 (31%), Positives = 104/240 (43%), Gaps = 33/240 (13%)
Query: 54 PQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLK 113
PQG + L V PCS Q + S E L +D+ RL +S + P
Sbjct: 27 PQG-HPSDLRVFHVNSPCSPFKQ---PNTVSWESTLLKDKARLQYLSSLAKKPSVPIASG 82
Query: 114 RTEAFTFPANINDTVADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDP 172
R V YIV A IG P Q + + LDT +D W C C+ C
Sbjct: 83 RA-----------IVQSPTYIVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGCASSV-- 129
Query: 173 FFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS-KECPFNIQYADGSGSGGFWATDRIT 231
F SKS + + C++ C+ P C + K C FN+ Y GS D +T
Sbjct: 130 LFDPSKSSSSRNLQCDAPQCK----QAPNPTCTAGKSCGFNMTYG-GSTIEASLTQDTLT 184
Query: 232 IQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLP 288
+ +N Y F GCI+ ++G A G+MGL R P+S+I++T Y FSYCLP
Sbjct: 185 L----ANDVIKSYTF--GCISKATGTSLPAQGLMGLGRGPLSLISQTQNLYMSTFSYCLP 238
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/164 (34%), Positives = 89/164 (54%), Gaps = 13/164 (7%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCN 188
+ EY++ + +G P + V+++ DTGSDV W QC PC C+ Q DP F S S TF I C
Sbjct: 78 SGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSSTFQSITCG 137
Query: 189 STSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
S+ C+ L C +C + + Y DGS + G ++T+ ++ N +
Sbjct: 138 SSLCQQLL----IRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFGSNAVNS------VAI 187
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
GC +N+ G +GA+G++GL + +S ++ Y FSYCLP+
Sbjct: 188 GCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPT 231
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 56/115 (48%), Gaps = 3/115 (2%)
Query: 302 TDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVR 361
T N +R AF M K L DTCYDLS ++++P ++ F GG + L +
Sbjct: 311 TSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATMALPAQ 370
Query: 362 GTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+V V + CL FA P N +GN+QQ+ + +D G R+G G C+
Sbjct: 371 NIMVPVDNSGTYCLAFA--PNSENFSIIGNIQQQSFRMSFDSTGNRVGIGANQCN 423
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/164 (34%), Positives = 89/164 (54%), Gaps = 13/164 (7%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCN 188
+ EY++ + +G P + V+++ DTGSDV W QC PC C+ Q DP F S S TF I C
Sbjct: 78 SGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSSTFQSITCG 137
Query: 189 STSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
S+ C+ L C +C + + Y DGS + G ++T+ ++ N +
Sbjct: 138 SSLCQQLL----IRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFGSNAVNS------VAI 187
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
GC +N+ G +GA+G++GL + +S ++ Y FSYCLP+
Sbjct: 188 GCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPT 231
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/114 (33%), Positives = 55/114 (48%), Gaps = 3/114 (2%)
Query: 302 TDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVR 361
T N +R AF M K L DTCYDLS ++++P ++ F GG + L +
Sbjct: 311 TSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATMALPAQ 370
Query: 362 GTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+V V + CL FA P N +GN+QQ+ + +D G R+G G C
Sbjct: 371 NIMVPVDNSGTYCLAFA--PNSENFSIIGNIQQQSFRMSFDSTGNRVGIGANQC 422
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/184 (30%), Positives = 89/184 (48%), Gaps = 14/184 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EYY + +G P Q L++DTGS++TW +C PC C D + A++S ++ + CN++
Sbjct: 99 EYYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVDTIYDAARSVSYKPVTCNNS 158
Query: 191 SCRILRESFPFGNC-NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
+ C +C F Y DGS S G +TD + ++ T F G
Sbjct: 159 QLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFG 218
Query: 250 CINNSSGD----KSGASGIMGLDRSPVSIITRTNTSY---FSYCLP---SPYGSTGYITF 299
C + GD +GASGI+GL+ +++ + + FS+C P S STG + F
Sbjct: 219 C---AQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFF 275
Query: 300 GKTD 303
G +
Sbjct: 276 GNAE 279
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 13/118 (11%)
Query: 308 LRSAFHKRMK---KYKKAKGLEDLLDTCYDLSAYET----VVVPKIAIHFLGGVDLELDV 360
LR AF K K+ + DL TC+ +S + +P +++ F GV + +
Sbjct: 344 LREAFLKHRPPSLKHLEGDSFGDL-GTCFKVSNDDIDELHRTLPSLSLVFEDGVTIGIPS 402
Query: 361 RGTLVVASVSQ----VCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
G L+ + Q +C F P+P ++ +GN QQ+ V YD+ R+GF +C
Sbjct: 403 IGVLLPVARYQNHVKMCFAFEDGGPNPVNV-IGNYQQQNLWVEYDIQRSRVGFARASC 459
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 70/221 (31%), Positives = 107/221 (48%), Gaps = 20/221 (9%)
Query: 101 SRRLRKPFPEFLKRTEAFT----FPANINDTVAD------EYYIVVAIGEPKQYVSLLLD 150
S+R+R R FT A++N D EY + +++G P + + D
Sbjct: 53 SQRIRNAIHRSFNRVSHFTDLSEMDASLNSPQTDITPCGGEYLMNLSLGTPPSPIMAVAD 112
Query: 151 TGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS--KE 208
TGS++ WTQCKPC C+ Q DP F S T+ + C+S+ C L +C++ K
Sbjct: 113 TGSNLIWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSSSQCTALENQ---ASCSTEDKT 169
Query: 209 CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC-INNSSGDKSGASGIMGL 267
C + + YADGS + G +A D +T+ + N ++GC NN+ ++ +SG++GL
Sbjct: 170 CSYLVSYADGSYTMGKFAVDTLTLGSTD-NRPVQLKNIIIGCGQNNAVTFRNKSSGVVGL 228
Query: 268 DRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDTV 305
VS+I + S FSYCL T I FG V
Sbjct: 229 GGGAVSLIKQLGDSIDGKFSYCLVPENDQTSKINFGTNAVV 269
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 98/427 (22%), Positives = 164/427 (38%), Gaps = 88/427 (20%)
Query: 69 GPCSRLNQGISTHAPSLEEILRQDQQR-----LHLKNSRRLRKPFPEFLKRTEAFTFPAN 123
G +RL+ + S+ R D++R L + R R+ + + A + P +
Sbjct: 22 GKSARLDLFPAAPGASVTARARGDRRRHAYISAQLPSRRGGRQRVAAEVASSSAVSLPMS 81
Query: 124 INDTVA-DEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTF 182
+Y++ V +G P Q +L+ DTGS++TW +C F SK++
Sbjct: 82 SGAYAGTGQYFVKVLVGTPAQEFTLVADTGSELTWVKCA---GGASPPGLVFRPEASKSW 138
Query: 183 FKIPCNSTSCRILRESFPFGNCNSKE--CPFNIQYADGS-GSGGFWATDRITIQEANSNG 239
+PC+S +C+ L F NC+S C ++ +Y +GS G+ G TD TI A G
Sbjct: 139 APVPCSSDTCK-LDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATI--ALPGG 195
Query: 240 YFTRYP-FLLGCINNSSGDK-SGASGIMGLDRSPVSIITRTNTSY---FSYCLP---SPY 291
+ +LGC + G G++ L + +S +R + FSYCL +P
Sbjct: 196 KVAQLQDVVLGCSSTHDGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSYCLVDHLAPR 255
Query: 292 GSTGYITFG-------------------------KTDTVNTLRSAFHKRMKKYKKAKGLE 326
+TGY+ FG K D V+ A + + G
Sbjct: 256 NATGYLAFGPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDPKSG-G 314
Query: 327 DLLDTCYDLS-----AYETVV---------------------------------VPKIAI 348
+LD+ L+ AY+ VV +PK+A+
Sbjct: 315 VILDSGTTLTVLATPAYKAVVAALTKLLAGVPKVDFPPFEHCYNWTAPRPGAPEIPKLAV 374
Query: 349 HFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLG 408
F G LE + ++ C+G P +GN+ Q+ H +D+ +
Sbjct: 375 QFTGCARLEPPAKSYVIDVKPGVKCIGLQEG-EWPGVSVIGNIMQQEHLWEFDLKNMEVR 433
Query: 409 FGPGNCS 415
F P C+
Sbjct: 434 FMPSTCT 440
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 106/211 (50%), Gaps = 22/211 (10%)
Query: 87 EILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVS 146
EI +R H + +R + L + F P + EY I ++ G P Q +
Sbjct: 52 EIFIAAVKRGHERRARLAK----HVLAGDQLFETPVASGN---GEYLIDISYGNPPQKST 104
Query: 147 LLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS 206
++DTGSD+ W QC PC C++ F SKS ++ + C S C+ L PF +C +
Sbjct: 105 AIVDTGSDLNWVQCLPCKSCYETLSAKFDPSKSASYKTLGCGSNFCQDL----PFQSC-A 159
Query: 207 KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMG 266
C ++ Y DGS + G +TD +TI G F GC N++ G +GA G++G
Sbjct: 160 ASCQYDYMYGDGSSTSGALSTDDVTI----GTGKIPNVAF--GCGNSNLGTFAGAGGLVG 213
Query: 267 LDRSPVSIITR---TNTSYFSYCLPSPYGST 294
L + P+S++++ T T FSYCL P GST
Sbjct: 214 LGKGPLSLVSQLGGTATKKFSYCL-VPLGST 243
Score = 45.8 bits (107), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 33/126 (26%), Positives = 52/126 (41%), Gaps = 20/126 (15%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGG--- 353
+T+ D N + +A K Y +A G L+ C+ + P + HF G
Sbjct: 313 LTYLDVDAFNPMVAAL-KAALPYPEADGSFYGLEYCFSTAGVANPTYPTVVFHFNGADVA 371
Query: 354 -----VDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLG 408
+ LD GT +A S GF+ + GN+QQ H + +D+ +R+G
Sbjct: 372 LAPDNTFIALDFEGTTCLAMASST--GFSIF---------GNIQQLNHVIVHDLVNKRIG 420
Query: 409 FGPGNC 414
F NC
Sbjct: 421 FKSANC 426
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 58/175 (33%), Positives = 90/175 (51%), Gaps = 14/175 (8%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSC- 192
+ +A+G P Q V+++LDTGS+++W C D F S TF +PC S C
Sbjct: 63 VSLAVGTPPQNVTMVLDTGSELSWLLCATGRAAAAAAD-SFRPRASATFAAVPCGSARCS 121
Query: 193 -RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI 251
R L + P + S+ C ++ YADGS S G ATD + +A R F GC+
Sbjct: 122 SRDL-PAPPSCDAASRRCRVSLSYADGSASDGALATDVFAVGDAPP----LRSAF--GCM 174
Query: 252 N---NSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGKTD 303
+ +SS D +G++G++R +S +T+ +T FSYC+ S G + G +D
Sbjct: 175 SAAYDSSPDAVATAGLLGMNRGALSFVTQASTRRFSYCI-SDRDDAGVLLLGHSD 228
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 91/176 (51%), Gaps = 13/176 (7%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKP-CIHCFQQRDPF-FYASKSKTFFKIPCNSTS 191
+ +A+G P Q V+++LDTGS+++W C P R F S TF +PC S
Sbjct: 67 VSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCGSAQ 126
Query: 192 CRILR-ESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
CR S P + SK+C ++ YADGS S G AT+ T+ + G R F GC
Sbjct: 127 CRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQ----GPPLRAAF--GC 180
Query: 251 IN---NSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGKTD 303
+ ++S D +G++G++R +S +++ +T FSYC+ S G + G +D
Sbjct: 181 MATAFDTSPDGVATAGLLGMNRGALSFVSQASTRRFSYCI-SDRDDAGVLLLGHSD 235
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/157 (35%), Positives = 87/157 (55%), Gaps = 10/157 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V IG P + V ++LDTGSDV W QC PC C+ Q +P F S S ++ + C++
Sbjct: 150 EYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTP 209
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C L S C + C + + Y DGS + G +AT+ +TI G +GC
Sbjct: 210 QCNALEVS----ECRNATCLYEVSYGDGSYTVGDFATETLTI------GSTLVQNVAVGC 259
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+++ G GA+G++GL +++ ++ NT+ FSYCL
Sbjct: 260 GHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCL 296
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 47/119 (39%), Positives = 66/119 (55%), Gaps = 4/119 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T +T N+LR +F K +KA G+ + DTCY+LSA T+ VP +A HF GG L
Sbjct: 371 VTRLQTGIYNSLRDSFLKGTSDLEKAAGVA-MFDTCYNLSAKTTIEVPTVAFHFPGGKML 429
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L + ++ V SV CL FA P + +GNVQQ+G V +D+A +GF C
Sbjct: 430 ALPAKNYMIPVDSVGTFCLAFA--PTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 486
>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 57/215 (26%), Positives = 99/215 (46%), Gaps = 16/215 (7%)
Query: 117 AFTFPANI-NDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFY 175
AF+FP + ++ YY + IG P + ++++DTGSDV W C C+ C Q FF
Sbjct: 66 AFSFPVERGTNPISRIYYTTLQIGTPPREFNVVIDTGSDVLWVSCISCVGCPLQNVTFFD 125
Query: 176 ASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA 235
S + K+ C+ C + C+ E + ++Y+DGS + G++ +D I+ +
Sbjct: 126 PGASSSAVKLACSDKRC--FSDLHKKSGCSPLE--YKVEYSDGSFTSGYYISDLISFETV 181
Query: 236 NSNGYFTR--YPFLLGCINNSSG----DKSGASGIMGLDRSPVSIIT-----RTNTSYFS 284
S+ + PF+ GC N +G ++ GI+GL + + +++ R FS
Sbjct: 182 MSSNLTVKSSAPFVFGCSNLHAGLISLPETSIHGIVGLGKGRLLVVSQLSSQRLAPEVFS 241
Query: 285 YCLPSPYGSTGYITFGKTDTVNTLRSAFHKRMKKY 319
CL G I G+ NT+ + + Y
Sbjct: 242 LCLSGGQEGGGVIILGENRLPNTVYTPLVRSQTHY 276
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 59/101 (58%), Gaps = 4/101 (3%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+I + +G P + +++D+GSD+ W QC+PC C+ Q DP F + S +F +PC+S+
Sbjct: 141 EYFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADSASFMGVPCSSS 200
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRIT 231
C + + C++ C + + Y DGS + G A + +T
Sbjct: 201 VCERIENA----GCHAGGCRYEVMYGDGSYTKGTLALETLT 237
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 57/109 (52%), Gaps = 4/109 (3%)
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV- 365
R AF + +A G+ + DTCY+L+ + +V VP ++ +F GG L L R L+
Sbjct: 376 AFRDAFIGQTGNLPRASGVS-IFDTCYNLNGFVSVRVPTVSFYFAGGPILTLPARNFLIP 434
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V V C FA P + I GN+QQ G ++ +D A +GFGP C
Sbjct: 435 VDDVGTFCFAFAASPSGLSII--GNIQQEGIQISFDGANGFVGFGPNVC 481
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 85/364 (23%), Positives = 143/364 (39%), Gaps = 83/364 (22%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIP 186
YY V +G P ++ +DTGSDV W C C C Q + FF S T I
Sbjct: 75 YYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMIA 134
Query: 187 CNSTSCRILRESFPFGNCNSK--ECPFNIQYADGSGSGGFWATDRI---TIQEANSNGYF 241
C+ C +S C+S+ +C + QY DGSG+ G++ +D + TI E +
Sbjct: 135 CSDQRCNNGIQSSD-ATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNS 193
Query: 242 TRYPFLLGCINNSSGDKS----GASGIMGLDRSPVSIITRTNTS-----YFSYCLPSPYG 292
T P + GC N +GD + GI G + +S+I++ ++ FS+CL
Sbjct: 194 TA-PVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSS 252
Query: 293 STGYITFGKTDTVNTLRSAF-------------------------------HKRMKKYKK 321
G + G+ N + ++ + R
Sbjct: 253 GGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRGTIVDS 312
Query: 322 AKGLEDLLDTCYD--------------------------LSAYETVVVPKIAIHFLGGVD 355
L L + YD +++ T V P+++++F GG
Sbjct: 313 GTTLAYLAEEAYDPFVSAITASIPQSVHTVVSRGNQCYLITSSVTEVFPQVSLNFAGGAS 372
Query: 356 LELDVRGTLV----VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGP 411
+ L + L+ + + C+GF +I LG++ + V YD+AG+R+G+
Sbjct: 373 MILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITI-LGDLVLKDKIVVYDLAGQRIGWAN 431
Query: 412 GNCS 415
+CS
Sbjct: 432 YDCS 435
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 104/220 (47%), Gaps = 14/220 (6%)
Query: 89 LRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIV-VAIGEPKQYVSL 147
+ + +R+ ++ R+ + + F N+ + + ++V ++G+P
Sbjct: 55 VAERAERIVKTSATRIAYLYAQIKGDIHMNDFELNLLPSTYEPLFLVNFSMGQPATPQLA 114
Query: 148 LLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS- 206
++DTGS++ W +C PC C QQ P SKS T+ +PC +T C P CN
Sbjct: 115 IMDTGSNILWVRCAPCKRCTQQNGPLLDPSKSSTYASLPCTNTMCHYA----PSAYCNRL 170
Query: 207 KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGA--SGI 264
+C +N+ YA G S G AT+++ I ++ G + GC ++ +GD +G+
Sbjct: 171 NQCGYNLSYATGLSSAGVLATEQL-IFHSSDEGVNAVPSVVFGC-SHENGDYKDRRFTGV 228
Query: 265 MGLDRSPVSIITRTNTSYFSYCL---PSPYGSTGYITFGK 301
GL + S +TR S FSYCL P+ + FG+
Sbjct: 229 FGLGKGITSFVTRMG-SKFSYCLGNIADPHYGYNQLVFGE 267
>gi|297838267|ref|XP_002887015.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297332856|gb|EFH63274.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 324
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 77/291 (26%), Positives = 132/291 (45%), Gaps = 54/291 (18%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDP-----FFYASKSKTFFKIPCN 188
I + IG P Q ++LDTGS ++W QC +++ P F S S +F +PC+
Sbjct: 76 ISLPIGTPPQAQQMVLDTGSQLSWIQCH------RKKLPPKPKTSFDPSLSSSFSTLPCS 129
Query: 189 STSC--RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPF 246
C RI + P +++ C ++ YADG+ + G ++IT SN T P
Sbjct: 130 HPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITF----SNTEITP-PL 184
Query: 247 LLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGKTDTVN 306
+LGC SS D+ GI+G++R +S +++ + FSYC+P G+ G
Sbjct: 185 ILGCATESSDDR----GILGMNRGRLSFVSQAKITKFSYCIPPKSNRPGFTPTG------ 234
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPK--IAIHFLGGVDLELDVRGTL 364
+F+ + +KG + + + ++VPK + ++ G+ R ++
Sbjct: 235 ----SFY--LGDNPNSKGFKYV--SLLTFPERVEILVPKERVLVNVGDGIHCVGIGRSSM 286
Query: 365 VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ A+ S +GNV Q+ V +DV RR+GF +CS
Sbjct: 287 LGAA----------------SNIIGNVHQQNLWVEFDVTNRRVGFARADCS 321
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 35/101 (34%), Positives = 57/101 (56%), Gaps = 4/101 (3%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ + +G P + +++D+GSD+ W QCKPC C+ Q DP F + S +F + C+S
Sbjct: 42 EYFVRIGLGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSA 101
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRIT 231
C + + CNS C + + Y DGS + G A + +T
Sbjct: 102 VCDRVENA----GCNSGRCRYEVSYGDGSYTKGTLALETLT 138
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 38/115 (33%), Positives = 60/115 (52%), Gaps = 6/115 (5%)
Query: 302 TDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVR 361
T R+AF ++ + +A G+ + DTCY+L + +V VP ++ +F GG L +
Sbjct: 272 TVAYEAFRNAFIEQTQNLPRASGVS-IFDTCYNLFGFLSVRVPTVSFYFSGGPILTIPAN 330
Query: 362 GTLV-VASVSQVCLGFATYPPDPNSIT-LGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L+ V C FA P P+ ++ LGN+QQ G ++ D A +GFGP C
Sbjct: 331 NFLIPVDDAGTFCFAFA---PSPSGLSILGNIQQEGIQISVDEANEFVGFGPNIC 382
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 87/162 (53%), Gaps = 7/162 (4%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ ++IG P + DTGSD+TW QCKPC C++Q P F KS T+ C+S
Sbjct: 84 EYFMSISIGTPPSKFLAIADTGSDLTWVQCKPCQQCYKQNTPLFDKKKSSTYKTESCDSI 143
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLG 249
+C L E + + C + Y D S + G AT+ I+I +S+G +P G
Sbjct: 144 TCNALSEHEEGCDESRNACKYRYSYGDESFTKGEVATETISID--SSSGSPVSFPGTAFG 201
Query: 250 CINNSSGD-KSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
C N+ G + SGI+GL P+S++++ +S FSYCL
Sbjct: 202 CGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCL 243
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 56/211 (26%), Positives = 101/211 (47%), Gaps = 13/211 (6%)
Query: 98 LKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVT 156
L + R +K + + + P +VA Y+ + +G P +++DTGS +T
Sbjct: 96 LLHGHRKKKAGGVGGSQASSSSVPLTPGASVAVGNYVTRLGLGTPATSYVMVVDTGSSLT 155
Query: 157 WTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESF--PFGNCNSKECPFNI 213
W QC PC + C +Q P F S T+ + C+S+ C L+ + P S C +
Sbjct: 156 WLQCSPCSVSCHRQAGPVFDPRASGTYAAVQCSSSECGELQAATLNPSACSVSNVCIYQA 215
Query: 214 QYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVS 273
Y D S S G+ + D ++ + G++ GC ++ G ++G++GL ++ +S
Sbjct: 216 SYGDSSYSVGYLSKDTVSFGSGSFPGFY------YGCGQDNEGLFGRSAGLIGLAKNKLS 269
Query: 274 IITRTNTSY---FSYCLPSPYGSTGYITFGK 301
++ + S FSYCLP+ + GY++ G
Sbjct: 270 LLYQLAPSLGYAFSYCLPTSSAAAGYLSIGS 300
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/88 (39%), Positives = 46/88 (52%), Gaps = 4/88 (4%)
Query: 328 LLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSIT 387
+LDTC+ SA + VP++ + F GG L L L+ S CL FA P +
Sbjct: 386 ILDTCFRGSA-AGLRVPRVDMAFAGGATLALSPGNVLIDVDDSTTCLAFA---PTGGTAI 441
Query: 388 LGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+GN QQ+ V YDVA R+GF G CS
Sbjct: 442 IGNTQQQTFSVVYDVAQSRIGFAAGGCS 469
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 101/427 (23%), Positives = 175/427 (40%), Gaps = 85/427 (19%)
Query: 58 DKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEA 117
D ++E++++ P S T + +R+ R+H + K ++
Sbjct: 27 DGFTVELINRDSPKSPFYNPRETPTQRIVSAVRRSMSRVHHFSPT----------KNSDI 76
Query: 118 FTFPANINDTVAD--EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFY 175
FT A ++ +++ EY + ++G P + + DTGSD+ WTQCKPC C++Q P F
Sbjct: 77 FTDTAQ-SEMISNQGEYLMKFSLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQDAPLFD 135
Query: 176 ASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA 235
S T+ I C++ C +L+E +K C ++ Y D S + G A D IT+
Sbjct: 136 PKSSSTYRDISCSTKQCDLLKEGASCSGEGNKTCHYSYSYGDRSFTSGNVAADTITL--G 193
Query: 236 NSNGYFTRYP-FLLGCINNSSGD-KSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS- 289
+++G P ++GC +N+ G SGI+GL P+S+I++ ++ FSYCL
Sbjct: 194 STSGRPVLLPKAIIGCGHNNGGSFTEKGSGIVGLGGGPISLISQLGSTIDGKFSYCLVPL 253
Query: 290 ----------PYGSTGYITFGKTDTVNTLR--------------SAFHKRMK----KYKK 321
+GS G ++ G + + S +R+K +
Sbjct: 254 SSNATNSSKLNFGSNGIVSGGGVQSTPLISKDPDTFYFLTLEAVSVGSERIKFPGSSFGT 313
Query: 322 AKG-----------------LEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLEL------ 358
++G +L D A V P + +D +L
Sbjct: 314 SEGNIIIDSGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSGILSLCYSIDADLKFPSIT 373
Query: 359 ------DVR----GTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLG 408
DV+ T V S + +C F P + GN+ Q V YD+ G+ +
Sbjct: 374 AHFDGADVKLNPLNTFVQVSDTVLCFAFN---PINSGAIFGNLAQMNFLVGYDLEGKTVS 430
Query: 409 FGPGNCS 415
F P +C+
Sbjct: 431 FKPTDCT 437
>gi|413923781|gb|AFW63713.1| hypothetical protein ZEAMMB73_300584, partial [Zea mays]
Length = 190
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 52/153 (33%), Positives = 80/153 (52%), Gaps = 11/153 (7%)
Query: 67 KYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEF--LKRTEAFTFPANI 124
+Y PCS + S P+LEE LR+DQ R + +++ F +++++A T P +
Sbjct: 22 RYDPCSPVP---SKKVPTLEERLRRDQLR-----AAYIKRKFSGAGDIEQSDAATVPTTL 73
Query: 125 NDTVAD-EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+++ EY I V IG P ++ +DTGSDV+W QCKPC C + D F S S T+
Sbjct: 74 GTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSSSSTYS 133
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYA 216
C+S C L +S C S +C + + Y
Sbjct: 134 PFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNYG 166
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 58/172 (33%), Positives = 85/172 (49%), Gaps = 12/172 (6%)
Query: 122 ANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKT 181
A IN + ++ + + IG P ++ L+DTGSD+ W QC PC+ C++Q P F KS T
Sbjct: 59 APINAYIG-QHLMEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIKPMFDPLKSST 117
Query: 182 FFKIPCNSTSCRILRESFPFGNCN-SKECPFNIQYADGSGSGGFWATDRITIQEANSNGY 240
+ I C+S C L G C+ K C + Y D S + G A D T +N+
Sbjct: 118 YNNISCDSPLCHKLDT----GVCSPEKRCNYTYGYGDNSLTKGVLAQDTATF-TSNTGKP 172
Query: 241 FTRYPFLLGC-INNSSGDKSGASGIMGLDRSPVSIITRTNTSY----FSYCL 287
+ FL GC NN+ G G++GL P S+I++ + FS CL
Sbjct: 173 VSLSRFLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCL 224
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 57/104 (54%), Gaps = 3/104 (2%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCN 188
+ EY++ + IG P Y +++D+GSD+ W QC+PC C+ Q DP F + S +F + C+
Sbjct: 126 SGEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSASFIGVACS 185
Query: 189 STSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITI 232
S C L + C C + + Y DGS + G A + ITI
Sbjct: 186 SNVCNQLDDDVA---CRKGRCGYQVAYGDGSYTKGTLALETITI 226
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/120 (36%), Positives = 61/120 (50%), Gaps = 6/120 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
IT T N R AF + +A G+ + DTCYDL+ + TV VP ++ +F GG L
Sbjct: 341 ITRLPTVAYNAFRDAFIAQTTNLPRAPGVS-IFDTCYDLNGFVTVRVPTVSFYFSGGQIL 399
Query: 357 ELDVRGTLVVA-SVSQVCLGFATYPPDPNSIT-LGNVQQRGHEVHYDVAGRRLGFGPGNC 414
R L+ A V C FA P P+ ++ +GN+QQ G +V D +GFGP C
Sbjct: 400 TFPARNFLIPADDVGTFCFAFA---PSPSGLSIIGNIQQEGIQVSIDGTNGFVGFGPNVC 456
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 102/230 (44%), Gaps = 22/230 (9%)
Query: 63 EVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPA 122
E V++ P +R+ H L +I +S R + K + F
Sbjct: 37 ESVARLNPNARVPITPEDHIKHLTDI-----------SSARFKYLQNSIDKELGSSNFQV 85
Query: 123 NINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQR--DPFFYASKS 179
++ + ++V ++G+P ++DTGS + W QC+PC HC P F + S
Sbjct: 86 DVEQAIKTSLFLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALS 145
Query: 180 KTFFKIPCNSTSCRILRESFPFGNC-NSKECPFNIQYADGSGSGGFWATDRITIQEANSN 238
TF + C+ CR P G+C +S +C + Y G+GS G A +R+T N N
Sbjct: 146 STFVECSCDDRFCRYA----PNGHCGSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGN 201
Query: 239 GYFTRYPFLLGC-INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
T+ P GC N +S +GI+GL P S+ + S FSYC+
Sbjct: 202 TVVTQ-PIAFGCGYENGEQLESHFTGILGLGAKPTSLAVQLG-SKFSYCI 249
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 78/261 (29%), Positives = 123/261 (47%), Gaps = 42/261 (16%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P ++LDTGSDV W QC PC C+ Q F +S+++ + C++
Sbjct: 141 EYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYDQSGQVFDPRRSRSYGAVGCSAP 200
Query: 191 SCRILRESFPFGNCN--SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
CR L G C+ K C + + Y DGS + G +AT+ +T G L
Sbjct: 201 LCRRLDS----GGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA-----GGARVARIAL 251
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSI---ITRTNTSYFSYCL--------PSPYGSTGYI 297
GC +++ G A+G++GL R +S I+R FSYCL P+ + ST +
Sbjct: 252 GCGHDNEGLFVAAAGLLGLGRGSLSFPAQISRRYGRSFSYCLVDRTSSANPASHSST--V 309
Query: 298 TFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGV--- 354
TFG +T+ ++F +K + ++T Y Y +V + + GV
Sbjct: 310 TFGSGAVGSTVAASFTPMVKNPR--------METFY----YVQLVGISVGGARVSGVADS 357
Query: 355 DLELDV---RGTLVVASVSQV 372
DL LD RG ++V S + V
Sbjct: 358 DLRLDPSSGRGGVIVDSGTSV 378
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 56/110 (50%), Gaps = 3/110 (2%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+ LR AF + + G L DTCYDLS + V VP +++HF GG + L L+
Sbjct: 387 SALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLI 446
Query: 366 -VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V S C FA D +GN+QQ+G V +D G+R+GF P C
Sbjct: 447 PVDSKGTFCFAFAGT--DGGVSIIGNIQQQGFRVVFDGDGQRVGFVPKGC 494
>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
Length = 396
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 55/161 (34%), Positives = 79/161 (49%), Gaps = 18/161 (11%)
Query: 138 IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRE 197
IG P Q S ++D ++ WTQC C CF+Q P F + S TF PC + +C+
Sbjct: 49 IGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACK---- 104
Query: 198 SFPFGNCNSKECPF----NIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINN 253
S P NC+ C + NI+ D + G T+ I A ++ F GC+
Sbjct: 105 STPTSNCSGDVCTYESTTNIRL-DRHTTLGIVGTETFAIGTATASLAF-------GCVVA 156
Query: 254 SSGDK-SGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGS 293
S D G SG +GL R+P S++ + + FSYCL SP G+
Sbjct: 157 SDIDTMDGTSGFIGLGRTPRSLVAQMKLTKFSYCL-SPRGT 196
>gi|238479902|ref|NP_001154646.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332643534|gb|AEE77055.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 350
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 84/341 (24%), Positives = 136/341 (39%), Gaps = 43/341 (12%)
Query: 83 PSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPK 142
PS + L D +RLH + RR KP P F+K + + + +Y++ + IG+P
Sbjct: 43 PSPTQALALDTRRLHFLSLRR--KPIP-FVKSPVV-----SGAASGSGQYFVDLRIGQPP 94
Query: 143 QYVSLLLDTGSDVTWTQCKPCIHCFQQRDP-FFYASKSKTFFKIPCNSTSCRILRESFPF 201
Q + L+ DTGSD+ W +C C +C F+ S TF C CR++ +
Sbjct: 95 QSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPKPDRA 154
Query: 202 GNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL-LGCINNSSG 256
CN C + YADGS + G +A + +++ S+G R + GC SG
Sbjct: 155 PICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLK--TSSGKEARLKSVAFGCGFRISG 212
Query: 257 DKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGKTDTVNTLRSAFHKRM 316
+G +D G+T + F ++ +A +R+
Sbjct: 213 QSVSGNGGTVVDS----------------------GTT--LAFLAEPAYRSVIAAVRRRV 248
Query: 317 KKYKKAKGLEDLLDTCYDLSAYET--VVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCL 374
K A L D C ++S ++P++ F GG R + CL
Sbjct: 249 -KLPIADALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCL 307
Query: 375 GFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ P +GN+ Q+G +D RLGF C+
Sbjct: 308 AIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 348
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 67/220 (30%), Positives = 104/220 (47%), Gaps = 25/220 (11%)
Query: 83 PSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPK 142
PS + L D +RLH + RR KP P F+K + + + +Y++ + IG+P
Sbjct: 43 PSPTQALALDTRRLHFLSLRR--KPIP-FVKSPVV-----SGAASGSGQYFVDLRIGQPP 94
Query: 143 QYVSLLLDTGSDVTWTQCKPCIHCFQQRDP-FFYASKSKTFFKIPCNSTSCRILRESFPF 201
Q + L+ DTGSD+ W +C C +C F+ S TF C CR++ +
Sbjct: 95 QSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPKPDRA 154
Query: 202 GNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL-LGCINNSSG 256
CN C + YADGS + G +A + +++ S+G R + GC SG
Sbjct: 155 PICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLK--TSSGKEARLKSVAFGCGFRISG 212
Query: 257 DK------SGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
+GA+G+MGL R P+S ++ + FSYCL
Sbjct: 213 QSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCL 252
>gi|413953789|gb|AFW86438.1| hypothetical protein ZEAMMB73_078928 [Zea mays]
Length = 155
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 49/108 (45%), Positives = 63/108 (58%), Gaps = 8/108 (7%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDV-RGTLVV 366
LRSAF K M+ Y+ + LDTCY+L+ Y+ VVVPKIA+ F GG + LDV G+LV
Sbjct: 55 LRSAFRKAMEAYRLLPNGD--LDTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGSLV- 111
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
CL FA PD ++ LGNV QR EV +D + + GF C
Sbjct: 112 ----NGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 155
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 53/161 (32%), Positives = 88/161 (54%), Gaps = 12/161 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ V IG P + L++DTGSDV W QC PC C++Q D F S +F ++ C++
Sbjct: 13 EYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTP 72
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
C++L C S + C + + Y DGS + G A+D ++ ++ P +
Sbjct: 73 QCKLLDVK----ACASTDNRCLYQVSYGDGSFTVGDLASDSFSVSRGRTS------PVVF 122
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
GC +++ G GA+G++GL +S ++ ++ FSYCL S
Sbjct: 123 GCGHDNEGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVS 163
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 54/110 (49%), Gaps = 4/110 (3%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+R AF +K +A L DTCYD SA +V +P ++ HF GG ++L LV
Sbjct: 251 TVMRDAFRSATQKLPRAADFS-LFDTCYDFSALTSVTIPTVSFHFEGGASVQLPPSNYLV 309
Query: 366 -VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V + C F+ D + I GN+QQ+ V D+ R+GF P C
Sbjct: 310 PVDTSGTFCFAFSKTSLDLSII--GNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
Length = 397
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 55/161 (34%), Positives = 79/161 (49%), Gaps = 18/161 (11%)
Query: 138 IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRE 197
IG P Q S ++D ++ WTQC C CF+Q P F + S TF PC + +C+
Sbjct: 49 IGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACK---- 104
Query: 198 SFPFGNCNSKECPF----NIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINN 253
S P NC+ C + NI+ D + G T+ I A ++ F GC+
Sbjct: 105 STPTSNCSGDVCTYESTTNIRL-DRHTTLGIVGTETFAIGTATASLAF-------GCVVA 156
Query: 254 SSGDK-SGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGS 293
S D G SG +GL R+P S++ + + FSYCL SP G+
Sbjct: 157 SDIDTMDGTSGFIGLGRTPRSLVAQMKLTKFSYCL-SPRGT 196
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 66/231 (28%), Positives = 101/231 (43%), Gaps = 18/231 (7%)
Query: 87 EILRQDQQRLHLKNSRRLRK------PFPEF-LKRTEAFTFPANINDTVADEYYIVVAIG 139
E+L + QR L+ + + K P P L P + EY +A+G
Sbjct: 82 ELLARRLQRDELRAAWIISKAAANGTPPPVVGLSTGRGLVAPVVSRAPTSGEYMAKIAVG 141
Query: 140 EPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESF 199
P L LDT SD+TW QC+PC C+ Q P F S ++ ++ ++ C+ L S
Sbjct: 142 TPAVQALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDCQALGRSG 201
Query: 200 PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD-K 258
G+ C + +QY DG GS D + + G Y +GC +++ G
Sbjct: 202 -GGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGGVRQAY-LSIGCGHDNKGLFG 259
Query: 259 SGASGIMGLDRSPVSIITRTN----TSYFSYCL----PSPYGSTGYITFGK 301
+ A+GI+GL R +SI + + FSYCL P + +TFG
Sbjct: 260 APAAGILGLGRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTLTFGA 310
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 38/92 (41%), Positives = 53/92 (57%), Gaps = 2/92 (2%)
Query: 324 GLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-VASVSQVCLGFATYPPD 382
G L DTCY + V VP +++HF GGV++ L + L+ V S VC FA D
Sbjct: 404 GPSGLFDTCYTVGGRAGVKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTG-D 462
Query: 383 PNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ +GN+ Q+G V YD+AG+R+GF P NC
Sbjct: 463 RSVSVIGNILQQGFRVVYDLAGQRVGFAPNNC 494
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 66/252 (26%), Positives = 103/252 (40%), Gaps = 31/252 (12%)
Query: 73 RLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEY 132
RL + + LEE+ R+D R H + RRL + F + N + Y
Sbjct: 37 RLQRAVPHQGVPLEELRRRDAAR-HRVSRRRLLGGVAGVVD----FPVEGSANPYMVGLY 91
Query: 133 YIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIPC 187
+ V +G P + + +DTGSD+ W C PC C + F S T +I C
Sbjct: 92 FTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRITC 151
Query: 188 NSTSCRILRESFPFG-------NCNSKECPFNIQYADGSGSGGFWATDRITIQE--ANSN 238
+ C F G N S C + Y DGSG+ G++ +D + + N
Sbjct: 152 SDDRC---TAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQ 208
Query: 239 GYFTRYPFLLGCINNSSGDKSGA----SGIMGLDRSPVSIITRTNT-----SYFSYCLPS 289
+ + GC N+ SGD + A GI G + +S+I++ N+ FS+CL
Sbjct: 209 TANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKG 268
Query: 290 PYGSTGYITFGK 301
G + G+
Sbjct: 269 SDNGGGILVLGE 280
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/162 (31%), Positives = 86/162 (53%), Gaps = 16/162 (9%)
Query: 80 THAPSLEEILRQDQQRLHLKNSR-RLRKPFPEFLKRTEAFTFPANINDTV-------ADE 131
TH L E L++D++R+ S+ +L K+ EA + ++N V + E
Sbjct: 1 THEQLLLETLQRDERRVRWIESKAKLAGK-----KKDEASS--TDLNGPVTSGLLYGSGE 53
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y++ + +G P + + +++DTGSD+ W QC+PC C++Q DP F S +F +IPC S
Sbjct: 54 YFVRLGLGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPL 113
Query: 192 CRILR-ESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITI 232
C+ L S + C + + Y DGS S G +++D T+
Sbjct: 114 CKALEVHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTL 155
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/117 (32%), Positives = 53/117 (45%), Gaps = 10/117 (8%)
Query: 302 TDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVR 361
T T+R AF A L DTCY+ S +V VP + +HF G DL+L
Sbjct: 297 TSVYATIRDAFRNATINLPSAPRYS-LFDTCYNFSGKASVDVPALVLHFENGADLQLPPT 355
Query: 362 GTLV-VASVSQVCLGFATYPPDPNSITL---GNVQQRGHEVHYDVAGRRLGFGPGNC 414
L+ + + CL FA P S+ L GN+QQ+ + +D+ L F P C
Sbjct: 356 NYLIPINTAGSFCLAFA-----PTSMELGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 407
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 70/221 (31%), Positives = 96/221 (43%), Gaps = 22/221 (9%)
Query: 86 EEILRQDQQRLHLKNSRRLRKPFPEFLKR-----TEAFTFP---ANINDTVAD-EYYIVV 136
E+LR+ R + SR R + A T P + D D EY I +
Sbjct: 45 RELLRRLATRSRARASRLYSSSSSSSSARPAGAGSHAVTAPLARGTVGDADIDSEYLIHL 104
Query: 137 AIGEPK-QYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRIL 195
+IG P+ Q V+L LDTGSD+ WTQC C CF Q P F A S+T +PC+ C
Sbjct: 105 SIGTPRPQRVALTLDTGSDLVWTQCA-CHVCFAQPFPTFDALASQTTLAVPCSDPICT-- 161
Query: 196 RESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL------ 247
+P C N C + YAD S + G D T + N + +
Sbjct: 162 SGKYPLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAGVAVPNVR 221
Query: 248 LGCINNSSGD-KSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
GC + G KS SGI G R P+S+ ++ + FS+C
Sbjct: 222 FGCGQYNKGIFKSNESGIAGFSRGPMSLPSQLKVARFSHCF 262
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 66/214 (30%), Positives = 103/214 (48%), Gaps = 33/214 (15%)
Query: 84 SLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQ 143
++EE +R+ +R H RRL T P I+ +Y IG+P Q
Sbjct: 37 TVEERVRRATERTH----RRL--------ASMGGVTAP--IHWGGQSQYIAEYLIGDPPQ 82
Query: 144 YVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFG 202
++DTGS++ WTQC C CF+Q P++ S+S+ + CN +C + E+
Sbjct: 83 RAEAIIDTGSNLIWTQCSRCRPTCFRQNLPYYDPSRSRAARAVGCNDAACALGSET---- 138
Query: 203 NC--NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI---NNSSGD 257
C ++K C Y G+ +G AT+ +T Q + + GCI S G
Sbjct: 139 QCLSDNKTCAVVTGYGAGNIAGTL-ATENLTFQSETVS-------LVFGCIVVTKLSPGS 190
Query: 258 KSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPY 291
+GASGI+GL R +S+ ++ + FSYCL +PY
Sbjct: 191 LNGASGIIGLGRGKLSLPSQLGDTRFSYCL-TPY 223
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 59/181 (32%), Positives = 91/181 (50%), Gaps = 19/181 (10%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQC------KPCIHCFQQRDPFFYASKSKTFFKIPC 187
+ +A+G P Q V+++LDTGS+++W C F S TF +PC
Sbjct: 65 VSLAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVPC 124
Query: 188 NSTSC--RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
ST C R L + P + S++C ++ YADGS S G ATD + EA R
Sbjct: 125 GSTQCSSRDL-PAPPSCDGASRQCHVSLSYADGSASDGALATDVFAVGEAPP----LRSA 179
Query: 246 FLLGCIN---NSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGKT 302
F GC++ +SS D +G++G++R +S +T+ +T FSYC+ S G + G +
Sbjct: 180 F--GCMSTAYDSSPDGVATAGLLGMNRGTLSFVTQASTRRFSYCI-SDRDDAGVLLLGHS 236
Query: 303 D 303
D
Sbjct: 237 D 237
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 67/237 (28%), Positives = 113/237 (47%), Gaps = 28/237 (11%)
Query: 85 LEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTV---ADEYYIVVAIGEP 141
+E+++ DQ+R H SR KR ++ + +Y+ + +G P
Sbjct: 45 IEDVIGADQKR-HSLISR----------KRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTP 93
Query: 142 KQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI-LRESFP 200
+ +++DTGS++TW C+ R F A +SK+F + C + +C++ L F
Sbjct: 94 AKKFRVVVDTGSELTWVNCRYRARGKDNRR-VFRADESKSFKTVGCLTQTCKVDLMNLFS 152
Query: 201 FGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLGCINNSSGD 257
C S C ++ +YADGS + G +A + IT+ +NG R P L+GC ++ +G
Sbjct: 153 LTTCPTPSTPCSYDYRYADGSAAQGVFAKETITV--GLTNGRMARLPGHLIGCSSSFTGQ 210
Query: 258 K-SGASGIMGLDRSPVSIITRTNTSY---FSYCLP---SPYGSTGYITFGKTDTVNT 307
GA G++GL S S + + Y FSYCL S + Y+ FG + + T
Sbjct: 211 SFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKT 267
Score = 42.7 bits (99), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 23/88 (26%), Positives = 41/88 (46%), Gaps = 2/88 (2%)
Query: 329 LDTCYDL-SAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSIT 387
++ C+ S + +P++ H GG E + LV A+ CLGF + P +
Sbjct: 353 IEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVS-AGTPATNV 411
Query: 388 LGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+GN+ Q+ + +D+ L F P C+
Sbjct: 412 IGNIMQQNYLWEFDLMASTLSFAPSACT 439
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 71/217 (32%), Positives = 101/217 (46%), Gaps = 16/217 (7%)
Query: 103 RLRKPFPEFLKRTEAF-TFPANINDTVAD------EYYIVVAIGEPKQYVSLLLDTGSDV 155
RLR F + R F T +IN D EY++ ++IG P V ++ DTGSD+
Sbjct: 58 RLRNAFSRSISRVNVFKTKAVDINSFQNDLVPNGGEYFMKMSIGTPLVEVIVIADTGSDL 117
Query: 156 TWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQY 215
TW QC PC C++Q+ P F S+S ++ + C S C L S ++ C ++ Y
Sbjct: 118 TWVQCLPCDPCYRQKSPLFDPSRSSSYRHMLCGSRFCNALDVSEQACTMDTNICEYHYSY 177
Query: 216 ADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD----KSGASGIMGLDRSP 271
D S + G AT++ TI +S P + GC + G SG G+ G S
Sbjct: 178 GDKSYTNGNLATEKFTIGSTSSRPVHLS-PIVFGCGTGNGGTFDELGSGIVGLGGGALSL 236
Query: 272 VSIITRTNTSYFSYC---LPSPYGSTGYITFGKTDTV 305
VS ++ FSYC L T I FG TD+V
Sbjct: 237 VSQLSSIIKGKFSYCLVPLSEQSNVTSKIKFG-TDSV 272
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 39/101 (38%), Positives = 58/101 (57%), Gaps = 4/101 (3%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + IG P + ++LDTGSDV W QC+PC C+ Q DP F S S +F + C+S
Sbjct: 7 EYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCDSA 66
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRIT 231
C L + +C+ C + + Y DGS + G +AT+ +T
Sbjct: 67 VCSQLDAN----DCHGGGCLYEVSYGDGSYTVGSYATETLT 103
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 40/115 (34%), Positives = 59/115 (51%), Gaps = 4/115 (3%)
Query: 301 KTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDV 360
+T + LR AF + +A G+ + DTCYDLSA ++V +P + HF G L
Sbjct: 238 QTSAYDALRDAFIAGTQHLPRADGIS-IFDTCYDLSALQSVSIPAVGFHFSNGAGFILPA 296
Query: 361 RGTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ L+ + S+ C FA P D N +GN+QQ+G V +D A +GF C
Sbjct: 297 KNCLIPMDSMGTFCFAFA--PADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 349
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/161 (32%), Positives = 87/161 (54%), Gaps = 12/161 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ V IG P + L++DTGSDV W QC PC C++Q D F S +F ++ C++
Sbjct: 13 EYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTP 72
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
C++L C S + C + + Y DGS + G A+D + ++ P +
Sbjct: 73 QCKLLDVK----ACASTDNRCLYQVSYGDGSFTVGDLASDSFLVSRGRTS------PVVF 122
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
GC +++ G GA+G++GL +S ++ ++ FSYCL S
Sbjct: 123 GCGHDNEGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVS 163
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 54/110 (49%), Gaps = 4/110 (3%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+R AF +K +A L DTCYD SA +V +P ++ HF GG ++L LV
Sbjct: 251 TVMRDAFRSATQKLPRAADFS-LFDTCYDFSALTSVTIPTVSFHFEGGASVQLPPSNYLV 309
Query: 366 -VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V + C F+ D + I GN+QQ+ V D+ R+GF P C
Sbjct: 310 PVDTSGTFCFAFSKTSLDLSII--GNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 63/125 (50%), Gaps = 5/125 (4%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ V +G P ++ SL+LDTGSD+ W QC PC CFQQ F+ S ++ I CN
Sbjct: 154 EYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGAFYDPKASASYKNITCNDP 213
Query: 191 SCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRY---P 245
C ++ P C S + CP+ Y D S + G +A + T+ S G Y
Sbjct: 214 RCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELYNVEN 273
Query: 246 FLLGC 250
+ GC
Sbjct: 274 MMFGC 278
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 43/88 (48%), Gaps = 1/88 (1%)
Query: 328 LLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSIT 387
+LD C+++S +++ +P++ I F G + + + VCL P SI
Sbjct: 432 ILDPCFNVSGIDSIQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAILGTPKSAFSI- 490
Query: 388 LGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+GN QQ+ + YD RLG+ P C+
Sbjct: 491 IGNYQQQNFHILYDTKRSRLGYAPTKCA 518
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 86/162 (53%), Gaps = 7/162 (4%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
E+++ + IG P V + DTGSD+TW QCKPC C+++ P F KS T+ PC+S
Sbjct: 84 EFFMSITIGTPPMKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSR 143
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLG 249
+C L S + + C + Y D S S G AT+ I+I A +G +P + G
Sbjct: 144 NCHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSA--SGSPVSFPGTVFG 201
Query: 250 CINNSSGD-KSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
C N+ G SGI+GL +S+I++ +S FSYCL
Sbjct: 202 CGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCL 243
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 57/192 (29%), Positives = 92/192 (47%), Gaps = 20/192 (10%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
+ + V +G P Q ++LD GSD+ WTQC +Q +P F A++S +F +PC+S
Sbjct: 107 HSLTVGVGTPPQPSKVILDLGSDLLWTQCSLVGPTAKQLEPVFDAARSSSFSVLPCDSKL 166
Query: 192 CRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI 251
C +F C ++C + Y + + G AT+ T +G F GC
Sbjct: 167 CEA--GTFTNKTCTDRKCAYENDYGIMTAT-GVLATETFTF--GAHHGVSANLTF--GCG 219
Query: 252 NNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL---------PSPYGSTG----YIT 298
++G + ASGI+GL P+S++ + + FSYCL P +G+ Y T
Sbjct: 220 KLANGTIAEASGILGLSPGPLSMLKQLAITKFSYCLTPFADRKTSPVMFGAMADLGKYKT 279
Query: 299 FGKTDTVNTLRS 310
GK T+ L++
Sbjct: 280 TGKVQTIPLLKN 291
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/175 (31%), Positives = 90/175 (51%), Gaps = 14/175 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ + +G P + +++D+GSD+ W QC+PC C+ Q DP F + S +F + C ST
Sbjct: 135 EYFVRIGVGSPPRNQYVVMDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSFSGVSCAST 194
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C + + C+ C + + Y DGS + G A + IT G +GC
Sbjct: 195 VCSHVDNA----ACHEGRCRYEVSYGDGSYTKGTLALETITF------GRTLIRNVAIGC 244
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITR---TNTSYFSYCLPS-PYGSTGYITFGK 301
+++ G GA+G++GL P+S + + FSYCL S S+G + FG+
Sbjct: 245 GHHNQGMFVGAAGLLGLGGGPMSFVGQLGGQTGGAFSYCLVSRGIESSGLLEFGR 299
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 54/114 (47%), Gaps = 4/114 (3%)
Query: 302 TDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVR 361
T R F + +A G+ + DTCYDL + +V VP ++ +F GG L L R
Sbjct: 365 TVAYEAFRDGFIAQTTNLPRASGVS-IFDTCYDLFGFVSVRVPTVSFYFSGGPILTLPAR 423
Query: 362 GTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L+ V V C FA P +GN+QQ G ++ D A +GFGP C
Sbjct: 424 NFLIPVDDVGTFCFAFA--PSSSGLSIIGNIQQEGIQISVDGANGFVGFGPNVC 475
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 57/101 (56%), Gaps = 4/101 (3%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P + ++LDTGSDV W QC+PC C+ Q DP F S S +F + C+S
Sbjct: 156 EYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYSASFSTVGCDSA 215
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRIT 231
C L +C+S C + Y DGS S G +AT+ +T
Sbjct: 216 VCSQLDAY----DCHSGGCLYEASYGDGSYSTGSFATETLT 252
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 55/115 (47%), Gaps = 6/115 (5%)
Query: 302 TDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVR 361
T + +R AF + + + + DTCYDLS + V VP + HF G L L +
Sbjct: 388 TSAYDAVRDAFVAGTGQLPRTDAVS-IFDTCYDLSGLQFVSVPTVGFHFSNGASLILPAK 446
Query: 362 GTLV-VASVSQVCLGFATYPPDPNSIT-LGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L+ + +V C FA P +S++ +GN QQ+ V +D A +GF C
Sbjct: 447 NYLIPMDTVGTFCFAFA---PAASSVSIMGNTQQQHIRVSFDSANSLVGFAFDQC 498
>gi|194702702|gb|ACF85435.1| unknown [Zea mays]
gi|414885969|tpg|DAA61983.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 163
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 46/109 (42%), Positives = 61/109 (55%), Gaps = 2/109 (1%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
LRS+ M +YK+A + DTCYDL+ +ETV +P +A+ F G + L G L
Sbjct: 56 LRSSVRSAMGRYKRAPS-STIFDTCYDLTGHETVRIPSVALVFADGATVHLHPSGVLYTW 114
Query: 368 S-VSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
S VSQ CL F P D + LGN QQR V YDV +++GFG C+
Sbjct: 115 SNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGANGCA 163
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 76/129 (58%), Gaps = 8/129 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + ++G P + ++DTGSD+ W QC+PC C+ Q P F S+SKT+ +PC+S
Sbjct: 93 EYLMSYSVGTPPFQILGIVDTGSDIIWLQCQPCEDCYNQTTPIFDPSQSKTYKTLPCSSN 152
Query: 191 SCRILRESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FL 247
C+ ++ + +C N+ EC + I Y D S S G + + +T+ +++G ++P +
Sbjct: 153 ICQSVQSA---ASCSSNNDECEYTITYGDNSHSQGDLSVETLTL--GSTDGSSVQFPKTV 207
Query: 248 LGCINNSSG 256
+GC +N+ G
Sbjct: 208 IGCGHNNKG 216
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 26/98 (26%), Positives = 45/98 (45%), Gaps = 4/98 (4%)
Query: 318 KYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFA 377
+ ++ + L CY ++ + + VP I HF G D+EL+ T + VC F
Sbjct: 349 ELERVEDPSKFLRLCYRTTSSDELNVPVITAHF-KGADVELNPISTFIEVDEGVVCFAFR 407
Query: 378 TYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ P GN+ Q+ V YD+ + + F P +C+
Sbjct: 408 SSKIGP---IFGNLAQQNLLVGYDLVKQTVSFKPTDCT 442
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 66/252 (26%), Positives = 103/252 (40%), Gaps = 31/252 (12%)
Query: 73 RLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEY 132
RL + + LEE+ R+D R H + RRL + F + N + Y
Sbjct: 35 RLQRAVPHKGVPLEELRRRDAAR-HRVSRRRLLGGVAGVVD----FPVEGSANPYMVGLY 89
Query: 133 YIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIPC 187
+ V +G P + + +DTGSD+ W C PC C + F S T +I C
Sbjct: 90 FTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRITC 149
Query: 188 NSTSCRILRESFPFG-------NCNSKECPFNIQYADGSGSGGFWATDRITIQE--ANSN 238
+ C F G N S C + Y DGSG+ G++ +D + + N
Sbjct: 150 SDDRC---TAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQ 206
Query: 239 GYFTRYPFLLGCINNSSGDKSGA----SGIMGLDRSPVSIITRTNT-----SYFSYCLPS 289
+ + GC N+ SGD + A GI G + +S+I++ N+ FS+CL
Sbjct: 207 TANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKG 266
Query: 290 PYGSTGYITFGK 301
G + G+
Sbjct: 267 SDNGGGILVLGE 278
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 91/348 (26%), Positives = 132/348 (37%), Gaps = 77/348 (22%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y + +++G P + + DTGSD+ W Q +PC C F +S TF ++ C+S
Sbjct: 55 YVMDISVGTPGKRFRAIADTGSDLVWVQSEPCTGC--SGGTIFDPRQSSTFREMDCSSQL 112
Query: 192 CRILRESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLL 248
C L G+C S C ++ +Y G G F R TI ++G ++P F +
Sbjct: 113 CTELP-----GSCEPGSSACSYSYEYGSGETEGEF---ARDTISLGTTSGGSQKFPSFAV 164
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNT---SYFSYCL------------------ 287
GC +SG G G++GL + PVS+ ++ + S FSYCL
Sbjct: 165 GCGMVNSG-FDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSA 223
Query: 288 -------------------PSPY--------------GSTGYITFGKTDTVNTLRSAFHK 314
P+ Y GS G T+ + S +
Sbjct: 224 ALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSPGTTIIDSGTTLTYVPSGVYG 283
Query: 315 R-------MKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVV- 366
R M + G LD CYD S+ P + I G LVV
Sbjct: 284 RVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPPSSNYFLVVD 343
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
S VCL + P SI +GNV Q+G+ + YD L F C
Sbjct: 344 DSGDTVCLAMGSAGGLPVSI-IGNVMQQGYHILYDRGSSELSFVQAKC 390
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 56/183 (30%), Positives = 88/183 (48%), Gaps = 11/183 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
+ + V+IG P Q +L+LDTGSD+ WTQCK + P + +KS +F PC+
Sbjct: 88 HHTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPCDGR 147
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C SF NC+ +C + Y + G A++ T E GC
Sbjct: 148 LCET--GSFNTKNCSRNKCIYTYNYGSATTKGEL-ASETFTFGEHRR----VSVSLDFGC 200
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPY---GSTGYITFGKTDTVNT 307
+SG GASGI+G+ +S++++ FSYCL +P+ +T +I FG ++
Sbjct: 201 GKLTSGSLPGASGILGISPDRLSLVSQLQIPRFSYCL-TPFLDRNTTSHIFFGAMADLSK 259
Query: 308 LRS 310
R+
Sbjct: 260 YRT 262
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 40/92 (43%), Gaps = 9/92 (9%)
Query: 330 DTCYDL-----SAYETVV-VPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDP 383
+ C+ L A ET V VP + HF GG + L +V S ++CL ++
Sbjct: 355 ELCFQLPRNGGGAVETAVQVPPLVYHFDGGAAMLLRRDSYMVEVSAGRMCLVISS---GA 411
Query: 384 NSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+GN QQ+ V +DV F P C+
Sbjct: 412 RGAIIGNYQQQNMHVLFDVENHEFSFAPTQCN 443
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 61/199 (30%), Positives = 93/199 (46%), Gaps = 24/199 (12%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQ-----------QRDPFFYASKS 179
+Y + +G P Q L+ DTGSD+TW CK HC + F+A+ S
Sbjct: 11 QYSVAFKVGTPSQKFMLVADTGSDLTWMSCK--YHCRSRNCSNRKARRIRHKRVFHANLS 68
Query: 180 KTFFKIPCNSTSCRI-LRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEAN 236
+F IPC + C+I L + F NC + C ++ +Y+DGS + GF+A + +T+ E
Sbjct: 69 SSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTV-ELK 127
Query: 237 SNGYFTRYPFLLGCINNSSGDK-SGASGIMGLDRSPVSIITRTNTSY---FSYCLP---S 289
+ L+GC + G A G+MGL S S + + FSYCL S
Sbjct: 128 EGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLS 187
Query: 290 PYGSTGYITFGKTDTVNTL 308
+ Y+TFG + + L
Sbjct: 188 HKNVSNYLTFGSSRSKEAL 206
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/119 (26%), Positives = 62/119 (52%), Gaps = 1/119 (0%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+TF + +A + K++K + L+ C++ + +E +VP++ HF G +
Sbjct: 261 LTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFEESLVPRLVFHFADGAEF 320
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
E V+ ++ A+ CLGF + P + +GN+ Q+ H +D+ ++LGF P +C+
Sbjct: 321 EPPVKSYVISAADGVRCLGFVSVAW-PGTSVVGNIMQQNHLWEFDLGLKKLGFAPSSCT 378
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 68/237 (28%), Positives = 113/237 (47%), Gaps = 28/237 (11%)
Query: 85 LEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANIN---DTVADEYYIVVAIGEP 141
+E+++ DQ+R H SR KR ++ D +Y+ + +G P
Sbjct: 67 IEDVIGADQKR-HSLISR----------KRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTP 115
Query: 142 KQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI-LRESFP 200
+ +++DTGS++TW C+ R F A +SK+F + C + +C++ L F
Sbjct: 116 AKKFRVVVDTGSELTWVNCRYRARGKDNRR-VFRADESKSFKTVGCLTQTCKVDLMNLFS 174
Query: 201 FGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLGCINNSSGD 257
C S C ++ +YADGS + G +A + IT+ +NG R P L+GC ++ +G
Sbjct: 175 LTTCPTPSTPCSYDYRYADGSAAQGVFAKETITV--GLTNGRMARLPGHLIGCSSSFTGQ 232
Query: 258 K-SGASGIMGLDRSPVSIITRTNTSY---FSYCLP---SPYGSTGYITFGKTDTVNT 307
GA G++GL S S + + Y FSYCL S + Y+ FG + + T
Sbjct: 233 SFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKT 289
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 23/88 (26%), Positives = 41/88 (46%), Gaps = 2/88 (2%)
Query: 329 LDTCYDL-SAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSIT 387
++ C+ S + +P++ H GG E + LV A+ CLGF + P +
Sbjct: 375 IEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVS-AGTPATNV 433
Query: 388 LGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+GN+ Q+ + +D+ L F P C+
Sbjct: 434 IGNIMQQNYLWEFDLMASTLSFAPSACT 461
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 58/101 (57%), Gaps = 4/101 (3%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ + +G P + +++D+GSD+ W QC+PC C+QQ DP F + S T+ I C+S+
Sbjct: 136 EYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYAGISCDSS 195
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRIT 231
C L + CN C + + Y DGS + G A + +T
Sbjct: 196 VCDRLDNA----GCNDGRCRYEVSYGDGSYTRGTLALETLT 232
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 46/88 (52%), Gaps = 3/88 (3%)
Query: 328 LLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-VASVSQVCLGFATYPPDPNSI 386
+ DTCY+L+ + +V VP ++ +F GG L L R L+ V C FA + I
Sbjct: 391 IFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSII 450
Query: 387 TLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
GN+QQ G ++ D + +GFGP C
Sbjct: 451 --GNIQQEGIQISIDGSNGFVGFGPTIC 476
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 63/125 (50%), Gaps = 5/125 (4%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ V +G P ++ SL+LDTGSD+ W QC PC CF Q F+ S +F I CN
Sbjct: 159 EYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKTSASFKNITCNDP 218
Query: 191 SCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP--- 245
C ++ P C S + CP+ Y D S + G +A + T+ + G + Y
Sbjct: 219 RCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVGN 278
Query: 246 FLLGC 250
+ GC
Sbjct: 279 MMFGC 283
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 52/110 (47%), Gaps = 3/110 (2%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYE--TVVVPKIAIHFLGGVDLELDVRGTLV 365
+++ F ++MK+ +LD C+++S E + +P++ I F+ G + +
Sbjct: 417 IKNKFAEKMKENYPIFRDFPVLDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFI 476
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
S VCL P SI +GN QQ+ + YD RLGF P C+
Sbjct: 477 WLSEDLVCLAILGTPKSTFSI-IGNYQQQNFHILYDTKRSRLGFTPTKCA 525
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 56/173 (32%), Positives = 89/173 (51%), Gaps = 14/173 (8%)
Query: 122 ANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKT 181
A IN + +Y + + IG P +S +DTGSD+ W QC PC+ C+ Q +P F KS T
Sbjct: 55 APINAYIG-QYLMELYIGTPPIKISGTVDTGSDLIWVQCVPCLGCYNQINPMFDPLKSST 113
Query: 182 FFKIPCNSTSCRILRESFPF-GNCN-SKECPFNIQYADGSGSGGFWATDRITIQEANSNG 239
+ I C+S C P+ G C+ K C + YAD S + G A + +T+ +N+
Sbjct: 114 YTNISCDSPLCYK-----PYIGECSPEKRCDYTYGYADSSLTKGVLAQETVTL-TSNTGK 167
Query: 240 YFTRYPFLLGCINNSSGD-KSGASGIMGLDRSPVSIITRTNTSY----FSYCL 287
+ L GC +N++G+ G++GL P S++++ + FS CL
Sbjct: 168 PISLQGILFGCGHNNTGNFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCL 220
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 67/125 (53%), Gaps = 3/125 (2%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCN 188
A EY++ V +G P ++ L++DTGSD+TW QCKPC CF Q P F S+S +F IPCN
Sbjct: 168 AGEYFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCN 227
Query: 189 STSCRILRESFPFGNCNS---KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
+ +C ++ N + K C + Y D S + G A + +++ ++
Sbjct: 228 AAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRD 287
Query: 246 FLLGC 250
++GC
Sbjct: 288 MVIGC 292
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 42/134 (31%), Positives = 59/134 (44%), Gaps = 10/134 (7%)
Query: 289 SPYGSTGYI-------TFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETV 341
+P GS G I T+ D + SAF R+ Y +A D+L CY+ + V
Sbjct: 401 APNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARIS-YPRADPF-DILGICYNATGRTAV 458
Query: 342 VVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYD 401
P ++I F G +L+L + + A P D SI +GN QQ+ YD
Sbjct: 459 PFPTLSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGMSI-IGNFQQQNIHFLYD 517
Query: 402 VAGRRLGFGPGNCS 415
V RLGF +CS
Sbjct: 518 VQHARLGFANTDCS 531
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 67/125 (53%), Gaps = 3/125 (2%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCN 188
A EY++ V +G P ++ L++DTGSD+TW QCKPC CF Q P F S+S +F IPCN
Sbjct: 84 AGEYFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCN 143
Query: 189 STSCRILRESFPFGNCNS---KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
+ +C ++ N + K C + Y D S + G A + +++ ++
Sbjct: 144 AAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRD 203
Query: 246 FLLGC 250
++GC
Sbjct: 204 MVIGC 208
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 54/119 (45%), Gaps = 3/119 (2%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T+ D + SAF R+ Y +A D+L CY+ + V P ++I F G +L
Sbjct: 332 LTYLNRDAYRAVESAFLARIS-YPRADPF-DILGICYNATGRAAVPFPALSIVFQNGAEL 389
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+L + + A P D SI +GN QQ+ YDV RLGF +CS
Sbjct: 390 DLPQENYFIQPDPQEAKHCLAILPTDGMSI-IGNFQQQNIHFLYDVQHARLGFANTDCS 447
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 86/360 (23%), Positives = 144/360 (40%), Gaps = 84/360 (23%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
+Y++ + +G P Q +L+ DTGSD+TW +C F S+++ IPC+S
Sbjct: 115 QYFVKLRVGTPVQEFTLVADTGSDLTWVKCAGA----SPPGRVFRPKTSRSWAPIPCSSD 170
Query: 191 SCRILRESFPFGNCNS--KECPFNIQYADGS-GSGGFWATDRITIQEANSNGYFTRYP-F 246
+C+ L F NC+S C ++ +Y +GS G+ G T+ TI A G +
Sbjct: 171 TCK-LDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATI--ALPGGKVAQLKDV 227
Query: 247 LLGCINNSSGDK-SGASGIMGLDRSPVSIITRTNTSY---FSYCLP---SPYGSTGYITF 299
+LGC ++ G A G++ L + +S T+ + FSYCL +P +TGY+ F
Sbjct: 228 VLGCSSSHDGQSFRSADGVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNATGYLAF 287
Query: 300 G-------------------------KTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYD 334
G K D ++ A + + AK +LD+
Sbjct: 288 GPGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVW-DAKSGGVILDSGNT 346
Query: 335 LS-----AYETVV----------------------------------VPKIAIHFLGGVD 355
L+ AY+ VV +PK+A+ F G
Sbjct: 347 LTVLAAPAYKAVVAALSKHLDGVPKVSFPPFEHCYNWTARRPGAPEIIPKLAVQFAGSAR 406
Query: 356 LELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
LE + ++ C+G P +GN+ Q+ H +D+ ++ F NC+
Sbjct: 407 LEPPAKSYVIDVKPGVKCIGVQEG-EWPGLSVIGNIMQQEHLWEFDLKNMQVRFKQSNCT 465
>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
Length = 366
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 57/102 (55%), Gaps = 4/102 (3%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P + ++LDTGSDV W QC+PC C+ Q DP F S S +F + C+S
Sbjct: 156 EYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYSASFSTVGCDSA 215
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITI 232
C L +C+S C + Y DGS S G +AT+ +T
Sbjct: 216 VCSQLDAY----DCHSGGCLYEASYGDGSYSTGSFATETLTF 253
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 85/153 (55%), Gaps = 12/153 (7%)
Query: 138 IGEPKQYVSLLLDTGSDVTWTQCKPCI---HCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
+G+P+Q +LDTGSDVTW QC PC C++Q P F S ++ + C+S C++
Sbjct: 3 VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQL 62
Query: 195 LRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNS 254
L E+ CN C + ++Y DGS + G AT+ +T +NS + +GC +++
Sbjct: 63 LDEA----GCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNSIPNIS-----IGCGHDN 113
Query: 255 SGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
G GA G++GL +SI ++ S FSYCL
Sbjct: 114 EGLFVGADGLIGLGGGAISISSQLKASSFSYCL 146
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/121 (35%), Positives = 54/121 (44%), Gaps = 8/121 (6%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
IT +D LR AF A + DTCYDLS+ V VP IA G L
Sbjct: 221 ITQLPSDVYEVLREAFLGLTTNLPPAPEISPF-DTCYDLSSQSNVEVPTIAFILPGENSL 279
Query: 357 ELDVRGTLV-VASVSQVCLGF--ATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
+L + L+ V S CL F AT+P +GN QQ+G V YD+ +GF
Sbjct: 280 QLPAKNCLIQVDSAGTFCLAFVSATFPLS----IIGNFQQQGIRVSYDLTNSLVGFSTNK 335
Query: 414 C 414
C
Sbjct: 336 C 336
>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
Length = 467
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 57/196 (29%), Positives = 87/196 (44%), Gaps = 16/196 (8%)
Query: 132 YYIVVAIGEPKQYVS---LLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFF 183
Y + + IG P +S +L DTGSD++WTQC+PC +C + DP SKS+TF
Sbjct: 102 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDP----SKSRTFR 157
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA-NSNGYFT 242
++ C C + G S C F +Y DG G +D A + GY
Sbjct: 158 RLSCFDPMCELCTAVVD-GGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 216
Query: 243 RYPFLLGC--INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFG 300
GC + +S + ++GI+ L S +T+ FSYC+P+ +
Sbjct: 217 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDRFSYCIPASEITDDDDDDD 276
Query: 301 KTDTVNTLRSAFHKRM 316
+ + + LR H RM
Sbjct: 277 EERSASFLRFGSHARM 292
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 62/215 (28%), Positives = 96/215 (44%), Gaps = 13/215 (6%)
Query: 79 STHAPSLEEILRQDQQRLHLKNSRRLRKPFP---EFLKRTEAFTFPANINDTVADEYYIV 135
+T A L L++D R S+ P L F P + EY
Sbjct: 82 ATPAQLLARRLQRDVLRAAWIISKAAANGTPPPVAGLSSARGFVAPVVSRAPTSGEYIAK 141
Query: 136 VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRIL 195
+A+G P L LDT SD+TW QC+PC C+ Q P F S ++ ++ N+ C+ L
Sbjct: 142 IAVGTPGVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYREMSFNAADCQAL 201
Query: 196 RESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL-LGCINNS 254
S G+ C + + Y DGS + G + + +T R P + +GC +++
Sbjct: 202 GRS-GGGDAKRGTCVYTVGYGDGSTTVGDFIEETLTFAGG------VRLPRISIGCGHDN 254
Query: 255 SGD-KSGASGIMGLDRSPVSIITRTN-TSYFSYCL 287
G + A+GI+GL R +S + + FSYCL
Sbjct: 255 KGLFGAPAAGILGLGRGLMSFPNQIDHNGTFSYCL 289
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 51/92 (55%), Gaps = 3/92 (3%)
Query: 324 GLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-VASVSQVCLGFATYPPD 382
G DTCY + VP +++HF G V+++L + L+ V S+ VC FA
Sbjct: 401 GPSGFFDTCYTVGGRGMKKVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDH 460
Query: 383 PNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
SI +GN+QQ+G + YD+ G R+GF P +C
Sbjct: 461 SVSI-IGNIQQQGFRIVYDIGG-RVGFAPNSC 490
>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
Length = 488
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 57/196 (29%), Positives = 87/196 (44%), Gaps = 16/196 (8%)
Query: 132 YYIVVAIGEPKQYVS---LLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFF 183
Y + + IG P +S +L DTGSD++WTQC+PC +C + DP SKS+TF
Sbjct: 123 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDP----SKSRTFR 178
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA-NSNGYFT 242
++ C C + G S C F +Y DG G +D A + GY
Sbjct: 179 RLSCFDPMCELCTAVVD-GGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 237
Query: 243 RYPFLLGC--INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFG 300
GC + +S + ++GI+ L S +T+ FSYC+P+ +
Sbjct: 238 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDRFSYCIPASEITDDDDDDD 297
Query: 301 KTDTVNTLRSAFHKRM 316
+ + + LR H RM
Sbjct: 298 EERSASFLRFGSHARM 313
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 62/111 (55%), Gaps = 2/111 (1%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+I V +G P ++ SL+LDTGSD+ W QC PC CF+Q P + +S ++ I C+ +
Sbjct: 180 EYFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYECFEQNGPHYDPGQSSSYRNIGCHDS 239
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNG 239
C ++ P C ++ CP+ Y D S + G +A + T+ S+G
Sbjct: 240 RCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSG 290
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 49/110 (44%), Gaps = 3/110 (2%)
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV- 365
++ AF ++K Y K +L+ CY+++ E +P I F G V +
Sbjct: 437 VIKEAFMAKVKGYPVVKDFP-VLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIE 495
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ VCL PP SI +GN QQ+ + YD RLGF P C+
Sbjct: 496 IEPREVVCLAILGTPPSALSI-IGNYQQQNFHILYDTKKSRLGFAPTKCA 544
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 49/169 (28%), Positives = 83/169 (49%), Gaps = 16/169 (9%)
Query: 80 THAPSLEEILRQDQQRLHLKNSR------------RLRKPFPEFLKRTEAFTFPA-NIND 126
++A +++ L++D R+ NSR F F P + D
Sbjct: 80 SYAERMQQRLKRDAARVAAINSRLELAVNGIKRSSLKPDSSSSFTMAESDFQSPVVSGMD 139
Query: 127 TVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIP 186
+ EY+ + +G P++ ++LDTGSDVTW QC+PC C+QQ DP + + S ++ +
Sbjct: 140 QGSGEYFSRIGVGAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQSDPIYNPALSSSYKLVG 199
Query: 187 CNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA 235
C + C+ L S G + C + + Y DGS + G +AT+ +T+ A
Sbjct: 200 CQANLCQQLDVS---GCSRNGSCLYQVSYGDGSYTQGNFATETLTLGGA 245
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 45/120 (37%), Positives = 65/120 (54%), Gaps = 6/120 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T +T ++LR AF K G+ L DTCYDLS+ E+V VP + HF GG +
Sbjct: 370 VTRLQTAAYDSLRDAFRAGTKNLPSTDGVS-LFDTCYDLSSKESVDVPTVVFHFSGGGSM 428
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITL-GNVQQRGHEVHYDVAGRRLGFGPGNC 414
L + LV V S+ C FA P +S+++ GN+QQ+G V +D A ++GF C
Sbjct: 429 SLPAKNYLVPVDSMGTFCFAFA---PTSSSLSIVGNIQQQGIRVSFDRANNQVGFAVNKC 485
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 59/166 (35%), Positives = 80/166 (48%), Gaps = 18/166 (10%)
Query: 128 VADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIP 186
V YIV A IG P Q + + LDT +D W C C+ C F SKS + +
Sbjct: 83 VQSPTYIVRANIGTPAQAMLVALDTSNDAAWIPCSGCVGC--SSSVLFDPSKSSSSRTLQ 140
Query: 187 CNSTSCRILRESFPFGNCN-SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C + C+ P +C SK C FN+ Y GS + D +T+ + Y
Sbjct: 141 CEAPQCK----QAPNPSCTVSKSCGFNMTYG-GSAIEAYLTQDTLTL----ATDVIPNYT 191
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLP 288
F GCIN +SG A G+MGL R P+S+I+++ Y FSYCLP
Sbjct: 192 F--GCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLP 235
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 60/199 (30%), Positives = 96/199 (48%), Gaps = 31/199 (15%)
Query: 113 KRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDP 172
K T F N+ T + + IG P Q ++++LDTGS+++W +CK ++P
Sbjct: 54 KTTGKLLFHHNVTLTAS------LTIGTPPQNITMVLDTGSELSWLRCK--------KEP 99
Query: 173 ----FFYASKSKTFFKIPCNSTSC--RILRESFPFGNCNSKECPFNIQYADGSGSGGFWA 226
F SKT+ KIPC+S +C R + P +K C F I YAD S G A
Sbjct: 100 NFTSIFNPLASKTYTKIPCSSQTCKTRTSDLTLPVTCDPAKLCHFIISYADASSVEGHLA 159
Query: 227 TDRITIQEANSNGYFTRYPFLLGCINNSSG----DKSGASGIMGLDRSPVSIITRTNTSY 282
+ G TR + GC+++ S + + +G+MG++R +S + +
Sbjct: 160 FETFRF------GSLTRPATVFGCMDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQMGFRK 213
Query: 283 FSYCLPSPYGSTGYITFGK 301
FSYC+ S STG++ G+
Sbjct: 214 FSYCI-SGLDSTGFLLLGE 231
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 61/198 (30%), Positives = 97/198 (48%), Gaps = 20/198 (10%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P ++LDTGSDV W QC PC C++Q F +S+++ + C +
Sbjct: 139 EYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYEQSGQVFDPRRSRSYNAVGCAAP 198
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
CR L G C+ + C + + Y DGS + G +AT+ +T G L
Sbjct: 199 LCRRLDS----GGCDLRRSACLYQVAYGDGSVTAGDFATETLTFA-----GGARVARVAL 249
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGS------TGYITF 299
GC +++ G A+G++GL R +S T+ + Y FSYCL S + +TF
Sbjct: 250 GCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGRSFSYCLVDRTSSANTASRSSTVTF 309
Query: 300 GKTDTVNTLRSAFHKRMK 317
G +T+ S+F +K
Sbjct: 310 GSGAVGSTVASSFTPMVK 327
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 55/110 (50%), Gaps = 3/110 (2%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+ LR AF + + G L DTCYDLS + V VP +++HF GG + L L+
Sbjct: 385 SALRDAFRGAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLI 444
Query: 366 -VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V S C FA D +GN+QQ+G V +D G+R+ F P C
Sbjct: 445 PVDSKGTFCFAFAGT--DGGVSIIGNIQQQGFRVVFDGDGQRVAFTPKGC 492
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 108/452 (23%), Positives = 171/452 (37%), Gaps = 98/452 (21%)
Query: 50 RTALPQG--PDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKP 107
R P+G P + LE+V G S + +++ R R L +SRR R+
Sbjct: 26 RHQRPRGRKPARPRLELVPA-------APGASLSDRARDDLHRHAYIRSQLASSRRGRR- 77
Query: 108 FPEFLKRTEAFTFPANIND-TVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC 166
AF P + T +Y++ +G P Q L+ DTGSD+TW +C+
Sbjct: 78 --AAEVGASAFAMPLSSGAYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAA 135
Query: 167 FQQRDP----FFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS--KECPFNIQYADGSG 220
F + SK++ I C+S +C F NC+S C ++ +Y DGS
Sbjct: 136 AGTGAGSPARVFRTAASKSWAPIACSSDTCTSY-VPFSLANCSSPASPCAYDYRYRDGSA 194
Query: 221 SGGFWATDRITIQEANSNGYFTRYP----------FLLGCINNSSGDKSGAS-GIMGLDR 269
+ G TD TI ++ +G +LGC G +S G++ L
Sbjct: 195 ARGVVGTDSATIALSSGSGRGGGDSSGGRRAKLQGVVLGCAATYDGQSFQSSDGVLSLGN 254
Query: 270 SPVSIITRTNTSY---FSYCLP---SPYGSTGYITFGKTDTVNTLRSA--FHKRMKK--- 318
S +S +R + FSYCL +P +T Y+TFG T ++ +RM
Sbjct: 255 SNISFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPGATAPAAQTPLLLDRRMTPFYA 314
Query: 319 ------YKKAKGLE-------------DLLDTCYDLS-----AYETVV------------ 342
Y + L+ +LD+ L+ AY VV
Sbjct: 315 VTVDAVYVAGEALDIPADVWDVDRNGGAILDSGTSLTILATPAYRAVVTALSKHLAGLPR 374
Query: 343 -------------------VPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDP 383
+PK+ +HF G LE + ++ A+ C+G P
Sbjct: 375 VTMDPFEYCYNWTDAGALEIPKMEVHFAGSARLEPPAKSYVIDAAPGVKCIGVQE-GSWP 433
Query: 384 NSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+GN+ Q+ H +D+ R L F C+
Sbjct: 434 GVSVIGNILQQEHLWEFDLRDRWLRFKHTRCA 465
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 89/183 (48%), Gaps = 17/183 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + ++IG P + + DTGSD+TW Q KPC C+ Q+ P F S S TF K+PC +
Sbjct: 79 EYMMNLSIGTPPFPILAIADTGSDLTWLQSKPCDQCYPQKGPIFDPSNSTTFHKLPCTTA 138
Query: 191 SCRILRESFPFGNC-NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
C L ES +C + C + Y D S + G+ A+D +T+ N++ F G
Sbjct: 139 PCNALDES--ARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTV--GNASVQIRNVAFGCG 194
Query: 250 CINNSSGDKS--GASGIMGLDRSPVSIITRTNTSYFSYCL----------PSPYGSTGYI 297
N + D+ G G+ G + S VS + T FSYCL PS +T I
Sbjct: 195 TRNGGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLPLENEISSQPSDSPATSRI 254
Query: 298 TFG 300
FG
Sbjct: 255 VFG 257
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 54/119 (45%), Gaps = 4/119 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+TF + + L +A + +K + + C+ S E V +P + +HF GG D+
Sbjct: 333 LTFLEEEFYGALEAALVEEIKMERVNDVKNSMFSLCFK-SGKEEVELPLMKVHFRGGADV 391
Query: 357 ELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
EL T V A VC F P + I GN+ Q V YD+ R + F P +CS
Sbjct: 392 ELKPVNTFVRAEEGLVC--FTMLPTNDVGI-YGNLAQMNFVVGYDLGKRTVSFLPADCS 447
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 46/161 (28%), Positives = 77/161 (47%), Gaps = 21/161 (13%)
Query: 90 RQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPAN------INDTVAD----------EYY 133
R RLH + RR L+R A+ +ND +D EY+
Sbjct: 75 RNHHHRLHAR-MRRDTDRVSAILRRISGKVVVASSDSRYEVNDFGSDVVSGMDQGSGEYF 133
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCR 193
+ + +G P + +++D+GSD+ W QC+PC C++Q DP F +KS ++ + C S+ C
Sbjct: 134 VRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCD 193
Query: 194 ILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQE 234
+ S C+S C + + Y DGS + G A + +T +
Sbjct: 194 RIENS----GCHSGGCRYEVMYGDGSYTKGTLALETLTFAK 230
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 55/114 (48%), Gaps = 4/114 (3%)
Query: 302 TDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVR 361
T R F + +A G+ + DTCYDLS + +V VP ++ +F G L L R
Sbjct: 361 TGAYAAFRDGFKSQTANLPRASGVS-IFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPAR 419
Query: 362 GTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L+ V C FA P + I GN+QQ G +V +D A +GFGP C
Sbjct: 420 NFLMPVDDSGTYCFAFAASPTGLSII--GNIQQEGIQVSFDGANGFVGFGPNVC 471
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 44/127 (34%), Positives = 75/127 (59%), Gaps = 4/127 (3%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + ++G P + ++DTGS +TW QC+ C C++Q P F SKSKT+ +PC+S
Sbjct: 96 EYLMSYSVGTPPFEILGVVDTGSGITWMQCQRCEDCYEQTTPIFDPSKSKTYKTLPCSSN 155
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLG 249
C+ + S P + + C + I+Y DGS S G + + +T+ ++NG ++P ++G
Sbjct: 156 MCQSVI-STPSCSSDKIGCKYTIKYGDGSHSQGDLSVETLTL--GSTNGSSVQFPNTVIG 212
Query: 250 CINNSSG 256
C +N+ G
Sbjct: 213 CGHNNKG 219
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 43/89 (48%), Gaps = 4/89 (4%)
Query: 327 DLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSI 386
+ L CY + + VP I HF G D+EL+ T V + VC FA + + SI
Sbjct: 364 NFLSLCYQTTPSGQLDVPVITAHF-KGADVELNPISTFVQVAEGVVC--FAFHSSEVVSI 420
Query: 387 TLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
GN+ Q V YD+ + + F P +C+
Sbjct: 421 -FGNLAQLNLLVGYDLMEQTVSFKPTDCT 448
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 58/168 (34%), Positives = 79/168 (47%), Gaps = 7/168 (4%)
Query: 125 NDTVADEYYIVVAIGEPK-QYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D + EY I + IG P+ Q V L LDTGSD+ WTQC C CF Q P F AS S TF
Sbjct: 87 SDVGSSEYLIHLGIGTPRPQRVVLHLDTGSDLVWTQCA-CTVCFDQPVPVFRASVSHTFS 145
Query: 184 KIPCNSTSCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
++PC+ C P C +++ C + Y D S + G A D T + +
Sbjct: 146 RVPCSDPLCG-HAVYLPLSGCAARDRSCFYAYGYMDHSITTGKMAEDTFTFKAPDRADTA 204
Query: 242 TRYPFL-LGC-INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
P + GC + N SGI G P+S+ ++ FSYC
Sbjct: 205 AAVPNIRFGCGMMNYGLFTPNQSGIAGFGTGPLSLPSQLKVRRFSYCF 252
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 57/192 (29%), Positives = 99/192 (51%), Gaps = 15/192 (7%)
Query: 113 KRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDP 172
+ + +F N++ TV+ + +G P Q V+++LDTGS+++W CK + DP
Sbjct: 43 RPSSKLSFHHNVSLTVS------LTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHSVFDP 96
Query: 173 FFYASKSKTFFKIPCNSTSCRILRESFPFG-NCNSKE-CPFNIQYADGSGSGGFWATDRI 230
+S ++ IPC S +CR F +C+ K+ C I YAD S G A+D
Sbjct: 97 L----RSSSYSPIPCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASD-- 150
Query: 231 TIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSP 290
T NS T + + +++S + S +G++G++R +S +T+ FSYC+ S
Sbjct: 151 TFHIGNSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQKFSYCI-SG 209
Query: 291 YGSTGYITFGKT 302
S+G + FG++
Sbjct: 210 QDSSGILLFGES 221
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 57/192 (29%), Positives = 99/192 (51%), Gaps = 15/192 (7%)
Query: 113 KRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDP 172
+ + +F N++ TV+ + +G P Q V+++LDTGS+++W CK + DP
Sbjct: 50 RPSSKLSFHHNVSLTVS------LTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHSVFDP 103
Query: 173 FFYASKSKTFFKIPCNSTSCRILRESFPFG-NCNSKE-CPFNIQYADGSGSGGFWATDRI 230
+S ++ IPC S +CR F +C+ K+ C I YAD S G A+D
Sbjct: 104 L----RSSSYSPIPCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASD-- 157
Query: 231 TIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSP 290
T NS T + + +++S + S +G++G++R +S +T+ FSYC+ S
Sbjct: 158 TFHIGNSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQKFSYCI-SG 216
Query: 291 YGSTGYITFGKT 302
S+G + FG++
Sbjct: 217 QDSSGILLFGES 228
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 54/169 (31%), Positives = 80/169 (47%), Gaps = 21/169 (12%)
Query: 121 PANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSK 180
PA + A EY + +AIG P L DTGSD+TWTQCKPC CF Q P + + S
Sbjct: 73 PARLRSGQA-EYLMELAIGTPPVPFIALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSS 131
Query: 181 TFFKIPCNSTSCRILRESFPFGNCN--SKECPFNIQYADGSGSGGFWATDRITIQEANSN 238
+F +PC+S +C + S C+ S C + Y DG+ ++ + I
Sbjct: 132 SFSPLPCSSATCLPIWSS----RCSTPSATCRYRYAYDDGA-----YSPECAGISVGG-- 180
Query: 239 GYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
GC ++ G ++G +GL R +S++ + FSYCL
Sbjct: 181 -------IAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCL 222
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 53/162 (32%), Positives = 81/162 (50%), Gaps = 14/162 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P ++LDTGSDV W QC PC C+ Q P F +S ++ + C +
Sbjct: 139 EYFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRCYDQSGPVFDPRRSSSYGAVDCAAP 198
Query: 191 SCRILRESFPFGNCN--SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
CR L G C+ + C + + Y DGS + G +AT+ +T L
Sbjct: 199 LCRRLDS----GGCDLRRRACLYQVAYGDGSVTAGDFATETLTFAGGARVARVA-----L 249
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
GC +++ G A+G++GL R +S T+ + Y FSYCL
Sbjct: 250 GCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGKSFSYCL 291
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 55/110 (50%), Gaps = 3/110 (2%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+ LR AF + + G L DTCYDL + V VP +++HF GG + L L+
Sbjct: 386 SALRDAFRAAAAGLRLSPGGFSLFDTCYDLGGRKVVKVPTVSMHFAGGAEAALPPENYLI 445
Query: 366 -VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V S C FA D +GN+QQ+G V +D G+R+GF P C
Sbjct: 446 PVDSRGTFCFAFAGT--DGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 493
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 60/184 (32%), Positives = 90/184 (48%), Gaps = 23/184 (12%)
Query: 131 EYYIVVAIGEPKQ----YVSLLL-DTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKI 185
EY + +G P + + +LL D GSDVTW QC PC C+ Q P + KS + +
Sbjct: 124 EYIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRCYHQPGPVYNRLKSSSASDV 183
Query: 186 PCNSTSCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
C + +CR L S G C EC + ++Y DGS S G + + +T R
Sbjct: 184 GCYAPACRALGSS---GGCVQFLNECQYKVEYGDGSSSAGDFGVETLTFPPG------VR 234
Query: 244 YPFL-LGCINNSSGD-KSGASGIMGLDRSPVSIITRTNTSY---FSYCLP--SPYGSTGY 296
P + +GC +++ G + A+GI+GL R +S ++ Y FSYCL G +
Sbjct: 235 VPGVAIGCGSDNQGLFPAPAAGILGLGRGSLSFPSQIAGRYGRSFSYCLAGQGTGGRSST 294
Query: 297 ITFG 300
+TFG
Sbjct: 295 LTFG 298
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 51/107 (47%), Gaps = 7/107 (6%)
Query: 307 TLRSAFHKRMKK---YKKAKGLEDLLDTCYD-LSAYETVVVPKIAIHFLGGVDLELDVRG 362
R AF K + G DTCY + VP +++HF GGV+++L +
Sbjct: 377 AFRDAFRVAAVKELGWPSPGGPFAFFDTCYSSVRGRVMKKVPAVSMHFAGGVEVKLPPQN 436
Query: 363 TLVVASVSQ--VCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRL 407
L+ ++ +C FA SI +GN+Q +G V YDV G+R+
Sbjct: 437 YLIPVDSNKGTMCFAFAGSGDRGVSI-IGNIQLQGFRVVYDVDGQRV 482
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 65/234 (27%), Positives = 110/234 (47%), Gaps = 35/234 (14%)
Query: 78 ISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVA 137
+S++ P + LR + R + R F K T+ F N+ TV+ +
Sbjct: 23 LSSNQPPIVLALRTQKHRTPISTPRL----FSTTSKTTDKLLFHHNVTLTVS------LT 72
Query: 138 IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDP----FFYASKSKTFFKIPCNSTSCR 193
G P Q ++++LDTGS+++W CK ++P F SKT+ KIPC+S +C
Sbjct: 73 AGTPLQNITMVLDTGSELSWLHCK--------KEPNFNSIFNPLASKTYTKIPCSSPTCE 124
Query: 194 ILRESFPFG-NCN-SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI 251
P +C+ +K C F I YAD S G A + + G T + GC+
Sbjct: 125 TRTRDLPLPVSCDPAKLCHFIISYADASSVEGNLAFETFRV------GSVTGPATVFGCM 178
Query: 252 N----NSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGK 301
+ ++S + + +G+MG++R +S + + FSYC+ S S+G + G+
Sbjct: 179 DSGFSSNSEEDAKTTGLMGMNRGSLSFVNQMGFRKFSYCI-SDRDSSGVLLLGE 231
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 63/196 (32%), Positives = 98/196 (50%), Gaps = 13/196 (6%)
Query: 103 RLRKPFPEFLKRTEAFTFPANINDTVAD------EYYIVVAIGEPKQYVSLLLDTGSDVT 156
RL F + R+ F + D + E+++ + IG P V + DTGSD+T
Sbjct: 50 RLNAAFLRSVSRSRRFNHQLSQTDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLT 109
Query: 157 WTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYA 216
W QCKPC C+++ P F KS T+ PC+S +C+ L + + ++ C + Y
Sbjct: 110 WVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYG 169
Query: 217 DGSGSGGFWATDRITIQEANSNGYFTRYP-FLLGCINNSSGD-KSGASGIMGLDRSPVSI 274
D S S G AT+ ++I A +G +P + GC N+ G SGI+GL +S+
Sbjct: 170 DQSFSKGDVATETVSIDSA--SGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSL 227
Query: 275 ITRTNTSY---FSYCL 287
I++ +S FSYCL
Sbjct: 228 ISQLGSSISKKFSYCL 243
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 57/193 (29%), Positives = 97/193 (50%), Gaps = 24/193 (12%)
Query: 115 TEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFF 174
++ +F N+ TV +A+G+P Q +S++LDTGS+++W CK + +P
Sbjct: 54 SDKLSFRHNVTLTVT------LAVGDPPQNISMVLDTGSELSWLHCKKSPNLGSVFNPV- 106
Query: 175 YASKSKTFFKIPCNSTSCRILRESFPF-GNCNSKE--CPFNIQYADGSGSGGFWATDRIT 231
S T+ +PC+S CR P +C+ K C I YAD + G A +
Sbjct: 107 ---SSSTYSPVPCSSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFV 163
Query: 232 IQEANSNGYFTRYPFLLGCIN----NSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
I G TR L GC++ ++S + + ++G+MG++R +S + + S FSYC+
Sbjct: 164 I------GSVTRPGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI 217
Query: 288 PSPYGSTGYITFG 300
S S+G++ G
Sbjct: 218 -SGSDSSGFLLLG 229
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 59/166 (35%), Positives = 81/166 (48%), Gaps = 18/166 (10%)
Query: 128 VADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIP 186
V YIV A IG P Q + + LDT +D W C C+ C F SKS + +
Sbjct: 83 VQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGC--SSSVLFDPSKSSSSRTLQ 140
Query: 187 CNSTSCRILRESFPFGNCN-SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C + C+ P +C SK C FN+ Y GS + D +T+ ++ Y
Sbjct: 141 CEAPQCK----QAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTL----ASDVIPNYT 191
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLP 288
F GCIN +SG A G+MGL R P+S+I+++ Y FSYCLP
Sbjct: 192 F--GCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLP 235
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 64/237 (27%), Positives = 105/237 (44%), Gaps = 15/237 (6%)
Query: 79 STHAPSLEEILRQDQQRLH--LKNSRRLRKPFPEF--LKRTEAFTFPANINDTVADEYYI 134
+T A L L++D+ R + + P P+ L P + +Y
Sbjct: 84 ATGAELLARRLQRDELRAAWIISTAAANGTPPPDVVGLSTGRGLVAPVVSRAPTSGDYIA 143
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
+A+G P L LDT SD+TW QC+PC C+ Q P F S ++ ++ ++ C+
Sbjct: 144 KIAVGTPAVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDCQA 203
Query: 195 LRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL-LGCINN 253
L S G+ C + + Y DG G G + ++E + R +L +GC ++
Sbjct: 204 LGRSG-GGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAGGVRQAYLSIGCGHD 262
Query: 254 SSGD-KSGASGIMGLDRSPVSIITRTN----TSYFSYCL----PSPYGSTGYITFGK 301
+ G + A+GI+GL R +SI + + FSYCL P + +TFG
Sbjct: 263 NKGLFGAPAAGILGLSRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTLTFGA 319
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 52/96 (54%), Gaps = 6/96 (6%)
Query: 324 GLEDLLDTCYDLSAY----ETVVVPKIAIHFLGGVDLELDVRGTLV-VASVSQVCLGFAT 378
G L DTCY + V VP +++HF GGV+L L + L+ V S VC FA
Sbjct: 413 GPSGLFDTCYTVGGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTVCFAFAG 472
Query: 379 YPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
D + +GN+ Q+G V YD+ G+R+GF P +C
Sbjct: 473 TG-DRSVSVIGNILQQGFRVVYDIGGQRVGFAPNSC 507
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 59/166 (35%), Positives = 81/166 (48%), Gaps = 18/166 (10%)
Query: 128 VADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIP 186
V YIV A IG P Q + + LDT +D W C C+ C F SKS + +
Sbjct: 83 VQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGC--SSSVLFDPSKSSSSRTLQ 140
Query: 187 CNSTSCRILRESFPFGNCN-SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C + C+ P +C SK C FN+ Y GS + D +T+ ++ Y
Sbjct: 141 CEAPQCK----QAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTL----ASDVIPNYT 191
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLP 288
F GCIN +SG A G+MGL R P+S+I+++ Y FSYCLP
Sbjct: 192 F--GCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLP 235
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 61/185 (32%), Positives = 95/185 (51%), Gaps = 13/185 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + +++G P + + DTGSD+ WTQCKPC C+ Q DP F S T+ + C+S+
Sbjct: 93 EYLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSSS 152
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
C L +C++++ C ++ Y D S + G A D +T+ ++ + ++
Sbjct: 153 QCTALENQ---ASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKN-III 208
Query: 249 GCINNSSGD-KSGASGIMGLDRSPVSIITRTNTSY---FSYC---LPSPYGSTGYITFGK 301
GC +N++G SGI+GL VS+IT+ S FSYC L S T I FG
Sbjct: 209 GCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINFGT 268
Query: 302 TDTVN 306
V+
Sbjct: 269 NAVVS 273
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 36/129 (27%), Positives = 53/129 (41%), Gaps = 15/129 (11%)
Query: 295 GYITFGKTDTVNTLRSAFHKRMKKY-------KKAKGLEDLLDTCYDLSAYETVVVPKIA 347
G I T+ L + F+ ++ +K + + L CY SA + VP I
Sbjct: 317 GNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLSLCY--SATGDLKVPAIT 374
Query: 348 IHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITL-GNVQQRGHEVHYDVAGRR 406
+HF G D+ L V S VC F P S ++ GNV Q V YD +
Sbjct: 375 MHF-DGADVNLKPSNCFVQISEDLVCFAFRGSP----SFSIYGNVAQMNFLVGYDTVSKT 429
Query: 407 LGFGPGNCS 415
+ F P +C+
Sbjct: 430 VSFKPTDCA 438
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 69/258 (26%), Positives = 99/258 (38%), Gaps = 22/258 (8%)
Query: 58 DKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEA 117
D + L V+ Y CS P +E + K+ RL+ ++T A
Sbjct: 31 DTSDLSVIPIYSKCSPF-------VPPKQESWVNTVITMASKDPERLKYLSTLADQKTTA 83
Query: 118 FTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYAS 177
Y + V +G P Q + ++LDT +D W C C C F +
Sbjct: 84 VPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSSTT---FLPN 140
Query: 178 KSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANS 237
S T + C+ C +R F S C FN Y S D IT+
Sbjct: 141 ASTTLGSLDCSGAQCSQVR-GFSCPATGSSACLFNQSYGGDSSLTATLVQDAITLANDVI 199
Query: 238 NGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS--PYG 292
G F GCIN SG G++GL R P+S+I++ Y FSYCLPS Y
Sbjct: 200 PG------FTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYY 253
Query: 293 STGYITFGKTDTVNTLRS 310
+G + G ++R+
Sbjct: 254 FSGSLKLGPVGQPKSIRT 271
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 51/106 (48%), Gaps = 9/106 (8%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
+R F K++ + G DTC+ +A P I +HF G++L L + +L+
Sbjct: 334 IRDEFRKQVNGPISSLGA---FDTCF--AATNEAEAPAITLHF-EGLNLVLPMENSLIHS 387
Query: 367 ASVSQVCLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRLGFG 410
+S S CL A P + NS+ + N+QQ+ + +D RLG
Sbjct: 388 SSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIA 433
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 60/111 (54%), Gaps = 2/111 (1%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ V +G P ++ SL+LDTGSD+ W QC PC CF+Q P++ S +F I C+
Sbjct: 194 EYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNGPYYDPKDSSSFKNITCHDP 253
Query: 191 SCRILRESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEANSNG 239
C+++ P C ++ CP+ Y D S + G +A + T+ G
Sbjct: 254 RCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEG 304
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 55/120 (45%), Gaps = 3/120 (2%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T+ ++ AF +++K + + L CY++S E + +P+ AI F G
Sbjct: 441 LTYFAEPAYEIIKEAFMRKIKGFPLVETFPPL-KPCYNVSGVEKMELPEFAILFADGAMW 499
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ V + + VCL P SI +GN QQ+ + YD+ RLG+ P C+
Sbjct: 500 DFPVENYFIQIEPEDVVCLAILGTPRSALSI-IGNYQQQNFHILYDLKKSRLGYAPMKCA 558
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 72/254 (28%), Positives = 107/254 (42%), Gaps = 21/254 (8%)
Query: 62 LEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFP 121
L V+ YG CS AP E + + K+ R+R ++T A
Sbjct: 32 LSVIPIYGKCSPFT------APKSESWMNTVID-MASKDPARIRYLSSLTAQKTVAAPIA 84
Query: 122 ANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKT 181
+ Y + V +G P Q + ++LDT +D W C CI C F A S T
Sbjct: 85 SGQQVLNVGNYVVRVQLGTPGQTMYMVLDTSNDAAWAPCSGCIGCSSTTT--FSAQNSST 142
Query: 182 FFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
F + C+ C R + +C FN Y G F AT +Q++ G
Sbjct: 143 FATLDCSKPECTQAR-GLSCPTTGNVDCLFNQTYG---GDSTFSAT---LVQDSLHLGPN 195
Query: 242 TRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS--PYGSTGY 296
F GCI+++SG G+MGL R P+S+I+++ + Y FSYCLPS Y +G
Sbjct: 196 VIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCLPSFKSYYFSGS 255
Query: 297 ITFGKTDTVNTLRS 310
+ G +R+
Sbjct: 256 LKLGPVGQPKAIRT 269
Score = 42.7 bits (99), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 53/106 (50%), Gaps = 9/106 (8%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
+R F K++ G DTC+ + V P I +H L G+DL+L + +L+ +
Sbjct: 332 VRDEFRKQVGGSFSPLGA---FDTCF--ATNNEVSAPAITLH-LSGLDLKLPMENSLIHS 385
Query: 368 SV-SQVCLGFATYPPDPNSIT--LGNVQQRGHEVHYDVAGRRLGFG 410
S S CL A P + NS+ + N+QQ+ H + +D+ +LG
Sbjct: 386 SAGSLACLAMAAAPNNVNSVVNVIANLQQQNHRILFDINNSKLGIA 431
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 90/348 (25%), Positives = 131/348 (37%), Gaps = 77/348 (22%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y + +++G P + + DTGSD+ W Q +PC C F +S TF ++ C+S
Sbjct: 55 YVMDISVGTPGKRFRAIADTGSDLVWVQSEPCTGC--SGGTIFDPRQSSTFREMDCSSQL 112
Query: 192 CRILRESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLL 248
C L G+C S C ++ +Y G G F R TI ++ ++P F +
Sbjct: 113 CAELP-----GSCEPGSSTCSYSYEYGSGETEGEF---ARDTISLGTTSDGSQKFPSFAV 164
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNT---SYFSYCL------------------ 287
GC +SG G G++GL + PVS+ ++ + S FSYCL
Sbjct: 165 GCGMVNSG-FDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSA 223
Query: 288 -------------------PSPY--------------GSTGYITFGKTDTVNTLRSAFHK 314
P+ Y GS G T+ + S +
Sbjct: 224 ALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSPGTTIIDSGTTLTYVPSGVYG 283
Query: 315 R-------MKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVV- 366
R M + G LD CYD S+ P + I G LVV
Sbjct: 284 RVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPPSSNYFLVVD 343
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
S VCL + P SI +GNV Q+G+ + YD L F C
Sbjct: 344 DSGDTVCLAMGSASGLPVSI-IGNVMQQGYHILYDRGSSELSFVQAKC 390
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 64/222 (28%), Positives = 98/222 (44%), Gaps = 27/222 (12%)
Query: 96 LHLKNSRRLRKPFPEFLKRTEAFTFP-ANINDTVA-DEYYIVVAIGEPKQYVSLLLDTGS 153
L + RRLR+ PE + +FP + ND A YY +++G P Q + +DTGS
Sbjct: 9 LRKHDQRRLRRMLPEVV------SFPISGDNDIFAMGLYYTRISLGTPPQQFYVDVDTGS 62
Query: 154 DVTWTQCKPCIHCFQQRD---PF--FYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKE 208
+V W +C PC C D P F KS T I C C +L + C+ +
Sbjct: 63 NVAWVKCAPCTGCEHSGDVPVPMSTFDPRKSTTKISISCTDAECGVLNKKL---QCSPER 119
Query: 209 --CPFNIQYADGSGSGGFWATDRITIQEA---NSNGYFTRYPFLLGCINNSSGDKSGASG 263
CP+++ Y DGS + G++ D T + NS + GC +G S G
Sbjct: 120 LSCPYSLLYGDGSSTAGYYLNDVFTFNQVPSDNSTAKSGTARLVFGCGGTQTGSWS-VDG 178
Query: 264 IMGLDRSPVSI---ITRTNTSY--FSYCLPSPYGSTGYITFG 300
++G + VS+ + + N S F++CL G + G
Sbjct: 179 LLGFGPTTVSLPNQLAQQNISVNIFAHCLQGDVSGRGSLVIG 220
>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
Length = 419
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 59/168 (35%), Positives = 80/168 (47%), Gaps = 20/168 (11%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC--IHCFQQRDPFFYASKSKTFFKIPCNS 189
Y IG P Q VS ++D ++ WTQC C CF+Q P F S S T+ C S
Sbjct: 62 YVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGS 121
Query: 190 TSCRILRESFPFGNCNSK-ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
C+ S P NC+ EC + G + G +TD I I A F
Sbjct: 122 PLCK----SIPTRNCSGDGECGYEAPSMFGD-TFGIASTDAIAIGNAEGRLAF------- 169
Query: 249 GCINNSSGDKSGA----SGIMGLDRSPVSIITRTNTSYFSYCLPSPYG 292
GC+ S G GA SG +GL R+P S++ ++N + FSYCL +P+G
Sbjct: 170 GCVVASDGSIDGAMDGPSGFVGLGRTPWSLVGQSNVTAFSYCL-APHG 216
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 90/422 (21%), Positives = 159/422 (37%), Gaps = 89/422 (21%)
Query: 73 RLNQGI-STHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADE 131
+L +GI + H L ++ +D+ R + R L+ L F + V
Sbjct: 30 KLERGIPANHEMELSQLKARDKAR----HGRLLQS-----LGGVIDFPVDGTFDPFVVGL 80
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIP 186
YY + +G P + + +DTGSDV W C C C Q + FF S T +
Sbjct: 81 YYTKIRLGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATPVS 140
Query: 187 CNSTSCRILRESFPFG-NCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYF--TR 243
C+ C +S G + + C + QY DGSG+ GF+ +D + + +
Sbjct: 141 CSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNST 200
Query: 244 YPFLLGCINNSSGD----KSGASGIMGLDRSPVSIITRTNTS-----YFSYCLPSPYGST 294
P + GC + +GD GI G + +S+I++ + FS+CL G
Sbjct: 201 APVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLKGENGGG 260
Query: 295 GYITFGKTD----------------TVNTLRSAFHKRM-----KKYKKAKGLEDLLDT-- 331
G + G+ VN L + + + + + G ++DT
Sbjct: 261 GILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGT 320
Query: 332 ----------------------------------CYDLSAYETVVVPKIAIHFLGGVDLE 357
CY ++ + P ++++F GG +
Sbjct: 321 TLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQCYVIATSVADIFPPVSLNFAGGASMF 380
Query: 358 LDVRGTLV----VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
L+ + L+ V + C+GF +I LG++ + YD+ G+R+G+ +
Sbjct: 381 LNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITI-LGDLVLKDKIFVYDLVGQRIGWANYD 439
Query: 414 CS 415
CS
Sbjct: 440 CS 441
>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
Length = 471
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/108 (40%), Positives = 56/108 (51%), Gaps = 5/108 (4%)
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVV 366
LR AF M Y + G LDTCYD + +V VP +++ F GG + LD G +V
Sbjct: 369 ALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV- 427
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ CL F P D +GNVQQ+ HEV YDV G +GF G C
Sbjct: 428 ----EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 471
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 73/295 (24%), Positives = 123/295 (41%), Gaps = 40/295 (13%)
Query: 31 HSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILR 90
H +V SSLL P A+P + + + YGPCS S L ++LR
Sbjct: 18 HYIVVETSSLLKPKAICSGLKAMPSS-NGTWVALHRPYGPCSPSPTTTSPPL--LVDMLR 74
Query: 91 QDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVV-------------- 136
D +LH RR + + + +D + +
Sbjct: 75 WD--KLHTDAIRRKATAGGDVVLEPDKPIVDVQQSDYKMQASFGIGTGGRSGSSSSSSSR 132
Query: 137 -----AIGEPKQYVSLLLDTGSDVTWTQCKPC--IHCFQQRDPFFYASKSKTFFKIPCNS 189
AI +P + +DT D+ W QC PC C+ Q++ F +S+T +PC S
Sbjct: 133 ISRPSAIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGS 192
Query: 190 TSCRILRESFPFG-NCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
+C L +G C++ +C + + Y DG + G + D +T+ + F
Sbjct: 193 AACGELGR---YGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPST-----VVMNFRF 244
Query: 249 GCINNSSGDKSGA-SGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITF 299
GC + G+ S + SG M L S++++T ++ FSYC+P P S+G+++
Sbjct: 245 GCSHAVRGNFSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCVPDP-SSSGFLSL 298
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 66/217 (30%), Positives = 105/217 (48%), Gaps = 24/217 (11%)
Query: 103 RLRKPFPEFLKRTEAFTFPANIN-------DTV--ADEYYIVVAIGEPKQYVSLLLDTGS 153
RL+ F + R FT P +++ D + EY++ ++IG P V ++ DTGS
Sbjct: 57 RLQSSFHRSISRANRFT-PNSVSAAKTLEYDIIPGGGEYFMRISIGTPPIEVLVIADTGS 115
Query: 154 DVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS--KECPF 211
D+ W QC+PC C++Q+ P F +S T+ ++ C + C L + + K C +
Sbjct: 116 DLIWVQCQPCQECYKQKSPIFNPKQSSTYRRVLCETRYCNALNSDMRACSAHGFFKACGY 175
Query: 212 NIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD-KSGASGIMGLDRS 270
+ Y D S + G+ AT+R I N+ + GC N++ G+ SGI+GL
Sbjct: 176 SYSYGDHSFTMGYLATERFIIGSTNN----SIQELAFGCGNSNGGNFDEVGSGIVGLGGG 231
Query: 271 PVSIITRTNTSY---FSYC----LPSPYGSTGYITFG 300
+S+I++ T FSYC L S G I FG
Sbjct: 232 SLSLISQLGTKIDNKFSYCLVPILEKSNFSLGKIVFG 268
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 50/176 (28%), Positives = 95/176 (53%), Gaps = 17/176 (9%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCR 193
+ + +G P Q VS+++DTGS+++W C + DP ++S ++ IPC+S +C
Sbjct: 33 VSLTVGTPPQNVSMVIDTGSELSWLHCNKTLSYPTTFDP----TRSTSYQTIPCSSPTCT 88
Query: 194 ILRESFPF-GNCNSKE-CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI 251
+ FP +C+S C + YAD S S G A+D I ++ +G + GC+
Sbjct: 89 NRTQDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHIGSSDISG------LVFGCM 142
Query: 252 N----NSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGKTD 303
+ ++S + S ++G+MG++R +S +++ FSYC+ S +G + G+++
Sbjct: 143 DSVFSSNSDEDSKSTGLMGMNRGSLSFVSQLGFPKFSYCI-SGTDFSGLLLLGESN 197
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 58/193 (30%), Positives = 94/193 (48%), Gaps = 24/193 (12%)
Query: 115 TEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFF 174
++ +F N+ TV +A+G P Q +S++LDTGS+++W CK + +P
Sbjct: 50 SDKLSFRHNVTLTVT------LAVGSPPQNISMVLDTGSELSWLHCKKSPNLGSVFNPV- 102
Query: 175 YASKSKTFFKIPCNSTSCRILRESFPF-GNCNSKE--CPFNIQYADGSGSGGFWATDRIT 231
S T+ +PC+S CR P +C+ K C I YAD + G A D
Sbjct: 103 ---SSSTYSPVPCSSPICRTRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFV 159
Query: 232 IQEANSNGYFTRYPFLLGCINNS----SGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
I G TR L GC+++ S + + ++G+MG++R +S + + S FSYC+
Sbjct: 160 I------GSVTRPGTLFGCMDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI 213
Query: 288 PSPYGSTGYITFG 300
S S+G + G
Sbjct: 214 -SGSDSSGILLLG 225
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 58/185 (31%), Positives = 93/185 (50%), Gaps = 15/185 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
+Y + + +G P + L+DTGSD+ W QC PC C++Q+ P F +SKT+ IPC S
Sbjct: 81 DYLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPCGGCYRQKSPMFEPLRSKTYSPIPCESE 140
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C F + K C ++ YAD S + G A + IT + + + GC
Sbjct: 141 QCSF----FGYSCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVVVG-DIIFGC 195
Query: 251 INNSSGD-KSGASGIMGLDRSPVSIITRTNTSY----FSYCLPSPY----GSTGYITFGK 301
+++SG GI+G+ P+S++++ T Y FS CL P+ ++G I FG+
Sbjct: 196 GHSNSGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCL-VPFHTDAHTSGTINFGE 254
Query: 302 TDTVN 306
V+
Sbjct: 255 ESDVS 259
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 61/240 (25%), Positives = 101/240 (42%), Gaps = 22/240 (9%)
Query: 79 STHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTV-------ADE 131
+T A + + + +RL SR + P+ + A N DTV
Sbjct: 43 TTAAINFTQAALESHRRLSFLASRSSQVDKPQ---SSSASQLSNNDTDTVPLRMDGGGGA 99
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y + +IG P Q ++ L DTGSD+ WT+C ++ + S TF ++PC+
Sbjct: 100 YDMEFSIGTPPQKLTALADTGSDLIWTKCDAGGGAAWGGSSSYHPNASSTFTRLPCSDRL 159
Query: 192 CRILRESFPFGNCNSKECPFNIQYADGSG-----SGGFWATDRITIQEANSNGYFTRYPF 246
C LR S+ C + + +YA G G + GF ++ T+ G
Sbjct: 160 CAALR-SYSLARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTLGGDAVPGVG----- 213
Query: 247 LLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGKTDTVN 306
GC GD +G++GL R P+S++++ + F YCL + + FG T+
Sbjct: 214 -FGCTTALEGDYGEGAGLVGLGRGPLSLVSQLDAGTFMYCLTADASKASPLLFGALATMT 272
Score = 41.2 bits (95), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 34/73 (46%), Gaps = 3/73 (4%)
Query: 342 VVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYD 401
++P + +HF GG D+ L V +V VC P+ +GN+ Q + V +D
Sbjct: 364 LIPAMVLHFDGGADMALPVANYVVEVDDGVVCW---VVQRSPSLSIIGNIMQMNYLVLHD 420
Query: 402 VAGRRLGFGPGNC 414
V L F P NC
Sbjct: 421 VRKSVLSFQPANC 433
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/165 (31%), Positives = 87/165 (52%), Gaps = 8/165 (4%)
Query: 128 VADE-YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIP 186
++DE Y + + IG P Q +L+ DT SD+TWTQC +Q +P F +KS +F +
Sbjct: 86 ISDEGYTVTIGIGTPPQLHTLIADTASDLTWTQCNLFNDTAKQVEPLFDPAKSSSFAFVT 145
Query: 187 CNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPF 246
C+S C ++ C++K C + Y + G A + T+ + N + + F
Sbjct: 146 CSSKLCT--EDNPGTKRCSNKTCRYVYPYVSVE-AAGVLAYESFTLSDNNQHICMS---F 199
Query: 247 LLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPY 291
GC + G+ GASGI+G+ + +S++++ FSYCL +PY
Sbjct: 200 GFGCGALTDGNLLGASGILGMSPAILSMVSQLAIPKFSYCL-TPY 243
>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 686
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 86/180 (47%), Gaps = 16/180 (8%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
Y+ + +G P + L +DTGSD+TW QC PC C + +P + K +P +
Sbjct: 314 YFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNL---VPLKDS 370
Query: 191 SCRILRESFPFGNCNS-KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
C ++ + G C + ++C + I+YAD S S G A+D + + A NG T+ + G
Sbjct: 371 LCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLA--NGSLTKLGIMFG 428
Query: 250 CINNSSG----DKSGASGIMGLDRSPVSIIT-----RTNTSYFSYCLPSPYGSTGYITFG 300
C + G + GI+GL ++ VS+ + R + +CL S GY+ G
Sbjct: 429 CAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFLG 488
>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
Length = 487
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/108 (40%), Positives = 56/108 (51%), Gaps = 5/108 (4%)
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVV 366
LR AF M Y + G LDTCYD + +V VP +++ F GG + LD G +V
Sbjct: 385 ALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV- 443
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ CL F P D +GNVQQ+ HEV YDV G +GF G C
Sbjct: 444 ----EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 487
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 49/170 (28%), Positives = 85/170 (50%), Gaps = 16/170 (9%)
Query: 137 AIGEPKQYVSLLLDTGSDVTWTQCKPC--IHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
AI +P + +DT D+ W QC PC C+ Q++ F +S+T +PC S +C
Sbjct: 154 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 213
Query: 195 LRESFPFG-NCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINN 253
L +G C++ +C + + Y DG + G + D +T+ + F GC +
Sbjct: 214 LGR---YGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPST-----VVMNFRFGCSHA 265
Query: 254 SSGDKSGA-SGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITF 299
G+ S + SG M L S++++T ++ FSYC+P P S+G+++
Sbjct: 266 VRGNFSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCVPDP-SSSGFLSL 314
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 58/165 (35%), Positives = 89/165 (53%), Gaps = 8/165 (4%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY++ V IG P ++ SL+LDTGSD+ W QC PC CF Q P++ +S +F I C+
Sbjct: 191 EYFMDVFIGTPPRHFSLILDTGSDLNWIQCVPCYDCFVQNGPYYDPKESSSFKNIGCHDP 250
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNG--YFTRYP- 245
C ++ P C ++ CP+ Y D S + G +A + T+ + G F R
Sbjct: 251 RCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRVEN 310
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
+ GC + + G GA+G++GL R P+S ++ + Y FSYCL
Sbjct: 311 VMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCL 355
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 60/135 (44%), Gaps = 10/135 (7%)
Query: 289 SPYGSTGYI-------TFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETV 341
SP G+ G I ++ + ++ AF K++K Y K +LD CY++S E +
Sbjct: 423 SPEGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFP-ILDPCYNVSGVEKM 481
Query: 342 VVPKIAIHFLGGVDLELDVRGTLVVASVSQ-VCLGFATYPPDPNSITLGNVQQRGHEVHY 400
+P+ I F G V + + VCL P SI +GN QQ+ + Y
Sbjct: 482 ELPEFRILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTPRSALSI-IGNYQQQNFHILY 540
Query: 401 DVAGRRLGFGPGNCS 415
D RLG+ P C+
Sbjct: 541 DTKKSRLGYAPMKCA 555
>gi|357118734|ref|XP_003561105.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 404
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 42/110 (38%), Positives = 60/110 (54%), Gaps = 6/110 (5%)
Query: 306 NTLRSAFHKRMKKYKKA-KGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL 364
LR AF M++Y++ G + +LDTCYD V VP +++ F GG + L+
Sbjct: 300 RALRRAFRNAMRRYRRVPAGGKQILDTCYDFEGLGNVTVPAVSLVFSGGAVVRLEP---- 355
Query: 365 VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+A + + CL F P D + +GNVQQ+ HEV YDV R +GF G C
Sbjct: 356 -MAVMMEGCLAFVPTPADSDLGFIGNVQQQTHEVLYDVGARNVGFRRGAC 404
>gi|413944378|gb|AFW77027.1| hypothetical protein ZEAMMB73_570500 [Zea mays]
Length = 484
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 102/443 (23%), Positives = 161/443 (36%), Gaps = 103/443 (23%)
Query: 60 ASLEVVSKYGPCSRLNQGISTHAP---SLEEILRQDQQRLH--LKNSRRLRKPFPEFLKR 114
+++ VV + PCS L P S+ ++L +D RL L +
Sbjct: 57 SAVPVVHRLSPCSPLAGAARNQQPERRSVADVLHRDALRLRSLLHREEDNHRTPAPAAPP 116
Query: 115 TEAFTFPANINDTV----ADEYYIVVAIGEPKQYVSLLLDTGS-DVTWTQCKPCIHCFQQ 169
+ P+ A EY++V G P Q + + DT + T QC PC
Sbjct: 117 GGGVSIPSRGEPIEELPGAFEYHVVAGFGTPMQKLPVGFDTTTTGATLLQCTPC---GSG 173
Query: 170 RDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSK-ECPFNIQYAD---GSGSGGFW 225
D F S S + ++PC S C PF C+ + C ++ + + G+ +
Sbjct: 174 ADHAFDPSASSSVSQVPCGSPDC-------PFHGCSGRPSCTLSVSFNNTLLGNATFFTD 226
Query: 226 ATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTS---- 281
A + + R+ L G + D G++GI+ L R+ S+ +R S
Sbjct: 227 TLTLTPSSSATVDKF--RFACLEGIAPGPAED--GSAGILDLSRNSHSLPSRLVASSPPH 282
Query: 282 --YFSYCLPSPYGSTGYITFGKT------------------------------------- 302
FSYCLP+ G+++ G T
Sbjct: 283 AVAFSYCLPASTADVGFLSLGATKPELLGRKVSYTPLRGSPSNGNLYVVDLVGLGLGGPD 342
Query: 303 -----------DTVNTLRSAF---------------HKRMKKYKKAKGLEDLLDTCYDLS 336
DT+ L + F K M +Y A L LDTCY+ +
Sbjct: 343 LPIPPAAIAGDDTILELHTTFTYLKPQVYKVLRDSFRKSMSEYPAAPPLGS-LDTCYNFT 401
Query: 337 AYETVVVPKIAIHFLGGVDLELDVRGTLVVAS----VSQVCLGFATYPPDPNSIT-LGNV 391
+ VP + + F GG D++L + + S CL F D + T +G++
Sbjct: 402 GLDAFSVPAVTLKFAGGADVDLWMDEMMYFTDPDNHFSIGCLAFVAQDDDCDGGTVIGSM 461
Query: 392 QQRGHEVHYDVAGRRLGFGPGNC 414
Q EV YDV G ++GF P C
Sbjct: 462 AQMSTEVVYDVRGGKVGFVPYRC 484
>gi|125595873|gb|EAZ35653.1| hypothetical protein OsJ_19940 [Oryza sativa Japonica Group]
Length = 468
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 44/109 (40%), Positives = 56/109 (51%), Gaps = 5/109 (4%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
LR AF M Y + G LDTCYD + +V VP +++ F GG + LD G +V
Sbjct: 365 RALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV 424
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ CL F P D +GNVQQ+ HEV YDV G +GF G C
Sbjct: 425 -----EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 468
Score = 45.8 bits (107), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 20/58 (34%), Positives = 31/58 (53%), Gaps = 2/58 (3%)
Query: 137 AIGEPKQYVSLLLDTGSDVTWTQCKPC--IHCFQQRDPFFYASKSKTFFKIPCNSTSC 192
AI +P + +DT D+ W QC PC C+ Q++ F +S+T +PC S +C
Sbjct: 154 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAAC 211
>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
Length = 419
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 57/164 (34%), Positives = 76/164 (46%), Gaps = 19/164 (11%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC--IHCFQQRDPFFYASKSKTFFKIPCN 188
Y IG P Q VS ++D ++ WTQC C CF+Q P F S S T+ C
Sbjct: 61 HYVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCG 120
Query: 189 STSCRILRESFPFGNCNSK-ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
S C+ S P NC+ EC + G + G +TD I I A F
Sbjct: 121 SPLCK----SIPTRNCSGDGECGYEAPSMFGD-TFGIASTDAIAIGNAEGRLAF------ 169
Query: 248 LGCINNSSGDKSGA----SGIMGLDRSPVSIITRTNTSYFSYCL 287
GC+ S G GA SG +GL R+P S++ ++N + FSYCL
Sbjct: 170 -GCVVASDGSIDGAMDGPSGFVGLGRTPWSLVGQSNVTAFSYCL 212
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 70/244 (28%), Positives = 114/244 (46%), Gaps = 34/244 (13%)
Query: 74 LNQGIS---THAPSLEEILRQDQQRLHLKNSRRLRKPFP---EFLKRTEAFTFPANINDT 127
LN G + H S + Q Q + + + +R+ F K + T P + ++
Sbjct: 25 LNNGFTLELIHRDSSKSPFYQPTQNKYERIANAVRRSINRVNHFYKYSLTST-PQSTVNS 83
Query: 128 VADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPC 187
EY + +IG P V +DTGSD+ W QC+PC C+ Q P F S S ++ IPC
Sbjct: 84 DKGEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQITPIFDPSLSSSYQNIPC 143
Query: 188 NSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-F 246
S +C +R + +C+ + G+ + + +T+ ++ GY +P
Sbjct: 144 LSDTCHSMRTT----SCDVR---------------GYLSVETLTLD--STTGYSVSFPKT 182
Query: 247 LLGCINNSSGDKSG-ASGIMGLDRSPVSIITRTNTSY---FSYCL-PSPYGSTGYITFGK 301
++GC ++G G +SGI+GL P+S+ ++ TS FSYCL P ST + FG
Sbjct: 183 MIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGGKFSYCLGPWLPNSTSKLNFGD 242
Query: 302 TDTV 305
V
Sbjct: 243 AAIV 246
>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
Length = 473
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 86/180 (47%), Gaps = 16/180 (8%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCK-PCIHCFQQRDPFFYASKSKTFFKIPCNST 190
Y+ + +G P + L +DTGSD+TW QC PC C + +P + K +P +
Sbjct: 101 YFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNL---VPLKDS 157
Query: 191 SCRILRESFPFGNCNS-KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
C ++ + G C + ++C + I+YAD S S G A+D + + A NG T+ + G
Sbjct: 158 LCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLA--NGSLTKLGIMFG 215
Query: 250 CINNSSG----DKSGASGIMGLDRSPVSIIT-----RTNTSYFSYCLPSPYGSTGYITFG 300
C + G + GI+GL ++ VS+ + R + +CL S GY+ G
Sbjct: 216 CAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFLG 275
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/184 (30%), Positives = 78/184 (42%), Gaps = 15/184 (8%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y + V +G P Q + ++LDT +D W C C C F + S T + C+
Sbjct: 45 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSSTT---FLPNASTTLGSLDCSEAQ 101
Query: 192 CRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI 251
C +R F S C FN Y S D IT+ G F GCI
Sbjct: 102 CSQVR-GFSCPATGSSACLFNQSYGGDSSLAATLVQDAITLANDVIPG------FTFGCI 154
Query: 252 NNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS--PYGSTGYITFGKTDTVN 306
N SG G++GL R P+S+I++ Y FSYCLPS Y +G + G
Sbjct: 155 NAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPK 214
Query: 307 TLRS 310
++R+
Sbjct: 215 SIRT 218
>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 570
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/200 (28%), Positives = 91/200 (45%), Gaps = 16/200 (8%)
Query: 112 LKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK-PCIHCFQQR 170
LK + FP + YY + +GEP + L +DTGSD+TW QC PC C + R
Sbjct: 179 LKSDSSAVFPVRGDIYPDGLYYTYIMVGEPPRPYFLDIDTGSDLTWVQCDAPCSSCGKGR 238
Query: 171 DPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS-KECPFNIQYADGSGSGGFWATDR 229
P + + + + C ++ ++ C + ++C + +QYAD S S G D
Sbjct: 239 SPLYKPRRENV---VSFKDSLCMEVQRNYDGDQCAACQQCNYEVQYADQSSSLGVLVKDE 295
Query: 230 ITIQEANSNGYFTRYPFLLGCINNSSG----DKSGASGIMGLDRSPVSIITRTNT----- 280
T++ SNG T+ + GC + G S GI+GL R+ VS+ ++ +
Sbjct: 296 FTLR--FSNGSLTKLNAIFGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQLASRGIIN 353
Query: 281 SYFSYCLPSPYGSTGYITFG 300
+ +CL GY+ G
Sbjct: 354 NVVGHCLTGDPAGGGYLFLG 373
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/184 (30%), Positives = 78/184 (42%), Gaps = 15/184 (8%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y + V +G P Q + ++LDT +D W C C C F + S T + C+
Sbjct: 45 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSSTT---FLPNASTTLGSLDCSEAQ 101
Query: 192 CRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI 251
C +R F S C FN Y S D IT+ G F GCI
Sbjct: 102 CSQVR-GFSCPATGSSACLFNQSYGGDSSLAATLVQDAITLANDVIPG------FTFGCI 154
Query: 252 NNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS--PYGSTGYITFGKTDTVN 306
N SG G++GL R P+S+I++ Y FSYCLPS Y +G + G
Sbjct: 155 NAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPK 214
Query: 307 TLRS 310
++R+
Sbjct: 215 SIRT 218
>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
Length = 471
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 52/169 (30%), Positives = 78/169 (46%), Gaps = 16/169 (9%)
Query: 132 YYIVVAIGEPKQYVS---LLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFF 183
Y + + IG P +S +L DTGSD++WTQC+PC +C + DP SKS+TF
Sbjct: 104 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDP----SKSRTFR 159
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA-NSNGYFT 242
++ C C + G S C F +Y DG G +D A + GY
Sbjct: 160 RLSCFDPMCELCTAVVD-GGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 218
Query: 243 RYPFLLGC--INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
GC + +S + ++GI+ L S +T+ FSYC+P+
Sbjct: 219 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDRFSYCIPA 267
>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
Length = 468
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 52/169 (30%), Positives = 78/169 (46%), Gaps = 16/169 (9%)
Query: 132 YYIVVAIGEPKQYVS---LLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFF 183
Y + + IG P +S +L DTGSD++WTQC+PC +C + DP SKS+TF
Sbjct: 101 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDP----SKSRTFR 156
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA-NSNGYFT 242
++ C C + G S C F +Y DG G +D A + GY
Sbjct: 157 RLSCFDPMCELCTAVVD-GGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 215
Query: 243 RYPFLLGC--INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
GC + +S + ++GI+ L S +T+ FSYC+P+
Sbjct: 216 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDRFSYCIPA 264
>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
Length = 489
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 52/169 (30%), Positives = 78/169 (46%), Gaps = 16/169 (9%)
Query: 132 YYIVVAIGEPKQYVS---LLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFF 183
Y + + IG P +S +L DTGSD++WTQC+PC +C + DP SKS+TF
Sbjct: 122 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDP----SKSRTFR 177
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA-NSNGYFT 242
++ C C + G S C F +Y DG G +D A + GY
Sbjct: 178 RLSCFDPMCELCTAVVD-GGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 236
Query: 243 RYPFLLGC--INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
GC + +S + ++GI+ L S +T+ FSYC+P+
Sbjct: 237 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDRFSYCIPA 285
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 105/441 (23%), Positives = 173/441 (39%), Gaps = 93/441 (21%)
Query: 47 NRTRTALPQGPDKA--SLEVVSKYGPCSRLNQGISTHAPS----LEEILRQDQQRLHLKN 100
+ +R++ P P A +L+V +GPCS L G T APS L + +D RL +
Sbjct: 29 SHSRSSCPATPPDAGNTLQVSHAFGPCSPLGPG--TAAPSWAGFLADQASRDASRLLYLD 86
Query: 101 SRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQ 159
S +R R A+ A+ + Y+V A +G P Q + L +DT +D +W
Sbjct: 87 SLAVRG-------RARAYAPIASGRQLLQTPTYVVRASLGTPPQQLLLAVDTSNDASWIP 139
Query: 160 CKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNC--NSKECPFNIQYAD 217
C C C F + S ++ +PC S C P C K C F++ YAD
Sbjct: 140 CAGCAGCPTSSAAPFDPASSASYRTVPCGSPLC----AQAPNAACPPGGKACGFSLTYAD 195
Query: 218 GSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITR 277
S + D + + Y GC+ ++G + G++GL R P+S +++
Sbjct: 196 SSLQAAL-SQDSLAVAGNAVKAY------TFGCLQRATGTAAPPQGLLGLGRGPLSFLSQ 248
Query: 278 TNTSY---FSYCLPS--PYGSTGYITFGKTDT----------VNTLRSAFH--------- 313
T Y FSYCLPS +G + G+ N RS+ +
Sbjct: 249 TKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTPLLANPHRSSLYYVNMTGIRV 308
Query: 314 ----KRMKKYKKAKGLEDLLDT--------------------------CYDLSAYET--- 340
+ + A G +LD+ L ++T
Sbjct: 309 GRKVVPIPAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRVGAPVSSLGGFDTCFN 368
Query: 341 ---VVVPKIAIHFLGGVDLELDVRGTLVVASVSQV-CLGFATYPPDPNSI--TLGNVQQR 394
V P + + F G+ + L ++ ++ + CL A P N++ + ++QQ+
Sbjct: 369 TTAVAWPPVTLLF-DGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQ 427
Query: 395 GHEVHYDVAGRRLGFGPGNCS 415
H V +DV R+GF C+
Sbjct: 428 NHRVLFDVPNGRVGFARERCT 448
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 51/163 (31%), Positives = 79/163 (48%), Gaps = 14/163 (8%)
Query: 103 RLRKPFPEFLKRTEAFTFPANINDTVAD-------EYYIVVAIGEPKQYVSLLLDTGSDV 155
RL+K F + R F +++ EY + +++G P + + DTGSD+
Sbjct: 59 RLQKAFHRSISRANHFRANGVSTNSIQSPVISNNGEYLMNISLGTPPVSMHGIADTGSDL 118
Query: 156 TWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNC-NSKECPFNIQ 214
W QCKPC C++Q +P F +KSKT+ + C SC L G C + C ++
Sbjct: 119 LWRQCKPCDSCYEQIEPIFDPAKSKTYQILSCEGKSCSNLGGQ---GGCSDDNTCIYSYS 175
Query: 215 YADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLGCINNSSG 256
Y DGS + G A D +TI ++ G P + GC +N+ G
Sbjct: 176 YGDGSHTSGDLAVDTLTI--GSTTGRPVSVPKVVFGCGHNNGG 216
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 42/96 (43%), Gaps = 6/96 (6%)
Query: 320 KKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATY 379
K + ++ CY S + +P I HF+G DLEL T V C FA
Sbjct: 355 KPVRDPNNVFSLCY--SNLSGLRIPTITAHFVG-ADLELKPLNTFVQVQEDLFC--FAMI 409
Query: 380 PPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
P +I GN+ Q V YD+ R + F P +C+
Sbjct: 410 PVSDLAI-FGNLAQMNFLVGYDLKSRTVSFKPTDCT 444
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 96/200 (48%), Gaps = 28/200 (14%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDP--FFYASKSKTFFKIP 186
+ +Y++ + +G P Q + L+ DTGSD+TW +C C P F A S TF
Sbjct: 80 SGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLARHSTTFSPTH 139
Query: 187 CNSTSCRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFT 242
C S+ C+++ + P CN C + Y+DGS + GF++ + T+ ++
Sbjct: 140 CFSSLCQLVPQPNP-NPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMKL 198
Query: 243 RYPFLLGCINNSSGDK------SGASGIMGLDRSPVSIITRTNTSY---FSYCL------ 287
+ GC ++SG +GASG+MGL R P+S ++ + FSYCL
Sbjct: 199 KS-IAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRSFSYCLLDYTLS 257
Query: 288 PSPYGSTGYITFGKTDTVNT 307
P P T Y+ G D V+T
Sbjct: 258 PPP---TSYLMIG--DVVST 272
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 55/173 (31%), Positives = 78/173 (45%), Gaps = 21/173 (12%)
Query: 136 VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF---KIPCNSTSC 192
++IG+P +++DTGSD+ W C PC +C F SKS TF K PC+
Sbjct: 105 ISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNDLGLLFDPSKSSTFSPLCKTPCD---- 160
Query: 193 RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCIN 252
F C PF + YAD S + G + D + E G L GC +
Sbjct: 161 --------FEGCRCDPIPFTVTYADNSTASGTFGRDTVVF-ETTDEGTSRISDVLFGCGH 211
Query: 253 NSSGDKS-GASGIMGLDRSPVSIITRTNTSYFSYC---LPSPYGSTGYITFGK 301
N D G +GI+GL+ P S++T+ FSYC L PY + + G+
Sbjct: 212 NIGHDTDPGHNGILGLNNGPDSLVTKLGQK-FSYCIGNLADPYYNYHQLILGE 263
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 90/180 (50%), Gaps = 23/180 (12%)
Query: 115 TEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFF 174
++ +F N+ TV +A+G+P Q +S++LDTGS+++W CK + +P
Sbjct: 54 SDKLSFRHNVTLTVT------LAVGDPPQNISMVLDTGSELSWLHCKKSPNLGSVFNPV- 106
Query: 175 YASKSKTFFKIPCNSTSCRILRESFPF-GNCNSKE--CPFNIQYADGSGSGGFWATDRIT 231
S T+ +PC+S CR P +C+ K C I YAD + G A +
Sbjct: 107 ---SSSTYSPVPCSSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFV 163
Query: 232 IQEANSNGYFTRYPFLLGCIN----NSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
I G TR L GC++ ++S + + ++G+MG++R +S + + S FSYC+
Sbjct: 164 I------GSVTRPGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCI 217
>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 481
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 56/180 (31%), Positives = 81/180 (45%), Gaps = 27/180 (15%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC----------IHCFQQRDPFFYASKSK 180
+Y IG+P Q ++DTGSD+ WTQC C CF Q P++ S S+
Sbjct: 77 QYIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLSR 136
Query: 181 TFFKIPCNSTS---CRILRESFPF---GNCNSKECPFNIQYADGSGSGGFWATDRITIQE 234
T +PC+ C + E+ G C Y G G TD T
Sbjct: 137 TARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAGVALG-VLGTDAFTFPS 195
Query: 235 ANSNGYFTRYPFLLGCINN---SSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPY 291
++S GC++ S G +GASGI+GL R +S++++ N + FSYCL +PY
Sbjct: 196 SSS------VTLAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNATEFSYCL-TPY 248
>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
Length = 447
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 57/188 (30%), Positives = 88/188 (46%), Gaps = 22/188 (11%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCR 193
+ VA+G P Q V+++LDTGS+++W C P F AS S ++ +PC ST+C
Sbjct: 57 VPVAVGTPPQNVTMVLDTGSELSWLLCNGSYA--PPLTPAFNASGSSSYGAVPCPSTACE 114
Query: 194 ILRESFPFGN-CN---SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
P C+ S C ++ YAD S + G ATD + Y G
Sbjct: 115 WRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPPVAVGAY---FG 171
Query: 250 CI--------NNSSGDKS----GASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYI 297
CI NS+G + A+G++G++R +S +T+T T F+YC+ +P G +
Sbjct: 172 CITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTRRFAYCI-APGEGPGVL 230
Query: 298 TFGKTDTV 305
G V
Sbjct: 231 LLGDDGGV 238
>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 447
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 57/188 (30%), Positives = 88/188 (46%), Gaps = 22/188 (11%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCR 193
+ VA+G P Q V+++LDTGS+++W C P F AS S ++ +PC ST+C
Sbjct: 57 VPVAVGTPPQNVTMVLDTGSELSWLLCNGSYA--PPLTPAFNASGSSSYGAVPCPSTACE 114
Query: 194 ILRESFPFGN-CN---SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
P C+ S C ++ YAD S + G ATD + Y G
Sbjct: 115 WRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPPVAVGAY---FG 171
Query: 250 CI--------NNSSGDKS----GASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYI 297
CI NS+G + A+G++G++R +S +T+T T F+YC+ +P G +
Sbjct: 172 CITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTRRFAYCI-APGEGPGVL 230
Query: 298 TFGKTDTV 305
G V
Sbjct: 231 LLGDDGGV 238
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 55/104 (52%), Gaps = 6/104 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P ++LDTGSDV W QC PC C++Q P F +S ++ + C +
Sbjct: 128 EYFTKIGVGTPATQALMVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAA 187
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITI 232
CR L G C+ + C + + Y DGS + G + T+ +T
Sbjct: 188 LCRRLDS----GGCDLRRGACMYQVAYGDGSVTAGDFVTETLTF 227
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 54/111 (48%), Gaps = 4/111 (3%)
Query: 306 NTLRSAFHKRMKK-YKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL 364
+ LR AF + + G L DTCYDL V VP +++HF GG + L L
Sbjct: 377 SALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYL 436
Query: 365 V-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ V S C FA D +GN+QQ+G V +D G+R+GF P C
Sbjct: 437 IPVDSRGTFCFAFAG--TDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 485
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 59/197 (29%), Positives = 93/197 (47%), Gaps = 25/197 (12%)
Query: 127 TVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQR-DPFFYASKSKTFFKI 185
T + +Y++ + +G P Q + L+ DTGSD+ W +C C +C + F A S TF
Sbjct: 84 TGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLARHSTTFSPN 143
Query: 186 PCNSTSCRILRESFP-FGNCNSKE----CPFNIQYADGSGSGGFWATDRITI-----QEA 235
C ++C+++ P CN C + Y DGS + GF++ + T+ +EA
Sbjct: 144 HCYDSACQLV--PLPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSGREA 201
Query: 236 NSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL----- 287
G F + + S +GA G+MGL R P+S+ ++ + FSYCL
Sbjct: 202 KLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSYCLMDHDI 261
Query: 288 -PSPYGSTGYITFGKTD 303
PSP T Y+ G T
Sbjct: 262 SPSP---TSYLLIGSTQ 275
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 58/188 (30%), Positives = 92/188 (48%), Gaps = 21/188 (11%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
+Y + ++IG P +DTGSD+ W QC PC +C++Q +P F S T+ I S
Sbjct: 58 DYLMELSIGTPPVKTYAQVDTGSDLIWLQCIPCTNCYKQLNPMFDPQSSSTYSNIAYGSE 117
Query: 191 SCRILRESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
SC L + +C + C + Y D S + G A + +T+ + +
Sbjct: 118 SCSKLYST----SCSPDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTGKPVALK-GVIF 172
Query: 249 GCINNSSG---DKSGASGIMGLDRSPVSIITRTNTSY----FSYCLPSPYGSTGYIT--- 298
GC +N++G DK GI+GL R P+S++++ +S+ FS CL P+ + IT
Sbjct: 173 GCGHNNNGVFNDKE--MGIIGLGRGPLSLVSQIGSSFGGKMFSQCL-VPFHTNPSITSPM 229
Query: 299 -FGKTDTV 305
FGK V
Sbjct: 230 SFGKGSEV 237
>gi|297605070|ref|NP_001056627.2| Os06g0118000 [Oryza sativa Japonica Group]
gi|55296430|dbj|BAD68553.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|215692556|dbj|BAG87976.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255676664|dbj|BAF18541.2| Os06g0118000 [Oryza sativa Japonica Group]
Length = 175
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 44/109 (40%), Positives = 57/109 (52%), Gaps = 6/109 (5%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
LR+AF M Y+ A + +LDTCYD S ++ +P IA+ F GG + LD G L+
Sbjct: 73 QALRAAFRSAMTMYRPAPPVS-ILDTCYDFSGVRSITLPSIALVFDGGATVNLDAAGILL 131
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
Q CL FA D +GNVQQR EV YDV G+ + F C
Sbjct: 132 -----QGCLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 175
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 52/162 (32%), Positives = 82/162 (50%), Gaps = 14/162 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V +G P ++LDTGSDV W QC PC HC+ Q F +S+++ + C +
Sbjct: 121 EYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAP 180
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
CR L + C+ + C + + Y DGS + G +A++ +T +
Sbjct: 181 ICRRLDSA----GCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGARVQRVA-----I 231
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
GC +++ G ASG++GL R +S T+ S+ FSYCL
Sbjct: 232 GCGHDNEGLFIAASGLLGLGRGRLSFPTQIARSFGRSFSYCL 273
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 51/109 (46%), Gaps = 1/109 (0%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+R AF + + G L DTCY+LS V VP +++H GG + L L+
Sbjct: 368 EAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLI 427
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
S FA D +GN+QQ+G V +D +R+GF P +C
Sbjct: 428 PVDTSGT-FCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 475
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 81/163 (49%), Gaps = 16/163 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P ++LDTGSDV W QC PC C+ Q F S ++ + C +
Sbjct: 146 EYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCAAP 205
Query: 191 SCRILRESFPFGNCN--SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL- 247
CR L G C+ K C + + Y DGS + G +AT+ +T R P +
Sbjct: 206 LCRRLDS----GGCDLRRKACLYQVAYGDGSVTAGDFATETLTFASG------ARVPRVA 255
Query: 248 LGCINNSSGDKSGASGIMGLDRSPVSI---ITRTNTSYFSYCL 287
LGC +++ G A+G++GL R +S I+R FSYCL
Sbjct: 256 LGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCL 298
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 42/109 (38%), Positives = 55/109 (50%), Gaps = 3/109 (2%)
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV- 365
LR AF + + G L DTCYDLS + V VP +++HF GG + L L+
Sbjct: 394 ALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIP 453
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V S C FA D +GN+QQ+G V +D G+RLGF P C
Sbjct: 454 VDSRGTFCFAFAGT--DGGVSIIGNIQQQGFRVVFDGDGQRLGFVPKGC 500
>gi|115448347|ref|NP_001047953.1| Os02g0720500 [Oryza sativa Japonica Group]
gi|113537484|dbj|BAF09867.1| Os02g0720500, partial [Oryza sativa Japonica Group]
Length = 172
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 58/110 (52%), Gaps = 8/110 (7%)
Query: 306 NTLRSAFHKRMKKYK-KAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL 364
+ LRSAF M Y + +LDTCYD + Y TV +P I+I F GG ++L G L
Sbjct: 70 SALRSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGGGAAMDLGTSGIL 129
Query: 365 VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ CL FA D + LGNVQQR EV +D G +GF P +C
Sbjct: 130 -----TSGCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPASC 172
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 88/408 (21%), Positives = 151/408 (37%), Gaps = 84/408 (20%)
Query: 86 EEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYV 145
E L Q + R ++ R L+ L F + V YY + +G P +
Sbjct: 40 EMELSQLKARDEARHGRLLQS-----LGGVIDFPVDGTFDPFVVGLYYTKLRLGTPPRDF 94
Query: 146 SLLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIPCNSTSCRILRESFP 200
+ +DTGSDV W C C C Q + FF S T I C+ C +S
Sbjct: 95 YVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSD 154
Query: 201 FG-NCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYF--TRYPFLLGCINNSSGD 257
G + + C + QY DGSG+ GF+ +D + + + P + GC + +GD
Sbjct: 155 SGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGD 214
Query: 258 ----KSGASGIMGLDRSPVSIITRTNTS-----YFSYCLPSPYGSTGYITFGKTD----- 303
GI G + +S+I++ + FS+CL G G + G+
Sbjct: 215 LVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVLGEIVEPNMV 274
Query: 304 -----------TVNTLRSAFHKRM-----KKYKKAKGLEDLLDT---------------- 331
VN L + + + + + G ++DT
Sbjct: 275 FTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFV 334
Query: 332 --------------------CYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV----VA 367
CY ++ + P ++++F GG + L+ + L+ V
Sbjct: 335 EAITNAVSQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVG 394
Query: 368 SVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ C+GF +I LG++ + YD+ G+R+G+ +CS
Sbjct: 395 GTAVWCIGFQRIQNQGITI-LGDLVLKDKIFVYDLVGQRIGWANYDCS 441
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 56/163 (34%), Positives = 93/163 (57%), Gaps = 12/163 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
+Y + ++G P ++DTGSD+ W QC+PC C+ Q P F SKS ++ I C+S
Sbjct: 86 DYIMSYSVGTPPIKSYGIVDTGSDIVWLQCEPCEQCYNQTTPKFNPSKSSSYKNISCSSK 145
Query: 191 SCRILRESFPFGNCNSKE-CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLL 248
C+ +R++ +CN K+ C ++I Y + S S G + + +T++ ++ G +P ++
Sbjct: 146 LCQSVRDT----SCNDKKNCEYSINYGNQSHSQGDLSLETLTLE--STTGRPVSFPKTVI 199
Query: 249 GCINNSSGD-KSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
GC N+ G K +SG++GL P S+IT+ S FSYCL
Sbjct: 200 GCGTNNIGSFKRVSSGVVGLGGGPASLITQLGPSIGGKFSYCL 242
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 30/123 (24%), Positives = 50/123 (40%), Gaps = 5/123 (4%)
Query: 293 STGYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLG 352
S+ +TF +D L SA + ++ CY++S+ E P + HF
Sbjct: 318 SSTIVTFVPSDVYTKLNSAI-VDLVTLERVDDPNQQFSLCYNVSSDEEYDFPYMTAHF-K 375
Query: 353 GVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPG 412
G D+ L T V + +C FA P G+ Q+ V YD+ + + F
Sbjct: 376 GADILLYATNTFVEVARDVLCFAFA---PSNGGAIFGSFSQQDFMVGYDLQQKTVSFKSV 432
Query: 413 NCS 415
+C+
Sbjct: 433 DCT 435
>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
Length = 413
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 61/221 (27%), Positives = 98/221 (44%), Gaps = 18/221 (8%)
Query: 110 EFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQ 168
F + + FP + N Y + + IG+P + L LDTGSD+TW QC PC+ C +
Sbjct: 26 RFTRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLE 85
Query: 169 QRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATD 228
P + S IPCN C+ L + ++C + ++YADG S G D
Sbjct: 86 APHPLYQPSSD----LIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRD 141
Query: 229 RITIQEANSNGYFTRYPFLLGCINNSSGDKSG---ASGIMGLDRSPVSIITRTNTSYF-- 283
++ R LGC + S G++GL R VSI+++ ++ +
Sbjct: 142 VFSMNYTQGLRLTPR--LALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVK 199
Query: 284 ---SYCLPSPYGSTGYITFGKTDTVNTLRSAFHKRMKKYKK 321
+CL S G G + FG D ++ R ++ ++Y K
Sbjct: 200 NVIGHCLSSLGG--GILFFGD-DLYDSSRVSWTPMSREYSK 237
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 81/163 (49%), Gaps = 16/163 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ + +G P ++LDTGSDV W QC PC C+ Q F S ++ + C +
Sbjct: 146 EYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCAAP 205
Query: 191 SCRILRESFPFGNCN--SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL- 247
CR L G C+ K C + + Y DGS + G +AT+ +T R P +
Sbjct: 206 LCRRLDS----GGCDLRRKACLYQVAYGDGSVTAGDFATETLTFASG------ARVPRVA 255
Query: 248 LGCINNSSGDKSGASGIMGLDRSPVSI---ITRTNTSYFSYCL 287
LGC +++ G A+G++GL R +S I+R FSYCL
Sbjct: 256 LGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCL 298
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 37/88 (42%), Positives = 48/88 (54%), Gaps = 3/88 (3%)
Query: 328 LLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-VASVSQVCLGFATYPPDPNSI 386
L DTCYDLS + V VP +++HF GG + L L+ V S C FA D
Sbjct: 416 LFDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGT--DGGVS 473
Query: 387 TLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+GN+QQ+G V +D G+RLGF P C
Sbjct: 474 IIGNIQQQGFRVVFDGDGQRLGFVPKGC 501
>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 61/221 (27%), Positives = 98/221 (44%), Gaps = 18/221 (8%)
Query: 110 EFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQ 168
F + + FP + N Y + + IG+P + L LDTGSD+TW QC PC+ C +
Sbjct: 38 RFTRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLE 97
Query: 169 QRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATD 228
P + S IPCN C+ L + ++C + ++YADG S G D
Sbjct: 98 APHPLYQPSSD----LIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRD 153
Query: 229 RITIQEANSNGYFTRYPFLLGCINNSSGDKSG---ASGIMGLDRSPVSIITRTNTSYF-- 283
++ R LGC + S G++GL R VSI+++ ++ +
Sbjct: 154 VFSMNYTQGLRLTPR--LALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVK 211
Query: 284 ---SYCLPSPYGSTGYITFGKTDTVNTLRSAFHKRMKKYKK 321
+CL S G G + FG D ++ R ++ ++Y K
Sbjct: 212 NVIGHCLSSLGG--GILFFGD-DLYDSSRVSWTPMSREYSK 249
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 69/258 (26%), Positives = 99/258 (38%), Gaps = 22/258 (8%)
Query: 58 DKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEA 117
D + L V+ Y CS P +E + K+ RL+ ++T A
Sbjct: 31 DTSDLSVIPIYSKCSPF-------VPPKQESWVNTVITMASKDPERLKYLSTLADQKTTA 83
Query: 118 FTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYAS 177
Y + V +G P Q + ++LDT +D W PC C F +
Sbjct: 84 VPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWV---PCSGCTGFSSTTFLPN 140
Query: 178 KSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANS 237
S T + C+ C +R F S C FN Y S D IT+
Sbjct: 141 ASTTLGSLDCSGAQCSQVR-GFSCPATGSSACLFNQSYGGDSSLTATLVQDAITLANDVI 199
Query: 238 NGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS--PYG 292
G F GCIN SG G++GL R P+S+I++ Y FSYCLPS Y
Sbjct: 200 PG------FTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYY 253
Query: 293 STGYITFGKTDTVNTLRS 310
+G + G ++R+
Sbjct: 254 FSGSLKLGPVGQPKSIRT 271
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 51/106 (48%), Gaps = 9/106 (8%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
+R F K++ + G DTC+ +A P I +HF G++L L + +L+
Sbjct: 334 IRDEFRKQVNGPISSLGA---FDTCF--AATNEAEAPAITLHF-EGLNLVLPMENSLIHS 387
Query: 367 ASVSQVCLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRLGFG 410
+S S CL A P + NS+ + N+QQ+ + +D RLG
Sbjct: 388 SSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIA 433
>gi|125554848|gb|EAZ00454.1| hypothetical protein OsI_22475 [Oryza sativa Indica Group]
Length = 538
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 89/180 (49%), Gaps = 16/180 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK-PCIHCFQQRDPFFYASKSKTFFKIPCNS 189
+YY + IG P + L +DTGSD+TW QC PC +C + P + K +P
Sbjct: 158 QYYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPEKPNV---VPPRD 214
Query: 190 TSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
+ C+ L+ + +G+ SK+C + I YAD S S G A D +Q ++G F+ G
Sbjct: 215 SYCQELQGNQNYGD-TSKQCDYEITYADRSSSMGILARD--NMQLITADGERENLDFVFG 271
Query: 250 CINNSSGD----KSGASGIMGLDRSPVSIITRTN-----TSYFSYCLPSPYGSTGYITFG 300
C + G+ + GI+GL + +S+ T+ ++ F +C+ + + GY+ G
Sbjct: 272 CGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAADPSNGGYMFLG 331
>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 61/221 (27%), Positives = 98/221 (44%), Gaps = 18/221 (8%)
Query: 110 EFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQ 168
F + + FP + N Y + + IG+P + L LDTGSD+TW QC PC+ C +
Sbjct: 38 RFTRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLE 97
Query: 169 QRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATD 228
P + S IPCN C+ L + ++C + ++YADG S G D
Sbjct: 98 APHPLYQPSSD----LIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRD 153
Query: 229 RITIQEANSNGYFTRYPFLLGCINNSSGDKSG---ASGIMGLDRSPVSIITRTNTSYF-- 283
++ R LGC + S G++GL R VSI+++ ++ +
Sbjct: 154 VFSMNYTKGLRLTPR--LALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVK 211
Query: 284 ---SYCLPSPYGSTGYITFGKTDTVNTLRSAFHKRMKKYKK 321
+CL S G G + FG D ++ R ++ ++Y K
Sbjct: 212 NVIGHCLSSLGG--GILFFGD-DLYDSSRVSWTPMSREYSK 249
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 81/269 (30%), Positives = 117/269 (43%), Gaps = 37/269 (13%)
Query: 60 ASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSR-----RLRKPFPEFLKR 114
A + +VS + N G S + ++ +D L N R RLR F + R
Sbjct: 14 AFISMVSAFSLVEARNAGFSAN------LIHRDSSVSPLYNPRDTYFDRLRNSFHRSISR 67
Query: 115 TEAFTFPANIN-------DTV--ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIH 165
F P +I+ D V EY + ++IG P+ + + DTGSD+ W QC+PC
Sbjct: 68 ANRFK-PNSISARALVQSDIVPGGGEYLMRISIGNPQVEILAIADTGSDLIWVQCQPCEM 126
Query: 166 CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS----KECPFNIQYADGSGS 221
C++Q P F +S ++ + C + C L +C++ K C + Y D S S
Sbjct: 127 CYKQNSPIFDPRRSSSYRNVLCGNEFCNKLDGEAR--SCDARGFVKTCGYTYSYGDQSFS 184
Query: 222 GGFWATDRITIQEANSN-----GYFTRYPFLLGCINNSSGDK--SGASGIMGLDRSPVSI 274
G A +R I NSN YF F G N + D+ SG G+ G S VS
Sbjct: 185 DGHLAIERFGIGSTNSNTSAAIAYFQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQ 244
Query: 275 ITRTNTSYFSYCL-PSPYGS--TGYITFG 300
+ + FSYCL P+ S T I FG
Sbjct: 245 LGPKLSGKFSYCLVPTSEQSNYTSKINFG 273
>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 401
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 61/221 (27%), Positives = 98/221 (44%), Gaps = 18/221 (8%)
Query: 110 EFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQ 168
F + + FP + N Y + + IG+P + L LDTGSD+TW QC PC+ C +
Sbjct: 35 RFTRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLE 94
Query: 169 QRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATD 228
P + S IPCN C+ L + ++C + ++YADG S G D
Sbjct: 95 APHPLYQPSSD----LIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRD 150
Query: 229 RITIQEANSNGYFTRYPFLLGCINNSSGDKSG---ASGIMGLDRSPVSIITRTNTSYF-- 283
++ R LGC + S G++GL R VSI+++ ++ +
Sbjct: 151 VFSMNYTQGLRLTPR--LALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVK 208
Query: 284 ---SYCLPSPYGSTGYITFGKTDTVNTLRSAFHKRMKKYKK 321
+CL S G G + FG D ++ R ++ ++Y K
Sbjct: 209 NVIGHCLSSLGG--GILFFGD-DLYDSSRVSWTPMSREYSK 246
>gi|115467508|ref|NP_001057353.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|53791766|dbj|BAD53531.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|53793187|dbj|BAD54393.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113595393|dbj|BAF19267.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|125596798|gb|EAZ36578.1| hypothetical protein OsJ_20919 [Oryza sativa Japonica Group]
gi|215767941|dbj|BAH00170.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 538
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 89/180 (49%), Gaps = 16/180 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK-PCIHCFQQRDPFFYASKSKTFFKIPCNS 189
+YY + IG P + L +DTGSD+TW QC PC +C + P + K +P
Sbjct: 158 QYYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPEKPNV---VPPRD 214
Query: 190 TSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
+ C+ L+ + +G+ SK+C + I YAD S S G A D +Q ++G F+ G
Sbjct: 215 SYCQELQGNQNYGD-TSKQCDYEITYADRSSSMGILARD--NMQLITADGERENLDFVFG 271
Query: 250 CINNSSGD----KSGASGIMGLDRSPVSIITRTN-----TSYFSYCLPSPYGSTGYITFG 300
C + G+ + GI+GL + +S+ T+ ++ F +C+ + + GY+ G
Sbjct: 272 CGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAADPSNGGYMFLG 331
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 88/408 (21%), Positives = 151/408 (37%), Gaps = 84/408 (20%)
Query: 86 EEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYV 145
E L Q + R ++ R L+ L F + V YY + +G P +
Sbjct: 40 EMELSQLKARDEARHGRLLQS-----LGGVIDFPVDGTFDPFVVGLYYTKLRLGTPPRDF 94
Query: 146 SLLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIPCNSTSCRILRESFP 200
+ +DTGSDV W C C C Q + FF S T I C+ C +S
Sbjct: 95 YVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSD 154
Query: 201 FG-NCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYF--TRYPFLLGCINNSSGD 257
G + + C + QY DGSG+ GF+ +D + + + P + GC + +GD
Sbjct: 155 SGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGD 214
Query: 258 ----KSGASGIMGLDRSPVSIITRTNTS-----YFSYCLPSPYGSTGYITFGKTD----- 303
GI G + +S+I++ + FS+CL G G + G+
Sbjct: 215 LVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVLGEIVEPNMV 274
Query: 304 -----------TVNTLRSAFHKRM-----KKYKKAKGLEDLLDT---------------- 331
VN L + + + + + G ++DT
Sbjct: 275 FTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFV 334
Query: 332 --------------------CYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV----VA 367
CY ++ + P ++++F GG + L+ + L+ V
Sbjct: 335 EAITNAVSQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVG 394
Query: 368 SVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ C+GF +I LG++ + YD+ G+R+G+ +CS
Sbjct: 395 GTAVWCIGFQRIQNQGITI-LGDLVLKDKIFVYDLVGQRIGWANYDCS 441
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 89/342 (26%), Positives = 141/342 (41%), Gaps = 48/342 (14%)
Query: 84 SLEEILRQDQQRLHL------KNSRRLRKPFPEFLKRTEAFTFPANIND-TVADEYYIVV 136
SL ++ R D+QR+ + +R AF P T +Y++
Sbjct: 42 SLADLARSDRQRMAFIASHGRRRTRETAAGSSSASSAAAAFAMPLTSGAYTGIGQYFVRF 101
Query: 137 AIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDP---------FFYASKSKTFFKIPC 187
+G P Q L+ DTGSD+TW +C+ P F S+T+ I C
Sbjct: 102 RVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTWAPISC 161
Query: 188 NSTSCRILRESFPF--GNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
S +C +S PF C + C ++ +Y DGS + G T+ TI + +
Sbjct: 162 ASDTC---TKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGREERKAK 218
Query: 244 YP-FLLGCINNSSGDKSGAS-GIMGLDRSPVSIITRTNTSY---FSYCLP---SPYGSTG 295
+LGC ++ +G AS G++ L S +S + + + FSYCL SP +T
Sbjct: 219 LKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGRFSYCLVDHLSPRNATS 278
Query: 296 YITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDT----CYDLS------AYETVVVPK 345
Y+TFG V++ R++ +A+ LLD YD+S A E + +P+
Sbjct: 279 YLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAGEFLKIPR 338
Query: 346 IAIHFLGGVDLELDVRGTLVV-------ASVSQVCLGFATYP 380
G + LD +L V A V+ + G A P
Sbjct: 339 AVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLP 380
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 45/93 (48%), Gaps = 5/93 (5%)
Query: 327 DLLDTCYDLSAYE----TVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPD 382
D + CY+ ++ V VPK+A+HF G LE + ++ A+ C+G P
Sbjct: 385 DPFEYCYNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEG-PW 443
Query: 383 PNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
P +GN+ Q+ H +D+ RRL F C+
Sbjct: 444 PGISVIGNILQQEHLWEFDIKNRRLKFQRSRCT 476
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 57/174 (32%), Positives = 83/174 (47%), Gaps = 14/174 (8%)
Query: 121 PANIN-DTVADE-YYIVVAIGEPKQYVSLLLDTGSDVTWTQCK----PCIHCFQQRDPFF 174
PA++ ++D+ + + V IG P Q L++DTGSD+ WTQCK + P +
Sbjct: 78 PADVRLSPLSDQGHSLTVGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVY 137
Query: 175 YASKSKTFFKIPCNSTSCRILRESFPFGNCNSK-ECPFNIQYADGSGSGGFWATDRITIQ 233
+S TF +PC+ C+ F F NC SK C + Y + G A++ T
Sbjct: 138 DPGESSTFAFLPCSDRLCQ--EGQFSFKNCTSKNRCVYEDVYGSAAAV-GVLASETFTF- 193
Query: 234 EANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
R F GC S+G GA+GI+GL +S+IT+ FSYCL
Sbjct: 194 -GARRAVSLRLGF--GCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYCL 244
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 32/79 (40%)
Query: 336 SAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRG 395
+A E V VP + +HF GG + L +CL +GNVQQ+
Sbjct: 370 AAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQN 429
Query: 396 HEVHYDVAGRRLGFGPGNC 414
V +DV + F P C
Sbjct: 430 MHVLFDVQHHKFSFAPTQC 448
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 105/441 (23%), Positives = 173/441 (39%), Gaps = 93/441 (21%)
Query: 47 NRTRTALPQGPDKA--SLEVVSKYGPCSRLNQGISTHAPS----LEEILRQDQQRLHLKN 100
+ +R++ P P A +L+V +GPCS L G T APS L + +D RL +
Sbjct: 29 SHSRSSCPATPPDAGNTLQVSHAFGPCSPLGPG--TAAPSWAGFLADQASRDASRLLYLD 86
Query: 101 SRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQ 159
S +R R A+ A+ + Y+V A +G P Q + L +DT +D +W
Sbjct: 87 SLAVRG-------RARAYAPIASGRQLLQTLTYVVRASLGTPPQQLLLAVDTSNDASWIP 139
Query: 160 CKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNC--NSKECPFNIQYAD 217
C C C F + S ++ +PC S C P C K C F++ YAD
Sbjct: 140 CAGCAGCPTSSAAPFDPAASASYRTVPCGSPLC----AQAPNAACPPGGKACGFSLTYAD 195
Query: 218 GSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITR 277
S + D + + Y GC+ ++G + G++GL R P+S +++
Sbjct: 196 SSLQAAL-SQDSLAVAGNAVKAY------TFGCLQRATGTAAPPQGLLGLGRGPLSFLSQ 248
Query: 278 TNTSY---FSYCLPS--PYGSTGYITFGKTDT----------VNTLRSAFH--------- 313
T Y FSYCLPS +G + G+ N RS+ +
Sbjct: 249 TKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTPLLANPHRSSLYYVNMTGVRV 308
Query: 314 ----KRMKKYKKAKGLEDLLDT--------------------------CYDLSAYET--- 340
+ + A G +LD+ L ++T
Sbjct: 309 GRKVVPIPAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRVGAPVSSLGGFDTCFN 368
Query: 341 ---VVVPKIAIHFLGGVDLELDVRGTLVVASVSQV-CLGFATYPPDPNSI--TLGNVQQR 394
V P + + F G+ + L ++ ++ + CL A P N++ + ++QQ+
Sbjct: 369 TTAVAWPPMTLLF-DGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQ 427
Query: 395 GHEVHYDVAGRRLGFGPGNCS 415
H V +DV R+GF C+
Sbjct: 428 NHRVLFDVPNGRVGFARERCT 448
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 54/187 (28%), Positives = 96/187 (51%), Gaps = 22/187 (11%)
Query: 107 PFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC 166
P F + F N++ TV+ + +G P Q VS++LDTGS+++W +C
Sbjct: 66 PSGSFPRSPNKLHFHHNVSLTVS------LTVGTPPQNVSMVLDTGSELSWLRCNK-TQT 118
Query: 167 FQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPF-GNCNSKE-CPFNIQYADGSGSGGF 224
FQ F ++S ++ +PC+S +C FP +C+S + C + YAD S S G
Sbjct: 119 FQTT---FDPNRSSSYSPVPCSSLTCTDRTRDFPIPASCDSNQLCHAILSYADASSSEGN 175
Query: 225 WATDRITIQEANSNGYFTRYPFLLGCINNS----SGDKSGASGIMGLDRSPVSIITRTNT 280
A+D I ++ G + GC+++S + + S +G+MG++R +S +++ +
Sbjct: 176 LASDTFYIGNSDMPGT------IFGCMDSSFSTNTEEDSKNTGLMGMNRGSLSFVSQMDF 229
Query: 281 SYFSYCL 287
FSYC+
Sbjct: 230 PKFSYCI 236
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 114/430 (26%), Positives = 173/430 (40%), Gaps = 92/430 (21%)
Query: 57 PDK-ASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRT 115
PD A+L+V +GPCS L G + APS L R SR L
Sbjct: 38 PDAGATLQVSHAFGPCSPL--GNAAAAPSWAGFLADQSSR---DASRLLY--LDSLAVAG 90
Query: 116 EAFTFPANINDTVADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFF 174
A+ A+ + Y+V A +G P Q + L +DT +D W C C C PF
Sbjct: 91 RAYAPIASGRQLLQTPTYVVRARLGTPPQQLLLAVDTSNDAAWIPCSGCAGC-PTTTPFN 149
Query: 175 YASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQE 234
A+ SK++ +PC S +C R P + N+K C F++ YAD S + D + +
Sbjct: 150 PAA-SKSYRAVPCGSPACS--RAPNPSCSLNTKSCGFSLTYADSSLEAAL-SQDSLAV-- 203
Query: 235 ANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS-- 289
+N Y F GC+ ++G + G++GL R P+S +++T Y FSYCLPS
Sbjct: 204 --ANDVVKSYTF--GCLQKATGTATPPQGLLGLGRGPLSFLSQTKDMYEGTFSYCLPSFK 259
Query: 290 PYGSTGYITFGKTDT----------VNTLRSAFHK------RMKK-----------YKKA 322
+G + G+ VN RS+ + R+ K + A
Sbjct: 260 SLNFSGTLRLGRKGQPLRIKTTPLLVNPHRSSLYYVSMTGIRVGKKVVPIPPAALAFDPA 319
Query: 323 KGLEDLLDT---------------------------CYDLSAYET-----VVVPKIAIHF 350
G +LD+ L ++T V P + F
Sbjct: 320 TGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRIRGAPLSSLGGFDTCYNTTVKWPPVTFMF 379
Query: 351 LG-GVDLELDVRGTLVVASV--SQVCLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGR 405
G V L D LV+ S + CL A P N++ + ++QQ+ H + +DV
Sbjct: 380 TGMQVTLPAD---NLVIHSTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRILFDVPNG 436
Query: 406 RLGFGPGNCS 415
R+GF C+
Sbjct: 437 RVGFAREQCT 446
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 52/162 (32%), Positives = 81/162 (50%), Gaps = 14/162 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V +G P ++LDTGSDV W QC PC HC+ Q F +S+++ + C +
Sbjct: 127 EYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAP 186
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
CR L + C+ + C + + Y DGS + G +A++ +T +
Sbjct: 187 ICRRLDSA----GCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGARVQRVA-----I 237
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSI---ITRTNTSYFSYCL 287
GC +++ G ASG++GL R +S I R+ FSYCL
Sbjct: 238 GCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCL 279
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 51/109 (46%), Gaps = 1/109 (0%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+R AF + + G L DTCY+LS V VP +++H GG + L L+
Sbjct: 374 EAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLI 433
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
S FA D +GN+QQ+G V +D +R+GF P +C
Sbjct: 434 PVDTSGT-FCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 481
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 60/222 (27%), Positives = 96/222 (43%), Gaps = 24/222 (10%)
Query: 94 QRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGS 153
+ L + RRLR+ PE + AF + + YY + +G P Q + +DTGS
Sbjct: 14 RTLREHDQRRLRRILPEVV----AFPISGDDDTFTTGLYYTRIYLGTPPQQFYVHVDTGS 69
Query: 154 DVTWTQCKPCIHCFQQRD-----PFFYASKSKTFFKIPCNSTSCRILRESFPFGNC--NS 206
DV W C PC +C + + F KS + I C C + S C NS
Sbjct: 70 DVAWVNCVPCTNCKRASNVALPISIFDPEKSTSKTSISCTDEECYLASNS----KCSFNS 125
Query: 207 KECPFNIQYADGSGSGGFWATDRITIQE---ANSNGYFTRYPFLLGCINNSSGDKSGASG 263
CP++ Y DGS + G+ D ++ + NS GC +N +G G
Sbjct: 126 MSCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNSTATSGTARLTFGCGSNQTGTWL-TDG 184
Query: 264 IMGLDRSPVSI---ITRTNTSY--FSYCLPSPYGSTGYITFG 300
++G ++ VS+ +++ N S F++CL +G + G
Sbjct: 185 LVGFGQAEVSLPSQLSKQNVSVNIFAHCLQGDNKGSGTLVIG 226
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 88/351 (25%), Positives = 130/351 (37%), Gaps = 72/351 (20%)
Query: 126 DTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKI 185
D+ Y + ++G P Q +S L DTGSD+ W +C C C + +Y +KS +F K+
Sbjct: 75 DSGGGAYDMTFSMGTPPQTLSALADTGSDLIWAKCGACKRCAPRGSASYYPTKSSSFSKL 134
Query: 186 PCNSTSCRILRESFPFGNCNSKE-----CPFNIQYADGSG----SGGFWATDRITIQEAN 236
PC+S CR L ES C C + Y S + G+ ++ T+
Sbjct: 135 PCSSALCRTL-ESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLGSDA 193
Query: 237 SNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGY 296
G GC S G SG++GL R +S++ + FSYCL S ++
Sbjct: 194 VQG------IGFGCTTMSEGGYGSGSGLVGLGRGKLSLVRQLKVGAFSYCLTSDPSTSSP 247
Query: 297 ITFG----------KTDTVNTLRSAFH------------------KRMKKYKKAKGLEDL 328
+ FG T VN S F+ + + L L
Sbjct: 248 LLFGAGALTGPGVQSTPLVNLKTSTFYTVNLDSISIGAAKTPGTGRHGIIFDSGTTLTFL 307
Query: 329 LDTCYDLS------------------AYET-------VVVPKIAIHFLGGVDLELDVRGT 363
+ Y L+ YE V P + +HF GG D+ L
Sbjct: 308 AEPAYTLAEAGLLSQTTNLTRVPGTDGYEVCFQTSGGAVFPSMVLHFDGG-DMALKTENY 366
Query: 364 LVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ S C P SI +GN+ Q + + YD+ L F P NC
Sbjct: 367 FGAVNDSVSCW-LVQKSPSEMSI-VGNIMQMDYHIRYDLDKSVLSFQPTNC 415
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 52/162 (32%), Positives = 81/162 (50%), Gaps = 14/162 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V +G P ++LDTGSDV W QC PC HC+ Q F +S+++ + C +
Sbjct: 121 EYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAP 180
Query: 191 SCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
CR L + C+ + C + + Y DGS + G +A++ +T +
Sbjct: 181 ICRRLDSA----GCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGARVQRVA-----I 231
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSI---ITRTNTSYFSYCL 287
GC +++ G ASG++GL R +S I R+ FSYCL
Sbjct: 232 GCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCL 273
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 35/109 (32%), Positives = 51/109 (46%), Gaps = 1/109 (0%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+R AF + + G L DTCY+LS V VP +++H GG + L L+
Sbjct: 368 EAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLI 427
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
S FA D +GN+QQ+G V +D +R+GF P +C
Sbjct: 428 PVDTSGT-FCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 475
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 51/169 (30%), Positives = 85/169 (50%), Gaps = 15/169 (8%)
Query: 127 TVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIP 186
T + EY +A+G P L +DTGSD+TW QC+PC C+ Q P F S ++ ++
Sbjct: 129 TTSGEYMAKIAVGTPAVEALLAMDTGSDITWLQCQPCRRCYPQSGPVFDPRHSTSYREMG 188
Query: 187 CNSTSCRILRESFPFGNCNSKECPFNIQYA-DGSGSGGFWATDRITIQEANSNGYFTRYP 245
++ C+ L S G+ C + + Y DGS + G + + +T + P
Sbjct: 189 YDAPDCQALGRSG-GGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFAGG------VQVP 241
Query: 246 FL-LGCINNSSGD-KSGASGIMGLDRSPVSIITRT-----NTSYFSYCL 287
+ +GC +++ G + A+GI+GL R +S ++ N + FSYCL
Sbjct: 242 HMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYCL 290
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 52/92 (56%), Gaps = 3/92 (3%)
Query: 324 GLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-VASVSQVCLGFATYPPD 382
G DTCY + + VP +++HF GGV+L L + L+ V S+ VC FA
Sbjct: 404 GPSGFFDTCYTMGG-RAMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDR 462
Query: 383 PNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
SI +GN+QQ+G V Y++ G R+GF P +C
Sbjct: 463 SVSI-IGNIQQQGFRVVYNIGGGRVGFAPNSC 493
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 56/189 (29%), Positives = 88/189 (46%), Gaps = 11/189 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + ++G P + DTGSD++W QC PC C+ Q P F ++S T+ +PC S
Sbjct: 87 EYLMRFSLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTYVDVPCESQ 146
Query: 191 SCRILRESFPFGNC-NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLL 248
C + ++ C +SK+C + QY S + G D I+ +P +
Sbjct: 147 PCTLFPQN--QRECGSSKQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSVF 204
Query: 249 GCI---NNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL-PSPYGSTGYITFGK 301
GC N + + A+G +GL P+S+ ++ FSYC+ P STG + FG
Sbjct: 205 GCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCMVPFSSTSTGKLKFGS 264
Query: 302 TDTVNTLRS 310
N + S
Sbjct: 265 MAPTNEVVS 273
>gi|357125298|ref|XP_003564331.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 524
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 59/117 (50%), Gaps = 14/117 (11%)
Query: 308 LRSAFHKRMKKYK----------KAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLE 357
LR AF M+ Y+ G E +LDTCYD + V VP +++ F GG ++
Sbjct: 412 LRLAFRNAMRGYRMNTRNGSTSSTPAGGEMILDTCYDFEGLDNVTVPTVSLVFFGGAVVD 471
Query: 358 LDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
LD A + + CL F P D + +GNVQQ+ HEV YDV R +GF G C
Sbjct: 472 LDP----TTAVMMEGCLAFVPTPADFDLGFIGNVQQQTHEVLYDVGARNVGFRRGAC 524
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 87/321 (27%), Positives = 137/321 (42%), Gaps = 54/321 (16%)
Query: 21 GAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGIST 80
GA D +V S P ++C+ + A+P G ++ + + Y PCS + S
Sbjct: 28 GAGGDQERRQRFTVVQTSHFQPQSICSGLK-AIPSGKNRTWVPLHRPYSPCSPSSS-PSP 85
Query: 81 HAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVA--- 137
PSL EILR DQ R + RR A +PA + +V+ + +V+
Sbjct: 86 PPPSLLEILRWDQVRT--ASVRRKAMSGHAGSHDDVAEYYPATPHVSVSQRDFALVSTFG 143
Query: 138 IGEPKQ--------------YVSLLLDTGSDVTW--TQCKPCIHCFQQRDPFFYASKSKT 181
IG ++ +DT D+ W + P C+ QR+ F +KS +
Sbjct: 144 IGSGAAGSLDDDDDGDPMVLAQTMAIDTTIDIPWIQCRPCPPPQCYPQRNALFDPTKSFS 203
Query: 182 FFKIPCNSTSCRILRESFPFGN------------------CNSKECPFNIQYADGSGSGG 223
+PC S +CR L +GN ++ +C + + Y+DG S G
Sbjct: 204 AAAVPCGSRACRALGN---YGNGCSNNSRRNKKKNKSKSNNSTGDCNYRVAYSDGRVSSG 260
Query: 224 FWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSG-ASGIMGLDRSPVSIITRTNTSY 282
+ TD +TI S F + F GC + G SG SG M L S++++T +Y
Sbjct: 261 TYMTDILTISPGTS---FLNFRF--GCSHGVRGSFSGETSGTMSLGGGRQSLLSQTARAY 315
Query: 283 ---FSYCLPSPYGSTGYITFG 300
FSYC+P P S G+++ G
Sbjct: 316 GNAFSYCVPKPSAS-GFLSLG 335
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 54/163 (33%), Positives = 90/163 (55%), Gaps = 14/163 (8%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI---HCFQQRDPFFYASKSKTFFKI 185
A EY+ + +G+P Q + DTGSDV+W QC+PC C++Q P F S ++ +
Sbjct: 181 AGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPL 240
Query: 186 PCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C+S C +L E+ C++ C + ++Y DGS + G AT+ + + +NS P
Sbjct: 241 SCDSEQCHLLDEA----ACDANSCIYEVEYGDGSFTVGELATETFSFRHSNS------IP 290
Query: 246 FL-LGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
L +GC +++ G GA G++GL +S+ ++ + FSYCL
Sbjct: 291 NLPIGCGHDNEGLFVGADGLIGLGGGAISLSSQLEATSFSYCL 333
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 46/121 (38%), Positives = 59/121 (48%), Gaps = 8/121 (6%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
IT +D + LR AF K A G+ DTCYDLS+ V VP IA G L
Sbjct: 408 ITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPF-DTCYDLSSQSNVEVPTIAFILPGENSL 466
Query: 357 ELDVRGTLV-VASVSQVCLGF--ATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
+L + L+ V S CL F +T+P +GNVQQ+G V YD+A +GF
Sbjct: 467 QLPAKNCLIQVDSAGTFCLAFLPSTFPLS----IIGNVQQQGIRVSYDLANSLVGFSTDK 522
Query: 414 C 414
C
Sbjct: 523 C 523
>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
Length = 557
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 54/184 (29%), Positives = 88/184 (47%), Gaps = 24/184 (13%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK-PCIHCFQQRDPFFYASKSKTFFKIPCNS 189
+YY + +G P + L +DTGSD+TW QC PC +C + P + +K K +P
Sbjct: 186 QYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPTKEKI---VPPRD 242
Query: 190 TSCRILRESFPFGNCN----SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C+ L+ GN N K+C + I+YAD S S G A D + + +NG +
Sbjct: 243 LLCQELQ-----GNQNYCETCKQCDYEIEYADQSSSMGVLARDDMHL--IATNGGREKLD 295
Query: 246 FLLGCINNSSGD----KSGASGIMGLDRSPVSIITRTN-----TSYFSYCLPSPYGSTGY 296
F+ GC + G + GI+GL + +S+ ++ ++ F +C+ G GY
Sbjct: 296 FVFGCAYDQQGQLLSSPAKTDGILGLSNAAISLPSQLASHGIISNIFGHCITREQGGGGY 355
Query: 297 ITFG 300
+ G
Sbjct: 356 MFLG 359
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 54/163 (33%), Positives = 91/163 (55%), Gaps = 14/163 (8%)
Query: 129 ADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI---HCFQQRDPFFYASKSKTFFKI 185
A EY+ + +G+P Q + DTGSDV+W QC+PC C++Q P F S ++ +
Sbjct: 181 AGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPL 240
Query: 186 PCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C+S C +L E+ C++ C + ++Y DGS + G AT+ + + +NS P
Sbjct: 241 SCDSEQCHLLDEA----ACDANSCIYEVEYGDGSFTVGELATETFSFRHSNS------IP 290
Query: 246 FL-LGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
L +GC +++ G GA+G++GL +S+ ++ + FSYCL
Sbjct: 291 NLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCL 333
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 46/121 (38%), Positives = 58/121 (47%), Gaps = 8/121 (6%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
IT +D + LR AF K A G+ DTCYDLS+ V VP IA G L
Sbjct: 408 ITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPF-DTCYDLSSQSNVEVPTIAFILPGENSL 466
Query: 357 ELDVRGTLV-VASVSQVCLGF--ATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
+L + L V S CL F +T+P +GNVQQ+G V YD+A +GF
Sbjct: 467 QLPAKNCLFQVDSAGTFCLAFLPSTFPLS----IIGNVQQQGIRVSYDLANSLVGFSTDK 522
Query: 414 C 414
C
Sbjct: 523 C 523
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 55/178 (30%), Positives = 89/178 (50%), Gaps = 23/178 (12%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKP---CIHCFQQRDPFFYASKSKTFFKIPC 187
+Y++ + +G P + L++DTGSD+TW QC P + P++ S S ++ +IPC
Sbjct: 58 QYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPC 117
Query: 188 NSTSCRILRESFPFGNCNS----KECPFNIQYADGSGSGGFWATDRITIQEANSNG---- 239
C+ L P G+ S C + Y+D S + G A + I+++ +G
Sbjct: 118 TDDECQFL--PAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKRAG 175
Query: 240 -YFTRYPFL----LGCINNSSGDK-SGASGIMGLDRSPVSIITRTNTS----YFSYCL 287
+ TR + LGC S G GASG++GL + P+S+ T+T + FSYCL
Sbjct: 176 NHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGIFSYCL 233
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 47/89 (52%), Gaps = 1/89 (1%)
Query: 321 KAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYP 380
+A+ + + + CY+++ E + PK+ + F GG +EL +V+ + + C+
Sbjct: 334 RAQEIPEGFELCYNVTRMEKGM-PKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVT 392
Query: 381 PDPNSITLGNVQQRGHEVHYDVAGRRLGF 409
S LGN+ Q+ H + YD+A R+GF
Sbjct: 393 TTNGSNILGNLLQQDHHIEYDLAKARIGF 421
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 74/264 (28%), Positives = 112/264 (42%), Gaps = 46/264 (17%)
Query: 59 KASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNS--RRLRKPFPEFLKRTE 116
+ SL +G L IS+ +P Q Q+ + N R+R +L
Sbjct: 16 RLSLTKTQNHGFNVELIHPISSRSPFYNPKETQIQRISSILNYSINRVR-----YLNHVF 70
Query: 117 AFTFPANINDT-----VADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRD 171
+F+ P I D + Y + +IG P + L+DTG+D W QCKPC C Q
Sbjct: 71 SFS-PNKIQDVPLSSFMGAGYVMSYSIGTPPFQLYSLIDTGNDNIWFQCKPCKPCLNQTS 129
Query: 172 PFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRIT 231
P F+ SKS T+ IPC S C+ + G + D +T
Sbjct: 130 PMFHPSKSSTYKTIPCTSPICK-------------------------NADGHYLGVDTLT 164
Query: 232 IQEANSNGYFTRYPFLLGCINNSSGDKSG-ASGIMGLDRSPVSIITRTNTSY---FSYCL 287
+ +N+ + ++GC + + G G SG +GL R P+S I++ N+S FSYCL
Sbjct: 165 L-NSNNGTPISFKNIVIGCGHRNQGPLEGYVSGNIGLARGPLSFISQLNSSIGGKFSYCL 223
Query: 288 P---SPYGSTGYITFGKTDTVNTL 308
S + + FG TV+ L
Sbjct: 224 VPLFSKENVSSKLHFGDKSTVSGL 247
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 70/252 (27%), Positives = 102/252 (40%), Gaps = 38/252 (15%)
Query: 54 PQGPDKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRR-LRKPFPEFL 112
P + LE+V ++ G +++ +++D+ R N R + +
Sbjct: 27 PVAVNSMRLELVHRHHERFAGGGGDVDRVEAVKGFVKRDKLRRQRMNQRWGVVSNYDSRR 86
Query: 113 KRTEAFTFPANIN-------DTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIH 165
K E T PA + D EY+ V +G P Q L++DTGS+ TW C
Sbjct: 87 KGFEMTTTPAEVEMPMHSGRDDALGEYFAEVKVGSPGQRFWLVVDTGSEFTWLNC----- 141
Query: 166 CFQQRDPFFYASKSKTFFKIPCNSTSCRI-LRESFPFGNC--NSKECPFNIQYADGSGSG 222
SK+F + C S C++ L E F C S C ++I YADGS +
Sbjct: 142 -------------SKSFEAVTCASRKCKVDLSELFSLSVCPKPSDPCLYDISYADGSSAK 188
Query: 223 GFWATDRITIQEANS-NGYFTRYPFLLGCIN---NSSGDKSGASGIMGLDRSPVSIITRT 278
GF+ TD IT+ N G +GC N GI+GL + S I +
Sbjct: 189 GFFGTDSITVGLTNGKQGKLNN--LTIGCTKSMLNGVNFNEETGGILGLGFAKDSFIDKA 246
Query: 279 NTSY---FSYCL 287
Y FSYCL
Sbjct: 247 ANKYGAKFSYCL 258
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 52/106 (49%), Gaps = 1/106 (0%)
Query: 311 AFHKRMKKYKKAKGLE-DLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASV 369
A K + K K+ G + D L+ C+D ++ VVP++ HF GG E V+ ++ +
Sbjct: 348 ALTKSLTKVKRVTGEDFDALEFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAP 407
Query: 370 SQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
C+G + +GN+ Q+ H +D++ +GF P C+
Sbjct: 408 LVKCIGIVPIDGIGGASVIGNIMQQNHLWEFDLSTNTVGFAPSTCT 453
>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
Length = 557
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 87/184 (47%), Gaps = 24/184 (13%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNS 189
+YY + IG P + L +DTGSD+TW QC PC +C + P + +K K +P
Sbjct: 186 QYYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKI---VPPRD 242
Query: 190 TSCRILRESFPFGNCN----SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C+ L+ GN N K+C + I+YAD S S G A D + + +NG +
Sbjct: 243 LLCQELQ-----GNQNYCETCKQCDYEIEYADQSSSMGVLARDDMHM--IATNGGREKLD 295
Query: 246 FLLGCINNSSGD----KSGASGIMGLDRSPVSIITRTNT-----SYFSYCLPSPYGSTGY 296
F+ GC + G + GI+GL + +S ++ + + F +C+ G GY
Sbjct: 296 FVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGGGY 355
Query: 297 ITFG 300
+ G
Sbjct: 356 MFLG 359
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 95/213 (44%), Gaps = 20/213 (9%)
Query: 102 RRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQC- 160
R + PE +RT A+ NI YY+ + IG P + L +DTGSD+TW QC
Sbjct: 3 RLSKASVPETAQRTAAYPIGGNIYPD--GLYYMAMRIGNPAKLYYLDMDTGSDLTWLQCD 60
Query: 161 KPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS--KECPFNIQYADG 218
PC C + +++ + C +C ++ F C+ ++C + + Y DG
Sbjct: 61 APCRSCAVGPHGLYDPKRARV---VDCRRPTCAQVQRGGQF-TCSGDVRQCDYEVDYVDG 116
Query: 219 SGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGA----SGIMGLDRSPVSI 274
S + G D IT+ N + TR ++GC + G + A G++GL S +S+
Sbjct: 117 SSTMGILVEDTITLVLTNGTRFQTR--AVIGCGYDQQGTLAKAPAVTDGVIGLSSSKISL 174
Query: 275 ITR-----TNTSYFSYCLPSPYGSTGYITFGKT 302
++ + +CL GY+ FG T
Sbjct: 175 PSQLAAKGIANNVIGHCLAGGSNGGGYLFFGDT 207
>gi|125552105|gb|EAY97814.1| hypothetical protein OsI_19735 [Oryza sativa Indica Group]
Length = 424
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 88/354 (24%), Positives = 128/354 (36%), Gaps = 77/354 (21%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC----------IHCFQQRDPFFYASKSK 180
+Y IG+P Q ++DTGSD+ WTQC C CF Q P++ S S+
Sbjct: 77 QYIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLSR 136
Query: 181 TFFKIPCNSTS---CRILRESFP---FGNCNSKECPFNIQYADGSGSGGFWATDRITIQE 234
T +PC+ C + E+ G C Y G G TD T
Sbjct: 137 TARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAGVAL-GVLGTDAFTFPS 195
Query: 235 ANSNGYFTRYPFLLGCINN---SSGDKSGASGIMGLDRSPVSIITRTNT-SYFSY----- 285
++S GC++ S G +GASGI+GL R +S+ + + S F Y
Sbjct: 196 SSS------VTLAFGCVSQTRISPGALTGASGIIGLGRGALSLNPKDSPFSTFYYLPLVG 249
Query: 286 -------------------CLPSPYGSTGYITFGK--TDTVNTLRSAFHKRMKKYKKAKG 324
P + I G T V+ A K + + + G
Sbjct: 250 LAAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSG 309
Query: 325 ------------LEDLLDTCYDLSAYETVVVPKIAIHF----LGGVDLELDVRGTLVVAS 368
LE ++ D + VP + + F GG +L +
Sbjct: 310 SLVPPPAKLGGALELCVEAGDDGDSLAAAAVPSLVLRFDDGVGGGRELVIPAEKYWARVE 369
Query: 369 VSQVCL-------GFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
S C+ G AT P + +I +GN Q+ V YD+A L F P NCS
Sbjct: 370 ASTWCMAVVSSASGNATLPTNETTI-IGNFMQQDMRVLYDLANGLLSFQPANCS 422
>gi|449467979|ref|XP_004151699.1| PREDICTED: probable aspartic protease At2g35615-like, partial
[Cucumis sativus]
Length = 209
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/169 (28%), Positives = 80/169 (47%), Gaps = 21/169 (12%)
Query: 98 LKNSRRLRKPFPEFLKRTEAFTFPANINDTV---------ADEYYIVVAIGEPKQYVSLL 148
L + RL F L R+ A N + + EY + V+IG P +
Sbjct: 49 LSHYDRLTNAFRRSLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGM 108
Query: 149 LDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKE 208
DTGSD+ W QC PC+ C++Q P F KS +F +PCNS +C+ + +S +C ++
Sbjct: 109 ADTGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAIDDS----HCGAQG 164
Query: 209 -CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSG 256
C ++ Y D + + G ++ITI ++ ++GC + S G
Sbjct: 165 VCDYSYTYGDQTYTKGDLGFEKITIGSSSVKS-------VIGCGHESGG 206
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 76/171 (44%), Gaps = 9/171 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + IG P DTGSD+ W QC PC CF Q P F KS TF C S
Sbjct: 89 EYLMRFYIGTPPVERLATADTGSDLIWVQCSPCASCFPQSTPLFQPLKSSTFMPTTCRSQ 148
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADG-SGSGGFWATD--RITIQEANSNGYFTRYPFL 247
C +L G S EC + +Y D S S G +T+ R Q F F
Sbjct: 149 PCTLLLPEQK-GCGKSGECIYTYKYGDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSFFG 207
Query: 248 LGCINNSSGDKS-GASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGST 294
G NN + S +GIMGL P+S++++ FSYCL P GST
Sbjct: 208 CGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQIGHKFSYCL-LPLGST 257
>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 413
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 58/195 (29%), Positives = 96/195 (49%), Gaps = 21/195 (10%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNS 189
YY+ + IG+P + L +DTGSD+TW QC PC C + P + +K+K +PC +
Sbjct: 51 HYYVTMNIGDPAKPYFLDIDTGSDLTWLQCDAPCQSCNKVPHPLYKPTKNKL---VPCAA 107
Query: 190 TSCRILRES-FPFGNCN-SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
+ C L + P C ++C + I+Y D + S G TD T+ NS+ R F
Sbjct: 108 SICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTLPLRNSSS--VRPSFT 165
Query: 248 LGCINNSSGDKSGA-----SGIMGLDRSPVSIITRTNT-----SYFSYCLPSPYGSTGYI 297
GC + K+G G++GL + VS++++ + +CL + G G++
Sbjct: 166 FGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCLSTNGG--GFL 223
Query: 298 TFGKTDTVNTLRSAF 312
FG + V T R+ +
Sbjct: 224 FFGD-NVVPTSRATW 237
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 52/167 (31%), Positives = 81/167 (48%), Gaps = 16/167 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI---HCFQQRDPFFYASKSKTFFKIPC 187
+Y IG+P Q + L+DTGS++ WTQC C +Q P++ S+S TF +PC
Sbjct: 83 QYIAEYLIGDPPQRAAALIDTGSNLIWTQCGTTCGLKACAKQDLPYYNLSRSSTFAAVPC 142
Query: 188 NSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
++ + C F Y GS G T+ T Q + F
Sbjct: 143 ADSAKLCAANGVHLCGLDG-SCTFAASYGAGSVFGSL-GTEAFTFQSGAAKLGF------ 194
Query: 248 LGCIN---NSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPY 291
GC++ + G +GASG++GL R +S++++T + FSYCL +PY
Sbjct: 195 -GCVSLTRITKGALNGASGLIGLGRGRLSLVSQTGATKFSYCL-TPY 239
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 56/178 (31%), Positives = 90/178 (50%), Gaps = 23/178 (12%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKP---CIHCFQQRDPFFYASKSKTFFKIPC 187
+Y++ + +G P + L++DTGSD+TW QC P + P++ S S ++ +IPC
Sbjct: 26 QYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPC 85
Query: 188 NSTSCRILRESFPFG-NCNSKE---CPFNIQYADGSGSGGFWATDRITIQEANSNG---- 239
C L P G +C+ K C + Y+D S + G A + I+++ +G
Sbjct: 86 TDDECLFLPA--PIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKRAG 143
Query: 240 -YFTRYPFL----LGCINNSSGDK-SGASGIMGLDRSPVSIITRTNTS----YFSYCL 287
+ TR + LGC S G GASG++GL + P+S+ T+T + FSYCL
Sbjct: 144 NHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGIFSYCL 201
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/89 (28%), Positives = 47/89 (52%), Gaps = 1/89 (1%)
Query: 321 KAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYP 380
+A+ + + + CY+++ E + PK+ + F GG +EL +V+ + + C+
Sbjct: 302 RAQEIPEGFELCYNVTRMEKGM-PKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVT 360
Query: 381 PDPNSITLGNVQQRGHEVHYDVAGRRLGF 409
S LGN+ Q+ H + YD+A R+GF
Sbjct: 361 TTNGSNILGNLLQQDHHIEYDLAKARIGF 389
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 57/204 (27%), Positives = 101/204 (49%), Gaps = 31/204 (15%)
Query: 98 LKNSRRLRKPFPEFLKR-----TEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTG 152
+ +S+ +KP LK + +F N+ TV+ + +G P Q V+++LDTG
Sbjct: 27 VSSSQLTQKPLLLPLKTQTQTPSRKLSFHHNVTLTVS------LTVGSPPQNVTMVLDTG 80
Query: 153 SDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSC--RILRESFPFGNC--NSKE 208
S+++W CK + +P +S + T PCNS+ C R + P +C N+K
Sbjct: 81 SELSWLHCKKLPNLNSTFNPLLSSSYTPT----PCNSSICTTRTRDLTIP-ASCDPNNKL 135
Query: 209 CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSS-----GDKSGASG 263
C + YAD S + G A + ++ A G L GC++++ + S +G
Sbjct: 136 CHVIVSYADASSAEGTLAAETFSLAGAAQPGT------LFGCMDSAGYTSDINEDSKTTG 189
Query: 264 IMGLDRSPVSIITRTNTSYFSYCL 287
+MG++R +S++T+ + FSYC+
Sbjct: 190 LMGMNRGSLSLVTQMSLPKFSYCI 213
>gi|222635873|gb|EEE66005.1| hypothetical protein OsJ_21949 [Oryza sativa Japonica Group]
Length = 100
Score = 77.0 bits (188), Expect = 2e-11, Method: Composition-based stats.
Identities = 37/98 (37%), Positives = 54/98 (55%), Gaps = 1/98 (1%)
Query: 317 KKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGF 376
+ Y+KA + LLDTCYD + V +P +++ F GG L++D G + S SQVCL F
Sbjct: 4 RGYRKAAAVS-LLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTVSASQVCLAF 62
Query: 377 ATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
A + +GN Q + V YD+ + +GF PG C
Sbjct: 63 AGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 100
>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 321
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 65/206 (31%), Positives = 98/206 (47%), Gaps = 30/206 (14%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y I V +G P + L +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-F 246
C +L S P +C E CPF + Y DGS S G D +T + + P F
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPSF 109
Query: 247 LLGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STG 295
GC +S G + G++G+ P+S++ +++ ++ FSYCLP S G +TG
Sbjct: 110 SFGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTG 169
Query: 296 YITFGKTDTVNTLRSAFHKRMKKYKK 321
Y + GK T +R + +M KK
Sbjct: 170 YFSLGKVATRTDVR---YTKMVARKK 192
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/194 (28%), Positives = 95/194 (48%), Gaps = 23/194 (11%)
Query: 115 TEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFF 174
+ +F N+ TV+ + +G P Q V+++LDTGS+++W CK + +P
Sbjct: 29 SNKLSFHHNVTLTVS------LTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTSVFNPL- 81
Query: 175 YASKSKTFFKIPCNSTSCRILRESFPFG-NCNSKE-CPFNIQYADGSGSGGFWATDRITI 232
S ++ IPC+S CR P C+ K+ C + YAD S G A+D I
Sbjct: 82 ---SSSSYSPIPCSSPVCRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRI 138
Query: 233 QEANSNGYFTRYPFLLGCIN----NSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLP 288
+ G L GC++ ++S + + +G+MG++R +S +T+ FSYC+
Sbjct: 139 GSSALPGT------LFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCI- 191
Query: 289 SPYGSTGYITFGKT 302
S S+G + FG +
Sbjct: 192 SGRDSSGVLLFGDS 205
>gi|388515789|gb|AFK45956.1| unknown [Medicago truncatula]
Length = 225
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 57/110 (51%), Gaps = 4/110 (3%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
N R AF + K G+ + DTCYDL+ + TV VP I+ +FLGG L L R L+
Sbjct: 119 NAFRDAFVAQTTNLPKTSGVS-IFDTCYDLNGFVTVRVPTISFYFLGGPILTLPARNFLI 177
Query: 366 -VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V SV C FA P +GN+QQ G E+ D A +GFGP C
Sbjct: 178 PVDSVGTFCFAFA--PSSSGLSIIGNIQQEGIEISVDGANGYIGFGPNIC 225
>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
Length = 420
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 60/216 (27%), Positives = 96/216 (44%), Gaps = 18/216 (8%)
Query: 115 TEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPF 173
+ FP + N Y + + IG+P + L LDTGSD+TW QC PC+ C + P
Sbjct: 21 VSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPL 80
Query: 174 FYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQ 233
+ S IPCN C+ L + ++C + ++YADG S G D ++
Sbjct: 81 YQPSSD----LIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMN 136
Query: 234 EANSNGYFTRYPFLLGCINNSSGDKSG---ASGIMGLDRSPVSIITRTNTSYF-----SY 285
R LGC + S G++GL R VSI+++ ++ + +
Sbjct: 137 YTQGLRLTPR--LALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGH 194
Query: 286 CLPSPYGSTGYITFGKTDTVNTLRSAFHKRMKKYKK 321
CL S G G + FG D ++ R ++ ++Y K
Sbjct: 195 CLSSLGG--GILFFGD-DLYDSSRVSWTPMSREYSK 227
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/192 (30%), Positives = 95/192 (49%), Gaps = 21/192 (10%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNS 189
YY+ + IG+P + L +DTGSD+TW QC PC C + P + +K+K +PC +
Sbjct: 56 HYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKNKL---VPCAN 112
Query: 190 TSCRILRE-SFPFGNCNS-KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
+ C L S P C + ++C + I+Y D + S G TD ++ N + R
Sbjct: 113 SICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSLPLRNKSN--VRPSLS 170
Query: 248 LGCINNSSGDKSGAS-----GIMGLDRSPVSIITR-----TNTSYFSYCLPSPYGSTGYI 297
GC + K+GA+ G++GL R VS++++ + +CL + G G++
Sbjct: 171 FGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLSTSGG--GFL 228
Query: 298 TFGKTDTVNTLR 309
FG D V T R
Sbjct: 229 FFGD-DMVPTSR 239
>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
Length = 395
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 90/182 (49%), Gaps = 20/182 (10%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
YY+ ++IG P + L +DTGSD+TW QC PC+ C + P + +K+K +PC
Sbjct: 58 YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKL---VPCVDQ 114
Query: 191 SCRILRESFPFGN-CNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C L + C+S ++C + I+YAD S G TD ++ ANS+ R
Sbjct: 115 MCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSS--IVRPGLA 172
Query: 248 LGCINN----SSGDKSGASGIMGLDRSPVSIITR-----TNTSYFSYCLPSPYGSTGYIT 298
GC + SS + S G++GL VS++++ + +CL + G G++
Sbjct: 173 FGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGG--GFLF 230
Query: 299 FG 300
FG
Sbjct: 231 FG 232
>gi|296082634|emb|CBI21639.3| unnamed protein product [Vitis vinifera]
Length = 278
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/144 (32%), Positives = 69/144 (47%), Gaps = 17/144 (11%)
Query: 85 LEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQY 144
L+ +++ + RL +RL F EA N E+ + +AIG P +
Sbjct: 54 LQRAMKRGKLRL-----QRLSAKTASFESSVEAPVHAGN------GEFLMKLAIGTPAET 102
Query: 145 VSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPC------NSTSCRILRES 198
S ++DTGSD+ WTQCKPC CF Q P F KS +F K+PC +ST + E+
Sbjct: 103 YSAIMDTGSDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKLPCSSDLYYSSTQGVLATET 162
Query: 199 FPFGNCNSKECPFNIQYADGSGSG 222
F FG+ + + F + SG
Sbjct: 163 FAFGDASVSKIGFGCGEDNDGNSG 186
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 89/359 (24%), Positives = 134/359 (37%), Gaps = 75/359 (20%)
Query: 120 FPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC--IHCFQQRDPFFYAS 177
P ++D+ Y + ++G P Q ++ L DTGSD+ W +C C Q P + +
Sbjct: 80 IPLRMDDS-GGAYDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPN 138
Query: 178 KSKTFFKIPCNSTSCRILR-ESFPFGNCNSKECPFNIQYADGSG----SGGFWATDRITI 232
S TF K+PC+ C +LR +S + EC + Y G + GF A + T+
Sbjct: 139 ASSTFAKLPCSDRLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTL 198
Query: 233 QEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL----- 287
G GC S G SG++GL R P+S++++ N S F YCL
Sbjct: 199 ------GADAVPSVRFGCTTASEGGYGSGSGLVGLGRGPLSLVSQLNASTFMYCLTSDAS 252
Query: 288 ---PSPYGSTGYITFGKTDTVNTLRSAFHKRMK--------------------------- 317
P +GS +T + + L S +
Sbjct: 253 KASPLLFGSLASLTGAQVQSTGLLASTTFYAVNLRSISIGSATTPGVGEPEGVVFDSGTT 312
Query: 318 -------KYKKAKGL---EDLLDTCYDLSAYET------------VVVPKIAIHFLGGVD 355
Y +AK + LD D +E VP + +HF G D
Sbjct: 313 LTYLAEPAYSEAKAAFLSQTSLDQVEDTDGFEACFQKPANGRLSNAAVPTMVLHF-DGAD 371
Query: 356 LELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ L V +V VC P+ +GN+ Q + V +DV L F P NC
Sbjct: 372 MALPVANYVVEVEDGVVCW---IVQRSPSLSIIGNIMQVNYLVLHDVHRSVLSFQPANC 427
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 60/203 (29%), Positives = 94/203 (46%), Gaps = 10/203 (4%)
Query: 113 KRTEAFTFPANINDTVA---DEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQ 169
+R A A + VA EY + + +G P + +++DTGSD+ W QC PC+ CF+Q
Sbjct: 130 RRALAERIVATVESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQ 189
Query: 170 RDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNC---NSKECPFNIQYADGSGSGGFWA 226
R P F + S ++ + C C ++ C +S CP+ Y D S + G A
Sbjct: 190 RGPVFDPAASLSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLA 249
Query: 227 TDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---F 283
+ T+ + GC +++ G GA+G++GL R +S ++ Y F
Sbjct: 250 LEAFTVNLTAPGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAF 309
Query: 284 SYCLPSPYGSTG-YITFGKTDTV 305
SYCL S G I FG D +
Sbjct: 310 SYCLVDHGSSVGSKIVFGDDDAL 332
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 49/111 (44%), Gaps = 2/111 (1%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+R AF +RM K +L CY++S E V VP+ ++ F G + V
Sbjct: 403 EVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFV 462
Query: 366 VASVSQV-CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ CL P SI +GN QQ+ V YD+ RLGF P C+
Sbjct: 463 RLDPDGIMCLAVLGTPRSAMSI-IGNFQQQNFHVLYDLQNNRLGFAPRRCA 512
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 90/182 (49%), Gaps = 20/182 (10%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
YY+ ++IG P + L +DTGSD+TW QC PC+ C + P + +K+K +PC
Sbjct: 58 YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKL---VPCVDQ 114
Query: 191 SCRILRESFPFGN-CNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C L + C+S ++C + I+YAD S G TD ++ ANS+ R
Sbjct: 115 MCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSS--IVRPGLA 172
Query: 248 LGCINN----SSGDKSGASGIMGLDRSPVSIITR-----TNTSYFSYCLPSPYGSTGYIT 298
GC + SS + S G++GL VS++++ + +CL + G G++
Sbjct: 173 FGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGG--GFLF 230
Query: 299 FG 300
FG
Sbjct: 231 FG 232
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/208 (27%), Positives = 91/208 (43%), Gaps = 21/208 (10%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIP 186
YY V +G P ++ +DTGSDV W C C C Q + FF S T I
Sbjct: 25 YYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMIA 84
Query: 187 CNSTSCRILRESFPFGNCNSK--ECPFNIQYADGSGSGGFWATDRI---TIQEANSNGYF 241
C+ C +S C+S+ +C + QY DGSG+ G++ +D + TI E +
Sbjct: 85 CSDQRCNNGIQSSD-ATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNS 143
Query: 242 TRYPFLLGCINNSSGDKS----GASGIMGLDRSPVSIITRTNTS-----YFSYCLPSPYG 292
T P + GC N +GD + GI G + +S+I++ ++ FS+CL
Sbjct: 144 TA-PVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSS 202
Query: 293 STGYITFGKTDTVNTLRSAFHKRMKKYK 320
G + G+ N + ++ Y
Sbjct: 203 GGGILVLGEIVEPNIVYTSLVPAQPHYN 230
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/182 (31%), Positives = 88/182 (48%), Gaps = 8/182 (4%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + V +G P + +++DTGSD+ W QC PC+ CF QR P F S ++ + C T
Sbjct: 149 EYLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFDQRGPVFDPMASTSYRNVTCGDT 208
Query: 191 SCRILRESFPFGNCNSKE---CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C ++ C S CP+ Y D S + G A + T+ S+ +
Sbjct: 209 RCGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTASSSRRVD-GVV 267
Query: 248 LGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTG-YITFGKTD 303
LGC + + G GA+G++GL R P+S ++ Y FSYCL + G I FG +
Sbjct: 268 LGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHAFSYCLVDHGSAVGSKIVFGDDN 327
Query: 304 TV 305
+
Sbjct: 328 VL 329
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 48/111 (43%), Gaps = 2/111 (1%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+R AF RM K +L CY++S E V VP+ ++ F G + +
Sbjct: 399 KAIRQAFVDRMDKAYPLIADFPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFI 458
Query: 366 VASVSQV-CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ CL P SI +GN QQ+ V YD+ RLGF P C+
Sbjct: 459 RLDTEGIMCLAVLGTPRSAMSI-IGNYQQQNFHVLYDLHHNRLGFAPRRCA 508
>gi|125595845|gb|EAZ35625.1| hypothetical protein OsJ_19916 [Oryza sativa Japonica Group]
Length = 152
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/141 (32%), Positives = 73/141 (51%), Gaps = 11/141 (7%)
Query: 146 SLLLDTGSDVTWTQCKPCIH--CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGN 203
+++LDT SDVTW QC PC C+ Q+D + +KS + CNS +C L + G
Sbjct: 18 TMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGP-YANGC 76
Query: 204 CNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD---KSG 260
N+ +C + ++Y DG+ + G + +D +TI A + F GC G S
Sbjct: 77 TNNNQCQYRVRYPDGTSTAGTYISDLLTITPATA-----VRSFQFGCSKGVKGSFSFGSS 131
Query: 261 ASGIMGLDRSPVSIITRTNTS 281
A+GIM L P S++++T +
Sbjct: 132 AAGIMALGGGPESLVSQTGAT 152
>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 64/207 (30%), Positives = 98/207 (47%), Gaps = 27/207 (13%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y I V +G P + + +DTGS TW C+ C C F S+S T K+ C ++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTTWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C +L S P +C E CPF + Y DGS S G D +T + FT
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----- 110
Query: 248 LGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STGY 296
GC +S G + G++G+ P+S++ +++ ++ FSYCLP S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAK 323
+ GK T +R + K + + K +
Sbjct: 171 FSLGKVATRTDVR--YTKMVARRKNTE 195
>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 421
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 90/182 (49%), Gaps = 20/182 (10%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
YY+ ++IG P + L +DTGSD+TW QC PC+ C + P + +K+K +PC
Sbjct: 58 YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKL---VPCVDQ 114
Query: 191 SCRILRESFPFGN-CNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C L + C+S ++C + I+YAD S G TD ++ ANS+ R
Sbjct: 115 MCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSS--IVRPGLA 172
Query: 248 LGCINN----SSGDKSGASGIMGLDRSPVSIITR-----TNTSYFSYCLPSPYGSTGYIT 298
GC + SS + S G++GL VS++++ + +CL + G G++
Sbjct: 173 FGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGG--GFLF 230
Query: 299 FG 300
FG
Sbjct: 231 FG 232
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 60/203 (29%), Positives = 94/203 (46%), Gaps = 10/203 (4%)
Query: 113 KRTEAFTFPANINDTVA---DEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQ 169
+R A A + VA EY + + +G P + +++DTGSD+ W QC PC+ CF+Q
Sbjct: 130 RRALAERIVATVESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQ 189
Query: 170 RDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNC---NSKECPFNIQYADGSGSGGFWA 226
R P F + S ++ + C C ++ C +S CP+ Y D S + G A
Sbjct: 190 RGPVFDPATSLSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLA 249
Query: 227 TDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---F 283
+ T+ + GC +++ G GA+G++GL R +S ++ Y F
Sbjct: 250 LEAFTVNLTAPGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAF 309
Query: 284 SYCLPSPYGSTG-YITFGKTDTV 305
SYCL S G I FG D +
Sbjct: 310 SYCLVDHGSSVGSKIVFGDDDAL 332
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 49/111 (44%), Gaps = 2/111 (1%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+R AF +RM K +L CY++S E V VP+ ++ F G + V
Sbjct: 403 EVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFV 462
Query: 366 VASVSQV-CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ CL P SI +GN QQ+ V YD+ RLGF P C+
Sbjct: 463 RLDPDGIMCLAVLGTPRSAMSI-IGNFQQQNFHVLYDLQNNRLGFAPRRCA 512
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 71/265 (26%), Positives = 113/265 (42%), Gaps = 22/265 (8%)
Query: 73 RLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRK-----PFPEFLKRTEAFTFPANINDT 127
R +G T SL ++ +D R+ R R P +R + A +
Sbjct: 84 RAAEGGRTREESLLDLAEKDAVRIETMYRRAARSGGGRMPASSSPRRALSERMVATVESG 143
Query: 128 VA---DEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFK 184
VA EY + V +G P + +++DTGSD+ W QC PC+ CF+QR P F + S ++
Sbjct: 144 VAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRN 203
Query: 185 IPCNSTSCRILRESFPFGNCNSKE--------CPFNIQYADGSGSGGFWATDRITIQEAN 236
+ C C + + + CP+ Y D S + G A + T+
Sbjct: 204 VTCGDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFTVNLTA 263
Query: 237 SNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGS 293
+ GC + + G GA+G++GL R P+S ++ Y FSYCL
Sbjct: 264 PGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVDHGSD 323
Query: 294 TG-YITFGKTDTVNTLRSAFHKRMK 317
G + FG+ D + L A H ++K
Sbjct: 324 VGSKVVFGEDD--DALALAAHPQLK 346
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 49/113 (43%), Gaps = 4/113 (3%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+R AF RM + +L CY++S E VP++++ F G + +
Sbjct: 412 QVIRHAFMDRMSRSYPLVPEFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFI 471
Query: 366 VASV---SQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
S +CL P SI +GN QQ+ V YD+ RLGF P C+
Sbjct: 472 RLDPDGGSIMCLAVLGTPRTGMSI-IGNFQQQNFHVVYDLQNNRLGFAPRRCA 523
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/192 (29%), Positives = 94/192 (48%), Gaps = 23/192 (11%)
Query: 115 TEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFF 174
+ +F N+ TV+ + +G P Q V+++LDTGS+++W CK + +P
Sbjct: 989 SNKLSFHHNVTLTVS------LTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTSVFNPL- 1041
Query: 175 YASKSKTFFKIPCNSTSCRILRESFPFG-NCNSKE-CPFNIQYADGSGSGGFWATDRITI 232
S ++ IPC+S CR P C+ K+ C + YAD S G A+D I
Sbjct: 1042 ---SSSSYSPIPCSSPICRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRI 1098
Query: 233 QEANSNGYFTRYPFLLGCIN----NSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLP 288
+ G L GC++ ++S + + +G+MG++R +S +T+ FSYC+
Sbjct: 1099 GSSALPGT------LFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFSYCI- 1151
Query: 289 SPYGSTGYITFG 300
S S+G + FG
Sbjct: 1152 SGRDSSGVLLFG 1163
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/144 (32%), Positives = 69/144 (47%), Gaps = 14/144 (9%)
Query: 285 YCLPSPYGSTGYI-------TFGKTDTVNTLRSAFHKRMKKYKKAKGLEDL--LDTCYDL 335
Y +PS G G I TF + + + K M KYK+AK +ED L CY++
Sbjct: 299 YLVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGPCYNI 358
Query: 336 SAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNS-----ITLGN 390
+ V VPK+ F GG +++L + + + VC+ T P +P S I LGN
Sbjct: 359 TGQSEVDVPKLTFSFKGGANMDLPLTDYFSLVTDGVVCMTVLTDPDEPGSTTGPAIILGN 418
Query: 391 VQQRGHEVHYDVAGRRLGFGPGNC 414
QQ+ + YD+ +R GF P C
Sbjct: 419 YQQQNFYIEYDLKKQRFGFKPQQC 442
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 65/209 (31%), Positives = 94/209 (44%), Gaps = 38/209 (18%)
Query: 98 LKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTW 157
L +R L++P P + +P + Y ++ ++G P Q VSL+LDTGS + W
Sbjct: 46 LSRARHLKRP-PTLTGKVTLPAYPRSYGG-----YSVIFSLGTPPQKVSLVLDTGSSLVW 99
Query: 158 TQCK------PCIHC-FQQRD----PFFYASKSKTFFKIPCNSTSCRILRESFPFG---N 203
T C C +C F D P + +KS T +PC S C + FG N
Sbjct: 100 TPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSLPCRSPKCNWV-----FGSDLN 154
Query: 204 CN-SKECP-FNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLGCINNSSGDKSG 260
C+ +K CP + ++Y GS +G +D + + + N R P FL GC S
Sbjct: 155 CSTTKRCPYYGLEYGLGSTTGQL-VSDVLGLSKLN------RIPDFLFGC---SLVSNRQ 204
Query: 261 ASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
GI G R SI + + FSYCL S
Sbjct: 205 PEGIAGFGRGLASIPAQLGLTKFSYCLVS 233
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 55/208 (26%), Positives = 89/208 (42%), Gaps = 30/208 (14%)
Query: 118 FTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC-----FQQRDP 172
F + N + Y+ V +G P + + +DTGSD+ W C PC C +
Sbjct: 77 FPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLE 136
Query: 173 FFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKE---CPFNIQYADGSGSGGFWATDR 229
FF S T KIPC+ C ++ C + + C + Y DGSG+ G++ +D
Sbjct: 137 FFNPDTSSTSSKIPCSDDRCTAALQTSE-AVCQTSDNSPCGYTFTYGDGSGTSGYYVSDT 195
Query: 230 ITI-------QEANSNGYFTRYPFLLGCINNSSGDKS----GASGIMGLDRSPVSIITRT 278
+ Q ANS+ + GC N+ SGD + GI G + +S++++
Sbjct: 196 MYFDTVMGNEQTANSSA-----SIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQL 250
Query: 279 NT-----SYFSYCLPSPYGSTGYITFGK 301
N+ FS+CL G + G+
Sbjct: 251 NSLGVSPKVFSHCLKGSDNGGGILVLGE 278
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 58/208 (27%), Positives = 95/208 (45%), Gaps = 19/208 (9%)
Query: 136 VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSC--R 193
+ IG P Q ++LDTGS ++W QC F S S TF +PC C R
Sbjct: 101 LPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVCKPR 160
Query: 194 ILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINN 253
I + P ++ C ++ YADG+ + G ++ T S FT P +LGC
Sbjct: 161 IPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTF----SRSLFTP-PLILGCATE 215
Query: 254 SSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSP-----YGSTGYITFGKTDTVNTL 308
S+ + GI+G++R +S +++ + FSYC+P+ Y TG G NT
Sbjct: 216 STDPR----GILGMNRGRLSFASQSKITKFSYCVPTRVTRPGYTPTGSFYLGHNPNSNTF 271
Query: 309 RSAFHKRMKKYKKAKGLEDLLDTCYDLS 336
R + M + +++ + +L Y ++
Sbjct: 272 R---YIEMLTFARSQRMPNLDPLAYTVA 296
Score = 42.7 bits (99), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 31/107 (28%), Positives = 47/107 (43%), Gaps = 5/107 (4%)
Query: 311 AFHKRMKKYKKAKGLEDLLDTCYDLSAYET-VVVPKIAIHFLGGVDLELDVRGTLVVASV 369
A RMKK G+ D+ C+D +A E ++ + F GV + + L
Sbjct: 350 AVGPRMKKGYVYGGVADM---CFDGNAIEIGRLIGDMVFEFEKGVQIVVPKERVLATVEG 406
Query: 370 SQVCLGFATYPP-DPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
C+G A S +GN Q+ V +D+ RR+GFG +CS
Sbjct: 407 GVHCIGIANSDKLGAASNIIGNFHQQNLWVEFDLVNRRMGFGTADCS 453
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 55/174 (31%), Positives = 87/174 (50%), Gaps = 22/174 (12%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDP-----FFYASKSKTFFKIPCN 188
I + IG P Q ++LDTGS ++W QC +++ P F S S +F +PC+
Sbjct: 74 ISLPIGTPPQAQQMVLDTGSQLSWIQCH------RKKLPPKPKTSFDPSLSSSFSTLPCS 127
Query: 189 STSC--RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPF 246
C RI + P +++ C ++ YADG+ + G ++IT SN T P
Sbjct: 128 HPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITF----SNTEITP-PL 182
Query: 247 LLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFG 300
+LGC SS D+ GI+G++R +S +++ S FSYC+P G+ G
Sbjct: 183 ILGCATESSDDR----GILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTG 232
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 79/165 (47%), Gaps = 14/165 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNS 189
Y + +G P +++DTGS +TW QC PC + C +Q P F S T+ + C++
Sbjct: 121 NYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSA 180
Query: 190 TSCRILRESF--PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-F 246
C L + P +S C + Y D S S G+ + D ++ T P F
Sbjct: 181 QQCSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGS-------TSLPNF 233
Query: 247 LLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLP 288
GC ++ G ++G++GL R+ +S++ + S F+YCLP
Sbjct: 234 YYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLP 278
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 59/129 (45%), Gaps = 5/129 (3%)
Query: 287 LPSPYGSTGYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKI 346
LP+ S IT T + L A MK +A +LDTC+ A V P +
Sbjct: 335 LPTIIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYS-ILDTCFKGQASR-VSAPAV 392
Query: 347 AIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRR 406
+ F GG L+L + LV S CL FA P ++ +GN QQ+ V YDV R
Sbjct: 393 TMSFAGGAALKLSAQNLLVDVDDSTTCLAFA---PARSAAIIGNTQQQTFSVVYDVKSSR 449
Query: 407 LGFGPGNCS 415
+GF G CS
Sbjct: 450 IGFAAGGCS 458
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 52/203 (25%), Positives = 86/203 (42%), Gaps = 20/203 (9%)
Query: 118 FTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC-----FQQRDP 172
F + N + Y+ V +G P + + +DTGSD+ W C PC C +
Sbjct: 77 FPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLE 136
Query: 173 FFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKE---CPFNIQYADGSGSGGFWATDR 229
FF S T KIPC+ C ++ C + + C + Y DGSG+ G++ +D
Sbjct: 137 FFNPDTSSTSSKIPCSDDRCTAALQTSE-AVCQTSDNSPCGYTFTYGDGSGTSGYYVSDT 195
Query: 230 ITIQE--ANSNGYFTRYPFLLGCINNSSGDKS----GASGIMGLDRSPVSIITRTNT--- 280
+ N + + GC N+ SGD + GI G + +S++++ N+
Sbjct: 196 MYFDSVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGV 255
Query: 281 --SYFSYCLPSPYGSTGYITFGK 301
FS+CL G + G+
Sbjct: 256 SPKVFSHCLKGSDNGGGILVLGE 278
>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 423
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 54/181 (29%), Positives = 90/181 (49%), Gaps = 18/181 (9%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
YY+ + +G P + L +DTGSD+TW QC PC +C + K+K + C+
Sbjct: 40 YYMALLLGSPPKLYFLDMDTGSDLTWAQCDAPCRNCAIGPHGLYNPKKAKV---VDCHLP 96
Query: 191 SCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
C +++ + CNS K+C + ++YADGS + G D +T++ +NG + ++
Sbjct: 97 VCAQIQQGGSY-ECNSDVKQCDYEVEYADGSSTMGVLVEDTLTVRL--TNGTLIQTKAII 153
Query: 249 GCINNSSGD--KSGAS--GIMGLDRSPVSIITRTN-----TSYFSYCLPSPYGSTGYITF 299
GC + G KS AS G++GL S V++ + + +CL GY+ F
Sbjct: 154 GCGYDQQGTLAKSPASTDGVIGLSSSKVALPAQLAEKGIIKNVLGHCLADGSNGGGYLFF 213
Query: 300 G 300
G
Sbjct: 214 G 214
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 83/362 (22%), Positives = 141/362 (38%), Gaps = 82/362 (22%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIP 186
YY V +G P + ++ +DTGSDV W C C C Q + FF S + +
Sbjct: 84 YYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVS 143
Query: 187 CNSTSC--RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSN--GYFT 242
C+ C ES G + C ++ +Y DGSG+ G++ +D ++ ++ +
Sbjct: 144 CSDRRCYSNFQTES---GCSPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINS 200
Query: 243 RYPFLLGCINNSSGD----KSGASGIMGLDRSPVSIITRTNTS-----YFSYCLPSPYGS 293
PF+ GC N SGD + GI GL + +S+I++ FS+CL
Sbjct: 201 SAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSG 260
Query: 294 TGYITFGKTDTVNTLRSAFHKRMKKYKK---------------------AKGLEDLLDT- 331
G + G+ +T+ + Y A G ++DT
Sbjct: 261 GGIMVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDTG 320
Query: 332 -----------------------------------CYDLSAYETVVVPKIAIHFLGGVDL 356
C++++A + V P++++ F GG +
Sbjct: 321 TTLAYLPDEAYSPFIQAVANAVSQYGRPITYESYQCFEITAGDVDVFPQVSLSFAGGASM 380
Query: 357 ELDVRGTLVVASVSQV---CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
L R L + S S C+GF +I LG++ + V YD+ +R+G+ +
Sbjct: 381 VLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITI-LGDLVLKDKVVVYDLVRQRIGWAEYD 439
Query: 414 CS 415
CS
Sbjct: 440 CS 441
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 53/175 (30%), Positives = 81/175 (46%), Gaps = 14/175 (8%)
Query: 126 DTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQ-RDPFFYASKSKTFFK 184
T + +Y++ + +G P Q + L+ DTGSD+ W +C C +C F S +F
Sbjct: 82 STGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSP 141
Query: 185 IPCNSTSCRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQ-----EA 235
C CR+L + P CN C F YADGS S GF++ + T++ E
Sbjct: 142 FHCFDPHCRLLPHA-PHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSGSEI 200
Query: 236 NSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
+ G F + + S +GA G+MGL R +S ++ + FSYCL
Sbjct: 201 HLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCL 255
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 55/208 (26%), Positives = 89/208 (42%), Gaps = 30/208 (14%)
Query: 118 FTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC-----FQQRDP 172
F + N + Y+ V +G P + + +DTGSD+ W C PC C +
Sbjct: 77 FPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLE 136
Query: 173 FFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKE---CPFNIQYADGSGSGGFWATDR 229
FF S T KIPC+ C ++ C + + C + Y DGSG+ G++ +D
Sbjct: 137 FFNPDTSSTSSKIPCSDDRCTAALQTSE-AVCQTSDNSPCGYTFTYGDGSGTSGYYVSDT 195
Query: 230 ITI-------QEANSNGYFTRYPFLLGCINNSSGDKS----GASGIMGLDRSPVSIITRT 278
+ Q ANS+ + GC N+ SGD + GI G + +S++++
Sbjct: 196 MYFDTVMGNEQTANSSA-----SIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQL 250
Query: 279 NT-----SYFSYCLPSPYGSTGYITFGK 301
N+ FS+CL G + G+
Sbjct: 251 NSLGVSPKVFSHCLKGSDNGGGILVLGE 278
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 55/174 (31%), Positives = 87/174 (50%), Gaps = 22/174 (12%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDP-----FFYASKSKTFFKIPCN 188
I + IG P Q ++LDTGS ++W QC +++ P F S S +F +PC+
Sbjct: 74 ISLPIGTPPQAQQMVLDTGSQLSWIQCH------RKKLPPKPKTSFDPSLSSSFSTLPCS 127
Query: 189 STSC--RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPF 246
C RI + P +++ C ++ YADG+ + G ++IT SN T P
Sbjct: 128 HPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITF----SNTEITP-PL 182
Query: 247 LLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFG 300
+LGC SS D+ GI+G++R +S +++ S FSYC+P G+ G
Sbjct: 183 ILGCATESSDDR----GILGMNRGRLSFVSQAKISKFSYCIPPKSNRPGFTPTG 232
>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 451
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 90/182 (49%), Gaps = 20/182 (10%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
YY+ ++IG P + L +DTGSD+TW QC PC+ C + P + +K+K +PC
Sbjct: 58 YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKL---VPCVDQ 114
Query: 191 SCRILRESFPFGN-CNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C L + C+S ++C + I+YAD S G TD ++ ANS+ R
Sbjct: 115 MCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSS--IVRPGLA 172
Query: 248 LGCINN----SSGDKSGASGIMGLDRSPVSIITR-----TNTSYFSYCLPSPYGSTGYIT 298
GC + SS + S G++GL VS++++ + +CL + G G++
Sbjct: 173 FGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRGG--GFLF 230
Query: 299 FG 300
FG
Sbjct: 231 FG 232
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 64/206 (31%), Positives = 98/206 (47%), Gaps = 30/206 (14%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y I V +G P + + +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 82 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 139
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-F 246
C +L S P +C E CPF + Y DGS S G D +T + + P F
Sbjct: 140 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPGF 190
Query: 247 LLGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STG 295
GC +S G + G++G+ P+S++ +++ ++ FSYCLP S G +TG
Sbjct: 191 SFGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTG 250
Query: 296 YITFGKTDTVNTLRSAFHKRMKKYKK 321
Y + GK T +R + +M KK
Sbjct: 251 YFSLGKVATRTDVR---YTKMVARKK 273
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 56/111 (50%), Gaps = 9/111 (8%)
Query: 310 SAFHKRMKKY--KKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
S +R+++ K+ E+ CYD+ + + +P I++HF G +L G V
Sbjct: 321 SVLSQRIRELLLKRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARFDLGSHGVFVER 380
Query: 368 SVSQV---CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGP-GNC 414
SV + CL FA P + SI +G++ Q EV YD+ + +G GP G C
Sbjct: 381 SVQEQDVWCLAFA--PTESVSI-IGSLMQTSKEVVYDLKRQLIGIGPSGAC 428
>gi|224164381|ref|XP_002338678.1| predicted protein [Populus trichocarpa]
gi|222873177|gb|EEF10308.1| predicted protein [Populus trichocarpa]
Length = 102
Score = 75.9 bits (185), Expect = 4e-11, Method: Composition-based stats.
Identities = 41/102 (40%), Positives = 58/102 (56%), Gaps = 4/102 (3%)
Query: 316 MKKYKKAKGLEDLLDTCYDLS--AYETVVVPKIAIHFLGGVDLELDVRGTLVVAS-VSQV 372
M Y KG L CYD S A + + +P+I+I F GGV++++D G + A+ + +V
Sbjct: 2 MTNYTLTKGTSGL-QPCYDFSKHANDNITIPQISIFFEGGVEVDIDDSGIFIAANGLEEV 60
Query: 373 CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
CL F D + GNVQQ+ +EV YDVA +GF PG C
Sbjct: 61 CLAFKDNGNDTDVAIFGNVQQKTYEVVYDVAKGMVGFAPGGC 102
>gi|361067981|gb|AEW08302.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156226|gb|AFG60348.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156228|gb|AFG60350.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156229|gb|AFG60351.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156230|gb|AFG60352.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156231|gb|AFG60353.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156232|gb|AFG60354.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156233|gb|AFG60355.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156235|gb|AFG60357.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156237|gb|AFG60359.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156238|gb|AFG60360.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156240|gb|AFG60362.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156241|gb|AFG60363.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
Length = 154
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 65/133 (48%), Gaps = 9/133 (6%)
Query: 68 YGPCSRLNQGISTHAPSL-EEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANIND 126
+G CS L S+ L + L +D RL SR P+ T P +
Sbjct: 12 HGACSPLRPANSSKWIDLVSQSLERDNDRLKTIRSRN-SGPY------TTMSNLPLQSGN 64
Query: 127 TVADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKI 185
V YIV A G P + L++DTGSD+TW QCKPC+ C+ Q DP F S+S ++ +
Sbjct: 65 KVGTGNYIVTAGFGTPTKKFLLIIDTGSDLTWIQCKPCLGCYSQVDPIFEPSQSSSYKSL 124
Query: 186 PCNSTSCRILRES 198
PC S +C L S
Sbjct: 125 PCLSATCTELLTS 137
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 63/231 (27%), Positives = 101/231 (43%), Gaps = 45/231 (19%)
Query: 81 HAPS---LEEIL---RQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYI 134
H PS LE I+ R+D RL +S+ T + P + Y +
Sbjct: 30 HPPSSSPLESIIALAREDDARLLFLSSKA---------ASTGVSSAPVASGQS-PPSYVV 79
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
+G P Q + L LDT +D TW C PC C F + S ++ +PC+ST C +
Sbjct: 80 RAGLGSPAQPILLALDTSADATWAHCSPCGTC-PSSGSLFAPANSTSYAPLPCSSTMCTV 138
Query: 195 LRESFPFGNCNSKE----------CPFNIQYADGSGSGGFWATDRITI-QEANSNGYFTR 243
L+ C +++ C F +AD S A+D + + ++A N
Sbjct: 139 LQGQ----PCPAQDPYDSSAPLPMCAFTKPFADASFQASL-ASDWLHLGKDAIPN----- 188
Query: 244 YPFLLGCINNSSGDKSG--ASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
+ GC++ SG + G++GL R P++++++ Y FSYCLPS
Sbjct: 189 --YAFGCVSAVSGPTANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLPS 237
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/111 (31%), Positives = 55/111 (49%), Gaps = 4/111 (3%)
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVV 366
LR F + + L DTC++ V P + +H GG+DL L + TL+
Sbjct: 322 ALREEFRRHVAAPSGYTSL-GAFDTCFNTDEVAAGVAPAVTVHMDGGLDLALPMENTLIH 380
Query: 367 ASVSQV-CLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+S + + CL A P + N++ L N+QQ+ V +DVA R+GF +C
Sbjct: 381 SSATPLACLAMAEAPQNVNAVVNVLANLQQQNLRVVFDVANSRVGFARESC 431
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 49/163 (30%), Positives = 84/163 (51%), Gaps = 20/163 (12%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSC- 192
I + IG P Q V+++LDTGS+++W CK + +P +S + T PCNS+ C
Sbjct: 61 ISLTIGSPPQNVTMVLDTGSELSWLHCKKLPNLNSTFNPLLSSSYTPT----PCNSSVCM 116
Query: 193 -RILRESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
R + P +C N+K C + YAD S + G A + ++ A G L G
Sbjct: 117 TRTRDLTIP-ASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGT------LFG 169
Query: 250 CINNSS-----GDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
C++++ + + +G+MG++R +S++T+ FSYC+
Sbjct: 170 CMDSAGYTSDINEDAKTTGLMGMNRGSLSLVTQMVLPKFSYCI 212
>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
Length = 321
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 64/205 (31%), Positives = 97/205 (47%), Gaps = 28/205 (13%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y I V +G P + + +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C +L S P +C E CPF + Y DGS S G D +T + FT
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFT----- 110
Query: 248 LGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STGY 296
GC +S G + G++G+ P+S++ +++ ++ FSYCLP S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGY 170
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKK 321
+ GK T +R + +M KK
Sbjct: 171 FSLGKVATRTDVR---YTKMVARKK 192
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 64/229 (27%), Positives = 99/229 (43%), Gaps = 14/229 (6%)
Query: 73 RLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRK-----PFPEFLKRTEAFTFPANINDT 127
R +G T S + +D R+ + R R P +R + A +
Sbjct: 82 RSAEGGRTRKESFLDKAEKDAVRIETMHRRAARSGVARMPASSSPRRALSERMVATVESG 141
Query: 128 VA---DEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFK 184
VA EY I V +G P + +++DTGSD+ W QC PC+ CF+QR P F + S ++
Sbjct: 142 VAVGSGEYLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRN 201
Query: 185 IPCNSTSCRILRESFPFGNCN---SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
+ C C ++ C CP+ Y D S + G A + T+
Sbjct: 202 VTCGDQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASR 261
Query: 242 TRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCL 287
+ GC + + G GA+G++GL R P+S ++ Y FSYCL
Sbjct: 262 RVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCL 310
Score = 45.8 bits (107), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 48/111 (43%), Gaps = 2/111 (1%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+R AF M + +L+ CY++S E VP++++ F G + V
Sbjct: 399 QVIRQAFVDLMSRLYPLIPDFPVLNPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFV 458
Query: 366 VASVSQV-CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ CL P SI +GN QQ+ V YD+ RLGF P C+
Sbjct: 459 RLDPDGIMCLAVRGTPRTGMSI-IGNFQQQNFHVVYDLQNNRLGFAPRRCA 508
>gi|383156225|gb|AFG60347.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156227|gb|AFG60349.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
Length = 154
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/133 (36%), Positives = 65/133 (48%), Gaps = 9/133 (6%)
Query: 68 YGPCSRLNQGISTHAPSL-EEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANIND 126
+G CS L S+ L + L +D RL SR P+ T P +
Sbjct: 12 HGACSPLRPANSSKWIDLISQSLERDNDRLKTIRSRN-SGPY------TTMSNLPLQSGN 64
Query: 127 TVADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKI 185
V YIV A G P + L++DTGSD+TW QCKPC+ C+ Q DP F S+S ++ +
Sbjct: 65 KVGTGNYIVTAGFGTPTKKFLLIIDTGSDLTWIQCKPCLGCYSQVDPIFEPSQSSSYKSL 124
Query: 186 PCNSTSCRILRES 198
PC S +C L S
Sbjct: 125 PCLSATCTELLTS 137
>gi|326517745|dbj|BAK03791.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 556
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 105/211 (49%), Gaps = 17/211 (8%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDP--FFYASKSKTFFKIPCN 188
+ + + +G P + + +DTG+ +++ QC+PC + C +Q D F SKS++F ++ C+
Sbjct: 206 FLMPIKLGTPPVWNLVAVDTGATLSFVQCEPCTLRCHKQTDAGEIFDPSKSESFSRVGCS 265
Query: 189 STSCRILRESFPFGN--CNSKE--CPFNIQYADGSG-SGGFWATDRITIQEANSNGYFTR 243
CR ++ + + C KE C +++ + S S G DR+ I + + GY
Sbjct: 266 ENKCRTVQRALHLQSKACMEKEDSCLYSMTFGGTSSYSVGKLVRDRLAIGKY-AKGY--S 322
Query: 244 YP-FLLGCINNSSGDKSGASGIMGLDRSPVSIITRT----NTSYFSYCLPSPYGSTGYIT 298
+P FL GC ++ + A G++G P S + N FSYC PS TGY++
Sbjct: 323 FPDFLFGCSLDTEYHQYEA-GLVGFADEPFSFFEQVAPLVNYKAFSYCFPSDRRKTGYLS 381
Query: 299 FGKTDTVNTLRSAFHKRMKKYKKAKGLEDLL 329
G VN+ + ++ + A L+++L
Sbjct: 382 IGDYTRVNSTYTPLFLARQQSRYALKLDEVL 412
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 80/164 (48%), Gaps = 12/164 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNS 189
Y + +G P + +++DTGS +TW QC PC + C +Q P F S ++ + C++
Sbjct: 136 NYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCST 195
Query: 190 TSCRILRESF--PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C L + P +S C + Y D S S G+ + D ++ G + F
Sbjct: 196 PQCNDLSTATLNPAACSSSDVCIYQASYGDSSFSVGYLSKDTVSF------GSNSVPNFY 249
Query: 248 LGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLP 288
GC ++ G ++G+MGL R+ +S++ + + FSYCLP
Sbjct: 250 YGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLP 293
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 63/129 (48%), Gaps = 5/129 (3%)
Query: 287 LPSPYGSTGYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKI 346
LP+ S IT T + L A MK K+A +LDTC+ + ++ VP +
Sbjct: 350 LPTIIDSGTVITRLPTTVYDALSKAVAGAMKGTKRADAYS-ILDTCF-VGQASSLRVPAV 407
Query: 347 AIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRR 406
++ F GG L+L + LV S CL FA P ++ +GN QQ+ V YDV R
Sbjct: 408 SMAFSGGAALKLSAQNLLVDVDSSTTCLAFA---PARSAAIIGNTQQQTFSVVYDVKSNR 464
Query: 407 LGFGPGNCS 415
+GF G C+
Sbjct: 465 IGFAAGGCT 473
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 65/115 (56%), Gaps = 8/115 (6%)
Query: 118 FTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYAS 177
F P I D A + ++IG P V ++LDTGSD+ W QC+PC C++Q+DP + +
Sbjct: 94 FVPPPLIRDKSA--FLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRT 151
Query: 178 KSKTFFKIPCNSTSCRIL-RESFPFGNC-NSKECPFNIQYADGSGSGGFWATDRI 230
KS ++ ++ CN C L RE G C +S C + YADGS + G + +++
Sbjct: 152 KSDSYTEMLCNEPPCLSLGRE----GQCSDSGSCLYQTSYADGSRTSGLLSYEKV 202
>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 76/352 (21%), Positives = 118/352 (33%), Gaps = 85/352 (24%)
Query: 143 QYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFG 202
Q L++DTGS T+ CK C C + ++ +S F ++ C S L E G
Sbjct: 49 QTYDLIVDTGSARTYVPCKGCARCGEHAHGYYDYDRSMEFERLDCGEASDATLCEETMKG 108
Query: 203 NCNSK-ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD--KS 259
C S C + + YA+GS S G+ DR+ + E + GC + +
Sbjct: 109 TCQSDGRCSYVVSYAEGSSSRGYVVRDRVRLGEGTLSAMLA-----FGCEEAETNAIYEQ 163
Query: 260 GASGIMGLDRSPVSIITRTNTS-----YFSYCLPSPYGSTGYITFGKTD----------- 303
A G+ G R ++ + ++ FS+C+ + G +T G+ D
Sbjct: 164 KADGLFGFGRGTATVHAQLASAGLIENVFSFCVEGFGANGGVLTLGRFDFGADAPALART 223
Query: 304 --TVNTLRSAFHK------------------------------------------RMKKY 319
+ AFH R+
Sbjct: 224 PLVADPANPAFHNVRTSSWKLGDSLIEHLNSYTTTLDSGTTFTFVPRSVWVSFKTRLDTQ 283
Query: 320 KKAKGLEDLL-------DTCYDLSAYETVVV----------PKIAIHFLGGVDLELDVRG 362
GLE + D CY +SA + P + I + GGV L L
Sbjct: 284 ATQAGLEIVAGPDPQYDDVCYGVSAAAMNMTLSQSTVSEWFPPLTIAYEGGVSLTLGPEN 343
Query: 363 TLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L + + N I LG + R + +DVA R+G P NC
Sbjct: 344 YLFAHETNSAAFCVGIFANPNNQILLGQITMRDTLMEFDVANSRVGMAPANC 395
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 50/169 (29%), Positives = 82/169 (48%), Gaps = 11/169 (6%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSC- 192
+ + IG P Q ++LDTGS ++W QC + F S S +F +PCN C
Sbjct: 79 VSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPRKPPPSTVFDPSLSSSFSVLPCNHPLCK 138
Query: 193 -RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI 251
RI + P ++ C ++ YADG+ + G ++IT + S P +LGC
Sbjct: 139 PRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVREKITFSTSQSTP-----PLILGCA 193
Query: 252 NNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFG 300
++S DK GI+G++ +S ++ + FSYC+P+ G+ G
Sbjct: 194 EDASDDK----GILGMNLGRLSFASQAKITKFSYCVPTRQVRPGFTPTG 238
Score = 41.2 bits (95), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 49/103 (47%), Gaps = 5/103 (4%)
Query: 315 RMKKYKKAKGLEDLLDTCYDLSAYET-VVVPKIAIHFLGGVDLELDVRGTLVVASVSQVC 373
R+KK G+ D+ C+D +A E ++ + F GV++ ++ L C
Sbjct: 334 RLKKGYVYSGVSDM---CFDGNAMEIGRLIGNMVFEFDKGVEIVIEKGRVLADVGGGVHC 390
Query: 374 LGFA-TYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+G + S +GN Q+ V +D+A RR+GFG +CS
Sbjct: 391 VGIGRSEMLGAASNIIGNFHQQNLWVEFDIANRRVGFGKADCS 433
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 57/202 (28%), Positives = 101/202 (50%), Gaps = 21/202 (10%)
Query: 109 PEFLKRT-EAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCF 167
PE ++R+ + F NI+ TV+ + +G P Q V++++DTGS+++W C +
Sbjct: 55 PESVRRSPDKLPFRHNISLTVS------LTVGTPPQNVTMVIDTGSELSWLHCNTSQNSS 108
Query: 168 QQRDPFFYASKSKTFFKIPCNSTSCRILRESFPF-GNCNSKE-CPFNIQYADGSGSGGFW 225
F S ++ IPC+S++C FP +C+S + C + YAD S S G
Sbjct: 109 SSSST-FNPVWSSSYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNL 167
Query: 226 ATDRITIQEANSNGYFTRYPFLLGCIN----NSSGDKSGASGIMGLDRSPVSIITRTNTS 281
ATD I + + GC++ ++S + S +G+MG++R +S +++
Sbjct: 168 ATDTFYIGSSGIPN------VVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFP 221
Query: 282 YFSYCLPSPYGSTGYITFGKTD 303
FSYC+ S Y +G + G +
Sbjct: 222 KFSYCI-SEYDFSGLLLLGDAN 242
>gi|297744129|emb|CBI37099.3| unnamed protein product [Vitis vinifera]
Length = 299
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 63/119 (52%), Gaps = 11/119 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPC--- 187
E+ + +AIG P + S ++DTGSD+ WTQCKPC CF Q P F KS +F K+PC
Sbjct: 96 EFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSD 155
Query: 188 ---NSTSCRILRESFPFGNCNSKECPF-----NIQYADGSGSGGFWATDRITIQEANSN 238
+ST + E+F FG+ + + F N A G+G F + ++ + + S
Sbjct: 156 LYHSSTQGVLATETFTFGDASVSKIGFGCGEDNRGRAYSQGAGLFISQMKLDVDASGST 214
>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
vinifera]
Length = 437
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 72/240 (30%), Positives = 107/240 (44%), Gaps = 28/240 (11%)
Query: 55 QGPDKAS-LEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLK 113
+ PD+ S L+V+ Y PCS +E L ++ L ++ + R F L
Sbjct: 31 ETPDQGSTLQVLHVYSPCSPFRP---------KEPLSWEESVLQMQAKDKARLQFLSSLV 81
Query: 114 RTEAFTFPANINDTVADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDP 172
++ A+ V + YIV A IG P Q + + +DT SDV W C C+ C
Sbjct: 82 ARKSVVPIASGRQIVQNPTYIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGC---SST 138
Query: 173 FFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITI 232
F + S T+ + C + C+ P C C FN+ Y GS + D IT+
Sbjct: 139 LFNSPASTTYKSLGCQAAQCK----QVPKPTCGGGVCSFNLTYG-GSSLAANLSQDTITL 193
Query: 233 QEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
GY GCI ++G A G++GL R P+S++++T Y FSYCLPS
Sbjct: 194 ATDAVPGYS------FGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS 247
Score = 46.6 bits (109), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 51/111 (45%), Gaps = 9/111 (8%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
+R AF R+ + L DTCY + + P I F G +++ L L+ +
Sbjct: 333 VRDAFRNRVGRNLTVTSLGGF-DTCYTVP----IAAPTITFMFTG-MNVTLPPDNLLIHS 386
Query: 368 SV-SQVCLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ S CL A P + NS+ + N+QQ+ H + YDV RLG C+
Sbjct: 387 TAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELCT 437
>gi|88174593|gb|ABD39371.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 64/206 (31%), Positives = 98/206 (47%), Gaps = 30/206 (14%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y I V +G P + + +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-F 246
C +L S P +C E CPF + Y DGS S G D +T + + P F
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPGF 109
Query: 247 LLGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STG 295
GC +S G + G++G+ P+S++ +++ ++ FSYCLP S G +TG
Sbjct: 110 SFGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTG 169
Query: 296 YITFGKTDTVNTLRSAFHKRMKKYKK 321
Y + GK T +R + +M KK
Sbjct: 170 YFSLGKVATRTDVR---YTKMVARKK 192
>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 63/207 (30%), Positives = 98/207 (47%), Gaps = 27/207 (13%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y I V +G P + + +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSASWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C +L S P +C E CPF + Y DGS S G D +T + FT
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----- 110
Query: 248 LGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STGY 296
GC +S G + G++G+ P+S++ +++ ++ FSYCLP S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAK 323
+ GK T +R + K + + K +
Sbjct: 171 FSLGKVATRTDVR--YTKMVARRKNTE 195
>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 64/206 (31%), Positives = 98/206 (47%), Gaps = 30/206 (14%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y I V +G P + + +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-F 246
C +L S P +C E CPF + Y DGS S G D +T + + P F
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPGF 109
Query: 247 LLGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STG 295
GC +S G + G++G+ P+S++ +++ ++ FSYCLP S G +TG
Sbjct: 110 SFGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTG 169
Query: 296 YITFGKTDTVNTLRSAFHKRMKKYKK 321
Y + GK T +R + +M KK
Sbjct: 170 YFSLGKVATRTDVR---YTKMVARKK 192
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 54/174 (31%), Positives = 80/174 (45%), Gaps = 22/174 (12%)
Query: 136 VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF---KIPCNSTSC 192
++IG+P +++DTGSD+ W C PC +C F S S TF K PC+ C
Sbjct: 105 ISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNHLGLLFDPSMSSTFSPLCKTPCDFKGC 164
Query: 193 RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLGCI 251
C+ PF + YAD S + G + D + + + +R P L GC
Sbjct: 165 S---------RCD--PIPFTVTYADNSTASGMFGRDTVVFETTDEGT--SRIPDVLFGCG 211
Query: 252 NNSSGDKS-GASGIMGLDRSPVSIITRTNTSYFSYC---LPSPYGSTGYITFGK 301
+N D G +GI+GL+ P S+ T+ FSYC L PY + + G+
Sbjct: 212 HNIGQDTDPGHNGILGLNNGPDSLATKIGQK-FSYCIGDLADPYYNYHQLILGE 264
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 66/128 (51%), Gaps = 8/128 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
+Y + + +G P V L+DTGSD+ W QC PC C++Q+ P F +S T+ IPC+S
Sbjct: 49 DYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQGCYRQKSPMFEPLRSNTYTPIPCDSE 108
Query: 191 SCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
C L FG+ S K C ++ YAD S + G A + +T + +
Sbjct: 109 ECNSL-----FGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVVG-DIVF 162
Query: 249 GCINNSSG 256
GC +++SG
Sbjct: 163 GCGHSNSG 170
>gi|194701538|gb|ACF84853.1| unknown [Zea mays]
gi|194703714|gb|ACF85941.1| unknown [Zea mays]
Length = 208
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 43/109 (39%), Positives = 58/109 (53%), Gaps = 6/109 (5%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+ L SAF MK+Y A+ +LDTC+D S +V +P +A+ F GG + LD G ++
Sbjct: 106 SALSSAFKAGMKQYPPAQ-PSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL 164
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
CL FA D + +GNVQQR EV YDV +GF G C
Sbjct: 165 -----SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 208
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 74/161 (45%), Gaps = 18/161 (11%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQR---DPFFYASKSK 180
+D + YY V IG P Q +L++DTGS VT+ C C HC + DP F S
Sbjct: 91 DDLLTKGYYTSRVFIGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFDPRFKPDNSS 150
Query: 181 TFFKIPCNSTSCRILRESFPFGNCNSK--ECPFNIQYADGSGSGGFWATDRITIQEANSN 238
++ + CNS C C+++ +C + YA+ S S G D + +
Sbjct: 151 SYQTVSCNSPDCITKM-------CDARVHQCKYERVYAEMSSSKGVLGKDLLGFGNGSR- 202
Query: 239 GYFTRYPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR 277
+P L GC +GD A GIMGL R P+SI+ +
Sbjct: 203 --LQPHPLLFGCETAETGDLYLQHADGIMGLGRGPLSIVDQ 241
>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 63/207 (30%), Positives = 98/207 (47%), Gaps = 27/207 (13%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y I V +G P + + +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSASWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C +L S P +C E CPF + Y DGS S G D +T + FT
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----- 110
Query: 248 LGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STGY 296
GC +S G + G++G+ P+S++ +++ ++ FSYCLP S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAK 323
+ GK T +R + K + + K +
Sbjct: 171 FSLGKVATRTDVR--YTKMVARRKNTE 195
>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 63/207 (30%), Positives = 98/207 (47%), Gaps = 27/207 (13%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y I V +G P + + +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C +L S P +C E CPF + Y DGS S G D +T + FT
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----- 110
Query: 248 LGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STGY 296
GC +S G + G++G+ P+S++ +++ ++ FSYCLP S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAK 323
+ GK T +R + K + + K +
Sbjct: 171 FSLGKVATRTDVR--YTKMVARRKNTE 195
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 58/180 (32%), Positives = 84/180 (46%), Gaps = 15/180 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPF---FYASKSKTFFKIP 186
EY + IG P V LDT + + W QC C C ++ F +SKS T+ P
Sbjct: 74 EYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTTKFLSSKSFTYEMEP 133
Query: 187 CNSTSCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRY 244
C S C L F CNS K C + + Y D + G ++D S+G
Sbjct: 134 CGSNFCNSLT---GFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFD--TSDGMLVDV 188
Query: 245 PFL-LGCINNS-SGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLP--SPYGSTGYITFG 300
FL GC +GD+ +G +GL+++P+S+I++ FSYCL + GST + FG
Sbjct: 189 GFLNFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQLGIKKFSYCLVPFNNLGSTSKMYFG 248
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 32/117 (27%), Positives = 56/117 (47%), Gaps = 5/117 (4%)
Query: 301 KTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDL-SAYETVVVPKIAIHFLGGVDLELD 359
+TD ++L + F ++ ++ + C++L +A + P + +HF G DL L+
Sbjct: 309 ETDAFDSLLAKFLTLKDFPQRKDDPKERFELCFELQNANDLESFPDVTVHF-DGADLILN 367
Query: 360 VRGTLV-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
V T V + CL P SI LGN Q + + V YD+ + + F P +C+
Sbjct: 368 VESTFVKIEDDGIFCLALLR-SGSPVSI-LGNFQLQNYHVGYDLEAQVISFAPVDCA 422
>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
gi|224030351|gb|ACN34251.1| unknown [Zea mays]
Length = 342
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 73/144 (50%), Gaps = 9/144 (6%)
Query: 159 QCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADG 218
QC+PC+ C++Q DP F S ++ +PC S +C L + + C + +Y+
Sbjct: 2 QCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQL-DGHRCHEDDDGACQYTYKYSGH 60
Query: 219 SGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSG-ASGIMGLDRSPVSIITR 277
+ G A D++ I G + + GC ++S G + ASG++GL R P+S++++
Sbjct: 61 GVTKGTLAIDKLAI------GGDVFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQ 114
Query: 278 TNTSYFSYCLPSPYGST-GYITFG 300
+ F YCLP P T G + G
Sbjct: 115 LSVHRFMYCLPPPMSRTSGKLVLG 138
>gi|224130234|ref|XP_002328687.1| predicted protein [Populus trichocarpa]
gi|222838863|gb|EEE77214.1| predicted protein [Populus trichocarpa]
Length = 603
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 52/173 (30%), Positives = 83/173 (47%), Gaps = 17/173 (9%)
Query: 139 GEPKQYVSLLLDTGSDVTWTQCK-PCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRE 197
G P+ Y L DTGSD+TW QC PC C + + ++ + +P C ++
Sbjct: 198 GPPQPYY-LDFDTGSDLTWIQCDAPCTSCAKGANAWYKPRRGNI---VPPKDLLCMEVQR 253
Query: 198 SFPFGNCNS-KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSG 256
+ G C + +C + I+YAD S S G ATD++ + A NG T+ F+ GC + G
Sbjct: 254 NQKAGYCETCDQCDYEIEYADHSSSMGVLATDKLLLMVA--NGSLTKLNFIFGCAYDQQG 311
Query: 257 ----DKSGASGIMGLDRSPVSIITRTNT-----SYFSYCLPSPYGSTGYITFG 300
GI+GL R+ VS+ ++ + + +CL + G GY+ G
Sbjct: 312 LLLKTLVKTDGILGLSRAKVSLPSQLASQGIINNVIGHCLTTDLGGGGYMFLG 364
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 86/165 (52%), Gaps = 15/165 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNS 189
EY + +AIG P Q + DTGSD+ WTQC PC CF+Q P + S S TF +PC+S
Sbjct: 91 EYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSS 150
Query: 190 ------TSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
R+ + P G C C +N Y G S G ++ T + ++ R
Sbjct: 151 ALNLCAAEARLAGATPPPG-C---ACRYNQTYGTGWTS-GLQGSETFTFGSSPADQ--VR 203
Query: 244 YPFL-LGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
P + GC N SS D +G++G++GL R +S++++ FSYCL
Sbjct: 204 VPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL 248
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 45/157 (28%), Positives = 66/157 (42%), Gaps = 23/157 (14%)
Query: 278 TNTSYFSYCLPSPYGS-------TGYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDL-- 328
T S + LP P G+ TG + T+ +L A +KR++ + L L
Sbjct: 299 TGISVGAAALPIPPGAFALRADGTGGLIIDSGTTITSLVDAAYKRVRA--AVRSLVKLPV 356
Query: 329 --------LDTCYDL--SAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFAT 378
LD C+ L S+ +P + +HF GG D+ L V +++ CL +
Sbjct: 357 TDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMILDG-GMWCLAMRS 415
Query: 379 YPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
D TLGN QQ+ + YDV L F P CS
Sbjct: 416 QT-DGELSTLGNYQQQNLHILYDVQKETLSFAPAKCS 451
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 86/165 (52%), Gaps = 15/165 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNS 189
EY + +AIG P Q + DTGSD+ WTQC PC CF+Q P + S S TF +PC+S
Sbjct: 91 EYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSS 150
Query: 190 ------TSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
R+ + P G C C +N Y G S G ++ T + ++ R
Sbjct: 151 ALNLCAAEARLAGATPPPG-C---ACRYNQTYGTGWTS-GLQGSETFTFGSSPADQ--VR 203
Query: 244 YPFL-LGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
P + GC N SS D +G++G++GL R +S++++ FSYCL
Sbjct: 204 VPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL 248
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 45/157 (28%), Positives = 65/157 (41%), Gaps = 23/157 (14%)
Query: 278 TNTSYFSYCLPSPYGS-------TGYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDL-- 328
T S LP P G+ TG + T+ +L A +KR++ + L L
Sbjct: 299 TGISVGPAALPIPPGAFALRADGTGGLIIDSGTTITSLVDAAYKRVRA--AVRSLVKLPV 356
Query: 329 --------LDTCYDL--SAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFAT 378
LD C+ L S+ +P + +HF GG D+ L V +++ CL +
Sbjct: 357 TDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMILDG-GMWCLAMRS 415
Query: 379 YPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
D TLGN QQ+ + YDV L F P CS
Sbjct: 416 QT-DGELSTLGNYQQQNLHILYDVQKETLSFAPAKCS 451
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 86/165 (52%), Gaps = 15/165 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNS 189
EY + +AIG P Q + DTGSD+ WTQC PC CF+Q P + S S TF +PC+S
Sbjct: 96 EYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSS 155
Query: 190 ------TSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
R+ + P G C C +N Y G S G ++ T + ++ R
Sbjct: 156 ALNLCAAEARLAGATPPPG-C---ACRYNQTYGTGWTS-GLQGSETFTFGSSPADQ--VR 208
Query: 244 YPFL-LGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
P + GC N SS D +G++G++GL R +S++++ FSYCL
Sbjct: 209 VPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL 253
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 45/157 (28%), Positives = 65/157 (41%), Gaps = 23/157 (14%)
Query: 278 TNTSYFSYCLPSPYGS-------TGYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDL-- 328
T S LP P G+ TG + T+ +L A +KR++ + L L
Sbjct: 304 TGISVGPAALPIPPGAFALRADGTGGLIIDSGTTITSLVDAAYKRVRA--AVRSLVKLPV 361
Query: 329 --------LDTCYDL--SAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFAT 378
LD C+ L S+ +P + +HF GG D+ L V +++ CL +
Sbjct: 362 TDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMILDG-GMWCLAMRS 420
Query: 379 YPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
D TLGN QQ+ + YDV L F P CS
Sbjct: 421 QT-DGELSTLGNYQQQNLHILYDVQKETLSFAPAKCS 456
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 53/194 (27%), Positives = 85/194 (43%), Gaps = 30/194 (15%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIP 186
Y+ V +G P + + +DTGSD+ W C PC C + FF S T KIP
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176
Query: 187 CNSTSCRILRESFPFGNCNSKE---CPFNIQYADGSGSGGFWATDRITI-------QEAN 236
C+ C ++ C + + C + Y DGSG+ G++ +D + Q AN
Sbjct: 177 CSDDRCTAALQTSE-AVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTAN 235
Query: 237 SNGYFTRYPFLLGCINNSSGDKS----GASGIMGLDRSPVSIITRTNT-----SYFSYCL 287
S+ + GC N+ SGD + GI G + +S++++ N+ FS+CL
Sbjct: 236 SSA-----SIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL 290
Query: 288 PSPYGSTGYITFGK 301
G + G+
Sbjct: 291 KGSDNGGGILVLGE 304
>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
Length = 452
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 83/182 (45%), Gaps = 23/182 (12%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCK----PCIHCFQQRDPFFYASKSKTFFKIPCNS 189
+ VA+G P Q V+++LDTGS+++W +C P Q F S S T+ C+S
Sbjct: 62 VPVAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPP-PQAPAAFNGSASSTYAAAHCSS 120
Query: 190 TSCRILRESFPF----GNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C+ P S C ++ YAD S + G A D + G
Sbjct: 121 PECQWRGRDLPVPPFCAGPPSXSCRVSLSYADASSADGILAADTFLL------GGAPPVX 174
Query: 246 FLLGCINN-------SSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYIT 298
L GC+ + +S D A+G++G++R +S +T+T T F+YC+ +P G +
Sbjct: 175 ALFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATLRFAYCI-APGDGPGLLV 233
Query: 299 FG 300
G
Sbjct: 234 LG 235
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 91/193 (47%), Gaps = 25/193 (12%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y I V +G P + + +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 82 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 139
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C +L S P +C E CPF + Y DGS S G D +T + FT
Sbjct: 140 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----- 191
Query: 248 LGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STGY 296
GC +S G + G++G+ P+S++ +++ + FSYCLP S G +TGY
Sbjct: 192 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGY 251
Query: 297 ITFGKTDTVNTLR 309
+ GK T +R
Sbjct: 252 FSLGKVATRTDVR 264
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 56/111 (50%), Gaps = 9/111 (8%)
Query: 310 SAFHKRMKKY--KKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
S +R+++ ++ E+ CYD+ + + +P I++HF G +L G V
Sbjct: 321 SVLSQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARFDLGSHGVFVER 380
Query: 368 SVSQV---CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGP-GNC 414
SV + CL FA P + SI +G++ Q EV YD+ + +G GP G C
Sbjct: 381 SVQEQDVWCLAFA--PTESVSI-IGSLMQTSKEVVYDLKRQLIGIGPSGAC 428
>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
Length = 573
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 54/184 (29%), Positives = 87/184 (47%), Gaps = 24/184 (13%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK-PCIHCFQQRDPFFYASKSKTFFKIPCNS 189
+YY + +G P + L +DTGSD+TW QC PC +C + P + +K K +P
Sbjct: 202 QYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKI---VPPKD 258
Query: 190 TSCRILRESFPFGNCN----SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C+ L+ GN N K+C + I+YAD S S G A D + I +NG +
Sbjct: 259 LLCQELQ-----GNQNYCETCKQCDYEIEYADRSSSMGVLARDDMHI--ITTNGGREKLD 311
Query: 246 FLLGCINNSSGD----KSGASGIMGLDRSPVSIITRTN-----TSYFSYCLPSPYGSTGY 296
F+ GC + G + GI+GL + +S+ ++ ++ F +C+ GY
Sbjct: 312 FVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDPNGGGY 371
Query: 297 ITFG 300
+ G
Sbjct: 372 MFLG 375
>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
Length = 454
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 83/182 (45%), Gaps = 23/182 (12%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCK----PCIHCFQQRDPFFYASKSKTFFKIPCNS 189
+ VA+G P Q V+++LDTGS+++W +C P Q F S S T+ C+S
Sbjct: 64 VPVAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPP-PQAPAAFNGSASSTYAAAHCSS 122
Query: 190 TSCRILRESFPF----GNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C+ P S C ++ YAD S + G A D + G
Sbjct: 123 PECQWRGRDLPVPPFCAGPPSNSCRVSLSYADASSADGILAADTFLL------GGAPPVR 176
Query: 246 FLLGCINN-------SSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYIT 298
L GC+ + +S D A+G++G++R +S +T+T T F+YC+ +P G +
Sbjct: 177 ALFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATLRFAYCI-APGDGPGLLV 235
Query: 299 FG 300
G
Sbjct: 236 LG 237
>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
Length = 574
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 54/184 (29%), Positives = 87/184 (47%), Gaps = 24/184 (13%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK-PCIHCFQQRDPFFYASKSKTFFKIPCNS 189
+YY + +G P + L +DTGSD+TW QC PC +C + P + +K K +P
Sbjct: 203 QYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKI---VPPKD 259
Query: 190 TSCRILRESFPFGNCN----SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C+ L+ GN N K+C + I+YAD S S G A D + I +NG +
Sbjct: 260 LLCQELQ-----GNQNYCETCKQCDYEIEYADRSSSMGVLARDDMHI--ITTNGGREKLD 312
Query: 246 FLLGCINNSSGD----KSGASGIMGLDRSPVSIITRTN-----TSYFSYCLPSPYGSTGY 296
F+ GC + G + GI+GL + +S+ ++ ++ F +C+ GY
Sbjct: 313 FVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDPNGGGY 372
Query: 297 ITFG 300
+ G
Sbjct: 373 MFLG 376
>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
Length = 418
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 58/192 (30%), Positives = 94/192 (48%), Gaps = 21/192 (10%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNS 189
YY+ + IG+P + L +DTGSD+TW QC PC C + P + +K+K +PC +
Sbjct: 56 HYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKNKL---VPCAN 112
Query: 190 TSCRILRE-SFPFGNCNS-KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
+ C L S P C + ++C + I+Y D + S G D ++ N + R
Sbjct: 113 SICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVMDSFSLPLRNKSN--VRPSLS 170
Query: 248 LGCINNSSGDKSGAS-----GIMGLDRSPVSIITR-----TNTSYFSYCLPSPYGSTGYI 297
GC + K+GA+ G++GL R VS++++ + +CL + G G++
Sbjct: 171 FGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLSTSGG--GFL 228
Query: 298 TFGKTDTVNTLR 309
FG D V T R
Sbjct: 229 FFGD-DMVPTSR 239
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 60/200 (30%), Positives = 96/200 (48%), Gaps = 19/200 (9%)
Query: 105 RKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCI 164
R P P + +TF +NI ++A + + IG P Q L+LDTGS ++W QC P
Sbjct: 59 RNPSPP----SSPYTFRSNIKYSMA--LILSLPIGTPSQSQELVLDTGSQLSWIQCHPKK 112
Query: 165 HCFQQRDPF--FYASKSKTFFKIPCNSTSC--RILRESFPFGNCNSKECPFNIQYADGSG 220
P F S S +F +PC+ C RI + P +++ C ++ YADG+
Sbjct: 113 IKKPLPPPTTSFDPSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTF 172
Query: 221 SGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNT 280
+ G ++ T + T P +LGC S+ +K GI+G++ +S I++
Sbjct: 173 AEGNLVKEKFTFSNSQ-----TTPPLILGCAKESTDEK----GILGMNLGRLSFISQAKI 223
Query: 281 SYFSYCLPSPYGSTGYITFG 300
S FSYC+P+ G + G
Sbjct: 224 SKFSYCIPTRSNRPGLASTG 243
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 65/246 (26%), Positives = 101/246 (41%), Gaps = 28/246 (11%)
Query: 52 ALPQGPDKAS-LEVVSKYGPCSRLNQ-GISTHAPSLEEILRQDQQRLHLKNSRRLRKPFP 109
A P K S L V+ YG CS NQ + ++ + +D R+ +S
Sbjct: 24 ASPSSESKGSDLSVIHVYGQCSPFNQHKAGSWVNTVINMASKDPARVTYLSS-------- 75
Query: 110 EFLKRTEAFTFPANINDTVAD--EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCF 167
+ +A + P V + Y + V +G P Q + ++LDT D W C C C
Sbjct: 76 -LVASPKATSVPIASGQQVLNIGNYVVRVKLGTPGQLMFMVLDTSRDAAWVPCADCAGC- 133
Query: 168 QQRDPFFYASKSKTFFKIPCNSTSCRILRE-SFPFGNCNSKECPFNIQYADGSGSGGFWA 226
P F + S T+ + C+ C +R S P + C FN Y S +
Sbjct: 134 --SSPTFSPNTSSTYASLQCSVPQCTQVRGLSCP--TTGTAACFFNQTYGGDSSFSAMLS 189
Query: 227 TDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---F 283
D + + Y GC+N SG G++GL R P+S+++++ + Y F
Sbjct: 190 QDSLGLAVDTLPSYS------FGCVNAVSGSTLPPQGLLGLGRGPMSLLSQSGSLYSGVF 243
Query: 284 SYCLPS 289
SYC PS
Sbjct: 244 SYCFPS 249
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 53/106 (50%), Gaps = 9/106 (8%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
+R F K++K G DTC+ +A + P + HF G +DL+L + TL+ +
Sbjct: 335 IRDEFRKQVKGPFATIGA---FDTCF--AATNEDIAPPVTFHFTG-MDLKLPLENTLIHS 388
Query: 368 SV-SQVCLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRLGFG 410
S S CL A P + NS+ + N+QQ+ + +DV RLG
Sbjct: 389 SAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDVTNSRLGIA 434
>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
Length = 396
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 56/163 (34%), Positives = 77/163 (47%), Gaps = 14/163 (8%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
Y + + IG P Q VS ++D G ++ WTQC + C CF+Q P F + S TF PC +
Sbjct: 51 YVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCGAA 110
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGS--GSGGFWATDRITIQEANSNGYFTRYPFLL 248
C ES P +C A S + G TD + I A + R F
Sbjct: 111 VC----ESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAAT----ARLAF-- 160
Query: 249 GCINNSSGDKS-GASGIMGLDRSPVSIITRTNTSYFSYCLPSP 290
GC S D G+SG +GL R+ +S+ + N + FSYCL P
Sbjct: 161 GCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNATAFSYCLAPP 203
>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 51/181 (28%), Positives = 88/181 (48%), Gaps = 18/181 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK-PCIHCFQQRDPFFYASKSKTFFKIPCNS 189
+YY + +G P + L +DTGSD+TW QC PC +C + P + +K K +P
Sbjct: 190 QYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKI---VPPRD 246
Query: 190 TSCRILRESFPFGNCNS-KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
+ C+ L+ + C + K+C + I+YAD S S G A D + + +NG + F+
Sbjct: 247 SLCQELQGDQNY--CETCKQCDYEIEYADRSSSMGVLAKDDMHL--IATNGGREKLDFVF 302
Query: 249 GCINNSSGD----KSGASGIMGLDRSPVSIITRTN-----TSYFSYCLPSPYGSTGYITF 299
GC + G + GI+GL + +S+ ++ ++ F +C+ GY+
Sbjct: 303 GCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCITRETNGGGYMFL 362
Query: 300 G 300
G
Sbjct: 363 G 363
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 50/169 (29%), Positives = 80/169 (47%), Gaps = 11/169 (6%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSC- 192
+ + IG P Q ++LDTGS ++W QC + F S S +F +PCN C
Sbjct: 84 VSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRKPPPSSVFDPSLSSSFSVLPCNHPLCK 143
Query: 193 -RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI 251
RI + P ++ C ++ YADG+ + G ++IT + S P +LGC
Sbjct: 144 PRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQSTP-----PLILGCA 198
Query: 252 NNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFG 300
S S A GI+G++ +S ++ + FSYC+P+ G+ G
Sbjct: 199 EES----SDAKGILGMNLGRLSFASQAKLTKFSYCVPTRQVRPGFTPTG 243
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 62/229 (27%), Positives = 106/229 (46%), Gaps = 20/229 (8%)
Query: 86 EEILRQDQQRLHLKNSRR---LRKPFPEFLKRTEAFTFPANIN-DTVADEYYIVVAIGEP 141
++L+ D R + +S R RK F + + P + D+ +Y++ + IG P
Sbjct: 73 RQLLQSDNARRQMISSLRHGTRRKAF----EVSHTAQIPIHSGADSGQSQYFVSIRIGTP 128
Query: 142 K-QYVSLLLDTGSDVTWTQCKPCIHCFQQRDP----FFYASKSKTFFKIPCNSTSCRI-L 195
+ Q L+ DTGSD+TW C+ + +P F A+ S +F IPC+S C+I L
Sbjct: 129 RPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFRANDSSSFRTIPCSSDDCKIEL 188
Query: 196 RESFPFGNCNSKECP--FNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINN 253
++ F C + P F+ +Y +G + G +A + +T+ N + + L+GC +
Sbjct: 189 QDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTVG-LNDHKKIRLFDVLIGCTES 247
Query: 254 SSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITF 299
+ G+MGL S+ R + FSYCL S+ + F
Sbjct: 248 FNETNGFPDGVMGLGYRKHSLALRLAEIFGNKFSYCLVDHLSSSNHKNF 296
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 56/113 (49%), Gaps = 7/113 (6%)
Query: 304 TVNTLRSAFHKRMKKYKKAKGLE--DLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVR 361
V+ L+ F K+KK +E +L + C++ ++ VP++ IHF G + V+
Sbjct: 372 VVDALKPIF----DKHKKVVPIELPELNNFCFEDKGFDRAAVPRLLIHFADGAIFKPPVK 427
Query: 362 GTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
++ + CLG P S LGNV Q+ H YD+ +LGFGP +C
Sbjct: 428 SYIIDVAEGIKCLGIIK-ADFPGSSILGNVMQQNHLWEYDLGRGKLGFGPSSC 479
>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 396
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 56/163 (34%), Positives = 77/163 (47%), Gaps = 14/163 (8%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
Y + + IG P Q VS ++D G ++ WTQC + C CF+Q P F + S TF PC +
Sbjct: 51 YVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCGAA 110
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGS--GSGGFWATDRITIQEANSNGYFTRYPFLL 248
C ES P +C A S + G TD + I A + R F
Sbjct: 111 VC----ESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAAT----ARLAF-- 160
Query: 249 GCINNSSGDKS-GASGIMGLDRSPVSIITRTNTSYFSYCLPSP 290
GC S D G+SG +GL R+ +S+ + N + FSYCL P
Sbjct: 161 GCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNATAFSYCLAPP 203
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 62/217 (28%), Positives = 95/217 (43%), Gaps = 29/217 (13%)
Query: 66 SKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANIN 125
S++ SR + H E L R HL+ S+ P R F +
Sbjct: 36 SRHHEGSRPAMILPLHHSVPESSLSHFNPRRHLQGSQSEHHPN----ARMRLF------D 85
Query: 126 DTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFK 184
D + + YY + IG P Q +L++DTGS VT+ C C HC +DP F S+T+
Sbjct: 86 DLLRNGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCKHCGSHQDPKFRPEASETYQP 145
Query: 185 IPCNSTSCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFT 242
+ C + C NC+ K+C + +YA+ S S G D ++ + +
Sbjct: 146 VKC-TWQC----------NCDDDRKQCTYERRYAEMSTSSGVLGEDVVSF---GNQSELS 191
Query: 243 RYPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR 277
+ GC N+ +GD A GIMGL R +SI+ +
Sbjct: 192 PQRAIFGCENDETGDIYNQRADGIMGLGRGDLSIMDQ 228
>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 72/240 (30%), Positives = 109/240 (45%), Gaps = 29/240 (12%)
Query: 84 SLEEILRQDQQRLHLKNS---RRLRKPFPEFLKRTEAFTFPANIND-TVADEYYIVVAIG 139
SL ++ R D+QR+ S RR R+ AF P T +Y++ +G
Sbjct: 44 SLADLARSDRQRMAFIASHGRRRARETAAG--SSAAAFEMPLTSGAYTGIGQYFVRFRVG 101
Query: 140 EPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPF---FYASKSKTFFKIPCNSTSCRIL 195
P Q L+ DTGSD+TW +C +P + + F S+T+ I C S +C
Sbjct: 102 TPAQPFLLVADTGSDLTWVKCRRPAANSSESGSGSGRAFRPEDSRTWAPISCASDTC--- 158
Query: 196 RESFPF--GNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP----FL 247
+S PF C + C ++ +Y DGS + G T+ TI + G R +
Sbjct: 159 TKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATI-ALSGRGREERKAKLKGLV 217
Query: 248 LGCINNSSGDKSGAS-GIMGLDRSPVSIITRTNTSY---FSYCLP---SPYGSTGYITFG 300
LGC ++ +G S G++ L S VS + + + FSYCL SP +T Y+TFG
Sbjct: 218 LGCTSSYTGPSFEVSDGVLSLGYSDVSFASHAASRFAGRFSYCLVDHLSPRNATSYLTFG 277
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 44/90 (48%), Gaps = 2/90 (2%)
Query: 327 DLLDTCYDL-SAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNS 385
D + CY+ S V +PK+A+HF G LE + ++ A+ C+G P P
Sbjct: 387 DPFEYCYNWTSPSGDVTLPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQE-GPWPGI 445
Query: 386 ITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+GN+ Q+ H +D+ RRL F C+
Sbjct: 446 SVIGNILQQEHLWEFDIKNRRLKFQRSRCT 475
>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 63/207 (30%), Positives = 97/207 (46%), Gaps = 27/207 (13%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y I V +G P + + +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C +L S P +C E CPF + Y DGS S G D +T + FT
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----- 110
Query: 248 LGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STGY 296
GC +S G + G++G+ P+S++ +++ + FSYCLP S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAK 323
+ GK T +R + K + + K +
Sbjct: 171 FSLGKVATRTDVR--YTKMVARRKNTE 195
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 58/191 (30%), Positives = 86/191 (45%), Gaps = 23/191 (12%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRD---PF-FYASK-SKTFFKIP 186
Y+ + +G P + + +DTGSD+ W C PC C + D P Y SK S T +
Sbjct: 74 YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVG 133
Query: 187 CNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPF 246
C C + +S G K C +++ Y DGS S G + D IT+++ N P
Sbjct: 134 CEDDFCSFIMQSETCG--AKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGN--LRTAPL 189
Query: 247 ----LLGCINNSSGD----KSGASGIMGLDRSPVSIITR-----TNTSYFSYCLPSPYGS 293
+ GC N SG S GIMG +S SII++ + FS+CL + G
Sbjct: 190 AQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGG 249
Query: 294 TGYITFGKTDT 304
G G+ ++
Sbjct: 250 -GIFAVGEVES 259
>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 63/207 (30%), Positives = 97/207 (46%), Gaps = 27/207 (13%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y I V +G P + + +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C +L S P +C E CPF + Y DGS S G D +T + FT
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----- 110
Query: 248 LGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STGY 296
GC +S G + G++G+ P+S++ +++ + FSYCLP S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAK 323
+ GK T +R + K + + K +
Sbjct: 171 FSLGKVATRTDVR--YTKMVARRKNTE 195
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 77/158 (48%), Gaps = 19/158 (12%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D +++ YY + IG P Q +L++DTGS VT+ C C C + +DP F S T+
Sbjct: 80 DDLLSNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQCGKHQDPRFQPESSSTYK 139
Query: 184 KIPCNSTSCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
+ CN P NC+ K+C + +YA+ S S G A D ++ +
Sbjct: 140 PMQCN-----------PSCNCDDEGKQCTYERRYAEMSSSSGLLAEDVLSFGNESE---L 185
Query: 242 TRYPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR 277
T + GC +G+ A GIMGL R P+S++ +
Sbjct: 186 TPQRAIFGCETVETGELFSQRADGIMGLGRGPLSVVDQ 223
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 57/231 (24%), Positives = 99/231 (42%), Gaps = 23/231 (9%)
Query: 89 LRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLL 148
L Q + R HL+++R L+ F+ F+ + + + Y+ V +G P + ++
Sbjct: 42 LAQLRARDHLRHARLLQG----FVGGVVDFSVQGSSDPYLVGLYFTRVKLGTPPREFNVQ 97
Query: 149 LDTGSDVTWTQCKPCIHCFQQRD-----PFFYASKSKTFFKIPCNSTSCRILRESFPFGN 203
+DTGSDV W C C +C Q +F + S T +PC+ C ++
Sbjct: 98 IDTGSDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARLVPCSHPICTSQIQTTA-TQ 156
Query: 204 C--NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYF--TRYPFLLGCINNSSGDKS 259
C S +C + QY DGSG+ G++ +D + + GC SGD +
Sbjct: 157 CPPQSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIANSSAAIVFGCSTYQSGDLT 216
Query: 260 ----GASGIMGLDRSPVSIITRTNTS-----YFSYCLPSPYGSTGYITFGK 301
GI G + +S+I++ ++ FS+CL G + G+
Sbjct: 217 KTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCLKGEDSGGGILVLGE 267
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 58/191 (30%), Positives = 86/191 (45%), Gaps = 23/191 (12%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRD---PF-FYASK-SKTFFKIP 186
Y+ + +G P + + +DTGSD+ W C PC C + D P Y SK S T +
Sbjct: 78 YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVG 137
Query: 187 CNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPF 246
C C + +S G K C +++ Y DGS S G + D IT+++ N P
Sbjct: 138 CEDDFCSFIMQSETCG--AKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGN--LRTAPL 193
Query: 247 ----LLGCINNSSGD----KSGASGIMGLDRSPVSIITR-----TNTSYFSYCLPSPYGS 293
+ GC N SG S GIMG +S SII++ + FS+CL + G
Sbjct: 194 AQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGG 253
Query: 294 TGYITFGKTDT 304
G G+ ++
Sbjct: 254 -GIFAVGEVES 263
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 61/207 (29%), Positives = 98/207 (47%), Gaps = 26/207 (12%)
Query: 115 TEAFTFPANIND-TVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQC-------KPCIHC 166
+ AF P T +Y++ + +G P Q L+ DTGSD+TW +C
Sbjct: 86 SSAFAMPLTSGAYTGTGQYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAAS 145
Query: 167 FQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS--KECPFNIQYADGSGSGGF 224
QR F + SK++ +PC+S +C+ F NC+S C ++ +Y D S + G
Sbjct: 146 PPQR--VFRPAGSKSWSPLPCDSDTCKSY-VPFSLANCSSPPDPCSYDYRYKDNSSARGV 202
Query: 225 WATDRITIQEANSNGYFTRYPFL----LGCINNSSGDKSGAS-GIMGLDRSPVSIITRTN 279
D T+ + ++G TR L LGC + G +S G++ L S +S +R
Sbjct: 203 VGLDSATVSLSGNDG--TRKAKLQEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASRAA 260
Query: 280 TSY---FSYCLP---SPYGSTGYITFG 300
+ + FSYCL +P +T ++TFG
Sbjct: 261 SRFGGRFSYCLVDHLAPRNATSFLTFG 287
>gi|414869114|tpg|DAA47671.1| TPA: hypothetical protein ZEAMMB73_872184 [Zea mays]
Length = 492
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 54/184 (29%), Positives = 87/184 (47%), Gaps = 20/184 (10%)
Query: 126 DTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKI 185
D A +Y + V G P+Q + LDT V+ CKPC DP F S+S TF +
Sbjct: 143 DAGALDYTVNVGYGTPEQQFPMFLDTIFGVSLVLCKPCAPGSTSCDPAFDTSQSTTFTHV 202
Query: 186 PCNSTSCRILRESFPFGNCNSKE-CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRY 244
PC+S C NC++ CPFN+ + +G+ ++ D +T+ + + FT
Sbjct: 203 PCDSPDCPST------ANCSAGSVCPFNLFFVEGT-----FSQDVLTVAPSVAVQDFT-- 249
Query: 245 PFLLGCINNSSGDKSGASGIMGLDRSPVSIITR---TNTSYFSYCLPSPYGSTGYITFGK 301
C++ + D G + L R S+ +R + ++ FSYC+P S G+++ G
Sbjct: 250 ---FVCLDAGASDGMPEVGTLDLSRDRNSLPSRLAGSASAAFSYCMPQYPDSPGFLSLGD 306
Query: 302 TDTV 305
TV
Sbjct: 307 DATV 310
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/120 (32%), Positives = 55/120 (45%), Gaps = 9/120 (7%)
Query: 303 DTVNTLRSAFHKRMKKYKKA-KGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVR 361
D LR AF + M +Y ++ G D DTCY+ + + + VP + F G L +D
Sbjct: 374 DAYTPLRDAFRQAMAQYNRSVPGFYDF-DTCYNFTGLQELTVPLVEFKFGNGDSLLIDGD 432
Query: 362 GTLVVASVSQ-----VCLGFATYPPDPN--SITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L S+ CL F+T D + S +G EV YDVAG +GF P +C
Sbjct: 433 QMLYYDIPSEGPFTVTCLAFSTLDVDDDDVSAVIGAYSLATTEVVYDVAGGTVGFIPESC 492
>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 91/193 (47%), Gaps = 25/193 (12%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y I V +G P + + +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C +L S P +C E CPF + Y DGS S G D +T + FT
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----- 110
Query: 248 LGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STGY 296
GC +S G + G++G+ P+S++ +++ + FSYCLP S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 297 ITFGKTDTVNTLR 309
+ GK T +R
Sbjct: 171 FSLGKVATRTDVR 183
>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 91/193 (47%), Gaps = 25/193 (12%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y I V +G P + + +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C +L S P +C E CPF + Y DGS S G D +T + FT
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----- 110
Query: 248 LGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STGY 296
GC +S G + G++G+ P+S++ +++ + FSYCLP S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 297 ITFGKTDTVNTLR 309
+ GK T +R
Sbjct: 171 FSLGKVATRTDVR 183
>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 402
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 58/110 (52%), Gaps = 3/110 (2%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGT--LV 365
L++ F K+ + A +LDTC++LSAY+ V +P I +HF G +L +DV G V
Sbjct: 292 LKAEFLKQFTGFPPAPAF-SILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFV 350
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ SQVCL A+ LGN QQ+ V YD ++GF CS
Sbjct: 351 KSDASQVCLALASLEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 400
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 48/93 (51%), Gaps = 11/93 (11%)
Query: 202 GNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKS 259
G C S C + I Y DGS + G +++ G F+ GC N+ G
Sbjct: 124 GVCGSAAPICNYAINYGDGSFTRGELGHEKLKF------GTILVKDFIFGCGRNNKGLFG 177
Query: 260 GASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
G SG+MGL RS +S+I++T+ + FSYCLPS
Sbjct: 178 GVSGLMGLGRSDLSLISQTSGIFGGVFSYCLPS 210
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 57/191 (29%), Positives = 85/191 (44%), Gaps = 23/191 (12%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRD---PF-FYASK-SKTFFKIP 186
Y+ + +G P + + +DTGSD+ W C PC C + D P Y SK S T +
Sbjct: 77 YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKASSTSKNVG 136
Query: 187 CNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPF 246
C C + +S G K C +++ Y DGS S G + D IT+ + N P
Sbjct: 137 CEDAFCSFIMQSETCG--AKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGN--LRTAPL 192
Query: 247 ----LLGCINNSSGD----KSGASGIMGLDRSPVSIITRTNTS-----YFSYCLPSPYGS 293
+ GC N SG +S GIMG +S S+I++ FS+CL + G
Sbjct: 193 AQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMNGG 252
Query: 294 TGYITFGKTDT 304
G G+ ++
Sbjct: 253 -GIFAIGEVES 262
>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 62/207 (29%), Positives = 98/207 (47%), Gaps = 27/207 (13%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y V +G P + + +DTGS ++W C+ C C F S+S T K+ C ++
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSISWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C +L S P +C E CPF + Y DGS S G D +T + FT
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----- 110
Query: 248 LGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STGY 296
GC +S G + G++G+ P+S++ +++ ++ FSYCLP S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAK 323
+ GK T +R + K + + K +
Sbjct: 171 FSLGKVATRTDVR--YTKMVARRKNTE 195
>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 421
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 96/201 (47%), Gaps = 27/201 (13%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
YY+ ++IG P + L +DTGSD+TW QC PC+ C + P + +K+K +PC
Sbjct: 58 YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCNKVPHPLYRPTKNKI---VPCVDQ 114
Query: 191 SCRILRESFPFGN-CNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C L + C+S ++C + I+YAD S G TD ++ ANS+ R
Sbjct: 115 LCSSLHGGLSGKHKCDSPKQQCDYEIKYADQGSSLGVLLTDSFAVRLANSS--IVRPSLA 172
Query: 248 LGCINN----SSGDKSGASGIMGLDRSPVSIITR-----TNTSYFSYCLPSPYGSTGYIT 298
GC + SS + + G++GL +S++++ + +CL G G++
Sbjct: 173 FGCGYDQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVVGHCLSIRGG--GFLF 230
Query: 299 FG-------KTDTVNTLRSAF 312
FG + V +RSAF
Sbjct: 231 FGDNLVPYSRATWVPMVRSAF 251
>gi|383156234|gb|AFG60356.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156236|gb|AFG60358.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
gi|383156239|gb|AFG60361.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
Length = 154
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 63/133 (47%), Gaps = 9/133 (6%)
Query: 68 YGPCSRLNQGISTHAPSL-EEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANIND 126
+G CS L S+ L + L +D RL SR T P +
Sbjct: 12 HGACSPLRPANSSKWIDLVSQSLERDNDRLKTIRSRNSGS-------YTTMSNLPLQSGN 64
Query: 127 TVADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKI 185
V YIV A G P + L++DTGSD+TW QCKPC+ C+ Q DP F S+S ++ +
Sbjct: 65 KVGTGNYIVTAGFGTPTKKFLLIIDTGSDLTWIQCKPCLGCYSQVDPIFEPSQSSSYKSL 124
Query: 186 PCNSTSCRILRES 198
PC S +C L S
Sbjct: 125 PCLSATCTELLTS 137
>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 438
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 67/234 (28%), Positives = 98/234 (41%), Gaps = 21/234 (8%)
Query: 109 PEFLKRTEA---FTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK-PCI 164
P L R A FP + N Y + + IG+P + L +DTGSD+TW QC PC
Sbjct: 51 PYILNRFRAGSSVVFPVHGNVYPVGFYNVTLNIGQPPRPYFLDIDTGSDLTWLQCDAPCS 110
Query: 165 HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGF 224
C Q P + S +PC + C L S + +C + +QYAD S G
Sbjct: 111 RCSQTPHPLYRPSND----FVPCRHSLCASLHHSDNYDCEVPHQCDYEVQYADHYSSLGV 166
Query: 225 WATDRITIQEANSNGYFTRYPFLLGCINNS---SGDKSGASGIMGLDRSPVSIITRTNT- 280
D T+ +NG + LGC + G++GL R S+ ++ N+
Sbjct: 167 LLHDVYTLNF--TNGVQLKVRMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQ 224
Query: 281 ----SYFSYCLPSPYGSTGYITFGKT-DTVNTLRSAFHKRMKKYKKAKGLEDLL 329
+ +CL + G GYI FG D+ + R K+ A G +LL
Sbjct: 225 GLVRNVIGHCLSAQGG--GYIFFGDVYDSSRLTWTPMSSRDYKHYSAAGAAELL 276
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 65/116 (56%), Gaps = 8/116 (6%)
Query: 118 FTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYAS 177
F P I D A + ++IG P V ++LDTGSD+ W QC+PC C++Q+DP + +
Sbjct: 81 FVPPPLIRDKSA--FLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRT 138
Query: 178 KSKTFFKIPCNSTSCRIL-RESFPFGNC-NSKECPFNIQYADGSGSGGFWATDRIT 231
KS ++ ++ CN C L RE G C +S C + YADG+ + G + +++
Sbjct: 139 KSDSYTEMLCNEPPCVSLGRE----GQCSDSGSCLYQTAYADGARTSGLLSYEKVA 190
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 87/349 (24%), Positives = 132/349 (37%), Gaps = 78/349 (22%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK---PCIHCFQQRDPFFYASKSKTFFKIPC 187
EY+ V +G P ++LDTGSDV W + P + +Q A + C
Sbjct: 121 EYFAQVGVGTPATTALMVLDTGSDVVWAPVRALPPLLRAVRQGSSTGAAPAPTPRWN--C 178
Query: 188 NSTSCRILRESFPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
+ CR L + C+ + C + + Y DGS + G +A++ +T
Sbjct: 179 VAPICRRLDSA----GCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGARVQRVA--- 231
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSI---ITRTNTSYFSYCL-----------PSPY 291
+GC +++ G ASG++GL R +S I R+ FSYCL +
Sbjct: 232 --IGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSRRARPSRRW 289
Query: 292 GST--------------------------------------GYITFGKTDTVNTLRSAFH 313
G T G I T R +
Sbjct: 290 GGTPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYE 349
Query: 314 KRMKKYKKAK-GLE------DLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV- 365
++ A GL L DTCY+LS V VP +++H GG + L L+
Sbjct: 350 AVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIP 409
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V + C FA D +GN+QQ+G V +D +R+GF P +C
Sbjct: 410 VDTSGTFC--FAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 456
>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
Length = 480
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 58/190 (30%), Positives = 89/190 (46%), Gaps = 17/190 (8%)
Query: 127 TVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIH-CFQQRDPFFYASKSKTFFKI 185
T +Y++ +G P Q L+ DTGSD+TW +C F A+ S+++ I
Sbjct: 107 TGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDGTGDAPRRVFRAAASRSWAPI 166
Query: 186 PCNSTSCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANS---NGY 240
C+S +C F NC+S C ++ +Y DGS + G TD TI + S +G
Sbjct: 167 ACSSDTCTSY-VPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGSESRDGG 225
Query: 241 FTRYPF---LLGCINNSSGDK-SGASGIMGLDRSPVSIITRTNTSY---FSYCLP---SP 290
R +LGC + G + G++ L S +S +R + FSYCL +P
Sbjct: 226 GRRAKLQGVVLGCTASYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAP 285
Query: 291 YGSTGYITFG 300
+T Y+TFG
Sbjct: 286 RNATSYLTFG 295
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 47/160 (29%), Positives = 78/160 (48%), Gaps = 14/160 (8%)
Query: 136 VAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
+ +G P +++DTGS +TW QC PC + C +Q P F S T+ + C++ C
Sbjct: 1 MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSD 60
Query: 195 LRESF--PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLGCI 251
L + P +S C + Y D S S G+ + D ++ T P F GC
Sbjct: 61 LPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGS-------TSLPNFYYGCG 113
Query: 252 NNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLP 288
++ G ++G++GL R+ +S++ + S F+YCLP
Sbjct: 114 QDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLP 153
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 59/129 (45%), Gaps = 5/129 (3%)
Query: 287 LPSPYGSTGYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKI 346
LP+ S IT T + L A MK +A +LDTC+ A V P +
Sbjct: 210 LPTIIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYS-ILDTCFKGQA-SRVSAPAV 267
Query: 347 AIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRR 406
+ F GG L+L + LV S CL FA P ++ +GN QQ+ V YDV R
Sbjct: 268 TMSFAGGAALKLSAQNLLVDVDDSTTCLAFA---PARSAAIIGNTQQQTFSVVYDVKSSR 324
Query: 407 LGFGPGNCS 415
+GF G CS
Sbjct: 325 IGFAAGGCS 333
>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
Length = 137
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 55/102 (53%), Gaps = 4/102 (3%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
E+ + +AIG+P S +LDTGSD+TWTQC PC C++Q P + S S T+ + C S+
Sbjct: 20 EFLMQLAIGKPSLAYSAILDTGSDLTWTQCMPCSDCYKQPTPIYDPSLSSTYGTVSCKSS 79
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITI 232
C L P C S C + Y D S + G + + T+
Sbjct: 80 LCLAL----PASACISATCEYLYTYGDYSSTQGILSYETFTL 117
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/175 (30%), Positives = 84/175 (48%), Gaps = 23/175 (13%)
Query: 112 LKRTEAFTFPAN----INDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC 166
L+R+E+ P +D + + YY + IG P Q +L++DTGS VT+ C C HC
Sbjct: 64 LQRSESKRHPNARMRLYDDLLINGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCEHC 123
Query: 167 FQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNC--NSKECPFNIQYADGSGSGGF 224
+ +DP F S+T+ + C P NC ++ +C ++ QYA+ S S G
Sbjct: 124 GRHQDPKFQPDLSETYQPVKCT-----------PDCNCDGDTNQCMYDRQYAEMSSSSGV 172
Query: 225 WATDRITIQEANSNGYFTRYPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR 277
D ++ + + GC N+ +GD A GIMGL R +SI+ +
Sbjct: 173 LGEDVVSFGNLSE---LAPQRAVFGCENDETGDLYSQRADGIMGLGRGDLSIMDQ 224
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 92/187 (49%), Gaps = 23/187 (12%)
Query: 120 FPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKS 179
F N++ TV+ + +G P Q V+++LDTGS+++W CK Q + F S
Sbjct: 63 FHHNVSLTVS------LTVGSPPQNVTMVLDTGSELSWLHCKKT----QFLNSVFNPLSS 112
Query: 180 KTFFKIPCNSTSC--RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANS 237
KT+ K+PC S +C R + P +K C + YAD + G A + +
Sbjct: 113 KTYSKVPCLSPTCKTRTRDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRL----- 167
Query: 238 NGYFTRYPFLLGCIN----NSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGS 293
G T+ + GC++ ++S + S +G++G++R +S + + FSYC+ S + S
Sbjct: 168 -GSLTKPATIFGCMDSGFSSNSEEDSKTTGLIGMNRGSLSFVNQMGYPKFSYCI-SGFDS 225
Query: 294 TGYITFG 300
G + G
Sbjct: 226 AGVLLLG 232
>gi|115475303|ref|NP_001061248.1| Os08g0207800 [Oryza sativa Japonica Group]
gi|45735815|dbj|BAD12851.1| unknown protein [Oryza sativa Japonica Group]
gi|113623217|dbj|BAF23162.1| Os08g0207800 [Oryza sativa Japonica Group]
gi|125602549|gb|EAZ41874.1| hypothetical protein OsJ_26419 [Oryza sativa Japonica Group]
Length = 449
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 91/225 (40%), Gaps = 23/225 (10%)
Query: 116 EAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFY 175
E F P I + V Y + IGE +Q LL+DTGS + WTQC C HC P +
Sbjct: 68 EKFVTPFRIYEDVV--YLAEMEIGERQQKQYLLIDTGSSLVWTQCDECPHCHIGDVPPYG 125
Query: 176 ASKSKTFFKIPCNSTSCRILRESF-------PFGN---CNSKECPFNIQY---ADGSGSG 222
S+S+TF ++ C E+ P G C + C F Y G
Sbjct: 126 RSQSRTFQEVSCGDDDDNDKEEAIASYCPAKPPGYITLCVNGRCMFKALYNLTGQGETVQ 185
Query: 223 GFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGA---SGIMGLDRSPVSIITRTN 279
G+ + D + Y ++ + GC + + + +GI+GL S + +T
Sbjct: 186 GYMSMDTFHFIDDRRFDYQAKFRMVFGCAHQENIVLTAVKECTGILGLGMGDASFLRQTG 245
Query: 280 TSYFSYCLPS-----PYGSTGYITFGKTDTVNTLRSAFHKRMKKY 319
+ FSYC+P Y ++ FG ++ + R KY
Sbjct: 246 ITKFSYCVPPRMPGYSYRRHSWLRFGSHAQISGKKVPLVMRWGKY 290
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/210 (26%), Positives = 87/210 (41%), Gaps = 32/210 (15%)
Query: 118 FTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC-----FQQRDP 172
F + N + Y+ V +G P + + +DTGSD+ W C PC C +
Sbjct: 75 FPVEGSANPYMVGLYFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLE 134
Query: 173 FFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKE-----CPFNIQYADGSGSGGFWAT 227
FF S T +IPC+ C ++ C S + C + Y DGSG+ GF+ +
Sbjct: 135 FFNPDSSSTSSRIPCSDDRCTAALQTGE-AVCQSSDSPSSPCGYTFTYGDGSGTSGFYVS 193
Query: 228 DRITI-------QEANSNGYFTRYPFLLGCINNSSGD----KSGASGIMGLDRSPVSIIT 276
D + Q ANS+ + GC N+ SGD GI G + +S+++
Sbjct: 194 DTMYFDTVMGNEQTANSSA-----SVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVS 248
Query: 277 R-----TNTSYFSYCLPSPYGSTGYITFGK 301
+ + FS+CL G + G+
Sbjct: 249 QLYSLGVSPKTFSHCLKGSDNGGGILVLGE 278
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/192 (26%), Positives = 82/192 (42%), Gaps = 17/192 (8%)
Query: 118 FTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRD-----P 172
F N TV Y+ + +G P + + +DTGSD+ W C C C ++ D
Sbjct: 55 FNLGGNGLPTVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLT 114
Query: 173 FFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITI 232
+ +SKT + C C E G CP++I Y DGS + G++ D +T
Sbjct: 115 LYDPKRSKTSEFVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTF 174
Query: 233 QEANSNGYFT--RYPFLLGCINNSSGDKSGAS-----GIMGLDRSPVSIITRTNTS---- 281
N N + + GC SG + +S GI+G ++ S++++ S
Sbjct: 175 NRVNGNPHTATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVK 234
Query: 282 -YFSYCLPSPYG 292
FS+CL + G
Sbjct: 235 KIFSHCLDTNVG 246
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 90/406 (22%), Positives = 154/406 (37%), Gaps = 85/406 (20%)
Query: 85 LEEILRQDQQRLHLKNSRRLRKPF--PEFLKRTEA--------FTFPANINDTVADEYYI 134
+L D R+ +R + P P L+R + + P +V Y+
Sbjct: 63 FSAVLTHDHARIASLAARLAKTPSSRPTKLRRGSSSSPDAESLASVPLGPGTSVGVGNYV 122
Query: 135 V-VAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNSTSC 192
+ +G P + +++DTGS +TW QC PC + C +Q P F S ++ + C++ C
Sbjct: 123 TRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSSYASVSCSAPQC 182
Query: 193 RILRESF--PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLG 249
L + P S C + Y D S S G+ + D ++ T P F G
Sbjct: 183 DALTTATLNPSTCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGS-------TSVPNFYYG 235
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPY-------------GS 293
C ++ G ++G++GL R+ +S++ + S FSYCLP+ G
Sbjct: 236 CGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSGYLSIGSYNPGQ 295
Query: 294 TGYITFGKTDTVNTLRSAFHKRMKKYKKA-----------KGLEDLLDTCYDLSAYETVV 342
Y K+ ++L + +M A L ++D+ ++ T V
Sbjct: 296 YSYTPMAKSSLDDSL---YFIKMTGITVAGKPLSVSASAYSSLPTIIDSGTVITRLPTDV 352
Query: 343 VPKIAIHFLGGVD----------LELDVRGTLVVASVSQVCLGFA--------------- 377
++ G + L+ +G V QV + FA
Sbjct: 353 YSALSKAVAGAMKGTPRASAFSILDTCFQGQASRLRVPQVSMAFAGGAALKLKATNLLVD 412
Query: 378 --------TYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ P ++ +GN QQ+ V YDV ++GF G CS
Sbjct: 413 VDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 458
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 82/362 (22%), Positives = 140/362 (38%), Gaps = 82/362 (22%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIP 186
YY V +G P + ++ +DTGSDV W C C C Q + FF S + +
Sbjct: 84 YYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVS 143
Query: 187 CNSTSC--RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSN--GYFT 242
C+ C ES G + C ++ +Y DGSG+ GF+ +D ++ ++ +
Sbjct: 144 CSDRRCYSNFQTES---GCSPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINS 200
Query: 243 RYPFLLGCINNSSGD----KSGASGIMGLDRSPVSIITRTNTS-----YFSYCLPSPYGS 293
PF+ GC N +GD + GI GL + +S+I++ FS+CL
Sbjct: 201 SAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSG 260
Query: 294 TGYITFGKTDTVNTLRSAFHKRMKKYKK---------------------AKGLEDLLDT- 331
G + G+ +T+ + Y A G ++DT
Sbjct: 261 GGIMVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDTG 320
Query: 332 -----------------------------------CYDLSAYETVVVPKIAIHFLGGVDL 356
C++++A + V P++++ F GG +
Sbjct: 321 TTLAYLPDEAYSPFIQAIANAVSQYGRPITYESYQCFEITAGDVDVFPEVSLSFAGGASM 380
Query: 357 ELDVRGTLVVASVSQV---CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
L L + S S C+GF +I LG++ + V YD+ +R+G+ +
Sbjct: 381 VLRPHAYLQIFSSSGSSIWCIGFQRMSHRRITI-LGDLVLKDKVVVYDLVRQRIGWAEYD 439
Query: 414 CS 415
CS
Sbjct: 440 CS 441
>gi|194690050|gb|ACF79109.1| unknown [Zea mays]
Length = 166
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 63/111 (56%), Gaps = 3/111 (2%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL- 364
N +R+ F ++ +Y +A G +LDTC++++ + V VP + + F GG ++E+D G L
Sbjct: 54 NAVRAEFMSQLAEYPQAPGF-SILDTCFNMTGLKEVQVPSLTLVFDGGAEVEVDSGGVLY 112
Query: 365 -VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V + SQVCL A+ + + +GN QQ+ V +D + ++GF C
Sbjct: 113 FVSSDSSQVCLAVASLKSEDETSIIGNYQQKNLRVVFDTSASQVGFAQETC 163
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 71/249 (28%), Positives = 105/249 (42%), Gaps = 30/249 (12%)
Query: 51 TALPQGPDKASLEVVSKYGPCSRLNQGISTH--APSLEEILRQ---DQQRLHLKNSRRLR 105
TA P G D L ++ CS TH A ++ +L D RL +S
Sbjct: 30 TAAPDGSDD--LSIIPINAKCSPF---APTHVSASVIDTVLHMASSDSHRLTYLSSLVAG 84
Query: 106 KPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIH 165
KP P + A+ N Y + +G P Q + ++LDT +D W C C
Sbjct: 85 KPKPTSVPV-------ASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSG 137
Query: 166 CFQQRDPFFYASKSKTFFKIPCNSTSCRILRE-SFPFGNCNSKECPFNIQYADGSGSGGF 224
C F + S T+ + C++ C R + P + C FN Y S
Sbjct: 138 C-SNASTSFNTNSSSTYSTVSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSAS 196
Query: 225 WATDRITIQ-EANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY- 282
D +T+ + N F GCIN++SG+ G+MGL R P+S++++T + Y
Sbjct: 197 LVQDTLTLAPDVIPN-------FSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYS 249
Query: 283 --FSYCLPS 289
FSYCLPS
Sbjct: 250 GVFSYCLPS 258
Score = 47.0 bits (110), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 51/109 (46%), Gaps = 8/109 (7%)
Query: 311 AFHKRMKKYKKAKGLEDL--LDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVAS 368
A +K L DTC+ SA V PKI +H + +DL+L + TL+ +S
Sbjct: 343 AIRDEFRKQVNVSSFSTLGAFDTCF--SADNENVAPKITLH-MTSLDLKLPMENTLIHSS 399
Query: 369 VSQV-CLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ CL A + N++ + N+QQ+ + +DV R+G P C
Sbjct: 400 AGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPC 448
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 61/186 (32%), Positives = 90/186 (48%), Gaps = 13/186 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + V +G P + +++DTGSD+ W QC PC+ CF+Q P F + S ++ + C
Sbjct: 148 EYLVDVYLGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQSGPIFDPAASISYRNVTCGDD 207
Query: 191 SCRILR---ESFPFGNC---NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRY 244
CR++ ES P C S CP+ Y D S + G A + T+ S G
Sbjct: 208 RCRLVSPPAESAP-RECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQS-GTRRVD 265
Query: 245 PFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY----FSYCLPSPYGSTG-YITF 299
GC + + G GA+G++GL R P+S ++ Y FSYCL + G I F
Sbjct: 266 GVAFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRGVYGGHAFSYCLVEHGSAAGSKIIF 325
Query: 300 GKTDTV 305
G D +
Sbjct: 326 GHDDAL 331
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 51/112 (45%), Gaps = 4/112 (3%)
Query: 306 NTLRSAFHKRMK-KYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL 364
+R AF RM Y G +L CY++S E V VP++++ F G E
Sbjct: 395 QAIRQAFIDRMSPSYPLILGFP-VLSPCYNVSGAEKVEVPELSLVFADGAAWEFPAENYF 453
Query: 365 V-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ + +CL P SI +GN QQ+ V YD+ RLGF P C+
Sbjct: 454 IRLEPEGIMCLAVLGTPRSGMSI-IGNYQQQNFHVLYDLEHNRLGFAPRRCA 504
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 60/223 (26%), Positives = 100/223 (44%), Gaps = 38/223 (17%)
Query: 84 SLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVA-IGEPK 142
S+ + R D RL +S K A A + A Y+V A +G P
Sbjct: 41 SIIALARDDDARLLFLSS-----------KAATAGVSSAPVASGQAPPSYVVRAGLGSPS 89
Query: 143 QYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRES---F 199
Q + L LDT +D TW C PC C F + S ++ +PC+S+ C + +
Sbjct: 90 QQLLLALDTSADATWAHCSPCGTC--PSSSLFAPANSSSYASLPCSSSWCPLFQGQACPA 147
Query: 200 PFGNCNSK-------ECPFNIQYADGSGSGGFWATDRITI-QEANSNGYFTRYPFLLGCI 251
P G ++ C F+ +AD S A+D + + ++A N + GC+
Sbjct: 148 PQGGGDAAPPPATLPTCAFSKPFADASFQAAL-ASDTLRLGKDAIPN-------YTFGCV 199
Query: 252 NNSSGDKSGA--SGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
++ +G + G++GL R P++++++ + Y FSYCLPS
Sbjct: 200 SSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPS 242
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 57/121 (47%), Gaps = 4/121 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
IT LR F +++ L DTC++ P + +H GGVDL
Sbjct: 318 ITRWTAPVYAALREEFRRQVAAPSGYTSL-GAFDTCFNTDEVAAGGAPAVTVHMDGGVDL 376
Query: 357 ELDVRGTLVVASVSQV-CLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRLGFGPGN 413
L + TL+ +S + + CL A P + NS+ + N+QQ+ V +DVA R+GF +
Sbjct: 377 ALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRVGFAKES 436
Query: 414 C 414
C
Sbjct: 437 C 437
>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/160 (29%), Positives = 71/160 (44%), Gaps = 12/160 (7%)
Query: 143 QYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPC-NSTSCRILRESFPF 201
Q L LD G ++W QC PC HC Q P F +KS TF IP N+ CR P+
Sbjct: 109 QNYQLALDMGGGLSWMQCLPCRHCLLQMSPVFDPTKSPTFSNIPAHNTVWCRP-----PY 163
Query: 202 GNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSS--GDKS 259
+ C F+I Y D + + G+ A D + N + + + GC + + ++
Sbjct: 164 QPLANGACGFDIAYRDNTHASGYLARDTFSFPAGNDD-FVPLSAIVFGCAHQTEHFKNQR 222
Query: 260 GASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITF 299
+GI+GL P + T++ LP+ G Y F
Sbjct: 223 AVAGILGLGMGPAG---KPPTAFTKQVLPAHGGRFSYCPF 259
>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
gi|219888491|gb|ACL54620.1| unknown [Zea mays]
Length = 557
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 54/184 (29%), Positives = 86/184 (46%), Gaps = 24/184 (13%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK-PCIHCFQQRDPFFYASKSKTFFKIPCNS 189
+YY + IG P + L +DTGSD+TW QC PC + + P + +K K +P
Sbjct: 186 QYYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNFAKGPHPLYKPAKEKI---VPPRD 242
Query: 190 TSCRILRESFPFGNCN----SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C+ L+ GN N K+C + I+YAD S S G A D + + +NG +
Sbjct: 243 LLCQELQ-----GNQNYCETCKQCDYEIEYADQSSSMGVLARDDMHM--IATNGGREKLD 295
Query: 246 FLLGCINNSSGD----KSGASGIMGLDRSPVSIITRTNT-----SYFSYCLPSPYGSTGY 296
F+ GC + G + GI+GL + +S ++ + + F +C+ G GY
Sbjct: 296 FVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGGGY 355
Query: 297 ITFG 300
+ G
Sbjct: 356 MFLG 359
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 70/239 (29%), Positives = 104/239 (43%), Gaps = 30/239 (12%)
Query: 61 SLEVVSKYGPCSRLNQGISTHAPS----LEEILRQDQQRLHLKNSRRLRKPFPEFLKRTE 116
+L+V +GPCS L G T APS L + +D RL +S R +
Sbjct: 43 TLQVSHAFGPCSPLGPG--TTAPSWAGFLADQASRDASRLLYLDSLAARG-------KAR 93
Query: 117 AFTFPANINDTVADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFY 175
A+ A+ + Y+V A +G P Q + L +DT +D W C C C P F
Sbjct: 94 AYAPIASGRQLLQTPTYVVRARLGTPPQQLLLAVDTSNDAAWIPCAGCAGCPTSSAPPFD 153
Query: 176 ASKSKTFFKIPCNSTSCRILRESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQ 233
+ S ++ +PC S C P C K C F++ YAD S + D + +
Sbjct: 154 PAASTSYRSVPCGSPLC----AQAPNAACPPGGKACGFSLTYADSSLQAAL-SQDSLAVA 208
Query: 234 EANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
Y GC+ ++G + G++GL R P+S +++T Y FSYCLPS
Sbjct: 209 GDAVKTY------TFGCLQKATGTAAPPQGLLGLGRGPLSFLSQTRDMYQGTFSYCLPS 261
>gi|361067983|gb|AEW08303.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
Length = 154
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 64/130 (49%), Gaps = 9/130 (6%)
Query: 68 YGPCSRLNQGISTHAPSL-EEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANIND 126
+G CS L S+ L + L +D RL SR P+ T P +
Sbjct: 12 HGACSPLRPINSSSWIDLVSQSLERDDDRLKTIRSRN-SSPY------TTMSNLPLQSGN 64
Query: 127 TVADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKI 185
V YIV A +G P + L++DTGSD+TW QCKPC+ C+ Q DP F +S ++ +
Sbjct: 65 KVGTGNYIVTAGLGTPTKNFLLIIDTGSDLTWIQCKPCLDCYSQVDPIFEPRQSSSYKSL 124
Query: 186 PCNSTSCRIL 195
PC S +C L
Sbjct: 125 PCLSATCTEL 134
>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 395
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 81/184 (44%), Gaps = 24/184 (13%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK-PCIHCFQQRDPFFYASKSKTFFKIPCNS 189
+YY + IG P + L +DTGSD TW C PC +C + P + ++ K
Sbjct: 15 QYYTSINIGNPPRPYFLDIDTGSDFTWIHCDAPCTNCTKGPHPVYKPTEGKIVHP---RD 71
Query: 190 TSCRILRESFPFGNCN----SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C L+ GN N K+C + I YAD S S G A D + + A +G
Sbjct: 72 PLCEELQ-----GNQNYCETCKQCDYEITYADRSSSKGVLARDNMQLTTA--DGEMKNVD 124
Query: 246 FLLGCINNSSG----DKSGASGIMGLDRSPVSIITRTNTS-----YFSYCLPSPYGSTGY 296
F+ GC +N G + GI+GL +S+ T+ S F +C+ + S GY
Sbjct: 125 FVFGCAHNQQGKLLDSPTSTDGILGLSNGAISLSTQLANSGIISNVFGHCMATDPSSGGY 184
Query: 297 ITFG 300
+ G
Sbjct: 185 MFLG 188
>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
Length = 137
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 38/102 (37%), Positives = 55/102 (53%), Gaps = 4/102 (3%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
E+ + +AIG+P S +LDTGSD+TWTQC PC C++Q P + S S T+ + C S+
Sbjct: 20 EFLMQLAIGKPSLAYSAILDTGSDLTWTQCIPCSDCYKQPTPIYDPSLSSTYGTVSCKSS 79
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITI 232
C L P C S C + Y D S + G + + T+
Sbjct: 80 LCLAL----PASACISATCEYLYTYGDYSSTQGILSYETFTL 117
>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 62/207 (29%), Positives = 97/207 (46%), Gaps = 27/207 (13%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y V +G P + + +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C +L S P +C E CPF + Y DGS S G D +T + FT
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----- 110
Query: 248 LGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STGY 296
GC +S G + G++G+ P+S++ +++ ++ FSYCLP S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAK 323
+ GK T +R + K + + K +
Sbjct: 171 FSLGKVATRTDVR--YTKMVARRKNTE 195
>gi|326493614|dbj|BAJ85268.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 399
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 57/199 (28%), Positives = 90/199 (45%), Gaps = 17/199 (8%)
Query: 8 FLLFICLLCSSNNGAYADDNDLSHSHIVSVSSLLPPNVCNRTRTALPQGPDKASLEVVSK 67
LL C++C + + A + I+S SSL P VC+ + +P A++ + +
Sbjct: 9 LLLLSCIICHALIASAAGEQSYK---ILSTSSLKPQAVCSEPK-GIPSSSSGATVPLNHR 64
Query: 68 YGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANI--- 124
+GPCS + P+ EE LR+D R + R + ++EA P +
Sbjct: 65 HGPCS---PAPTKKEPTFEERLRRDTLRAGYVQRKFSRNHRGGGVLQSEAVKVPTELGSP 121
Query: 125 NDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFK 184
DT+ EY I V +G P + +DTGSDV+W +C F +KS T+
Sbjct: 122 QDTL--EYVITVGVGSPAVQQIMTIDTGSDVSWLRCNSNSTAGTM---PFDPTKSTTYTA 176
Query: 185 IPCNSTSCRIL--RESFPF 201
PC + +C L R+ FP
Sbjct: 177 FPCGAAACAQLGHRQGFPV 195
>gi|357143660|ref|XP_003573001.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 151
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 60/110 (54%), Gaps = 7/110 (6%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYET-VVVPKIAIHFLGGVDLELDVRGTL 364
L SAF MK+Y A+ + +L+TC+D + E V +P +A+ GG ++LD G +
Sbjct: 48 EALSSAFKDGMKQYPPAEP-QSILNTCFDFTGQENNVTIPSVALVLDGGAVVDLDPNGII 106
Query: 365 VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ + CL FA D +S +GNVQQR EV YDV GF PG C
Sbjct: 107 LSS-----CLAFAATDDDRSSGIIGNVQQRTFEVLYDVGQSVFGFRPGVC 151
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 60/223 (26%), Positives = 100/223 (44%), Gaps = 38/223 (17%)
Query: 84 SLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVA-IGEPK 142
S+ + R D RL +S K A A + A Y+V A +G P
Sbjct: 43 SIIALARDDDARLLFLSS-----------KAATAGVSSAPVASGQAPPSYVVRAGLGSPS 91
Query: 143 QYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRES---F 199
Q + L LDT +D TW C PC C F + S ++ +PC+S+ C + +
Sbjct: 92 QQLLLALDTSADATWAHCSPCGTC--PSSSLFAPANSSSYASLPCSSSWCPLFQGQACPA 149
Query: 200 PFGNCNSK-------ECPFNIQYADGSGSGGFWATDRITI-QEANSNGYFTRYPFLLGCI 251
P G ++ C F+ +AD S A+D + + ++A N + GC+
Sbjct: 150 PQGGGDAAPPPATLPTCAFSKPFADASFQAAL-ASDTLRLGKDAIPN-------YTFGCV 201
Query: 252 NNSSGDKSGA--SGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
++ +G + G++GL R P++++++ + Y FSYCLPS
Sbjct: 202 SSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPS 244
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 57/121 (47%), Gaps = 4/121 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
IT LR F +++ L DTC++ P + +H GGVDL
Sbjct: 320 ITRWTAPVYAALREEFRRQVAAPSGYTSL-GAFDTCFNTDEVAAGGAPAVTVHMDGGVDL 378
Query: 357 ELDVRGTLVVASVSQV-CLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRLGFGPGN 413
L + TL+ +S + + CL A P + NS+ + N+QQ+ V +DVA R+GF +
Sbjct: 379 ALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRIGFAKES 438
Query: 414 C 414
C
Sbjct: 439 C 439
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 60/233 (25%), Positives = 94/233 (40%), Gaps = 22/233 (9%)
Query: 86 EEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYV 145
E L Q + R ++ R L+ L F + V YY + +G P +
Sbjct: 40 EMELSQLKARDEARHGRLLQS-----LGGVIDFPVDGTFDPFVVGLYYTKLRLGTPPRDF 94
Query: 146 SLLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIPCNSTSCRILRESFP 200
+ +DTGSDV W C C C Q + FF S T I C+ C +S
Sbjct: 95 YVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSD 154
Query: 201 FG-NCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYF--TRYPFLLGCINNSSGD 257
G + + C + QY DGSG+ GF+ +D + + + P + GC + +GD
Sbjct: 155 SGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGD 214
Query: 258 ----KSGASGIMGLDRSPVSIITRTNTS-----YFSYCLPSPYGSTGYITFGK 301
GI G + +S+I++ + FS+CL G G + G+
Sbjct: 215 LVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVLGE 267
>gi|413919745|gb|AFW59677.1| hypothetical protein ZEAMMB73_406599 [Zea mays]
Length = 246
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/191 (28%), Positives = 90/191 (47%), Gaps = 19/191 (9%)
Query: 124 INDTVADEYYIV--VAIGEPKQYVSLLLDTGSDVTWTQCKPCI-HCFQQ---RDPFFYAS 177
I DT +++ + + IG P + +DTGS ++W QC+PC HC +Q F S
Sbjct: 32 IEDTNINKFLFLMPINIGTPPVMNLVGIDTGSTLSWVQCRPCEPHCHKQAAKAGQIFDPS 91
Query: 178 KSKTFFKIPCNSTSCRILRES--FPFGNCNSK--ECPFNIQYADGSGSGGFWATDRITIQ 233
+S TF + CNS C +++++ F NC K C +++ Y GG+ T +
Sbjct: 92 RSTTFRRAGCNSRECFVVKDALKLEFANCMEKVNTCLYSMIY-----EGGWAYTASKVVW 146
Query: 234 EANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVS----IITRTNTSYFSYCLPS 289
+ G F+ GC + +G +G + +S + ++ N FSYCLPS
Sbjct: 147 DNLIIGTNISLSFMFGCSLDVEYGNYKEAGTVGFGTTSISFFEQVSSQINYKAFSYCLPS 206
Query: 290 PYGSTGYITFG 300
+TGY+ G
Sbjct: 207 NETTTGYMNLG 217
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 61/205 (29%), Positives = 91/205 (44%), Gaps = 32/205 (15%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPF-----------FYASKS 179
+Y++ +G P Q L+ DTGSD+TW +C+P + F KS
Sbjct: 94 QYFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKS 153
Query: 180 KTFFKIPCNSTSCRILRESFPF--GNCNS--KECPFNIQYADGSGSGGFWATDRITIQ-- 233
KT+ IPC S +C +S PF C + C ++ +Y DGS + G T+ TI
Sbjct: 154 KTWAPIPCASDTC---SKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTESATIALS 210
Query: 234 -----EANSNGYFTRYPFLLGCINNSSGDKSGAS-GIMGLDRSPVSIITRTNTSY---FS 284
N +LGC + +G AS G++ L S VS + + + FS
Sbjct: 211 SSSSSSKNKVKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVSFASHAASRFGGRFS 270
Query: 285 YCLP---SPYGSTGYITFGKTDTVN 306
YCL SP +T Y+TFG ++
Sbjct: 271 YCLVDHLSPRNATSYLTFGPNSALS 295
Score = 46.6 bits (109), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 29/110 (26%), Positives = 53/110 (48%), Gaps = 7/110 (6%)
Query: 310 SAFHKRMKKYKKAKGLEDLLDTCYDLSAY----ETVVVPKIAIHFLGGVDLELDVRGTLV 365
+A K++ ++ + D + CY+ ++ E +PK+A+HF G LE + ++
Sbjct: 371 AALGKKLARFPRVA--MDPFEYCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVI 428
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
A+ C+G P P +GN+ Q+ H +D+ RRL F C+
Sbjct: 429 DAAPGVKCIGVQEG-PWPGISVIGNILQQEHLWEFDLKNRRLRFKRSRCT 477
>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 62/207 (29%), Positives = 97/207 (46%), Gaps = 27/207 (13%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y V +G P + + +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C +L S P +C E CPF + Y DGS S G D +T + FT
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----- 110
Query: 248 LGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STGY 296
GC +S G + G++G+ P+S++ +++ ++ FSYCLP S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAK 323
+ GK T +R + K + + K +
Sbjct: 171 FSLGKVATRTDVR--YTKMVARRKNTE 195
>gi|54290725|dbj|BAD62395.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 500
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 69/237 (29%), Positives = 99/237 (41%), Gaps = 41/237 (17%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQC---KPCIHC--FQQRDPFFYASKSKTFFKI 185
+Y +VV G P Q +++ DTG ++ +C +P C DP S+S TF +
Sbjct: 145 DYTVVVGYGTPAQQLAMAFDTGLGISLVRCAACRPGAPCDGLASFDP----SRSSTFAPV 200
Query: 186 PCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
PC S CR S +C PF G A D +T+ + S FT
Sbjct: 201 PCGSPDCRSGCSSGSTPSCPLTSFPFL---------SGAVAQDVLTLTPSASVDDFT--- 248
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNT---SYFSYCLP-SPYGSTGYITFGK 301
GC+ SSG+ GA+G++ L R S+ +R FSYCLP S S G++ G+
Sbjct: 249 --FGCVEGSSGEPLGAAGLLDLSRDSRSVASRLAADAGGTFSYCLPLSTTSSHGFLAIGE 306
Query: 302 TDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLEL 358
D H R + L YD A+ V +A LGG D+ +
Sbjct: 307 ADVP-------HNRTARVTAVAPL------VYD-PAFPNHYVIDLAGVSLGGRDIPI 349
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/133 (30%), Positives = 60/133 (45%), Gaps = 17/133 (12%)
Query: 298 TFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAY-ETVVVPKIAIHFLGGVDL 356
T+ K LR AF + M +Y +A + DL DTCY+ + V++P + + F G
Sbjct: 369 TYMKPSMYAPLRDAFRRAMARYPRAPAMGDL-DTCYNFTGVRHEVLIPLVHLTFRGIGGG 427
Query: 357 ELDVRGTLVVASV----------SQVCLGFATYPPD-----PNSITLGNVQQRGHEVHYD 401
L + S CL FA P D P ++ +G + Q EV +D
Sbjct: 428 GGGQVLGLGADQMFYMSEPGNFFSVTCLAFAALPSDGDAEAPLAMVMGTLAQSSMEVVHD 487
Query: 402 VAGRRLGFGPGNC 414
V G ++GF PG+C
Sbjct: 488 VPGGKIGFIPGSC 500
>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 62/207 (29%), Positives = 97/207 (46%), Gaps = 27/207 (13%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y V +G P + + +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 1 YVTSVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C +L S P +C E CPF + Y DGS S G D +T + FT
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFT----- 110
Query: 248 LGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STGY 296
GC +S G + G++G+ P+S++ +++ ++ FSYCLP S G +TGY
Sbjct: 111 FGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSERGFFSKTTGY 170
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAK 323
+ GK T +R + K + + K +
Sbjct: 171 FSLGKVATRTDVR--YTKMVARRKNTE 195
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 60/224 (26%), Positives = 95/224 (42%), Gaps = 33/224 (14%)
Query: 81 HAPS---LEEIL---RQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYI 134
H PS LE I+ R D RL +S+ + T + Y +
Sbjct: 31 HPPSPSPLESIIALARADDARLLFLSSKA---------ASSGGVTSAPVASGQTPPSYVV 81
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
+G P Q + L LDT +D TW+ C PC C F + S ++ +PC S C +
Sbjct: 82 RAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASDWCPL 139
Query: 195 LRESFPFGNCNSK----ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
N ++ C F+ +AD S +D + + + GY GC
Sbjct: 140 FEGQPCPANQDASAPLPACAFSKPFADTSFQASL-GSDTLRLGKDAIAGY------AFGC 192
Query: 251 INNSSGDKSG--ASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
+ +G + G++GL R P+S++++T ++Y FSYCLPS
Sbjct: 193 VGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCLPS 236
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 53/121 (43%), Gaps = 4/121 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
IT LR F +++ L DTC++ P + +H GGVDL
Sbjct: 311 ITRWTAPVYAALREEFRRQVAAPSGYTSL-GAFDTCFNTDEVAAGGAPPVTLHMDGGVDL 369
Query: 357 ELDVRGTLVVASVSQV-CLGFATYP--PDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
L + TL+ +S + + CL A P + + N+QQ+ V DVAG R+GF
Sbjct: 370 TLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREP 429
Query: 414 C 414
C
Sbjct: 430 C 430
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 58/231 (25%), Positives = 102/231 (44%), Gaps = 23/231 (9%)
Query: 89 LRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLL 148
L Q + R L+++R L+ F+ F+ + + + Y+ V +G P + ++
Sbjct: 27 LHQLRARDRLRHARLLQG----FVGGVVDFSVQGSSDPYLVGLYFTKVKLGSPPREFNVQ 82
Query: 149 LDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGN 203
+DTGSDV W C C +C + FF +S S T ++ C+ C ++
Sbjct: 83 IDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQVRCSDPICTSAVQTTA-TQ 141
Query: 204 CNSK--ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL--GCINNSSGDKS 259
C+S+ +C + QY DGSG+ G++ +D + L+ GC SGD +
Sbjct: 142 CSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDNSSALIVFGCSAYQSGDLT 201
Query: 260 ----GASGIMGLDRSPVSIITRTNT-----SYFSYCLPSPYGSTGYITFGK 301
GI G + +S+I++ +T FS+CL G + G+
Sbjct: 202 KTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCLKGDGSGGGILVLGE 252
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/160 (29%), Positives = 80/160 (50%), Gaps = 13/160 (8%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCR 193
I + +G P Q +S+++DTGS+++W C PFF + S ++ I C+S +C
Sbjct: 68 ISITVGTPPQNMSMVIDTGSELSWLHCNTNTTATIPY-PFFNPNISSSYTPISCSSPTCT 126
Query: 194 ILRESFPF-GNCNSKE-CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI 251
FP +C+S C + YAD S S G A+D + + G + GC+
Sbjct: 127 TRTRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGFGSSFNPG------IVFGCM 180
Query: 252 NNS----SGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
N+S S S +G+MG++ +S++++ FSYC+
Sbjct: 181 NSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKIPKFSYCI 220
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/180 (31%), Positives = 79/180 (43%), Gaps = 9/180 (5%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + IG P + DT SD+ W QC PC CF Q P F KS TF + C+S
Sbjct: 89 EYLMRFYIGTPPVERLAIADTASDLIWVQCSPCETCFPQDTPLFEPHKSSTFANLSCDSQ 148
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C + + C + Y DGS + G T+ +I + F + F G
Sbjct: 149 PCT--SSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTE--SIHFGSQTVTFPKTIFGCGS 204
Query: 251 INNSSGDKSG-ASGIMGLDRSPVSIITRTNTSY---FSYC-LPSPYGSTGYITFGKTDTV 305
N+ S +GI+GL P+S++++ FSYC LP ST + FG T+
Sbjct: 205 NNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQIGHKFSYCLLPFTSTSTIKLKFGNDTTI 264
>gi|376337718|gb|AFB33415.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
gi|376337720|gb|AFB33416.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
Length = 154
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 64/133 (48%), Gaps = 9/133 (6%)
Query: 68 YGPCSRLNQGISTHAPSL-EEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANIND 126
+G CS L S+ L + L +D RL SR P+ T P
Sbjct: 12 HGACSPLRPTNSSKWIDLVSQSLERDNDRLKTIRSRN-SGPY------TTMSNLPLQSGS 64
Query: 127 TVADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKI 185
V YI+ A G P + L++DTGSD+TW QCKPC+ C+ Q DP F S+S ++ +
Sbjct: 65 EVGTGNYILTAGFGTPTKKFLLVIDTGSDLTWIQCKPCLGCYSQVDPIFEPSQSSSYKSL 124
Query: 186 PCNSTSCRILRES 198
PC S +C L S
Sbjct: 125 PCLSATCTELLTS 137
>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
Length = 469
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 69/254 (27%), Positives = 107/254 (42%), Gaps = 34/254 (13%)
Query: 73 RLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANIND-TVADE 131
RL+ + SL E R D +R S+ + AF P + T +
Sbjct: 45 RLDLVPAAPGASLGERARDDARRHAYIRSQLASRRRRAADVGASAFAMPLSSGAYTGTGQ 104
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPF--FYASKSKTFFKIPCNS 189
Y++ +G P Q L+ DTGSD+TW +C+ P F AS+S+++ + C+S
Sbjct: 105 YFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESRSWAPLACSS 164
Query: 190 TSCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITI--------------- 232
+C F NC+S C ++ +Y DGS + G TD TI
Sbjct: 165 DTCTSY-VPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGSGGGG 223
Query: 233 QEANSNGYFTRYPFLLGCINNSSGDKSGAS-GIMGLDRSPVSIITRTNTSY---FSYCLP 288
+ A G +LGC G +S G++ L S +S +R + FSYCL
Sbjct: 224 RRAKLQG------VVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLV 277
Query: 289 ---SPYGSTGYITF 299
+P ++ Y+TF
Sbjct: 278 DHLAPRNASSYLTF 291
>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 457
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 52/179 (29%), Positives = 84/179 (46%), Gaps = 8/179 (4%)
Query: 115 TEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKP--CIHCFQQRDP 172
T + +P + Y + +IG P + D+GS + W QC C +C++Q+ P
Sbjct: 84 TSSMKYPISRMSYTDKAYVMKFSIGSPAVDTYAIPDSGSSLVWLQCGTPYCRNCYRQKIP 143
Query: 173 FFYASKSKTFFKIPCNSTSCRI-LRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRIT 231
F SKS T+ K CN+ CR+ L + + ++ C ++ Y D S + G +TD T
Sbjct: 144 LFNPSKSVTYMKRLCNTAECRVALGDEYWRCKKPNQICKYHEDYLDDSYTEGVISTDIFT 203
Query: 232 IQEANSNGYFTRYPF--LLGC-INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
E S F Y + GC NNS G++GL + S++ + + FSYC+
Sbjct: 204 FPEHISG--FGNYTLRIIFGCGYNNSDPQHFYPPGLVGLTNNKASLVGQMDVDQFSYCV 260
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 75/170 (44%), Gaps = 29/170 (17%)
Query: 125 NDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFK 184
N EY + +AIG P Q V L LDTGSD+ WTQC+PC CF Q P+F S S T
Sbjct: 82 NGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSL 141
Query: 185 IPCNSTSCRILR-ESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
C+ST C+ L S P +D+ T A ++
Sbjct: 142 TSCDSTLCQGLPVASLP-------------------------RSDKFTFVGAGAS--VPG 174
Query: 244 YPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGS 293
F G NN KS +GI G R P+S+ ++ FS+C + G+
Sbjct: 175 VAFGCGLFNNGV-FKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGA 223
>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
Length = 165
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 47/120 (39%), Positives = 65/120 (54%), Gaps = 6/120 (5%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+T ++D N +R AF K K + L DTCYDLS+ +V VP +A HF G L
Sbjct: 50 VTRLQSDVYNVVRDAFVKGTKDLLATNEVS-LFDTCYDLSSKTSVEVPTVAFHFGEGKVL 108
Query: 357 ELDVRGTLV-VASVSQVCLGFATYPPDPNSIT-LGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L + LV V SV C FA P +S++ +GN+QQ+G V +D+A +GF P C
Sbjct: 109 VLPAKNYLVPVDSVGTFCFAFA---PTMSSLSIIGNIQQQGTRVSFDLANSLVGFSPNRC 165
>gi|255685716|gb|ACU28347.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
gi|255685726|gb|ACU28352.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
gi|255685728|gb|ACU28353.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 52/101 (51%), Gaps = 15/101 (14%)
Query: 136 VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRIL 195
+ IG P + +LDTGS++ WTQC PC+HC+ Q+ P F SKS TF + CN+
Sbjct: 3 LQIGTPPFEIEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCNT------ 56
Query: 196 RESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEAN 236
CP+ I Y D S + G AT+ +TI +
Sbjct: 57 ---------PDHSCPYKIVYDDKSYTQGTLATETVTIHSTS 88
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 74/162 (45%), Gaps = 14/162 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
Y +G P Q + + +D +D W C C C P F ++S T+ +PC S
Sbjct: 101 NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSP 159
Query: 191 SC-RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
C ++ S P G +S C FN+ YA S D + ++ N Y F G
Sbjct: 160 QCAQVPSPSCPAGVGSS--CGFNLTYA-ASTFQAVLGQDSLALE----NNVVVSYTF--G 210
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLP 288
C+ SG+ G++G R P+S +++T +Y FSYCLP
Sbjct: 211 CLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLP 252
Score = 45.8 bits (107), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 51/106 (48%), Gaps = 10/106 (9%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
+R AF R++ A L DTCY++ TV VP + F G V + L ++ +
Sbjct: 339 VRDAFRGRVRT-PVAPPLGGF-DTCYNV----TVSVPTVTFMFAGAVAVTLPEENVMIHS 392
Query: 368 SVSQV-CLGFATYPPDPNSITL---GNVQQRGHEVHYDVAGRRLGF 409
S V CL A P D + L ++QQ+ V +DVA R+GF
Sbjct: 393 SSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGF 438
>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 564
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 51/181 (28%), Positives = 86/181 (47%), Gaps = 18/181 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK-PCIHCFQQRDPFFYASKSKTFFKIPCNS 189
+YY + +G P + L +DTGSD+TW QC PC +C + P + +K K +P
Sbjct: 193 QYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKI---VPPRD 249
Query: 190 TSCRILRESFPF-GNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
C+ L+ + C K+C + I+YAD S S G A D + + +NG + F+
Sbjct: 250 LLCQELQGDQNYCATC--KQCDYEIEYADRSSSMGVLAKDDMHM--IATNGGREKLDFVF 305
Query: 249 GCINNSSGD----KSGASGIMGLDRSPVSIITRTN-----TSYFSYCLPSPYGSTGYITF 299
GC + G + GI+GL + +S+ ++ ++ F +C+ GY+
Sbjct: 306 GCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCITKEPNGGGYMFL 365
Query: 300 G 300
G
Sbjct: 366 G 366
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 49/162 (30%), Positives = 74/162 (45%), Gaps = 14/162 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
Y +G P Q + + +D +D W C C C P F ++S T+ +PC S
Sbjct: 82 NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSP 140
Query: 191 SC-RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
C ++ S P G +S C FN+ YA S D + ++ N Y F G
Sbjct: 141 QCAQVPSPSCPAGVGSS--CGFNLTYA-ASTFQAVLGQDSLALE----NNVVVSYTF--G 191
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLP 288
C+ SG+ G++G R P+S +++T +Y FSYCLP
Sbjct: 192 CLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLP 233
Score = 45.4 bits (106), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 51/106 (48%), Gaps = 10/106 (9%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
+R AF R++ A L DTCY++ TV VP + F G V + L ++ +
Sbjct: 320 VRDAFRGRVRT-PVAPPLGGF-DTCYNV----TVSVPTVTFMFAGAVAVTLPEENVMIHS 373
Query: 368 SVSQV-CLGFATYPPDPNSITL---GNVQQRGHEVHYDVAGRRLGF 409
S V CL A P D + L ++QQ+ V +DVA R+GF
Sbjct: 374 SSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGF 419
>gi|255685718|gb|ACU28348.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
gi|255685720|gb|ACU28349.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
gi|255685724|gb|ACU28351.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 37/101 (36%), Positives = 53/101 (52%), Gaps = 15/101 (14%)
Query: 136 VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRIL 195
+ IG P + +LDTGS++ WTQC PC+HC+ Q+ P F SKS TF + CN+
Sbjct: 3 LQIGTPPFEIEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCNT------ 56
Query: 196 RESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEAN 236
+ CP+ I Y D S + G AT+ +TI +
Sbjct: 57 ---------PNHSCPYKIVYDDKSYTLGTLATETVTIHSTS 88
>gi|376337722|gb|AFB33417.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
gi|376337724|gb|AFB33418.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
gi|376337726|gb|AFB33419.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
gi|376337728|gb|AFB33420.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
gi|376337730|gb|AFB33421.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
gi|376337732|gb|AFB33422.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
Length = 154
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 64/133 (48%), Gaps = 9/133 (6%)
Query: 68 YGPCSRLNQGISTHAPSL-EEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANIND 126
+G CS L S+ L + L +D RL SR P+ T P
Sbjct: 12 HGACSPLRPTNSSKWIDLVSQSLERDNDRLKTIRSRN-SGPY------TTMSNLPLQSGS 64
Query: 127 TVADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKI 185
V YI+ A G P + L++DTGSD+TW QCKPC+ C+ Q DP F S+S ++ +
Sbjct: 65 EVGTGNYILTAGFGTPTKKFLLVIDTGSDLTWIQCKPCLGCYSQVDPIFDPSQSSSYKSL 124
Query: 186 PCNSTSCRILRES 198
PC S +C L S
Sbjct: 125 PCLSATCTELLTS 137
>gi|218201673|gb|EEC84100.1| hypothetical protein OsI_30414 [Oryza sativa Indica Group]
Length = 366
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 60/232 (25%), Positives = 90/232 (38%), Gaps = 30/232 (12%)
Query: 116 EAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFY 175
E F P I + VA Y + IGE +Q LL+DTGS + WTQC C HC P +
Sbjct: 68 EKFVTPFRIYEDVA--YLAEMEIGERQQKQYLLIDTGSSLVWTQCDECPHCHIGDVPPYG 125
Query: 176 ASKSKTFFKIPCNSTSCRILRESF-------PFGN---CNSKECPFNIQY---ADGSGSG 222
S+S+TF ++ C E+ P G C + C F Y G
Sbjct: 126 RSQSRTFQEVSCGDDDDNDKEEAIASYCPAKPPGYITLCVNGRCMFKALYNLTGQGETVQ 185
Query: 223 GFWATDRITIQEANSNGYFTRYPFLLGC----------INNSSGDKSGASGIMGLDRSPV 272
G+ + D + Y ++ + GC + + +GI+GL
Sbjct: 186 GYMSMDTFHFIDDRRFDYQAKFRMVFGCAHQENIVLTAVKECTTAVKECTGILGLGMGDA 245
Query: 273 SIITRTNTSYFSYCLPS-----PYGSTGYITFGKTDTVNTLRSAFHKRMKKY 319
S + +T + FSYC P Y ++ FG ++ + R KY
Sbjct: 246 SFLRQTGITKFSYCAPPRMPGYSYRRDSWLRFGSHAQISGKKVPLVMRWGKY 297
>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
gi|194690728|gb|ACF79448.1| unknown [Zea mays]
Length = 431
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 60/224 (26%), Positives = 94/224 (41%), Gaps = 33/224 (14%)
Query: 81 HAPS---LEEIL---RQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYI 134
H PS LE I+ R D RL +S+ + T + Y +
Sbjct: 31 HPPSPSPLESIIALARADDARLLFLSSKA---------ASSGGITSAPVASGQTPPSYVV 81
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
+G P Q + L LDT +D TW+ C PC C F + S ++ +PC S C +
Sbjct: 82 RAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASDWCPL 139
Query: 195 LRESFPFGNCNSK----ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
N ++ C F+ +AD S +D + + + GY GC
Sbjct: 140 FEGQPCPANQDASAPLPACAFSKPFADTSFQASL-GSDTLRLGKDAIAGY------AFGC 192
Query: 251 INNSSGDKSG--ASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
+ +G + G++GL R P+S++++T + Y FSYCLPS
Sbjct: 193 VGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPS 236
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 53/121 (43%), Gaps = 4/121 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
IT LR F +++ L DTC++ P + +H GGVDL
Sbjct: 311 ITRWTAPVYAALREEFRRQVAAPSGYTSL-GAFDTCFNTDEVAAGGAPPVTLHMDGGVDL 369
Query: 357 ELDVRGTLVVASVSQV-CLGFATYP--PDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
L + TL+ +S + + CL A P + + N+QQ+ V DVAG R+GF
Sbjct: 370 TLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREP 429
Query: 414 C 414
C
Sbjct: 430 C 430
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 72/156 (46%), Gaps = 15/156 (9%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D + + YY + IG P Q +L++DTGS VT+ C C C + +DP F S T+
Sbjct: 82 DDLLLNGYYTTRIWIGTPPQTFALIVDTGSTVTYVPCSTCEQCGRHQDPKFEPELSSTYQ 141
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
+ CN C E K+C + QYA+ S S G D I+ +
Sbjct: 142 PVSCN-IDCTCDNE--------RKQCVYERQYAEMSSSSGVLGEDIISF---GNQSELVP 189
Query: 244 YPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR 277
+ GC N +GD A GIMGL R +SI+ +
Sbjct: 190 QRAIFGCENQETGDLYSQRADGIMGLGRGDLSIVDQ 225
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 100/428 (23%), Positives = 159/428 (37%), Gaps = 94/428 (21%)
Query: 58 DKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEA 117
D ++L+V + CS PS + L K+ R++ F + R
Sbjct: 31 DGSTLKVFHIFSQCSPFK-------PSKPMSWEESVLNLQAKDQARMQY-FSSLVARKSV 82
Query: 118 FTFPANINDTVADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYA 176
A+ + YIV A G P Q + L LDT SD W C C+ C + F
Sbjct: 83 VPI-ASARQIIQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGCSTSKP--FAP 139
Query: 177 SKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEAN 236
KS +F + C S C+ P C C FN Y S + D +T+
Sbjct: 140 IKSTSFRNVSCGSPHCK----QVPNPTCGGSACAFNFTYGSSSIAASV-VQDTLTLAADP 194
Query: 237 SNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS--PY 291
GY GC+N ++G + G++GL R P+S+++++ Y FSYCLPS
Sbjct: 195 IPGY------TFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSI 248
Query: 292 GSTGYITFGKTDTVNTLR-----------SAFHKRMKKYKKAKGLEDL------------ 328
+G + G ++ S ++ + K + + D+
Sbjct: 249 NFSGSLRLGPVYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTG 308
Query: 329 LDTCYD----------------LSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASV--- 369
T +D + + V PK+ + LGG D +V +VV ++
Sbjct: 309 AGTIFDSGTVFTRLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYNV--PIVVPTITFL 366
Query: 370 --------------------SQVCLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRL 407
S CL A P + NS+ + N+QQ+ H V +DV R+
Sbjct: 367 FSGMNVALPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRI 426
Query: 408 GFGPGNCS 415
G C+
Sbjct: 427 GIARELCT 434
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 60/198 (30%), Positives = 91/198 (45%), Gaps = 15/198 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + V +G P + +++DTGSD+ W QC PC+ CF+QR P F + S ++ + C
Sbjct: 145 EYLMDVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNLTCGDP 204
Query: 191 SCRILRESFPFGNCNSKE-----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C + + CP+ Y D S S G A + T+
Sbjct: 205 RCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNLTAPGASSRVDG 264
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY----FSYCLPSPYGS--TGYITF 299
+ GC + + G GA+G++GL R P+S ++ Y FSYCL +GS + F
Sbjct: 265 VVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGGHTFSYCLVD-HGSDVASKVVF 323
Query: 300 GKTDTVNTLRSAFHKRMK 317
G+ D L A H R+K
Sbjct: 324 GEDD---ALALAAHPRLK 338
Score = 45.8 bits (107), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 32/111 (28%), Positives = 47/111 (42%), Gaps = 2/111 (1%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+R AF RM +L CY++S E VP++++ F G + +
Sbjct: 401 QVIRRAFIDRMSGSYPPVPDFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFI 460
Query: 366 VASVSQV-CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ CL P SI +GN QQ+ V YD+ RLGF P C+
Sbjct: 461 RLDPDGIMCLAVLGTPRTGMSI-IGNFQQQNFHVAYDLHNNRLGFAPRRCA 510
>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 57/191 (29%), Positives = 92/191 (48%), Gaps = 23/191 (12%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNS 189
YY+ + IG+P + L +DTGSD+TW QC PC C + P++ +K+K +PC +
Sbjct: 72 HYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPWYKPTKNKI---VPCAA 128
Query: 190 TSCRILRESFPFGNCN-SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
+ C L P C ++C + I+Y D + S G D T+ NS+ R
Sbjct: 129 SLCTSLT---PNKKCAVPQQCDYQIKYTDKASSLGVLIADNFTLSLRNSST--VRANLTF 183
Query: 249 GCINNSSGDKSGA-----SGIMGLDRSPVSIITR-----TNTSYFSYCLPSPYGSTGYIT 298
GC + K+GA G++GL + VS++++ + +C + G G++
Sbjct: 184 GCGYDQQVGKNGAVQAATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHCFSTNGG--GFLF 241
Query: 299 FGKTDTVNTLR 309
FG D V T R
Sbjct: 242 FGD-DIVPTSR 251
>gi|356507437|ref|XP_003522473.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 440
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 66/234 (28%), Positives = 96/234 (41%), Gaps = 21/234 (8%)
Query: 109 PEFLKRTEA---FTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCI 164
P L R A FP + N Y + + IG+P + L +DTGSD+TW QC PC
Sbjct: 53 PYILNRFRAGSSVVFPVHGNVYPVGFYNVTLNIGQPPRPYFLDIDTGSDLTWLQCDAPCS 112
Query: 165 HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGF 224
C Q P + S +PC C L S + +C + +QYAD S G
Sbjct: 113 RCSQTPHPLYRPSND----LVPCRHALCASLHLSDNYDCEVPHQCDYEVQYADHYSSLGV 168
Query: 225 WATDRITIQEANSNGYFTRYPFLLGCINNS---SGDKSGASGIMGLDRSPVSIITRTNT- 280
D T+ +NG + LGC + G++GL R S+ ++ N+
Sbjct: 169 LLHDVYTLNF--TNGVQLKVRMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQ 226
Query: 281 ----SYFSYCLPSPYGSTGYITFGKT-DTVNTLRSAFHKRMKKYKKAKGLEDLL 329
+ +CL + G GYI FG D+ + R K+ G +LL
Sbjct: 227 GLVRNVIGHCLSAQGG--GYIFFGDVYDSFRLTWTPMSSRDYKHYSVAGAAELL 278
>gi|125524351|gb|EAY72465.1| hypothetical protein OsI_00321 [Oryza sativa Indica Group]
Length = 343
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 42/65 (64%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+ V +G P + + ++LDTGSDVTW QC+PC C+QQ DP F S S ++ + C++
Sbjct: 166 EYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSTSYASVACDNP 225
Query: 191 SCRIL 195
C L
Sbjct: 226 RCHDL 230
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 76/158 (48%), Gaps = 19/158 (12%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D +++ YY + IG P Q +L++DTGS VT+ C C C + +DP F S T+
Sbjct: 69 DDLLSNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYR 128
Query: 184 KIPCNSTSCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
+ CN P NC+ K+C + +YA+ S S G A D ++ +
Sbjct: 129 PVKCN-----------PSCNCDDEGKQCTYERRYAEMSSSSGVIAEDVVSF---GNESEL 174
Query: 242 TRYPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR 277
+ GC N +GD A GIMGL R +S++ +
Sbjct: 175 KPQRAVFGCENVETGDLYSQRADGIMGLGRGRLSVVDQ 212
>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 63/206 (30%), Positives = 97/206 (47%), Gaps = 30/206 (14%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y I V +G P + + +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-F 246
C +L S P +C E CPF + Y DGS S G D +T + + P F
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPGF 109
Query: 247 LLGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STG 295
GC +S G + G++G+ +S++ +++ ++ FSYCLP S G +TG
Sbjct: 110 SFGCNMDSFGANEFGNVDGLLGMGAGAMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTG 169
Query: 296 YITFGKTDTVNTLRSAFHKRMKKYKK 321
Y + GK T +R + +M KK
Sbjct: 170 YFSLGKVATRTDVR---YTKMVARKK 192
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 52/193 (26%), Positives = 81/193 (41%), Gaps = 26/193 (13%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIP 186
Y+ V +G P + + +DTGSD+ W C PC C + F S T +I
Sbjct: 5 YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 64
Query: 187 CNSTSCRILRESFPFG-------NCNSKECPFNIQYADGSGSGGFWATDRITIQE--ANS 237
C+ C F G N S C + Y DGSG+ G++ +D + + N
Sbjct: 65 CSDDRC---TAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNE 121
Query: 238 NGYFTRYPFLLGCINNSSGDKSGA----SGIMGLDRSPVSIITRTNT-----SYFSYCLP 288
+ + GC N+ SGD + A GI G + +S+I++ N+ FS+CL
Sbjct: 122 QTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLK 181
Query: 289 SPYGSTGYITFGK 301
G + G+
Sbjct: 182 GSDNGGGILVLGE 194
>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 63/206 (30%), Positives = 97/206 (47%), Gaps = 30/206 (14%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y I V +G P + + +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-F 246
C +L S P +C E CPF + Y DGS S G D +T + + P F
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPGF 109
Query: 247 LLGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STG 295
GC +S G + G++G+ +S++ +++ ++ FSYCLP S G +TG
Sbjct: 110 SFGCNMDSFGANEFGNVDGLLGMGAGAMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTG 169
Query: 296 YITFGKTDTVNTLRSAFHKRMKKYKK 321
Y + GK T +R + +M KK
Sbjct: 170 YFSLGKVATRTDVR---YTKMVARKK 192
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 100/428 (23%), Positives = 159/428 (37%), Gaps = 94/428 (21%)
Query: 58 DKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEA 117
D ++L+V + CS PS + L K+ R++ F + R
Sbjct: 31 DGSTLKVFHIFSQCSPFK-------PSKPMSWEESVLNLQAKDQARMQY-FSSLVARKSV 82
Query: 118 FTFPANINDTVADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYA 176
A+ + YIV A G P Q + L LDT SD W C C+ C + F
Sbjct: 83 VPI-ASARQIIQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGCSTSKP--FAP 139
Query: 177 SKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEAN 236
KS +F + C S C+ P C C FN Y S + D +T+
Sbjct: 140 IKSTSFRNVSCGSPHCK----QVPNPTCGGSACAFNFTYGSSSIAASV-VQDTLTLATDP 194
Query: 237 SNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS--PY 291
GY GC+N ++G + G++GL R P+S+++++ Y FSYCLPS
Sbjct: 195 IPGY------TFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSI 248
Query: 292 GSTGYITFGKTDTVNTLR-----------SAFHKRMKKYKKAKGLEDL------------ 328
+G + G ++ S ++ + K + + D+
Sbjct: 249 NFSGSLRLGPVYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTG 308
Query: 329 LDTCYD----------------LSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASV--- 369
T +D + + V PK+ + LGG D +V +VV ++
Sbjct: 309 AGTIFDSGTVFTRLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYNV--PIVVPTITFL 366
Query: 370 --------------------SQVCLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRL 407
S CL A P + NS+ + N+QQ+ H V +DV R+
Sbjct: 367 FSGMNVTLPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRI 426
Query: 408 GFGPGNCS 415
G C+
Sbjct: 427 GIARELCT 434
>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
gi|194703964|gb|ACF86066.1| unknown [Zea mays]
gi|219886221|gb|ACL53485.1| unknown [Zea mays]
gi|219886359|gb|ACL53554.1| unknown [Zea mays]
gi|223950085|gb|ACN29126.1| unknown [Zea mays]
gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 431
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 60/224 (26%), Positives = 94/224 (41%), Gaps = 33/224 (14%)
Query: 81 HAPS---LEEIL---RQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYI 134
H PS LE I+ R D RL +S+ + T + Y +
Sbjct: 31 HPPSPSPLESIIALARADDARLLFLSSKA---------ASSGGVTSAPVASGQTPPSYVV 81
Query: 135 VVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRI 194
+G P Q + L LDT +D TW+ C PC C F + S ++ +PC S C +
Sbjct: 82 RAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASDWCPL 139
Query: 195 LRESFPFGNCNSK----ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
N ++ C F+ +AD S +D + + + GY GC
Sbjct: 140 FEGQPCPANQDASAPLPACAFSKPFADTSFQASL-GSDTLRLGKDAIAGY------AFGC 192
Query: 251 INNSSGDKSG--ASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
+ +G + G++GL R P+S++++T + Y FSYCLPS
Sbjct: 193 VGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPS 236
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 53/121 (43%), Gaps = 4/121 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
IT LR F +++ L DTC++ P + +H GGVDL
Sbjct: 311 ITRWTAPVYAALREEFRRQVAAPSGYTSL-GAFDTCFNTDEVAAGGAPPVTLHMDGGVDL 369
Query: 357 ELDVRGTLVVASVSQV-CLGFATYP--PDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
L + TL+ +S + + CL A P + + N+QQ+ V DVAG R+GF
Sbjct: 370 TLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREP 429
Query: 414 C 414
C
Sbjct: 430 C 430
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 77/158 (48%), Gaps = 19/158 (12%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D + + YY + IG P Q +L++DTGS VT+ C C HC +DP F S+T+
Sbjct: 85 DDLLRNGYYTARLWIGTPPQRFALIVDTGSTVTYVPCSTCRHCGSHQDPKFRPEDSETYQ 144
Query: 184 KIPCNSTSCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
+ C + C NC++ K+C + +YA+ S S G D ++
Sbjct: 145 PVKC-TWQC----------NCDNDRKQCTYERRYAEMSTSSGALGEDVVSFGNQTE---L 190
Query: 242 TRYPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR 277
+ + GC N+ +GD A GIMGL R +SI+ +
Sbjct: 191 SPQRAIFGCENDETGDIYNQRADGIMGLGRGDLSIMDQ 228
>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
Length = 429
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 76/153 (49%), Gaps = 12/153 (7%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
YY+ + IG P + L +DTGSD+TW QC PC C + P + +K+K +PC
Sbjct: 66 YYVAMNIGNPPKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTKNKL---VPCVDQ 122
Query: 191 SCRILRESFPFGN-CNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C L + C+S ++C + I+YAD S G D ++ A NG R
Sbjct: 123 LCASLHNGLNRKHKCDSPYEQCDYVIKYADQGSSTGVLVNDSFALRLA--NGSVVRPSLA 180
Query: 248 LGCINN---SSGDKSGASGIMGLDRSPVSIITR 277
GC + SSG+ S G++GL VS++++
Sbjct: 181 FGCGYDQQVSSGEMSPTDGVLGLGTGSVSLLSQ 213
>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
vinifera]
Length = 451
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 72/250 (28%), Positives = 108/250 (43%), Gaps = 34/250 (13%)
Query: 55 QGPDKAS-LEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLK 113
+ PD+ S L+V+ Y PCS +E L ++ L ++ + R F L
Sbjct: 31 ETPDQGSTLQVLHVYSPCSPFRP---------KEPLSWEESVLQMQAKDKARLQFLSSLV 81
Query: 114 RTEAFTFPANINDTVADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDP 172
++ A+ V + YIV A IG P Q + + +DT SDV W C C+ C
Sbjct: 82 ARKSVVPIASGRQIVQNPTYIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGC---SST 138
Query: 173 FFYASKSKTFFKIPCNSTSCRILRESF----------PFGNCNSKECPFNIQYADGSGSG 222
F + S T+ + C + C+ + P C C FN+ Y GS
Sbjct: 139 LFNSPASTTYKSLGCQAAQCKQVLHLLSPLLTSPSVVPKPTCGGGVCSFNLTYG-GSSLA 197
Query: 223 GFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY 282
+ D IT+ GY GCI ++G A G++GL R P+S++++T Y
Sbjct: 198 ANLSQDTITLATDAVPGYS------FGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLY 251
Query: 283 ---FSYCLPS 289
FSYCLPS
Sbjct: 252 QSTFSYCLPS 261
Score = 46.6 bits (109), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 33/111 (29%), Positives = 51/111 (45%), Gaps = 9/111 (8%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
+R AF R+ + L DTCY + + P I F G +++ L L+ +
Sbjct: 347 VRDAFRNRVGRNLTVTSLGGF-DTCYTVP----IAAPTITFMFTG-MNVTLPPDNLLIHS 400
Query: 368 SV-SQVCLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ S CL A P + NS+ + N+QQ+ H + YDV RLG C+
Sbjct: 401 TAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELCT 451
>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 323
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 61/208 (29%), Positives = 95/208 (45%), Gaps = 27/208 (12%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y I V +G P + L +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-F 246
C +L S P +C E CPF + Y DGS S G D +T + + P F
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ------KIPGF 109
Query: 247 LLGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STG 295
GC +S G + G++G+ P+S++ +++ ++ FSYCLP S G +TG
Sbjct: 110 SFGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTG 169
Query: 296 YITFGKTDTVNTLRSAFHKRMKKYKKAK 323
Y + G + K + + K +
Sbjct: 170 YFSLGGKIAATRTDVRYTKMVARRKNTE 197
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 60/205 (29%), Positives = 91/205 (44%), Gaps = 29/205 (14%)
Query: 78 ISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIV-V 136
IS S +L +D + HL+N L KP + +D + + YY +
Sbjct: 44 ISPTNSSHRRVLDRDHRLRHLQN---LVKPHSSNARMRLH-------DDLLTNGYYTTRL 93
Query: 137 AIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILR 196
IG P Q +L++DTGS VT+ C C+ C +DP F S T+ + CN+ C
Sbjct: 94 WIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCNA-DC---- 148
Query: 197 ESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNS 254
NC N +C + +YA+ S S G A D ++ + + + GC
Sbjct: 149 ------NCDENGVQCTYERRYAEMSTSSGVLAEDVMSFGKESE---LVPQRAVFGCETME 199
Query: 255 SGD--KSGASGIMGLDRSPVSIITR 277
SGD A GIMGL R +S++ +
Sbjct: 200 SGDLYTQRADGIMGLGRGTLSVMDQ 224
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 60/205 (29%), Positives = 91/205 (44%), Gaps = 29/205 (14%)
Query: 78 ISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIV-V 136
IS S +L +D + HL+N L KP + +D + + YY +
Sbjct: 44 ISPTNSSHRRVLDRDHRLRHLQN---LVKPHSSNARMRLH-------DDLLTNGYYTTRL 93
Query: 137 AIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILR 196
IG P Q +L++DTGS VT+ C C+ C +DP F S T+ + CN+ C
Sbjct: 94 WIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCNA-DC---- 148
Query: 197 ESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNS 254
NC N +C + +YA+ S S G A D ++ + + + GC
Sbjct: 149 ------NCDENGVQCTYERRYAEMSTSSGVLAEDVMSFGKESE---LVPQRAVFGCETME 199
Query: 255 SGD--KSGASGIMGLDRSPVSIITR 277
SGD A GIMGL R +S++ +
Sbjct: 200 SGDLYTQRADGIMGLGRGTLSVMDQ 224
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 53/192 (27%), Positives = 90/192 (46%), Gaps = 25/192 (13%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYAS-----KSKTFFKIP 186
Y+ + +G P + + +DTGSD+ W CKPC C + + F+ S S T K+
Sbjct: 74 YFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKKVG 133
Query: 187 CNSTSCRILRESFPFGNCN-SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C+ C + +S +C + C ++I YAD S S G + D++T+++ G P
Sbjct: 134 CDDDFCSFISQS---DSCQPAVGCSYHIVYADESTSEGNFIRDKLTLEQV--TGDLQTGP 188
Query: 246 F----LLGCINNSSGD----KSGASGIMGLDRSPVSIITRTNTS-----YFSYCLPSPYG 292
+ GC ++ SG S G+MG +S S++++ + FS+CL + G
Sbjct: 189 LGQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKG 248
Query: 293 STGYITFGKTDT 304
G G D+
Sbjct: 249 G-GIFAVGVVDS 259
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 55/187 (29%), Positives = 83/187 (44%), Gaps = 24/187 (12%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D +++ YY + IG P Q +L++DTGS VT+ C C C + +DP F S ++
Sbjct: 72 DDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSSSYK 131
Query: 184 KIPCNSTSCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
+ CN P NC+ K C + +YA+ S S G + D I+ +
Sbjct: 132 ALKCN-----------PDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISF---GNESQL 177
Query: 242 TRYPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR-----TNTSYFSYCLPSPYGST 294
T + GC N +GD A GIMGL R +S++ + FS C
Sbjct: 178 TPQRAVFGCENVETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGG 237
Query: 295 GYITFGK 301
G + GK
Sbjct: 238 GAMVLGK 244
>gi|388520263|gb|AFK48193.1| unknown [Lotus japonicus]
Length = 157
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 75/151 (49%), Gaps = 3/151 (1%)
Query: 266 GLDRSPVSIITR-TNTSYFSYCLPSPYGSTGYITFGKTDTVNTLRSAFHKRM-KKYKKAK 323
GLD + +++ + + SY +P+ S IT L+++F + M KKY +A
Sbjct: 7 GLDLTAITVGGKPLGLAASSYKVPTIIDSGTVITRLPMPVYTALKNSFVRIMSKKYAQAP 66
Query: 324 GLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDP 383
G+ +LDTC+ + E VP+I + F GG DL L TL+ CL A +
Sbjct: 67 GIS-ILDTCFKGNVKEMSEVPEIQMIFGGGADLPLKAHNTLIELDKGVTCLAIAGSSENN 125
Query: 384 NSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+GN QQ+ +V YDVA ++GF G C
Sbjct: 126 PIAIIGNYQQQTFKVAYDVANSKIGFAAGGC 156
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 62/217 (28%), Positives = 99/217 (45%), Gaps = 24/217 (11%)
Query: 125 NDTVADE-YYIVVAIGEPKQYVSLLLDTGSDVTWTQCK----PCIHCFQQRDPFFYASKS 179
N ++D+ + + V I +P++ L++DTGSD+ WTQCK P + +S
Sbjct: 8 NILLSDQGHSLTVGIVQPRK---LIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGES 64
Query: 180 KTFFKIPCNSTSCRILRESFPFGNCNSK-ECPFNIQYADGSGSGGFWATDRITIQEANSN 238
TF +PC+ C+ F F NC SK C + Y + G A++ T
Sbjct: 65 STFAFLPCSDRLCQ--EGQFSFKNCTSKNRCVYEDVYGSAAAVG-VLASETFTF--GARR 119
Query: 239 GYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYG--STGY 296
R F GC S+G GA+GI+GL +S+IT+ FSYCL +P+ T
Sbjct: 120 AVSLRLGF--GCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYCL-TPFADKKTSP 176
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCY 333
+ FG ++ HK + + + + ++T Y
Sbjct: 177 LLFGAMADLSR-----HKTTRPIQTTAIVSNPVETVY 208
Score = 41.2 bits (95), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 32/79 (40%)
Query: 336 SAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRG 395
+A E V VP + +HF GG + L +CL +GNVQQ+
Sbjct: 292 AAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQN 351
Query: 396 HEVHYDVAGRRLGFGPGNC 414
V +DV + F P C
Sbjct: 352 MHVLFDVQHHKFSFAPTQC 370
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 76/173 (43%), Gaps = 21/173 (12%)
Query: 136 VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF---KIPCNSTSC 192
++IG+P +++DTGSD+ W C PC +C F S S TF K PC
Sbjct: 105 LSIGQPSIPQLVVMDTGSDILWIMCNPCTNCDNHLGLLFDPSMSSTFSPLCKTPCG---- 160
Query: 193 RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCIN 252
F C PF I Y D S + G + D I + E G ++GC +
Sbjct: 161 --------FKGCKCDPIPFTISYVDNSSASGTFGRD-ILVFETTDEGTSQISDVIIGCGH 211
Query: 253 NSSGDKS-GASGIMGLDRSPVSIITRTNTSYFSYC---LPSPYGSTGYITFGK 301
N + G +GI+GL+ P S+ T+ FSYC L PY + + G+
Sbjct: 212 NIGFNSDPGYNGILGLNNGPNSLATQIGRK-FSYCIGNLADPYYNYNQLRLGE 263
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 50/158 (31%), Positives = 75/158 (47%), Gaps = 19/158 (12%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D +++ YY + IG P Q +L++DTGS VT+ C C HC + +DP F +S T+
Sbjct: 80 DDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHCGKHQDPRFQPDESSTYH 139
Query: 184 KIPCNSTSCRILRESFPFGNCNSK--ECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
+ CN C NC+ C + +YA+ S S G D I+ +
Sbjct: 140 PVKCN-MDC----------NCDHDGVNCVYERRYAEMSSSSGVLGEDIISF---GNQSEV 185
Query: 242 TRYPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR 277
+ GC N +GD A GIMGL R +SI+ +
Sbjct: 186 VPQRAVFGCENVETGDLYSQRADGIMGLGRGQLSIVDQ 223
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 87/206 (42%), Gaps = 20/206 (9%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRD-----PFFYASKSKTFFKIP 186
YY V +G P + + +DTGSD+ W C C C ++ + + SKT +P
Sbjct: 72 YYTKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDPNGSKTSNAVP 131
Query: 187 CNSTSCRILRESFPFGNCNSK-ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFT--R 243
C C S P C CP++I Y DGS + G + D +T E + N +
Sbjct: 132 CGDGFCTD-TYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEVSGNLHTKPDN 190
Query: 244 YPFLLGCINNSSGDKSGAS-----GIMGLDRSPVSIITRTNTS-----YFSYCLPSPYGS 293
+ GC SG S S GI+G ++ S++++ S FS+CL S +G
Sbjct: 191 SSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHCLDSHHGG 250
Query: 294 TGYITFGKTDTVNTLRSAFHKRMKKY 319
G + G+ + RM Y
Sbjct: 251 -GIFSIGQVMEPKFNTTPLVPRMAHY 275
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 54/205 (26%), Positives = 86/205 (41%), Gaps = 26/205 (12%)
Query: 118 FTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC-----FQQRDP 172
FT + VA YY + +G P + + +DTGSD+ W CKPC C
Sbjct: 27 FTLQGTADPYVAGLYYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALN 86
Query: 173 FFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITI 232
FF S T + C + C + + C ++ +Y DGSG+ G++ +D
Sbjct: 87 FFDPRGSSTASPLSCIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSD---- 142
Query: 233 QEANSNGYFTRY-------PFLLGCINNSSGDKS----GASGIMGLDRSPVSIITRTNTS 281
E + N Y +Y GC N SGD + GI G ++ +S++++ N+
Sbjct: 143 -EFDYNQYVNQYVTNNASAKITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQ 201
Query: 282 -----YFSYCLPSPYGSTGYITFGK 301
FS+CL G + G+
Sbjct: 202 GLAPKIFSHCLEGADPGGGILVLGE 226
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 65/218 (29%), Positives = 103/218 (47%), Gaps = 30/218 (13%)
Query: 85 LEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQY 144
+E+I+ DQ+R L + +R K + I+ A +Y+ V +G P +
Sbjct: 49 IEDIIGADQKRHSLISRKRK-------FKGGVKMDLGSGIDYGTA-QYFTEVRVGTPAKK 100
Query: 145 VSLLLDTGSDVTWTQCKPCIHCFQQRDP-------FFYASKSKTFFKIPCNSTSCRI-LR 196
+++DTGS++TW C+ ++ R F A +SK+F + C + +C++ L
Sbjct: 101 FRVVVDTGSELTWVNCR-----YRGRGKGKVKNRRVFRAEESKSFKTVGCFTQTCKVDLM 155
Query: 197 ESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC-INN 253
F C S C ++ +YADGS + G +A + IT+ N R L+GC +
Sbjct: 156 NLFSLSTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRKARLR-GLLVGCSSSF 214
Query: 254 SSGDKSGASGIMGLDRSPVSIITRTNTSYF----SYCL 287
S GA G++GL S S T T TS F SYCL
Sbjct: 215 SGQSFQGADGVLGLAFSDFS-FTSTATSLFGAKLSYCL 251
Score = 42.0 bits (97), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 22/80 (27%), Positives = 37/80 (46%), Gaps = 1/80 (1%)
Query: 336 SAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRG 395
S + +P++ H GG E + LV A+ CLGF + P + +GN+ Q+
Sbjct: 371 SGFNESKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFMS-AGTPATNVVGNIMQQN 429
Query: 396 HEVHYDVAGRRLGFGPGNCS 415
+ +D+ L F P C+
Sbjct: 430 YLWEFDLMASTLSFAPSTCT 449
>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 381
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 48/183 (26%), Positives = 87/183 (47%), Gaps = 18/183 (9%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
YY+ + IG P + L +DTGSD+TW QC PC C + K++ + C
Sbjct: 23 YYMAMLIGAPAKLYYLDMDTGSDLTWLQCDAPCRSCASGPHGLYDPKKARL---VDCRVP 79
Query: 191 SCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
C ++++ + C ++C ++++YADGS + G D IT+ +NG ++ ++
Sbjct: 80 LCALVQQGGSYA-CGGPVRQCDYDVEYADGSSTMGVLMEDTITLLL--TNGTRSKTTAII 136
Query: 249 GCINNSSG----DKSGASGIMGLDRSPVSIITRTNT-----SYFSYCLPSPYGSTGYITF 299
GC + G + G+MGL + +S+ ++ + +CL GY+ F
Sbjct: 137 GCGYDQQGTLAQTPASTDGVMGLSSAKISLPSQLAKKGIVRNVIGHCLAGGSNGGGYLFF 196
Query: 300 GKT 302
G +
Sbjct: 197 GDS 199
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 54/187 (28%), Positives = 83/187 (44%), Gaps = 24/187 (12%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D +++ YY + IG P Q +L++DTGS VT+ C C C + +DP F S ++
Sbjct: 68 DDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQ 127
Query: 184 KIPCNSTSCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
+ CN P NC+ K C + +YA+ S S G + D I+ +
Sbjct: 128 ALKCN-----------PDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISF---GNESQL 173
Query: 242 TRYPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR-----TNTSYFSYCLPSPYGST 294
+ + GC N +GD A GIMGL R +S++ + FS C
Sbjct: 174 SPQRAVFGCENEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGG 233
Query: 295 GYITFGK 301
G + GK
Sbjct: 234 GAMVLGK 240
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 54/187 (28%), Positives = 83/187 (44%), Gaps = 24/187 (12%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D +++ YY + IG P Q +L++DTGS VT+ C C C + +DP F S ++
Sbjct: 68 DDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQ 127
Query: 184 KIPCNSTSCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
+ CN P NC+ K C + +YA+ S S G + D I+ +
Sbjct: 128 ALKCN-----------PDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISF---GNESQL 173
Query: 242 TRYPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR-----TNTSYFSYCLPSPYGST 294
+ + GC N +GD A GIMGL R +S++ + FS C
Sbjct: 174 SPQRAVFGCENEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGG 233
Query: 295 GYITFGK 301
G + GK
Sbjct: 234 GAMVLGK 240
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 51/163 (31%), Positives = 76/163 (46%), Gaps = 11/163 (6%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
Y + +G P Q + ++LDT +D W C C C F + S T+ + C++T
Sbjct: 104 NYVVRARLGTPPQLMFMVLDTSNDAVWLPCSGCSGC-SNASTSFNTNSSSTYSTVSCSTT 162
Query: 191 SCRILRE-SFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
C R + P C FN Y S D +T+ S + F G
Sbjct: 163 QCTQARGLTCPSSTPQPSICSFNQSYGGDSSFSANLVQDTLTL----SPDVIPNFSF--G 216
Query: 250 CINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
CIN++SG+ G+MGL R P+S++++T + Y FSYCLPS
Sbjct: 217 CINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPS 259
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 54/110 (49%), Gaps = 9/110 (8%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
+R F K++ G DTC+ SA V PKI +H + +DL+L + TL+ +
Sbjct: 345 IRDEFRKQVNGSFSTLGA---FDTCF--SADNENVTPKITLH-MTSLDLKLPMENTLIHS 398
Query: 368 SVSQV-CLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
S + CL A + N++ + N+QQ+ + +DV R+G P C
Sbjct: 399 SAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPC 448
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 63/235 (26%), Positives = 98/235 (41%), Gaps = 24/235 (10%)
Query: 58 DKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEA 117
D ++L+V + PCS PS + +L K+ R++ + R
Sbjct: 32 DGSTLQVFHVFSPCSPFR-------PSKPMSWEESVLKLQAKDQARMQY-LSSLVARRSI 83
Query: 118 FTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYAS 177
+ T + Y + IG P Q + L +DT +D +W C C+ C F +
Sbjct: 84 VPIASGRQITQSPTYIVKAKIGTPAQTLLLAMDTSNDASWVPCTACVGCSTTTP--FAPA 141
Query: 178 KSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANS 237
KS TF K+ C ++ C+ +R C+ C FN Y S + D +T+
Sbjct: 142 KSTTFKKVGCGASQCKQVRNP----TCDGSACAFNFTYGTSSVAASL-VQDTVTLATDPV 196
Query: 238 NGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
Y GCI +G G++GL R P+S++ +T Y FSYCLPS
Sbjct: 197 PAY------AFGCIQKVTGSSVPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPS 245
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/115 (31%), Positives = 56/115 (48%), Gaps = 11/115 (9%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDL--LDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGT 363
N +R+ F +R+ +KK + L DTCY +V P I F G+++ L
Sbjct: 329 NAVRNEFRRRIAVHKKLT-VTSLGGFDTCYT----APIVAPTITFMF-SGMNVTLPPDNI 382
Query: 364 LVVASVSQV-CLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
L+ ++ V CL A P + NS+ + N+QQ+ H V +DV RLG C+
Sbjct: 383 LIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLGVARELCT 437
>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 50/181 (27%), Positives = 87/181 (48%), Gaps = 18/181 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNS 189
+YY + IG P + L +DTGS +TW QC PC +C + P + +K +P
Sbjct: 128 QYYTSINIGNPARPYFLDVDTGSALTWIQCDAPCTNCTKGPHPLYKPAKENI---VPPRD 184
Query: 190 TSCRILRESFPFGNCNS-KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
+ C+ L+ + + C++ K+C + I YAD S S G A D + + A +G +
Sbjct: 185 SHCQELQGNQNY--CDTCKQCDYEIAYADRSSSAGVLARDNMELITA--DGERENMDLVF 240
Query: 249 GCINNSSGDKSG----ASGIMGLDRSPVSIITRTN-----TSYFSYCLPSPYGSTGYITF 299
GC ++ G G + GI+GL +S+ T+ ++ F +C+ + + Y+
Sbjct: 241 GCAHDQQGKLLGSPASSDGILGLSNGAMSLPTQLAKQGIISNVFGHCIATDPSGSAYMFL 300
Query: 300 G 300
G
Sbjct: 301 G 301
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 55/169 (32%), Positives = 72/169 (42%), Gaps = 26/169 (15%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQ-----------RDP 172
+D + YY V IG P +L++DTGS VT+ C C HC RDP
Sbjct: 32 DDLLTKGYYTSRVFIGTPPNEFALIVDTGSTVTYVPCSSCTHCGHHQASFSTHRLFCRDP 91
Query: 173 FFYASKSKTFFKIPCNSTSCRILRESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRI 230
F S ++ KI C S+ C G C NS +C + YA+ S S G D +
Sbjct: 92 RFKPENSSSYQKIGCRSSDCIT-------GLCDSNSHQCKYERMYAEMSTSKGVLGKDLL 144
Query: 231 TIQEANSNGYFTRYPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR 277
A+ GC SGD A GIMGL R P+SI+ +
Sbjct: 145 DFGPASR---LQSQLLSFGCETAESGDLYLQVADGIMGLGRGPLSIVDQ 190
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 85/354 (24%), Positives = 136/354 (38%), Gaps = 78/354 (22%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + + IG P + DTGSD+ W QC PC +CF Q P F KS TF C+S
Sbjct: 91 EYLMTLYIGTPPVERLAIADTGSDLIWVQCSPCQNCFPQDTPLFEPLKSSTFKAATCDSQ 150
Query: 191 SCRILRESFPFGNCNS-KECPFNIQYADGSGSGGFWATDRITI----------------- 232
C + S C +C ++ Y D S + G T+ ++
Sbjct: 151 PCTSVPPS--QRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGSTGDAQTVSFPSSIFG 208
Query: 233 ------------------------------QEANSNGYFTRYPFLLGCINNSSGDKSGAS 262
Q GY Y L N++S K G+
Sbjct: 209 CGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQIGYKFSYCLLPFSSNSTSKLKFGSE 268
Query: 263 GIM---GLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGKTD--------TV-----N 306
I+ G+ +P+ II S++ L + + G+TD TV
Sbjct: 269 AIVTTNGVVSTPL-IIKPLFPSFYFLNLEAVTIGQKVVPTGRTDGNIIIDSGTVLTYLEQ 327
Query: 307 TLRSAFHKRMKKYKKAKGLEDL---LDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGT 363
T + F +++ + +DL C+ Y + +P IA F G + L +
Sbjct: 328 TFYNNFVASLQEVLSVESAQDLPFPFKFCF---PYRDMTIPVIAFQFTGA-SVALQPKNL 383
Query: 364 LV-VASVSQVCLGFATYPPDPNSITL-GNVQQRGHEVHYDVAGRRLGFGPGNCS 415
L+ + + +CL A P + I++ GNV Q +V YD+ G+++ F P +C+
Sbjct: 384 LIKLQDRNMLCL--AVVPSSLSGISIFGNVAQFDFQVVYDLEGKKVSFAPTDCT 435
>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
Length = 358
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 61/205 (29%), Positives = 93/205 (45%), Gaps = 24/205 (11%)
Query: 118 FTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYA 176
F N+ T YY+ + IG P + L +DTGSD+TW QC PC C + P +
Sbjct: 42 FQLQGNVYPT--GHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRP 99
Query: 177 SKSKTFFKIPCNSTSCRILRESFPFGN-CNS-KECPFNIQYADGSGSGGFWATDRITIQE 234
+ + +PC + C L N C S K+C + I+Y D + S G D ++
Sbjct: 100 TANSL---VPCANALCTALHSGHGSNNKCPSPKQCDYQIKYTDSASSQGVLINDNFSLPM 156
Query: 235 ANSNGYFTRYPFLLGCINNSSGDKSGA-----SGIMGLDRSPVSIITR-----TNTSYFS 284
+SN R GC + K+GA G++GL R VS++++ +
Sbjct: 157 RSSN---IRPGLTFGCGYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLG 213
Query: 285 YCLPSPYGSTGYITFGKTDTVNTLR 309
+CL + G G++ FG D V T R
Sbjct: 214 HCLSTNGG--GFLFFGD-DIVPTSR 235
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 79/366 (21%), Positives = 135/366 (36%), Gaps = 89/366 (24%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRD-----PFFYASKSKTFFKIP 186
Y+ V +G P ++ +DTGSD+ W C C +C FF A S T +
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 159
Query: 187 CNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQE-------ANSNG 239
C+ C + ++ + +C ++ +Y DGSG+ G++ TD ANS+
Sbjct: 160 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 219
Query: 240 YFTRYPFLLGCINNSSGDKS----GASGIMGLDRSPVSIITRTNTS-----YFSYCLPSP 290
P + GC SGD + GI G + +S++++ ++ FS+CL
Sbjct: 220 -----PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 274
Query: 291 YGSTGYITFGKTDT----------------VNTLRSAFHKRMKKYKKA------------ 322
G G+ +N L + +M A
Sbjct: 275 GSGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIV 334
Query: 323 ---KGLEDLLDTCYDL---------SAYETVVV-----------------PKIAIHFLGG 353
L L+ YDL S T ++ P ++++F GG
Sbjct: 335 DTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGG 394
Query: 354 VDLELDVRGTL----VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGF 409
+ L + L + S C+GF P + LG++ + YD+A +R+G+
Sbjct: 395 ASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQT--ILGDLVLKDKVFVYDLARQRIGW 452
Query: 410 GPGNCS 415
+CS
Sbjct: 453 ASYDCS 458
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 55/208 (26%), Positives = 100/208 (48%), Gaps = 20/208 (9%)
Query: 102 RRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK 161
R P F + F NI+ TV+ + +G P Q VS+++DTGS+++W C
Sbjct: 7 RTEEIPSNSFPRSPNKLPFRHNISLTVS------LTVGTPPQNVSMVIDTGSELSWLYCN 60
Query: 162 PCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPF-GNCNSKE-CPFNIQYADGS 219
F ++S ++ IPC+S++C F +C+S C + YAD S
Sbjct: 61 KTTTTTSYPT-TFNQTRSISYRPIPCSSSTCTNQTRDFSIPASCDSNSLCHATLSYADAS 119
Query: 220 GSGGFWATDRITIQEANSNGYFTRYPFLLGCIN----NSSGDKSGASGIMGLDRSPVSII 275
S G A+D + ++ G + GC++ ++S + S +G+MG++R +S +
Sbjct: 120 SSEGNLASDTFHMGASDIPG------MVFGCMDSVFSSNSDEDSKNTGLMGMNRGSLSFV 173
Query: 276 TRTNTSYFSYCLPSPYGSTGYITFGKTD 303
++ FSYC+ S +G + G+++
Sbjct: 174 SQMGFPKFSYCI-SGTDFSGMLLLGESN 200
>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
Length = 449
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 72/238 (30%), Positives = 107/238 (44%), Gaps = 23/238 (9%)
Query: 57 PDK-ASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRT 115
PD A+L+V +GPCS L G + APS L R SR L +
Sbjct: 37 PDAGATLQVSHAFGPCSPL--GAESAAPSWAGFLADQAAR---DASRLLY--LDSLAVKG 89
Query: 116 EAFTFPANINDTVADEYYIVVA-IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFF 174
A+ A+ + Y+V A +G P Q + L +DT +D W C C C PF
Sbjct: 90 RAYAPIASGRQLLQTPTYVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGC-PTSSPFN 148
Query: 175 YASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQE 234
A+ S ++ +PC S C + P + N+K C F++ YAD S + D + +
Sbjct: 149 PAA-SASYRPVPCGSPQCVLAPN--PSCSPNAKSCGFSLSYADSSLQAAL-SQDTLAVAG 204
Query: 235 ANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
Y GC+ ++G + G++GL R P+S +++T Y FSYCLPS
Sbjct: 205 DVVKAY------TFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPS 256
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 49/111 (44%), Gaps = 8/111 (7%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
LR +R+ A DTCY+ TV P + + F G+ + L ++
Sbjct: 342 LRDEVRRRVGAGAAAVSSLGGFDTCYN----TTVAWPPVTLLF-DGMQVTLPEENVVIHT 396
Query: 368 SVSQV-CLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ CL A P N++ + ++QQ+ H V +DV R+GF +C+
Sbjct: 397 TYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESCT 447
>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 459
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 59/212 (27%), Positives = 96/212 (45%), Gaps = 23/212 (10%)
Query: 91 QDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYY------------IVVAI 138
+D+ + LKNS KR A + + AD+ Y + +I
Sbjct: 57 KDRAKRMLKNSNARFDYVQAISKRNSAVVDYDGGDTSAADDAYEASLLSELCTFLVNFSI 116
Query: 139 GEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRES 198
G+P ++DTGS +TW QC+PCI+C QQ+ P + S S T + R
Sbjct: 117 GQPPVPQYAVMDTGSSLTWIQCEPCINCHQQKGPLYNPSSSST------YVSCSDFDRTD 170
Query: 199 FPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSS--- 255
F + +C ++ YAD + + G +A +++ E +G + + GC +N++
Sbjct: 171 TTFTATHGSDCNYSQTYADKTTTRGTYAREQLLF-ETPDDGITIMHDVIFGCGHNNTQLP 229
Query: 256 GDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
G ASG+ GL S SII++ FSYC+
Sbjct: 230 GPTGYASGVFGLGDSGSSIISKLGFG-FSYCI 260
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 79/174 (45%), Gaps = 19/174 (10%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRD-----PFFYASKSKTFFKIP 186
YY V +G P + ++ +DTGSD+ W C C +C Q FF S T IP
Sbjct: 78 YYTKVKMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALIP 137
Query: 187 CNSTSCRILRESFPFGNCNSK--ECPFNIQYADGSGSGGFWATDRI--TIQEANSNGYFT 242
C+ C R C+ + +C + QY DGSG+ G++ +D + ++ +
Sbjct: 138 CSDPICT-SRVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPAVNS 196
Query: 243 RYPFLLGCINNSSGDKS----GASGIMGLDRSPVSIITRTNT-----SYFSYCL 287
+ GC + SGD + GI G P+S++++ ++ FS+CL
Sbjct: 197 SATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCL 250
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 53/189 (28%), Positives = 92/189 (48%), Gaps = 27/189 (14%)
Query: 110 EFLKRTEAFTFPANINDTVADEYYIVV----AIGEPKQYVSLLLDTGSDVTWTQCKPCIH 165
+ LK+T + + +N+ V Y+V +IGEP ++DTGS +TW C PC
Sbjct: 70 KILKQTFSNDYISNL---VPSPRYVVFLMNFSIGEPPIPQLAVMDTGSSLTWVMCHPCSS 126
Query: 166 CFQQRDPFFYASKSKTFFKIPCNS-TSCRILRESFPFGNCNSKECPFNIQYADGSGSGGF 224
C QQ P F SKS T+ + C+ C ++ + ECP++++Y S G
Sbjct: 127 CSQQSVPIFDPSKSSTYSNLSCSECNKCDVV----------NGECPYSVEYVGSGSSQGI 176
Query: 225 WATDRITIQEANSNGYFTRYPFLL-GC-----INNSSGDKSGASGIMGLDRSPVSIITRT 278
+A +++T++ + + + P L+ GC I+++ G +G+ GL S++
Sbjct: 177 YAREQLTLETIDES--IIKVPSLIFGCGRKFSISSNGYPYQGINGVFGLGSGRFSLLPSF 234
Query: 279 NTSYFSYCL 287
FSYC+
Sbjct: 235 GKK-FSYCI 242
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 54/187 (28%), Positives = 88/187 (47%), Gaps = 15/187 (8%)
Query: 118 FTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK--PCIHCFQQRDPFFY 175
+TF +N ++A + + IG P Q L+LDTGS ++W QC F
Sbjct: 69 YTFRSNFKYSMA--LILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFD 126
Query: 176 ASKSKTFFKIPCNSTSC--RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQ 233
S S +F +PC+ C RI + P +++ C ++ YADG+ + G ++ T
Sbjct: 127 PSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFS 186
Query: 234 EANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGS 293
+ T P +LGC S+ K GI+G++ +S I++ S FSYC+P+
Sbjct: 187 NSQ-----TTPPLILGCAKESTDVK----GILGMNLGRLSFISQAKISKFSYCIPTRSNR 237
Query: 294 TGYITFG 300
G + G
Sbjct: 238 PGLASTG 244
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/157 (29%), Positives = 74/157 (47%), Gaps = 17/157 (10%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D + + YY + IG P Q +L++D+GS VT+ C C C +DP F S ++
Sbjct: 81 DDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYS 140
Query: 184 KIPCN-STSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFT 242
+ CN +C + + K+C + QYA+ S S G D ++ +
Sbjct: 141 PVKCNVDCTC----------DSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESE---LK 187
Query: 243 RYPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR 277
+ GC N+ +GD A GIMGL R +SI+ +
Sbjct: 188 AQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQ 224
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 33/105 (31%), Positives = 51/105 (48%), Gaps = 3/105 (2%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY + V +G P + +++DTGSD+ W QC PC+ CF Q P F + S ++ + C
Sbjct: 150 EYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFDQVGPVFDPAASSSYRNVTCGDQ 209
Query: 191 SCRILRESFPFGNC---NSKECPFNIQYADGSGSGGFWATDRITI 232
C ++ P C CP+ Y D S + G A + T+
Sbjct: 210 RCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTV 254
Score = 46.6 bits (109), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 48/111 (43%), Gaps = 2/111 (1%)
Query: 306 NTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+R AF RM + +L CY++S + VP++++ F G + +
Sbjct: 406 QVIRQAFIDRMGRSYPLIPDFPVLSPCYNVSGVDRPEVPELSLLFADGAVWDFPAENYFI 465
Query: 366 VASVSQV-CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ CL P SI +GN QQ+ V YD+ RLGF P C+
Sbjct: 466 RLDPDGIMCLAVLGTPRTGMSI-IGNFQQQNFHVVYDLKNNRLGFAPRRCA 515
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 80/377 (21%), Positives = 138/377 (36%), Gaps = 89/377 (23%)
Query: 121 PANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRD-----PFFY 175
P + + Y+ V +G P ++ +DTGSD+ W C C +C FF
Sbjct: 94 PYLVGSKMTMLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFD 153
Query: 176 ASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQE- 234
A S T + C+ C + ++ + +C ++ +Y DGSG+ G++ TD
Sbjct: 154 APGSLTAGSVTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAI 213
Query: 235 ------ANSNGYFTRYPFLLGCINNSSGDKS----GASGIMGLDRSPVSIITRTNTS--- 281
ANS+ P + GC SGD + GI G + +S++++ ++
Sbjct: 214 LGESLVANSSA-----PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGIT 268
Query: 282 --YFSYCLPSPYGSTGYITFGKTDT----------------VNTLRSAFHKRMKKYKKA- 322
FS+CL G G+ +N L + +M A
Sbjct: 269 PPVFSHCLKGDGSGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAV 328
Query: 323 --------------KGLEDLLDTCYDL---------SAYETVVV---------------- 343
L L+ YDL S T ++
Sbjct: 329 FEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQCYLVSTSISDM 388
Query: 344 -PKIAIHFLGGVDLELDVRGTL----VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEV 398
P ++++F GG + L + L + S C+GF P + LG++ +
Sbjct: 389 FPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQT--ILGDLVLKDKVF 446
Query: 399 HYDVAGRRLGFGPGNCS 415
YD+A +R+G+ +CS
Sbjct: 447 VYDLARQRIGWASYDCS 463
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 51/188 (27%), Positives = 90/188 (47%), Gaps = 15/188 (7%)
Query: 115 TEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFF 174
T +F N+ TV+ + +G P Q V+++LDTGS+++W CK Q + F
Sbjct: 59 TRKVSFYHNVTLTVS------LTVGTPPQSVTMVLDTGSELSWLHCKKQ----QNINSVF 108
Query: 175 YASKSKTFFKIPCNSTSCRILRESF--PFGNCNSKECPFNIQYADGSGSGGFWATDRITI 232
S ++ IPC S C+ F P ++ C + YAD + G A+D I
Sbjct: 109 NPHLSSSYTPIPCMSPICKTRTRDFLIPVSCDSNNLCHVTVSYADFTSLEGNLASDTFAI 168
Query: 233 QEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYG 292
+ G + + ++++ + S +G+MG++R +S +T+ FSYC+ S
Sbjct: 169 SGSGQPGII--FGSMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQMGFPKFSYCI-SGKD 225
Query: 293 STGYITFG 300
++G + FG
Sbjct: 226 ASGVLLFG 233
>gi|255685722|gb|ACU28350.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 51/101 (50%), Gaps = 15/101 (14%)
Query: 136 VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRIL 195
+ IG P +LDTGS++ WTQC PC+HC+ Q+ P F SKS TF + CN+
Sbjct: 3 LQIGTPPFEXEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCNT------ 56
Query: 196 RESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEAN 236
CP+ J Y D S + G AT+ +TI +
Sbjct: 57 ---------PBHSCPYKJVYDDKSYTXGTLATETVTIHSTS 88
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/157 (29%), Positives = 74/157 (47%), Gaps = 17/157 (10%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D + + YY + IG P Q +L++D+GS VT+ C C C +DP F S ++
Sbjct: 80 DDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCSSCEQCGNHQDPRFQPDLSSSYS 139
Query: 184 KIPCN-STSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFT 242
+ CN +C + + K+C + QYA+ S S G D ++ +
Sbjct: 140 PVKCNVDCTC----------DSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESE---LK 186
Query: 243 RYPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR 277
+ GC N+ +GD A GIMGL R +SI+ +
Sbjct: 187 PQHAIFGCENSETGDLFSQHADGIMGLGRGQLSIMDQ 223
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 53/192 (27%), Positives = 87/192 (45%), Gaps = 25/192 (13%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIP 186
Y+ + +G P + + +DTGSD+ W CKPC C R F + S T K+
Sbjct: 74 YFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVG 133
Query: 187 CNSTSCRILRESFPFGNCN-SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C+ C + +S +C + C ++I YAD S S G + D +T+++ G P
Sbjct: 134 CDDDFCSFISQS---DSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQV--TGDLKTGP 188
Query: 246 F----LLGCINNSSGD----KSGASGIMGLDRSPVSIITRTNTS-----YFSYCLPSPYG 292
+ GC ++ SG S G+MG +S S++++ + FS+CL + G
Sbjct: 189 LGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKG 248
Query: 293 STGYITFGKTDT 304
G G D+
Sbjct: 249 G-GIFAVGVVDS 259
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/158 (31%), Positives = 74/158 (46%), Gaps = 19/158 (12%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D + + YY + IG P Q +L++DTGS VT+ C C C + +DP F S T+
Sbjct: 76 DDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQ 135
Query: 184 KIPCNSTSCRILRESFPFGNCNSK--ECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
+ C + C NC+S +C + QYA+ S S G D I+ +
Sbjct: 136 PVKC-TIDC----------NCDSDRMQCVYERQYAEMSTSSGVLGEDLISF---GNQSEL 181
Query: 242 TRYPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR 277
+ GC N +GD A GIMGL R +SI+ +
Sbjct: 182 APQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQ 219
>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 418
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 53/178 (29%), Positives = 77/178 (43%), Gaps = 17/178 (9%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCK-PCIHCFQQRDPFFYASKSKTFFKIPCNST 190
Y + + +G+P + L DTGSD+TW QC PC C + P + S +PC
Sbjct: 57 YNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDL----VPCKDP 112
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C L S N +C + ++YADG S G D + N + R LGC
Sbjct: 113 LCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPR--LALGC 170
Query: 251 INNSSGDKSG---ASGIMGLDRSPVSIITRTNT-----SYFSYCLPSPYGSTGYITFG 300
+ S GI+GL R VSI+++ + + +C S G GY+ FG
Sbjct: 171 GYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGG--GYLFFG 226
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 53/170 (31%), Positives = 84/170 (49%), Gaps = 26/170 (15%)
Query: 140 EPKQYVSLLLDTGSD-VTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRES 198
+P +L + D +TWTQCKPC+ C + F S S T+ C ++
Sbjct: 82 QPPSPQEILAEMNPDSITWTQCKPCVRCLKDSHRHFDPSASLTYSLGSCIPST------- 134
Query: 199 FPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD- 257
GN +N+ Y D S S G + D +T++ ++ F ++ F GC N+ GD
Sbjct: 135 --VGNT------YNMTYGDKSTSVGNYGCDTMTLEPSD---VFPKFQF--GCGRNNEGDF 181
Query: 258 KSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGSTGYITFGKTDT 304
SGA G++GL + +S +++T + + FSYCLP S G + FG+ T
Sbjct: 182 GSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEE-DSIGSLLFGEKAT 230
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 55/103 (53%), Gaps = 6/103 (5%)
Query: 306 NTLRSAFHKRMKKYKKAKGLE---DLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRG 362
+ L +AF K M KY + G D+LDTCY+LS + V++P+I +HF G D+ L+ +
Sbjct: 298 SALTAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKR 357
Query: 363 TLVVASVSQVCLGFATYPP---DPNSITLGNVQQRGHEVHYDV 402
+ S++CL FA + +GN QQ V YD+
Sbjct: 358 VIWGNDASRLCLAFAGNSKSTMNSELTIIGNRQQVSLTVLYDI 400
>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
Length = 519
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 71/279 (25%), Positives = 109/279 (39%), Gaps = 60/279 (21%)
Query: 86 EEILRQDQQRLHLKNSRRLRKP-------------FPEFLKRTEAFTFPANIND-TVADE 131
+E+ R DQ+R S R+ EAF P + T +
Sbjct: 47 DEVARMDQERTAFICSHARRRATEAGDAKHKAKAKAKGAPAADEAFAMPLSSGAYTGTGQ 106
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIH-------------------------- 165
Y++ +G P + L+ DTGSD+TW +C H
Sbjct: 107 YFVRFRVGTPARPFLLVADTGSDLTWVKCHRHDHDAPAPGYGYAAPASNDSSTSSLSAAA 166
Query: 166 -CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPF--GNCNS--KECPFNIQYADGSG 220
F +S+T+ IPC+S +C S PF C + C ++ +Y DGS
Sbjct: 167 ASSSSHARVFRPDRSRTWAPIPCSSDTCT---ASLPFSLAACPTPGSPCAYDYRYKDGSA 223
Query: 221 SGGFWATDRITIQEANSNGYFTRYP-----FLLGCINNSSGDKSGAS-GIMGLDRSPVSI 274
+ G TD TI + + +LGC + +GD AS G++ L S +S
Sbjct: 224 ARGTVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFLASDGVLSLGYSNISF 283
Query: 275 ITRTNTSY---FSYCLP---SPYGSTGYITFGKTDTVNT 307
+R + FSYCL +P +T Y+TFG V++
Sbjct: 284 ASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSS 322
Score = 46.2 bits (108), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 49/105 (46%), Gaps = 10/105 (9%)
Query: 320 KKAKGLE----DLLDTCYDLSAYET-----VVVPKIAIHFLGGVDLELDVRGTLVVASVS 370
KK GL D D CY+ ++ T V +P++A+HF G L+ + ++ A+
Sbjct: 415 KKLAGLPRVTMDPFDYCYNWTSPSTGEDLTVAMPELAVHFAGSARLQPPAKSYVIDAAPG 474
Query: 371 QVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
C+G P +GN+ Q+ H +D+ RRL F C+
Sbjct: 475 VKCIGLQEG-EWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRCT 518
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/171 (29%), Positives = 79/171 (46%), Gaps = 15/171 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK-PCI--HCFQQRDPFFYASKSKTFFKIPC 187
+Y IG P Q L+DTGSD+ WTQC C+ C +Q P++ S+S TF +PC
Sbjct: 85 QYIASYLIGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYYNLSQSSTFVPVPC 144
Query: 188 NSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
+ + C F Y G G T+ + ++ F
Sbjct: 145 ADKAGFCAANGVHLCGLDGS-CTFIASYGAGRVIGSL-GTESFAFESGTTSLAF------ 196
Query: 248 LGCIN---NSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTG 295
GC++ +SG + ASG++GL R +S++++ + FSYCL + S+G
Sbjct: 197 -GCVSLTRITSGALNDASGLIGLGRGRLSLVSQIGATRFSYCLTPYFHSSG 246
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 78/362 (21%), Positives = 137/362 (37%), Gaps = 79/362 (21%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIP 186
YY + +G P + + +DTGSDV W C C C FF S T I
Sbjct: 52 YYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASLIS 111
Query: 187 CNSTSCRI-LRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYF--TR 243
C+ C + L+ S + + C +N QY DGSG+ G++ +D + +
Sbjct: 112 CSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDLLHFDTVLGGSVMNNSS 171
Query: 244 YPFLLGCINNSSGDKS----GASGIMGLDRSPVSIITRTNTS-----YFSYCLPSPYGST 294
P + GC +GD + GI G + +S++++ + FS+CL
Sbjct: 172 APIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKGDDSGG 231
Query: 295 GYITFG-----------------------KTDTVNTLRSAFHKRMKKYKKAKG------- 324
G + G ++ +VN A + ++G
Sbjct: 232 GILVLGEIVEPNIVYTPLVPSQPHYNLNMQSISVNGQTLAIDPSVFGTSSSQGTIIDSGT 291
Query: 325 -LEDLLDTCYD--LSAYETVV------------------------VPKIAIHFLGGVDLE 357
L L + YD +SA ++V P+++++F GG +
Sbjct: 292 TLAYLAEAAYDPFISAITSIVSPSVRPYLSKGNHCYLISSSINDIFPQVSLNFAGGASMI 351
Query: 358 LDVRGTLV----VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
L + L+ + + C+GF +I LG++ + YD+A +R+G+ +
Sbjct: 352 LIPQDYLIQQSSIGGAALWCIGFQKIQGQGITI-LGDLVLKDKIFVYDIANQRIGWANYD 410
Query: 414 CS 415
CS
Sbjct: 411 CS 412
>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 441
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 69/252 (27%), Positives = 110/252 (43%), Gaps = 34/252 (13%)
Query: 43 PNVCNRTRTALPQGPDKASLEVVSKYGPCS--RLNQGISTHAPSLEEILRQDQQRLHLKN 100
P+ CN P ++L+V + PCS R ++ +S A ++ ++ +DQ RL +
Sbjct: 28 PSNCN------PAADRSSTLQVFHIFSPCSPFRPSKPLS-WADNVLQMQAKDQARLQFLS 80
Query: 101 SRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQC 160
S R+ F + P + + IG P Q + L LDT +D W C
Sbjct: 81 SLVARRSFVPIASARQLIQSP---------TFVVRAKIGTPAQTLLLALDTSNDAAWIPC 131
Query: 161 KPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSG 220
CI C F + KS +F +PC S C P +C+ C FN+ Y +
Sbjct: 132 SGCIGC--PSTTVFSSDKSSSFRPLPCQSPQC----NQVPNPSCSGSACGFNLTYGSSTV 185
Query: 221 SGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNT 280
+ D +T+ + Y GCI ++G G++GL R P+S++ ++ +
Sbjct: 186 AADL-VQDNLTLATDSVPSY------TFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQS 238
Query: 281 SY---FSYCLPS 289
Y FSYCLPS
Sbjct: 239 LYQSTFSYCLPS 250
Score = 45.4 bits (106), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 53/111 (47%), Gaps = 9/111 (8%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
+R F +R+ + L DTCY + ++ P I F G+++ L L+ +
Sbjct: 336 VRDEFRRRVGRNVTVSSLGGF-DTCYTVP----IISPTITFMF-AGMNVTLPPDNFLIHS 389
Query: 368 SV-SQVCLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ S CL A P + NS+ + ++QQ+ H + +D+ R+G +CS
Sbjct: 390 TAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESCS 440
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/164 (30%), Positives = 76/164 (46%), Gaps = 13/164 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
Y + +G P Q + ++LDT +D W C C C F + S T+ + C++
Sbjct: 29 NYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGC-SNASTSFNTNSSSTYSTVSCSTA 87
Query: 191 SCRILRE-SFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQ-EANSNGYFTRYPFLL 248
C R + P + C FN Y S D +T+ + N F
Sbjct: 88 QCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPDVIPN-------FSF 140
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
GCIN++SG+ G+MGL R P+S++++T + Y FSYCLPS
Sbjct: 141 GCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPS 184
Score = 47.0 bits (110), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 51/109 (46%), Gaps = 8/109 (7%)
Query: 311 AFHKRMKKYKKAKGLEDL--LDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVAS 368
A +K L DTC+ SA V PKI +H + +DL+L + TL+ +S
Sbjct: 269 AIRDEFRKQVNVSSFSTLGAFDTCF--SADNENVAPKITLH-MTSLDLKLPMENTLIHSS 325
Query: 369 VSQV-CLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ CL A + N++ + N+QQ+ + +DV R+G P C
Sbjct: 326 AGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPC 374
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 84/189 (44%), Gaps = 20/189 (10%)
Query: 128 VADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRD---PF--FYASKSKTF 182
+A Y+ V +G P + +L +DTGSD+ W C PCI C D P + S +
Sbjct: 32 IAGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASS 91
Query: 183 FKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFT 242
K+PC+ SC ++ + G + +C ++ QY DGSG+ G+ D + N+
Sbjct: 92 SKVPCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYM-VNATA--- 147
Query: 243 RYPFLLGCINNSSGDKSGAS----GIMGLDRSPVSIITR-----TNTSYFSYCLPSPYGS 293
+ GC SGD S + GI+G S +S ++ + F++CL
Sbjct: 148 --TVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERG 205
Query: 294 TGYITFGKT 302
G + G
Sbjct: 206 GGILVLGNV 214
>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
Length = 468
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 89/390 (22%), Positives = 143/390 (36%), Gaps = 111/390 (28%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKP---CIHC-FQQRD----PFFYASKSKTFF 183
Y + +++G P Q V L++DTGS + W C C C F D P F S +
Sbjct: 84 YSMSLSLGTPSQTVKLIMDTGSSLVWFPCTSRYVCASCNFPNTDITKIPKFMPRLSSSSK 143
Query: 184 KIPCNSTSCRILRESFPFG--------NCN--SKEC-----PFNIQYADGSGSGGFWAT- 227
I C + C ++ FG NCN ++ C P+ IQY GS +G +
Sbjct: 144 LIGCKNPKC-----AWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLGSTAGLLLSET 198
Query: 228 ---------------DRITIQEANSNGYFTR----YPFLLG------CINNSSGDKSGAS 262
++ ++ F R P LG C+ + D S S
Sbjct: 199 INFPNKTISDFLAGCSLLSTRQPEGIAGFGRSQESLPLQLGLKKFSYCLVSRRFDDSPVS 258
Query: 263 GIMGLDRSPVS----------------IITRTNTSY---------------------FSY 285
+ LD P + + +++N ++ +S+
Sbjct: 259 SDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKIIVGKTHVKVPYSF 318
Query: 286 CLPSPYGSTGYI-------TFGKTDTVNTLRSAFHKRMKKYKKAKGLEDL--LDTCYDLS 336
+P G+ G I TF + L F K+M Y A ++ L L C+D+S
Sbjct: 319 LVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTVATNVQKLTGLRPCFDIS 378
Query: 337 AYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCL-----------GFATYPPDPNS 385
++VV+P + F GG ++L + + VCL G +
Sbjct: 379 GEKSVVIPDLTFQFKGGAKMQLPLSNYFAFVDMGVVCLTIVSDNAAALGGDGGVRSSGPA 438
Query: 386 ITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
I LGN QQ+ + YD+ R GF +C+
Sbjct: 439 IILGNFQQQNFYIEYDLENDRFGFKEQSCA 468
>gi|125532795|gb|EAY79360.1| hypothetical protein OsI_34488 [Oryza sativa Indica Group]
Length = 342
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 72/165 (43%), Gaps = 44/165 (26%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y + IG P Q S ++ + WTQC PC CF+Q P F + +T F
Sbjct: 28 YMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRYEVETMF-------- 79
Query: 192 CRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI 251
D SG GG TD I A ++ F GC
Sbjct: 80 ------------------------GDTSGIGG---TDTFAIGTATASLAF-------GCA 105
Query: 252 NNSSGDK-SGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTG 295
+S+ + GASG++GL R+P S++ + N + FSYCL +P+G+ G
Sbjct: 106 MDSNIKQLLGASGVVGLGRTPWSLVGQMNATAFSYCL-APHGAAG 149
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/157 (29%), Positives = 74/157 (47%), Gaps = 17/157 (10%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D + + YY + IG P Q +L++D+GS VT+ C C C +DP F S ++
Sbjct: 81 DDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYS 140
Query: 184 KIPCN-STSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFT 242
+ CN +C + + K+C + QYA+ S S G D ++ +
Sbjct: 141 PVKCNVDCTC----------DSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESE---LK 187
Query: 243 RYPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR 277
+ GC N+ +GD A GIMGL R +SI+ +
Sbjct: 188 PQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQ 224
>gi|222613193|gb|EEE51325.1| hypothetical protein OsJ_32293 [Oryza sativa Japonica Group]
Length = 371
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 49/148 (33%), Positives = 65/148 (43%), Gaps = 15/148 (10%)
Query: 147 LLLDTGSDVT---WT----QCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESF 199
LL D G V W+ C CIHCF+Q P F + S TF PC + C+ S
Sbjct: 35 LLADGGGAVVPFHWSPELYNCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCK----SI 90
Query: 200 PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKS 259
P C S C ++ G + G ATD I A R P +S +
Sbjct: 91 PTPKCASDVCAYDGVTGLGGHTVGIVATDTFAIGTAAP----ARPPASGASWRATSTPWA 146
Query: 260 GASGIMGLDRSPVSIITRTNTSYFSYCL 287
G SG +GL R+P S++ + + FSYCL
Sbjct: 147 GPSGFIGLGRTPWSLVAQMKLTRFSYCL 174
>gi|3641868|emb|CAA09458.1| hypothetical protein [Cicer arietinum]
Length = 110
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 59/109 (54%), Gaps = 4/109 (3%)
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV- 365
++R AF + + + A+G+ + DTCYDLS+ +V VP ++ HF +L + L+
Sbjct: 5 SVRDAFKRLTQNLRSAEGVA-IFDTCYDLSSLRSVRVPTVSFHFGNDRVWDLPAKNYLIP 63
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V S C FA P + +GNVQQ+G V +D+A +GF P C
Sbjct: 64 VDSDGTFCFAFA--PTSSSLSIIGNVQQQGTRVSFDIANSLVGFSPNKC 110
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 59/209 (28%), Positives = 91/209 (43%), Gaps = 48/209 (22%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIH---------------CFQQRDPFFY 175
+Y++ +G P Q L+ DTGSD+TW +C+ F
Sbjct: 109 QYFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAAAPSPAVAPPRVFR 168
Query: 176 ASKSKTFFKIPCNSTSCRILRESFPF--GNCNSK--ECPFNIQYADGSGSGGFWATDRIT 231
SKT+ IPC+S +C + + PF NC+S C ++ +Y D S + G TD T
Sbjct: 169 PGDSKTWSPIPCSSETC---KSTIPFSLANCSSSTAACSYDYRYNDNSAARGVVGTDSAT 225
Query: 232 I-------------QEANSNGYFTRYPFLLGCINNSSGDKSGAS-GIMGLDRSPVSIITR 277
+ ++A G +LGC +G AS G++ L S +S +R
Sbjct: 226 VALSGGRGGGGGGDRKAKLQG------VVLGCTTAHAGQGFEASDGVLSLGYSNISFASR 279
Query: 278 TNTSY---FSYCLP---SPYGSTGYITFG 300
+ + FSYCL +P +T Y+TFG
Sbjct: 280 AASRFGGRFSYCLVDHLAPRNATSYLTFG 308
Score = 39.3 bits (90), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 43/93 (46%), Gaps = 5/93 (5%)
Query: 327 DLLDTCYDLSAYET----VVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPD 382
D D CY+ +A + VPK+A+ F G LE + ++ A+ C+G
Sbjct: 402 DPFDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVIDAAPGVKCIGVQE-GAW 460
Query: 383 PNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
P +GN+ Q+ H +D+ R L F +C+
Sbjct: 461 PGVSVIGNILQQEHLWEFDLNNRWLRFRQTSCT 493
>gi|15219354|ref|NP_175079.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12320825|gb|AAG50556.1|AC074228_11 nucellin, putative [Arabidopsis thaliana]
gi|332193902|gb|AEE32023.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 405
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 89/386 (23%), Positives = 153/386 (39%), Gaps = 87/386 (22%)
Query: 108 FPEFLKRT-EAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIH 165
F F+K + + FP + N Y +++ IG P + +DTGSD+TW QC PC
Sbjct: 24 FKTFIKSSPSSVVFPLSGNVFPLGYYSVLMQIGSPPKAFQFDIDTGSDLTWVQCDAPCSG 83
Query: 166 CFQQRDPFFYASKSKTFFKIPCNSTSCRILR-ESFPFGNCNSKECPFNIQYADGSGSGGF 224
C + Y K IPC++ C L + P ++C + ++YAD S G
Sbjct: 84 CTLPPN-LQYKPKGNI---IPCSNPICTALHWPNKPHCPNPQEQCDYEVKYADQGSSMGA 139
Query: 225 WATDRITIQEANSNGYFTRYPFLLGCINN----SSGDKSGASGIMGLDRSPVSIITRTNT 280
TD+ ++ NG F + P GC + S+ +G++GL R + ++T+ +
Sbjct: 140 LVTDQFPLKLV--NGSFMQPPVAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVS 197
Query: 281 SYFS-----YCLPSPYGSTGYITFGKTDTVNTLRSAFHKRMK----------------KY 319
+ + +CL S G G++ FG + V ++ A+ + K
Sbjct: 198 AGLTRNVVGHCLSSKGG--GFLFFGD-NLVPSIGVAWTPLLSQDNHYTTGPADLLFNGKP 254
Query: 320 KKAKGLEDLLDTC-----YDLSAYETVV-------------------------------- 342
KGL+ + DT ++ AY+T++
Sbjct: 255 TGLKGLKLIFDTGSSYTYFNSKAYQTIINLIGNDLKVSPLKVAKEDKTLPICWKGAKPFK 314
Query: 343 --------VPKIAIHFLGG---VDLELDVRGTLVVASVSQVCLGF--ATYPPDPNSITLG 389
I I+F G L L L+V+ VCLG + NS +G
Sbjct: 315 SVLEVKNFFKTITINFTNGRRNTQLYLAPELYLIVSKTGNVCLGLLNGSEVGLQNSNVIG 374
Query: 390 NVQQRGHEVHYDVAGRRLGFGPGNCS 415
++ +G + YD ++LG+ +C+
Sbjct: 375 DISMQGLMMIYDNEKQQLGWVSSDCN 400
>gi|222631382|gb|EEE63514.1| hypothetical protein OsJ_18330 [Oryza sativa Japonica Group]
Length = 464
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/163 (31%), Positives = 74/163 (45%), Gaps = 27/163 (16%)
Query: 148 LLDTGSDVTWTQCKPC----------IHCFQQRDPFFYASKSKTFFKIPCNSTS---CRI 194
++DTGSD+ WTQC C CF Q P++ S S+T +PC+ C +
Sbjct: 77 VVDTGSDLVWTQCSTCRLPAVAAAGGGGCFPQNLPYYNFSLSRTARAVPCDDDDGALCGV 136
Query: 195 LRESFPF---GNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI 251
E+ G C Y G G TD T ++S GC+
Sbjct: 137 APETAGCARGGGSGDDACVVAASYGAGVALG-VLGTDAFTFPSSSS------VTLAFGCV 189
Query: 252 NN---SSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPY 291
+ S G +GASGI+GL R +S++++ N + FSYCL +PY
Sbjct: 190 SQTRISPGALNGASGIIGLGRGALSLVSQLNATEFSYCL-TPY 231
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 74/156 (47%), Gaps = 19/156 (12%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D + + YY + IG P Q +L++DTGS VT+ C C C + +DP F S T+
Sbjct: 5 DDLLINGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQ 64
Query: 184 KIPCNSTSCRILRESFPFGNCN--SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
+ CN C NC+ ++C + QYA+ S S G D I+ ++
Sbjct: 65 SVKCN-IDC----------NCDDEKQQCVYERQYAEMSTSSGVLGEDIISFGNLSA---L 110
Query: 242 TRYPFLLGCINNSSGD--KSGASGIMGLDRSPVSII 275
+ GC N +GD A GIMG+ R +SI+
Sbjct: 111 APQRAVFGCENMETGDLYSQHADGIMGMGRGDLSIV 146
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 86/164 (52%), Gaps = 14/164 (8%)
Query: 102 RRLRKPFPEFLKRTEAF-TFPANINDTVAD------EYYIVVAIGEPKQYVSLLLDTGSD 154
+RL+K F + R F A+ ND +D Y + +++G P + + DTGSD
Sbjct: 57 QRLQKAFRRSILRGNHFRAMRASPNDIQSDVISGGGAYLMNISLGTPPVPMLGIADTGSD 116
Query: 155 VTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKE-CPFNI 213
+ W QC PC +C++Q +P F +S+T+ + C++ C+ L + G+C+ C ++
Sbjct: 117 LIWRQCLPCPNCYEQVEPLFDPKESETYKTLDCDNEFCQDLGQQ---GSCDDDNTCTYSY 173
Query: 214 QYADGSGSGGFWATDRITIQEANSNGYFTRYPFL-LGCINNSSG 256
Y D S + G ++D +TI ++ G +P + GC +++ G
Sbjct: 174 SYGDRSYTRGDLSSDTLTI--GSTEGDPASFPGIAFGCGHDNGG 215
>gi|325190367|emb|CCA24840.1| aspartyl protease family A01B putative [Albugo laibachii Nc14]
Length = 603
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 56/201 (27%), Positives = 86/201 (42%), Gaps = 27/201 (13%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
YYI + IG P Q SLLLDT S T CK C+ C DP++ +KS+T C + +
Sbjct: 120 YYIDLYIGIPLQKASLLLDTTSQHTVFPCKNCVACADHMDPYYDIAKSQTSNFTKCGAEN 179
Query: 192 CRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEAN------SNGYFTR-- 243
+C ++C Y+DGS G D + + ++G
Sbjct: 180 V--------CNSCEDEKCRVEQSYSDGSFWSGLVVEDLVWVASPKTGDIEMTSGIIRNFG 231
Query: 244 YPFLLGCINNSSG--DKSGASGIMGLDRSPVSIIT------RTNTSYFSYCLPSPYGSTG 295
+P C + G + +GI+GLDRS SI+ R + FSYCL + + G
Sbjct: 232 FPMRFACETSEDGIFSQQRENGILGLDRSNHSILNFMVQAKRIDHRIFSYCL---HDTGG 288
Query: 296 YITFGKTDTVNTLRSAFHKRM 316
G D+++ + R+
Sbjct: 289 TFVLGGFDSMHHTSDMIYTRI 309
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 56/179 (31%), Positives = 84/179 (46%), Gaps = 16/179 (8%)
Query: 138 IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRE 197
IG P + V LL+DT S++TW Q C +C + P F S +F PC S+ C + R
Sbjct: 5 IGTPPREVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVC-LGRS 63
Query: 198 SFPFGN-CN--SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNS 254
F + CN + C F + Y DGS + G A + ++Q + T + GC +
Sbjct: 64 KLGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAAS-TLGDVIFGCASKD 122
Query: 255 -SGDKSGASGIMGLDRS----PVSIITRTNTSY---FSYCLPS---PYGSTGYITFGKT 302
+SG +GL+R P I +R+ + FSYC P+ S+G I FG +
Sbjct: 123 LQRPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVIIFGDS 181
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/116 (28%), Positives = 53/116 (45%), Gaps = 8/116 (6%)
Query: 307 TLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVV--VPKIAIHFLGGVDLELDVRGTL 364
L AF +R+ + G + + CYD++A + + P + +HF VD+EL
Sbjct: 255 ALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGDARLPTAPLVTLHFKNNVDMELREASVW 314
Query: 365 V----VASVSQVCLGF--ATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
V V +CL F A +GN QQ+ + + +D+ R+GF P NC
Sbjct: 315 VPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQDYLIEHDLERSRIGFAPANC 370
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 77/365 (21%), Positives = 133/365 (36%), Gaps = 89/365 (24%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRD-----PFFYASKSKTFFKIP 186
Y+ V +G P ++ +DTGSD+ W C C +C FF A S T +
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 159
Query: 187 CNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQE-------ANSNG 239
C+ C + ++ + +C ++ +Y DGSG+ G++ TD ANS+
Sbjct: 160 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 219
Query: 240 YFTRYPFLLGCINNSSGDKS----GASGIMGLDRSPVSIITRTNTS-----YFSYCLPSP 290
P + GC SGD + GI G + +S++++ ++ FS+CL
Sbjct: 220 -----PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 274
Query: 291 YGSTGYITFGKTDT----------------VNTLRSAFHKRMKKYKKA------------ 322
G G+ +N L + +M A
Sbjct: 275 GSGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIV 334
Query: 323 ---KGLEDLLDTCYDL--------------------------SAYETVVVPKIAIHFLGG 353
L L+ YDL S + + P ++++F GG
Sbjct: 335 DTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGG 394
Query: 354 VDLELDVRGTL----VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGF 409
+ L + L + S C+GF P + LG++ + YD+A +R+G+
Sbjct: 395 ASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEE--QTILGDLVLKDKVFVYDLARQRIGW 452
Query: 410 GPGNC 414
+C
Sbjct: 453 ASYDC 457
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/157 (29%), Positives = 73/157 (46%), Gaps = 17/157 (10%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D + + YY + IG P Q +L++D+GS VT+ C C C +DP F S T+
Sbjct: 80 DDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYS 139
Query: 184 KIPCN-STSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFT 242
+ CN +C + + +C + QYA+ S S G D ++ +
Sbjct: 140 PVKCNVDCTC----------DSDKNQCTYERQYAEMSSSSGVLGEDIVSF---GTESELK 186
Query: 243 RYPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR 277
+ GC N+ +GD A GIMGL R +SI+ +
Sbjct: 187 PQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQ 223
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 71/156 (45%), Gaps = 15/156 (9%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D + YY V IG P SL++DTGS VT+ C C HC +DP F + S ++
Sbjct: 27 DDLLTKGYYTSRVKIGTPPHEFSLIVDTGSTVTYVPCSSCTHCGNHQDPRFSPALSSSYK 86
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
+ C S C G C+ + QYA+ S S G D I ++ G
Sbjct: 87 PLECGS-ECST-------GFCDGSR-KYQRQYAEKSTSSGVLGKDVIGFSNSSDLG---G 134
Query: 244 YPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR 277
+ GC +GD A GI+GL R P+SII +
Sbjct: 135 QRLVFGCETAETGDLYDQTADGIIGLGRGPLSIIDQ 170
>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 530
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 89/199 (44%), Gaps = 24/199 (12%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDP--------FFYASKSKTFF 183
+Y VVA+G P + LDTGSD+ W C CI C P + KS T
Sbjct: 99 HYAVVALGTPNVTFLVALDTGSDLFWVPCD-CIKCAPLASPDYGDLKFDMYSPRKSSTSR 157
Query: 184 KIPCNSTSCRILRESFPFGNCN--SKECPFNIQY-ADGSGSGGFWATDRITIQEANSNGY 240
K+PC+S+ C P +C+ S CP++IQY ++ + S G D + + +
Sbjct: 158 KVPCSSSLCD------PQADCSAASNSCPYSIQYLSENTSSKGVLVEDVLYLTTESGQSK 211
Query: 241 FTRYPFLLGCINNSSGDKSGAS---GIMGL---DRSPVSIITRTNTSYFSYCLPSPYGST 294
T+ P GC SG G++ G++GL +S S++ + S+ +
Sbjct: 212 ITQAPITFGCGQVQSGSFLGSAAPNGLLGLGMDSKSVPSLLASKGIAANSFSMCFGEDGH 271
Query: 295 GYITFGKTDTVNTLRSAFH 313
G I FG T + + L + +
Sbjct: 272 GRINFGDTGSSDQLETPLN 290
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 53/192 (27%), Positives = 87/192 (45%), Gaps = 25/192 (13%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIP 186
Y+ + +G P + + +DTGSD+ W CKPC C R F + S T K+
Sbjct: 74 YFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVG 133
Query: 187 CNSTSCRILRESFPFGNCN-SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C+ C + +S +C + C ++I YAD S S G + D +T+++ G P
Sbjct: 134 CDDDFCSFISQS---DSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQV--TGDLKTGP 188
Query: 246 F----LLGCINNSSGD----KSGASGIMGLDRSPVSIITRTNTS-----YFSYCLPSPYG 292
+ GC ++ SG S G+MG +S S++++ + FS+CL + G
Sbjct: 189 LGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKG 248
Query: 293 STGYITFGKTDT 304
G G D+
Sbjct: 249 G-GIFAVGVVDS 259
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 54/183 (29%), Positives = 86/183 (46%), Gaps = 21/183 (11%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNS 189
YY+ + IG P + L +DTGSD+TW QC PC C + P + + ++ +PC +
Sbjct: 52 HYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRL---VPCAN 108
Query: 190 TSCRILRESFPFGN-CNS-KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C L N C S K+C + I+Y D + S G D ++ +SN R
Sbjct: 109 ALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN---IRPGLT 165
Query: 248 LGCINNSSGDKSGA-----SGIMGLDRSPVSIITR-----TNTSYFSYCLPSPYGSTGYI 297
GC + K+GA G++GL R VS++++ + +CL + G G++
Sbjct: 166 FGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGG--GFL 223
Query: 298 TFG 300
FG
Sbjct: 224 FFG 226
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 46/157 (29%), Positives = 73/157 (46%), Gaps = 17/157 (10%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D + + YY + IG P Q +L++D+GS VT+ C C C +DP F S T+
Sbjct: 80 DDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYS 139
Query: 184 KIPCN-STSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFT 242
+ CN +C + + +C + QYA+ S S G D ++ +
Sbjct: 140 PVKCNVDCTC----------DSDKNQCTYERQYAEMSSSSGVLGEDIVSF---GTESELK 186
Query: 243 RYPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR 277
+ GC N+ +GD A GIMGL R +SI+ +
Sbjct: 187 PQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQ 223
>gi|449533387|ref|XP_004173657.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 254
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/166 (30%), Positives = 80/166 (48%), Gaps = 21/166 (12%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTF--------FKI 185
+ + IG P Q L+LDTGS ++W QC ++R P K+ TF +
Sbjct: 69 VSLPIGTPPQPTDLVLDTGSQLSWIQCH--DKKVKKRLPPLPKPKTATFDPSLSSSFSLL 126
Query: 186 PCNSTSC--RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
PCN C RI + P ++ C ++ YADG+ + G ++ T + S
Sbjct: 127 PCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSNSLSTP---- 182
Query: 244 YPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
P +LGC S+ ++ GI+G++ +S I++ S FSYC+PS
Sbjct: 183 -PVILGCAQGSTENR----GILGMNHGRLSFISQAKISKFSYCVPS 223
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 54/183 (29%), Positives = 86/183 (46%), Gaps = 21/183 (11%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNS 189
YY+ + IG P + L +DTGSD+TW QC PC C + P + + ++ +PC +
Sbjct: 52 HYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRL---VPCAN 108
Query: 190 TSCRILRESFPFGN-CNS-KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C L N C S K+C + I+Y D + S G D ++ +SN R
Sbjct: 109 ALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN---IRPGLT 165
Query: 248 LGCINNSSGDKSGA-----SGIMGLDRSPVSIITR-----TNTSYFSYCLPSPYGSTGYI 297
GC + K+GA G++GL R VS++++ + +CL + G G++
Sbjct: 166 FGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGG--GFL 223
Query: 298 TFG 300
FG
Sbjct: 224 FFG 226
>gi|242086418|ref|XP_002443634.1| hypothetical protein SORBIDRAFT_08g022645 [Sorghum bicolor]
gi|241944327|gb|EES17472.1| hypothetical protein SORBIDRAFT_08g022645 [Sorghum bicolor]
Length = 486
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 84/290 (28%), Positives = 121/290 (41%), Gaps = 46/290 (15%)
Query: 37 VSSLLPPNVCNRTRTALPQGPDKASLEVVSKYGPCSRLNQGISTHAPSL--EEILRQDQQ 94
S L P C+ T L D L +V + P S L G+ PSL ++L +D
Sbjct: 54 ASRLPPATTCSSMATGL----DNNKLPIVHRQSPWSPL-HGL----PSLTTADVLHRDTS 104
Query: 95 RLHLKNSRR-----LRKPFPEFLKRTEAFTFPANINDTV-----ADEYYIVVAIGEPKQY 144
+ + + P P L A PAN + A +Y ++V+ G P+Q
Sbjct: 105 LVRRRRRFSSQSSVVAAPTPA-LSPAAATIIPANGSSDPSTLPGALDYIVLVSYGSPEQQ 163
Query: 145 VSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNC 204
+ L T + +CKPC +P F +S TF +PC+S C + NC
Sbjct: 164 FPVFLGTNVGTSLLRCKPCASGSDDCNPAFDTLQSSTFAHVPCSSPDCPV--------NC 215
Query: 205 NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGI 264
+S CPF Y GG +ATD +T+ A S+ + F+ + + S D A G
Sbjct: 216 SSSVCPFYDLYGT---VGGTFATDVLTL--APSSMAVHDFRFVCMDVESPSPDLPEA-GS 269
Query: 265 MGLDR---------SPVSIITRTNTSYFSYCLPSPYGSTGYITFGKTDTV 305
+ L R S S I T S FSYCLP S G+++ G TV
Sbjct: 270 IDLSRHRNSLPSQLSSSSGIAPTAAS-FSYCLPQSRNSQGFLSLGGDATV 318
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 47/109 (43%), Gaps = 17/109 (15%)
Query: 287 LPSPYGSTGY----ITFGKTDTV------NTLRSAFHKRMKKY--KKAKGLEDLLDTCYD 334
LP P G+ G + G T T+ TLR AF K M +Y + + D DTC++
Sbjct: 357 LPIPSGTFGNASTNLDVGATFTMLAPEAYTTLRDAFRKEMSQYNNRSSPAGFDGFDTCFN 416
Query: 335 LSAYETVVVPKIAIHFLGGVDLELDVRGTL-----VVASVSQVCLGFAT 378
+ +VVP + + F G L +D L + CL F++
Sbjct: 417 FTGLNELVVPLVQLKFSNGESLMIDGDQMLYYHDPAAGPFTMACLAFSS 465
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 84/189 (44%), Gaps = 20/189 (10%)
Query: 128 VADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRD---PF--FYASKSKTF 182
+A Y+ V +G P + +L +DTGSD+ W C PCI C D P + S +
Sbjct: 32 IAGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASS 91
Query: 183 FKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFT 242
K+PC+ SC ++ + G + +C ++ QY DGSG+ G+ D + N+
Sbjct: 92 SKVPCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYM-VNATA--- 147
Query: 243 RYPFLLGCINNSSGDKSGAS----GIMGLDRSPVSIITR-----TNTSYFSYCLPSPYGS 293
+ GC SGD S + GI+G S +S ++ + F++CL
Sbjct: 148 --TVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERG 205
Query: 294 TGYITFGKT 302
G + G
Sbjct: 206 GGILVLGNV 214
>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
gi|223942623|gb|ACN25395.1| unknown [Zea mays]
Length = 378
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 56/195 (28%), Positives = 88/195 (45%), Gaps = 33/195 (16%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPF--FYASKSKTFFKIPCN 188
+Y++ +G P Q L+ DTGSD+TW +C+ P F AS+S+++ + C+
Sbjct: 13 QYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESRSWAPLACS 72
Query: 189 STSCRILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITI-------------- 232
S +C F NC+S C ++ +Y DGS + G TD TI
Sbjct: 73 SDTCTSY-VPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGSGGG 131
Query: 233 -QEANSNGYFTRYPFLLGCINNSSGDKSGAS-GIMGLDRSPVSIITRTNTSY---FSYCL 287
+ A G +LGC G +S G++ L S +S +R + FSYCL
Sbjct: 132 GRRAKLQG------VVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCL 185
Query: 288 P---SPYGSTGYITF 299
+P ++ Y+TF
Sbjct: 186 VDHLAPRNASSYLTF 200
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 93/391 (23%), Positives = 151/391 (38%), Gaps = 88/391 (22%)
Query: 95 RLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVA-IGEPKQYVSLLLDTGS 153
++ K++ RL+ F + L ++ A+ + YIV A IG P Q + L +DT +
Sbjct: 42 QMQAKDTTRLQ--FLDSLVARKSVVPIASGRQIIQSPTYIVRAKIGTPPQTLLLAMDTSN 99
Query: 154 DVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNI 213
D W C C C F KS TF + C + C+ P C C FN+
Sbjct: 100 DAAWIPCTACDGCAST---LFAPEKSTTFKNVSCAAPECK----QVPNPGCGVSSCNFNL 152
Query: 214 QYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVS 273
Y S + D IT+ Y GC++ ++G + G++GL R P+S
Sbjct: 153 TYGSSSIAANL-VQDTITLATDPVPSY------TFGCVSKTTGTSAPPQGLLGLGRGPLS 205
Query: 274 IITRTNTSY---FSYCLPS--PYGSTGYITFGKTDTVNTLR-----------SAFHKRMK 317
++++T Y FSYCLPS +G + G ++ S ++ ++
Sbjct: 206 LLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLE 265
Query: 318 KYKKAKGLEDL------------LDTCYDLSAYETVVV----------------PKIAIH 349
+ + + D+ T +D T +V PK+ +
Sbjct: 266 AIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVFTRLVAPVYVAVRDEFRRRVGPKLTVT 325
Query: 350 FLGGVDLELDVRGTLVVASV-----------------------SQVCLGFATYPPDPNSI 386
LGG D +V +VV ++ S CL A P + NS+
Sbjct: 326 SLGGFDTCYNV--PIVVPTITFIFTGMNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSV 383
Query: 387 --TLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ N+QQ+ H V YDV R+G C+
Sbjct: 384 LNVIANMQQQNHRVLYDVPNSRVGVARELCT 414
>gi|255685714|gb|ACU28346.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 51/101 (50%), Gaps = 15/101 (14%)
Query: 136 VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRIL 195
+ IG P + +LDTGS++ WTQC PC+HC+ Q+ P F SKS TF + CN+
Sbjct: 3 LQIGTPPFEIEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCNT------ 56
Query: 196 RESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEAN 236
C + I Y D S + G AT+ +TI +
Sbjct: 57 ---------PDHSCSYKIVYDDKSYTQGTLATETVTIHSTS 88
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 75/158 (47%), Gaps = 16/158 (10%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCK-PCIHCFQQRDPFFYASKSKTFFKIPCNSTSC 192
+ + IG P Q ++LDTGS ++W QCK P DP S +F +PCN + C
Sbjct: 80 VSLPIGTPPQTQQMVLDTGSQLSWIQCKVPPKTPPTAFDPLL----SSSFSVLPCNHSLC 135
Query: 193 --RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
R+ + P ++ C ++ YADG+ + G ++ T + T P +LGC
Sbjct: 136 KPRVPDYTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSSSQ-----TTPPLILGC 190
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLP 288
D S GI+G++ +S + S FSYC+P
Sbjct: 191 AT----DSSDTQGILGMNLGRLSFSSLAKISKFSYCVP 224
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 318 KYKKAKGLEDLLDTCYDLSAYET-VVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGF 376
K KK LD C+D A ++ +A F GV++ ++ L CLG
Sbjct: 332 KLKKGYVYGGSLDMCFDGDAMVIGRMIGNMAFEFENGVEIVVEREKMLADVGGGVQCLGI 391
Query: 377 ATYPP-DPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
S +GN Q+ V +D+ GRR+GFG +CS
Sbjct: 392 GRSDLLGVASNIIGNFHQQDLWVEFDLVGRRVGFGRTDCS 431
>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
Length = 308
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 75/305 (24%), Positives = 123/305 (40%), Gaps = 48/305 (15%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y + +++G P + + DTGSD+ W QC PC C++Q +P F KSKT+ T
Sbjct: 29 YLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKSKTY------KTL 82
Query: 192 CRILRESFPFGNCNSKECPF-NIQYADGSGSGGFW------------ATDRITIQEANSN 238
+ E+F G+ F + + G +GG + + +Q ++
Sbjct: 83 GYLSSETFTIGSTEGDPASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKV 142
Query: 239 GYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGS-TGYI 297
G Y C+ S D + +S I + ++++ + TS SP + I
Sbjct: 143 GGQFSY-----CLVPLSSDSTASSKI---NFGKSAVVSGSGTS-------SPAAAEESNI 187
Query: 298 TFGKTDTVNTLRSAFHKRMKK-YKKAKGLEDLLD------TCYDLSAYETVVVPKIAIHF 350
T+ L F+ M+ K G + D CY S + + +P I HF
Sbjct: 188 IIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCY--SGVKKLEIPTITAHF 245
Query: 351 LGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFG 410
+ G D++L T V A VC P N GN+ Q V YD+ ++ F
Sbjct: 246 I-GADVQLPPLNTFVQAQEDLVCFSMI---PSSNLAIFGNLSQMNFLVGYDLKNNKVSFK 301
Query: 411 PGNCS 415
P +C+
Sbjct: 302 PTDCT 306
>gi|255685712|gb|ACU28345.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 51/101 (50%), Gaps = 15/101 (14%)
Query: 136 VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRIL 195
+ IG P + +LDTGS++ WTQC PC+HC+ Q+ P F SKS TF + CN+
Sbjct: 3 LQIGTPPFEIEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCNT------ 56
Query: 196 RESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEAN 236
C + I Y D S + G AT+ +TI +
Sbjct: 57 ---------PDHSCXYKIVYDDKSYTQGTLATETVTIHSTS 88
>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 432
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 76/155 (49%), Gaps = 14/155 (9%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
YY+ + IG P + L +D+GSD+TW QC PC C + P + +KSK +PC
Sbjct: 64 YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKSKL---VPCVHR 120
Query: 191 SCRILRESFPFGN--CNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPF 246
C L + G C S ++C + I+YAD S G D ++ +NG R
Sbjct: 121 LCASLHNALTGGKHRCESPHEQCDYVIKYADQGSSTGVLVNDSFALRL--TNGSVARPSV 178
Query: 247 LLGCINNS---SGDKSGAS-GIMGLDRSPVSIITR 277
GC + SGD S + G++GL VS++++
Sbjct: 179 AFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQ 213
>gi|242041431|ref|XP_002468110.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
gi|241921964|gb|EER95108.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
Length = 467
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 56/204 (27%), Positives = 89/204 (43%), Gaps = 33/204 (16%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCK----PCIHCFQQRDPFFYASKSKTFFKIPCNS 189
+ VA+G P Q V+++LDTGS+++W C P Q F S S T+ C+S
Sbjct: 61 VPVAVGAPPQNVTMVLDTGSELSWLLCNGSRVPSTPPQPQAPAAFNGSASSTYAAAHCSS 120
Query: 190 T-SCRILRESFPF----GNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRY 244
+ C+ P S C ++ YAD S + G A D + G
Sbjct: 121 SPECQWRGRDLPVPPFCAGPPSNSCRVSLSYADASSADGVLAADTFLL------GGAPPV 174
Query: 245 PFLLGCI----NNSSGDKSG-------------ASGIMGLDRSPVSIITRTNTSYFSYCL 287
L GCI ++S+ D +G A+G++G++R +S +T+T T F+YC+
Sbjct: 175 RALFGCITSYSSSSTADGNGNGNDASATNSSEAATGLLGMNRGSLSFVTQTGTLRFAYCI 234
Query: 288 PSPYGSTGYITFGKTDTVNTLRSA 311
+P G + G L +A
Sbjct: 235 -APGDGPGLLVLGGDGDGAALSAA 257
>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 77/155 (49%), Gaps = 13/155 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNS 189
Y +++ IG P + L +DTGSD+TW QC PC C + D + + ++PC S
Sbjct: 67 HYSVILNIGNPPKAFDLDIDTGSDLTWVQCDAPCKGCTKPLDKLYKPKNN----RVPCAS 122
Query: 190 TSCRILRESFPFGNCN--SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
+ C+ ++ + NC+ +++C + ++YAD S G +D ++ N + R F
Sbjct: 123 SLCQAIQNN----NCDIPTEQCDYEVEYADLGSSLGVLLSDYFPLRLNNGSLLQPRIAFG 178
Query: 248 LGCINNSSGDKS--GASGIMGLDRSPVSIITRTNT 280
G G S +GI+GL R SI+++ T
Sbjct: 179 CGYDQKYLGPHSPPDTAGILGLGRGKASILSQLRT 213
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 51/184 (27%), Positives = 85/184 (46%), Gaps = 13/184 (7%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
+Y + +IGEP + +DTGSD+ W +C PC C P + ++S++ K+PC+S
Sbjct: 86 KYIMQFSIGEPPLLIWAEVDTGSDLMWVKCSPCNGCNPPPSPLYDPARSRSSGKLPCSSQ 145
Query: 191 SCRILRESFPFGNCNSKECPF-NIQYADG-SG---SGGFWATDRITIQEA--NSNGYFTR 243
C+ L + S + P YA G SG + G T+ T + +N F R
Sbjct: 146 LCQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTFGDGYVANNVSFGR 205
Query: 244 YPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGKTD 303
+ G G +G++GL R +S++++ F+YCL + I FG
Sbjct: 206 SDTIDGS------QFGGTAGLVGLGRGHLSLVSQLGAGRFAYCLAADPNVYSTILFGSLA 259
Query: 304 TVNT 307
++T
Sbjct: 260 ALDT 263
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 50/170 (29%), Positives = 80/170 (47%), Gaps = 12/170 (7%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPF-FYASKSKTFFKIPCNSTSC 192
+ + IG P Q ++LDTGS ++W QC + F S S +F +PCN C
Sbjct: 82 VSLPIGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPCNHPLC 141
Query: 193 --RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
RI + P ++ C ++ YADG+ + G ++IT + S P +LGC
Sbjct: 142 KPRIPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFSSSQSTP-----PLILGC 196
Query: 251 INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFG 300
S+ +K GI+G++ S ++ S FSYC+P+ G + G
Sbjct: 197 AEASTDEK----GILGMNLGRRSFASQAKISKFSYCVPTRQARAGLSSTG 242
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 44/100 (44%), Gaps = 2/100 (2%)
Query: 318 KYKKAKGLEDLLDTCYDLSAYET-VVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGF 376
K KK + D C+D + E ++ + F GV++ +D L C+G
Sbjct: 338 KLKKGYVYGGVSDMCFDGNPMEIGRLIGNMVFEFEKGVEIVIDKWRVLADVGGGVHCIGI 397
Query: 377 A-TYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ S +GN Q+ V YD+A RR+G G +CS
Sbjct: 398 GRSEMLGAASNIIGNFHQQNLWVEYDLANRRIGLGKADCS 437
>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Brachypodium distachyon]
Length = 429
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 91/366 (24%), Positives = 150/366 (40%), Gaps = 92/366 (25%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQ---QRDPFFYASKSKTFFKIP 186
++++ +++G P + +DTGS ++W C+ C I C + F KS T+ +
Sbjct: 74 KFFMDISLGTPPVANLVTVDTGSTLSWVVCQRCQISCHTTAPEAGSVFDPDKSTTYELVG 133
Query: 187 CNSTSCRILRESF--PFGNCNSKE-CPFNIQYADG-SG--SGGFWATDRITIQEANS--N 238
C+S C ++ S PFG + C ++++Y G SG S G TD++T+ ++S +
Sbjct: 134 CSSRDCADVQRSLVAPFGCIEETDTCLYSLRYGSGPSGQYSAGRLGTDKLTLASSSSIID 193
Query: 239 GYFTRYPFLLGCINNSS--GDKSGASGIMGLDRSPVSIITR-TNTSYFSYCLPSPYGSTG 295
G F+ GC + S G +SG G G + S + + R TN FSYC P + + G
Sbjct: 194 G------FIFGCSGDDSFKGYESGVIGFGGANFSFFNQVARQTNYRAFSYCFPGDHTAEG 247
Query: 296 YITFG--------------------------------------------------KTDTV 305
+++ G + TV
Sbjct: 248 FLSIGAYPKDELVYTNLIPHFGDRSVYSLQQIDMMVDGNRLQVDQSEYTKRMMVVDSGTV 307
Query: 306 NTLR-----SAFHKRMKKYKKAKG-LEDLL--DTCYDLSAYETV---VVPKIAIHFLGGV 354
+T AF K M +AKG L D + +TC+ + ++V +P + + F+ G
Sbjct: 308 DTFLLGPVFDAFSKAMASAMQAKGFLSDTVGTETCFRPNGGDSVDSGDLPTVEMRFI-GT 366
Query: 355 DLELDVRGTL--VVASVSQVCLGFATYPPDP----NSITLGNVQQRGHEVHYDVAGRRLG 408
L+L ++ S ++CL F PD N LGN V YD+ G
Sbjct: 367 TLKLPPENVFHDLLPSHDKICLAFK---PDVAGVRNVQILGNKATXSFRVVYDLQAMYFG 423
Query: 409 FGPGNC 414
F G C
Sbjct: 424 FQAGAC 429
>gi|159463556|ref|XP_001690008.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158283996|gb|EDP09746.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 547
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 67/145 (46%), Gaps = 10/145 (6%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
YY + IG P Q VS +LDTGS + C C C + F S T C+
Sbjct: 81 YYTYLTIGTPGQTVSGILDTGSTLPAFPCSGCTRCGPSKTGMFKPELSSTSSTFGCSDAR 140
Query: 192 CRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI 251
C S +CN+++C ++I+Y +GS + GF A D + + + F+ GC
Sbjct: 141 CFCGANSC---SCNNEQCGYSIRYLEGSSTSGFLAEDMLAVGDGGPAAN-----FVFGCA 192
Query: 252 NNSSG--DKSGASGIMGLDRSPVSI 274
+ SG A G+ G+ R+P S+
Sbjct: 193 QSESGLLYSQIADGVFGMGRTPASL 217
>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 529
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 53/197 (26%), Positives = 87/197 (44%), Gaps = 20/197 (10%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDP--------FFYASKSKTFF 183
+Y VVA+G P + LDTGSD+ W C C+ C P + KS T
Sbjct: 108 HYAVVALGTPNVTFLVALDTGSDLFWVPCD-CLKCAPLSSPDYGNLKFDVYSPRKSSTSR 166
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQY-ADGSGSGGFWATDRITIQEANSNGYFT 242
K+PC+S C + E + S CP+ I+Y +D + S G D + + + + T
Sbjct: 167 KVPCSSNMCDLQTEC----SAASNSCPYKIEYLSDNTSSKGVLVEDVMYLATESGHSKIT 222
Query: 243 RYPFLLGCINNSSGDKSGAS---GIMGL---DRSPVSIITRTNTSYFSYCLPSPYGSTGY 296
+ P GC +G G++ G++GL +S S++ + S+ + G
Sbjct: 223 QAPITFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASQGVAANSFSMCFGEDGHGR 282
Query: 297 ITFGKTDTVNTLRSAFH 313
I FG T + + L + +
Sbjct: 283 INFGDTGSADQLETPLN 299
>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Cucumis sativus]
Length = 418
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 66/150 (44%), Gaps = 10/150 (6%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCK-PCIHCFQQRDPFFYASKSKTFFKIPCNST 190
Y + + +G+P + L DTGSD+TW QC PC C + P + S +PC
Sbjct: 57 YNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQPSNDL----VPCKDP 112
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGC 250
C L S N +C + ++YADG S G D + N + R LGC
Sbjct: 113 LCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPIRPR--LALGC 170
Query: 251 INNSSGDKSG---ASGIMGLDRSPVSIITR 277
+ S GI+GL R VSI+++
Sbjct: 171 GYDQDPGSSSYHPMDGILGLGRGAVSIVSQ 200
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 67/138 (48%), Gaps = 10/138 (7%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
+Y + +G P++ S+++DTGS +T+ CK C HC + +F KS T K+ C
Sbjct: 13 FYTTLKLGTPERTFSVIIDTGSTITYIPCKDCSHCGKHTAEWFDPDKSTTAKKLACGDPL 72
Query: 192 CRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI 251
C P CN+ C ++ YA+ S S G+ D +++S + GC
Sbjct: 73 CNC---GTPSCTCNNDRCYYSRTYAERSSSEGWMIEDTFGFPDSDSP-----VRLVFGCE 124
Query: 252 NNSSGD--KSGASGIMGL 267
N +G+ + A GIMG+
Sbjct: 125 NGETGEIYRQMADGIMGM 142
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 73/158 (46%), Gaps = 19/158 (12%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D + + YY + IG P Q +L++DTGS VT+ C C C + +DP F S T+
Sbjct: 104 DDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQ 163
Query: 184 KIPCNSTSCRILRESFPFGNCNSK--ECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
+ C + C NC+ +C + QYA+ S S G D I+ +
Sbjct: 164 PVKC-TIDC----------NCDGDRMQCVYERQYAEMSTSSGVLGEDVISF---GNQSEL 209
Query: 242 TRYPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR 277
+ GC N +GD A GIMGL R +SI+ +
Sbjct: 210 APQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQ 247
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 76/361 (21%), Positives = 130/361 (36%), Gaps = 79/361 (21%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIP 186
YY + +G P + + +DTGSDV W C C C FF S T I
Sbjct: 90 YYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLIS 149
Query: 187 CNSTSCRI-LRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYF--TR 243
C+ C + L+ S + +C + QY DGSG+ G++ +D + +
Sbjct: 150 CSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMKNSS 209
Query: 244 YPFLLGCINNSSGDKS----GASGIMGLDRSPVSIITRTNTS-----YFSYCLPSPYGST 294
P + GC +GD + GI G + +S+I++ + FS+CL
Sbjct: 210 APIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKGDDSGG 269
Query: 295 GYITFGKTDTVNTLRSAFHKRMKKYK-------------------------------KAK 323
G + G+ N + + Y
Sbjct: 270 GILVLGEIVEPNIVYTPLVPSQPHYNLNLQSIYVNGQTLAIDPSVFATSSNQGTIIDSGT 329
Query: 324 GLEDLLDTCYD--LSAYETVV------------------------VPKIAIHFLGGVDLE 357
L L + YD +SA + V P+++++F GG +
Sbjct: 330 TLAYLTEAAYDPFISAITSTVSPSVSPYLSKGNQCYLTSSSINDVFPQVSLNFAGGTSMI 389
Query: 358 LDVRGTLV----VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
L + L+ + + C+GF +I LG++ + YD+AG+R+G+ +
Sbjct: 390 LIPQDYLIQQSSINGAALWCVGFQKIQGQEITI-LGDLVLKDKIFVYDIAGQRIGWANYD 448
Query: 414 C 414
C
Sbjct: 449 C 449
>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
Length = 471
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 54/178 (30%), Positives = 83/178 (46%), Gaps = 16/178 (8%)
Query: 127 TVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKP--CIHCFQQRDPFFYASKSKTFF 183
++ D+ Y++ IG P + DTGS++ W QC C +C++Q+ P F +KS T+
Sbjct: 102 SIIDKVYVMKFNIGSPPVETYAIPDTGSNIVWIQCGSPICTNCYKQKIPLFNPTKSSTYA 161
Query: 184 KIPCNSTSCR-ILRESFPFGNCNS--KECPFNIQYADGSGSGGFWATDRITIQE--ANSN 238
C C+ L + C S + C ++I Y D S S G +TD IT E A
Sbjct: 162 IRLCGHRECKQALWGLGEYLGCKSSVQVCRYHISYEDHSFSEGTISTDIITFPEHIAEFG 221
Query: 239 GYFTRYPFLLGCINNSSG------DKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSP 290
Y R GC N+S + A G++GL S++ + FSYC+ +P
Sbjct: 222 NYSLR--MFFGCGYNNSETPGQDPNSFTAPGVVGLGNEMASLVGQLTLGQFSYCISTP 277
>gi|357117301|ref|XP_003560410.1| PREDICTED: uncharacterized protein LOC100833752 [Brachypodium
distachyon]
Length = 473
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 52/179 (29%), Positives = 78/179 (43%), Gaps = 22/179 (12%)
Query: 132 YYIVVAIGEPKQY--VSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPC-N 188
Y + V +G Y L +D + +W QC PC C Q +P F +KS TF + N
Sbjct: 101 YAVAVGVGTEHGYENYELEMDMAAGFSWMQCAPCHPCLPQLNPVFDPAKSPTFRPVSGHN 160
Query: 189 STSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FL 247
+ CR P+ C F I Y +G+ + G+ A D + ++N F P +
Sbjct: 161 AVLCRP-----PYHPLQDGRCGFGIAYRNGASAAGYLARDTFSFPTGDNN--FQHLPGIV 213
Query: 248 LGCIN-----NSSGDKSGASGI-MGLDRSPVSIITR----TNTSYFSYCLPSPYGSTGY 296
GC N ++ G +G G+ MG + P++ R FSYC P G+T Y
Sbjct: 214 FGCANRIARFDTHGALAGVLGMGMGAEGKPLTGFMRQLYHNGGGRFSYCPIVP-GTTAY 271
>gi|222624328|gb|EEE58460.1| hypothetical protein OsJ_09701 [Oryza sativa Japonica Group]
Length = 360
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 63/243 (25%), Positives = 106/243 (43%), Gaps = 46/243 (18%)
Query: 209 CPFNIQYADGSGSGGFWATDRITI-QEANSNGYFTRYPFLLGCINNSSGDKSGA--SGIM 265
C F+ +AD S A+D + + ++A N + GC+++ +G + G++
Sbjct: 126 CAFSKPFADASFQAAL-ASDTLRLGKDAIPN-------YTFGCVSSVTGPTTNMPRQGLL 177
Query: 266 GLDRSPVSIITRTNTSY----------------FSYCLPSPYGS------TGYITFGKTD 303
GL R P++++++ + Y C P GS TG T +
Sbjct: 178 GLGRGPMALLSQAGSLYNGRLPLLPPELQVILLLRACSGFPAGSFAFDAATGAGTVVDSG 237
Query: 304 TVNT---------LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGV 354
TV T LR F +++ L DTC++ P + +H GGV
Sbjct: 238 TVITRWTAPVYAALREEFRRQVAAPSGYTSL-GAFDTCFNTDEVAAGGAPAVTVHMDGGV 296
Query: 355 DLELDVRGTLVVASVSQV-CLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRLGFGP 411
DL L + TL+ +S + + CL A P + NS+ + N+QQ+ V +DVA R+GF
Sbjct: 297 DLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRVGFAK 356
Query: 412 GNC 414
+C
Sbjct: 357 ESC 359
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 46/156 (29%), Positives = 72/156 (46%), Gaps = 15/156 (9%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D + + YY + IG P Q +L++D+GS VT+ C C C +DP F S T+
Sbjct: 77 DDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYS 136
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
+ C S C + + +C + QYA+ S S G D ++ +
Sbjct: 137 PVKC-SADCTC--------DSDKSQCTYERQYAEMSSSSGVLGEDIVSF---GTESELKP 184
Query: 244 YPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR 277
+ GC N+ +GD A GIMGL R +SI+ +
Sbjct: 185 QRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQ 220
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 47/156 (30%), Positives = 71/156 (45%), Gaps = 15/156 (9%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D + + YY + IG P Q +L++D+GS VT+ C C C +DP F S T+
Sbjct: 83 DDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQDPRFQPDLSSTYS 142
Query: 184 KIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTR 243
+ CN C E +C + QYA+ S S G D ++ + +
Sbjct: 143 PVKCN-VDCTCDNE--------RSQCTYERQYAEMSSSSGVLGEDIMSFGKESE---LKP 190
Query: 244 YPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR 277
+ GC N +GD A GIMGL R +SI+ +
Sbjct: 191 QRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQ 226
>gi|242095590|ref|XP_002438285.1| hypothetical protein SORBIDRAFT_10g011125 [Sorghum bicolor]
gi|241916508|gb|EER89652.1| hypothetical protein SORBIDRAFT_10g011125 [Sorghum bicolor]
Length = 74
Score = 68.6 bits (166), Expect = 6e-09, Method: Composition-based stats.
Identities = 31/69 (44%), Positives = 45/69 (65%)
Query: 328 LLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSIT 387
+LDTCYD++ ++ VVVP +A+ F GG L++D G ++VA VSQ CL FA +
Sbjct: 4 VLDTCYDVTGHDEVVVPSVALLFGGGARLDVDASGIVLVADVSQACLAFAPNADRGSVNI 63
Query: 388 LGNVQQRGH 396
+ N+QQR H
Sbjct: 64 IANMQQRTH 72
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 48/88 (54%), Gaps = 6/88 (6%)
Query: 147 LLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS 206
++LDTGSDV W QC PC C++Q P F +S ++ + C + CR L G C+
Sbjct: 1 MVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDS----GGCDL 56
Query: 207 KE--CPFNIQYADGSGSGGFWATDRITI 232
+ C + + Y DGS + G + T+ +T
Sbjct: 57 RRGACMYQVAYGDGSVTAGDFVTETLTF 84
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 54/111 (48%), Gaps = 4/111 (3%)
Query: 306 NTLRSAFHKRMKK-YKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL 364
+ LR AF + + G L DTCYDL V VP +++HF GG + L L
Sbjct: 234 SALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYL 293
Query: 365 V-VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ V S C FA D +GN+QQ+G V +D G+R+GF P C
Sbjct: 294 IPVDSRGTFCFAFAG--TDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342
>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 372
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 94/197 (47%), Gaps = 23/197 (11%)
Query: 121 PANINDTVADE------YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQ---R 170
PA+ + + D+ Y++ +++G P + + +DTGS ++W QCK C I C+ Q
Sbjct: 8 PADSSTVIGDDSMRKNKYFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKA 67
Query: 171 DPFFYASKSKTFFKIPCNSTSCRILRE--SFPFGNCNSKE-CPFNIQYADGSGSGGFWAT 227
F S T+ K+ C++ +C + + +G + C ++++Y G S G+
Sbjct: 68 GQIFNPYNSSTYSKVGCSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGK 127
Query: 228 DRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVS----IITRTNTSYF 283
DR+T+ S F+ GC ++ + A GI+G S + +T+ + F
Sbjct: 128 DRLTLASNRSID-----NFIFGCGEDNLYNGVNA-GIIGFGTKSYSFFNQVCQQTDYTAF 181
Query: 284 SYCLPSPYGSTGYITFG 300
SYC P + + G +T G
Sbjct: 182 SYCFPRDHENEGSLTIG 198
>gi|115465777|ref|NP_001056488.1| Os05g0591300 [Oryza sativa Japonica Group]
gi|113580039|dbj|BAF18402.1| Os05g0591300 [Oryza sativa Japonica Group]
Length = 453
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 55/182 (30%), Positives = 81/182 (44%), Gaps = 13/182 (7%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQ---RDPFFYASKSKTFFKIPC 187
+ I V +G P + +DTGS ++W QC+PC I C Q P F S S TF + C
Sbjct: 53 FLIPVKLGTPAVQYLVTMDTGSSLSWVQCRPCTIKCHVQPAKVGPIFDPSNSSTFRHVGC 112
Query: 188 NSTSCRILRESFPFGN--CNSKE--CPFNIQYADGSG-SGGFWATDRITIQEANSN-GYF 241
+++ C L + + C E C + + Y G S G TDR+ + +
Sbjct: 113 STSICSYLGRTLRIQSKACMEWEDICLYTMSYGGGWAYSVGKAVTDRLVLGGGETTRTTL 172
Query: 242 TRYPFLLGCINNSSGDKSGASGIMGLDRSPVS---IITRTNTSYFSYCLPSPYGSTGYIT 298
+ F+ GC ++ +GI GL S S I + FSYCLPS GY++
Sbjct: 173 SLANFVFGCSMDTQYSTHKEAGIFGLGTSNYSFEQIAPLLSYKAFSYCLPSDEAHQGYLS 232
Query: 299 FG 300
G
Sbjct: 233 IG 234
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 66/144 (45%), Gaps = 18/144 (12%)
Query: 138 IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRE 197
IG P Q +L++DTGS VT+ C C C +DP F S T+ + CN
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCN--------- 52
Query: 198 SFPFGNCNSK--ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSS 255
P C+++ +C + QYA+ S S G D ++ + + GC N +
Sbjct: 53 --PDCTCDTENDQCTYERQYAEMSSSSGILGEDLVSFGNMSE---LKPQRAVFGCENAET 107
Query: 256 GD--KSGASGIMGLDRSPVSIITR 277
GD A GIMGL R +SI+ +
Sbjct: 108 GDLFSQHADGIMGLGRGDLSIVDQ 131
>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
Length = 353
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 48/182 (26%), Positives = 88/182 (48%), Gaps = 17/182 (9%)
Query: 130 DEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQ---RDPFFYASKSKTFFKI 185
++Y++ +++G P + + +DTGS ++W QCK C I C+ Q F S T+ K+
Sbjct: 4 NKYFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKV 63
Query: 186 PCNSTSCRILRESFP--FGNCNSKE-CPFNIQYADGSGSGGFWATDRITIQEANSNGYFT 242
C++ +C + +G + C ++++Y G S G+ DR+T+ S
Sbjct: 64 GCSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNRSID--- 120
Query: 243 RYPFLLGCINNSSGDKSGASGIMGLDRSPVS----IITRTNTSYFSYCLPSPYGSTGYIT 298
F+ GC ++ + A GI+G S + +T+ + FSYC P + + G +T
Sbjct: 121 --NFIFGCGEDNLYNGVNA-GIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGSLT 177
Query: 299 FG 300
G
Sbjct: 178 IG 179
>gi|195647908|gb|ACG43422.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|414587776|tpg|DAA38347.1| TPA: aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 498
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 53/190 (27%), Positives = 87/190 (45%), Gaps = 27/190 (14%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF-------- 183
+Y +V +G P Q + LDTGSD+ W C+ C C P AS S TF+
Sbjct: 109 HYALVTVGTPGQTFMVALDTGSDLFWLPCQ-CDGC---TPPATAASGSATFYIPGMSSTS 164
Query: 184 -KIPCNSTSCRILRESFPFGNCNSK-ECPFNIQYAD-GSGSGGFWATDRITIQEANSNGY 240
+PCNS C + +E C++ +CP+ + Y G+ S GF D + + N++
Sbjct: 165 KAVPCNSNFCDLQKE------CSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQ 218
Query: 241 FTRYPFLLGCINNSSG---DKSGASGIMGLDRSPV---SIITRTNTSYFSYCLPSPYGST 294
+ +LGC +G D + +G+ GL V SI+ + + S+ +
Sbjct: 219 ILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGI 278
Query: 295 GYITFGKTDT 304
G I+FG ++
Sbjct: 279 GRISFGDQES 288
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 48/157 (30%), Positives = 75/157 (47%), Gaps = 15/157 (9%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSC- 192
I + IG P Q ++LDTGS ++W QC H Q F S S TF +PC C
Sbjct: 77 INLPIGTPPQTQPMVLDTGSQLSWIQC----HKKQPPTASFDPSLSSTFSILPCTHPLCK 132
Query: 193 -RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI 251
RI + P ++ C ++ YADG+ + G ++ T + S P +LGC
Sbjct: 133 PRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRSVSTP-----PLILGCA 187
Query: 252 NNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLP 288
S+ + GI+G++ +S ++ + FSYC+P
Sbjct: 188 TESTDPR----GILGMNLGRLSFAKQSKITKFSYCVP 220
>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 59/207 (28%), Positives = 93/207 (44%), Gaps = 25/207 (12%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y I V +G P + + +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C +L S P +C E CPF + Y DGS S G D +T + FT
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFT----- 110
Query: 248 LGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STGY 296
GC +S G + G++G+ +S++ +++ ++ FSYCLP S G +TGY
Sbjct: 111 FGCNMDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGY 170
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAK 323
+ G + K + + K +
Sbjct: 171 FSLGGKIAATRTDVRYTKMVARRKNTE 197
>gi|194700652|gb|ACF84410.1| unknown [Zea mays]
gi|414587775|tpg|DAA38346.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 500
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 53/190 (27%), Positives = 87/190 (45%), Gaps = 27/190 (14%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF-------- 183
+Y +V +G P Q + LDTGSD+ W C+ C C P AS S TF+
Sbjct: 109 HYALVTVGTPGQTFMVALDTGSDLFWLPCQ-CDGC---TPPATAASGSATFYIPGMSSTS 164
Query: 184 -KIPCNSTSCRILRESFPFGNCNSK-ECPFNIQYAD-GSGSGGFWATDRITIQEANSNGY 240
+PCNS C + +E C++ +CP+ + Y G+ S GF D + + N++
Sbjct: 165 KAVPCNSNFCDLQKE------CSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQ 218
Query: 241 FTRYPFLLGCINNSSG---DKSGASGIMGLDRSPV---SIITRTNTSYFSYCLPSPYGST 294
+ +LGC +G D + +G+ GL V SI+ + + S+ +
Sbjct: 219 ILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGI 278
Query: 295 GYITFGKTDT 304
G I+FG ++
Sbjct: 279 GRISFGDQES 288
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 66/144 (45%), Gaps = 18/144 (12%)
Query: 138 IGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRE 197
IG P Q +L++DTGS VT+ C C C +DP F S T+ + CN
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCN--------- 52
Query: 198 SFPFGNCNSK--ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSS 255
P C+++ +C + QYA+ S S G D ++ + + GC N +
Sbjct: 53 --PDCTCDTENDQCTYERQYAEMSSSSGILGEDLVSFGNMSE---LKPQRAVFGCENAET 107
Query: 256 GD--KSGASGIMGLDRSPVSIITR 277
GD A GIMGL R +SI+ +
Sbjct: 108 GDLFSQHADGIMGLGRGDLSIVDQ 131
>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 59/207 (28%), Positives = 93/207 (44%), Gaps = 25/207 (12%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
Y I V +G P + + +DTGS +W C+ C C F S+S T K+ C ++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFL-QSRSTTCAKVSCGTSM 58
Query: 192 CRILRESFPFGNCNSKE----CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C +L S P +C E CPF + Y DGS S G D +T + FT
Sbjct: 59 C-LLGGSDP--HCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFT----- 110
Query: 248 LGCINNSSG--DKSGASGIMGLDRSPVSIITRTNTSY--FSYCLP---SPYG----STGY 296
GC +S G + G++G+ +S++ +++ ++ FSYCLP S G +TGY
Sbjct: 111 FGCNMDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDGFSYCLPLQMSERGFFSKTTGY 170
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAK 323
+ G + K + + K +
Sbjct: 171 FSLGGKIAATRTDVRYTKMVARRKNTE 197
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 53/189 (28%), Positives = 89/189 (47%), Gaps = 18/189 (9%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDP--FFYASKSKTFFKIPCN 188
EY + V +G P + + DTGSD+ W C D F+ S+S T+ + C
Sbjct: 99 EYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSRSTTYSLLSCQ 158
Query: 189 STSCRILRESFPFGNCNS-KECPFNIQYADGSGSGGFWATDRITIQEANSNGYF-TRYPF 246
S +C+ L ++ +C++ EC + Y DGS + G +T+ + A G R P
Sbjct: 159 SAACQALSQA----SCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQVRVPR 214
Query: 247 L-LGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY-----FSYCLPSPYG---STGYI 297
+ GC S+G + G++GL +S++++ + FSYCL PY S+ +
Sbjct: 215 VSFGCSTGSAGSFR-SDGLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAANSSSTL 273
Query: 298 TFGKTDTVN 306
+FG V+
Sbjct: 274 SFGARAVVS 282
Score = 46.2 bits (108), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 31/122 (25%), Positives = 52/122 (42%), Gaps = 4/122 (3%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDL---SAYETVVVPKIAIHFLGG 353
+TF + L + +R++ +A+ E LL CYD+ S E +P + + F GG
Sbjct: 330 LTFLDPALLRPLVAELERRIR-LPRAQPPEQLLQLCYDVQGKSQAEDFGIPDVTLRFGGG 388
Query: 354 VDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGN 413
+ L T + +CL LGN+ Q+ V YD+ R + F +
Sbjct: 389 ASVTLRPENTFSLLEEGTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDARTVTFAAVD 448
Query: 414 CS 415
C+
Sbjct: 449 CT 450
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 51/169 (30%), Positives = 84/169 (49%), Gaps = 10/169 (5%)
Query: 123 NINDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQ-RDPFFYASKSK 180
N++ + ++ ++V ++G+P ++DTGS + W QC PC C QQ P F S S
Sbjct: 92 NLHPSASEPLFLVNFSMGQPPVPQLAIMDTGSSLLWIQCAPCKSCSQQIIGPMFDPSISS 151
Query: 181 TFFKIPCNSTSCRILRESFPFGNCN-SKECPFNIQYADGSGSGGFWATDRITIQEANSNG 239
T+ + C + CR P G C+ S +C +N Y +G S G AT+++ I ++ G
Sbjct: 152 TYDSLSCKNIICRYA----PSGECDSSSQCVYNQTYVEGLPSVGVIATEQL-IFGSSDEG 206
Query: 240 YFTRYPFLLGCINNSSGDKSGA-SGIMGLDRSPVSIITRTNTSYFSYCL 287
L GC + + K +G+ GL S++ + S FSYC+
Sbjct: 207 RNAVNNVLFGCSHRNGNYKDRRFTGVFGLGSGITSVVNQMG-SKFSYCI 254
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 88/208 (42%), Gaps = 24/208 (11%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRD-----PFFYASKSKTFFKIP 186
Y+ + +G P + + +DTGSD+ W C C C ++ D + S+T I
Sbjct: 70 YFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPKGSETSELIS 129
Query: 187 CNSTSCRILRESFPFGNCNSK-ECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C+ C + P C S+ CP++I Y DGS + G++ D +T N N P
Sbjct: 130 CDQEFCSATYDG-PIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVNDN--LRTAP 186
Query: 246 ----FLLGCINNSSGDKSGAS-----GIMGLDRSPVSIITRTNTS-----YFSYCLPSPY 291
+ GC SG S +S GI+G +S S++++ S FS+CL +
Sbjct: 187 QNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCLDNIR 246
Query: 292 GSTGYITFGKTDTVNTLRSAFHKRMKKY 319
G G G+ + RM Y
Sbjct: 247 GG-GIFAIGEVVEPKVSTTPLVPRMAHY 273
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 61/206 (29%), Positives = 84/206 (40%), Gaps = 30/206 (14%)
Query: 89 LRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVA-IGEPKQYVSL 147
+ +DQ RL +S +K A+ + YIV A +G P Q + +
Sbjct: 1 MAKDQARLQFLSSLVAKKSVVPI----------ASGRGVIQSPSYIVKAKVGTPPQTLLM 50
Query: 148 LLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSK 207
LD D W CK C+ C F KS TF + C + C+ P C
Sbjct: 51 ALDNSYDAAWIPCKGCVGC---SSTVFNTVKSTTFKTLGCGAPQCK----QVPNPICGGS 103
Query: 208 ECPFNIQYADGSGSGGFWAT-DRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMG 266
C +N Y GS + R TI A S Y F GCI ++G G++G
Sbjct: 104 TCTWNTTY----GSSTILSNLTRDTI--ALSMDPVPYYAF--GCIQKATGSSVPPQGLLG 155
Query: 267 LDRSPVSIITRTNTSY---FSYCLPS 289
R P+S +++T Y FSYCLPS
Sbjct: 156 FGRGPLSFLSQTQNLYKSTFSYCLPS 181
Score = 41.6 bits (96), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 28/111 (25%), Positives = 49/111 (44%), Gaps = 10/111 (9%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
+R+ F KR+ + DTCY + +VP G+++ + L+ +
Sbjct: 267 VRNEFRKRVGNATVSS--LGGFDTCYSVP-----IVPPTITFMFSGMNVTMPPENLLIHS 319
Query: 368 SVSQV-CLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ CL A P + NS+ + ++QQ+ H + +DV RLG CS
Sbjct: 320 TAGVTSCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRLGVAREQCS 370
>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 444
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 62/235 (26%), Positives = 96/235 (40%), Gaps = 24/235 (10%)
Query: 58 DKASLEVVSKYGPCSRLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEA 117
D ++L+V + PCS PS + +L K+ R++ + R
Sbjct: 40 DGSTLQVFHVFSPCSPFR-------PSKPMSWEESVLQLQAKDQARMQY-LSNLVARRSI 91
Query: 118 FTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYAS 177
+ T + Y + G P Q + L +DT +D W C C+ C F
Sbjct: 92 VPIASGRQITQSPTYIVRAKFGTPAQTLLLAMDTSNDAAWVPCTACVGCSTTTP--FAPP 149
Query: 178 KSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANS 237
KS TF K+ C ++ C+ +R C+ C FN Y S + D +T+
Sbjct: 150 KSTTFKKVGCGASQCKQVRNP----TCDGSACAFNFTYGTSSVAASL-VQDTVTLATDPV 204
Query: 238 NGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
Y GCI ++G G++GL R P+S++ +T Y FSYCLPS
Sbjct: 205 PAY------TFGCIQKATGSSLPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPS 253
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 56/113 (49%), Gaps = 11/113 (9%)
Query: 308 LRSAFHKRMKKYKKAKGLEDL--LDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV 365
+R+ F +R+ +KK + L DTCY + +V P I F G+++ L L+
Sbjct: 338 VRNEFRRRVSVHKKLT-VTSLGGFDTCYTVP----IVAPTITFMF-SGMNVTLPPDNILI 391
Query: 366 VASVSQV-CLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
++ V CL A P + NS+ + N+QQ+ H V +DV RLG C+
Sbjct: 392 HSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLGVARELCT 444
>gi|110739922|dbj|BAF01866.1| chloroplast nucleoid DNA binding protein like [Arabidopsis
thaliana]
Length = 142
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 42/109 (38%), Positives = 57/109 (52%), Gaps = 5/109 (4%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV-V 366
+R AF K K+A L DTC+DLS V VP + +HF G D+ L L+ V
Sbjct: 38 MRDAFRVGAKTLKRAPDFS-LFDTCFDLSNMNEVKVPTVVLHFRGA-DVSLPATNYLIPV 95
Query: 367 ASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ + C FA + I GN+QQ+G V YD+A R+GF PG C+
Sbjct: 96 DTNGKFCFAFAGTMGGLSII--GNIQQQGFRVVYDLASSRVGFAPGGCA 142
>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
Length = 423
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 87/332 (26%), Positives = 129/332 (38%), Gaps = 69/332 (20%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
Y +G P Q + + +D +D W C C C P F ++S T+ +PC S
Sbjct: 101 NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSP 159
Query: 191 SC-RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
C ++ S P G +S C FN+ YA S D + ++ N Y F G
Sbjct: 160 QCAQVPSPSCPAGVGSS--CGFNLTYA-ASTFQAVLGQDSLALE----NNVVVSYTF--G 210
Query: 250 CINNSSGDKSGASG----------IMGLDR---SPVSIITRTNTSYFSYC--LPSPY--- 291
C+ +G+ A+G ++ D+ P+ R T+ Y PS Y
Sbjct: 211 CLRVVNGNSRAAAGAHRLRPRAALLLVADQGHLGPIGQPKRIKTTPLLYNPHRPSLYYVN 270
Query: 292 ------GS---------------TGYITFGKTDTVNT---------LRSAFHKRMKKYKK 321
GS TG T T+ T +R AF R++
Sbjct: 271 MIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRVRT-PV 329
Query: 322 AKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQV-CLGFATYP 380
A L DTCY++ TV VP + F G V + L ++ +S V CL A P
Sbjct: 330 APPLGG-FDTCYNV----TVSVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGP 384
Query: 381 PDPNSI---TLGNVQQRGHEVHYDVAGRRLGF 409
D + L ++QQ+ V +DVA R+GF
Sbjct: 385 SDGVNAALNVLASMQQQNQRVLFDVANGRVGF 416
>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
Length = 450
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 95/377 (25%), Positives = 146/377 (38%), Gaps = 102/377 (27%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKP---CIHC-FQQRDP----FFYASKSKTFFKI 185
I ++ G P Q +S L+DTGSDV W C C +C F DP F S + +
Sbjct: 80 ISLSFGTPPQKLSFLVDTGSDVVWAPCTTDYTCTNCSFSAADPKKVPIFDPKLSSSSKIL 139
Query: 186 PCNSTSCRILRESFPF-------GNCNSKEC----PFNIQYADGSGSGGFWATD----RI 230
C + C + FP+ N NSK C P++ QY G+ SG F + R
Sbjct: 140 DCRNPKC--VSTYFPYVHLGCPRCNGNSKHCSYACPYSTQYGTGASSGYFLLENLKFPRK 197
Query: 231 TIQ------------EANSNGY--FTR----YPFLLG------CINNSSGDKSGASGIMG 266
TI+ E +S+ F R P +G C+N+ D + SG +
Sbjct: 198 TIRNFLLGCTTSAARELSSDALAGFGRSMFSLPIQMGVKKFAYCLNSHDYDDTRNSGKLI 257
Query: 267 LD---------------RSP-------------VSI---ITRTNTSYFS--------YCL 287
LD +SP + I + R + Y + +
Sbjct: 258 LDYRDGKTKGLSYTPFLKSPPASAFYYHLGVKDIKIGNKLLRIPSKYLAPGSDGRSGVII 317
Query: 288 PSPYGSTGYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDL--LDTCYDLSAYETVVVPK 345
S YG GY+T + + K+M KY+++ E L CY+ + ++++ +P
Sbjct: 318 DSGYGGAGYMT---GPVFKIVTNELKKQMSKYRRSLEAETQTGLTPCYNFTGHKSIKIPP 374
Query: 346 IAIHFLGGVDLELDVRGTLVVA-SVSQVCLGFAT-------YPPDPNSITLGNVQQRGHE 397
+ F GG ++ + + ++ S C T PDP SI LGN Q +
Sbjct: 375 LIYQFRGGANMVVPGKNYFGISPQESLACFLMDTNGTNALEITPDP-SIILGNSQHVDYY 433
Query: 398 VHYDVAGRRLGFGPGNC 414
V YD+ R GF C
Sbjct: 434 VEYDLKNDRFGFRRQTC 450
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 94/382 (24%), Positives = 147/382 (38%), Gaps = 103/382 (26%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKP---CIHC-FQQRDPF----FYASKSKTFF 183
Y ++ G P+Q + L+ DTGS + W C C C F + DP F S +
Sbjct: 81 YSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSK 140
Query: 184 KIPCNSTSCRILRESFPFG--------NCNSKE------CP-FNIQYADGSGSGGFWAT- 227
+ C + C S+ FG +CN K CP + +QY GS +G +
Sbjct: 141 LVGCQNPKC-----SWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSET 195
Query: 228 ----DR-----------ITIQEANSNGYFTR----YPFLLG------CINNSSGDKSGAS 262
D+ ++I + + F R P +G C+ + D S S
Sbjct: 196 LDFPDKXIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHS 255
Query: 263 GIMGLDRSPV--SIITRT---------NTSY---------------------FSYCLPSP 290
G + LD + V S +T T N +Y + + +P P
Sbjct: 256 GQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGP 315
Query: 291 YGSTGYI-------TFGKTDTVNTLRSAFHKRMKKYKKAKGLEDL--LDTCYDLSAYETV 341
G+ G I TF + + F K++ + +A +E L L C+D+S ++V
Sbjct: 316 DGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDISKEKSV 375
Query: 342 VVPKIAIHFLGGVDLELDVRGTLVVASVSQV-CLGFATYPPDPN-------SITLGNVQQ 393
P++ F GG L + + S S V CL T+ + S+ LG QQ
Sbjct: 376 KFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGGGPSVILGAFQQ 435
Query: 394 RGHEVHYDVAGRRLGFGPGNCS 415
+ V YD+ +RLGF CS
Sbjct: 436 QNFYVEYDLVNQRLGFRQQTCS 457
>gi|20975624|emb|CAD31717.1| putative nucleoid DNA-binding protein [Cicer arietinum]
Length = 144
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 41/115 (35%), Positives = 59/115 (51%), Gaps = 6/115 (5%)
Query: 302 TDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVR 361
T + R AF + ++ + + DTCYDL + +V VP I+ +FLGG L L R
Sbjct: 34 TAAYDAFRDAFIGQTTNLPRSSDVS-IFDTCYDLYGFVSVRVPTISFYFLGGPILTLPAR 92
Query: 362 GTLV-VASVSQVCLGFATYPPDPNSIT-LGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L+ V V C FA P P+ ++ +GN+QQ G E+ D +GFGP C
Sbjct: 93 NFLIPVNDVGTFCFAFA---PSPSGLSIIGNIQQEGIEISVDGVNGFVGFGPNIC 144
>gi|326526699|dbj|BAK00738.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 182
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 63/129 (48%), Gaps = 5/129 (3%)
Query: 287 LPSPYGSTGYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKI 346
LP+ S IT T + L A MK K+A +LDTC+ + ++ VP +
Sbjct: 59 LPTIIDSGTVITRLPTTVYDALSKAVAGAMKGTKRADAYS-ILDTCF-VGQASSLRVPAV 116
Query: 347 AIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRR 406
++ F GG L+L + LV S CL FA P ++ +GN QQ+ V YDV R
Sbjct: 117 SMAFSGGAALKLSAQNLLVDVDSSTTCLAFA---PARSAAIIGNTQQQTFSVVYDVKSNR 173
Query: 407 LGFGPGNCS 415
+GF G C+
Sbjct: 174 IGFAAGGCT 182
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 45/171 (26%), Positives = 85/171 (49%), Gaps = 17/171 (9%)
Query: 141 PKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPF--FYASKSKTFFKIPCNSTSCRILRES 198
P Q +S+++DTGS+++W +C +P F ++S ++ IPC+S +CR
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRS----SNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRD 137
Query: 199 FPF-GNCNS-KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSG 256
F +C+S K C + YAD S S G A + + ++ + GC+ + SG
Sbjct: 138 FLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSN-----LIFGCMGSVSG 192
Query: 257 ----DKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGKTD 303
+ + +G++G++R +S I++ FSYC+ G++ G ++
Sbjct: 193 SDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCISGTDDFPGFLLLGDSN 243
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 66/237 (27%), Positives = 97/237 (40%), Gaps = 48/237 (20%)
Query: 116 EAFTFPANIND-TVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK------------- 161
EAF P + T +Y++ +G P + L+ DTGSD+TW +C+
Sbjct: 38 EAFAMPLSSGAYTGTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAP 97
Query: 162 ---------------PCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPF--GNC 204
F +S+T+ IPC+S +C S PF C
Sbjct: 98 GYNYGYGAPASNDSSSVSAAASSPARVFRPDRSRTWAPIPCSSDTCT---ASLPFSLAAC 154
Query: 205 NS--KECPFNIQYADGSGSGGFWATDRITIQ-EANSNGYFTRYPFL----LGCINNSSGD 257
+ C + +Y DGS + G TD TI G R L LGC + +G+
Sbjct: 155 PTPGSPCAYEYRYKDGSAARGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGE 214
Query: 258 KSGAS-GIMGLDRSPVSIITRTNTSY---FSYCLP---SPYGSTGYITFGKTDTVNT 307
AS G++ L S VS +R + FSYCL +P +T Y+TFG V++
Sbjct: 215 SFLASDGVLSLGYSNVSFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSS 271
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 47/104 (45%), Gaps = 10/104 (9%)
Query: 320 KKAKGLE----DLLDTCYDLSAYET-----VVVPKIAIHFLGGVDLELDVRGTLVVASVS 370
KK GL D D CY+ ++ T V VP +A+HF G L+ + ++ A+
Sbjct: 357 KKLVGLPRVAMDPFDYCYNWTSPLTGEDLAVAVPALAVHFAGSARLQPPPKSYVIDAAPG 416
Query: 371 QVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
C+G P +GN+ Q+ H +D+ RRL F C
Sbjct: 417 VKCIGLQE-GDWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRC 459
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 54/216 (25%), Positives = 96/216 (44%), Gaps = 21/216 (9%)
Query: 89 LRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLL 148
L Q + R L+++R L+ F+ F+ + + + Y+ V +G P + ++
Sbjct: 27 LSQLRARDRLRHARLLQG----FVGGVVDFSVQGSPDPYLVGLYFTKVKLGSPPREFNVQ 82
Query: 149 LDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIPCNSTSC-RILRESFPFG 202
+DTGSDV W C C +C + FF +S S T + C+ C ++ +
Sbjct: 83 IDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGLVHCSDPICTSAVQTTVTQC 142
Query: 203 NCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL--GCINNSSGDKS- 259
+ + +C + QY DGSG+ G++ +D + L+ GC SGD +
Sbjct: 143 SPQTNQCSYTFQYEDGSGTSGYYVSDTLYFDAILGESLVVNSSALIVFGCSTFQSGDLTM 202
Query: 260 ---GASGIMGLDRSPVSIITRTNTS-----YFSYCL 287
GI G + +S+I++ +T FS+CL
Sbjct: 203 TDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCL 238
>gi|6580159|emb|CAB62657.2| putative protein [Arabidopsis thaliana]
Length = 475
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 61/243 (25%), Positives = 102/243 (41%), Gaps = 16/243 (6%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
YY V++G P + LDTGSD+ W C C + + P ST+
Sbjct: 102 YYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTPNASTT 161
Query: 192 CRILRES----FPFGNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
+R S F C+S CP+ I Y++ +G+ G D + + + N +
Sbjct: 162 SSSIRCSDKRCFGSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHLATEDENLTPVKAN 221
Query: 246 FLLGCINNSSG---DKSGASGIMGL---DRSPVSIITRTNTSY--FSYCLPSPYGSTGYI 297
LGC +G + +G++GL S S++ + N + FS C G+ G I
Sbjct: 222 VTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFGRVIGNVGRI 281
Query: 298 TFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVV-VPKIAIHFLGGVDL 356
+FG + + F + ++ E + CYDLS T + P + + F+GG +
Sbjct: 282 SFGDRGYTDQEETPFISVAPR-RRPVDPELPFEFCYDLSPNATTIQFPLVEMTFIGGSKI 340
Query: 357 ELD 359
L+
Sbjct: 341 ILN 343
>gi|125555052|gb|EAZ00658.1| hypothetical protein OsI_22679 [Oryza sativa Indica Group]
Length = 316
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 78/180 (43%), Gaps = 27/180 (15%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-----IHCFQQRDPFFYASKSKTFFKI 185
+Y +VV G P Q +++ DTG ++ +C C DP S+S TF +
Sbjct: 145 DYTVVVGYGTPAQQLAMAFDTGLGISLARCAACRPGAPCDGLASFDP----SRSSTFAPV 200
Query: 186 PCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
PC S CR S +C PF G A D +T+ + S FT
Sbjct: 201 PCGSPDCRSGCSSGSTPSCPLTSFPFL---------SGAVAQDVLTLTPSASVDDFT--- 248
Query: 246 FLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNT---SYFSYCLP-SPYGSTGYITFGK 301
GC+ SSG+ GA+G++ L R S+ +R FSYCLP S S G++ K
Sbjct: 249 --FGCVEGSSGEPLGAAGLLDLSRDSRSVASRLAADAGGTFSYCLPLSTTSSHGFLATAK 306
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 94/382 (24%), Positives = 147/382 (38%), Gaps = 103/382 (26%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKP---CIHC-FQQRDPF----FYASKSKTFF 183
Y ++ G P+Q + L+ DTGS + W C C C F + DP F S +
Sbjct: 81 YSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSK 140
Query: 184 KIPCNSTSCRILRESFPFG--------NCNSKE------CP-FNIQYADGSGSGGFWAT- 227
+ C + C S+ FG +CN K CP + +QY GS +G +
Sbjct: 141 LVGCQNPKC-----SWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTAGLLLSET 195
Query: 228 ----DR-----------ITIQEANSNGYFTR----YPFLLG------CINNSSGDKSGAS 262
D+ ++I + + F R P +G C+ + D S S
Sbjct: 196 LDFPDKKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGLKKFAYCLASRKFDDSPHS 255
Query: 263 GIMGLDRSPV--SIITRT---------NTSY---------------------FSYCLPSP 290
G + LD + V S +T T N +Y + + +P P
Sbjct: 256 GQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGP 315
Query: 291 YGSTGYI-------TFGKTDTVNTLRSAFHKRMKKYKKAKGLEDL--LDTCYDLSAYETV 341
G+ G I TF + + F K++ + +A +E L L C+D+S ++V
Sbjct: 316 DGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDISKEKSV 375
Query: 342 VVPKIAIHFLGGVDLELDVRGTLVVASVSQV-CLGFATYPPDPN-------SITLGNVQQ 393
P++ F GG L + + S S V CL T+ + S+ LG QQ
Sbjct: 376 KFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGGGPSVILGAFQQ 435
Query: 394 RGHEVHYDVAGRRLGFGPGNCS 415
+ V YD+ +RLGF CS
Sbjct: 436 QNFYVEYDLVNQRLGFRQQTCS 457
>gi|326490597|dbj|BAJ89966.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 450
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 60/212 (28%), Positives = 95/212 (44%), Gaps = 30/212 (14%)
Query: 90 RQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLL 149
R+ + + S RL++ P F N++ TV+ V +G P Q V+++L
Sbjct: 29 REGKAGAAVLLSLRLQEVAPPPRALANRLRFRHNVSLTVS------VVVGTPPQNVTMVL 82
Query: 150 DTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGN-CN--- 205
DTGS+++ C PF AS S T+ + C+S +C P C+
Sbjct: 83 DTGSELSGLLCNG--SSLSPPAPF-NASASLTYSAVDCSSPACVWRGRDLPVRPFCDAPP 139
Query: 206 SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI---------NNSSG 256
S C +I YAD S + G D + P L GCI N+S+
Sbjct: 140 STSCRVSISYADASSADGHLVADTFILGT-------QAVPALFGCITSYSSSTAINSSAT 192
Query: 257 DKS-GASGIMGLDRSPVSIITRTNTSYFSYCL 287
D S A+G++G++R +S +T+T T F+YC+
Sbjct: 193 DPSEAATGLLGMNRGSLSFVTQTATLRFAYCI 224
>gi|242072510|ref|XP_002446191.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
gi|241937374|gb|EES10519.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
Length = 499
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 47/157 (29%), Positives = 74/157 (47%), Gaps = 24/157 (15%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF-------- 183
+Y +V +G P Q + LDTGSD+ W C+ C C P AS S TF+
Sbjct: 108 HYALVTVGTPGQTFMVALDTGSDLFWLPCQ-CDGC---TPPATAASGSATFYIPGMSSTS 163
Query: 184 -KIPCNSTSCRILRESFPFGNCNSK-ECPFNIQYAD-GSGSGGFWATDRITIQEANSNGY 240
+PCNS C + +E C++ +CP+ + Y G+ S GF D + + N++
Sbjct: 164 KAVPCNSNFCDLQKE------CSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQ 217
Query: 241 FTRYPFLLGCINNSSG---DKSGASGIMGLDRSPVSI 274
+ +LGC +G D + +G+ GL VS+
Sbjct: 218 ILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSV 254
>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
Length = 408
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 127/333 (38%), Gaps = 52/333 (15%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
Y + +G P Q + L LDT +D TW+ C PC C F + S ++ +PC S
Sbjct: 78 SYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASD 135
Query: 191 SCRILRESFPFGNCNSKECPFNI---QYADGSGSGGFWATDRI----TIQEANSNGYFT- 242
C + R G ++ Q A + G A R T A +G +
Sbjct: 136 WCPLFRRPAVPGEPGRVGAAADVRLLQAASRTPRSGVLAATRCGWARTPSPATRSGPMSL 195
Query: 243 ------RYPFLLG-CINN------SSGDKSGASG----------IMGLDRSPVSIITRTN 279
RY + C+ + S + GA+G + R + + T
Sbjct: 196 LSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTG 255
Query: 280 TSYFSYCLPSPYGS------TGYITFGKTDTVNT---------LRSAFHKRMKKYKKAKG 324
S + +P GS TG T + TV T LR F +++
Sbjct: 256 LSVGRALVKAPAGSFAFDPSTGAGTVIDSGTVITRWTAPVYAALRDEFRRQVAAPSGYTS 315
Query: 325 LEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVASVSQV-CLGFATYP--P 381
L DTC++ P + +H GGVDL L + TL+ +S + + CL A P
Sbjct: 316 L-GAFDTCFNTDEVAAGGAPPVTLHMGGGVDLTLPMENTLIHSSATPLACLAMAEAPQNV 374
Query: 382 DPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
+ + N+QQ+ V DVAG R+GF C
Sbjct: 375 NSVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 407
>gi|297852200|ref|XP_002893981.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
gi|297339823|gb|EFH70240.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
Length = 354
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 82/317 (25%), Positives = 134/317 (42%), Gaps = 54/317 (17%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHC----FQQRDPFFYASKSKTFFKIP 186
Y +++ IG P + +DTGSD+TW QC PC C +Q Y K T +P
Sbjct: 54 YSVLLQIGTPPKAFEFDIDTGSDLTWVQCDAPCTGCTLPPIRQ-----YKPKGNT---VP 105
Query: 187 CNSTSCRILRESFPFGNCNSKECP-------FNIQYADGSGSGGFWATDRITIQEANSNG 239
C C L FP N +CP + + YAD S G D+ ++ N +
Sbjct: 106 CLDPICLALH--FP----NKPQCPNPKEQCDYEVNYADQGSSMGALVIDQFPLKLLNGSA 159
Query: 240 YFTRYPFLLGC----INNSSGDKSGASGIMGLDRSPVSIITRTNTSYFS-----YCLPSP 290
R F GC I + +G++GL R + ++ + + + +CL S
Sbjct: 160 MQPRLAF--GCGYDQILPKAHPPPATAGVLGLGRGKIGVLPQLVAAGLTRNVVGHCLSSK 217
Query: 291 YGSTGYITFGKTDTVNTLRSAFHKRMK-KYKKAKGL-EDLLDTCYDLSAYETVVVPK--- 345
G GY+ FG T + TL A+ + +Y + D L D + +++V+ K
Sbjct: 218 GG--GYLFFGDT-LIPTLGVAWTPLLSPEYTFFFHICRDRLQR--DYTFFKSVLEFKNFF 272
Query: 346 --IAIHFLGG---VDLELDVRGTLVVASVSQVCLGF--ATYPPDPNSITLGNVQQRGHEV 398
I I+F L++ L+++ CLG + NS +G++ +G V
Sbjct: 273 KTITINFTNARRITQLQIPPESYLIISKTGNACLGLLNGSEVGLQNSNVIGDISMQGLMV 332
Query: 399 HYDVAGRRLGFGPGNCS 415
YD ++LG+ NC+
Sbjct: 333 IYDNEKQQLGWVSSNCN 349
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 81/168 (48%), Gaps = 17/168 (10%)
Query: 126 DTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKI 185
D + EY I +AIG P +S ++DTGSD+ WT+C PC C + S S T+ K+
Sbjct: 36 DIGSGEYLIQMAIGTPALSLSAIMDTGSDLVWTKCNPCTDC--STSSIYDPSSSSTYSKV 93
Query: 186 PCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C S+ C + F N +C + Y D S + G + + +I +
Sbjct: 94 LCQSSLC---QPPSIFSCNNDGDCEYVYPYGDRSSTSGILSDETFSISSQ------SLPN 144
Query: 246 FLLGCINNSSG-DKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
GC +++ G DK G G++G R +S++++ S FSYCL S
Sbjct: 145 ITFGCGHDNQGFDKVG--GLVGFGRGSLSLVSQLGPSMGNKFSYCLVS 190
>gi|168051774|ref|XP_001778328.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670305|gb|EDQ56876.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 165
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 58/110 (52%), Gaps = 2/110 (1%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
EY+I + I P +++ +++DTGSD+TW QC PC+HC+ Q+ F S+++ + C
Sbjct: 11 EYFIDIFIDTPPRHILVIIDTGSDLTWVQCTPCLHCYLQKGLVFNPHSSESYDPVACGEP 70
Query: 191 SCRILRESFPFGNC--NSKECPFNIQYADGSGSGGFWATDRITIQEANSN 238
+ S C +S+ C + Y D S + +AT+ T+ + N
Sbjct: 71 KRAFVESSNNRSTCVTDSQGCSYFYWYGDSSNTTSDFATETFTVNKTIKN 120
>gi|226501154|ref|NP_001146408.1| uncharacterized protein LOC100279988 [Zea mays]
gi|219887047|gb|ACL53898.1| unknown [Zea mays]
gi|414587777|tpg|DAA38348.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 416
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 53/190 (27%), Positives = 87/190 (45%), Gaps = 27/190 (14%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF-------- 183
+Y +V +G P Q + LDTGSD+ W C+ C C P AS S TF+
Sbjct: 7 HYALVTVGTPGQTFMVALDTGSDLFWLPCQ-CDGC---TPPATAASGSATFYIPGMSSTS 62
Query: 184 -KIPCNSTSCRILRESFPFGNCNSK-ECPFNIQYAD-GSGSGGFWATDRITIQEANSNGY 240
+PCNS C + +E C++ +CP+ + Y G+ S GF D + + N++
Sbjct: 63 KAVPCNSNFCDLQKE------CSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQ 116
Query: 241 FTRYPFLLGCINNSSG---DKSGASGIMGLDRSPV---SIITRTNTSYFSYCLPSPYGST 294
+ +LGC +G D + +G+ GL V SI+ + + S+ +
Sbjct: 117 ILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGI 176
Query: 295 GYITFGKTDT 304
G I+FG ++
Sbjct: 177 GRISFGDQES 186
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 77/366 (21%), Positives = 133/366 (36%), Gaps = 89/366 (24%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRD-----PFFYASKSKTFFKIP 186
Y+ V +G P ++ +DTGSD+ W C C +C FF A S T +
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGSVT 159
Query: 187 CNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQE-------ANSNG 239
C+ C + ++ + +C ++ +Y DGSG+ G++ TD ANS+
Sbjct: 160 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 219
Query: 240 YFTRYPFLLGCINNSSGDKS----GASGIMGLDRSPVSIITRTNTS-----YFSYCLPSP 290
P + GC SGD + GI G + +S++++ ++ FS+CL
Sbjct: 220 -----PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGD 274
Query: 291 YGSTGYITFGKTDTVNTLRS-------------------------------AFHKRMKKY 319
G G+ + S A + R
Sbjct: 275 GSGGGVFVLGEILVPGMVYSPLLPSQPHYNLNLLSIGVNGQILPIDAAVFEASNTRGTIV 334
Query: 320 KKAKGLEDLLDTCYD---------LSAYETVVV-----------------PKIAIHFLGG 353
L L+ YD +S T+++ P ++++F GG
Sbjct: 335 DTGTTLTYLVKEAYDPFLNAISNSVSQLVTLIISNGEQCYLVSTSISDMFPPVSLNFAGG 394
Query: 354 VDLELDVRGTL----VVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGF 409
+ L + L S C+GF P + LG++ + YD+A +R+G+
Sbjct: 395 ASMMLRPQDYLFHYGFYDGASMWCIGFQKAPEE--QTILGDLVLKDKVFVYDLARQRIGW 452
Query: 410 GPGNCS 415
+CS
Sbjct: 453 ANYDCS 458
>gi|356529585|ref|XP_003533370.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1388
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 52/194 (26%), Positives = 88/194 (45%), Gaps = 17/194 (8%)
Query: 119 TFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK-PCIHCFQQRDPFFYAS 177
FP + N Y+ ++ +G P + L +DTGSD+TW QC PCI C + + +
Sbjct: 179 VFPVSGNVYPDGLYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCISCGKGAHVLYKPT 238
Query: 178 KSKTFFKIPCNSTSCRILRESFPFGNCNSK--ECPFNIQYADGSGSGGFWATDRITIQEA 235
+S + C ++++ G+ + +C + IQYAD S S G D + +
Sbjct: 239 RSNVVSSV---DALCLDVQKNQKNGHHDESLLQCDYEIQYADHSSSLGVLVRDELHL--V 293
Query: 236 NSNGYFTRYPFLLGCINNSSG----DKSGASGIMGLDRSPVSIITRTNT-----SYFSYC 286
+NG T+ + GC + +G GIMGL R+ VS+ + + + +C
Sbjct: 294 TTNGSKTKLNVVFGCGYDQAGLLLNTLGKTDGIMGLSRAKVSLPYQLASKGLIKNVVGHC 353
Query: 287 LPSPYGSTGYITFG 300
L + GY+ G
Sbjct: 354 LSNDGAGGGYMFLG 367
>gi|413953656|gb|AFW86305.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 406
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 57/219 (26%), Positives = 96/219 (43%), Gaps = 29/219 (13%)
Query: 96 LHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADE--YYIVVAIGEPKQYVSLLLDTGS 153
L+ K RR +P+ + FP ++ + E YY +++G P + L +DTGS
Sbjct: 127 LYPKPPRRGGDDWPQ-----NSTLFPHSLAGNLFPEGLYYTAISLGSPPRPYFLDVDTGS 181
Query: 154 DVTWTQCK--PCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPF 211
TW QC PC C + P + ++T +P + C + N +C +
Sbjct: 182 HTTWVQCDAPPCASCAKGAHPLY--RPARTADALPASDPLCEGAQHE------NPNQCDY 233
Query: 212 NIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGA----SGIMGL 267
I YADGS S G + D ++Q +G + GC + G A G++GL
Sbjct: 234 EISYADGSSSMGVYVRD--SMQFVGEDGERENADIVFGCGYDQQGVLLNALETTDGVLGL 291
Query: 268 DRSPVSIITRTN-----TSYFSYCLPS-PYGSTGYITFG 300
+S+ T+ ++ F +C+ + P G+ GY+ G
Sbjct: 292 TNKALSLPTQLASRGIISNAFGHCMSTDPSGAGGYLFLG 330
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 96/420 (22%), Positives = 158/420 (37%), Gaps = 97/420 (23%)
Query: 58 DKASLEVVSKYGPCS--RLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRT 115
+ ++L+V+ + PCS R ++ +S S+ ++ +D RL +S RK
Sbjct: 27 NGSTLQVIHVFSPCSPFRPSKPLSWEE-SVLQMQAKDTTRLQFLDSLVARKSIVPIASGR 85
Query: 116 EAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFY 175
+ P Y + IG P Q + L +DT +D W C C C F
Sbjct: 86 QIIQSPT---------YIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCAST---LFA 133
Query: 176 ASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEA 235
KS TF + C + C+ P C FN+ Y S + D IT+
Sbjct: 134 PEKSTTFKNVSCAAPECK----QVPNPGCGVSSRNFNLTYGSSSIAANL-VQDTITLATD 188
Query: 236 NSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS--P 290
Y GC++ ++G + G++GL R P+S++++T Y FSYCLPS
Sbjct: 189 PVPSY------TFGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKS 242
Query: 291 YGSTGYITFGKTDTVNTLR-----------SAFHKRMKKYKKAKGLEDL----------- 328
+G + G ++ S ++ ++ + + + D+
Sbjct: 243 LNFSGSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTT 302
Query: 329 -LDTCYDLSAYETVVV----------------PKIAIHFLGGVDLELDVRGTLVVASV-- 369
T +D T +V PK+ + LGG D +V +VV ++
Sbjct: 303 GAGTIFDSGTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNV--PIVVPTITF 360
Query: 370 ---------------------SQVCLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRR 406
S CL A P + NS+ + N+QQ+ H V YDV R
Sbjct: 361 IFTGMNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSR 420
>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 535
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 49/181 (27%), Positives = 81/181 (44%), Gaps = 22/181 (12%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCK--PCIHCFQQRDPFFYASKSKTFFKIPCNS 189
YY +++G P + L +DTGS TW QC PC C + P + ++T +P +
Sbjct: 160 YYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAHPLY--RPARTADALPASD 217
Query: 190 TSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLG 249
C + N +C + I YADGS S G + D ++Q +G + G
Sbjct: 218 PLCEGAQHE------NPNQCDYEISYADGSSSMGVYVRD--SMQFVGEDGERENADIVFG 269
Query: 250 CINNSSGDKSGA----SGIMGLDRSPVSIITRTN-----TSYFSYCLPS-PYGSTGYITF 299
C + G A G++GL +S+ T+ ++ F +C+ + P G+ GY+
Sbjct: 270 CGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPSGAGGYLFL 329
Query: 300 G 300
G
Sbjct: 330 G 330
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 82/381 (21%), Positives = 138/381 (36%), Gaps = 84/381 (22%)
Query: 118 FTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC-----FQQRDP 172
F N + Y+ V +G P + + +DTGSDV W C C C Q
Sbjct: 70 FPVQGTFNPFLVGLYFTRVQLGSPPKDFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLT 129
Query: 173 FFYASKSKTFFKIPCNSTSCRI-LRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRIT 231
FF S T + C+ C ++ S + + +C + QY DGSG+ G++ D +
Sbjct: 130 FFDPGSSTTAALVSCSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMH 189
Query: 232 IQE-ANSNG--------YFTRYPFLLGCINNSSGDKS--GASGIMGLDRSPVSIITRTNT 280
+ S+G Y + F+ + KS GI G + +S+I++ +
Sbjct: 190 LDTLLLSSGELSQICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLAS 249
Query: 281 S-----YFSYCLPSPYGSTGYITFGKTDTVNTLRSAFHKRMKKYK--------------- 320
FS+CL G + G+ N + + Y
Sbjct: 250 QGITPRVFSHCLKGDDSGGGVLVLGEIVEPNIVYTPLVPSQPHYNLYLQSISVAGQTLAI 309
Query: 321 ----------------KAKGLEDLLDTCYD--LSAYETV--------------------- 341
L L + YD +SA +V
Sbjct: 310 DPSVFGASSNQGTIVDSGTTLAYLAEGAYDPFVSAITSVVSLNARTYLSKGNQCYLVTSS 369
Query: 342 ---VVPKIAIHFLGGVDLELDVRGTLV----VASVSQVCLGFATYPPDPNSITLGNVQQR 394
V P+++++F GG L L+ + L+ V + C+GF P +I LG++ +
Sbjct: 370 VNDVFPQVSLNFAGGASLILNPQDYLLQQNSVGGAAVWCVGFQKTPGQQITI-LGDLVLK 428
Query: 395 GHEVHYDVAGRRLGFGPGNCS 415
YD+A +R+G+ +CS
Sbjct: 429 DKIFVYDIANQRVGWTNYDCS 449
>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 578
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 53/194 (27%), Positives = 85/194 (43%), Gaps = 18/194 (9%)
Query: 120 FPANINDTVADEYYIVVAIGEPK--QYVSLLLDTGSDVTWTQCK-PCIHCFQQRDPFFYA 176
FP N YY + +G+P+ QY L +DTGSD+TW QC PC C + + +
Sbjct: 186 FPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAPCTSCAKGANQLYKP 245
Query: 177 SKSKTFFKIPCNSTSCRILRESFPFGNCNS-KECPFNIQYADGSGSGGFWATDRITIQEA 235
K + + C ++ + +C S +C + I+YAD S S G D+ ++
Sbjct: 246 RKDNL---VRSSEPFCVEVQRNQLTEHCESCHQCDYEIEYADHSYSMGVLTKDKFHLKL- 301
Query: 236 NSNGYFTRYPFLLGCINNSSG----DKSGASGIMGLDRSPVSIITRTN-----TSYFSYC 286
NG + GC + G GI+GL R+ +S+ ++ ++ +C
Sbjct: 302 -HNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHC 360
Query: 287 LPSPYGSTGYITFG 300
L S GYI G
Sbjct: 361 LASDLNGEGYIFMG 374
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/171 (26%), Positives = 85/171 (49%), Gaps = 17/171 (9%)
Query: 141 PKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPF--FYASKSKTFFKIPCNSTSCRILRES 198
P Q +S+++DTGS+++W +C +P F ++S ++ IPC+S +CR
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRS----SNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRD 137
Query: 199 FPF-GNCNS-KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSG 256
F +C+S K C + YAD S S G A + + ++ + GC+ + SG
Sbjct: 138 FLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSN-----LIFGCMGSVSG 192
Query: 257 ----DKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGSTGYITFGKTD 303
+ + +G++G++R +S I++ FSYC+ G++ G ++
Sbjct: 193 SDPEEDTKTTGLLGMNRGSLSFISQMGFPKFSYCISGTDDFPGFLLLGDSN 243
>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 438
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 66/235 (28%), Positives = 100/235 (42%), Gaps = 29/235 (12%)
Query: 60 ASLEVVSKYGPCS--RLNQGISTHAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEA 117
++LEV + PCS R ++ +S A S+ ++ +DQ RL S + +
Sbjct: 33 STLEVFHVFSPCSPFRPSKPLS-WAESVLQLQAKDQARLQFLASMVAGRSIVPIASGRQI 91
Query: 118 FTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYAS 177
P Y + IG P Q + L +DT +D W C C C F
Sbjct: 92 IQSP---------TYIVRAKIGTPPQTLLLAIDTSNDAAWIPCTACDGCTST---LFAPE 139
Query: 178 KSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANS 237
KS TF + C S C P +C + C FN+ Y S + D +T+
Sbjct: 140 KSTTFKNVSCGSPEC----NKVPSPSCGTSACTFNLTYGSSSIAANV-VQDTVTLATDPI 194
Query: 238 NGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
GY GC+ ++G + G++GL R P+S++++T Y FSYCLPS
Sbjct: 195 PGY------TFGCVAKTTGPSTPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPS 243
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 54/114 (47%), Gaps = 11/114 (9%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLL---DTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTL 364
+R F +R+ KA L DTCY + +V P I F G+++ L L
Sbjct: 329 VRDEFRRRVAMAAKANLTVTSLGGFDTCYTVP----IVAPTITFMF-SGMNVTLPQDNIL 383
Query: 365 VVASV-SQVCLGFATYPPDPNSI--TLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ ++ S CL A+ P + NS+ + N+QQ+ H V YDV RLG C+
Sbjct: 384 IHSTAGSTSCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNSRLGVARELCT 437
>gi|356500374|ref|XP_003519007.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Glycine max]
Length = 454
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 57/216 (26%), Positives = 96/216 (44%), Gaps = 23/216 (10%)
Query: 99 KNSRRLRKPFPE----FLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSD 154
+N+++ + P+ + L + F N+ Y + + IG P + L +D+GSD
Sbjct: 29 RNAKKPKTPYSDNNHHRLSSSAVFKLQGNVYPL--GHYTVSLNIGYPPKLYDLDIDSGSD 86
Query: 155 VTWTQCK-PCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKE--CPF 211
+TW QC PC C + RD + + + + C C + S + NC S + C +
Sbjct: 87 LTWVQCDAPCKGCTKPRDQLYKPNHN----LVQCVDQLCSEVHLSMAY-NCPSPDDPCDY 141
Query: 212 NIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKS--GASGIMGLDR 269
++YAD S G D I Q N + R F G SG S SG++GL
Sbjct: 142 EVEYADHGSSLGVLVRDYIPFQFTNGSVVRPRVAFGCGYDQKYSGSNSPPATSGVLGLGN 201
Query: 270 SPVSIITRTNT-----SYFSYCLPSPYGSTGYITFG 300
SI+++ ++ + +CL + G G++ FG
Sbjct: 202 GRASILSQLHSLGLIRNVVGHCLSAQGG--GFLFFG 235
>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
Length = 414
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 73/148 (49%), Gaps = 19/148 (12%)
Query: 165 HCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGF 224
C + P F + S TF K+PC S+ C+ L P+ CN+ C + Y G + G+
Sbjct: 87 ECAARPAPPFQPASSSTFSKLPCASSLCQFLTS--PYLTCNATGCVYYYPYGMGF-TAGY 143
Query: 225 WATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFS 284
AT+ + + A+ G GC + +G + +SGI+GL RSP+S++++ FS
Sbjct: 144 LATETLHVGGASFPG------VAFGC-STENGVGNSSSGIVGLGRSPLSLVSQVGVGRFS 196
Query: 285 YCL---------PSPYGSTGYITFGKTD 303
YCL P +GS +T GK+
Sbjct: 197 YCLRSDADAGDSPILFGSLAKVTGGKSS 224
>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
Length = 437
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 62/221 (28%), Positives = 90/221 (40%), Gaps = 20/221 (9%)
Query: 90 RQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLL 149
+Q Q L N RR R FL + +FP N + YY + +G P Q + +++
Sbjct: 49 KQHLQHLVEHNDRRGR-----FL---QGISFPLKGNYSDLGLYYTEIGLGNPVQKLKVIV 100
Query: 150 DTGSDVTWTQCKPCIHCFQQRDPF----FYASKSKTFFKIPCNSTSCRILRESFPFGNCN 205
DTGSD+ W +C PC C ++D Y + + + S E + N
Sbjct: 101 DTGSDILWVKCSPCRSCLSKQDIIPPLSIYNLSASSTSSVSSCSDPLCTGEEVVCSRSGN 160
Query: 206 SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKSGASGIM 265
+ C + Y D S S G + D + N +R F GC N +G GIM
Sbjct: 161 NSACAYVSSYQDKSASVGAYVRDDMHYVLHGGNATTSRIFF--GCATNITGSWP-VDGIM 217
Query: 266 GL----DRSPVSIITRTNTS-YFSYCLPSPYGSTGYITFGK 301
G P I T+ N S FS+CL G + FG+
Sbjct: 218 GFGLISKTVPNQIATQRNMSRVFSHCLGGEKHGGGILEFGE 258
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/158 (30%), Positives = 74/158 (46%), Gaps = 19/158 (12%)
Query: 125 NDTVADEYYIV-VAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFF 183
+D + + YY + IG P Q +L++DTGS VT+ C C C + +DP F S T+
Sbjct: 73 DDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDLSSTYQ 132
Query: 184 KIPCNSTSCRILRESFPFGNCNSK--ECPFNIQYADGSGSGGFWATDRITIQEANSNGYF 241
+ C + C NC++ +C + QYA+ S S G D ++ +
Sbjct: 133 PVKC-TLDC----------NCDNDRMQCVYERQYAEMSTSSGVLGEDVVSF---GNQSEL 178
Query: 242 TRYPFLLGCINNSSGD--KSGASGIMGLDRSPVSIITR 277
+ GC N +GD A GIMGL R +SI+ +
Sbjct: 179 APQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQ 216
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/193 (25%), Positives = 81/193 (41%), Gaps = 13/193 (6%)
Query: 106 KPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIH 165
KP P+ R A Y +G P Q + + +D +D W C C+
Sbjct: 74 KPKPKGHSRHTFVPIAAGRQILRTPSYVARARLGTPPQTLLVAIDPSNDAAWVPCSACLG 133
Query: 166 CFQ-QRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS---KECPFNIQYADGSGS 221
C P F ++S T+ + C + C + + P +C + C FN+ YA S
Sbjct: 134 CAPGASSPSFDPTQSSTYRPVRCGAPQCAQVPPATP--SCPAGPGASCAFNLSYAS-STL 190
Query: 222 GGFWATDRITIQEANSNGYFTRYPFLLGCIN--NSSGDKSGASGIMGLDRSPVSIITRTN 279
D +++ ++N + + GC+ SG G++G R P+S +++T
Sbjct: 191 HAVLGQDALSLSDSNGAAVPDDH-YTFGCLRVVTGSGGSVPPQGLVGFGRGPLSFLSQTK 249
Query: 280 TSY---FSYCLPS 289
+Y FSYCLPS
Sbjct: 250 ATYGSIFSYCLPS 262
Score = 44.7 bits (104), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 52/106 (49%), Gaps = 8/106 (7%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLVVA 367
LR+AF +R A L DTCY ++ ++V P +A F GG + L ++ +
Sbjct: 349 LRNAF-RRGVSAPAAPALGGF-DTCYYVNGTKSV--PAVAFVFAGGARVTLPEENVVISS 404
Query: 368 SVSQV-CLGFATYPPDPNSI---TLGNVQQRGHEVHYDVAGRRLGF 409
+ V CL A P D + L ++QQ+ H V +DV R+GF
Sbjct: 405 TSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGNGRVGF 450
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 47/168 (27%), Positives = 79/168 (47%), Gaps = 23/168 (13%)
Query: 149 LDTGSDVTWTQCKPCIH----CFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGN- 203
+DTG++++W QC+ C + CF +DP + +S+SK++ + CN + SF N
Sbjct: 105 IDTGNELSWIQCEGCQNKGNMCFPHKDPPYTSSQSKSYKPVSCN-------QHSFCEPNQ 157
Query: 204 CNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSG------- 256
C C +N+ Y GS + G A + T +N + GC +S
Sbjct: 158 CKEGLCAYNVTYGPGSYTSGNLANETFTFY-SNHGKHTALKSISFGCSTDSRNMIYAFLL 216
Query: 257 DKSGASGIMGLDRSPVSIITRTNT---SYFSYCLPSPYGSTGYITFGK 301
DK+ SG++G+ P S + + + FSYC+ + Y+ FGK
Sbjct: 217 DKNPVSGVLGMGWGPRSFLAQLGSISHGKFSYCITANNTHNTYLRFGK 264
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/166 (28%), Positives = 80/166 (48%), Gaps = 14/166 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNS 189
Y + +G P + +++DTGS +TW QC PC + C +Q P F S ++ + C++
Sbjct: 128 NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCSA 187
Query: 190 TSCRILRESF--PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-F 246
C L + P S C + Y D S S G+ + D ++ T P F
Sbjct: 188 QQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGS-------TSVPNF 240
Query: 247 LLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
GC ++ G ++G++GL R+ +S++ + S FSYCLP+
Sbjct: 241 YYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPT 286
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 61/129 (47%), Gaps = 5/129 (3%)
Query: 287 LPSPYGSTGYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKI 346
LP+ S IT T + L A MK +A +LDTC+ A + VP++
Sbjct: 344 LPTIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFS-ILDTCFQGQAAR-LRVPEV 401
Query: 347 AIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRR 406
+ F GG L+L R LV + CL FA P ++ +GN QQ+ V YDV +
Sbjct: 402 TMAFAGGAALKLAARNLLVDVDSATTCLAFA---PARSAAIIGNTQQQTFSVVYDVKNSK 458
Query: 407 LGFGPGNCS 415
+GF G CS
Sbjct: 459 IGFAAGGCS 467
>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
Length = 389
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/96 (38%), Positives = 52/96 (54%), Gaps = 3/96 (3%)
Query: 308 LRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDLELDVRGT--LV 365
L++ F K+ + A +LDTC++LSAY+ V +P I +HF G +L +DV G V
Sbjct: 235 LKAEFLKQFTGFPPAPAFS-ILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFV 293
Query: 366 VASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYD 401
+ SQVCL A+ LGN QQ+ V YD
Sbjct: 294 KSDASQVCLALASLEYQDEVAILGNYQQKNLRVIYD 329
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 34/93 (36%), Positives = 48/93 (51%), Gaps = 11/93 (11%)
Query: 202 GNCNSKE--CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKS 259
G C S C + I Y DGS + G +++ G F+ GC N+ G
Sbjct: 67 GVCGSAAPICNYAINYGDGSFTRGELGHEKLKF------GTILVKDFIFGCGRNNKGLFG 120
Query: 260 GASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
G SG+MGL RS +S+I++T+ + FSYCLPS
Sbjct: 121 GVSGLMGLGRSDLSLISQTSGIFGGVFSYCLPS 153
>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
Length = 393
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/153 (28%), Positives = 72/153 (47%), Gaps = 11/153 (7%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCK-PCIHCFQQRDPFFYASKSKTFFKIPCNST 190
Y + + IG+P + L +DTGSD+TW QC PC+ C + P++ + +PC
Sbjct: 34 YNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPHPYYRPRNN----LVPCMDP 89
Query: 191 SCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL-LG 249
C+ L + N +C + ++YADG S G TD + N P L LG
Sbjct: 90 ICQSLHSNGDHRCENPGQCDYEVEYADGGSSFGVLVTDTFNL---NFTSEKRHSPLLALG 146
Query: 250 CINNS--SGDKSGASGIMGLDRSPVSIITRTNT 280
C + G G++GL + SI+++ ++
Sbjct: 147 CGYDQFPGGSHHPIDGVLGLGKGKSSIVSQLSS 179
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 58/227 (25%), Positives = 96/227 (42%), Gaps = 26/227 (11%)
Query: 97 HLKNSRRLRKPFPEFLKRTEA---FTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGS 153
HL++ R+R L+ + F+ + + YY V +G P + + +DTGS
Sbjct: 47 HLRSRDRVRHG--RMLQSSGGVIDFSVSGTYDPFLVGLYYTRVQLGNPPKDFYVQIDTGS 104
Query: 154 DVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIPCNSTSCRI---LRESFPFGNCN 205
DV W C C C Q FF S T + C+ C + +S FG N
Sbjct: 105 DVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASLVSCSDQICALGVQSSDSACFGQSN 164
Query: 206 SKECPFNIQYADGSGSGGFWATDRITIQEA--NSNGYFTRYPFLLGCINNSSGDKS---- 259
+C + QY DGSG+ G++ D I + +S + + GC + +GD +
Sbjct: 165 --QCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTSNSSASVVFGCSTSQTGDLTKSDR 222
Query: 260 GASGIMGLDRSPVSIITRTNT-----SYFSYCLPSPYGSTGYITFGK 301
GI G + +S+I++ ++ FS+CL G + G+
Sbjct: 223 AVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGDDSGGGILVLGE 269
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 25/88 (28%), Positives = 46/88 (52%), Gaps = 5/88 (5%)
Query: 332 CYDLSAYETVVVPKIAIHFLGGVDLELDVRGTLV----VASVSQVCLGFATYPPDPNSIT 387
CY S+ + + P+++++F GG L L + L+ V + C+GF P +I
Sbjct: 357 CYVTSSSVSDIFPQVSLNFAGGASLVLGAQDYLIQQNSVGGTTVWCIGFQKIPGQGITI- 415
Query: 388 LGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
LG++ + YD+A +R+G+ +CS
Sbjct: 416 LGDLVLKDKIFIYDLANQRIGWTNYDCS 443
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/166 (28%), Positives = 80/166 (48%), Gaps = 14/166 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNS 189
Y + +G P + +++DTGS +TW QC PC + C +Q P F S ++ + C++
Sbjct: 128 NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCSA 187
Query: 190 TSCRILRESF--PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-F 246
C L + P S C + Y D S S G+ + D ++ T P F
Sbjct: 188 QQCSDLTTATLSPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGS-------TSVPNF 240
Query: 247 LLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
GC ++ G ++G++GL R+ +S++ + S FSYCLP+
Sbjct: 241 YYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPT 286
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 61/129 (47%), Gaps = 5/129 (3%)
Query: 287 LPSPYGSTGYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKI 346
LP+ S IT T + L A MK +A +LDTC+ A + VP++
Sbjct: 344 LPTIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFS-ILDTCFQGQAAR-LRVPEV 401
Query: 347 AIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRR 406
+ F GG L+L R LV + CL FA P ++ +GN QQ+ V YDV +
Sbjct: 402 TMAFAGGAALKLAARNLLVDVDSATTCLAFA---PARSAAIIGNTQQQTFSVVYDVKNSK 458
Query: 407 LGFGPGNCS 415
+GF G CS
Sbjct: 459 IGFAAGGCS 467
>gi|357464807|ref|XP_003602685.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355491733|gb|AES72936.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 440
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 61/232 (26%), Positives = 95/232 (40%), Gaps = 18/232 (7%)
Query: 108 FPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK-PCIHC 166
F + + FP + N Y + + IG P + L +DTGSD+TW QC PC C
Sbjct: 61 FLNRFRSGSSVVFPVHGNVYPVGFYNVTINIGYPPRPYFLDIDTGSDLTWLQCDAPCSRC 120
Query: 167 FQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKECPFNIQYADGSGSGGFWA 226
Q P + S +PC C + ++ + +C + ++YAD S G
Sbjct: 121 SQTPHPLYRPSND----LVPCRHPLCASVHQTDNYECEVEHQCDYEVEYADHYSSLGVLV 176
Query: 227 TDRITIQEANSNGYFTRYPFLLGCINNSSGDKSG---ASGIMGLDRSPVSIITRTN---- 279
D + +NG + LGC + S G++GL R S+I++ N
Sbjct: 177 NDVYVLNF--TNGVQLKVRMALGCGYDQIFPDSSYHPVDGMLGLGRGKSSLISQLNGQGL 234
Query: 280 -TSYFSYCLPSPYGSTGYITFGKT-DTVNTLRSAFHKRMKKYKKAKGLEDLL 329
+ +CL + G GYI FG D+ + R K+ A E +L
Sbjct: 235 VRNVVGHCLSAQGG--GYIFFGDVYDSSRLAWTPMSSRDYKHYSAGAAELVL 284
>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
Length = 383
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 76/154 (49%), Gaps = 13/154 (8%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
YY+ + IG P + L +D+GSD+TW QC PC C + P + +KSK +PC
Sbjct: 66 YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKSKL---VPCVHR 122
Query: 191 SCRILRESFPFGN-CNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C L + C+S ++C + I+YAD S G D ++ +NG R
Sbjct: 123 LCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRL--TNGSVARPSVA 180
Query: 248 LGCINNS---SGDKSGAS-GIMGLDRSPVSIITR 277
GC + SGD S + G++GL VS++++
Sbjct: 181 FGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQ 214
>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
Length = 433
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 76/154 (49%), Gaps = 13/154 (8%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
YY+ + IG P + L +D+GSD+TW QC PC C + P + +KSK +PC
Sbjct: 66 YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKSKL---VPCVHR 122
Query: 191 SCRILRESFPFGN-CNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C L + C+S ++C + I+YAD S G D ++ +NG R
Sbjct: 123 LCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRL--TNGSVARPSVA 180
Query: 248 LGCINNS---SGDKSGAS-GIMGLDRSPVSIITR 277
GC + SGD S + G++GL VS++++
Sbjct: 181 FGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQ 214
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/178 (27%), Positives = 78/178 (43%), Gaps = 18/178 (10%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRD-----PFFYASKSKTFFKIP 186
YY + IG P + +DTGSD+ W C C +C ++ D + S T I
Sbjct: 73 YYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLIT 132
Query: 187 CNSTSCRILRESFPFGNCNSK-ECPFNIQYADGSGSGGFWATDRITIQEA--NSNGYFTR 243
C+ C ++ P C C + + Y DGS + G++ D I +Q A N T
Sbjct: 133 CDQPFCSATYDA-PIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSETN 191
Query: 244 YPFLLGCINNSSGDKSGAS----GIMGLDRSPVSIITRTNTS-----YFSYCLPSPYG 292
+ GC SG+ +S GI+G ++ S+I++ + F++CL S G
Sbjct: 192 GSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISG 249
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 57/191 (29%), Positives = 87/191 (45%), Gaps = 23/191 (12%)
Query: 96 LHLKNSRRLRKPFPEFLKRTEAF-TFPANI---NDTVADEYYIV-VAIGEPKQYVSLLLD 150
L NS R L+R+E+ T A + +D + YY + IG P Q +L++D
Sbjct: 51 LSAPNSSRTLSHSRRHLQRSESHSTATARMPLYDDLIPYGYYTTRIWIGTPPQTFALIVD 110
Query: 151 TGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSK--E 208
TGS +T+ C C C + +DP F S T+ + C S C C+S+
Sbjct: 111 TGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKC-SMEC----------TCDSEMMH 159
Query: 209 CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD--KSGASGIMG 266
C ++ QYA+ S S G D ++ + + + GC N +GD A GIMG
Sbjct: 160 CVYDRQYAEMSSSSGVLGEDIVSFGKQSE---LKPQRTVFGCENVETGDIYSQRADGIMG 216
Query: 267 LDRSPVSIITR 277
L R +SI+ +
Sbjct: 217 LGRGDLSIVDQ 227
>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
gi|194692946|gb|ACF80557.1| unknown [Zea mays]
Length = 424
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 76/154 (49%), Gaps = 13/154 (8%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQC-KPCIHCFQQRDPFFYASKSKTFFKIPCNST 190
YY+ + IG P + L +D+GSD+TW QC PC C + P + +KSK +PC
Sbjct: 57 YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPTKSKL---VPCVHR 113
Query: 191 SCRILRESFPFGN-CNS--KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFL 247
C L + C+S ++C + I+YAD S G D ++ +NG R
Sbjct: 114 LCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRL--TNGSVARPSVA 171
Query: 248 LGCINNS---SGDKSGAS-GIMGLDRSPVSIITR 277
GC + SGD S + G++GL VS++++
Sbjct: 172 FGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQ 205
>gi|125553570|gb|EAY99279.1| hypothetical protein OsI_21243 [Oryza sativa Indica Group]
gi|125605796|gb|EAZ44832.1| hypothetical protein OsJ_29469 [Oryza sativa Japonica Group]
Length = 534
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 64/277 (23%), Positives = 109/277 (39%), Gaps = 59/277 (21%)
Query: 81 HAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFP--ANINDTVADEYYIVVAI 138
H P E R+ + L K+ RR+ + PE + +T+ F P + +N Y +VV I
Sbjct: 56 HVPESGEERREHFRALMAKDMRRMMRQVPELMSKTDMFELPMRSALNIAQVGMYVVVVRI 115
Query: 139 GEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDP-------------------------- 172
G P SL L+T ++VTW C+ + + + P
Sbjct: 116 GTPALPYSLALETANEVTWINCR--LRRRKGKHPGRPHVPPAATTMSIQVDDDGGGGGSG 173
Query: 173 -----------FFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKE----CPFNIQYAD 217
++ +KS ++ + C+ +C L P+ C S + C + D
Sbjct: 174 GKSKVTKVIMNWYRPAKSSSWRRFRCSQRACMDL----PYNTCESPDQNTSCTYYQVMKD 229
Query: 218 GSGSGGFWATDRITIQEANSNGYFTRYP-FLLGCIN-NSSGDKSGASGIMGLDRSPVSI- 274
+ + G + ++ T+ A S+G + P ++GC G + GI+ L SP S
Sbjct: 230 STITSGIYGQEKATV--AVSDGTMKKLPGLVIGCSTFEHGGAVNSHDGILSLGNSPSSFG 287
Query: 275 --ITRTNTSYFSYCL---PSPYGSTGYITFGKTDTVN 306
R S+CL S ++ Y+TFG V
Sbjct: 288 IAAARRFGGRLSFCLLATTSGRNASSYLTFGANPAVQ 324
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/178 (27%), Positives = 78/178 (43%), Gaps = 18/178 (10%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRD-----PFFYASKSKTFFKIP 186
YY + IG P + +DTGSD+ W C C +C ++ D + S T I
Sbjct: 73 YYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLIT 132
Query: 187 CNSTSCRILRESFPFGNCNSK-ECPFNIQYADGSGSGGFWATDRITIQEA--NSNGYFTR 243
C+ C ++ P C C + + Y DGS + G++ D I +Q A N T
Sbjct: 133 CDQPFCSATYDA-PIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSETN 191
Query: 244 YPFLLGCINNSSGDKSGAS----GIMGLDRSPVSIITRTNTS-----YFSYCLPSPYG 292
+ GC SG+ +S GI+G ++ S+I++ + F++CL S G
Sbjct: 192 GSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISG 249
>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
Length = 538
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 59/246 (23%), Positives = 100/246 (40%), Gaps = 44/246 (17%)
Query: 94 QRLHLKNSRRLRKPFPEFLKRTEAFTFP--ANINDTVADEYYIVVAIGEPKQYVSLLLDT 151
+R K S +L PE + T F P + +N Y + V G P +L+LDT
Sbjct: 91 RRRQAKESSKL----PEVMSATSMFELPMRSALNIAHVGMYLVSVRFGTPALPYNLVLDT 146
Query: 152 GSDVTWTQCK--------------------PCIHCFQQRDPFFYASKSKTFFKIPCNSTS 191
+D+TW C+ +R ++ +KS ++ +I C+
Sbjct: 147 ANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKEARRKNWYRPAKSSSWRRIRCSQKE 206
Query: 192 CRILRESFPFGNCNS----KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-F 246
C +L P+ C S + C + Q DG+ + G + ++ T+ S+G + P
Sbjct: 207 CALL----PYNTCQSPSKAESCSYYQQMQDGTLTMGIYGKEKATV--TVSDGRMAKLPGL 260
Query: 247 LLGC-INNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGS---TGYITF 299
+LGC + + G G++ L +S + FS+CL S S + Y+TF
Sbjct: 261 ILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVHAAKRFGQRFSFCLLSANSSRDASSYLTF 320
Query: 300 GKTDTV 305
G V
Sbjct: 321 GPNPAV 326
>gi|414587774|tpg|DAA38345.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 520
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 52/191 (27%), Positives = 86/191 (45%), Gaps = 27/191 (14%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC----------FQQRDPFFYASKSKT 181
+Y +V +G P Q + LDTGSD+ W C+ C C FQ F+ S T
Sbjct: 109 HYALVTVGTPGQTFMVALDTGSDLFWLPCQ-CDGCTPPATAASGSFQAT--FYIPGMSST 165
Query: 182 FFKIPCNSTSCRILRESFPFGNCNSK-ECPFNIQYAD-GSGSGGFWATDRITIQEANSNG 239
+PCNS C + +E C++ +CP+ + Y G+ S GF D + + N++
Sbjct: 166 SKAVPCNSNFCDLQKE------CSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHP 219
Query: 240 YFTRYPFLLGCINNSSG---DKSGASGIMGLDRSPV---SIITRTNTSYFSYCLPSPYGS 293
+ +LGC +G D + +G+ GL V SI+ + + S+ +
Sbjct: 220 QILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDG 279
Query: 294 TGYITFGKTDT 304
G I+FG ++
Sbjct: 280 IGRISFGDQES 290
>gi|115465837|ref|NP_001056518.1| Os05g0596000 [Oryza sativa Japonica Group]
gi|55733881|gb|AAV59388.1| unknown protein [Oryza sativa Japonica Group]
gi|57900669|gb|AAW57794.1| unknown protein [Oryza sativa Japonica Group]
gi|113580069|dbj|BAF18432.1| Os05g0596000 [Oryza sativa Japonica Group]
gi|215697162|dbj|BAG91156.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215768162|dbj|BAH00391.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 535
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 64/277 (23%), Positives = 109/277 (39%), Gaps = 59/277 (21%)
Query: 81 HAPSLEEILRQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFP--ANINDTVADEYYIVVAI 138
H P E R+ + L K+ RR+ + PE + +T+ F P + +N Y +VV I
Sbjct: 57 HVPESGEERREHFRALMAKDMRRMMRQVPELMSKTDMFELPMRSALNIAQVGMYVVVVRI 116
Query: 139 GEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDP-------------------------- 172
G P SL L+T ++VTW C+ + + + P
Sbjct: 117 GTPALPYSLALETANEVTWINCR--LRRRKGKHPGRPHVPPAATTMSIQVDDDGGGGGSG 174
Query: 173 -----------FFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSKE----CPFNIQYAD 217
++ +KS ++ + C+ +C L P+ C S + C + D
Sbjct: 175 GKSKVTKVIMNWYRPAKSSSWRRFRCSQRACMDL----PYNTCESPDQNTSCTYYQVMKD 230
Query: 218 GSGSGGFWATDRITIQEANSNGYFTRYP-FLLGCIN-NSSGDKSGASGIMGLDRSPVSI- 274
+ + G + ++ T+ A S+G + P ++GC G + GI+ L SP S
Sbjct: 231 STITSGIYGQEKATV--AVSDGTMKKLPGLVIGCSTFEHGGAVNSHDGILSLGNSPSSFG 288
Query: 275 --ITRTNTSYFSYCL---PSPYGSTGYITFGKTDTVN 306
R S+CL S ++ Y+TFG V
Sbjct: 289 IAAARRFGGRLSFCLLATTSGRNASSYLTFGANPAVQ 325
>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
Length = 437
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 66/224 (29%), Positives = 94/224 (41%), Gaps = 26/224 (11%)
Query: 90 RQDQQRLHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLL 149
+ Q L N RR R FL + +FP N + YY + +G P Q + +++
Sbjct: 49 KHHLQHLVEHNDRRGR-----FL---QGISFPLKGNYSDLGLYYTEIGLGNPVQKLKVIV 100
Query: 150 DTGSDVTWTQCKPCIHCFQQRD-----PFFYASKSKTFFKIPCNSTSCRILRESFPFGNC 204
DTGSD+ W +C PC C ++D + S S T C+ C +
Sbjct: 101 DTGSDILWVKCSPCRSCLSKQDIIPPLSIYNLSASSTSSVSSCSDPLCTGEQAVCSRSGS 160
Query: 205 NSKECPFNIQYADGSGSGGFWATDRI--TIQEANSNGYFTRYPFLLGCINNSSGDKSGAS 262
NS C + I Y D S S G + D + +Q N+ T GC N +G A
Sbjct: 161 NS-ACAYGISYQDKSTSIGAYVKDDMHYVLQGGNA----TTSHIFFGCAINITGSWP-AD 214
Query: 263 GIMGLDR----SPVSIITRTNTS-YFSYCLPSPYGSTGYITFGK 301
GIMG + P I T+ N S FS+CL G + FG+
Sbjct: 215 GIMGFGQISKTVPNQIATQRNMSRVFSHCLGGEKHGGGILEFGE 258
>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 64/234 (27%), Positives = 100/234 (42%), Gaps = 40/234 (17%)
Query: 96 LHLKNSRRLRKPFPEFLKRTEAFTFPANINDTVADE-----------------YYIVVAI 138
L S L PFP L + T P+ + A + + I
Sbjct: 13 LSFSQSNSLSLPFPLSLSEKPSNTIPSYSSQLYAKRPSSYGSFKLPFKYSSTALVVSLPI 72
Query: 139 GEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKS--------KTFFKIPCNST 190
G P Q L+LDTGS ++W QC ++R P K+ +F +PCN
Sbjct: 73 GTPPQPTDLVLDTGSQLSWIQCHD--KKIKKRLPPLPKPKTTSFDPSLSSSFSLLPCNHP 130
Query: 191 SC--RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLL 248
C RI + P ++ C ++ YADG+ + G ++ T ++ + P +L
Sbjct: 131 ICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKS-----LSTPPVIL 185
Query: 249 GCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPSPYGS--TGYITFG 300
GC S+ ++ GI+G++R +S I++ S FSYC+PS GS TG G
Sbjct: 186 GCAQASTENR----GILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLG 235
Score = 39.7 bits (91), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 47/101 (46%), Gaps = 6/101 (5%)
Query: 320 KKAKGLEDLLDTCYDLSAYETV--VVPKIAIHFLGGVDLELDVRGTLVVASVSQ--VCLG 375
KK D+ D C+D V + I+ F GV++ + RG V+ V + C+G
Sbjct: 328 KKGYVYADVADMCFDAGVTAEVGRRIGGISFEFDNGVEIFVG-RGEGVLTEVEKGVKCVG 386
Query: 376 FA-TYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ S +G V Q+ V YD+A +R+GFG CS
Sbjct: 387 IGRSERLGIGSNIIGTVHQQNMWVEYDLANKRVGFGGAECS 427
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 57/191 (29%), Positives = 87/191 (45%), Gaps = 23/191 (12%)
Query: 96 LHLKNSRRLRKPFPEFLKRTEAF-TFPANI---NDTVADEYYIV-VAIGEPKQYVSLLLD 150
L NS R L+R+E+ T A + +D + YY + IG P Q +L++D
Sbjct: 51 LSAPNSSRTLSHSRRHLQRSESHSTATARMPLYDDLIPYGYYTTRIWIGTPPQTFALIVD 110
Query: 151 TGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNSK--E 208
TGS +T+ C C C + +DP F S T+ + C S C C+S+
Sbjct: 111 TGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKC-SMEC----------TCDSEMMH 159
Query: 209 CPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGD--KSGASGIMG 266
C ++ QYA+ S S G D ++ + + + GC N +GD A GIMG
Sbjct: 160 CVYDRQYAEMSSSSGVLGEDIVSFGKQSE---LKPQRTVFGCENVETGDIYSQRADGIMG 216
Query: 267 LDRSPVSIITR 277
L R +SI+ +
Sbjct: 217 LGRGDLSIVDQ 227
>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
Length = 583
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 52/194 (26%), Positives = 85/194 (43%), Gaps = 18/194 (9%)
Query: 120 FPANINDTVADEYYIVVAIGEPK--QYVSLLLDTGSDVTWTQCK-PCIHCFQQRDPFFYA 176
FP N YY + +G+P+ QY L +DTGS++TW QC PC C + + +
Sbjct: 191 FPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKP 250
Query: 177 SKSKTFFKIPCNSTSCRILRESFPFGNC-NSKECPFNIQYADGSGSGGFWATDRITIQEA 235
K + + C ++ + +C N +C + I+YAD S S G D+ ++
Sbjct: 251 RKDNL---VRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKL- 306
Query: 236 NSNGYFTRYPFLLGCINNSSG----DKSGASGIMGLDRSPVSIITRTN-----TSYFSYC 286
NG + GC + G GI+GL R+ +S+ ++ ++ +C
Sbjct: 307 -HNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHC 365
Query: 287 LPSPYGSTGYITFG 300
L S GYI G
Sbjct: 366 LASDLNGEGYIFMG 379
Score = 39.3 bits (90), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 18/53 (33%), Positives = 31/53 (58%), Gaps = 2/53 (3%)
Query: 364 LVVASVSQVCLGF--ATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNC 414
L++++ VCLG + D ++I LG++ RGH + YD RR+G+ +C
Sbjct: 516 LIISNKGNVCLGILDGSSVHDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDC 568
>gi|356554625|ref|XP_003545645.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 452
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 56/212 (26%), Positives = 93/212 (43%), Gaps = 19/212 (8%)
Query: 99 KNSRRLRKPFPEFLKRTEAFTFPANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWT 158
+N+++L L + F N+ Y + + IG P + L +D+GSD+TW
Sbjct: 33 RNAKKLSSDNHHRLSSSAVFKVQGNVYPL--GHYTVSLNIGYPPKLYDLDIDSGSDLTWV 90
Query: 159 QCK-PCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCNS--KECPFNIQY 215
QC PC C + RD + + + + C C ++ S + C S +C + ++Y
Sbjct: 91 QCDAPCKGCTKPRDQLYKPNHN----LVQCVDQLCSEVQLSMEY-TCASPDDQCDYEVEY 145
Query: 216 ADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCINNSSGDKS--GASGIMGLDRSPVS 273
AD S G D I Q N + R F G SG S SG++GL S
Sbjct: 146 ADHGSSLGVLVRDYIPFQFTNGSVVRPRVAFGCGYDQKYSGSNSPPATSGVLGLGNGRAS 205
Query: 274 IITRTNT-----SYFSYCLPSPYGSTGYITFG 300
I+++ ++ + +CL + G G++ FG
Sbjct: 206 ILSQLHSLGLIHNVVGHCLSARGG--GFLFFG 235
>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
Length = 506
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 54/186 (29%), Positives = 81/186 (43%), Gaps = 19/186 (10%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYAS---KSKTFFKIPCN 188
YY + +G P Q+++ ++DTGSD+ W +CK C C +++ +S + P
Sbjct: 88 YYAQIGVGHPVQFLNAIVDTGSDILWFKCKLCQGCSSKKNVIVCSSIIMQGPITLYDPEL 147
Query: 189 STSCRILRESFPF----GNC--NSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFT 242
S + S P G+C N+ C ++I Y D S S G + D + + S
Sbjct: 148 SITASPATCSDPLCSEGGSCRGNNNSCAYDISYEDTSSSTGIYFRDVVHLGHKAS----L 203
Query: 243 RYPFLLGCINNSSGDKSGASGIMGLDRSPVSI---ITRTNTSY--FSYCLPSPYGSTGYI 297
LGC + SG GIMG RS VS+ + SY F +CL G +
Sbjct: 204 NTTMFLGCATSISGLWP-VDGIMGFGRSKVSVPNQLAAQAGSYNIFYHCLSGEKEGGGIL 262
Query: 298 TFGKTD 303
GK D
Sbjct: 263 VLGKND 268
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 75/158 (47%), Gaps = 15/158 (9%)
Query: 134 IVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHCFQQRDPFFYASKSKTFFKIPCNSTSC- 192
+ + IG P Q ++LDTGS ++W QC H F S S +F+ +PC C
Sbjct: 90 VTLPIGTPPQPQQMVLDTGSQLSWIQC----HNKTPPTASFDPSLSSSFYVLPCTHPLCK 145
Query: 193 -RILRESFPFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYPFLLGCI 251
R+ + P ++ C ++ YADG+ + G +++ + T P +LGC
Sbjct: 146 PRVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSPSQ-----TTPPLILGC- 199
Query: 252 NNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCLPS 289
S + A GI+G++ +S + + FSYC+P+
Sbjct: 200 ---SSESRDARGILGMNLGRLSFPFQAKVTKFSYCVPT 234
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 46/100 (46%), Gaps = 2/100 (2%)
Query: 318 KYKKAKGLEDLLDTCYDLSAYET-VVVPKIAIHFLGGVDLELDVRGTLVVASVSQVCLGF 376
+ KK + D C+D +A E ++ +A F GV++ + L C+G
Sbjct: 342 RVKKGYVYGGVADMCFDGNAMEIGRLLGDVAFEFEKGVEIVVPKERVLADVGGGVHCVGI 401
Query: 377 A-TYPPDPNSITLGNVQQRGHEVHYDVAGRRLGFGPGNCS 415
+ S +GN Q+ V +D+A RR+GFG +CS
Sbjct: 402 GRSERLGAASNIIGNFHQQNLWVEFDLANRRIGFGVADCS 441
>gi|308080924|ref|NP_001183009.1| uncharacterized protein LOC100501329 [Zea mays]
gi|238008766|gb|ACR35418.1| unknown [Zea mays]
Length = 205
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 63/125 (50%), Gaps = 15/125 (12%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK-PCIHCFQQRDPFFYASKSKTFFKIPCNS 189
+YY + IG P + L +DTGSD+TW QC PC +C + P + +K K +P
Sbjct: 89 QYYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKI---VPPRD 145
Query: 190 TSCRILRESFPFGNCN----SKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP 245
C+ L+ GN N K+C + I+YAD S S G A D + + +NG +
Sbjct: 146 LLCQELQ-----GNQNYCETCKQCDYEIEYADQSSSMGVLARDDMHM--IATNGGREKLD 198
Query: 246 FLLGC 250
F+ GC
Sbjct: 199 FVFGC 203
>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
Length = 508
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 45/165 (27%), Positives = 69/165 (41%), Gaps = 22/165 (13%)
Query: 132 YYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPCIHC-----FQQRDPFFYASKSKTFFKIP 186
Y + ++G P Q V+ +LD SD W QC C C P FYA S T ++
Sbjct: 97 YVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIREVR 156
Query: 187 CNSTSCRILRESFPFGNCNSKECPFNIQYADGSGS----GGFWATDRITIQEANSNGYFT 242
C + C+ L C++ + P Y G G+ G A D ++G
Sbjct: 157 CANRGCQRLVPQ----TCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADG--- 209
Query: 243 RYPFLLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSYFSYCL 287
+ GC + GD G++GL R +S++++ FSY L
Sbjct: 210 ---VIFGCAVATEGD---IGGVIGLGRGELSLVSQLQIGRFSYYL 248
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 55/114 (48%), Gaps = 3/114 (2%)
Query: 297 ITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKIAIHFLGGVDL 356
+TF +R A ++ + A G E LD CY + T VP +A+ F GG +
Sbjct: 327 VTFLDAGAYKVVRQAMASKIG-LRAADGSELGLDLCYTSESLATAKVPSMALVFAGGAVM 385
Query: 357 ELDVRGTLVVASVSQV-CLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRRLGF 409
EL++ + S + + CL P S+ LG++ Q G + YD++G RL F
Sbjct: 386 ELEMGNYFYMDSTTGLECLTILPSPAGDGSL-LGSLIQVGTHMIYDISGSRLVF 438
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 48/166 (28%), Positives = 80/166 (48%), Gaps = 14/166 (8%)
Query: 131 EYYIVVAIGEPKQYVSLLLDTGSDVTWTQCKPC-IHCFQQRDPFFYASKSKTFFKIPCNS 189
Y + +G P + +++DTGS +TW QC PC + C +Q P F S ++ + C++
Sbjct: 126 NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSA 185
Query: 190 TSCRILRESF--PFGNCNSKECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-F 246
C L + P S C + Y D S S G+ + D ++ T P F
Sbjct: 186 QQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGS-------TSVPNF 238
Query: 247 LLGCINNSSGDKSGASGIMGLDRSPVSIITRTNTSY---FSYCLPS 289
GC ++ G ++G++GL R+ +S++ + S FSYCLP+
Sbjct: 239 YYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPT 284
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 61/129 (47%), Gaps = 5/129 (3%)
Query: 287 LPSPYGSTGYITFGKTDTVNTLRSAFHKRMKKYKKAKGLEDLLDTCYDLSAYETVVVPKI 346
LP+ S IT T + L A MK +A +LDTC+ A + VP++
Sbjct: 342 LPTIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFS-ILDTCFQGQAAR-LRVPEV 399
Query: 347 AIHFLGGVDLELDVRGTLVVASVSQVCLGFATYPPDPNSITLGNVQQRGHEVHYDVAGRR 406
+ F GG L+L R LV + CL FA P ++ +GN QQ+ V YDV +
Sbjct: 400 TMAFAGGAALKLAARNLLVDVDSATTCLAFA---PARSAAIIGNTQQQTFSVVYDVKNSK 456
Query: 407 LGFGPGNCS 415
+GF G CS
Sbjct: 457 IGFAAGGCS 465
>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
Length = 538
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 94/232 (40%), Gaps = 40/232 (17%)
Query: 108 FPEFLKRTEAFTFP--ANINDTVADEYYIVVAIGEPKQYVSLLLDTGSDVTWTQCK---- 161
PE + T F P + +N Y + V G P +L+LDT +D+TW C+
Sbjct: 101 LPEVMSATSMFELPMRSALNIAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRR 160
Query: 162 ----------------PCIHCFQQRDPFFYASKSKTFFKIPCNSTSCRILRESFPFGNCN 205
+R ++ +KS ++ +I C+ C +L P+ C
Sbjct: 161 KGKHYGRTMSVGAGDDGAAAKEARRKNWYRPAKSSSWRRIRCSQKECALL----PYNTCQ 216
Query: 206 S----KECPFNIQYADGSGSGGFWATDRITIQEANSNGYFTRYP-FLLGC-INNSSGDKS 259
S + C + Q DG+ + G + ++ T+ S+G + P +LGC + + G
Sbjct: 217 SPSKAESCSYYQQMQDGTLTMGIYGKEKATV--TVSDGRMAKLPGLILGCSVLEAGGSVD 274
Query: 260 GASGIMGLDRSPVSIITRTNTSY---FSYCLPSPYGS---TGYITFGKTDTV 305
G++ L +S + FS+CL S S + Y+TFG V
Sbjct: 275 AHDGVLSLGNGEMSFAVHAAKRFGQRFSFCLLSANSSRDASSYLTFGPNPAV 326
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.137 0.424
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,906,945,276
Number of Sequences: 23463169
Number of extensions: 295948716
Number of successful extensions: 675931
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 935
Number of HSP's successfully gapped in prelim test: 1353
Number of HSP's that attempted gapping in prelim test: 670202
Number of HSP's gapped (non-prelim): 3770
length of query: 415
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 270
effective length of database: 8,957,035,862
effective search space: 2418399682740
effective search space used: 2418399682740
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 78 (34.7 bits)