BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 011042
(495 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 708 bits (1827), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/462 (76%), Positives = 400/462 (86%), Gaps = 17/462 (3%)
Query: 34 HFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSS 93
HFQ+LNV E+I K SQY ELF+ N+ + E +W L+LVHRDK+
Sbjct: 37 HFQLLNVKEAIT-----ETKASQYQELFDNQNDTLT------EGKWKLKLVHRDKI---- 81
Query: 94 NTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGS 153
T N + H H+FHAR+QRD KRVATL+RRLS A ++ + V++FG +VVSGM+QGS
Sbjct: 82 -TAFNKSSYDHSHNFHARIQRDKKRVATLIRRLSPRDATSS-YSVEEFGAEVVSGMNQGS 139
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
GEYF+RIGVGSPPR QY+VIDSGSDIVWVQCQPC+QCY Q+DPVFDPADSASF GV CSS
Sbjct: 140 GEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADSASFMGVPCSS 199
Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFV 273
+VC+R+ENAGCHAG CRYEV YGDGSYTKGTLALETLT GRTVV+NVAIGCGH+N+GMFV
Sbjct: 200 SVCERIENAGCHAGGCRYEVMYGDGSYTKGTLALETLTFGRTVVRNVAIGCGHRNRGMFV 259
Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVR 333
GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT S+GSL FGR A+PVGAAW+PL+R
Sbjct: 260 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTDSAGSLEFGRGAMPVGAAWIPLIR 319
Query: 334 NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD 393
NPRAPSFYY+ LSG+GVGGM++PISED+F+L +MG+ GVVMDTGTAVTR+PT AY AFRD
Sbjct: 320 NPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTGTAVTRIPTVAYVAFRD 379
Query: 394 AFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
AF+ QTGNLPRASGVSIFDTCYNL+GFVSVRVPTVSFYF+GGP+LTLPA NFLIPVDD G
Sbjct: 380 AFIGQTGNLPRASGVSIFDTCYNLNGFVSVRVPTVSFYFAGGPILTLPARNFLIPVDDVG 439
Query: 454 TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
TFCFAFA SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
Sbjct: 440 TFCFAFAASPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 481
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 699 bits (1804), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 351/461 (76%), Positives = 399/461 (86%), Gaps = 23/461 (4%)
Query: 35 FQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSN 94
FQ LNV E+I G+R ++S+ +E +W +++VHRD++S ++
Sbjct: 42 FQHLNVKETIAGTRIIPLEVSEDHE--------------EGGEKWMMKVVHRDQLSFGNS 87
Query: 95 TTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSG 154
H+H R++RD KRVA+L+RRLS GG + V DFGTDV+SGM+QGSG
Sbjct: 88 DD-------HRHRLDGRLKRDAKRVASLIRRLSSGGG--GSYRVDDFGTDVISGMEQGSG 138
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC+QCY QSDPVFDPADSASF+GVSCSS+
Sbjct: 139 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCSSS 198
Query: 215 VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVG 274
VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLT GRT+V++VAIGCGH+N+GMFVG
Sbjct: 199 VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFGRTMVRSVAIGCGHRNRGMFVG 258
Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRN 334
AAGLLGLGGGSMS VGQLGGQTGGAFSYCLVSRGT SSGSLVFGREALP GAAWVPLVRN
Sbjct: 259 AAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTDSSGSLVFGREALPAGAAWVPLVRN 318
Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
PRAPSFYY+GL+GLGVGG+R+PISE++FRLT++GD GVVMDTGTAVTRLPT AY+AFRDA
Sbjct: 319 PRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDA 378
Query: 395 FVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
F+AQT NLPRA+GV+IFDTCY+L GFVSVRVPTVSFYFSGGP+LTLPA NFLIP+DDAGT
Sbjct: 379 FLAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGT 438
Query: 455 FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
FCFAFAPS SGLSI+GNIQQEGIQISFDGANG+VGFGPN+C
Sbjct: 439 FCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 479
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 667 bits (1721), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/462 (72%), Positives = 388/462 (83%), Gaps = 13/462 (2%)
Query: 34 HFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSS 93
HFQ LNV + + ++ + + Y L +H ++ + +S A++ L+LVHRDK+
Sbjct: 25 HFQQLNVKQILTETKLN--PTNTYKHL--QHQKLNIATEASSPAKYKLKLVHRDKVP--- 77
Query: 94 NTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGS 153
T N H HR + F+ARMQRD KRVA L R L+ G A+ + FG+DVVSGM+QGS
Sbjct: 78 -TFNTSHDHRTR--FNARMQRDTKRVAALRRHLAAGKPTYAE---EAFGSDVVSGMEQGS 131
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
GEYFVRIGVGSPPR+QY+VIDSGSDI+WVQC+PC+QCY QSDPVF+PADS+S++GVSC+S
Sbjct: 132 GEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSYAGVSCAS 191
Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFV 273
VC ++NAGCH GRCRYEVSYGDGSYTKGTLALETLT GRT+++NVAIGCGH NQGMFV
Sbjct: 192 TVCSHVDNAGCHEGRCRYEVSYGDGSYTKGTLALETLTFGRTLIRNVAIGCGHHNQGMFV 251
Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVR 333
GAAGLLGLG G MS VGQLGGQ GG FSYCLVSRG SSG L FGREA+PVGAAWVPL+
Sbjct: 252 GAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGIQSSGLLQFGREAVPVGAAWVPLIH 311
Query: 334 NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD 393
NPRA SFYYVGLSGLGVGG+R+PISED+F+L+++GD GVVMDTGTAVTRLPT AYEAFRD
Sbjct: 312 NPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELGDGGVVMDTGTAVTRLPTAAYEAFRD 371
Query: 394 AFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
AF+AQT NLPRASGVSIFDTCY+L GFVSVRVPTVSFYFSGGP+LTLPA NFLIPVDD G
Sbjct: 372 AFIAQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVG 431
Query: 454 TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+FCFAFAPS SGLSIIGNIQQEGI+IS DGANGFVGFGPNVC
Sbjct: 432 SFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGFVGFGPNVC 473
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 653 bits (1684), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 333/469 (71%), Positives = 382/469 (81%), Gaps = 16/469 (3%)
Query: 30 ASDTHFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKM 89
+S T FQ LNV K ++ D + L + S SD + L L+HRDK+
Sbjct: 27 SSSTKFQYLNV----KATKLDFNDGQILHALNFSDGHRQVSGYKSDNNTFKLNLLHRDKL 82
Query: 90 SSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAK---HEVQDFGTDVV 146
S H H H+ F+ RM+RD RVATLVRRLS G A K ++V +F TDV+
Sbjct: 83 S---------HVHGHRRGFNDRMKRDAIRVATLVRRLSHGAPAAVKDSRYKVANFATDVI 133
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASF 206
SGM+ GSGEYFVRIGVGSPPR+QYMVIDSGSDIVWVQC+PCS+CY+QSDPVFDPADS+SF
Sbjct: 134 SGMEAGSGEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFDPADSSSF 193
Query: 207 SGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
+GVSC S VCDRLEN GC+AGRCRYEVSYGDGSYTKGTLALETLT+G+ ++++VAIGCGH
Sbjct: 194 AGVSCGSDVCDRLENTGCNAGRCRYEVSYGDGSYTKGTLALETLTVGQVMIRDVAIGCGH 253
Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA 326
NQGMF+GAAGLLGLGGGSMS +GQLGGQTGGAFSYCLVSRGTGS+G+L FGR ALPVGA
Sbjct: 254 TNQGMFIGAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGALEFGRGALPVGA 313
Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
W+ L+RNPRAPSFYY+GL+G+GVGG+R+ + E+ F+LT+ G +GVVMDTGTAVTR PT
Sbjct: 314 TWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVVMDTGTAVTRFPTA 373
Query: 387 AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
AY AFRD+F AQT NLPRA GVSIFDTCY+L+GF SVRVPTVSFYFS GPVLTLPA NFL
Sbjct: 374 AYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPARNFL 433
Query: 447 IPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
IPVD GTFC AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN+C
Sbjct: 434 IPVDGGGTFCLAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC 482
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 649 bits (1673), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/418 (77%), Positives = 365/418 (87%), Gaps = 28/418 (6%)
Query: 78 RWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHE 137
+W +++VHRD++S ++ H+H R++RD KRVA+L+RRLS GG +
Sbjct: 132 KWMMKVVHRDQLSFGNSDD-------HRHRLDGRLKRDAKRVASLIRRLSSGGG--GSYR 182
Query: 138 VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
V DFGTDV+SGM+QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC+QCY QSDPV
Sbjct: 183 VDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPV 242
Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
FDPADSASF+GVSCSS+VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLT GRT+V
Sbjct: 243 FDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFGRTMV 302
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF 317
++VAIGCGH+N+GMFVGAAGLLGLGGGSMS VGQLGGQTGGAFSYCLVS
Sbjct: 303 RSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVS----------- 351
Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
AAWVPLVRNPRAPSFYY+GL+GLGVGG+R+PISE++FRLT++GD GVVMDTG
Sbjct: 352 --------AAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTG 403
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV 437
TAVTRLPT AY+AFRDAF+AQT NLPRA+GV+IFDTCY+L GFVSVRVPTVSFYFSGGP+
Sbjct: 404 TAVTRLPTLAYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFSGGPI 463
Query: 438 LTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
LTLPA NFLIP+DDAGTFCFAFAPS SGLSI+GNIQQEGIQISFDGANG+VGFGPN+C
Sbjct: 464 LTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 521
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 648 bits (1672), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/463 (73%), Positives = 389/463 (84%), Gaps = 12/463 (2%)
Query: 34 HFQILNVNESIKGSRTDHAKMSQYNELFERHNN-ISSSNTSSDEARWNLELVHRDKMSSS 92
HFQ LNV + I + +Q ++ HN ++S+ +S A++ L+LVHRDK+ +
Sbjct: 24 HFQQLNVKQIILTETKLYPNPTQPSK--HPHNKKLNSATEASSSAKYKLKLVHRDKVPTF 81
Query: 93 SNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQG 152
+ YH H+ F+ARMQRD KR A+L+RRL+ G A + FG+DVVSGM+QG
Sbjct: 82 NT------YHDHRTRFNARMQRDTKRAASLLRRLAAGKPTYA---AEAFGSDVVSGMEQG 132
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
SGEYFVRIGVGSPPR+QY+V+DSGSDI+WVQC+PC+QCY QSDPVF+PADS+SFSGVSC+
Sbjct: 133 SGEYFVRIGVGSPPRNQYVVMDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSFSGVSCA 192
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
S VC ++NA CH GRCRYEVSYGDGSYTKGTLALET+T GRT+++NVAIGCGH NQGMF
Sbjct: 193 STVCSHVDNAACHEGRCRYEVSYGDGSYTKGTLALETITFGRTLIRNVAIGCGHHNQGMF 252
Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLV 332
VGAAGLLGLGGG MS VGQLGGQTGGAFSYCLVSRG SSG L FGREA+PVGAAWVPL+
Sbjct: 253 VGAAGLLGLGGGPMSFVGQLGGQTGGAFSYCLVSRGIESSGLLEFGREAMPVGAAWVPLI 312
Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
NPRA SFYY+GLSGLGVGG+R+ ISED+F+L+++GD GVVMDTGTAVTRLPT AYEAFR
Sbjct: 313 HNPRAQSFYYIGLSGLGVGGLRVSISEDVFKLSELGDGGVVMDTGTAVTRLPTVAYEAFR 372
Query: 393 DAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA 452
D F+AQT NLPRASGVSIFDTCY+L GFVSVRVPTVSFYFSGGP+LTLPA NFLIPVDD
Sbjct: 373 DGFIAQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDV 432
Query: 453 GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
GTFCFAFAPS SGLSIIGNIQQEGIQIS DGANGFVGFGPNVC
Sbjct: 433 GTFCFAFAPSSSGLSIIGNIQQEGIQISVDGANGFVGFGPNVC 475
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 638 bits (1645), Expect = e-180, Method: Compositional matrix adjust.
Identities = 316/384 (82%), Positives = 352/384 (91%), Gaps = 2/384 (0%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
M RDVKRVA+L+ RLS G AAK+EV+DFG+DVVSGM+QGSGEYFVRIG+GSPPRSQYM
Sbjct: 1 MHRDVKRVASLIHRLSSG--SAAKYEVEDFGSDVVSGMNQGSGEYFVRIGLGSPPRSQYM 58
Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRY 231
VIDSGSDIVWVQC+PC+QCY Q+DP+FDPADSASF GVSCSSAVCDR+ENAGC++GRCRY
Sbjct: 59 VIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCDRVENAGCNSGRCRY 118
Query: 232 EVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQ 291
EVSYGDGSYTKGTLALETLT GRTVV+NVAIGCGH N+GMFVGAAGLLGLGGGSMS +GQ
Sbjct: 119 EVSYGDGSYTKGTLALETLTFGRTVVRNVAIGCGHSNRGMFVGAAGLLGLGGGSMSFMGQ 178
Query: 292 LGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVG 351
L GQTG AFSYCLVSRGT ++G L FG EA+PVGAAW+PLVRNPRAPSFYY+ L GLGVG
Sbjct: 179 LSGQTGNAFSYCLVSRGTNTNGFLEFGSEAMPVGAAWIPLVRNPRAPSFYYIRLLGLGVG 238
Query: 352 GMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF 411
R+P+SED+F+L ++G GVVMDTGTAVTR PT AYEAFR+AF+ QT NLPRASGVSIF
Sbjct: 239 DTRVPVSEDVFQLNELGSGGVVMDTGTAVTRFPTVAYEAFRNAFIEQTQNLPRASGVSIF 298
Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGN 471
DTCYNL GF+SVRVPTVSFYFSGGP+LT+PA+NFLIPVDDAGTFCFAFAPSPSGLSI+GN
Sbjct: 299 DTCYNLFGFLSVRVPTVSFYFSGGPILTIPANNFLIPVDDAGTFCFAFAPSPSGLSILGN 358
Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
IQQEGIQIS D AN FVGFGPN+C
Sbjct: 359 IQQEGIQISVDEANEFVGFGPNIC 382
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 633 bits (1632), Expect = e-179, Method: Compositional matrix adjust.
Identities = 315/384 (82%), Positives = 352/384 (91%), Gaps = 2/384 (0%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
MQRDVKRV +L+RR+S G A + V+DFG++VVSGMDQGSGEYFVRIGVGSPPRSQYM
Sbjct: 1 MQRDVKRVVSLIRRVSSG--STASYGVEDFGSEVVSGMDQGSGEYFVRIGVGSPPRSQYM 58
Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRY 231
VIDSGSDIVWVQC+PC+QCY Q+DP+FDPADSASF GVSCSSAVCD+++NAGC++GRCRY
Sbjct: 59 VIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCDQVDNAGCNSGRCRY 118
Query: 232 EVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQ 291
EVSYGDGS TKGTLALETLT+GRTVV+NVAIGCGH NQGMFVGAAGLLGLGGGSMS VGQ
Sbjct: 119 EVSYGDGSSTKGTLALETLTLGRTVVQNVAIGCGHMNQGMFVGAAGLLGLGGGSMSFVGQ 178
Query: 292 LGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVG 351
L + G AFSYCLVSR T S+G L FG EA+PVGAAW+PL+RNP +PS+YY+GLSGLGVG
Sbjct: 179 LSRERGNAFSYCLVSRVTNSNGFLEFGSEAMPVGAAWIPLIRNPHSPSYYYIGLSGLGVG 238
Query: 352 GMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF 411
M++PISED+F LT++G+ GVVMDTGTAVTR PT AYEAFRDAF+ QTGNLPRASGVSIF
Sbjct: 239 DMKVPISEDIFELTELGNGGVVMDTGTAVTRFPTVAYEAFRDAFIDQTGNLPRASGVSIF 298
Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGN 471
DTCYNL GF+SVRVPTVSFYFSGGP+LTLPA+NFLIPVDDAGTFCFAFAPSPSGLSI+GN
Sbjct: 299 DTCYNLFGFLSVRVPTVSFYFSGGPILTLPANNFLIPVDDAGTFCFAFAPSPSGLSILGN 358
Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
IQQEGIQIS DGAN FVGFGPNVC
Sbjct: 359 IQQEGIQISVDGANEFVGFGPNVC 382
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 630 bits (1626), Expect = e-178, Method: Compositional matrix adjust.
Identities = 326/440 (74%), Positives = 373/440 (84%), Gaps = 12/440 (2%)
Query: 59 ELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKR 118
E NN S+ S+ +++ L L+HRD+ S + Y H H HARM+RD R
Sbjct: 41 ETLPDFNNTHFSDDSN--SKYTLRLLHRDRFPS-------VTYRNHHHRLHARMRRDTDR 91
Query: 119 VATLVRRLSGG---GADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDS 175
V+ ++RR+SG + +++EV DFG+DVVSGMDQGSGEYFVRIGVGSPPR QYMVIDS
Sbjct: 92 VSAILRRISGKVVVASSDSRYEVNDFGSDVVSGMDQGSGEYFVRIGVGSPPRDQYMVIDS 151
Query: 176 GSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSY 235
GSD+VWVQCQPC CYKQSDPVFDPA S S++GVSC S+VCDR+EN+GCH+G CRYEV Y
Sbjct: 152 GSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMY 211
Query: 236 GDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQ 295
GDGSYTKGTLALETLT +TVV+NVA+GCGH+N+GMF+GAAGLLG+GGGSMS VGQL GQ
Sbjct: 212 GDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQ 271
Query: 296 TGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
TGGAF YCLVSRGT S+GSLVFGREALPVGA+WVPLVRNPRAPSFYYVGL GLGVGG+RI
Sbjct: 272 TGGAFGYCLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRI 331
Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCY 415
P+ + +F LT+ GD GVVMDTGTAVTRLPT AY AFRD F +QT NLPRASGVSIFDTCY
Sbjct: 332 PLPDGVFDLTETGDGGVVMDTGTAVTRLPTGAYAAFRDGFKSQTANLPRASGVSIFDTCY 391
Query: 416 NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQE 475
+LSGFVSVRVPTVSFYF+ GPVLTLPA NFL+PVDD+GT+CFAFA SP+GLSIIGNIQQE
Sbjct: 392 DLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQE 451
Query: 476 GIQISFDGANGFVGFGPNVC 495
GIQ+SFDGANGFVGFGPNVC
Sbjct: 452 GIQVSFDGANGFVGFGPNVC 471
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 630 bits (1625), Expect = e-178, Method: Compositional matrix adjust.
Identities = 325/433 (75%), Positives = 372/433 (85%), Gaps = 11/433 (2%)
Query: 65 NNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVR 124
NN S+ SS +++ L L+HRD+ S + Y H H HARM+RD RV+ ++R
Sbjct: 47 NNTHFSDESS--SKYTLRLLHRDRFPS-------VTYRNHHHRLHARMRRDTDRVSAILR 97
Query: 125 RLSGG--GADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWV 182
R+SG + +++EV DFG+D+VSGMDQGSGEYFVRIGVGSPPR QYMVIDSGSD+VWV
Sbjct: 98 RISGKVIPSSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWV 157
Query: 183 QCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTK 242
QCQPC CYKQSDPVFDPA S S++GVSC S+VCDR+EN+GCH+G CRYEV YGDGSYTK
Sbjct: 158 QCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTK 217
Query: 243 GTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
GTLALETLT +TVV+NVA+GCGH+N+GMF+GAAGLLG+GGGSMS VGQL GQTGGAF Y
Sbjct: 218 GTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGY 277
Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
CLVSRGT S+GSLVFGREALPVGA+WVPLVRNPRAPSFYYVGL GLGVGG+RIP+ + +F
Sbjct: 278 CLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVF 337
Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
LT+ GD GVVMDTGTAVTRLPT AY AFRD F +QT NLPRASGVSIFDTCY+LSGFVS
Sbjct: 338 DLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVS 397
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
VRVPTVSFYF+ GPVLTLPA NFL+PVDD+GT+CFAFA SP+GLSIIGNIQQEGIQ+SFD
Sbjct: 398 VRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFD 457
Query: 483 GANGFVGFGPNVC 495
GANGFVGFGPNVC
Sbjct: 458 GANGFVGFGPNVC 470
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 618 bits (1594), Expect = e-174, Method: Compositional matrix adjust.
Identities = 327/467 (70%), Positives = 384/467 (82%), Gaps = 12/467 (2%)
Query: 29 AASDTHFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDK 88
AA+ Q+LNV ++IK + T +++ Q EL E + N SS +++W L+L HRDK
Sbjct: 22 AATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIFELDNNSS-QSQWKLKLFHRDK 80
Query: 89 MSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSG 148
+ + + H F R+ RD KRV++L+R L + + +V DFG+DVVSG
Sbjct: 81 LPLNFDPD-------HPRRFKERISRDSKRVSSLLRLL----SSGSDEQVTDFGSDVVSG 129
Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
+QGSGEYFVRIGVGSPPRSQY+VIDSGSDIVWVQCQPCS+CY+QSDPVFDPA SA+++G
Sbjct: 130 TEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYAG 189
Query: 209 VSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKN 268
+SC S+VCDRL+NAGC+ GRCRYEVSYGDGSYT+GTLALETLT GR +++N+AIGCGH N
Sbjct: 190 ISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIRNIAIGCGHMN 249
Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAW 328
+GMF+GAAGLLGLGGG+MS VGQLGGQTGGAFSYCLVSRGT S+G+L FGR A+PVGAAW
Sbjct: 250 RGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGRGAMPVGAAW 309
Query: 329 VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
VPL+RNPRAPSFYYVGLSGLGVGG+R+PI E +F LT +G GVVMDTGTAVTRLP PAY
Sbjct: 310 VPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTRLPAPAY 369
Query: 389 EAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
EAFRD F+ QT NLPR+ VSIFDTCYNL+GFVSVRVPTVSFYFSGGP+LTLPA NFLIP
Sbjct: 370 EAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPARNFLIP 429
Query: 449 VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
VD GTFCFAFA S SGLSIIGNIQQEGIQIS DG+NGFVGFGP +C
Sbjct: 430 VDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 476
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 566 bits (1458), Expect = e-158, Method: Compositional matrix adjust.
Identities = 291/466 (62%), Positives = 345/466 (74%), Gaps = 39/466 (8%)
Query: 33 THFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSS 92
++FQ LNV +I ++ K +N + + +W +L HRD +
Sbjct: 27 SYFQHLNVENAISETKLKPLKQQNHN---------------TQQPQWKTKLFHRDNI--- 68
Query: 93 SNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ--DFGTDVVSGMD 150
N+ H+ F +R+ RD+KRV L+ RL+ + FG+DVVSG +
Sbjct: 69 -----NLKKTTHKTRFISRINRDIKRVTFLLNRLNKNTQEQQTTTATEASFGSDVVSGTE 123
Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
+GSGEYFVRIG+GSP QYMVIDSGSDIVW+QC+PC QCY Q+DP+F+PA SASF GV+
Sbjct: 124 EGSGEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSASFIGVA 183
Query: 211 CSSAVCDRLEN-AGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQ 269
CSS VC++L++ C GRC Y+V+YGDGSYTKGTLALET+TIGRTV+++ AIGCGH N+
Sbjct: 184 CSSNVCNQLDDDVACRKGRCGYQVAYGDGSYTKGTLALETITIGRTVIQDTAIGCGHWNE 243
Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV 329
GMFVGAAGLLGLGGG MS VGQLG QTGGAF YCLVSR A+PVGA WV
Sbjct: 244 GMFVGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCLVSR-------------AMPVGAMWV 290
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
PL+ NP PSFYYV LSGL VGG+R+PISE +F+LT +G GVVMDTGTA+TRLPT AY
Sbjct: 291 PLIHNPFYPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGGVVMDTGTAITRLPTVAYN 350
Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
AFRDAF+AQT NLPRA GVSIFDTCY+L+GFV+VRVPTVSFYFSGG +LT PA NFLIP
Sbjct: 351 AFRDAFIAQTTNLPRAPGVSIFDTCYDLNGFVTVRVPTVSFYFSGGQILTFPARNFLIPA 410
Query: 450 DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
DD GTFCFAFAPSPSGLSIIGNIQQEGIQ+S DG NGFVGFGPNVC
Sbjct: 411 DDVGTFCFAFAPSPSGLSIIGNIQQEGIQVSIDGTNGFVGFGPNVC 456
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 516 bits (1328), Expect = e-143, Method: Compositional matrix adjust.
Identities = 274/461 (59%), Positives = 328/461 (71%), Gaps = 32/461 (6%)
Query: 51 HAKMSQYNELFERHNNISSSNTS-----------SDEARWNLELVHRDKMSSSSNTTNNM 99
HA ++ + RHN + ++ S + R + LV RD ++ S+
Sbjct: 20 HASSLRFQYIDRRHNFTAKQASTSSSSSPSSAHGSRDRRPSFALVRRDAVTGST------ 73
Query: 100 HYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFG---TDVVSGMDQGSGEY 156
Y +H+ + RD R L RLS A ++ F + VVSG+D+GSGEY
Sbjct: 74 -YPSRRHAVLDLVARDNARAEYLASRLS-----PAAYQPTGFSGSESKVVSGLDEGSGEY 127
Query: 157 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC 216
FVR+G+GSPP QY+V+DSGSD++WVQC+PC +CY Q+DP+FDPA SA+FS V C SAVC
Sbjct: 128 FVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPATSATFSAVPCGSAVC 187
Query: 217 DRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGA 275
L +GC +G C YEVSYGDGSYTKG LALETLT+G T V+ VAIGCGH+N+G+FVGA
Sbjct: 188 RTLRTSGCGDSGGCDYEVSYGDGSYTKGALALETLTLGGTAVEGVAIGCGHRNRGLFVGA 247
Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR-EALPVGAAWVPLVRN 334
AGLLGLG G MSLVGQLGG GGAFSYCL SRG +GSLV GR EA+P GA WVPLVRN
Sbjct: 248 AGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRG---AGSLVLGRSEAVPEGAVWVPLVRN 304
Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
P+APSFYYVGLSG+GVG R+P+ EDLF+LT+ G GVVMDTGTAVTRLP AY A RDA
Sbjct: 305 PQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTGTAVTRLPQEAYAALRDA 364
Query: 395 FVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
FVA G LPRA GVS+ DTCY+LSG+ SVRVPTVSFYF G LTLPA N L+ V D G
Sbjct: 365 FVAAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAATLTLPARNLLLEV-DGGI 423
Query: 455 FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+C AFAPS SG SI+GNIQQEGIQI+ D ANG++GFGP C
Sbjct: 424 YCLAFAPSSSGPSILGNIQQEGIQITVDSANGYIGFGPTTC 464
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 514 bits (1325), Expect = e-143, Method: Compositional matrix adjust.
Identities = 279/489 (57%), Positives = 336/489 (68%), Gaps = 28/489 (5%)
Query: 12 QVLLLHLLCSIITTSTSAASDTHFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSN 71
+V+L L S S AS F +N + + T A S H + +++N
Sbjct: 8 KVILFLLFVSTSVLIVSPASPPRFHYINPH-----NFTTPASSSSSASASAVHRSRNNNN 62
Query: 72 TSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGA 131
S L LVHRD +S ++ Y +H + RD RV L +RL A
Sbjct: 63 PS-------LSLVHRDAISGAT-------YPSRRHQVVGLVARDNARVEHLEKRLV---A 105
Query: 132 DAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCY 191
+ + +D ++VV G+D GSGEYFVR+GVGSPP QY+V+DSGSD++WVQC+PC QCY
Sbjct: 106 STSPYLPEDLVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCY 165
Query: 192 KQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG----RCRYEVSYGDGSYTKGTLAL 247
Q+DP+FDPA S+SFSGVSC SA+C L GC G +C Y V+YGDGSYTKG LAL
Sbjct: 166 AQTDPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELAL 225
Query: 248 ETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
ETLT+G T V+ VAIGCGH+N G+FVGAAGLLGLG G+MSLVGQLGG GG FSYCL SR
Sbjct: 226 ETLTLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASR 285
Query: 308 GTGSSGSLVFGR-EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
G G +GSLV GR EA+PVGA WVPLVRN +A SFYYVGL+G+GVGG R+P+ + LF+LT+
Sbjct: 286 GAGGAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTE 345
Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
G GVVMDTGTAVTRLP AY A R AF G LPR+ VS+ DTCY+LSG+ SVRVP
Sbjct: 346 DGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVP 405
Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
TVSFYF G VLTLPA N L+ V A FC AFAPS SG+SI+GNIQQEGIQI+ D ANG
Sbjct: 406 TVSFYFDQGAVLTLPARNLLVEVGGA-VFCLAFAPSSSGISILGNIQQEGIQITVDSANG 464
Query: 487 FVGFGPNVC 495
+VGFGPN C
Sbjct: 465 YVGFGPNTC 473
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 514 bits (1325), Expect = e-143, Method: Compositional matrix adjust.
Identities = 278/489 (56%), Positives = 336/489 (68%), Gaps = 28/489 (5%)
Query: 12 QVLLLHLLCSIITTSTSAASDTHFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSN 71
+V+L L S S AS F +N + + T A S H + +++N
Sbjct: 8 KVILFLLFVSTSVLIVSPASPPRFHYINPH-----NFTTPASSSSSASASAVHRSRNNNN 62
Query: 72 TSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGA 131
S L LVHRD +S ++ Y +H + RD RV L +RL A
Sbjct: 63 PS-------LSLVHRDAISGAT-------YPSRRHQVVGLVARDNARVEHLEKRLV---A 105
Query: 132 DAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCY 191
+ + +D ++VV G+D GSGEYFVR+GVGSPP QY+V+DSGSD++WVQC+PC QCY
Sbjct: 106 STSPYLPEDLVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCY 165
Query: 192 KQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG----RCRYEVSYGDGSYTKGTLAL 247
Q+DP+FDPA S+SFSGVSC SA+C L GC G +C Y V+YGDGSYTKG LAL
Sbjct: 166 AQTDPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELAL 225
Query: 248 ETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
ETLT+G T V+ VAIGCGH+N G+FVGAAGLLGLG G+MSL+GQLGG GG FSYCL SR
Sbjct: 226 ETLTLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLIGQLGGAAGGVFSYCLASR 285
Query: 308 GTGSSGSLVFGR-EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
G G +GSLV GR EA+PVGA WVPLVRN +A SFYYVGL+G+GVGG R+P+ + LF+LT+
Sbjct: 286 GAGGAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTE 345
Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
G GVVMDTGTAVTRLP AY A R AF G LPR+ VS+ DTCY+LSG+ SVRVP
Sbjct: 346 DGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVP 405
Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
TVSFYF G VLTLPA N L+ V A FC AFAPS SG+SI+GNIQQEGIQI+ D ANG
Sbjct: 406 TVSFYFDQGAVLTLPARNLLVEVGGA-VFCLAFAPSSSGISILGNIQQEGIQITVDSANG 464
Query: 487 FVGFGPNVC 495
+VGFGPN C
Sbjct: 465 YVGFGPNTC 473
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 511 bits (1317), Expect = e-142, Method: Compositional matrix adjust.
Identities = 267/432 (61%), Positives = 319/432 (73%), Gaps = 23/432 (5%)
Query: 74 SDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADA 133
S + R + LV RD ++ ++ Y +H+ + RD R L RLS
Sbjct: 53 SRDRRPSFALVRRDAVTGAT-------YPSPRHAVLDLVSRDNARAEYLASRLS-----P 100
Query: 134 AKHEVQDFGTD--VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCY 191
A FG++ VVSG+D+GSGEYFVR+G+GSPP QY+V+DSGSD++WVQC+PC +CY
Sbjct: 101 AYQPTDFFGSESKVVSGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECY 160
Query: 192 KQSDPVFDPADSASFSGVSCSSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETL 250
Q+DP+FDPA SA+FS VSC SA+C L +GC +G C YEVSYGDGSYTKGTLALETL
Sbjct: 161 AQADPLFDPASSATFSAVSCGSAICRTLRTSGCGDSGGCEYEVSYGDGSYTKGTLALETL 220
Query: 251 TIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG-- 308
T+G T V+ VAIGCGH+N+G+FVGAAGLLGLG G MSLVGQLGG GGAFSYCL SRG
Sbjct: 221 TLGGTAVEGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGS 280
Query: 309 ----TGSSGSLVFGR-EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
++GSLV GR EA+P GA WVPLVRNP+APSFYYVG+SG+GVG R+P+ + LF+
Sbjct: 281 GSGAADAAGSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQ 340
Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
LT+ G GVVMDTGTAVTRLP AY A RDAFV G LPRA GVS+ DTCY+LSG+ SV
Sbjct: 341 LTEDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLLDTCYDLSGYTSV 400
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDG 483
RVPTVSFYF G LTLPA N L+ V D G +C AFAPS SGLSI+GNIQQEGIQI+ D
Sbjct: 401 RVPTVSFYFDGAATLTLPARNLLLEV-DGGIYCLAFAPSSSGLSILGNIQQEGIQITVDS 459
Query: 484 ANGFVGFGPNVC 495
ANG++GFGP C
Sbjct: 460 ANGYIGFGPATC 471
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 489 bits (1260), Expect = e-135, Method: Compositional matrix adjust.
Identities = 271/489 (55%), Positives = 327/489 (66%), Gaps = 37/489 (7%)
Query: 12 QVLLLHLLCSIITTSTSAASDTHFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSN 71
+V+L L S S AS F +N + + T A S H + +++N
Sbjct: 8 KVILFLLFVSTSVLIVSPASPPRFHYINPH-----NFTTPASSSSSASASAVHRSRNNNN 62
Query: 72 TSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGA 131
S L LVHRD +S ++ Y +H + RD RV L +RL A
Sbjct: 63 PS-------LSLVHRDAISGAT-------YPSRRHQVVGLVARDNARVEHLEKRLV---A 105
Query: 132 DAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCY 191
+ + +D ++VV G+D GSGEYFVR+GVGSPP QY+V+DSGSD++WVQC+PC QCY
Sbjct: 106 STSPYLPEDLVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCY 165
Query: 192 KQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG----RCRYEVSYGDGSYTKGTLAL 247
Q+DP+FDPA S+SFSGVSC SA+C L GC G +C Y V+YGDGSYTKG LAL
Sbjct: 166 AQTDPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELAL 225
Query: 248 ETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
ETLT+G T V+ VAIGCGH+N G+FVGAAGLLGLG G+MSLVGQLGG GG FSYCL SR
Sbjct: 226 ETLTLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASR 285
Query: 308 GTGSSGSLVFGR-EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
G G +GSLV GR EA+P R RA SFYYVGL+G+GVGG R+P+ + LF+LT+
Sbjct: 286 GAGGAGSLVLGRTEAVP---------RGRRASSFYYVGLTGIGVGGERLPLQDSLFQLTE 336
Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
G GVVMDTGTAVTRLP AY A R AF G LPR+ VS+ DTCY+LSG+ SVRVP
Sbjct: 337 DGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVP 396
Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
TVSFYF G VLTLPA N L+ V A FC AFAPS SG+SI+GNIQQEGIQI+ D ANG
Sbjct: 397 TVSFYFDQGAVLTLPARNLLVEVGGA-VFCLAFAPSSSGISILGNIQQEGIQITVDSANG 455
Query: 487 FVGFGPNVC 495
+VGFGPN C
Sbjct: 456 YVGFGPNTC 464
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 477 bits (1228), Expect = e-132, Method: Compositional matrix adjust.
Identities = 256/439 (58%), Positives = 307/439 (69%), Gaps = 31/439 (7%)
Query: 74 SDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADA 133
S ++R +L LV RD+++ S+ Y +H+ + RD R L RLS
Sbjct: 99 SRDSRPSLALVRRDEVTGST-------YPSLRHAVLDLVARDNARAEYLATRLS------ 145
Query: 134 AKHEVQDFG---TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC 190
++ F + VVSG+D+GSGEY VR+ VGSPP QY+V+DSGSD++WVQC+PC +C
Sbjct: 146 PAYQPPGFSGSESKVVSGLDEGSGEYLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPCLEC 205
Query: 191 YKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC---HAGRCRYEVSYGDGSYTKGTLAL 247
Y Q+DP+FDPA SA+FSGVSC SA+C L + C G C YEVSY DGSYTKG LAL
Sbjct: 206 YVQADPLFDPATSATFSGVSCGSAICRILPTSACGDGELGGCEYEVSYADGSYTKGALAL 265
Query: 248 ETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
ETLT+G T V+ V IGCGH+N+G+FVGAAGL+GLG G MSLVGQLGG+ GGAFSYCL SR
Sbjct: 266 ETLTLGGTAVEGVVIGCGHRNRGLFVGAAGLMGLGWGPMSLVGQLGGEVGGAFSYCLASR 325
Query: 308 GTGSSGS-------LVFGR-EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISE 359
G SG+ LV GR EA+P GA WVPLVRNPRAPSFYYVGLSG+ VG R+P+
Sbjct: 326 GGYGSGAADDDAGWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQA 385
Query: 360 DLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV-AQTGNLPRASGV--SIFDTCYN 416
LF+LT+ G VVMDTGT VTRLP AY A RDAFV A G +PRA GV S+ DTCY+
Sbjct: 386 GLFQLTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYD 445
Query: 417 LSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEG 476
LSG+ SVRVPTVSF F G L L A N L+ V D G +C AFAPS SGLSI+GN QQ G
Sbjct: 446 LSGYASVRVPTVSFCFDGDARLILAARNVLLEV-DMGIYCLAFAPSSSGLSIMGNTQQAG 504
Query: 477 IQISFDGANGFVGFGPNVC 495
IQI+ D ANG++GFGP C
Sbjct: 505 IQITVDSANGYIGFGPANC 523
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 473 bits (1216), Expect = e-130, Method: Compositional matrix adjust.
Identities = 263/488 (53%), Positives = 318/488 (65%), Gaps = 48/488 (9%)
Query: 12 QVLLLHLLCSIITTSTSAASDTHFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSN 71
+V+L L S S AS F +N + + T A S H + +++N
Sbjct: 8 KVILFLLFVSTSVLIVSPASPPRFHYINPH-----NFTTPASSSSSASASAVHRSRNNNN 62
Query: 72 TSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGA 131
S L LVHRD +S ++ Y +H + RD RV L +RL A
Sbjct: 63 PS-------LSLVHRDAISGAT-------YPSRRHQVVGLVARDNARVEHLEKRLV---A 105
Query: 132 DAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCY 191
+ + +D ++VV G+D GSGEYFVR+GVGSPP QY+V+DSGSD++WVQC+PC QCY
Sbjct: 106 STSPYLPEDLVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCY 165
Query: 192 KQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG----RCRYEVSYGDGSYTKGTLAL 247
Q+DP+FDPA S+SFSGVSC SA+C L GC G +C Y V+YGDGSYTKG LAL
Sbjct: 166 AQTDPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELAL 225
Query: 248 ETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
ETLT+G T V+ VAIGCGH+N G+FVGAAGLLGLG G+MSLVGQLGG GG FSYCL SR
Sbjct: 226 ETLTLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASR 285
Query: 308 GTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
G G +GSL A SFYYVGL+G+GVGG R+P+ + LF+LT+
Sbjct: 286 GAGGAGSL---------------------ASSFYYVGLTGIGVGGERLPLQDSLFQLTED 324
Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
G GVVMDTGTAVTRLP AY A R AF G LPR+ VS+ DTCY+LSG+ SVRVPT
Sbjct: 325 GAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPT 384
Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGF 487
VSFYF G VLTLPA N L+ V A FC AFAPS SG+SI+GNIQQEGIQI+ D ANG+
Sbjct: 385 VSFYFDQGAVLTLPARNLLVEVGGA-VFCLAFAPSSSGISILGNIQQEGIQITVDSANGY 443
Query: 488 VGFGPNVC 495
VGFGPN C
Sbjct: 444 VGFGPNTC 451
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 461 bits (1185), Expect = e-127, Method: Compositional matrix adjust.
Identities = 257/428 (60%), Positives = 317/428 (74%), Gaps = 22/428 (5%)
Query: 76 EARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAK 135
+ R +L L+HRD +S + Y +H+ RD RV L RRLS
Sbjct: 66 DGRPSLALLHRDAVSGRT-------YPSTRHAMLGLAARDGARVEYLQRRLS------PT 112
Query: 136 HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD 195
+ G++VVSG+ +GSGEYFVR+GVGSPP QY+V+DSGSD++W+QC+PC++CY+Q+D
Sbjct: 113 TMTTEVGSEVVSGISEGSGEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQAD 172
Query: 196 PVFDPADSASFSGVSCSSAVCDRLE--NAGC-HAGRCRYEVSYGDGSYTKGTLALETLTI 252
P+FDPA SASF+ V C S VC L ++GC +G CRY+VSYGDGSYT+G LA+ETLT
Sbjct: 173 PLFDPAASASFTAVPCDSGVCRTLPGGSSGCADSGACRYQVSYGDGSYTQGVLAMETLTF 232
Query: 253 G-RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
G T V+ VAIGCGH+N+G+FVGAAGLLGLG G MSLVGQLGG GGAFSYCL SRG +
Sbjct: 233 GDSTPVQGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGADA 292
Query: 312 -SGSLVFGR-EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
+GSLVFGR +A+PVGA WVPL+RN + PSFYYVGL+GLGVGG R+P+ + LF LT+ G
Sbjct: 293 GAGSLVFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDGG 352
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQT-GNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
GVVMDTGTAVTRLP AY A RDAF + G+LPRA GVS+ DTCY+LSG+ SVRVPTV
Sbjct: 353 GGVVMDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLDTCYDLSGYASVRVPTV 412
Query: 429 SFYFS-GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGF 487
+ YF G LTLPA N L+ + G +C AFA S SGLSI+GNIQQ+GIQI+ D ANG+
Sbjct: 413 ALYFGRDGAALTLPARNLLVEM-GGGVYCLAFAASASGLSILGNIQQQGIQITVDSANGY 471
Query: 488 VGFGPNVC 495
VGFGP+ C
Sbjct: 472 VGFGPSTC 479
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 451 bits (1160), Expect = e-124, Method: Compositional matrix adjust.
Identities = 229/428 (53%), Positives = 298/428 (69%), Gaps = 14/428 (3%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLS---------GG 129
W+++LVHRD + Y R +++R+ RV L +R+ G
Sbjct: 71 WSVQLVHRDSLLFKGAANATASYERR---LEEKLRREAARVRALEQRIERKLKLKKDPAG 127
Query: 130 GADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ 189
+ +FG++VVSGM+QGSGEYF RIG+G+P R QYMV+D+GSD+VW+QC+PC +
Sbjct: 128 SYENVAGVTAEFGSEVVSGMEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRE 187
Query: 190 CYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALET 249
CY Q+DP+F+P+ S SFS V C SAVC +L+ CH G C YEVSYGDGSYT G+ A ET
Sbjct: 188 CYSQADPIFNPSSSVSFSTVGCDSAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATET 247
Query: 250 LTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT 309
LT G T ++NVAIGCGH N G+FVGAAGLLGLG GS+S QLG QTG AFSYCLV R +
Sbjct: 248 LTFGTTSIQNVAIGCGHDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRDS 307
Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI-PISEDLFRLTQ-M 367
SSG+L FG E++P+G+ + PLV NP P+FYY+ + + VGG+ + + + FR+ +
Sbjct: 308 ESSGTLEFGPESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETT 367
Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
G G+++D+GTAVTRL T AY+A RDAF+A T +LPRA G+SIFDTCY+LS SV +P
Sbjct: 368 GRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPA 427
Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGF 487
V F+FS G LPA N LIP+D GTFCFAFAP+ S LSI+GNIQQ+GI++SFD AN
Sbjct: 428 VGFHFSNGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSL 487
Query: 488 VGFGPNVC 495
VGF + C
Sbjct: 488 VGFAIDQC 495
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 446 bits (1147), Expect = e-122, Method: Compositional matrix adjust.
Identities = 238/443 (53%), Positives = 298/443 (67%), Gaps = 14/443 (3%)
Query: 64 HNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLV 123
+ T + W++++VHRD + Y R ++RD +RV L
Sbjct: 99 RDEYEKRETKPRQTPWSVQVVHRDSLLVKDAANATASYERR---LEETLRRDARRVRGLE 155
Query: 124 ----RRLSGGGADAAKHE-----VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVID 174
+RL A HE +FG +VVSGM QGSGEYF RIGVG+P R QYMV+D
Sbjct: 156 QRIEKRLRLNKDPAGSHENVAEVAAEFGGEVVSGMAQGSGEYFTRIGVGTPMREQYMVLD 215
Query: 175 SGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVS 234
+GSD+VW+QC+PCS+CY Q DP+F+P+ SASFS + C+SAVC L+ CH G C Y+VS
Sbjct: 216 TGSDVVWIQCEPCSKCYSQVDPIFNPSLSASFSTLGCNSAVCSYLDAYNCHGGGCLYKVS 275
Query: 235 YGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGG 294
YGDGSYT G+ A E LT G T V+NVAIGCGH N G+FVGAAGLLGLG G +S QLG
Sbjct: 276 YGDGSYTIGSFATEMLTFGTTSVRNVAIGCGHDNAGLFVGAAGLLGLGAGLLSFPSQLGT 335
Query: 295 QTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMR 354
QTG AFSYCLV R + SSG+L FG E++P+G+ PL+ NP P+FYYV L + VGG
Sbjct: 336 QTGRAFSYCLVDRFSESSGTLEFGPESVPLGSILTPLLTNPSLPTFYYVPLISISVGGAL 395
Query: 355 I-PISEDLFRLTQM-GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD 412
+ + D+FR+ + G G ++D+GTAVTRL TP Y+A RDAFVA T LP+A GVSIFD
Sbjct: 396 LDSVPPDVFRIDETSGRGGFIVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSIFD 455
Query: 413 TCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNI 472
TCY+LSG V VPTV F+FS G L LPA N++IP+D GTFCFAFAP+ S LSI+GNI
Sbjct: 456 TCYDLSGLPLVNVPTVVFHFSNGASLILPAKNYMIPMDFMGTFCFAFAPATSDLSIMGNI 515
Query: 473 QQEGIQISFDGANGFVGFGPNVC 495
QQ+GI++SFD AN VGF C
Sbjct: 516 QQQGIRVSFDTANSLVGFALRQC 538
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 443 bits (1139), Expect = e-121, Method: Compositional matrix adjust.
Identities = 232/430 (53%), Positives = 298/430 (69%), Gaps = 16/430 (3%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
W++ LVHRD M +SN N + Y R++RD RVA + RL + +
Sbjct: 59 WSIPLVHRDAMKGNSNKNNELSYAER---MQQRLKRDAARVAAINSRLELAVNGIKRSSL 115
Query: 139 ------------QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP 186
DF + VVSGMDQGSGEYF RIGVG+P R Q MV+D+GSD+ W+QC+P
Sbjct: 116 KPDSSSSFTMAESDFQSPVVSGMDQGSGEYFSRIGVGAPRRDQLMVLDTGSDVTWIQCEP 175
Query: 187 CSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTL 245
CS CY+QSDP+++PA S+S+ V C + +C +L+ +GC G C Y+VSYGDGSYT+G
Sbjct: 176 CSDCYQQSDPIYNPALSSSYKLVGCQANLCQQLDVSGCSRNGSCLYQVSYGDGSYTQGNF 235
Query: 246 ALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
A ETLT+G ++NVAIGCGH N+G+FVGAAGLLGLGGGS+S QL + G FSYCLV
Sbjct: 236 ATETLTLGGAPLQNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQLTDENGKIFSYCLV 295
Query: 306 SRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
R + SS +L FGR A+P GA P+++N R +FYYV LSG+ VGG + IS+ +F +
Sbjct: 296 DRDSESSSTLQFGRAAVPNGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGID 355
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
G+ GV++D+GTAVTRL T AY++ RDAF A T NLP GVS+FDTCY+LS SV V
Sbjct: 356 ASGNGGVIVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPSTDGVSLFDTCYDLSSKESVDV 415
Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
PTV F+FSGG ++LPA N+L+PVD GTFCFAFAP+ S LSI+GNIQQ+GI++SFD AN
Sbjct: 416 PTVVFHFSGGGSMSLPAKNYLVPVDSMGTFCFAFAPTSSSLSIVGNIQQQGIRVSFDRAN 475
Query: 486 GFVGFGPNVC 495
VGF N C
Sbjct: 476 NQVGFAVNKC 485
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 439 bits (1129), Expect = e-120, Method: Compositional matrix adjust.
Identities = 228/434 (52%), Positives = 297/434 (68%), Gaps = 10/434 (2%)
Query: 71 NTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHAR-------MQRDVKRVATLV 123
T + W++E+VHRD + + Y R R ++R ++R TL
Sbjct: 66 ETKPRRSPWSVEVVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLN 125
Query: 124 RRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQ 183
+ + A+ + DFG +VVSGM+QGSGEYF RIGVG+P R QYMV+D+GSD+ W+Q
Sbjct: 126 KDPVNRYENVAEVDA-DFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQ 184
Query: 184 CQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKG 243
C+PC +CY Q+DP+F+P+ SASFS V C SAVC +L+ CH+G C YE SYGDGSY+ G
Sbjct: 185 CEPCRECYSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCHSGGCLYEASYGDGSYSTG 244
Query: 244 TLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
+ A ETLT G T V NVAIGCGHKN G+F+GAAGLLGLG G++S Q+G QTG FSYC
Sbjct: 245 SFATETLTFGTTSVANVAIGCGHKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTFSYC 304
Query: 304 LVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI-PISEDLF 362
LV R + SSG L FG +++PVG+ + PL +NP P+FYY+ ++ + VGG + I ++F
Sbjct: 305 LVDRESDSSGPLQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVF 364
Query: 363 RLTQM-GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV 421
R+ + G G ++D+GT VTRL T AY+A RDAFVA TG LPR VSIFDTCY+LSG
Sbjct: 365 RIDETSGHGGFIIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRTDAVSIFDTCYDLSGLQ 424
Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISF 481
V VPTV F+FS G L LPA N+LIP+D GTFCFAFAP+ S +SI+GN QQ+ I++SF
Sbjct: 425 FVSVPTVGFHFSNGASLILPAKNYLIPMDTVGTFCFAFAPAASSVSIMGNTQQQHIRVSF 484
Query: 482 DGANGFVGFGPNVC 495
D AN VGF + C
Sbjct: 485 DSANSLVGFAFDQC 498
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 209/349 (59%), Positives = 264/349 (75%), Gaps = 2/349 (0%)
Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
M+QGSGEYF RIG+G+P R QYMV+D+GSD+VW+QC+PC +CY Q+DP+F+P+ S SFS
Sbjct: 1 MEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFST 60
Query: 209 VSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKN 268
V C SAVC +L+ CH G C YEVSYGDGSYT G+ A ETLT G T ++NVAIGCGH N
Sbjct: 61 VGCDSAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTFGTTSIQNVAIGCGHDN 120
Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAW 328
G+FVGAAGLLGLG GS+S QLG QTG AFSYCLV R + SSG+L FG E++P+G+ +
Sbjct: 121 VGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRDSESSGTLEFGPESVPIGSIF 180
Query: 329 VPLVRNPRAPSFYYVGLSGLGVGGMRI-PISEDLFRLTQ-MGDDGVVMDTGTAVTRLPTP 386
PLV NP P+FYY+ + + VGG+ + + + FR+ + G G+++D+GTAVTRL T
Sbjct: 181 TPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAVTRLQTS 240
Query: 387 AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
AY+A RDAF+A T +LPRA G+SIFDTCY+LS SV +P V F+FS G LPA N L
Sbjct: 241 AYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFSNGAGFILPAKNCL 300
Query: 447 IPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
IP+D GTFCFAFAP+ S LSI+GNIQQ+GI++SFD AN VGF + C
Sbjct: 301 IPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 349
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 414 bits (1063), Expect = e-113, Method: Compositional matrix adjust.
Identities = 247/445 (55%), Positives = 296/445 (66%), Gaps = 28/445 (6%)
Query: 68 SSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLS 127
+ S SS R +L+L+HRD +S + + + +H+ A RD RVA L RRLS
Sbjct: 46 APSVPSSTTRRPSLQLLHRDTVSGTKHPS-------RRHAVLALASRDTARVAYLQRRLS 98
Query: 128 GGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC 187
+ ++ V+ GT V GSGEY VR+G+GSPP Q++V D+GSD++WVQC PC
Sbjct: 99 PSPSPSSTSSVESGGTIV----SHGSGEYLVRVGIGSPPLEQHLVADTGSDVIWVQCSPC 154
Query: 188 SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN-----AGCHAGRCRYEVSYGDGSYTK 242
S CY Q DP+FDPA+SASFS V C+S VC G G C Y+VSYGD SYT
Sbjct: 155 SDCYAQGDPLFDPANSASFSPVPCNSGVCRAAARYSSSSCGGGGGECEYKVSYGDKSYTN 214
Query: 243 GTLALETLTI-GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFS 301
G LALETLT+ G T V+ VA+GCGH+N+G+F AAGLLGLG G MSLVGQLGG GGAFS
Sbjct: 215 GVLALETLTLDGGTEVQGVAMGCGHENRGLFAEAAGLLGLGWGPMSLVGQLGGAAGGAFS 274
Query: 302 YCLVSRGTGSSGS---LVFGRE-ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPI 357
YCL +G LV GRE A P GA WVPLVRNP APSFYYVG++GLGV G R+ +
Sbjct: 275 YCLAGYYSGEGSGSGSLVLGREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQL 334
Query: 358 SEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV-AQTGNLPRASGVSIFDTCYN 416
+ LF L G GVVMDTGTAVTRLP AY A R AF A PRA GVS+FDTCY+
Sbjct: 335 QDGLFDLGDDGGGGVVMDTGTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSLFDTCYD 394
Query: 417 LSGFVSVRVPTVSFYFSG------GPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIG 470
LSG+ SVRVPTV+ YF G LTLPA N L+PVDD GT+C AFA SG SI+G
Sbjct: 395 LSGYASVRVPTVALYFGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVASGPSILG 454
Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
NIQQ+GI+I+ D A+G+VGFGP C
Sbjct: 455 NIQQQGIEITVDSASGYVGFGPATC 479
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 211/411 (51%), Positives = 281/411 (68%), Gaps = 13/411 (3%)
Query: 95 TTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV---------QDFGTDV 145
T +H+ ++ +R+ RD R +L RL D +K ++ +D T V
Sbjct: 91 TIYKIHHKDYKSLVLSRLHRDTVRFNSLTARLQLALEDISKSDLKPLETEIKPEDLSTPV 150
Query: 146 VSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSAS 205
SG QGSGEYF R+GVG+P R YMV+D+GSDI W+QCQPC+ CY+Q+DP+FDP S++
Sbjct: 151 TSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASST 210
Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGC 264
++ V+C S C LE + C +G+C Y+V+YGDGSYT G A E+++ G + VKNVA+GC
Sbjct: 211 YAPVTCQSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGSVKNVALGC 270
Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
GH N+G+FVGAAGLLGLGGG +SL QL +FSYCLV+R + S +L F L V
Sbjct: 271 GHDNEGLFVGAAGLLGLGGGPLSLTNQLKAT---SFSYCLVNRDSAGSSTLDFNSAQLGV 327
Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
+ PL++N + +FYYVGLSG+ VGG + I E FRL + G+ G+++D GTA+TRL
Sbjct: 328 DSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQ 387
Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
T AY RDAFV T NL S V++FDTCY+LSG SVRVPTVSF+F+ G LPA+N
Sbjct: 388 TQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSWNLPAAN 447
Query: 445 FLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+LIPVD AGT+CFAFAP+ S LSIIGN+QQ+G +++FD AN +GF PN C
Sbjct: 448 YLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 498
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 224/510 (43%), Positives = 322/510 (63%), Gaps = 25/510 (4%)
Query: 1 MAFSQTTLLLKQVLLLHLLCSIITTSTSAASDTHFQILNVNESIKGSRTDHAKMSQYNEL 60
MAF + LL V L L + +S S ++ T +L+V S++ ++T + + L
Sbjct: 1 MAFPRFLSLLTTVTLSLFLTATDASSRSLSTSTKTTVLDVVSSLQQTQTILSLDPTRSSL 60
Query: 61 F-ERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRV 119
+ +IS + + +LEL RD + +S + ++ +R++RD RV
Sbjct: 61 TATKPESISDPVFFNSSSPLSLELHSRDTLVAS-------QHKDYKSLVLSRLERDSSRV 113
Query: 120 ATLVR--RLSGGGADAA----------KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPR 167
A + R + G D + +++ + T VVSG+ QGSGEYF RIGVG+P +
Sbjct: 114 AGIAAKIRFAVEGIDRSDLKPVNNEDTRYQPEALTTPVVSGVSQGSGEYFSRIGVGTPAK 173
Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG 227
Y+V+D+GSD+ W+QC+PCS CY+QSDPVF+P S+++ ++CS+ C LE + C +
Sbjct: 174 EMYLVLDTGSDVNWIQCEPCSDCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSN 233
Query: 228 RCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSM 286
+C Y+VSYGDGS+T G LA +T+T G + + +VA+GCGH N+G+F GAAGLLGLGGG++
Sbjct: 234 KCLYQVSYGDGSFTVGELATDTVTFGNSGKINDVALGCGHDNEGLFTGAAGLLGLGGGAL 293
Query: 287 SLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLS 346
S+ Q+ +FSYCLV R +G S SL F L G A PL+RN + +FYYVGLS
Sbjct: 294 SITNQMKAT---SFSYCLVDRDSGKSSSLDFNSVQLGSGDATAPLLRNQKIDTFYYVGLS 350
Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR-A 405
G VGG ++ + + +F + G GV++D GTAVTRL T AY + RDAF+ T NL +
Sbjct: 351 GFSVGGQKVMMPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTTNLKKGT 410
Query: 406 SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG 465
S +S+FDTCY+ S SV+VPTV+F+F+GG L LPA N+LIPVDD GTFCFAFAP+ S
Sbjct: 411 SSISLFDTCYDFSSLSSVKVPTVAFHFTGGKSLDLPAKNYLIPVDDNGTFCFAFAPTSSS 470
Query: 466 LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
LSIIGN+QQ+G +I++D AN +G N C
Sbjct: 471 LSIIGNVQQQGTRITYDLANKIIGLSGNKC 500
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 227/514 (44%), Positives = 322/514 (62%), Gaps = 33/514 (6%)
Query: 1 MAFSQTTLLLKQVLLLHLLCSIITTSTSAASDTHFQILNVNESIKGSRT----DHAKMSQ 56
MAF + LL V L L + +S S ++ +L+V S++ ++T D + S
Sbjct: 1 MAFPRFLSLLAVVTLSLFLTTTDASSRSLSTPPKTNVLDVVSSLQQTQTILSLDPTRSSL 60
Query: 57 YNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFH-ARMQRD 115
E ++ N+SS +LEL RD +S H+ S +R++RD
Sbjct: 61 TTTKPESLSDPVFFNSSS---PLSLELHSRDTFVASQ--------HKDYKSLTLSRLERD 109
Query: 116 VKRVATLVR--RLSGGGADAA----------KHEVQDFGTDVVSGMDQGSGEYFVRIGVG 163
RVA +V R + G D + +++ +D T VVSG QGSGEYF RIGVG
Sbjct: 110 SSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVG 169
Query: 164 SPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG 223
+P + Y+V+D+GSD+ W+QC+PC+ CY+QSDPVF+P S+++ ++CS+ C LE +
Sbjct: 170 TPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSA 229
Query: 224 CHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLG 282
C + +C Y+VSYGDGS+T G LA +T+T G + + NVA+GCGH N+G+F GAAGLLGLG
Sbjct: 230 CRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLG 289
Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYY 342
GG +S+ Q+ +FSYCLV R +G S SL F L G A PL+RN + +FYY
Sbjct: 290 GGVLSITNQMKAT---SFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYY 346
Query: 343 VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL 402
VGLSG VGG ++ + + +F + G GV++D GTAVTRL T AY + RDAF+ T NL
Sbjct: 347 VGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNL 406
Query: 403 PR-ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP 461
+ +S +S+FDTCY+ S +V+VPTV+F+F+GG L LPA N+LIPVDD+GTFCFAFAP
Sbjct: 407 KKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAP 466
Query: 462 SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ S LSIIGN+QQ+G +I++D + +G N C
Sbjct: 467 TSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 227/514 (44%), Positives = 322/514 (62%), Gaps = 33/514 (6%)
Query: 1 MAFSQTTLLLKQVLLLHLLCSIITTSTSAASDTHFQILNVNESIKGSRT----DHAKMSQ 56
MAF + LL V L L + +S S ++ +L+V S++ ++T D + S
Sbjct: 1 MAFPRFLSLLAVVTLSLFLTTTDASSRSLSTPPKTNVLDVVSSLQQTQTILSLDPTRSSL 60
Query: 57 YNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFH-ARMQRD 115
E ++ N+SS +LEL RD +S H+ S +R++RD
Sbjct: 61 TTTKPESLSDPVFFNSSS---PLSLELHSRDTFVASQ--------HKDYKSLTLSRLERD 109
Query: 116 VKRVATLVR--RLSGGGADAA----------KHEVQDFGTDVVSGMDQGSGEYFVRIGVG 163
RVA +V R + G D + +++ +D T VVSG QGSGEYF RIGVG
Sbjct: 110 SSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVG 169
Query: 164 SPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG 223
+P + Y+V+D+GSD+ W+QC+PC+ CY+QSDPVF+P S+++ ++CS+ C LE +
Sbjct: 170 TPAKDMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSA 229
Query: 224 CHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLG 282
C + +C Y+VSYGDGS+T G LA +T+T G + + NVA+GCGH N+G+F GAAGLLGLG
Sbjct: 230 CRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLG 289
Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYY 342
GG +S+ Q+ +FSYCLV R +G S SL F L G A PL+RN + +FYY
Sbjct: 290 GGVLSITNQMKAT---SFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYY 346
Query: 343 VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL 402
VGLSG VGG ++ + + +F + G GV++D GTAVTRL T AY + RDAF+ T NL
Sbjct: 347 VGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNL 406
Query: 403 PR-ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP 461
+ +S +S+FDTCY+ S +V+VPTV+F+F+GG L LPA N+LIPVDD+GTFCFAFAP
Sbjct: 407 KKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAP 466
Query: 462 SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ S LSIIGN+QQ+G +I++D + +G N C
Sbjct: 467 TSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 220/432 (50%), Positives = 284/432 (65%), Gaps = 16/432 (3%)
Query: 68 SSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLS 127
S ++T+ A ++++L H D +S +S + F R+QRD RV +
Sbjct: 49 SPTDTAESSATFSVQLHHVDALSFNSTP---------ETLFTTRLQRDAARVEAISYLAE 99
Query: 128 GGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC 187
G K F + V+SG+ QGSGEYF RIGVG+PPR YMV+D+GSDIVW+QC PC
Sbjct: 100 TAGT--GKRVGTGFSSSVISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPC 157
Query: 188 SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTL 245
+CY QSDPVFDP S SF+ ++C S +C RL++ GC+ + C Y+VSYGDGS+T G
Sbjct: 158 KRCYAQSDPVFDPRKSRSFASIACRSPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDF 217
Query: 246 ALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
+ ETLT RT V VA+GCGH N+G+FVGAAGLLGLG G +S Q G + FSYCLV
Sbjct: 218 STETLTFRRTRVARVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLV 277
Query: 306 SRGTGSS-GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFR 363
R S S+VFG A+ A + PLV NP+ +FYYV L G+ VGG R+P I+ LF+
Sbjct: 278 DRSASSKPSSMVFGDSAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFK 337
Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
L Q G+ GV++D+GT+VTRL PAY AFRDAF A NL RA S+FDTC++LSG V
Sbjct: 338 LDQTGNGGVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGKTEV 397
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDG 483
+VPTV +F G V +LPASN+LIPVD +G FC AFA + GLSIIGNIQQ+G ++ +D
Sbjct: 398 KVPTVVLHFRGADV-SLPASNYLIPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDL 456
Query: 484 ANGFVGFGPNVC 495
A VGF P+ C
Sbjct: 457 AGSRVGFAPHGC 468
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 219/473 (46%), Positives = 302/473 (63%), Gaps = 35/473 (7%)
Query: 37 ILNVNESIKGSR---TDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSS 93
+L+V SI+ ++ + KMS +N+ T+S E +EL+ R + ++
Sbjct: 33 VLDVAASIQRTKNIFSSGPKMSPFNQ--------QEKETTSSE--LTVELLSRTSIQKTT 82
Query: 94 NTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL-----SGGGADAAKHEV------QDFG 142
+T ++ +R+QRD RV +LV RL S +D E +D
Sbjct: 83 HTG-------YKSLTLSRLQRDSARVKSLVTRLDLAINSISSSDLKPLETDSEFKPEDLQ 135
Query: 143 TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPAD 202
+ ++SG QGSGEYF R+G+G PP Y+++D+GSD+ WVQC PC+ CY+Q+DP+F+PA
Sbjct: 136 SPIISGTSQGSGEYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPAS 195
Query: 203 SASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI 262
SASFS +SC++ C L+ + C C YEVSYGDGSYT G ET+T+G V NVAI
Sbjct: 196 SASFSTLSCNTRQCRSLDVSECRNDTCLYEVSYGDGSYTVGDFVTETITLGSAPVDNVAI 255
Query: 263 GCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL 322
GCGH N+G+FVGAAGLLGLGGGS+S Q+ +FSYCLV R + S+ +L F L
Sbjct: 256 GCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAT---SFSYCLVDRDSESASTLEFN-STL 311
Query: 323 PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
P A PL+RN +FYYVGL+GL VGG + I E F++ + G+ GV++D+GTA+TR
Sbjct: 312 PPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDSGTAITR 371
Query: 383 LPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPA 442
L T Y + RDAFV +T +LP +G+++FDTCY+LS +V VPTVSF+F G L LPA
Sbjct: 372 LQTDVYNSLRDAFVKRTRDLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPA 431
Query: 443 SNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
N+L+P+D GTFCFAFAP+ S LSIIGN+QQ+G ++ +D N VGF PN C
Sbjct: 432 KNYLVPLDSEGTFCFAFAPTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 223/515 (43%), Positives = 317/515 (61%), Gaps = 33/515 (6%)
Query: 1 MAFSQTTLLLKQVLLLHLLCSIITTSTSAASDTHFQILNVNESIKGSR----TDHAKMSQ 56
MAF + LL V L L + +S S ++ +L+V S++ ++ D + S
Sbjct: 1 MAFPRFLSLLSVVTLSICLTTTDASSRSLSTSHKTTVLDVVSSLQQTQHILSVDPTRSSL 60
Query: 57 YNEL--FERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQR 114
+ F+ ++ N+SS +LEL RD + +S + ++ +R++R
Sbjct: 61 TARIPEFKPESDPVFLNSSS---PLSLELHSRDTLVAS-------QHKDYKSLVLSRLER 110
Query: 115 DVKRVATLVRR------------LSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGV 162
D RVA + + L D + + +D T VVSG QGSGEYF RIGV
Sbjct: 111 DSSRVAGIAAKIRFAVEGIDRSDLKPVDIDETRFQPEDLTTPVVSGTSQGSGEYFSRIGV 170
Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA 222
G+P + Y+V+D+GSD+ W+QC PCS+CY+QSDP+FDP S++F ++CS C L+ +
Sbjct: 171 GTPAKEMYVVLDTGSDVNWIQCLPCSECYQQSDPIFDPTSSSTFKSLTCSDPKCASLDVS 230
Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGL 281
C + +C Y+VSYGDGS+T G A +T+T G + V +VA+GCGH N+G+F GAAGLLGL
Sbjct: 231 ACRSNKCLYQVSYGDGSFTVGNYATDTVTFGESGKVNDVALGCGHDNEGLFTGAAGLLGL 290
Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFY 341
GGG++S+ Q+ + +FSYCLV R + S SL F + G A PL+RN + +FY
Sbjct: 291 GGGALSMTNQIKAK---SFSYCLVDRDSAKSSSLDFNSVQIGAGDATAPLLRNSKMDTFY 347
Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
YVGLSG VGG ++ I LF + G GV++D GTAVTRL T AY + RDAFV T +
Sbjct: 348 YVGLSGFSVGGQQVSIPSSLFEVDASGAGGVILDCGTAVTRLQTQAYNSLRDAFVKLTTD 407
Query: 402 LPR-ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA 460
+ S +S+FDTCY+ S +V+VPTV+F+F+GG L LPA N+LIP+DDAGTFCFAFA
Sbjct: 408 FKKGTSPISLFDTCYDFSSLSTVKVPTVTFHFTGGKSLNLPAKNYLIPIDDAGTFCFAFA 467
Query: 461 PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P+ S LSIIGN+QQ+G +I++D AN +G N C
Sbjct: 468 PTSSSLSIIGNVQQQGTRITYDLANNLIGLSANKC 502
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 200/358 (55%), Positives = 260/358 (72%), Gaps = 4/358 (1%)
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
+D T V SG QGSGEYF R+GVG+P R YMV+D+GSDI W+QCQPC+ CY+Q+DP+F
Sbjct: 3 EDLSTPVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIF 62
Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VV 257
DP S++++ V+C S C LE + C +G+C Y+V+YGDGSYT G A E+++ G + V
Sbjct: 63 DPTASSTYAPVTCQSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGSV 122
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF 317
KNVA+GCGH N+G+FVGAAGLLGLGGG +SL QL +FSYCLV+R + S +L F
Sbjct: 123 KNVALGCGHDNEGLFVGAAGLLGLGGGPLSLTNQLKAT---SFSYCLVNRDSAGSSTLDF 179
Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
L V + PL++N + +FYYVGLSG+ VGG + I E FRL + G+ G+++D G
Sbjct: 180 NSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCG 239
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV 437
TA+TRL T AY RDAFV T NL S V++FDTCY+LSG SVRVPTVSF+F+ G
Sbjct: 240 TAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKS 299
Query: 438 LTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
LPA+N+LIPVD AGT+CFAFAP+ S LSIIGN+QQ+G +++FD AN +GF PN C
Sbjct: 300 WNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 357
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 207/431 (48%), Positives = 279/431 (64%), Gaps = 24/431 (5%)
Query: 85 HRDKMSSSSNTTNNMH----YHRHQHSFH-----ARMQRDVKRVATLVRRLSGGGADAAK 135
++ ++SSS T +H + +H + +R++RD RV ++ RL +
Sbjct: 53 QQEIVTSSSQLTMELHSRTSVQKTKHPDYRSLTLSRLERDSARVKSINTRLDLAIHGLST 112
Query: 136 HEVQDFGTD-----------VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC 184
+++ TD ++SG QGSGEYF R+G+G P YMV+D+GSD+ W+QC
Sbjct: 113 SDLKPLDTDSQFRAEDLQGPIISGTSQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQC 172
Query: 185 QPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGT 244
PC+ CY Q+DP+F+PA S S+S +SC + C L+ + C C YEVSYGDGSYT G
Sbjct: 173 APCADCYHQADPIFEPASSTSYSPLSCDTKQCQSLDVSECRNNTCLYEVSYGDGSYTVGD 232
Query: 245 LALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL 304
ET+T+G V NVAIGCGH N+G+F+GAAGLLGLGGG +S Q+ +FSYCL
Sbjct: 233 FVTETITLGSASVDNVAIGCGHNNEGLFIGAAGLLGLGGGKLSFPSQINA---SSFSYCL 289
Query: 305 VSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL 364
V R + S+ +L F LP A PL+RN +FYYVG++GL VGG + I E +F +
Sbjct: 290 VDRDSDSASTLEFNSALLP-HAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEM 348
Query: 365 TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR 424
+ G+ G+++D+GTAVTRL T AY A RDAFV T +LP S V++FDTCY+LS SV
Sbjct: 349 DESGNGGIIIDSGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVE 408
Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGA 484
VPTV+F+ +GG VL LPA+N+LIPVD GTFCFAFAP+ S LSIIGN+QQ+G ++ FD A
Sbjct: 409 VPTVTFHLAGGKVLPLPATNYLIPVDSDGTFCFAFAPTSSALSIIGNVQQQGTRVGFDLA 468
Query: 485 NGFVGFGPNVC 495
N VGF P C
Sbjct: 469 NSLVGFEPRQC 479
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 217/476 (45%), Positives = 304/476 (63%), Gaps = 32/476 (6%)
Query: 31 SDTHFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMS 90
S T ILNV +SI RT + + N+ E+ ++ SSS ++L+L R +
Sbjct: 29 STTTTSILNVADSIH--RTKYTSSFRLNQQEEQTHSASSS--------FSLQLHSRVSVR 78
Query: 91 SSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAK-----------HEVQ 139
+ + ++ AR+ RD RV +L+ RL + +K E Q
Sbjct: 79 GT-------EHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPISTMYTTEEQ 131
Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
D ++SG QGSGEYF R+G+G P R YMV+D+GSD+ W+QC PC+ CY Q++P+F+
Sbjct: 132 DIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFE 191
Query: 200 PADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN 259
P+ S+S+ +SC + C+ LE + C C YEVSYGDGSYT G A ETLTIG T+V+N
Sbjct: 192 PSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQN 251
Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR 319
VA+GCGH N+G+FVGAAGLLGLGGG ++L QL +FSYCLV R + S+ ++ FG
Sbjct: 252 VAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTT---SFSYCLVDRDSDSASTVDFGT 308
Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
P A PL+RN + +FYY+GL+G+ VGG + I + F + + G G+++D+GTA
Sbjct: 309 SLSP-DAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTA 367
Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
VTRL T Y + RD+FV T +L +A+GV++FDTCYNLS +V VPTV+F+F GG +L
Sbjct: 368 VTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFPGGKMLA 427
Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
LPA N++IPVD GTFC AFAP+ S L+IIGN+QQ+G +++FD AN +GF N C
Sbjct: 428 LPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 208/413 (50%), Positives = 287/413 (69%), Gaps = 16/413 (3%)
Query: 94 NTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL-------SGGGADAAKHEVQ--DFGTD 144
+T + + ++ +R+ RD RV + RL S + E+Q D T
Sbjct: 88 DTIHKTPHKDYKALVLSRLHRDSSRVQAITTRLQLILNGVSKSDLKPLQTEIQPQDLSTP 147
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
V SG QGSGEYF R+GVG+P +S YMV+D+GSDI W+QCQPCS CY+QSDP+F PA S+
Sbjct: 148 VSSGTSQGSGEYFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDPIFTPAASS 207
Query: 205 SFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIG 263
S+S ++C S C+ L+ + C G+CRY+V+YGDGS+T G ET++ G + V ++A+G
Sbjct: 208 SYSPLTCDSQQCNSLQMSSCRNGQCRYQVNYGDGSFTFGDFVTETMSFGGSGTVNSIALG 267
Query: 264 CGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP 323
CGH N+G+FVGAAGLLGLGGG +SL QL +FSYCLV+R + +S +L F + P
Sbjct: 268 CGHDNEGLFVGAAGLLGLGGGPLSLTSQLKAT---SFSYCLVNRDSAASSTLDF--NSAP 322
Query: 324 VGAAWV-PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
VG + + PL+++ + +FYYVGLSG+ VGG + I +++F+L GD GV++D GTA+TR
Sbjct: 323 VGDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDGGVIVDCGTAITR 382
Query: 383 LPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPA 442
L + AY + RD+FV+ + +L SGV++FDTCY+LSG SV+VPTVSF+F GG LPA
Sbjct: 383 LQSEAYNSLRDSFVSMSRHLRSTSGVALFDTCYDLSGQSSVKVPTVSFHFDGGKSWDLPA 442
Query: 443 SNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+N+LIPVD AGT+CFAFAP+ S LSIIGN+QQ+G ++SFD AN VGF N C
Sbjct: 443 ANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVSFDLANNRVGFSTNKC 495
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 214/461 (46%), Positives = 293/461 (63%), Gaps = 28/461 (6%)
Query: 52 AKMSQYNELF--ERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFH 109
A + + ++F E ++ T SD + +L+L R + +S + ++
Sbjct: 37 ASIQRTQQVFAVEPKSSTPDETTVSDPSSLSLQLNSRISVMKAS-------HSDYKSLTL 89
Query: 110 ARMQRDVKRVATLVRRLSGG-----GAD----------AAKHEVQDFGTDVVSGMDQGSG 154
+R++RD RV +L R+ G D ++ +DF + +VSG QGSG
Sbjct: 90 SRLKRDSARVRSLTARIDLAIRGITGTDLEPLGNGGGGGSQFGTEDFESPIVSGASQGSG 149
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
EYF R+G+G PP YMV+D+GSD+ WVQC PC++CY+Q+DP+F+P SASF+ +SC +
Sbjct: 150 EYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPIFEPTSSASFTSLSCETE 209
Query: 215 VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVG 274
C L+ + C G C YEVSYGDGSYT G ET+T+G T + N+AIGCGH N+G+F+G
Sbjct: 210 QCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGSTSLGNIAIGCGHNNEGLFIG 269
Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRN 334
AAGLLGLGGGS+S QL +FSYCLV R + S+ +L F P A PL RN
Sbjct: 270 AAGLLGLGGGSLSFPSQLN---ASSFSYCLVDRDSDSTSTLDFNSPITP-DAVTAPLHRN 325
Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
P +F+Y+GL+G+ VGG +PI E F++++ G+ G+++D+GTAVTRL T Y RDA
Sbjct: 326 PNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTRLQTTVYNVLRDA 385
Query: 395 FVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
FV T +L A GV++FDTCY+LS V VPTVSF+F+ G L LPA N+LIPVD GT
Sbjct: 386 FVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPLPAKNYLIPVDSEGT 445
Query: 455 FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
FCFAFAP+ S LSI+GN QQ+G ++ FD AN VGF PN C
Sbjct: 446 FCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 223/482 (46%), Positives = 314/482 (65%), Gaps = 42/482 (8%)
Query: 36 QILNVNESIKGSRTDHAKMS--QYNELF--ERHNNISSSNTSSDEARWNLELVHRDKMSS 91
Q+L+V ++K R +K+S +++E E N+I L++VHRD +SS
Sbjct: 34 QVLDVEAALK-LRISRSKVSAQEWSETVQGEEKNSIV------------LQVVHRDSLSS 80
Query: 92 SSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRR---------------LSGGGADAAKH 136
SSNT+ + R++RD RV ++ R L+G DA +
Sbjct: 81 SSNTSL------VKEILQERLKRDAARVDSINARVQLAAMGVSKAEMKPLNGSSIDA-RF 133
Query: 137 EVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP 196
+ +DF + ++SG+ QGSGEYF R+GVG+PPR YMV+D+GSDI+W+QC PC++CY Q+DP
Sbjct: 134 DAKDFSSSIISGLAQGSGEYFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPCAKCYGQTDP 193
Query: 197 VFDPADSASFSGVSCSSAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT 255
+F+PA S+++ V C++ +C +L+ +GC R C Y+VSYGDGS+T G + ETLT
Sbjct: 194 LFNPAASSTYRKVPCATPLCKKLDISGCRNKRYCEYQVSYGDGSFTVGDFSTETLTFRGQ 253
Query: 256 VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR-GTGSSGS 314
V++ VA+GCGH N+G+F+GAAGLLGLG GS+S Q G Q FSYCLV R +G++ S
Sbjct: 254 VIRRVALGCGHDNEGLFIGAAGLLGLGRGSLSFPSQTGAQFSKRFSYCLVDRSASGTASS 313
Query: 315 LVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI-PISEDLFRLTQMGDDGVV 373
L+FG+ A+P A + PL+ NP+ +FYYV L G+ VGG R+ I +FR+ G+ GV+
Sbjct: 314 LIFGKAAIPKSAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMDATGNGGVI 373
Query: 374 MDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFS 433
+D+GT+VTRL AY RDAF TGNL A G S+FDTCY+LSG +V+VPT+ F+F
Sbjct: 374 IDSGTSVTRLVDSAYSTMRDAFRVGTGNLKSAGGFSLFDTCYDLSGLKTVKVPTLVFHFQ 433
Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
GG ++LPA+N+LIPVD + TFCFAFA + GLSIIGNIQQ+G ++ FD VGF
Sbjct: 434 GGAHISLPATNYLIPVDSSATFCFAFAGNTGGLSIIGNIQQQGYRVVFDSLANRVGFKAG 493
Query: 494 VC 495
C
Sbjct: 494 SC 495
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 214/461 (46%), Positives = 292/461 (63%), Gaps = 28/461 (6%)
Query: 52 AKMSQYNELF--ERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFH 109
A + + ++F E ++ T SD + +L+L R + +S + ++
Sbjct: 37 ASIQRTQQVFAVEPKSSTPDETTVSDPSSLSLQLNSRISVMKAS-------HSDYKSLTL 89
Query: 110 ARMQRDVKRVATLVRRLSGG-----GAD----------AAKHEVQDFGTDVVSGMDQGSG 154
+R++RD RV +L R+ G D ++ +DF + +VSG QGSG
Sbjct: 90 SRLKRDSARVRSLTARIDLAIRGITGTDLEPLGNGGGGGSQFGTEDFESPIVSGASQGSG 149
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
EYF R+G+G PP YMV+D+GSD+ WVQC PC++CY+Q+DP F+P SASF+ +SC +
Sbjct: 150 EYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPXFEPTSSASFTSLSCETE 209
Query: 215 VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVG 274
C L+ + C G C YEVSYGDGSYT G ET+T+G T + N+AIGCGH N+G+F+G
Sbjct: 210 QCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGSTSLGNIAIGCGHNNEGLFIG 269
Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRN 334
AAGLLGLGGGS+S QL +FSYCLV R + S+ +L F P A PL RN
Sbjct: 270 AAGLLGLGGGSLSFPSQLN---ASSFSYCLVDRDSDSTSTLDFNSPITP-DAVTAPLHRN 325
Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
P +F+Y+GL+G+ VGG +PI E F++++ G+ G+++D+GTAVTRL T Y RDA
Sbjct: 326 PNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTRLQTTVYNVLRDA 385
Query: 395 FVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
FV T +L A GV++FDTCY+LS V VPTVSF+F+ G L LPA N+LIPVD GT
Sbjct: 386 FVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPLPAKNYLIPVDSEGT 445
Query: 455 FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
FCFAFAP+ S LSI+GN QQ+G ++ FD AN VGF PN C
Sbjct: 446 FCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 390 bits (1002), Expect = e-106, Method: Compositional matrix adjust.
Identities = 212/471 (45%), Positives = 306/471 (64%), Gaps = 33/471 (7%)
Query: 37 ILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTT 96
ILNV +SI RT + + N+ E+ ++ SSS ++L+L R + +
Sbjct: 37 ILNVADSIH--RTKYTSSFRLNQQEEQTHSRSSS--------FSLQLHSRVSVRGT---- 82
Query: 97 NNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ------------DFGTD 144
+ ++ AR+ RD RV +L+ RL + +K +++ D
Sbjct: 83 ---EHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPVTTMYTTTEEEDIEAP 139
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
++SG QGSGEYF R+G+G+P R YMV+D+GSD+ W+QC PC+ CY Q++P+F+P+ S+
Sbjct: 140 LISGTTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSS 199
Query: 205 SFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC 264
S+ +SC + C+ LE + C C YEVSYGDGSYT G A ETLTIG T+V+NVA+GC
Sbjct: 200 SYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVAVGC 259
Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
GH N+G+FVGAAGLLGLGGG ++L QL +FSYCLV R + S+ ++ FG +LP
Sbjct: 260 GHSNEGLFVGAAGLLGLGGGLLALPSQLNTT---SFSYCLVDRDSDSASTVEFG-TSLPP 315
Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
A PL+RN + +FYY+GL+G+ VGG + I + F + + G G+++D+GTAVTRL
Sbjct: 316 DAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQ 375
Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
T Y + RD+F+ T +L +A+GV++FDTCYNLS ++ VPTV+F+F GG +L LPA N
Sbjct: 376 TGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIEVPTVAFHFPGGKMLALPAKN 435
Query: 445 FLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
++IPVD GTFC AFAP+ S L+IIGN+QQ+G +++FD AN +GF N C
Sbjct: 436 YMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 486
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 390 bits (1002), Expect = e-106, Method: Compositional matrix adjust.
Identities = 217/432 (50%), Positives = 282/432 (65%), Gaps = 26/432 (6%)
Query: 68 SSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLS 127
S S SS +A L+L H D +S + T+ F+ R+ RD RV L R +
Sbjct: 43 SQSLQSSPDAPLTLDLHHLDSLSLNKTPTD---------LFNLRLHRDTLRVHALNSRAA 93
Query: 128 GGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC 187
G F + VVSG+ QGSGEYF R+GVG+PPR YMV+D+GSD+VW+QC PC
Sbjct: 94 G------------FSSSVVSGLSQGSGEYFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPC 141
Query: 188 SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTL 245
+CY QSDP+F+P S SF+G+ CSS +C RL+++GC R C Y+VSYGDGS+T G
Sbjct: 142 RKCYSQSDPIFNPYKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDF 201
Query: 246 ALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
A ETLT + VA+GCGH N+G+FVGAAGLLGLG G +S Q G + FSYCLV
Sbjct: 202 ATETLTFRGNKIAKVALGCGHHNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLV 261
Query: 306 SRGTGSS-GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFR 363
R S S+VFG A+ A + PL+RNP+ +FYYVGL G+ VGG+R+ +S LF+
Sbjct: 262 DRSASSKPSSMVFGDAAISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFK 321
Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
L G+ GV++D+GT+VTRL PAY A RDAF +L R S+FDTCY+LSG SV
Sbjct: 322 LDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFDTCYDLSGQSSV 381
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDG 483
+VPTV +F G + LPA+N+LIPVD+ G+FCFAFA + SGLSIIGNIQQ+G ++ +D
Sbjct: 382 KVPTVVLHFRGAD-MALPATNYLIPVDENGSFCFAFAGTISGLSIIGNIQQQGFRVVYDL 440
Query: 484 ANGFVGFGPNVC 495
A +GF P C
Sbjct: 441 AGSRIGFAPRGC 452
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 206/355 (58%), Positives = 250/355 (70%), Gaps = 7/355 (1%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
VVSG+ QGSGEYF R+G+GSP R YMV+D+GSD+ WVQCQPC+ CY+QSDPVFDP+ SA
Sbjct: 158 VVSGVGQGSGEYFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSA 217
Query: 205 SFSGVSCSSAVCDRLENAGCH--AGRCRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVA 261
S++ VSC S C L+ A C G C YEV+YGDGSYT G A ETLT+G T V NVA
Sbjct: 218 SYAAVSCDSPRCRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDSTPVTNVA 277
Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA 321
IGCGH N+G+FVGAAGLL LGGG +S Q+ T FSYCLV R + ++ +L FG +
Sbjct: 278 IGCGHDNEGLFVGAAGLLALGGGPLSFPSQISAST---FSYCLVDRDSPAASTLQFGADG 334
Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM-GDDGVVMDTGTAV 380
PLVR+PR +FYYV LSG+ VGG + I F + G GV++D+GTAV
Sbjct: 335 AEADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGSGGVIVDSGTAV 394
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
TRL + AY A RDAFV T +LPR SGVS+FDTCY+LS SV VP VS F GG L L
Sbjct: 395 TRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRL 454
Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
PA N+LIPVD AGT+C AFAP+ + +SIIGN+QQ+G ++SFD A G VGF PN C
Sbjct: 455 PAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKGVVGFTPNKC 509
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 387 bits (993), Expect = e-105, Method: Compositional matrix adjust.
Identities = 212/377 (56%), Positives = 258/377 (68%), Gaps = 10/377 (2%)
Query: 123 VRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWV 182
+R +G AA +Q VVSG+ QGSGEYF R+G+GSP R YMV+D+GSD+ WV
Sbjct: 136 LRPANGSAVFAASAAIQG---PVVSGVGQGSGEYFSRVGIGSPARQLYMVLDTGSDVTWV 192
Query: 183 QCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH--AGRCRYEVSYGDGSY 240
QCQPC+ CY+QSDPVFDP+ SAS++ VSC S C L+ A C G C YEV+YGDGSY
Sbjct: 193 QCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGACLYEVAYGDGSY 252
Query: 241 TKGTLALETLTIG-RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA 299
T G A ETLT+G T V NVAIGCGH N+G+FVGAAGLL LGGG +S Q+ T
Sbjct: 253 TVGDFATETLTLGDSTPVGNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISAST--- 309
Query: 300 FSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISE 359
FSYCLV R + ++ +L FG A G PLVR+PR +FYYV LSG+ VGG + I
Sbjct: 310 FSYCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPA 369
Query: 360 DLFRLTQM-GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLS 418
F + G GV++D+GTAVTRL + AY A RDAFV +LPR SGVS+FDTCY+LS
Sbjct: 370 SAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLS 429
Query: 419 GFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQ 478
SV VP VS F GG L LPA N+LIPVD AGT+C AFAP+ + +SIIGN+QQ+G +
Sbjct: 430 DRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTR 489
Query: 479 ISFDGANGFVGFGPNVC 495
+SFD A G VGF PN C
Sbjct: 490 VSFDTARGAVGFTPNKC 506
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 386 bits (992), Expect = e-104, Method: Compositional matrix adjust.
Identities = 218/430 (50%), Positives = 281/430 (65%), Gaps = 18/430 (4%)
Query: 73 SSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRD---VKRVATLVRRLSGG 129
S E+ L L H D +SS+ Q F +R+QRD VK +ATL ++ G
Sbjct: 66 SDSESSITLNLDHIDALSSNKTP---------QELFSSRLQRDSRRVKSIATLAAQIPGR 116
Query: 130 GADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ 189
A F + VVSG+ QGSGEYF R+GVG+P R YMV+D+GSDIVW+QC PC +
Sbjct: 117 NVTHAPR-TGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRR 175
Query: 190 CYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLAL 247
CY QSDP+FDP S +++ + CSS C RL++AGC+ R C Y+VSYGDGS+T G +
Sbjct: 176 CYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFST 235
Query: 248 ETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
ETLT R VK VA+GCGH N+G+FVGAAGLLGLG G +S GQ G + FSYCLV R
Sbjct: 236 ETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDR 295
Query: 308 GTGSS-GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLT 365
S S+VFG A+ A + PL+ NP+ +FYYV L G+ VGG R+P ++ LF+L
Sbjct: 296 SASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLD 355
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
Q+G+ GV++D+GT+VTRL PAY A RDAF L RA S+FDTC++LS V+V
Sbjct: 356 QIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAPDFSLFDTCFDLSNMNEVKV 415
Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
PTV +F G V +LPA+N+LIPVD G FCFAFA + GLSIIGNIQQ+G ++ +D A+
Sbjct: 416 PTVVLHFRGADV-SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLAS 474
Query: 486 GFVGFGPNVC 495
VGF P C
Sbjct: 475 SRVGFAPGGC 484
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 205/397 (51%), Positives = 261/397 (65%), Gaps = 15/397 (3%)
Query: 110 ARMQRDVKRVATLVRRLS-----------GGGADAAKHEVQDFGTDVVSGMDQGSGEYFV 158
+R+ RD RV L RL A+ E VVSG QGSGEYF+
Sbjct: 92 SRLARDSARVKALQTRLDLFLKRVSNSDLHPAESKAEFESNALQGPVVSGTSQGSGEYFL 151
Query: 159 RIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
R+G+G PP Y+V+D+GSD+ W+QC PCS+CY+QSDP+FDP S S+S + C C
Sbjct: 152 RVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPISSNSYSPIRCDEPQCKS 211
Query: 219 LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGL 278
L+ + C G C YEVSYGDGSYT G A ET+T+G V+NVAIGCGH N+G+FVGAAGL
Sbjct: 212 LDLSECRNGTCLYEVSYGDGSYTVGEFATETVTLGSAAVENVAIGCGHNNEGLFVGAAGL 271
Query: 279 LGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAP 338
LGLGGG +S Q+ + FSYCLV+R + + +L F LP AA PL+RNP
Sbjct: 272 LGLGGGKLSFPAQVNATS---FSYCLVNRDSDAVSTLEFN-SPLPRNAATAPLMRNPELD 327
Query: 339 SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ 398
+FYY+GL G+ VGG +PI E F + +G G+++D+GTAVTRL + Y+A RDAFV
Sbjct: 328 TFYYLGLKGISVGGEALPIPESSFEVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAFVKG 387
Query: 399 TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFA 458
+P+A+GVS+FDTCY+LS SV +PTVSF F G L LPA N+LIPVD GTFCFA
Sbjct: 388 AKGIPKANGVSLFDTCYDLSSRESVEIPTVSFRFPEGRELPLPARNYLIPVDSVGTFCFA 447
Query: 459 FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
FAP+ S LSIIGN+QQ+G ++ FD AN VGF + C
Sbjct: 448 FAPTTSSLSIIGNVQQQGTRVGFDIANSLVGFSVDSC 484
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 206/406 (50%), Positives = 266/406 (65%), Gaps = 16/406 (3%)
Query: 102 HRHQHSFH-ARMQRDVKRVATLVRRLS-----------GGGADAAKHEVQDFGTDVVSGM 149
HR S +R+ RD RV +L RL A+ E VVSG
Sbjct: 83 HRDYKSLTLSRLARDSARVKSLQTRLDLVLKRVSNSDLHPAESNAEFEANALQGPVVSGT 142
Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGV 209
QGSGEYF+R+G+G PP Y+V+D+GSD+ W+QC PCS+CY+QSDP+FDP S S+S +
Sbjct: 143 SQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPVSSNSYSPI 202
Query: 210 SCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQ 269
C + C L+ + C G C YEVSYGDGSYT G A ET+T+G V+NVAIGCGH N+
Sbjct: 203 RCDAPQCKSLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTLGTAAVENVAIGCGHNNE 262
Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV 329
G+FVGAAGLLGLGGG +S Q+ + FSYCLV+R + + +L F LP
Sbjct: 263 GLFVGAAGLLGLGGGKLSFPAQVNATS---FSYCLVNRDSDAVSTLEFN-SPLPRNVVTA 318
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
PL RNP +FYY+GL G+ VGG +PI E +F + +G G+++D+GTAVTRL + Y+
Sbjct: 319 PLRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGIIIDSGTAVTRLRSEVYD 378
Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
A RDAFV +P+A+GVS+FDTCY+LS SV+VPTVSF+F G L LPA N+LIPV
Sbjct: 379 ALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVQVPTVSFHFPEGRELPLPARNYLIPV 438
Query: 450 DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
D GTFCFAFAP+ S LSI+GN+QQ+G ++ FD AN VGF + C
Sbjct: 439 DSVGTFCFAFAPTTSSLSIMGNVQQQGTRVGFDIANSLVGFSADSC 484
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 217/422 (51%), Positives = 281/422 (66%), Gaps = 18/422 (4%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRD---VKRVATLVRRLSGGGADAAKHE 137
L L H D +SS+ T + + F +R+QRD VK +ATL ++ G A
Sbjct: 74 LNLDHIDALSSN-KTPDEL--------FSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRP 124
Query: 138 VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
F + VVSG+ QGSGEYF R+GVG+P R YMV+D+GSDIVW+QC PC +CY QSDP+
Sbjct: 125 -GGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPI 183
Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRT 255
FDP S +++ + CSS C RL++AGC+ R C Y+VSYGDGS+T G + ETLT R
Sbjct: 184 FDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRN 243
Query: 256 VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS-GS 314
VK VA+GCGH N+G+FVGAAGLLGLG G +S GQ G + FSYCLV R S S
Sbjct: 244 RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSS 303
Query: 315 LVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVV 373
+VFG A+ A + PL+ NP+ +FYYVGL G+ VGG R+P ++ LF+L Q+G+ GV+
Sbjct: 304 VVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVI 363
Query: 374 MDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFS 433
+D+GT+VTRL PAY A RDAF L RA S+FDTC++LS V+VPTV +F
Sbjct: 364 IDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFR 423
Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
G V +LPA+N+LIPVD G FCFAFA + GLSIIGNIQQ+G ++ +D A+ VGF P
Sbjct: 424 GADV-SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPG 482
Query: 494 VC 495
C
Sbjct: 483 GC 484
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 206/445 (46%), Positives = 293/445 (65%), Gaps = 24/445 (5%)
Query: 61 FERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVA 120
F++ ++ SN+S ++L+L RD + +N + ++ +R+ RD RV
Sbjct: 61 FQQQVHLVPSNSS---FSFSLQLHPRDSL-------HNAGHKDYKSLVLSRLSRDSSRVK 110
Query: 121 TLVRRLSGGGADAAKHEVQ---------DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
++ RL ++ + +++ D T ++SG QGSGEYF R+GVG P + YM
Sbjct: 111 SIYDRLEFALSELKRSDLEPLKTEILPEDLSTPIISGTSQGSGEYFSRVGVGQPAKPFYM 170
Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRY 231
V+D+GSDI W+QCQPC+ CY+Q+DP+FDP S+SF+ + C S C LE +GC A +C Y
Sbjct: 171 VLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQQCQALETSGCRASKCLY 230
Query: 232 EVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVG 290
+VSYGDGS+T G +ETLT G + ++ NVA+GCGH N+G+FVG+AGLLGLGGGS+SL
Sbjct: 231 QVSYGDGSFTVGEFVIETLTFGNSGMINNVAVGCGHDNEGLFVGSAGLLGLGGGSLSLTS 290
Query: 291 QLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGV 350
Q+ +FSYCLV R + SS L F A P + PL+++ + +FYYVGL+G+ V
Sbjct: 291 QM---KASSFSYCLVDRDSSSSSDLEFNSAA-PSDSVNAPLLKSGKVDTFYYVGLTGMSV 346
Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI 410
GG + I +LF++ G G+++D+GTA+TRL T AY RDAFV++T L + +G ++
Sbjct: 347 GGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFAL 406
Query: 411 FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIG 470
FDTCY+LS V +PTVSF F+GG L LP N+LIPVD GTFCFAFAP+ S LSIIG
Sbjct: 407 FDTCYDLSSQSRVTIPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIG 466
Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
N+QQ+G ++ +D AN VGF P+ C
Sbjct: 467 NVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 216/422 (51%), Positives = 278/422 (65%), Gaps = 18/422 (4%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRV---ATLVRRLSGGGADAAKHE 137
L L H D +SS+ Q F +R+QRD +RV ATL ++ G A
Sbjct: 74 LNLDHIDALSSNKTP---------QELFSSRLQRDSRRVRSIATLAAQIPGRNVTHAPRP 124
Query: 138 VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
F + VVSG+ QGSGEYF R+GVG+P R YMV+D+GSDIVW+QC PC +CY QSDP+
Sbjct: 125 -GGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPI 183
Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRT 255
FDP S +++ + CSS C RL++AGC+ R C Y+VSYGDGS+T G + ETLT R
Sbjct: 184 FDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRN 243
Query: 256 VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS-GS 314
VK VA+GCGH N+G+FVGAAGLLGLG G +S GQ G + FSYCLV R S S
Sbjct: 244 RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSS 303
Query: 315 LVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVV 373
+VFG A+ A + PL+ NP+ +FYYVGL G+ VGG R+P ++ LF+L Q+G+ GV+
Sbjct: 304 VVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVI 363
Query: 374 MDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFS 433
+D+GT+VTRL PAY A RDAF L RA S+FDTC++LS V+VPTV +F
Sbjct: 364 IDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSLFDTCFDLSNMNEVKVPTVVLHFR 423
Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
V +LPA+N+LIPVD G FCFAFA + GLSIIGNIQQ+G ++ +D A+ VGF P
Sbjct: 424 RADV-SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPG 482
Query: 494 VC 495
C
Sbjct: 483 GC 484
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 203/409 (49%), Positives = 277/409 (67%), Gaps = 19/409 (4%)
Query: 102 HRHQHSFH-----ARMQRDVKRVATLVRRLSGGGADAAKHEVQDFG---------TDVVS 147
H+ H + AR++RD RV +L R+ A K +++ T +VS
Sbjct: 87 HKSSHKDYKSLVLARLERDSDRVRSLATRMDLAIAGITKSDLKPVEKELEAEALETPLVS 146
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFS 207
G QGSGEYF R+G+GSPP+ YMV+D+GSD+ WVQC PC+ CY+Q+DP+F+P+ S+S++
Sbjct: 147 GASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYA 206
Query: 208 GVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGH 266
++C + C L+ + C C YEVSYGDGSYT G A ET+T+ G + NVAIGCGH
Sbjct: 207 PLTCETHQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITLDGSASLNNVAIGCGH 266
Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA 326
N+G+FVGAAGLLGLGGGS+S Q+ +FSYCLV+R T S+ +L F +P +
Sbjct: 267 DNEGLFVGAAGLLGLGGGSLSFPSQINA---SSFSYCLVNRDTDSASTLEFN-SPIPSHS 322
Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
PL+RN + +FYY+G++G+GVGG + I F + + G+ G+++D+GTAVTRL +
Sbjct: 323 VTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIVDSGTAVTRLQSD 382
Query: 387 AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
Y + RD+FV T +LP SGV++FDTCY+LS SV VPTVSF+F G L LPA N+L
Sbjct: 383 VYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHFPDGKYLALPAKNYL 442
Query: 447 IPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
IPVD AGTFCFAFAP+ S LSIIGN+QQ+G ++S+D +N VGF PN C
Sbjct: 443 IPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFSPNGC 491
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 213/488 (43%), Positives = 303/488 (62%), Gaps = 27/488 (5%)
Query: 22 IITTSTSAASDTHFQILNVNESIKGSR---TDHAKMSQYNELFERHNNISSSNTSSDEAR 78
+ + S +D+H +L+V+ SI+ + + + +S+ ++ +R +S + +S +
Sbjct: 22 VFSRELSLDTDSHSSVLDVSGSIRKTLDVLSHKSSVSKPSD--QRDEKTTSFSPTSLASS 79
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
++LEL R+ + S + ++ +R+ RD RV + +L + K ++
Sbjct: 80 FSLELHPRELLHGGS-------HKDYRALMLSRLARDSARVKAINTKLQLAVSGTDKSDL 132
Query: 139 ----------QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS 188
QDF T V SG QGSGEYF+R+G+G P ++ YMVID+GSD+ W+QC+PC
Sbjct: 133 VPMDTEILHPQDFSTPVTSGTSQGSGEYFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCD 192
Query: 189 QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALE 248
CY+Q DP+FDPA S+SFS + C + C L+ C C Y+VSYGDGSYT G A E
Sbjct: 193 DCYQQVDPIFDPASSSSFSRLGCQTPQCRNLDVFACRNDSCLYQVSYGDGSYTVGDFATE 252
Query: 249 TLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
T++ G + V VAIGCGH N+G+FVGAAGL+GLGGG +SL Q+ +FSYCLV+R
Sbjct: 253 TVSFGNSGSVDKVAIGCGHDNEGLFVGAAGLIGLGGGPLSLTSQI---KASSFSYCLVNR 309
Query: 308 GTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
+ S +L F A P + P+ +N + +FYYVG++G+ VGG ++ I +F +
Sbjct: 310 DSVDSSTLEFN-SAKPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGS 368
Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
G G+++D GTAVTRL T AY A RD FV T +LP SG ++FDTCYNLS SVRVPT
Sbjct: 369 GKGGIIVDCGTAVTRLQTQAYNALRDTFVKLTKDLPSTSGFALFDTCYNLSSRTSVRVPT 428
Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGF 487
V+F F GG L LP SN+LIPVD AGTFC AFAP+ + LSIIGN+QQ+G ++++D AN
Sbjct: 429 VAFLFDGGKSLPLPPSNYLIPVDSAGTFCLAFAPTTASLSIIGNVQQQGTRVTYDLANSQ 488
Query: 488 VGFGPNVC 495
V F C
Sbjct: 489 VSFSSRKC 496
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 204/445 (45%), Positives = 291/445 (65%), Gaps = 24/445 (5%)
Query: 61 FERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVA 120
F++ ++ SN+S ++L+L RD + +N + ++ +R+ RD RV
Sbjct: 61 FQQQVHLVPSNSS---FSFSLQLHPRDSL-------HNAGHKDYKSLVLSRLSRDSSRVK 110
Query: 121 TLVRRLSGGGADAAKHEVQ---------DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
++ RL ++ + +++ D T ++SG QGSGEYF R+GVG P + YM
Sbjct: 111 SIYDRLEFALSELKRSDLEPLKTEILPEDLSTPIISGTSQGSGEYFSRVGVGQPAKPFYM 170
Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRY 231
V+D+GSDI W+QCQPC+ CY+Q+DP+FDP S+SF+ + C S C LE +GC A +C Y
Sbjct: 171 VLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQQCQALETSGCRASKCLY 230
Query: 232 EVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVG 290
+VSYGDGS+T G ETLT G + ++ +VA+GCGH N+G+FVG+AGLLGLGGG +SL
Sbjct: 231 QVSYGDGSFTVGEFVTETLTFGNSGMINDVAVGCGHDNEGLFVGSAGLLGLGGGPLSLTS 290
Query: 291 QLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGV 350
Q+ +FSYCLV R + SS L F A P + PL+++ + +FYYVGL+G+ V
Sbjct: 291 QM---KASSFSYCLVDRDSSSSSDLEFNSAA-PSDSVNAPLLKSGKVDTFYYVGLTGMSV 346
Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI 410
GG + I +LF++ G G+++D+GTA+TRL T AY RDAFV++T L + +G ++
Sbjct: 347 GGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFAL 406
Query: 411 FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIG 470
FDTCY+LS V +PTVSF F+GG L LP N+LIPVD GTFCFAFAP+ S LSIIG
Sbjct: 407 FDTCYDLSSQSRVTIPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIG 466
Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
N+QQ+G ++ +D AN VGF P+ C
Sbjct: 467 NVQQQGTRVHYDLANSVVGFSPHKC 491
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 380 bits (976), Expect = e-103, Method: Compositional matrix adjust.
Identities = 217/467 (46%), Positives = 289/467 (61%), Gaps = 17/467 (3%)
Query: 34 HFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSS 93
FQ L +N A + F + +S +SS +++L H D +SS
Sbjct: 33 QFQTLTLNPLPNKPTISWADTEPGTQTFT--DQTTSEPSSSATTFLSVQLHHIDALSSDK 90
Query: 94 NTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSG-GGADAAKHEVQDFGTDVVSGMDQG 152
++ Q F++R+ RD RV +L+ + GG + + F + V+SG+ QG
Sbjct: 91 SS---------QDLFNSRLVRDAARVKSLISLAATVGGTNLTRARGPGFSSSVISGLAQG 141
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
SGEYF R+GVG+P R YMV+D+GSDIVW+QC PC +CY Q+DPVFDP S SF+ + C
Sbjct: 142 SGEYFTRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKCYSQTDPVFDPTKSRSFANIPCG 201
Query: 213 SAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
S +C RL+ GC + C Y+VSYGDGS+T G + ETLT T V V +GCGH N+G
Sbjct: 202 SPLCRRLDYPGCSTKKQICLYQVSYGDGSFTVGEFSTETLTFRGTRVGRVVLGCGHDNEG 261
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS-GSLVFGREALPVGAAWV 329
+FVGAAGLLGLG G +S Q+G + FSYCL R S S+VFG A+ +
Sbjct: 262 LFVGAAGLLGLGRGRLSFPSQIGRRFNSKFSYCLGDRSASSRPSSIVFGDSAISRTTRFT 321
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
PL+ NP+ +FYYV L G+ VGG R+ IS LF+L G+ GV++D+GT+VTRL AY
Sbjct: 322 PLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTGNGGVIIDSGTSVTRLTRAAY 381
Query: 389 EAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
A RDAF+ NL RA S+FDTC++LSG V+VPTV +F G V LPASN+LIP
Sbjct: 382 VALRDAFLVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHFRGADV-PLPASNYLIP 440
Query: 449 VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
VD++G+FCFAFA + SGLSIIGNIQQ+G ++ +D A VGF P C
Sbjct: 441 VDNSGSFCFAFAGTASGLSIIGNIQQQGFRVVYDLATSRVGFAPRGC 487
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 217/467 (46%), Positives = 290/467 (62%), Gaps = 15/467 (3%)
Query: 34 HFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSS 93
FQ L VN A +E + S+S +S +++L H D +SS
Sbjct: 33 QFQTLTVNPLPNKPTLSWADTEPESEPETQTLTDSTSTEASTTTSLSVQLHHLDALSSDE 92
Query: 94 NTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSG-GGADAAKHEVQDFGTDVVSGMDQG 152
Q F++R+ RD RV +L + G + + F + V SG+ QG
Sbjct: 93 TP---------QDLFNSRLARDASRVKSLTSLAAAVGSTNRTRARGPGFSSSVTSGLAQG 143
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
SGEYF R+GVG+P R +MV+D+GSD+VW+QC PC +CY Q+DPVF+P S SF+ + C
Sbjct: 144 SGEYFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANIPCG 203
Query: 213 SAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
S +C RL++ GC + C Y+VSYGDGS+T G + ETLT T V VA+GCGH N+G
Sbjct: 204 SPLCRRLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGTRVGRVALGCGHDNEG 263
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS-LVFGREALPVGAAWV 329
+F+GAAGLLGLG G +S Q+G + FSYCLV R S S +VFG A+ A +
Sbjct: 264 LFIGAAGLLGLGRGRLSFPSQIGRRFSRKFSYCLVDRSASSKPSYMVFGDSAISRTARFT 323
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
PLV NP+ +FYYV L G+ VGG R+P I+ LF+L G+ GV++D+GT+VTRL PAY
Sbjct: 324 PLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVIIDSGTSVTRLTRPAY 383
Query: 389 EAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
A RDAF NL RA S+FDTC++LSG V+VPTV +F G V +LPASN+LIP
Sbjct: 384 VALRDAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHFRGADV-SLPASNYLIP 442
Query: 449 VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
VD++G+FCFAFA + SGLSI+GNIQQ+G ++ +D A VGF P C
Sbjct: 443 VDNSGSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASRVGFAPRGC 489
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 214/475 (45%), Positives = 301/475 (63%), Gaps = 26/475 (5%)
Query: 33 THFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEAR--WNLELVHRDKMS 90
+H +L+V+ S+ + H +S +L E ++ + + TS + ++L+L R+
Sbjct: 32 SHTNVLDVSSSLHQA---HQILSFNPQLLEEQSSETETPTSPSSSSSSFSLQLHPRE--- 85
Query: 91 SSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV----------QD 140
T N + ++ +R+ RD RV +L +L + + ++ +D
Sbjct: 86 ----TLLNEQHPNYKTLVLSRLARDTARVNSLNTKLQLALSSLNRSDLYPTETELLRPED 141
Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
T V SG QGSGEYF R+GVG P + YMV+D+GSD+ W+QC+PCS CY+QSDP+FDP
Sbjct: 142 LSTPVSSGTAQGSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDPIFDP 201
Query: 201 ADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNV 260
S+S++ ++C + C LE + C G+C Y+VSYGDGS+T G ET++ G V V
Sbjct: 202 TASSSYNPLTCDAQQCQDLEMSACRNGKCLYQVSYGDGSFTVGEYVTETVSFGAGSVNRV 261
Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE 320
AIGCGH N+G+FVG+AGLLGLGGG +SL Q+ +FSYCLV R +G S +L F
Sbjct: 262 AIGCGHDNEGLFVGSAGLLGLGGGPLSLTSQIKAT---SFSYCLVDRDSGKSSTLEFN-S 317
Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
P + PL++N + +FYYV L+G+ VGG + + + F + Q G GV++D+GTA+
Sbjct: 318 PRPGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVDSGTAI 377
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
TRL T AY + RDAF +T NL A GV++FDTCY+LS SVRVPTVSF+FSG L
Sbjct: 378 TRLRTQAYNSVRDAFKRKTSNLRPAEGVALFDTCYDLSSLQSVRVPTVSFHFSGDRAWAL 437
Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
PA N+LIPVD AGT+CFAFAP+ S +SIIGN+QQ+G ++SFD AN VGF PN C
Sbjct: 438 PAKNYLIPVDGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSFDLANSLVGFSPNKC 492
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 210/398 (52%), Positives = 266/398 (66%), Gaps = 12/398 (3%)
Query: 105 QHSFHARMQRDVKRVATLVRRLSGGGADAAKHE----VQDFGTDVVSGMDQGSGEYFVRI 160
+ FH R+QRD RV ++LS GA + F + V+SG+ QGSGEYF RI
Sbjct: 78 EELFHLRLQRDAIRV----KKLSSLGATSRNLSKPGGTTGFSSSVISGLAQGSGEYFTRI 133
Query: 161 GVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLE 220
GVG+PP+ YMV+D+GSDIVW+QC PC CY Q+DPVF+P S SF+ V C + +C RLE
Sbjct: 134 GVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLE 193
Query: 221 NAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLL 279
+ GC+ + C Y+VSYGDGSYT G ETLT RT V+ VA+GCGH N+G+FVGAAGLL
Sbjct: 194 SPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVALGCGHDNEGLFVGAAGLL 253
Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS-GSLVFGREALPVGAAWVPLVRNPRAP 338
GLG G +S Q G FSYCLV R S S+VFG A+ A + PL+ NPR
Sbjct: 254 GLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLD 313
Query: 339 SFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVA 397
+FYYV L G+ VGG + I+ F+L + G+ GV++D GT+VTRL PAY A RDAF A
Sbjct: 314 TFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRA 373
Query: 398 QTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCF 457
+L A S+FDTCY+LSG +V+VPTV +F G V +LPASN+LIPVD +G FCF
Sbjct: 374 GASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADV-SLPASNYLIPVDGSGRFCF 432
Query: 458 AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
AFA + SGLSIIGNIQQ+G ++ +D A+ VGF P C
Sbjct: 433 AFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 470
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 206/367 (56%), Positives = 254/367 (69%), Gaps = 11/367 (2%)
Query: 132 DAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCY 191
+A+ E+Q VVSG+ GSGEYF R+GVGSP R YMV+D+GSD+ WVQCQPC+ CY
Sbjct: 146 EASAAEIQG---PVVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCY 202
Query: 192 KQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH--AGRCRYEVSYGDGSYTKGTLALET 249
+QSDPVFDP+ S S++ V+C + C L+ A C G C YEV+YGDGSYT G A ET
Sbjct: 203 QQSDPVFDPSLSTSYASVACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATET 262
Query: 250 LTIGRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG 308
LT+G + V +VAIGCGH N+G+FVGAAGLL LGGG +S Q+ T FSYCLV R
Sbjct: 263 LTLGDSAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATT---FSYCLVDRD 319
Query: 309 TGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
+ SS +L FG A PL+R+PR +FYYVGLSGL VGG + I F + G
Sbjct: 320 SPSSSTLQFGDAA--DAEVTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTG 377
Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
GV++D+GTAVTRL + AY A RDAFV T +LPR SGVS+FDTCY+LS SV VP V
Sbjct: 378 AGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAV 437
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
S F+GG L LPA N+LIPVD AGT+C AFAP+ + +SIIGN+QQ+G ++SFD A V
Sbjct: 438 SLRFAGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTV 497
Query: 489 GFGPNVC 495
GF N C
Sbjct: 498 GFTTNKC 504
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 208/424 (49%), Positives = 275/424 (64%), Gaps = 14/424 (3%)
Query: 76 EARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAK 135
E +L L H D +S + + FH R++RD RV TL +
Sbjct: 59 EPTTSLSLHHIDALSFNKTPS---------QLFHLRLERDAARVKTLTHLAAATNKTRPA 109
Query: 136 HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD 195
+ F + VVSG+ QGSGEYF R+GVG+PP+ YMV+D+GSD+VW+QC+PC++CY Q+D
Sbjct: 110 NPGSGFSSSVVSGLSQGSGEYFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPCTKCYSQTD 169
Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIG 253
+FDP+ S SF+G+ C S +C RL++ GC C+Y+VSYGDGS+T G + ETLT
Sbjct: 170 QIFDPSKSKSFAGIPCYSPLCRRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTFR 229
Query: 254 RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR-GTGSS 312
R V VAIGCGH N+G+FVGAAGLLGLG G +S Q G + FSYCL R +
Sbjct: 230 RAAVPRVAIGCGHDNEGLFVGAAGLLGLGRGGLSFPTQTGTRFNNKFSYCLTDRTASAKP 289
Query: 313 GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDG 371
S+VFG A+ A + PLV+NP+ +FYYV L G+ VGG + IS FRL G+ G
Sbjct: 290 SSIVFGDSAVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGG 349
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
V++D+GT+VTRL PAY + RDAF +L RA S+FDTCY+LSG V+VPTV +
Sbjct: 350 VIIDSGTSVTRLTRPAYVSLRDAFRVGASHLKRAPEFSLFDTCYDLSGLSEVKVPTVVLH 409
Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFG 491
F G V +LPA+N+L+PVD++G+FCFAFA + SGLSIIGNIQQ+G ++ FD A VGF
Sbjct: 410 FRGADV-SLPAANYLVPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGSRVGFA 468
Query: 492 PNVC 495
P C
Sbjct: 469 PRGC 472
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 220/441 (49%), Positives = 276/441 (62%), Gaps = 33/441 (7%)
Query: 75 DEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRD-----------------VK 117
+E R L L RD + + Y + AR++RD V
Sbjct: 73 EEGRLALRLHSRDFLPEEQGRQRHASY---RSLVLARLRRDSARAAAVSARAAMAADGVS 129
Query: 118 RVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGS 177
R + ++ A AA E+Q VVSG+ GSGEYF R+GVGSP R YMV+D+GS
Sbjct: 130 RFDLVPANVTAFEASAA--EIQG---PVVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGS 184
Query: 178 DIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH--AGRCRYEVSY 235
D+ WVQCQPC+ CY+QSDPVFDP+ S S++ V+C + C L+ A C G C YEV+Y
Sbjct: 185 DVTWVQCQPCADCYQQSDPVFDPSLSTSYASVACDNPRCHDLDAAACRNSTGACLYEVAY 244
Query: 236 GDGSYTKGTLALETLTIGRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGG 294
GDGSYT G A ETLT+G + V +VAIGCGH N+G+FVGAAGLL LGGG +S Q+
Sbjct: 245 GDGSYTVGDFATETLTLGDSAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISA 304
Query: 295 QTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMR 354
T FSYCLV R + SS +L FG A PL+R+PR +FYYVGLSG+ VGG
Sbjct: 305 TT---FSYCLVDRDSPSSSTLQFGDAA--DAEVTAPLIRSPRTSTFYYVGLSGISVGGQI 359
Query: 355 IPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTC 414
+ I F + G GV++D+GTAVTRL + AY A RDAFV T +LPR SGVS+FDTC
Sbjct: 360 LSIPPSAFAMDGTGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTC 419
Query: 415 YNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQ 474
Y+LS SV VP VS F+GG L LPA N+LIPVD AGT+C AFAP+ + +SIIGN+QQ
Sbjct: 420 YDLSDRTSVEVPAVSLRFAGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQ 479
Query: 475 EGIQISFDGANGFVGFGPNVC 495
+G ++SFD A VGF N C
Sbjct: 480 QGTRVSFDTAKSTVGFTSNKC 500
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 209/442 (47%), Positives = 282/442 (63%), Gaps = 18/442 (4%)
Query: 62 ERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVAT 121
E IS+ S + + L HRD ++ ++ + F+ R+QRD RV
Sbjct: 57 ETETQISTLPVSETDPTMTMHLEHRDVLAFNAT---------PEALFNLRLQRDAFRVEA 107
Query: 122 LVR-----RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSG 176
L + G + + F + V SG+ QGSGEYF R+GVG+PP+ YMV+D+G
Sbjct: 108 LSKMAAAAGGRRAGRNGTHAQGGGFSSSVTSGLAQGSGEYFTRLGVGTPPKYVYMVLDTG 167
Query: 177 SDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR-CRYEVSY 235
SD+VW+QC PC +CY Q+DPVFDP S SFS +SC S +C RL++ GC++ + C Y+V+Y
Sbjct: 168 SDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAY 227
Query: 236 GDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQ 295
GDGS+T G + ETLT T V VA+GCGH N+G+FVGAAGLLGLG G +S Q G +
Sbjct: 228 GDGSFTFGEFSTETLTFRGTRVPKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPTQTGLR 287
Query: 296 TGGAFSYCLVSRGTGSS-GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMR 354
G FSYCLV R S S+VFG+ A+ A + PL+ NP+ +FYY+ L+G+ VGG R
Sbjct: 288 FGRKFSYCLVDRSASSKPSSVVFGQSAVSRTAVFTPLITNPKLDTFYYLELTGISVGGAR 347
Query: 355 IP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDT 413
+ I+ LF+L G+ GV++D+GT+VTRL AY + RDAF A +L RA S+FDT
Sbjct: 348 VAGITASLFKLDTAGNGGVIIDSGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDYSLFDT 407
Query: 414 CYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQ 473
C++LSG V+VPTV +F G V +LPA+N+LIPVD G FCFAFA + SGLSIIGNIQ
Sbjct: 408 CFDLSGKTEVKVPTVVMHFRGADV-SLPATNYLIPVDTNGVFCFAFAGTMSGLSIIGNIQ 466
Query: 474 QEGIQISFDGANGFVGFGPNVC 495
Q+G ++ FD A +GF C
Sbjct: 467 QQGFRVVFDVAASRIGFAARGC 488
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 208/420 (49%), Positives = 280/420 (66%), Gaps = 15/420 (3%)
Query: 80 NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
+L L H D +SS+ + F R+QRD KRV +V L+ A+
Sbjct: 63 SLHLHHIDALSSNKTP---------EQLFQLRLQRDAKRVEGVVA-LAALNQSHARRSGS 112
Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
F + ++SG+ QGSGEYF RIGVG+P R YMV+D+GSD+VW+QC PC +CY Q+DPVFD
Sbjct: 113 SFSSSIISGLAQGSGEYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVFD 172
Query: 200 PADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVV 257
P S +++G+ C + +C RL++ GC+ C+Y+VSYGDGS+T G + ETLT RT V
Sbjct: 173 PTKSRTYAGIPCGAPLCRRLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRTRV 232
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR-GTGSSGSLV 316
VA+GCGH N+G+F+GAAGLLGLG G +S Q G + FSYCLV R + S+V
Sbjct: 233 TRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSYCLVDRSASAKPSSVV 292
Query: 317 FGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMD 375
FG A+ A + PL++NP+ +FYY+ L G+ VGG + +S LFRL G+ GV++D
Sbjct: 293 FGDSAVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVIID 352
Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGG 435
+GT+VTRL PAY A RDAF +L RA+ S+FDTC++LSG V+VPTV +F G
Sbjct: 353 SGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVLHFRGA 412
Query: 436 PVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
V +LPA+N+LIPVD++G+FCFAFA + SGLSIIGNIQQ+G ++SFD A VGF P C
Sbjct: 413 DV-SLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAPRGC 471
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 202/367 (55%), Positives = 254/367 (69%), Gaps = 11/367 (2%)
Query: 132 DAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCY 191
+A+ E+Q VVSG+ QGSGEYF R+GVG P R YMV+D+GSD+ W+QCQPC+ CY
Sbjct: 142 EASAAEIQG---PVVSGVGQGSGEYFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCY 198
Query: 192 KQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH--AGRCRYEVSYGDGSYTKGTLALET 249
QSDPV+DP+ S S++ V C S C L+ A C G C YEV+YGDGSYT G A ET
Sbjct: 199 AQSDPVYDPSVSTSYATVGCDSPRCRDLDAAACRNSTGSCLYEVAYGDGSYTVGDFATET 258
Query: 250 LTIGRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG 308
LT+G + V NVAIGCGH N+G+FVGAAGLL LGGG +S Q+ T FSYCLV R
Sbjct: 259 LTLGDSAPVSNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATT---FSYCLVDRD 315
Query: 309 TGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
+ SS +L FG P A PL+R+PR +FYYV LSG+ VGG + I F + G
Sbjct: 316 SPSSSTLQFGDSEQP--AVTAPLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAG 373
Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
GV++D+GTAVTRL + AY A R+AFV T +LPRASGVS+FDTCY+L+G SV+VP V
Sbjct: 374 SGGVIVDSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQVPAV 433
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
+ +F GG L LPA N+LIPVD AGT+C AFA + +SIIGN+QQ+G+++SFD A V
Sbjct: 434 ALWFEGGGELKLPAKNYLIPVDAAGTYCLAFAGTSGPVSIIGNVQQQGVRVSFDTAKNTV 493
Query: 489 GFGPNVC 495
GF + C
Sbjct: 494 GFTADKC 500
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 365 bits (938), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 204/383 (53%), Positives = 258/383 (67%), Gaps = 8/383 (2%)
Query: 120 ATLVRRLSGGGADAAKHE----VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDS 175
A V++LS GA + F + V+SG+ QGSGEYF RIGVG+PP+ YMV+D+
Sbjct: 2 AIRVKKLSSLGATSRNLSKPGGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDT 61
Query: 176 GSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR-CRYEVS 234
GSDIVW+QC PC CY Q+DPVF+P S SF+ V C + +C RLE+ GC+ + C Y+VS
Sbjct: 62 GSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVS 121
Query: 235 YGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGG 294
YGDGSYT G ETLT RT V+ VA+GCGH N+G+FVGAAGLLGLG G +S Q G
Sbjct: 122 YGDGSYTTGEFVTETLTFRRTKVEQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGR 181
Query: 295 QTGGAFSYCLVSRGTGSS-GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGM 353
FSYCLV R S S+VFG A+ A + PL+ NPR +FYYV L G+ VGG
Sbjct: 182 TFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGT 241
Query: 354 RIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD 412
+ I+ F+L + G+ GV++D GT+VTRL PAY A RDAF A +L A S+FD
Sbjct: 242 PVSGITASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFD 301
Query: 413 TCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNI 472
TCY+LSG +V+VPTV +F G V +LPASN+LIPVD +G FCFAFA + SGLSIIGNI
Sbjct: 302 TCYDLSGKTTVKVPTVVLHFRGADV-SLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNI 360
Query: 473 QQEGIQISFDGANGFVGFGPNVC 495
QQ+G ++ +D A+ VGF P C
Sbjct: 361 QQQGFRVVYDLASSRVGFSPRGC 383
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 363 bits (931), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 198/395 (50%), Positives = 266/395 (67%), Gaps = 10/395 (2%)
Query: 105 QHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGS 164
+ FH R+QRD KRV L+ ++ A + F + ++SG+ QGSGEYF RIGVG+
Sbjct: 72 EQLFHLRLQRDAKRVEALLNQI-----HARRSAGSSFSSSIISGLAQGSGEYFTRIGVGT 126
Query: 165 PPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC 224
P R YMV+D+GSD+VW+QC PC +CY Q+D VFDP S +++G+ C + +C RL++ GC
Sbjct: 127 PARYVYMVLDTGSDVVWLQCAPCRKCYTQTDHVFDPTKSRTYAGIPCGAPLCRRLDSPGC 186
Query: 225 HAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
C+Y+VSYGDGS+T G + ETLT R V VA+GCGH N+G+F GAAGLLGLG
Sbjct: 187 SNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRNRVTRVALGCGHDNEGLFTGAAGLLGLG 246
Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSR-GTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFY 341
G +S Q G + FSYCLV R + S++FG A+ A + PL++NP+ +FY
Sbjct: 247 RGRLSFPVQTGRRFNHKFSYCLVDRSASAKPSSVIFGDSAVSRTAHFTPLIKNPKLDTFY 306
Query: 342 YVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG 400
Y+ L G+ VGG + +S LFRL G+ GV++D+GT+VTRL PAY A RDAF
Sbjct: 307 YLELLGISVGGAPVRGLSASLFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDAFRIGAS 366
Query: 401 NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA 460
+L RA S+FDTC++LSG V+VPTV +F G V +LPA+N+LIPVD++G+FCFAFA
Sbjct: 367 HLKRAPEFSLFDTCFDLSGLTEVKVPTVVLHFRGADV-SLPATNYLIPVDNSGSFCFAFA 425
Query: 461 PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ SGLSIIGNIQQ+G +IS+D VGF P C
Sbjct: 426 GTMSGLSIIGNIQQQGFRISYDLTGSRVGFAPRGC 460
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 361 bits (926), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 198/361 (54%), Positives = 248/361 (68%), Gaps = 15/361 (4%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
VVSG+ QGSGEYF RIG+GSP R YMV+D+GSD+ W+QC PC+ CY QSDP+FDPA S+
Sbjct: 185 VVSGVGQGSGEYFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPALSS 244
Query: 205 SFSGVSCSSAVCDRLENAGCHAG------RCRYEVSYGDGSYTKGTLALETLTIG---RT 255
S++ V C S C L+ + CH C YEV+YGDGSYT G A ETLT+G
Sbjct: 245 SYATVPCDSPHCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLGGDGSA 304
Query: 256 VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSL 315
V +VAIGCGH N+G+FVGAAGLL LGGG +S Q+ + FSYCLV R + S+ +L
Sbjct: 305 AVHDVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQI---SATEFSYCLVDRDSPSASTL 361
Query: 316 VFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVM 374
FG A PL+R+PR+ +FYYV L+G+ VGG + I F + + G GV++
Sbjct: 362 QFG--ASDSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGSGGVIV 419
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSG 434
D+GTAVTRL + AY A RDAFV T LPRASGVS+FDTCY+L+G SV+VP VS F G
Sbjct: 420 DSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSLFDTCYDLAGRSSVQVPAVSLRFEG 479
Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNV 494
G L LPA N+LIPVD AGT+C AFA + +SI+GN+QQ+GI++SFD A VGF PN
Sbjct: 480 GGELKLPAKNYLIPVDGAGTYCLAFAATGGAVSIVGNVQQQGIRVSFDTAKNTVGFSPNK 539
Query: 495 C 495
C
Sbjct: 540 C 540
>gi|388515789|gb|AFK45956.1| unknown [Medicago truncatula]
Length = 225
Score = 358 bits (920), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 182/225 (80%), Positives = 203/225 (90%)
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVP 330
MFVGAAGLLGLG G MS VGQLGGQ GG FSYCLVSRGT SSGSL FGRE++PVGA+WV
Sbjct: 1 MFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGTESSGSLEFGRESVPVGASWVS 60
Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
L+ NPRAPSFYY+GLSGLGVGG+R+PISED+FRL ++G+ GVVMDTGTAVTRLP AY A
Sbjct: 61 LIHNPRAPSFYYIGLSGLGVGGLRVPISEDIFRLNELGEGGVVMDTGTAVTRLPAAAYNA 120
Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
FRDAFVAQT NLP+ SGVSIFDTCY+L+GFV+VRVPT+SFYF GGP+LTLPA NFLIPVD
Sbjct: 121 FRDAFVAQTTNLPKTSGVSIFDTCYDLNGFVTVRVPTISFYFLGGPILTLPARNFLIPVD 180
Query: 451 DAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
GTFCFAFAPS SGLSIIGNIQQEGI+IS DGANG++GFGPN+C
Sbjct: 181 SVGTFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGYIGFGPNIC 225
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 355 bits (912), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 198/402 (49%), Positives = 266/402 (66%), Gaps = 15/402 (3%)
Query: 108 FHARMQRDVKRVATLVRRLS-GGGADAAKHEVQD---FGTDVVSGMDQGSGEYFVRIGVG 163
F+ R+QRD RV +L + G + K + F V+SG+ QGSGEYF+R+GVG
Sbjct: 84 FNLRLQRDSLRVESLTSLAAVSAGRNVTKRPPRSAGGFSGVVISGLSQGSGEYFMRLGVG 143
Query: 164 SPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG 223
+P + YMV+D+GSD+VW+QC PC CY QSDPVF+PA S +F+ V C S +C RL+++
Sbjct: 144 TPATNMYMVLDTGSDVVWLQCSPCKVCYNQSDPVFNPAKSKTFATVPCGSRLCRRLDDSS 203
Query: 224 -CHAGR---CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLL 279
C + R C Y+VSYGDGS+T G + ETLT V +VA+GCGH N+G+FVGAAGLL
Sbjct: 204 ECVSRRSKACLYQVSYGDGSFTVGDFSTETLTFHGARVDHVALGCGHDNEGLFVGAAGLL 263
Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSR-----GTGSSGSLVFGREALPVGAAWVPLVRN 334
GLG G +S Q + G FSYCLV R + ++VFG A+P A + PL+ N
Sbjct: 264 GLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNGAVPKTAVFTPLLTN 323
Query: 335 PRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD 393
P+ +FYY+ L G+ VGG R+P +SE F+L G+ GV++D+GT+VTRL AY A RD
Sbjct: 324 PKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRD 383
Query: 394 AFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
AF L RA S+FDTC++LSG +V+VPTV F+F+GG V +LPASN+LIPV++ G
Sbjct: 384 AFRLGATRLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFTGGEV-SLPASNYLIPVNNQG 442
Query: 454 TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
FCFAFA + LSIIGNIQQ+G ++++D VGF C
Sbjct: 443 RFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 484
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 354 bits (908), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 200/429 (46%), Positives = 273/429 (63%), Gaps = 22/429 (5%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLS-GGGADAAKHEVQ 139
+ L H D +SS S+ + F+ R+QRD RV ++ + G +A K +
Sbjct: 63 VHLSHVDALSSFSDAS-------PADLFNLRLQRDSLRVKSITSLAAVSTGRNATKRTPR 115
Query: 140 D---FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP 196
F V+SG+ QGSGEYF+R+GVG+P + YMV+D+GSD+VW+QC PC CY Q+D
Sbjct: 116 TAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDA 175
Query: 197 VFDPADSASFSGVSCSSAVCDRLENAG-CHAGR---CRYEVSYGDGSYTKGTLALETLTI 252
+FDP S +F+ V C S +C RL+++ C R C Y+VSYGDGS+T+G + ETLT
Sbjct: 176 IFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF 235
Query: 253 GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----- 307
V +V +GCGH N+G+FVGAAGLLGLG G +S Q + G FSYCLV R
Sbjct: 236 HGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGS 295
Query: 308 GTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQ 366
+ ++VFG A+P + + PL+ NP+ +FYY+ L G+ VGG R+P +SE F+L
Sbjct: 296 SSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDA 355
Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
G+ GV++D+GT+VTRL PAY A RDAF L RA S+FDTC++LSG +V+VP
Sbjct: 356 TGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVP 415
Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
TV F+F GG V +LPASN+LIPV+ G FCFAFA + LSIIGNIQQ+G ++++D
Sbjct: 416 TVVFHFGGGEV-SLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGS 474
Query: 487 FVGFGPNVC 495
VGF C
Sbjct: 475 RVGFLSRAC 483
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 353 bits (907), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 202/449 (44%), Positives = 278/449 (61%), Gaps = 22/449 (4%)
Query: 61 FERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVA 120
+ + S + S ++ L H D +SS S+ + F R+QRD RV
Sbjct: 46 WPESKSFSDESVSESTTSLSVHLSHVDALSSFSDAS-------PVDLFKLRLQRDSLRVK 98
Query: 121 TLVRRLS-GGGADAAKHEVQD---FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSG 176
++ + G +A K + F V+SG+ QGSGEYF+R+GVG+P + YMV+D+G
Sbjct: 99 SITSLAAVSTGRNATKRTPRSAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTG 158
Query: 177 SDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG-CHAGR---CRYE 232
SD+VW+QC PC CY QSD +FDP S +F+ V C S +C RL+++ C R C Y+
Sbjct: 159 SDVVWLQCSPCKACYNQSDVIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQ 218
Query: 233 VSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQL 292
VSYGDGS+T+G + ETLT V +V +GCGH N+G+FVGAAGLLGLG G +S Q
Sbjct: 219 VSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQT 278
Query: 293 GGQTGGAFSYCLVSR-----GTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSG 347
+ G FSYCLV R + ++VFG +A+P + + PL+ NP+ +FYY+ L G
Sbjct: 279 KSRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNDAVPKTSVFTPLLTNPKLDTFYYLQLLG 338
Query: 348 LGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
+ VGG R+P +SE F+L G+ GV++D+GT+VTRL AY A RDAF L RA
Sbjct: 339 ISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATKLKRAP 398
Query: 407 GVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL 466
S+FDTC++LSG +V+VPTV F+F GG V +LPASN+LIPV+ G FCFAFA + L
Sbjct: 399 SYSLFDTCFDLSGMTTVKVPTVVFHFGGGEV-SLPASNYLIPVNTEGRFCFAFAGTMGSL 457
Query: 467 SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
SIIGNIQQ+G ++++D VGF C
Sbjct: 458 SIIGNIQQQGFRVAYDLVGSRVGFLSRAC 486
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 353 bits (905), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 194/406 (47%), Positives = 261/406 (64%), Gaps = 19/406 (4%)
Query: 108 FHARMQRDVKRVATLVRRL---------SGGGADAAKHEVQDFGTDVVSGMDQGSGEYFV 158
H + RD RVA++ R+ S K QDF VVSG+ GSGEYF+
Sbjct: 1 MHVTISRDNLRVASIHGRINQTVNGLTRSRSRDRQTKVPSQDFQAPVVSGLSLGSGEYFI 60
Query: 159 RIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
RI VG+PPR Y+V+D+GSDI+W+QC PC CY QSD +FDP S+++S + CS+ C
Sbjct: 61 RISVGTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSDAIFDPYKSSTYSTLGCSTRQCLN 120
Query: 219 LENAGCHAGRCRYEVSYGDGSYTKGTLALETLT------IGRTVVKNVAIGCGHKNQGMF 272
L+ C A +C Y+V YGDGS+T G + ++ +G+ V+ + +GCGH N+G F
Sbjct: 121 LDIGTCQANKCLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNEGYF 180
Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS--GSLVFGREALP-VGAAWV 329
VGAAGLLGLG G +S Q+ Q GG FSYCL R T S+ SLVFG A+P GA +
Sbjct: 181 VGAAGLLGLGKGPLSFPNQVDPQNGGRFSYCLTDRETDSTEGSSLVFGEAAVPPAGARFT 240
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
P N R P+FYY+ ++G+ VGG + I F+L +G+ GV++D+GT+VTRL AY
Sbjct: 241 PQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGGVIIDSGTSVTRLQNAAYA 300
Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
+ RDAF A T +L +G S+FDTCY+LSG SV VPTV+ +F GG L LPASN+LIPV
Sbjct: 301 SLRDAFRAGTSDLAPTAGFSLFDTCYDLSGLASVDVPTVTLHFQGGTDLKLPASNYLIPV 360
Query: 450 DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
D++ TFC AFA + +G SIIGNIQQ+G ++ +D + VGF P+ C
Sbjct: 361 DNSNTFCLAFAGT-TGPSIIGNIQQQGFRVIYDNLHNQVGFVPSQC 405
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 352 bits (902), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 193/426 (45%), Positives = 265/426 (62%), Gaps = 18/426 (4%)
Query: 84 VHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV----- 138
+HRD S N + ++ R+ RD R+ ++ R+S G A K +
Sbjct: 1 MHRDSADSPYRPANATVHGLVRN----RLHRDELRLLSISSRISLGVAGIPKSSLTNPLK 56
Query: 139 -------QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCY 191
QDF T + SG+ GSGEYFV +GVG+PPR+ MV D+GSD++W+QC PC CY
Sbjct: 57 NTNPFLQQDFETPLRSGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCY 116
Query: 192 KQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLT 251
Q+DP+F+P+ S++F ++C S++C +L GC +C Y+VSYGDGS+T G + ETL+
Sbjct: 117 GQTDPLFNPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLS 176
Query: 252 IGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
G V +VAIGCGH NQG+F GAAGLLGLG G +S Q+G G FSYCL +R +
Sbjct: 177 FGSNAVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRESTG 236
Query: 312 SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL-TQMGDD 370
S L+FG +A+ A + L+ NP+ +FYYV + G+ VGG + I L + G+
Sbjct: 237 SVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNG 296
Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAFVA-QTGNLPRASGVSIFDTCYNLSGFVSVRVPTVS 429
GV++D+GTAVTRL T AY RDAF A + SG S+FDTCY+LSG S+ +P VS
Sbjct: 297 GVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVS 356
Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVG 489
F F+GG + LPA N ++PVD++GT+C AFAP+ SIIGNIQQ+ ++SFD VG
Sbjct: 357 FVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNRVG 416
Query: 490 FGPNVC 495
G N C
Sbjct: 417 IGANQC 422
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 352 bits (902), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 193/426 (45%), Positives = 265/426 (62%), Gaps = 18/426 (4%)
Query: 84 VHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV----- 138
+HRD S N + ++ R+ RD R+ ++ R+S G A K +
Sbjct: 1 MHRDSADSPYRPANATVHGLVRN----RLHRDELRLLSISSRISLGVAGIPKSSLTNPLK 56
Query: 139 -------QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCY 191
QDF T + SG+ GSGEYFV +GVG+PPR+ MV D+GSD++W+QC PC CY
Sbjct: 57 NTNPFLQQDFETPLRSGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCY 116
Query: 192 KQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLT 251
Q+DP+F+P+ S++F ++C S++C +L GC +C Y+VSYGDGS+T G + ETL+
Sbjct: 117 GQTDPLFNPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLS 176
Query: 252 IGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
G V +VAIGCGH NQG+F GAAGLLGLG G +S Q+G G FSYCL +R +
Sbjct: 177 FGSNAVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRESTG 236
Query: 312 SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL-TQMGDD 370
S L+FG +A+ A + L+ NP+ +FYYV + G+ VGG + I L + G+
Sbjct: 237 SVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNG 296
Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAFVA-QTGNLPRASGVSIFDTCYNLSGFVSVRVPTVS 429
GV++D+GTAVTRL T AY RDAF A + SG S+FDTCY+LSG S+ +P VS
Sbjct: 297 GVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVS 356
Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVG 489
F F+GG + LPA N ++PVD++GT+C AFAP+ SIIGNIQQ+ ++SFD VG
Sbjct: 357 FVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNRVG 416
Query: 490 FGPNVC 495
G N C
Sbjct: 417 IGANQC 422
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 351 bits (900), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 194/435 (44%), Positives = 261/435 (60%), Gaps = 18/435 (4%)
Query: 75 DEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAA 134
D +LEL+HR+ + + H H+ +QRD +RV + + G
Sbjct: 52 DGGTLSLELIHRNSLLREAKE----KLHTHEQLLLETLQRDEQRVRWIESKAQLAGKKKD 107
Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS 194
+ D V SG+ GSGEYFVR+GVG+P RS +MV+D+GSD+ W+QCQPC CYKQ+
Sbjct: 108 EASSTDLNGPVTSGLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQA 167
Query: 195 DPVFDPADSASFSGVSCSSAVCDRLENAGCH-----AGRCRYEVSYGDGSYTKGTLALET 249
DP+FDP +S+SF + C S +C LE C RC Y+V+YGDGS++ G + +
Sbjct: 168 DPIFDPRNSSSFQRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDL 227
Query: 250 LTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQL-----GGQTGGAFSYC 303
T+G + +VA GCG N+G+F GAAGLLGLG G +S Q+ T +FSYC
Sbjct: 228 FTLGTGSKAMSVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYC 287
Query: 304 LVSRG---TGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISED 360
LV R T SS SL+FG A+P AA PL++NP+ +FYY + G+ VGG ++PIS
Sbjct: 288 LVDRSNPMTRSSSSLIFGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLK 347
Query: 361 LFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGF 420
+L+Q G GV++D+GT+VTR PT Y RDAF T NLP A S+FDTCYN SG
Sbjct: 348 SLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSAPRYSLFDTCYNFSGK 407
Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQIS 480
SV VP + +F G L LP +N+LIP++ AG+FC AFAP+ L IIGNIQQ+ +I
Sbjct: 408 ASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIG 467
Query: 481 FDGANGFVGFGPNVC 495
FD + F P C
Sbjct: 468 FDLQKSHLAFAPQQC 482
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 350 bits (897), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 185/360 (51%), Positives = 246/360 (68%), Gaps = 10/360 (2%)
Query: 143 TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPAD 202
+ V SG+ GSGEYFVR+G+GSP + QY+V+D+GSD+ W+QC PC CYKQ+D VFDP
Sbjct: 1 SQVTSGLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRA 60
Query: 203 SASFSGVSCSSAVCDRLENAGCHA--GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNV 260
S+SF +SCS+ C L+ C + RC Y+VSYGDGS+T G LA ++ ++ R V
Sbjct: 61 SSSFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSVSRGRTSPV 120
Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG--SSGSLVFG 318
GCGH N+G+FVGAAGLLGLG G +S QL + FSYCLVSR G +S +L+FG
Sbjct: 121 VFGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLSSR---KFSYCLVSRDNGVRASSALLFG 177
Query: 319 REALPVGA--AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ-MGDDGVVMD 375
ALP A A+ L++NP+ +FYY GLSG+ +GG + I F+L+ G GV++D
Sbjct: 178 DSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIID 237
Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGG 435
+GT+VTRLPT AY RDAF + T LPRA+ S+FDTCY+ S SV +PTVSF+F GG
Sbjct: 238 SGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEGG 297
Query: 436 PVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ LP SN+L+PVD +GTFCFAF+ + LSIIGNIQQ+ ++++ D + VGF P C
Sbjct: 298 ASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 349 bits (895), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 185/360 (51%), Positives = 245/360 (68%), Gaps = 10/360 (2%)
Query: 143 TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPAD 202
+ V SG+ GSGEYFVR+G+GSP + QY+V+D+GSD+ W+QC PC CYKQ+D VFDP
Sbjct: 1 SQVTSGLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRA 60
Query: 203 SASFSGVSCSSAVCDRLENAGCHA--GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNV 260
S+SF +SCS+ C L+ C + RC Y+VSYGDGS+T G LA ++ + R V
Sbjct: 61 SSSFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFLVSRGRTSPV 120
Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG--SSGSLVFG 318
GCGH N+G+FVGAAGLLGLG G +S QL + FSYCLVSR G +S +L+FG
Sbjct: 121 VFGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLSSR---KFSYCLVSRDNGVRASSALLFG 177
Query: 319 REALPVGA--AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ-MGDDGVVMD 375
ALP A A+ L++NP+ +FYY GLSG+ +GG + I F+L+ G GV++D
Sbjct: 178 DSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIID 237
Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGG 435
+GT+VTRLPT AY RDAF + T LPRA+ S+FDTCY+ S SV +PTVSF+F GG
Sbjct: 238 SGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEGG 297
Query: 436 PVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ LP SN+L+PVD +GTFCFAF+ + LSIIGNIQQ+ ++++ D + VGF P C
Sbjct: 298 ASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 347 bits (891), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 189/329 (57%), Positives = 229/329 (69%), Gaps = 7/329 (2%)
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH--AGR 228
MV+D+GSD+ WVQCQPC+ CY+QSDPVFDP+ SAS++ VSC S C L+ A C G
Sbjct: 1 MVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGA 60
Query: 229 CRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMS 287
C YEV+YGDGSYT G A ETLT+G T V NVAIGCGH N+G+FVGAAGLL LGGG +S
Sbjct: 61 CLYEVAYGDGSYTVGDFATETLTLGDSTPVGNVAIGCGHDNEGLFVGAAGLLALGGGPLS 120
Query: 288 LVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSG 347
Q+ T FSYCLV R + ++ +L FG A G PLVR+PR +FYYV LSG
Sbjct: 121 FPSQISAST---FSYCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSG 177
Query: 348 LGVGGMRIPISEDLFRLTQM-GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
+ VGG + I F + G GV++D+GTAVTRL + AY A RDAFV +LPR S
Sbjct: 178 ISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTS 237
Query: 407 GVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL 466
GVS+FDTCY+LS SV VP VS F GG L LPA N+LIPVD AGT+C AFAP+ + +
Sbjct: 238 GVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAV 297
Query: 467 SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
SIIGN+QQ+G ++SFD A G VGF PN C
Sbjct: 298 SIIGNVQQQGTRVSFDTARGAVGFTPNKC 326
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 343 bits (881), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 206/444 (46%), Positives = 273/444 (61%), Gaps = 26/444 (5%)
Query: 65 NNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVR 124
N S+ + + R+ LVHRD S ++ + Y R++RD KR A L
Sbjct: 62 NLASAEDAPASTVRF--RLVHRDDFSVNATAAELLAY---------RLERDAKRAARL-- 108
Query: 125 RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC 184
+ G A+ + VVSG+ QGSGEYF +IGVG+P MV+D+GSD+VW+QC
Sbjct: 109 SAAAGPANGTRRGGGGVVAPVVSGLAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQC 168
Query: 185 QPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTK 242
PC +CY+QS VFDP S S++ V C++ +C RL++ GC R C Y+V+YGDGS T
Sbjct: 169 APCRRCYEQSGQVFDPRRSRSYNAVGCAAPLCRRLDSGGCDLRRSACLYQVAYGDGSVTA 228
Query: 243 GTLALETLTI-GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFS 301
G A ETLT G V VA+GCGH N+G+FV AAGLLGLG GS+S Q+ + G +FS
Sbjct: 229 GDFATETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGRSFS 288
Query: 302 YCLVSRGTGS-----SGSLVFGREAL--PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMR 354
YCLV R + + S ++ FG A+ V +++ P+V+NPR +FYYV L G+ VGG R
Sbjct: 289 YCLVDRTSSANTASRSSTVTFGSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGAR 348
Query: 355 IP--ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG-VSIF 411
+P + DL G GV++D+GT+VTRL PAY A RDAF L + G S+F
Sbjct: 349 VPGVANSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLF 408
Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGN 471
DTCY+LSG V+VPTVS +F+GG LP N+LIPVD GTFCFAFA + G+SIIGN
Sbjct: 409 DTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGN 468
Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
IQQ+G ++ FDG V F P C
Sbjct: 469 IQQQGFRVVFDGDGQRVAFTPKGC 492
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 342 bits (878), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 204/433 (47%), Positives = 267/433 (61%), Gaps = 30/433 (6%)
Query: 80 NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
+ +VHRD + ++ T + HR +QRD +R A R+S + +
Sbjct: 66 HFRVVHRDTFAVNA-TAGELLKHR--------LQRDKRRAA----RISEAAGAGGGNGRK 112
Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
VVSG+ QGSGEYF +IGVG+P MV+D+GSD+VWVQC PC +CY+QS PVFD
Sbjct: 113 GVAAPVVSGLAQGSGEYFTKIGVGTPATQALMVLDTGSDVVWVQCAPCRRCYEQSGPVFD 172
Query: 200 PADSASFSGVSCSSAVCDRLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTI-GRTV 256
P S+S+ V C +A+C RL++ GC G C Y+V+YGDGS T G ETLT G
Sbjct: 173 PRRSSSYGAVGCGAALCRRLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTFAGGAR 232
Query: 257 VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--------- 307
V VA+GCGH N+G+FV AAGLLGLG G +S Q+ + G +FSYCLV R
Sbjct: 233 VARVALGCGHDNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAP 292
Query: 308 GTGSSGSLVFGREALPV-GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRL- 364
G+ S ++ FG ++ A++ P+VRNPR +FYYV L G+ VGG R+P ++E RL
Sbjct: 293 GSHRSSTVSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLD 352
Query: 365 TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS--GVSIFDTCYNLSGFVS 422
G GV++D+GT+VTRL +Y A RDAF A R S G S+FDTCY+L G
Sbjct: 353 PSTGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRV 412
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
V+VPTVS +F+GG LP N+LIPVD GTFCFAFA + G+SIIGNIQQ+G ++ FD
Sbjct: 413 VKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFD 472
Query: 483 GANGFVGFGPNVC 495
G VGF P C
Sbjct: 473 GDGQRVGFAPKGC 485
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 341 bits (875), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 207/443 (46%), Positives = 270/443 (60%), Gaps = 46/443 (10%)
Query: 80 NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL------------S 127
+ +VHRD ++++ + RH R+QRD +R A + + S
Sbjct: 70 HFRVVHRDAFAANATAAELL---RH------RLQRDKRRAARISKAAAGGGAGAANGTRS 120
Query: 128 GGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC 187
GGA AA VVSG+ QGSGEYF +IGVG+P MV+D+GSD+VW+QC PC
Sbjct: 121 RGGAVAAP---------VVSGLAQGSGEYFTKIGVGTPSTPALMVLDTGSDVVWLQCAPC 171
Query: 188 SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTL 245
+CY QS PVFDP S+S+ V C++ +C RL++ GC R C Y+V+YGDGS T G
Sbjct: 172 RRCYDQSGPVFDPRRSSSYGAVDCAAPLCRRLDSGGCDLRRRACLYQVAYGDGSVTAGDF 231
Query: 246 ALETLTI-GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL 304
A ETLT G V VA+GCGH N+G+FV AAGLLGLG GS+S Q+ + G +FSYCL
Sbjct: 232 ATETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGKSFSYCL 291
Query: 305 VSR---------GTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
V R S ++ FG + A++ P+VRNPR +FYYV L G+ VGG R+
Sbjct: 292 VDRTSSSSSGAASRSRSSTVTFGPPSASA-ASFTPMVRNPRMETFYYVQLVGISVGGARV 350
Query: 356 P-ISEDLFRL-TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS-GVSIFD 412
P ++E RL G GV++D+GT+VTRL P+Y A RDAF A L + G S+FD
Sbjct: 351 PGVAESDLRLDPSTGRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFD 410
Query: 413 TCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNI 472
TCY+L G V+VPTVS +F+GG LP N+LIPVD GTFCFAFA + G+SIIGNI
Sbjct: 411 TCYDLGGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNI 470
Query: 473 QQEGIQISFDGANGFVGFGPNVC 495
QQ+G ++ FDG VGF P C
Sbjct: 471 QQQGFRVVFDGDGQRVGFAPKGC 493
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 338 bits (866), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 186/406 (45%), Positives = 249/406 (61%), Gaps = 14/406 (3%)
Query: 104 HQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVG 163
H+ +QRD +RV + + G + D V SG+ GSGEYFVR+G+G
Sbjct: 2 HEQLLLETLQRDERRVRWIESKAKLAGKKKDEASSTDLNGPVTSGLLYGSGEYFVRLGLG 61
Query: 164 SPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG 223
+P RS +MV+D+GSD+ W+QCQPC CYKQ+DP+FDP +S+SF + C S +C LE
Sbjct: 62 TPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPLCKALEVHS 121
Query: 224 CH-----AGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAG 277
C RC Y+V+YGDGS++ G + + T+G + +VA GCG N+G+F GAAG
Sbjct: 122 CSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKAMSVAFGCGFDNEGLFAGAAG 181
Query: 278 LLGLGGGSMSLVGQL-----GGQTGGAFSYCLVSRG---TGSSGSLVFGREALPVGAAWV 329
LLGLG G +S Q+ T +FSYCLV R T SS SL+FG A+P AA
Sbjct: 182 LLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIFGVAAIPSTAALS 241
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
PL++NP+ +FYY + G+ VGG ++PIS +L+Q G GV++D+GT+VTR PT Y
Sbjct: 242 PLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYA 301
Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
RDAF T NLP A S+FDTCYN SG SV VP + +F G L LP +N+LIP+
Sbjct: 302 TIRDAFRNATINLPSAPRYSLFDTCYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPI 361
Query: 450 DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ AG+FC AFAP+ L IIGNIQQ+ +I FD + F P C
Sbjct: 362 NTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 407
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 337 bits (863), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 210/469 (44%), Positives = 283/469 (60%), Gaps = 27/469 (5%)
Query: 52 AKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTT-NNMHY---HRHQHS 107
AK Q L + + + SS+ AR + + V ++++ + T + + + HR
Sbjct: 28 AKPVQTQSLLVTPLSPTPFSASSELARGDDKDVFAGNLAAAEDATPSTVQFSVVHRDDFV 87
Query: 108 FHA--------RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVR 159
+A R+QRD KR A + A+ + VVSG+ QGSGEYF +
Sbjct: 88 VNATAAELLGHRLQRDGKRAARISAAAGA--ANGTRRTGSGVVAPVVSGLAQGSGEYFTK 145
Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL 219
IGVG+P MV+D+GSD+VW+QC PC +CY QS VFDP S S+ V CS+ +C RL
Sbjct: 146 IGVGTPATPALMVLDTGSDVVWLQCAPCRRCYDQSGQVFDPRRSRSYGAVGCSAPLCRRL 205
Query: 220 ENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGMFVGAA 276
++ GC R C Y+V+YGDGS T G A ETLT G V +A+GCGH N+G+FV AA
Sbjct: 206 DSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAGGARVARIALGCGHDNEGLFVAAA 265
Query: 277 GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS-----SGSLVFGREAL--PVGAAWV 329
GLLGLG GS+S Q+ + G +FSYCLV R + + S ++ FG A+ V A++
Sbjct: 266 GLLGLGRGSLSFPAQISRRYGRSFSYCLVDRTSSANPASHSSTVTFGSGAVGSTVAASFT 325
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRL-TQMGDDGVVMDTGTAVTRLPTPA 387
P+V+NPR +FYYV L G+ VGG R+ +++ RL G GV++D+GT+VTRL PA
Sbjct: 326 PMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSSGRGGVIVDSGTSVTRLARPA 385
Query: 388 YEAFRDAFVAQTGNLPRASG-VSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
Y A RDAF A L + G S+FDTCY+LSG V+VPTVS +F+GG LP N+L
Sbjct: 386 YSALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYL 445
Query: 447 IPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
IPVD GTFCFAFA + G+SIIGNIQQ+G ++ FDG VGF P C
Sbjct: 446 IPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFVPKGC 494
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 333 bits (854), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 177/370 (47%), Positives = 245/370 (66%), Gaps = 10/370 (2%)
Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS 194
K QDF V+SG+ GSGEYF+R+ VG+PPR Y+V+D+GSDI+W+QC PC CY Q
Sbjct: 16 KVPSQDFQAPVISGLSLGSGEYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYHQC 75
Query: 195 DPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI-- 252
D VFDP S+++S + C+S C L+ GC +C Y+V YGDGS++ G A + +++
Sbjct: 76 DEVFDPYKSSTYSTLGCNSRQCLNLDVGGCVGNKCLYQVDYGDGSFSTGEFATDAVSLNS 135
Query: 253 ----GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG 308
G+ V+ + +GCGH N+G FVGAAGLLGLG G +S Q+ + GG FSYCL R
Sbjct: 136 TSGGGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQINSENGGRFSYCLTGRD 195
Query: 309 TGSS--GSLVFGREALP-VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
T S+ SL+FG A+P G + P N R +FYY+ ++G+ VGG + I F+L
Sbjct: 196 TDSTERSSLIFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLD 255
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
+G+ GV++D+GT+VTRL AY + R+AF A T +L + S+FDTCYNLS SV V
Sbjct: 256 SLGNGGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNLSDLSSVDV 315
Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
PTV+ +F GG L LPASN+L+PVD++ TFC AFA + +G SIIGNIQQ+G ++ +D +
Sbjct: 316 PTVTLHFQGGADLKLPASNYLVPVDNSSTFCLAFAGT-TGPSIIGNIQQQGFRVIYDNLH 374
Query: 486 GFVGFGPNVC 495
VGF P+ C
Sbjct: 375 NQVGFVPSQC 384
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 330 bits (847), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 171/390 (43%), Positives = 238/390 (61%), Gaps = 6/390 (1%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
M+RD R+ + R+ + V SG+ GSGEYF R+G+GSP RS Y+
Sbjct: 1 MERDEARLRWIHHRIQSSDHRHRRGRSLLQTAQVSSGLSLGSGEYFARMGIGSPQRSYYL 60
Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRY 231
+D+GSD+ W+QC PCS CY Q DP++DP++S+S+ V C SA+C L+ + C C Y
Sbjct: 61 ELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSALCQALDYSACQGMGCSY 120
Query: 232 EVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSL 288
V YGD S + G L +E+ +G T ++N+A GCGH N G+F G AGLLG+GGG++S
Sbjct: 121 RVVYGDSSASSGDLGIESFYLGPNSSTAMRNIAFGCGHSNSGLFRGEAGLLGMGGGTLSF 180
Query: 289 VGQLGGQTGGAFSYCLVSRGT---GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGL 345
Q+ G AFSYCLV R + S L+FGR A+P A + PL++NPR +FYY L
Sbjct: 181 FSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFAARFTPLLKNPRIDTFYYAIL 240
Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
+G+ VGG +PI F LT G G ++D+GT+VTR+ AY RDA+ A + NLP A
Sbjct: 241 TGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRDAYRAASRNLPPA 300
Query: 406 SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG 465
GV + DTC+N G +V++P++ +F + LP N LIPVD +GTFC AFAPS
Sbjct: 301 PGVYLLDTCFNFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGTFCLAFAPSSMP 360
Query: 466 LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+S+IGN+QQ+ +I FD + P C
Sbjct: 361 ISVIGNVQQQTFRIGFDLQRSLIAIAPREC 390
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 330 bits (846), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 192/412 (46%), Positives = 262/412 (63%), Gaps = 17/412 (4%)
Query: 97 NNMHYHRHQHSFHARMQRDVKRVA----TLVRRLSGG---GADAAKHEVQDFGT-DVVSG 148
+N Y + AR+ RD RV L R L+GG G + + D T VVSG
Sbjct: 80 HNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSG 139
Query: 149 MDQGSG-EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ---CYKQSDPVFDPADSA 204
+GSG EY +IGVG P + Y+V D+GSD+ W+QCQPC+ CYKQ DP+FDP S+
Sbjct: 140 QSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSS 199
Query: 205 SFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIG 263
S+S +SC+S C L+ A C++ C Y+V YGDGS+T G LA ETL+ G + + N+ IG
Sbjct: 200 SYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIG 259
Query: 264 CGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP 323
CGH N+G+F G AGL+GLGGG++SL QL +FSYCLV+ + SS +L F +P
Sbjct: 260 CGHDNEGLFAGGAGLIGLGGGAISLSSQL---KASSFSYCLVNLDSDSSSTLEFNSN-MP 315
Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
+ PLV+N R S+ YV + G+ VGG +PIS F + + G G+++D+GT ++RL
Sbjct: 316 SDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRL 375
Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
P+ YE+ R+AFV T +L A G+S+FDTCYN SG +V VPT++F S G L LPA
Sbjct: 376 PSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPAR 435
Query: 444 NFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
N+LI +D AGT+C AF + S LSIIG+ QQ+GI++S+D N VGF N C
Sbjct: 436 NYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTNKC 487
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 329 bits (843), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 166/357 (46%), Positives = 230/357 (64%), Gaps = 6/357 (1%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
+ SG+ GSGEYF R+G+G+P RS Y+ +D+GSD+ W+QC PCS CY Q DP++DP++S+
Sbjct: 1 ISSGLSLGSGEYFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSS 60
Query: 205 SFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVA 261
S+ V C SA+C L+ + C C Y V YGD S + G L +E+ +G T ++N+A
Sbjct: 61 SYRRVYCGSALCQALDYSACQGMGCSYRVVYGDSSASSGDLGIESFYLGPNSSTAMRNIA 120
Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT---GSSGSLVFG 318
GCGH N G+F G AGLLG+GGG++S Q+ G AFSYCLV R + S L+FG
Sbjct: 121 FGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFG 180
Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
R A+P A + PL++NPR +FYY L+G+ VGG +PI F LT G G ++D+GT
Sbjct: 181 RTAIPFAARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGTGGAILDSGT 240
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
+VTR+ PAY RDA+ A + NLP A GV + DTC+N G +V++P++ +F G +
Sbjct: 241 SVTRVVPPAYAVLRDAYRAASRNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNGVDM 300
Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
LP N LIPVD +GTFC AFAPS +S+IGN+QQ+ +I FD + P C
Sbjct: 301 VLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 357
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 329 bits (843), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 192/412 (46%), Positives = 262/412 (63%), Gaps = 17/412 (4%)
Query: 97 NNMHYHRHQHSFHARMQRDVKRVA----TLVRRLSGG---GADAAKHEVQDFGT-DVVSG 148
+N Y + AR+ RD RV L R L+GG G + + D T VVSG
Sbjct: 80 HNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSG 139
Query: 149 MDQGSG-EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ---CYKQSDPVFDPADSA 204
+GSG EY +IGVG P + Y+V D+GSD+ W+QCQPC+ CYKQ DP+FDP S+
Sbjct: 140 QSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSS 199
Query: 205 SFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIG 263
S+S +SC+S C L+ A C++ C Y+V YGDGS+T G LA ETL+ G + + N+ IG
Sbjct: 200 SYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIG 259
Query: 264 CGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP 323
CGH N+G+F G AGL+GLGGG++SL QL +FSYCLV+ + SS +L F +P
Sbjct: 260 CGHDNEGLFAGGAGLIGLGGGAISLSSQL---KASSFSYCLVNLDSDSSSTLEFN-SYMP 315
Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
+ PLV+N R S+ YV + G+ VGG +PIS F + + G G+++D+GT ++RL
Sbjct: 316 SDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRL 375
Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
P+ YE+ R+AFV T +L A G+S+FDTCYN SG +V VPT++F S G L LPA
Sbjct: 376 PSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPAR 435
Query: 444 NFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
N+LI +D AGT+C AF + S LSIIG+ QQ+GI++S+D N VGF N C
Sbjct: 436 NYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSIVGFSTNKC 487
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 328 bits (842), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 187/394 (47%), Positives = 247/394 (62%), Gaps = 14/394 (3%)
Query: 106 HSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSP 165
S + +++ +K RR++G + V SG QG+GEYF RIGVG P
Sbjct: 140 QSLNRKLELSLKGGKQFGRRINGSDS------TNSLTAPVTSGASQGAGEYFARIGVGQP 193
Query: 166 PRSQYMVIDSGSDIVWVQCQPC---SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA 222
+S + V D+GSD+ W+QCQPC + CYKQ P+FDP S+S+S +SC S C L+ A
Sbjct: 194 VQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEA 253
Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGL 281
C A C YEV YGDGS+T G LA ET + + + N+ IGCGH N+G+FVGA GL+GL
Sbjct: 254 ACDANSCIYEVEYGDGSFTVGELATETFSFRHSNSIPNLPIGCGHDNEGLFVGADGLIGL 313
Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFY 341
GGG++SL QL +FSYCLV + SS +L F + P + PLV+N R P+F
Sbjct: 314 GGGAISLSSQL---EATSFSYCLVDLDSESSSTLDFNADQ-PSDSLTSPLVKNDRFPTFR 369
Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
YV + G+ VGG +PIS F + + G G+++D+GT +T +P+ Y+ RDAFV T N
Sbjct: 370 YVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKN 429
Query: 402 LPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP 461
LP A GVS FDTCY+LS +V VPT++F G L LPA N LI VD AGTFC AF P
Sbjct: 430 LPPAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFLP 489
Query: 462 SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S LSIIGN+QQ+GI++S+D AN VGF + C
Sbjct: 490 STFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 328 bits (842), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 197/434 (45%), Positives = 262/434 (60%), Gaps = 28/434 (6%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATL-----VRRLSGGGADAAK 135
L +VHRD + ++ + + R++RD +R + + + G
Sbjct: 76 LRVVHRDDFAVNATAAELLAH---------RLRRDKRRASRISAAAGGAAAANGTRVGGG 126
Query: 136 HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD 195
F VVSG+ QGSGEYF +IGVG+P MV+D+GSD+VW+QC PC +CY QS
Sbjct: 127 GGGSGFVAPVVSGLAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSG 186
Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIG 253
+FDP S S+ V C++ +C RL++ GC R C Y+V+YGDGS T G A ETLT
Sbjct: 187 QMFDPRASHSYGAVDCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA 246
Query: 254 RTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS 312
V VA+GCGH N+G+FV AAGLLGLG GS+S Q+ + G +FSYCLV R + S+
Sbjct: 247 SGARVPRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSA 306
Query: 313 GS------LVFGREAL--PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP--ISEDLF 362
+ + FG A+ A++ P+V+NPR +FYYV L G+ VGG R+P DL
Sbjct: 307 SATSRSSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLR 366
Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG-VSIFDTCYNLSGFV 421
G GV++D+GT+VTRL PAY A RDAF A L + G S+FDTCY+LSG
Sbjct: 367 LDPSTGRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGLK 426
Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISF 481
V+VPTVS +F+GG LP N+LIPVD GTFCFAFA + G+SIIGNIQQ+G ++ F
Sbjct: 427 VVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVF 486
Query: 482 DGANGFVGFGPNVC 495
DG +GF P C
Sbjct: 487 DGDGQRLGFVPKGC 500
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 327 bits (838), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 170/356 (47%), Positives = 231/356 (64%), Gaps = 2/356 (0%)
Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
F + ++SG+ GSG+YF RIGVG+P RS YMV D+GSD+ W+QC PC +CY+Q DP+F+P
Sbjct: 66 FASPLISGIAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNP 125
Query: 201 ADSASFSGVSCSSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN 259
+ S+SF ++C+S++C +L+ GC C Y+VSYGDGS+T G + ETL+ G V++
Sbjct: 126 SLSSSFKPLACASSICGKLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETLSFGEHAVRS 185
Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR 319
VA+GCG NQG+F GAAGLLGLG G +S Q G FSYCL R + + SLVFG
Sbjct: 186 VAMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRESAIAASLVFGP 245
Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
A+P A + L+ N R ++YYVGL+ + V G + I D F + G GV++D+GTA
Sbjct: 246 SAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTA 305
Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
++RL TPAY A RDAF + P A G+S+FDTCY+LS + +P V F GG +
Sbjct: 306 ISRLTTPAYTALRDAFRSLV-TFPSAPGISLFDTCYDLSSMKTATLPAVVLDFDGGASMP 364
Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
LPA L+ VDD GT+C AFAP SIIGN+QQ+ +IS D +G P+ C
Sbjct: 365 LPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 420
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 327 bits (837), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 187/394 (47%), Positives = 247/394 (62%), Gaps = 14/394 (3%)
Query: 106 HSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSP 165
S + +++ +K RR++G + V SG QG+GEYF RIGVG P
Sbjct: 140 QSLNRKLELSLKGGKQFGRRINGSDS------TNSLTAPVTSGASQGAGEYFARIGVGQP 193
Query: 166 PRSQYMVIDSGSDIVWVQCQPC---SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA 222
+S + V D+GSD+ W+QCQPC + CYKQ P+FDP S+S+S +SC S C L+ A
Sbjct: 194 VQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEA 253
Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGL 281
C A C YEV YGDGS+T G LA ET + + + N+ IGCGH N+G+FVGAAGL+GL
Sbjct: 254 ACDANSCIYEVEYGDGSFTVGELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGL 313
Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFY 341
GGG++SL QL +FSYCLV + SS +L F + P + PLV+N R P+F
Sbjct: 314 GGGAISLSSQL---EATSFSYCLVDLDSESSSTLDFNADQ-PSDSLTSPLVKNDRFPTFR 369
Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
YV + G+ VGG +PIS F + + G G+++D+GT +T +P+ Y+ RDAFV T N
Sbjct: 370 YVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKN 429
Query: 402 LPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP 461
LP A GVS FDTCY+LS +V VPT++F G L LPA N L VD AGTFC AF P
Sbjct: 430 LPPAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLP 489
Query: 462 SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S LSIIGN+QQ+GI++S+D AN VGF + C
Sbjct: 490 STFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 326 bits (835), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 197/448 (43%), Positives = 267/448 (59%), Gaps = 28/448 (6%)
Query: 63 RHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATL 122
+ +S + ++ + + L HR+ + ++ ++ + + + + A AT
Sbjct: 47 QEQQLSLAAPRTNASTLHFRLAHREHFALNATASDLLAHLLARDAARAAALLAAPNNATR 106
Query: 123 VRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWV 182
RR G F ++SG+ QGSGEYF ++GVG+P + MV+D+GSD+VW+
Sbjct: 107 PRRRGG------------FAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWL 154
Query: 183 QCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSY 240
QC PC CY QS VFDP S S++ V C + +C RL++AGC R C Y+V+YGDGS
Sbjct: 155 QCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSV 214
Query: 241 TKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA 299
T G A ETLT R V+ VAIGCGH N+G+F+ A+GLLGLG G +S Q+ G +
Sbjct: 215 TAGDFASETLTFARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRS 274
Query: 300 FSYCLVSRGTG--------SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVG 351
FSYCLV R + S+ + G A GA++ P+ RNPR +FYYV L G VG
Sbjct: 275 FSYCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVG 334
Query: 352 GMRIP-ISEDLFRLT-QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS--G 407
G R+ +S+ RL G GV++D+GT+VTRL P YEA RDAF A L R S G
Sbjct: 335 GARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGL-RVSPGG 393
Query: 408 VSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS 467
S+FDTCYNLSG V+VPTVS + +GG + LP N+LIPVD +GTFCFA A + G+S
Sbjct: 394 FSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVS 453
Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
IIGNIQQ+G ++ FDG VGF P C
Sbjct: 454 IIGNIQQQGFRVVFDGDAQRVGFVPKSC 481
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 326 bits (835), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 197/448 (43%), Positives = 267/448 (59%), Gaps = 28/448 (6%)
Query: 63 RHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATL 122
+ +S + ++ + + L HR+ + ++ ++ + + + + A AT
Sbjct: 41 QEQQLSLAAPRTNASTLHFRLAHREHFALNATASDLLAHLLARDAARAAALLAAPNNATR 100
Query: 123 VRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWV 182
RR G F ++SG+ QGSGEYF ++GVG+P + MV+D+GSD+VW+
Sbjct: 101 PRRRGG------------FAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWL 148
Query: 183 QCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSY 240
QC PC CY QS VFDP S S++ V C + +C RL++AGC R C Y+V+YGDGS
Sbjct: 149 QCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSV 208
Query: 241 TKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA 299
T G A ETLT R V+ VAIGCGH N+G+F+ A+GLLGLG G +S Q+ G +
Sbjct: 209 TAGDFASETLTFARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRS 268
Query: 300 FSYCLVSRGTG--------SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVG 351
FSYCLV R + S+ + G A GA++ P+ RNPR +FYYV L G VG
Sbjct: 269 FSYCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVG 328
Query: 352 GMRIP-ISEDLFRLT-QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS--G 407
G R+ +S+ RL G GV++D+GT+VTRL P YEA RDAF A L R S G
Sbjct: 329 GARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGL-RVSPGG 387
Query: 408 VSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS 467
S+FDTCYNLSG V+VPTVS + +GG + LP N+LIPVD +GTFCFA A + G+S
Sbjct: 388 FSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVS 447
Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
IIGNIQQ+G ++ FDG VGF P C
Sbjct: 448 IIGNIQQQGFRVVFDGDAQRVGFVPKSC 475
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 325 bits (833), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 197/448 (43%), Positives = 267/448 (59%), Gaps = 28/448 (6%)
Query: 63 RHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATL 122
+ +S + ++ + + L HR+ + ++ ++ + + + + A AT
Sbjct: 41 QEQQLSLAAPRTNASTLHFRLAHREHFALNATASDLLAHLLARDAARAAALLAAPNNATR 100
Query: 123 VRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWV 182
RR G F ++SG+ QGSGEYF ++GVG+P + MV+D+GSD+VW+
Sbjct: 101 PRRRGG------------FAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWL 148
Query: 183 QCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSY 240
QC PC CY QS VFDP S S++ V C + +C RL++AGC R C Y+V+YGDGS
Sbjct: 149 QCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSV 208
Query: 241 TKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA 299
T G A ETLT R V+ VAIGCGH N+G+F+ A+GLLGLG G +S Q+ G +
Sbjct: 209 TAGDFASETLTFARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPTQIARSFGRS 268
Query: 300 FSYCLVSRGTG--------SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVG 351
FSYCLV R + S+ + G A GA++ P+ RNPR +FYYV L G VG
Sbjct: 269 FSYCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVG 328
Query: 352 GMRIP-ISEDLFRLT-QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS--G 407
G R+ +S+ RL G GV++D+GT+VTRL P YEA RDAF A L R S G
Sbjct: 329 GARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGL-RVSPGG 387
Query: 408 VSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS 467
S+FDTCYNLSG V+VPTVS + +GG + LP N+LIPVD +GTFCFA A + G+S
Sbjct: 388 FSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVS 447
Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
IIGNIQQ+G ++ FDG VGF P C
Sbjct: 448 IIGNIQQQGFRVVFDGDAQRVGFVPKSC 475
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 323 bits (829), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 169/352 (48%), Positives = 230/352 (65%), Gaps = 2/352 (0%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
++SG+ GSG+YF RIGVG+P RS YMV D+GSD+ W+QC PC +CY+Q DP+F+P+ S+
Sbjct: 3 LISGIAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSS 62
Query: 205 SFSGVSCSSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIG 263
SF ++C+S++C +L+ GC +C Y+VSYGDGS+T G + ETL+ G V++VA+G
Sbjct: 63 SFKPLACASSICGKLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSFGEHAVRSVAMG 122
Query: 264 CGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP 323
CG NQG+F GAAGLLGLG G +S Q G FSYCL R + + SLVFG A+P
Sbjct: 123 CGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRESAIAASLVFGPSAVP 182
Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
A + L+ N R ++YYVGL+ + V G + I D F + G GV++D+GTA++RL
Sbjct: 183 EKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAISRL 242
Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
TPAY A RDAF + P A G+S+FDTCY+LS + +P V F GG + LPA
Sbjct: 243 TTPAYTALRDAFRSLV-TFPSAPGISLFDTCYDLSSMKTATLPAVVLDFDGGASMPLPAD 301
Query: 444 NFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L+ VDD GT+C AFAP SIIGN+QQ+ +IS D +G P+ C
Sbjct: 302 GILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 353
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 320 bits (819), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 159/368 (43%), Positives = 231/368 (62%), Gaps = 14/368 (3%)
Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
F + SG+ G+GEYF +GVG+P R Y+V+D+GSDI W+QC PC+ CYKQ D +F+P
Sbjct: 1 FEAPIFSGLAFGTGEYFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNP 60
Query: 201 ADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI------GR 254
+ S+SF + CSS++C L+ GC + +C Y+ YGDGS+T G L + + + G+
Sbjct: 61 SSSSSFKVLDCSSSLCLNLDVMGCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQ 120
Query: 255 TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS-- 312
V+ N+ +GCGH N+G F AAG+LGLG G +S L T FSYCL R + +
Sbjct: 121 VVLTNIPLGCGHDNEGTFGTAAGILGLGRGPLSFPNNLDASTRNIFSYCLPDRESDPNHK 180
Query: 313 GSLVFGREALPVGAA----WVPLVRNPRAPSFYYVGLSGLGVGG-MRIPISEDLFRLTQM 367
+LVFG A+P A ++P +RNPR ++YYV ++G+ VGG + I +F+L
Sbjct: 181 STLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSH 240
Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
G+ G + D+GT +TRL AY A RDAF A T +L A+ IFDTCY+ +G S+ VPT
Sbjct: 241 GNGGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFKIFDTCYDFTGMNSISVPT 300
Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGF 487
V+F+F G + LP SN+++PV + FCFAFA S G S+IGN+QQ+ ++ +D +
Sbjct: 301 VTFHFQGDVDMRLPPSNYIVPVSNNNIFCFAFAAS-MGPSVIGNVQQQSFRVIYDNVHKQ 359
Query: 488 VGFGPNVC 495
+G P+ C
Sbjct: 360 IGLLPDQC 367
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 156/389 (40%), Positives = 229/389 (58%), Gaps = 17/389 (4%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
RD R+ T+ + +G + + +Q G G+G Y V G G+P ++ +
Sbjct: 101 FDRDNDRLNTIWSKNNGTYSTMSNLPLQP-------GSKVGTGNYIVTAGFGTPAKNSLL 153
Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG-CHAGRCR 230
+ID+GSD+ W+QC+PCS CY Q DP+F+P S+S+ +SC S+ C L C G C
Sbjct: 154 IIDTGSDVTWIQCKPCSDCYSQVDPIFEPQQSSSYKHLSCLSSACTELTTMNHCRLGGCV 213
Query: 231 YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVG 290
YE++YGDGS ++G + ETLT+G + A GCGH N G+F G+AGLLGLG ++S
Sbjct: 214 YEINYGDGSRSQGDFSQETLTLGSDSFPSFAFGCGHTNTGLFKGSAGLLGLGRTALSFPS 273
Query: 291 QLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLG 349
Q + GG FSYCL + S+GS G+ ++P A +VPLV N PSFY+VGL+G+
Sbjct: 274 QTKSKYGGQFSYCLPDFVSSTSTGSFSVGQGSIPATATFVPLVSNSNYPSFYFVGLNGIS 333
Query: 350 VGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS 409
VGG R+ I + +G G ++D+GT +TRL AY+A + +F ++T NLP A S
Sbjct: 334 VGGERLSIPPAV-----LGRGGTIVDSGTVITRLVPQAYDALKTSFRSKTRNLPSAKPFS 388
Query: 410 IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD-DAGTFCFAFAPSPSGLS- 467
I DTCY+LS + VR+PT++F+F + + A L + D C AFA + +S
Sbjct: 389 ILDTCYDLSSYSQVRIPTITFHFQNNADVAVSAVGILFTIQSDGSQVCLAFASASQSIST 448
Query: 468 -IIGNIQQEGIQISFDGANGFVGFGPNVC 495
IIGN QQ+ ++++FD G +GF P C
Sbjct: 449 NIIGNFQQQRMRVAFDTGAGRIGFAPGSC 477
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 298 bits (763), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 174/342 (50%), Positives = 223/342 (65%), Gaps = 17/342 (4%)
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC--HAGR 228
MV+D+GSD+VWVQC PC +CY+QS PVFDP S+S+ V C +A+C RL++ GC G
Sbjct: 1 MVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGA 60
Query: 229 CRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMS 287
C Y+V+YGDGS T G ETLT G V VA+GCGH N+G+FV AAGLLGLG G +S
Sbjct: 61 CMYQVAYGDGSVTAGDFVTETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGGLS 120
Query: 288 LVGQLGGQTGGAFSYCLVSR---------GTGSSGSLVFGREALPVGAA-WVPLVRNPRA 337
Q+ + G +FSYCLV R G+ S ++ FG ++ +A + P+VRNPR
Sbjct: 121 FPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSASFTPMVRNPRM 180
Query: 338 PSFYYVGLSGLGVGGMRIP-ISEDLFRLT-QMGDDGVVMDTGTAVTRLPTPAYEAFRDAF 395
+FYYV L G+ VGG R+P ++E RL G GV++D+GT+VTRL +Y A RDAF
Sbjct: 181 ETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAF 240
Query: 396 VAQTGNLPRAS--GVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
A R S G S+FDTCY+L G V+VPTVS +F+GG LP N+LIPVD G
Sbjct: 241 RAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDSRG 300
Query: 454 TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
TFCFAFA + G+SIIGNIQQ+G ++ FDG VGF P C
Sbjct: 301 TFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 297 bits (761), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 163/338 (48%), Positives = 219/338 (64%), Gaps = 8/338 (2%)
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQ---CYKQSDPVFDPADSASFSGVSCSSAVCDR 218
VG P + + V+D+GSD+ W+QC PC+ CY+Q P+FDP S+S++ VSC S C
Sbjct: 3 VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQL 62
Query: 219 LENAGCHAGRCRYEVSYGDGSYTKGTLALETLT-IGRTVVKNVAIGCGHKNQGMFVGAAG 277
L+ AGC+ C Y+V YGDGS+T G LA ETLT + + N++IGCGH N+G+FVGA G
Sbjct: 63 LDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNSIPNISIGCGHDNEGLFVGADG 122
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRA 337
L+GLGGG++S+ QL +FSYCLV + S +L F + P + PLV+N R
Sbjct: 123 LIGLGGGAISISSQL---KASSFSYCLVDIDSPSFSTLDFNTDP-PSDSLISPLVKNDRF 178
Query: 338 PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVA 397
PSF YV + G+ VGG +PIS F + + G G+++D+GT +T+LP+ YE R+AF+
Sbjct: 179 PSFRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEVLREAFLG 238
Query: 398 QTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCF 457
T NLP A +S FDTCY+LS +V VPT++F G L LPA N LI VD AGTFC
Sbjct: 239 LTTNLPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQVDSAGTFCL 298
Query: 458 AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
AF + LSIIGN QQ+GI++S+D N VGF N C
Sbjct: 299 AFVSATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 289 bits (740), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 192/444 (43%), Positives = 252/444 (56%), Gaps = 36/444 (8%)
Query: 73 SSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGAD 132
++ + ++ L+HRD+ ++ N T R R+QRDV R A ++ + + G
Sbjct: 62 AASSSTLHIRLLHRDRFAA--NATPAQLLAR-------RLQRDVLRAAWIISKAAANGTP 112
Query: 133 ---AAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ 189
A + F VVS SGEY +I VG+P + +D+ SD+ W+QCQPC +
Sbjct: 113 PPVAGLSSARGFVAPVVSRAPT-SGEYIAKIAVGTPGVEALLALDTASDLTWLQCQPCRR 171
Query: 190 CYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG---CHAGRCRYEVSYGDGSYTKGTLA 246
CY QS PVFDP S S+ +S ++A C L +G G C Y V YGDGS T G
Sbjct: 172 CYPQSGPVFDPRHSTSYREMSFNAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFI 231
Query: 247 LETLTI-GRTVVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL 304
ETLT G + ++IGCGH N+G+F AAG+LGLG G MS Q+ G FSYCL
Sbjct: 232 EETLTFAGGVRLPRISIGCGHDNKGLFGAPAAGILGLGRGLMSFPNQI--DHNGTFSYCL 289
Query: 305 VS--RGTGS-SGSLVFGREAL----PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP- 356
V G GS S +L FG A+ PV ++ P V N P+FYYV L+G+ VGG+R+P
Sbjct: 290 VDFLSGPGSLSSTLTFGAGAVDTSPPV--SFTPTVLNLNMPTFYYVRLTGISVGGVRVPG 347
Query: 357 -ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS--GVS-IFD 412
DL G GV++D+GTAVTRL PAY AFRDAF A +L + S G S FD
Sbjct: 348 VTERDLQLDPYTGRGGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFD 407
Query: 413 TCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS-PSGLSIIGN 471
TCY + G +VPTVS +F+G + L N+LIPVD GT CFAFA + +SIIGN
Sbjct: 408 TCYTVGGRGMKKVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSIIGN 467
Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
IQQ+G +I +D G VGF PN C
Sbjct: 468 IQQQGFRIVYD-IGGRVGFAPNSC 490
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 285 bits (729), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 141/351 (40%), Positives = 205/351 (58%), Gaps = 5/351 (1%)
Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
D + G+ G+ + V+IGVG PP+ YM+ D +D W+QCQPC +CY Q D +FD
Sbjct: 171 DLNASLNPGITTGTSNFLVQIGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFD 230
Query: 200 PADSASFSGVSCSSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VV 257
P+ S+S++ +SC + C+ L N+ C G CRY ++Y DG+ T+G L ET++ + V
Sbjct: 231 PSQSSSYTLLSCETKHCNLLPNSSCSDDGYCRYNITYKDGTNTEGVLINETVSFESSGWV 290
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF 317
V++GC +KNQG FVG+ G GLG GS+S ++ + SYCLV G S S +
Sbjct: 291 DRVSLGCSNKNQGPFVGSDGTFGLGRGSLSFPSRINA---SSMSYCLVESKDGYSSSTLE 347
Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
G+ L++NP+A + YYVGL G+ VGG +I + F + G+ G+++ +
Sbjct: 348 FNSPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSS 407
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV 437
+ +T L Y RDAFVA+T +L R FDTCYNLS +V +P + F + G
Sbjct: 408 SLITMLENDTYNVVRDAFVAKTQHLERLKAFLQFDTCYNLSSNNTVELPILEFEVNDGKS 467
Query: 438 LTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
LP ++L VD GTFCFAFAPS SI+G +QQ G +++FD N FV
Sbjct: 468 WLLPKESYLYAVDKNGTFCFAFAPSKGSFSILGTLQQYGTRVTFDLVNSFV 518
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 281 bits (718), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 162/371 (43%), Positives = 213/371 (57%), Gaps = 20/371 (5%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
V+SG+ SGEYF I VG PP +VID+GSD++W+QC PC CY+Q P++DP S+
Sbjct: 77 VMSGVPFDSGEYFAVINVGDPPTRALVVIDTGSDLIWLQCVPCRHCYRQVTPLYDPRSSS 136
Query: 205 SFSGVSCSSAVC-DRLENAGCHA--GRCRYEVSYGDGSYTKGTLALETLTI-GRTVVKNV 260
+ + C+S C D L GC A G C Y V YGDGS + G LA + L T V NV
Sbjct: 137 THRRIPCASPRCRDVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDRLVFPDDTHVHNV 196
Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL---VSRGTGSSGSLVF 317
+GCGH N G+ AAGLLG+G G +S QL G FSYCL +SR S LVF
Sbjct: 197 TLGCGHDNVGLLESAAGLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQNGSSYLVF 256
Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP--ISEDLFRLTQMGDDGVVMD 375
GR P A+ PL NPR PS YYV + G VGG R+ + L G G+V+D
Sbjct: 257 GRTPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGRGGIVVD 316
Query: 376 TGTAVTRLPTPAYEAFRDAF---VAQTGNLPR-ASGVSIFDTCYNLSG----FVSVRVPT 427
+GTA++R AY A RDAF A G + + A+ S+FD CY+L G +VRVP+
Sbjct: 317 SGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAVRVPS 376
Query: 428 VSFYFSGGPVLTLPASNFLIPV---DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGA 484
+ +F+GG + LP +N+LIPV D FC + GL+++GN+QQ+G + FD
Sbjct: 377 IVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGLVFDVE 436
Query: 485 NGFVGFGPNVC 495
G +GF PN C
Sbjct: 437 RGRIGFTPNGC 447
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 279 bits (714), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 162/369 (43%), Positives = 212/369 (57%), Gaps = 18/369 (4%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
V+SG+ SGEYF IGVG PP +VID+GSD++W+QC PC +CY+Q P++DP +S
Sbjct: 81 VMSGVPFDSGEYFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTPLYDPRNSK 140
Query: 205 SFSGVSCSSAVCD-RLENAGCHA--GRCRYEVSYGDGSYTKGTLALETLTI-GRTVVKNV 260
+ + C+S C L GC A G C Y V YGDGS + G LA +TL + T V NV
Sbjct: 141 THRRIPCASPQCRGVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDTLVLPDDTRVHNV 200
Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL---VSRGTGSSGSLVF 317
+GCGH N+G+ AAGLLG G G +S QL G FSYCL +SR SS LVF
Sbjct: 201 TLGCGHDNEGLLASAAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMSRARNSSSYLVF 260
Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP--ISEDLFRLTQMGDDGVVMD 375
GR A+ PL NPR PS YYV + G VGG R+ + L G GVV+D
Sbjct: 261 GRTPELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPATGRGGVVVD 320
Query: 376 TGTAVTRLPTPAYEAFRDAFV---AQTGNLPRASGVSIFDTCYNLSGF---VSVRVPTVS 429
+GTA++R AY A RDAFV A G + S+FDTCY++ G VRVP++
Sbjct: 321 SGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYDVHGNGPGTGVRVPSIV 380
Query: 430 FYFSGGPVLTLPASNFLIPV---DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
+F+ + LP +N+LIPV D FC + GL+++GN+QQ+G + FD G
Sbjct: 381 LHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGVVFDVERG 440
Query: 487 FVGFGPNVC 495
+GF PN C
Sbjct: 441 RIGFTPNGC 449
>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
Length = 366
Score = 279 bits (714), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 143/290 (49%), Positives = 194/290 (66%), Gaps = 8/290 (2%)
Query: 71 NTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHAR-------MQRDVKRVATLV 123
T + W++E+VHRD + + Y R R ++R ++R TL
Sbjct: 66 ETKPRRSPWSVEVVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLN 125
Query: 124 RRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQ 183
+ + A+ + DFG +VVSGM+QGSGEYF RIGVG+P R QYMV+D+GSD+ W+Q
Sbjct: 126 KDPVNRYENVAEVDA-DFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQ 184
Query: 184 CQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKG 243
C+PC +CY Q+DP+F+P+ SASFS V C SAVC +L+ CH+G C YE SYGDGSY+ G
Sbjct: 185 CEPCRECYSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCHSGGCLYEASYGDGSYSTG 244
Query: 244 TLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
+ A ETLT G T V NVAIGCGHKN G+F+GAAGLLGLG G++S Q+G QTG FSYC
Sbjct: 245 SFATETLTFGTTSVANVAIGCGHKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTFSYC 304
Query: 304 LVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGM 353
LV R + SSG L FG +++PVG+ + PL +NP P+FYY+ ++ + + +
Sbjct: 305 LVDRESDSSGPLQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISISAI 354
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 278 bits (712), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 152/393 (38%), Positives = 227/393 (57%), Gaps = 21/393 (5%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
+RD R+ T+ + SG + +Q SG G+G Y V G G+P ++ +
Sbjct: 100 FERDNARLNTIRSKNSGPYTTMSNLPLQ-------SGTTVGTGNYIVTAGFGTPAKNSLL 152
Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL-----ENAGCHA 226
+ID+GSD+ W+QC+PC+ CY Q D +F+P S+S+ + C SA C L C
Sbjct: 153 IIDTGSDLTWIQCKPCADCYSQVDAIFEPKQSSSYKTLPCLSATCTELITSESNPTPCLL 212
Query: 227 GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSM 286
G C YE++YGDGS ++G + ETLT+G +N A GCGH N G+F G++GLLGLG S+
Sbjct: 213 GGCVYEINYGDGSSSQGDFSQETLTLGSDSFQNFAFGCGHTNTGLFKGSSGLLGLGQNSL 272
Query: 287 SLVGQLGGQTGGAFSYCLV-SRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGL 345
S Q + GG F+YCL + S+GS G+ ++P A + PLV N P+FY+VGL
Sbjct: 273 SFPSQSKSKYGGQFAYCLPDFGSSTSTGSFSVGKGSIPASAVFTPLVSNFMYPTFYFVGL 332
Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
+G+ VGG R+ I + +G ++D+GT +TRL AY A + +F ++T +LP A
Sbjct: 333 NGISVGGDRLSIPPAV-----LGRGSTIVDSGTVITRLLPQAYNALKTSFRSKTRDLPSA 387
Query: 406 SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT-FCFAFAPSPS 464
SI DTCY+LS VR+PT++F+F + + L+PV + G+ C AFA +
Sbjct: 388 KPFSILDTCYDLSRHSQVRIPTITFHFQNNADVAVSDVGILVPVQNGGSQVCLAFASASQ 447
Query: 465 --GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
G +IIGN QQ+ ++++FD G +GF C
Sbjct: 448 MDGFNIIGNFQQQRMRVAFDTGAGRIGFASGSC 480
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 275 bits (703), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 163/441 (36%), Positives = 245/441 (55%), Gaps = 36/441 (8%)
Query: 71 NTSSDEARWNLELVHR-----DKMSSSSNTTNNMHYHRHQHSF-HARMQRDVKRVATLVR 124
+TSS + +LE++HR D++S++ + + + F H+++ +++ V
Sbjct: 53 HTSSLGEQSSLEVIHRHGPCGDEVSNAPTAAEMLVKDQSRVDFIHSKIAGELESV----D 108
Query: 125 RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC 184
RL G A + SG GSG Y V +G+G+P + ++ D+GSD+ W QC
Sbjct: 109 RLRGSKATKIPAK---------SGATIGSGNYIVSVGLGTPKKYLSLIFDTGSDLTWTQC 159
Query: 185 QPCSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAGR-CRYEVSYGD 237
QPC++ CY Q DPVF P+ S ++S +SCSS C +LE+ GC A R C Y + YGD
Sbjct: 160 QPCARYCYNQKDPVFVPSQSTTYSNISCSSPDCSQLESGTGNQPGCSAARACIYGIQYGD 219
Query: 238 GSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT 296
S++ G A ETLT+ T V++N GCG N+G+F AAGL+GLG +S+V Q +
Sbjct: 220 QSFSVGYFAKETLTLTSTDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKY 279
Query: 297 GGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP 356
G FSYCL + + S+G L FG + P+ + +FY V + G+ VGG +IP
Sbjct: 280 GQVFSYCL-PKTSSSTGYLTFGGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIP 338
Query: 357 ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYN 416
IS +F + G ++D+GT +TRLP AY A + AF P+A +SI DTCY+
Sbjct: 339 ISSSVFSTS-----GAIIDSGTVITRLPPDAYSALKSAFEKGMAKYPKAPELSILDTCYD 393
Query: 417 LSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQ 474
LS + ++++P V F F GG L L + + C AFA PS ++IIGN+QQ
Sbjct: 394 LSKYSTIQIPKVGFVFKGGEELDLDGIGIMYGASTS-QVCLAFAGNQDPSTVAIIGNVQQ 452
Query: 475 EGIQISFDGANGFVGFGPNVC 495
+ +Q+ +D G +GFG N C
Sbjct: 453 KTLQVVYDVGGGKIGFGYNGC 473
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 275 bits (702), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 161/431 (37%), Positives = 232/431 (53%), Gaps = 24/431 (5%)
Query: 76 EARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHAR-MQRDVKRVATLVRRLSGGGADAA 134
+ R +LE+VH+ S + H+ H + + +D RVA++ RL+ A +
Sbjct: 72 DQRASLEVVHKHGPCS------KLRPHKANSPSHTQILAQDESRVASIQSRLAKNLAGGS 125
Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQ 193
+ S GSG Y V +G+GSP R + D+GSD+ W QC+PC CY+Q
Sbjct: 126 NLKASKATLPSKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQ 185
Query: 194 SDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAGRCRYEVSYGDGSYTKGTLALE 248
+ +FDP+ S S+S VSC S C++LE+A GC + C Y + YGDGSY+ G A E
Sbjct: 186 REHIFDPSTSLSYSNVSCDSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFARE 245
Query: 249 TLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
L++ T V N GCG N+G+F G AGLLGL +SLV Q + G FSYCL
Sbjct: 246 KLSLTSTDVFNNFQFGCGQNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKVFSYCL-PS 304
Query: 308 GTGSSGSLVFGR-EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
+ S+G L FG + + P N PSFY++ + G+ VG ++PI + +F
Sbjct: 305 SSSSTGYLSFGSGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFSTA- 363
Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
G ++D+GT ++RLP Y + + F + PR GVSI DTCY+LS + +V+VP
Sbjct: 364 ----GTIIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKYKTVKVP 419
Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGA 484
+ YFSGG + L A +I V C AFA ++IIGN+QQ+ I + +D A
Sbjct: 420 KIILYFSGGAEMDL-APEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDA 478
Query: 485 NGFVGFGPNVC 495
G VGF P+ C
Sbjct: 479 EGRVGFAPSGC 489
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 274 bits (700), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 154/370 (41%), Positives = 207/370 (55%), Gaps = 10/370 (2%)
Query: 136 HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD 195
H+ + V+SG+ SGEYF +GVG+PP +VID+GSD+VW+QC+PC CY+Q
Sbjct: 79 HDDDHLHSPVISGLPFASGEYFASVGVGTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLS 138
Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR- 254
P++DP S++++ CS C + G C Y + YGD S T G LA + L
Sbjct: 139 PLYDPRGSSTYAQTPCSPPQCRNPQTCDGTTGGCGYRIVYGDASSTSGNLATDRLVFSND 198
Query: 255 TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL--VSRGTGSS 312
T V NV +GCGH N+G+F AAGLLG+ G+ S Q+ G F+YCL +R SS
Sbjct: 199 TSVGNVTLGCGHDNEGLFGSAAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGSSS 258
Query: 313 GSLVFGREAL-PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP--ISEDLFRLTQMGD 369
LVFGR A P + + PL NPR PS YYV + G VGG + + L G
Sbjct: 259 SYLVFGRTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGR 318
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAF---VAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
GVV+D+GT++TR AY A RDAF A+ G G+S+FD CY+L G P
Sbjct: 319 GGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLRGVAVADAP 378
Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF-APSPSGLSIIGNIQQEGIQISFDGAN 485
V +F+GG + LP N+L+P + CFA A GLS+IGN+ Q+ ++ FD N
Sbjct: 379 GVVLHFAGGADVALPPENYLVPEESGRYHCFALEAAGHDGLSVIGNVLQQRFRVVFDVEN 438
Query: 486 GFVGFGPNVC 495
VGF PN C
Sbjct: 439 ERVGFEPNGC 448
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 273 bits (699), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 154/441 (34%), Positives = 235/441 (53%), Gaps = 22/441 (4%)
Query: 65 NNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVR 124
+++ S + D+ R +LE++H+ S + R Q + +D RV ++
Sbjct: 52 SSVCSPSPKGDDKRASLEVIHKHGPCSKLSQDKGRSPSRTQM-----LDQDESRVNSIRS 106
Query: 125 RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC 184
RL+ AD K + SG G+G Y V +G+G+P R + D+GSD+ W QC
Sbjct: 107 RLAKNPADGGKLKGSKVTLPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQC 166
Query: 185 QPCSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAGRCRYEVSYGDG 238
+PC++ CY Q +P+F+P+ S S++ +SCSS CD L++ C A C Y + YGD
Sbjct: 167 EPCARYCYHQQEPIFNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQ 226
Query: 239 SYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTG 297
SY+ G A + L + T V N GCG N+G+FVG AGL+GLG ++SLV Q + G
Sbjct: 227 SYSVGFFAQDKLALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLVSQTAQKYG 286
Query: 298 GAFSYCLVSRGTGSSGSLVFGREA-LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP 356
FSYCL S + S+G L FG + P + N + PSFY++ L + VGG ++
Sbjct: 287 KLFSYCLPSTSS-STGYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLS 345
Query: 357 ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYN 416
S +F G ++D+GT ++RLP AY R +F Q P+A+ SI DTCY+
Sbjct: 346 TSASVFSTA-----GTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTCYD 400
Query: 417 LSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQ 474
S + +V VP ++ YFS G + L S + + C AFA + ++I+GN+QQ
Sbjct: 401 FSQYDTVDVPKINLYFSDGAEMDLDPSGIFY-ILNISQVCLAFAGNSDATDIAILGNVQQ 459
Query: 475 EGIQISFDGANGFVGFGPNVC 495
+ + +D A G +GF P C
Sbjct: 460 KTFDVVYDVAGGRIGFAPGGC 480
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 273 bits (699), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 165/452 (36%), Positives = 236/452 (52%), Gaps = 47/452 (10%)
Query: 58 NELFERHNNISSSNTSSDEARWNLELVHRD-------KMSSSSNTTNNMHYHRHQHSFHA 110
N+ F+ N++S LE+VHR ++N +NM
Sbjct: 54 NQTFKVSNSLS------------LEVVHRSGPCIQVLNQEKAANAPSNMEI--------- 92
Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
+ +D RV ++ RLS G K V SG GSG+Y V +G+G+P +
Sbjct: 93 -LLQDRHRVDSIHARLSSHGVFQEKQAT----LPVQSGASIGSGDYAVTVGLGTPKKEFT 147
Query: 171 MVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG---CHA 226
++ D+GSD+ W QC+PC++ CYKQ +P DP S S+ +SCSSA C L+ G C +
Sbjct: 148 LIFDTGSDLTWTQCEPCAKTCYKQKEPRLDPTKSTSYKNISCSSAFCKLLDTEGGESCSS 207
Query: 227 GRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGS 285
C Y+V YGDGSY+ G A ETLT+ + V KN GCG +N G+F GAAGLLGLG
Sbjct: 208 PTCLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNSGLFRGAAGLLGLGRTK 267
Query: 286 MSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGL 345
+SL Q + FSYCL + + S G L FG + + PL + ++ FY + +
Sbjct: 268 LSLPSQTAQKYKKLFSYCLPA-SSSSKGYLSFGGQVSKT-VKFTPLSEDFKSTPFYGLDI 325
Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
+ L VGG ++ I +F + G V+D+GT +TRLP+ AY A AF + P
Sbjct: 326 TELSVGGNKLSIDASIFSTS-----GTVIDSGTVITRLPSTAYSALSSAFQKLMTDYPST 380
Query: 406 SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG 465
G SIFDTCY+ S ++++P V F GG + + S L PV+ C AFA +
Sbjct: 381 DGYSIFDTCYDFSKNETIKIPKVGVSFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNGDD 440
Query: 466 L--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ +I GN QQ+ Q+ +D A G VGF P+ C
Sbjct: 441 VKAAIFGNTQQKTYQVVYDDAKGRVGFAPSGC 472
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 174/434 (40%), Positives = 231/434 (53%), Gaps = 41/434 (9%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+ L H D + N + R H RM R V R AT V+ ++GGG
Sbjct: 40 LRVRLTHVD---AHGNYSRLQLLQRAARRSHHRMSRLVAR-ATGVKAVAGGG-------- 87
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
D+ + G+GE+ + + +G+P S ++D+GSD+VW QC+PC C+KQS PVF
Sbjct: 88 -----DLQVPVHAGNGEFLMDVAIGTPALSYAAIVDTGSDLVWTQCKPCVDCFKQSTPVF 142
Query: 199 DPADSASFSGVSCSSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
DP+ S++++ V CSSA+C L + C A +C Y +YGD S T+G LA ET T+G+
Sbjct: 143 DPSSSSTYATVPCSSALCSDLPTSTCTSASKCGYTYTYGDASSTQGVLASETFTLGKEKK 202
Query: 258 K--NVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGTGSS 312
K VA GCG N+G F AGL+GLG G +SLV QLG FSYCL S G G S
Sbjct: 203 KLPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDGDGKS 259
Query: 313 GSLVFG--------REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL 364
L+ G PV PLV+NP PSFYYV L+GL VG RI + F +
Sbjct: 260 PLLLGGSAAAISESAATAPV--QTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAI 317
Query: 365 TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYN--LSGFV 421
G GV++D+GT++T L Y A + AFVAQ LP G I D C+ G
Sbjct: 318 QDDGTGGVIVDSGTSITYLELQGYRALKKAFVAQMA-LPTVDGSEIGLDLCFQGPAKGVD 376
Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISF 481
V+VP + +F GG L LPA N+++ +G C APS GLSIIGN QQ+ Q +
Sbjct: 377 EVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVAPS-RGLSIIGNFQQQNFQFVY 435
Query: 482 DGANGFVGFGPNVC 495
D A + F P C
Sbjct: 436 DVAGDTLSFAPVQC 449
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 271 bits (694), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 158/397 (39%), Positives = 214/397 (53%), Gaps = 17/397 (4%)
Query: 109 HARMQR-DVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPR 167
H + R D RV ++ +LS A E + G GSG Y V +G+G+P
Sbjct: 56 HVEILRLDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKN 115
Query: 168 SQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRLE----NA 222
++ D+GSD+ W QCQPC + CY Q +P+F+P+ S S+ VSCSSA C L NA
Sbjct: 116 DLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNA 175
Query: 223 G-CHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLG 280
G C A C Y + YGD S++ G LA E T+ + V V GCG NQG+F G AGLLG
Sbjct: 176 GSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLG 235
Query: 281 LGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSF 340
LG +S Q FSYCL S + +G L FG + + P+ SF
Sbjct: 236 LGRDKLSFPSQTATAYNKIFSYCLPSSAS-YTGHLTFGSAGISRSVKFTPISTITDGTSF 294
Query: 341 YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG 400
Y + + + VGG ++PI +F G ++D+GT +TRLP AY A R +F A+
Sbjct: 295 YGLNIVAITVGGQKLPIPSTVFS-----TPGALIDSGTVITRLPPKAYAALRSSFKAKMS 349
Query: 401 NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA 460
P SGVSI DTC++LSGF +V +P V+F FSGG V+ L S + V C AFA
Sbjct: 350 KYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVEL-GSKGIFYVFKISQVCLAFA 408
Query: 461 --PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S +I GN+QQ+ +++ +DGA G VGF PN C
Sbjct: 409 GNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGC 445
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 271 bits (693), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 162/428 (37%), Positives = 225/428 (52%), Gaps = 23/428 (5%)
Query: 78 RWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQR-DVKRVATLVRRLSGGGADAAKH 136
+ +L + HR T + ++ + H + R D RV ++ +LS A
Sbjct: 59 KSSLHVTHRH------GTCSRLNNGKATSPDHVEILRLDQARVNSIHSKLSKKLATDHVS 112
Query: 137 EVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSD 195
E + G GSG Y V +G+G+P ++ D+GSD+ W QCQPC + CY Q +
Sbjct: 113 ESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKE 172
Query: 196 PVFDPADSASFSGVSCSSAVCDRLE----NAG-CHAGRCRYEVSYGDGSYTKGTLALETL 250
P+F+P+ S S+ VSCSSA C L NAG C A C Y + YGD S++ G LA E
Sbjct: 173 PIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKF 232
Query: 251 TIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT 309
T+ + V V GCG NQG+F G AGLLGLG +S Q FSYCL S +
Sbjct: 233 TLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPSSAS 292
Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
+G L FG + + P+ SFY + + + VGG ++PI +F
Sbjct: 293 -YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFST----- 346
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVS 429
G ++D+GT +TRLP AY A R +F A+ P SGVSI DTC++LSGF +V +P V+
Sbjct: 347 PGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVA 406
Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGF 487
F FSGG V+ L S + V C AFA S +I GN+QQ+ +++ +DGA G
Sbjct: 407 FSFSGGAVVEL-GSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGR 465
Query: 488 VGFGPNVC 495
VGF PN C
Sbjct: 466 VGFAPNGC 473
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 271 bits (692), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 160/423 (37%), Positives = 230/423 (54%), Gaps = 21/423 (4%)
Query: 72 TSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGG-G 130
T + + +LE+VH+ S N + HS + +D +RV + RLS G
Sbjct: 63 TKGPKTKASLEVVHKHGPCSQLNDHDGKAKSTTPHS--DILNQDKERVKYINSRLSKNLG 120
Query: 131 ADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ- 189
D++ E+ SG GSG YFV +G+G+P R ++ D+GSD+ W QC+PC++
Sbjct: 121 QDSSVEELDSATLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARS 180
Query: 190 CYKQSDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAGR--CRYEVSYGDGSYTK 242
CYKQ D +FDP+ S S+S ++C+SA+C +L A GC A C Y + YGD S++
Sbjct: 181 CYKQQDVIFDPSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSV 240
Query: 243 GTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFS 301
G + E LT+ T VV N GCG NQG+F G+AGL+GLG +S V Q + FS
Sbjct: 241 GYFSRERLTVTATDVVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAKYRKIFS 300
Query: 302 YCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
YCL S + S+G L FG A + P R SFY + ++ + VGG+++P+S
Sbjct: 301 YCLPSTSS-STGHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSST 359
Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV 421
F G ++D+GT +TRLP AY A R AF P A +SI DTCY+LSG+
Sbjct: 360 F-----STGGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYK 414
Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQI 479
+PT+ F F+GG + LP L V C AFA + S ++I GN+QQ I++
Sbjct: 415 VFSIPTIEFSFAGGVTVKLPPQGILF-VASTKQVCLAFAANGDDSDVTIYGNVQQRTIEV 473
Query: 480 SFD 482
+D
Sbjct: 474 VYD 476
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 147/356 (41%), Positives = 205/356 (57%), Gaps = 13/356 (3%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADS 203
SG +G Y V +G+G+P +V D+GSD WVQC+PC +CYKQ +P+FDPA S
Sbjct: 152 ATSGRAVSTGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDPAKS 211
Query: 204 ASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIG 263
++++ VSC+ + C L+ GC G C Y V YGDGSYT G A +TLTI +K G
Sbjct: 212 STYANVSCTDSACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKGFRFG 271
Query: 264 CGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP 323
CG KN G+F AGL+GLG G SL Q + GGAF+YCL + TG +G L FG +
Sbjct: 272 CGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTG-TGYLDFGPGSAG 330
Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
A P++ + + +FYYVG++G+ VGG ++P++E +F G ++D+GT +TRL
Sbjct: 331 NNARLTPMLTD-KGQTFYYVGMTGIRVGGQQVPVAESVFSTA-----GTLVDSGTVITRL 384
Query: 384 PTPAYEAFRDAF--VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
P AY A AF V +A G SI DTCY+ +G V +PTVS F GG L +
Sbjct: 385 PATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVD 444
Query: 442 ASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S + + +A C AFA + ++I+GN QQ+ + +D VGF P C
Sbjct: 445 VSGIVYAISEA-QVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 269 bits (687), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 155/399 (38%), Positives = 221/399 (55%), Gaps = 23/399 (5%)
Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
R++R V R + RL+ AA V G V + + G+GE+ +++ +GSPPRS
Sbjct: 324 RLRRGVARGKNRLHRLNAMVLAAANATV---GDQVKAPVVAGNGEFLMKLAIGSPPRSFS 380
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
++D+GSD++W QC+PC QC+ QS P+FDP S+SF +SCSS +C L + C + C
Sbjct: 381 AIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTSTCSSDGCE 440
Query: 231 YEVSYGDGSYTKGTLALETLTIGRTVVKNVAI-----GCGHKNQGM-FVGAAGLLGLGGG 284
Y +YGD S T+G LA ET T G + ++I GCG+ N G F AGL+GLG G
Sbjct: 441 YLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRG 500
Query: 285 SMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA--LPVGA----AWVPLVRNPRAP 338
+SLV QL Q F+YCL + SL+ G A P + PL++NP P
Sbjct: 501 PLSLVSQLKEQ---KFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQP 557
Query: 339 SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ 398
SFYY+ L G+ VGG ++ I + F L G GV++D+GT +T + A+ + ++ F+AQ
Sbjct: 558 SFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQ 617
Query: 399 TGNLP-RASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFC 456
NLP SG D C+NL +G V VP ++F+F G L LP N++I AG C
Sbjct: 618 M-NLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFKGAD-LELPGENYMIGDSKAGLLC 675
Query: 457 FAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
A S G+SI GN+QQ+ + D + F P C
Sbjct: 676 LAIG-SSRGMSIFGNLQQQNFMVVHDLQEETLSFLPTQC 713
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 269 bits (687), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 179/444 (40%), Positives = 248/444 (55%), Gaps = 39/444 (8%)
Query: 63 RHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATL 122
+ +S + ++ + + L HR+ + ++ ++ + + + + A AT
Sbjct: 41 QEQQLSLAAPRTNASTLHFRLAHREHFALNATASDLLAHLLARDAARAAALLAAPNNATR 100
Query: 123 VRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWV 182
RR G F ++SG+ QG+GEYF ++GVG+P + MV+D+GSD+VW
Sbjct: 101 PRRRGG------------FAAPLLSGLPQGTGEYFAQVGVGTPATTALMVLDTGSDVVWA 148
Query: 183 QCQ---PCSQCYKQ-SDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYG 236
+ P + +Q S PA + ++ C + +C RL++AGC R C Y+V+YG
Sbjct: 149 PVRALPPLLRAVRQGSSTGAAPAPTPRWN---CVAPICRRLDSAGCDRRRNSCLYQVAYG 205
Query: 237 DGSYTKGTLALETLTIGRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQ 295
DGS T G A ETLT R V+ VAIGCGH N+G+F+ A+GLLGLG G +S Q+
Sbjct: 206 DGSVTAGDFASETLTFARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARS 265
Query: 296 TGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
G +FSYCLV R + W PR +FYYV L G VGG R+
Sbjct: 266 FGRSFSYCLVDRTSSRRARPS---------RRWG---GTPRMATFYYVHLLGFSVGGARV 313
Query: 356 P-ISEDLFRLT-QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS--GVSIF 411
+S+ RL G GV++D+GT+VTRL P YEA RDAF A L R S G S+F
Sbjct: 314 KGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGL-RVSPGGFSLF 372
Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGN 471
DTCYNLSG V+VPTVS + +GG + LP N+LIPVD +GTFCFA A + G+SIIGN
Sbjct: 373 DTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGN 432
Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
IQQ+G ++ FDG VGF P C
Sbjct: 433 IQQQGFRVVFDGDAQRVGFVPKSC 456
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 269 bits (687), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 166/425 (39%), Positives = 230/425 (54%), Gaps = 33/425 (7%)
Query: 90 SSSSNTTNNMHYHRH-----------QHSFHARMQRDVKRVATLVRRLSGG-GADAAKHE 137
S+S T +H HRH S R+QRD R A + R+ SG G D + +
Sbjct: 56 STSGGITVPLH-HRHGPCSPVPSNKMPASLEERLQRDQLRAAYIKRKFSGAKGGDVEQSD 114
Query: 138 VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
T + G + EY + +G+GSP +Q M +D+GSD+ WVQC+PCSQC+ + D +
Sbjct: 115 AATVPTTL--GTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSL 172
Query: 198 FDPADSASFSGVSCSSAVCDRLENA----GCHAGRCRYEVSYGDGSYTKGTLALETLTIG 253
FDP+ S+++S SCSSA C +L + GC + +C+Y VSY DGS T GT + +TLT+G
Sbjct: 173 FDPSASSTYSPFSCSSAACVQLSQSQQGNGCSSSQCQYIVSYVDGSSTTGTYSSDTLTLG 232
Query: 254 RTVVKNVAIGCGHKNQGMFVGAA-GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS 312
+K GC G F GL+GLGG + SLV Q G G AFSYCL GSS
Sbjct: 233 SNAIKGFQFGCSQSESGGFSDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCLPPT-PGSS 291
Query: 313 GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
G L G A G P++R+ + P++Y V L + VGG ++ I +F G
Sbjct: 292 GFLTLG-AASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVFSA------GS 344
Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYF 432
VMD+GT +TRLP AY A AF A P A I DTC++ SG SV +P+V+ F
Sbjct: 345 VMDSGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 404
Query: 433 SGGPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGF 490
SGG V+ L + ++ +D+ +C AFA + S L IGN+QQ ++ +D G VGF
Sbjct: 405 SGGAVVNLDFNGIMLELDN---WCLAFAANSDDSSLGFIGNVQQRTFEVLYDVGGGAVGF 461
Query: 491 GPNVC 495
C
Sbjct: 462 RAGAC 466
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 268 bits (686), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 164/459 (35%), Positives = 235/459 (51%), Gaps = 46/459 (10%)
Query: 61 FERHNNISSSNTSSDEARWN-----LELVHRDKMSSSSNTTNN-MHYHRHQHSFHAR-MQ 113
F+ +SS S E RW LE+ H+D S N + H F R +Q
Sbjct: 43 FQWKQGSNSSTCLSQETRWENGATILEMKHKDSCSGKILDWNKKLKKHLIMDDFQLRSLQ 102
Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
+K + +SG D + D + SG+ + Y V + +G R +++
Sbjct: 103 SRMKSI------ISGRNID----DSVDAPIPLTSGIRLQTLNYIVTVELGG--RKMTVIV 150
Query: 174 DSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA-------GCHA 226
D+GSD+ WVQCQPC +CY Q DPVF+P+ S S+ V CSS C L++A G +
Sbjct: 151 DTGSDLSWVQCQPCKRCYNQQDPVFNPSTSPSYRTVLCSSPTCQSLQSATGNLGVCGSNP 210
Query: 227 GRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGS 285
C Y V+YGDGSYT+G L E L +G T V N GCG NQG+F GA+GL+GLG S
Sbjct: 211 PSCNYVVNYGDGSYTRGELGTEHLDLGNSTAVNNFIFGCGRNNQGLFGGASGLVGLGRSS 270
Query: 286 MSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG------REALPVGAAWVPLVRNPRAPS 339
+SL+ Q GG FSYCL T +SGSLV G + P+ ++ ++ NP+ P
Sbjct: 271 LSLISQTSAMFGGVFSYCLPITETEASGSLVMGGNSSVYKNTTPI--SYTRMIPNPQLP- 327
Query: 340 FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQT 399
FY++ L+G+ VG + + + G DG+++D+GT +TRLP Y+A +D FV Q
Sbjct: 328 FYFLNLTGITVGSVAV-------QAPSFGKDGMMIDSGTVITRLPPSIYQALKDEFVKQF 380
Query: 400 GNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN-FLIPVDDAGTFCFA 458
P A I DTC+NLSG+ V +P + +F G L + + F DA C A
Sbjct: 381 SGFPSAPAFMILDTCFNLSGYQEVEIPNIKMHFEGNAELNVDVTGVFYFVKTDASQVCLA 440
Query: 459 FA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
A + + IIGN QQ+ ++ +D +GF C
Sbjct: 441 IASLSYENEVGIIGNYQQKNQRVIYDTKGSMLGFAAEAC 479
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 268 bits (685), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 156/408 (38%), Positives = 223/408 (54%), Gaps = 23/408 (5%)
Query: 102 HRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIG 161
H + R++R V R + RL+ AA V G V + + G+GE+ +++
Sbjct: 60 HVKNLTRFERLRRGVARGKNRLHRLNAMVLAAANATV---GDQVKAPVVAGNGEFLMKLA 116
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
+GSPPRS ++D+GSD++W QC+PC QC+ QS P+FDP S+SF +SCSS +C L
Sbjct: 117 IGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPT 176
Query: 222 AGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI-----GCGHKNQGM-FVGA 275
+ C + C Y +YGD S T+G LA ET T G + ++I GCG+ N G F
Sbjct: 177 STCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQG 236
Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA--LPVGA----AWV 329
AGL+GLG G +SLV QL Q F+YCL + SL+ G A P +
Sbjct: 237 AGLVGLGRGPLSLVSQLKEQ---KFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKTT 293
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
PL++NP PSFYY+ L G+ VGG ++ I + F L G GV++D+GT +T + A+
Sbjct: 294 PLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFT 353
Query: 390 AFRDAFVAQTGNLP-RASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
+ ++ F+AQ NLP SG D C+NL +G V VP ++F+F G L LP N++I
Sbjct: 354 SLKNEFIAQM-NLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFKGAD-LELPGENYMI 411
Query: 448 PVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
AG C A S G+SI GN+QQ+ + D + F P C
Sbjct: 412 GDSKAGLLCLAIG-SSRGMSIFGNLQQQNFMVVHDLQEETLSFLPTQC 458
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 268 bits (684), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 166/404 (41%), Positives = 225/404 (55%), Gaps = 27/404 (6%)
Query: 116 VKRVATLVRRLSGGGADAAKHE-----VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
KR + L +RL+ ADAA++ + V SG+ SGEYF +GVG+P
Sbjct: 44 AKRGSLLRQRLA---ADAARYASLVDATGRLHSPVFSGIPFESGEYFALVGVGTPSTKAM 100
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA---- 226
+VID+GSD+VW+QC PC +CY Q VFDP S+++ V CSS C L GC +
Sbjct: 101 LVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQCRALRFPGCDSGGAA 160
Query: 227 -GRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGG 284
G CRY V+YGDGS + G LA + L T V NV +GCG N+G+F AAGLLG+G G
Sbjct: 161 GGGCRYMVAYGDGSSSTGDLATDKLAFANDTYVNNVTLGCGRDNEGLFDSAAGLLGVGRG 220
Query: 285 SMSLVGQLGGQTGGAFSYCLVSRGTGSSGS--LVFGREALPVGAAWVPLVRNPRAPSFYY 342
+S+ Q+ G F YCL R + S+ S LVFGR P A+ L+ NPR PS YY
Sbjct: 221 KISISTQVAPAYGSVFEYCLGDRTSRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYY 280
Query: 343 VGLSGLGVGGMRIP--ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG 400
V ++G VGG R+ + L T G GVV+D+GTA++R AY A RDAF A+
Sbjct: 281 VDMAGFSVGGERVTGFSNASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARAR 340
Query: 401 NLPRASGV---SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD----DAG 453
S+FD CY+L G + P + +F+GG + LP N+ +PVD A
Sbjct: 341 AAGMRRLAGEHSVFDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAA 400
Query: 454 TF--CFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
++ C F + GLS+IGN+QQ+G ++ FD +GF P C
Sbjct: 401 SYRRCLGFEAADDGLSVIGNVQQQGFRVVFDVEKERIGFAPKGC 444
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 267 bits (683), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 147/356 (41%), Positives = 204/356 (57%), Gaps = 13/356 (3%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADS 203
SG +G Y V +G+G+P +V D+GSD WVQC+PC +CYKQ P+FDPA S
Sbjct: 152 ATSGRAVSTGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPLFDPAKS 211
Query: 204 ASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIG 263
++++ VSC+ + C L+ GC G C Y V YGDGSYT G A +TLTI +K G
Sbjct: 212 STYANVSCTDSACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKGFRFG 271
Query: 264 CGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP 323
CG KN G+F AGL+GLG G SL Q + GGAF+YCL + TG +G L FG +
Sbjct: 272 CGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTG-TGYLDFGPGSAG 330
Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
A P++ + + +FYYVG++G+ VGG ++P++E +F G ++D+GT +TRL
Sbjct: 331 NNARLTPMLTD-KGQTFYYVGMTGIRVGGQQVPVAESVFSTA-----GTLVDSGTVITRL 384
Query: 384 PTPAYEAFRDAF--VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
P AY A AF V +A G SI DTCY+ +G V +PTVS F GG L +
Sbjct: 385 PATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVD 444
Query: 442 ASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S + + +A C AFA + ++I+GN QQ+ + +D VGF P C
Sbjct: 445 VSGIVYAISEA-QVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 267 bits (682), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 161/432 (37%), Positives = 226/432 (52%), Gaps = 34/432 (7%)
Query: 89 MSSSSNTTNNMHYHRHQHSFHARMQR--------------DVKRVATLVRRLSGGGADAA 134
+S ++TT + + H+H +R+ D RV ++ +LS
Sbjct: 52 LSPRASTTKSSLHVTHRHGTCSRLNNGKATSPDHVEILRLDQARVNSIHSKLSK--KLTT 109
Query: 135 KHEVQDFGTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CY 191
H Q TD+ + G GSG Y V +G+G+P ++ D+GSD+ W QCQPC + CY
Sbjct: 110 NHVSQSQSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCY 169
Query: 192 KQSDPVFDPADSASFSGVSCSSAVCDRLE----NAG-CHAGRCRYEVSYGDGSYTKGTLA 246
Q +P+F+P+ S S+ VSCSSA C L NAG C A C Y + YGD S++ G LA
Sbjct: 170 DQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLA 229
Query: 247 LETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
+ T+ + V V GCG NQG+F G AGLLGLG +S Q FSYCL
Sbjct: 230 KDKFTLTSSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLP 289
Query: 306 SRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
S + +G L FG + + P+ SFY + + + VGG ++PI +F
Sbjct: 290 SSAS-YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFS-- 346
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
G ++D+GT +TRLP AY A R +F A+ P SGVSI DTC++LSGF +V +
Sbjct: 347 ---TPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTI 403
Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDG 483
P V+F FSGG V+ L + C AFA S +I GN+QQ+ +++ +DG
Sbjct: 404 PKVAFSFSGGAVVELGSKGIFYAF-KISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDG 462
Query: 484 ANGFVGFGPNVC 495
A G VGF PN C
Sbjct: 463 AGGRVGFAPNGC 474
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 165/404 (40%), Positives = 224/404 (55%), Gaps = 27/404 (6%)
Query: 116 VKRVATLVRRLSGGGADAAKHE-----VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
KR + L +RL+ ADAA++ + V SG+ SGEYF +GVG+P
Sbjct: 44 AKRGSLLRQRLA---ADAARYASLVDATGRLHSPVFSGIPFESGEYFALVGVGTPSTKAM 100
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA---- 226
+VID+GSD+VW+QC PC +CY Q VFDP S+++ V CSS C L GC +
Sbjct: 101 LVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQCRALRFPGCDSGGAA 160
Query: 227 -GRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGG 284
G CRY V+YGDGS + G LA + L T V NV +GCG N+G+F AAGLLG+ G
Sbjct: 161 GGGCRYMVAYGDGSSSTGELATDKLAFANDTYVNNVTLGCGRDNEGLFDSAAGLLGVARG 220
Query: 285 SMSLVGQLGGQTGGAFSYCLVSRGTGSSGS--LVFGREALPVGAAWVPLVRNPRAPSFYY 342
+S+ Q+ G F YCL R + S+ S LVFGR P A+ L+ NPR PS YY
Sbjct: 221 KISISTQVAPAYGSVFEYCLGDRTSRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYY 280
Query: 343 VGLSGLGVGGMRIP--ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG 400
V ++G VGG R+ + L T G GVV+D+GTA++R AY A RDAF A+
Sbjct: 281 VDMAGFSVGGERVTGFSNASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARAR 340
Query: 401 NLPRASGV---SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD----DAG 453
S+FD CY+L G + P + +F+GG + LP N+ +PVD A
Sbjct: 341 AAGMRRLAGEHSVFDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAA 400
Query: 454 TF--CFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
++ C F + GLS+IGN+QQ+G ++ FD +GF P C
Sbjct: 401 SYRRCLGFEAADDGLSVIGNVQQQGFRVVFDVEKERIGFAPKGC 444
>gi|147866052|emb|CAN80962.1| hypothetical protein VITISV_022007 [Vitis vinifera]
Length = 150
Score = 265 bits (678), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 125/147 (85%), Positives = 141/147 (95%)
Query: 349 GVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV 408
GVGG+R+PISE++FRLT++GD GVVMDTGTAVTRLPT AY+AFRDAF+AQT NLPRA+GV
Sbjct: 4 GVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGV 63
Query: 409 SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSI 468
+IFDTCY+L GFVSVRVPTVSFYFSGGP+LTLPA NFLIP+DDAGTFCFAFAPS SGLSI
Sbjct: 64 AIFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSI 123
Query: 469 IGNIQQEGIQISFDGANGFVGFGPNVC 495
+GNIQQEGIQISFDGANG+VGFGPN+C
Sbjct: 124 LGNIQQEGIQISFDGANGYVGFGPNIC 150
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 265 bits (676), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 154/419 (36%), Positives = 228/419 (54%), Gaps = 22/419 (5%)
Query: 76 EARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGG-GADAA 134
+ + +LE+VH+ S N + + HS + +D +RV + R+S G D++
Sbjct: 66 KRKASLEVVHKHGPCSQLNNHDGKAKSKTPHS--EILNQDKERVKYINSRISKNLGQDSS 123
Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQ 193
E+ SG GSG YFV +G+G+P R ++ D+GSD+ W QC+PC++ CYKQ
Sbjct: 124 VSELDSVTLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQ 183
Query: 194 SDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAGR--CRYEVSYGDGSYTKGTLA 246
D +FDP+ S S+S ++C+S +C +L A GC A C Y + YGD S++ G +
Sbjct: 184 QDAIFDPSKSTSYSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFS 243
Query: 247 LETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
E L++ T +V N GCG NQG+F G+AGL+GLG +S V Q FSYCL
Sbjct: 244 RERLSVTATDIVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAVYRKIFSYCLP 303
Query: 306 SRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
+ + S+G L FG + P R SFY + ++G+ VGG ++P+S F
Sbjct: 304 ATSS-STGRLSFGTTTTSY-VKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTF--- 358
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
G ++D+GT +TRLP AY A R AF P A +SI DTCY+LSG+ +
Sbjct: 359 --STGGAIIDSGTVITRLPPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLSGYEVFSI 416
Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFD 482
P + F F+GG + LP L V A C AFA + S ++I GN+QQ+ I++ +D
Sbjct: 417 PKIDFSFAGGVTVQLPPQGILY-VASAKQVCLAFAANGDDSDVTIYGNVQQKTIEVVYD 474
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 265 bits (676), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 167/425 (39%), Positives = 237/425 (55%), Gaps = 30/425 (7%)
Query: 90 SSSSNTTNNMHYHRH----------QHSFHARMQRDVKRVATLVRRLSGGGADAAKH--- 136
SS+ T +H HRH + R+ RD R A + R+ SGGG + ++
Sbjct: 53 SSTGAATVPLH-HRHGPCSPLPTKKMPTLEERLHRDQLRAAYIQRKFSGGGVNGSRGGAG 111
Query: 137 EVQDFGTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS 194
+VQ V + G + EY + + +GSP +SQ M+ID+GSD+ WVQC+PCSQC+ Q+
Sbjct: 112 DVQQSHATVPTTLGTSLDTLEYLITVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQCHSQA 171
Query: 195 DPVFDPADSASFSGVSCSSAVCDRL--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI 252
DP+FDP+ S+++S SCSSA C +L E GC + +C+Y V+YGDGS T GT + +TL +
Sbjct: 172 DPLFDPSSSSTYSPFSCSSAACAQLGQEGNGCSSSQCQYTVTYGDGSSTTGTYSSDTLAL 231
Query: 253 GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS 312
G V+ GC + G GL+GLGGG+ SLV Q G G AFSYCL + + SS
Sbjct: 232 GSNAVRKFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTFGAAFSYCLPAT-SSSS 290
Query: 313 GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
G L G A G P++R+ + P+FY V + + VGG ++ I +F G
Sbjct: 291 GFLTLG--AGTSGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVF------SAGT 342
Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYF 432
+MD+GT +TRLP AY A AF A P A I DTC++ SG SV +PTV+ F
Sbjct: 343 IMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSAPPSGILDTCFDFSGQSSVSIPTVALVF 402
Query: 433 SGGPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGF 490
SGG V+ + + ++ ++ C AFA + S L IIGN+QQ ++ +D G VGF
Sbjct: 403 SGGAVVDIASDGIMLQTSNS-ILCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGGGAVGF 461
Query: 491 GPNVC 495
C
Sbjct: 462 KAGAC 466
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 264 bits (675), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 182/428 (42%), Positives = 248/428 (57%), Gaps = 43/428 (10%)
Query: 80 NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
+ LVHRD + +++ + + R+QRD++R A ++ + AA
Sbjct: 67 QVRLVHRDSFAVNASAADLLA---------RRLQRDMRRAAWIITK-------AATPADP 110
Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQ-----YMVIDSGSDIVWVQCQPCSQCYKQS 194
+ GT VV+G SGEY +I VG+P + + D GSD+ W+QC PC +CY Q
Sbjct: 111 ENGT-VVTGAPT-SGEYIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRCYHQP 168
Query: 195 DPVFDPADSASFSGVSCSSAVCDRL-ENAGC--HAGRCRYEVSYGDGSYTKGTLALETLT 251
PV++ S+S S V C + C L + GC C+Y+V YGDGS + G +ETLT
Sbjct: 169 GPVYNRLKSSSASDVGCYAPACRALGSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLT 228
Query: 252 IGRTV-VKNVAIGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT 309
V V VAIGCG NQG+F AAG+LGLG GS+S Q+ G+ G +FSYCL +GT
Sbjct: 229 FPPGVRVPGVAIGCGSDNQGLFPAPAAGILGLGRGSLSFPSQIAGRYGRSFSYCLAGQGT 288
Query: 310 -GSSGSLVFGREA-----LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLF 362
G S +L FG A ++ P++ N R +FYYVGL G+ VGG+R+ ++E
Sbjct: 289 GGRSSTLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTESDL 348
Query: 363 RL-TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF-VAQTGNL--PRASG-VSIFDTCY-N 416
RL G GV++D+GTAVTRL PAY AFRDAF VA L P G + FDTCY +
Sbjct: 349 RLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAFRVAAVKELGWPSPGGPFAFFDTCYSS 408
Query: 417 LSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD-DAGTFCFAFAPS-PSGLSIIGNIQQ 474
+ G V +VP VS +F+GG + LP N+LIPVD + GT CFAFA S G+SIIGNIQ
Sbjct: 409 VRGRVMKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRGVSIIGNIQL 468
Query: 475 EGIQISFD 482
+G ++ +D
Sbjct: 469 QGFRVVYD 476
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 167/423 (39%), Positives = 231/423 (54%), Gaps = 30/423 (7%)
Query: 90 SSSSNTTNNMHYHRH----------QHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
SSS+ +HRH + + RD R A + R+ SGGG +
Sbjct: 52 SSSAGAATVPLHHRHGPCSPLPTKKMPTLEETLHRDQLRAAYIQRKFSGGGGAGGDVQRS 111
Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
D G + EY + +G+GSP SQ M+ID+GSD+ WVQC+PCSQC+ Q+DP+FD
Sbjct: 112 DATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFD 171
Query: 200 PADSASFSGVSCSSAVCDRL--ENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
P+ S+++S SC SA C +L E GC + +C+Y V+YGDGS T GT + +TL +G +
Sbjct: 172 PSSSSTYSPFSCGSAACAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSA 231
Query: 257 VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLV 316
VK+ GC + G GL+GLGGG+ SLV Q G G AFSYCL + SSG L
Sbjct: 232 VKSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPS-SSGFLT 290
Query: 317 FGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
G + +V P++R+ + P+FY V L + VGG ++ I +F G VM
Sbjct: 291 LGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSA------GTVM 344
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSG 434
D+GT +TRLP AY A AF A P A I DTC++ SG SV +P+V+ FSG
Sbjct: 345 DSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSG 404
Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGP 492
G V++L AS ++ + C AFA + S L IIGN+QQ ++ +D G VGF
Sbjct: 405 GAVVSLDASGIIL------SNCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRA 458
Query: 493 NVC 495
C
Sbjct: 459 GAC 461
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 160/414 (38%), Positives = 222/414 (53%), Gaps = 31/414 (7%)
Query: 102 HRHQHSFHARMQ-------RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSG 154
H H + ++Q R R++ LV R + G AA D+ + G+G
Sbjct: 63 HVDAHGNYTKLQLLRRAARRSHHRMSRLVARTATGSVKAAA------APDLQVPVHAGNG 116
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
E+ + + +G+P + ++D+GSD+VW QC+PC +C+ QS PVFDP+ S+++S + CSS+
Sbjct: 117 EFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYSTLPCSSS 176
Query: 215 VCDRLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM- 271
+C L + C A C Y +YGD S T+G LA ET T+ +T + VA GCG N+G
Sbjct: 177 LCSDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKTKLPGVAFGCGDTNEGDG 236
Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL----PVGAA 327
F AGL+GLG G +SLV QLG G FSYCL S S L+ G A AA
Sbjct: 237 FTQGAGLVGLGRGPLSLVSQLG---LGKFSYCLTSLDDTSKSPLLLGSLAAISTDTASAA 293
Query: 328 WV---PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
+ PL++NP PSFYYV L L VG RIP+ F + G GV++D+GT++T L
Sbjct: 294 AIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTGGVIVDSGTSITYLE 353
Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYN--LSGFVSVRVPTVSFYFSGGPVLTLP 441
Y + AF AQ LP A G ++ D C+ SG V VP + +F GG L LP
Sbjct: 354 LQGYRPLKKAFAAQM-KLPVADGSAVGLDLCFKAPASGVDDVEVPKLVLHFDGGADLDLP 412
Query: 442 ASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
A N+++ +G C S GLSIIGN QQ+ IQ +D + F P C
Sbjct: 413 AENYMVLDSASGALCLTVMGS-RGLSIIGNFQQQNIQFVYDVDKDTLSFAPVQC 465
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 262 bits (670), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 156/395 (39%), Positives = 226/395 (57%), Gaps = 20/395 (5%)
Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
R+Q +KR + ++RL+ A+ + +D + + + G+GEY + + +G+PP S
Sbjct: 66 RVQHGIKRGKSRLQRLNAMVLAASTLDSED---QLEAPIHAGNGEYLMELAIGTPPVSYP 122
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
V+D+GSD++W QC+PC+QCYKQ P+FDP S+SFS VSC S++C + ++ C G C
Sbjct: 123 AVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCGSSLCSAVPSSTCSDG-CE 181
Query: 231 YEVSYGDGSYTKGTLALETLTIGRTV----VKNVAIGCGHKNQGM-FVGAAGLLGLGGGS 285
Y SYGD S T+G LA ET T G++ V N+ GCG N+G F A+GL+GLG G
Sbjct: 182 YVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGRGP 241
Query: 286 MSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV---PLVRNPRAPSFYY 342
+SLV QL FSYCL L+ G A V PL++NP PSFYY
Sbjct: 242 LSLVSQLKEP---RFSYCLTPMDDTKESILLLGSLGKVKDAKEVVTTPLLKNPLQPSFYY 298
Query: 343 VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL 402
+ L G+ VG R+ I + F + G+ GV++D+GT +T + A+EA + F++QT L
Sbjct: 299 LSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYIEQKAFEALKKEFISQT-KL 357
Query: 403 PRASGVSI-FDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA 460
P S D C++L SG V +P + F+F GG L LPA N++I + G C A
Sbjct: 358 PLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHFKGGD-LELPAENYMIGDSNLGVACLAMG 416
Query: 461 PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S SG+SI GN+QQ+ I ++ D + F P C
Sbjct: 417 AS-SGMSIFGNVQQQNILVNHDLEKETISFVPTSC 450
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 262 bits (669), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 146/355 (41%), Positives = 199/355 (56%), Gaps = 17/355 (4%)
Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC 211
G+GE+ + + +G+P + +ID+GSD+VW QC+PC +C+ QS PVFDP+ S++++ + C
Sbjct: 98 GNGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYAALPC 157
Query: 212 SSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
SS +C L ++ C + +C Y +YGD S T+G LA ET T+ +T + +VA GCG N+G
Sbjct: 158 SSTLCSDLPSSKCTSAKCGYTYTYGDSSSTQGVLAAETFTLAKTKLPDVAFGCGDTNEGD 217
Query: 272 -FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-------LP 323
F AGL+GLG G +SLV QLG FSYCL S S L+ G A
Sbjct: 218 GFTQGAGLVGLGRGPLSLVSQLGLN---KFSYCLTSLDDTSKSPLLLGSLATISESAAAA 274
Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
PL+RNP PSFYYV L GL VG I + F + G GV++D+GT++T L
Sbjct: 275 SSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGGVIVDSGTSITYL 334
Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYN--LSGFVSVRVPTVSFYFSGGPVLTL 440
Y A + AF AQ LP A G I DTC+ SG V VP + F+ G L L
Sbjct: 335 ELQGYRALKKAFAAQM-KLPAADGSGIGLDTCFEAPASGVDQVEVPKLVFHLDGAD-LDL 392
Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
PA N+++ +G C S GLSIIGN QQ+ IQ +D + F P C
Sbjct: 393 PAENYMVLDSGSGALCLTVMGS-RGLSIIGNFQQQNIQFVYDVGENTLSFAPVQC 446
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 261 bits (668), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 151/355 (42%), Positives = 204/355 (57%), Gaps = 15/355 (4%)
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASF 206
G+ G+G Y V + +G+P +V D+GSD WVQCQPC + CY+Q +P+FDP SA++
Sbjct: 153 GVALGTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATY 212
Query: 207 SGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
+ +SCSS+ C L +GC G C Y + YGDGSYT G A +TLT+ +KN GCG
Sbjct: 213 ANISCSSSYCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYDTIKNFRFGCGE 272
Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA 326
KN+G+F AAGLLGLG G SL Q + GG F+YCL + G +G L G A A
Sbjct: 273 KNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPATSAG-TGFLDLGPGAPAANA 331
Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
P++ + R P+FYYVG++G+ VGG +PI +F G ++D+GT +TRLP
Sbjct: 332 RLTPMLVD-RGPTFYYVGMTGIKVGGHVLPIPGSVFSTA-----GTLVDSGTVITRLPPS 385
Query: 387 AYEAFRDAFVAQTGNL--PRASGVSIFDTCYNLSGFV--SVRVPTVSFYFSGGPVLTLPA 442
AY R AF L A SI DTCY+L+G S+ +P VS F GG L + A
Sbjct: 386 AYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGACLDVDA 445
Query: 443 SNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S L V D C AFAP+ + ++I+GN QQ+ + +D VGF P C
Sbjct: 446 SGILY-VADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 499
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 261 bits (668), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 153/431 (35%), Positives = 225/431 (52%), Gaps = 40/431 (9%)
Query: 83 LVHRDKMSSSSNTTNNMHYHRH-QHSFHAR-MQRDVKRVATLVRRLSGGGADAAKHEVQD 140
+ H+D S N R +F R +Q +K + LSG D+ ++
Sbjct: 1 MKHKDSCSGKILDWNKKLQKRLIMDNFQLRSLQSRIKNII-----LSGNIDDSVDTQIP- 54
Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
+ SG+ S Y V + +G R +++D+GSD+ WVQCQPC++CY Q DPVF+P
Sbjct: 55 ----LTSGIRLQSLNYIVTVELGG--RKMTVIVDTGSDLSWVQCQPCNRCYNQQDPVFNP 108
Query: 201 ADSASFSGVSCSSAVCDRLENA-------GCHAGRCRYEVSYGDGSYTKGTLALETLTIG 253
+ S S+ V C+S C L+ A G + C Y V+YGDGSYT G + +E L +G
Sbjct: 109 SKSPSYRTVLCNSLTCRSLQLATGNSGVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNLG 168
Query: 254 RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSG 313
T V N GCG KNQG+F GA+GL+GLG +SL+ Q+ GG FSYCL + +SG
Sbjct: 169 NTTVNNFIFGCGRKNQGLFGGASGLVGLGRTDLSLISQISPMFGGVFSYCLPTTEAEASG 228
Query: 314 SLVFG------REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
SLV G + P+ ++ ++ NP P FY++ L+G+ VGG+ + +
Sbjct: 229 SLVMGGNSSVYKNTTPI--SYTRMIHNPLLP-FYFLNLTGITVGGVEV-------QAPSF 278
Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
G D +++D+GT ++RLP Y+A + FV Q P A I D+C+NLSG+ V++P
Sbjct: 279 GKDRMIIDSGTVISRLPPSIYQALKAEFVKQFSGYPSAPSFMILDSCFNLSGYQEVKIPD 338
Query: 428 VSFYFSGGPVLTLPASNFLIPVD-DAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGA 484
+ YF G L + + V DA C A A P + IIGN QQ+ +I +D
Sbjct: 339 IKMYFEGSAELNVDVTGVFYSVKTDASQVCLAIASLPYEDEVGIIGNYQQKNQRIIYDTK 398
Query: 485 NGFVGFGPNVC 495
+GF C
Sbjct: 399 GSMLGFAEEAC 409
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 261 bits (667), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 151/355 (42%), Positives = 204/355 (57%), Gaps = 15/355 (4%)
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASF 206
G+ G+G Y V + +G+P +V D+GSD WVQCQPC + CY+Q +P+FDP SA++
Sbjct: 88 GVALGTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATY 147
Query: 207 SGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
+ +SCSS+ C L +GC G C Y + YGDGSYT G A +TLT+ +KN GCG
Sbjct: 148 ANISCSSSYCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYDTIKNFRFGCGE 207
Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA 326
KN+G+F AAGLLGLG G SL Q + GG F+YCL + G +G L G A A
Sbjct: 208 KNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPATSAG-TGFLDLGPGAPAANA 266
Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
P++ + R P+FYYVG++G+ VGG +PI +F G ++D+GT +TRLP
Sbjct: 267 RLTPMLVD-RGPTFYYVGMTGIKVGGHVLPIPGSVFSTA-----GTLVDSGTVITRLPPS 320
Query: 387 AYEAFRDAFVAQTGNL--PRASGVSIFDTCYNLSGFV--SVRVPTVSFYFSGGPVLTLPA 442
AY R AF L A SI DTCY+L+G S+ +P VS F GG L + A
Sbjct: 321 AYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGACLDVDA 380
Query: 443 SNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S L V D C AFAP+ + ++I+GN QQ+ + +D VGF P C
Sbjct: 381 SGILY-VADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 434
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 261 bits (666), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 148/388 (38%), Positives = 207/388 (53%), Gaps = 17/388 (4%)
Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
R+QR +KR ++RLS A F + V + + G+GE+ +++ +G+P +
Sbjct: 60 RLQRAMKRGKLRLQRLSAKTAS--------FESSVEAPVHAGNGEFLMKLAIGTPAETYS 111
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
++D+GSD++W QC+PC C+ Q P+FDP S+SFS + CSS +C L + C G C
Sbjct: 112 AIMDTGSDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKLPCSSDLCAALPISSCSDG-CE 170
Query: 231 YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLV 289
Y SYGD S T+G LA ET G V + GCG N G F AGL+GLG G +SL+
Sbjct: 171 YLYSYGDYSSTQGVLATETFAFGDASVSKIGFGCGEDNDGSGFSQGAGLVGLGRGPLSLI 230
Query: 290 GQLGGQTGGAFSYCLVSRGTGSS-GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGL 348
QLG FSYCL S SL+ G EA A PL++NP PSFYY+ L G+
Sbjct: 231 SQLGEP---KFSYCLTSMDDSKGISSLLVGSEATMKNAITTPLIQNPSQPSFYYLSLEGI 287
Query: 349 GVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV 408
VG +PI + F + G G+++D+GT +T L A+ A + F++Q SG
Sbjct: 288 SVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTITYLEDSAFAALKKEFISQLKLDVDESGS 347
Query: 409 SIFDTCYNLSGFVS-VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS 467
+ D C+ L S V VP + F+F G L LPA N++I G C S SG+S
Sbjct: 348 TGLDLCFTLPPDASTVDVPQLVFHFEGAD-LKLPAENYIIADSGLGVICLTMG-SSSGMS 405
Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
I GN QQ+ I + D + F P C
Sbjct: 406 IFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 261 bits (666), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 153/394 (38%), Positives = 222/394 (56%), Gaps = 17/394 (4%)
Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
R+Q +KR + +++L+ A D + + + G+GEY + + +G+PP S
Sbjct: 65 RVQHGIKRGKSRLQKLNA--MVLAASSTPDSEDQLEAPIHAGNGEYLIELAIGTPPVSYP 122
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
V+D+GSD++W QC+PC++CYKQ P+FDP S+SFS VSC S++C L ++ C G C
Sbjct: 123 AVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSCGSSLCSALPSSTCSDG-CE 181
Query: 231 YEVSYGDGSYTKGTLALETLTIGRTV----VKNVAIGCGHKNQGM-FVGAAGLLGLGGGS 285
Y SYGD S T+G LA ET T G++ V N+ GCG N+G F A+GL+GLG G
Sbjct: 182 YVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGRGP 241
Query: 286 MSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV---PLVRNPRAPSFYY 342
+SLV QL Q FSYCL L+ G A V PL++NP PSFYY
Sbjct: 242 LSLVSQLKEQ---RFSYCLTPIDDTKESVLLLGSLGKVKDAKEVVTTPLLKNPLQPSFYY 298
Query: 343 VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL 402
+ L + VG R+ I + F + G+ GV++D+GT +T + AYEA + F++QT
Sbjct: 299 LSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYVQQKAYEALKKEFISQTKLA 358
Query: 403 PRASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP 461
+ + D C++L SG V +P + F+F GG L LPA N++I + G C A
Sbjct: 359 LDKTSSTGLDLCFSLPSGSTQVEIPKLVFHFKGGD-LELPAENYMIGDSNLGVACLAMGA 417
Query: 462 SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S SG+SI GN+QQ+ I ++ D + F P C
Sbjct: 418 S-SGMSIFGNVQQQNILVNHDLEKETISFVPTSC 450
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 166/423 (39%), Positives = 230/423 (54%), Gaps = 30/423 (7%)
Query: 90 SSSSNTTNNMHYHRH----------QHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
SSS+ +HRH + + RD R A + R+ SGGG +
Sbjct: 122 SSSAGAATVPLHHRHGPCSPLPTKKMPTLEETLHRDQLRAAYIQRKFSGGGGAGGDVQRS 181
Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
D G + EY + +G+GSP SQ M+ID+GSD+ WVQC+PCSQC+ Q+DP+FD
Sbjct: 182 DATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFD 241
Query: 200 PADSASFSGVSCSSAVCDRL--ENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
P+ S+++S SC SA C +L E GC + +C+Y V+YGDGS T GT + +TL +G +
Sbjct: 242 PSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSA 301
Query: 257 VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLV 316
V++ GC + G GL+GLGGG+ SLV Q G G AFSYCL + SSG L
Sbjct: 302 VRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPS-SSGFLT 360
Query: 317 FGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
G + +V P++R+ + P+FY V L + VGG ++ I +F G VM
Sbjct: 361 LGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF------SAGTVM 414
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSG 434
D+GT +TRLP AY A AF A P A I DTC++ SG SV +P+V+ FSG
Sbjct: 415 DSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSG 474
Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGP 492
G V++L AS ++ + C AFA S L IIGN+QQ ++ +D G VGF
Sbjct: 475 GAVVSLDASGIIL------SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRA 528
Query: 493 NVC 495
C
Sbjct: 529 GAC 531
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 148/412 (35%), Positives = 230/412 (55%), Gaps = 18/412 (4%)
Query: 91 SSSNTTNNMHYHRHQHSFHARMQ-----RDVKRVATLVRRLSGGGADAAKHEVQDFG-TD 144
S+S T N H+ F ++ +++ + L R + G + E G +
Sbjct: 24 STSRTALNHHHEPKVAGFQIMLEHVDSGKNLTKFELLERAVERGSRRLQRLEAMLNGPSG 83
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
V + + G GEY + + +G+P + ++D+GSD++W QCQPC+QC+ QS P+F+P S+
Sbjct: 84 VETPVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSS 143
Query: 205 SFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC 264
SFS + CSS +C L++ C C+Y YGDGS T+G++ ETLT G + N+ GC
Sbjct: 144 SFSTLPCSSQLCQALQSPTCSNNSCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGC 203
Query: 265 GHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-- 321
G NQG G AGL+G+G G +SL QL FSYC+ G+ +S +L+ G A
Sbjct: 204 GENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSTSSTLLLGSLANS 260
Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL-TQMGDDGVVMDTGTAV 380
+ G+ L+ + + P+FYY+ L+GL VG +PI +F+L + G G+++D+GT +
Sbjct: 261 VTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTL 320
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNL-SGFVSVRVPTVSFYFSGGPVL 438
T AY+A R AF++Q NL +G S FD C+ + S ++++PT +F GG L
Sbjct: 321 TYFADNAYQAVRQAFISQM-NLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGD-L 378
Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
LP+ N+ I + G C A S G+SI GNIQQ+ + + +D N V F
Sbjct: 379 VLPSENYFISPSN-GLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVSF 429
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 166/423 (39%), Positives = 230/423 (54%), Gaps = 30/423 (7%)
Query: 90 SSSSNTTNNMHYHRH----------QHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
SSS+ +HRH + + RD R A + R+ SGGG +
Sbjct: 52 SSSAGAATVPLHHRHGPCSPLPTKKMPTLEETLHRDQLRAAYIQRKFSGGGGAGGDVQRS 111
Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
D G + EY + +G+GSP SQ M+ID+GSD+ WVQC+PCSQC+ Q+DP+FD
Sbjct: 112 DATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFD 171
Query: 200 PADSASFSGVSCSSAVCDRL--ENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
P+ S+++S SC SA C +L E GC + +C+Y V+YGDGS T GT + +TL +G +
Sbjct: 172 PSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSA 231
Query: 257 VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLV 316
V++ GC + G GL+GLGGG+ SLV Q G G AFSYCL + SSG L
Sbjct: 232 VRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPS-SSGFLT 290
Query: 317 FGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
G + +V P++R+ + P+FY V L + VGG ++ I +F G VM
Sbjct: 291 LGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSA------GTVM 344
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSG 434
D+GT +TRLP AY A AF A P A I DTC++ SG SV +P+V+ FSG
Sbjct: 345 DSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSG 404
Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGP 492
G V++L AS ++ + C AFA S L IIGN+QQ ++ +D G VGF
Sbjct: 405 GAVVSLDASGIIL------SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRA 458
Query: 493 NVC 495
C
Sbjct: 459 GAC 461
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 148/412 (35%), Positives = 231/412 (56%), Gaps = 18/412 (4%)
Query: 91 SSSNTTNNMHYHRHQHSFHARMQ-----RDVKRVATLVRRLSGGGADAAKHEVQDFG-TD 144
S+S T N H+ F ++ +++ + L R + G + E G +
Sbjct: 24 STSRTALNHHHEPKVAGFQIMLEHVDSGKNLTKFELLERAVERGSRRLQRLEAMLNGPSG 83
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
V + + G GEY + + +G+P + ++D+GSD++W QCQPC+QC+ QS P+F+P S+
Sbjct: 84 VETPVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSS 143
Query: 205 SFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC 264
SFS + CSS +C L++ C C+Y YGDGS T+G++ ETLT G + N+ GC
Sbjct: 144 SFSTLPCSSQLCQALQSPTCSNNSCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGC 203
Query: 265 GHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-- 321
G NQG G AGL+G+G G +SL QL FSYC+ G+ +S +L+ G A
Sbjct: 204 GENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSNSSTLLLGSLANS 260
Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL-TQMGDDGVVMDTGTAV 380
+ G+ L+++ + P+FYY+ L+GL VG +PI +F+L + G G+++D+GT +
Sbjct: 261 VTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTL 320
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNL-SGFVSVRVPTVSFYFSGGPVL 438
T AY+A R AF++Q NL +G S FD C+ + S ++++PT +F GG L
Sbjct: 321 TYFVDNAYQAVRQAFISQM-NLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGD-L 378
Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
LP+ N+ I + G C A S G+SI GNIQQ+ + + +D N V F
Sbjct: 379 VLPSENYFISPSN-GLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVSF 429
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 163/427 (38%), Positives = 223/427 (52%), Gaps = 30/427 (7%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
L L HR + + + + SF ++ D +R + RR+SG A A ++
Sbjct: 67 LRLTHRHGPCAPAGKASALG---SPPSFLDTLRADQRRAEYIQRRVSGAAAAAPGMQLAG 123
Query: 141 FGTDVVS---GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ--CYKQSD 195
V G G+ +Y V + +G+P +Q + +D+GSD+ WVQC+PC CY Q D
Sbjct: 124 SKAATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRD 183
Query: 196 PVFDPADSASFSGVSCSSAVCDRLE--NAGCHAGRCRYEVSYGDGSYTKGTLALETLTI- 252
P+FDP S+S+S V C++A C +L + GC G+C Y VSYGDGS T G + +TLT+
Sbjct: 184 PLFDPTRSSSYSAVPCAAASCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLT 243
Query: 253 GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS 312
G +K GCGH QG+F G GLLGLG SLV Q GG FSYCL S
Sbjct: 244 GSNALKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPT-QNSV 302
Query: 313 GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
G + G + G + PL+ P++Y V L+G+ VGG + I +F G
Sbjct: 303 GYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFA------SGA 356
Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSF 430
V+DTGT VTRLP AY A R AF A P A I DTCY+ + + +V +PT+S
Sbjct: 357 VVDTGTVVTRLPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISI 416
Query: 431 YFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFV 488
F GG + L S L + C AFAP+ S SI+GN+QQ ++ FDG+ V
Sbjct: 417 AFGGGAAMDLGTSGILT------SGCLAFAPTGGDSQASILGNVQQRSFEVRFDGST--V 468
Query: 489 GFGPNVC 495
GF P C
Sbjct: 469 GFMPASC 475
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 159/422 (37%), Positives = 227/422 (53%), Gaps = 25/422 (5%)
Query: 91 SSSNTTNNMHYHRHQHSFHAR---------MQRDVKRVATLVRRLSGGGADAAKHEVQDF 141
S+S+ N +H AR + D RV ++ R+++ +
Sbjct: 70 SNSSALNVVHRQGPCSPLQARGAPPPHAELLNDDQARVDSIHRKIAAAASPVLDQARGKK 129
Query: 142 GTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
G + + G+ G+G Y V +G+G+P R +V D+GSD+ WVQC PCS CY+Q DP+FD
Sbjct: 130 GVTLPAQRGISLGTGNYVVSMGLGTPARDMTVVFDTGSDLSWVQCTPCSDCYEQKDPLFD 189
Query: 200 PADSASFSGVSCSSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VV 257
PA S+++S V C+S C L++ C +CRYEV YGD S T G LA +TLT+ ++ V+
Sbjct: 190 PARSSTYSAVPCASPECQGLDSRSCSRDKKCRYEVVYGDQSQTDGALARDTLTLTQSDVL 249
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF 317
GCG ++ G+F A GL+GLG +SL Q + G FSYCL S + ++G L
Sbjct: 250 PGFVFGCGEQDTGLFGRADGLVGLGREKVSLSSQAASKYGAGFSYCLPSSPS-AAGYLSL 308
Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
G A P A + + +PSFYYV L G+ V G + +S +F G V+D+G
Sbjct: 309 GGPA-PANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVFSAA-----GTVIDSG 362
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGG 435
T +TRLP Y A R AF G RA +SI DTCY+ +G +VR+P+V+ F+GG
Sbjct: 363 TVITRLPPRVYAALRSAFARSMGRYGYKRAPALSILDTCYDFTGHTTVRIPSVALVFAGG 422
Query: 436 PVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS--IIGNIQQEGIQISFDGANGFVGFGPN 493
+ L S L V C AFAP+ G IIGN QQ+ + + +D A +GFG N
Sbjct: 423 AAVGLDFSGVLY-VAKVSQACLAFAPNGDGADAGIIGNTQQKTLAVVYDVARQKIGFGAN 481
Query: 494 VC 495
C
Sbjct: 482 GC 483
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 152/402 (37%), Positives = 219/402 (54%), Gaps = 23/402 (5%)
Query: 109 HAR-MQRDVKRVATLVRRLSG---GGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGS 164
HA + RD RV ++ R + AD + G+ G+ Y V +G+G+
Sbjct: 87 HAEILDRDQDRVDSIHRLAAARPSSTADDPSSASKGVSLPARRGVPLGTANYIVSVGLGT 146
Query: 165 PPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC 224
P R +V D+GSD+ WVQC+PC CY+Q DP+FDP+ S ++S V C + C RL++ C
Sbjct: 147 PKRDLLVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQSTTYSAVPCGAQECRRLDSGSC 206
Query: 225 HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV-------VKNVAIGCGHKNQGMFVGAAG 277
+G+CRYEV YGD S T G LA +TLT+G + ++ GCG + G+F A G
Sbjct: 207 SSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQEFVFGCGDDDTGLFGKADG 266
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRA 337
L GLG +SL Q + G FSYCL S T + G L G A P A + +V
Sbjct: 267 LFGLGRDRVSLASQAAAKYGAGFSYCLPSSST-AEGYLSLG-SAAPPNARFTAMVTRSDT 324
Query: 338 PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF-- 395
PSFYY+ L G+ V G + +S +FR G V+D+GT +TRLP+ AY A R +F
Sbjct: 325 PSFYYLNLVGIKVAGRTVRVSPAVFRTP-----GTVIDSGTVITRLPSRAYAALRSSFAG 379
Query: 396 VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF 455
+ + + RA +SI DTCY+ +G V++P+V+ F GG L L L V +
Sbjct: 380 LMRRYSYKRAPALSILDTCYDFTGRNKVQIPSVALLFDGGATLNLGFGEVLY-VANKSQA 438
Query: 456 CFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C AFA + + ++I+GN+QQ+ + +D AN +GFG C
Sbjct: 439 CLAFASNGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGC 480
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 259 bits (661), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 169/457 (36%), Positives = 235/457 (51%), Gaps = 36/457 (7%)
Query: 51 HAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHA 110
H ++ ++ L R + S N +S L L HR + + + + SF
Sbjct: 32 HIQLRDWDSL--RVSAASPRNGTSAV----LRLTHRHGPCAPAGKASALG---SPPSFLD 82
Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVS---GMDQGSGEYFVRIGVGSPPR 167
++ D +R + RR+SG A A ++ V G G+ +Y V + +G+P
Sbjct: 83 TLRADQRRAEYIQRRVSGAAAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTPAV 142
Query: 168 SQYMVIDSGSDIVWVQCQPCSQ--CYKQSDPVFDPADSASFSGVSCSSAVCDRLE--NAG 223
+Q + +D+GSD+ WVQC+PC CY Q DP+FDP S+S+S V C++A C +L + G
Sbjct: 143 AQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAAASCSQLALYSNG 202
Query: 224 CHAGRCRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
C G+C Y VSYGDGS T G + +TLT+ G +K GCGH QG+F G GLLGLG
Sbjct: 203 CSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGSNALKGFLFGCGHAQQGLFAGVDGLLGLG 262
Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYY 342
SLV Q GG FSYCL S G + G + G + PL+ P++Y
Sbjct: 263 RQGQSLVSQASSTYGGVFSYCLPPT-QNSVGYISLGGPSSTAGFSTTPLLTASNDPTYYI 321
Query: 343 VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG-- 400
V L+G+ VGG + I +F G V+DTGT VTRLP AY A R AF A
Sbjct: 322 VMLAGISVGGQPLSIDASVFA------SGAVVDTGTVVTRLPPTAYSALRSAFRAAMAPY 375
Query: 401 NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA 460
P A I DTCY+ + + +V +PT+S F GG + L S L + C AFA
Sbjct: 376 GYPSAPATGILDTCYDFTRYGTVTLPTISIAFGGGAAMDLGTSGILT------SGCLAFA 429
Query: 461 PS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P+ S SI+GN+QQ ++ FDG+ VGF P C
Sbjct: 430 PTGGDSQASILGNVQQRSFEVRFDGST--VGFMPASC 464
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 148/392 (37%), Positives = 213/392 (54%), Gaps = 14/392 (3%)
Query: 109 HAR-MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPR 167
HA + RD RV ++ R +G + G+ G+ Y V +G+G+P R
Sbjct: 140 HAEILDRDQDRVDSIHRMTAGPWTAGQSSASKGVSLPAHRGLRLGTANYIVSVGLGTPRR 199
Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG 227
+V D+GSD+ WVQC+PC+ CYKQ DP+FDP+ S ++S V C + C L++ C +G
Sbjct: 200 DLLVVFDTGSDLSWVQCKPCNNCYKQHDPLFDPSQSTTYSAVPCGAQEC--LDSGTCSSG 257
Query: 228 RCRYEVSYGDGSYTKGTLALETLTIGRTV--VKNVAIGCGHKNQGMFVGAAGLLGLGGGS 285
+CRYEV YGD S T G LA +TLT+G + ++ GCG + G+F A GL GLG
Sbjct: 258 KCRYEVVYGDMSQTDGNLARDTLTLGPSSDQLQGFVFGCGDDDTGLFGRADGLFGLGRDR 317
Query: 286 MSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGL 345
+SL Q + G FSYCL S + G L G A P A + +V PSFYY+ L
Sbjct: 318 VSLASQAAARYGAGFSYCLPSSWR-AEGYLSLGSAAAPPHAQFTAMVTRSDTPSFYYLDL 376
Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
G+ V G + ++ +F+ G V+D+GT +TRLP+ AY A R +F RA
Sbjct: 377 VGIKVAGRTVRVAPAVFKAP-----GTVIDSGTVITRLPSRAYSALRSSFAGFMRRYKRA 431
Query: 406 SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--P 463
+SI DTCY+ +G V++P+V+ F GG L L L V + C AFA +
Sbjct: 432 PALSILDTCYDFTGRTKVQIPSVALLFDGGATLNLGFGGVLY-VANRSQACLAFASNGDD 490
Query: 464 SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ + I+GN+QQ+ + +D AN +GFG C
Sbjct: 491 TSVGILGNMQQKTFAVVYDLANQKIGFGAKGC 522
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 257 bits (657), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 160/396 (40%), Positives = 222/396 (56%), Gaps = 20/396 (5%)
Query: 107 SFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPP 166
+ + RD R A + R+ SGGG + D G + EY + +G+GSP
Sbjct: 3 TLEETLHRDQLRAAYIQRKFSGGGGAGGDVQRSDATVPTALGTSLNTLEYLITVGLGSPA 62
Query: 167 RSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAGC 224
SQ M+ID+GSD+ WVQC+PCSQC+ Q+DP+FDP+ S+++S SC SA C +L E GC
Sbjct: 63 TSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADCAQLGQEGNGC 122
Query: 225 -HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGG 283
+ +C+Y V+YGDGS T GT + +TL +G + V++ GC + G GL+GLGG
Sbjct: 123 SSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSAVRSFQFGCSNVESGFNDQTDGLMGLGG 182
Query: 284 GSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFY 341
G+ SLV Q G G AFSYCL + SSG L G + +V P++R+ + P+FY
Sbjct: 183 GAQSLVSQTAGTLGRAFSYCLPPTPS-SSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFY 241
Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
V L + VGG ++ I +F G VMD+GT +TRLP AY A AF A
Sbjct: 242 GVRLQAIRVGGRQLSIPASVFSA------GTVMDSGTVITRLPPTAYSALSSAFKAGMKQ 295
Query: 402 LPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA- 460
P A I DTC++ SG SV +P+V+ FSGG V++L AS ++ + C AFA
Sbjct: 296 YPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL------SNCLAFAG 349
Query: 461 -PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S L IIGN+QQ ++ +D G VGF C
Sbjct: 350 NSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 385
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 257 bits (657), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 147/433 (33%), Positives = 224/433 (51%), Gaps = 42/433 (9%)
Query: 83 LVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL----SGGGADAAKHEV 138
+ HRD +SS +T+ + D RV +L R+ SG DA ++
Sbjct: 1 MKHRDFCNSSGKSTD------WNKKLQKSLILDDFRVRSLQSRIKSIFSGNNIDALDSQI 54
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
+ SG+ + Y V + +G R+ +++D+GSD+ WVQCQPC CY Q DP+F
Sbjct: 55 P-----LSSGVRLQTLNYIVTVEIGG--RNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLF 107
Query: 199 DPADSASFSGVSCSSAVCDRLENA-------GCHAGRCRYEVSYGDGSYTKGTLALETLT 251
+P+ S S+ + C+S+ C L+ A G + C Y V+YGDGSYT+G L +E L
Sbjct: 108 NPSGSPSYQTILCNSSTCQSLQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLN 167
Query: 252 IGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
+G T V N GCG N+G+F GA+GL+GLG +SLV Q G FSYCL + +
Sbjct: 168 LGTTHVSNFIFGCGRNNKGLFGGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAADA 227
Query: 312 SGSLVFG------REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
SGSL+ G + P+ ++ ++ NP+ P+FY++ L+G+ +GG+ + +
Sbjct: 228 SGSLILGGNSSVYKNTTPI--SYTRMIANPQLPTFYFLNLTGISIGGVAL-------QAP 278
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
G+++D+GT +TRLP P Y + F+ Q P A SI DTC+NL+G+ V +
Sbjct: 279 NYRQSGILIDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDI 338
Query: 426 PTVSFYFSGGPVLTLPASN-FLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFD 482
PT+ F G LT+ + F DA C A A + IIGN QQ ++ ++
Sbjct: 339 PTIRMQFEGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYN 398
Query: 483 GANGFVGFGPNVC 495
+GF C
Sbjct: 399 TKESKLGFAAEAC 411
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 176/426 (41%), Positives = 226/426 (53%), Gaps = 58/426 (13%)
Query: 83 LVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATL------VRRLSGGGADAAKH 136
L HR+ ++ + T + HR + RD R + V R GG
Sbjct: 82 LAHREAFAAPNATAAQLLAHR--------LARDAARAEAISVSARNVTRAGGG------- 126
Query: 137 EVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP 196
F VVSG+ QGSGEYF +GVG+PP +V+D+GSD+VW+QC PC QCY QS
Sbjct: 127 ----FSAPVVSGLAQGSGEYFASVGVGTPPTPALLVLDTGSDVVWLQCAPCRQCYAQSGR 182
Query: 197 VFDPADSASFSGVSCSSAVCDRLENAGCHAGR-----CRYEVSYGDGSYTKGTLALETLT 251
VFDP S S++ V C + C L+ G C Y+V+YGDGS T G LA ETL
Sbjct: 183 VFDPRRSRSYAAVRCGAPPCRGLDAGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLW 242
Query: 252 IGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG 310
R V VA+GCGH N+G+FV AAGLLGLG G +SL Q + G FSYC +G+
Sbjct: 243 FARGARVPRVAVGCGHDNEGLFVAAAGLLGLGRGRLSLPTQTARRYGRRFSYCF--QGSD 300
Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
+ VG A V G+G +R+ S G
Sbjct: 301 LDHRTIIRTVHQHVGGARV----------------RGVGERSLRLDPS--------TGRG 336
Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS-GVSIFDTCYNLSGFVSVRVPTVS 429
GV++D+GT+VTRL P Y A R+AF A G L A G S+FDTCY+L G V+VPTVS
Sbjct: 337 GVILDSGTSVTRLARPVYVAVREAFRAAAGGLRLAPGGFSLFDTCYDLRGRRVVKVPTVS 396
Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVG 489
+ +GG + LP N+LIPVD GTFC A A + G+SI+GNIQQ+G ++ FDG V
Sbjct: 397 VHLAGGAEVALPPENYLIPVDTRGTFCLALAGTDGGVSIVGNIQQQGFRVVFDGDRQRVA 456
Query: 490 FGPNVC 495
P C
Sbjct: 457 LVPKSC 462
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 166/425 (39%), Positives = 224/425 (52%), Gaps = 35/425 (8%)
Query: 90 SSSSNTTNNMHYHRH----------QHSFHARMQRDVKRVATLVRRLSGG----GADAAK 135
SSS TT +H HRH S R+ RD R A + R+ SG G A
Sbjct: 52 SSSGATTVPLH-HRHGPCSPLPTKKMPSLEDRLHRDQLRAAYIKRKFSGDVKKDGQGAGG 110
Query: 136 HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD 195
E G + EY + + +GSP ++Q ++IDSGSD+ WVQC+PC QC+ Q D
Sbjct: 111 VEQSHVTVPTTLGTSLNTLEYLITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCHSQVD 170
Query: 196 PVFDPADSASFSGVSCSSAVCDRL--ENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTI 252
P+FDP+ S+++S SCSSA C +L + GC + +C+Y V Y DGS T GT + +TL +
Sbjct: 171 PLFDPSLSSTYSPFSCSSAACAQLGQDGNGCSSSSQCQYIVRYADGSSTTGTYSSDTLAL 230
Query: 253 GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS 312
G + N GC H G GL+GLGGG+ SL Q G G AFSYCL + SS
Sbjct: 231 GSNTISNFQFGCSHVESGFNDLTDGLMGLGGGAPSLASQTAGTFGTAFSYCLPPTPS-SS 289
Query: 313 GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
G L G A G P++R+ P+FY V L + VGG ++ I +F G+
Sbjct: 290 GFLTLG--AGTSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVF------SAGM 341
Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYF 432
VMD+GT +TRLP AY A AF A A SI DTC++ SG SVR+P+V+ F
Sbjct: 342 VMDSGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVRLPSVALVF 401
Query: 433 SGGPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGF 490
SGG V+ L A+ ++ C AFA + S I+GN+QQ ++ +D G VGF
Sbjct: 402 SGGAVVNLDANGIIL------GNCLAFAANSDDSSPGIVGNVQQRTFEVLYDVGGGAVGF 455
Query: 491 GPNVC 495
C
Sbjct: 456 KAGAC 460
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 153/424 (36%), Positives = 231/424 (54%), Gaps = 46/424 (10%)
Query: 100 HYHRHQHSFHARMQRD-------VKRVATLVRRLSGGGADAAKHEVQDFGTDV--VSGMD 150
H + ++ R+Q+ V+ + +RR+ A+ H V+ T + SG++
Sbjct: 6 HCSEKKIDWNRRLQKQLILDDLRVRSMQNRIRRV------ASTHNVEASQTQIPLSSGIN 59
Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
+ Y V +G+GS ++ ++ID+GSD+ WVQC+PC CY Q P+F P+ S+S+ VS
Sbjct: 60 LQTLNYIVTMGLGS--KNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVS 117
Query: 211 CSSAVCDRLENAGCHAG--------RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI 262
C+S+ C L+ A + G C Y V+YGDGSYT G L +E L+ G V +
Sbjct: 118 CNSSTCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSFGGVSVSDFVF 177
Query: 263 GCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE-- 320
GCG N+G+F G +GL+GLG +SLV Q GG FSYCL + GSSGSLV G E
Sbjct: 178 GCGRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGSSGSLVMGNESS 237
Query: 321 ----ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGG--MRIPISEDLFRLTQMGDDGVVM 374
A P+ + ++ NP+ +FY + L+G+ VGG ++ P+S G+ G+++
Sbjct: 238 VFKNANPI--TYTRMLSNPQLSNFYILNLTGIDVGGVALKAPLS--------FGNGGILI 287
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSG 434
D+GT +TRLP+ Y+A + F+ + P A G SI DTC+NL+G+ V +PT+S F G
Sbjct: 288 DSGTVITRLPSSVYKALKAEFLKKFTGFPSAPGFSILDTCFNLTGYDEVSIPTISLRFEG 347
Query: 435 GPVLTLPAS-NFLIPVDDAGTFCFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFG 491
L + A+ F + +DA C A A +IIGN QQ ++ +D VGF
Sbjct: 348 NAQLNVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFA 407
Query: 492 PNVC 495
C
Sbjct: 408 EEPC 411
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 149/354 (42%), Positives = 205/354 (57%), Gaps = 15/354 (4%)
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASF 206
G G+G Y V +G+G+P +V D+GSD WVQCQPC CY+Q + +FDPA S+++
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 230
Query: 207 SGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCG 265
+ VSC++ C L+ +GC G C Y V YGDGSY+ G A++TLT+ VK GCG
Sbjct: 231 ANVSCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCG 290
Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
+N G+F AAGLLGLG G SL Q G+ GG F++CL +R TG +G L FG + P
Sbjct: 291 ERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARSTG-TGYLDFGAGSPPAT 349
Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
L N P+FYYVG++G+ VGG +PI+ +F G ++D+GT +TRLP
Sbjct: 350 TTTPMLTGN--GPTFYYVGMTGIRVGGRLLPIAPSVFAAA-----GTIVDSGTVITRLPP 402
Query: 386 PAYEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
AY + R AF A +A+ VS+ DTCY+ +G V +PTVS F GG L + AS
Sbjct: 403 AAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDAS 462
Query: 444 NFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ V A C AFA + G + I+GN Q + +++D VGF P C
Sbjct: 463 GIMYTV-SASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 515
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 146/409 (35%), Positives = 223/409 (54%), Gaps = 21/409 (5%)
Query: 102 HRHQHSF--------HARMQRDVKRVATLVRRLSGGGADAAKHEVQDFG-TDVVSGMDQG 152
HRH+ H +++ + L R + G + E G + V + + G
Sbjct: 32 HRHEAKVTGFQIMLEHVDSGKNLTKFQLLERAIERGSRRLQRLEAMLNGPSGVETSVYAG 91
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
GEY + + +G+P + ++D+GSD++W QCQPC+QC+ QS P+F+P S+SFS + CS
Sbjct: 92 DGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCS 151
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
S +C L + C C+Y YGDGS T+G++ ETLT G + N+ GCG NQG
Sbjct: 152 SQLCQALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFG 211
Query: 273 VG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA--LPVGAAWV 329
G AGL+G+G G +SL QL FSYC+ G+ + +L+ G A + G+
Sbjct: 212 QGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSTPSNLLLGSLANSVTAGSPNT 268
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL-TQMGDDGVVMDTGTAVTRLPTPAY 388
L+++ + P+FYY+ L+GL VG R+PI F L + G G+++D+GT +T AY
Sbjct: 269 TLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAY 328
Query: 389 EAFRDAFVAQTGNLPRASGVSI-FDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
++ R F++Q NLP +G S FD C+ S ++++PT +F GG L LP+ N+
Sbjct: 329 QSVRQEFISQI-NLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGD-LELPSENYF 386
Query: 447 IPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
I + G C A S G+SI GNIQQ+ + + +D N V F C
Sbjct: 387 ISPSN-GLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 149/354 (42%), Positives = 205/354 (57%), Gaps = 15/354 (4%)
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASF 206
G G+G Y V +G+G+P +V D+GSD WVQCQPC CY+Q + +FDPA S+++
Sbjct: 175 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 234
Query: 207 SGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCG 265
+ VSC++ C L+ +GC G C Y V YGDGSY+ G A++TLT+ VK GCG
Sbjct: 235 ANVSCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCG 294
Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
+N G+F AAGLLGLG G SL Q G+ GG F++CL +R TG +G L FG + P
Sbjct: 295 ERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARSTG-TGYLDFGAGSPPAT 353
Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
L N P+FYYVG++G+ VGG +PI+ +F G ++D+GT +TRLP
Sbjct: 354 TTTPMLTGN--GPTFYYVGMTGIRVGGRLLPIAPSVFAAA-----GTIVDSGTVITRLPP 406
Query: 386 PAYEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
AY + R AF A +A+ VS+ DTCY+ +G V +PTVS F GG L + AS
Sbjct: 407 AAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDAS 466
Query: 444 NFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ V A C AFA + G + I+GN Q + +++D VGF P C
Sbjct: 467 GIMYTV-SASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 256 bits (653), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 144/356 (40%), Positives = 202/356 (56%), Gaps = 14/356 (3%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSAS 205
SG G+G Y V +G+G+P +V D+GSD WVQCQPC CY+Q + +FDPA S++
Sbjct: 170 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSST 229
Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGC 264
++ VSC++ C L+ GC G C Y V YGDGSY+ G A++TLT+ VK GC
Sbjct: 230 YANVSCAAPACFDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 289
Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR-EALP 323
G +N+G+F AAGLLGLG G SL Q + GG F++CL +R +G +G L FG
Sbjct: 290 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSG-TGYLDFGPGSPAA 348
Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
GA + P+FYYVG++G+ VGG + I + +F G ++D+GT +TRL
Sbjct: 349 AGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATA-----GTIVDSGTVITRL 403
Query: 384 PTPAYEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
P PAY + R AFV+ +A VS+ DTCY+ +G V +PTVS F GG +L +
Sbjct: 404 PPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAILDVD 463
Query: 442 ASNFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
AS + C FA + G + I+GN Q + +++D VGF P C
Sbjct: 464 ASGIMYAA-SVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 518
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 255 bits (652), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 149/354 (42%), Positives = 204/354 (57%), Gaps = 15/354 (4%)
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASF 206
G G+G Y V +G+G+P +V D+GSD WVQCQPC CY+Q + +FDPA S+++
Sbjct: 172 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 231
Query: 207 SGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCG 265
+ VSC++ C L+ +GC G C Y V YGDGSY+ G A++TLT+ VK GCG
Sbjct: 232 ANVSCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCG 291
Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
+N G+F AAGLLGLG G SL Q G+ GG F++CL R TG +G L FG + P
Sbjct: 292 ERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPPRSTG-TGYLDFGAGSPPAT 350
Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
L N P+FYYVG++G+ VGG +PI+ +F G ++D+GT +TRLP
Sbjct: 351 TTTPMLTGN--GPTFYYVGMTGIRVGGRLLPIAPSVFAAA-----GTIVDSGTVITRLPP 403
Query: 386 PAYEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
AY + R AF A +A+ VS+ DTCY+ +G V +PTVS F GG L + AS
Sbjct: 404 AAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDAS 463
Query: 444 NFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ V A C AFA + G + I+GN Q + +++D VGF P C
Sbjct: 464 GIMYTV-SASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 516
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 254 bits (650), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 152/431 (35%), Positives = 228/431 (52%), Gaps = 35/431 (8%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHAR-MQRDVKRVATLVRRLSGGGADAAKHEVQ 139
LE+ R + S S + + H R +Q +++ R S AD+++ +V
Sbjct: 56 LEMKDRGECSESERKGDWVEKQLVLDGLHVRSIQNHIRK-----RTSSSQIADSSETQV- 109
Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
+ SG+ + Y V +G+GS S +++D+GSD+ WVQC+PC CY Q+ P+F
Sbjct: 110 ----PLTSGIKFQTLNYIVTMGLGSQNMS--VIVDTGSDLTWVQCEPCRSCYNQNGPLFK 163
Query: 200 PADSASFSGVSCSSAVCDRLENAGC-----HAGRCRYEVSYGDGSYTKGTLALETLTIGR 254
P+ S S+ + C+S C LE C + C Y V+YGDGSYT G L +E L G
Sbjct: 164 PSTSPSYQPILCNSTTCQSLELGACGSDPSTSATCDYVVNYGDGSYTSGELGIEKLGFGG 223
Query: 255 TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG-TGSSG 313
V N GCG N+G+F GA+GL+GLG +S++ Q GG FSYCL S G+SG
Sbjct: 224 ISVSNFVFGCGRNNKGLFGGASGLMGLGRSELSMISQTNATFGGVFSYCLPSTDQAGASG 283
Query: 314 SLVFGREA------LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
SLV G ++ P+ A+ ++ N + +FY + L+G+ VGG+ + + F
Sbjct: 284 SLVMGNQSGVFKNVTPI--AYTRMLPNLQLSNFYILNLTGIDVGGVSLHVQASSF----- 336
Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
G+ GV++D+GT ++RL Y+A + F+ Q P A G SI DTC+NL+G+ V +PT
Sbjct: 337 GNGGVILDSGTVISRLAPSVYKALKAKFLEQFSGFPSAPGFSILDTCFNLTGYDQVNIPT 396
Query: 428 VSFYFSGGPVLTLPASN-FLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGA 484
+S YF G L + A+ F + +DA C A A + IIGN QQ ++ +D
Sbjct: 397 ISMYFEGNAELNVDATGIFYLVKEDASRVCLALASLSDEYEMGIIGNYQQRNQRVLYDAK 456
Query: 485 NGFVGFGPNVC 495
VGF C
Sbjct: 457 LSQVGFAKEPC 467
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 254 bits (649), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 150/399 (37%), Positives = 220/399 (55%), Gaps = 22/399 (5%)
Query: 109 HAR-MQRDVKRVATLVRRLSGGG-----ADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGV 162
HA ++RD RV ++ R+++G G D A+ Q G+ G+G Y V +G+
Sbjct: 96 HAEILERDQARVDSIHRKVAGAGGAPSVVDPARASEQGVSLPAQRGISLGTGNYVVSVGL 155
Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA 222
G+P + ++ D+GSD+ WVQC+PC+ CY+Q DP+FDP+ S++++ V+C + C L+ +
Sbjct: 156 GTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPECQELDAS 215
Query: 223 GCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLG 280
GC + RCRYEV YGD S T G L +TLT+ + + GCG +N G+F GL G
Sbjct: 216 GCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASDTLPGFVFGCGDQNAGLFGQVDGLFG 275
Query: 281 LGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSF 340
LG +SL Q G F+YCL S +G G L G A P A + L + PSF
Sbjct: 276 LGREKVSLPSQGAPSYGPGFTYCLPSSSSG-RGYLSLG-GAPPANAQFTALA-DGATPSF 332
Query: 341 YYVGLSGLGVGG--MRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ 398
YY+ L G+ VGG +RIP + V+D+GT +TRLP AY R AF
Sbjct: 333 YYIDLVGIKVGGRAIRIPATAFAAAGG------TVIDSGTVITRLPPRAYAPLRAAFARS 386
Query: 399 TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFA 458
+A +SI DTCY+ +G + ++PTV F+GG ++L + L V C A
Sbjct: 387 MAQYKKAPALSILDTCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLY-VSKVSQACLA 445
Query: 459 FAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
FAP+ S ++I+GN QQ+ +++D AN +GFG C
Sbjct: 446 FAPNADDSSIAILGNTQQKTFAVTYDVANQRIGFGAKGC 484
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 254 bits (649), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 156/465 (33%), Positives = 236/465 (50%), Gaps = 41/465 (8%)
Query: 58 NELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHS-----FHARM 112
++ HNNI S S + + R +TT M HR S + +M
Sbjct: 35 KKILSVHNNIWSPKKSYEASS---SCFSRSLGKGRESTTLEMK-HRELCSGKTIDWGKKM 90
Query: 113 QR----DVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRS 168
+R D RV +L R+ + + V + + SG+ + Y V + +G ++
Sbjct: 91 RRALLLDNIRVQSLQLRIKAMTSSTTEQSVSETQIPLTSGIKLETLNYIVTVELGG--KN 148
Query: 169 QYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG- 227
+++D+GSD+ WVQCQPC CY Q P++DP+ S+S+ V C+S+ C L A ++G
Sbjct: 149 MSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATGNSGP 208
Query: 228 ----------RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAG 277
C Y VSYGDGSYT+G LA E++ +G T ++N+ GCG N+G+F GA+G
Sbjct: 209 CGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVLGDTKLENLVFGCGRNNKGLFGGASG 268
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE----ALPVGAAWVPLVR 333
L+GLG S+SLV Q G FSYCL S G+SG+L FG + + PLV+
Sbjct: 269 LMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGTLSFGNDFSVYKNSTSVFYTPLVQ 328
Query: 334 NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD 393
NP+ SFY + L+G +GG+ + T G+++D+GT +TRLP Y+A +
Sbjct: 329 NPQLRSFYILNLTGASIGGVELK--------TLSFGRGILIDSGTVITRLPPSIYKAVKT 380
Query: 394 AFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN-FLIPVDDA 452
F+ Q P A G SI DTC+NL+ + + +PT+ F G L + + F DA
Sbjct: 381 EFLKQFSGFPSAPGYSILDTCFNLTSYEDISIPTIKMIFEGNAELEVDVTGVFYFVKPDA 440
Query: 453 GTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A A + + IIGN QQ+ ++ +D +G C
Sbjct: 441 SLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIAGENC 485
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 254 bits (649), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 150/399 (37%), Positives = 220/399 (55%), Gaps = 22/399 (5%)
Query: 109 HAR-MQRDVKRVATLVRRLSGGGA-----DAAKHEVQDFGTDVVSGMDQGSGEYFVRIGV 162
HA ++RD RV ++ R+++G G D A+ Q G+ G+G Y V +G+
Sbjct: 96 HAEILERDQARVDSIHRKVAGAGGAPSVVDPARASEQGVSLPAQRGISLGTGNYVVSVGL 155
Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA 222
G+P + ++ D+GSD+ WVQC+PC+ CY+Q DP+FDP+ S++++ V+C + C L+ +
Sbjct: 156 GTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPECQELDAS 215
Query: 223 GCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLG 280
GC + RCRYEV YGD S T G L +TLT+ + + GCG +N G+F GL G
Sbjct: 216 GCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASDTLPGFVFGCGDQNAGLFGQVDGLFG 275
Query: 281 LGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSF 340
LG +SL Q G F+YCL S +G G L G A P A + L + PSF
Sbjct: 276 LGREKVSLPSQGAPSYGPGFTYCLPSSSSG-RGYLSLG-GAPPANAQFTALA-DGATPSF 332
Query: 341 YYVGLSGLGVGG--MRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ 398
YY+ L G+ VGG +RIP + V+D+GT +TRLP AY R AF
Sbjct: 333 YYIDLVGIKVGGRAIRIPATAFAAAGG------TVIDSGTVITRLPPRAYAPLRAAFARS 386
Query: 399 TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFA 458
+A +SI DTCY+ +G + ++PTV F+GG ++L + L V C A
Sbjct: 387 MAQYKKAPALSILDTCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLY-VSKVSQACLA 445
Query: 459 FAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
FAP+ S ++I+GN QQ+ +++D AN +GFG C
Sbjct: 446 FAPNADDSSIAILGNTQQKTFAVAYDVANQRIGFGAKGC 484
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 254 bits (649), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 178/449 (39%), Positives = 237/449 (52%), Gaps = 57/449 (12%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATL-----VRRLSGGGADAAK 135
L +VHRD + ++ + + R++RD +R + + + G
Sbjct: 76 LRVVHRDDFAVNATAAELLAH---------RLRRDKRRASRISAAAGGAAAANGTRVGGG 126
Query: 136 HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD 195
F VVSG+ QGSGEYF +IGVG+P MV+D+GSD+VW+QC PC +CY QS
Sbjct: 127 GGGSGFVAPVVSGLAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSG 186
Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIG 253
+FDP S S+ V C++ +C RL++ GC R C Y+V+YGDGS T G A ETLT
Sbjct: 187 QMFDPRASHSYGAVDCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA 246
Query: 254 RTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----- 307
V VA+GCGH N+G+FV AAGLLGLG GS+S Q+ + G +FSYCLV R
Sbjct: 247 SGARVPRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSA 306
Query: 308 -----------GTGSSGSLVFGREAL-PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
G+G+ G+L GR L P G RA + G
Sbjct: 307 SATSRSSTVTFGSGARGAL--GRRVLHPDGEEPQDGDVLLRAAHGHQRRRRARPGRGRVR 364
Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS--------- 406
P + G GV++D+G P+PA+ R RA+
Sbjct: 365 PPPD-----PSTGRGGVIVDSGR-----PSPAWA--RAGRTPPCATRSRAAAAGLRLSPG 412
Query: 407 GVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL 466
G S+FDTCY+LSG V+VPTVS +F+GG LP N+LIPVD GTFCFAFA + G+
Sbjct: 413 GFSLFDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGV 472
Query: 467 SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
SIIGNIQQ+G ++ FDG +GF P C
Sbjct: 473 SIIGNIQQQGFRVVFDGDGQRLGFVPKGC 501
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 254 bits (649), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 155/408 (37%), Positives = 219/408 (53%), Gaps = 21/408 (5%)
Query: 100 HYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVR 159
+Y RHQ +R R++ LV R +G ++K G D+ + G+GE+ +
Sbjct: 53 NYSRHQL-LRRAARRSHHRMSRLVARATGVPMTSSKAA---GGGDLQVPVHAGNGEFLMD 108
Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL 219
+ +G+P + ++D+GSD+VW QC+PC C+KQS PVFDP+ S++++ V CSSA C L
Sbjct: 109 VSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDL 168
Query: 220 ENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM-FVGAAG 277
+ C A +C Y +YGD S T+G LA ET T+ ++ + V GCG N+G F AG
Sbjct: 169 PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAG 228
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-------LPVGAAWVP 330
L+GLG G +SLV QLG FSYCL S ++ L+ G A P
Sbjct: 229 LVGLGRGPLSLVSQLGLDK---FSYCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTP 285
Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
L++NP PSFYYV L + VG RI + F + G GV++D+GT++T L Y A
Sbjct: 286 LIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRA 345
Query: 391 FRDAFVAQTGNLPRASGVSI-FDTCYN--LSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
+ AF AQ LP A G + D C+ G V VP + F+F GG L LPA N+++
Sbjct: 346 LKKAFAAQMA-LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMV 404
Query: 448 PVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+G C S GLSIIGN QQ+ Q +D + + F P C
Sbjct: 405 LDGGSGALCLTVMGS-RGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQC 451
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 254 bits (648), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 147/391 (37%), Positives = 219/391 (56%), Gaps = 18/391 (4%)
Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
R+Q +KR + RL+ A+ + ++ S + G+GE+ + + +G+PP +
Sbjct: 61 RIQHGIKRANHRLERLNAMVLAASSN------AEINSPVLSGNGEFLMNLAIGTPPETYS 114
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
++D+GSD++W QC+PC+QC+ Q P+FDP S+SFS +SCSS +C L + C + C
Sbjct: 115 AIMDTGSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLCKALPQSSC-SDSCE 173
Query: 231 YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLV 289
Y +YGD S T+GT+A ET T G+ + NV GCG N+G F +GL+GLG G +SLV
Sbjct: 174 YLYTYGDYSSTQGTMATETFTFGKVSIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSLV 233
Query: 290 GQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA----WVPLVRNPRAPSFYYVGL 345
QL FSYCL S + +L+ G A G + PL++NP PSFYY+ L
Sbjct: 234 SQL---KEAKFSYCLTSIDDTKTSTLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSL 290
Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
G+ VGG R+PI E F+L G G+++D+GT +T L A++ + F +Q G
Sbjct: 291 EGISVGGTRLPIKESTFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDN 350
Query: 406 SGVSIFDTCYNLSGFVS-VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS 464
SG + + CYNL S + VP + +F+G L LP N++I G C A S
Sbjct: 351 SGATGLELCYNLPSDTSELEVPKLVLHFTGAD-LELPGENYMIADSSMGVICLAMG-SSG 408
Query: 465 GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
G+SI GN+QQ+ + +S D + F P C
Sbjct: 409 GMSIFGNVQQQNMFVSHDLEKETLSFLPTNC 439
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 144/354 (40%), Positives = 201/354 (56%), Gaps = 14/354 (3%)
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSASF 206
G G+G Y V +G+G+P +V D+GSD WVQCQPC CY+Q + +FDPA S+++
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTY 230
Query: 207 SGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCG 265
+ VSC++ C L+ GC G C Y V YGDGSY+ G A++TLT+ VK GCG
Sbjct: 231 ANVSCAAPACSDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCG 290
Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
+N+G+F AAGLLGLG G SL Q + GG F++CL +R TG +G L FG +
Sbjct: 291 ERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTG-TGYLDFGAGSPAAR 349
Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
P++ + P+FYYVGL+G+ VGG + I + +F G ++D+GT +TRLP
Sbjct: 350 LTTTPMLVD-NGPTFYYVGLTGIRVGGRLLYIPQSVFATA-----GTIVDSGTVITRLPP 403
Query: 386 PAYEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
AY + R AF A +A VS+ DTCY+ +G V +PTVS F GG L + AS
Sbjct: 404 AAYSSLRSAFAAAMSARGYKKAPAVSLLDTCYDFAGMSQVAIPTVSLLFQGGARLDVDAS 463
Query: 444 NFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ A C AFA + G + I+GN Q + +++D V F P C
Sbjct: 464 GIMYAA-SASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFSPGAC 516
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 155/408 (37%), Positives = 219/408 (53%), Gaps = 21/408 (5%)
Query: 100 HYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVR 159
+Y RHQ +R R++ LV R +G ++K G D+ + G+GE+ +
Sbjct: 43 NYSRHQL-LRRAARRSHHRMSRLVARATGVPMTSSKAA---GGGDLQVPVHAGNGEFLMD 98
Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL 219
+ +G+P + ++D+GSD+VW QC+PC C+KQS PVFDP+ S++++ V CSSA C L
Sbjct: 99 VSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDL 158
Query: 220 ENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM-FVGAAG 277
+ C A +C Y +YGD S T+G LA ET T+ ++ + V GCG N+G F AG
Sbjct: 159 PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAG 218
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-------LPVGAAWVP 330
L+GLG G +SLV QLG FSYCL S ++ L+ G A P
Sbjct: 219 LVGLGRGPLSLVSQLGLDK---FSYCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTP 275
Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
L++NP PSFYYV L + VG RI + F + G GV++D+GT++T L Y A
Sbjct: 276 LIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRA 335
Query: 391 FRDAFVAQTGNLPRASGVSI-FDTCYN--LSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
+ AF AQ LP A G + D C+ G V VP + F+F GG L LPA N+++
Sbjct: 336 LKKAFAAQMA-LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMV 394
Query: 448 PVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+G C S GLSIIGN QQ+ Q +D + + F P C
Sbjct: 395 LDGGSGALCLTVMGS-RGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQC 441
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 149/421 (35%), Positives = 229/421 (54%), Gaps = 42/421 (9%)
Query: 100 HYHRHQHSFHARMQRD-------VKRVATLVRRLSGGGADAAKHEVQDFGTDV--VSGMD 150
H + ++ R+Q+ V+ + +RR+ + H V+ T + SG++
Sbjct: 6 HCSEKKIDWNRRLQKQLISDDLRVRSMQNRIRRV------VSSHNVEASQTQIPLSSGIN 59
Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
+ Y V +G+GS + ++ID+GSD+ WVQC+PC CY Q P+F P+ S+S+ VS
Sbjct: 60 LQTLNYIVTMGLGSTNMT--VIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVS 117
Query: 211 CSSAVCDRLENA-------GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIG 263
C+S+ C L+ A G + C Y V+YGDGSYT G L +E L+ G V + G
Sbjct: 118 CNSSTCQSLQFATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVEQLSFGGVSVSDFVFG 177
Query: 264 CGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-- 321
CG N+G+F G +GL+GLG +SLV Q GG FSYCL + +G+SGSLV G E+
Sbjct: 178 CGRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTESGASGSLVMGNESSV 237
Query: 322 ----LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
P+ + ++ NP+ +FY + L+G+ V G+ + ++ G+ GV++D+G
Sbjct: 238 FKNVTPI--TYTRMLPNPQLSNFYILNLTGIDVDGVAL-------QVPSFGNGGVLIDSG 288
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV 437
T +TRLP+ Y+A + F+ Q P A G SI DTC+NL+G+ V +PT+S +F G
Sbjct: 289 TVITRLPSSVYKALKALFLKQFTGFPSAPGFSILDTCFNLTGYDEVSIPTISMHFEGNAE 348
Query: 438 LTLPAS-NFLIPVDDAGTFCFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNV 494
L + A+ F + +DA C A A +IIGN QQ ++ +D VGF
Sbjct: 349 LKVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEES 408
Query: 495 C 495
C
Sbjct: 409 C 409
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 179/506 (35%), Positives = 260/506 (51%), Gaps = 55/506 (10%)
Query: 9 LLKQVLLLHLLCS----IITTSTSAASDTHFQILNVNESIKGSRTDHAKMSQYNELFERH 64
++++ LLL L+C+ + S AA + ++ + S T S + + +R
Sbjct: 5 VVRRALLLSLICAGALGFLPCSHGAAVAPGYVTVSAAR-FRPSST----CSSLDPVAQRR 59
Query: 65 NNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVR 124
N +S+ L L H+ + S ++ S ++ D +R ++R
Sbjct: 60 RNGTSAV---------LRLTHKHGPCAPSRASS-----LATPSVADTLRADQRRAEYILR 105
Query: 125 RLSGGGADAAKHEVQDFGTDVVS---GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVW 181
R+SG G + T V G + G+ Y V + +G+P +Q + +D+GSD+ W
Sbjct: 106 RVSGRGTPQLWDSKAEAATATVPANWGFNIGTLNYVVTVSLGTPGVAQTLEVDTGSDLSW 165
Query: 182 VQCQPCSQ--CYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAGCHAGRCRYEVSYGD 237
VQC PC+ CY Q DP+FDPA S+S++ V C VC L + C A +C Y VSYGD
Sbjct: 166 VQCTPCAAPACYSQKDPLFDPAQSSSYAAVPCGGPVCGGLGIYASSCSAAQCGYVVSYGD 225
Query: 238 GSYTKGTLALETLTIG-RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT 296
GS T G + +TLT+ V+ GCGH G F G GLLGLG SLV Q G
Sbjct: 226 GSKTTGVYSSDTLTLSPNDAVRGFFFGCGHAQSG-FTGNDGLLGLGREEASLVEQTAGTY 284
Query: 297 GGAFSYCLVSRGTGSSGSLVFGRE--ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMR 354
GG FSYCL +R + ++G L G A P G + L+ +P A ++Y V L+G+ VGG +
Sbjct: 285 GGVFSYCLPTRPS-TTGYLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQ 343
Query: 355 IPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL--PRASGVSIFD 412
+ + +F G V+DTGT +TRLP AY A R AF + + P A I D
Sbjct: 344 LSVPSSVFA------GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILD 397
Query: 413 TCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF-CFAFAPSPS--GLSII 469
TCYN SG+ +V +P V+ FSGG +TL A L +F C AFAPS S G++I+
Sbjct: 398 TCYNFSGYGTVTLPNVALTFSGGATVTLGADGIL-------SFGCLAFAPSGSDGGMAIL 450
Query: 470 GNIQQEGIQISFDGANGFVGFGPNVC 495
GN+QQ ++ DG + VGF P+ C
Sbjct: 451 GNVQQRSFEVRIDGTS--VGFKPSSC 474
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 147/355 (41%), Positives = 203/355 (57%), Gaps = 13/355 (3%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSAS 205
SG+ +G Y V I +G+P +V D+GSD WVQCQPC + CY+Q +P+F P SA+
Sbjct: 156 SGLSLNTGNYVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFTPTKSAT 215
Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCG 265
++ +SC+S+ C L+ GC G C Y V YGDGSYT G A +TLT+G VK+ GCG
Sbjct: 216 YANISCTSSYCSDLDTRGCSGGHCLYAVQYGDGSYTVGFYAQDTLTLGYDTVKDFRFGCG 275
Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
KN+G+F AAGL+GLG G S+ Q + G F+YC+ + +G +G L FG A
Sbjct: 276 EKNRGLFGKAAGLMGLGRGKTSVPVQAYDKYSGVFAYCIPATSSG-TGFLDFGPGAPAAA 334
Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
A + + P+FYYVG++G+ VGG + I +F D G ++D+GT +TRLP
Sbjct: 335 NARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVF-----SDAGALVDSGTVITRLPP 389
Query: 386 PAYEAFRDAFVAQTGNL--PRASGVSIFDTCYNLSGFV-SVRVPTVSFYFSGGPVLTLPA 442
AYE R AF L A SI DTCY+L+G+ S+ +P VS F GG L + A
Sbjct: 390 SAYEPLRSAFAKGMEGLGYKTAPAFSILDTCYDLTGYQGSIALPAVSLVFQGGACLDVDA 449
Query: 443 SNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S L V D C AFA + + ++I+GN QQ+ + +D VGF P C
Sbjct: 450 SGILY-VADVSQACLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFAPGAC 503
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 138/352 (39%), Positives = 201/352 (57%), Gaps = 14/352 (3%)
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASF 206
G+ G+ Y + +G G+P ++Q ++ D+GS++ W+QC+PC CY Q +P+FDP S+++
Sbjct: 8 GLYIGTANYVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSSTY 67
Query: 207 SGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCG 265
+SC+SA C L + GC C Y V+YGDGS T G LA ET T+ V N GCG
Sbjct: 68 RNISCTSAACTGLSSRGCSGSTCVYGVTYGDGSSTVGFLATETFTLAAGNVFNNFIFGCG 127
Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
NQG+F GAAGL+GLG SL QL G FSYCL S + ++G L G G
Sbjct: 128 QNNQGLFTGAAGLIGLGRSPYSLNSQLATSLGNIFSYCLPSTSS-ATGYLNIGNPLRTPG 186
Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
+ ++ N RAP+ Y++ L G+ VGG R+ +S +F+ G ++D+GT +TRLP
Sbjct: 187 --YTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQ-----SVGTIIDSGTVITRLPP 239
Query: 386 PAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNF 445
AY A R AF A RA+ SI DTCY+ S +V PT+ +++G V T+P +
Sbjct: 240 TAYGALRTAFRAAMTQYTRAAAASILDTCYDFSRTTTVTFPTIKLHYTGLDV-TIPGAGV 298
Query: 446 LIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
V + C AFA + + IIGN+QQ +++++D A +GF C
Sbjct: 299 FY-VISSSQVCLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALKRIGFAAGAC 349
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 155/431 (35%), Positives = 230/431 (53%), Gaps = 22/431 (5%)
Query: 76 EARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSG--GGADA 133
+ + +LE+VH+ S N + S + M D +RV + RLS GG +
Sbjct: 62 KRKASLEVVHKHGPCSQLNHSGKA---EATISHNDIMNLDNERVKYIQSRLSKNLGGENR 118
Query: 134 AKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYK 192
K E+ SG GS +Y+V +G+G+P R ++ D+GS + W QC+PC+ CYK
Sbjct: 119 VK-ELDSTTLPAKSGRLIGSADYYVVVGLGTPKRDLSLIFDTGSYLTWTQCEPCAGSCYK 177
Query: 193 QSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALET 249
Q DP+FDP+ S+S++ + C+S++C + +AGC + C Y+V YGD S ++G L+ E
Sbjct: 178 QQDPIFDPSKSSSYTNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYGDNSISRGFLSQER 237
Query: 250 LTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG 308
LTI T +V + GCG N+G+F G AGL+GL +S V Q FSYCL S
Sbjct: 238 LTITATDIVHDFLFGCGQDNEGLFRGTAGLMGLSRHPISFVQQTSSIYNKIFSYCLPSTP 297
Query: 309 TGSSGSLVFGREALP-VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQ 366
+ S G L FG A + P SFY + + G+ VGG ++P +S F
Sbjct: 298 S-SLGHLTFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSA-- 354
Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
G ++D+GT +TRLP AY A R AF P A G + DTCY+ SG+ + VP
Sbjct: 355 ---GGSIIDSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEISVP 411
Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGA 484
+ F F+GG + LP L + A C AFA + +G ++I GN+QQ+ +++ +D
Sbjct: 412 RIDFEFAGGVKVELPLVGILYG-ESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVE 470
Query: 485 NGFVGFGPNVC 495
G +GFG C
Sbjct: 471 GGRIGFGAAGC 481
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 146/388 (37%), Positives = 207/388 (53%), Gaps = 17/388 (4%)
Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
R+QR VKR ++RLS A F V + + G+GE+ + + +G+P +
Sbjct: 60 RLQRAVKRGRLRLQRLSAKTAS--------FEPSVEAPVHAGNGEFLMNLAIGTPAETYS 111
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
++D+GSD++W QC+PC C+ Q P+FDP S+SFS + CSS +C L + C G C
Sbjct: 112 AIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLCVALPISSCSDG-CE 170
Query: 231 YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG-MFVGAAGLLGLGGGSMSLV 289
Y SYGD S T+G LA ET T G V + GCG N+G + AGL+GLG G +SL+
Sbjct: 171 YRYSYGDHSSTQGVLATETFTFGDASVSKIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLI 230
Query: 290 GQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGL 348
QLG FSYCL S + +L+ G EA A PL++NP PSFYY+ L G+
Sbjct: 231 SQLGVP---KFSYCLTSIDDSKGISTLLVGSEATVKSAIPTPLIQNPSRPSFYYLSLEGI 287
Query: 349 GVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV 408
VG +PI + F + G G+++D+GT +T L A+ A + F++Q ASG
Sbjct: 288 SVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDNAFAALKKEFISQMKLDVDASGS 347
Query: 409 SIFDTCYNLSGFVS-VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS 467
+ + C+ L S V VP + F+F G L LP N++I C S SG+S
Sbjct: 348 TELELCFTLPPDGSPVEVPQLVFHFEGVD-LKLPKENYIIEDSALRVICLTMG-SSSGMS 405
Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
I GN QQ+ I + D + F P C
Sbjct: 406 IFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 156/418 (37%), Positives = 222/418 (53%), Gaps = 26/418 (6%)
Query: 82 ELVHRDKMSSS--SNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
EL+HR+ SS SNT+ F A ++R +R A L + + G +
Sbjct: 21 ELIHREHPSSPLRSNTSKTT-----TEIFLAAVKRGAERRAQLSKHILAEG--------R 67
Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
F T V SG +GEY + I GSPP+ +++D+GSD++W QC PC C + +FD
Sbjct: 68 LFSTPVASG----NGEYLIDISFGSPPQKASVIVDTGSDLIWTQCLPCETCNAAASVIFD 123
Query: 200 PADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN 259
P S+++ VSC+S C L C C+Y+ YGDGS T G L+ ET+T+G + N
Sbjct: 124 PVKSSTYDTVSCASNFCSSLPFQSCTT-SCKYDYMYGDGSSTSGALSTETVTVGTGTIPN 182
Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR 319
VA GCGH N G F GAAG++GLG G +SL+ Q T FSYCLV G+ + ++ G
Sbjct: 183 VAFGCGHTNLGSFAGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLGSTKTSPMLIGD 242
Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
A G A+ L+ N P+FYY L+G+ V G + F + G G ++D+GT
Sbjct: 243 SAAAGGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDSGTT 302
Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF--DTCYNLSGFVSVRVPTVSFYFSGGPV 437
+T L T A+ A A A+ P A G S++ D C++ +G + PT++F+F G
Sbjct: 303 LTYLETGAFNALVAALKAEV-PFPEADG-SLYGLDYCFSTAGVANPTYPTMTFHFKGAD- 359
Query: 438 LTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
LP N + +D G+ C A A S +G SI+GNIQQ+ I D N VGF C
Sbjct: 360 YELPPENVFVALDTGGSICLAMAAS-TGFSIMGNIQQQNHLIVHDLVNQRVGFKEANC 416
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 183/463 (39%), Positives = 254/463 (54%), Gaps = 38/463 (8%)
Query: 57 YNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDV 116
Y+ +N S S++S+ ++ L+HRD + ++ + R+QRD
Sbjct: 46 YSAPAAADDNFSVSSSSA----LHIHLLHRDSFAVNATAAELLAR---------RLQRDE 92
Query: 117 KRVATLVRRLSGGGADAAKHEV---QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
R A ++ + + G + + VVS SGEY +I VG+P + +
Sbjct: 93 LRAAWIISKAAANGTPPPVVGLSTGRGLVAPVVSRAPT-SGEYMAKIAVGTPAVQALLAL 151
Query: 174 DSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG---CHAGRCR 230
D+ SD+ W+QCQPC +CY QS PVFDP S S+ ++ + C L +G G C
Sbjct: 152 DTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDCQALGRSGGGDAKRGTCI 211
Query: 231 YEVSYGDG----SYTKGTLALETLTIGRTVVKN-VAIGCGHKNQGMF-VGAAGLLGLGGG 284
Y V YGDG S + G L ETLT V + ++IGCGH N+G+F AAG+LGLG G
Sbjct: 212 YTVQYGDGHGSTSTSVGDLVEETLTFAGGVRQAYLSIGCGHDNKGLFGAPAAGILGLGRG 271
Query: 285 SMSLVGQLGGQ-TGGAFSYCLVS--RGTGS-SGSLVFGREALPVG--AAWVPLVRNPRAP 338
+S+ Q+ +FSYCLV G GS S +L FG A+ A++ P V N P
Sbjct: 272 QISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMP 331
Query: 339 SFYYVGLSGLGVGGMRIP--ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
+FYYV L G+ VGG+R+P DL G GV++D+GT VTRL PAY AFRDAF
Sbjct: 332 TFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGRGGVILDSGTTVTRLARPAYVAFRDAFR 391
Query: 397 AQTGNLPRAS--GVS-IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
A +L + S G S +FDTCY + G V+VP VS +F+GG ++L N+LIPVD G
Sbjct: 392 AAATSLGQVSTGGPSGLFDTCYTVGGRAGVKVPAVSMHFAGGVEVSLQPKNYLIPVDSRG 451
Query: 454 TFCFAFAPS-PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
T CFAFA + +S+IGNI Q+G ++ +D A VGF PN C
Sbjct: 452 TVCFAFAGTGDRSVSVIGNILQQGFRVVYDLAGQRVGFAPNNC 494
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 252 bits (643), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 146/388 (37%), Positives = 207/388 (53%), Gaps = 17/388 (4%)
Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
R+QR VKR ++RLS A F V + + G+GE+ + + +G+P +
Sbjct: 60 RLQRAVKRGRLRLQRLSAKTAS--------FEPSVEAPVHAGNGEFLMNLAIGTPAETYS 111
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
++D+GSD++W QC+PC C+ Q P+FDP S+SFS + CSS +C L + C G C
Sbjct: 112 AIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLCVALPISSCSDG-CE 170
Query: 231 YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG-MFVGAAGLLGLGGGSMSLV 289
Y SYGD S T+G LA ET T G V + GCG N+G + AGL+GLG G +SL+
Sbjct: 171 YRYSYGDHSSTQGVLATETFTFGDASVSKIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLI 230
Query: 290 GQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGL 348
QLG FSYCL S + +L+ G EA A PL++NP PSFYY+ L G+
Sbjct: 231 SQLGVP---KFSYCLTSIDDSKGISTLLVGSEATVKSAIPTPLIQNPSRPSFYYLSLEGI 287
Query: 349 GVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV 408
VG +PI + F + G G+++D+GT +T L A+ A + F++Q ASG
Sbjct: 288 SVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDSAFAALKKEFISQMKLDVDASGS 347
Query: 409 SIFDTCYNLSGFVS-VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS 467
+ + C+ L S V VP + F+F G L LP N++I C S SG+S
Sbjct: 348 TELELCFTLPPDGSPVDVPQLVFHFEGVD-LKLPKENYIIEDSALRVICLTMG-SSSGMS 405
Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
I GN QQ+ I + D + F P C
Sbjct: 406 IFGNFQQQNIVVLHDLEKETISFAPAQC 433
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 251 bits (642), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 153/392 (39%), Positives = 222/392 (56%), Gaps = 20/392 (5%)
Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
R++ VKR ++RL A V +++ + + G+GE+ +++ +G+PP +
Sbjct: 58 RIRHGVKRGRNRLQRLQ------AMALVASSSSEIEAPVLPGNGEFLMKLAIGTPPETYS 111
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
++D+GSD++W QC+PC+QC+ QS P+FDP S+SFS +SCSS +C+ L + C+ G C
Sbjct: 112 AILDTGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLCEALPQSSCNNG-CE 170
Query: 231 YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLV 289
Y SYGD S T+G LA ETLT G+ V NVA GCG N+G F AGL+GLG G +SLV
Sbjct: 171 YLYSYGDYSSTQGILASETLTFGKASVPNVAFGCGADNEGSGFSQGAGLVGLGRGPLSLV 230
Query: 290 GQLGGQTGGAFSYCLVSRGTGSSGSLVFGR----EALPVGAAWVPLVRNPRAPSFYYVGL 345
QL FSYCL + + +L+ G A PL+ +P PSFYY+ L
Sbjct: 231 SQLKEP---KFSYCLTTVDDTKTSTLLMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSL 287
Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP-R 404
G+ VG R+PI + F L G G+++D+GT +T L A+ F A+ NLP
Sbjct: 288 EGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITYLEESAFNLVAKEFTAKI-NLPVD 346
Query: 405 ASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP 463
+SG + D C+ L SG ++ VP + F+F G L LPA N++I G C A S
Sbjct: 347 SSGSTGLDVCFTLPSGSTNIEVPKLVFHFDGAD-LELPAENYMIGDSSMGVACLAMG-SS 404
Query: 464 SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
SG+SI GN+QQ+ + + D + F P C
Sbjct: 405 SGMSIFGNVQQQNMLVLHDLEKETLSFLPTQC 436
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 251 bits (641), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 140/355 (39%), Positives = 205/355 (57%), Gaps = 17/355 (4%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
GEY + +G+P R +++D+GSD+ WVQC PC +CY Q+D +F P S SF+ ++C S
Sbjct: 11 GEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDALFLPNTSTSFTKLACGS 70
Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHKN 268
A+C+ L C+ C Y SYGDGS T G +T+T+ + V N A GCGH N
Sbjct: 71 ALCNGLPFPMCNQTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPNFAFGCGHDN 130
Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGTGSSGSLVFGREALPV-- 324
+G F GA G+LGLG G +S QL G FSYCLV + L+FG A+P+
Sbjct: 131 EGSFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPPTQTSPLLFGDAAVPILP 190
Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
++P++ NP+ P++YYV L+G+ VG + IS +F + +G G + D+GT VT+L
Sbjct: 191 DVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAGTIFDSGTTVTQLA 250
Query: 385 TPAYEAFRDAFVAQTGNLPRA-SGVSIFDTCYNLSGFVSVRVPTV---SFYFSGGPVLTL 440
AY+ A A T R +S D C LSGF ++PTV +F+F GG + L
Sbjct: 251 EAAYKEVLAAMNASTMAYSRKIDDISRLDLC--LSGFPKDQLPTVPAMTFHFEGGD-MVL 307
Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P SN+ I ++ + ++CFA SP ++IIG++QQ+ Q+ +D A +GF P C
Sbjct: 308 PPSNYFIYLESSQSYCFAMTSSPD-VNIIGSVQQQNFQVYYDTAGRKLGFVPKDC 361
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 251 bits (641), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 142/356 (39%), Positives = 197/356 (55%), Gaps = 17/356 (4%)
Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC 211
G+GE+ + + +G+P + ++D+GSD+VW QC+PC C+KQS PVFDP+ S++++ V C
Sbjct: 70 GNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPC 129
Query: 212 SSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
SSA C L + C A +C Y +YGD S T+G LA ET T+ ++ + V GCG N+G
Sbjct: 130 SSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEG 189
Query: 271 M-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-------L 322
F AGL+GLG G +SLV QLG FSYCL S ++ L+ G A
Sbjct: 190 DGFSQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDTNNSPLLLGSLAGISEASAA 246
Query: 323 PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
PL++NP PSFYYV L + VG RI + F + G GV++D+GT++T
Sbjct: 247 ASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITY 306
Query: 383 LPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYN--LSGFVSVRVPTVSFYFSGGPVLT 439
L Y A + AF AQ LP A G + D C+ G V VP + F+F GG L
Sbjct: 307 LEVQGYRALKKAFAAQMA-LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLD 365
Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
LPA N+++ +G C S GLSIIGN QQ+ Q +D + + F P C
Sbjct: 366 LPAENYMVLDGGSGALCLTVMGS-RGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQC 420
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 251 bits (640), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 147/354 (41%), Positives = 202/354 (57%), Gaps = 20/354 (5%)
Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC 211
G GEY + + +G+P S ++D+GSD++W QC+PC+QC+ Q P+F+P DS+SFS + C
Sbjct: 92 GDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPC 151
Query: 212 SSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
S C L + C+ C+Y YGDGS T+G +A ET T + V N+A GCG NQG
Sbjct: 152 ESQYCQDLPSETCNNNECQYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQGF 211
Query: 272 FVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA--LPVGAAW 328
G AGL+G+G G +SL QLG G FSYC+ S G+ S +L G A +P G+
Sbjct: 212 GQGNGAGLIGMGWGPLSLPSQLG---VGQFSYCMTSYGSSSPSTLALGSAASGVPEGSPS 268
Query: 329 VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
L+ + P++YY+ L G+ VGG + I F+L G G+++D+GT +T LP AY
Sbjct: 269 TTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAY 328
Query: 389 EAFRDAFVAQTGNLP----RASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPAS 443
A AF Q NLP +SG+S TC+ S +V+VP +S F GG VL L
Sbjct: 329 NAVAQAFTDQI-NLPTVDESSSGLS---TCFQQPSDGSTVQVPEISMQFDGG-VLNLGEQ 383
Query: 444 NFLI-PVDDAGTFCFAFAPSPS-GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
N LI P + G C A S G+SI GNIQQ+ Q+ +D N V F P C
Sbjct: 384 NILISPAE--GVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 251 bits (640), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 139/408 (34%), Positives = 215/408 (52%), Gaps = 30/408 (7%)
Query: 108 FHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPR 167
F R+ D V +L H++ D + SG + Y V +G+G +
Sbjct: 97 FQNRIILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGG--Q 154
Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG 227
+ +++D+GSD+ WVQC PC CY Q +P+F+P++S+SF + C+S C L+ +G
Sbjct: 155 NSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSG 214
Query: 228 --------RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLL 279
C Y++ YGDGSY++G L E LT+G+T + N GCG N+G+F GA+GL+
Sbjct: 215 LCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLM 274
Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG-------REALPVGAAWVPLV 332
GL +SLV Q G FSYCL + G GSSGSL G + P+ ++ ++
Sbjct: 275 GLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPI--SYTRMI 332
Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV--VMDTGTAVTRLPTPAYEA 390
+NP+ +FY++ L+G+ +GG+ + + RL+ ++GV ++D+GT +TRL Y+A
Sbjct: 333 QNPQMSNFYFLNLTGISIGGVNLNVP----RLSS--NEGVLSLLDSGTVITRLSPSIYKA 386
Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN-FLIPV 449
F+ F Q G SI +TC+NL+G+ V +PTV F F G + + F
Sbjct: 387 FKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVK 446
Query: 450 DDAGTFCFAFAP--SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
DA C AFA IIGN QQ+ ++ ++ VGF C
Sbjct: 447 SDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPC 494
>gi|20975624|emb|CAD31717.1| putative nucleoid DNA-binding protein [Cicer arietinum]
Length = 144
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 117/144 (81%), Positives = 132/144 (91%)
Query: 352 GMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF 411
G+R+PISED+FRL ++G+ GVVMDTGTAVTRLPT AY+AFRDAF+ QT NLPR+S VSIF
Sbjct: 1 GVRVPISEDVFRLNELGEGGVVMDTGTAVTRLPTAAYDAFRDAFIGQTTNLPRSSDVSIF 60
Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGN 471
DTCY+L GFVSVRVPT+SFYF GGP+LTLPA NFLIPV+D GTFCFAFAPSPSGLSIIGN
Sbjct: 61 DTCYDLYGFVSVRVPTISFYFLGGPILTLPARNFLIPVNDVGTFCFAFAPSPSGLSIIGN 120
Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
IQQEGI+IS DG NGFVGFGPN+C
Sbjct: 121 IQQEGIEISVDGVNGFVGFGPNIC 144
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 139/411 (33%), Positives = 216/411 (52%), Gaps = 30/411 (7%)
Query: 105 QHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGS 164
+ F R+ D V +L H++ D + SG + Y V +G+G
Sbjct: 15 EKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGG 74
Query: 165 PPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC 224
++ +++D+GSD+ WVQC PC CY Q +P+F+P++S+SF + C+S C L+
Sbjct: 75 --QNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAG 132
Query: 225 HAG--------RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAA 276
+G C Y++ YGDGSY++G L E LT+G+T + N GCG N+G+F GA+
Sbjct: 133 SSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDNFIFGCGRNNKGLFGGAS 192
Query: 277 GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG-------REALPVGAAWV 329
GL+GL +SLV Q G FSYCL + G GSSGSL G + P+ ++
Sbjct: 193 GLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPI--SYT 250
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV--VMDTGTAVTRLPTPA 387
+++NP+ +FY++ L+G+ +GG+ + + RL+ ++GV ++D+GT +TRL
Sbjct: 251 RMIQNPQMSNFYFLNLTGISIGGVNLNVP----RLSS--NEGVLSLLDSGTVITRLSPSI 304
Query: 388 YEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN-FL 446
Y+AF+ F Q G SI +TC+NL+G+ V +PTV F F G + + F
Sbjct: 305 YKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFY 364
Query: 447 IPVDDAGTFCFAFAP--SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
DA C AFA IIGN QQ+ ++ ++ VGF C
Sbjct: 365 FVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPC 415
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 138/366 (37%), Positives = 210/366 (57%), Gaps = 26/366 (7%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSAS 205
SG+ G+G Y V +G+G+P + ++ D+GSD+ W QCQPC + CY Q P+FDP+ S +
Sbjct: 145 SGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSTSKT 204
Query: 206 FSGVSCSSAVCDRLENA-----GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKN 259
+S +SC+SA C L++A GC + C Y + YGD S+T G A + LT+ + V
Sbjct: 205 YSNISCTSAACSSLKSATGNSPGCSSSNCVYGIQYGDSSFTIGFFAKDKLTLTQNDVFDG 264
Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL-VSRGTGSSGSLVFG 318
GCG N+G+F AGL+GLG +S+V Q + G FSYCL SRG S+G L FG
Sbjct: 265 FMFGCGQNNKGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRG--SNGHLTFG 322
Query: 319 R-------EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
+A+ G + P + + ++Y++ + G+ VGG + IS LF+ + G
Sbjct: 323 NGNGVKASKAVKNGITFTPFASS-QGTAYYFIDVLGISVGGKALSISPMLFQ-----NAG 376
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
++D+GT +TRLP+ AY + + AF P A +S+ DTCY+LS + S+ +P +SF
Sbjct: 377 TIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISFN 436
Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVG 489
F+G + L + LI + A C AFA + + I GNIQQ+ +++ +D A G +G
Sbjct: 437 FNGNANVELDPNGILI-TNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVYDVAGGQLG 495
Query: 490 FGPNVC 495
FG C
Sbjct: 496 FGYKGC 501
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 141/357 (39%), Positives = 203/357 (56%), Gaps = 16/357 (4%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSAS 205
SG G+G Y V +G+G+P +V D+GSD WVQCQPC CY+Q + +FDPA S++
Sbjct: 171 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST 230
Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGC 264
++ +SC++ C L+ GC G C Y V YGDGSY+ G A++TLT+ VK GC
Sbjct: 231 YANISCAAPACSDLDTRGCSGGNCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 290
Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
G +N+G+F AAGLLGLG G SL Q + GG F++CL +R +G +G L FG +
Sbjct: 291 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSG-TGYLDFGPGSPAA 349
Query: 325 GAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
A + P++ + P+FYYVG++G+ VGG + I + +F G ++D+GT +TR
Sbjct: 350 AGARLTTPMLTD-NGPTFYYVGMTGIRVGGQLLSIPQSVFTTA-----GTIVDSGTVITR 403
Query: 383 LPTPAYEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
LP AY + R AF + +A VS+ DTCY+ +G V +PTVS F GG L +
Sbjct: 404 LPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDV 463
Query: 441 PASNFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
AS + C FA + G + I+GN Q + +++D VGF P C
Sbjct: 464 DASGIMYAA-SVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 248 bits (634), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 154/428 (35%), Positives = 226/428 (52%), Gaps = 21/428 (4%)
Query: 77 ARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGA-DAAK 135
+R + +VHR S ++ H+ A D R ++ RR+S K
Sbjct: 85 SRTRMPIVHRHGPCSPLADAHDGKLPSHEEILAA----DQNRAKSIQRRVSTTTTVSRGK 140
Query: 136 HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQS 194
+ SG G+G Y V IG+G+P +V D+GSD WVQC+PC CYKQ
Sbjct: 141 PKRNRPSLPASSGSALGTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQ 200
Query: 195 DPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR 254
+ +FDPA S++++ +SC++ C L GC G C Y V YGDGSY+ G A++TLT+
Sbjct: 201 EKLFDPARSSTYANISCAAPACSDLYIKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSS 260
Query: 255 -TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSG 313
+K GCG +N+G++ AAGLLGLG G SL Q + GG F++C +R +G +G
Sbjct: 261 YDAIKGFRFGCGERNEGLYGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSG-TG 319
Query: 314 SLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
L FG +LP +A + P++ + P+FYYVGL+G+ VGG + I + +F + G
Sbjct: 320 YLDFGPGSLPAVSAKLTTPMLVD-NGPTFYYVGLTGIRVGGKLLSIPQSVFTTS-----G 373
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDTCYNLSGFVSVRVPTVS 429
++D+GT +TRLP AY + R AF + +A +S+ DTCY+ +G V +PTVS
Sbjct: 374 TIVDSGTVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFTGMSEVAIPTVS 433
Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGF 487
F GG L + AS +I C FA + I+GN Q + + +D
Sbjct: 434 LLFQGGASLDVHASG-IIYAASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVYDIGKKV 492
Query: 488 VGFGPNVC 495
VGF P C
Sbjct: 493 VGFCPGAC 500
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 248 bits (633), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 154/402 (38%), Positives = 216/402 (53%), Gaps = 24/402 (5%)
Query: 108 FHARMQRDVKRVATLVRRLSGGGADAAKH-EVQDFGTDVVS-----GMDQGSGEYFVRIG 161
F A + D R+A+ RL+ + ++ Q G+ + S G G G Y R+G
Sbjct: 63 FSAVLTHDAARIASFAARLAKKSSPSSASATTQAAGSSLASVPLTPGTSVGVGNYVTRMG 122
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCSSAVCDRLE 220
+G+P + MV+D+GS + W+QC PC C++QS PVFDP S+S++ VSCSS CD L
Sbjct: 123 LGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCSSPQCDGLS 182
Query: 221 NAGCHAGRCR------YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVG 274
A + C Y+ SYGD S++ G L+ +T++ G V N GCG N+G+F
Sbjct: 183 TATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKDTVSFGANSVPNFYYGCGQDNEGLFGR 242
Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRN 334
+AGL+GL +SL+ QL G +FSYCL S T SSG L G P G ++ P+V N
Sbjct: 243 SAGLMGLARNKLSLLYQLAPTLGYSFSYCLPS--TSSSGYLSIGSYN-PGGYSYTPMVSN 299
Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
S Y++ LSG+ V G + +S ++ ++D+GT +TRLPT Y A A
Sbjct: 300 TLDDSLYFISLSGMTVAGKPLAVSS-----SEYTSLPTIIDSGTVITRLPTSVYTALSKA 354
Query: 395 F-VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
A G+ RA+ SI DTC+ VP VS FSGG L L A N L+ VD A
Sbjct: 355 VAAAMKGSTKRAAAYSILDTCFEGQASKLRAVPAVSMAFSGGATLKLSAGNLLVDVDGAT 414
Query: 454 TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
T C AFAP+ S +IIGN QQ+ + +D + +GF C
Sbjct: 415 T-CLAFAPARSA-AIIGNTQQQTFSVVYDVKSNRIGFAAAGC 454
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 247 bits (631), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 145/357 (40%), Positives = 202/357 (56%), Gaps = 16/357 (4%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSAS 205
SG G+G Y V +G+G+P +V D+GSD WVQCQPC CY+Q + +FDPA S++
Sbjct: 171 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST 230
Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGC 264
++ VSC++ C L GC G C Y V YGDGSY+ G A++TLT+ VK GC
Sbjct: 231 YANVSCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 290
Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
G +N+G+F AAGLLGLG G SL Q + GG F++CL +R TG +G L FG +L
Sbjct: 291 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTG-TGYLDFGAGSLAA 349
Query: 325 GAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
A + P++ P+FYYVG++G+ VGG + I + +F G ++D+GT +TR
Sbjct: 350 ARARLTTPMLTE-NGPTFYYVGMTGIRVGGQLLSIPQSVFATA-----GTIVDSGTVITR 403
Query: 383 LPTPAYEAFR--DAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
LP AY + R A +A VS+ DTCY+ +G V +PTVS F GG L +
Sbjct: 404 LPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDV 463
Query: 441 PASNFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
AS + A C AFA + G + I+GN Q + +++D VGF P C
Sbjct: 464 DASGIMY-AASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 145/350 (41%), Positives = 199/350 (56%), Gaps = 13/350 (3%)
Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC 211
GSGEY + + +G+P S ++D+GSD++W QC+PC+QC+ Q P+F+P DS+SFS + C
Sbjct: 92 GSGEYLMNVAIGTPASSLSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPC 151
Query: 212 SSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
S C L + C+ C+Y YGDGS T+G +A ET T + V N+A GCG NQG
Sbjct: 152 ESQYCQDLPSESCY-NDCQYTYGYGDGSSTQGYMATETFTFETSSVPNIAFGCGEDNQGF 210
Query: 272 FVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA--LPVGAAW 328
G AGL+G+G G +SL QLG G FSYC+ S G+ S +L G A +P G+
Sbjct: 211 GQGNGAGLIGMGWGPLSLPSQLG---VGQFSYCMTSSGSSSPSTLALGSAASGVPEGSPS 267
Query: 329 VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
L+ + P++YY+ L G+ VGG + I F+L G G+++D+GT +T LP AY
Sbjct: 268 TTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAY 327
Query: 389 EAFRDAFVAQTGNLPRASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
A AF Q P S TC+ L S +V+VP +S F GG VL L N LI
Sbjct: 328 NAVAQAFTDQINLSPVDESSSGLSTCFQLPSDGSTVQVPEISMQFDGG-VLNLGEENVLI 386
Query: 448 -PVDDAGTFCFAF-APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P + G C A + S G+SI GNIQQ+ Q+ +D N V F P C
Sbjct: 387 SPAE--GVICLAMGSSSQQGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 434
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 137/366 (37%), Positives = 208/366 (56%), Gaps = 26/366 (7%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSAS 205
SG+ G+G Y V +G+G+P + ++ D+GSD+ W QCQPC + CY Q P+FDP+ S +
Sbjct: 145 SGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSASKT 204
Query: 206 FSGVSCSSAVCDRLENA-----GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKN 259
+S +SC+S C L++A GC + C Y + YGD S+T G A +TLT+ + V
Sbjct: 205 YSNISCTSTACSGLKSATGNSPGCSSSNCVYGIQYGDSSFTVGFFAKDTLTLTQNDVFDG 264
Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL-VSRGTGSSGSLVFG 318
GCG N+G+F AGL+GLG +S+V Q + G FSYCL SRG S+G L FG
Sbjct: 265 FMFGCGQNNRGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRG--SNGHLTFG 322
Query: 319 R-------EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
+A+ G + P + + +FY++ + G+ VGG + IS LF+ + G
Sbjct: 323 NGNGVKTSKAVKNGITFTPFASS-QGATFYFIDVLGISVGGKALSISPMLFQ-----NAG 376
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
++D+GT +TRLP+ Y + + F P A +S+ DTCY+LS + S+ +P +SF
Sbjct: 377 TIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISFN 436
Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVG 489
F+G + L + LI + A C AFA + + I GNIQQ+ +++ +D A G +G
Sbjct: 437 FNGNANVDLEPNGILI-TNGASQVCLAFAGNGDDDTIGIFGNIQQQTLEVVYDVAGGQLG 495
Query: 490 FGPNVC 495
FG C
Sbjct: 496 FGYKGC 501
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 142/356 (39%), Positives = 196/356 (55%), Gaps = 14/356 (3%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSAS 205
SG G+G Y V IG+G+P +V D+GSD WVQCQPC CYKQ + +FDPA S++
Sbjct: 173 SGRALGTGNYVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSST 232
Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGC 264
++ VSC++ C L GC G C Y V YGDGSY+ G A++TLT+ VK GC
Sbjct: 233 YANVSCAAPACSDLYTRGCSGGHCLYSVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 292
Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR-EALP 323
G +N+G+F AAGLLGLG G SL Q + GG F++CL +R +G +G L FG
Sbjct: 293 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSG-TGYLDFGPGSPAA 351
Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
VGA + P+FYYVG++G+ VGG + I + +F G ++D+GT +TRL
Sbjct: 352 VGARQTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFSTA-----GTIVDSGTVITRL 406
Query: 384 PTPAYEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
P AY + R AF + +A +S+ DTCY+ +G V +P VS F GG L +
Sbjct: 407 PPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYDFTGMSEVAIPKVSLLFQGGAYLDVN 466
Query: 442 ASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
AS + C FA + + I+GN Q + + +D VGF P C
Sbjct: 467 ASGIMYAA-SLSQVCLGFAANEDDDDVGIVGNTQLKTFGVVYDIGKKTVGFSPGAC 521
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 172/430 (40%), Positives = 225/430 (52%), Gaps = 43/430 (10%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
L L HR S++S SF + D +RV + RR+SGGGA AK +Q
Sbjct: 75 LRLAHRCGPSTAS------------ASFAEVQRADEQRVEYIQRRVSGGGARGAKGALQQ 122
Query: 141 FGT-----DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ--CYKQ 193
T V + M G+ +Y V + +G+P SQ + +D+GSD+ WVQC+PCS C Q
Sbjct: 123 LATGSRSATVPTTMGVGTFQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQ 182
Query: 194 SDPVFDPADSASFSGVSCSSAVCD--RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLT 251
D +FDPA S+++S V C + C R+ AGC +C Y VSYGDGS T G +TL
Sbjct: 183 RDQLFDPAKSSTYSAVPCGADACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLA 242
Query: 252 IGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG 310
+ V GCGH GMF G GLL LG SMSL Q G GG FSYCL S+ +
Sbjct: 243 LAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQS- 301
Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
++G L G + G A L+ AP+FY V L+G+ VGG ++ + F
Sbjct: 302 AAGYLTLGGPSSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFA------G 355
Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAF---VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
G V+DTGT +TRLP AY A R AF +A G P A I DTCY+ S + V +PT
Sbjct: 356 GTVVDTGTVITRLPPTAYAALRSAFRGAIAPCG-YPSAPANGILDTCYDFSRYGVVTLPT 414
Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGAN 485
V+ FSGG L L A L + C AFAP+ +I+GN+QQ + FDG+
Sbjct: 415 VALTFSGGATLALEAPGIL------SSGCLAFAPNGGDGDAAILGNVQQRSFAVRFDGST 468
Query: 486 GFVGFGPNVC 495
VGF P C
Sbjct: 469 --VGFMPGAC 476
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 245 bits (626), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 161/422 (38%), Positives = 224/422 (53%), Gaps = 28/422 (6%)
Query: 88 KMSSSSNTTNNMHYHRH----------QHSFHARMQRDVKRVATLVRRLSGGGADAAKHE 137
K++ SS +HRH + ++RD R A + R+ SG A E
Sbjct: 49 KVAPSSGVVTVPLHHRHGPCSTVPSTNAPTLEDMLRRDQLRAAYITRKYSGVNGSAGDVE 108
Query: 138 VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
D G + EY + +G+GSP +Q M+ID+GSD+ WVQC+PCSQC+ Q+D +
Sbjct: 109 GSDVTVPTTLGTSLDTLEYLITVGMGSPAVAQTMLIDTGSDVSWVQCKPCSQCHSQADSL 168
Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
FDP+ S+++S SC+SA C +L GC + +C+Y V YGDGS GT + +TL +G + V
Sbjct: 169 FDPSSSSTYSAFSCTSAACAQLRQRGCSSSQCQYTVKYGDGSTGSGTYSSDTLALGSSTV 228
Query: 258 KNVAIGCGHKNQGMFV--GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSL 315
+N GC G + AGL+GLGGG+ SL Q G G AFSYCL GSSG L
Sbjct: 229 ENFQFGCSQSESGNLLQDQTAGLMGLGGGAESLATQTAGTFGKAFSYCLPPT-PGSSGFL 287
Query: 316 VFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMD 375
G P++R+ + PS+Y V L + VGG ++ I F G +MD
Sbjct: 288 TLGASTSGF-VVKTPMLRSTQVPSYYGVLLQAIRVGGRQLNIPASAFSA------GSIMD 340
Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGG 435
+GT +TRLP AY A AF A P A + IFDTC++ SG SV +PTV+ FSGG
Sbjct: 341 SGTIITRLPRTAYSALSSAFKAGMKQYPPAQPMGIFDTCFDFSGQSSVSIPTVALVFSGG 400
Query: 436 PVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
V+ L + ++ C AFA + + L IIGN+QQ ++ +D G VGF
Sbjct: 401 AVVDLASDGIIL------GSCLAFAANSDDTSLGIIGNVQQRTFEVLYDVGGGAVGFKAG 454
Query: 494 VC 495
C
Sbjct: 455 AC 456
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 245 bits (626), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 172/430 (40%), Positives = 224/430 (52%), Gaps = 43/430 (10%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
L L HR S++S SF + D +RV + RR+SGGGA AK +Q
Sbjct: 75 LRLAHRCGPSTAS------------ASFAEVQRADEQRVEYIQRRVSGGGARGAKGALQQ 122
Query: 141 FGT-----DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ--CYKQ 193
T V + M G+ +Y V + +G+P SQ + +D+GSD+ WVQC+PCS C Q
Sbjct: 123 LATGSRSATVPTTMGVGTFQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQ 182
Query: 194 SDPVFDPADSASFSGVSCSSAVCD--RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLT 251
D +FDPA S+++S V C + C R+ AGC +C Y VSYGDGS T G +TL
Sbjct: 183 RDQLFDPAKSSTYSAVPCGADACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLA 242
Query: 252 IGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG 310
+ V GCGH GMF G GLL LG SMSL Q G GG FSYCL S+ +
Sbjct: 243 LAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQS- 301
Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
++G L G G A L+ AP+FY V L+G+ VGG ++ + F
Sbjct: 302 AAGYLTLGGPTSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFA------G 355
Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAF---VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
G V+DTGT +TRLP AY A R AF +A G P A I DTCY+ S + V +PT
Sbjct: 356 GTVVDTGTVITRLPPTAYAALRSAFRGAIAPYG-YPSAPANGILDTCYDFSRYGVVTLPT 414
Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGAN 485
V+ FSGG L L A L + C AFAP+ +I+GN+QQ + FDG+
Sbjct: 415 VALTFSGGATLALEAPGIL------SSGCLAFAPNGGDGDAAILGNVQQRSFAVRFDGST 468
Query: 486 GFVGFGPNVC 495
VGF P C
Sbjct: 469 --VGFMPGAC 476
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 244 bits (624), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 152/394 (38%), Positives = 214/394 (54%), Gaps = 23/394 (5%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGT-DVVSGMDQGSGEYFVRIGVGSPPRSQY 170
+QR +RVA +LS DA FG+ + S + G+GEY + + +GSPP+S
Sbjct: 4 VQRSHERVAFYTLKLS---PDA-------FGSQEFQSPVKAGNGEYLMTLTLGSPPQSFD 53
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD--RLENAGCHAGR 228
+++D+GSD+ WVQC PC CY+Q P FDP+ S SF +C+ +C+ L C A
Sbjct: 54 VIVDTGSDLNWVQCLPCRVCYQQPGPKFDPSKSRSFRKAACTDNLCNVSALPLKACAANV 113
Query: 229 CRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGG 284
C+Y+ +YGD S T G LA ET+++ G V N A GCG +N G F GAAGL+GLG G
Sbjct: 114 CQYQYTYGDQSNTNGDLAFETISLNNGAGTQSVPNFAFGCGTQNLGTFAGAAGLVGLGQG 173
Query: 285 SMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVG 344
+SL QL FSYCLVS + S+ L FG A + +V N R P++YYV
Sbjct: 174 PLSLNSQLSHTFANKFSYCLVSLNSLSASPLTFGSIAAAANIQYTSIVVNARHPTYYYVQ 233
Query: 345 LSGLGVGGMRIPISEDLFRLTQ-MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP 403
L+ + VGG + ++ +F + Q G G ++D+GT +T L PAY A A+ + N P
Sbjct: 234 LNSIEVGGQPLNLAPSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFV-NYP 292
Query: 404 RASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD-DAGTFCFAFAP 461
R G + D C+N++G + VP + F F G + N + VD A T C A
Sbjct: 293 RLDGSAYGLDLCFNIAGVSNPSVPDMVFKFQGAD-FQMRGENLFVLVDTSATTLCLAMGG 351
Query: 462 SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S G SIIGNIQQ+ + +D +GF C
Sbjct: 352 S-QGFSIIGNIQQQNHLVVYDLEAKKIGFATADC 384
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 244 bits (624), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 147/386 (38%), Positives = 214/386 (55%), Gaps = 12/386 (3%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
++RD RV ++ +LS A+ E + SG+ GSG Y V IG+G+P +
Sbjct: 89 IRRDQARVESIYSKLSKNSANEVS-EAKSTELPAKSGITLGSGNYIVTIGIGTPKHDLSL 147
Query: 172 VIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
V D+GSD+ W QC+PC CY Q +P F+P+ S+++ VSCSS +C+ E+ C A C
Sbjct: 148 VFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCEDAES--CSASNCV 205
Query: 231 YEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLV 289
Y + YGD S+T+G LA E T+ + V+++V GCG NQG+F G AGLLGLG G +SL
Sbjct: 206 YSIGYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGENNQGLFDGVAGLLGLGPGKLSLP 265
Query: 290 GQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLG 349
Q FSYCL S + S+G L FG + + P+ P A + Y + + G+
Sbjct: 266 AQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFTPISSFPSAFN-YGIDIIGIS 324
Query: 350 VGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS 409
VG + I+ + F +G ++D+GT TRLPT Y R F + + SG
Sbjct: 325 VGDKELAITPNSFST-----EGAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYG 379
Query: 410 IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSII 469
+FDTCY+ +G +V PT++F F+GG V+ L S +P+ C AFA + +I
Sbjct: 380 LFDTCYDFTGLDTVTYPTIAFSFAGGTVVELDGSGISLPI-KISQVCLAFAGNDDLPAIF 438
Query: 470 GNIQQEGIQISFDGANGFVGFGPNVC 495
GN+QQ + + +D A G VGF PN C
Sbjct: 439 GNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 244 bits (623), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 138/335 (41%), Positives = 186/335 (55%), Gaps = 21/335 (6%)
Query: 170 YMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLE--NAGCHAG 227
+++ID+GSDI W+QC PC QCYKQ D +F PA SA++ + C+S +C +L+ + C
Sbjct: 2 FLLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLNS 61
Query: 228 RCRYEVSYGDGSYTKGTLALETLTIGR-----TVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
C Y VSYGD S T+G ALETLT+ V N A GCGH N+G+F GAAGL+GLG
Sbjct: 62 SCNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFNGAAGLMGLG 121
Query: 283 GGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREA-LPVGAAWVPLVRNPRAPSF 340
S+ Q G FSYCL S T SG L FG A L + PLV + PS
Sbjct: 122 KSSIGFPAQTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRFTPLVDSSSGPSQ 181
Query: 341 YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG 400
Y+V ++G+ VG +PIS V++D+GT ++R AYE RDAF
Sbjct: 182 YFVSMTGINVGDELLPISA-----------TVMVDSGTVISRFEQSAYERLRDAFTQILP 230
Query: 401 NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA 460
L A V+ FDTC+ +S + +P ++ +F L L + L PVDD G CFAFA
Sbjct: 231 GLQTAVSVAPFDTCFRVSTVDDINIPLITLHFRDDAELRLSPVHILYPVDD-GVMCFAFA 289
Query: 461 PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
PS SG S++GN QQ+ ++ +D +G C
Sbjct: 290 PSSSGRSVLGNFQQQNLRFVYDIPKSRLGISAFEC 324
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 244 bits (622), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 151/414 (36%), Positives = 227/414 (54%), Gaps = 25/414 (6%)
Query: 99 MHYHRHQHSFHARMQR-DVKRVATLVRRLSGGGADAAKHEVQDF---------GTDVVSG 148
+ + + Q+ F A+++ D + T R+ G +H +Q F +++ +
Sbjct: 31 LEHPKVQNGFRAKLKHVDSGKNLTKFERIQHG-VKRGRHRLQRFKAMALVASSNSEIDAP 89
Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
+ G+GE+ +++ +G+PP + ++D+GSD++W QC+PC+QC+ Q P+FDP S+SFS
Sbjct: 90 VLPGNGEFLMKLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSK 149
Query: 209 VSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKN 268
+SCSS +C+ L + C G C Y YGD S T+G LA ETLT G+ V VA GCG N
Sbjct: 150 LSCSSKLCEALPQSTCSDG-CEYLYGYGDYSSTQGMLASETLTFGKVSVPEVAFGCGEDN 208
Query: 269 QGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR----EALP 323
+G F +GL+GLG G +SLV QL FSYCL S + +L+ G +A
Sbjct: 209 EGSGFSQGSGLVGLGRGPLSLVSQLKEP---KFSYCLTSVDDTKASTLLMGSLASVKASD 265
Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
PL++N PSFYY+ L G+ VG +PI + F L + G G+++D+GT +T L
Sbjct: 266 SEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTITYL 325
Query: 384 PTPAYEAFRDAFVAQTGNLP-RASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLP 441
A++ F +Q NLP SG + + C+ L SG + VP + F+F G L LP
Sbjct: 326 EQSAFDLVAKEFTSQI-NLPVDNSGSTGLEVCFTLPSGSTDIEVPKLVFHFDGAD-LELP 383
Query: 442 ASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
A N++I G C A S SG+SI GNIQQ+ + + D + F P C
Sbjct: 384 AENYMIADASMGVACLAMG-SSSGMSIFGNIQQQNMLVLHDLEKETLSFLPTQC 436
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 244 bits (622), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 145/395 (36%), Positives = 219/395 (55%), Gaps = 27/395 (6%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVS-----GMDQGSGEYFVRIGVGSPP 166
++ D +R ++RR+SG GA ++ D+ + G D G+ Y V +G+P
Sbjct: 92 LRADQRRAEHILRRVSGRGAP----QLWDYKAAAATVPANWGYDIGTSNYVVTASLGTPG 147
Query: 167 RSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENA 222
+Q + +D+GSD+ WVQC+PC+ CY+Q DP+FDPA S+S++ V C + C L +
Sbjct: 148 MAQTLEVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPAQSSSYAAVPCGRSACAGLGIYAS 207
Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVAIGCGH-KNQGMFVGAAGLLG 280
C A +C Y VSYGDGS T G + +TLT+ V+ GCGH ++ G+F G GLLG
Sbjct: 208 ACSAAQCGYVVSYGDGSNTTGVYSSDTLTLAANATVQGFLFGCGHAQSGGLFTGIDGLLG 267
Query: 281 LGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSF 340
G SLV Q G GG FSYCL ++ + + + G + G + L+ +P AP++
Sbjct: 268 FGREQPSLVQQTAGAYGGVFSYCLPTKSSTTGYLTLGGPSGVAPGFSTTQLLPSPNAPTY 327
Query: 341 YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG 400
Y V L+G+ VGG + + F G V+DTGT +TRLP AY A R AF +
Sbjct: 328 YVVMLTGISVGGQPLSVPASAFA------AGTVVDTGTVITRLPPAAYAALRSAFRSGMA 381
Query: 401 NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA 460
+ P A + I DTCY+ +G+ +V + +V+ FS G +TL A + G FA +
Sbjct: 382 SYPSAPPIGILDTCYSFAGYGTVNLTSVALTFSSGATMTLGADGIM----SFGCLAFASS 437
Query: 461 PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S ++I+GN+QQ ++ DG++ VGF P+ C
Sbjct: 438 GSDGSMAILGNVQQRSFEVRIDGSS--VGFRPSSC 470
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 137/355 (38%), Positives = 201/355 (56%), Gaps = 17/355 (4%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
GEY + +G+P R +++D+GSD+ WVQC PC CY Q+D +F P S SF+ ++C +
Sbjct: 1 GEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACGT 60
Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHKN 268
+C+ L C+ C Y SYGDGS + G +T+T+ + V N A GCGH N
Sbjct: 61 ELCNGLPYPMCNQTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGCGHDN 120
Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGTGSSGSLVFGREALPV-- 324
+G F GA G+LGLG G +S QL G FSYCLV + L+FG A+P
Sbjct: 121 EGSFAGADGILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPTQTSPLLFGDAAVPTFP 180
Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
G ++ L+ NP+ P++YYV L+G+ VGG + IS F + +G G + D+GT VT+L
Sbjct: 181 GVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIFDSGTTVTQLA 240
Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTV---SFYFSGGPVLTL 440
++ A A T + PR S S D C L GF ++PTV +F+F GG + L
Sbjct: 241 GEVHQEVLAAMNASTMDYPRKSDDSSGLDLC--LGGFAEGQLPTVPSMTFHFEGGD-MEL 297
Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P SN+ I ++ + ++CF+ SP ++IIG+IQQ+ Q+ +D +GF P C
Sbjct: 298 PPSNYFIFLESSQSYCFSMVSSPD-VTIIGSIQQQNFQVYYDTVGRKIGFVPKSC 351
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 154/431 (35%), Positives = 226/431 (52%), Gaps = 23/431 (5%)
Query: 78 RWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGG-GADAAKH 136
+ +LE+VH+ S N + S M D +RV + RLS G + +
Sbjct: 60 KASLEVVHKHGPCSQLNHNGKA---KTTISHTDIMNLDNERVKYIQSRLSKNLGRENSVK 116
Query: 137 EVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSD 195
E+ SG GS YFV +G+G+P R +V D+GSD+ W QC+PC+ CYKQ D
Sbjct: 117 ELDSTTLPAKSGSLIGSANYFVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQD 176
Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCHA------GRCRYEVSYGDGSYTKGTLALET 249
+FDP+ S+S+ ++C+S++C +L +AG + C Y + YGD S + G L+ E
Sbjct: 177 AIFDPSKSSSYINITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQER 236
Query: 250 LTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG 308
LTI T +V + GCG N+G+F G+AGL+GLG +S V Q FSYCL S
Sbjct: 237 LTITATDIVDDFLFGCGQDNEGLFSGSAGLIGLGRHPISFVQQTSSIYNKIFSYCLPSTS 296
Query: 309 TGSSGSLVFGREALP-VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQ 366
+ S G L FG A + PL +FY + + G+ VGG ++P +S F
Sbjct: 297 S-SLGHLTFGASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSA-- 353
Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
G ++D+GT +TRL AY A R AF P A+ +FDTCY+ SG+ + VP
Sbjct: 354 ---GGSIIDSGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGYKEISVP 410
Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP--SPSGLSIIGNIQQEGIQISFDGA 484
+ F F+GG + LP LI A C AFA + + ++I GN+QQ+ +++ +D
Sbjct: 411 KIDFEFAGGVTVELPLVGILIG-RSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVE 469
Query: 485 NGFVGFGPNVC 495
G +GFG C
Sbjct: 470 GGRIGFGAAGC 480
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 163/448 (36%), Positives = 232/448 (51%), Gaps = 41/448 (9%)
Query: 85 HRDKMSSSSNTTNNMHYHR-------HQHSFHARMQRDVKRVATLVRRLSGGG------- 130
+ + +SSS + HR + SF + ++D R+ T+ RR + G
Sbjct: 64 EQKQPASSSPSLQLRMKHRSAEGGRTRKESFLDKAEKDAVRIETMHRRAARSGVARMPAS 123
Query: 131 ADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC 190
+ + + V SG+ GSGEY + + VG+PPR M++D+GSD+ W+QC PC C
Sbjct: 124 SSPRRALSERMVATVESGVAVGSGEYLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDC 183
Query: 191 YKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR---------CRYEVSYGDGSYT 241
++Q PVFDPA S+S+ V+C C + A A R C Y YGD S T
Sbjct: 184 FEQRGPVFDPAASSSYRNVTCGDQRCGLV--APPEAPRACRRPAEDSCPYYYWYGDQSNT 241
Query: 242 KGTLALETLTIGRTV------VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQ 295
G LALE+ T+ T V V GCGH+N+G+F GAAGLLGLG G +S QL
Sbjct: 242 TGDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAV 301
Query: 296 TGGAFSYCLVSRGTGSSGSLVFGREALPVG------AAWVPLVRNPRAPSFYYVGLSGLG 349
G FSYCLV G+ + +VFG + L + A+ P +P A +FYYV L G+
Sbjct: 302 YGHTFSYCLVEHGSDAGSKVVFGEDYLVLAHPQLKYTAFAP-TSSP-ADTFYYVKLKGVL 359
Query: 350 VGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL-PRASGV 408
VGG + IS D + + + G G ++D+GT ++ PAY+ R AFV L P
Sbjct: 360 VGGDLLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDF 419
Query: 409 SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLS 467
+ + CYN+SG VP +S F+ G V PA N+ + +D G C A +P +G+S
Sbjct: 420 PVLNPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFVRLDPDGIMCLAVRGTPRTGMS 479
Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
IIGN QQ+ + +D N +GF P C
Sbjct: 480 IIGNFQQQNFHVVYDLQNNRLGFAPRRC 507
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 154/410 (37%), Positives = 219/410 (53%), Gaps = 32/410 (7%)
Query: 113 QRDVKRVATLVRRLSGGGADAAKHEV-------QDFGTDVVSGMDQGSGEYFVRIGVGSP 165
++D R+ T+ RR + G+ AA+ + + V SG+ GSGEY V + +G+P
Sbjct: 99 EKDAVRIDTMHRRAALSGSAAARRDSAPRRALSERVVATVESGVPVGSGEYLVDVYLGTP 158
Query: 166 PRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH 225
PR M++D+GSD+ W+QC PC C++QS P+FDPA S S+ V+C C +
Sbjct: 159 PRRFRMIMDTGSDLNWLQCAPCLDCFEQSGPIFDPAASISYRNVTCGDDRCRLVSPPAES 218
Query: 226 AGR---------CRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIGCGHKNQGM 271
A R C Y YGD S T G LALE T+ G V VA GCGH+N+G+
Sbjct: 219 APRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGTRRVDGVAFGCGHRNRGL 278
Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGG-AFSYCLVSRGTGSSGSLVFGREALPVGAA--- 327
F GAAGLLGLG G +S QL G GG AFSYCLV G+ + ++FG + +
Sbjct: 279 FHGAAGLLGLGRGPLSFASQLRGVYGGHAFSYCLVEHGSAAGSKIIFGHDDALLAHPQLN 338
Query: 328 WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
+ A +FYY+ L + VGG + IS D + G ++D+GT ++ P PA
Sbjct: 339 YTAFAPTTDADTFYYLQLKSILVGGEAVNISSD-----TLSAGGTIIDSGTTLSYFPEPA 393
Query: 388 YEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
Y+A R AF+ + + + P G + CYN+SG V VP +S F+ G PA N+
Sbjct: 394 YQAIRQAFIDRMSPSYPLILGFPVLSPCYNVSGAEKVEVPELSLVFADGAAWEFPAENYF 453
Query: 447 IPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
I ++ G C A +P SG+SIIGN QQ+ + +D + +GF P C
Sbjct: 454 IRLEPEGIMCLAVLGTPRSGMSIIGNYQQQNFHVLYDLEHNRLGFAPRRC 503
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 142/393 (36%), Positives = 211/393 (53%), Gaps = 15/393 (3%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
+ RD RV + R+++ A+ + + V G + YF + +G+P +
Sbjct: 90 LGRDQDRVDAIRRKVAAVTTAASSSKPKGVPLQVGWGKYLDTTNYFTSLRLGTPATDLLV 149
Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA----G 227
+D+GSD W+QC+PC CY+Q + +FDP+ S+++S ++CSS C L ++ H
Sbjct: 150 ELDTGSDQSWIQCKPCPDCYEQHEALFDPSKSSTYSDITCSSRECQELGSSHKHNCSSDK 209
Query: 228 RCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSM 286
+C YE++Y D SYT G LA +TLT+ T V GCGH N G F GLLGLG G
Sbjct: 210 KCPYEITYADDSYTVGNLARDTLTLSPTDAVPGFVFGCGHNNAGSFGEIDGLLGLGRGKA 269
Query: 287 SLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG--REALPVGAAWVPLVRNPRAPSFYYVG 344
SL Q+ + G FSYCL S + ++G L F A P A + +V + PSFYY+
Sbjct: 270 SLSSQVAARYGAGFSYCLPSSPS-ATGYLSFSGAAAAAPTNAQFTEMVAG-QHPSFYYLN 327
Query: 345 LSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR 404
L+G+ V G I + +F G ++D+GTA + LP AY A R + + G R
Sbjct: 328 LTGITVAGRAIKVPPSVFATAA----GTIIDSGTAFSCLPPSAYAALRSSVRSAMGRYKR 383
Query: 405 ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP- 463
A +IFDTCY+L+G +VR+P+V+ F+ G + L S L + C AF P+P
Sbjct: 384 APSSTIFDTCYDLTGHETVRIPSVALVFADGATVHLHPSGVLYTWSNVSQTCLAFLPNPD 443
Query: 464 -SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ L ++GN QQ + + +D N VGFG N C
Sbjct: 444 DTSLGVLGNTQQRTLAVIYDVDNQKVGFGANGC 476
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 152/469 (32%), Positives = 243/469 (51%), Gaps = 36/469 (7%)
Query: 46 GSRTDHAKMSQYNELFERHNNISSSNTSSDEARWN---LELVHRDKMSSSSNTTNNMHYH 102
G + KM + ++ +R++ S E+R + L +D+ S N
Sbjct: 26 GCELEQKKMFKV-QMLQRNHQFGSKGCILPESRKEKGAIVLEMKDRGYCSERKINWNRKL 84
Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGV 162
+ Q F R ++ + ++SG + E+Q + SG++ + Y V IG+
Sbjct: 85 QKQLIFDDLRVRSMQN--RIRAKVSGHNSSEQSSEIQ---IPLASGINLETLNYIVTIGL 139
Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLE-- 220
G+ ++ ++ID+GSD+ WVQC PC CY Q PVF+P++S+S++ + C+S+ C L+
Sbjct: 140 GN--QNMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSYNSLLCNSSTCQNLQFT 197
Query: 221 ---NAGCHAGR---CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVG 274
C + C + VSYGDGS+T G L +E L+ G V N GCG N+G+F G
Sbjct: 198 TGNTEACESNNPSSCNHTVSYGDGSFTDGELGVEHLSFGGISVSNFVFGCGRNNKGLFGG 257
Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA------LPVGAAW 328
+G++GLG ++S++ Q GG FSYCL + +G+SGSLV G E+ P+ A+
Sbjct: 258 VSGIMGLGRSNLSMISQTNTTFGGVFSYCLPTTDSGASGSLVIGNESSLFKNLTPI--AY 315
Query: 329 VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
+V NP+ +FY + L+G+ VGG+ I + T G+ G+++D+GT +TRL Y
Sbjct: 316 TSMVSNPQLSNFYVLNLTGIDVGGVAI-------QDTSFGNGGILIDSGTVITRLAPSLY 368
Query: 389 EAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
A + F+ Q P A +SI DTC+NL+G V +PT+S +F L + A L
Sbjct: 369 NALKAEFLKQFSGYPIAPALSILDTCFNLTGIEEVSIPTLSMHFENNVDLNVDAVGILYM 428
Query: 449 VDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
D C A A + ++IIGN QQ ++ +D +GF C
Sbjct: 429 PKDGSQVCLALASLSDENDMAIIGNYQQRNQRVIYDAKQSKIGFAREDC 477
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 144/357 (40%), Positives = 204/357 (57%), Gaps = 16/357 (4%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSAS 205
SG G+G Y V +G+G+P +V D+GSD WVQCQPC CY+Q + +FDP S++
Sbjct: 169 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSST 228
Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGC 264
++ VSC++ C L GC G C Y V YGDGSY+ G A++TLT+ VK GC
Sbjct: 229 YANVSCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 288
Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
G +N+G+F AAGLLGLG G SL Q + GG F++CL +R TG +G L FG +
Sbjct: 289 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTG-TGYLDFGAGSPAA 347
Query: 325 GAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
+A + P++ + P+FYY+G++G+ VGG + I + +F G ++D+GT +TR
Sbjct: 348 ASARLTTPMLTD-NGPTFYYIGMTGIRVGGQLLSIPQSVFATA-----GTIVDSGTVITR 401
Query: 383 LPTPAYEAFR--DAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
LP PAY + R A +A VS+ DTCY+ +G V +PTVS F GG L +
Sbjct: 402 LPPPAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDV 461
Query: 441 PASNFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
AS + A C AFA + G + I+GN Q + +++D VGF P VC
Sbjct: 462 DASGIMY-AASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGVC 517
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 146/386 (37%), Positives = 213/386 (55%), Gaps = 12/386 (3%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
++RD RV ++ +LS A+ E + SG+ GSG Y V IG+G+P +
Sbjct: 89 IRRDQARVESIYSKLSKNSANEVS-EAKSTELPAKSGITLGSGNYIVTIGIGTPKHDLSL 147
Query: 172 VIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
V D+GSD+ W QC+PC CY Q +P F+P+ S+++ VSCSS +C+ E+ C A C
Sbjct: 148 VFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCEDAES--CSASNCV 205
Query: 231 YEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLV 289
Y + YGD S+T+G LA E T+ + V+++V GCG NQG+F G AGLLGLG G +SL
Sbjct: 206 YSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGENNQGLFDGVAGLLGLGPGKLSLP 265
Query: 290 GQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLG 349
Q FSYCL S + S+G L FG + + P+ P A + Y + + G+
Sbjct: 266 AQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFTPISSFPSAFN-YGIDIIGIS 324
Query: 350 VGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS 409
VG + I+ + F +G ++D+GT TRLPT Y R F + + SG
Sbjct: 325 VGDKELAITPNSFST-----EGAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYG 379
Query: 410 IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSII 469
+FDTCY+ +G +V PT++F F+G V+ L S +P+ C AFA + +I
Sbjct: 380 LFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGISLPI-KISQVCLAFAGNDDLPAIF 438
Query: 470 GNIQQEGIQISFDGANGFVGFGPNVC 495
GN+QQ + + +D A G VGF PN C
Sbjct: 439 GNVQQTTLDVVYDVAGGRVGFAPNGC 464
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 182/468 (38%), Positives = 252/468 (53%), Gaps = 39/468 (8%)
Query: 67 ISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHY-HRHQHSFHA--------RMQRDVK 117
+S SS EA + H++ M++SS++ ++ HR + +A R+QRD
Sbjct: 40 LSPHAHSSPEAAEDGAHAHQEDMAASSSSAMHVRLLHRDSFAVNATGAELLARRLQRDEL 99
Query: 118 RVATLVRRLSGGGADAAKHEVQDFGTDVVSGM---DQGSGEYFVRIGVGSPPRSQYMVID 174
R A ++ + G G +V+ + SG+Y +I VG+P + +D
Sbjct: 100 RAAWIISTAAANGTPPPDVVGLSTGRGLVAPVVSRAPTSGDYIAKIAVGTPAVEALLALD 159
Query: 175 SGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG---CHAGRCRY 231
+ SD+ W+QCQPC +CY QS PVFDP S S+ ++ + C L +G G C Y
Sbjct: 160 TASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDCQALGRSGGGDAKRGTCIY 219
Query: 232 EVSYGDG------SYTKGTLALETLTIGRTVVKN-VAIGCGHKNQGMF-VGAAGLLGLGG 283
V YGDG S + G L ETLT V + ++IGCGH N+G+F AAG+LGL
Sbjct: 220 TVLYGDGDGHGSTSTSVGDLVEETLTFAGGVRQAYLSIGCGHDNKGLFGAPAAGILGLSR 279
Query: 284 GSMSLVGQLGGQ-TGGAFSYCLVS--RGTGS-SGSLVFGREALPVG--AAWVPLVRNPRA 337
G +S+ Q+ +FSYCLV G GS S +L FG A+ A++ P V N
Sbjct: 280 GQISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNM 339
Query: 338 PSFYYVGLSGLGVGGMRIP--ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF 395
P+FYYV L G+ VGG+R+P DL G GV++D+GT VTRL PAY AFRDAF
Sbjct: 340 PTFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAF 399
Query: 396 VAQTGNLPRAS--GVS-IFDTCYNLSGFVSVR----VPTVSFYFSGGPVLTLPASNFLIP 448
A L + S G S +FDTCY + G +R VP VS +F+GG L+L N+LI
Sbjct: 400 RAAATGLGQVSTGGPSGLFDTCYTVGGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLIT 459
Query: 449 VDDAGTFCFAFAPS-PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
VD GT CFAFA + +S+IGNI Q+G ++ +D VGF PN C
Sbjct: 460 VDSRGTVCFAFAGTGDRSVSVIGNILQQGFRVVYDIGGQRVGFAPNSC 507
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 241 bits (616), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 135/371 (36%), Positives = 204/371 (54%), Gaps = 21/371 (5%)
Query: 143 TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPAD 202
TD S + G G+Y I +G+P + ++ D+GSD++W+QC+PC C+ Q DP+FDP
Sbjct: 27 TDYESPVASGGGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEG 86
Query: 203 SASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VV 257
S+S++ +SC +CD L C + C Y YGDGS T+GTL+ ET+T+ T
Sbjct: 87 SSSYTTMSCGDTLCDSLPRKSC-SPDCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAA 145
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS--L 315
KN+A GCGH N+G F A+GL+GLG G++S V QLG G FSYCLV S + +
Sbjct: 146 KNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPM 205
Query: 316 VFGREA------LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
FG E+ + A+ P++ NP SFYYV L + + G + I F + G
Sbjct: 206 FFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGS 265
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSG---FVSVRV 425
G++ D+GT +T LP Y+ A ++ + P+ G S D CY++SG +++
Sbjct: 266 GGMIFDSGTTLTLLPDAPYQIVLRALRSKI-SFPKIDGSSAGLDLCYDVSGSKASYKMKI 324
Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTF-CFAFAPSPSGLSIIGNIQQEGIQISFDGA 484
P + F+F G LP N+ I +DAGT C A S + I GN+ Q+ ++ +D
Sbjct: 325 PAMVFHFEGAD-YQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIG 383
Query: 485 NGFVGFGPNVC 495
+ +G+ P+ C
Sbjct: 384 SSKIGWAPSQC 394
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 241 bits (615), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 174/498 (34%), Positives = 247/498 (49%), Gaps = 57/498 (11%)
Query: 14 LLLHLLCSIITTSTSAASDTHFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTS 73
LLL L + A+ ++ I+ VN + + +H+ + +S+S
Sbjct: 3 LLLFSLEKGYAVEENEATKSYLHIIKVNSLLPTTACNHS------------SKVSNS--- 47
Query: 74 SDEARWNLELVHRD-------KMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL 126
+LE+VHR ++ +NM RD RV ++ RL
Sbjct: 48 -----LSLEVVHRHGPCIGIVNQEKGADAPSNMEI----------FLRDQNRVDSIHARL 92
Query: 127 SGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP 186
S G K Q V SG G+G+Y V +G+G+P + ++ D+GSDI W QC+P
Sbjct: 93 SSRGMFPEK---QATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEP 149
Query: 187 CSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAGRCRYEVSYGDGSY 240
C + CYKQ +P +P+ S S+ +SCSSA+C + + C + C Y+V YGDGSY
Sbjct: 150 CVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSY 209
Query: 241 TKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA 299
+ G A ETLT+ + V KN GCG +N G+F GAAGLLGLG ++L Q
Sbjct: 210 SIGFFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKL 269
Query: 300 FSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISE 359
FSYCL + + S G L G + + + PL + + FY + ++GL VGG ++ I E
Sbjct: 270 FSYCLPA-SSSSKGYLSLGGQ-VSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDE 327
Query: 360 DLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSG 419
F G V+D+GT +TRL AY AF + P SG SIFDTCY+ S
Sbjct: 328 SAFSA------GTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSK 381
Query: 420 FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGI 477
+ +VR+P V F GG + + S L PV+ C AFA S SI GN+QQ
Sbjct: 382 YDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTY 441
Query: 478 QISFDGANGFVGFGPNVC 495
Q+ +DGA G VGF P C
Sbjct: 442 QVVYDGAKGRVGFAPGGC 459
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 241 bits (615), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 156/427 (36%), Positives = 219/427 (51%), Gaps = 38/427 (8%)
Query: 102 HRHQHSFHARMQ-------RDVKRVATLVRRLSGGGADAAKHEVQDFG----TDVVSGMD 150
H H ++R+Q R R++ LV R +G + ++ D+ +
Sbjct: 51 HVDAHGNYSRLQLLQRAARRSHHRMSRLVARATGAASTSSSKAAAAGDGSGGKDLQVPVH 110
Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
G+GE+ + + VG+P ++D+GSD+VW QC+PC +C+ Q+ PVFDPA S++++ +
Sbjct: 111 AGNGEFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTTPVFDPAASSTYAALP 170
Query: 211 CSSAVCDRLENAGCHAGRCR--------YEVSYGDGSYTKGTLALETLTIGRTVVKNVAI 262
CSSA+C L + C + Y +YGD S T+G LA ET T+ R V VA
Sbjct: 171 CSSALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLARQKVPGVAF 230
Query: 263 GCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF---- 317
GCG N+G F AGL+GLG G +SLV QLG FSYCL S + S +
Sbjct: 231 GCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGIDR---FSYCLTSLDDAAGRSPLLLGSA 287
Query: 318 ---GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
A A PLV+NP PSFYYV L+GL VG R+ + F + G GV++
Sbjct: 288 AGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDDGTGGVIV 347
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYN-----LSGFVSVRVPTV 428
D+GT++T L AY A R AFVA +LP I D C+ + V V+VP +
Sbjct: 348 DSGTSITYLELRAYRALRKAFVAHM-SLPTVDASEIGLDLCFQGPAGAVDQDVQVQVPKL 406
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
+F GG L LPA N+++ +G C S GLSIIGN QQ+ Q +D A +
Sbjct: 407 VLHFDGGADLDLPAENYMVLDSASGALCLTVMAS-RGLSIIGNFQQQNFQFVYDVAGDTL 465
Query: 489 GFGPNVC 495
F P C
Sbjct: 466 SFAPAEC 472
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 241 bits (615), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 157/441 (35%), Positives = 217/441 (49%), Gaps = 50/441 (11%)
Query: 104 HQHSFHARMQRDVKRVATLVRR---------LSGGGADAAKHEV---------------- 138
H+ SF A RD+ R+ TL +R LS + K V
Sbjct: 115 HKESFVASTTRDLTRIQTLHKRILEKKNQNALSRLNKEEPKQPVVAPAASPESYPANGLS 174
Query: 139 -QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
Q T + SG+ GSGEYF+ + +G+PPR +++D+GSD+ W+QC PC C+ Q+ P
Sbjct: 175 GQLMAT-LESGVSLGSGEYFMDVFIGTPPRHFSLILDTGSDLNWIQCVPCYDCFVQNGPY 233
Query: 198 FDPADSASFSGVSCSSAVCDRLENAG----CHAGR--CRYEVSYGDGSYTKGTLALETLT 251
+DP +S+SF + C C + + C A C Y YGD S T G ALET T
Sbjct: 234 YDPKESSSFKNIGCHDPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFT 293
Query: 252 IGRTV---------VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
+ T V+NV GCGH N+G+F GAAGLLGLG G +S QL G +FSY
Sbjct: 294 VNLTSPAGKSEFKRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSY 353
Query: 303 CLVSRG--TGSSGSLVFGREALPVGAAWV---PLVRNPRAP--SFYYVGLSGLGVGGMRI 355
CLV R T S L+FG + + V LV P +FYYV + + VGG +
Sbjct: 354 CLVDRNSDTNVSSKLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVL 413
Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCY 415
I E+ + L+ G G ++D+GT ++ P+YE +DAFV + P I D CY
Sbjct: 414 KIPEETWHLSPEGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFPILDPCY 473
Query: 416 NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQ 474
N+SG + +P F G V P N+ I ++ C A +P S LSIIGN QQ
Sbjct: 474 NVSGVEKMELPEFRILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTPRSALSIIGNYQQ 533
Query: 475 EGIQISFDGANGFVGFGPNVC 495
+ I +D +G+ P C
Sbjct: 534 QNFHILYDTKKSRLGYAPMKC 554
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 241 bits (615), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 139/346 (40%), Positives = 190/346 (54%), Gaps = 17/346 (4%)
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
+G+P + ++D+GSD+VW QC+PC C+KQS PVFDP+ S++++ V CSSA C L
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPT 232
Query: 222 AGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM-FVGAAGLL 279
+ C A +C Y +YGD S T+G LA ET T+ ++ + V GCG N+G F AGL+
Sbjct: 233 SKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLV 292
Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-------LPVGAAWVPLV 332
GLG G +SLV QLG FSYCL S ++ L+ G A PL+
Sbjct: 293 GLGRGPLSLVSQLGLDK---FSYCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLI 349
Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
+NP PSFYYV L + VG RI + F + G GV++D+GT++T L Y A +
Sbjct: 350 KNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALK 409
Query: 393 DAFVAQTGNLPRASGVSI-FDTCYN--LSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
AF AQ LP A G + D C+ G V VP + F+F GG L LPA N+++
Sbjct: 410 KAFAAQMA-LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLD 468
Query: 450 DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+G C S GLSIIGN QQ+ Q +D + + F P C
Sbjct: 469 GGSGALCLTVMGS-RGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQC 513
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 241 bits (614), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 140/357 (39%), Positives = 201/357 (56%), Gaps = 17/357 (4%)
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASF 206
G+ G+ Y V IG+G+PP +V D+GSD WVQC+PC CYKQ D +FDPA S+++
Sbjct: 155 GLSLGTANYVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTY 214
Query: 207 SGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
+ VSC+ C L+ +GC+AG C Y + YGDGSYT G A +TL + + +K GCG
Sbjct: 215 ANVSCADPACADLDASGCNAGHCLYGIQYGDGSYTVGFFAKDTLAVAQDAIKGFKFGCGE 274
Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF---GREALP 323
KN+G+F AGLLGLG G S+ Q + GG+FSYCL + + ++G L F +
Sbjct: 275 KNRGLFGQTAGLLGLGRGPTSITVQAYEKYGGSFSYCLPAS-SAATGYLEFGPLSPSSSG 333
Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI-PISEDLFRLTQMGDDGVVMDTGTAVTR 382
A P++ + + P+FYYVGL+G+ VGG ++ I E +F + G ++D+GT +TR
Sbjct: 334 SNAKTTPMLTD-KGPTFYYVGLTGIRVGGKQLGAIPESVFS-----NSGTLVDSGTVITR 387
Query: 383 LP--TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
LP A + A +A+ SI DTCY+ +G V +PTVS F GG L L
Sbjct: 388 LPDTAYAALSSAFAAAMAASGYKKAAAYSILDTCYDFTGLSQVSLPTVSLVFQGGACLDL 447
Query: 441 PASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
AS + + + C FA + + I+GN QQ + +D + VGF P C
Sbjct: 448 DASGIVYAISQS-QVCLGFASNGDDESVGIVGNTQQRTYGVLYDVSKKVVGFAPGAC 503
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 156/391 (39%), Positives = 212/391 (54%), Gaps = 20/391 (5%)
Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
RD RV ++ RLS G K Q V SG G+G+Y V +G+G+P + ++
Sbjct: 92 RDQNRVDSIHARLSSRGMFPEK---QATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIF 148
Query: 174 DSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAG 227
D+GSDI W QC+PC + CYKQ +P +P+ S S+ +SCSSA+C + + C +
Sbjct: 149 DTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFSQSCSSS 208
Query: 228 RCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSM 286
C Y+V YGDGSY+ G A ETLT+ + V KN GCG +N G+F GAAGLLGLG +
Sbjct: 209 TCLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKL 268
Query: 287 SLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLS 346
+L Q FSYCL + + S G L G + + + PL + + FY + ++
Sbjct: 269 ALPSQTAKTYKKLFSYCLPA-SSSSKGYLSLGGQ-VSKSVKFTPLSADFDSTPFYGLDIT 326
Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
GL VGG ++ I E F G V+D+GT +TRL AY AF + P S
Sbjct: 327 GLSVGGRKLSIDESAFSA------GTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTS 380
Query: 407 GVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPS 464
G SIFDTCY+ S + +VR+P V F GG + + S L PV+ C AFA S
Sbjct: 381 GYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDS 440
Query: 465 GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
SI GN+QQ Q+ +DGA G VGF P C
Sbjct: 441 DTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 471
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 145/357 (40%), Positives = 204/357 (57%), Gaps = 16/357 (4%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSAS 205
SG G+G Y V +G+G+P +V D+GSD WVQCQPC CY+Q + +FDPA S++
Sbjct: 171 SGRALGTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST 230
Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGC 264
++ VSC++ C L GC G C Y V YGDGSY+ G A++TLT+ VK GC
Sbjct: 231 YANVSCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 290
Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
G +N+G+F AAGLLGLG G SL Q + GG F++CL +R TG +G L FG +L
Sbjct: 291 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTG-TGYLDFGAGSLAA 349
Query: 325 GAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
+A + P++ + P+FYYVG++G+ VGG + I + +F G ++D+GT +TR
Sbjct: 350 ASARLTTPMLTD-NGPTFYYVGMTGIRVGGQLLSIPQSVFATA-----GTIVDSGTVITR 403
Query: 383 LPTPAYEAFR--DAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
LP AY + R A +A VS+ DTCY+ +G V +PTVS F GG L +
Sbjct: 404 LPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDV 463
Query: 441 PASNFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
AS + A C AFA + G + I+GN Q + +++D VGF P C
Sbjct: 464 DASGIMY-AASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 240 bits (612), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 135/371 (36%), Positives = 203/371 (54%), Gaps = 21/371 (5%)
Query: 143 TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPAD 202
TD S + G G+Y I +G+P + ++ D+GSD++W+QC+PC C+ Q DP+FDP
Sbjct: 27 TDYESPVASGGGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEG 86
Query: 203 SASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VV 257
S+S++ +SC +CD L C + C Y YGDGS T+GTL+ ET+T+ T
Sbjct: 87 SSSYTTMSCGDTLCDSLPRKSC-SPNCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAA 145
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS--L 315
KN+A GCGH N+G F A+GL+GLG G++S V QLG G FSYCLV S + +
Sbjct: 146 KNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPM 205
Query: 316 VFGREA------LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
FG E+ + A+ P++ NP SFYYV L + + G + I F + G
Sbjct: 206 FFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGS 265
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVS---VRV 425
G++ D+GT +T LP Y+ A ++ + P G S D CY++SG + ++
Sbjct: 266 GGMIFDSGTTLTLLPDAPYQIVLRALRSKV-SFPEIDGSSAGLDLCYDVSGSKASYKKKI 324
Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTF-CFAFAPSPSGLSIIGNIQQEGIQISFDGA 484
P + F+F G LP N+ I +DAGT C A S + I GN+ Q+ ++ +D
Sbjct: 325 PAMVFHFEGAD-HQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIG 383
Query: 485 NGFVGFGPNVC 495
+ +G+ P+ C
Sbjct: 384 SSKIGWAPSQC 394
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 240 bits (612), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 145/402 (36%), Positives = 207/402 (51%), Gaps = 16/402 (3%)
Query: 107 SFHARMQRDVKRVATLVRRLSGGGADAAKHEV---QDFGTDVVSGMDQGSGEYFVRIGVG 163
S+ + +K R + GG A K V +D + SG S Y +++G G
Sbjct: 72 SWWTAVSESIKGDTARYRAMVKGGWSAGKTMVNPQEDADIPLASGQAISSSNYIIKLGFG 131
Query: 164 SPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD--RLEN 221
+PP+S Y V+D+GS+I W+ C PCS C + P F+P+ S++++ ++C+S C R+
Sbjct: 132 TPPQSFYTVLDTGSNIAWIPCNPCSGCSSKQQP-FEPSKSSTYNYLTCASQQCQLLRVCT 190
Query: 222 AGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGL 281
++ C YGD S L+ ETL++G V+N GC + +G+ L+G
Sbjct: 191 KSDNSVNCSLTQRYGDQSEVDEILSSETLSVGSQQVENFVFGCSNAARGLIQRTPSLVGF 250
Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGS-SGSLVFGREALPV-GAAWVPLVRNPRAPS 339
G +S V Q FSYCL S + + +GSL+ G+EAL G + PL+ N R PS
Sbjct: 251 GRNPLSFVSQTATLYDSTFSYCLPSLFSSAFTGSLLLGKEALSAQGLKFTPLLSNSRYPS 310
Query: 340 FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQT 399
FYYVGL+G+ VG + I L + G ++D+GT +TRL PAY A RD+F +Q
Sbjct: 311 FYYVGLNGISVGEELVSIPAGTLSLDESTGRGTIIDSGTVITRLVEPAYNAMRDSFRSQL 370
Query: 400 GNLPRASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIP-VDDAGTFCF 457
NL AS +FDTCYN SG V P ++ +F LTLP N L P DD C
Sbjct: 371 SNLTMASPTDLFDTCYNRPSG--DVEFPLITLHFDDNLDLTLPLDNILYPGNDDGSVLCL 428
Query: 458 AFAPSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
AF P G LS GN QQ+ ++I D A +G C
Sbjct: 429 AFGLPPGGGDDVLSTFGNYQQQKLRIVHDVAESRLGIASENC 470
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 158/434 (36%), Positives = 215/434 (49%), Gaps = 32/434 (7%)
Query: 82 ELVHRDKMSSSSNTTNNMHYHRH----------QHSFHARMQRDVKRVATLVRRLSGGGA 131
E+ K++SS N HRH + S + RD R A + +LS
Sbjct: 45 EVCSGQKVTSSKNGATLPLVHRHGPCSPVMSKEKPSHEETLGRDQLRAANIHAKLSSPRN 104
Query: 132 DAAKHEVQDFGTDV--VSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS- 188
+AK E+Q G + SG G+ EY + + +G+P +Q M ID+GSD+ WVQC PC+
Sbjct: 105 SSAK-ELQQSGVTIPTSSGYSLGTPEYVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAA 163
Query: 189 -QCYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAGCHAGRCRYEVSYGDGSYTKGTL 245
C Q D +FDPA SA++S SCSSA C +L E GC C+Y V Y D S T GT
Sbjct: 164 QSCSSQKDKLFDPAKSATYSAFSCSSAQCAQLGGEGNGCLNSHCQYIVKYVDHSNTTGTY 223
Query: 246 ALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL 304
+TL + + VKN GC H+ G GL+GLGG + SLV Q G AFSYCL
Sbjct: 224 GSDTLGLTTSDAVKNFQFGCSHRANGFVGQLDGLMGLGGDTESLVSQTAATYGKAFSYCL 283
Query: 305 VSRGTGSSGSLVFGREALPVGAAW---VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
+ + G L G A ++ PLVR P+FY V L + V G ++ + +
Sbjct: 284 PPSSSSAGGFLTLGAAAGGTSSSRYSRTPLVRF-NVPTFYGVFLQAITVAGTKLNVPASV 342
Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV 421
F V+D+GT +T+LP AY+A R AF + P A+ V I DTC++ SG
Sbjct: 343 F------SGASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGILDTCFDFSGIK 396
Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISF 481
+VRVP V+ FS G V+ L S AG F I+GN+QQ ++ F
Sbjct: 397 TVRVPVVTLTFSRGAVMDLDVSGIFY----AGCLAFTATAQDGDTGILGNVQQRTFEMLF 452
Query: 482 DGANGFVGFGPNVC 495
D +GF P C
Sbjct: 453 DVGGSTLGFRPGAC 466
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 239 bits (610), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 167/472 (35%), Positives = 239/472 (50%), Gaps = 55/472 (11%)
Query: 71 NTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRH-----------QHSFHARMQRDVKRV 119
NT+ +A + +L+ ++ + + +H R + SF Q+D R+
Sbjct: 44 NTAVADAGCDGKLLAEEEEQKDRSPSLKLHMSRRSPAEATAGRTRKDSFLESAQKDGVRI 103
Query: 120 ATLVRRLS------GGGADAAKHEVQDFGTDVV----SGMDQGSGEYFVRIGVGSPPRSQ 169
AT+ RR++ G A+ + +V SG+ GSGEY V + VG+PPR
Sbjct: 104 ATMHRRVALQAQAQPGRRSASSSPRRALSERLVATVESGVAVGSGEYLVEVYVGTPPRRF 163
Query: 170 YMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG----CH 225
M++D+GSD+ W+QC PC C+ Q PVFDP S S+ V+C C + C
Sbjct: 164 QMIMDTGSDLNWLQCAPCLDCFDQRGPVFDPMASTSYRNVTCGDTRCGLVSPPAAPRTCR 223
Query: 226 AGR---CRYEVSYGDGSYTKGTLALETLTIGRTV-----VKNVAIGCGHKNQGMFVGAAG 277
+ R C Y YGD S T G LALE T+ T V V +GCGH+N+G+F GAAG
Sbjct: 224 SSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTASSSRRVDGVVLGCGHRNRGLFHGAAG 283
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPR- 336
LLGLG G +S QL G AFSYCLV G+ +VFG + + L+ +P+
Sbjct: 284 LLGLGRGPLSFASQLRAVYGHAFSYCLVDHGSAVGSKIVFGDDNV--------LLSHPQL 335
Query: 337 -----APS-----FYYVGLSGLGVGGMRIPISEDLFRLTQM-GDDGVVMDTGTAVTRLPT 385
APS FYYV L G+ VGG + I + + +++ G G ++D+GT ++ P
Sbjct: 336 NYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDGSGGTIIDSGTTLSYFPE 395
Query: 386 PAYEAFRDAFVAQTGN-LPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
PAY+A R AFV + P + + CYN+SG V VP S F+ G V PA N
Sbjct: 396 PAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFPAEN 455
Query: 445 FLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ I +D G C A +P S +SIIGN QQ+ + +D + +GF P C
Sbjct: 456 YFIRLDTEGIMCLAVLGTPRSAMSIIGNYQQQNFHVLYDLHHNRLGFAPRRC 507
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 155/391 (39%), Positives = 211/391 (53%), Gaps = 20/391 (5%)
Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
RD RV ++ RLS G K V SG G+G+Y V +G+G+P + ++
Sbjct: 32 RDQNRVDSIHARLSSRGMFPEKQATT---LPVQSGASIGAGDYVVTVGLGTPKKEFTLIF 88
Query: 174 DSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAG 227
D+GSDI W QC+PC + CYKQ +P +P+ S S+ +SCSSA+C + + C +
Sbjct: 89 DTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFSQSCSSS 148
Query: 228 RCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSM 286
C Y+V YGDGSY+ G A ETLT+ + V KN GCG +N G+F GAAGLLGLG +
Sbjct: 149 TCLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKL 208
Query: 287 SLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLS 346
+L Q FSYCL + + S G L G + + + PL + + FY + ++
Sbjct: 209 ALPSQTAKTYKKLFSYCLPA-SSSSKGYLSLGGQ-VSKSVKFTPLSADFDSTPFYGLDIT 266
Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
GL VGG ++ I E F G V+D+GT +TRL AY AF + P S
Sbjct: 267 GLSVGGRQLSIDESAFSA------GTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTS 320
Query: 407 GVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPS 464
G SIFDTCY+ S + +VR+P V F GG + + S L PV+ C AFA S
Sbjct: 321 GYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDS 380
Query: 465 GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
SI GN+QQ Q+ +DGA G VGF P C
Sbjct: 381 DTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 411
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 238 bits (608), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 151/396 (38%), Positives = 220/396 (55%), Gaps = 20/396 (5%)
Query: 107 SFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPP 166
+ R++RD R A + R+ SG G D + + T + G + EY + +G+GSP
Sbjct: 76 TLEERLRRDQLRAAYIKRKFSGAG-DIEQSDAATVPTTL--GTSLSTLEYVITVGIGSPA 132
Query: 167 RSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL----ENA 222
+Q M +D+GSD+ WVQC+PCSQC+ + D +FDP+ S+++S SCSSA C +L E
Sbjct: 133 VTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSSSSTYSPFSCSSAPCAQLSQSQEGN 192
Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAA-GLLGL 281
GC + +C+Y V+YGD S T GT + +TLT+G + + + GC G F GL+GL
Sbjct: 193 GCMSSQCQYIVNYGDSSSTTGTYSSDTLTLGSSAMTDFQFGCSQSESGGFNDQTDGLMGL 252
Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFY 341
GGG+ SL Q G G AFSYCL +GSSG L G + G P++R+ + P++Y
Sbjct: 253 GGGAQSLASQTAGTFGTAFSYCLPPT-SGSSGFLTLGTGS--SGFVKTPMLRSTQIPTYY 309
Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
V L + VG ++ + +F G +MD+GT +TRLP AY A AF A
Sbjct: 310 VVLLESIKVGSQQLNLPTSVFSA------GSLMDSGTIITRLPPTAYSALSSAFKAGMQQ 363
Query: 402 LPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP 461
P A+ I DTC++ SG S+ +PTV+ FSGG + L ++ + + C AF P
Sbjct: 364 YPPATPSGILDTCFDFSGQSSISIPTVTLVFSGGAAVDLAFDGIMLEISSS-IRCLAFTP 422
Query: 462 S--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ S L IIGN+QQ ++ +D G VGF C
Sbjct: 423 NGDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 458
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 238 bits (607), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 140/392 (35%), Positives = 211/392 (53%), Gaps = 20/392 (5%)
Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
R +R +KR + +L + E + G+GE+ +++ +G+P S
Sbjct: 79 RFKRAIKRSQDRLEKLQMSVDEVKAVEAPVYA---------GNGEFLMKMAIGTPSLSFS 129
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
++D+GSD+ W QC+PC+ CY Q P++DP+ S+++S V CSS++C L C C
Sbjct: 130 AILDTGSDLTWTQCKPCTDCYPQPTPIYDPSQSSTYSKVPCSSSMCQALPMYSCSGANCE 189
Query: 231 YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGS-MSLV 289
Y SYGD S T+G L+ E+ T+ + ++A GCG +N+G G L G +SL+
Sbjct: 190 YLYSYGDQSSTQGILSYESFTLTSQSLPHIAFGCGQENEGGGFSQGGGLVGFGRGPLSLI 249
Query: 290 GQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV---PLVRNPRAPSFYYVGL 345
QLG G FSYCLVS + S S +F + + A V PLV++ P+FYY+ L
Sbjct: 250 SQLGQSLGNKFSYCLVSITDSPSKTSPLFIGKTASLNAKTVSSTPLVQSRSRPTFYYLSL 309
Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
G+ VGG + I++ F L G GV++D+GT VT L Y+ + A ++ NLP+
Sbjct: 310 EGISVGGQLLDIADGTFDLQLDGTGGVIIDSGTTVTYLEQSGYDVVKKAVISSI-NLPQV 368
Query: 406 SGVSI-FDTCYN-LSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP 463
G +I D C+ SG + PT++F+F G LP N+ I D +G C A PS
Sbjct: 369 DGSNIGLDLCFEPQSGSSTSHFPTITFHFEGAD-FNLPKENY-IYTDSSGIACLAMLPS- 425
Query: 464 SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+G+SI GNIQQ+ QI +D + F P VC
Sbjct: 426 NGMSIFGNIQQQNYQILYDNERNVLSFAPTVC 457
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 238 bits (606), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 163/450 (36%), Positives = 232/450 (51%), Gaps = 42/450 (9%)
Query: 71 NTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGG- 129
+SSD R ++ LVHR + S + + S R++RD R +V + +GG
Sbjct: 9 TSSSDPNRASVPLVHRHGPCAPSAASGG------KPSLAERLRRDRARTNYIVTKATGGR 62
Query: 130 GADAAKHEVQDFGTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC 187
A A + GT + + G S EY V +G+G+P Q ++ID+GSD+ WVQC+PC
Sbjct: 63 TAATALSDAAGGGTSIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPC 122
Query: 188 S--QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA----GC------HAGRCRYEVSY 235
+CY Q DP+FDP+ S+S++ V C S C +L GC A C Y + Y
Sbjct: 123 GAGECYAQKDPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEY 182
Query: 236 GDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGG 294
G+ + T G + ETLT+ VV + GCG G + GLLGLGG SLV Q
Sbjct: 183 GNRATTTGVYSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSS 242
Query: 295 QTGGAFSYCLVSRGTGSSGSLVFG------REALPVGAAWVPLVRNPRAPSFYYVGLSGL 348
Q GG FSYCL +G +G L G G ++ P+ R P P+FY V L+G+
Sbjct: 243 QFGGPFSYCLPPT-SGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGI 301
Query: 349 GVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF---VAQTGNLPRA 405
VGG + I F G+V+D+GT +T LP AY A R AF +++ LP +
Sbjct: 302 SVGGAPLAIPPSAF------SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPS 355
Query: 406 SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG 465
+G + DTCY+ +G +V VPT+S FSGG + L A ++ VD G FA A + +
Sbjct: 356 NG-GVLDTCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVL-VD--GCLAFAGAGTDNA 411
Query: 466 LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ IIGN+ Q ++ +D G VGF C
Sbjct: 412 IGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 441
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 159/462 (34%), Positives = 232/462 (50%), Gaps = 63/462 (13%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV-- 138
+EL HRD +SN + ++RD+ R+ + +R+S +A E
Sbjct: 1 MELKHRDHRQPTSN---------RRSLLLESLKRDITRLQSFQKRVSEKLTASANPEAYL 51
Query: 139 ------------------QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIV 180
++ + V SG + G+GEYF+ + VG+PPR ++ID+GSD+
Sbjct: 52 EMTNSSSTKSPPSPSSSWEEVDSTVESGAELGAGEYFMDVFVGNPPRHFLLIIDTGSDLT 111
Query: 181 WVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH-------AGRCRYEV 233
W+QC+PC C+ QS PVFDP+ S SF + C++A CD + + C C+Y
Sbjct: 112 WLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAACDLVVHDECRDNSSKTSPKTCKYFY 171
Query: 234 SYGDGSYTKGTLALETLTIGRT------VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMS 287
YGD S T G LALE+L++ + ++++ IGCGH N+G+F GA GLLGLG G++S
Sbjct: 172 WYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALS 231
Query: 288 LVGQL-GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-----------AWVPLVR-N 334
QL G +FSYCLV R S S A+ GA + P VR N
Sbjct: 232 FPSQLRSSPIGQSFSYCLVDRTNNLSVS-----SAISFGAGFALSRHFDQMKFTPFVRTN 286
Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
+FYY+G+ G+ + +PI + F + G G ++D+GT +T L AY A A
Sbjct: 287 NSVETFYYLGIQGIKIDQELLPIPAERFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESA 346
Query: 395 FVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI-PVDDAG 453
F+A+ + PRA I CYN +G +V P +S F G L LP N+ I P
Sbjct: 347 FLARI-SYPRADPFDILGICYNATGRAAVPFPALSIVFQNGAELDLPQENYFIQPDPQEA 405
Query: 454 TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A P+ G+SIIGN QQ+ I +D + +GF C
Sbjct: 406 KHCLAILPT-DGMSIIGNFQQQNIHFLYDVQHARLGFANTDC 446
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 149/467 (31%), Positives = 243/467 (52%), Gaps = 54/467 (11%)
Query: 60 LFERHNNISSSNTSS----DEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRD 115
F H I+SS S + L+L H + S N+T+ + F +D
Sbjct: 8 FFSAHLAIASSLKDSGLKHKQPDMQLKLYHMTSLKSPPNSTSLL--------FAYMFAKD 59
Query: 116 VKRVATLVRRLSGGG-ADAAKHEV--QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMV 172
+R+ RL+ A+A+ +V + G + SG+ GSG Y+V++G+GSP + M+
Sbjct: 60 EERIRYFHSRLAKNSDANASSKKVGPKLAGIPLKSGLSMGSGNYYVKMGLGSPTKYYTMI 119
Query: 173 IDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSC-------------SSAVCDR 218
+D+GS W+QCQPC+ C+ Q DPVF+P+ S ++ V C + C +
Sbjct: 120 VDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSSQCSSLKSATLNEPTCSK 179
Query: 219 LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAG 277
NA C Y+ SYGD S++ G L+ + LT+ + + + GCG NQG+F G
Sbjct: 180 QSNA------CVYKASYGDSSFSLGYLSQDVLTLTPSQTLSSFVYGCGQDNQGLFGRTDG 233
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----GTGSSGSLVFGREALPVGAAW--VPL 331
++GL +S++ QL G+ G AFSYCL + + G L G +L +++ PL
Sbjct: 234 IIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIGTSSLTPSSSYKFTPL 293
Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
++NP PS Y++ L + V G + ++ +++ ++D+GT +TRLPTP Y
Sbjct: 294 LKNPNNPSLYFIDLESITVAGRPLGVAASSYKVP------TIIDSGTVITRLPTPVYTTL 347
Query: 392 RDAFVA-QTGNLPRASGVSIFDTCY--NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
++A+V + +A G+S+ DTC+ +L+G +S P + F GG L L N L+
Sbjct: 348 KNAYVTILSKKYQQAPGISLLDTCFKGSLAG-ISEVAPDIRIIFKGGADLQLKGHNSLVE 406
Query: 449 VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ + G C A A S S ++IIGN QQ+ +++++D N VGF P C
Sbjct: 407 L-ETGITCLAMAGS-SSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 138/368 (37%), Positives = 197/368 (53%), Gaps = 15/368 (4%)
Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
F + VVSG GSG+YFV +G+PP+ +++DSGSD++WVQC PC QCY Q P++ P
Sbjct: 49 FQSPVVSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCYAQDSPLYVP 108
Query: 201 ADSASFSGVSCSSAVCDRL---ENAGC---HAGRCRYEVSYGDGSYTKGTLALETLTIGR 254
++S++FS V C S+ C + E C + G C YE Y D S +KG A E+ T+
Sbjct: 109 SNSSTFSPVPCLSSDCLLIPATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESATVDG 168
Query: 255 TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--GTGSS 312
+ VA GCG NQG F A G+LGLG G +S Q+G G F+YCLV+ T S
Sbjct: 169 VRIDKVAFGCGSDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVS 228
Query: 313 GSLVFGREALPV--GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
SL+FG E + + P+V NP++P+ YYV + + VGG +PIS+ + + +G+
Sbjct: 229 SSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLGNG 288
Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSF 430
G + D+GT +T AY AF + + PRA V D C L+G P+ +
Sbjct: 289 GSIFDSGTTLTYWFPSAYSHILAAFDSGV-HYPRAESVQGLDLCVELTGVDQPSFPSFTI 347
Query: 431 YFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSP-SGLSIIGNIQQEGIQISFDGANGF 487
F G V A N+ + V C A A SP G + IGN+ Q+ + +D
Sbjct: 348 EFDDGAVFQPEAENYFVDV-APNVRCLAMAGLASPLGGFNTIGNLLQQNFFVQYDREENL 406
Query: 488 VGFGPNVC 495
+GF P C
Sbjct: 407 IGFAPAKC 414
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 138/370 (37%), Positives = 194/370 (52%), Gaps = 15/370 (4%)
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
DF + VVSG GSG+YFV +G+PP+ +++DSGSD++WVQC PC QCY Q P++
Sbjct: 48 HDFQSPVVSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCYAQDTPLY 107
Query: 199 DPADSASFSGVSCSSAVCDRL---ENAGC---HAGRCRYEVSYGDGSYTKGTLALETLTI 252
P++S++F+ V C S C + E C + G C YE Y D S +KG A E+ T+
Sbjct: 108 APSNSSTFNPVPCLSPECLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATV 167
Query: 253 GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--GTG 310
+ VA GCG NQG F A G+LGLG G +S Q+G G F+YCLV+ T
Sbjct: 168 DDVRIDKVAFGCGRDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTS 227
Query: 311 SSGSLVFGREALPV--GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
S L+FG E + + P+V N R P+ YYV + + VGG +PIS + L +G
Sbjct: 228 VSSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLG 287
Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
+ G + D+GT VT PAY AF + PRA+ V D C +++G P+
Sbjct: 288 NGGSIFDSGTTVTYWLPPAYRNILAAF-DKNVRYPRAASVQGLDLCVDVTGVDQPSFPSF 346
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS---GLSIIGNIQQEGIQISFDGAN 485
+ GG V N+ + V C A A PS G + IGN+ Q+ + +D
Sbjct: 347 TIVLGGGAVFQPQQGNYFVDV-APNVQCLAMAGLPSSVGGFNTIGNLLQQNFLVQYDREE 405
Query: 486 GFVGFGPNVC 495
+GF P C
Sbjct: 406 NRIGFAPAKC 415
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 163/448 (36%), Positives = 231/448 (51%), Gaps = 42/448 (9%)
Query: 73 SSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGG-GA 131
SSD R ++ LVHR + S + + S R++RD R +V + +GG A
Sbjct: 91 SSDPNRASVPLVHRHGPCAPSAASGG------KPSLAERLRRDRARTNYIVTKATGGRTA 144
Query: 132 DAAKHEVQDFGTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS- 188
A + GT + + G S EY V +G+G+P Q ++ID+GSD+ WVQC+PC
Sbjct: 145 ATALSDAAGGGTSIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGA 204
Query: 189 -QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA----GC------HAGRCRYEVSYGD 237
+CY Q DP+FDP+ S+S++ V C S C +L GC A C Y + YG+
Sbjct: 205 GECYAQKDPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGN 264
Query: 238 GSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT 296
+ T G + ETLT+ VV + GCG G + GLLGLGG SLV Q Q
Sbjct: 265 RATTTGVYSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQF 324
Query: 297 GGAFSYCLVSRGTGSSGSLVFG------REALPVGAAWVPLVRNPRAPSFYYVGLSGLGV 350
GG FSYCL +G +G L G G ++ P+ R P P+FY V L+G+ V
Sbjct: 325 GGPFSYCLPPT-SGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISV 383
Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF---VAQTGNLPRASG 407
GG + I F G+V+D+GT +T LP AY A R AF +++ LP ++G
Sbjct: 384 GGAPLAIPPSAF------SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG 437
Query: 408 VSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS 467
+ DTCY+ +G +V VPT+S FSGG + L A ++ VD G FA A + + +
Sbjct: 438 -GVLDTCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVL-VD--GCLAFAGAGTDNAIG 493
Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
IIGN+ Q ++ +D G VGF C
Sbjct: 494 IIGNVNQRTFEVLYDSGKGTVGFRAGAC 521
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 153/441 (34%), Positives = 228/441 (51%), Gaps = 41/441 (9%)
Query: 80 NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
L L H + SS +T+ SF + +D +RV L RL+ + +
Sbjct: 32 QLNLYHVKGLDSSQTSTS-------PFSFSDMITKDEERVRFLHSRLTNKESASNSATTD 84
Query: 140 DFG------TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYK 192
G T + SG+ GSG Y+V+IGVG+P + M++D+GS + W+QCQPC C+
Sbjct: 85 KLGGPSLVSTPLKSGLSIGSGNYYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHV 144
Query: 193 QSDPVFDPADSASFSG-----VSCSSAVCDRLENAGCH--AGRCRYEVSYGDGSYTKGTL 245
Q DP+F P+ S ++ CSS L GC G C Y+ SYGD S++ G L
Sbjct: 145 QVDPIFTPSVSKTYKALSCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYL 204
Query: 246 ALETLTIGRTVVKN--VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
+ + LT+ + + GCG NQG+F +AG++GL +S++GQL + G AFSYC
Sbjct: 205 SQDVLTLTPSAAPSSGFVYGCGQDNQGLFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYC 264
Query: 304 LVSRGTGSSGSLVFGREALPVGAA--------WVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
L S + S V G L +GA+ + PLV+NP+ PS Y++GL+ + V G +
Sbjct: 265 LPSSFSAQPNSSVSGF--LSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPL 322
Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVA-QTGNLPRASGVSIFDTC 414
+S + + ++D+GT +TRLP Y A + +FV + +A G SI DTC
Sbjct: 323 GVSASSYNVP------TIIDSGTVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILDTC 376
Query: 415 YNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQ 474
+ S VP + F GG L L N L+ ++ GT C A A S + +SIIGN QQ
Sbjct: 377 FKGSVKEMSTVPEIRIIFRGGAGLELKVHNSLVEIEK-GTTCLAIAASSNPISIIGNYQQ 435
Query: 475 EGIQISFDGANGFVGFGPNVC 495
+ +++D AN +GF P C
Sbjct: 436 QTFTVAYDVANSKIGFAPGGC 456
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 133/345 (38%), Positives = 195/345 (56%), Gaps = 9/345 (2%)
Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC 211
GSGEY ++I +G+PP+ ++D+GSD+ WVQC PC++C++Q DP+F P S+S+S SC
Sbjct: 4 GSGEYVLQISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYSNASC 63
Query: 212 SSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
+ ++CD L C C Y SYGDGS T+G A ET+T+ + + + GCGH +G
Sbjct: 64 TDSLCDALPRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTLNGSTLARIGFGCGHNQEG 123
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG-TGSSGSLVFGREALPVGAAWV 329
F GA GL+GLG G +SL QL FSYCLV + TG+ + FG A A++
Sbjct: 124 TFAGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQSTTGTFSPITFGNAAENSRASFT 183
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
PL++N PS+YYVG+ + VG R+P FR+ G GV++D+GT +T A+
Sbjct: 184 PLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILDSGTTITYWRLAAFI 243
Query: 390 AFRDAFVAQTGNLPRASGVSI-FDTCYNLSGF--VSVRVPTVSFYFSGGPVLTLPASNFL 446
Q + P A + CY++S S+ +P+++ + + +P SN
Sbjct: 244 PILAELRRQI-SYPEADPTPYGLNLCYDISSVSASSLTLPSMTVHLTNVD-FEIPVSNLW 301
Query: 447 IPVDDAG-TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
+ VD+ G T C A + S SIIGN+QQ+ I D AN VGF
Sbjct: 302 VLVDNFGETVCTAMSTS-DQFSIIGNVQQQNNLIVTDVANSRVGF 345
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 155/460 (33%), Positives = 235/460 (51%), Gaps = 41/460 (8%)
Query: 58 NELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHS-----FHARM 112
++ HNNI S S + + R +TT M HR S +M
Sbjct: 32 KKILSVHNNIWSPKKSYEAST---SCFSRSLGKGRESTTLEMK-HRELCSGKTIDLGKKM 87
Query: 113 QR----DVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRS 168
+R D RV +L ++ + + V + + SG+ S Y V + +G ++
Sbjct: 88 RRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGG--KN 145
Query: 169 QYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR 228
+++D+GSD+ WVQCQPC CY Q P++DP+ S+S+ V C+S+ C L A ++G
Sbjct: 146 MSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGP 205
Query: 229 C-----------RYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAG 277
C Y VSYGDGSYT+G LA E++ +G T ++N GCG N+G+F G++G
Sbjct: 206 CGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFVFGCGRNNKGLFGGSSG 265
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL----PVGAAWVPLVR 333
L+GLG S+SLV Q G FSYCL S G+SGSL FG ++ ++ PLV+
Sbjct: 266 LMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQ 325
Query: 334 NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD 393
NP+ SFY + L+G +GG+ + S R G+++D+GT +TRLP Y+A +
Sbjct: 326 NPQLRSFYILNLTGASIGGVELK-SSSFGR-------GILIDSGTVITRLPPSIYKAVKI 377
Query: 394 AFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN-FLIPVDDA 452
F+ Q P A G SI DTC+NL+ + + +P + F G L + + F DA
Sbjct: 378 EFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDA 437
Query: 453 GTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
C A A + + IIGN QQ+ ++ +D +G
Sbjct: 438 SLVCLALASLSYENEVGIIGNYQQKNQRVIYDSTQERLGI 477
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 159/463 (34%), Positives = 232/463 (50%), Gaps = 63/463 (13%)
Query: 80 NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV- 138
+EL HRD + N + ++RD+ R+ + +R+S +A E
Sbjct: 84 KMELKHRDHGQPTRN---------RRSLLLESLKRDITRLQSFQKRVSEKLTASANPEAY 134
Query: 139 -------------------QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDI 179
++ + V SG + G+GEYF+ + VG+PPR ++ID+GSD+
Sbjct: 135 LEMTNSSSTKSPPSPSSSWEEVDSTVESGAELGAGEYFMDVFVGNPPRHFLLIIDTGSDL 194
Query: 180 VWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH-------AGRCRYE 232
W+QC+PC C+ QS PVFDP+ S SF + C++A CD + + C C+Y
Sbjct: 195 TWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAACDLVVHDECRDNSSKTSPKTCKYF 254
Query: 233 VSYGDGSYTKGTLALETLTIGRT------VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSM 286
YGD S T G LALE+L++ + ++++ IGCGH N+G+F GA GLLGLG G++
Sbjct: 255 YWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGAL 314
Query: 287 SLVGQL-GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA-----------WVPLVR- 333
S QL G +FSYCLV R S S A+ GA + P VR
Sbjct: 315 SFPSQLRSSPIGQSFSYCLVDRTNNLSVS-----SAISFGAGFALSRHFDQMRFTPFVRT 369
Query: 334 NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD 393
N +FYY+G+ G+ + +PI + F + G G ++D+GT +T L AY A
Sbjct: 370 NNSVETFYYLGIQGIKIDQELLPIPAERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVES 429
Query: 394 AFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI-PVDDA 452
AF+A+ + PRA I CYN +G +V PT+S F G L LP N+ I P
Sbjct: 430 AFLARI-SYPRADPFDILGICYNATGRTAVPFPTLSIVFQNGAELDLPQENYFIQPDPQE 488
Query: 453 GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A P+ G+SIIGN QQ+ I +D + +GF C
Sbjct: 489 AKHCLAILPT-DGMSIIGNFQQQNIHFLYDVQHARLGFANTDC 530
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 155/460 (33%), Positives = 235/460 (51%), Gaps = 41/460 (8%)
Query: 58 NELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHS-----FHARM 112
++ HNNI S S + + R +TT M HR S +M
Sbjct: 32 KKILSVHNNIWSPKKSYEAST---SCFSRSLGKGRESTTLEMK-HRELCSGKTIDLGKKM 87
Query: 113 QR----DVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRS 168
+R D RV +L ++ + + V + + SG+ S Y V + +G ++
Sbjct: 88 RRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGG--KN 145
Query: 169 QYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR 228
+++D+GSD+ WVQCQPC CY Q P++DP+ S+S+ V C+S+ C L A ++G
Sbjct: 146 MSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGP 205
Query: 229 C-----------RYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAG 277
C Y VSYGDGSYT+G LA E++ +G T ++N GCG N+G+F G++G
Sbjct: 206 CGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFVFGCGRNNKGLFGGSSG 265
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL----PVGAAWVPLVR 333
L+GLG S+SLV Q G FSYCL S G+SGSL FG ++ ++ PLV+
Sbjct: 266 LMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQ 325
Query: 334 NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD 393
NP+ SFY + L+G +GG+ + S R G+++D+GT +TRLP Y+A +
Sbjct: 326 NPQLRSFYILNLTGASIGGVELK-SSSFGR-------GILIDSGTVITRLPPSIYKAVKI 377
Query: 394 AFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN-FLIPVDDA 452
F+ Q P A G SI DTC+NL+ + + +P + F G L + + F DA
Sbjct: 378 EFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDA 437
Query: 453 GTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
C A A + + IIGN QQ+ ++ +D +G
Sbjct: 438 SLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGI 477
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 235 bits (600), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 151/438 (34%), Positives = 223/438 (50%), Gaps = 42/438 (9%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGG----ADAAKH 136
LEL H SS + HA + D RV++L RR+ G +DAA
Sbjct: 43 LELRHHASFSSGGKS--------RAEEAHAVLASDAARVSSLQRRIGSYGLIRSSDAASA 94
Query: 137 EVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP 196
+ V SG + Y +G+G + +++D+ S++ WVQC+PC C+ Q +P
Sbjct: 95 S-KLAQVPVTSGARLRTLNYVATVGIGGGEAT--VIVDTASELTWVQCEPCDACHDQQEP 151
Query: 197 VFDPADSASFSGVSCSSAVCDRLENAGCHAGR--------CRYEVSYGDGSYTKGTLALE 248
+FDP+ S S++ V C+S+ CD L A +G+ C Y +SY DGSY++G LA +
Sbjct: 152 LFDPSSSPSYAAVPCNSSSCDALRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAHD 211
Query: 249 TLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG 308
L++ ++ GCG NQG F G +GL+GLG +SL+ Q Q GG FSYCL +
Sbjct: 212 RLSLAGEDIQGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPPKE 271
Query: 309 TGSSGSLVFG------REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
+GSSGSLV G R + P+ + +V +P FY L+G+ VGG ED+
Sbjct: 272 SGSSGSLVLGDDASVYRNSTPI--VYTAMVSDPLQGPFYLANLTGITVGG------EDVQ 323
Query: 363 R--LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGF 420
+ G ++D+GT +T L Y A R FV+Q P+A+ SI DTC++L+G
Sbjct: 324 SPGFSAGGGGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFSILDTCFDLTGL 383
Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVD-DAGTFCFAFA--PSPSGLSIIGNIQQEGI 477
V+VP++ F GG + + + L V DA C A A S IIGN QQ+ +
Sbjct: 384 REVQVPSLKLVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQQKNL 443
Query: 478 QISFDGANGFVGFGPNVC 495
++ FD +GF C
Sbjct: 444 RVIFDTVGSQIGFAQETC 461
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 235 bits (600), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 152/401 (37%), Positives = 214/401 (53%), Gaps = 21/401 (5%)
Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGV 162
R F + Q V + + ++SG G E SG+ G+G Y V +G+
Sbjct: 86 RSHVEFLLQDQLRVDSIQARLSKISGHGI----FEEMVTKLPAQSGIAIGTGNYVVTVGL 141
Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
G+P +V D+GS I W QCQPC CY Q + FDP S S++ VSCSSA C+ L
Sbjct: 142 GTPKEDFTLVFDTGSGITWTQCQPCLGSCYPQKEQKFDPTKSTSYNNVSCSSASCNLLPT 201
Query: 222 A--GCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAA 276
+ GC A C Y++ YGD SY++G A ETLTI + V N GCG N G+F AA
Sbjct: 202 SERGCSASNSTCLYQIIYGDQSYSQGFFATETLTISSSDVFTNFLFGCGQSNNGLFGQAA 261
Query: 277 GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPR 336
GLLGL S+SL Q + FSYCL S + S+G L FG + + A + P+ +P
Sbjct: 262 GLLGLSSSSVSLPSQTAEKYQKQFSYCLPSTPS-STGYLNFGGK-VSQTAGFTPI--SPA 317
Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
SFY + + G+ V G ++PI +F + G ++D+GT +TRLP AY+A ++AF
Sbjct: 318 FSSFYGIDIVGISVAGSQLPIDPSIFTTS-----GAIIDSGTVITRLPPTAYKALKEAFD 372
Query: 397 AQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFC 456
+ N P+ +G + DTCY+ S + +V P VS F GG + + AS L V+ C
Sbjct: 373 EKMSNYPKTNGDELLDTCYDFSNYTTVSFPKVSVSFKGGVEVDIDASGILYLVNGVKMVC 432
Query: 457 FAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
AFA + S I GN QQ+ ++ +DGA G +GF C
Sbjct: 433 LAFAANKDDSEFGIFGNHQQKTYEVVYDGAKGMIGFAAGAC 473
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 235 bits (600), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 143/417 (34%), Positives = 212/417 (50%), Gaps = 36/417 (8%)
Query: 110 ARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGT--------DVVSGMDQGSGEYFVRIG 161
+R+++D +R ++ + A + +GT + SG+ GSGEYF+ +
Sbjct: 41 SRLKKDKERPEKQIKTVVATAASP-----ESYGTGLSGQLMATLESGVTLGSGEYFMDVF 95
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
+G+PP+ +++D+GSD+ W+QC PC C++Q+ P +DP +S+SF + C C + +
Sbjct: 96 IGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKESSSFRNIGCHDPRCHLVSS 155
Query: 222 AG----CHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTV---------VKNVAIGCGH 266
C A C Y YGD S T G A ET T+ T V+NV GCGH
Sbjct: 156 PDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKRVENVMFGCGH 215
Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG--TGSSGSLVFGREALPV 324
N+G+F GA+GLLGLG G +S QL G +FSYCLV R T S L+FG + +
Sbjct: 216 WNRGLFHGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLL 275
Query: 325 GAA---WVPLVRNPRAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
+ LV P +FYYV + + VGG + I E + +T G G ++D+GT
Sbjct: 276 NHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPESTWNMTSDGVGGTIVDSGTT 335
Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
++ PAY+ +DAFV + P I D CYN+SG + +P F+ G V
Sbjct: 336 LSYFTEPAYQIIKDAFVKKVKGYPIVQDFPILDPCYNVSGVEKIDLPDFGILFADGAVWN 395
Query: 440 LPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P N+ I +D C A +P S LSIIGN QQ+ + +D +G+ P C
Sbjct: 396 FPVENYFIRLDPEEVVCLAILGTPRSALSIIGNYQQQNFHVLYDTKKSRLGYAPMNC 452
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 235 bits (599), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 151/408 (37%), Positives = 209/408 (51%), Gaps = 34/408 (8%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
++ D RV ++ R ++ A QD G+ G+G Y V +G+G+P R +
Sbjct: 45 LEHDQARVDSIHRMIANETAVVG----QDVSLPAERGISVGTGNYVVSVGLGTPARDLTV 100
Query: 172 VIDSGSDIVWVQCQPCSQ--CYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG-- 227
V D+GSD+ WVQC PCS CY Q DP+F P+ S++FS V C C R + C +
Sbjct: 101 VFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSAVRCGEPECPRARQS-CSSSPG 159
Query: 228 --RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA-----------IGCGHKNQGMFVG 274
RC YEV YGD S T G L +TLT+G T N + GCG N G+F
Sbjct: 160 DDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASENNSNKLPGFVFGCGENNTGLFGK 219
Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-LPVGAAWVPLVR 333
A GL GLG G +SL Q G+ G FSYCL S + + G L G A P A + P++
Sbjct: 220 ADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPSSSSNAHGYLSLGTPAPAPAHARFTPMLN 279
Query: 334 NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD 393
PSFYYV L G+ V G I +S + G+++D+GT +TRL AY A R
Sbjct: 280 RSNTPSFYYVKLVGIRVAGRAIKVSSR----PALWPAGLIVDSGTVITRLAPRAYSALRT 335
Query: 394 AFVAQTGN--LPRASGVSIFDTCYNLSGF--VSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
AF++ G RA +SI DTCY+ + +V +P V+ F+GG +++ S L V
Sbjct: 336 AFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVALVFAGGATISVDFSGVLY-V 394
Query: 450 DDAGTFCFAFAPSPSGLS--IIGNIQQEGIQISFDGANGFVGFGPNVC 495
C AFAP+ +G S I+GN QQ + + +D +GF C
Sbjct: 395 AKVAQACLAFAPNGNGRSAGILGNTQQRTVAVVYDVGRQKIGFAAKGC 442
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 234 bits (598), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 150/379 (39%), Positives = 201/379 (53%), Gaps = 36/379 (9%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
V SG+ GSGEY V + VG+PPR M++D+GSD+ W+QC PC C++Q PVFDPA S
Sbjct: 141 VESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPATSL 200
Query: 205 SFSGVSCSSAVCDRLENA----GC---HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV- 256
S+ V+C C + C H+ C Y YGD S T G LALE T+ T
Sbjct: 201 SYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAP 260
Query: 257 -----VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
V +V GCGH N+G+F GAAGLLGLG G++S QL G AFSYCLV G+
Sbjct: 261 GASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGSSV 320
Query: 312 SGSLVFGREALPVGAAWVPLVRNPR-------------APSFYYVGLSGLGVGGMRIPIS 358
+VFG + +G +PR A +FYYV L G+ VGG ++ IS
Sbjct: 321 GSKIVFGDDDALLG--------HPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNIS 372
Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN-LPRASGVSIFDTCYNL 417
+ + + G G ++D+GT ++ PAYE R AFV + P + + CYN+
Sbjct: 373 PSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNV 432
Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEG 476
SG V VP S F+ G V PA N+ + +D G C A +P S +SIIGN QQ+
Sbjct: 433 SGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQN 492
Query: 477 IQISFDGANGFVGFGPNVC 495
+ +D N +GF P C
Sbjct: 493 FHVLYDLQNNRLGFAPRRC 511
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 234 bits (598), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 150/379 (39%), Positives = 201/379 (53%), Gaps = 36/379 (9%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
V SG+ GSGEY V + VG+PPR M++D+GSD+ W+QC PC C++Q PVFDPA S
Sbjct: 141 VESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASL 200
Query: 205 SFSGVSCSSAVCDRLENA----GC---HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV- 256
S+ V+C C + C H+ C Y YGD S T G LALE T+ T
Sbjct: 201 SYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAP 260
Query: 257 -----VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
V +V GCGH N+G+F GAAGLLGLG G++S QL G AFSYCLV G+
Sbjct: 261 GASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGSSV 320
Query: 312 SGSLVFGREALPVGAAWVPLVRNPR-------------APSFYYVGLSGLGVGGMRIPIS 358
+VFG + +G +PR A +FYYV L G+ VGG ++ IS
Sbjct: 321 GSKIVFGDDDALLG--------HPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNIS 372
Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN-LPRASGVSIFDTCYNL 417
+ + + G G ++D+GT ++ PAYE R AFV + P + + CYN+
Sbjct: 373 PSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNV 432
Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEG 476
SG V VP S F+ G V PA N+ + +D G C A +P S +SIIGN QQ+
Sbjct: 433 SGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQN 492
Query: 477 IQISFDGANGFVGFGPNVC 495
+ +D N +GF P C
Sbjct: 493 FHVLYDLQNNRLGFAPRRC 511
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 234 bits (597), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 136/349 (38%), Positives = 184/349 (52%), Gaps = 14/349 (4%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCSS 213
E+ V +G G+P ++ ++ D+GSD+ W+QC PCS CYKQ DP+FDP SA++S V C
Sbjct: 134 EFVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSVVPCGH 193
Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMF 272
C + + C G C Y+V YGDGS + G L+ ETL++ T + A GCG N G F
Sbjct: 194 PQCAAADGSKCSNGTCLYKVEYGDGSSSAGVLSHETLSLTSTRALPGFAFGCGQTNLGDF 253
Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG--REALPVGAAWVP 330
GL+GLG G +SL Q GG FSYCL S T + G L G A +
Sbjct: 254 GDVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPSDNT-THGYLTIGPTTPASNDDVQYTA 312
Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
+V+ PSFY+V L + +GG +P+ LF DDG +D+GT +T LP AY A
Sbjct: 313 MVQKQDYPSFYFVELVSIDIGGYILPVPPTLFT-----DDGTFLDSGTILTYLPPEAYTA 367
Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
RD F A FDTCY+ +G ++ +P VSF FS G V L LI D
Sbjct: 368 LRDRFKFTMTQYKPAPAYDPFDTCYDFTGQSAIFIPAVSFKFSDGSVFDLSFFGILIFPD 427
Query: 451 DAGTF--CFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
D C F PS + +I+GN+QQ ++ +D A +GF C
Sbjct: 428 DTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 234 bits (596), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 138/373 (36%), Positives = 199/373 (53%), Gaps = 25/373 (6%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASF 206
SG+ GSGEYF+ + VG+PP+ +++D+GSD+ W+QC PC C++Q+ P +DP DS+SF
Sbjct: 186 SGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNGPYYDPKDSSSF 245
Query: 207 SGVSCSSAVC------DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT----- 255
++C C D + C Y YGD S T G ALET T+ T
Sbjct: 246 KNITCHDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGK 305
Query: 256 ----VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
+V+NV GCGH N+G+F GAAGLLGLG G +S QL G +FSYCLV R + S
Sbjct: 306 PELKIVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQSLYGHSFSYCLVDRNSNS 365
Query: 312 SGS--LVFGREALPVG------AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
S S L+FG + + ++V NP +FYYV + + VGG + I E+ +
Sbjct: 366 SVSSKLIFGEDKELLSHPNLNFTSFVGGKENP-VDTFYYVLIKSIMVGGEVLKIPEETWH 424
Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
L+ G G ++D+GT +T PAYE ++AF+ + P CYN+SG +
Sbjct: 425 LSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPPLKPCYNVSGVEKM 484
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFD 482
+P + F+ G + P N+ I ++ C A +P S LSIIGN QQ+ I +D
Sbjct: 485 ELPEFAILFADGAMWDFPVENYFIQIEPEDVVCLAILGTPRSALSIIGNYQQQNFHILYD 544
Query: 483 GANGFVGFGPNVC 495
+G+ P C
Sbjct: 545 LKKSRLGYAPMKC 557
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 234 bits (596), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 164/433 (37%), Positives = 228/433 (52%), Gaps = 40/433 (9%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
L L HR + S ++ S ++ D +R ++RR+SG +
Sbjct: 68 LRLTHRHGPCAPSRASS-----LAAPSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAA 122
Query: 141 FGTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS---QCYKQSD 195
V + G D G+ Y V +G+P +Q M +D+GSD+ WVQC+PCS CY Q D
Sbjct: 123 AAATVPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKD 182
Query: 196 PVFDPADSASFSGVSCSSAVCDRL---ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI 252
P+FDPA S+S++ V C VC L + C A +C Y VSYGDGS T G + +TLT+
Sbjct: 183 PLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL 242
Query: 253 -GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
+ V+ GCGH G+F G GLLGLG SLV Q G GG FSYCL ++ + +
Sbjct: 243 SASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPS-T 301
Query: 312 SGSLVFGREALPVGAA----WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
+G L G P GAA L+ +P AP++Y V L+G+ VGG ++ + F
Sbjct: 302 AGYLTLGLGG-PSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFA---- 356
Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDTCYNLSGFVSVRV 425
G V+DTGT +TRLP AY A R AF + + P A I DTCYN +G+ +V +
Sbjct: 357 --GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTL 414
Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTF-CFAFAPSPS--GLSIIGNIQQEGIQISFD 482
P V+ F G + L A L +F C AFAPS S G++I+GN+QQ ++ D
Sbjct: 415 PNVALTFGSGATVMLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID 467
Query: 483 GANGFVGFGPNVC 495
G + VGF P+ C
Sbjct: 468 GTS--VGFKPSSC 478
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 145/393 (36%), Positives = 215/393 (54%), Gaps = 22/393 (5%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
M+R ++R + +L A H+++D T V D GSGEY +++ +G+P S
Sbjct: 1 MKRAIQRSQERLEKLQITSA-VNTHQMKDIETPVTP--DIGSGEYLIQMAIGTPALSLSA 57
Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA-GRCR 230
++D+GSD+VW +C PC+ C + ++DP+ S+++S V C S++C C+ G C
Sbjct: 58 IMDTGSDLVWTKCNPCTDC--STSSIYDPSSSSTYSKVLCQSSLCQPPSIFSCNNDGDCE 115
Query: 231 YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVG 290
Y YGD S T G L+ ET +I + N+ GCGH NQG F GL+G G GS+SLV
Sbjct: 116 YVYPYGDRSSTSGILSDETFSISSQSLPNITFGCGHDNQG-FDKVGGLVGFGRGSLSLVS 174
Query: 291 QLGGQTGGAFSYCLVSRGTGSSGSLVF-----GREALPVGAAWVPLVRNPRAPSFYYVGL 345
QLG G FSYCLVSR S S +F EA VG+ PLV++ + + YY+ L
Sbjct: 175 QLGPSMGNKFSYCLVSRTDSSKTSPLFIGNTASLEATTVGS--TPLVQS-SSTNHYYLSL 231
Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
G+ VGG + I F + G G+++D+GT +T L AY+A ++A V+ NLP+A
Sbjct: 232 EGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMVSSI-NLPQA 290
Query: 406 SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG 465
G D C+N G + P+++F+F G +P N+L P + C A P+ S
Sbjct: 291 DGQ--LDLCFNQQGSSNPGFPSMTFHFKGAD-YDVPKENYLFPDSTSDIVCLAMMPTNSN 347
Query: 466 L---SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L +I GN+QQ+ QI +D N + F P C
Sbjct: 348 LGNMAIFGNVQQQNYQILYDNENNVLSFAPTAC 380
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 142/402 (35%), Positives = 217/402 (53%), Gaps = 32/402 (7%)
Query: 111 RMQR----DVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPP 166
+M+R D RV +L ++ + + V + + SG+ S Y V + +G
Sbjct: 38 KMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGG-- 95
Query: 167 RSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA 226
++ +++D+GSD+ WVQCQPC CY Q P++DP+ S+S+ V C+S+ C L A ++
Sbjct: 96 KNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNS 155
Query: 227 GRC-----------RYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGA 275
G C Y VSYGDGSYT+G LA E++ +G T ++N GCG N+G+F G+
Sbjct: 156 GPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFVFGCGRNNKGLFGGS 215
Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL----PVGAAWVPL 331
+GL+GLG S+SLV Q G FSYCL S G+SGSL FG ++ ++ PL
Sbjct: 216 SGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPL 275
Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
V+NP+ SFY + L+G +GG+ + S R G+++D+GT +TRLP Y+A
Sbjct: 276 VQNPQLRSFYILNLTGASIGGVELK-SSSFGR-------GILIDSGTVITRLPPSIYKAV 327
Query: 392 RDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN-FLIPVD 450
+ F+ Q P A G SI DTC+NL+ + + +P + F G L + + F
Sbjct: 328 KIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKP 387
Query: 451 DAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
DA C A A + + IIGN QQ+ ++ +D +G
Sbjct: 388 DASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGI 429
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 141/398 (35%), Positives = 211/398 (53%), Gaps = 21/398 (5%)
Query: 112 MQRDVKRVATLVRRLSGG-GADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
M D +RV + RLS G + ++ SG GS Y V +G+G+P R
Sbjct: 1 MNLDNERVKYIQSRLSKNLGRENTVKDLDSTTLPAESGSLIGSANYVVVVGLGTPKRDLS 60
Query: 171 MVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA--- 226
+V D+GSD+ W QC+PC+ CYKQ D +FDP+ S+S++ ++C+S++C +L + G +
Sbjct: 61 LVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSSLCTQLTSDGIKSECS 120
Query: 227 ----GRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGL 281
C Y+ YGD S + G L+ E LTI T +V + GCG N+G+F G+AGL+GL
Sbjct: 121 SSTDASCIYDAKYGDNSTSVGFLSQERLTITATDIVDDFLFGCGQDNEGLFNGSAGLMGL 180
Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-AWVPLVRNPRAPSF 340
G +S+V Q FSYCL + + S G L FG A + + PL SF
Sbjct: 181 GRHPISIVQQTSSNYNKIFSYCLPATSS-SLGHLTFGASAATNASLIYTPLSTISGDNSF 239
Query: 341 YYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQT 399
Y + + + VGG ++P +S F G ++D+GT +TRL Y A R AF
Sbjct: 240 YGLDIVSISVGGTKLPAVSSSTFSA-----GGSIIDSGTVITRLAPTVYAALRSAFRRXM 294
Query: 400 GNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF 459
P A+ + DTCY+LSG+ + VP + F FSGG + L L V+ C AF
Sbjct: 295 EKYPVANEAGLLDTCYDLSGYKEISVPRIDFEFSGGVTVELXHRGILX-VESEQQVCLAF 353
Query: 460 AP--SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
A S + +++ GN+QQ+ +++ +D G +GFG C
Sbjct: 354 AANGSDNDITVFGNVQQKTLEVVYDVKGGRIGFGAAGC 391
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 137/402 (34%), Positives = 213/402 (52%), Gaps = 23/402 (5%)
Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
++QR + R + RL G A A D ++ + GSGE+ + + +G+P
Sbjct: 63 KIQRGINRGFHRLNRL-GAVAVLAVASKPDDTNNIKAPTHGGSGEFLMELSIGNPAVKYS 121
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR-- 228
++D+GSD++W QC+PC++C+ Q P+FDP S+S+S V CSS +C+ L + C+ +
Sbjct: 122 AIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDA 181
Query: 229 CRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSM 286
C Y +YGD S T+G LA ET T + + GCG +N+G F +GL+GLG G +
Sbjct: 182 CEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPL 241
Query: 287 SLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPV----GAAW-------VPLVRN 334
SL+ QL FSYCL S + +S SL G A + GA+ + L+RN
Sbjct: 242 SLISQLKET---KFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRN 298
Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
P PSFYY+ L G+ VG R+ + + F L + G G+++D+GT +T L A++ ++
Sbjct: 299 PDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEE 358
Query: 395 FVAQTGNLPRASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
F ++ SG + D C+ L ++ VP + F+F G L LP N+++ G
Sbjct: 359 FTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKGAD-LELPGENYMVADSSTG 417
Query: 454 TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A S +G+SI GN+QQ+ + D V F P C
Sbjct: 418 VLCLAMG-SSNGMSIFGNVQQQNFNVLHDLEKETVSFVPTEC 458
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 139/401 (34%), Positives = 221/401 (55%), Gaps = 24/401 (5%)
Query: 110 ARMQRDVKRVATLVRRLSGGGADAAKH------EVQDFGTDVVSGMDQGSGEYFVRIGVG 163
+R + VK +++ +R+ GA ++H E + G+ GSG Y++++G+G
Sbjct: 68 SRDEEHVKFLSSRLRKKDVQGASFSRHKSGHLLEPNSANIPLNPGLSIGSGNYYLKLGLG 127
Query: 164 SPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA 222
SPP+ M++D+GS + W+QC+PC C+ Q DP+F+P+ S ++ + CSS+ C L+ A
Sbjct: 128 SPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYCSSSECSLLKAA 187
Query: 223 G-----CHA-GRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGA 275
C A G C Y SYGD SY+ G L+ + LT+ + + + GCG N+G+F A
Sbjct: 188 TLNDPLCTASGVCVYTASYGDASYSMGYLSRDLLTLTPSQTLPSFTYGCGQDNEGLFGKA 247
Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNP 335
AG++GL +S++ QL + G AFSYCL + + G L G+ + P + P++RN
Sbjct: 248 AGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTSSGGGFLSIGKIS-PSSYKFTPMIRNS 306
Query: 336 RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF 395
+ PS Y++ L+ + V G + ++ +++ ++D+GT VTRLP Y A R+AF
Sbjct: 307 QNPSLYFLRLAAITVAGRPVGVAAAGYQVP------TIIDSGTVVTRLPISIYAALREAF 360
Query: 396 VA-QTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
V + +A SI DTC+ S P + F GG L+L A N LI D G
Sbjct: 361 VKIMSRRYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQGGADLSLRAPNILIEADK-GI 419
Query: 455 FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C AFA S + ++IIGN QQ+ I++D + +GF P C
Sbjct: 420 ACLAFA-SSNQIAIIGNHQQQTYNIAYDVSASKIGFAPGGC 459
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 164/458 (35%), Positives = 236/458 (51%), Gaps = 40/458 (8%)
Query: 61 FERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVA 120
FE S+S+ +SD R ++ LVHR + S + + S R++RD R
Sbjct: 25 FEPEAACSTSSANSDPNRASVPLVHRHGPCAPSAASGG------KPSLAERLRRDRARAN 78
Query: 121 TLVRRLSGGGADAA--KHEVQDFGTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSG 176
+V + +GG A V GT + + G S EY V +G+G+P Q ++ID+G
Sbjct: 79 YIVTKAAGGRTAATAVSDAVGGGGTSIPTFLGDSVDSLEYVVTLGIGTPAVQQIVLIDTG 138
Query: 177 SDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA----GCHAGR-- 228
SD+ WVQC+PC +CY Q DP+FDP+ S+S++ V C S C +L GC +G
Sbjct: 139 SDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTSGAAA 198
Query: 229 -CRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSM 286
C Y + YG+ + T G + ETLT+ VV + GCG G + GLLGLGG
Sbjct: 199 LCEYGIEYGNRATTTGVYSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPE 258
Query: 287 SLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE------ALPVGAAWVPLVRNPRAPSF 340
SLV Q Q GG FSYCL +G +G L G G + P+ R P P+F
Sbjct: 259 SLVSQTSSQFGGPFSYCLPPT-SGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSVPTF 317
Query: 341 YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF---VA 397
Y V L+G+ VGG + + F G+V+D+GT +T LP AY A R AF ++
Sbjct: 318 YVVTLTGISVGGAPLAVPPSAF------SSGMVIDSGTVITGLPATAYAALRSAFRSAMS 371
Query: 398 QTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCF 457
+ LP ++G ++ DTCY+ +G +V VPT++ FSGG + L A+ + VD G F
Sbjct: 372 EYRLLPPSNG-AVLDTCYDFTGHTNVTVPTIALTFSGGATIDL-ATPAGVLVD--GCLAF 427
Query: 458 AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
A A + + IIGN+ Q ++ +D G VGF C
Sbjct: 428 AGAGTDDTIGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 465
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 155/420 (36%), Positives = 213/420 (50%), Gaps = 24/420 (5%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+ EL++R+ SS + F A ++R +R A L + + G
Sbjct: 28 FRAELIYREHQSSPLRSET---LKTPSEIFIAAVKRGHERRARLAKHVLAGD-------- 76
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
Q F T V SG +GEY + I G+PP+ ++D+GSD+ WVQC PC CY+ F
Sbjct: 77 QLFETPVASG----NGEYLIDISYGNPPQKSTAIVDTGSDLNWVQCLPCKSCYETLSAKF 132
Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVK 258
DP+ SAS+ + C S C L C A C+Y+ YGDGS T G L+ + +TIG +
Sbjct: 133 DPSKSASYKTLGCGSNFCQDLPFQSC-AASCQYDYMYGDGSSTSGALSTDDVTIGTGKIP 191
Query: 259 NVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG 318
NVA GCG+ N G F GA GL+GLG G +SLV QLGG FSYCLV G+ + L G
Sbjct: 192 NVAFGCGNSNLGTFAGAGGLVGLGKGPLSLVSQLGGTATKKFSYCLVPLGSTKTSPLYIG 251
Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
L G A+ P++ N P+FYY L G+ V G + + F + G G+++D+GT
Sbjct: 252 DSTLAGGVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGT 311
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF---DTCYNLSGFVSVRVPTVSFYFSGG 435
+T L +AF A LP F + C++ +G + PTV F+F+G
Sbjct: 312 TLTYLDV---DAFNPMVAALKAALPYPEADGSFYGLEYCFSTAGVANPTYPTVVFHFNGA 368
Query: 436 PVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
V P + F I +D GT C A A S +G SI GNIQQ I D N +GF C
Sbjct: 369 DVALAPDNTF-IALDFEGTTCLAMA-SSTGFSIFGNIQQLNHVIVHDLVNKRIGFKSANC 426
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 232 bits (591), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 136/352 (38%), Positives = 196/352 (55%), Gaps = 16/352 (4%)
Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSASFSGVS 210
G+G Y V IG+G+P +V D+GSD WVQC+PC CY+Q + +FDPA S++ + +S
Sbjct: 182 GTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANIS 241
Query: 211 CSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQ 269
C++ C L GC G C Y V YGDGSY+ G A++TLT+ +K GCG +N+
Sbjct: 242 CAAPACSDLYTKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIKGFRFGCGERNE 301
Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV 329
G+F AAGLLGLG G SL Q + GG F++C +R +G +G L FG + P + +
Sbjct: 302 GLFGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSG-TGYLDFGPGSSPAVSTKL 360
Query: 330 --PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
P++ + +FYYVGL+G+ VGG + I +F G ++D+GT +TRLP A
Sbjct: 361 TTPMLVD-NGLTFYYVGLTGIRVGGKLLSIPPSVFTTA-----GTIVDSGTVITRLPPAA 414
Query: 388 YEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNF 445
Y + R AF + +A +S+ DTCY+ +G V +PTVS F GG L + AS
Sbjct: 415 YSSLRSAFASAIAARGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLFQGGASLDVDASG- 473
Query: 446 LIPVDDAGTFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+I C FA + + I+GN Q + + +D VGF P C
Sbjct: 474 IIYAASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 232 bits (591), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 146/375 (38%), Positives = 204/375 (54%), Gaps = 24/375 (6%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
V SG+ GS EY + + VG+PPR M++D+GSD+ W+QC PC C++Q PVFDPA S+
Sbjct: 135 VESGVAVGSAEYLMDVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASS 194
Query: 205 SFSGVSCSSAVCDRL------ENAGCH---AGRCRYEVSYGDGSYTKGTLALETLTIGRT 255
S+ ++C C + C C Y YGD S + G LALE+ T+ T
Sbjct: 195 SYRNLTCGDPRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNLT 254
Query: 256 V------VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGG-AFSYCLVSRG 308
V V GCGH+N+G+F GAAGLLGLG G +S QL GG FSYCLV G
Sbjct: 255 APGASSRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGGHTFSYCLVDHG 314
Query: 309 TGSSGSLVFGREALPVGAAWVPLVRNPRAP------SFYYVGLSGLGVGGMRIPISEDLF 362
+ + +VFG + AA L AP +FYYV L+G+ VGG + IS D +
Sbjct: 315 SDVASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGELLNISSDTW 374
Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFV 421
++ G G ++D+GT ++ PAY+ R AF+ + +G+ P + CYN+SG
Sbjct: 375 DASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPVLSPCYNVSGVE 434
Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQIS 480
VP +S F+ G V PA N+ I +D G C A +P +G+SIIGN QQ+ ++
Sbjct: 435 RPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNFQQQNFHVA 494
Query: 481 FDGANGFVGFGPNVC 495
+D N +GF P C
Sbjct: 495 YDLHNNRLGFAPRRC 509
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 232 bits (591), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 135/361 (37%), Positives = 193/361 (53%), Gaps = 27/361 (7%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
GEY + +G+GSPPR +ID+GSD++W QC PC C +Q P F+PA S S++ + CSS
Sbjct: 86 GEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSS 145
Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG----RTVVKNVAIGCGHKNQ 269
A+C+ L + C C Y+ YGD + + G LA ET T G R V V+ GCG+ N
Sbjct: 146 AMCNALYSPLCFQNACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSFGCGNMNA 205
Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL------- 322
G +G++G G G++SLV QLG FSYCL S + ++ L FG A
Sbjct: 206 GTLFNGSGMVGFGRGALSLVSQLGSPR---FSYCLTSFMSPATSRLYFGAYATLNSTNTS 262
Query: 323 ---PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM-GDDGVVMDTGT 378
PV + P + NP P+ Y++ ++G+ V G +PI +F + + G GV++D+GT
Sbjct: 263 SSGPVQS--TPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSGT 320
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGV--SIFDTCYNLSGFVS--VRVPTVSFYFSG 434
VT L PAY + AFVA G LPRA+ FDTC+ V +P + +F G
Sbjct: 321 TVTFLAQPAYAMVQGAFVAWVG-LPRANATPSDTFDTCFKWPPPPRRMVTLPEMVLHFDG 379
Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNV 494
+ LP N+++ G C A PS G SIIG+ Q + + +D N + F P
Sbjct: 380 AD-MELPLENYMVMDGGTGNLCLAMLPSDDG-SIIGSFQHQNFHMLYDLENSLLSFVPAP 437
Query: 495 C 495
C
Sbjct: 438 C 438
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 232 bits (591), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 148/467 (31%), Positives = 242/467 (51%), Gaps = 54/467 (11%)
Query: 60 LFERHNNISSSNTSS----DEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRD 115
F H I+SS S + L+L + S N+T+ + F +D
Sbjct: 8 FFSAHLAIASSLKDSGLKHKQPDMQLKLYPMTSLKSPPNSTSLL--------FAYMFAKD 59
Query: 116 VKRVATLVRRLSGGG-ADAAKHEV--QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMV 172
+R+ RL+ A+A+ +V + G + SG+ GSG Y+V++G+GSP + M+
Sbjct: 60 EERIRYFHSRLAKNSDANASFKKVGPKLAGIPLKSGLSMGSGNYYVKMGLGSPTKYYTMI 119
Query: 173 IDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSC-------------SSAVCDR 218
+D+GS W+QCQPC+ C+ Q DPVF+P+ S ++ V C + C +
Sbjct: 120 VDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSSQCSSLKSATLNEPTCSK 179
Query: 219 LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAG 277
NA C Y+ SYGD S++ G L+ + LT+ + + + GCG NQG+F G
Sbjct: 180 QSNA------CVYKASYGDSSFSLGYLSQDVLTLTPSQTLSSFVYGCGQDNQGLFGRTDG 233
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----GTGSSGSLVFGREALPVGAAW--VPL 331
++GL +S++ QL G+ G AFSYCL + + G L G +L +++ PL
Sbjct: 234 IIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIGTSSLTPSSSYKFTPL 293
Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
++NP PS Y++ L + V G + ++ +++ ++D+GT +TRLPTP Y
Sbjct: 294 LKNPNNPSLYFIDLESITVAGRPLGVAASSYKVP------TIIDSGTVITRLPTPVYTTL 347
Query: 392 RDAFVA-QTGNLPRASGVSIFDTCY--NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
++A+V + +A G+S+ DTC+ +L+G +S P + F GG L L N L+
Sbjct: 348 KNAYVTILSKKYQQAPGISLLDTCFKGSLAG-ISEVAPDIRIIFKGGADLQLKGHNSLVE 406
Query: 449 VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ + G C A A S S ++IIGN QQ+ +++++D N VGF P C
Sbjct: 407 L-ETGITCLAMAGS-SSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 231 bits (590), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 135/361 (37%), Positives = 193/361 (53%), Gaps = 27/361 (7%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
GEY + +G+GSPPR +ID+GSD++W QC PC C +Q P F+PA S S++ + CSS
Sbjct: 83 GEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSS 142
Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG----RTVVKNVAIGCGHKNQ 269
A+C+ L + C C Y+ YGD + + G LA ET T G R V V+ GCG+ N
Sbjct: 143 AMCNALYSPLCFQNACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSFGCGNMNA 202
Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL------- 322
G +G++G G G++SLV QLG FSYCL S + ++ L FG A
Sbjct: 203 GTLFNGSGMVGFGRGALSLVSQLGSPR---FSYCLTSFMSPATSRLYFGAYATLNSTNTS 259
Query: 323 ---PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM-GDDGVVMDTGT 378
PV + P + NP P+ Y++ ++G+ V G +PI +F + + G GV++D+GT
Sbjct: 260 SSGPVQS--TPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSGT 317
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGV--SIFDTCYNLSGFVS--VRVPTVSFYFSG 434
VT L PAY + AFVA G LPRA+ FDTC+ V +P + +F G
Sbjct: 318 TVTFLAQPAYAMVQGAFVAWVG-LPRANATPSDTFDTCFKWPPPPRRMVTLPEMVLHFDG 376
Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNV 494
+ LP N+++ G C A PS G SIIG+ Q + + +D N + F P
Sbjct: 377 AD-MELPLENYMVMDGGTGNLCLAMLPSDDG-SIIGSFQHQNFHMLYDLENSLLSFVPAP 434
Query: 495 C 495
C
Sbjct: 435 C 435
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 231 bits (590), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 137/402 (34%), Positives = 214/402 (53%), Gaps = 23/402 (5%)
Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
++QR + R + RL G A A D ++ + GSGE+ + + +G+P
Sbjct: 64 KIQRGINRGFHRLNRL-GAVAVLAVASNPDDTNNIKAPTHGGSGEFLMELSIGNPAVKYA 122
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR-- 228
++D+GSD++W QC+PC++C+ Q P+FDP S+S+S V CSS +C+ L + C+ +
Sbjct: 123 AIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDS 182
Query: 229 CRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSM 286
C Y +YGD S T+G LA ET T + + GCG +N+G F +GL+GLG G +
Sbjct: 183 CEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPL 242
Query: 287 SLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPV----GA-------AWVPLVRN 334
SL+ QL FSYCL S + +S SL G A + GA + L+RN
Sbjct: 243 SLISQLKET---KFSYCLTSIEDSEASSSLFIGSLASGIVNKTGANLDGEVTKTMSLLRN 299
Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
P PSFYY+ L G+ VG R+ + + F L++ G G+++D+GT +T L A++ ++
Sbjct: 300 PDQPSFYYLELQGITVGAKRLSVEKSTFELSEDGTGGMIIDSGTTITYLEETAFKVLKEE 359
Query: 395 FVAQTGNLPRASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
F ++ SG + D C+ L + ++ VP + F+F G L LP N+++ G
Sbjct: 360 FTSRMSLPVDDSGSTGLDLCFKLPNAAKNIAVPKLIFHFKGAD-LELPGENYMVADSSTG 418
Query: 454 TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A S +G+SI GN+QQ+ + D V F P C
Sbjct: 419 VLCLAMG-SSNGMSIFGNVQQQNFNVLHDLEKETVTFVPTEC 459
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 231 bits (590), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 145/370 (39%), Positives = 197/370 (53%), Gaps = 29/370 (7%)
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ--CYKQSDPVFDPADSAS 205
G+ G+G Y V +G+G+P R +V D+GSD+ WVQC PCS CYKQ DP+F P+DS++
Sbjct: 146 GISVGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSST 205
Query: 206 FSGVSCSSAVCDRLENAGCHAG--RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA-- 261
FS V C + C ++ G G RC YEV YGD S T+G L +TLT+G N +
Sbjct: 206 FSAVRCGARECRARQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAE 265
Query: 262 ---------IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS 312
GCG N G+F A GL GLG G +SL Q G+ G FSYCL S + +
Sbjct: 266 NDNKLPGFVFGCGENNTGLFGQADGLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSSAP 325
Query: 313 GSLVFGREA-LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
G L G P A + P++ PSFYYV L G+ V G I +S L
Sbjct: 326 GYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALP------ 379
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDTCYNLSGF--VSVRVPT 427
+++D+GT +TRL AY A R AF++ G RA +SI DTCY+ + +V +P
Sbjct: 380 LIVDSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPA 439
Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS--IIGNIQQEGIQISFDGAN 485
V+ F+GG +++ S L V C AFAP+ G S I+GN QQ + + +D A
Sbjct: 440 VALVFAGGATISVDFSGVLY-VAKVAQACLAFAPNGDGRSAGILGNTQQRTLAVVYDVAR 498
Query: 486 GFVGFGPNVC 495
+GF C
Sbjct: 499 QKIGFAAKGC 508
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 231 bits (589), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 143/402 (35%), Positives = 212/402 (52%), Gaps = 36/402 (8%)
Query: 115 DVKRVATLVRRLSGGGADAAKHEVQDFGT---DVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
D RV++L RR +GGG+ A T V SG + Y +G+G + +
Sbjct: 84 DAARVSSLQRR-AGGGSWAEDEAAAAAATGRVPVTSGARLRTLNYVATVGLGGGEAT--V 140
Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLE---------NA 222
++D+ S++ WVQC PC+ C+ Q P+FDPA S S++ + C+S+ CD L+
Sbjct: 141 IVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACG 200
Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
G C Y +SY DGSY++G LA + L++ V+ GCG NQG F G +GL+GLG
Sbjct: 201 GGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDGFVFGCGTSNQGPFGGTSGLMGLG 260
Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG------REALPVGAAWVPLVRNPR 336
+SL+ Q Q GG FSYCL + + SSGSLV G R + P+ + +V +P
Sbjct: 261 RSQLSLISQTMDQFGGVFSYCLPLKESESSGSLVLGDDTSVYRNSTPI--VYTTMVSDPV 318
Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
FY+V L+G+ +GG + S V++D+GT +T L Y A + F+
Sbjct: 319 QGPFYFVNLTGITIGGQEVESSA----------GKVIVDSGTIITSLVPSVYNAVKAEFL 368
Query: 397 AQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV-DDAGTF 455
+Q P+A G SI DTC+NL+GF V++P++ F F G + + +S L V D+
Sbjct: 369 SQFAEYPQAPGFSILDTCFNLTGFREVQIPSLKFVFEGNVEVEVDSSGVLYFVSSDSSQV 428
Query: 456 CFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A A S SIIGN QQ+ +++ FD +GF C
Sbjct: 429 CLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 470
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 231 bits (588), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 143/402 (35%), Positives = 212/402 (52%), Gaps = 36/402 (8%)
Query: 115 DVKRVATLVRRLSGGGADAAKHEVQDFGT---DVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
D RV++L RR +GGG+ A T V SG + Y +G+G + +
Sbjct: 83 DAARVSSLQRR-AGGGSWAEDEAAAAAATGRVPVTSGARLRTLNYVATVGLGGGEAT--V 139
Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLE---------NA 222
++D+ S++ WVQC PC+ C+ Q P+FDPA S S++ + C+S+ CD L+
Sbjct: 140 IVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACG 199
Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
G C Y +SY DGSY++G LA + L++ V+ GCG NQG F G +GL+GLG
Sbjct: 200 GGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDGFVFGCGTSNQGPFGGTSGLMGLG 259
Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG------REALPVGAAWVPLVRNPR 336
+SL+ Q Q GG FSYCL + + SSGSLV G R + P+ + +V +P
Sbjct: 260 RSQLSLISQTMDQFGGVFSYCLPLKESESSGSLVLGDDTSVYRNSTPI--VYTTMVSDPV 317
Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
FY+V L+G+ +GG + S V++D+GT +T L Y A + F+
Sbjct: 318 QGPFYFVNLTGITIGGQEVESSA----------GKVIVDSGTIITSLVPSVYNAVKAEFL 367
Query: 397 AQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV-DDAGTF 455
+Q P+A G SI DTC+NL+GF V++P++ F F G + + +S L V D+
Sbjct: 368 SQFAEYPQAPGFSILDTCFNLTGFREVQIPSLKFVFEGNVEVEVDSSGVLYFVSSDSSQV 427
Query: 456 CFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A A S SIIGN QQ+ +++ FD +GF C
Sbjct: 428 CLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 469
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 231 bits (588), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 159/448 (35%), Positives = 225/448 (50%), Gaps = 49/448 (10%)
Query: 80 NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
L++VHR + + + H+H + ++RD RV ++ RRL+ AA+
Sbjct: 56 TLQIVHRACLQTGDDIAVPDHHH-----YTGILRRDRHRVRSIYRRLT-----AAETTTT 105
Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--SQCYKQSDPV 197
G+ S EY V IG+G+PPR+ ++ D+GSD+ WVQC PC S CY Q +P+
Sbjct: 106 TTTIPARLGLAFQSLEYVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPL 165
Query: 198 FDPADSASFSGVSCSSAVCDR--LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-- 253
FDP+ S+++ V CS+ C ++ C A C Y V YGD S T G+LA ET T+
Sbjct: 166 FDPSKSSTYVDVPCSAPECHIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPP 225
Query: 254 ---RTVVKNVAIGCGHKNQGMF----VGAAGLLGLGGGSMSLVGQLGGQT---GGAFSYC 303
V GC H+ +F +G AGLLGLG G S++ Q GG FSYC
Sbjct: 226 SPLAPAATGVVFGCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYC 285
Query: 304 LVSRGTGSSGSLVFGREALP----VGAAWVPLVRN-PRAPSFYYVGLSGLGVGGMRIPIS 358
L RG+ + + G A P ++ PL+ + S Y V L+G+ V G + I
Sbjct: 286 LPPRGSSTGYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIP 345
Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN---LPRASGVSIFDTCY 415
F L G V+D+GT VT +P AY RD F G+ LP S + + DTCY
Sbjct: 346 ASAFSL------GAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGS-MKLLDTCY 398
Query: 416 NLSGFVSVRVPTVSFYFSGGPVLTLPASNFL--IPVDDAG-----TFCFAFAPSPS-GLS 467
+++G V P V+ F GG + + AS L +P +D C AF P+ S GL
Sbjct: 399 DVTGQDVVTAPRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAGLV 458
Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
I+GN+QQ + FD G +GFGPN C
Sbjct: 459 IVGNMQQRAYNVVFDVDGGRIGFGPNGC 486
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 231 bits (588), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 157/454 (34%), Positives = 230/454 (50%), Gaps = 37/454 (8%)
Query: 62 ERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVAT 121
E I ++ SS + ++ L HR S ++ + + ++RD R
Sbjct: 43 EFWGGIEATIPSSSDGTSSVTLSHRYGPCSPADPNSGEKRPTDEE----LLRRDQLRADY 98
Query: 122 LVRRLSGGGADAAKHEVQDFGTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDI 179
+ R+ SG AA + Q V + G + EY + +G+GSP +Q +VID+GSD+
Sbjct: 99 IRRKFSGSNGTAAGEDGQSSKVSVPTTLGSSLDTLEYVISVGLGSPAMTQRVVIDTGSDV 158
Query: 180 VWVQCQPC---SQCYKQSDPVFDPADSASFSGVSCSSAVCDRL----ENAGCHA-GRCRY 231
WVQC+PC S C+ + +FDPA S++++ +CS+A C +L E GC A RC+Y
Sbjct: 159 SWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQY 218
Query: 232 EVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKN--QGMFVGAAGLLGLGGGSMSL 288
V YGDGS T GT + + LT+ G VV+ GC H GM GL+GLGG + SL
Sbjct: 219 IVKYGDGSNTTGTYSSDVLTLSGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSL 278
Query: 289 VGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-----AWVPLVRNPRAPSFYYV 343
V Q + G +FSYCL + SSG L G A G A P++R+ + P++Y+
Sbjct: 279 VSQTAARYGKSFSYCLPAT-PASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFA 337
Query: 344 GLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP 403
L + VGG ++ +S +F G ++D+GT +TRLP AY A AF A
Sbjct: 338 ALEDIAVGGKKLGLSPSVFAA------GSLVDSGTVITRLPPAAYAALSSAFRAGMTRYA 391
Query: 404 RASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS- 462
RA + I DTC+N +G V +PTV+ F+GG V+ L A + C AFAP+
Sbjct: 392 RAEPLGILDTCFNFTGLDKVSIPTVALVFAGGAVVDLDAHGIV------SGGCLAFAPTR 445
Query: 463 -PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
IGN+QQ ++ +D G GF C
Sbjct: 446 DDKAFGTIGNVQQRTFEVLYDVGGGVFGFRAGAC 479
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 231 bits (588), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 147/437 (33%), Positives = 230/437 (52%), Gaps = 37/437 (8%)
Query: 89 MSSSSNTTNNMHYHRHQHSF--------HARMQRDVKRVATLVRRLS--GGGADAAKH-- 136
++ SS N H H H S + D + V L RL+ G G+ +AK
Sbjct: 41 INQSSIHLNIYHVHGHGSSLTPNSSSLLSDVLLHDEEHVKALSDRLANKGLGSGSAKPPK 100
Query: 137 -----EVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QC 190
E + G+ GSG Y+V++G+G+PP+ M++D+GS + W+QCQPC+ C
Sbjct: 101 SGHLLEPNSASIPLNPGLSIGSGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYC 160
Query: 191 YKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH-------AGRCRYEVSYGDGSYTKG 243
+ Q+DP++DP+ S ++ +SC+S C RL+ A + + C Y SYGD S++ G
Sbjct: 161 HAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIG 220
Query: 244 TLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
L+ + LT+ + + GCG NQG+F AAG++GL +S++ QL + G AFSY
Sbjct: 221 YLSQDLLTLTSSQTLPQFTYGCGQDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSY 280
Query: 303 CLVSRGTGSSGSLVFGREAL-PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
CL + +GSSG ++ P + P++ + + PS Y++ L+ + V G + ++ +
Sbjct: 281 CLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAM 340
Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVA-QTGNLPRASGVSIFDTCYNLSGF 420
+R+ + +D+GT +TRLP Y A R AFV + +A SI DTC+ S
Sbjct: 341 YRVPTL------IDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLK 394
Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQ 478
VP + F GG LTL A + LI D G C AFA S + ++IIGN QQ+
Sbjct: 395 SISAVPEIKMIFQGGADLTLRAPSILIEADK-GITCLAFAGSSGTNQIAIIGNRQQQTYN 453
Query: 479 ISFDGANGFVGFGPNVC 495
I++D + +GF P C
Sbjct: 454 IAYDVSTSRIGFAPGSC 470
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 231 bits (588), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 140/384 (36%), Positives = 203/384 (52%), Gaps = 23/384 (5%)
Query: 119 VATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSD 178
VA+L R D + V + G G G Y R+G+G+P + MV+D+GS
Sbjct: 105 VASLYRANDDAAVDGSLASVP-----LTPGTSYGVGNYVTRMGLGTPAKPYIMVVDTGSS 159
Query: 179 IVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR------Y 231
+ W+QC PC C++QS PVFDP S+S++ VSCS+ C+ L A + C Y
Sbjct: 160 LTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCSTPQCNDLSTATLNPAACSSSDVCIY 219
Query: 232 EVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQ 291
+ SYGD S++ G L+ +T++ G V N GCG N+G+F +AGL+GL +SL+ Q
Sbjct: 220 QASYGDSSFSVGYLSKDTVSFGSNSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQ 279
Query: 292 LGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVG 351
L G +FSYCL S + S+ P ++ P+V + S Y++ LSG+ V
Sbjct: 280 LAPTLGYSFSYCLPSSSSSGYLSIGSYN---PGQYSYTPMVSSTLDDSLYFIKLSGMTVA 336
Query: 352 GMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF 411
G + +S ++ ++D+GT +TRLPT Y+A A RA SI
Sbjct: 337 GKPLAVSS-----SEYSSLPTIIDSGTVITRLPTTVYDALSKAVAGAMKGTKRADAYSIL 391
Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGN 471
DTC+ + S+RVP VS FSGG L L A N L+ VD + T C AFAP+ S +IIGN
Sbjct: 392 DTCF-VGQASSLRVPAVSMAFSGGAALKLSAQNLLVDVDSSTT-CLAFAPARSA-AIIGN 448
Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
QQ+ + +D + +GF C
Sbjct: 449 TQQQTFSVVYDVKSNRIGFAAGGC 472
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 141/433 (32%), Positives = 222/433 (51%), Gaps = 29/433 (6%)
Query: 70 SNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVR-RLSG 128
S+ + +E +L+LVHR + T+ SF+ ++RD RV ++++ R S
Sbjct: 52 SSKALNEGSSSLKLVHRFGPCNPHRTST-----APASSFNEILRRDKLRVDSIIQARRSM 106
Query: 129 GGADAAKH---EVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ 185
+ +H V +G ++ D Y V +G+G+P + ++ D+GS ++W QC+
Sbjct: 107 NLTSSVEHMKSSVPFYGLSKITASD-----YIVNVGIGTPKKEMPLIFDTGSGLIWTQCK 161
Query: 186 PCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTL 245
PC CY + PVFDP SASF G+ CSS +C + GC + +C Y +Y D S + GTL
Sbjct: 162 PCKACYPKV-PVFDPTKSASFKGLPCSSKLCQSIRQ-GCSSPKCTYLTAYVDNSSSTGTL 219
Query: 246 ALETLTIG--RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
A ET++ + KN+ IGC + G +G +G++GL +SL Q FSYC
Sbjct: 220 ATETISFSHLKYDFKNILIGCSDQVSGESLGESGIMGLNRSPISLASQTANIYDKLFSYC 279
Query: 304 LVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYY-VGLSGLGVGGMRIPISEDLF 362
+ S GS+G L FG + +P + P+ + APS Y + ++G+ VGG ++ I F
Sbjct: 280 IPST-PGSTGHLTFGGK-VPNDVRFSPVSKT--APSSDYDIKMTGISVGGRKLLIDASAF 335
Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
++ +D+G +TRLP AY A R F P DTCY+ S + +
Sbjct: 336 KIAS------TIDSGAVLTRLPPKAYSALRSVFREMMKGYPLLDQDDFLDTCYDFSNYST 389
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
V +P++S +F GG + + S + V + +C AFA +SI GN QQ+ + FD
Sbjct: 390 VAIPSISVFFEGGVEMDIDVSGIMWQVPGSKVYCLAFAELDDEVSIFGNFQQKTYTVVFD 449
Query: 483 GANGFVGFGPNVC 495
GA +GF P C
Sbjct: 450 GAKERIGFAPGGC 462
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 152/432 (35%), Positives = 213/432 (49%), Gaps = 29/432 (6%)
Query: 82 ELVHRDKMSSSSNTTNNMHYHRH----------QHSFHARMQRDVKRVATLVRRLSGGGA 131
E+ K++ S N + HRH + S ++RD R A + ++S
Sbjct: 44 EVCSGHKVTPSKNGSTLALSHRHGPCSPVISKEKPSHEETLRRDQLRAAYIQAKVSSRYN 103
Query: 132 DAAKHEVQDFGT-DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-- 188
+ AK Q T SG G+ EY + + +G+P +Q M ID+GSD+ WVQC PC+
Sbjct: 104 NVAKELQQSAVTIPTSSGYSLGTTEYVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQ 163
Query: 189 QCYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAGCHAGRCRYEVSYGDGSYTKGTLA 246
C Q D +FDPA SA++S SC SA C +L E GC +C+Y V YGDGS T GT
Sbjct: 164 SCSSQKDKLFDPAMSATYSAFSCGSAQCAQLGDEGNGCLKSQCQYIVKYGDGSNTAGTYG 223
Query: 247 LETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
+TL++ + VK+ GC H+ G GL+GLGG + SLV Q G AFSYCL
Sbjct: 224 SDTLSLTSSDAVKSFQFGCSHRAAGFVGELDGLMGLGGDTESLVSQTAATYGKAFSYCLP 283
Query: 306 SRGTGSSGSLVFGREALPVGAAW--VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
+ G L G + + P+VR P+FY V L G+ V G + + +F
Sbjct: 284 PPSSSGGGFLTLGAAGGASSSRYSHTPMVRF-SVPTFYGVFLQGITVAGTMLNVPASVF- 341
Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
V+D+GT +T+LP AY+A R AF + P A+ V DTC++ SGF ++
Sbjct: 342 -----SGASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCFDFSGFNTI 396
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDG 483
VPTV+ FS G + L S L AG F I+GN+QQ ++ FD
Sbjct: 397 TVPTVTLTFSRGAAMDLDISGILY----AGCLAFTATAHDGDTGILGNVQQRTFEMLFDV 452
Query: 484 ANGFVGFGPNVC 495
+GF C
Sbjct: 453 GGRTIGFRSGAC 464
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 143/417 (34%), Positives = 209/417 (50%), Gaps = 34/417 (8%)
Query: 104 HQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVG 163
H + D R + R+ A AA + + SG+ + Y I +G
Sbjct: 133 HDRYLRRLLAADESRANSFQLRIRNDRAAAASTQSGSAEVPLTSGIRFQTLNYVTTIALG 192
Query: 164 -----SPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
SP + +++D+GSD+ WVQC+PCS CY Q DP+FDPA SA+++ V C+++ C
Sbjct: 193 GGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACAA 252
Query: 219 LENAG------CHAG--RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
A C G RC Y ++YGDGS+++G LA +T+ +G + GCG N+G
Sbjct: 253 SLKAATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGASLDGFVFGCGLSNRG 312
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG-SSGSLVFG------REALP 323
+F G AGL+GLG +SLV Q + GG FSYCL + +G +SGSL G R P
Sbjct: 313 LFGGTAGLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSLSLGGDASSYRNTTP 372
Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
V A+ ++ +P P FY++ ++G VGG + +G V++D+GT +TRL
Sbjct: 373 V--AYTRMIADPAQPPFYFLNVTGAAVGGTALAAQ-------GLGASNVLIDSGTVITRL 423
Query: 384 PTPAYEAFRDAFVAQ--TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
Y R F Q P A G SI DTCY+L+G V+VP ++ GG +T+
Sbjct: 424 APSVYRGVRAEFTRQFAAAGYPTAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGAEVTVD 483
Query: 442 ASNFLIPV-DDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
A+ L V D C A A IIGN QQ+ ++ +D +GF C
Sbjct: 484 AAGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 540
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 187/451 (41%), Positives = 239/451 (52%), Gaps = 54/451 (11%)
Query: 80 NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVA--------------TLVRR 125
++ L+HRD S + N T R R+QRD R A T V
Sbjct: 62 HVRLLHRD--SFAVNATPAQLLAR-------RLQRDELRAAWIIKAAAPAAAANDTPVVG 112
Query: 126 LSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ 185
LS GGA F VVS SGEY +I VG+P + +D+GSDI W+QCQ
Sbjct: 113 LSSGGA---------FVAPVVSRAPTTSGEYMAKIAVGTPAVEALLAMDTGSDITWLQCQ 163
Query: 186 PCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL-ENAGCHAGR--CRYEVSYG-DGSYT 241
PC +CY QS PVFDP S S+ + + C L + G A R C Y V YG DGS T
Sbjct: 164 PCRRCYPQSGPVFDPRHSTSYREMGYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTT 223
Query: 242 KGTLALETLTI-GRTVVKNVAIGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLG--GQTG 297
G ETLT G V +++IGCGH N+G+F AAG+LGLG G +S Q+ G
Sbjct: 224 VGDFIEETLTFAGGVQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNV 283
Query: 298 GAFSYCLVSRGTGSSGSLVFGREALPVGAA-------WVPLVRNPRAPSFYY-VGLSGLG 349
+FSYCL S G V + GAA + P V+N +FYY +
Sbjct: 284 TSFSYCLADFFLSSPGRSVSSTLTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSV 343
Query: 350 VGGMRIPISEDLFRLTQ-MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS-- 406
G ++ED +L G GV++D+GTAVTRL AY AFRDAF A +L + S
Sbjct: 344 GGVRVPGVTEDDLKLDPYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIG 403
Query: 407 GVS-IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS-PS 464
G S FDTCY + G +++VPTVS +F+GG LTLP N+LIPVD GT CFAFA +
Sbjct: 404 GPSGFFDTCYTMGG-RAMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDR 462
Query: 465 GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+SIIGNIQQ+G ++ ++ G VGF PN C
Sbjct: 463 SVSIIGNIQQQGFRVVYNIGGGRVGFAPNSC 493
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 151/440 (34%), Positives = 215/440 (48%), Gaps = 42/440 (9%)
Query: 74 SDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADA 133
+D + L+L H D +S Y + Q A + R RVA L A
Sbjct: 23 NDNVGFQLKLTHVDAGTS---------YTKPQLLSRA-IARSKARVAAL------QSAAV 66
Query: 134 AKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQ 193
+ V D T + SGEY V + +G+PP ++D+GSD++W QC PC C Q
Sbjct: 67 SPAPVADPITAARVLVTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCAAQ 126
Query: 194 SDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG 253
P FD SA++ + C S+ C L + C C Y+ YGD + T G LA ET T G
Sbjct: 127 PTPYFDVKRSATYRALPCRSSRCAALSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFG 186
Query: 254 -----RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG 308
+ N++ GCG N G ++G++G G G +SLV QLG FSYCL S
Sbjct: 187 AASSTKVRAANISFGCGSLNAGELANSSGMVGFGRGPLSLVSQLGPSR---FSYCLTSYL 243
Query: 309 TGSSGSLVFGREA----------LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPIS 358
+ + L FG A PV + P V NP P+ Y++ + G+ +G R+PI
Sbjct: 244 SPTPSRLYFGVFANLNSTNTSSGSPVQS--TPFVINPALPNMYFLSVKGISLGTKRLPID 301
Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNL 417
+F + G GV++D+GT++T L AYEA R +A T LP + I DTC+
Sbjct: 302 PLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRG-LASTIPLPAMNDTDIGLDTCFQW 360
Query: 418 SGF--VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQE 475
V+V VP F+F G +TLP N+++ G C A AP+ G +IIGN QQ+
Sbjct: 361 PPPPNVTVTVPDFVFHFDGA-NMTLPPENYMLIASTTGYLCLAMAPTSVG-TIIGNYQQQ 418
Query: 476 GIQISFDGANGFVGFGPNVC 495
+ + +D AN F+ F P C
Sbjct: 419 NLHLLYDIANSFLSFVPAPC 438
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 148/404 (36%), Positives = 212/404 (52%), Gaps = 23/404 (5%)
Query: 100 HYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVR 159
H H S AR+ + T +RR S DA G G G G Y R
Sbjct: 69 HDHARIASLAARLAKTPSSRPTKLRRGSSSSPDAESLASVPLG----PGTSVGVGNYVTR 124
Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
+G+G+P +S MV+D+GS + W+QC PC C++QS PVF+P S+S++ VSCS+ CD
Sbjct: 125 MGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSSYASVSCSAPQCDA 184
Query: 219 LENAGCHAGRCR------YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
L A + C Y+ SYGD S++ G L+ +T++ G T V N GCG N+G+F
Sbjct: 185 LTTATLNPSTCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTSVPNFYYGCGQDNEGLF 244
Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLV 332
+AGL+GL +SL+ QL G +FSYCL + + S + P ++ P+
Sbjct: 245 GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSGYLSIGSYN--PGQYSYTPMA 302
Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
++ S Y++ ++G+ V G + +S + ++D+GT +TRLPT Y A
Sbjct: 303 KSSLDDSLYFIKMTGITVAGKPLSVSASAY-----SSLPTIIDSGTVITRLPTDVYSALS 357
Query: 393 DAFVAQTGNLPRASGVSIFDTCYNLSGFVS-VRVPTVSFYFSGGPVLTLPASNFLIPVDD 451
A PRAS SI DTC+ G S +RVP VS F+GG L L A+N L+ VD
Sbjct: 358 KAVAGAMKGTPRASAFSILDTCFQ--GQASRLRVPQVSMAFAGGAALKLKATNLLVDVDS 415
Query: 452 AGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
A T C AFAP+ S +IIGN QQ+ + +D N +GF C
Sbjct: 416 ATT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKNSKIGFAAGGC 457
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 134/359 (37%), Positives = 195/359 (54%), Gaps = 35/359 (9%)
Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC-DRLEN 221
GSP + +++D+GSD+ WVQC+PCS CY Q DP+FDPA SA+++ V C+++ C D L
Sbjct: 155 GSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACADSLRA 214
Query: 222 A----------GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
A G + +C Y ++YGDGS+++G LA +T+ +G + GCG N+G+
Sbjct: 215 ATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGASLGGFVFGCGLSNRGL 274
Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG-SSGSLVFG---------REA 321
F G AGL+GLG +SLV Q + GG FSYCL + +G +SGSL G R
Sbjct: 275 FGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGGDDAASSYRNT 334
Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
PV A+ ++ +P P FY++ ++G VGG + +G V++D+GT +T
Sbjct: 335 TPV--AYTRMIADPAQPPFYFLNVTGAAVGGTALAAQ-------GLGASNVLIDSGTVIT 385
Query: 382 RLPTPAYEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
RL Y A R F+ Q G P A G SI DTCY+L+G V+VP ++ GG +T
Sbjct: 386 RLAPSVYRAVRAEFMRQFGAAGYPAAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGADVT 445
Query: 440 LPASNFLIPV-DDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ A+ L V D C A A IIGN QQ+ ++ +D +GF C
Sbjct: 446 VDAAGMLFVVRKDGSQVCLAMASLSYEDETPIIGNYQQKNKRVVYDTLGSRLGFADEDC 504
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 134/372 (36%), Positives = 197/372 (52%), Gaps = 23/372 (6%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASF 206
SG+ GSGEYF+ + VG+PP+ +++D+GSD+ W+QC PC +C++Q+ P +DP S+S+
Sbjct: 172 SGVSLGSGEYFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYECFEQNGPHYDPGQSSSY 231
Query: 207 SGVSCSSAVC------DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV---- 256
+ C + C D + C Y YGD S T G ALET T+ T+
Sbjct: 232 RNIGCHDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGK 291
Query: 257 -----VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
V+NV GCGH N+G+F GAAGLLGLG G +S QL G +FSYCLV R + +
Sbjct: 292 PELRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDA 351
Query: 312 --SGSLVFGREALPVGAA---WVPLVRNPRAP--SFYYVGLSGLGVGGMRIPISEDLFRL 364
S L+FG + + + LV P +FYYV + + VGG + I E+ +++
Sbjct: 352 NVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQI 411
Query: 365 TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR 424
G G ++D+GT ++ PAY+ ++AF+A+ P + + CYN++G
Sbjct: 412 ATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEPCYNVTGVEQPD 471
Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDG 483
+P FS G V P N+ I ++ C A PS LSIIGN QQ+ I +D
Sbjct: 472 LPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSALSIIGNYQQQNFHILYDT 531
Query: 484 ANGFVGFGPNVC 495
+GF P C
Sbjct: 532 KKSRLGFAPTKC 543
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 154/463 (33%), Positives = 237/463 (51%), Gaps = 42/463 (9%)
Query: 58 NELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVK 117
+ L E +N N + L L H + SS +T+ SF + +D +
Sbjct: 17 SSLVEFQDN---DNPRQKQEGMQLNLYHVKGLDSSQTSTSPF-------SFSDMITKDEE 66
Query: 118 RVATLVRRLSGGGA--DAAKHEVQDFGTDVVS------GMDQGSGEYFVRIGVGSPPRSQ 169
RV L RL+ + ++A + G +VS G+ GSG Y+V+IG+G+P +
Sbjct: 67 RVRFLHSRLTNKESVRNSATTDKLRGGPSLVSTTPLKSGLSIGSGNYYVKIGLGTPAKYF 126
Query: 170 YMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSC-----SSAVCDRLENAG 223
M++D+GS + W+QCQPC C+ Q DP+F P+ S ++ + C SS L G
Sbjct: 127 SMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKALPCSSSQCSSLKSSTLNAPG 186
Query: 224 CH--AGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN--VAIGCGHKNQGMFVGAAGLL 279
C G C Y+ SYGD S++ G L+ + LT+ + + GCG NQG+F ++G++
Sbjct: 187 CSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSEAPSSGFVYGCGQDNQGLFGRSSGII 246
Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS-----SGSLVFGREALPVGA-AWVPLVR 333
GL +S++GQL + G AFSYCL S + SG L G +L + PLV+
Sbjct: 247 GLANDKISMLGQLSKKYGNAFSYCLPSSFSAPNSSSLSGFLSIGASSLTSSPYKFTPLVK 306
Query: 334 NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD 393
N + PS Y++ L+ + V G + +S + + ++D+GT +TRLP Y A +
Sbjct: 307 NQKIPSLYFLDLTTITVAGKPLGVSASSYNVP------TIIDSGTVITRLPVAVYNALKK 360
Query: 394 AFV-AQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA 452
+FV + +A G SI DTC+ S VP + F GG L L A N L+ ++
Sbjct: 361 SFVLIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIQIIFRGGAGLELKAHNSLVEIEK- 419
Query: 453 GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
GT C A A S + +SIIGN QQ+ ++++D AN +GF P C
Sbjct: 420 GTTCLAIAASSNPISIIGNYQQQTFKVAYDVANFKIGFAPGGC 462
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 170/513 (33%), Positives = 244/513 (47%), Gaps = 56/513 (10%)
Query: 22 IITTSTSAASDTHFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNL 81
+ S+S F N ++ G D K+ E + +S + S L
Sbjct: 27 VKINSSSPLFGVEFPPFNTAVAVTG--CDSGKLVAAEEALDEQKQPASPSPS-----LKL 79
Query: 82 ELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL--SGGGADAAKHEVQ 139
L HR + + S ++D R+ T+ RR SGGG A +
Sbjct: 80 RLNHRAAEGGRT----------REESLLDLAEKDAVRIETMYRRAARSGGGRMPASSSPR 129
Query: 140 DFGTD-----VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS 194
++ V SG+ GSGEY + + VG+PPR M++D+GSD+ W+QC PC C++Q
Sbjct: 130 RALSERMVATVESGVAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQR 189
Query: 195 DPVFDPADSASFSGVSCSSAVCDRLENAG---------CH---AGRCRYEVSYGDGSYTK 242
PVFDPA S+S+ V+C C + C C Y YGD S T
Sbjct: 190 GPVFDPAASSSYRNVTCGDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTT 249
Query: 243 GTLALETLTIGRTV------VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT 296
G LALE+ T+ T V V GCGH+N+G+F GAAGLLGLG G +S QL
Sbjct: 250 GDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVY 309
Query: 297 GGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVR----------NPRAPSFYYVGLS 346
G FSYCLV G+ +VFG + + A P ++ + A +FYYV L
Sbjct: 310 GHTFSYCLVDHGSDVGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLK 369
Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRA 405
G+ VGG + IS D + + + G G ++D+GT ++ PAY+ R AF+ + + + P
Sbjct: 370 GVLVGGELLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPLV 429
Query: 406 SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG--TFCFAFAPSP 463
+ CYN+SG VP +S F+ G V PA N+ I +D G C A +P
Sbjct: 430 PEFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAVLGTP 489
Query: 464 -SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+G+SIIGN QQ+ + +D N +GF P C
Sbjct: 490 RTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRC 522
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 228 bits (582), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 175/462 (37%), Positives = 240/462 (51%), Gaps = 47/462 (10%)
Query: 73 SSDEARWNLELVHRDKMSSSSNTTNNMHYHRH----------QHSFHARMQRDVKRVATL 122
S R N +V + + + + T +H HRH + R+ RD R A +
Sbjct: 40 SHQSLRTNKSVVCSESRAPAVHATVPLH-HRHGPCSPLPNKKMPTLEERLHRDKLRAAYI 98
Query: 123 VRRLS--------GGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPP-RSQYMVI 173
R+LS G G D + G + EY + + +GSPP +SQ M+I
Sbjct: 99 HRKLSRGKKQGGGGAGGDVVVQQSHAMTVPTTLGTSLDTLEYVITVRLGSPPGKSQTMLI 158
Query: 174 DSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAVCDRL-----ENAGCHAG 227
D+GSDI WV+C+PC QC Q DP+FDP+ S+++S SCSSA C +L N +G
Sbjct: 159 DTGSDISWVRCKPCWQQCRPQVDPLFDPSLSSTYSPFSCSSAACAQLFQEGNANGCSSSG 218
Query: 228 RCRYEVSYGDGSY-TKGTLALETLTIGR----TVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
+C+Y YGDGS T GT + +TL +G VV GC H G+ AGL+GLG
Sbjct: 219 QCQYIAMYGDGSVGTTGTYSSDTLALGSNSNTVVVSKFRFGCSHAETGITGLTAGLMGLG 278
Query: 283 GGSMSLVGQLGGQTGG-AFSYCLVSRGTGSSGSLVFGREALP-VGAAWVPLVRNPRAPSF 340
GG+ SLV Q G G AFSYCL + SSG L G G P++R+ + P+F
Sbjct: 279 GGAQSLVSQTAGTFGTTAFSYCLPPTPS-SSGFLTLGAAGTSSAGFVKTPMLRSSQVPAF 337
Query: 341 YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVA--- 397
Y V L + VGG ++ I +F G++MD+GT VTRLP AY + AF A
Sbjct: 338 YGVRLEAIRVGGRQLSIPTTVF------SAGMIMDSGTVVTRLPPTAYSSLSSAFKAGMK 391
Query: 398 QTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFS--GGPVLTLPASNFLIPVDDAGTF 455
Q P ++G DTC+++SG SV +PTV+ FS GG V+ L AS L+ ++ + F
Sbjct: 392 QYPPAPSSAGGGFLDTCFDMSGQSSVSMPTVALVFSGAGGAVVNLDASGILLQMETSSIF 451
Query: 456 CFAF-APSPSGLS-IIGNIQQEGIQISFDGANGFVGFGPNVC 495
C AF A S G + IIGN+QQ Q+ +D A G VGF C
Sbjct: 452 CLAFVATSDDGSTGIIGNVQQRTFQVLYDVAGGAVGFKAGAC 493
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 228 bits (582), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 133/353 (37%), Positives = 197/353 (55%), Gaps = 15/353 (4%)
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASF 206
G+ GSG Y + +G G+P R+Q +V D+GSD+ W+QC+PC+ +CY Q +P+FDP+ S+++
Sbjct: 8 GLFIGSGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTY 67
Query: 207 SGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCG 265
VSC+ C L GC + C Y V YGDGS T G LA++T + KN GCG
Sbjct: 68 RNVSCTEPACVGLSTRGCSSSTCLYGVFYGDGSSTIGFLAMDTFMLTPAQKFKNFIFGCG 127
Query: 266 HKNQGMFVGAAGLLGLGGGSM-SLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
N G+F G AGL+GLG S SL Q+ G FSYCL S + ++G L G
Sbjct: 128 QNNTGLFQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCLPSTSS-ATGYLNIGNPQNTP 186
Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
G + ++ + R P+ Y++ L G+ VGG R+ +S +F+ G ++D+GT +TRLP
Sbjct: 187 G--YTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQ-----SVGTIIDSGTVITRLP 239
Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
AY A + A A A V+I DTCY+ S SV P + +F+G V +PA+
Sbjct: 240 PTAYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVLHFAGLDV-RIPATG 298
Query: 445 FLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
V ++ C AFA + + IIGN+QQ +++++D +GF C
Sbjct: 299 VFF-VFNSSQVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSAGAC 350
>gi|356537173|ref|XP_003537104.1| PREDICTED: uncharacterized protein LOC100817302 [Glycine max]
Length = 328
Score = 228 bits (580), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 108/142 (76%), Positives = 121/142 (85%)
Query: 354 RIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDT 413
++ ISEDL+R+T +GD+G VMDTG VTRLPT AY AFRDAFVAQT NLPRA GVSIF+T
Sbjct: 187 QLNISEDLYRVTDLGDEGAVMDTGITVTRLPTVAYGAFRDAFVAQTTNLPRAPGVSIFNT 246
Query: 414 CYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQ 473
CY+L+GFV+VRVPTV FYFSGG +LT+ NFLIP DD GTF FAFA SPS LSIIGNIQ
Sbjct: 247 CYDLNGFVTVRVPTVLFYFSGGQILTILTQNFLIPADDVGTFYFAFAASPSALSIIGNIQ 306
Query: 474 QEGIQISFDGANGFVGFGPNVC 495
QEGIQIS DGANGF+GFG NVC
Sbjct: 307 QEGIQISVDGANGFLGFGRNVC 328
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 228 bits (580), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 141/413 (34%), Positives = 209/413 (50%), Gaps = 29/413 (7%)
Query: 110 ARMQRDVKRVATLVRR----LSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSP 165
+R+Q+ K+ + +S A + ++ Q T + SG+ GSGEYF+ + +G+P
Sbjct: 143 SRLQKSTKKQTNSKQSYKPAVSPVAAASPEYSSQLVAT-LESGVSLGSGEYFMDVFIGTP 201
Query: 166 PRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC------DRL 219
P+ +++D+GSD+ W+QC PC C++QS P +DP +S+SF ++C C D
Sbjct: 202 PKHYSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKESSSFENITCHDPRCKLVSSPDPP 261
Query: 220 ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV---------VKNVAIGCGHKNQG 270
+ C Y YGD S T G ALET T+ T V+NV GCGH N+G
Sbjct: 262 KPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKHVENVMFGCGHWNRG 321
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG--TGSSGSLVFGREALPVGAAW 328
+F GAAGLLGLG G +S QL G +FSYCLV R T S L+FG + +
Sbjct: 322 LFHGAAGLLGLGRGPLSFASQLQSIYGHSFSYCLVDRNSDTSVSSKLIFGEDKELLSHPN 381
Query: 329 VPLV-----RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
+ +FYYVG+ + V G + I E+ + L++ G G ++D+GT +T
Sbjct: 382 LNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIPEETWHLSKEGGGGTIIDSGTTLTYF 441
Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
PAYE ++AF+ + G CYN+SG + +P FS G + P
Sbjct: 442 AEPAYEIIKEAFMKKIKGYELVEGFPPLKPCYNVSGIEKMELPDFGILFSDGAMWDFPVE 501
Query: 444 NFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
N+ I + + C A +P S LSIIGN QQ+ I +D +G+ P C
Sbjct: 502 NYFIQI-EPDLVCLAILGTPKSALSIIGNYQQQNFHILYDMKKSRLGYAPMKC 553
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 228 bits (580), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 138/362 (38%), Positives = 210/362 (58%), Gaps = 27/362 (7%)
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASF 206
G GSG Y+V++G+GSP R M++D+GS + W+QC+PC C+ Q+DP+FDP+ S ++
Sbjct: 5 GASIGSGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTY 64
Query: 207 SGVSCSSAVCDRLENAGCH-------AGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVK 258
+SC+S+ C L +A + + C Y SYGD SY+ G L+ + LT+ + +
Sbjct: 65 KSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQTLP 124
Query: 259 NVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG 318
GCG ++G+F AAG+LGLG +S++GQ+ + G AFSYCL +RG G G L G
Sbjct: 125 GFVYGCGQDSEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGG--GFLSIG 182
Query: 319 REALPVGAAW--VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
+ +L G+A+ P+ +P PS Y++ L+ + VGG + ++ +R+ ++D+
Sbjct: 183 KASL-AGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVP------TIIDS 235
Query: 377 GTAVTRLPTPAYEAFRDAFVA-QTGNLPRASGVSIFDTCY--NLSGFVSVRVPTVSFYFS 433
GT +TRLP Y F+ AFV + RA G SI DTC+ NL S VP V F
Sbjct: 236 GTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNLKDMQS--VPEVRLIFQ 293
Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
GG L L N L+ VD+ G C AFA + +G++IIGN QQ+ +++ D + +GF
Sbjct: 294 GGADLNLRPVNVLLQVDE-GLTCLAFAGN-NGVAIIGNHQQQTFKVAHDISTARIGFATG 351
Query: 494 VC 495
C
Sbjct: 352 GC 353
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 228 bits (580), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 167/503 (33%), Positives = 236/503 (46%), Gaps = 72/503 (14%)
Query: 17 HLLCSIITTSTSAASDTHFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSN----T 72
HLLC + S S F+ G + Q E N + S++ T
Sbjct: 8 HLLCLCLVISLSTTYAFGFE---------GRKIAQENHLQLIHAIEISNLLPSADCEHST 58
Query: 73 SSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGG-- 130
+ + +L++VH+ S N N + + + D RV ++ +LS
Sbjct: 59 KVAQNKASLKVVHKHGPCSQLNQQNG-----NAPNLVEILLEDQSRVDSIHAKLSDHSGV 113
Query: 131 --ADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS 188
DAAK + SGM G+G Y V IG+GSP + ++ D+GSD+ W +C
Sbjct: 114 KETDAAKLPTK-------SGMSLGTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARCS--- 163
Query: 189 QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG-----CHAGRCRYEVSYGDGSYTKG 243
+ FDP S S++ VSCS+ +C + +A C A C Y + YGDGSY+ G
Sbjct: 164 -----AAETFDPTKSTSYANVSCSTPLCSSVISATGNPSRCAASTCVYGIQYGDGSYSIG 218
Query: 244 TLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
L E LTIG T + N GCG G+F AAGLLGLG +S+V Q + FSY
Sbjct: 219 FLGKERLTIGSTDIFNNFYFGCGQDVDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSY 278
Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
CL S + S+G L FG + A + PL P SFY + L+G+ VGG ++ I +F
Sbjct: 279 CLPS--SSSTGFLSFG-SSQSKSAKFTPLSSGPS--SFYNLDLTGITVGGQKLAIPLSVF 333
Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
G ++D+GT VTRLP AY A R AF + P +SI DTCY+ S + +
Sbjct: 334 STA-----GTIIDSGTVVTRLPPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKT 388
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF--------CFAFAPSPSG--LSIIGNI 472
++VP + FSGG + VD AG F C AFA + +I GN
Sbjct: 389 IKVPKIVISFSGG---------VDVDVDQAGIFVANGLKQVCLAFAGNTGARDTAIFGNT 439
Query: 473 QQEGIQISFDGANGFVGFGPNVC 495
QQ ++ +D + G VGF P C
Sbjct: 440 QQRNFEVVYDVSGGKVGFAPASC 462
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 227 bits (579), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 147/433 (33%), Positives = 223/433 (51%), Gaps = 45/433 (10%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+++EL+HRD S +++QH F +R + R A H
Sbjct: 28 FSVELIHRDSPKSPYYKPTE---NKYQH-FVDAARRSINR---------------ANHFF 68
Query: 139 QDFGTDVV-SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
+D T S + G Y + VG+PP Y + D+GSDIVW+QC+PC QCY Q+ P+
Sbjct: 69 KDSDTSTPESTVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPI 128
Query: 198 FDPADSASFSGVSCSSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRT- 255
F+P+ S+S+ + CSS +C + + C C+Y++SYGD S+++G L+++TL++ T
Sbjct: 129 FNPSKSSSYKNIPCSSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTS 188
Query: 256 ----VVKNVAIGCGHKNQGMFVGA-AGLLGLGGGSMSLVGQLGGQTGGAFSYCLV---SR 307
+ IGCG N G F GA +G++GLGGG +SL+ QLG GG FSYCLV ++
Sbjct: 189 GSPVSFPKIVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNK 248
Query: 308 GTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
+ +S L FG A+ G V PL++ + P FY++ L VG R+ +
Sbjct: 249 ESNASSILSFGDAAVVSGDGVVSTPLIK--KDPVFYFLTLQAFSVGNKRVEFGGS----S 302
Query: 366 QMGDD--GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLSGFVS 422
+ GDD +++D+GT +T +P+ Y A V L R + F CY+L
Sbjct: 303 EGGDDEGNIIIDSGTTLTLIPSDVYTNLESA-VVDLVKLDRVDDPNQQFSLCYSLKS-NE 360
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
P ++ +F G V S F +P+ D G CFAF PSP SI GN+ Q+ + + +D
Sbjct: 361 YDFPIITVHFKGADVELHSISTF-VPITD-GIVCFAFQPSPQLGSIFGNLAQQNLLVGYD 418
Query: 483 GANGFVGFGPNVC 495
V F P C
Sbjct: 419 LQQKTVSFKPTDC 431
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 227 bits (579), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 136/375 (36%), Positives = 200/375 (53%), Gaps = 31/375 (8%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
V SG + Y +G+G + +++D+ S++ WVQC PC C+ Q DP+FDP+ S
Sbjct: 142 VTSGAKLRTLNYVATVGLGGGEAT--VIVDTASELTWVQCAPCESCHDQQDPLFDPSSSP 199
Query: 205 SFSGVSCSSAVCDRLE---------NAGCH-----AGRCRYEVSYGDGSYTKGTLALETL 250
S++ V C+S+ CD L+ A C A C Y +SY DGSY++G LA + L
Sbjct: 200 SYAAVPCNSSSCDALQLATGGTSGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRL 259
Query: 251 TIGRTVVKNVAIGCGHKNQG-MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT 309
++ V+ GCG NQG F G +GL+GLG +SLV Q Q GG FSYCL + +
Sbjct: 260 SLAGEVIDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYCLPLKES 319
Query: 310 GSSGSLVFG------REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
SSGSLV G R + P+ + +V +P FY+V L+G+ VGG + E
Sbjct: 320 DSSGSLVIGDDSSVYRNSTPI--VYASMVSDPLQGPFYFVNLTGITVGGQEV---ESSGF 374
Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
+ G ++D+GT +T L Y A + F++Q P+A G SI DTC+N++G V
Sbjct: 375 SSGGGGGKAIIDSGTVITSLVPSIYNAVKAEFLSQFAEYPQAPGFSILDTCFNMTGLREV 434
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPV-DDAGTFCFAFAP--SPSGLSIIGNIQQEGIQIS 480
+VP++ F GG + + + L V D+ C A AP S +IIGN QQ+ +++
Sbjct: 435 QVPSLKLVFDGGVEVEVDSGGVLYFVSSDSSQVCLAMAPLKSEYETNIIGNYQQKNLRVI 494
Query: 481 FDGANGFVGFGPNVC 495
FD + VGF C
Sbjct: 495 FDTSGSQVGFAQETC 509
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 227 bits (579), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 138/372 (37%), Positives = 198/372 (53%), Gaps = 24/372 (6%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASF 206
SG+ GSGEYF+ + VG+PP+ +++D+GSD+ W+QC PC C++QS P +DP DS+SF
Sbjct: 186 SGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSF 245
Query: 207 SGVSCSSAVCDRLENAG----CHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTV---- 256
+SC C + + C A C Y YGDGS T G ALET T+ T
Sbjct: 246 RNISCHDPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGK 305
Query: 257 -----VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
V+NV GCGH N+G+F GAAGLLGLG G +S Q+ G +FSYCLV R + +
Sbjct: 306 SELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNA 365
Query: 312 SGS--LVFGREALPVGAAWVPLV-----RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL 364
S S L+FG + + + ++ +FYYV ++ + V + I E+ + L
Sbjct: 366 SVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEETWHL 425
Query: 365 TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR 424
+ G G ++D+GT +T PAYE ++AFV + G+ CYN+SG +
Sbjct: 426 SSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGLPPLKPCYNVSGIEKME 485
Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDG 483
+P F+ G V P N+ I + D C A +P S LSIIGN QQ+ I +D
Sbjct: 486 LPDFGILFADGAVWNFPVENYFIQI-DPDVVCLAILGNPRSALSIIGNYQQQNFHILYDM 544
Query: 484 ANGFVGFGPNVC 495
+G+ P C
Sbjct: 545 KKSRLGYAPMKC 556
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 147/416 (35%), Positives = 213/416 (51%), Gaps = 39/416 (9%)
Query: 108 FHARMQRDVKRVATLVRRLSG------------------GGADAAKHEVQDFGTDVV--S 147
F + D RVA L RL+ GGA H D V
Sbjct: 66 FSTVLTHDDARVAHLASRLAASDPPSRRPTSLRKQKKAAGGASGGHHLDDDSLASVPLSP 125
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASF 206
G G G Y ++G+G+P S MV+D+GS + W+QC PC C++Q P+FDP S+++
Sbjct: 126 GTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTY 185
Query: 207 SGVSCSSAVCDRLENA-----GCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNV 260
+ V CS++ CD L+ A C A C Y+ SYGD S++ G+L+ +T++ G T +
Sbjct: 186 ASVRCSASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGSLSTDTVSFGSTRYPSF 245
Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE 320
GCG N+G+F +AGL+GL +SL+ QL G +FSYCL + S+G L G
Sbjct: 246 YYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPT--AASTGYLSIGPY 303
Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
++ P+ + S Y++ LSG+ VGG + +S ++ ++D+GT +
Sbjct: 304 NTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSP-----SEYSSLPTIIDSGTVI 358
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS-VRVPTVSFYFSGGPVLT 439
TRLPT + A A RA SI DTC+ G S +RVPTV+ F+GG +
Sbjct: 359 TRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFE--GQASQLRVPTVAMAFAGGASMK 416
Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L N LI VDD+ T C AFAP+ S +IIGN QQ+ + +D A +GF C
Sbjct: 417 LTTRNVLIDVDDSTT-CLAFAPTDS-TAIIGNTQQQTFSVIYDVAQSRIGFSAGGC 470
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 135/373 (36%), Positives = 192/373 (51%), Gaps = 24/373 (6%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASF 206
SG+ GSGEYF+ + +GSPP+ +++D+GSD+ W+QC PC C++Q+ P +DP DS SF
Sbjct: 187 SGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISF 246
Query: 207 SGVSCSSAVC------DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV---- 256
++C+ C D C Y YGD S T G ALET T+ T
Sbjct: 247 RNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTG 306
Query: 257 ------VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--G 308
V+NV GCGH N+G+F GAAGLLGLG G +S QL G +FSYCLV R
Sbjct: 307 KSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRDSD 366
Query: 309 TGSSGSLVFGREALPVGAA---WVPLVRNPRAP--SFYYVGLSGLGVGGMRIPISEDLFR 363
T S L+FG + + + L+ P +FYY+ + + VGG ++ I E+ +
Sbjct: 367 TSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWN 426
Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
L+ G G ++D+GT ++ PAY ++AF+ + I CYN+SG +
Sbjct: 427 LSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDEL 486
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFD 482
P F+ G V P N+ I + C A +P S LSIIGN QQ+ I +D
Sbjct: 487 NFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQNFHILYD 546
Query: 483 GANGFVGFGPNVC 495
N +G+ P C
Sbjct: 547 TKNSRLGYAPMRC 559
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 160/440 (36%), Positives = 221/440 (50%), Gaps = 43/440 (9%)
Query: 80 NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
+++VHR + S T H H H + ++RD RV ++ RRL+G G AA
Sbjct: 61 TIQIVHRACLQSGDRKTVPDH---HPH-YTGILRRDHNRVRSIHRRLTGAGDTAAT---- 112
Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVF 198
G+ S EY V IG+G+P R+ ++ D+GSD+ WVQC+PC+ CY+Q +P+F
Sbjct: 113 ---IPASLGLAFHSLEYVVTIGIGTPARNFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLF 169
Query: 199 DPADSASFSGVSCSSAVCDR--LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
DP+ S+++ V C + C ++ C C Y V YGD S T+G LA E T+ +
Sbjct: 170 DPSKSSTYVDVPCGTPQCKIGGGQDLTCGGTTCEYSVKYGDQSVTRGNLAQEAFTLSPSA 229
Query: 257 --VKNVAIGCGHKNQGMFVGA------AGLLGLGGGSMSLVGQL-GGQTGGAFSYCLVSR 307
V GC H+ GA AGLLGLG G S++ Q G +G FSYCL R
Sbjct: 230 PPAAGVVFGCSHEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYCLPPR 289
Query: 308 GTGSSGSLVFGREALP-VGAAWVPLVR-NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
G+ S+G L G A P ++ PLV N + S Y V L G+ V G +PI F +
Sbjct: 290 GS-SAGYLTIGAAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYI- 347
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN---LPRASGVSIFDTCYNLSGFVS 422
G V+D+GT +T +P AY RD F G LP V DTCY+++G
Sbjct: 348 -----GTVIDSGTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGH-VESLDTCYDVTGHDV 401
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLI--PVDDAGT----FCFAFAPSP-SGLSIIGNIQQE 475
V P V+ F GG + + AS L+ VD +G C AF P+ G IIGN+QQ
Sbjct: 402 VTAPPVALEFGGGARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFVIIGNMQQR 461
Query: 476 GIQISFDGANGFVGFGPNVC 495
+ FD +GFG N C
Sbjct: 462 AYNVVFDVEGRRIGFGANGC 481
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 135/373 (36%), Positives = 192/373 (51%), Gaps = 24/373 (6%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASF 206
SG+ GSGEYF+ + +GSPP+ +++D+GSD+ W+QC PC C++Q+ P +DP DS SF
Sbjct: 187 SGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISF 246
Query: 207 SGVSCSSAVC------DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV---- 256
++C+ C D C Y YGD S T G ALET T+ T
Sbjct: 247 RNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTG 306
Query: 257 ------VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--G 308
V+NV GCGH N+G+F GAAGLLGLG G +S QL G +FSYCLV R
Sbjct: 307 KSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRDSD 366
Query: 309 TGSSGSLVFGREALPVGAA---WVPLVRNPRAP--SFYYVGLSGLGVGGMRIPISEDLFR 363
T S L+FG + + + L+ P +FYY+ + + VGG ++ I E+ +
Sbjct: 367 TSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWN 426
Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
L+ G G ++D+GT ++ PAY ++AF+ + I CYN+SG +
Sbjct: 427 LSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDEL 486
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFD 482
P F+ G V P N+ I + C A +P S LSIIGN QQ+ I +D
Sbjct: 487 NFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQNFHILYD 546
Query: 483 GANGFVGFGPNVC 495
N +G+ P C
Sbjct: 547 TKNSRLGYAPMRC 559
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 130/362 (35%), Positives = 188/362 (51%), Gaps = 28/362 (7%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
GEY + +G+G+PPR ++D+GSD++W QC PC C Q P FDPA S S++ + C+S
Sbjct: 87 GEYLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYAKLPCNS 146
Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG----RTVVKNVAIGCGHKNQ 269
+C+ L C+ C Y+ YGD + T G L+ ET T G R V +A GCG+ N
Sbjct: 147 PMCNALYYPLCYRNVCVYQYFYGDSANTAGVLSNETFTFGTNDTRVTVPRIAFGCGNLNA 206
Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL------- 322
G +G++G G G +SLV QLG FSYCL S + L FG A
Sbjct: 207 GSLFNGSGMVGFGRGPLSLVSQLGSPR---FSYCLTSFMSPVPSRLYFGAYATLNSTSAS 263
Query: 323 ---PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM-GDDGVVMDTGT 378
PV + P + NP P+ YY+ ++G+ VGG +PI +F + G GV++D+G+
Sbjct: 264 TGEPVQS--TPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGTGGVIIDSGS 321
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVS---IFDTCYNLSGFVS--VRVPTVSFYFS 433
+T L AY+ AF Q G LP + S + DTC+ V +P ++F+F
Sbjct: 322 TITYLARAAYDMVHQAFADQVG-LPLTNATSLADVLDTCFVWPPPPRKIVTMPELAFHFE 380
Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
G + LP N+++ D G C A A S G SIIG+ Q + + +D N + F P
Sbjct: 381 GA-NMELPLENYMLIDGDTGNLCLAIAASDDG-SIIGSFQHQNFHVLYDNENSLLSFTPA 438
Query: 494 VC 495
C
Sbjct: 439 TC 440
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 142/409 (34%), Positives = 208/409 (50%), Gaps = 34/409 (8%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQD----FGTDVVSGMDQGSGEYFVRIGVGSPPR 167
+ D RV++L RR+ + + E + + SG + + Y +G+G+
Sbjct: 72 LSSDAARVSSLQRRIESYRSSSEGEEEEASKLALQVPITSGANLRTLNYVATVGLGAAEA 131
Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC--- 224
+ +V+D+ S++ WVQCQPC C+ Q DP+FDP+ S S++ V C+S+ CD L A
Sbjct: 132 T--VVVDTASELTWVQCQPCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALRVAMAAGT 189
Query: 225 --------HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM-FVGA 275
C Y +SY DGSY++G LA + L + ++ GCG NQG F G
Sbjct: 190 SPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRLAGQDIEGFVFGCGTSNQGAPFGGT 249
Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG------REALPVGAAWV 329
+GL+GLG +SLV Q Q GG FSYCL R +GSSGSLV G R + P+ +
Sbjct: 250 SGLMGLGRSHVSLVSQTMDQFGGVFSYCLPMRESGSSGSLVLGDDSSAYRNSTPIVYTAM 309
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
P FY++ L+G+ VGG + F + V++D+GT +T L Y
Sbjct: 310 VSDSGPLQGPFYFLNLTGITVGGQE--VESPWFSAGR-----VIIDSGTIITTLVPSVYN 362
Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
A R F++Q P+A SI DTC+NL+G V+VP++ F F G + + + L V
Sbjct: 363 AVRAEFLSQLAEYPQAPAFSILDTCFNLTGLKEVQVPSLKFVFEGSVEVEVDSKGVLYFV 422
Query: 450 -DDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
DA C A A S SIIGN QQ+ +++ FD +GF C
Sbjct: 423 SSDASQVCLALASLKSEYDTSIIGNYQQKNLRVIFDTLGSQIGFAQETC 471
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 146/416 (35%), Positives = 211/416 (50%), Gaps = 39/416 (9%)
Query: 108 FHARMQRDVKRVATLVRRLSG------------------GGADAAKHEVQD--FGTDVVS 147
F + D RVA L RL+ GGA H D +
Sbjct: 66 FSTVLTHDDARVAHLASRLAASDPPSRRPTSLRKQKKAAGGASGGHHLDDDSLASVPLSP 125
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASF 206
G G G Y ++G+G+P S MV+D+GS + W+QC PC C++Q P+FDP S+++
Sbjct: 126 GTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTY 185
Query: 207 SGVSCSSAVCDRLENA-----GCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNV 260
+ V CS++ CD L+ A C A C Y+ SYGD S++ G L+ +T++ G T +
Sbjct: 186 TSVRCSASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGYLSTDTVSFGSTSYPSF 245
Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE 320
GCG N+G+F +AGL+GL +SL+ QL G +FSYCL + S+G L G
Sbjct: 246 YYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPT--AASTGYLSIGPY 303
Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
++ P+ + S Y++ LSG+ VGG + +S ++ ++D+GT +
Sbjct: 304 NTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSP-----SEYSSLPTIIDSGTVI 358
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS-VRVPTVSFYFSGGPVLT 439
TRLPT + A A RA SI DTC+ G S +RVPTV F+GG +
Sbjct: 359 TRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFE--GQASQLRVPTVVMAFAGGASMK 416
Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L N LI VDD+ T C AFAP+ S +IIGN QQ+ + +D A +GF C
Sbjct: 417 LTTRNVLIDVDDSTT-CLAFAPTDS-TAIIGNTQQQTFSVIYDVAQSRIGFSAGGC 470
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 225 bits (573), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 146/431 (33%), Positives = 225/431 (52%), Gaps = 44/431 (10%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
++ EL+HRD SS + ++ QH +A +R + R L + ++ +
Sbjct: 28 FSFELIHRD---SSKSPLYKPAQNKFQHVVNA-ARRSINRANRLFKDSLSNTPESTVY-- 81
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
V+G GEY + VG+PP + Y V+D+GSDIVW+QC+PC QCYKQ+ P+F
Sbjct: 82 -------VNG-----GEYLMTYSVGTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQTTPIF 129
Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
+P+ S+S+ + CSS +C + C+ C Y +++ D SY++G L++ETLT+ T
Sbjct: 130 NPSKSSSYKNIPCSSNLCQSVRYTSCNKQNSCEYTINFSDQSYSQGELSVETLTLDSTTG 189
Query: 258 KNVA-----IGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--GT 309
+V+ IGCGH N+GMF G +G++GLG G +SL QL GG FSYCL+ +
Sbjct: 190 HSVSFPKTVIGCGHNNRGMFQGETSGIVGLGIGPVSLTTQLKSSIGGKFSYCLLPLLVDS 249
Query: 310 GSSGSLVFGREALPVGAAWV--PLV-RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
+ L FG A+ G V P V ++P+A FYY+ L VG RI F +
Sbjct: 250 NKTSKLNFGDAAVVSGDGVVSTPFVKKDPQA--FYYLTLEAFSVGNKRIE-----FEVLD 302
Query: 367 MGDDG-VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLSGFVSVR 424
++G +++D+GT +T LP+ Y A VAQ L R + + + CY+++
Sbjct: 303 DSEEGNIILDSGTTLTLLPSHVYTNLESA-VAQLVKLDRVDDPNQLLNLCYSITS-DQYD 360
Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGA 484
P ++ +F G + P S F D G C AF S +G I GN+ Q + + +D
Sbjct: 361 FPIITAHFKGADIKLNPISTFAHVAD--GVVCLAFTSSQTG-PIFGNLAQLNLLVGYDLQ 417
Query: 485 NGFVGFGPNVC 495
V F P+ C
Sbjct: 418 QNIVSFKPSDC 428
>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 224 bits (572), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 151/430 (35%), Positives = 207/430 (48%), Gaps = 68/430 (15%)
Query: 76 EARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHAR-MQRDVKRVATLVRRLSGGGADAA 134
+ R +LE+VH+ S + H+ H + + +D RVA++ RL+ A +
Sbjct: 14 DQRASLEVVHKHGPCS------KLRPHKANSPSHTQILAQDESRVASIQSRLAKNLAGGS 67
Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQ 193
+ S GSG Y V +G+GSP R + D+GSD+ W QC+PC CY+Q
Sbjct: 68 NLKASKATLPSKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQ 127
Query: 194 SDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAGRCRYEVSYGDGSYTKGTLALE 248
+ +FDP+ S S+S VSC S C++LE+A GC + C Y + YGDGSY+ G A E
Sbjct: 128 REHIFDPSTSLSYSNVSCDSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFARE 187
Query: 249 TLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
L++ T V N GCG N+G+F G AGLLGL +SLV Q + G FSYCL
Sbjct: 188 KLSLTSTDVFNNFQFGCGQNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKVFSYCL-PS 246
Query: 308 GTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
+ S+G L FG S D
Sbjct: 247 SSSSTGYLSFG---------------------------------------SGD------- 260
Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
GD V T RLP Y + + F + PR GVSI DTCY+LS + +V+VP
Sbjct: 261 GDSKAVKFT----PRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKYKTVKVPK 316
Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGAN 485
+ YFSGG + L A +I V C AFA ++IIGN+QQ+ I + +D A
Sbjct: 317 IILYFSGGAEMDL-APEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAE 375
Query: 486 GFVGFGPNVC 495
G VGF P+ C
Sbjct: 376 GRVGFAPSGC 385
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 224 bits (571), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 159/428 (37%), Positives = 219/428 (51%), Gaps = 41/428 (9%)
Query: 91 SSSNTTNNMHYHRH----------QHSFHARMQRDVKRVATLVRRLS---GGGADAAKHE 137
SSS TT + HRH + + ++RD R + +LS G G D +
Sbjct: 49 SSSGTTVPLS-HRHGPCSPAPSTVEPTMAELLRRDQLRAKYIQAKLSVNSGSGTDGVQQS 107
Query: 138 VQ-DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP 196
T + S +D + Y + + +G+P +Q ++ID+GSD+ WV C ++ S
Sbjct: 108 AAITLPTTLGSALD--TLAYVITVSIGTPAMTQAVMIDTGSDVSWVHCH--ARAGAGSSL 163
Query: 197 VFDPADSASFSGVSCSSAVCDRLE--NAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIG 253
FDP S++++ SCSSA C RLE + GC C+Y V YGDGS T GT +TL +
Sbjct: 164 FFDPGKSSTYTPFSCSSAACTRLEGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLALN 223
Query: 254 RT-VVKNVAIGCGHKN---QGMFVGAA-GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG 308
T V+N GC + +G+ GL+GLGGG+ SLV Q G AFSYCL +
Sbjct: 224 STEKVENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCLPAT- 282
Query: 309 TGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
T SSG L G G P+ R+ RAP+FY+V L G+ VGG + IS +F
Sbjct: 283 TRSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFAA---- 338
Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
G +MD+GT +TRLP AY A AF A PRA SI DTC++ +G +V +P V
Sbjct: 339 --GSIMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDFTGQDNVSIPAV 396
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL-SIIGNIQQEGIQISFDGANGF 487
FSGG V+ L A + C AFAP+ G+ SIIGN+QQ ++ D
Sbjct: 397 ELVFSGGAVVDLDADGIMY------GSCLAFAPATGGIGSIIGNVQQRTFEVLHDVGQSV 450
Query: 488 VGFGPNVC 495
+GF P C
Sbjct: 451 LGFRPGAC 458
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 224 bits (571), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 159/449 (35%), Positives = 224/449 (49%), Gaps = 41/449 (9%)
Query: 88 KMSSSSNTTNNMHYHRH--------QHSFHARMQRDVKRVATLVRRLSGGGADAA----- 134
K +S + + +H +R + S +D R+ T+ RR + G D
Sbjct: 66 KQPASLSPSLKLHMNRRAAEGGRTRKESVLDLADKDAVRIETMHRRAARSGGDRTPASPS 125
Query: 135 ----KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC 190
+ + V SG+ GSGEY + + VG+PPR M++D+GSD+ W+QC PC C
Sbjct: 126 SSPRRALSERMVATVESGVAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDC 185
Query: 191 YKQSDPVFDPADSASFSGVSCSSAVCDRLENA----GCH---AGRCRYEVSYGDGSYTKG 243
+ Q PVFDPA S+S+ V+C C + C C Y YGD S T G
Sbjct: 186 FDQVGPVFDPAASSSYRNVTCGDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTG 245
Query: 244 TLALETLTIGRTV------VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTG 297
LALE+ T+ T V +V GCGH N+G+F GAAGLLGLG G +S QL G
Sbjct: 246 DLALESFTVNLTAPGASRRVDDVVFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLRAVYG 305
Query: 298 GAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVR-------NPRAPSFYYVGLSGLGV 350
FSYCLV G+ + +VFG + AA P + + A +FYYV L G+ V
Sbjct: 306 HTFSYCLVDHGSDVASKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLV 365
Query: 351 GGMRIPISEDLF--RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG-NLPRASG 407
GG + IS D + + G G ++D+GT ++ PAY+ R AF+ + G + P
Sbjct: 366 GGELLNISSDTWGVGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIPD 425
Query: 408 VSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGL 466
+ CYN+SG VP +S F+ G V PA N+ I +D G C A +P +G+
Sbjct: 426 FPVLSPCYNVSGVDRPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGM 485
Query: 467 SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
SIIGN QQ+ + +D N +GF P C
Sbjct: 486 SIIGNFQQQNFHVVYDLKNNRLGFAPRRC 514
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 128/353 (36%), Positives = 196/353 (55%), Gaps = 13/353 (3%)
Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC 211
G GE+ V I +G+PP+ ++ID+GSD+ W+Q +PC C++Q+DP+FDP+ S++++ ++C
Sbjct: 21 GYGEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQADPIFDPSKSSTYNKIAC 80
Query: 212 SSAVC-DRLENAGCH-AGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQ 269
SS+ C D L C A C Y YGDGS T+G + ET+T T + V G N
Sbjct: 81 SSSACADLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEEVKFGASVYNT 140
Query: 270 GMF--VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV---SRGTGSSGSLVFGREALPV 324
G F G G+LGLG G +S+ QLG G FSYCLV S G+ +S ++ FG A+P
Sbjct: 141 GTFGDTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCLVDWLSAGSETS-TMYFGDAAVPS 199
Query: 325 GAA-WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
G + P+V N P++YY+ + G+ VGG + I + ++ + G G ++D+GT +T L
Sbjct: 200 GEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGTIIDSGTTITYL 259
Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
+ A A+ +Q P + + D C+N G S P ++ + G L LP +
Sbjct: 260 QQEVFNALVAAYTSQV-RYPTTTSATGLDLCFNTRGTGSPVFPAMTIHLD-GVHLELPTA 317
Query: 444 NFLIPVDDAGTFCFAFAPSPS-GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
N I + + C AFA + ++I GNIQQ+ I +D N +GF P C
Sbjct: 318 NTFISL-ETNIICLAFASALDFPIAIFGNIQQQNFDIVYDLDNMRIGFAPADC 369
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 153/441 (34%), Positives = 232/441 (52%), Gaps = 34/441 (7%)
Query: 69 SSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHAR--MQRDVKRVATLVRRL 126
S+ S E + L++VH+ S R H A+ + +D RV ++ +L
Sbjct: 73 STQVPSIENKAFLKVVHKHGPCSD---------LRQGHKAEAQYILLQDQSRVDSIHSKL 123
Query: 127 SGGGADAAKHEVQDFGTDVVSGMDQ---GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQ 183
S D+ +V+ + D GSG YFV +G+G+P + ++ D+GSD+ W Q
Sbjct: 124 S---KDSGLSDVKATAATTLPAKDGSIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQ 180
Query: 184 CQPCSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAGRCRYEVSYGD 237
C+PC + CY Q + +F+P+ S S++ +SC S +CD L +A C + C Y + YGD
Sbjct: 181 CEPCVKSCYNQKEAIFNPSQSTSYANISCGSTLCDSLASATGNIFNCASSTCVYGIQYGD 240
Query: 238 GSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT 296
S++ G E L++ T V + GCG N+G+F GAAGLLGLG +SLV Q +
Sbjct: 241 SSFSIGFFGKEKLSLTATDVFNDFYFGCGQNNKGLFGGAAGLLGLGRDKLSLVSQTAQRY 300
Query: 297 GGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP 356
FSYCL + S+G L FG A++ PL SFY + L+G+ VGG ++
Sbjct: 301 NKIFSYCL-PSSSSSTGFLTFGGST-SKSASFTPLATISGGSSFYGLDLTGISVGGRKLA 358
Query: 357 ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYN 416
IS +F G ++D+GT +TRLP AY A F P A +SI DTC++
Sbjct: 359 ISPSVFSTA-----GTIIDSGTVITRLPPAAYSALSSTFRKLMSQYPAAPALSILDTCFD 413
Query: 417 LSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQ 474
S ++ VP + +FSGG V+ + + + V+D C AFA S ++I GN+QQ
Sbjct: 414 FSNHDTISVPKIGLFFSGGVVVDIDKTG-IFYVNDLTQVCLAFAGNSDASDVAIFGNVQQ 472
Query: 475 EGIQISFDGANGFVGFGPNVC 495
+ +++ +DGA G VGF P C
Sbjct: 473 KTLEVVYDGAAGRVGFAPAGC 493
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 155/444 (34%), Positives = 223/444 (50%), Gaps = 38/444 (8%)
Query: 68 SSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLS 127
SS N A ++ LVHR ++S ++ SF ++ R + R S
Sbjct: 44 SSVNLEPSSATLSVPLVHRYGPCAASQYSD-----MPTPSFSETLRHSRARTNYIKSRAS 98
Query: 128 GGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC 187
G A T + +D S EY V +G G+P Q +++D+GSD+ WVQC PC
Sbjct: 99 TGMASTPDDAAVTVPTRLGGFVD--SLEYMVTLGFGTPSVPQVLLMDTGSDVSWVQCAPC 156
Query: 188 --SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN---AGCHAG--RCRYEVSYGDGSY 240
++CY Q DP+FDP+ S++++ ++C + C++L + GC +G +C Y V YGDGS
Sbjct: 157 NSTECYPQKDPLFDPSKSSTYAPIACGADACNKLGDHYRNGCTSGGTQCGYRVEYGDGSS 216
Query: 241 TKGTLALETLTIGRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA 299
T+G + ET+T + VK+ GCGH +G GLLGLGG SLV Q GGA
Sbjct: 217 TRGVYSNETITFAPGITVKDFHFGCGHDQRGPSDKFDGLLGLGGAPESLVVQTASVYGGA 276
Query: 300 FSYCLVSRGTGSSGSLVFGREALPVGA------AWVPLVRNPRAPSFYYVGLSGLGVGGM 353
FSYCL + + +G L G P A + P+ P + Y V ++G+ VGG
Sbjct: 277 FSYCLPALNS-EAGFLALGVR--PSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGK 333
Query: 354 RIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDT 413
+ I FR G+++D+GT VT LP AY A A P + FDT
Sbjct: 334 PLDIPRSAFR------GGMLIDSGTIVTELPETAYNALNAALRKAFAAYPMVASED-FDT 386
Query: 414 CYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS-PS-GLSIIGN 471
CYN +G+ +V VP V+ FSGG + L N ++ D C AF S P GL IIGN
Sbjct: 387 CYNFTGYSNVTVPRVALTFSGGATIDLDVPNGILVKD-----CLAFRESGPDVGLGIIGN 441
Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
+ Q +++ +D +G VGF C
Sbjct: 442 VNQRTLEVLYDAGHGKVGFRAGAC 465
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 223 bits (568), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 151/432 (34%), Positives = 224/432 (51%), Gaps = 42/432 (9%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+ +EL+HRD S ++ H+ R ++ R+ T+V +D A+ +
Sbjct: 27 FTVELIHRDSPKSPMYNSSETHFDRIVNALRRSSHRN-----TVVLE-----SDTAEAPI 76
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
+ G GEY V I VG+PP S V D+GSD++W QC+PCS CY+Q+ P+F
Sbjct: 77 FNNG-----------GEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAPMF 125
Query: 199 DPADSASFSGVSCSSAVCDRL-ENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
DP+ S ++ V+CSS VC + + C C Y ++YGD S+++G LA++T+T+ T
Sbjct: 126 DPSKSTTYKNVACSSPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQSTS 185
Query: 257 VKNVA-----IGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG 310
+ VA IGCGH N G F +G++GLG G SLV QLG TGG FSYCL+ GTG
Sbjct: 186 GRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIGTG 245
Query: 311 S---SGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
S S L FG A G+ V P+ + + +FY + L + VG + E +L
Sbjct: 246 STNDSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKLG 305
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF-DTCYNLSGFVSVR 424
G+ +++D+GT +T LP+ +F A ++Q+ +LP A S F D C+ +
Sbjct: 306 --GESNIIIDSGTTLTYLPSALLNSFGSA-ISQSMSLPHAQDPSEFLDYCFATTT-DDYE 361
Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDG 483
+P V+ +F G V L N + + D T C AF P + I GNI Q + +D
Sbjct: 362 MPPVTMHFEGADV-PLQRENLFVRLSD-DTICLAFGSFPDDNIFIYGNIAQSNFLVGYDI 419
Query: 484 ANGFVGFGPNVC 495
N V F P C
Sbjct: 420 KNLAVSFQPAHC 431
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 154/446 (34%), Positives = 229/446 (51%), Gaps = 48/446 (10%)
Query: 75 DEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAA 134
D R ++ L HR + ++ + + SF R++ D R ++R+ SG
Sbjct: 50 DPTRASVPLAHRHGPCAPKGSSAT---DKKKPSFAERLRSDRARADHILRKASG------ 100
Query: 135 KHEVQDFGTDVVSGMDQG---SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--SQ 189
+ + + G + G S EY V +G+G+P Q ++ID+GSD+ WVQC+PC S
Sbjct: 101 RRMMSEGGGASIPTYLGGFVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASD 160
Query: 190 CYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG----------RCRYEVSYGDGS 239
CY Q DP+FDP+ S++F+ + C+S C +L G G +C Y + YG+G+
Sbjct: 161 CYPQKDPLFDPSKSSTFATIPCASDACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGA 220
Query: 240 YTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGG 298
T+G + ETL +G + VVK+ GCG G + GLLGLGG SLV Q GG
Sbjct: 221 ITEGVYSTETLALGSSAVVKSFRFGCGSDQHGPYDKFDGLLGLGGAPESLVSQTASVYGG 280
Query: 299 AFSYCLVSRGTGSSGSLVFGREALP----VGAAWVPL-VRNPRAPSFYYVGLSGLGVGGM 353
AFSYCL +G +G L G G + P+ +P+ +FY V L+G+ VGG
Sbjct: 281 AFSYCLPPLNSG-AGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGK 339
Query: 354 RIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF---VAQTGNLPRASGVSI 410
+ I +F G ++D+GT +T +PT AY+A R AF +A+ LP A S
Sbjct: 340 ALDIPPAVFA------KGNIVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPAD--SA 391
Query: 411 FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSII 469
DTCYN +G +V VP V+ F GG + L + ++ V+D C AFA + G II
Sbjct: 392 LDTCYNFTGHGTVTVPKVALTFVGGATVDLDVPSGVL-VED----CLAFADAGDGSFGII 446
Query: 470 GNIQQEGIQISFDGANGFVGFGPNVC 495
GN+ I++ +D G +GF C
Sbjct: 447 GNVNTRTIEVLYDSGKGHLGFRAGAC 472
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 136/372 (36%), Positives = 194/372 (52%), Gaps = 24/372 (6%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASF 206
SG+ GSGEYF+ + VG+PP+ +++D+GSD+ W+QC PC C++QS P +DP DS+SF
Sbjct: 188 SGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSF 247
Query: 207 SGVSCSSAVC------DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV---- 256
+SC C D + C Y YGDGS T G ALET T+ T
Sbjct: 248 RNISCHDPRCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGT 307
Query: 257 -----VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
V+NV GCGH N+G+F GAAGLLGLG G +S Q+ G +FSYCLV R + +
Sbjct: 308 SELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNA 367
Query: 312 SGS--LVFGREALPVGAAWVPLV-----RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL 364
S S L+FG + + + ++ +FYYV + + V + I E+ + L
Sbjct: 368 SVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEETWHL 427
Query: 365 TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR 424
+ G G ++D+GT +T PAYE ++AFV + G+ CYN+SG +
Sbjct: 428 SSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKPCYNVSGIEKME 487
Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDG 483
+P F+ V P N+ I + D C A +P S LSIIGN QQ+ I +D
Sbjct: 488 LPDFGILFADEAVWNFPVENYFIWI-DPEVVCLAILGNPRSALSIIGNYQQQNFHILYDM 546
Query: 484 ANGFVGFGPNVC 495
+G+ P C
Sbjct: 547 KKSRLGYAPMKC 558
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 146/433 (33%), Positives = 223/433 (51%), Gaps = 45/433 (10%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+++EL+HRD S +++QH F +R + R A H
Sbjct: 28 FSVELIHRDSPKSPYYKPTE---NKYQH-FVDAARRSINR---------------ANHFF 68
Query: 139 QDFGTDVV-SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
+D T S + G Y + VG+PP Y + D+GSDIVW+QC+PC QCY Q+ P+
Sbjct: 69 KDSDTSTPESTVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPI 128
Query: 198 FDPADSASFSGVSCSSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
F+P+ S+S+ + C S +C + + C C+Y++SYGD S+++G L+++TL++ T
Sbjct: 129 FNPSKSSSYKNIPCLSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTS 188
Query: 257 VKNVA-----IGCGHKNQGMFVGA-AGLLGLGGGSMSLVGQLGGQTGGAFSYCLV---SR 307
V+ IGCG N G F GA +G++GLGGG +SL+ QLG GG FSYCLV ++
Sbjct: 189 GSPVSFPKTVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNK 248
Query: 308 GTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
+ +S L FG A+ G V PL++ + P FY++ L VG R+ +
Sbjct: 249 ESNASSILSFGDAAVVSGDGVVSTPLIK--KDPVFYFLTLQAFSVGNKRVEFGGS----S 302
Query: 366 QMGDD--GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLSGFVS 422
+ GDD +++D+GT +T +P+ Y A V L R + F CY+L
Sbjct: 303 EGGDDEGNIIIDSGTTLTLIPSDVYTNLESA-VVDLVKLDRVDDPNQQFSLCYSLKS-NE 360
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
P ++ +F G + S F +P+ D G CFAF PSP SI GN+ Q+ + + +D
Sbjct: 361 YDFPIITAHFKGADIELHSISTF-VPITD-GIVCFAFQPSPQLGSIFGNLAQQNLLVGYD 418
Query: 483 GANGFVGFGPNVC 495
V F P C
Sbjct: 419 LQQKTVSFKPTDC 431
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 149/432 (34%), Positives = 221/432 (51%), Gaps = 40/432 (9%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+ +L+HRD S R +++ H R V RV D ++ +
Sbjct: 31 FTADLIHRDSPKSPFYNPTETSSQRLRNAIH----RSVSRVFHFT--------DISQKDA 78
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
D + + SGEY + I +G+PP + D+GSD++W QC+PC CY Q DP+F
Sbjct: 79 SDNAPQI--DLTSNSGEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDPLF 136
Query: 199 DPADSASFSGVSCSSAVCDRLEN-AGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRT 255
DP S+++ VSCSS+ C LEN A C C Y SYGD SYTKG +A++TLT+G T
Sbjct: 137 DPKASSTYKDVSCSSSQCTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGST 196
Query: 256 -----VVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV--SR 307
+KN+ IGCGH N G F +G++GLGGG++SL+ QLG G FSYCLV +
Sbjct: 197 DTRPVQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTS 256
Query: 308 GTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRI--PISEDLFR 363
+ + FG A+ G V PL+ + +FYY+ L + VG + P S+
Sbjct: 257 ENDRTSKINFGTNAVVSGTGVVSTPLIAKSQE-TFYYLTLKSISVGSKEVQYPGSD---- 311
Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
+ G+ +++D+GT +T LPT Y DA + + + CY+ +G +
Sbjct: 312 -SGSGEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLSLCYSATG--DL 368
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDG 483
+VP ++ +F G V P++ F+ +D CFAF SPS SI GN+ Q + +D
Sbjct: 369 KVPAITMHFDGADVNLKPSNCFVQISEDL--VCFAFRGSPS-FSIYGNVAQMNFLVGYDT 425
Query: 484 ANGFVGFGPNVC 495
+ V F P C
Sbjct: 426 VSKTVSFKPTDC 437
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 221 bits (563), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 147/430 (34%), Positives = 218/430 (50%), Gaps = 39/430 (9%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+ +L+HRD S R +++ H R V RV + D
Sbjct: 31 FTADLIHRDSPKSPFYNPMETSSQRLRNAIH----RSVNRVFHFTEK------DNTPQPQ 80
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
D ++ SGEY + + +G+PP + D+GSD++W QC PC CY Q DP+F
Sbjct: 81 IDLTSN--------SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLF 132
Query: 199 DPADSASFSGVSCSSAVCDRLEN-AGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRT 255
DP S+++ VSCSS+ C LEN A C + C Y +SYGD SYTKG +A++TLT+G +
Sbjct: 133 DPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSS 192
Query: 256 -----VVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV--SR 307
+KN+ IGCGH N G F +G++GLGGG +SL+ QLG G FSYCLV +
Sbjct: 193 DTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTS 252
Query: 308 GTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
+ + FG A+ G+ V PL+ +FYY+ L + VG +I + +
Sbjct: 253 KKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQI---QYSGSDS 309
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
+ + +++D+GT +T LPT Y DA + + S CY+ +G ++V
Sbjct: 310 ESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATG--DLKV 367
Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
P ++ +F G V L +SN + V + CFAF SPS SI GN+ Q + +D +
Sbjct: 368 PVITMHFDGADV-KLDSSNAFVQVSE-DLVCFAFRGSPS-FSIYGNVAQMNFLVGYDTVS 424
Query: 486 GFVGFGPNVC 495
V F P C
Sbjct: 425 KTVSFKPTDC 434
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 221 bits (563), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 147/430 (34%), Positives = 218/430 (50%), Gaps = 39/430 (9%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+ +L+HRD S R +++ H R V RV + D
Sbjct: 31 FTADLIHRDSPKSPFYNPMETSSQRLRNAIH----RSVNRVFHFTEK------DNTPQPQ 80
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
D ++ SGEY + + +G+PP + D+GSD++W QC PC CY Q DP+F
Sbjct: 81 IDLTSN--------SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLF 132
Query: 199 DPADSASFSGVSCSSAVCDRLEN-AGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRT 255
DP S+++ VSCSS+ C LEN A C + C Y +SYGD SYTKG +A++TLT+G +
Sbjct: 133 DPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSS 192
Query: 256 -----VVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV--SR 307
+KN+ IGCGH N G F +G++GLGGG +SL+ QLG G FSYCLV +
Sbjct: 193 DTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTS 252
Query: 308 GTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
+ + FG A+ G+ V PL+ +FYY+ L + VG +I + +
Sbjct: 253 KKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQI---QYSGSDS 309
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
+ + +++D+GT +T LPT Y DA + + S CY+ +G ++V
Sbjct: 310 ESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATG--DLKV 367
Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
P ++ +F G V L +SN + V + CFAF SPS SI GN+ Q + +D +
Sbjct: 368 PVITMHFDGADV-KLDSSNAFVQVSE-DLVCFAFRGSPS-FSIYGNVAQMNFLVGYDTVS 424
Query: 486 GFVGFGPNVC 495
V F P C
Sbjct: 425 KTVSFKPTDC 434
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 221 bits (562), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 156/438 (35%), Positives = 233/438 (53%), Gaps = 31/438 (7%)
Query: 73 SSDEARWNLELVHR----DKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSG 128
S+++ + +L++VH+ K+S + H + +D RV ++ RLS
Sbjct: 68 SNNDNKASLKVVHKHGPCSKLSQDEASAAPTHTEI--------LLQDQSRVKSIHSRLSN 119
Query: 129 GGADAAKH-EVQDFGT-DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP 186
K +V D T G GSG Y V +G+G+P + ++ D+GSDI W QCQP
Sbjct: 120 SKTSGGKDVKVTDSTTIPAKDGSTVGSGNYIVTVGLGTPKKDLSLIFDTGSDITWTQCQP 179
Query: 187 CSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAGRCRYEVSYGDGSY 240
C++ CYKQ + +FDP+ S S++ +SCSS++C+ L +A GC + C Y + YGD S+
Sbjct: 180 CARSCYKQKEQIFDPSQSTSYTNISCSSSICNSLTSATGNTPGCASSACVYGIQYGDSSF 239
Query: 241 TKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA 299
+ G E LT+ T N+ GCG NQG+F G+AGLLGLG +S+V Q +
Sbjct: 240 SVGFFGTEKLTLTSTDAFNNIYFGCGQNNQGLFGGSAGLLGLGRDKLSVVSQTAQKYNKI 299
Query: 300 FSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISE 359
FSYCL + S+G L FG A A + PL PSFY + +G+ VGG ++ IS
Sbjct: 300 FSYCL-PSSSSSTGFLTFGGSA-SKNAKFTPLSTISAGPSFYGLDFTGISVGGKKLAISA 357
Query: 360 DLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSG 419
+F G ++D+GT +TRLP AY A R +F P +SI DTCY+ S
Sbjct: 358 SVFSTA-----GAIIDSGTVITRLPPAAYSALRASFRNLMSKYPMTKALSILDTCYDFSS 412
Query: 420 FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGI 477
+ ++ VP + F FS G + + A+ L C AFA + + I GN+QQ+ +
Sbjct: 413 YTTISVPKIGFSFSSGIEVDIDATGILY-ASSLSQVCLAFAGNSDATDVFIFGNVQQKTL 471
Query: 478 QISFDGANGFVGFGPNVC 495
++ +DG+ G VGF P C
Sbjct: 472 EVFYDGSAGKVGFAPGGC 489
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 135/390 (34%), Positives = 199/390 (51%), Gaps = 27/390 (6%)
Query: 133 AAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK 192
+A H Q + VVSG GSG+YFV + +G+PP+ +V D+GSD+VWV+C C C +
Sbjct: 66 SALHTPQSLKSPVVSGASTGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTR 125
Query: 193 QSD-PVFDPADSASFSGVSCSSAVCDRL---ENAGCHAGR----CRYEVSYGDGSYTKGT 244
+ F S +FS C + C + ++ C+ R CRYE SYGDGS T G
Sbjct: 126 HTPGSAFLARHSTTFSPNHCYDSACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKTSGF 185
Query: 245 LALETLTI----GRTV-VKNVAIGCGHKNQG------MFVGAAGLLGLGGGSMSLVGQLG 293
+ ET T+ GR +K +A GC + G F GA G++GLG G +SL QLG
Sbjct: 186 FSKETTTLNTSSGREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLG 245
Query: 294 GQTGGAFSYCLVSRGTGSSGS--LVFGREALPVGAA-----WVPLVRNPRAPSFYYVGLS 346
+ G FSYCL+ S + L+ G V + PL NP +P+FYY+G+
Sbjct: 246 HRFGNKFSYCLMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIE 305
Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
+ V G+++PI+ ++ L ++G+ G ++D+GT +T LP PAY + A
Sbjct: 306 SVSVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAE 365
Query: 407 GVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD-DAGTFCFAFAPSPSG 465
FD C N+S R+P +SF G V + P N+ + D D +PSG
Sbjct: 366 PTPGFDLCVNVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSG 425
Query: 466 LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S+IGN+ Q+G + FD +GF + C
Sbjct: 426 FSVIGNLMQQGFLLEFDKDRTRLGFSRHGC 455
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 133/427 (31%), Positives = 212/427 (49%), Gaps = 31/427 (7%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+ ++L+HRD S ++ + + R+ ++R + V A + +
Sbjct: 32 FTVDLIHRDSPLSP--------FYNSEETDLQRINNALRRSISRVHHFDPIAAASVSPKA 83
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
+ +DV S GEY + + +G+PP + D+GSD++W QC+PC +CYKQ DP+F
Sbjct: 84 AE--SDVTSNR----GEYLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDPLF 137
Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVK 258
DP S ++ SC + C L+ + C C+Y+ SYGD SYT G +A +T+T+ T
Sbjct: 138 DPKSSKTYRDFSCDARQCSLLDQSTCSGNICQYQYSYGDRSYTMGNVASDTITLDSTTGS 197
Query: 259 NVA-----IGCGHKNQGMFVGA-AGLLGLGGGSMSLVGQLGGQTGGAFSYCLV--SRGTG 310
V+ IGCGH+N G F +G++GLG G +SL+ Q+G GG FSYCLV S G
Sbjct: 198 PVSFPKTVIGCGHENDGTFSDKGSGIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAG 257
Query: 311 SSGSLVFGREALPVGAAW--VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
+S L FG A+ G PL+ + SFY++ L + VG RI + G
Sbjct: 258 NSSKLNFGSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSL---GTG 314
Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
+ +++D+GT +T +P + A Q CY+ + ++VP +
Sbjct: 315 EGNIIIDSGTTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCYSATS--DLKVPAI 372
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
+ +F+G V P + F+ DD C AFA + SG+SI GN+ Q + ++ +
Sbjct: 373 TAHFTGADVKLKPINTFVQVSDDV--VCLAFASTTSGISIYGNVAQMNFLVEYNIQGKSL 430
Query: 489 GFGPNVC 495
F P C
Sbjct: 431 SFKPTDC 437
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 124/355 (34%), Positives = 192/355 (54%), Gaps = 22/355 (6%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD 217
+ + +G+P ++D+GSD++W QC+PC++C+ Q P+FDP S+S+S V CSS +C+
Sbjct: 1 MELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCN 60
Query: 218 RLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGM-FV 273
L + C+ + C Y +YGD S T+G LA ET T + + GCG +N+G F
Sbjct: 61 ALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEGDGFS 120
Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPV----GAAW 328
+GL+GLG G +SL+ QL FSYCL S + +S SL G A + GA+
Sbjct: 121 QGSGLVGLGRGPLSLISQLKET---KFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASL 177
Query: 329 -------VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
+ L+RNP PSFYY+ L G+ VG R+ + + F L + G G+++D+GT +T
Sbjct: 178 DGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTIT 237
Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTL 440
L A++ ++ F ++ SG + D C+ L ++ VP + F+F G L L
Sbjct: 238 YLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKGAD-LEL 296
Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P N+++ G C A S +G+SI GN+QQ+ + D V F P C
Sbjct: 297 PGENYMVADSSTGVLCLAMG-SSNGMSIFGNVQQQNFNVLHDLEKETVSFVPTEC 350
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 136/412 (33%), Positives = 201/412 (48%), Gaps = 26/412 (6%)
Query: 102 HRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIG 161
H + + ++Q + +A R++ + A V D T + SGEY V +
Sbjct: 35 HVDAGTSYTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITAARVLVTASSGEYLVDLA 94
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
+G+PP ++D+GSD++W QC PC C Q P FD SA++ + C S+ C L +
Sbjct: 95 IGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSS 154
Query: 222 AGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHKNQGMFVGAA 276
C C Y+ YGD + T G LA ET T G + N+A GCG N G ++
Sbjct: 155 PSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSS 214
Query: 277 GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA----------LPVGA 326
G++G G G +SLV QLG FSYCL S + + L FG A PV +
Sbjct: 215 GMVGFGRGPLSLVSQLGPSR---FSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQS 271
Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
P V NP P+ Y++ L + +G +PI +F + G GV++D+GT++T L
Sbjct: 272 --TPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQD 329
Query: 387 AYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGF--VSVRVPTVSFYFSGGPVLTLPAS 443
AYEA R V+ LP + I DTC+ V+V VP + F+F + LP
Sbjct: 330 AYEAVRRGLVSAIP-LPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLP-E 387
Query: 444 NFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
N+++ G C AP+ G +IIGN QQ+ + + +D N F+ F P C
Sbjct: 388 NYMLIASTTGYLCLVMAPTGVG-TIIGNYQQQNLHLLYDIGNSFLSFVPAPC 438
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 149/430 (34%), Positives = 220/430 (51%), Gaps = 37/430 (8%)
Query: 73 SSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGAD 132
SS + ++ L HR S ++ + + ++RD R + R+ SG
Sbjct: 27 SSSDGTSSVTLSHRYGPCSPADPNSGEKRPTDEE----LLRRDQLRADYIRRKFSGSNGT 82
Query: 133 AAKHEVQDFGTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--- 187
AA + Q V + G + EY + +G+GSP +Q +VID+GSD+ WVQC+PC
Sbjct: 83 AAGEDGQSSKVSVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAP 142
Query: 188 SQCYKQSDPVFDPADSASFSGVSCSSAVCDRL----ENAGCHA-GRCRYEVSYGDGSYTK 242
S C+ + +FDPA S++++ +CS+A C +L E GC A RC+Y V YGDGS T
Sbjct: 143 SPCHAHAGALFDPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTT 202
Query: 243 GTLALETLTI-GRTVVKNVAIGCGHKN--QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA 299
GT + + LT+ G VV+ GC H GM GL+GLGG + S V Q + G +
Sbjct: 203 GTYSSDVLTLSGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKS 262
Query: 300 FSYCLVSRGTGSSGSLVFGREALPVGA-----AWVPLVRNPRAPSFYYVGLSGLGVGGMR 354
F YCL + SSG L G A G A P++R+ + P++Y+ L + VGG +
Sbjct: 263 FFYCLPAT-PASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKK 321
Query: 355 IPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTC 414
+ +S +F G ++D+GT +TRLP AY A AF A RA + I DTC
Sbjct: 322 LGLSPSVFAA------GSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTC 375
Query: 415 YNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNI 472
+N +G V +PTV+ F+GG V+ L A + C AFAP+ IGN+
Sbjct: 376 FNFTGLDKVSIPTVALVFAGGAVVDLDAHGIV------SGGCLAFAPTRDDKAFGTIGNV 429
Query: 473 QQEGIQISFD 482
QQ ++ +D
Sbjct: 430 QQRTFEVLYD 439
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 138/353 (39%), Positives = 188/353 (53%), Gaps = 19/353 (5%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCSS 213
E+ V +G GSP ++ + ID+GSD+ W+QC PCS CYKQ DPVFDP SA++S V C
Sbjct: 160 EFVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDPTKSATYSAVPCGH 219
Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVAIGCGHKNQGMF 272
C ++G C Y+V+YGDGS T G L+ ETL++ T + A GCG N G F
Sbjct: 220 PQCAAAGGKCSNSGTCLYKVTYGDGSSTAGVLSHETLSLSSTRDLPGFAFGCGQTNLGEF 279
Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA------ 326
G GL+GLG G++SL Q G FSYCL S T + G L G P +
Sbjct: 280 GGVDGLVGLGRGALSLPSQAAATFGATFSYCLPSYDT-THGYLTMG-STTPAASNDDDDV 337
Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
+ +++ PS Y+V + + +GG +P+ +F DG + D+GT +T LP
Sbjct: 338 QYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFT-----RDGTLFDSGTILTYLPPE 392
Query: 387 AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
AY + RD F A FDTCY+ +G ++ +P V+F FS G V L L
Sbjct: 393 AYASLRDRFKFTMTQYKPAPAYDPFDTCYDFTGHNAIFMPAVAFKFSDGAVFDLSPVAIL 452
Query: 447 IPVDDA--GTFCFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
I DD T C AF P PS + +IIGN QQ G ++ +D A +GFG C
Sbjct: 453 IYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTEVIYDVAAEKIGFGQFTC 505
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 141/401 (35%), Positives = 209/401 (52%), Gaps = 28/401 (6%)
Query: 108 FHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPR 167
F A + D R+A L RL A K V + SG G G Y R+G+G+P
Sbjct: 64 FSAFITHDAARIAGLASRL----ATKDKDWVAASSVPLASGASVGVGNYITRLGLGTPTT 119
Query: 168 SQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH- 225
+ MV+DSGS + W+QC PC+ C+ Q+ P++DP S++++ V CS+ C L+ A +
Sbjct: 120 TYVMVVDSGSSLTWLQCAPCAVSCHPQAGPLYDPRASSTYAAVPCSAPQCAELQAATLNP 179
Query: 226 -----AGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLL 279
+G C+Y+ SYGDGS++ G L+ +T+++ + GCG N G+F AAGL+
Sbjct: 180 SSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSSSGSFPGFYYGCGQDNVGLFGRAAGLI 239
Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA---LPVGAAWVPLVRNPR 336
GL +SL+ QL G +F+YCL + S+G L FG + P ++ +V +
Sbjct: 240 GLARNKLSLLSQLAPSVGNSFAYCLPTSAAASAGYLSFGSNSDNKNPGKYSYTSMVSSSL 299
Query: 337 APSFYYVGLSGLGVGG--MRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
S Y+V L+G+ V G + +P SE G ++D+GT +TRLPTP Y A A
Sbjct: 300 DASLYFVSLAGMSVAGSPLAVPSSE-------YGSLPTIIDSGTVITRLPTPVYTALSKA 352
Query: 395 FVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
V P A SI TC+ + VP V+ F+GG L L N L+ V++ T
Sbjct: 353 -VGAALAAPSAPAYSILQTCFK-GQVAKLPVPAVNMAFAGGATLRLTPGNVLVDVNETTT 410
Query: 455 FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C AFAP+ S +IIGN QQ+ + +D +GF C
Sbjct: 411 -CLAFAPTDS-TAIIGNTQQQTFSVVYDVKGSRIGFAAGGC 449
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 128/357 (35%), Positives = 186/357 (52%), Gaps = 20/357 (5%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
GEY + +G+G+P R ++D+GSD++W QC PC C Q P FDPA+S+++ + CS+
Sbjct: 90 GEYLMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSSTYRSLGCSA 149
Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG----RTVVKNVAIGCGHKNQ 269
C+ L C+ C Y+ YGD + T G LA ET T G R + ++ GCG+ N
Sbjct: 150 PACNALYYPLCYQKTCVYQYFYGDSASTAGVLANETFTFGTNDTRVTLPRISFGCGNLNA 209
Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL--PVGAA 327
G +G++G G GS+SLV QLG FSYCL S + L FG A A+
Sbjct: 210 GSLANGSGMVGFGRGSLSLVSQLGSP---RFSYCLTSFLSPVRSRLYFGAYATLNSTNAS 266
Query: 328 WV---PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM-GDDGVVMDTGTAVTRL 383
V P + NP P+ Y++ ++G+ VGG R+PI + + G G ++D+GT +T L
Sbjct: 267 TVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSGTTITYL 326
Query: 384 PTPAYEAFRDAFVA---QTGNLPRASGVSIFDTCYNLSGFV--SVRVPTVSFYFSGGPVL 438
PAY A R+AFV T L + S+ DTC+ SV +P + +F G
Sbjct: 327 AEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVTLPQLVLHFDGAD-W 385
Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
LP N+++ G C A A S G SIIG+ Q + + +D N + F P C
Sbjct: 386 ELPLQNYMLVDPSTGGLCLAMATSSDG-SIIGSYQHQNFNVLYDLENSLLSFVPAPC 441
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 219 bits (557), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 144/435 (33%), Positives = 215/435 (49%), Gaps = 40/435 (9%)
Query: 76 EARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAK 135
A + ELVHRD S + H R ++ M+R V RV R A +
Sbjct: 28 NAGFTTELVHRDSPKSPLYNSQQTHLQR----WNKAMRRSVSRVHHFQRT----AATVSP 79
Query: 136 HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD 195
EV+ S + GEY + + +G+PP + D+GSD++W QC PC +CYKQ
Sbjct: 80 KEVE-------SEIIANGGEYLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIA 132
Query: 196 PVFDPADSASFSGVSCSSAVCDRL-ENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTI- 252
P+FDP S ++ +SC + C L E++ C + + C+Y YGD S+T G LA++T+T+
Sbjct: 133 PLFDPKSSKTYRDLSCDTRQCQNLGESSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLP 192
Query: 253 ----GRTVVKNVAIGCGHKNQGMFVGA-AGLLGLGGGSMSLVGQLGGQTGGAFSYCLV-- 305
G IGCG +N G F +G++GLGGG MSL+ Q+G GG FSYCLV
Sbjct: 193 STNGGPVYFPKTVIGCGRRNNGTFDKKDSGIIGLGGGPMSLISQMGSSVGGKFSYCLVPF 252
Query: 306 -SRGTGSSGSLVFGREALPVGAAW--VPLV-RNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
S G+S L FGR A+ G+ PL+ +NP +FYY+ L + VG +I
Sbjct: 253 SSESAGNSSKLHFGRNAVVSGSGVQSTPLISKNPD--TFYYLTLEAMSVGDKKIEFGG-- 308
Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLSGF 420
+ +++D+GT++T P + F A N R S + CY +
Sbjct: 309 -SSFGGSEGNIIIDSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPTP- 366
Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQIS 480
++VP ++ +F+G V+ + F++ DD C AF + SG +I GN+ Q I
Sbjct: 367 -DLKVPVITAHFNGADVVLQTLNTFILISDDV--LCLAFNSTQSG-AIFGNVAQMNFLIG 422
Query: 481 FDGANGFVGFGPNVC 495
+D V F P C
Sbjct: 423 YDIQGKSVSFKPTDC 437
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 219 bits (557), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 144/407 (35%), Positives = 212/407 (52%), Gaps = 33/407 (8%)
Query: 108 FHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTD-----------VVSGMDQGSGEY 156
F A + D R+++L RL+ +A+ D D + G G G Y
Sbjct: 65 FTAVLTHDDARISSLAARLAK--TPSARATSLDADADAGLAGSLASVPLSPGASVGVGNY 122
Query: 157 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAV 215
R+G+G+P MV+D+GS + W+QC PC C++QS PVF+P S++++ V CS+
Sbjct: 123 VTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQ 182
Query: 216 CDRLENA-----GCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQ 269
C L +A C + C Y+ SYGD S++ G L+ +T++ G T + N GCG N+
Sbjct: 183 CSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTSLPNFYYGCGQDNE 242
Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV 329
G+F +AGL+GL +SL+ QL G +F+YCL S + SL P ++
Sbjct: 243 GLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYLSLGSYN---PGQYSYT 299
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
P+V + S Y++ LSG+ V G P+S + + ++D+GT +TRLPT Y
Sbjct: 300 PMVSSSLDDSLYFIKLSGMTVAGN--PLSVSSSAYSSL---PTIIDSGTVITRLPTSVYS 354
Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS-VRVPTVSFYFSGGPVLTLPASNFLIP 448
A A A RAS SI DTC+ G S V P V+ F+GG L L A N L+
Sbjct: 355 ALSKAVAAAMKGTSRASAYSILDTCFK--GQASRVSAPAVTMSFAGGAALKLSAQNLLVD 412
Query: 449 VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
VDD+ T C AFAP+ S +IIGN QQ+ + +D + +GF C
Sbjct: 413 VDDSTT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKSSRIGFAAGGC 457
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 218 bits (556), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 134/353 (37%), Positives = 197/353 (55%), Gaps = 25/353 (7%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--SQCYKQSDPVFDPADSASFSGVSCS 212
EY V +G G+P Q +++D+GSD+ WVQC PC ++CY Q DP+FDP+ S++++ ++C+
Sbjct: 130 EYVVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTKCYPQKDPLFDPSKSSTYAPIACN 189
Query: 213 SAVCDRLEN---AGCHAG--RCRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVAIGCGH 266
+ C +L + GC +G +C Y V Y DGS+++G + ETLT+ + V++ GCG
Sbjct: 190 TDACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGVYSNETLTLAPGITVEDFHFGCGR 249
Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA 326
+G GLLGLGG +SLV Q GGAFSYCL + + +G LV G +
Sbjct: 250 DQRGPSDKYDGLLGLGGAPVSLVVQTSSVYGGAFSYCLPALNS-EAGFLVLGSPPSGNKS 308
Query: 327 AWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
A+V P+ P +FY V ++G+ VGG + I + FR G+++D+GT T LP
Sbjct: 309 AFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAFR------GGMIIDSGTVDTELP 362
Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
AY A A P FDTCYN +G+ ++ VP V+F FSGG + L N
Sbjct: 363 ETAYNALEAALRKALKAYPLVPS-DDFDTCYNFTGYSNITVPRVAFTFSGGATIDLDVPN 421
Query: 445 FLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
++ V+D C AF S GL IIGN+ Q +++ +D G VGF C
Sbjct: 422 GIL-VND----CLAFQESGPDDGLGIIGNVNQRTLEVLYDAGRGNVGFRAGAC 469
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 218 bits (556), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 138/391 (35%), Positives = 202/391 (51%), Gaps = 27/391 (6%)
Query: 117 KRVATLVRR-LSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDS 175
+R T +R+ + GA + + G G G Y +G+G+P S MV+D+
Sbjct: 94 RRPTTSLRKPKAAAGASGGPLDDSLASVPLTPGTSVGVGNYVTELGLGTPATSYAMVVDT 153
Query: 176 GSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR---- 230
GS + W+QC PC C++Q P++DP S++++ V CS++ CD L+ A + C
Sbjct: 154 GSSLTWLQCSPCVVSCHRQVGPLYDPRASSTYATVPCSASQCDELQAATLNPSACSVRNV 213
Query: 231 --YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSL 288
Y+ SYGD S++ G L+ +T++ G N GCG N+G+F +AGL+GL +SL
Sbjct: 214 CIYQASYGDSSFSVGYLSRDTVSFGSGSYPNFYYGCGQDNEGLFGRSAGLIGLARNKLSL 273
Query: 289 VGQLGGQTGGAFSYCL---VSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGL 345
+ QL G +FSYCL S G S G G ++ P+ + S Y+V L
Sbjct: 274 LYQLAPSLGYSFSYCLPTPASTGYLSIGPYTSGHY------SYTPMASSSLDASLYFVTL 327
Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
SG+ VGG + +S + ++D+GT +TRLPT Y A A A + A
Sbjct: 328 SGMSVGGSPLAVSP-----AEYSSLPTIIDSGTVITRLPTAVYTALSKAVAAAMVGVQSA 382
Query: 406 SGVSIFDTCYNLSGFVS-VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS 464
SI DTC+ G S +RVP V+ F+GG L L N LI VDD+ T C AFAP+ S
Sbjct: 383 PAFSILDTCFQ--GQASQLRVPAVAMAFAGGATLKLATQNVLIDVDDSTT-CLAFAPTDS 439
Query: 465 GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+IIGN QQ+ + +D A +GF C
Sbjct: 440 -TTIIGNTQQQTFSVVYDVAQSRIGFAAGGC 469
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 218 bits (555), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 164/433 (37%), Positives = 228/433 (52%), Gaps = 40/433 (9%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
L L HR + S ++ S ++ D +R ++RR+SG +
Sbjct: 68 LRLTHRHGPCAPSRASS-----LAAPSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAA 122
Query: 141 FGTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS---QCYKQSD 195
V + G D G+ Y V +G+P +Q M +D+GSD+ WVQC+PCS CY Q D
Sbjct: 123 AAATVPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKD 182
Query: 196 PVFDPADSASFSGVSCSSAVCDRL---ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI 252
P+FDPA S+S++ V C VC L + C A +C Y VSYGDGS T G + +TLT+
Sbjct: 183 PLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL 242
Query: 253 -GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
+ V+ GCGH G+F G GLLGLG SLV Q G GG FSYCL ++ + +
Sbjct: 243 SASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPS-T 301
Query: 312 SGSLVFGREALPVGAA----WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
+G L G P GAA L+ +P AP++Y V L+G+ VGG ++ + F +
Sbjct: 302 AGYLTLGVGG-PSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV 360
Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDTCYNLSGFVSVRV 425
+DTGT VTRLP AY A R AF + + P A I DTCYN +G+ +V +
Sbjct: 361 ------VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTL 414
Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTF-CFAFAPSPS--GLSIIGNIQQEGIQISFD 482
P V+ F G +TL A L +F C AFAPS S G++I+GN+QQ ++ D
Sbjct: 415 PNVALTFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID 467
Query: 483 GANGFVGFGPNVC 495
G + VGF P+ C
Sbjct: 468 GTS--VGFKPSSC 478
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 218 bits (555), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 148/429 (34%), Positives = 225/429 (52%), Gaps = 38/429 (8%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+ ++L+HRD S ++ + RM+ ++R A + S DA+ +
Sbjct: 26 FTIDLIHRDSPKSP--------FYNSAETSSQRMRNAIRRSARSTLQFSND--DASPNSP 75
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
Q F T GEY + I +G+PP + D+GSD++W QC PC CY+Q+ P+F
Sbjct: 76 QSFIT-------SNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLF 128
Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIG--- 253
DP +S+++ VSCSS+ C LE+A C C Y ++YGD SYTKG +A++T+T+G
Sbjct: 129 DPKESSTYRKVSCSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSG 188
Query: 254 -RTV-VKNVAIGCGHKNQGMFVGA-AGLLGLGGGSMSLVGQLGGQTGGAFSYCLV--SRG 308
R V ++N+ IGCGH+N G F A +G++GLGGGS SLV QL G FSYCLV +
Sbjct: 189 RRPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSE 248
Query: 309 TGSSGSLVFGREALPVGAAWVPLVRNPRAP-SFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
TG + + FG + G V + P ++Y++ L + VG +I + +F
Sbjct: 249 TGLTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIF---GT 305
Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLSGFVSVRVP 426
G+ +V+D+GT +T LP+ Y ++ VA T R I CY S S +VP
Sbjct: 306 GEGNIVIDSGTTLTLLPSNFYYEL-ESVVASTIKAERVQDPDGILSLCYRDSS--SFKVP 362
Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
++ +F GG V + F+ +D CFAFA + L+I GN+ Q + +D +G
Sbjct: 363 DITVHFKGGDVKLGNLNTFVAVSEDVS--CFAFAANEQ-LTIFGNLAQMNFLVGYDTVSG 419
Query: 487 FVGFGPNVC 495
V F C
Sbjct: 420 TVSFKKTDC 428
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 218 bits (554), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 141/380 (37%), Positives = 201/380 (52%), Gaps = 43/380 (11%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
V SG + Y +G+G + +++D+ S++ WVQC PC C+ Q P+FDP+ S
Sbjct: 132 VSSGARLRTLNYVATVGLGGGEAT--VIVDTASELTWVQCAPCESCHDQQGPLFDPSSSP 189
Query: 205 SFSGVSCSSAVCDRLEN-----AG-----CHAGR---CRYEVSYGDGSYTKGTLALETLT 251
S++ V C S CD L+ AG C AGR C Y +SY DGSY++G LA + L+
Sbjct: 190 SYAAVPCDSPSCDALQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRLS 249
Query: 252 IGRTVVKNVAIGCGHKNQG-MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL-VSRGT 309
+ V+ GCG NQG F G +GL+GLG +SLV Q Q GG FSYCL +SR +
Sbjct: 250 LAGEVIDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCLPLSRES 309
Query: 310 GSSGSLVFG------REALPVGAAWV-----PLVRNPRAPSFYYVGLSGLGVGGMRIPIS 358
+SGSLV G R + PV + PL++ P FY V L+G+ VGG + +
Sbjct: 310 DASGSLVLGDDPSAYRNSTPVVYTSMVSNSDPLLQGP----FYLVNLTGITVGGQEVEST 365
Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLS 418
R ++D+GT +T L Y A R F++Q P+A G SI DTC+N++
Sbjct: 366 GFSAR--------AIVDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGFSILDTCFNMT 417
Query: 419 GFVSVRVPTVSFYFSGGPVLTLPASNFLIPV-DDAGTFCFAFA--PSPSGLSIIGNIQQE 475
G V+VP+++ F GG + + + L V D+ C A A S SIIGN QQ+
Sbjct: 418 GLKEVQVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSEDETSIIGNYQQK 477
Query: 476 GIQISFDGANGFVGFGPNVC 495
+++ FD + VGF C
Sbjct: 478 NLRVVFDTSASQVGFAQETC 497
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 153/447 (34%), Positives = 221/447 (49%), Gaps = 44/447 (9%)
Query: 78 RWNLELV--HRDKMSSSSNTTNNMHYHRH-----------QHSFHARMQRDVKRVATLVR 124
N E V R+ +SSS + T HRH + + ++RD R + R
Sbjct: 32 ELNSEAVCSERNAISSSLSGTTVALNHRHGPCSPVPSSKKRPTEEELLKRDQLRAEHIQR 91
Query: 125 RLS-----GGGADAAKHEVQD-FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSD 178
+ + G D + +V T + S +D + EY + +G+G+P +Q + ID+GSD
Sbjct: 92 KFAMNAAVDGAGDLQQSKVSSSVPTKLGSSLD--TLEYVISVGLGTPAVTQTVTIDTGSD 149
Query: 179 IVWVQCQPCSQ--CYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG----RCRYE 232
+ WVQC PC CY Q+ +FDPA S+++ VSC++A C +LE G G C+Y
Sbjct: 150 VSWVQCNPCPNPPCYAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYG 209
Query: 233 VSYGDGSYTKGTLALETLTI--GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVG 290
V YGDGS T GT + +TLT+ VK GC H G GL+GLGGG+ SLV
Sbjct: 210 VQYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCSHVESGFSDQTDGLMGLGGGAQSLVS 269
Query: 291 QLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGV 350
Q G +FSYCL +GSSG L G G ++R+ + P+FY L + V
Sbjct: 270 QTAAAYGNSFSYCLPPT-SGSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAV 328
Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI 410
GG ++ +S +F G V+D+GT +TRLP AY A AF A A SI
Sbjct: 329 GGKQLGLSPSVFAA------GSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSI 382
Query: 411 FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSI 468
DTC++ +G + +PTV+ FSGG + L + + C AFA + I
Sbjct: 383 LDTCFDFAGQTQISIPTVALVFSGGAAIDLDPNGIMY------GNCLAFAATGDDGTTGI 436
Query: 469 IGNIQQEGIQISFDGANGFVGFGPNVC 495
IGN+QQ ++ +D + +GF C
Sbjct: 437 IGNVQQRTFEVLYDVGSSTLGFRSGAC 463
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 163/433 (37%), Positives = 227/433 (52%), Gaps = 40/433 (9%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADA--AKHEV 138
L L HR + S ++ S ++ D +R ++RR+SG +K
Sbjct: 68 LRLTHRHGPCAPSRASS-----LAAPSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAA 122
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS---QCYKQSD 195
G D G+ Y V +G+P +Q M +D+GSD+ WVQC+PC+ CY Q D
Sbjct: 123 AVATVPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKD 182
Query: 196 PVFDPADSASFSGVSCSSAVCDRL---ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI 252
P+FDPA S+S++ V C VC L + C A +C Y VSYGDGS T G + +TLT+
Sbjct: 183 PLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL 242
Query: 253 -GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
+ V+ GCGH G+F G GLLGLG SLV Q G GG FSYCL ++ + +
Sbjct: 243 SASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPS-T 301
Query: 312 SGSLVFGREALPVGAA----WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
+G L G P GAA L+ +P AP++Y V L+G+ VGG ++ + F +
Sbjct: 302 AGYLTLGVGG-PSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV 360
Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDTCYNLSGFVSVRV 425
+DTGT VTRLP AY A R AF + + P A I DTCYN +G+ +V +
Sbjct: 361 ------VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTL 414
Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTF-CFAFAPSPS--GLSIIGNIQQEGIQISFD 482
P V+ F G +TL A L +F C AFAPS S G++I+GN+QQ ++ D
Sbjct: 415 PNVALTFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID 467
Query: 483 GANGFVGFGPNVC 495
G + VGF P+ C
Sbjct: 468 GTS--VGFKPSSC 478
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 132/362 (36%), Positives = 186/362 (51%), Gaps = 26/362 (7%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
GEY + + +G+PP ++D+GSD++W QC PC C Q P F PA SA++ V C S
Sbjct: 90 GEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRS 149
Query: 214 AVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHK 267
+C L C C Y+ YGD + T G LA ET T G + +V +VA GCG+
Sbjct: 150 PLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNI 209
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL----- 322
N G ++G++GLG G +SLV QLG FSYCL S + L FG A
Sbjct: 210 NSGQLANSSGMVGLGRGPLSLVSQLGPSR---FSYCLTSFLSPEPSRLNFGVFATLNGTN 266
Query: 323 ------PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
PV + PLV N PS Y++ L G+ +G R+PI +F + G GV +D+
Sbjct: 267 ASSSGSPVQS--TPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDS 324
Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNL--SGFVSVRVPTVSFYFS 433
GT++T L AY+A R V+ LP + I +TC+ V+V VP + +F
Sbjct: 325 GTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTVPDMELHFD 384
Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
GG +T+P N+++ G C A S +IIGN QQ+ + I +D AN + F P
Sbjct: 385 GGANMTVPPENYMLIDGATGFLCLAMIRS-GDATIIGNYQQQNMHILYDIANSLLSFVPA 443
Query: 494 VC 495
C
Sbjct: 444 PC 445
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 146/430 (33%), Positives = 225/430 (52%), Gaps = 43/430 (10%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+ L HRD + S ++ HY R ++F +R + R ATL+ R + GA
Sbjct: 30 FTTSLFHRDSLLSPLEFSSLSHYDRLTNAF----RRSLSRSATLLNRAATNGA------- 78
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
D+ + + GSGEY + + +G+PP + D+GSD++W QC PC +CYKQS P+F
Sbjct: 79 ----LDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIF 134
Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
DP S SFS V C+S C ++++ C A G C Y +YGD +YTKG L E +TIG + V
Sbjct: 135 DPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKITIGSSSV 194
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA--FSYCLVSRGTGSSGSL 315
K+V IGCGH++ G F A+G++GLGGG +SLV Q+ +G + FSYCL + + ++G +
Sbjct: 195 KSV-IGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKI 253
Query: 316 VFGREALPVGAAWV--PLV-RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
FG+ A+ G V PL+ +NP ++YYV L + +G R + V
Sbjct: 254 NFGQNAVVSGPGVVSTPLISKNPV--TYYYVTLEAISIGNER--------HMASAKQGNV 303
Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV----SIFDTCYN--LSGFVSVRVP 426
++D+GT ++ LP Y D V+ + +A V + +D C++ ++ S +P
Sbjct: 304 IIDSGTTLSFLPKELY----DGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIP 359
Query: 427 TVSFYFSGGP-VLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
++ FSGG V LP + F ++ A IIGN+ I +D
Sbjct: 360 IITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEA 419
Query: 486 GFVGFGPNVC 495
+ F P VC
Sbjct: 420 KRLSFKPTVC 429
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 132/362 (36%), Positives = 186/362 (51%), Gaps = 26/362 (7%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
GEY + + +G+PP ++D+GSD++W QC PC C Q P F PA SA++ V C S
Sbjct: 90 GEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRS 149
Query: 214 AVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHK 267
+C L C C Y+ YGD + T G LA ET T G + +V +VA GCG+
Sbjct: 150 PLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNI 209
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL----- 322
N G ++G++GLG G +SLV QLG FSYCL S + L FG A
Sbjct: 210 NSGQLANSSGMVGLGRGPLSLVSQLGPSR---FSYCLTSFLSPEPSRLNFGVFATLNGTN 266
Query: 323 ------PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
PV + PLV N PS Y++ L G+ +G R+PI +F + G GV +D+
Sbjct: 267 ASSSGSPVQS--TPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDS 324
Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNL--SGFVSVRVPTVSFYFS 433
GT++T L AY+A R V+ LP + I +TC+ V+V VP + +F
Sbjct: 325 GTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTVPDMELHFD 384
Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
GG +T+P N+++ G C A S +IIGN QQ+ + I +D AN + F P
Sbjct: 385 GGANMTVPPENYMLIDGATGFLCLAMIRS-GDATIIGNYQQQNMHILYDIANSLLSFVPA 443
Query: 494 VC 495
C
Sbjct: 444 PC 445
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 145/392 (36%), Positives = 207/392 (52%), Gaps = 15/392 (3%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
+ +D RV ++ R S A + E+Q V SG+ G+G Y V++ +G+P S +
Sbjct: 2 LLQDQLRVKSMHARFSNKNAGSHFKEMQA-DIPVQSGIPLGAGNYLVKMALGTPKLSLSL 60
Query: 172 VIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSA----VCDRLENAGCHA 226
+D+GSDI W QC+PC CY+Q+ FDP S+S+ VSCSS+ + D GC +
Sbjct: 61 ALDTGSDITWTQCEPCVGSCYRQAQTKFDPRKSSSYKNVSCSSSSCRIITDSGGARGCVS 120
Query: 227 GRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGS 285
C Y+V YGDGSY+ G A E LTI + V+ N GCG +N G F AGLLGLG G
Sbjct: 121 STCIYKVQYGDGSYSVGFFATEKLTISPSDVISNFLFGCGQQNAGRFGRIAGLLGLGRGK 180
Query: 286 MSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGL 345
+SL Q + F+YCL S + S+G L G + +P + PL + FY + +
Sbjct: 181 LSLALQTSEKYNNLFTYCLPSFSSSSTGHLTLGGQ-VPKSVKFTPLSPAFKNTPFYGIDI 239
Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
GL VGG +PI +F + G ++D+GT +TRL Y A F + P+
Sbjct: 240 KGLSVGGHVLPIDASVF-----SNAGAIIDSGTVITRLQPTVYSALSSKFQQLMKDYPKT 294
Query: 406 SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS- 464
G SI DTCY+ SG S+ VP +SF+F GG + + L ++ C AFAP+
Sbjct: 295 DGFSILDTCYDFSGNESISVPRISFFFKGGVEVDIKFFGILTVINAWDKVCLAFAPNDDD 354
Query: 465 -GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ GN QQ+ + D A G +GF P+ C
Sbjct: 355 GDFVVFGNSQQQTYDVVHDLAKGRIGFAPSGC 386
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 127/373 (34%), Positives = 191/373 (51%), Gaps = 24/373 (6%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASF 206
SG G+GEYF+ + VG+PP+ ++++D+GSD+ W+QC PC C++Q+ P ++P +S+S+
Sbjct: 161 SGASLGTGEYFIDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGPHYNPNESSSY 220
Query: 207 SGVSCSSAVC------DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV---- 256
+SC C D L++ C Y Y DGS T G ALET T+ T
Sbjct: 221 RNISCYDPRCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGK 280
Query: 257 -----VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGT 309
V +V GCGH N+G F GA GLLGLG G +S QL G +FSYCL T
Sbjct: 281 EKFKHVVDVMFGCGHWNKGFFHGAGGLLGLGRGPLSFPSQLQSIYGHSFSYCLTDLFSNT 340
Query: 310 GSSGSLVFGREALPV---GAAWVPLVRNPRAP--SFYYVGLSGLGVGGMRIPISEDLFRL 364
S L+FG + + + L+ P +FYY+ + + VGG + I E +
Sbjct: 341 SVSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHW 400
Query: 365 TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR 424
+ G G ++D+G+ +T P AY+ ++AF + A+ I CYN+SG + V
Sbjct: 401 SSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQIAADDFIMSPCYNVSGAMQVE 460
Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFD 482
+P +F+ G V PA N+ + C A +P S L+IIGN+ Q+ I +D
Sbjct: 461 LPDYGIHFADGAVWNFPAENYFYQYEPDEVICLAILKTPNHSHLTIIGNLLQQNFHILYD 520
Query: 483 GANGFVGFGPNVC 495
+G+ P C
Sbjct: 521 VKRSRLGYSPRRC 533
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 123/379 (32%), Positives = 189/379 (49%), Gaps = 30/379 (7%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASF 206
SG G+GEYF+ + VG+PP+ ++++D+GSD+ W+QC PC C++Q+ + P DS+++
Sbjct: 162 SGASLGTGEYFLDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSHYYPKDSSTY 221
Query: 207 SGVSCSSAVC------DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV---- 256
+SC C D L++ C Y Y DGS T G A ET T+ T
Sbjct: 222 RNISCYDPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGK 281
Query: 257 -----VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGT 309
V +V GCGH N+G F GA+GLLGLG G +S Q+ G +FSYCL T
Sbjct: 282 EKFKQVVDVMFGCGHWNKGFFYGASGLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNT 341
Query: 310 GSSGSLVFGREALPV---GAAWVPLVRNPRAP--SFYYVGLSGLGVGGMRIPISEDLFRL 364
S L+FG + + + L+ P +FYY+ + + VGG + ISE +
Sbjct: 342 SVSSKLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDISEQTWHW 401
Query: 365 TQ-----MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSG 419
+ G ++D+G+ +T P AY+ ++AF + A+ + CYN+SG
Sbjct: 402 SSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIAADDFVMSPCYNVSG 461
Query: 420 -FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEG 476
+ V +P +F+ G V PA N+ + C A +P S L+IIGN+ Q+
Sbjct: 462 AMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAIMKTPNHSHLTIIGNLLQQN 521
Query: 477 IQISFDGANGFVGFGPNVC 495
I +D +G+ P C
Sbjct: 522 FHILYDVKRSRLGYSPRRC 540
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 215 bits (547), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 140/430 (32%), Positives = 217/430 (50%), Gaps = 43/430 (10%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+ L HRD + S ++ HY R ++F +R + R A L+ R + GA +
Sbjct: 30 FTTSLFHRDSLLSPLEFSSLSHYDRLANAF----RRSLSRSAALLNRAATSGAVGLQ--- 82
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
S + GSGEY + + +G+PP + D+GSD+ W QC PC +CY+Q P+F
Sbjct: 83 --------SSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIF 134
Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
+P S SFS V C++ C +++ C G C Y +YGD +Y+KG L E +TIG + V
Sbjct: 135 NPLKSTSFSHVPCNTQTCHAVDDGHCGVQGVCDYSYTYGDRTYSKGDLGFEKITIGSSSV 194
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA---FSYCLVSRGTGSSGS 314
K+V IGCGH + G F A+G++GLGGG +SLV Q+ QT G FSYCL + + ++G
Sbjct: 195 KSV-IGCGHASSGGFGFASGVIGLGGGQLSLVSQM-SQTSGISRRFSYCLPTLLSHANGK 252
Query: 315 LVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
+ FG A+ G V PL+ + ++YY+ L + +G R + V
Sbjct: 253 INFGENAVVSGPGVVSTPLI-SKNTVTYYYITLEAISIGNER--------HMAFAKQGNV 303
Query: 373 VMDTGTAVTRLPTPAYEAFRDAFV----AQTGNLPRASGVSIFDTCYN--LSGFVSVRVP 426
++D+GT +T LP Y+ + + A+ P S D C++ ++ S+ +P
Sbjct: 304 IIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGS----LDLCFDDGINAAASLGIP 359
Query: 427 TVSFYFSGGP-VLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
++ +FSGG V LP + F D+ A + IIGN+ Q I +D
Sbjct: 360 VITAHFSGGANVNLLPINTFRKVADNVNCLTLKAASPTTEFGIIGNLAQANFLIGYDLEA 419
Query: 486 GFVGFGPNVC 495
+ F P VC
Sbjct: 420 KRLSFKPTVC 429
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 152/447 (34%), Positives = 221/447 (49%), Gaps = 44/447 (9%)
Query: 78 RWNLELV--HRDKMSSSSNTTNNMHYHRH-----------QHSFHARMQRDVKRVATLVR 124
N E V R+ +SSS + T HRH + + ++RD R + R
Sbjct: 32 ELNSEAVCSERNAISSSLSGTTVALNHRHGPCSPVPSSKKRPTEEELLKRDQLRAEHIQR 91
Query: 125 RLS-----GGGADAAKHEVQD-FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSD 178
+ + G D + +V T + S +D + EY + +G+G+P +Q + ID+GSD
Sbjct: 92 KFAMNAAVDGAGDLQQSKVSSSVPTKLGSSLD--TLEYVISVGLGTPAVTQTVTIDTGSD 149
Query: 179 IVWVQCQPCSQ--CYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG----RCRYE 232
+ WVQC PC C+ Q+ +FDPA S+++ VSC++A C +LE G G C+Y
Sbjct: 150 VSWVQCNPCPNPPCHAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYG 209
Query: 233 VSYGDGSYTKGTLALETLTI--GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVG 290
V YGDGS T GT + +TLT+ VK GC H G GL+GLGGG+ SLV
Sbjct: 210 VQYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCSHLESGFSDQTDGLMGLGGGAQSLVS 269
Query: 291 QLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGV 350
Q G +FSYCL +GSSG L G G ++R+ + P+FY L + V
Sbjct: 270 QTAAAYGNSFSYCLPPT-SGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAV 328
Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI 410
GG ++ +S +F G V+D+GT +TRLP AY A AF A A SI
Sbjct: 329 GGKQLGLSPSVFAA------GSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSI 382
Query: 411 FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSI 468
DTC++ +G + +PTV+ FSGG + L + + C AFA + I
Sbjct: 383 LDTCFDFAGQTQISIPTVALVFSGGAAIDLDPNGIMY------GNCLAFAATGDDGTTGI 436
Query: 469 IGNIQQEGIQISFDGANGFVGFGPNVC 495
IGN+QQ ++ +D + +GF C
Sbjct: 437 IGNVQQRTFEVLYDVGSSTLGFRSGAC 463
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 214 bits (544), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 138/343 (40%), Positives = 182/343 (53%), Gaps = 31/343 (9%)
Query: 168 SQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA--- 222
SQ +V+D+ SDI WVQC PC QC+ Q DP++DPA S++F+ + C S C L ++
Sbjct: 168 SQTVVVDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGN 227
Query: 223 GCH--AGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGA-AGL 278
GC C+Y V+YGDG T GT +TLT+ T VVK+ GC H +G F AG+
Sbjct: 228 GCSPTTDECKYIVNYGDGKATTGTYVTDTLTMSPTIVVKDFRFGCSHAVRGSFSNQNAGI 287
Query: 279 LGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA----AWVPLVRN 334
L LGGG SL+ Q G AFSYC+ S+G L G PV A ++ PL++N
Sbjct: 288 LALGGGRGSLLEQTADAYGNAFSYCIPK--PSSAGFLSLGG---PVEASLKFSYTPLIKN 342
Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
AP+FY V L + V G ++ + F G VMD+G VT+LP Y A R A
Sbjct: 343 KHAPTFYIVHLEAIIVAGKQLAVPPTAFAT------GAVMDSGAVVTQLPPQVYAALRAA 396
Query: 395 F-VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL-PASNFLIPVDDA 452
F A P A+ V DTCY+ + F V+VP VS F+GG L L PAS L
Sbjct: 397 FRSAMAAYGPLAAPVRNLDTCYDFTRFPDVKVPKVSLVFAGGATLDLEPASIIL-----D 451
Query: 453 GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
G FA P + IGN+QQ+ ++ +D G VGF C
Sbjct: 452 GCLAFAATPGEESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 153/440 (34%), Positives = 217/440 (49%), Gaps = 53/440 (12%)
Query: 89 MSSSSNTTNNMHYHRH------------QHSFHARMQRDVKRVATLVRRLSGG--GADAA 134
+ SNT + HRH SF R++R+ R ++ R+S G G DA
Sbjct: 49 LDPGSNTVSVPLVHRHGPCAPTQLSSDKPSSFTDRLRRNRARSKYIMSRVSKGMMGDDAD 108
Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--SQCYK 192
G V S EY V +G+G+P SQ ++ID+GSD+ WVQCQPC + CY
Sbjct: 109 VSIPTHLGGSV------DSLEYVVTVGLGTPSVSQVLLIDTGSDLSWVQCQPCNSTTCYP 162
Query: 193 QSDPVFDPADSASFSGVSCSSAVCDRLEN----AGCHAG----RCRYEVSYGDGSYTKGT 244
Q DP+FDP+ S++++ + C++ C L + GC +G +C + ++YGDGS T+G
Sbjct: 163 QKDPLFDPSKSSTYAPIPCNTDACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGV 222
Query: 245 LALETLTIGRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
+ ETL + V VK+ GCGH G GLLGLGG SLV Q GGAFSYC
Sbjct: 223 YSNETLALAPGVAVKDFRFGCGHDQDGANDKYDGLLGLGGAPESLVVQTASVYGGAFSYC 282
Query: 304 L------VSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPI 357
L V G G G + P++R +FY V ++G+ VGG I +
Sbjct: 283 LPALNNQVGFLALGGGGAPSGGVVNTSGFVFTPMIREEE--TFYVVNMTGITVGGEPIDV 340
Query: 358 SEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNL 417
F G+++D+GT VT L AY A + AF P + DTCY+
Sbjct: 341 PPSAFS------GGMIIDSGTVVTELQHTAYNALQAAFRKAMAAYPLVRNGEL-DTCYDF 393
Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQE 475
SG+ +V +P V+ FSGG + L N ++ +DD C AF S I+GN+ Q
Sbjct: 394 SGYSNVTLPKVALTFSGGATIDLDVPNGIL-LDD----CLAFQESGPDDQPGILGNVNQR 448
Query: 476 GIQISFDGANGFVGFGPNVC 495
+++ +D G VGF VC
Sbjct: 449 TLEVLYDAGRGRVGFRAAVC 468
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 130/356 (36%), Positives = 189/356 (53%), Gaps = 19/356 (5%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSC 211
+ Y V I +G+PP V+D+GSD++W QC PC +C+ Q P++ PA SA+++ VSC
Sbjct: 89 TATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSC 148
Query: 212 SSAVCDRLENAGCHAGR----CRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGH 266
S +C L++ C Y SYGDG+ T G LA ET T+G T V+ VA GCG
Sbjct: 149 RSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGT 208
Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-LPVG 325
+N G ++GL+G+G G +SLV QLG FSYC ++ L G A L
Sbjct: 209 ENLGSTDNSSGLVGMGRGPLSLVSQLGVTR---FSYCFTPFNATAASPLFLGSSARLSSA 265
Query: 326 AAWVPLVRNP-----RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
A P V +P R S+YY+ L G+ VG +PI +FRLT MGD GV++D+GT
Sbjct: 266 AKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTF 325
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
T L A+ A A ++ LP ASG + C+ + +V VP + +F G +
Sbjct: 326 TALEESAFVALARALASRV-RLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGAD-ME 383
Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L ++++ AG C S G+S++G++QQ+ I +D G + F P C
Sbjct: 384 LRRESYVVEDRSAGVACLGMV-SARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 438
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 130/356 (36%), Positives = 189/356 (53%), Gaps = 19/356 (5%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSC 211
+ Y V I +G+PP V+D+GSD++W QC PC +C+ Q P++ PA SA+++ VSC
Sbjct: 89 TATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSC 148
Query: 212 SSAVCDRLENAGCHAGR----CRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGH 266
S +C L++ C Y SYGDG+ T G LA ET T+G T V+ VA GCG
Sbjct: 149 RSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGT 208
Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-LPVG 325
+N G ++GL+G+G G +SLV QLG FSYC ++ L G A L
Sbjct: 209 ENLGSTDNSSGLVGMGRGPLSLVSQLGVTR---FSYCFTPFNATAASPLFLGSSARLSSA 265
Query: 326 AAWVPLVRNP-----RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
A P V +P R S+YY+ L G+ VG +PI +FRLT MGD GV++D+GT
Sbjct: 266 AKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTF 325
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
T L A+ A A ++ LP ASG + C+ + +V VP + +F G +
Sbjct: 326 TALEERAFVALARALASRV-RLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGAD-ME 383
Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L ++++ AG C S G+S++G++QQ+ I +D G + F P C
Sbjct: 384 LRRESYVVEDRSAGVACLGMV-SARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 438
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 213 bits (543), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 138/353 (39%), Positives = 185/353 (52%), Gaps = 15/353 (4%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSAS 205
+G G+ E+ V +G G+P ++ ++ D+GSD+ W+QC PCS CYKQ DP+FDP SA+
Sbjct: 111 TGTSLGTLEFVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSAT 170
Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGC 264
+S V C C G C Y+V YGDGS T G L+ ETL++ + A GC
Sbjct: 171 YSAVPCGHPQCAAAGGKCSSNGTCLYKVQYGDGSSTAGVLSHETLSLTSARALPGFAFGC 230
Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
G N G F GL+GLG G +SL Q G AFSYCL S T S G L G
Sbjct: 231 GETNLGDFGDVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNT-SHGYLTIGTTTPAS 289
Query: 325 GA---AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
G+ + +++ PSFY+V L + VGG +P+ LF DG ++D+GT +T
Sbjct: 290 GSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFT-----RDGTLLDSGTVLT 344
Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
LP AY A RD F A FDTCY+ +G ++ +P VSF FS G L
Sbjct: 345 YLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFAGQNAIFMPLVSFKFSDGSSFDLS 404
Query: 442 ASNFLIPVDD--AGTFCFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGF 490
LI DD T C AF P PS + +I+GN QQ ++ +D A +GF
Sbjct: 405 PFGVLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEKIGF 457
>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/334 (38%), Positives = 174/334 (52%), Gaps = 74/334 (22%)
Query: 62 ERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVAT 121
E IS+ S + + L HRD ++ ++ + F+ R+QRD RV
Sbjct: 84 ETETQISTLPVSETDPTMTMHLEHRDVLAFNATP---------EALFNLRLQRDAFRVEA 134
Query: 122 LVR-----RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSG 176
L + G + + F + V SG+ QGSGEYF R+GVG+PP+ YMV+D+G
Sbjct: 135 LSKMAAAAGGRRAGRNGTHAQGGGFSSSVTSGLAQGSGEYFTRLGVGTPPKYVYMVLDTG 194
Query: 177 SDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR-CRYEVSY 235
SD+VW+QC PC +CY Q+DPVFDP S SFS +SC S +C RL++ GC++ + C Y+V+Y
Sbjct: 195 SDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAY 254
Query: 236 GDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQ 295
GDGS+T G + ETLT T V VA+GCGH N+G+FVGAAGLLGLG
Sbjct: 255 GDGSFTFGEFSTETLTFRGTRVPKVALGCGHDNEGLFVGAAGLLGLG------------- 301
Query: 296 TGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
R PR L+ VGG R+
Sbjct: 302 -------------------------------------RQPR--------LNRPPVGGARV 316
Query: 356 P-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
I+ LF+L G+ GV++D+GT+VTRL AY
Sbjct: 317 AGITASLFKLDTAGNGGVIIDSGTSVTRLTRRAY 350
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 141/400 (35%), Positives = 205/400 (51%), Gaps = 36/400 (9%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMD------QGSGEYFVRIGVGSP 165
++RD RV ++ AKH + T V + M G Y V +G+G+P
Sbjct: 92 LRRDQLRVKSI----------RAKHSMNSSTTGVFNEMKTRVPTTHFGGGYAVTVGLGTP 141
Query: 166 PRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENA 222
+ ++ D+GSD+ W QC+PCS C+ Q+D FDP S S+ +SCSS C + E+A
Sbjct: 142 KKDFSLLFDTGSDLTWTQCEPCSGGCFPQNDEKFDPTKSTSYKNLSCSSEPCKSIGKESA 201
Query: 223 -GCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLL 279
GC + C Y V YG G YT G LA ETLTI + V +N IGCG +N G F G AGLL
Sbjct: 202 QGCSSSNSCLYGVKYGTG-YTVGFLATETLTITPSDVFENFVIGCGERNGGRFSGTAGLL 260
Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPS 339
GLG ++L Q FSYCL + + S+G L FG + A + P+ + P
Sbjct: 261 GLGRSPVALPSQTSSTYKNLFSYCLPAS-SSSTGHLSFGG-GVSQAAKFTPITS--KIPE 316
Query: 340 FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQT 399
Y + +SG+ VGG ++PI +FR G ++D+GT +T LP+ A+ A AF
Sbjct: 317 LYGLDVSGISVGGRKLPIDPSVFRTA-----GTIIDSGTTLTYLPSTAHSALSSAFQEMM 371
Query: 400 GNLPRASGVSIFDTCYNLSGFV--SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCF 457
N G S CY+ S ++ +P +S +F GG + + S I + C
Sbjct: 372 TNYTLTKGTSGLQPCYDFSKHANDNITIPQISIFFEGGVEVDIDDSGIFIAANGLEEVCL 431
Query: 458 AFAP--SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
AF + + ++I GN+QQ+ ++ +D A G VGF P C
Sbjct: 432 AFKDNGNDTDVAIFGNVQQKTYEVVYDVAKGMVGFAPGGC 471
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 144/419 (34%), Positives = 207/419 (49%), Gaps = 46/419 (10%)
Query: 108 FHARMQRDVKRVATLVRRLSG-----------------------GGADAAKHEVQDFGTD 144
F A + D R+A L RL+ GG+ A+ V
Sbjct: 65 FSAVVTHDDARIAHLASRLANNHPTSPSSSSLLHGHRKKKAGGVGGSQASSSSVP----- 119
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADS 203
+ G G Y R+G+G+P S MV+D+GS + W+QC PCS C++Q+ PVFDP S
Sbjct: 120 LTPGASVAVGNYVTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSVSCHRQAGPVFDPRAS 179
Query: 204 ASFSGVSCSSAVCDRLENAGCHAGRCR------YEVSYGDGSYTKGTLALETLTIGRTVV 257
+++ V CSS+ C L+ A + C Y+ SYGD SY+ G L+ +T++ G
Sbjct: 180 GTYAAVQCSSSECGELQAATLNPSACSVSNVCIYQASYGDSSYSVGYLSKDTVSFGSGSF 239
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF 317
GCG N+G+F +AGL+GL +SL+ QL G AFSYCL + + ++G L
Sbjct: 240 PGFYYGCGQDNEGLFGRSAGLIGLAKNKLSLLYQLAPSLGYAFSYCLPTS-SAAAGYLSI 298
Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
G P ++ P+ + S Y+V LSG+ V G + + +R ++D+G
Sbjct: 299 GSYN-PGQYSYTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEYRSLP-----TIIDSG 352
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGV-SIFDTCYNLSGFVSVRVPTVSFYFSGGP 436
T +TRLP Y A A A + + SI DTC+ S +RVP V F+GG
Sbjct: 353 TVITRLPPNVYTALSRAVAAAMASAAPRAPTYSILDTCFRGSA-AGLRVPRVDMAFAGGA 411
Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L L N LI VDD+ T C AFAP+ G +IIGN QQ+ + +D A +GF C
Sbjct: 412 TLALSPGNVLIDVDDSTT-CLAFAPT-GGTAIIGNTQQQTFSVVYDVAQSRIGFAAGGC 468
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 137/395 (34%), Positives = 198/395 (50%), Gaps = 27/395 (6%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
+ +D RV + RLS + E+Q T + + + G Y V +G+G+P + +
Sbjct: 99 LLQDQLRVKSFQVRLSMNPSSGVFKEMQ---TTIPASIVPTGGAYVVTVGLGTPKKDFTL 155
Query: 172 VIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG-----CH 225
D+GSD+ W QC+PC C+ Q+ P FDP S S+ VSCSS C + C
Sbjct: 156 SFDTGSDLTWTQCEPCLGGCFPQNQPKFDPTTSTSYKNVSCSSEFCKLIAEGNYPAQDCI 215
Query: 226 AGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGG 284
+ C Y + YG G YT G LA ETL I + V KN GC +++G F G GLLGLG
Sbjct: 216 SNTCLYGIQYGSG-YTIGFLATETLAIASSDVFKNFLFGCSEESRGTFNGTTGLLGLGRS 274
Query: 285 SMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVG 344
++L Q + FSYCL + + S+G L FG E + A P+ +P+ Y +
Sbjct: 275 PIALPSQTTNKYKNLFSYCLPASPS-STGHLSFGVE-VSQAAKSTPI--SPKLKQLYGLN 330
Query: 345 LSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR 404
G+ V G +PI+ + R ++D+GT T LP+P Y A AF N
Sbjct: 331 TVGISVRGRELPINGSISR--------TIIDSGTTFTFLPSPTYSALGSAFREMMANYTL 382
Query: 405 ASGVSIFDTCYNLS--GFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP- 461
+G S F CY+ S G ++ +P +S +F GG + + S +IPV+ C AFA
Sbjct: 383 TNGTSSFQPCYDFSNIGNGTLTIPGISIFFEGGVEVEIDVSGIMIPVNGLKEVCLAFADT 442
Query: 462 -SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S S +I GN QQ+ ++ +D A G VGF P C
Sbjct: 443 GSDSDFAIFGNYQQKTYEVIYDVAKGMVGFAPKGC 477
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 141/428 (32%), Positives = 213/428 (49%), Gaps = 48/428 (11%)
Query: 114 RDVKRVATLVRR-LSGGGADAAKHEVQDFGTDVV--------------------SGMDQG 152
RD+ R+ TL +R L+ + + + +VV SGM G
Sbjct: 92 RDLTRIQTLHKRVLAKKNQNTVSQKQKKKNKEVVTTPVASSVEEQAGQLVATLESGMTLG 151
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
SGEYF+ + VGSPP+ +++D+GSD+ W+QC PC C++Q+ +DP SAS+ ++C+
Sbjct: 152 SGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGAFYDPKASASYKNITCN 211
Query: 213 SAVCDRLENAG----CHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTV---------V 257
C+ + C + C Y YGD S T G A+ET T+ T V
Sbjct: 212 DPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELYNV 271
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG--TGSSGSL 315
+N+ GCGH N+G+F GAAGLLGLG G +S QL G +FSYCLV R T S L
Sbjct: 272 ENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKL 331
Query: 316 VFGREALPVG------AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
+FG + + ++V N +FYYV + + V G + I E+ + ++ G
Sbjct: 332 IFGEDKDLLSHPNLNFTSFVARKEN-LVDTFYYVQIKSIIVAGEVLNIPEETWNISSDGA 390
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
G ++D+GT ++ PAYE ++ + G P I D C+N+SG S+++P +
Sbjct: 391 GGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIDSIQLPEL 450
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGF 487
F+ G V P N I +++ C A +P S SIIGN QQ+ I +D
Sbjct: 451 GIAFADGAVWNFPTENSFIWLNE-DLVCLAILGTPKSAFSIIGNYQQQNFHILYDTKRSR 509
Query: 488 VGFGPNVC 495
+G+ P C
Sbjct: 510 LGYAPTKC 517
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 146/433 (33%), Positives = 209/433 (48%), Gaps = 46/433 (10%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+++E++HRD S + R ++ H R V R + AAK
Sbjct: 29 FSVEMIHRDSSRSPFFRPTETQFQRVANAVH----RSVNRANHFHK-----AHKAAK--- 76
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
+ + Q GEY + VG PP Y +ID+GSD++W+QC+PC +CY Q+ +F
Sbjct: 77 --------ATITQNDGEYLISYSVGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTTRIF 128
Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGR---CRYEVSYGDGSYTKGTLALETLTIGRT 255
DP+ S ++ + SS C +E+ C + C Y + YGDGSY++G L++ETLT+G T
Sbjct: 129 DPSKSNTYKILPFSSTTCQSVEDTSCSSDNRKMCEYTIYYGDGSYSQGDLSVETLTLGST 188
Query: 256 -----VVKNVAIGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQL---GGQTGGAFSYCLVS 306
+ IGCG N F G ++G++GLG G +SL+ QL G FSYCL S
Sbjct: 189 NGSSVKFRRTVIGCGRNNTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLAS 248
Query: 307 RGTGSSGSLVFGREALPVGAAWV--PLV-RNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
SS L FG A+ G V P+V +P+ FYY+ L VG RI + FR
Sbjct: 249 MSNISS-KLNFGDAAVVSGDGTVSTPIVTHDPKV--FYYLTLEAFSVGNNRIEFTSSSFR 305
Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG-VSIFDTCYNLSGFVS 422
+ G+ +++D+GT +T LP Y A VA L R + CY S F
Sbjct: 306 FGEKGN--IIIDSGTTLTLLPNDIYSKLESA-VADLVELDRVKDPLKQLSLCYR-STFDE 361
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
+ P + +FSG V L A N I V+ G C AF S G I GN+ Q+ + +D
Sbjct: 362 LNAPVIMAHFSGADV-KLNAVNTFIEVEQ-GVTCLAFISSKIG-PIFGNMAQQNFLVGYD 418
Query: 483 GANGFVGFGPNVC 495
V F P C
Sbjct: 419 LQKKIVSFKPTDC 431
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 140/439 (31%), Positives = 220/439 (50%), Gaps = 31/439 (7%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
LEL RD ++ + +Q++ + +++ K V T + A + + +
Sbjct: 101 LELQIRD-LTRIQTLHKRVLEKNNQNTVSQKQKKNDKEVVT-----TTPVASSVEEQAGQ 154
Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
+ SGM GSGEYF+ + VGSPP+ +++D+GSD+ W+QC PC C++Q+ +DP
Sbjct: 155 LVATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDP 214
Query: 201 ADSASFSGVSCSSAVCDRLENAG----CHAGR--CRYEVSYGDGSYTKGTLALETLTIGR 254
SAS+ ++C+ C+ + + C + C Y YGD S T G A+ET T+
Sbjct: 215 KASASYKNITCNDQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNL 274
Query: 255 TV---------VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
T V+N+ GCGH N+G+F GAAGLLGLG G +S QL G +FSYCLV
Sbjct: 275 TTNGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLV 334
Query: 306 SRG--TGSSGSLVFGREALPVGAAWVPLV-----RNPRAPSFYYVGLSGLGVGGMRIPIS 358
R T S L+FG + + + + +FYYV + + V G + I
Sbjct: 335 DRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIP 394
Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNL 417
E+ + ++ G G ++D+GT ++ PAYE ++ + G P I D C+N+
Sbjct: 395 EETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNV 454
Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEG 476
SG +V++P + F+ G V P N I +++ C A +P S SIIGN QQ+
Sbjct: 455 SGIHNVQLPELGIAFADGAVWNFPTENSFIWLNE-DLVCLAMLGTPKSAFSIIGNYQQQN 513
Query: 477 IQISFDGANGFVGFGPNVC 495
I +D +G+ P C
Sbjct: 514 FHILYDTKRSRLGYAPTKC 532
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 154/456 (33%), Positives = 222/456 (48%), Gaps = 66/456 (14%)
Query: 69 SSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRH-----------QHSFHARMQRDVK 117
++ S+ RW + SNT + HRH + S R++R
Sbjct: 41 AATCSTSRVRW---------LDEGSNTVSVPLVHRHGPCAPSTRSSDEPSLSERLRRSRA 91
Query: 118 RVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGS 177
R ++ R S H G S EY V +G+G+P SQ ++ID+GS
Sbjct: 92 RSKYIMSRASKSNVSIPTHL----------GGSVDSLEYVVTVGLGTPAVSQVLLIDTGS 141
Query: 178 DIVWVQCQPC--SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG----CHAG---- 227
D+ WVQC PC + CY Q DP+FDP+ S++++ + C++ C L G C +G
Sbjct: 142 DLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYAPIPCNTDACRDLTRDGYGSDCTSGSGGG 201
Query: 228 -RCRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGS 285
+C Y ++YGDGS T G + ETLT+ V VK+ GCGH G GLLGLGG
Sbjct: 202 AQCGYAITYGDGSQTTGVYSNETLTMAPGVTVKDFHFGCGHDQDGPNDKYDGLLGLGGAP 261
Query: 286 MSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV----GAAWVPLVRNPRAPSFY 341
SLV Q GGAFSYCL + +G L G PV G + P+VR + +FY
Sbjct: 262 ESLVVQTSSVYGGAFSYCLPA-ANDQAGFLALGA---PVNDASGFVFTPMVREQQ--TFY 315
Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
V ++G+ VGG I + F G+++D+GT VT L AY A + AF
Sbjct: 316 VVNMTGITVGGEPIDVPPSAFS------GGMIIDSGTVVTELQHTAYAALQAAFRKAMAA 369
Query: 402 LPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF-- 459
P + DTCYN +G +V VP V+ FSGG + L + ++ +D+ C AF
Sbjct: 370 YPLLPNGEL-DTCYNFTGHSNVTVPRVALTFSGGATVDLDVPDGIL-LDN----CLAFQE 423
Query: 460 APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
A + I+GN+ Q +++ +D +G VGFG + C
Sbjct: 424 AGPDNQPGILGNVNQRTLEVLYDVGHGRVGFGADAC 459
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 211 bits (538), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 158/398 (39%), Positives = 213/398 (53%), Gaps = 35/398 (8%)
Query: 116 VKRVATLVRRLSGG--GADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
V+RV L L G G H G D G+ Y V +G+P +Q M +
Sbjct: 6 VRRVVLLSSLLCAGALGFLPCSHAAAVATVPASWGYDIGTLNYVVTASLGTPGVAQTMEV 65
Query: 174 DSGSDIVWVQCQPCS---QCYKQSDPVFDPADSASFSGVSCSSAVCDRL---ENAGCHAG 227
D+GSD+ WVQC+PC+ CY Q DP+FDPA S+S++ V C VC L + C A
Sbjct: 66 DTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAA 125
Query: 228 RCRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSM 286
+C Y VSYGDGS T G + +TLT+ + V+ GCGH G+F G GLLGLG
Sbjct: 126 QCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQP 185
Query: 287 SLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA----WVPLVRNPRAPSFYY 342
SLV Q G GG FSYCL ++ + ++G L G P GAA L+ +P AP++Y
Sbjct: 186 SLVEQTAGTYGGVFSYCLPTKPS-TAGYLTLGVGG-PSGAAPGFSTTQLLPSPNAPTYYV 243
Query: 343 VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN- 401
V L+G+ VGG ++ + F + +DTGT VTRLP AY A R AF + +
Sbjct: 244 VMLTGISVGGQQLSVPASAFAGGTV------VDTGTVVTRLPPTAYAALRSAFRSGMASY 297
Query: 402 -LPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF-CFAF 459
P A I DTCYN +G+ +V +P V+ F G +TL A L +F C AF
Sbjct: 298 GYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL-------SFGCLAF 350
Query: 460 APSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
APS S G++I+GN+QQ ++ DG + VGF P+ C
Sbjct: 351 APSGSDGGMAILGNVQQRSFEVRIDGTS--VGFKPSSC 386
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 211 bits (537), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 140/436 (32%), Positives = 210/436 (48%), Gaps = 44/436 (10%)
Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDV----------------- 145
R HS +D+ R+ TL R + + + +D+
Sbjct: 90 RTTHSVVDLQIQDLTRIKTLHARFNKSKKQKNEKVRKKITSDISLVGAPEVSPGKLIATL 149
Query: 146 VSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSAS 205
SGM GSGEYF+ + VG+PP+ +++D+GSD+ W+QC PC C+ Q+ +DP SAS
Sbjct: 150 ESGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKTSAS 209
Query: 206 FSGVSCSSAVCDRLENAG----CHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTV--- 256
F ++C+ C + + C + C Y YGD S T G A+ET T+ T
Sbjct: 210 FKNITCNDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEG 269
Query: 257 ------VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--G 308
V N+ GCGH N+G+F GA+GLLGLG G +S QL G +FSYCLV R
Sbjct: 270 GSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSN 329
Query: 309 TGSSGSLVFGREALPVGAAWVPLV-----RNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
T S L+FG + + + + +FYY+ + + VGG + I E+ +
Sbjct: 330 TNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWN 389
Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG-NLPRASGVSIFDTCYNLSGFV- 421
++ GD G ++D+GT ++ PAYE ++ F + N P + D C+N+SG
Sbjct: 390 ISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCFNVSGIEE 449
Query: 422 -SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQI 479
++ +P + F G V PA N I + + C A +P S SIIGN QQ+ I
Sbjct: 450 NNIHLPELGIAFVDGTVWNFPAENSFIWLSE-DLVCLAILGTPKSTFSIIGNYQQQNFHI 508
Query: 480 SFDGANGFVGFGPNVC 495
+D +GF P C
Sbjct: 509 LYDTKRSRLGFTPTKC 524
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 147/435 (33%), Positives = 210/435 (48%), Gaps = 50/435 (11%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+ +EL+HRD S HYHR VA +RR + H
Sbjct: 30 FTVELIHRDSPKSPMYNPLENHYHR---------------VADTLRR-------SISHNT 67
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
V + + GEY +++ VG+PP V D+GSDI+W QC+PC+ CY+Q P+F
Sbjct: 68 GLVTNTVEAPIYNNRGEYLMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQDLPMF 127
Query: 199 DPADSASFSGVSCSSAVCDRL--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT- 255
+P+ S ++ VSCSS VC +N+ C Y +SYGD S+++G A++TLT+G T
Sbjct: 128 NPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTS 187
Query: 256 ----VVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT- 309
AIGCGH N G F +G++GLG G SL+ Q+G GG FSYCL G
Sbjct: 188 GRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGND 247
Query: 310 -GSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
G S L FG A G+ V P+ + + SFY + L + VG + F T
Sbjct: 248 DGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVG------RNNTFYSTA 301
Query: 367 M----GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF-DTCYNLSGFV 421
G +++D+GT +T LP Y F A ++ + NL R + F + C+ +
Sbjct: 302 NSILGGKANIIIDSGTTLTLLPVDLYHNFAKA-ISNSINLQRTDDPNQFLEYCFETTT-D 359
Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA-PSPSGLSIIGNIQQEGIQIS 480
+VP ++ +F G L L N LI V D C AFA + +SI GNI Q +
Sbjct: 360 DYKVPFIAMHFEGAN-LRLQRENVLIRVSD-NVICLAFAGAQDNDISIYGNIAQINFLVG 417
Query: 481 FDGANGFVGFGPNVC 495
+D N + F P C
Sbjct: 418 YDVTNMSLSFKPMNC 432
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 156/480 (32%), Positives = 227/480 (47%), Gaps = 38/480 (7%)
Query: 40 VNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNM 99
E+I+G R AK + + ++ + S LEL H SS+ T +
Sbjct: 67 ARETIQGRRYAQAKQAGFLAGEDKKAAEEPAARRSRSTTAVLELKHH----SSTATVPDH 122
Query: 100 HYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVV--SGMDQGSGEYF 157
R ++ H + D R A+L R + + +V SG+ + Y
Sbjct: 123 PAARERYLKHL-LAADSARAASLQLRKPKPASSTTTTQASAAAAEVPLGSGIRYQTLNYV 181
Query: 158 VRIGVGSP-PRSQYMVIDSGSDIVWVQCQPC--SQCYKQSDPVFDPADSASFSGVSCSSA 214
I +G ++ +++D+GSD+ WVQC+PC S CY Q DP+FDPA S +F+ V C S
Sbjct: 182 TTIALGGGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAAVPCGSP 241
Query: 215 VCDR------------LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVA 261
C +AG RC Y +SYGDGS+++G LA +TL +G T +
Sbjct: 242 ACAASLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGLGTTTKLDGFV 301
Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG--- 318
GCG N+G+F G AGL+GLG +SLV Q + GG FSYCL + T S+GSL G
Sbjct: 302 FGCGLSNRGLFGGTAGLMGLGRTDLSLVSQTAARFGGVFSYCLPAT-TTSTGSLSLGPGP 360
Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
+ P A+ ++ +P P FY++ ++G V G V++D+GT
Sbjct: 361 SSSFP-NMAYTRMIADPTQPPFYFINITGAAV------GGGAALTAPGFGAGNVLVDSGT 413
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
+TRL Y+A R F A+ P A G SI D CY+L+G V VP ++ GG +
Sbjct: 414 VITRLAPSVYKAVRAEF-ARRFEYPAAPGFSILDACYDLTGRDEVNVPLLTLTLEGGAQV 472
Query: 439 TLPASNFLIPV-DDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
T+ A+ L V D C A A P IIGN QQ ++ +D +GF C
Sbjct: 473 TVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQRNKRVVYDTVGSRLGFADEDC 532
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 135/357 (37%), Positives = 189/357 (52%), Gaps = 23/357 (6%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
EY + + +G+PP + D+GSD+ W QCQPC C+ Q PV+DP+ S++FS V CSSA
Sbjct: 65 EYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSA 124
Query: 215 VCD---RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV------VKNVAIGCG 265
C R N + CRY SY DG+Y+ G L ETLTIG +V V +VA GCG
Sbjct: 125 TCLPTWRSRNCSNPSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAFGCG 184
Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF--GREALP 323
N G + + G +GLG G++SL+ QLG G FSYCL + S F L
Sbjct: 185 TDNGGDSLNSTGTVGLGRGTLSLLAQLG---VGKFSYCLTDFFNSTMDSPFFLGTLAELA 241
Query: 324 VGAAWV---PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
G V PL+++P PS Y+V L G+ +G +R+PI F L G+ G+++D+GT
Sbjct: 242 PGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGMMVDSGTTF 301
Query: 381 TRLPTPAYEAFRDAF--VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
T L A FR+ VAQ P + S+ C+ S +P + +F+GG +
Sbjct: 302 TIL---AKSGFREVVDRVAQLLGQPPVNASSLDSPCFP-SPDGEPFMPDLVLHFAGGADM 357
Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L N++ +D +FC SPS S +GN QQ+ IQ+ FD G + F P C
Sbjct: 358 RLHRDNYMSYNEDDSSFCLNIVGSPSTWSRLGNFQQQNIQMLFDMTVGQLSFLPTDC 414
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 132/391 (33%), Positives = 208/391 (53%), Gaps = 31/391 (7%)
Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS 194
K+ V F + VV+ + Q EY+V + VG+P +++D+GSD+ W+QC PC C
Sbjct: 119 KNTVTGFTSPVVT-LGQAGLEYYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPAL 177
Query: 195 DPVFDPADSASFSGVSCSSAVCDR----LENAGCHAGR-CRYEVSYGDGSYTKGTLALET 249
P F+P S+SF + C+S+ C ++ +GR C + + YGDGS + G LA+ET
Sbjct: 178 RPPFNPRHSSSFFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMET 237
Query: 250 LT-------IGRTV-VKNVAIGCGH-KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAF 300
+ G V + N+ +GC +G+ GA+GLLG+ +S QL + F
Sbjct: 238 IAGNTPNFGDGEPVKLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKF 297
Query: 301 SYCLVSR--GTGSSGSLVFGR-EALPVGAAWVPLVRNPRAPS----FYYVGLSGLGVGGM 353
S+C + SSG + FG + + + PLV+NP PS +YYVGL G+ V
Sbjct: 298 SHCFPDKIAHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDES 357
Query: 354 RIPISEDLFRLTQM-GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD 412
R+P+S F + ++ G G ++D+GTA T L PA++A R F+A+T +L + S F
Sbjct: 358 RLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFT 417
Query: 413 TCYNLS----GFVSVRVPTVSFYFSGGPVLTLPASNFLIPV---DDAGTFCFAFAPSPS- 464
CYN++ S +P+++ +F GG + LP ++ LIPV ++ T C AF S
Sbjct: 418 PCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDI 477
Query: 465 GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+IIGN QQ+ + + +D +G P C
Sbjct: 478 PFNIIGNYQQQNLWVEYDLEKLRLGIAPAQC 508
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 209 bits (531), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 136/363 (37%), Positives = 192/363 (52%), Gaps = 20/363 (5%)
Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
+ G EY + + +G+PP + D+GSD+ W QCQPC C+ Q P++D A S+SFS
Sbjct: 86 LRSGQAEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPIYDTAVSSSFSP 145
Query: 209 VSCSSAVCDRL---ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI---GRTVVKNVAI 262
V C+SA C + N + CRY +YGDG+Y+ G L ETLT V +A
Sbjct: 146 VPCASATCLPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGAPGVSVGGIAF 205
Query: 263 GCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS-LVFG--- 318
GCG N G+ + G +GLG GS+SLV QLG G FSYCL S GS ++FG
Sbjct: 206 GCGVDNGGLSYNSTGTVGLGRGSLSLVAQLG---VGKFSYCLTDFFNTSLGSPVLFGALA 262
Query: 319 REALPVGAAWV---PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMD 375
A P A V PLV++P P++YYV L G+ +G R+PI F L G G+++D
Sbjct: 263 ELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIVD 322
Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCY-NLSGFVSV-RVPTVSFYFS 433
+GT T L A+ D VA P + S+ C+ +G + +P + +F+
Sbjct: 323 SGTTFTFLVESAFRVVVD-HVAGVLRQPVVNASSLDSPCFPAATGEQQLPAMPDMVLHFA 381
Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGP 492
GG + L N++ + +FC A SPS +SI+GN QQ+ IQ+ FD G + F P
Sbjct: 382 GGADMRLHRDNYMSFNQEESSFCLNIAGSPSADVSILGNFQQQNIQMLFDITVGQLSFMP 441
Query: 493 NVC 495
C
Sbjct: 442 TDC 444
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 209 bits (531), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 143/415 (34%), Positives = 209/415 (50%), Gaps = 39/415 (9%)
Query: 102 HRHQHSFHARMQRDVKRVATL--VRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVR 159
+ + ++R RVATL + L+ G A A + +D GEY +
Sbjct: 44 YTEEQLLSRALRRSSARVATLQSLAALAPGDAITAAR-ILVLASD---------GEYLME 93
Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL 219
+G+G+P R ++D+GSD++W QC PC C Q P FDPA SA++ + C+S C+ L
Sbjct: 94 MGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNAL 153
Query: 220 ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG----RTVVKNVAIGCGHKNQGMFVGA 275
C+ C Y+ YGD + T G LA ET T G R + ++ GCG+ N G+
Sbjct: 154 YYPLCYQKVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGCGNLNAGLLANG 213
Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL---------PVGA 326
+G++G G GS+SLV QLG FSYCL S + L FG A PV +
Sbjct: 214 SGMVGFGRGSLSLVSQLGSPR---FSYCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQS 270
Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM-GDDGVVMDTGTAVTRLPT 385
P V NP P+ Y++ ++G+ VGG +PI +F + G G ++D+GT +T L
Sbjct: 271 --TPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAE 328
Query: 386 PAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFV--SVRVPTVSFYFSGGPVLTLPA 442
PAY+A R AF +Q T L + S+ DTC+ SV +P + +F G LP
Sbjct: 329 PAYDAVRAAFASQITLPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHFDGAD-WELPL 387
Query: 443 SNFLIPVDDA--GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
N+++ VD + G C A A S S SIIG+ Q + + +D N + F P C
Sbjct: 388 QNYML-VDPSTGGGLCLAMA-SSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 208 bits (530), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 147/435 (33%), Positives = 209/435 (48%), Gaps = 50/435 (11%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+ +EL+HRD S HYHR VA +RR + H
Sbjct: 30 FTVELIHRDSPKSPMYNPLENHYHR---------------VADTLRR-------SISHNT 67
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
V + + GEY +++ VG+PP V D+GSDI+W QC PC+ CY+Q P+F
Sbjct: 68 GLVTNTVEAPIYNNRGEYLMKLSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQDLPMF 127
Query: 199 DPADSASFSGVSCSSAVCDRL--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT- 255
+P+ S ++ VSCSS VC +N+ C Y +SYGD S+++G A++TLT+G T
Sbjct: 128 NPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTS 187
Query: 256 ----VVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT- 309
AIGCGH N G F +G++GLG G SL+ Q+G GG FSYCL G
Sbjct: 188 GRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGND 247
Query: 310 -GSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
G S L FG A G+ V P+ + + SFY + L + VG + F T
Sbjct: 248 DGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVG------RNNTFYSTA 301
Query: 367 M----GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF-DTCYNLSGFV 421
G +++D+GT +T LP Y F A ++ + NL R + F + C+ +
Sbjct: 302 NSILGGKANIIIDSGTTLTLLPVDLYHNFAKA-ISNSINLQRTDDPNQFLEYCFETTT-D 359
Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA-PSPSGLSIIGNIQQEGIQIS 480
+VP ++ +F G L L N LI V D C AFA + +SI GNI Q +
Sbjct: 360 DYKVPFIAMHFEGAN-LRLQRENVLIRVSD-NVICLAFAGAQDNDISIYGNIAQINFLVG 417
Query: 481 FDGANGFVGFGPNVC 495
+D N + F P C
Sbjct: 418 YDVTNMSLSFKPMNC 432
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 208 bits (529), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 137/358 (38%), Positives = 195/358 (54%), Gaps = 17/358 (4%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSAS 205
+G + + E+ V +G G+P ++ +++D+GSD+ W+QC+PCS CY+Q DP FDPA S+S
Sbjct: 128 TGTNLDTLEFVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDPAKSSS 187
Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGC 264
++ V C + VC C+ C Y V YGDGS T G L+ +TLT + GC
Sbjct: 188 YAAVPCGTPVC-AAAGGMCNGTTCLYGVQYGDGSSTTGVLSRDTLTFNSSSKFTGFTFGC 246
Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG--REAL 322
G KN G F GLLGLG G +SL Q GG FSYCL S T + G L G +
Sbjct: 247 GEKNIGDFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPSYNT-TPGYLNIGATKPTS 305
Query: 323 PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
V + +++ P+ PSFY++ L + +GG +P+ +F T G ++D+GT +T
Sbjct: 306 TVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVFTKT-----GTLLDSGTILTY 360
Query: 383 LPTPAYEAFRDAF-VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
LP PAY + RD F GN P A DTCY+ +G ++ +P VSF FS G V L
Sbjct: 361 LPPPAYTSLRDRFKFTMQGNKP-APPYEPLDTCYDFTGQGAIVIPAVSFNFSDGAVFDLD 419
Query: 442 ASNFLIPVDDAGTF--CFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+I DDA C AF P+ + SI+GN QQ ++ +D + +GF P C
Sbjct: 420 FYGIMIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQKIGFIPISC 477
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 207 bits (528), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 136/419 (32%), Positives = 211/419 (50%), Gaps = 30/419 (7%)
Query: 106 HSFHARMQRDVK-RVATLVRRLSGGGADAAKHEVQ--DFGTDVVSGMDQGSGEYFVRIGV 162
+ HAR ++ K R + ++++ + EV + SGM GSGEYF+ + V
Sbjct: 109 QTLHARFKKSKKQRNEKVKKKITSDISLVGAPEVSPGKLIATLESGMTLGSGEYFMDVLV 168
Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN- 221
G+PP+ +++D+GSD+ W+QC PC C+ Q++ +DP SASF ++C+ C + +
Sbjct: 169 GTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEAFYDPKTSASFKNITCNDPRCSLISSP 228
Query: 222 ---AGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTV---------VKNVAIGCGHK 267
C + C Y YGD S T G A+ET T+ T V+N+ GCGH
Sbjct: 229 EPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKVENMMFGCGHW 288
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG--TGSSGSLVFGREALPVG 325
N+G+F GA+GLLGLG G +S QL G +FSYCLV R T S L+FG + +
Sbjct: 289 NRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLN 348
Query: 326 AAWVPLV-----RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
+ + +FYY+ + + VGG + I E+ + ++ G G ++D+GT +
Sbjct: 349 HTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNISPDGAGGTIIDSGTTL 408
Query: 381 TRLPTPAYEAFRDAFVAQTG-NLPRASGVSIFDTCYNLSGFV--SVRVPTVSFYFSGGPV 437
+ PAYE ++ F + N + D C+N+SG ++ +P + F+ G V
Sbjct: 409 SYFAEPAYEIIKNKFAEKMKENYLVFRDFPVLDPCFNVSGIEENNIHLPELGIAFADGAV 468
Query: 438 LTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
PA N I + + C A +P S SIIGN QQ+ I +D +GF P C
Sbjct: 469 WNFPAENSFIWLSE-DLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKMSRLGFTPTKC 526
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 207 bits (528), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 133/374 (35%), Positives = 192/374 (51%), Gaps = 23/374 (6%)
Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
F T +VSG GSG+YFV +G+P + ++++D+GSD+ +VQC PC CY+Q P++ P
Sbjct: 19 FRTPLVSGTTLGSGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQP 78
Query: 201 ADSASFSGVSCSSAVCDRLE---NAGCHA--------GRCRYEVSYGDGSYTKGTLALET 249
++S++F+ V C SA C + A C + G C YE YGD S T G A ET
Sbjct: 79 SNSSTFTPVPCDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYET 138
Query: 250 LTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT 309
T+G V +VA GCG++NQG FV A G+LGLG G++S Q G F+YCL S +
Sbjct: 139 ATVGGIRVNHVAFGCGNRNQGSFVSAGGVLGLGQGALSFTSQAGYAFENKFAYCLTSYLS 198
Query: 310 GSS--GSLVFGREALPV--GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
+S SL+FG + + + PLV NP PS YYV + + GG + I + +++
Sbjct: 199 PTSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKID 258
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAF---VAQTGNLPRASGVSIFDTCYNLSGFVS 422
+G+ G + D+GT VT AY AF V P G+ + C N+SG
Sbjct: 259 SVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPL---CVNVSGIDH 315
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS-GLSIIGNIQQEGIQISF 481
P+ + F G N+ I V C A S S G ++IGNI Q+ + +
Sbjct: 316 PIYPSFTIEFDQGATYRPNQGNYFIEV-SPNIDCLAMLESSSDGFNVIGNIIQQNYLVQY 374
Query: 482 DGANGFVGFGPNVC 495
D +GF C
Sbjct: 375 DREEHRIGFAHANC 388
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 207 bits (528), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 141/402 (35%), Positives = 203/402 (50%), Gaps = 31/402 (7%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVS--GMDQGSGEYFVRIGVGSPPRSQ 169
++RD RV + R+++ A G +++ G + Y + +G+P
Sbjct: 99 LRRDQDRVDAIRRKVT------ASSNKPKGGVSLLANWGKSLSTTNYVASLRLGTPATEL 152
Query: 170 YMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLE-------NA 222
+ +D+GSD WVQC+PC+ CY+Q DPVFDP S+++S V C + C L +
Sbjct: 153 VVELDTGSDQSWVQCKPCADCYEQRDPVFDPTASSTYSAVPCGARECQELASSSSSRNCS 212
Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-------VVKNVAIGCGHKNQGMFVGA 275
+ C YEVSY D S+T G LA +TLT+ + V GCGH N G F
Sbjct: 213 SDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPSPSPADTVPGFVFGCGHSNAGTFGEV 272
Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNP 335
GLLGLG G SL Q+ + G AFSYCL S + ++G L FG A A + +V
Sbjct: 273 DGLLGLGLGKASLPSQVAARYGAAFSYCLPSSPS-AAGYLSFGGAAARANAQFTEMVTG- 330
Query: 336 RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF 395
+ P+ YY+ L+G+ V G I + F G ++D+GTA +RLP AY A R +F
Sbjct: 331 QDPTSYYLNLTGIVVAGRAIKVPASAFATAA----GTIIDSGTAFSRLPPSAYAALRSSF 386
Query: 396 VAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
+ G RA IFDTCY+ +G +VR+P V F+ G + L S L +D
Sbjct: 387 RSAMGRYRYKRAPSSPIFDTCYDFTGHETVRIPAVELVFADGATVHLHPSGVLYTWNDVA 446
Query: 454 TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C AF P+ L I+GN QQ + + +D + +GFG C
Sbjct: 447 QTCLAFVPN-HDLGILGNTQQRTLAVIYDVGSQRIGFGRKGC 487
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 207 bits (528), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 139/398 (34%), Positives = 207/398 (52%), Gaps = 36/398 (9%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAK-HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
M+R V R + +R LSG A + + H VQ EY + + +G PP
Sbjct: 42 MRRAVHR--SRLRALSGYDATSPRLHSVQV--------------EYLMELAIGKPPVPFV 85
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC-HAGRC 229
+ D+GSD+ W QCQPC C+ Q PV+DP+ S++FS + CSSA C + + C + C
Sbjct: 86 ALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPLPCSSATCLPIWSRNCTPSSLC 145
Query: 230 RYEVSYGDGSYTKGTLALETLTIGRT----VVKNVAIGCGHKNQGMFVGAAGLLGLGGGS 285
RY +YGDG+Y+ G L ETLT+G + V VA GCG N G + + G +GLG G+
Sbjct: 146 RYRYAYGDGAYSAGILGTETLTLGPSSAPVSVGGVAFGCGTDNGGDSLNSTGTVGLGRGT 205
Query: 286 MSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREA-LPVGAAWV---PLVRNPRAPSF 340
+SL+ QLG G FSYCL + + G A L G + V PL+++P+ PS
Sbjct: 206 LSLLAQLG---VGKFSYCLTDFFNSALDSPFLLGTLAELAPGPSTVQSTPLLQSPQNPSR 262
Query: 341 YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF--VAQ 398
Y+V L G+ +G +R+PI F L G G+++D+GT T L A FR+ VA+
Sbjct: 263 YFVSLQGISLGDVRLPIPNGTFDLRGDGTGGMIVDSGTTFTIL---AESGFREVVGRVAR 319
Query: 399 TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFA 458
P + S+ C+ +P + +F+GG + L N++ ++ +FC
Sbjct: 320 VLGQPPVNASSLDAPCFPAPAGEPPYMPDLVLHFAGGADMRLYRDNYMSYNEEDSSFCLN 379
Query: 459 FA-PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
A +P S++GN QQ+ IQ+ FD G + F P C
Sbjct: 380 IAGTTPESTSVLGNFQQQNIQMLFDTTVGQLSFLPTDC 417
>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
Length = 409
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 150/425 (35%), Positives = 211/425 (49%), Gaps = 61/425 (14%)
Query: 112 MQRDVKRVATLVRRLSGGGADAA-KHEVQDFG-TDVVSGMDQGSGEYFVRIGVGSPPR-- 167
+ D RVA + +RL+G D A H+ + G T VVS + +G G+G P
Sbjct: 5 LDADQLRVAYIQKRLAGDTGDGADPHKFVEGGDTHVVSSLQVATGA-----GIGQKPHLT 59
Query: 168 --------------------SQYMVIDSGSDIVWVQCQPCSQ--CYKQSDPVFDPADSAS 205
SQ ++IDSGSD+ WVQCQPC C+ Q DP+FDPA S +
Sbjct: 60 TTRLGTTATTNSAPDGTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTT 119
Query: 206 FSGVSCSSAVCDRL--ENAGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVA 261
++ V CSSA C RL GC A +C++ ++Y +G+ GT + + LT+G VV+
Sbjct: 120 YAAVPCSSAACARLGPYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYDVVRGFL 179
Query: 262 IGCGHKNQG--MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR 319
GC H +QG AG L LGGGS S V Q Q FSYC V T S G ++FG
Sbjct: 180 FGCAHADQGSTFSYDVAGTLALGGGSQSFVQQTASQYSRVFSYC-VPPSTSSFGFIMFGV 238
Query: 320 EALPVGAAWVP-LVRNP------RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
P AA VP V P +P+FY V L + V G +P+ +F +
Sbjct: 239 P--PQRAALVPTFVSTPLLSSSTMSPTFYRVLLRSIIVAGRPLPVPPTVFSASS------ 290
Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYF 432
V+D+ T ++R+P AY+A R AF + A VSI DTCY+ SG S+ +P+++ F
Sbjct: 291 VIDSATVISRIPPTAYQALRAAFRSAMTMYRPAPPVSILDTCYDFSGVRSITLPSIALVF 350
Query: 433 SGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGF 490
GG + L A+ L+ C AFAP+ S IGN+QQ +++ +D + F
Sbjct: 351 DGGATVNLDAAGILL------QGCLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRF 404
Query: 491 GPNVC 495
C
Sbjct: 405 RSAAC 409
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 144/392 (36%), Positives = 201/392 (51%), Gaps = 32/392 (8%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
++ D R + R+LSG D + T + S +D + EY + +G+GSP +Q M
Sbjct: 89 LEHDQLRAKYIQRKLSG--TDGLQPLDLTVPTTLGSALD--TMEYVITVGIGSPAVTQTM 144
Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN--AGCHAGRC 229
+ID+GSD+ WV+C +FDP+ S +++ SCSSA C +L N GC C
Sbjct: 145 MIDTGSDVSWVRCNSTDGL-----TLFDPSKSTTYAPFSCSSAACAQLGNNGDGCSNSGC 199
Query: 230 RYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAA--GLLGLGGGSM 286
+Y V YGDGS T GT + +TL + + V + GC H + F G GL+GLGG +
Sbjct: 200 QYRVQYGDGSNTTGTYSSDTLALSASDTVTDFHFGCSHHEED-FDGEKIDGLMGLGGDAQ 258
Query: 287 SLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR-EALPVGAAWVPLVRNPRAPSFYYVGL 345
SLV Q G +FSYCL +SG L FG G P++R P+AP+ Y V L
Sbjct: 259 SLVSQTAATYGKSFSYCLPPTNR-TSGFLTFGAPNGTSGGFVTTPMLRWPKAPTLYGVLL 317
Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL--P 403
+ VGG + I + +G VMD+GT +T LP AY A AF + L
Sbjct: 318 QDISVGGTPLGIQPSVL------SNGSVMDSGTVITWLPRRAYSALSSAFRSSMTRLRHQ 371
Query: 404 RASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP 463
RA+ + I DTCY+ +G V+V +P VS GG V+ L + +I C AFA +
Sbjct: 372 RAAPLGILDTCYDFTGLVNVSIPAVSLVLDGGAVVDLDGNGIMI------QDCLAFAAT- 424
Query: 464 SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
SG SIIGN+QQ ++ D G GF C
Sbjct: 425 SGDSIIGNVQQRTFEVLHDVGQGVFGFRSGAC 456
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 130/391 (33%), Positives = 208/391 (53%), Gaps = 31/391 (7%)
Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS 194
K+ + F + VV+ + Q EY+V + +G+P +++D+GSD+ W+QC PC C
Sbjct: 118 KNALTGFTSPVVT-LGQAGLEYYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPAL 176
Query: 195 DPVFDPADSASFSGVSCSSAVCDR----LENAGCHAGR-CRYEVSYGDGSYTKGTLALET 249
P F+P S+SF + C+S+ C ++ +GR C + + YGDGS + G LA+ET
Sbjct: 177 RPPFNPRHSSSFFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMET 236
Query: 250 LT-------IGRTV-VKNVAIGCGH-KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAF 300
+ G V + N+ +GC +G+ GA+GLLG+ +S QL + F
Sbjct: 237 IAGNTPNFGDGEPVKLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKF 296
Query: 301 SYCLVSR--GTGSSGSLVFGR-EALPVGAAWVPLVRNPRAPS----FYYVGLSGLGVGGM 353
S+C + SSG + FG + + + PLV+NP PS +YYVGL G+ V
Sbjct: 297 SHCFPDKIAHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDES 356
Query: 354 RIPISEDLFRLTQM-GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD 412
R+P+S F + ++ G G ++D+GTA T L PA++A R F+A+T +L + S F
Sbjct: 357 RLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFT 416
Query: 413 TCYNLS----GFVSVRVPTVSFYFSGGPVLTLPASNFLIPV---DDAGTFCFAFAPSPS- 464
CYN++ S +P+++ +F GG + LP ++ LIPV ++ T C AF S
Sbjct: 417 PCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQMSGDI 476
Query: 465 GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+IIGN QQ+ + + +D +G P C
Sbjct: 477 PFNIIGNYQQQNLWVEYDLEKLRLGIAPAQC 507
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 143/415 (34%), Positives = 208/415 (50%), Gaps = 39/415 (9%)
Query: 102 HRHQHSFHARMQRDVKRVATL--VRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVR 159
+ + ++R RVATL + L+ G A A + +D GEY +
Sbjct: 44 YTEEQLLSRALRRSSARVATLQSLAALAPGDAITAAR-ILVLASD---------GEYLME 93
Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL 219
+G+G+P R ++D+GSD++W QC PC C Q P FDPA SA++ + C+S C+ L
Sbjct: 94 MGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNAL 153
Query: 220 ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG----RTVVKNVAIGCGHKNQGMFVGA 275
C+ C Y+ YGD + T G LA ET T G R + ++ GCG+ N G
Sbjct: 154 YYPLCYQKVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGCGNLNAGSLANG 213
Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL---------PVGA 326
+G++G G GS+SLV QLG FSYCL S + L FG A PV +
Sbjct: 214 SGMVGFGRGSLSLVSQLGSPR---FSYCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQS 270
Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM-GDDGVVMDTGTAVTRLPT 385
P V NP P+ Y++ ++G+ VGG +PI +F + G G ++D+GT +T L
Sbjct: 271 --TPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAE 328
Query: 386 PAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFV--SVRVPTVSFYFSGGPVLTLPA 442
PAY+A R AF +Q T L + S+ DTC+ SV +P + +F G LP
Sbjct: 329 PAYDAVRAAFASQITLPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHFDGAD-WELPL 387
Query: 443 SNFLIPVDDA--GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
N+++ VD + G C A A S S SIIG+ Q + + +D N + F P C
Sbjct: 388 QNYML-VDPSTGGGLCLAMA-SSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 153/451 (33%), Positives = 226/451 (50%), Gaps = 38/451 (8%)
Query: 60 LFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRV 119
+F H + S +S++ ++ +L+ RD S + + R Q +FH R + R
Sbjct: 16 IFFIHFSGLSHTEASNKGGFSTDLISRDSPLSPFYNPSETQFDRLQKAFH----RSISRA 71
Query: 120 ATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDI 179
R +G ++ + V +GEY + I +G+PP S + + D+GSD+
Sbjct: 72 NHF--RANGVSTNSIQSPVI-----------SNNGEYLMNISLGTPPVSMHGIADTGSDL 118
Query: 180 VWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL-ENAGC-HAGRCRYEVSYGD 237
+W QC+PC CY+Q +P+FDPA S ++ +SC C L GC C Y SYGD
Sbjct: 119 LWRQCKPCDSCYEQIEPIFDPAKSKTYQILSCEGKSCSNLGGQGGCSDDNTCIYSYSYGD 178
Query: 238 GSYTKGTLALETLTIGRTV-----VKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQ 291
GS+T G LA++TLTIG T V V GCGH N G F + +GL+GLGGG +S++ Q
Sbjct: 179 GSHTSGDLAVDTLTIGSTTGRPVSVPKVVFGCGHNNGGTFELHGSGLVGLGGGPLSMISQ 238
Query: 292 LGGQTGGAFSYCLVSRGTGSSGS--LVFGREALPVGAAWVPLVRNPRAP-SFYYVGLSGL 348
L GG FSYCLV G S S + FG + GA V R P +FYY+ L +
Sbjct: 239 LRPLIGGRFSYCLVPLGNDPSVSSKMHFGSRGIVSGAGAVSTPLASRQPDTFYYLTLESM 298
Query: 349 GVGGMRIP---ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
VG ++ S+ L + +++D+GT +T LP Y V+ G P
Sbjct: 299 SVGSKKLAYKGFSKVGSPLADADEGNIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVR 358
Query: 406 SGVSIFDTCY-NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS 464
++F CY NLSG +R+PT++ +F G + P + F+ +D FCFA P S
Sbjct: 359 DPNNVFSLCYSNLSG---LRIPTITAHFVGADLELKPLNTFVQVQEDL--FCFAMIPV-S 412
Query: 465 GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L+I GN+ Q + +D + V F P C
Sbjct: 413 DLAIFGNLAQMNFLVGYDLKSRTVSFKPTDC 443
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 141/401 (35%), Positives = 204/401 (50%), Gaps = 37/401 (9%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAK-HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
M+R R + +R LSG A++ + H VQ EY + + +G+PP
Sbjct: 48 MRRAAHR--SRLRALSGYDANSPRLHSVQV--------------EYLMELAIGTPPVPFV 91
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC---DRLENAGCHAG 227
+ D+GSD+ W QCQPC C+ Q PV+DP+ S++FS V CSSA C R N +
Sbjct: 92 ALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSATCLPVLRSRNCSTPSS 151
Query: 228 RCRYEVSYGDGSYTKGTLALETLTIGRTV------VKNVAIGCGHKNQGMFVGAAGLLGL 281
CRY SY DG+Y+ G L ETLT+G +V V +VA GCG N G + + G +GL
Sbjct: 152 LCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSDVAFGCGTDNGGDSLNSTGTVGL 211
Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVS--RGTGSSGSLVFGREALPVGAAWV---PLVRNPR 336
G G++SL+ QLG G FSYCL T S L+ L G V PL+++P
Sbjct: 212 GRGTLSLLAQLG---VGKFSYCLTDFFNSTLDSPFLLGTLAELAPGPGAVQSTPLLQSPL 268
Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
PS Y V L G+ +G +R+PI F L G+V+D+GT + LP + D V
Sbjct: 269 NPSRYVVSLQGITLGDVRLPIPNKTFDLHANSTGGMVVDSGTTFSILPESGFRVVVD-HV 327
Query: 397 AQTGNLPRASGVSIFDTCYNL-SGFVSVR-VPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
AQ P + S+ C+ +G + +P + +F+GG + L N++ + +
Sbjct: 328 AQVLGQPPVNASSLDSPCFPAPAGERQLPFMPDLVLHFAGGADMRLHRDNYMSYNQEDSS 387
Query: 455 FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
FC + S S++GN QQ+ IQ+ FD G + F P C
Sbjct: 388 FCLNIVGTTSTWSMLGNFQQQNIQMLFDMTVGQLSFLPTDC 428
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 129/344 (37%), Positives = 189/344 (54%), Gaps = 20/344 (5%)
Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
+G+G+P MV+D+GS + W+QC PC C++QS PVF+P S++++ V CS+ C
Sbjct: 1 MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSD 60
Query: 219 LENA-----GCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
L +A C + C Y+ SYGD S++ G L+ +T++ G T + N GCG N+G+F
Sbjct: 61 LPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTSLPNFYYGCGQDNEGLF 120
Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLV 332
+AGL+GL +SL+ QL G +F+YCL S + SL P ++ P+V
Sbjct: 121 GRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYLSLGSYN---PGQYSYTPMV 177
Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
+ S Y++ LSG+ V G P+S + + ++D+GT +TRLPT Y A
Sbjct: 178 SSSLDDSLYFIKLSGMTVAGN--PLSVSSSAYSSL---PTIIDSGTVITRLPTSVYSALS 232
Query: 393 DAFVAQTGNLPRASGVSIFDTCYNLSGFVS-VRVPTVSFYFSGGPVLTLPASNFLIPVDD 451
A A RAS SI DTC+ G S V P V+ F+GG L L A N L+ VDD
Sbjct: 233 KAVAAAMKGTSRASAYSILDTCFK--GQASRVSAPAVTMSFAGGAALKLSAQNLLVDVDD 290
Query: 452 AGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ T C AFAP+ S +IIGN QQ+ + +D + +GF C
Sbjct: 291 STT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKSSRIGFAAGGC 332
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 205 bits (522), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 126/353 (35%), Positives = 187/353 (52%), Gaps = 37/353 (10%)
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC------ 224
+++D+GSD+ WVQC+PCS CY Q DP+FDP+ SAS++ V C+++ C+ A
Sbjct: 179 VIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 238
Query: 225 ----------HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVG 274
+ RC Y ++YGDGS+++G LA +T+ +G V GCG N+G+F G
Sbjct: 239 ATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDGFVFGCGLSNRGLFGG 298
Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG-SSGSLVFG------REALPVGAA 327
AGL+GLG +SLV Q + GG FSYCL + +G ++GSL G R A PV +
Sbjct: 299 TAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTSSYRNATPV--S 356
Query: 328 WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
+ ++ +P P FY++ ++G VGG + +G V++D+GT +TRL
Sbjct: 357 YTRMIADPAQPPFYFMNVTGASVGGAAV-------AAAGLGAANVLLDSGTVITRLAPSV 409
Query: 388 YEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNF 445
Y A R F Q G P A S+ D CYNL+G V+VP ++ GG +T+ A+
Sbjct: 410 YRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMTVDAAGM 469
Query: 446 L-IPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L + D C A A IIGN QQ+ ++ +D +GF C
Sbjct: 470 LFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 522
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 124/353 (35%), Positives = 184/353 (52%), Gaps = 37/353 (10%)
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC------ 224
+++D+GSD+ WVQC+PCS CY Q DP+FDP+ SAS++ V C+++ C+ A
Sbjct: 178 VIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 237
Query: 225 ----------HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVG 274
+ RC Y ++YGDGS+++G LA +T+ +G V GCG N+G+F G
Sbjct: 238 ATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDGFVFGCGLSNRGLFGG 297
Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG-SSGSLVFG------REALPVGAA 327
AGL+GLG +SLV Q + GG FSYCL + +G ++GSL G R A PV +
Sbjct: 298 TAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTSSYRNATPV--S 355
Query: 328 WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
+ ++ +P P FY++ ++G V +G V++D+GT +TRL
Sbjct: 356 YTRMIADPAQPPFYFMNVTGASV-------GGAAVAAAGLGAANVLLDSGTVITRLAPSV 408
Query: 388 YEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNF 445
Y A R F Q G P A S+ D CYNL+G V+VP ++ GG +T+ A+
Sbjct: 409 YRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMTVDAAGM 468
Query: 446 L-IPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L + D C A A IIGN QQ+ ++ +D +GF C
Sbjct: 469 LFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 521
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 205 bits (521), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 133/367 (36%), Positives = 194/367 (52%), Gaps = 28/367 (7%)
Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSG 208
+ G+G Y + + VG+PP + +ID+GSD+ W QC PC+ C+ Q P++DPA S++FS
Sbjct: 90 ENGAGAYHMILSVGTPPLAFPAIIDTGSDLTWTQCAPCTTACFAQPTPLYDPARSSTFSK 149
Query: 209 VSCSSAVCDRLENA--GCHAGRCRYEVSYGDGSYTKGTLALETLTI--------GRTVVK 258
+ C+S +C L +A C+A C Y+ Y G +T G LA +TL I +
Sbjct: 150 LPCASPLCQALPSAFRACNATGCVYDYRYAVG-FTAGYLAADTLAIGDGDGDGDASSSFA 208
Query: 259 NVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG 318
VA GC N G GA+G++GLG ++SL+ Q+G G FSYCL S + ++FG
Sbjct: 209 GVAFGCSTANGGDMDGASGIVGLGRSALSLLSQIGV---GRFSYCLRSDADAGASPILFG 265
Query: 319 REALPVG--AAWVPLVRNP-----RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
A G L+RNP RAP +YYV L+G+ VG +P++ F T G G
Sbjct: 266 ALANVTGDKVQSTALLRNPVAARRRAP-YYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGG 324
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQT-GNLPRASGVSI-FDTCYNLSGFVSVRVPTVS 429
V++D+GT T L Y R AF++QT G L R SG FD C+ +G VP +
Sbjct: 325 VIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCFE-AGAADTPVPRLV 383
Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTF-CFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
F F+GG +P ++ VD+ G C P+ G+S+IGN+ Q + + +D
Sbjct: 384 FRFAGGAEYAVPRQSYFDAVDEGGRVACLLVLPT-RGVSVIGNVMQMDLHVLYDLDGATF 442
Query: 489 GFGPNVC 495
F P C
Sbjct: 443 SFAPADC 449
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 205 bits (521), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 138/389 (35%), Positives = 191/389 (49%), Gaps = 30/389 (7%)
Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS 194
+ V + VVSG GSG+YFV + +G PP+S ++ D+GSD+VWV+C C C S
Sbjct: 62 RKPVPFVKSPVVSGASSGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHS 121
Query: 195 DP-VFDPADSASFSGVSCSSAVCDRLENAG----CHAGR----CRYEVSYGDGSYTKGTL 245
VF P S++FS C VC + G C+ R C YE Y DGS T G
Sbjct: 122 PATVFFPRHSSTFSPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLF 181
Query: 246 ALETLTI-----GRTVVKNVAIGCGHKNQGM------FVGAAGLLGLGGGSMSLVGQLGG 294
A ET ++ +K+VA GCG + G F GA G++GLG G +S QLG
Sbjct: 182 ARETTSLKTSSGKEAKLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGR 241
Query: 295 QTGGAFSYCLV--SRGTGSSGSLVFGREALPVGAA-WVPLVRNPRAPSFYYVGLSGLGVG 351
+ G FSYCL+ + + L+ G V + PL+ NP +P+FYYV L + V
Sbjct: 242 RFGNKFSYCLMDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVN 301
Query: 352 GMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI- 410
G ++ I ++ + G+ G VMD+GT + L PAY A V Q LP A ++
Sbjct: 302 GAKLRIDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAA-VKQRIKLPNADELTPG 360
Query: 411 FDTCYNLSGFVSVR--VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF--APSPSGL 466
FD C N+SG +P + F FSGG V P N+ I ++ C A G
Sbjct: 361 FDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQ-CLAIQSVDPKVGF 419
Query: 467 SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S+IGN+ Q+G FD +GF C
Sbjct: 420 SVIGNLMQQGFLFEFDRDRSRLGFSRRGC 448
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 204 bits (520), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 120/338 (35%), Positives = 188/338 (55%), Gaps = 20/338 (5%)
Query: 171 MVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH---- 225
M++D+GS + W+QCQPC+ C+ Q+DP++DP+ S ++ +SC+S C RL+ A +
Sbjct: 1 MILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60
Query: 226 ---AGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGL 281
+ C Y SYGD S++ G L+ + LT+ + + GCG NQG+F AAG++GL
Sbjct: 61 ETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQTLPQFTYGCGQDNQGLFGRAAGIIGL 120
Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL-PVGAAWVPLVRNPRAPSF 340
+S++ QL + G AFSYCL + +GSSG ++ P + P++ + + PS
Sbjct: 121 ARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSL 180
Query: 341 YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVA-QT 399
Y++ L+ + V G + ++ ++R+ + +D+GT +TRLP Y A R AFV +
Sbjct: 181 YFLRLTAITVSGRPLDLAAAMYRVPTL------IDSGTVITRLPMSMYAALRQAFVKIMS 234
Query: 400 GNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF 459
+A SI DTC+ S VP + F GG LTL A + LI D G C AF
Sbjct: 235 TKYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSILIEADK-GITCLAF 293
Query: 460 APS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
A S + ++IIGN QQ+ I++D + +GF P C
Sbjct: 294 AGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 204 bits (519), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 130/359 (36%), Positives = 192/359 (53%), Gaps = 17/359 (4%)
Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
++ G G Y + I VG+P + +V D+GSD++W QC PC++C++Q P F PA S++FS
Sbjct: 79 LENGVGGYNMNISVGTPLLTFPVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSK 138
Query: 209 VSCSSAVCDRLENA--GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
+ C+S+ C L N+ C+A C Y YG G YT G LA ETL +G +VA GC
Sbjct: 139 LPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGDASFPSVAFGCST 197
Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-LPVG 325
+N G+ +G+ GLG G++SL+ QLG G FSYCL S + ++FG A L G
Sbjct: 198 EN-GVGNSTSGIAGLGRGALSLIPQLG---VGRFSYCLRSGSAAGASPILFGSLANLTDG 253
Query: 326 AAW-VPLVRNPRA-PSFYYVGLSGLGVGGMRIPISEDLFRLTQMG-DDGVVMDTGTAVTR 382
P V NP PS+YYV L+G+ VG +P++ F TQ G G ++D+GT +T
Sbjct: 254 NVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTY 313
Query: 383 LPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLS-GFVSVRVPTVSFYFSGGPVLTLP 441
L YE + AF++QT N+ +G D C+ + G + VP++ F GG +P
Sbjct: 314 LAKDGYEMVKQAFLSQTANVTTVNGTRGLDLCFKSTGGGGGIAVPSLVLRFDGGAEYAVP 373
Query: 442 ASNFLIPVDDAGTF---CFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ D G+ C P+ +S+IGN+ Q + + +D G F P C
Sbjct: 374 TYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFSPADC 432
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 204 bits (519), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 122/380 (32%), Positives = 195/380 (51%), Gaps = 29/380 (7%)
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC-YKQSDPV 197
F + V+SG GSG+YFV + +G+PP++ +V D+GSD++WV+C PC C ++
Sbjct: 69 NSFRSPVISGASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSA 128
Query: 198 FDPADSASFSGVSCSSAVCDRLENA---GCHAGR----CRYEVSYGDGSYTKGTLALETL 250
F S ++S + C S C + + C+ R CRY+ +Y D S T G + E L
Sbjct: 129 FFARHSTTYSAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEAL 188
Query: 251 TIGRTVVK-----NVAIGCGHKNQG------MFVGAAGLLGLGGGSMSLVGQLGGQTGGA 299
T+ + K ++ GCG + G F GA G++GLG +S QLG + G
Sbjct: 189 TLNTSTGKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSK 248
Query: 300 FSYCLVS---RGTGSSGSLVFGREALPVGA----AWVPLVRNPRAPSFYYVGLSGLGVGG 352
FSYCL+ +S + G + + V ++ PL+ NP +P+FYY+ + G+ V G
Sbjct: 249 FSYCLMDYTLSPPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNG 308
Query: 353 MRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD 412
+++PI+ ++ + +G+ G ++D+GT +T + PAY AF + A FD
Sbjct: 309 VKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGFD 368
Query: 413 TCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP--SPSGLSIIG 470
C N+SG +P +SF +GG V + P N+ I D C A P G S++G
Sbjct: 369 LCMNVSGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGDQ-IKCLAVQPVSQDGGFSVLG 427
Query: 471 NIQQEGIQISFDGANGFVGF 490
N+ Q+G + FD +GF
Sbjct: 428 NLMQQGFLLEFDRDKSRLGF 447
>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 204 bits (519), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 146/458 (31%), Positives = 225/458 (49%), Gaps = 53/458 (11%)
Query: 59 ELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKR 118
+ F + ++ + S + W L H S + ++ N S + D +R
Sbjct: 44 KTFCSGHKVAPGDVPSPNSTWA-PLHHLYGPCSPAPSSANSTAADVAASMADMVDDDQRR 102
Query: 119 VATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPR----------- 167
+ +RL+G A + Q + + +G+Y G+GS P
Sbjct: 103 ADYIQKRLTG-----ATDDKQPMAFSSRTSQYEKNGQYATNGGLGSVPHLKSLSTTATTN 157
Query: 168 ---------SQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVC 216
+Q ++IDSGSD+ WVQC+PC C++Q DP+FDPA S +++ V C+SA C
Sbjct: 158 SAPDGTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAAC 217
Query: 217 DRL--ENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQG-- 270
+L GC A +C++ ++YGDGS GT + + LT+G V++ GC H ++G
Sbjct: 218 AQLGPYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSA 277
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG----REALPVGA 326
AG L LGGGS SLV Q + G FSYCL + S G LV G R L
Sbjct: 278 FDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTAS-SLGFLVLGVPPERAQLIPSF 336
Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
PL+ + AP+FY V L + V G + + +F + V+D+ T ++RLP
Sbjct: 337 VSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASS------VIDSSTIISRLPPT 390
Query: 387 AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
AY+A R AF + A VSI DTCY+ +G S+ +P+++ F GG + L A+ L
Sbjct: 391 AYQALRAAFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGIL 450
Query: 447 IPVDDAGTFCFAFAPSPSGL--SIIGNIQQEGIQISFD 482
+ G+ C AFAP+ S IGN+QQ+ +++ +D
Sbjct: 451 L-----GS-CLAFAPTASDRMPGFIGNVQQKTLEVVYD 482
>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
Length = 256
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 110/216 (50%), Positives = 152/216 (70%), Gaps = 5/216 (2%)
Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS 194
K + T +VSG QGSGEYF R+G+GSPP+ YMV+D+GSD+ WVQC PC+ CY+Q+
Sbjct: 32 KTIAEALETPLVSGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQA 91
Query: 195 DPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI-G 253
DP+F+P+ S+S++ ++C + C L+ + C C YEVSYGDGSYT G A ET+T+ G
Sbjct: 92 DPIFEPSFSSSYAPLTCETHQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITLDG 151
Query: 254 RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSG 313
+ NVAIGCGH N+G+FVGAAGLLGLGGGS+S Q+ +FSYCLV+R T S+
Sbjct: 152 SASLNNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQIN---ASSFSYCLVNRDTDSAS 208
Query: 314 SLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLG 349
+L F +P + PL+RN + +FYY+G++G+G
Sbjct: 209 TLEF-NSPIPSHSVTAPLLRNNQLDTFYYLGMTGIG 243
>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
Length = 362
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 119/275 (43%), Positives = 167/275 (60%), Gaps = 20/275 (7%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLS-GGGADAAKHEVQ 139
+ L H D +SS S+ + F+ R+QRD RV ++ + G +A K +
Sbjct: 63 VHLSHVDALSSFSDAS-------PADLFNLRLQRDSLRVKSITSLAAVSTGRNATKRTPR 115
Query: 140 D---FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP 196
F V+SG+ QGSGEYF+R+GVG+P + YMV+D+GSD+VW+QC PC CY Q+D
Sbjct: 116 TAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDA 175
Query: 197 VFDPADSASFSGVSCSSAVCDRLENAG-CHAGR---CRYEVSYGDGSYTKGTLALETLTI 252
+FDP S +F+ V C S +C RL+++ C R C Y+VSYGDGS+T+G + ETLT
Sbjct: 176 IFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF 235
Query: 253 GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----- 307
V +V +GCGH N+G+FVGAAGLLGLG G +S Q + G FSYCLV R
Sbjct: 236 HGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGS 295
Query: 308 GTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYY 342
+ ++VFG A+P + + PL+ NP+ +FYY
Sbjct: 296 SSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYY 330
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 203 bits (517), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 129/360 (35%), Positives = 191/360 (53%), Gaps = 18/360 (5%)
Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
++ G G Y + I VG+P + +V D+GSD++W QC PC++C++Q P F PA S++FS
Sbjct: 79 LENGVGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSK 138
Query: 209 VSCSSAVCDRLENA--GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
+ C+S+ C L N+ C+A C Y YG G YT G LA ETL +G +VA GC
Sbjct: 139 LPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGDASFPSVAFGCST 197
Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-LPVG 325
+N G+ +G+ GLG G++SL+ QLG G FSYCL S + ++FG A L G
Sbjct: 198 EN-GVGNSTSGIAGLGRGALSLIPQLG---VGRFSYCLRSGSAAGASPILFGSLANLTDG 253
Query: 326 AAW-VPLVRNPRA-PSFYYVGLSGLGVGGMRIPISEDLFRLTQMG-DDGVVMDTGTAVTR 382
P V NP PS+YYV L+G+ VG +P++ F TQ G G ++D+GT +T
Sbjct: 254 NVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTY 313
Query: 383 LPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYN--LSGFVSVRVPTVSFYFSGGPVLTL 440
L YE + AF++QT ++ +G D C+ G + VP++ F GG +
Sbjct: 314 LAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGGAEYAV 373
Query: 441 PASNFLIPVDDAGTF---CFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P + D G+ C P+ +S+IGN+ Q + + +D G F P C
Sbjct: 374 PTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADC 433
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 145/428 (33%), Positives = 217/428 (50%), Gaps = 51/428 (11%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
+ LVHR + + + + SF +R R + +VR G H
Sbjct: 22 VPLVHRHGPCAPAPSLST-----DTRSFADIFRRSRARPSYIVR---GKKVSVPAH---- 69
Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVF 198
GT V+S EY VR+ G+P Q +VID+GSD+ W+QC+PCS QC+ Q DP++
Sbjct: 70 LGTSVMSL------EYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLY 123
Query: 199 DPADSASFSGVSCSSAVCDRLE----NAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIG 253
DP+ S+++S V C+S VC +L +GC +G+ C + +SY DG+ T G + + LT+
Sbjct: 124 DPSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTLA 183
Query: 254 R-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS 312
+V+N GCGH + G+LGLG L LG + GG FSYCL S +
Sbjct: 184 PGAIVQNFYFGCGHGKHAVRGLFDGVLGLG----RLRESLGARYGGVFSYCLPSV-SSKP 238
Query: 313 GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
G L G P G + P+ P P+F V L+G+ VGG ++ + F G+
Sbjct: 239 GFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF------SGGM 292
Query: 373 VMDTGTAVTRLPTPAYEAFRDAF---VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVS 429
++D+GT +T L + AY A R AF + LP DTCYNL+G+ +V VP ++
Sbjct: 293 IVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGD----LDTCYNLTGYKNVVVPKIA 348
Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS-PSGLS-IIGNIQQEGIQISFDGANGF 487
F+GG + L N ++ V+ C AFA S P G + ++GN+ Q ++ FD +
Sbjct: 349 LTFTGGATINLDVPNGIL-VNG----CLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSK 403
Query: 488 VGFGPNVC 495
GF C
Sbjct: 404 FGFRAKAC 411
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 137/448 (30%), Positives = 209/448 (46%), Gaps = 48/448 (10%)
Query: 68 SSSNTSSDEAR-----WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATL 122
S S SS EAR ++++L+HRD SS ++ + R+ R +
Sbjct: 13 SLSTLSSREAREGLRGFSVDLIHRDSPSSP--------FYNPSLTPSERIINAALRSMSR 64
Query: 123 VRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWV 182
++R+S + E S + GEY +R +GSPP + ++D+GS ++W+
Sbjct: 65 LQRVSHFLDENKLPE---------SLLIPDKGEYLMRFYIGSPPVERLAMVDTGSSLIWL 115
Query: 183 QCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA--GC-HAGRCRYEVSYGDGS 239
QC PC C+ Q P+F+P S+++ +C S C L+ + C G+C Y + YGD S
Sbjct: 116 QCSPCHNCFPQETPLFEPLKSSTYKYATCDSQPCTLLQPSQRDCGKLGQCIYGIMYGDKS 175
Query: 240 YTKGTLALETLTIGRT------VVKNVAIGCGHKNQGMFVGA---AGLLGLGGGSMSLVG 290
++ G L ETL+ G T N GCG N + G+ GLG G +SLV
Sbjct: 176 FSVGILGTETLSFGSTGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVS 235
Query: 291 QLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV--GAAWVPLVRNPRAPSFYYVGLSGL 348
QLG Q G FSYCL+ + S+ L FG EA+ G PL+ P P++Y++ L +
Sbjct: 236 QLGAQIGHKFSYCLLPYDSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAV 295
Query: 349 GVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV 408
+G + T D +V+D+GT +T L Y F + G
Sbjct: 296 TIGQKVVS--------TGQTDGNIVIDSGTPLTYLENTFYNNFVASLQETLGVKLLQDLP 347
Query: 409 SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS-GLS 467
S TC+ ++ +P ++F F+G V P N LIP+ D+ C A PS G+S
Sbjct: 348 SPLKTCF--PNRANLAIPDIAFQFTGASVALRP-KNVLIPLTDSNILCLAVVPSSGIGIS 404
Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ G+I Q Q+ +D V F P C
Sbjct: 405 LFGSIAQYDFQVEYDLEGKKVSFAPTDC 432
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 143/416 (34%), Positives = 211/416 (50%), Gaps = 52/416 (12%)
Query: 102 HRHQHSFHA-RMQRDVKRVATLVRR--------LSGGGADAAKHEVQDFGTDVVSGMDQG 152
HRH A + D + A + RR + G H GT V+S
Sbjct: 60 HRHGPCAPAPSLSTDTRSFADIFRRSRARPSYIVRGKKVSVPAH----LGTSVMSL---- 111
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVS 210
EY VR+ G+P Q +VID+GSD+ W+QC+PCS QC+ Q DP++DP+ S+++S V
Sbjct: 112 --EYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVP 169
Query: 211 CSSAVCDRLE----NAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGC 264
C+S VC +L +GC +G+ C + +SY DG+ T G + + LT+ +V+N GC
Sbjct: 170 CASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTLAPGAIVQNFYFGC 229
Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
GH + G+LGLG L LG + GG FSYCL S + G L G P
Sbjct: 230 GHGKHAVRGLFDGVLGLG----RLRESLGARYGGVFSYCLPSV-SSKPGFLALGAGKNPS 284
Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
G + P+ P P+F V L+G+ VGG ++ + F G+++D+GT +T L
Sbjct: 285 GFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF------SGGMIVDSGTVITGLQ 338
Query: 385 TPAYEAFRDAF---VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
+ AY A R AF + LP DTCYNL+G+ +V VP ++ F+GG + L
Sbjct: 339 STAYRALRSAFRKAMEAYRLLPNGD----LDTCYNLTGYKNVVVPKIALTFTGGATINLD 394
Query: 442 ASNFLIPVDDAGTFCFAFAPS-PSGLS-IIGNIQQEGIQISFDGANGFVGFGPNVC 495
N ++ V+ C AFA S P G + ++GN+ Q ++ FD + GF C
Sbjct: 395 VPNGIL-VNG----CLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 445
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 146/428 (34%), Positives = 209/428 (48%), Gaps = 33/428 (7%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+++E++HRD S + + +RVA VRR G K V
Sbjct: 31 FSVEMIHRDSSRSP---------------LYRPTETPFQRVANAVRRSINRGNHFKKAFV 75
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
+ S + GEY +R VGSPP ++D+GSDI+W+QC+PC CYKQ+ P+F
Sbjct: 76 STDSAE--STVVASQGEYLMRYSVGSPPFQVLGIVDTGSDILWLQCEPCEDCYKQTTPIF 133
Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVV 257
DP+ S ++ + CSS C+ L N C + C Y + YGDGS++ G L++ETLT+G T
Sbjct: 134 DPSKSKTYKTLPCSSNTCESLRNTACSSDNVCEYSIDYGDGSHSDGDLSVETLTLGSTDG 193
Query: 258 KNV-----AIGCGHKNQGMFVGAAGLLGLGGGS-MSLVGQLGGQTGGAFSYCL--VSRGT 309
+V IGCGH N G F + GG +SL+ QL GG FSYCL + +
Sbjct: 194 SSVHFPKTVIGCGHNNGGTFQEEGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPIFSES 253
Query: 310 GSSGSLVFGREALPVGAAWVPLVRNP-RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
SS L FG A+ G V +P FY++ L VG RI S + G
Sbjct: 254 NSSSKLNFGDAAVVSGRGTVSTPLDPLNGQVFYFLTLEAFSVGDNRIEFSGSSSSGSGSG 313
Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLSGFVSVRVPT 427
D +++D+GT +T LP Y A V+ L RA S + CY + + +P
Sbjct: 314 DGNIIIDSGTTLTLLPQEDYLNLESA-VSDVIKLERARDPSKLLSLCYKTTS-DELDLPV 371
Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGF 487
++ +F G V P S F +PV + G CFAF S G +I GN+ Q+ + + +D
Sbjct: 372 ITAHFKGADVELNPISTF-VPV-EKGVVCFAFISSKIG-AIFGNLAQQNLLVGYDLVKKT 428
Query: 488 VGFGPNVC 495
V F P C
Sbjct: 429 VSFKPTDC 436
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 136/369 (36%), Positives = 193/369 (52%), Gaps = 28/369 (7%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
SG Y + I +GSPP+ ++D+GSD+VW+QC+PCSQCY QSDP++DP+ S++F+ SCS
Sbjct: 1 SGAYTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSCS 60
Query: 213 SAVCDRLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIGCG 265
++ C L +GC A C Y YGD S T+G ALETLT+ N GCG
Sbjct: 61 TSSCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCG 120
Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS--LVFGREALP 323
N G F GAAG++GLG G +SL QLG FSYCLV SS + L+FG A
Sbjct: 121 RLNSGSFGGAAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFDDDSSKTSPLIFGSSAST 180
Query: 324 -VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF-------------RLTQMGD 369
GA P++ N ++Y+VGL G+ VGG ++ ++ R ++
Sbjct: 181 GSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRALEVNS 240
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTV 428
G + D+GT +T L Y + AF A + +LP S FD CY++S + + P +
Sbjct: 241 GGTIFDSGTTLTLLDDAVYSKVKSAF-ASSVSLPTVDASSSGFDLCYDVSKSKNFKFPAL 299
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTF-CFAF-APSPSGLSIIGNIQQEGIQISFDGANG 486
+ F G + P N+ + VD A T C A GL IIGN+ Q+ + +D
Sbjct: 300 TLAFKGTK-FSPPQKNYFVIVDTAETVACLAMGGSGSLGLGIIGNLMQQNYHVVYDRGTS 358
Query: 487 FVGFGPNVC 495
+ P C
Sbjct: 359 TISMSPAQC 367
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 144/400 (36%), Positives = 215/400 (53%), Gaps = 22/400 (5%)
Query: 101 YHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRI 160
+ ++ + M ++ A +R L ++ QD +V + GSGEY +++
Sbjct: 66 FRPPNRTWESLMSEKIRGDANRLRFLK----RTSRSSKQDANANV--PVRSGSGEYIIQV 119
Query: 161 GVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLE 220
G+P +S Y +ID+GSD+ W+ C+ C C+ + P+FDPA S+S+ +C S C +
Sbjct: 120 DFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHSTA-PIFDPAKSSSYKPFACDSQPCQEIS 178
Query: 221 NAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLG 280
+C++EVSYGDG+ GTLA + +T+G + N + GC + GL+G
Sbjct: 179 GNCGGNSKCQFEVSYGDGTQVDGTLASDAITLGSQYLPNFSFGCAESLSEDTSPSPGLMG 238
Query: 281 LGGGSMSLVGQLGGQT--GGAFSYCLVSRGTGSSGSLVFGREALPVGAA--WVPLVRNPR 336
LGGGS+SL+ Q GG FSYCL + SSGSLV G+EA ++ + L+++P
Sbjct: 239 LGGGSLSLLTQAPTAELFGGTFSYCL-PSSSTSSGSLVLGKEAAVSSSSLKFTTLIKDPS 297
Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD-DGVVMDTGTAVTRLPTPAYEAFRDAF 395
P+FY+V L + VG RI + T + G ++D+GT +T L AY A RDAF
Sbjct: 298 IPTFYFVTLKAISVGNTRISVPG-----TNIASGGGTIIDSGTTITHLVPSAYTALRDAF 352
Query: 396 VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF 455
Q +L + + V DTCY+LS SV VPT++ + L LP N LI ++G
Sbjct: 353 RQQLSSL-QPTPVEDMDTCYDLSS-SSVDVPTITLHLDRNVDLVLPKENILI-TQESGLA 409
Query: 456 CFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C AF+ S SIIGN+QQ+ +I FD N VGF C
Sbjct: 410 CLAFS-STDSRSIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
Length = 487
Score = 201 bits (511), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 138/348 (39%), Positives = 178/348 (51%), Gaps = 27/348 (7%)
Query: 161 GVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
+ P +Q M ID+ D+ W+QC PC +CY Q + +FDP S + + V C SA C
Sbjct: 154 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 213
Query: 219 L--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVAIGCGHKNQGMF-VG 274
L AGC +C+Y V YGDG T GT ++ LT+ TVV N GC H +G F
Sbjct: 214 LGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSAS 273
Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA---AWVPL 331
+G + LGGG SL+ Q G AFSYC+ SSG L G A GA A PL
Sbjct: 274 TSGTMSLGGGRQSLLSQTAATFGNAFSYCVPD--PSSSGFLSLGGPADGGGAGRFARTPL 331
Query: 332 VRNPRA-PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
VRNP P+ Y V L G+ VGG R+ + +F G VMD+ +T+LP AY A
Sbjct: 332 VRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVF------AGGAVMDSSVIITQLPPTAYRA 385
Query: 391 FRDAFVAQTGNLPR-ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
R AF + PR A G + DTCY+ F SV VP VS F GG V+ L A ++
Sbjct: 386 LRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV-- 443
Query: 450 DDAGTFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C AF P+P L IGN+QQ+ ++ +D G VGF C
Sbjct: 444 ----EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 487
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 134/388 (34%), Positives = 199/388 (51%), Gaps = 37/388 (9%)
Query: 143 TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP--VFDP 200
+ ++SG GSG+YFV I +GSPP++ +V D+GSD+ WV+C C P F
Sbjct: 70 SPLMSGASSGSGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLA 129
Query: 201 ADSASFSGVSCSSAVCDRLENAG---CHAGR----CRYEVSYGDGSYTKGTLALETLTI- 252
S +FS C S++C + C+ R CRYE Y DGS T G + ET T+
Sbjct: 130 RHSTTFSPTHCFSSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLN 189
Query: 253 ---GRTV-VKNVAIGCGHKNQG------MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
GR + +K++A GCG G F GA+G++GLG G +S QLG + G +FSY
Sbjct: 190 TSSGREMKLKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRSFSY 249
Query: 303 CLVSRGTGSS-------GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
CL+ G +V ++ ++ PL+ NP AP+FYY+ + G+ V G+++
Sbjct: 250 CLLDYTLSPPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKL 309
Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR-----ASGVSI 410
I ++ L ++G+ G V+D+GT +T L PAY AF + LP AS S
Sbjct: 310 HIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREV-KLPSPTPGGASTRSG 368
Query: 411 FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP--SPSG-LS 467
FD C N++G R P +S G + + P N+ I + + G C A P + SG S
Sbjct: 369 FDLCVNVTGVSRPRFPRLSLELGGESLYSPPPRNYFIDISE-GIKCLAIQPVEAESGRFS 427
Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
+IGN+ Q+G + FD +GF C
Sbjct: 428 VIGNLMQQGFLLEFDRGKSRLGFSRRGC 455
>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
Length = 471
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 138/348 (39%), Positives = 178/348 (51%), Gaps = 27/348 (7%)
Query: 161 GVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
+ P +Q M ID+ D+ W+QC PC +CY Q + +FDP S + + V C SA C
Sbjct: 138 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 197
Query: 219 L--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVAIGCGHKNQGMF-VG 274
L AGC +C+Y V YGDG T GT ++ LT+ TVV N GC H +G F
Sbjct: 198 LGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSAS 257
Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA---AWVPL 331
+G + LGGG SL+ Q G AFSYC+ SSG L G A GA A PL
Sbjct: 258 TSGTMSLGGGRQSLLSQTAATFGNAFSYCVPD--PSSSGFLSLGGPADGGGAGRFARTPL 315
Query: 332 VRNPRA-PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
VRNP P+ Y V L G+ VGG R+ + +F G VMD+ +T+LP AY A
Sbjct: 316 VRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRA 369
Query: 391 FRDAFVAQTGNLPR-ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
R AF + PR A G + DTCY+ F SV VP VS F GG V+ L A ++
Sbjct: 370 LRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV-- 427
Query: 450 DDAGTFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C AF P+P L IGN+QQ+ ++ +D G VGF C
Sbjct: 428 ----EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 471
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 132/340 (38%), Positives = 164/340 (48%), Gaps = 25/340 (7%)
Query: 169 QYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAGC 224
Q M ID+ D+ W+QC PC QCY Q DP+FDP S++ + V C S C L GC
Sbjct: 148 QTMAIDTTVDVPWIQCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACRSLGPYGNGC 207
Query: 225 HA----GRCRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGMFVG-AAGL 278
CRY + Y D T GT +TLTI G T V+N GC H +G F AG
Sbjct: 208 SNRSANAECRYLIEYSDDRATAGTYMTDTLTISGTTAVRNFRFGCSHAVRGRFSDLTAGT 267
Query: 279 LGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA---AWVPLVRNP 335
+ LGGG+ SL+ Q G AFSYC+ +SG L G A A PLVR+
Sbjct: 268 MSLGGGAQSLLAQTARSLGNAFSYCVPQ--ASASGFLSIGGPATTNSTTVFATTPLVRSA 325
Query: 336 RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF 395
PS Y V L G+ V G R+ I F G VMD+ +T+LP AY A R AF
Sbjct: 326 INPSLYLVRLQGIVVAGRRLGIPPVAF------SAGAVMDSSAVITQLPPTAYRALRRAF 379
Query: 396 VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF 455
PR+ DTCY+ G +VRVP VS F GG V+ L +I G
Sbjct: 380 RNAMRAYPRSGATGTLDTCYDFLGLTNVRVPAVSLVFGGGAVVVLDPPAVMI----GGCL 435
Query: 456 CFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
F S L IGN+QQ+ ++ +D A G VGF C
Sbjct: 436 AFTATSSDLALGFIGNVQQQTHEVLYDVAAGGVGFRRGAC 475
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 130/362 (35%), Positives = 188/362 (51%), Gaps = 32/362 (8%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
EY V + +G+PP+ + +D+GSD++W QCQPC C+ Q+ P FDP+ S++ S SC S
Sbjct: 81 EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 140
Query: 215 VCDRLENAGCHAGR------CRYEVSYGDGSYTKGTLALETLTI--GRTVVKNVAIGCGH 266
+C L A C + + C Y SYGD S T G L ++ T V VA GCG
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGL 200
Query: 267 KNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF-------- 317
N G+F G+ G G G +SL QL G FS+C + +++
Sbjct: 201 FNNGVFKSNETGIAGFGRGPLSLPSQL---KVGNFSHCFTAVNGLKPSTVLLDLPADLYK 257
Query: 318 -GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
GR A+ PL++NP P+FYY+ L G+ VG R+P+ E F L + G G ++D+
Sbjct: 258 SGRGAV----QSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTL-KNGTGGTIIDS 312
Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR--VPTVSFYFSG 434
GTA+T LPT Y RDAF AQ LP SG + D + LS + + VP + +F G
Sbjct: 313 GTAMTSLPTRVYRLVRDAFAAQV-KLPVVSGNTT-DPYFCLSAPLRAKPYVPKLVLHFEG 370
Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPN 493
+ LP N++ V+DAG+ A G ++ IGN QQ+ + + +D N + F P
Sbjct: 371 A-TMDLPRENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPA 429
Query: 494 VC 495
C
Sbjct: 430 QC 431
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 130/362 (35%), Positives = 188/362 (51%), Gaps = 32/362 (8%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
EY V + +G+PP+ + +D+GSD++W QCQPC C+ Q+ P FDP+ S++ S SC S
Sbjct: 81 EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 140
Query: 215 VCDRLENAGCHAGR------CRYEVSYGDGSYTKGTLALETLTI--GRTVVKNVAIGCGH 266
+C L A C + + C Y SYGD S T G L ++ T V VA GCG
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGL 200
Query: 267 KNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF-------- 317
N G+F G+ G G G +SL QL G FS+C + +++
Sbjct: 201 FNNGVFKSNETGIAGFGRGPLSLPSQL---KVGNFSHCFTAVNGLKPSTVLLDLPADLYK 257
Query: 318 -GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
GR A+ PL++NP P+FYY+ L G+ VG R+P+ E F L + G G ++D+
Sbjct: 258 SGRGAV----QSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFAL-KNGTGGTIIDS 312
Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR--VPTVSFYFSG 434
GTA+T LPT Y RDAF AQ LP SG + D + LS + + VP + +F G
Sbjct: 313 GTAMTSLPTRVYRLVRDAFAAQV-KLPVVSGNTT-DPYFCLSAPLRAKPYVPKLVLHFEG 370
Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPN 493
+ LP N++ V+DAG+ A G ++ IGN QQ+ + + +D N + F P
Sbjct: 371 A-TMDLPRENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPA 429
Query: 494 VC 495
C
Sbjct: 430 QC 431
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 153/447 (34%), Positives = 223/447 (49%), Gaps = 43/447 (9%)
Query: 69 SSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSG 128
++ +SD +R ++ L++R + ++ ++ S ++RD R ++R+ SG
Sbjct: 46 AAQVTSDPSRASMPLMYRHGPCAPASAAAT-----NRPSPAEMLRRDRARRNHILRKASG 100
Query: 129 GGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC- 187
+ G S +Y V +G G+P Q ++ID+GSD+ WVQCQPC
Sbjct: 101 ------RRITLGVSIPTSLGAFVDSLQYVVTLGFGTPAVPQVLLIDTGSDLSWVQCQPCN 154
Query: 188 -SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLE---------NAGCHAGRCRYEVSYGD 237
S CY Q DPVFDP+ S++++ V C S C L+ N+ A C+Y + YG+
Sbjct: 155 SSTCYPQKDPVFDPSASSTYAPVPCGSEACRDLDPDSYANGCTNSSSGASLCQYGIQYGN 214
Query: 238 GSYTKGTLALETLTI---GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGG 294
G T G + ETLT+ TVV N + GCG +G+F GLLGLGG SLV Q G
Sbjct: 215 GDTTVGVYSTETLTLSPEAATVVNNFSFGCGLVQKGVFDLFDGLLGLGGAPESLVSQTTG 274
Query: 295 QTGGAFSYCLVSRGTGSSGSLVFGREAL----PVGAAWVPLVRNPRAPSFYYVGLSGLGV 350
GGAFSYCL + G ++G L G A G + PL +FY V L+G+ V
Sbjct: 275 TYGGAFSYCLPA-GNSTAGFLALGAPATGGNNTAGFQFTPL--QVVETTFYLVKLTGISV 331
Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP--RASGV 408
GG ++ I +F G+++D+GT VT LP AY A R AF + P +
Sbjct: 332 GGKQLDIEPTVFA------GGMIIDSGTIVTGLPETAYSALRTAFRSAMSAYPLLPPNDD 385
Query: 409 SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSI 468
DTCY+ +G +V VPTV+ F GG + L + ++ +D G F S I
Sbjct: 386 EDLDTCYDFTGNTNVTVPTVALTFEGGVTIDLDVPSGVL-LD--GCLAFVAGASDGDTGI 442
Query: 469 IGNIQQEGIQISFDGANGFVGFGPNVC 495
IGN+ Q ++ +D A G VGF C
Sbjct: 443 IGNVNQRTFEVLYDSARGHVGFRAGAC 469
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 127/364 (34%), Positives = 185/364 (50%), Gaps = 29/364 (7%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
EY V + +G+PP+ +++D+GSD+VW QC+PC C+ ++ DP++S++F + CSS
Sbjct: 414 EYLVHLAIGTPPQPVQLILDTGSDLVWTQCRPCPVCFSRALGPLDPSNSSTFDVLPCSSP 473
Query: 215 VCDRLENAGCHAGR-----CRYEVSYGDGSYTKGTLALETLTI------GRTVVKNVAIG 263
VCD L + C C Y +Y DGS T G L ET T G+ V ++A G
Sbjct: 474 VCDNLTWSSCGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQATVPDLAFG 533
Query: 264 CGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL 322
CG N G+F G+ G G G++SL QL FS+C + S++ G A
Sbjct: 534 CGLFNNGIFTSNETGIAGFGRGALSLPSQLKVDN---FSHCFTAITGSEPSSVLLGLPAN 590
Query: 323 PVGAA-----WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
A PLV+N + YY+ L G+ VG R+PI E F L Q G G ++D+G
Sbjct: 591 LYSDADGAVQSTPLVQNFSSLRAYYLSLKGITVGSTRLPIPESTFALKQDGTGGTIIDSG 650
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLP--RASGVSIFDTCYNLSGFVSVR--VPTVSFYFS 433
T +T LP AY+ DAF AQ LP A+ S+ C++ S + VP + +F
Sbjct: 651 TGMTTLPQDAYKLVHDAFTAQV-RLPVDNATSSSLSRLCFSFSVPRRAKPDVPKLVLHFE 709
Query: 434 GGPVLTLPASNFLIPVDDAG--TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFG 491
G L LP N++ +DAG C A + L+IIGN QQ+ + + +D + F
Sbjct: 710 GA-TLDLPRENYMFEFEDAGGSVTCLAIN-AGDDLTIIGNYQQQNLHVLYDLVRNMLSFV 767
Query: 492 PNVC 495
P C
Sbjct: 768 PAQC 771
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 134/436 (30%), Positives = 207/436 (47%), Gaps = 49/436 (11%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+++EL+HRD + S + Q + RR K+ +
Sbjct: 28 FSVELIHRDSLKSP---------------LYKPTQNKYQYFVDAARRSINRANHFYKYSL 72
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
+ V GEY + VG+PP Y ++D+GSDIVW+QC+PC +CY Q+ P+F
Sbjct: 73 ANIPQSTVI---PDIGEYLMTYSVGTPPFKLYGIVDTGSDIVWLQCEPCQECYNQTTPMF 129
Query: 199 DPADSASFSGVSCSSAVCDRLENAGCH-AGRCRYEVSYGDGSYTKGTLALETLTI----G 253
+P+ S+S+ + C S +C +E+ C+ C Y YGD S++ G L+++TLT+ G
Sbjct: 130 NPSKSSSYKNIPCPSKLCQSMEDTSCNDKNYCEYSTYYGDNSHSGGDLSVDTLTLESTNG 189
Query: 254 RTV-VKNVAIGCGHKNQGMFVGA-AGLLGLGGGSMSLVGQLGGQTGGAFSYCL------V 305
TV N+ IGCG N + GA +G++G G G S + QLG TGG FSYCL
Sbjct: 190 LTVSFPNIVIGCGTNNILSYEGASSGIVGFGSGPASFITQLGSSTGGKFSYCLTPLFSVT 249
Query: 306 SRGTGSSGSLVFGREALPVGAAWVP---LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
+ + ++ L FG A G V L ++P +FYY+ L VG R+ I
Sbjct: 250 NIQSNATSKLNFGDAATVSGDGVVTTPILKKDPE--TFYYLTLEAFSVGNRRVEIGG--- 304
Query: 363 RLTQMGDD--GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG-VSIFDTCYNLSG 419
GD+ +++D+GT +T L Y +F ++ V L R + CY++
Sbjct: 305 --VPNGDNEGNIIIDSGTTLTSLTKDDY-SFLESAVVDLVKLERVDDPTQTLNLCYSVKA 361
Query: 420 FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQI 479
P ++ +F G V P S F+ D G FC AF S +I GN+ Q+ + +
Sbjct: 362 -EGYDFPIITMHFKGADVDLHPISTFVSVAD--GVFCLAFESSQDH-AIFGNLAQQNLMV 417
Query: 480 SFDGANGFVGFGPNVC 495
+D V F P+ C
Sbjct: 418 GYDLQQKIVSFKPSDC 433
>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
Length = 720
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 145/454 (31%), Positives = 222/454 (48%), Gaps = 53/454 (11%)
Query: 59 ELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKR 118
+ F + ++ + S + W L H S + ++ N S + D +R
Sbjct: 44 KTFCSGHKVAPGDVPSPNSTWA-PLHHLYGPCSPAPSSANSTAADVAASMADMVDDDQRR 102
Query: 119 VATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPR----------- 167
+ +RL+G A + Q + + +G+Y G+GS P
Sbjct: 103 ADYIQKRLTG-----ATDDKQPMAFSSRTSQYEKNGQYATNGGLGSVPHLKSLSTTATTN 157
Query: 168 ---------SQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVC 216
+Q ++IDSGSD+ WVQC+PC C++Q DP+FDPA S +++ V C+SA C
Sbjct: 158 SAPDGTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAAC 217
Query: 217 DRL--ENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQG-- 270
+L GC A +C++ ++YGDGS GT + + LT+G V++ GC H ++G
Sbjct: 218 AQLGPYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSA 277
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG----REALPVGA 326
AG L LGGGS SLV Q + G FSYCL + S G LV G R L
Sbjct: 278 FDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTAS-SLGFLVLGVPPERAQLIPSF 336
Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
PL+ + AP+FY V L + V G + + +F + V+D+ T ++RLP
Sbjct: 337 VSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASS------VIDSSTIISRLPPT 390
Query: 387 AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
AY+A R AF + A VSI DTCY+ +G S+ +P+++ F GG + L A+ L
Sbjct: 391 AYQALRAAFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGIL 450
Query: 447 IPVDDAGTFCFAFAPSPSGL--SIIGNIQQEGIQ 478
+ G+ C AFAP+ S IGN+QQ+ ++
Sbjct: 451 L-----GS-CLAFAPTASDRMPGFIGNVQQKTLE 478
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 84/283 (29%), Positives = 125/283 (44%), Gaps = 51/283 (18%)
Query: 223 GCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGL 281
GC A +C++ ++YGDGS GT + + LT+G V QG+ + A
Sbjct: 479 GCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVDR---------QGLPLRTAT---- 525
Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVP-LVRNP----- 335
Q G FSYC + S G + G P AA VP V P
Sbjct: 526 -------------QYGRVFSYC-IPPSPSSLGFITLGVP--PQRAALVPTFVSTPLLSSS 569
Query: 336 -RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
P+FY V L + V G +P+ +F + V+ + T ++RLP AY+A R A
Sbjct: 570 SMPPTFYRVLLRAIIVAGRPLPVPPTVFSTSS------VIASTTVISRLPPTAYQALRAA 623
Query: 395 FVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
F A VSI DTCY+ +G S+ +P+++ F GG + L A+ L+
Sbjct: 624 FRRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL------Q 677
Query: 455 FCFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C AFAP+ + IGN+QQ +++ +D + F C
Sbjct: 678 GCLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 720
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 130/380 (34%), Positives = 198/380 (52%), Gaps = 35/380 (9%)
Query: 143 TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC-YKQSDPVFDPA 201
+ ++SG GSG+YFV I +G+PP+S +V D+GSD+VWV+C C C + F P
Sbjct: 75 SPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPR 134
Query: 202 DSASFSGVSCSSAVCDRLENAGCHA-------GRCRYEVSYGDGS-----YTKGTLALET 249
S+SFS C C L +A H CR+ SY DGS ++K T L++
Sbjct: 135 HSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKS 194
Query: 250 LTIGRTVVKNVAIGCGHKNQG------MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
L+ +K ++ GCG + G F GA G++GLG GS+S QLG + G FSYC
Sbjct: 195 LSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYC 254
Query: 304 LVSRGTGSSGSLVF----GREALPVGAA----WVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
L+ + G +LP+ A + PL NP +P+FYY+ + + + G+++
Sbjct: 255 LMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKL 314
Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTC 414
PI+ ++ + + G+ G V+D+GT +T L AYE + V + LP A+ ++ FD C
Sbjct: 315 PINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKS-VRRRVKLPNAAELTPGFDLC 373
Query: 415 YNLSGFVSVR--VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF--APSPSGLSIIG 470
N SG S R +P + F GG V P N+ + ++ G C A S +G S+IG
Sbjct: 374 VNASG-ESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEE-GVMCLAIRAVESGNGFSVIG 431
Query: 471 NIQQEGIQISFDGANGFVGF 490
N+ Q+G + FD +GF
Sbjct: 432 NLMQQGFLLEFDKEESRLGF 451
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 137/373 (36%), Positives = 192/373 (51%), Gaps = 30/373 (8%)
Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
+ G EY + + +G+PP + D+GSD+ W QC+PC C+ Q P++D A SASFS
Sbjct: 88 LRSGQAEYLMELAIGTPPVPFVALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSP 147
Query: 209 VSCSSAVCDRL--ENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRT-------- 255
V C+SA C + + C A CRY +Y DG+Y+ G L ETLT +
Sbjct: 148 VPCASATCLPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPG 207
Query: 256 -VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS 314
V VA GCG N G+ + G +GLG GS+SLV QLG G FSYCL S GS
Sbjct: 208 VSVGGVAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLG---VGKFSYCLTDFFNTSLGS 264
Query: 315 -LVFG---REALP--VGAAWV---PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
++FG A P +G A V PLV+ P PS YYV L G+ +G R+PI F L
Sbjct: 265 PVLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLR 324
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS--V 423
G G+++D+GT T L A+ + VA N P + S+ C+ +
Sbjct: 325 DDGSGGMIVDSGTIFTVLVESAFRVVVN-HVAGVLNQPVVNASSLDSPCFPATAGEQQLP 383
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL-SIIGNIQQEGIQISFD 482
+P + +F+GG + L N++ ++ +FC A +PS SI+GN QQ+ IQ+ FD
Sbjct: 384 DMPDMLLHFAGGADMRLHRDNYMSFNQESSSFCLNIAGAPSAYGSILGNFQQQNIQMLFD 443
Query: 483 GANGFVGFGPNVC 495
G + F P C
Sbjct: 444 ITVGQLSFVPTDC 456
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 142/434 (32%), Positives = 213/434 (49%), Gaps = 49/434 (11%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+++E++HRD S F + + +RVA V R + A H
Sbjct: 29 FSVEMIHRDSSRSP---------------FFSPTETQFQRVANAVHR----SINRANHLN 69
Query: 139 QDF------GTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK 192
Q F T V+S + GEY + VG+P + ++D+GSDI+W+QCQPC +CY+
Sbjct: 70 QSFVSPNSPETTVISAL----GEYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYE 125
Query: 193 QSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLT 251
Q+ P+FD + S ++ + C S C ++ C + + C Y + Y DGS + G L++ETLT
Sbjct: 126 QTTPIFDSSKSQTYKTLPCPSNTCQSVQGTFCSSRKHCLYSIHYVDGSQSLGDLSVETLT 185
Query: 252 IGRT-----VVKNVAIGCGHKNQ-GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
+G T IGCG N G+ +G++GLG G MSL+ QL TGG FSYCLV
Sbjct: 186 LGSTNGSPVQFPGTVIGCGRYNAIGIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSYCLV 245
Query: 306 SRGTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
+ +S L FG A+ G V PL FY++ L VG RI
Sbjct: 246 PGLSTASSKLNFGNAAVVSGRGTVSTPLFSK-NGLVFYFLTLEAFSVGRNRIEFGSP--- 301
Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLS-GFV 421
G +++D+GT +T LP Y +A VA+T L R + + CY ++ +
Sbjct: 302 -GSGGKGNIIIDSGTTLTALPNGVYSKL-EAAVAKTVILQRVRDPNQVLGLCYKVTPDKL 359
Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISF 481
VP ++ +FSG V TL A N + V D CFAF P+ +G ++ GN+ Q+ + + +
Sbjct: 360 DASVPVITAHFSGADV-TLNAINTFVQVAD-DVVCFAFQPTETG-AVFGNLAQQNLLVGY 416
Query: 482 DGANGFVGFGPNVC 495
D V F C
Sbjct: 417 DLQMNTVSFKHTDC 430
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 199 bits (506), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 143/396 (36%), Positives = 208/396 (52%), Gaps = 16/396 (4%)
Query: 107 SFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPP 166
S AR+ + TL+ G + ++ + + G G G Y R+G+G+P
Sbjct: 78 SLAARLAKTPSSRPTLLDESRAGSSSSSPDDESLASVPLGPGTSVGVGNYVTRMGLGTPA 137
Query: 167 RSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVS-----CSSAVCDRLE 220
+S MV+D+GS + W+QC PC C++QS PVF+P S+S++ VS CS L
Sbjct: 138 KSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQQCSDLTTATLN 197
Query: 221 NAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLL 279
A C C Y+ SYGD S++ G L+ +T++ G T V N GCG N+G+F +AGL+
Sbjct: 198 PASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLI 257
Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPS 339
GL +SL+ QL G +FSYCL + + SSG L G P ++ P+ + S
Sbjct: 258 GLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIGSYN-PGQYSYTPMASSSLDDS 316
Query: 340 FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQT 399
Y++ ++G+ V G P+S + + ++D+GT +TRLPT Y A A
Sbjct: 317 LYFIKMTGIKVAGK--PLSVSSSAYSSL---PTIIDSGTVITRLPTGVYSALSKAVAGAM 371
Query: 400 GNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF 459
PRAS SI DTC+ +RVP V+ F+GG L L A N L+ VD A T C AF
Sbjct: 372 KGTPRASAFSILDTCFQGQA-ARLRVPEVTMAFAGGAALKLAARNLLVDVDSATT-CLAF 429
Query: 460 APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
AP+ S +IIGN QQ+ + +D N +GF C
Sbjct: 430 APARSA-AIIGNTQQQTFSVVYDVKNSKIGFAAAGC 464
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 199 bits (506), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 154/441 (34%), Positives = 217/441 (49%), Gaps = 40/441 (9%)
Query: 73 SSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGAD 132
+SD R ++ L HR + + T++ S R++RD R + R+ G
Sbjct: 54 TSDPNRASMPLAHRHGPCAPATTSS-------WPSLAERLRRDRARRDHITRKAKASGRT 106
Query: 133 AAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--SQC 190
+V T + + +D S EY V +G+G+P Q ++ID+GSD+ WVQC+PC S C
Sbjct: 107 TTLSDVS-IPTSLGAAVD--SLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSC 163
Query: 191 YKQSDPVFDPADSASFSGVSCSSAVCDRL----ENAGCH----AGRCRYEVSYGDGSYTK 242
Y Q DP++DP S++++ V C S C L + GC C+Y + YG+ T
Sbjct: 164 YPQKDPLYDPTASSTYAPVPCDSKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTV 223
Query: 243 GTLALETLTIGRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFS 301
G + ETLT+ V VK+ GCG QG F GLLGLGG SLV Q GGAFS
Sbjct: 224 GVYSTETLTLSPQVSVKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFS 283
Query: 302 YCLVSRGTGSSGSLVFGREAL---PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPIS 358
YCL G ++G L G G + PL P +FY V L+G+ VGG + I
Sbjct: 284 YCL-PPGNSTTGFLALGAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIP 342
Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP--RASGVSIFDTCYN 416
+ G+++D+GT +T LP AY A R AF P + + DTCYN
Sbjct: 343 PTVLS------GGMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYN 396
Query: 417 LSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQ 474
+G +V VPTV+ F GG + L + ++ D C AFA S + IIGN+ Q
Sbjct: 397 FTGIANVTVPTVALTFDGGATIDLDVPSGVLIQD-----CLAFAGGASDGDVGIIGNVNQ 451
Query: 475 EGIQISFDGANGFVGFGPNVC 495
++ +D G VGF P C
Sbjct: 452 RTFEVLYDSGRGHVGFRPGAC 472
>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
Length = 280
Score = 199 bits (506), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 114/231 (49%), Positives = 147/231 (63%), Gaps = 28/231 (12%)
Query: 91 SSSNTTNNMHYH-RHQHSFHA--------RMQRDVKRVATLVRRLSGGGADAAKHEVQDF 141
+SS +T ++ H R S HA R+ RD RV + +L+ Q+F
Sbjct: 64 TSSTSTLSLQLHSRASLSSHADYKSLTLSRLDRDSARVKYITTKLN-----------QNF 112
Query: 142 GTD-----VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP 196
TD ++SG QGSGEYF RIG+G PP YMV+D+GSDI WVQC PC+ CY+Q+DP
Sbjct: 113 NTDKLSGPIISGTSQGSGEYFSRIGIGEPPSQAYMVLDTGSDISWVQCAPCADCYRQADP 172
Query: 197 VFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
+F+P SAS++ +SC +A C L+ + C G C Y+VSYGDGSYT G ET+TIG
Sbjct: 173 IFEPTASASYAPLSCEAAQCRYLDQSQCRNGNCLYQVSYGDGSYTVGDFVTETVTIGVNK 232
Query: 257 VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
VKNVA+GCGH N+G+FVGAAGL+GLGGG +S QL +FSYCLV R
Sbjct: 233 VKNVALGCGHNNEGLFVGAAGLIGLGGGPLSFPAQLNST---SFSYCLVDR 280
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 199 bits (506), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 135/430 (31%), Positives = 212/430 (49%), Gaps = 38/430 (8%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
++++L+HRD S + R +F +R V RV R + +D + +
Sbjct: 32 FSVDLIHRDSPHSPFFDPSKTQAERLTDAF----RRSVSRVGRF--RPTAMTSDGIQSRI 85
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
+GEY + + +G+PP ++D+GSD+ W QC+PC+ CYKQ P+F
Sbjct: 86 V-----------PSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLF 134
Query: 199 DPADSASFSGVSCSSAVCDRL-ENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
DP +S+++ SC ++ C L ++ C +C + SY DGS+T G LA ETLT+ T
Sbjct: 135 DPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTA 194
Query: 257 VKNV-----AIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG 310
K V A GCGH + G+F ++G++GLGGG +SL+ QL G FSYCL+ T
Sbjct: 195 GKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTD 254
Query: 311 SSGS--LVFGREALPVGAAWV--PLVRNPRAP-SFYYVGLSGLGVGGMRIPISEDLFRLT 365
SS S + FG G V PLV+ ++P +FYY+ L G+ VG R+P + + T
Sbjct: 255 SSISSRINFGASGRVSGYGTVSTPLVQ--KSPDTFYYLTLEGISVGKKRLPY-KGYSKKT 311
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
++ + +++D+GT T LP Y + IF CYN + +
Sbjct: 312 EVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTA--EINA 369
Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
P ++ +F V P + F+ +D CF AP+ S + ++GN+ Q + FD
Sbjct: 370 PIITAHFKDANVELQPLNTFMRMQEDL--VCFTVAPT-SDIGVLGNLAQVNFLVGFDLRK 426
Query: 486 GFVGFGPNVC 495
V F C
Sbjct: 427 KRVSFKAADC 436
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 143/396 (36%), Positives = 208/396 (52%), Gaps = 16/396 (4%)
Query: 107 SFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPP 166
S AR+ + TL+ G + ++ + + G G G Y R+G+G+P
Sbjct: 78 SLAARLAKTPSSRPTLLDESRAGSSSSSPDDESLASVPLGPGTSVGVGNYVTRMGLGTPA 137
Query: 167 RSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVS-----CSSAVCDRLE 220
+S MV+D+GS + W+QC PC C++QS PVF+P S+S++ VS CS L
Sbjct: 138 KSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQQCSDLTTATLN 197
Query: 221 NAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLL 279
A C C Y+ SYGD S++ G L+ +T++ G T V N GCG N+G+F +AGL+
Sbjct: 198 PASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLI 257
Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPS 339
GL +SL+ QL G +FSYCL + + SSG L G P ++ P+ + S
Sbjct: 258 GLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIGSYN-PGQYSYTPMASSSLDDS 316
Query: 340 FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQT 399
Y++ ++G+ V G P+S + + ++D+GT +TRLPT Y A A
Sbjct: 317 LYFIKMTGIKVAGK--PLSVSSSAYSSL---PTIIDSGTVITRLPTGVYSALSKAVAGAM 371
Query: 400 GNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF 459
PRAS SI DTC+ +RVP V+ F+GG L L A N L+ VD A T C AF
Sbjct: 372 KGTPRASAFSILDTCFQGQA-ARLRVPEVTMAFAGGAALKLAARNLLVDVDSATT-CLAF 429
Query: 460 APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
AP+ S +IIGN QQ+ + +D N +GF C
Sbjct: 430 APARSA-AIIGNTQQQTFSVVYDVKNSKIGFAAGGC 464
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 133/428 (31%), Positives = 212/428 (49%), Gaps = 41/428 (9%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
+E++HRD S + R + H R + RV + S
Sbjct: 30 IEMIHRDFSKSPLYHPTVTKFQRAYNVVH----RSINRVNYFTKEFSLNKNQP------- 78
Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
VS + GEY + VG+PP Y +D+GS+IVW+QCQPC+ C+ Q+ P+F+P
Sbjct: 79 -----VSTLTPELGEYLISYSVGTPPFKVYGFMDTGSNIVWLQCQPCNTCFNQTSPIFNP 133
Query: 201 ADSASFSGVSCSSAVCDRLENA--GCHAG--RCRYEVSYGDGSYTKGTLALETLTIGRT- 255
+ S+S+ + C+S+ C + C G C Y ++YG + ++G L+ ++LT+ T
Sbjct: 134 SKSSSYKNIPCTSSTCKDTNDTHISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTS 193
Query: 256 ----VVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQT-GGAFSYCLV--SR 307
+ N+ IGCGH N ++G++G+G G MSL+ Q+G + G FSYCL+ +
Sbjct: 194 GSSVLFPNIVIGCGHINVLQDNSQSSGVVGMGRGPMSLIKQVGSSSVGSKFSYCLIPYNS 253
Query: 308 GTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
+ SS L+FG + + G V P+V+ ++Y++ L VG RI E T
Sbjct: 254 DSNSSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIEYGERSNAST 313
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLSGFVSVR 424
Q +++D+GT +T LP + + ++VAQ LPR CYN +G +
Sbjct: 314 Q----NILIDSGTPLTMLPN-LFLSKLVSYVAQEVKLPRIEPPDHHLSLCYNTTG-KQLN 367
Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGA 484
VP ++ +F+G V L ++ P +D G CF F S +GL I GNI Q + I +D
Sbjct: 368 VPDITAHFNGADV-KLNSNGTFFPFED-GIMCFGFI-SSNGLEIFGNIAQNNLLIDYDLE 424
Query: 485 NGFVGFGP 492
+ F P
Sbjct: 425 KEIISFKP 432
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 198 bits (503), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 120/350 (34%), Positives = 174/350 (49%), Gaps = 16/350 (4%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
Y R G+G+P ++ + ID +D WV C C+ C S P F P S+++ V C S
Sbjct: 101 NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSP 159
Query: 215 VCDRLENAGCHAG---RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
C ++ + C AG C + ++Y ++ + L ++L + VV + GC G
Sbjct: 160 QCAQVPSPSCPAGVGSSCGFNLTYAASTF-QAVLGQDSLALENNVVVSYTFGCLRVVSGN 218
Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVP 330
V GL+G G G +S + Q G FSYCL + R + SG+L G P P
Sbjct: 219 SVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIGQPKRIKTTP 278
Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
L+ NP PS YYV + G+ VG + + + + G ++D GT TRL P Y A
Sbjct: 279 LLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYAA 338
Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
RDAF + P A + FDTCYN V+V VPTV+F F+G +TLP N +I
Sbjct: 339 VRDAFRGRV-RTPVAPPLGGFDTCYN----VTVSVPTVTFMFAGAVAVTLPEENVMIHSS 393
Query: 451 DAGTFCFAFAPSPS-----GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
G C A A PS L+++ ++QQ+ ++ FD ANG VGF +C
Sbjct: 394 SGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 443
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 198 bits (503), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 129/362 (35%), Positives = 184/362 (50%), Gaps = 17/362 (4%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
V SG G Y VR +G+PP+ +MV+D+ +D VW+ C CS C + F+ S+
Sbjct: 93 VASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGC-SNASTSFNTNSSS 151
Query: 205 SFSGVSCSSAVCDRLENAGCHAGR-----CRYEVSYGDGSYTKGTLALETLTIGRTVVKN 259
++S VSCS+A C + C + C + SYG S +L +TLT+ V+ N
Sbjct: 152 TYSTVSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPDVIPN 211
Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFG 318
+ GC + G + GL+GLG G MSLV Q G FSYCL S R SGSL G
Sbjct: 212 FSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLG 271
Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
P + PL+RNPR PS YYV L+G+ VG +++P+ G ++D+GT
Sbjct: 272 LLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGT 331
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
+TR P YEA RD F Q N+ S + FDTC++ P ++ + + L
Sbjct: 332 VITRFAQPVYEAIRDEFRKQV-NVSSFSTLGAFDTCFSADN--ENVAPKITLHMTSLD-L 387
Query: 439 TLPASNFLIPVDDAGTF-CFAFA----PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
LP N LI AGT C + A + + L++I N+QQ+ ++I FD N +G P
Sbjct: 388 KLPMENTLI-HSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPE 446
Query: 494 VC 495
C
Sbjct: 447 PC 448
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 137/407 (33%), Positives = 199/407 (48%), Gaps = 28/407 (6%)
Query: 112 MQRDVKRVATL--VRRLSGGGADAAKHEVQDFGTDV-VSGMDQGSGEYFVRIGVGSPPRS 168
MQR R A L VR + + K++ Q VS G EY V + +G+PP+
Sbjct: 55 MQRSKARAAALSAVRNRAASARFSGKNDDQRTTPPTGVSVRPSGDLEYVVDLAIGTPPQP 114
Query: 169 QYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH-AG 227
++D+GSD++W QC PC+ C Q DP+F P +SAS+ + C+ +C + + GC
Sbjct: 115 VSALLDTGSDLIWTQCAPCASCLAQPDPLFAPGESASYEPMRCAGQLCSDILHHGCEMPD 174
Query: 228 RCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
C Y +YGDG+ T G A E T R + + GCG N G +G++G G
Sbjct: 175 TCTYRYNYGDGTMTMGVYATERFTFTSSGGDRLMTVPLGFGCGSMNVGSLNNGSGIVGFG 234
Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV------GAAWVPLVRNPR 336
+SLV QL + FSYCL S G+G +L+FG + V PL+++ +
Sbjct: 235 RNPLSLVSQLSIRR---FSYCLTSYGSGRKSTLLFGSLSGGVYGDATGPVQTTPLLQSLQ 291
Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
P+FYYV L+GL VG R+ I E F L G GV++D+GTA+T LP AF
Sbjct: 292 NPTFYYVHLAGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFR 351
Query: 397 AQTGNLPRASGVSIFD-TCYNL-------SGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
Q LP A+G + D C+ + S V VP + F+F L LP N+++
Sbjct: 352 QQL-RLPFANGGNPEDGVCFLVPAAWRRSSSTSQVPVPRMVFHFQDAD-LDLPRRNYVLD 409
Query: 449 VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
G C A S S IGN+ Q+ +++ +D + F P C
Sbjct: 410 DHRKGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSFAPAQC 456
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 133/381 (34%), Positives = 189/381 (49%), Gaps = 30/381 (7%)
Query: 143 TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP-VFDPA 201
+ VVSG GSG+YFV + +G PP+S ++ D+GSD+VWV+C C C S VF P
Sbjct: 71 SPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPR 130
Query: 202 DSASFSGVSCSSAVCDRLENAG----CHAGR----CRYEVSYGDGSYTKGTLALETLTI- 252
S++FS C VC + C+ R C YE Y DGS T G A ET ++
Sbjct: 131 HSSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLK 190
Query: 253 ----GRTVVKNVAIGCGHKNQGM------FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
+K+VA GCG + G F GA G++GLG G +S QLG + G FSY
Sbjct: 191 TSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSY 250
Query: 303 CLV--SRGTGSSGSLVFGREALPVGAA-WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISE 359
CL+ + + L+ G + + PL+ NP +P+FYYV L + V G ++ I
Sbjct: 251 CLMDYTLSPPPTSYLIIGNGGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDP 310
Query: 360 DLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLS 418
++ + G+ G V+D+GT + L PAY + A V + LP A ++ FD C N+S
Sbjct: 311 SIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAA-VRRRVKLPIADALTPGFDLCVNVS 369
Query: 419 GFVSVR--VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF--APSPSGLSIIGNIQQ 474
G +P + F FSGG V P N+ I ++ C A G S+IGN+ Q
Sbjct: 370 GVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQ-CLAIQSVDPKVGFSVIGNLMQ 428
Query: 475 EGIQISFDGANGFVGFGPNVC 495
+G FD +GF C
Sbjct: 429 QGFLFEFDRDRSRLGFSRRGC 449
>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
Length = 629
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 137/398 (34%), Positives = 204/398 (51%), Gaps = 52/398 (13%)
Query: 115 DVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPR------- 167
D +R + +RL+G A + Q + + +G+Y G+GS P
Sbjct: 8 DQRRADYIQKRLTG-----ATDDKQPMAFSSRTSQYEKNGQYATNGGLGSVPHLKSLSTT 62
Query: 168 -------------SQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCS 212
+Q ++IDSGSD+ WVQC+PC C++Q DP+FDPA S +++ V C+
Sbjct: 63 ATTNSAPDGTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCT 122
Query: 213 SAVCDRL--ENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKN 268
SA C +L GC A +C++ ++YGDGS GT + + LT+G V++ GC H +
Sbjct: 123 SAACAQLGPYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHAD 182
Query: 269 QG--MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG----REAL 322
+G AG L LGGGS SLV Q + G FSYCL + S G LV G R L
Sbjct: 183 RGSAFDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTAS-SLGFLVLGVPPERAQL 241
Query: 323 PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
PL+ + AP+FY V L + V G + + +F + V+D+ T ++R
Sbjct: 242 IPSFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASS------VIDSSTIISR 295
Query: 383 LPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPA 442
LP AY+A R AF + A VSI DTCY+ +G S+ +P+++ F GG + L A
Sbjct: 296 LPPTAYQALRAAFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDA 355
Query: 443 SNFLIPVDDAGTFCFAFAPSPSGL--SIIGNIQQEGIQ 478
+ L+ G+ C AFAP+ S IGN+QQ+ ++
Sbjct: 356 AGILL-----GS-CLAFAPTASDRMPGFIGNVQQKTLE 387
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 84/283 (29%), Positives = 125/283 (44%), Gaps = 51/283 (18%)
Query: 223 GCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGL 281
GC A +C++ ++YGDGS GT + + LT+G V QG+ + A
Sbjct: 388 GCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVDR---------QGLPLRTAT---- 434
Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVP-LVRNP----- 335
Q G FSYC + S G + G P AA VP V P
Sbjct: 435 -------------QYGRVFSYC-IPPSPSSLGFITLGVP--PQRAALVPTFVSTPLLSSS 478
Query: 336 -RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
P+FY V L + V G +P+ +F + V+ + T ++RLP AY+A R A
Sbjct: 479 SMPPTFYRVLLRAIIVAGRPLPVPPTVFSTSS------VIASTTVISRLPPTAYQALRAA 532
Query: 395 FVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
F A VSI DTCY+ +G S+ +P+++ F GG + L A+ L+
Sbjct: 533 FRRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL------Q 586
Query: 455 FCFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C AFAP+ + IGN+QQ +++ +D + F C
Sbjct: 587 GCLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 629
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 198 bits (503), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 120/350 (34%), Positives = 174/350 (49%), Gaps = 16/350 (4%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
Y R G+G+P ++ + ID +D WV C C+ C S P F P S+++ V C S
Sbjct: 82 NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSP 140
Query: 215 VCDRLENAGCHAG---RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
C ++ + C AG C + ++Y ++ + L ++L + VV + GC G
Sbjct: 141 QCAQVPSPSCPAGVGSSCGFNLTYAASTF-QAVLGQDSLALENNVVVSYTFGCLRVVSGN 199
Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVP 330
V GL+G G G +S + Q G FSYCL + R + SG+L G P P
Sbjct: 200 SVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIGQPKRIKTTP 259
Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
L+ NP PS YYV + G+ VG + + + + G ++D GT TRL P Y A
Sbjct: 260 LLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYAA 319
Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
RDAF + P A + FDTCYN V+V VPTV+F F+G +TLP N +I
Sbjct: 320 VRDAFRGRV-RTPVAPPLGGFDTCYN----VTVSVPTVTFMFAGAVAVTLPEENVMIHSS 374
Query: 451 DAGTFCFAFAPSPS-----GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
G C A A PS L+++ ++QQ+ ++ FD ANG VGF +C
Sbjct: 375 SGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 424
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 197 bits (502), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 135/349 (38%), Positives = 196/349 (56%), Gaps = 16/349 (4%)
Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC 211
GSGEY +++ G+P +S Y +ID+GSD+ W+ C+ C C+ + P+FDPA S+S+ +C
Sbjct: 111 GSGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHSTA-PIFDPAKSSSYKPFAC 169
Query: 212 SSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
S C + +C++EV YGDG+ GTLA + +T+G + N + GC
Sbjct: 170 DSQPCQEISGNCGGNSKCQFEVLYGDGTQVDGTLASDAITLGSQYLPNFSFGCAESLSED 229
Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSRGTGSSGSLVFGREALPVGAA-- 327
+ GL+GLGGGS+SL+ Q GG FSYCL + SSGSLV G+EA ++
Sbjct: 230 TYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCL-PSSSTSSGSLVLGKEAAVSSSSLK 288
Query: 328 WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD-DGVVMDTGTAVTRLPTP 386
+ L+++P P+FY+V L + VG RI + T + G ++D+GT +T L
Sbjct: 289 FTTLIKDPSFPTFYFVTLKAISVGNTRISVPA-----TNIASGGGTIIDSGTTITYLVPS 343
Query: 387 AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
AY+ RDAF Q +L + + V DTCY+LS SV VPT++ + L LP N L
Sbjct: 344 AYKDLRDAFRQQLSSL-QPTPVEDMDTCYDLSS-SSVDVPTITLHLDRNVDLVLPKENIL 401
Query: 447 IPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
I ++G C AF+ S SIIGN+QQ+ +I FD N VGF C
Sbjct: 402 I-TQESGLSCLAFS-STDSRSIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 197 bits (502), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 127/349 (36%), Positives = 179/349 (51%), Gaps = 15/349 (4%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
S Y VR +G+P + + +D+ +D WV C C C S +FDP+ S+S + C
Sbjct: 88 SPTYIVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGC--ASSVLFDPSKSSSSRNLQCD 145
Query: 213 SAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
+ C + N C AG+ C + ++YG GS + +L +TLT+ V+K+ GC K G
Sbjct: 146 APQCKQAPNPTCTAGKSCGFNMTYG-GSTIEASLTQDTLTLANDVIKSYTFGCISKATGT 204
Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV-SRGTGSSGSLVFGREALPVGAAWVP 330
+ A GL+GLG G +SL+ Q FSYCL S+ + SGSL G + PV P
Sbjct: 205 SLPAQGLMGLGRGPLSLISQTQNLYMSTFSYCLPNSKSSNFSGSLRLGPKYQPVRIKTTP 264
Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
L++NPR S YYV L G+ VG + I G + D+GT TRL PAY A
Sbjct: 265 LLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAGTIFDSGTVFTRLVEPAYVA 324
Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
R+ F + N A+ + FDTCY SG SV P+V+F F+G V TLP N LI
Sbjct: 325 VRNEFRRRIKNA-NATSLGGFDTCY--SG--SVVYPSVTFMFAGMNV-TLPPDNLLIHSS 378
Query: 451 DAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
T C A A +P S L++I ++QQ+ ++ D N +G C
Sbjct: 379 SGSTSCLAMAAAPNNVNSVLNVIASMQQQNHRVLIDLPNSRLGISRETC 427
>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 324
Score = 197 bits (502), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 142/341 (41%), Positives = 192/341 (56%), Gaps = 33/341 (9%)
Query: 171 MVIDSGSDIVWVQCQPCS---QCYKQSDPVFDPADSASFSGVSCSSAVCDRL---ENAGC 224
M +D+GSD+ WVQC+PC+ CY Q DP+FDPA S+S++ V C VC L + C
Sbjct: 1 MEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASAC 60
Query: 225 HAGRCRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGG 283
A +C Y VSYGDGS T G + +TLT+ + V+ GCGH G+F G GLLGLG
Sbjct: 61 SAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGR 120
Query: 284 GSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA----WVPLVRNPRAPS 339
SLV Q G GG FSYCL ++ + ++G L G P GAA L+ +P AP+
Sbjct: 121 EQPSLVEQTAGTYGGVFSYCLPTKPS-TAGYLTLGVGG-PSGAAPGFSTTQLLPSPNAPT 178
Query: 340 FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQT 399
+Y V L+G+ VGG ++ + F + +DTGT VTRLP AY A R AF +
Sbjct: 179 YYVVMLTGISVGGQQLSVPASAFAGGTV------VDTGTVVTRLPPTAYAALRSAFRSGM 232
Query: 400 GN--LPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF-C 456
+ P A I DTCYN +G+ +V +P V+ F G +TL A L +F C
Sbjct: 233 ASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL-------SFGC 285
Query: 457 FAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
AFAPS S G++I+GN+QQ ++ DG + VGF P+ C
Sbjct: 286 LAFAPSGSDGGMAILGNVQQRSFEVRIDGTS--VGFKPSSC 324
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 136/355 (38%), Positives = 190/355 (53%), Gaps = 16/355 (4%)
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDP-----A 201
G G G Y R+G+G+P +S MV+D+GS + W+QC PC C++QS PVF+P
Sbjct: 121 GTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSY 180
Query: 202 DSASFSGVSCSSAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNV 260
S S S CS L A C C Y+ SYGD S++ G L+ +T++ G T V N
Sbjct: 181 TSVSCSAQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTSVPNF 240
Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE 320
GCG N+G+F +AGL+GL +SL+ QL G +FSYCL + + SSG L G
Sbjct: 241 YYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIGSY 300
Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
P ++ P+ + S Y++ ++G+ V G P+S + + ++D+GT +
Sbjct: 301 N-PGQYSYTPMASSSLDDSLYFIKMTGIKVAGK--PLSVSSSAYSSL---PTIIDSGTVI 354
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
TRLPT Y A A PRAS SI DTC+ +RVP V+ F+GG L L
Sbjct: 355 TRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQA-ARLRVPEVTMAFAGGAALKL 413
Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
A N L+ VD A T C AFAP+ S +IIGN QQ+ + +D N +GF C
Sbjct: 414 AARNLLVDVDSATT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKNSKIGFAAGGC 466
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 136/355 (38%), Positives = 190/355 (53%), Gaps = 16/355 (4%)
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDP-----A 201
G G G Y R+G+G+P +S MV+D+GS + W+QC PC C++QS PVF+P
Sbjct: 121 GTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSY 180
Query: 202 DSASFSGVSCSSAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNV 260
S S S CS L A C C Y+ SYGD S++ G L+ +T++ G T V N
Sbjct: 181 TSVSCSAQQCSDLTTATLSPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTSVPNF 240
Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE 320
GCG N+G+F +AGL+GL +SL+ QL G +FSYCL + + SSG L G
Sbjct: 241 YYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIGSY 300
Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
P ++ P+ + S Y++ ++G+ V G P+S + + ++D+GT +
Sbjct: 301 N-PGQYSYTPMASSSLDDSLYFIKMTGIKVAGK--PLSVSSSAYSSL---PTIIDSGTVI 354
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
TRLPT Y A A PRAS SI DTC+ +RVP V+ F+GG L L
Sbjct: 355 TRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQA-ARLRVPEVTMAFAGGAALKL 413
Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
A N L+ VD A T C AFAP+ S +IIGN QQ+ + +D N +GF C
Sbjct: 414 AARNLLVDVDSATT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKNSKIGFAAGGC 466
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 131/411 (31%), Positives = 191/411 (46%), Gaps = 27/411 (6%)
Query: 106 HSFHARMQRDVKRVATLVR----RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIG 161
H+ H ++ + L R RL + AA V V SG Q Y VR G
Sbjct: 27 HNVHPPSSSPLESIIALAREDDARLLFLSSKAASTGVSS--APVASG--QSPPSYVVRAG 82
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
+GSP + + +D+ +D W C PC C S +F PA+S S++ + CSS +C L+
Sbjct: 83 LGSPAQPILLALDTSADATWAHCSPCGTC-PSSGSLFAPANSTSYAPLPCSSTMCTVLQG 141
Query: 222 AGCHA----------GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
C A C + + D S+ + +LA + L +G+ + N A GC G
Sbjct: 142 QPCPAQDPYDSSAPLPMCAFTKPFADASF-QASLASDWLHLGKDAIPNYAFGCVSAVSGP 200
Query: 272 F--VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAW 328
+ GLLGLG G M+L+ Q+G G FSYCL S + SGSL G P G +
Sbjct: 201 TANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLPSYKSYYFSGSLRLGAAGQPRGVRY 260
Query: 329 VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
P+++NP S YYV ++GL VG + + F G V+D+GT +TR P Y
Sbjct: 261 TPMLKNPNRSSLYYVNVTGLSVGRAPVKVPAGSFAFDPATGAGTVVDSGTVITRWTPPVY 320
Query: 389 EAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
A R+ F + + FDTC+N + P V+ + GG L LP N LI
Sbjct: 321 AALREEFRRHVAAPSGYTSLGAFDTCFNTDEVAAGVAPAVTVHMDGGLDLALPMENTLIH 380
Query: 449 VDDAGTFCFAFAPSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A A +P ++++ N+QQ+ +++ FD AN VGF C
Sbjct: 381 SSATPLACLAMAEAPQNVNAVVNVLANLQQQNLRVVFDVANSRVGFARESC 431
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 197 bits (500), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 129/362 (35%), Positives = 184/362 (50%), Gaps = 17/362 (4%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
V SG G Y VR +G+PP+ +MV+D+ +D VW+ C CS C + F+ S+
Sbjct: 19 VASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGC-SNASTSFNTNSSS 77
Query: 205 SFSGVSCSSAVCDRLENAGCHAGR-----CRYEVSYGDGSYTKGTLALETLTIGRTVVKN 259
++S VSCS+A C + C + C + SYG S +L +TLT+ V+ N
Sbjct: 78 TYSTVSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPDVIPN 137
Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFG 318
+ GC + G + GL+GLG G MSLV Q G FSYCL S R SGSL G
Sbjct: 138 FSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLG 197
Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
P + PL+RNPR PS YYV L+G+ VG +++P+ G ++D+GT
Sbjct: 198 LLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGT 257
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
+TR P YEA RD F Q N+ S + FDTC++ P ++ + + L
Sbjct: 258 VITRFAQPVYEAIRDEFRKQV-NVSSFSTLGAFDTCFSADN--ENVAPKITLHMTSLD-L 313
Query: 439 TLPASNFLIPVDDAGTF-CFAFA----PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
LP N LI AGT C + A + + L++I N+QQ+ ++I FD N +G P
Sbjct: 314 KLPMENTLI-HSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPE 372
Query: 494 VC 495
C
Sbjct: 373 PC 374
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 119/367 (32%), Positives = 172/367 (46%), Gaps = 35/367 (9%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+ EY VR+ VG+P R + +D+GSD+VW QC PC C+ Q PV DPA S++++ + C
Sbjct: 81 TNEYLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCG 140
Query: 213 SAVCDRLENAGC------HAGRCRYEVSYGDGSYTKGTLALETLTIGRT-------VVKN 259
+A C L C + C Y YGD S T G +A + T G + +
Sbjct: 141 AARCRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTRR 200
Query: 260 VAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG 318
+ GCGH N+G+F G+ G G G SL QL + FSYC S S + G
Sbjct: 201 LTFGCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNVTS---FSYCFTSMFESKSSLVTLG 257
Query: 319 -------REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
A P+++NP PS Y++ L G+ VG R+P+ E FR T
Sbjct: 258 GSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFRST------ 311
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR---VPTV 428
++D+G ++T LP YEA + F AQ G P S D C+ L R VP++
Sbjct: 312 -IIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLCFALPVTALWRRPAVPSL 370
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
+ + G LP SN++ A C +P ++IGN QQ+ + +D N +
Sbjct: 371 TLHLEGAD-WELPRSNYVFEDLGARVMCIVLDAAPGEQTVIGNFQQQNTHVVYDLENDRL 429
Query: 489 GFGPNVC 495
F P C
Sbjct: 430 SFAPARC 436
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 141/444 (31%), Positives = 213/444 (47%), Gaps = 40/444 (9%)
Query: 68 SSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLS 127
+S S + + L+HRD S N ++ R Q SFH + R R +
Sbjct: 22 TSLTASMNNGSFTASLIHRDSPISPLYNPKNTYFDRLQSSFHRSISR--------ANRFT 73
Query: 128 GGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC 187
AAK D++ G GEYF+RI +G+PP ++ D+GSD++WVQCQPC
Sbjct: 74 PNSVSAAK----TLEYDII----PGGGEYFMRISIGTPPIEVLVIADTGSDLIWVQCQPC 125
Query: 188 SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN--AGCHA----GRCRYEVSYGDGSYT 241
+CYKQ P+F+P S+++ V C + C+ L + C A C Y SYGD S+T
Sbjct: 126 QECYKQKSPIFNPKQSSTYRRVLCETRYCNALNSDMRACSAHGFFKACGYSYSYGDHSFT 185
Query: 242 KGTLALETLTIGRT--VVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGG 298
G LA E IG T ++ +A GCG+ N G F +G++GLGGGS+SL+ QLG +
Sbjct: 186 MGYLATERFIIGSTNNSIQELAFGCGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKIDN 245
Query: 299 AFSYCLV---SRGTGSSGSLVFGREALPVGA---AWVPLV-RNPRAPSFYYVGLSGLGVG 351
FSYCLV + S G +VFG + G+ PLV + P +FYY+ L + VG
Sbjct: 246 KFSYCLVPILEKSNFSLGKIVFGDNSFISGSDTYVSTPLVSKEPE--TFYYLTLEAISVG 303
Query: 352 GMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF 411
R+ E+ + +++D+GT +T L + Y + IF
Sbjct: 304 NERLAY-ENSRNDGNVEKGNIIIDSGTTLTFLDSKLYNKLELVLEKAVEGERVSDPNGIF 362
Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGN 471
C+ + + +P ++ +F+ V P + F +D CF PS +G++I GN
Sbjct: 363 SICFRDK--IGIELPIITVHFTDADVELKPINTFAKAEEDL--LCFTMIPS-NGIAIFGN 417
Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
+ Q + +D V F P C
Sbjct: 418 LAQMNFLVGYDLDKNCVSFMPTDC 441
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 120/341 (35%), Positives = 171/341 (50%), Gaps = 26/341 (7%)
Query: 173 IDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYE 232
+D+GSD++W QC PC C Q P FD SA++ + C S+ C L + C C Y+
Sbjct: 1 MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMCVYQ 60
Query: 233 VSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMS 287
YGD + T G LA ET T G + N+A GCG N G ++G++G G G +S
Sbjct: 61 YYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLS 120
Query: 288 LVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA----------LPVGAAWVPLVRNPRA 337
LV QLG FSYCL S + + L FG A PV + P V NP
Sbjct: 121 LVSQLGPS---RFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQS--TPFVINPAL 175
Query: 338 PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVA 397
P+ Y++ L + +G +PI +F + G GV++D+GT++T L AYEA R V+
Sbjct: 176 PNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVS 235
Query: 398 QTGNLPRASGVSI-FDTCYNL--SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
LP + I DTC+ V+V VP + F+F + LP N+++ G
Sbjct: 236 AIP-LPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLP-ENYMLIASTTGY 293
Query: 455 FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C AP+ G +IIGN QQ+ + + +D N F+ F P C
Sbjct: 294 LCLVMAPTGVG-TIIGNYQQQNLHLLYDIGNSFLSFVPAPC 333
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 136/385 (35%), Positives = 192/385 (49%), Gaps = 38/385 (9%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ----PCSQCYKQS---DPVFD 199
SG G G+Y V + G+PP+ ++ D+GSD++W+QC P + C K++ P F
Sbjct: 45 SGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFV 104
Query: 200 PADSASFSGVSCSSAVCDRLENAGCHAGRCR--------YEVSYGDGSYTKGTLALETLT 251
+ SA+ S V CS+A C + H C Y Y DGS T G LA +T T
Sbjct: 105 ASKSATLSVVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTAT 164
Query: 252 I-----GRTVVKNVAIGCGHKNQG-MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
I G V+ VA GCG +NQG F G G++GLG G +S Q G FSYCL+
Sbjct: 165 ISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLL 224
Query: 306 SRGTG----SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
G SS L GR A+ PLV NP AP+FYYVG+ + VG +P+
Sbjct: 225 DLEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSE 284
Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF----DTCYNL 417
+ + +G+ G V+D+G+ +T L AY AF A +LPR + F + CYN+
Sbjct: 285 WAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASV-HLPRIPSSATFFQGLELCYNV 343
Query: 418 SGFVSVR-----VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP--SPSGLSIIG 470
S S+ P ++ F+ G L LP N+L+ V D C A P SP +++G
Sbjct: 344 SSSSSLAPANGGFPRLTIDFAQGLSLELPTGNYLVDVAD-DVKCLAIRPTLSPFAFNVLG 402
Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
N+ Q+G + FD A+ +GF C
Sbjct: 403 NLMQQGYHVEFDRASARIGFARTEC 427
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 150/448 (33%), Positives = 220/448 (49%), Gaps = 41/448 (9%)
Query: 60 LFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRV 119
LF H I S+ + + + +L+HRD S R +++ H R RV
Sbjct: 14 LFSSH--ILSNVNAKPKLGFTTDLIHRDSPKSPFYNPAETPSQRIRNAIH----RSFNRV 67
Query: 120 ATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDI 179
+ DA+ + Q TD+ GEY + + +G+PP V D+GS++
Sbjct: 68 SHFTDL---SEMDASLNSPQ---TDIT----PCGGEYLMNLSLGTPPSPIMAVADTGSNL 117
Query: 180 VWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN-AGC--HAGRCRYEVSYG 236
+W QC+PC CY Q DP+FDP S+++ VSCSS+ C LEN A C C Y VSY
Sbjct: 118 IWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSSSQCTALENQASCSTEDKTCSYLVSYA 177
Query: 237 DGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQGMFVGA-AGLLGLGGGSMSLVG 290
DGSYT G A++TLT+G T +KN+ IGCG N F +G++GLGGG++SL+
Sbjct: 178 DGSYTMGKFAVDTLTLGSTDNRPVQLKNIIIGCGQNNAVTFRNKSSGVVGLGGGAVSLIK 237
Query: 291 QLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGL 348
QLG G FSYCLV +S + FG A+ G V PLV R +FYY+ L +
Sbjct: 238 QLGDSIDGKFSYCLVPENDQTS-KINFGTNAVVSGPGTVSTPLVVKSRD-TFYYLTLKSI 295
Query: 349 GVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV 408
VG + + + +V+D+GT +T LP Y +A VA N ++
Sbjct: 296 SVGSKNMQTPDSNIK------GNMVIDSGTTLTLLPVKYYIEIENA-VASLINADKSKDE 348
Query: 409 SIFDT-CYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS 467
I + CYN + + +P ++ +F G V P ++F +D C AF S
Sbjct: 349 RIGSSLCYNATA--DLNIPVITMHFEGADVKLYPYNSFFKVTEDL--VCLAFGMSFYRNG 404
Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
I GN+ Q+ + +D A+ + F P C
Sbjct: 405 IYGNVAQKNFLVGYDTASKTMSFKPTDC 432
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 135/416 (32%), Positives = 190/416 (45%), Gaps = 33/416 (7%)
Query: 106 HSFHARMQRDVKRVATLVR----RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIG 161
H+ H ++ + L R RL + AA V V SG Q Y VR G
Sbjct: 29 HNVHPSSPSPLESIIALARDDDARLLFLSSKAATAGVSS--APVASG--QAPPSYVVRAG 84
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
+GSP + + +D+ +D W C PC C S +F PA+S+S++ + CSS+ C +
Sbjct: 85 LGSPSQQLLLALDTSADATWAHCSPCGTCPSSS--LFAPANSSSYASLPCSSSWCPLFQG 142
Query: 222 AGCHAGR--------------CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK 267
C A + C + + D S+ + LA +TL +G+ + N GC
Sbjct: 143 QACPAPQGGGDAAPPPATLPTCAFSKPFADASF-QAALASDTLRLGKDAIPNYTFGCVSS 201
Query: 268 NQGMFVGAA--GLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREA-LP 323
G GLLGLG G M+L+ Q G G FSYCL S R SGSL G P
Sbjct: 202 VTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRLGAGGGQP 261
Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
+ P++RNP S YYV ++GL VG + + F G V+D+GT +TR
Sbjct: 262 RSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAATGAGTVVDSGTVITRW 321
Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
P Y A R+ F Q + + FDTC+N + P V+ + GG L LP
Sbjct: 322 TAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMDGGVDLALPME 381
Query: 444 NFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
N LI C A A +P S +++I N+QQ+ I++ FD AN VGF C
Sbjct: 382 NTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRVGFAKESC 437
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 132/446 (29%), Positives = 201/446 (45%), Gaps = 46/446 (10%)
Query: 67 ISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL 126
+SS S + ++++L+HRD S +++ + R+ R + R
Sbjct: 17 VSSREVSEGQRGFSIDLIHRDSPLSP--------FYKPSLTPSDRIINTALRSIYQLNRA 68
Query: 127 SGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP 186
S + K + GEY +R +G+PP + + D+ SD++WVQC P
Sbjct: 69 SHSDLNEKK--------TLERVRIPNHGEYLMRFYIGTPPVERLAIADTASDLIWVQCSP 120
Query: 187 CSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH--AGRCRYEVSYGDGSYTKGT 244
C C+ Q P+F+P S++F+ +SC S C C C Y +YGDGS TKG
Sbjct: 121 CETCFPQDTPLFEPHKSSTFANLSCDSQPCTSSNIYYCPLVGNLCLYTNTYGDGSSTKGV 180
Query: 245 LALETLTIGRTVV--KNVAIGCGHKNQGMFV---GAAGLLGLGGGSMSLVGQLGGQTGGA 299
L E++ G V GCG N M G++GLG G +SLV QLG Q G
Sbjct: 181 LCTESIHFGSQTVTFPKTIFGCGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQIGHK 240
Query: 300 FSYCLVSRGTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPI 357
FSYCL+ + S+ L FG + G V PL+ +P PS+Y++ L G+ +G + +
Sbjct: 241 FSYCLLPFTSTSTIKLKFGNDTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQV 300
Query: 358 SEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF----RDAF-VAQTG-NLPRASGVSIF 411
R T + +++D GT +T L Y F R+A +++T ++P F
Sbjct: 301 -----RTTDHTNGNIIIDLGTVLTYLEVNFYHNFVTLLREALGISETKDDIPYP-----F 350
Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSII 469
D C+ ++ P + F F+G V P N DD C A P G S+
Sbjct: 351 DFCF--PNQANITFPKIVFQFTGAKVFLSP-KNLFFRFDDLNMICLAVLPDFYAKGFSVF 407
Query: 470 GNIQQEGIQISFDGANGFVGFGPNVC 495
GN+ Q Q+ +D V F P C
Sbjct: 408 GNLAQVDFQVEYDRKGKKVSFAPADC 433
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 131/360 (36%), Positives = 182/360 (50%), Gaps = 28/360 (7%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
+Y + + +G+PP Y +D+GSD++W+QC PC+ CYKQ +P+FDP S+++S ++ S
Sbjct: 58 DYLMELSIGTPPVKTYAQVDTGSDLIWLQCIPCTNCYKQLNPMFDPQSSSTYSNIAYGSE 117
Query: 215 VCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI-----GCGHK 267
C +L + C + C Y SY D S T+G LA ETLT+ T K VA+ GCGH
Sbjct: 118 SCSKLYSTSCSPDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTGKPVALKGVIFGCGHN 177
Query: 268 NQGMFVGAA-GLLGLGGGSMSLVGQLGGQTGGA-FSYCLVSRGTGSS--GSLVFGR--EA 321
N G+F G++GLG G +SLV Q+G GG FS CLV T S + FG+ E
Sbjct: 178 NNGVFNDKEMGIIGLGRGPLSLVSQIGSSFGGKMFSQCLVPFHTNPSITSPMSFGKGSEV 237
Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
L G PLV +FY+V L G+ V + +P + D L + +V+D+GT T
Sbjct: 238 LGNGVVSTPLVSKNTHQAFYFVTLLGISVEDINLPFN-DGSSLEPITKGNMVIDSGTPTT 296
Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSI-----FDTCYNLSGFVSVRVPTVSFYFSGGP 436
LP E F V + N + I + CY +++ T++ +F G
Sbjct: 297 LLP----EDFYHRLVEEVRNKVALDPIPIDPTLGYQLCYRTP--TNLKGTTLTAHFEGAD 350
Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
VL P F IPV D G FCFAF + S I GN Q I FD V F C
Sbjct: 351 VLLTPTQIF-IPVQD-GIFCFAFTSTFSNEYGIYGNHAQSNYLIGFDLEKQLVSFKATDC 408
>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
distachyon]
Length = 836
Score = 195 bits (495), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 157/405 (38%), Positives = 207/405 (51%), Gaps = 29/405 (7%)
Query: 107 SFHARMQRDVKRVATLVRRLSG----GGAD--AAKHEVQDFGTDVVSGMDQGSGEYFVRI 160
SF ++ D +R + RR+SG GG A + G G+ +Y V +
Sbjct: 445 SFAEVLRADERRAEYIQRRMSGAKGPGGLQQFTAASSSKSVTIPANIGHSIGTLQYVVTV 504
Query: 161 GVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK--QSDPVFDPADSASFSGVSCSSAVCDR 218
+G+P +Q + +D+GSD+ WVQC PC+ Q D +FDPA S+S+S V C++ C
Sbjct: 505 SLGTPGVAQTVEVDTGSDVSWVQCAPCAAPACYAQKDQLFDPAKSSSYSAVPCAADACSE 564
Query: 219 LEN--AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVG 274
L GC AG +C Y VSYGDGS T G +TLT+ V GCGH G+F G
Sbjct: 565 LSTYGHGCAAGSQCGYVVSYGDGSNTTGVYGSDTLTLTDADAVTGFLFGCGHAQAGLFAG 624
Query: 275 AAGLLGLGGGSMSLVGQLGGQT-GGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVR 333
GLL LG MSL Q G GG FSYCL + S+G L G + G A L+
Sbjct: 625 IDGLLALGRKGMSLTSQTSGAYGGGVFSYCLPPSPS-STGFLTLGGPSSASGFATTGLLT 683
Query: 334 NPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
P+FY V L+G+GVGG ++ + F G V+DTGT +TRLP AY A R
Sbjct: 684 AWDVPTFYMVMLTGIGVGGQQLSGVPASAFA------GGTVVDTGTVITRLPPTAYAALR 737
Query: 393 DAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
AF A P A I DTCYN + + +V +PTVS FSGG L L A FL
Sbjct: 738 AAFRAAMAPYGYPAAPATGILDTCYNFTDYGTVTLPTVSLTFSGGATLKLDAPGFL---- 793
Query: 451 DAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+G FA +I+GN+QQ + FDG++ VGF P+ C
Sbjct: 794 SSGCLAFATNSGDGDPAILGNVQQRSFAVRFDGSS--VGFMPHSC 836
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 194 bits (494), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 134/416 (32%), Positives = 190/416 (45%), Gaps = 33/416 (7%)
Query: 106 HSFHARMQRDVKRVATLVR----RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIG 161
H+ H ++ + L R RL + AA V V SG Q Y VR G
Sbjct: 31 HNVHPSSPSPLESIIALARDDDARLLFLSSKAATAGVSS--APVASG--QAPPSYVVRAG 86
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
+GSP + + +D+ +D W C PC C S +F PA+S+S++ + CSS+ C +
Sbjct: 87 LGSPSQQLLLALDTSADATWAHCSPCGTCPSSS--LFAPANSSSYASLPCSSSWCPLFQG 144
Query: 222 AGCHAGR--------------CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK 267
C A + C + + D S+ + LA +TL +G+ + N GC
Sbjct: 145 QACPAPQGGGDAAPPPATLPTCAFSKPFADASF-QAALASDTLRLGKDAIPNYTFGCVSS 203
Query: 268 NQGMFVGAA--GLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREA-LP 323
G GLLGLG G M+L+ Q G G FSYCL S R SGSL G P
Sbjct: 204 VTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRLGAGGGQP 263
Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
+ P++RNP S YYV ++GL VG + + F G V+D+GT +TR
Sbjct: 264 RSVRYTPMLRNPHRSSLYYVNVTGLSVGRAWVKVPAGSFAFDAATGAGTVVDSGTVITRW 323
Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
P Y A R+ F Q + + FDTC+N + P V+ + GG L LP
Sbjct: 324 TAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMDGGVDLALPME 383
Query: 444 NFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
N LI C A A +P S +++I N+QQ+ I++ FD AN +GF C
Sbjct: 384 NTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRIGFAKESC 439
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 194 bits (494), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 139/427 (32%), Positives = 204/427 (47%), Gaps = 54/427 (12%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+ LEL+HRD S F+ Q +R+A VRR K+ +
Sbjct: 29 FTLELIHRDSSKSP---------------FYQPTQNKYERIANAVRRSINRVNHFYKYSL 73
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
+ S ++ GEY + +G+PP + +D+GSD+VW+QC+PC QCY Q P+F
Sbjct: 74 T---STPQSTVNSDKGEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQITPIF 130
Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVK 258
DP+ S+S+ + C S C + C +G L++ETLT+ T
Sbjct: 131 DPSLSSSYQNIPCLSDTCHSMRTTSCDV---------------RGYLSVETLTLDSTTGY 175
Query: 259 NVA-----IGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS 312
+V+ IGCG++N G F G ++G++GLG G MSL QLG GG FSYCL S+
Sbjct: 176 SVSFPKTMIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGGKFSYCLGPWLPNST 235
Query: 313 GSLVFGREALPV--GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
L FG A+ GA P+V+ A S YY+ L VG I T G++
Sbjct: 236 SKLNFGDAAIVYGDGAMTTPIVKK-DAQSGYYLTLEAFSVGNKLIEFGGP----TYGGNE 290
Query: 371 G-VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLSGFVSVRVPTV 428
G +++D+GT T LP Y F A VA+ NL + F CYN++ + P +
Sbjct: 291 GNILIDSGTTFTFLPYDVYYRFESA-VAEYINLEHVEDPNGTFKLCYNVA-YHGFEAPLI 348
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
+ +F G + S F I V D G C AF PS + +I GN+ Q+ + + ++ V
Sbjct: 349 TAHFKGADIKLYYISTF-IKVSD-GIACLAFIPSQT--AIFGNVAQQNLLVGYNLVQNTV 404
Query: 489 GFGPNVC 495
F P C
Sbjct: 405 TFKPVDC 411
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 194 bits (493), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 129/361 (35%), Positives = 181/361 (50%), Gaps = 35/361 (9%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
EY + + +G+PP Y D+GSD+VW QC PC++CYKQ +P+FDP S+S++ ++C +
Sbjct: 59 EYLMELSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYKQQNPMFDPRSSSSYTNITCGTE 118
Query: 215 VCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA-----IGCGHK 267
C++L+++ C + C Y SY D S T+G LA ETLT+ T + VA GCGH
Sbjct: 119 SCNKLDSSLCSTDQKTCNYTYSYADNSITQGVLAQETLTLTSTTGEPVAFQGIIFGCGHN 178
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLG---GQTGGAFSYCLVSRGTGSS--GSLVFGR--E 320
N G GL+GLG G +SL+ Q+G G G FS CLV T S + FG+ E
Sbjct: 179 NSGFNDREMGLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNTDPSITSQMNFGKGSE 238
Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
L G PL+ + + Y+ L G+ V + +P S L + +++D+GT +
Sbjct: 239 VLGNGTVSTPLIS--KDGTGYFATLLGISVEDINLPFSNGS-SLGTITKGNILIDSGTTI 295
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCY----NLSGFVSVRVPTVSFYFSG 434
T LP E F + Q N I ++ CY NL+G PT++ +F G
Sbjct: 296 TYLP----EEFYHRLIEQVRNKVALEPFRIDGYELCYQTPTNLNG------PTLTIHFEG 345
Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNV 494
G VL PA F IPV D FCFA + GN Q I FD V F
Sbjct: 346 GDVLLTPAQMF-IPVQD-DNFCFAVFDTNEEYVTYGNYAQSNYLIGFDLERQVVSFKATD 403
Query: 495 C 495
C
Sbjct: 404 C 404
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 136/433 (31%), Positives = 208/433 (48%), Gaps = 41/433 (9%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+ +EL++RD S F+ + +R+ + VRR + +
Sbjct: 29 FTVELINRDSPKSP---------------FYNPRETPTQRIVSAVRRSMSRVHHFSPTKN 73
Query: 139 QDFGTDVV-SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
D TD S M GEY ++ +G+P + D+GSD++W QC+PC QCY+Q P+
Sbjct: 74 SDIFTDTAQSEMISNQGEYLMKFSLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQDAPL 133
Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGR----CRYEVSYGDGSYTKGTLALETLTIG 253
FDP S+++ +SCS+ CD L+ +G C Y SYGD S+T G +A +T+T+G
Sbjct: 134 FDPKSSSTYRDISCSTKQCDLLKEGASCSGEGNKTCHYSYSYGDRSFTSGNVAADTITLG 193
Query: 254 RT-----VVKNVAIGCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV-- 305
T ++ IGCGH N G F +G++GLGGG +SL+ QLG G FSYCLV
Sbjct: 194 STSGRPVLLPKAIIGCGHNNGGSFTEKGSGIVGLGGGPISLISQLGSTIDGKFSYCLVPL 253
Query: 306 SRGTGSSGSLVFGREALPVGAAW--VPLV-RNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
S +S L FG + G PL+ ++P +FY++ L + VG RI F
Sbjct: 254 SSNATNSSKLNFGSNGIVSGGGVQSTPLISKDPD--TFYFLTLEAVSVGSERIKFPGSSF 311
Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
++ +++D+GT +T P + A P I CY++
Sbjct: 312 GTSE---GNIIIDSGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSGILSLCYSIDA--D 366
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
++ P+++ +F G V P + F + V D CFAF P SG +I GN+ Q + +D
Sbjct: 367 LKFPSITAHFDGADVKLNPLNTF-VQVSDT-VLCFAFNPINSG-AIFGNLAQMNFLVGYD 423
Query: 483 GANGFVGFGPNVC 495
V F P C
Sbjct: 424 LEGKTVSFKPTDC 436
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 150/445 (33%), Positives = 217/445 (48%), Gaps = 34/445 (7%)
Query: 66 NISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRR 125
NIS SN+ + +++E++HRD S +RH + +RVA +RR
Sbjct: 22 NISFSNSKVLNSGFSVEMIHRDSSRSP--------LYRH-------TETPFQRVANAMRR 66
Query: 126 LSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ 185
K S + GEY + VG+PP V+D+GS I W+QCQ
Sbjct: 67 SINRANHFNKKSFVASTNTAESTVKASQGEYLMSYSVGTPPFEILGVVDTGSGITWMQCQ 126
Query: 186 PCSQCYKQSDPVFDPADSASFSGVSCSSAVCDR-LENAGCHAGR--CRYEVSYGDGSYTK 242
C CY+Q+ P+FDP+ S ++ + CSS +C + C + + C+Y + YGDGS+++
Sbjct: 127 RCEDCYEQTTPIFDPSKSKTYKTLPCSSNMCQSVISTPSCSSDKIGCKYTIKYGDGSHSQ 186
Query: 243 GTLALETLTIGRT-----VVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQT 296
G L++ETLT+G T N IGCGH N+G F +G++GLGGG +SL+ QL
Sbjct: 187 GDLSVETLTLGSTNGSSVQFPNTVIGCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSI 246
Query: 297 GGAFSYCLVS--RGTGSSGSLVFGREALP--VGAAWVPLVRNPRAPSFYYVGLSGLGVGG 352
GG FSYCL + SS L FG A+ +GA PLV + FYY+ L VG
Sbjct: 247 GGKFSYCLAPMFSQSNSSSKLNFGDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGD 306
Query: 353 MRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF 411
RI + + G+ +++D+GT +T LP Y A VA R S S F
Sbjct: 307 KRIEFVGGSSSSGSSNGEGNIIIDSGTTLTLLPQEDYSNLESA-VADAIQANRVSDPSNF 365
Query: 412 -DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIG 470
CY + + VP ++ +F G V P S F+ + G CFAF S +SI G
Sbjct: 366 LSLCYQTTPSGQLDVPVITAHFKGADVELNPISTFVQVAE--GVVCFAFH-SSEVVSIFG 422
Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
N+ Q + + +D V F P C
Sbjct: 423 NLAQLNLLVGYDLMEQTVSFKPTDC 447
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 135/357 (37%), Positives = 182/357 (50%), Gaps = 15/357 (4%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSAS 205
+G + + E+ V +G GSP ++ + D+GSD+ W+QCQPCS CYKQ DPVFDPA S+S
Sbjct: 103 TGTNLKTPEFVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCSGHCYKQHDPVFDPAKSSS 162
Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVAIGC 264
++ V C + C C+ C Y V YGDGS T G LA ETLT + GC
Sbjct: 163 YAVVPCGTTEC-AAAGGECNGTTCVYGVEYGDGSSTTGVLARETLTFSSSSEFTGFIFGC 221
Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP- 323
G N G F GLLGLG GS+SL Q GG FSYCL S T + G L G +
Sbjct: 222 GETNLGDFGEVDGLLGLGRGSLSLSSQAAPAFGGIFSYCLPSYNT-TPGYLSIGATPVTG 280
Query: 324 -VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
+ + +V P PSFY++ L + +GG +P+ F T G ++D+GT +T
Sbjct: 281 QIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEFTKT-----GTLLDSGTILTY 335
Query: 383 LPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPA 442
LP PAY A RD F A DTCY+ +G + +P VSF FS G V L
Sbjct: 336 LPPPAYTALRDRFKFTMQGSKPAPPYDELDTCYDFTGQSGILIPGVSFNFSDGAVFNLNF 395
Query: 443 SNFLIPVDDA--GTFCFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ DD C AF P+ + S++G+ Q ++ +D +GF P C
Sbjct: 396 FGIMTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEVIYDVPAQKIGFIPASC 452
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 127/362 (35%), Positives = 180/362 (49%), Gaps = 18/362 (4%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
V SG G Y VR +G+PP+ +MV+D+ +D VW+ C CS C + F+ S+
Sbjct: 94 VASGNQLHIGNYVVRARLGTPPQLMFMVLDTSNDAVWLPCSGCSGC-SNASTSFNTNSSS 152
Query: 205 SFSGVSCSSAVCDRLENAGCHAGR-----CRYEVSYGDGSYTKGTLALETLTIGRTVVKN 259
++S VSCS+ C + C + C + SYG S L +TLT+ V+ N
Sbjct: 153 TYSTVSCSTTQCTQARGLTCPSSTPQPSICSFNQSYGGDSSFSANLVQDTLTLSPDVIPN 212
Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFG 318
+ GC + G + GL+GLG G MSLV Q G FSYCL S R SGSL G
Sbjct: 213 FSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLG 272
Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
P + PL+RNPR PS YYV L+G+ VG +++P+ G ++D+GT
Sbjct: 273 LLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDSNSGAGTIIDSGT 332
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
+TR P YEA RD F Q S + FDTC++ P ++ + + L
Sbjct: 333 VITRFAQPVYEAIRDEFRKQVNG--SFSTLGAFDTCFSADN--ENVTPKITLHMTSLD-L 387
Query: 439 TLPASNFLIPVDDAGTF-CFAFA----PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
LP N LI AGT C + A + + L++I N+QQ+ ++I FD N +G P
Sbjct: 388 KLPMENTLI-HSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPE 446
Query: 494 VC 495
C
Sbjct: 447 PC 448
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 134/423 (31%), Positives = 194/423 (45%), Gaps = 32/423 (7%)
Query: 80 NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
NL++ H S + + + A+ Q ++ +++LV R S + + VQ
Sbjct: 33 NLQVFHVYSPCSPFWPSKPLKWEESVLQMQAKDQARLQFLSSLVARKSVVPIASGRQIVQ 92
Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
S Y VR +G+P ++ + +D+ +D W+ PCS C S VF+
Sbjct: 93 -------------SPTYIVRAKIGTPAQTMLLAMDTSNDAAWI---PCSGCVGCSSTVFN 136
Query: 200 PADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN 259
S +F V C + C ++ N+ C C + ++YG S L+ + +T+ + +
Sbjct: 137 NVKSTTFKTVGCEAPQCKQVPNSKCGGSACAFNMTYGSSSIA-ANLSQDVVTLATDSIPS 195
Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFG 318
GC + G + GLLGLG G MSL+ Q FSYCL S R SGSL G
Sbjct: 196 YTFGCLTEATGSSIPPQGLLGLGRGPMSLLSQTQNLYQSTFSYCLPSFRSLNFSGSLRLG 255
Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
P PL++NPR S YYV L + VG + I G + D+GT
Sbjct: 256 PVGQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDSGT 315
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSVRVPTVSFYFSGGP 436
TRL PAY A RDAF + GN A+ S+ FDTCY + PT++F FSG
Sbjct: 316 VFTRLVAPAYTAVRDAFRKRVGN---ATVTSLGGFDTCYT----SPIVAPTITFMFSGMN 368
Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGP 492
V TLP N LI + C A A +P S L++I N+QQ+ +I FD N +G
Sbjct: 369 V-TLPPDNLLIHSTASSITCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRLGVAR 427
Query: 493 NVC 495
C
Sbjct: 428 EPC 430
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 137/385 (35%), Positives = 191/385 (49%), Gaps = 38/385 (9%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ----PCSQCYKQS---DPVFD 199
SG G G+Y V + G+PP+ ++ D+GSD++W+QC P + C K++ P F
Sbjct: 44 SGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFV 103
Query: 200 PADSASFSGVSCSSAVC-----DRLENAGCHAGR---CRYEVSYGDGSYTKGTLALETLT 251
+ SA+ S V CS+A C R C C Y Y DGS T G LA +T T
Sbjct: 104 ASKSATLSVVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTAT 163
Query: 252 I-----GRTVVKNVAIGCGHKNQG-MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
I G V+ VA GCG +NQG F G G++GLG G +S Q G FSYCL+
Sbjct: 164 ISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLL 223
Query: 306 SRGTG----SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
G SS L GR A+ PLV NP AP+FYYVG+ + VG +P+
Sbjct: 224 DLEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSE 283
Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF----DTCYNL 417
+ + +G+ G V+D+G+ +T L AY AF A +LPR + F + CYN+
Sbjct: 284 WAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASV-HLPRIPSSATFFQGLELCYNV 342
Query: 418 SGFVSVR-----VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP--SPSGLSIIG 470
S S P ++ F+ G L LP N+L+ V D C A P SP +++G
Sbjct: 343 SSSSSSAPANGGFPRLTIDFAQGLSLELPTGNYLVDVAD-DVKCLAIRPTLSPFAFNVLG 401
Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
N+ Q+G + FD A+ +GF C
Sbjct: 402 NLMQQGYHVEFDRASARIGFARTEC 426
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 148/433 (34%), Positives = 214/433 (49%), Gaps = 39/433 (9%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+++E++HRD S Y+R + +RVA +RR K +
Sbjct: 32 FSVEIIHRDSSRSP--------YYRP-------TETQFQRVANALRRSINRANHFNKPNL 76
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
S + GEY + VG+PP ++D+GSDI+W+QCQPC CY Q+ P+F
Sbjct: 77 VASTNTAESTVIASQGEYLMSYSVGTPPFQILGIVDTGSDIIWLQCQPCEDCYNQTTPIF 136
Query: 199 DPADSASFSGVSCSSAVCDRLENAG-CHAG--RCRYEVSYGDGSYTKGTLALETLTIGRT 255
DP+ S ++ + CSS +C +++A C + C Y ++YGD S+++G L++ETLT+G T
Sbjct: 137 DPSQSKTYKTLPCSSNICQSVQSAASCSSNNDECEYTITYGDNSHSQGDLSVETLTLGST 196
Query: 256 VVKNV-----AIGCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--R 307
+V IGCGH N+G F +G++GLGGG +SL+ QL GG FSYCL
Sbjct: 197 DGSSVQFPKTVIGCGHNNKGTFQREGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPLFS 256
Query: 308 GTGSSGSLVFGREALPVGAAWVPLVRNPRAPS----FYYVGLSGLGVGGMRIPISEDLFR 363
+ SS L FG EA+ G V P P FY++ L VG RI F
Sbjct: 257 QSNSSSKLNFGDEAVVSGRG---TVSTPIVPKNGLGFYFLTLEAFSVGDNRIEFGSSSFE 313
Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF-DTCYNLSGFVS 422
+ + +++D+GT +T LP Y A VA L R S F CY +
Sbjct: 314 SSGGEGN-IIIDSGTTLTILPEDDYLNLESA-VADAIELERVEDPSKFLRLCYRTTSSDE 371
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
+ VP ++ +F G V P S F I VD+ G CFAF S G I GN+ Q+ + + +D
Sbjct: 372 LNVPVITAHFKGADVELNPISTF-IEVDE-GVVCFAFRSSKIG-PIFGNLAQQNLLVGYD 428
Query: 483 GANGFVGFGPNVC 495
V F P C
Sbjct: 429 LVKQTVSFKPTDC 441
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 141/418 (33%), Positives = 205/418 (49%), Gaps = 27/418 (6%)
Query: 95 TTNNMHYHRHQHSFHARMQRDVKRVATLVRR--LSGGGADAAKHEVQDFGTDVVSGMDQG 152
TT+ + F+ + +R+ RR L G A + D +DV+SG
Sbjct: 35 TTDFISRDSPHSPFYNPSETKYQRLQKAFRRSILRGNHFRAMRASPNDIQSDVISG---- 90
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
G Y + I +G+PP + D+GSD++W QC PC CY+Q +P+FDP +S ++ + C
Sbjct: 91 GGAYLMNISLGTPPVPMLGIADTGSDLIWRQCLPCPNCYEQVEPLFDPKESETYKTLDCD 150
Query: 213 SAVCDRLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCG 265
+ C L G C Y SYGD SYT+G L+ +TLTIG T +A GCG
Sbjct: 151 NEFCQDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGSTEGDPASFPGIAFGCG 210
Query: 266 HKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS--SGSLVFGREAL 322
H N G F GL+GLGGG +SLV QL + GG FSYCLV + S S + FG+ +
Sbjct: 211 HDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVGGQFSYCLVPLSSDSTVSSKINFGKSGV 270
Query: 323 PVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIP---ISEDLFRLTQMGDDGVVMDTG 377
G+ V PL++ +FYY+ L GL VG + SE+ + + +++D+G
Sbjct: 271 VSGSGTVSTPLIKG-TPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAAVEEGNIIIDSG 329
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV 437
T +T LP Y A G IF CY S ++ +PT++ +F+G V
Sbjct: 330 TTLTLLPQDFYTDVESALTNAIGGQTTTDPNGIFSLCY--SSVNNLEIPTITAHFTGADV 387
Query: 438 LTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P + F+ +D CF+ PS S L+I GN+ Q + +D N V F C
Sbjct: 388 QLPPLNTFVQVQEDL--VCFSMIPS-SNLAIFGNLAQINFLVGYDLKNNKVSFKQTDC 442
>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
Length = 452
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 151/429 (35%), Positives = 210/429 (48%), Gaps = 58/429 (13%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
L L HR + S ++ S ++ D +R ++RR+SG +
Sbjct: 68 LRLTHRHGPCAPSRASS-----LAAPSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAA 122
Query: 141 FGTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS---QCYKQSD 195
V + G D G+ Y V +G+P +Q M +D+GSD+ WVQC+PCS CY Q D
Sbjct: 123 AAATVPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKD 182
Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT 255
P+FDPA S+S++ V C VC L G +A G
Sbjct: 183 PLFDPAQSSSYAAVPCGGPVCAGL---GIYAASACSAAQCG------------------- 220
Query: 256 VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSL 315
V+ GCGH G+F G GLLGLG SLV Q G GG FSYCL ++ + ++G L
Sbjct: 221 AVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPS-TAGYL 279
Query: 316 VFGREALPVGAA----WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
G P GAA L+ +P AP++Y V L+G+ VGG ++ + F +
Sbjct: 280 TLGVGG-PSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV---- 334
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDTCYNLSGFVSVRVPTVS 429
+DTGT VTRLP AY A R AF + + P A I DTCYN +G+ +V +P V+
Sbjct: 335 --VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVA 392
Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTF-CFAFAPSPS--GLSIIGNIQQEGIQISFDGANG 486
F G +TL A L +F C AFAPS S G++I+GN+QQ ++ DG +
Sbjct: 393 LTFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRIDGTS- 444
Query: 487 FVGFGPNVC 495
VGF P+ C
Sbjct: 445 -VGFKPSSC 452
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 192 bits (489), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 121/350 (34%), Positives = 170/350 (48%), Gaps = 16/350 (4%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
Y VR+ +G+P + +MV+D+ +D WV PCS C S F P S + + CS
Sbjct: 96 ANYVVRVKLGTPGQQMFMVLDTSNDAAWV---PCSGCTGFSSTTFLPNASTTLGSLDCSG 152
Query: 214 AVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
A C ++ C A C + SYG S TL + +T+ V+ GC + G
Sbjct: 153 AQCSQVRGFSCPATGSSACLFNQSYGGDSSLTATLVQDAITLANDVIPGFTFGCINAVSG 212
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV 329
+ GLLGLG G +SL+ Q G G FSYCL S + SGSL G P
Sbjct: 213 GSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTT 272
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
PL+RNP PS YYV L+G+ VG +++PI + G ++D+GT +TR P Y
Sbjct: 273 PLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYF 332
Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
A RD F Q N P S + FDTC+ + P ++ +F G L LP N LI
Sbjct: 333 AIRDEFRKQV-NGP-ISSLGAFDTCFAATN--EAEAPAITLHFEGL-NLVLPMENSLIHS 387
Query: 450 DDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C + A +P S L++I N+QQ+ ++I FD N +G +C
Sbjct: 388 SSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIARELC 437
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 192 bits (489), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 120/350 (34%), Positives = 170/350 (48%), Gaps = 16/350 (4%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
Y VR+ +G+P + +MV+D+ +D WV C C+ C S F P S + + CS
Sbjct: 96 ANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC---SSTTFLPNASTTLGSLDCSG 152
Query: 214 AVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
A C ++ C A C + SYG S TL + +T+ V+ GC + G
Sbjct: 153 AQCSQVRGFSCPATGSSACLFNQSYGGDSSLTATLVQDAITLANDVIPGFTFGCINAVSG 212
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV 329
+ GLLGLG G +SL+ Q G G FSYCL S + SGSL G P
Sbjct: 213 GSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTT 272
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
PL+RNP PS YYV L+G+ VG +++PI + G ++D+GT +TR P Y
Sbjct: 273 PLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYF 332
Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
A RD F Q N P S + FDTC+ + P ++ +F G L LP N LI
Sbjct: 333 AIRDEFRKQV-NGP-ISSLGAFDTCFAATN--EAEAPAITLHFEGL-NLVLPMENSLIHS 387
Query: 450 DDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C + A +P S L++I N+QQ+ ++I FD N +G +C
Sbjct: 388 SSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIARELC 437
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 192 bits (488), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 122/379 (32%), Positives = 178/379 (46%), Gaps = 49/379 (12%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+ EY V + VG+PPR + +D+GSD+VW QC PC C+ Q P+ DPA S++++ + C
Sbjct: 89 TNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAASSTYAALPCG 148
Query: 213 SAVCDRLENAGCHAG----------RCRYEVSYGDGSYTKGTLALETLTIG--------R 254
+ C L C G C Y YGD S T G +A + T G R
Sbjct: 149 APRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSR 208
Query: 255 TVVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC---------- 303
+ + GCGH N+G+F G+ G G G SL QL T FSYC
Sbjct: 209 LPTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNVTT---FSYCFTSMFESKSS 265
Query: 304 LVSRGTGSSGSLVFGREALPVGAA-WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
LV+ G + +L++ A G PL++NP PS Y++ L G+ VG R+ + E
Sbjct: 266 LVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVPEAKL 325
Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV---SIFDTCYNLSG 419
R T ++D+G ++T LP YEA + F AQ G P +GV S D C+ L
Sbjct: 326 RST-------IIDSGASITTLPEAVYEAVKAEFAAQVGLPP--TGVVEGSALDLCFALPV 376
Query: 420 FVSVR---VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEG 476
R VP+++ + G LP N++ A C +P ++IGN QQ+
Sbjct: 377 TALWRRPPVPSLTLHLDGAD-WELPRGNYVFEDLAARVMCVVLDAAPGDQTVIGNFQQQN 435
Query: 477 IQISFDGANGFVGFGPNVC 495
+ +D N ++ F P C
Sbjct: 436 THVVYDLENDWLSFAPARC 454
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 192 bits (487), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 150/477 (31%), Positives = 236/477 (49%), Gaps = 60/477 (12%)
Query: 54 MSQYNELFERHNNISSSNTSSDEARWNLELVHRD-KMSSSSNTTNNMHYHRHQHSFHARM 112
M+ ++ L I +S+ ++ R L +H D ++++S + H+H+ AR
Sbjct: 1 MASFSVLLILACTILASDAAA-AVRVGLTRIHADPEVTASEFVRGALRRDMHRHARFARE 59
Query: 113 QRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMV 172
Q L+ A AA V G + G GEY + + +G+PP S +
Sbjct: 60 Q------------LAPSSAAAAGLTV---GAPTQKDLRNG-GEYIMTLSIGTPPLSYRAI 103
Query: 173 IDSGSDIVWVQCQPC--------SQCYKQSDPVFDPADSASFSGVSCSS--AVCDRLENA 222
D+GSD++W QC PC +QC+KQS +++P+ S +F + C+S ++C +
Sbjct: 104 ADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTFGVLPCNSPLSMCAAMAGP 163
Query: 223 GCHAG-RCRYEVSYGDGSYTKGTLALETLTIGRTV------VKNVAIGCGHKNQGMFVGA 275
G C Y +YG G +T G ++ET T G + V N+A GC + + + G+
Sbjct: 164 SPPPGCACMYNQTYGTG-WTAGVQSVETFTFGSSSTPPAVRVPNIAFGCSNASSNDWNGS 222
Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREAL-------PVGAA 327
AGL+GLG GSMSLV QLG GAFSYCL + S+ +L+ G A PV +
Sbjct: 223 AGLVGLGRGSMSLVSQLG---AGAFSYCLTPFQDANSTSTLLLGPSAAAALKGTGPVRS- 278
Query: 328 WVPLVRNP-RAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
P V P +AP ++YY+ L+G+ VG + I D F L G G+++D+GT +T L
Sbjct: 279 -TPFVAGPSKAPMSTYYYLNLTGISVGETALAIPPDAFSLRADGTGGLIIDSGTTITTLV 337
Query: 385 TPAYEAFRDAFVA-QTGNLPRASGV---SIFDTCYNLSGFV-SVRVPTVSFYFSGGPVLT 439
AY+ R A + LP A G + D C+ L +P+++ +F GG +
Sbjct: 338 DSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLDLCFALKASTPPPAMPSMTLHFEGGADMV 397
Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
LP N++I +G +C A G +S++GN QQ+ I + +D + F P VC
Sbjct: 398 LPVENYMI--LGSGVWCLAMRNQTVGAMSMVGNYQQQNIHVLYDVRKETLSFAPAVC 452
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 130/356 (36%), Positives = 183/356 (51%), Gaps = 25/356 (7%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--SQCYKQSDPVFDPADSASFSGVS 210
S EY +G+G+P Q +++D+GS + WVQC+PC SQCY Q P+FDP S+S+S V
Sbjct: 126 SQEYVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTSSSYSPVP 185
Query: 211 CSSAVCDRL----ENAGCHAGR---CRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAI 262
C S C L + GC + C YE+ YG G+ G + + LT+G +VK
Sbjct: 186 CDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTLGPGAIVKRFHF 245
Query: 263 GCGHKNQ-GMFVGAAGLLGLGGGSMSLVGQLGGQTGG-AFSYCLVSRGTGSSGSLVFGRE 320
GCGH Q G F A G+LGLG SL Q + GG FS+CL G S+G L G
Sbjct: 246 GCGHHQQRGKFDMADGVLGLGRLPQSLAWQASARRGGGVFSHCLPPTGV-STGFLALGAP 304
Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
+ PL+ P FY + + + V G + I +FR +GV+ D+GT +
Sbjct: 305 HDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFR------EGVITDSGTVL 358
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
+ L AY A R AF + P A V DTC+N +G+ +V VPTVS F GG + L
Sbjct: 359 SALQETAYTALRTAFRSAMAEYPLAPPVGHLDTCFNFTGYDNVTVPTVSLTFRGGATVHL 418
Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLS-IIGNIQQEGIQISFDGANGFVGFGPNVC 495
AS+ ++ +D C AF S + +IG++ Q I++ +D VGF C
Sbjct: 419 DASSGVL-MDG----CLAFWSSGDEYTGLIGSVSQRTIEVLYDMPGRKVGFRTGAC 469
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 121/349 (34%), Positives = 176/349 (50%), Gaps = 15/349 (4%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
S Y VR +G+P ++ + +D+ +D W+ C C C S +FDP+ S+S + C
Sbjct: 85 SPTYIVRANIGTPAQAMLVALDTSNDAAWIPCSGCVGC--SSSVLFDPSKSSSSRTLQCE 142
Query: 213 SAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
+ C + N C + C + ++YG GS + L +TLT+ V+ N GC +K G
Sbjct: 143 APQCKQAPNPSCTVSKSCGFNMTYG-GSAIEAYLTQDTLTLATDVIPNYTFGCINKASGT 201
Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV-SRGTGSSGSLVFGREALPVGAAWVP 330
+ A GL+GLG G +SL+ Q FSYCL S+ + SGSL G + P+ P
Sbjct: 202 SLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIRIKTTP 261
Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
L++NPR S YYV L G+ VG + I G + D+GT TRL PAY A
Sbjct: 262 LLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEPAYVA 321
Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
R+ F + N A+ + FDTCY SG SV P+V+F F+G V TLP N LI
Sbjct: 322 MRNEFRRRVKNA-NATSLGGFDTCY--SG--SVVFPSVTFMFAGMNV-TLPPDNLLIHSS 375
Query: 451 DAGTFCFAFAPSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A A +P+ L++I ++QQ+ ++ D N +G C
Sbjct: 376 AGNLSCLAMAAAPTNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETC 424
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 191 bits (486), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 135/404 (33%), Positives = 210/404 (51%), Gaps = 36/404 (8%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
++RD+ R A R L+ G + D+ +G GEY + + +G+PP S
Sbjct: 52 LRRDMHRHARFTRELASSGDRTVAAPTRK---DLPNG-----GEYIMTLAIGTPPLSYPA 103
Query: 172 VIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAV--CDRLENAGCHAG- 227
+ D+GSD++W QC PC SQC+KQ+ ++P+ S +F + C+S+V C L G
Sbjct: 104 IADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPCNSSVSMCAALAGPSPPPGC 163
Query: 228 RCRYEVSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
C Y +YG G +T G ++ET T G +T V +A GC + + + G+AGL+GLG
Sbjct: 164 SCMYNQTYGTG-WTAGIQSVETFTFGSTPADQTRVPGIAFGCSNASSDDWNGSAGLVGLG 222
Query: 283 GGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALP--VGAAWVPLVRNP-RAP 338
GSMSLV QLG G FSYCL + S+ +L+ G A G P V +P +AP
Sbjct: 223 RGSMSLVSQLG---AGMFSYCLTPFQDANSTSTLLLGPSAALNGTGVLTTPFVASPSKAP 279
Query: 339 --SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
++YY+ L+G+ +G + I + F L G G+++D+GT +T L AY+ R A +
Sbjct: 280 MSTYYYLNLTGISIGTTALSIPPNAFALRTDGTGGLIIDSGTTITSLVDAAYQQVRAA-I 338
Query: 397 AQTGNLPRASGVSI--FDTCYNLSGFVSV--RVPTVSFYFSGGPVLTLPASNFLIPVDDA 452
LP A G D C+ L+ S +P+++F+F G ++ LP N++I +
Sbjct: 339 ESLVTLPVADGSDSTGLDLCFALTSETSTPPSMPSMTFHFDGADMV-LPVDNYMI--LGS 395
Query: 453 GTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
G +C A G +S GN QQ+ + + +D + F P C
Sbjct: 396 GVWCLAMRNQTVGAMSTFGNYQQQNVHLLYDIHEETLSFAPAKC 439
>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
Length = 165
Score = 191 bits (485), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 93/165 (56%), Positives = 117/165 (70%)
Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
L RNP+ ++YYVGL G+ VGG + I E F + G+ G+++D+GTAVTRL + Y
Sbjct: 1 LRRNPQLDTYYYVGLVGISVGGELLAIPETSFEVDSAGNGGIIVDSGTAVTRLQSDVYNV 60
Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
RDAFV T +L + VS+FDTCY+LS SV VPTV+F+F G VL LPA N+L+PVD
Sbjct: 61 VRDAFVKGTKDLLATNEVSLFDTCYDLSSKTSVEVPTVAFHFGEGKVLVLPAKNYLVPVD 120
Query: 451 DAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
GTFCFAFAP+ S LSIIGNIQQ+G ++SFD AN VGF PN C
Sbjct: 121 SVGTFCFAFAPTMSSLSIIGNIQQQGTRVSFDLANSLVGFSPNRC 165
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 191 bits (485), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 128/384 (33%), Positives = 176/384 (45%), Gaps = 47/384 (12%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQ-SDPVFDPADSASFSGVSC 211
+ EY V + VG+PPR + +D+GSD+VW QC PC C+ Q + PV DPA S++ + V C
Sbjct: 91 TNEYLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGAIPVLDPAASSTHAAVRC 150
Query: 212 SSAVCDRLENAGCHAG-------RCRYEVSYGDGSYTKGTLALETLTIGR--------TV 256
+ VC L C G C Y YGD S T G LA + T G
Sbjct: 151 DAPVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGGVS 210
Query: 257 VKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSL 315
+ + GCGH N+G+F G+ G G G SL QLG + FSYC S +S +
Sbjct: 211 ERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVTS---FSYCFTSMFESTSSLV 267
Query: 316 VFGREA----LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
G L PL+R+P PS Y++ L + VG RIPI E R ++ +
Sbjct: 268 TLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPE---RRQRLREAS 324
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNL-------SGF---- 420
++D+G ++T LP YEA + FVAQ G A S D C+ L S F
Sbjct: 325 AIIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALDLCFALPSAAAPKSAFGWRW 384
Query: 421 ------VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG---LSIIGN 471
+ VRVP + F+ GG LP N++ A C + G +IGN
Sbjct: 385 RGRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAATGGGDQTVVIGN 444
Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
QQ+ + +D N + F P C
Sbjct: 445 YQQQNTHVVYDLENDVLSFAPARC 468
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 191 bits (484), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 122/349 (34%), Positives = 175/349 (50%), Gaps = 15/349 (4%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
S Y VR +G+P + + +D+ +D W+ C C C S +FDP+ S+S + C
Sbjct: 85 SPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGC--SSSVLFDPSKSSSSRTLQCE 142
Query: 213 SAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
+ C + N C + C + ++YG GS + L +TLT+ V+ N GC +K G
Sbjct: 143 APQCKQAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTLASDVIPNYTFGCINKASGT 201
Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV-SRGTGSSGSLVFGREALPVGAAWVP 330
+ A GL+GLG G +SL+ Q FSYCL S+ + SGSL G + P+ P
Sbjct: 202 SLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIRIKTTP 261
Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
L++NPR S YYV L G+ VG + I G + D+GT TRL PAY A
Sbjct: 262 LLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEPAYVA 321
Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
R+ F + N A+ + FDTCY SG SV P+V+F F+G V TLP N LI
Sbjct: 322 VRNEFRRRVKNA-NATSLGGFDTCY--SG--SVVFPSVTFMFAGMNV-TLPPDNLLIHSS 375
Query: 451 DAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A A +P S L++I ++QQ+ ++ D N +G C
Sbjct: 376 AGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETC 424
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 142/435 (32%), Positives = 218/435 (50%), Gaps = 46/435 (10%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+++EL+HRD SS + +++QH A + R + RV S + A+ E
Sbjct: 28 FSIELIHRD---SSKSPFYKPTQNKYQHVVDA-VHRSINRV-----NHSNKNSLASTPE- 77
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
S + G+Y + VG+PP Y ++D+GSDIVW+QC+PC QCY Q+ P F
Sbjct: 78 --------STVISYEGDYIMSYSVGTPPIKSYGIVDTGSDIVWLQCEPCEQCYNQTTPKF 129
Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVV 257
+P+ S+S+ +SCSS +C + + C+ + C Y ++YG+ S+++G L+LETLT+ T
Sbjct: 130 NPSKSSSYKNISCSSKLCQSVRDTSCNDKKNCEYSINYGNQSHSQGDLSLETLTLESTTG 189
Query: 258 KNVA-----IGCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG--- 308
+ V+ IGCG N G F ++G++GLGGG SL+ QLG GG FSYCLV
Sbjct: 190 RPVSFPKTVIGCGTNNIGSFKRVSSGVVGLGGGPASLITQLGPSIGGKFSYCLVRMSITL 249
Query: 309 ---TGSSGSLVFGREALPVG--AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
+ S L FG A+ G P+V+ + FYY+ + VG R+ F
Sbjct: 250 KNMSMGSSKLNFGDVAIVSGHNVLSTPIVKKDHS-FFYYLTIEAFSVGDKRVE-----FA 303
Query: 364 LTQMG--DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLSGF 420
+ G + +++D+ T VT +P+ Y A V L R + F CYN+S
Sbjct: 304 GSSKGVEEGNIIIDSSTIVTFVPSDVYTKLNSAIVDLV-TLERVDDPNQQFSLCYNVSSD 362
Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQIS 480
P ++ +F G +L + F+ D CFAFAPS G +I G+ Q+ +
Sbjct: 363 EEYDFPYMTAHFKGADILLYATNTFVEVARDV--LCFAFAPSNGG-AIFGSFSQQDFMVG 419
Query: 481 FDGANGFVGFGPNVC 495
+D V F C
Sbjct: 420 YDLQQKTVSFKSVDC 434
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 122/349 (34%), Positives = 175/349 (50%), Gaps = 15/349 (4%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
S Y VR +G+P + + +D+ +D W+ C C C S +FDP+ S+S + C
Sbjct: 85 SPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGC--SSSVLFDPSKSSSSRTLQCE 142
Query: 213 SAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
+ C + N C + C + ++YG GS + L +TLT+ V+ N GC +K G
Sbjct: 143 APQCKQAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTLASDVIPNYTFGCINKASGT 201
Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV-SRGTGSSGSLVFGREALPVGAAWVP 330
+ A GL+GLG G +SL+ Q FSYCL S+ + SGSL G + P+ P
Sbjct: 202 SLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIRIKTTP 261
Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
L++NPR S YYV L G+ VG + I G + D+GT TRL PAY A
Sbjct: 262 LLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEPAYVA 321
Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
R+ F + N A+ + FDTCY SG SV P+V+F F+G V TLP N LI
Sbjct: 322 VRNEFRRRVKNA-NATSLGGFDTCY--SG--SVVFPSVTFMFAGMNV-TLPPDNLLIHSS 375
Query: 451 DAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A A +P S L++I ++QQ+ ++ D N +G C
Sbjct: 376 AGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETC 424
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 136/420 (32%), Positives = 206/420 (49%), Gaps = 33/420 (7%)
Query: 102 HRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGS----GEYF 157
H + + RD R +R G D + + G VS + GEY
Sbjct: 54 HSDPDTTAPQFVRDALRRDMHRQRSRSFGRDRDRELAESDGRTTVSARTRKDLPNGGEYL 113
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAV- 215
+ + +G+PP V D+GSD++W QC PC +QC++Q P+++PA S +FS + C+S++
Sbjct: 114 MTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNSSLS 173
Query: 216 -CDRLENAGCHAGRC--RYEVSYGDGSYTKGTLALETLTIGRTV-----VKNVAIGCGHK 267
C C Y +YG G +T G ET T G + V VA GC +
Sbjct: 174 MCAGALAGAAPPPGCACMYNQTYGTG-WTAGVQGSETFTFGSSAADQARVPGVAFGCSNA 232
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALP--V 324
+ + G+AGL+GLG GS+SLV QLG G FSYCL + T S+ +L+ G A
Sbjct: 233 SSSDWNGSAGLVGLGRGSLSLVSQLGA---GRFSYCLTPFQDTNSTSTLLLGPSAALNGT 289
Query: 325 GAAWVPLVRNP-RAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
G P V +P RAP ++YY+ L+G+ +G +PIS F L G G+++D+GT +T
Sbjct: 290 GVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTTIT 349
Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSVR---VPTVSFYFSGGP 436
L AY+ R A + LP G D C+ L S +P+++ +F G
Sbjct: 350 SLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHFDGAD 409
Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
++ LPA +++I +G +C A G +S GN QQ+ + I +D + F P C
Sbjct: 410 MV-LPADSYMI--SGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVREETLSFAPAKC 466
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 137/397 (34%), Positives = 205/397 (51%), Gaps = 43/397 (10%)
Query: 138 VQDF-GTD------VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC 190
+QDF G D +VSG GSG+YFV + VG+P + +++D+GSD+ W+QC P +
Sbjct: 34 IQDFQGEDPALFSRLVSGSSIGSGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTT 93
Query: 191 YKQSD---PVFDPADSASFSGVSCSSAVCDRLE---NAGC---HAGRCRYEVSYGDGSYT 241
S P +D + S+S+ + C+ C L + C C Y Y D S T
Sbjct: 94 ANSSSPPAPWYDKSSSSSYREIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRT 153
Query: 242 KGTLALETLTIG---------------RTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGS 285
G LA ET+++ R +KNVA+GC ++ G F+GA+G+LGLG G
Sbjct: 154 TGILAYETISMKSRKRSGKRAGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGP 213
Query: 286 MSLVGQLGGQT-GGAFSYCLVS--RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYY 342
+SL Q GG FSYCLV RG+ +S LV GR A P+VRNP A SFYY
Sbjct: 214 ISLATQTRHTALGGIFSYCLVDYLRGSNASSFLVMGRTHW-RKLAHTPIVRNPAAQSFYY 272
Query: 343 VGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
V ++G+ V G + I+ + + G+ G + D+GT ++ L PAY A A
Sbjct: 273 VNVTGVAVDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASI-Y 331
Query: 402 LPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA 460
LPRA + F+ CYN++ + +P + F GG V+ LP +N+++ V + C A
Sbjct: 332 LPRAQEIPEGFELCYNVTR-MEKGMPKLGVEFQGGAVMELPWNNYMVLVAE-NVQCVALQ 389
Query: 461 P--SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ +G +I+GN+ Q+ I +D A +GF + C
Sbjct: 390 KVTTTNGSNILGNLLQQDHHIEYDLAKARIGFKWSPC 426
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 139/418 (33%), Positives = 206/418 (49%), Gaps = 27/418 (6%)
Query: 95 TTNNMHYHRHQHSFHARMQRDVKRVATLVRR--LSGGGADAAKHEVQDFGTDVVSGMDQG 152
TT+ + + F+ + +R+ RR L G A + D ++V+SG
Sbjct: 35 TTDFISRDSPRSPFYNPSETKYQRLQKAFRRSILRGNHFRAIRASPNDIQSNVISG---- 90
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
G Y + I +G+PP S + D+GSD++W QC PC CYKQ +P+FDP S ++ + C+
Sbjct: 91 GGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKSKTYKTLGCN 150
Query: 213 SAVCDRLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCG 265
+ C L G C SYGD SYT+ L+ ET TIG T +A GCG
Sbjct: 151 NDFCQDLGQQGSCGDDNTCTSSYSYGDQSYTRRDLSSETFTIGSTEGDPASFPGLAFGCG 210
Query: 266 HKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS--LVFGREAL 322
H N G F +GL+GLGGG +SLV QL + GG FSYCLV + S+ S + FG+ A+
Sbjct: 211 HSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDSTASSKINFGKSAV 270
Query: 323 PVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIP---ISEDLFRLTQMGDDGVVMDTG 377
G+ V PL++ +FYY+ L G+ +G ++ S++ + +++D+G
Sbjct: 271 VSGSGTVSTPLIKG-TPDTFYYLTLEGMSLGSEKVAFKGFSKNKSSPAAAEESNIIIDSG 329
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV 437
T +T LP Y A G F CY SG + +PT++ +F G V
Sbjct: 330 TTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCY--SGVKKLEIPTITAHFIGADV 387
Query: 438 LTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P + F+ +D CF+ PS S L+I GN+ Q + +D N V F P C
Sbjct: 388 QLPPLNTFVQAQEDL--VCFSMIPS-SNLAIFGNLSQMNFLVGYDLKNNKVSFKPTDC 442
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 118/350 (33%), Positives = 175/350 (50%), Gaps = 16/350 (4%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
G Y VR+ +G+P + +MV+D+ D WV C C+ C S P F P S++++ + CS
Sbjct: 97 GNYVVRVKLGTPGQLMFMVLDTSRDAAWVPCADCAGC---SSPTFSPNTSSTYASLQCSV 153
Query: 214 AVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
C ++ C C + +YG S L+ ++L + + + + GC + G
Sbjct: 154 PQCTQVRGLSCPTTGTAACFFNQTYGGDSSFSAMLSQDSLGLAVDTLPSYSFGCVNAVSG 213
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV 329
+ GLLGLG G MSL+ Q G G FSYC S + SGSL G P
Sbjct: 214 STLPPQGLLGLGRGPMSLLSQSGSLYSGVFSYCFPSFKSYYFSGSLRLGPLGQPKNIRTT 273
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
PL+RNP P+ YYV L+G+ VG + +P++ +L G ++D+GT +TR P Y
Sbjct: 274 PLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGAGTIIDSGTVITRFVEPVYA 333
Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
A RD F Q P A+ + FDTC+ + P V+F+F+G L LP N LI
Sbjct: 334 AIRDEFRKQVKG-PFAT-IGAFDTCFAATN--EDIAPPVTFHFTGMD-LKLPLENTLIHS 388
Query: 450 DDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A A +P S L++I N+QQ+ ++I FD N +G +C
Sbjct: 389 SAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDVTNSRLGIARELC 438
>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 452
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 150/429 (34%), Positives = 209/429 (48%), Gaps = 58/429 (13%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADA--AKHEV 138
L L HR + S ++ S ++ D +R ++RR+SG +K
Sbjct: 68 LRLTHRHGPCAPSRASS-----LAAPSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAA 122
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS---QCYKQSD 195
G D G+ Y V +G+P +Q M +D+GSD+ WVQC+PC+ CY Q D
Sbjct: 123 AVATVPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKD 182
Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT 255
P+FDPA S+S++ V C VC L G +A G
Sbjct: 183 PLFDPAQSSSYAAVPCGGPVCAGL---GIYAASACSAAQCG------------------- 220
Query: 256 VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSL 315
V+ GCGH G+F G GLLGLG SLV Q G GG FSYCL ++ + ++G L
Sbjct: 221 AVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPS-TAGYL 279
Query: 316 VFGREALPVGAA----WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
G P GAA L+ +P AP++Y V L+G+ VGG ++ + F +
Sbjct: 280 TLGVGG-PSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV---- 334
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDTCYNLSGFVSVRVPTVS 429
+DTGT VTRLP AY A R AF + + P A I DTCYN +G+ +V +P V+
Sbjct: 335 --VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVA 392
Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTF-CFAFAPSPS--GLSIIGNIQQEGIQISFDGANG 486
F G +TL A L +F C AFAPS S G++I+GN+QQ ++ DG +
Sbjct: 393 LTFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRIDGTS- 444
Query: 487 FVGFGPNVC 495
VGF P+ C
Sbjct: 445 -VGFKPSSC 452
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 128/382 (33%), Positives = 183/382 (47%), Gaps = 32/382 (8%)
Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSP-PRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
+G V + SGEY + +G+P P+ + +D+GSD+VW QC PC C+ Q P+FD
Sbjct: 72 YGQPVTATAVPSSGEYLIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPVCFDQPFPLFD 131
Query: 200 PADSASFSGVSCSSAVCDR---LENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTI-- 252
P+ S++F V+C +C L + C RC Y SYGD S T G + +T T
Sbjct: 132 PSVSSTFRAVACPDPICRPSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMS 191
Query: 253 ------GRTVVKNVAIGCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
V +A GCG N G+F +G+ G G G +SL QL G FSYCL
Sbjct: 192 PNGEGAPPVAVSGLAFGCGDYNTGVFASNESGIAGFGRGPLSLPSQL---RVGRFSYCLT 248
Query: 306 SRGTGSSG--SLVF------GREALPVGA-AWVPLVRNPRAPSFYYVGLSGLGVGGMRIP 356
S S S VF G A G P++ +P P+FYY+ L G+ VG R+P
Sbjct: 249 SHDETESNKTSAVFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLP 308
Query: 357 ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDT--C 414
+ +F L + G G V+D+GT VT P +E ++ FVAQ LPR S C
Sbjct: 309 VDSSVFALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQL-PLPRYDNTSEVGNLLC 367
Query: 415 YNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQ 473
+ G V VP + F+ + + LP N++ D+G C + + +IGN Q
Sbjct: 368 FQRPKGGKQVPVPKLIFHLASAD-MDLPRENYIPEDTDSGVMCLMINGAEVDMVLIGNFQ 426
Query: 474 QEGIQISFDGANGFVGFGPNVC 495
Q+ + I +D N + F C
Sbjct: 427 QQNMHIVYDVENSKLLFASAQC 448
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 132/430 (30%), Positives = 208/430 (48%), Gaps = 55/430 (12%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+ L HRD + S ++ HY R ++F +R + R A L+ R + GA + +
Sbjct: 30 FTTSLFHRDSLLSPLEFSSLSHYDRLANAF----RRSLSRSAALLNRAATSGAVGLQSSI 85
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
+G+PP + D+GSD+ W QC PC +CY+Q P+F
Sbjct: 86 -----------------------IGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIF 122
Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
+P S SFS V C++ C +++ C G C Y +YGD +Y+KG L E +TIG + V
Sbjct: 123 NPLKSTSFSHVPCNTQTCHAVDDGHCGVQGVCDYSYTYGDRTYSKGDLGFEKITIGSSSV 182
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA---FSYCLVSRGTGSSGS 314
K+V IGCGH + G F A+G++GLGGG +SLV Q+ QT G FSYCL + + ++G
Sbjct: 183 KSV-IGCGHASSGGFGFASGVIGLGGGQLSLVSQM-SQTSGISRRFSYCLPTLLSHANGK 240
Query: 315 LVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
+ FG+ A+ G V PL+ + ++YY+ L + +G R + V
Sbjct: 241 INFGQNAVVSGPGVVSTPLI-SKNTVTYYYITLEAISIGNER--------HMAFAKQGNV 291
Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV----SIFDTCYN--LSGFVSVRVP 426
++D+GT ++ LP Y D V+ + +A V + +D C++ ++ S +P
Sbjct: 292 IIDSGTTLSFLPKELY----DGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIP 347
Query: 427 TVSFYFSGGP-VLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
++ FSGG V LP + F ++ A IIGN+ I +D
Sbjct: 348 IITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEA 407
Query: 486 GFVGFGPNVC 495
+ F P VC
Sbjct: 408 KRLSFKPTVC 417
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 145/447 (32%), Positives = 223/447 (49%), Gaps = 53/447 (11%)
Query: 76 EAR---WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGAD 132
EAR ++ L+HRD S + ++ R ++SFH + R R
Sbjct: 26 EARNAGFSANLIHRDSSVSPLYNPRDTYFDRLRNSFHRSISR--------ANRFKPNSI- 76
Query: 133 AAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK 192
+A+ VQ +D+V G GEY +RI +G+P + D+GSD++WVQCQPC CYK
Sbjct: 77 SARALVQ---SDIVPG----GGEYLMRISIGNPQVEILAIADTGSDLIWVQCQPCEMCYK 129
Query: 193 QSDPVFDPADSASFSGVSCSSAVCDRL--ENAGCHA----GRCRYEVSYGDGSYTKGTLA 246
Q+ P+FDP S+S+ V C + C++L E C A C Y SYGD S++ G LA
Sbjct: 130 QNSPIFDPRRSSSYRNVLCGNEFCNKLDGEARSCDARGFVKTCGYTYSYGDQSFSDGHLA 189
Query: 247 LETLTIGRT---------VVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQT 296
+E IG T + VA GCG KN G F +G++GLGGGSMSLV QLG +
Sbjct: 190 IERFGIGSTNSNTSAAIAYFQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGPKL 249
Query: 297 GGAFSYCLV--SRGTGSSGSLVFGREALPVGAAW----VPLVRNPRAP-SFYYVGLSGLG 349
G FSYCLV S + + + FG + G+ + PL+ P+ P ++YY+ L +
Sbjct: 250 SGKFSYCLVPTSEQSNYTSKINFGNDINISGSNYNVVSTPLL--PKKPETYYYLTLEAIS 307
Query: 350 VGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS 409
V R+P + + G+ +++D+GT +T L + + D+ V + R S
Sbjct: 308 VENKRLPYTNLWNGEVEKGN--IIIDSGTTLTFLDSEFFNNL-DSAVEEAVKGERVSDPH 364
Query: 410 -IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSI 468
+F+ C+ ++ +P ++ +F+G V P + F +D CF PS + ++I
Sbjct: 365 GLFNICFKDEK--AIELPIITAHFTGADVELQPVNTFAKVEED--LLCFTMIPS-NDIAI 419
Query: 469 IGNIQQEGIQISFDGANGFVGFGPNVC 495
GN+ Q + +D V F P C
Sbjct: 420 FGNLAQMNFLVGYDLEKKAVSFLPTDC 446
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 138/419 (32%), Positives = 207/419 (49%), Gaps = 36/419 (8%)
Query: 105 QHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGS 164
+HS R RD R+A L + A +V + ++ G+G Y + I +G+
Sbjct: 42 KHSEAVR--RDGHRLAFLSYAATAAAGKATTTGTNSSSVNVQAQLENGAGAYNMNISLGT 99
Query: 165 PPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD--PVFDPADSASFSGVSCSSAVCDRLENA 222
PP +++D+GS+++W QC PC++C+ + PV PA S++FS + C+ + C L +
Sbjct: 100 PPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNGSFCQYLPTS 159
Query: 223 G----CHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAG 277
C+A C Y +YG G YT G LA ETLT+G VA GC +N ++G
Sbjct: 160 SRPRTCNATAACAYNYTYGSG-YTAGYLATETLTVGDGTFPKVAFGCSTENG--VDNSSG 216
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV---PLVR 333
++GLG G +SLV QL G FSYCL S G + ++FG A + V PL++
Sbjct: 217 IVGLGRGPLSLVSQLA---VGRFSYCLRSDMADGGASPILFGSLAKLTEGSVVQSTPLLK 273
Query: 334 NP--RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG-DDGVVMDTGTAVTRLPTPAYEA 390
NP + + YYV L+G+ V +P++ F TQ G G ++D+GT +T L Y
Sbjct: 274 NPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAM 333
Query: 391 FRDAFVAQTGNL----PRASGVSIFDTCYNLS---GFVSVRVPTVSFYFSGGPVLTLPAS 443
+ AF +Q NL P + D CY S G +VRVP ++ F+GG +P
Sbjct: 334 VKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALRFAGGAKYNVPVQ 393
Query: 444 NFL--IPVDDAGTF---CFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
N+ + D G C P+ L SIIGN+ Q + + +D G F P C
Sbjct: 394 NYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMHLLYDIDGGMFSFAPADC 452
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 138/419 (32%), Positives = 207/419 (49%), Gaps = 36/419 (8%)
Query: 105 QHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGS 164
+HS R RD R+A L + A +V + ++ G+G Y + I +G+
Sbjct: 42 KHSEAVR--RDGHRLAFLSYAATAAAGKATTTGTNSSSVNVQAQLENGAGAYNMNISLGT 99
Query: 165 PPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD--PVFDPADSASFSGVSCSSAVCDRLENA 222
PP +++D+GS+++W QC PC++C+ + PV PA S++FS + C+ + C L +
Sbjct: 100 PPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNGSFCQYLPTS 159
Query: 223 G----CHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAG 277
C+A C Y +YG G YT G LA ETLT+G VA GC +N ++G
Sbjct: 160 SRPRTCNATAACAYNYTYGSG-YTAGYLATETLTVGDGTFPKVAFGCSTENG--VDNSSG 216
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV---PLVR 333
++GLG G +SLV QL G FSYCL S G + ++FG A + V PL++
Sbjct: 217 IVGLGRGPLSLVSQLA---VGRFSYCLRSDMADGGASPILFGSLAKLTERSVVQSTPLLK 273
Query: 334 NP--RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG-DDGVVMDTGTAVTRLPTPAYEA 390
NP + + YYV L+G+ V +P++ F TQ G G ++D+GT +T L Y
Sbjct: 274 NPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAM 333
Query: 391 FRDAFVAQTGNL----PRASGVSIFDTCYNLS---GFVSVRVPTVSFYFSGGPVLTLPAS 443
+ AF +Q NL P + D CY S G +VRVP ++ F+GG +P
Sbjct: 334 VKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALRFAGGAKYNVPVQ 393
Query: 444 NFL--IPVDDAGTF---CFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
N+ + D G C P+ L SIIGN+ Q + + +D G F P C
Sbjct: 394 NYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMHLLYDIDGGMFSFAPADC 452
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 137/424 (32%), Positives = 203/424 (47%), Gaps = 43/424 (10%)
Query: 106 HSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGS-------GEYFV 158
H+ HA R + L+RR++ + + G + MD GS EY V
Sbjct: 31 HATHADAGRGLS-TRELLRRMAARSKARSARLLS--GRAASARMDPGSYTDGVPDTEYLV 87
Query: 159 RIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
+ +G+PP+ +++D+GSD+ W QC PC C++QS P F+P+ S +FS + C +C
Sbjct: 88 HMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRD 147
Query: 219 LENAGCHA-----GRCRYEVSYGDGSYTKGTLALETLT-------IGRTVVKNVAIGCGH 266
L + C G C Y +Y D S T G L +T + IG V ++ GCG
Sbjct: 148 LTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGL 207
Query: 267 KNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF-------G 318
N G+FV G+ G G++S+ QL FSYC + TGS S VF
Sbjct: 208 FNNGIFVSNETGIAGFSRGALSMPAQLKVDN---FSYCFTAI-TGSEPSPVFLGVPPNLY 263
Query: 319 REALPVGAAWV---PLVR-NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
+A G V L+R + YY+ L G+ VG R+PI E +F L + G G ++
Sbjct: 264 SDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIV 323
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSG 434
D+GT +T LP Y DAFVAQT S S+ C+++ VP + +F G
Sbjct: 324 DSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEG 383
Query: 435 GPVLTLPASNFLIPVDDAGTF---CFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFG 491
L LP N++ +++AG C A + LS+IGN QQ+ + + +D AN + F
Sbjct: 384 A-TLDLPRENYMFEIEEAGGIRLTCLAIN-AGEDLSVIGNFQQQNMHVLYDLANDMLSFV 441
Query: 492 PNVC 495
P C
Sbjct: 442 PARC 445
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 122/360 (33%), Positives = 177/360 (49%), Gaps = 29/360 (8%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+GEY +R +G+PP + D+GSD++WVQC PC+ C+ QS P+F P S++F +C
Sbjct: 87 NGEYLMRFYIGTPPVERLATADTGSDLIWVQCSPCASCFPQSTPLFQPLKSSTFMPTTCR 146
Query: 213 SAVCDRL--ENAGC-HAGRCRYEVSYGDG-SYTKGTLALETLT------IGRTVVKNVAI 262
S C L E GC +G C Y YGD S+++G L+ ETL + N
Sbjct: 147 SQPCTLLLPEQKGCGKSGECIYTYKYGDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSFF 206
Query: 263 GCG-HKNQGMF--VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR 319
GCG + N +F G++GLG G +SLV Q+G Q G FSYCL+ G+ S+ L FG
Sbjct: 207 GCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQIGHKFSYCLLPLGSTSTSKLKFGN 266
Query: 320 EALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
E++ G V P++ P P++Y++ L + V +P T D V++D+G
Sbjct: 267 ESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVP--------TGSTDGNVIIDSG 318
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGP 436
T +T L Y F + +S C+ FV P ++F F+G
Sbjct: 319 TLLTYLGESFYYNFAASLQESLAVELVQDVLSPLPFCFPYRDNFV---FPEIAFQFTGAR 375
Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
V PA N + +D T C APS SG+SI G+ Q Q+ +D V F P C
Sbjct: 376 VSLKPA-NLFVMTEDRNTVCLMIAPSSVSGISIFGSFSQIDFQVEYDLEGKKVSFQPTDC 434
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 130/367 (35%), Positives = 193/367 (52%), Gaps = 36/367 (9%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSC 211
GEY + + +G+PP S + D+GSD++W QC PCS QC+ Q P+++PA S +F + C
Sbjct: 90 GEYLMTLSIGTPPLSYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPC 149
Query: 212 SSAVCDRLENAGCHAGR-------CRYEVSYGDGSYTKGTLALETLTIGRTV-----VKN 259
+S++ AG AG+ C Y +YG G +T G ET T G V
Sbjct: 150 NSSLS---MCAGVLAGKAPPPGCACMYNQTYGTG-WTAGVQGSETFTFGSAAADQARVPG 205
Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFG 318
+A GC + + + G+AGL+GLG GS+SLV QLG G FSYCL + T S+ +L+ G
Sbjct: 206 IAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLG---AGRFSYCLTPFQDTNSTSTLLLG 262
Query: 319 REALP--VGAAWVPLVRNP-RAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVV 373
A G P V +P +AP ++YY+ L+G+ +G + IS D F L G G++
Sbjct: 263 PSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGLI 322
Query: 374 MDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSV--RVPTVS 429
+D+GT +T L AY+ R A V LP G D CY L S +P+++
Sbjct: 323 IDSGTTITSLVNAAYQQVRAA-VQSLVTLPAIDGSDSTGLDLCYALPTPTSAPPAMPSMT 381
Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFV 488
+F G ++ LPA +++I +G +C A G +S GN QQ+ + I +D N +
Sbjct: 382 LHFDGADMV-LPADSYMI--SGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVRNEML 438
Query: 489 GFGPNVC 495
F P C
Sbjct: 439 SFAPAKC 445
>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 402
Score = 189 bits (479), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 101/282 (35%), Positives = 154/282 (54%), Gaps = 18/282 (6%)
Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
G A C Y ++YGDGS+T+G L E L G +VK+ GCG N+G+F G +GL+GLG
Sbjct: 127 GSAAPICNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGLG 186
Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG------REALPVGAAWVPLVRNPR 336
+SL+ Q G GG FSYCL S SGSL+ G R + P+ ++ ++ NP+
Sbjct: 187 RSDLSLISQTSGIFGGVFSYCLPSTERKGSGSLILGGNSSVYRNSSPI--SYAKMIENPQ 244
Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
+FY++ L+G+ +GG+ + + +G +++D+GT +TRLP Y+A + F+
Sbjct: 245 LYNFYFINLTGISIGGVAL-------QAPSVGPSRILVDSGTVITRLPPTIYKALKAEFL 297
Query: 397 AQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN-FLIPVDDAGTF 455
Q P A SI DTC+NLS + V +PT+ +F G LT+ + F DA
Sbjct: 298 KQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQV 357
Query: 456 CFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A A ++I+GN QQ+ +++ +D VGF C
Sbjct: 358 CLALASLEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETC 399
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 121/356 (33%), Positives = 175/356 (49%), Gaps = 22/356 (6%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
G Y + + +G+PP Y + D+GSD+ W C PC++CYKQ +P+FDP S S+ +SC S
Sbjct: 23 GHYLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKSTSYRNISCDS 82
Query: 214 AVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTV-----VKNVAIGCGHK 267
+C +L+ C + C Y +Y + T+G LA ET+T+ T +K + GCGH
Sbjct: 83 KLCHKLDTGVCSPQKHCNYTYAYASAAITQGVLAQETITLSSTKGESVPLKGIVFGCGHN 142
Query: 268 NQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGA-FSYCLVSRGT----GSSGSLVFGREA 321
N G F G++GLGGG +S + Q+G GG FS CLV T S SL G E
Sbjct: 143 NTGGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFHTDVSVSSKMSLGKGSEV 202
Query: 322 LPVGAAWVPLV-RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
G PLV + + P Y+V L G+ VG + + + + G+ V +D+GT
Sbjct: 203 SGKGVVSTPLVAKQDKTP--YFVTLLGISVGNTYLHFNGSSSQSVEKGN--VFLDSGTPP 258
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
T LPT Y+ ++ P + + + CY ++R P ++ +F GG V
Sbjct: 259 TILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYRTKN--NLRGPVLTAHFEGGDVKL 316
Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
LP F+ P D G FC F + S + GN Q I FD V F P C
Sbjct: 317 LPTQTFVSPKD--GVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPMDC 370
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 139/397 (35%), Positives = 207/397 (52%), Gaps = 43/397 (10%)
Query: 138 VQDF-GTD------VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC 190
+QDF G D +VSG GSG+YFV + VG+P + ++ID+GSD+ W+QC P +
Sbjct: 2 IQDFQGEDPALFSRLVSGSSIGSGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTT 61
Query: 191 YKQSD---PVFDPADSASFSGVSCSSAVCDRLE---NAGC---HAGRCRYEVSYGDGSYT 241
S P +D + S+S+ + C+ C L + C C Y Y D S T
Sbjct: 62 ANSSSPPAPWYDKSSSSSYREIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRT 121
Query: 242 KGTLALETLTIG--------------RTV-VKNVAIGCGHKNQGM-FVGAAGLLGLGGGS 285
G LA ET+++ RT+ +KNVA+GC ++ G F+GA+G+LGLG G
Sbjct: 122 TGILAYETISMKSRKRSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGP 181
Query: 286 MSLVGQLGGQT-GGAFSYCLVS--RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYY 342
+SL Q GG FSYCLV RG+ +S LV GR A P+VRNP A SFYY
Sbjct: 182 ISLATQTRHTALGGIFSYCLVDYLRGSNASSFLVMGRTRW-RKLAHTPIVRNPAAQSFYY 240
Query: 343 VGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
V ++G+ V G + I+ + + G+ G + D+GT ++ L PAY A A
Sbjct: 241 VNVTGVAVDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASI-Y 299
Query: 402 LPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA 460
LPRA + F+ CYN++ + +P + F GG V+ LP +N+++ V + C A
Sbjct: 300 LPRAQEIPEGFELCYNVTR-MEKGMPKLGVEFQGGAVMELPWNNYMVLVAE-NVQCVALQ 357
Query: 461 P--SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ +G +I+GN+ Q+ I +D A +GF + C
Sbjct: 358 KVTTTNGSNILGNLLQQDHHIEYDLAKARIGFKWSPC 394
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 137/424 (32%), Positives = 203/424 (47%), Gaps = 43/424 (10%)
Query: 106 HSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGS-------GEYFV 158
H+ HA R + L+RR++ + + G + MD GS EY V
Sbjct: 57 HATHADAGRGLS-TRELLRRMAARSKARSARLLS--GRAASARMDPGSYTDGVPDTEYLV 113
Query: 159 RIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
+ +G+PP+ +++D+GSD+ W QC PC C++QS P F+P+ S +FS + C +C
Sbjct: 114 HMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRD 173
Query: 219 LENAGCHA-----GRCRYEVSYGDGSYTKGTLALETLT-------IGRTVVKNVAIGCGH 266
L + C G C Y +Y D S T G L +T + IG V ++ GCG
Sbjct: 174 LTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGL 233
Query: 267 KNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF-------G 318
N G+FV G+ G G++S+ QL FSYC + TGS S VF
Sbjct: 234 FNNGIFVSNETGIAGFSRGALSMPAQLKVDN---FSYCFTAI-TGSEPSPVFLGVPPNLY 289
Query: 319 REALPVGAAWV---PLVR-NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
+A G V L+R + YY+ L G+ VG R+PI E +F L + G G ++
Sbjct: 290 SDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIV 349
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSG 434
D+GT +T LP Y DAFVAQT S S+ C+++ VP + +F G
Sbjct: 350 DSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEG 409
Query: 435 GPVLTLPASNFLIPVDDAGTF---CFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFG 491
L LP N++ +++AG C A + LS+IGN QQ+ + + +D AN + F
Sbjct: 410 A-TLDLPRENYMFEIEEAGGIRLTCLAIN-AGEDLSVIGNFQQQNMHVLYDLANDMLSFV 467
Query: 492 PNVC 495
P C
Sbjct: 468 PARC 471
>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 133/433 (30%), Positives = 204/433 (47%), Gaps = 55/433 (12%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
LEL RD ++ + +Q++ + +++ K V T + A + + +
Sbjct: 101 LELQIRD-LTRIQTLHKRVLEKNNQNTVSQKQKKNDKEVVT-----TTPVASSVEEQAGQ 154
Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
+ SGM GSGEYF+ + VGSPP+ +++D+GSD+ W+QC PC C++Q+D
Sbjct: 155 LVATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQND----- 209
Query: 201 ADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV---- 256
C Y YGD S T G A+ET T+ T
Sbjct: 210 -------------------------NQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGS 244
Query: 257 -----VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG--T 309
V+N+ GCGH N+G+F GAAGLLGLG G +S QL G +FSYCLV R T
Sbjct: 245 SELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDT 304
Query: 310 GSSGSLVFGREALPVGAAWVPLV-----RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL 364
S L+FG + + + + +FYYV + + V G + I E+ + +
Sbjct: 305 NVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNI 364
Query: 365 TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFVSV 423
+ G G ++D+GT ++ PAYE ++ + G P I D C+N+SG +V
Sbjct: 365 SSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNV 424
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFD 482
++P + F+ G V P N I +++ C A +P S SIIGN QQ+ I +D
Sbjct: 425 QLPELGIAFADGAVWNFPTENSFIWLNE-DLVCLAMLGTPKSAFSIIGNYQQQNFHILYD 483
Query: 483 GANGFVGFGPNVC 495
+G+ P C
Sbjct: 484 TKRSRLGYAPTKC 496
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 134/354 (37%), Positives = 185/354 (52%), Gaps = 23/354 (6%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS---QCYKQSDPVFDPADSASFSGVSC 211
E+ V +G+G+P + ++ D+GSD+ WVQCQPC C+ Q DP+FDP+ S++++ V C
Sbjct: 148 EFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHC 207
Query: 212 SSAVCDRLENAG--CHAGR--CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGH 266
C AG C C Y V YGDGS T G L+ +TL + + + GCG
Sbjct: 208 GEPQC---AAAGGLCSEDNTTCLYLVHYGDGSSTTGVLSRDTLALTSSRALAGFPFGCGT 264
Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG-REALPVG 325
+N G F GLLGLG G +SL Q G FSYCL S + ++G L G A G
Sbjct: 265 RNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNS-TTGYLTIGATPATDTG 323
Query: 326 AA-WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
AA + ++R P+ PSFY+V L + +GG +P+ +F G ++D+GT +T LP
Sbjct: 324 AAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFT-----RGGTLLDSGTVLTYLP 378
Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
AYE RD F A + D CY+ +G V VP VSF F G V L
Sbjct: 379 AQAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVIVPAVSFRFGDGAVFELDFFG 438
Query: 445 FLIPVDDAGTFCFAFAPSPSG---LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+I +D+ C AFA +G LSIIGN QQ ++ +D A +GF P C
Sbjct: 439 VMIFLDE-NVGCLAFAAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 491
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 125/368 (33%), Positives = 183/368 (49%), Gaps = 33/368 (8%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
EY V + +G+PP+ +++D+GSD+ W QC PC C++QS P F+P+ S +FS + C
Sbjct: 110 EYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLR 169
Query: 215 VCDRLENAGCHA-----GRCRYEVSYGDGSYTKGTLALETLT-------IGRTVVKNVAI 262
+C L + C G C Y +Y D S T G L +T + IG V ++
Sbjct: 170 ICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTF 229
Query: 263 GCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF---- 317
GCG N G+FV G+ G G++S+ QL FSYC + TGS S VF
Sbjct: 230 GCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDN---FSYCFTAI-TGSEPSPVFLGVP 285
Query: 318 ---GREALPVGAAWV---PLVR-NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
+A G V L+R + YY+ L G+ VG R+PI E +F L + G
Sbjct: 286 PNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTG 345
Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSF 430
G ++D+GT +T LP Y DAFVAQT S S+ C+++ VP +
Sbjct: 346 GTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVL 405
Query: 431 YFSGGPVLTLPASNFLIPVDDAGTF---CFAFAPSPSGLSIIGNIQQEGIQISFDGANGF 487
+F G L LP N++ +++AG C A + LS+IGN QQ+ + + +D AN
Sbjct: 406 HFEGA-TLDLPRENYMFEIEEAGGIRLTCLAIN-AGEDLSVIGNFQQQNMHVLYDLANDM 463
Query: 488 VGFGPNVC 495
+ F P C
Sbjct: 464 LSFVPARC 471
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 133/432 (30%), Positives = 201/432 (46%), Gaps = 38/432 (8%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+++EL+H S T H+ R ++ M+ RV L S V
Sbjct: 26 FSVELIHPISSKSPFYNTAESHFQRMSNN----MKHSTNRVHYLNHVFSFPPNKVPNIVV 81
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
F G G Y + +G+PP Y V+D+ +D +W QC PC C+ + P+F
Sbjct: 82 SPF---------MGDG-YIISFLIGTPPFQLYGVMDTANDNIWFQCNPCKPCFNTTSPMF 131
Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGR---CRYEVSYGDGSYTKGTLALETLTIGRT 255
DP+ S+++ + CSS C +EN C + C Y +YG +Y++G L+++TLT+
Sbjct: 132 DPSKSSTYKTIPCSSPKCKNVENTHCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLTLNSN 191
Query: 256 -----VVKNVAIGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--R 307
KN+ IGCGH+N+G G +G +GLG G +S + QL GG FSYCLV
Sbjct: 192 NDTPISFKNIVIGCGHRNKGPLEGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCLVPLFS 251
Query: 308 GTGSSGSLVFGREALP--VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
G SG L FG +++ VG P+ Y L+ L VG I +
Sbjct: 252 NEGISGKLHFGDKSVVSGVGTVSTPITAGEIG---YSTTLNALSVGDHIIKFENSTSKND 308
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA-SGVSIFDTCYNLSGFVSVR 424
+G+ ++D+GT +T LP Y ++ V L RA S F CY + ++
Sbjct: 309 NLGN--TIIDSGTTLTILPENVYSRL-ESIVTSMVKLERAKSPNQQFKLCYKAT-LKNLD 364
Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL-SIIGNIQQEGIQISFDG 483
VP ++ +F+G V L + N P+D CFAF + +IIGNI Q+ + FD
Sbjct: 365 VPIITAHFNGADV-HLNSLNTFYPIDHE-VVCFAFVSVGNFPGTIIGNIAQQNFLVGFDL 422
Query: 484 ANGFVGFGPNVC 495
+ F P C
Sbjct: 423 QKNIISFKPTDC 434
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 139/419 (33%), Positives = 208/419 (49%), Gaps = 44/419 (10%)
Query: 84 VHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGT 143
R ++ + T ++ R H H +R++ L RL + +A+ +Q
Sbjct: 26 ARRSFRATMTRTEPAINLTRAAHKSH-------QRLSMLAARLDDAASGSAQTPLQ---- 74
Query: 144 DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADS 203
+D G G Y + +G+PP+ + D+GSD++W +C C++C Q P + P S
Sbjct: 75 -----LDSGGGAYDMTFSIGTPPQELSALADTGSDLIWAKCGACTRCVPQGSPSYYPNKS 129
Query: 204 ASFSGVSCSSAVCDRLENAGCHAG--RCRYEVSYGDGS----YTKGTLALETLTIGRTVV 257
+SFS + CS ++C L ++ C AG C Y+ SYG S YT+G L ET T+G V
Sbjct: 130 SSFSKLPCSGSLCSDLPSSQCSAGGAECDYKYSYGLASDPHHYTQGYLGSETFTLGSDAV 189
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF 317
+ GC ++G + +GL+GLG G +SLV QL GAFSYCL S +S L+F
Sbjct: 190 PGIGFGCTTMSEGGYGSGSGLVGLGRGPLSLVSQL---NVGAFSYCLTSDAAKTS-PLLF 245
Query: 318 GREALP-VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
G AL G PL+R + +Y V L + +G G G++ D+
Sbjct: 246 GSGALTGAGVQSTPLLRT--STYYYTVNLESISIGAATT---------AGTGSSGIIFDS 294
Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGP 436
GT V L PAY ++A ++QT NL ASG ++ C+ SG V P++ +F GG
Sbjct: 295 GTTVAFLAEPAYTLAKEAVLSQTTNLTMASGRDGYEVCFQTSGAV---FPSMVLHFDGGD 351
Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ LP N+ VDD+ + C+ SPS LSI+GNI Q I +D + F P C
Sbjct: 352 -MDLPTENYFGAVDDSVS-CWIVQKSPS-LSIVGNIMQMNYHIRYDVEKSMLSFQPANC 407
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 187 bits (476), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 139/435 (31%), Positives = 200/435 (45%), Gaps = 51/435 (11%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKH-- 136
+++E++HRD S F+ + +RV VRR + A H
Sbjct: 27 FSVEIIHRDSSRSP---------------FYRATETQFQRVTNAVRR----SMNRANHFN 67
Query: 137 EVQDFGTDVVSGMDQ-GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD 195
++ + V S + G+Y + +G+PP Y ++D+ SDI+WVQCQ C CY +
Sbjct: 68 QISVYSNAVESPVTLLDDGDYLMSYSLGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTS 127
Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCHAGR---CRYEVSYGDGSYTKGTLALETLTI 252
P+FDP+ S ++ + CSS C ++ C + C + V+Y DGS+++G L +ET+T+
Sbjct: 128 PMFDPSYSKTYKNLPCSSTTCKSVQGTSCSSDERKICEHTVNYKDGSHSQGDLIVETVTL 187
Query: 253 G----------RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
G RTV IGC +N + + G++GLGGG +SLV QL FSY
Sbjct: 188 GSYNDPFVHFPRTV-----IGC-IRNTNVSFDSIGIVGLGGGPVSLVPQLSSSISKKFSY 241
Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPL-VRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
CL SS L FG A+ G V + FYY+ L VG RI
Sbjct: 242 CLAPISDRSS-KLKFGDAAMVSGDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEFRSS- 299
Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG-VSIFDTCYNLSGF 420
G +++D+GT T LP Y A VA L RA + F CY S +
Sbjct: 300 -SSRSSGKGNIIIDSGTTFTVLPDDVYSKLESA-VADVVKLERAEDPLKQFSLCYK-STY 356
Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQIS 480
V VP ++ +FSG V L A N I V C AF S SG +I GN+ Q+ +
Sbjct: 357 DKVDVPVITAHFSGADV-KLNALNTFI-VASHRVVCLAFLSSQSG-AIFGNLAQQNFLVG 413
Query: 481 FDGANGFVGFGPNVC 495
+D V F P C
Sbjct: 414 YDLQRKIVSFKPTDC 428
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 187 bits (476), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 137/454 (30%), Positives = 209/454 (46%), Gaps = 52/454 (11%)
Query: 64 HNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLV 123
H+ S + S + +++ L+HR+ S + R +++ R +R+
Sbjct: 14 HSIASFAEASKTLSGFSINLIHRESPLSPFYNPSLTPSERIKNTVLRSFARSKRRL---- 69
Query: 124 RRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQ 183
RLS + D ++ D+ EY +R +G+PP ++ + D+GSD++WVQ
Sbjct: 70 -RLS---------QNDDRSPGTITIPDEPITEYLMRFYIGTPPVERFAIADTGSDLIWVQ 119
Query: 184 CQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA--GC--HAGRCRYEVSYGDGS 239
C PC +C Q+ P+FDP S++F V C S C L + C +G+C Y+ YGD +
Sbjct: 120 CAPCEKCVPQNAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRACVGKSGQCYYQYIYGDHT 179
Query: 240 YTKGTLALETLTIGRTVVKNVAI-------GCGHKNQGMFVGAA---GLLGLGGGSMSLV 289
G L E++ G KN AI GC N + GL+GLG G +SL+
Sbjct: 180 LVSGILGFESINFGS---KNNAIKFPKLTFGCTFSNNDTVDESKRNMGLVGLGVGPLSLI 236
Query: 290 GQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP---VGAAWVPLVRNPRAPSFYYVGLS 346
QLG Q G FSYC + S+ + FG +A+ G PL+ PS+YY+ L
Sbjct: 237 SQLGYQIGRKFSYCFPPLSSNSTSKMRFGNDAIVKQIKGVVSTPLIIKSIGPSYYYLNLE 296
Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
G+ +G ++ SE D +++D+GT+ T L ++F + FVA +
Sbjct: 297 GVSIGNKKVKTSE------SQTDGNILIDSGTSFTILK----QSFYNKFVALVKEVYGVE 346
Query: 407 GVSI----FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP- 461
V I ++ C+ G R P V F F+G V + ASN L +D C P
Sbjct: 347 AVKIPPLVYNFCFENKG-KRKRFPDVVFLFTGAKV-RVDASN-LFEAEDNNLLCMVALPT 403
Query: 462 SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S SI GN Q G Q+ +D G V F P C
Sbjct: 404 SDEDDSIFGNHAQIGYQVEYDLQGGMVSFAPADC 437
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 136/432 (31%), Positives = 210/432 (48%), Gaps = 40/432 (9%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
++++L+HRD S + R +FH R RV R S +D +
Sbjct: 32 FSVDLIHRDSPHSPFFDPSKTRTERLTDAFH----RSASRVGRF--RQSAMTSDGIQ--- 82
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
S + +GEY + + +G+PP ++D+GSD+ W QC+PC+ CYKQ P F
Sbjct: 83 --------SRLVPSAGEYIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPFF 134
Query: 199 DPADSASFSGVSCSSAVCDRLEN-AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGRTV 256
DP +S+++ SC ++ C L N C G +C + SY DGS+T G LA+ETLT+ T
Sbjct: 135 DPKNSSTYRDSSCGTSFCLALGNDRSCRNGKKCTFMYSYADGSFTGGNLAVETLTVASTA 194
Query: 257 VKNV-----AIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG 310
K V A GC H++ G+F ++G++GLG +S++ QL G FSYCL+ T
Sbjct: 195 GKPVSFPGFAFGCVHRSGGIFDEHSSGIVGLGVAELSMISQLKSTINGRFSYCLLPVFTD 254
Query: 311 SSGS--LVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
SS S + FGR + GA V PLV +Y + L G VG R+ + + +
Sbjct: 255 SSMSSRINFGRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSY-KGFSKKAE 313
Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA---SGVSIFDTCYNLSGFVSV 423
+ + +++D+GT T LP Y ++ VA + R +G+S CYN + +
Sbjct: 314 VEEGNIIVDSGTTYTYLPLEFYVKLEES-VAHSIKGKRVRDPNGIS--SLCYNTT-VDQI 369
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDG 483
P ++ +F V P + FL +D CF P+ S + I+GN+ Q + FD
Sbjct: 370 DAPIITAHFKDANVELQPWNTFLRMQEDL--VCFTVLPT-SDIGILGNLAQVNFLVGFDL 426
Query: 484 ANGFVGFGPNVC 495
V F C
Sbjct: 427 RKKRVSFKAADC 438
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 130/367 (35%), Positives = 186/367 (50%), Gaps = 30/367 (8%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSC 211
+ Y V +G+PP + V+D+GSD++W QC PC +C+ Q P++ PA S +++ VSC
Sbjct: 97 TATYLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYANVSC 156
Query: 212 SSAVCDRLEN-------------AGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVV 257
S +CD L + G C Y SYGDGS T G LA ET T G T V
Sbjct: 157 GSRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTFGAGTTV 216
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLV 316
++A GCG N G ++GL+G+G G +SLV QLG FSYC T +S L
Sbjct: 217 HDLAFGCGTDNLGGTDNSSGLVGMGRGPLSLVSQLGVTK---FSYCFTPFNDTTTSSPLF 273
Query: 317 FGREA-LPVGAAWVPLVRNPRAP---SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
G A L A P V +P P S+YY+ L G+ VG +PI +FRLT G G+
Sbjct: 274 LGSSASLSPAAKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPAVFRLTASGRGGL 333
Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLS---GFVSVRVPTV 428
++D+GT T L A+ A A+ LP ASG + C+ G +V VP +
Sbjct: 334 IIDSGTTFTALEERAFVVLARAVAARV-ALPLASGAHLGLSVCFAAPQGRGPEAVDVPRL 392
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
+F G + LP S+ ++ AG C S G+S++G++QQ+ + + +D +
Sbjct: 393 VLHFDGAD-MELPRSSAVVEDRVAGVACLGIV-SARGMSVLGSMQQQNMHVRYDVGRDVL 450
Query: 489 GFGPNVC 495
F P C
Sbjct: 451 SFEPANC 457
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 125/383 (32%), Positives = 179/383 (46%), Gaps = 42/383 (10%)
Query: 145 VVSGMDQGSG----EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQ-SDPVFD 199
V +G+ G G EY + + VG+PPR + +D+GSD+VW QC PC C++Q + PV D
Sbjct: 75 VRAGLGAGGGIVTNEYLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLD 134
Query: 200 PADSASFSGVSCSSAVCDRLENAGCHAGR------CRYEVSYGDGSYTKGTLALETLTI- 252
PA S++ + + C + +C L C GR C Y YGD S T G LA ++ T
Sbjct: 135 PAASSTHAALPCDAPLCRALPFTSC-GGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFG 193
Query: 253 -----GRTVVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS 306
G + V GCGH N+G+F G+ G G G SL QL + FSYC S
Sbjct: 194 GDDNAGGLAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTS---FSYCFTS 250
Query: 307 R-GTGSSGSLVFGREALPV----------GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
T SS + G A + L++NP PS Y+V L G+ VGG R+
Sbjct: 251 MFDTKSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARV 310
Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCY 415
+ E R ++D+G ++T LP YEA + FV+Q G A+G + D C+
Sbjct: 311 AVPESRLR------SSTIIDSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLCF 364
Query: 416 NLSGFVSVR---VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNI 472
L R VP ++ + GG LP N++ A C + +IGN
Sbjct: 365 ALPVAALWRRPAVPALTLHLDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNY 424
Query: 473 QQEGIQISFDGANGFVGFGPNVC 495
QQ+ + +D N + F P C
Sbjct: 425 QQQNTHVVYDLENDVLSFAPARC 447
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 137/396 (34%), Positives = 210/396 (53%), Gaps = 31/396 (7%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
+R R ATL+ L+ + S + SGE+ + I +G+PP +
Sbjct: 57 FRRSFSRSATLLTHLTSVSTACIR-----------SPIIPDSGEFLMSIFIGTPPVNVIA 105
Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC--HAGRC 229
+ D+GSD+ W QC PC +C+ QS P+F+P S+S+ VSC+S C LE+ C C
Sbjct: 106 IADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSC 165
Query: 230 RYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAA-GLLGLGGGSMSL 288
Y SYGD S+T G LA + +TIG + IGCGH+N G F G G++GLGGGS+SL
Sbjct: 166 SYGYSYGDRSFTYGDLASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSL 225
Query: 289 VGQLGGQTG--GAFSYCLVSRGTGS--SGSLVFGREALPVGAAWV--PLVRNPRAP-SFY 341
V Q+ G FSYCL + + + +G++ FGR+A+ G V PLV PR+P +FY
Sbjct: 226 VSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLV--PRSPDTFY 283
Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD--AFVAQT 399
++ L + VG R + + +T G+ +++D+GT +T LP Y A V +
Sbjct: 284 FLTLEAISVGKKRFKAANGISAMTNHGN--IIIDSGTTLTLLPRSLYYGVFSTLARVIKA 341
Query: 400 GNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF 459
+ SG I + CY+ + +P ++ +F+GG + L N PV D T C F
Sbjct: 342 KRVDDPSG--ILELCYSAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVT-CLTF 398
Query: 460 APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
AP+ + ++I GN+ Q ++ +D N + F P +C
Sbjct: 399 APA-TQVAIFGNLAQINFEVGYDLGNKRLSFEPKLC 433
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 127/365 (34%), Positives = 191/365 (52%), Gaps = 30/365 (8%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCS 212
GEY + + +G+PP V D+GSD++W QC PC +QC++Q P+++PA S +FS + C+
Sbjct: 112 GEYLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCN 171
Query: 213 SAV--CDRLENAGCHAGRC--RYEVSYGDGSYTKGTLALETLTIGRTV-----VKNVAIG 263
S++ C C Y +YG G +T G ET T G + V VA G
Sbjct: 172 SSLSMCAGALAGAAPPPGCACMYYQTYGTG-WTAGVQGSETFTFGSSAADQARVPGVAFG 230
Query: 264 CGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREAL 322
C + + + G+AGL+GLG GS+SLV QLG G FSYCL + T S+ +L+ G A
Sbjct: 231 CSNASSSDWNGSAGLVGLGRGSLSLVSQLGA---GRFSYCLTPFQDTNSTSTLLLGPSAA 287
Query: 323 P--VGAAWVPLVRNP-RAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
G P V +P RAP ++YY+ L+G+ +G +PIS F L G G+++D+G
Sbjct: 288 LNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSG 347
Query: 378 TAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSI--FDTCYNLSGFVSVR---VPTVSFY 431
T +T L AY+ R A +Q LP G D C+ L S +P+++ +
Sbjct: 348 TTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLH 407
Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGF 490
F G ++ LPA +++I +G +C A G +S GN QQ+ + I +D + F
Sbjct: 408 FDGADMV-LPADSYMI--SGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVREETLSF 464
Query: 491 GPNVC 495
P C
Sbjct: 465 APAKC 469
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 127/433 (29%), Positives = 194/433 (44%), Gaps = 39/433 (9%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+++EL+H D S R + + +KR L S D K +
Sbjct: 27 FSVELIHPDSSRSPFYNIRETQLQRISNV----VTHSIKRAHYLNHVFSLSHNDLPKPTI 82
Query: 139 QDFGTDVVSGMDQGSGEYFV-RIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
+ +G Y+V +G+PP Y V+D+GSD +W QC+PC C Q+ P+
Sbjct: 83 IPY-----------AGSYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTSPI 131
Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGR---CRYEVSYGDGSYTKGTLALETLTIGR 254
F+P+ S+++ + CSS +C R E C + R C YE++Y D S ++G ++ +TLT+
Sbjct: 132 FNPSKSSTYKNIRCSSPICKRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNS 191
Query: 255 T-----VVKNVAIGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG 308
+ IGCGHKN G A+G++G G G+ S+V QLG GG FSYCL S
Sbjct: 192 NDGSPISFPKIVIGCGHKNSLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASLF 251
Query: 309 TGS--SGSLVFGREALPVGAAWVPLVRNPRAPSF----YYVGLSGLGVGGMRIPISEDLF 362
+ + S L FG A+ G +V P SF Y+ L VG I + +
Sbjct: 252 SKANISSKLYFGDMAVVSGHG---VVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDS-- 306
Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
L + V+D+G+ +T+LP Y A ++ CY +
Sbjct: 307 SLIPDNEGNAVIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTT-LKK 365
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
VP ++ +F G V L A N I ++ CFAF S + GNI Q+ + +D
Sbjct: 366 YEVPIITAHFRGADV-KLNAFNTFIQMNHE-VMCFAFNSSAFPWVVYGNIAQQNFLVGYD 423
Query: 483 GANGFVGFGPNVC 495
+ F P C
Sbjct: 424 TLKNIISFKPTNC 436
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 126/347 (36%), Positives = 176/347 (50%), Gaps = 16/347 (4%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
Y VR +G+PP+ + +D+ +D W+ C C+ C + F+PA S S+ V C S
Sbjct: 108 YVVRARLGTPPQQLLLAVDTSNDAAWIPCSGCAGCPTTTP--FNPAASKSYRAVPCGSPA 165
Query: 216 CDRLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFV 273
C R N C + C + ++Y D S + L+ ++L + VVK+ GC K G
Sbjct: 166 CSRAPNPSCSLNTKSCGFSLTYADSSL-EAALSQDSLAVANDVVKSYTFGCLQKATGTAT 224
Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLV 332
GLLGLG G +S + Q G FSYCL S + SG+L GR+ P+ PL+
Sbjct: 225 PPQGLLGLGRGPLSFLSQTKDMYEGTFSYCLPSFKSLNFSGTLRLGRKGQPLRIKTTPLL 284
Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
NP S YYV ++G+ VG +PI G V+D+GT TRL PAY A R
Sbjct: 285 VNPHRSSLYYVSMTGIRVGKKVVPIPPAALAFDPATGAGTVLDSGTMFTRLVAPAYVAVR 344
Query: 393 DAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA 452
D + P +S + FDTCYN +V+ P V+F F+G V TLPA N +I
Sbjct: 345 DEVRRRIRGAPLSS-LGGFDTCYN----TTVKWPPVTFMFTGMQV-TLPADNLVIHSTYG 398
Query: 453 GTFCFAFAPSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
T C A A +P G L++I ++QQ+ +I FD NG VGF C
Sbjct: 399 TTSCLAMAAAPDGVNTVLNVIASMQQQNHRILFDVPNGRVGFAREQC 445
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 138/411 (33%), Positives = 202/411 (49%), Gaps = 40/411 (9%)
Query: 98 NMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYF 157
M H +F R +R++ L RL A +A+ +Q MD G G Y
Sbjct: 32 TMTRHEPTINFTRAAHRSRERLSILATRLGAASAGSAQSPLQ---------MDSGGGAYD 82
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD 217
+ +G+PP++ + D+GSD++W +C C +C + + P S+SFS + CSSA+C
Sbjct: 83 MTFSMGTPPQTLSALADTGSDLIWAKCGACKRCAPRGSASYYPTKSSSFSKLPCSSALCR 142
Query: 218 RLEN---AGCHAGR-----CRYEVSYGDGS----YTKGTLALETLTIGRTVVKNVAIGCG 265
LE+ A C R C Y SYG S YT+G + ET T+G V+ + GC
Sbjct: 143 TLESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLGSDAVQGIGFGCT 202
Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP-V 324
++G + +GL+GLG G +SLV QL GAFSYCL S + SS L+FG AL
Sbjct: 203 TMSEGGYGSGSGLVGLGRGKLSLVRQL---KVGAFSYCLTSDPSTSS-PLLFGAGALTGP 258
Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
G PLV N + +FY V L + +G + P G G++ D+GT +T L
Sbjct: 259 GVQSTPLV-NLKTSTFYTVNLDSISIGAAKTP---------GTGRHGIIFDSGTTLTFLA 308
Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
PAY ++QT NL R G ++ C+ SG P++ +F GG + L N
Sbjct: 309 EPAYTLAEAGLLSQTTNLTRVPGTDGYEVCFQTSG--GAVFPSMVLHFDGGD-MALKTEN 365
Query: 445 FLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ V+D+ + C+ SPS +SI+GNI Q I +D + F P C
Sbjct: 366 YFGAVNDSVS-CWLVQKSPSEMSIVGNIMQMDYHIRYDLDKSVLSFQPTNC 415
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 133/411 (32%), Positives = 194/411 (47%), Gaps = 39/411 (9%)
Query: 112 MQRDVKRVATL--VR---RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPP 166
M+R R A L VR R SG K+E Q V+ G EY V + +G+PP
Sbjct: 54 MRRSKARAAALSAVRNRARFSG------KNE-QQTPAGVLPVRPSGDLEYVVDLAIGTPP 106
Query: 167 RSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH- 225
+ ++D+GSD++W QC PC+ C Q DP+F P SAS+ + C+ +C + + C
Sbjct: 107 QPVSALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEPMRCAGTLCSDILHHSCER 166
Query: 226 AGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN-------VAIGCGHKNQGMFVGAAGL 278
C Y +YGDG+ T G A E T + + GCG N G +G+
Sbjct: 167 PDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGCGSVNVGSLNNGSGI 226
Query: 279 LGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV------GAAWVPLV 332
+G G +SLV QL + FSYCL S + +L+FG + V PL+
Sbjct: 227 VGFGRNPLSLVSQLSIRR---FSYCLTSYASRRQSTLLFGSLSDGVYGDATGRVQTTPLL 283
Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
++P+ P+FYYV +GL VG R+ I E F L G GV++D+GTA+T LP
Sbjct: 284 QSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEVV 343
Query: 393 DAFVAQTGNLPRASGVSIFD-TCYNL-------SGFVSVRVPTVSFYFSGGPVLTLPASN 444
AF Q LP A+G + D C+ + S + VP + +F G L LP N
Sbjct: 344 RAFRQQL-RLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHFQGAD-LDLPRRN 401
Query: 445 FLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+++ G C A S S IGN+ Q+ +++ +D + P C
Sbjct: 402 YVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 452
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 130/351 (37%), Positives = 182/351 (51%), Gaps = 17/351 (4%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS---QCYKQSDPVFDPADSASFSGVSC 211
E+ V +G+G+P + ++ D+GSD+ WVQCQPC C+ Q DP+FDP+ S++++ V C
Sbjct: 143 EFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHC 202
Query: 212 SSAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQ 269
C + C Y V YGDGS T G L+ +TL + + + GCG +N
Sbjct: 203 GEPQCAAAGDLCSEDNTTCLYLVRYGDGSSTTGVLSRDTLALTSSRALTGFPFGCGTRNL 262
Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG-REALPVGAA- 327
G F GLLGLG G +SL Q G FSYCL S + ++G L G A GAA
Sbjct: 263 GDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNS-TTGYLTIGATPATDTGAAQ 321
Query: 328 WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
+ ++R P+ PSFY+V L + +GG +P+ +F G ++D+GT +T LP A
Sbjct: 322 YTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFT-----RGGTLLDSGTVLTYLPAQA 376
Query: 388 YEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
Y RD F A + D CY+ +G V VP VSF F G V L +I
Sbjct: 377 YALLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVVVPAVSFRFGDGAVFELDFFGVMI 436
Query: 448 PVDDAGTFCFAFAPSPSG---LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+D+ C AFA +G LSIIGN QQ ++ +D A +GF P C
Sbjct: 437 FLDE-NVGCLAFAAMDTGGLPLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 486
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 134/424 (31%), Positives = 193/424 (45%), Gaps = 53/424 (12%)
Query: 78 RWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHE 137
R++++L+HRD S + + R+ R +R +S A + +
Sbjct: 34 RFSIDLIHRDSPKSP--------LYNPSETPAERLDRFFRRF------MSFSEASISPNT 79
Query: 138 VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
+ + +GEY ++I +G+PP Y + D+GSD++W QC PC CYKQ +P+
Sbjct: 80 PE-------PPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPM 132
Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGR- 254
FDP+ S SF VSC S C L+ C + C + YGDGS +G +A ETLT+
Sbjct: 133 FDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSN 192
Query: 255 ----TVVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSR 307
T + N+ GCGH N G F GL G GG +SL Q+ +G FS CLV
Sbjct: 193 SGQPTSILNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPF 252
Query: 308 GTGSS--GSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
T S ++FG EA G+ V PLV P++Y+V L G+ VG P S
Sbjct: 253 RTDPSITSKIIFGPEAEVSGSDVVSTPLVTK-DDPTYYFVTLDGISVGDKLFPFSSS--- 308
Query: 364 LTQMGDDG-VVMDTGTAVTRLPTPAY----EAFRDAFVAQTGNLPRASGVSIFDTCYNLS 418
+ M G V +D GT T LP Y + ++A + P CY +
Sbjct: 309 -SPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQP----QLCYRSA 363
Query: 419 GFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQ 478
+ P ++ +F G V P + F+ P + G +CFA P I GN Q
Sbjct: 364 TLID--GPILTAHFDGADVQLKPLNTFISPKE--GVYCFAMQPIDGDTGIFGNFVQMNFL 419
Query: 479 ISFD 482
I FD
Sbjct: 420 IGFD 423
>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
Length = 393
Score = 185 bits (469), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 125/385 (32%), Positives = 180/385 (46%), Gaps = 78/385 (20%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVS--GMDQGSGEYFVRIGVGSPPRSQ 169
++RD R + R+ SG AA + Q V + G + EY + +G+GSP +Q
Sbjct: 60 LRRDQLRADYIRRKFSGSNGTAAGEDGQSSKVSVPTTLGSSLDTLEYVISVGLGSPAVTQ 119
Query: 170 YMVIDSGSDIVWVQCQPC---SQCYKQSDPVFDPADSASFSGVSCSSAVCDRL----ENA 222
+VID+GSD+ WVQC+PC S C+ + +FDPA S++++ +CS+A C +L E
Sbjct: 120 RVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCSAAACAQLGDSGEAN 179
Query: 223 GCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKN--QGMFVGAAGLL 279
GC A RC+Y V YGDGS T GT GC H GM GL+
Sbjct: 180 GCDAKSRCQYIVKYGDGSNTTGT--------------GFQFGCSHAELGAGMDDKTDGLI 225
Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPS 339
GLGG + SLV Q R+ + P+
Sbjct: 226 GLGGDAQSLVSQTA--------------------------------------ARSKKVPT 247
Query: 340 FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQT 399
+Y+ L + VGG ++ +S +F G ++D+GT +TRLP AY A AF A
Sbjct: 248 YYFAALEDIAVGGKKLGLSPSVFAA------GSLVDSGTVITRLPPAAYAALSSAFRAGM 301
Query: 400 GNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF 459
RA + I DTC+N +G V +PTV+ F+GG V+ L A + C AF
Sbjct: 302 TRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAGGAVVDLDAHGIV------SGGCLAF 355
Query: 460 APS--PSGLSIIGNIQQEGIQISFD 482
AP+ IGN+QQ ++ +D
Sbjct: 356 APTRDDKAFGTIGNVQQRTFEVLYD 380
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 184 bits (468), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 121/360 (33%), Positives = 179/360 (49%), Gaps = 34/360 (9%)
Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
++ G G Y + I VG+P + +V D+GSD++W QC PC++C++Q P F PA S++FS
Sbjct: 79 LENGVGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSK 138
Query: 209 VSCSSAVCDRLENA--GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
+ C+S+ C L N+ C+A C Y YG G YT G LA ETL +G +VA GC
Sbjct: 139 LPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGDASFPSVAFGCST 197
Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-LPVG 325
+N GLG + + G FSYCL S + ++FG A L G
Sbjct: 198 EN-----------GLGQLDLGV---------GRFSYCLRSGSAAGASPILFGSLANLTDG 237
Query: 326 AAW-VPLVRNPRA-PSFYYVGLSGLGVGGMRIPISEDLFRLTQMG-DDGVVMDTGTAVTR 382
P V NP PS+YYV L+G+ VG +P++ F TQ G G ++D+GT +T
Sbjct: 238 NVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTY 297
Query: 383 LPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYN--LSGFVSVRVPTVSFYFSGGPVLTL 440
L YE + AF++QT ++ +G D C+ G + VP++ F GG +
Sbjct: 298 LAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGGAEYAV 357
Query: 441 PASNFLIPVDDAGTF---CFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P + D G+ C P+ +S+IGN+ Q + + +D G F P C
Sbjct: 358 PTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADC 417
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 184 bits (467), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 126/405 (31%), Positives = 190/405 (46%), Gaps = 26/405 (6%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
+QR R A L GG A+ + Q+ + G EY V + VG+PP+
Sbjct: 60 VQRSKARAAALSVARLGGSNKGARQQDQNQQQPGLPVRPSGDLEYLVDLAVGTPPQPVSA 119
Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH-AGRCR 230
++D+GSD++W QC PC+ C Q DP+F P S+S+ + C+ +C+ + + C C
Sbjct: 120 LLDTGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRCAGELCNDILHHSCQRPDTCT 179
Query: 231 YEVSYGDGSYTKGTLALETLTIGR--------TVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
Y SYGDG+ T+G A E T + + GCG N+G +G++G G
Sbjct: 180 YRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGFGCGTMNKGSLNNGSGIVGFG 239
Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR------EALPVGAAWVPLVRNPR 336
+SLV QL + FSYCL +G +L+FG +A L+R+ +
Sbjct: 240 RAPLSLVSQLAIRR---FSYCLTPYASGRKSTLLFGSLRGGVYDAATATVQTTRLLRSRQ 296
Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
P+FYYV +G+ VG R+ I F L G G ++D+GTA+T P P AF
Sbjct: 297 NPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGAIVDSGTALTLFPAPVLAEVVRAFR 356
Query: 397 AQTGNLP-RASGVSIFD--TCYNLSGFVSVR---VPTVSFYFSGGPVLTLPASNFLIPVD 450
+Q LP A+G S D C+ + R VP + F+ G L LP N+++
Sbjct: 357 SQL-RLPFAANGSSGPDDGVCFAAAASRVPRPAVVPRMVFHLQGAD-LDLPRRNYVLDDQ 414
Query: 451 DAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
G C A S + IGN Q+ +++ +D + F P C
Sbjct: 415 RKGNLCLLLADSGDSGTTIGNFVQQDMRVLYDLEADTLSFAPAQC 459
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 184 bits (467), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 133/424 (31%), Positives = 192/424 (45%), Gaps = 53/424 (12%)
Query: 78 RWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHE 137
R++++L+HRD S + + R+ R +R +S A + +
Sbjct: 34 RFSIDLIHRDSPKSP--------LYNPSETPAERLDRFFRRF------MSFSEASISPNT 79
Query: 138 VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
+ + +GEY ++I +G+PP Y + D+GSD++W QC PC CYKQ +P+
Sbjct: 80 PE-------PPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPM 132
Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRT 255
FDP+ S SF VSC S C L+ C + C + YGDGS +G +A ETLT+
Sbjct: 133 FDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSN 192
Query: 256 -----VVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSR 307
+ N+ GCGH N G F GL G GG +SL Q+ +G FS CLV
Sbjct: 193 SGQPXSIXNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPF 252
Query: 308 GTGSS--GSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
T S ++FG EA G+ V PLV P++Y+V L G+ VG P S
Sbjct: 253 RTDPSITSKIIFGPEAEVSGSXVVSTPLVTK-DDPTYYFVTLDGISVGDKLFPFSSS--- 308
Query: 364 LTQMGDDG-VVMDTGTAVTRLPTPAY----EAFRDAFVAQTGNLPRASGVSIFDTCYNLS 418
+ M G V +D GT T LP Y + ++A + P CY +
Sbjct: 309 -SPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQP----QLCYRSA 363
Query: 419 GFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQ 478
+ P ++ +F G V P + F+ P + G +CFA P I GN Q
Sbjct: 364 TLID--GPILTAHFDGADVQLKPLNTFISPKE--GVYCFAMQPIDGDTGIFGNFVQMNFL 419
Query: 479 ISFD 482
I FD
Sbjct: 420 IGFD 423
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 120/350 (34%), Positives = 171/350 (48%), Gaps = 15/350 (4%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
G Y VR+ +G+P ++ YMV+D+ +D W C C C S F +S++F+ + CS
Sbjct: 93 GNYVVRVQLGTPGQTMYMVLDTSNDAAWAPCSGCIGC--SSTTTFSAQNSSTFATLDCSK 150
Query: 214 AVCDRLENAGCHAG---RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
C + C C + +YG S TL ++L +G V+ N + GC G
Sbjct: 151 PECTQARGLSCPTTGNVDCLFNQTYGGDSTFSATLVQDSLHLGPNVIPNFSFGCISSASG 210
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV 329
+ GL+GLG G +SL+ Q G G FSYCL S + SGSL G P
Sbjct: 211 SSIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKAIRTT 270
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
PL+ NP PS YYV L+G+ VG + +PIS +L G ++D+GT +TR Y
Sbjct: 271 PLLHNPHRPSLYYVNLTGISVGRVLVPISPELLAFDPNTGAGTIIDSGTVITRFVPAIYT 330
Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
A RD F Q G S + FDTC+ + VS P ++ + SG L LP N LI
Sbjct: 331 AVRDEFRKQVGG--SFSPLGAFDTCFATNNEVS--APAITLHLSGLD-LKLPMENSLIHS 385
Query: 450 DDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A A +P S +++I N+QQ+ +I FD N +G +C
Sbjct: 386 SAGSLACLAMAAAPNNVNSVVNVIANLQQQNHRILFDINNSKLGIARELC 435
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 139/408 (34%), Positives = 191/408 (46%), Gaps = 51/408 (12%)
Query: 106 HSFHARMQRDVK-RVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGS 164
H R R K RVA L RL+G D + D+G Y V IG+G+
Sbjct: 54 HDMWRRSARASKARVARLEARLTG-----------DMSVPLARISDEG---YTVTIGIGT 99
Query: 165 PPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG- 223
PP+ ++ D+ SD+ W QC + KQ +P+FDPA S+SF+ V+CSS +C +N G
Sbjct: 100 PPQLHTLIADTASDLTWTQCNLFNDTAKQVEPLFDPAKSSSFAFVTCSSKLCTE-DNPGT 158
Query: 224 --CHAGRCRYEVSYGDGSYTKGTLALETLTI---GRTVVKNVAIGCGHKNQGMFVGAAGL 278
C CRY Y G LA E+ T+ + + + GCG G +GA+G+
Sbjct: 159 KRCSNKTCRYVYPYVSVE-AAGVLAYESFTLSDNNQHICMSFGFGCGALTDGNLLGASGI 217
Query: 279 LGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVR----N 334
LG+ +S+V QL FSYCL S L FG AW L R
Sbjct: 218 LGMSPAILSMVSQLAIPK---FSYCLTPYTDRKSSPLFFG--------AWADLGRYKTTG 266
Query: 335 PRAPS---FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
P S +YYV L GL +G R+ + F L Q G V+D G V +L PA+ A
Sbjct: 267 PIQKSLTFYYYVPLVGLSLGTRRLDVPAATFALKQ---GGTVVDLGCTVGQLAEPAFTAL 323
Query: 392 RDAFVAQTGNLPRAS-GVSIFDTCYNLSGFV---SVRVPTVSFYFSGGPVLTLPASNFLI 447
++A V T NLP + V + C+ L V +V+ P + YF GG + LP N+
Sbjct: 324 KEA-VLHTLNLPLTNRTVKDYKVCFALPSGVAMGAVQTPPLVLYFDGGADMVLPRDNYF- 381
Query: 448 PVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
AG C A P G+SIIGN+QQ+ + FD + F P +C
Sbjct: 382 QEPTAGLMCLALVPG-GGMSIIGNVQQQNFHLLFDVHDSKFLFAPTIC 428
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 115/352 (32%), Positives = 167/352 (47%), Gaps = 36/352 (10%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
EY +++ +G+PP V+D+GS+ +W QC PC CY Q+ P+FDP+ S++F + C +
Sbjct: 58 EYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDT- 116
Query: 215 VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQ 269
H C YE+ YG SYTKGTL ET+TI T V+ IGCG N
Sbjct: 117 ----------HDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNNS 166
Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV 329
G G AG++GL G SL+ Q+GG+ G SYC +GT + FG A+ G V
Sbjct: 167 GFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGT---SKINFGANAIVAGDGVV 223
Query: 330 P---LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
V+ + P FYY+ L + VG RI F + +V+D+G+ +T P
Sbjct: 224 STTVFVKTAK-PGFYYLNLDAVSVGNTRIETVGTPFHALK---GNIVIDSGSTLTYFPES 279
Query: 387 AYEAFRDAF--VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
R A V PR+ + + ++ P ++ +FSGG L L N
Sbjct: 280 YCNLVRKAVEQVVTAVRFPRSDILCYYSKTIDI-------FPVITMHFSGGADLVLDKYN 332
Query: 445 FLIPVDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ + G FC A SP +I GN Q + +D ++ V F P C
Sbjct: 333 MYVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNC 384
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 115/352 (32%), Positives = 167/352 (47%), Gaps = 36/352 (10%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
EY +++ +G+PP V+D+GS+ +W QC PC CY Q+ P+FDP+ S++F + C +
Sbjct: 64 EYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDT- 122
Query: 215 VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQ 269
H C YE+ YG SYTKGTL ET+TI T V+ IGCG N
Sbjct: 123 ----------HDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNNS 172
Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV 329
G G AG++GL G SL+ Q+GG+ G SYC +GT + FG A+ G V
Sbjct: 173 GFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGT---SKINFGANAIVAGDGVV 229
Query: 330 P---LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
V+ + P FYY+ L + VG RI F + +V+D+G+ +T P
Sbjct: 230 STTVFVKTAK-PGFYYLNLDAVSVGNTRIETVGTPFHALK---GNIVIDSGSTLTYFPES 285
Query: 387 AYEAFRDAF--VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
R A V PR+ + + ++ P ++ +FSGG L L N
Sbjct: 286 YCNLVRKAVEQVVTAVRFPRSDILCYYSKTIDI-------FPVITMHFSGGADLVLDKYN 338
Query: 445 FLIPVDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ + G FC A SP +I GN Q + +D ++ V F P C
Sbjct: 339 MYVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNC 390
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 124/350 (35%), Positives = 174/350 (49%), Gaps = 18/350 (5%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
S Y VR +G+PP++ + +D+ +D W+ C C C + +F P S +F VSC+
Sbjct: 75 SPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGC---ASTLFAPEKSTTFKNVSCA 131
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
+ C ++ N GC C + ++YG S L +T+T+ V + GC K G
Sbjct: 132 APECKQVPNPGCGVSSCNFNLTYGSSSIA-ANLVQDTITLATDPVPSYTFGCVSKTTGTS 190
Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPL 331
GLLGLG G +SL+ Q FSYCL S + SGSL G A P + PL
Sbjct: 191 APPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPKRIKYTPL 250
Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
++NPR S YYV L + VG + I G + D+GT TRL P Y A
Sbjct: 251 LKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVFTRLVAPVYVAV 310
Query: 392 RDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
RD F + G P+ + S+ FDTCYN V + VPT++F F+G V TLP N LI
Sbjct: 311 RDEFRRRVG--PKLTVTSLGGFDTCYN----VPIVVPTITFIFTGMNV-TLPQDNILIHS 363
Query: 450 DDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
T C A A +P S L++I N+QQ+ ++ +D N VG +C
Sbjct: 364 TAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSRVGVARELC 413
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 119/350 (34%), Positives = 176/350 (50%), Gaps = 36/350 (10%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSC 211
+ Y V I +G+PP V+D+GSD++W QC PC +C+ Q P++ PA SA+++ VSC
Sbjct: 89 TATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSC 148
Query: 212 SSAVCDRLENAGCHAGR----CRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGH 266
S +C L++ C Y SYGDG+ T G LA ET T+G T V+ VA GCG
Sbjct: 149 RSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGT 208
Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA 326
+N G ++GL+G+G G +SLV QLG V+R S +
Sbjct: 209 ENLGSTDNSSGLVGMGRGPLSLVSQLG-----------VTRPRRSCRARAA------ARG 251
Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
P +P L G+ VG +PI +FRLT MGD GV++D+GT T L
Sbjct: 252 GGAPTTTSP---------LEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEER 302
Query: 387 AYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNF 445
A+ A A ++ LP ASG + C+ + +V VP + +F G + L ++
Sbjct: 303 AFVALARALASRV-RLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGAD-MELRRESY 360
Query: 446 LIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
++ AG C S G+S++G++QQ+ I +D G + F P C
Sbjct: 361 VVEDRSAGVACLGMV-SARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 409
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 127/360 (35%), Positives = 179/360 (49%), Gaps = 34/360 (9%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
EY V + +G+PP+ + +D+GSD++W QCQPC C+ Q+ P FDP+ S++ S SC S
Sbjct: 34 EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 93
Query: 215 VCDRLENAGCHAGR------CRYEVSYGDGSYTKGTLALETLTI--GRTVVKNVAIGCGH 266
+C L A C + + C Y SYGD S T G L ++ T V VA GCG
Sbjct: 94 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGL 153
Query: 267 KNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV- 324
N G+F G+ G G G +SL QL G FS+C + TG+ S V LP
Sbjct: 154 FNNGVFKSNETGIAGFGRGPLSLPSQL---KVGNFSHCFTTI-TGAIPSTVL--LDLPAD 207
Query: 325 -------GAAWVPLV---RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
PL+ +N P+ YY+ L G+ VG R+P+ E F LT G G ++
Sbjct: 208 LFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTN-GTGGTII 266
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFS 433
D+GT++T LP Y+ RD F AQ LP G + TC++ VP + +F
Sbjct: 267 DSGTSITSLPPQVYQVVRDEFAAQI-KLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFE 325
Query: 434 GGPVLTLPASNFLIPV-DDAGT--FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
G + LP N++ V DDAG C A +IIGN QQ+ + + +D N + F
Sbjct: 326 GA-TMDLPRENYVFEVPDDAGNSIICLAINKG-DETTIIGNFQQQNMHVLYDLQNNMLSF 383
>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
Length = 389
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 102/293 (34%), Positives = 158/293 (53%), Gaps = 24/293 (8%)
Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVK 258
+ +A S +S VC G A C Y ++YGDGS+T+G L E L G +VK
Sbjct: 52 EDVSNAQIPVTSGNSGVC------GSAAPICNYAINYGDGSFTRGELGHEKLKFGTILVK 105
Query: 259 NVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG 318
+ GCG N+G+F G +GL+GLG +SL+ Q G GG FSYCL S SGSL+ G
Sbjct: 106 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLISQTSGIFGGVFSYCLPSTERKGSGSLILG 165
Query: 319 ------REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
R + P+ ++ ++ NP+ +FY++ L+G+ +GG+ + + +G +
Sbjct: 166 GNSSVYRNSSPI--SYAKMIENPQLYNFYFINLTGISIGGVAL-------QAPSVGPSRI 216
Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYF 432
++D+GT +TRLP Y+A + F+ Q P A SI DTC+NLS + V +PT+ +F
Sbjct: 217 LVDSGTVITRLPPTIYKALKAEFLKQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHF 276
Query: 433 SGGPVLTLPASN-FLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFD 482
G LT+ + F DA C A A ++I+GN QQ+ +++ +D
Sbjct: 277 EGNAELTVDVTGVFYFVKSDASQVCLALASLEYQDEVAILGNYQQKNLRVIYD 329
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 117/349 (33%), Positives = 168/349 (48%), Gaps = 31/349 (8%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
Y +++ VG+PP +ID+GS+I W QC PC CY+Q+ P+FDP+ S++F
Sbjct: 65 YLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTF--------- 115
Query: 216 CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQG 270
+ C C YEV Y D +YT GTLA ET+T+ T V+ IGCGH N
Sbjct: 116 ----KEKRCDGHSCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGCGHNNSW 171
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV- 329
+G++GL G SL+ Q+GG+ G SYC +GT + FG A+ G V
Sbjct: 172 FKPSFSGMVGLNWGPSSLITQMGGEYPGLMSYCFSGQGT---SKINFGANAIVAGDGVVS 228
Query: 330 -PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
+ P FYY+ L + VG RI F + +V+D+GT +T P
Sbjct: 229 TTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALE---GNIVIDSGTTLTYFPVSYC 285
Query: 389 EAFRDAFVAQTGNLPRASGVSIFDT-CYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
R A V RA+ + D CYN S + + P ++ +FSGG L L N +
Sbjct: 286 NLVRQA-VEHVVTAVRAADPTGNDMLCYN-SDTIDI-FPVITMHFSGGVDLVLDKYNMYM 342
Query: 448 PVDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
++ G FC A SP+ +I GN Q + +D ++ V F P C
Sbjct: 343 ESNNGGVFCLAIICNSPTQEAIFGNRAQNNFLVGYDSSSLLVSFSPTNC 391
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 121/348 (34%), Positives = 165/348 (47%), Gaps = 15/348 (4%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
S Y V+ VG+PP++ M +D+ D W+ C+ C C S VF+ S +F + C
Sbjct: 32 SPSYIVKAKVGTPPQTLLMALDNSYDAAWIPCKGCVGC---SSTVFNTVKSTTFKTLGCG 88
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
+ C ++ N C C + +YG S L +T+ + V A GC K G
Sbjct: 89 APQCKQVPNPICGGSTCTWNTTYGS-STILSNLTRDTIALSMDPVPYYAFGCIQKATGSS 147
Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPL 331
V GLLG G G +S + Q FSYCL S R SGSL G P PL
Sbjct: 148 VPPQGLLGFGRGPLSFLSQTQNLYKSTFSYCLPSFRTLNFSGSLRLGPVGQPPRIKTTPL 207
Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
++NPR S YYV L+G+ VG + I G + D+GT TRL PAY A
Sbjct: 208 LKNPRRSSLYYVKLNGIRVGRKIVDIPRSALAFNPTTGAGTIFDSGTVFTRLVAPAYIAV 267
Query: 392 RDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDD 451
R+ F + GN S + FDTCY+ V + PT++F FSG V T+P N LI
Sbjct: 268 RNEFRKRVGNA-TVSSLGGFDTCYS----VPIVPPTITFMFSGMNV-TMPPENLLIHSTA 321
Query: 452 AGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
T C A A +P S L++I ++QQ+ +I FD N +G C
Sbjct: 322 GVTSCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRLGVAREQC 369
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 120/345 (34%), Positives = 166/345 (48%), Gaps = 35/345 (10%)
Query: 168 SQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDRL---ENA 222
+Q MV+D+ SD+ WVQC PC CY Q D ++DP S+S SC+S C +L N
Sbjct: 168 TQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANG 227
Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGMFV---GAAGL 278
+ +C+Y V Y DG+ T GT + LTI T V++ GC H QG F AAG+
Sbjct: 228 CTNNNQCQYRVRYPDGTSTAGTYISDLLTITPATAVRSFQFGCSHGVQGSFSFGSSAAGI 287
Query: 279 LGLGGGSMSLVGQLGGQTGGAFSYCL---VSRGTGSSGSLVFGREALPVGAAW----VPL 331
+ LGGG SLV Q G FS+C RG F +P AAW P+
Sbjct: 288 MALGGGPESLVSQTAATYGRVFSHCFPPPTRRG--------FFTLGVPRVAAWRYVLTPM 339
Query: 332 VRNPR-APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
++NP P+FY V L + V G RI + +F G +D+ TA+TRLP AY+A
Sbjct: 340 LKNPAIPPTFYMVRLEAIAVAGQRIAVPPTVFAA------GAALDSRTAITRLPPTAYQA 393
Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
R AF + A DTCY+++G S +P ++ F + L S L
Sbjct: 394 LRQAFRDRMAMYQPAPPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLF--- 450
Query: 451 DAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
G F P+ IIGNIQ + +++ ++ VGF C
Sbjct: 451 -QGCLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 494
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 182 bits (461), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 132/426 (30%), Positives = 205/426 (48%), Gaps = 49/426 (11%)
Query: 109 HARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVV--SGMDQGSGEYFVRIGVGSPP 166
H ++R ++R RL+G G A+ E VV + + GEY V++G+G+PP
Sbjct: 45 HELLRRAIQRSRY---RLAGIGM--ARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPP 99
Query: 167 RSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC-- 224
ID+ SD++W QCQPC+ CY Q DP+F+P S++++ + CSS CD L+ C
Sbjct: 100 YKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGH 159
Query: 225 -HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG--MFVGAAGLLGL 281
C+Y +Y + T+GTLA++ L IG + VA GC + G A+G++GL
Sbjct: 160 DDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAFGCSTSSTGGAPPPQASGVVGL 219
Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAW----VPLVRNPRA 337
G G +SLV QL + F+YCL + G LV G +A A VP+ R+PR
Sbjct: 220 GRGPLSLVSQLSVRR---FAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRY 276
Query: 338 PSFYYVGLSGLGVGGMRIPISEDLFRL--------------------TQMGDD---GVVM 374
PS+YY+ L GL +G + + +GD G+++
Sbjct: 277 PSYYYLNLDGLLIGDRTMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMII 336
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLS---GFVSVRVPTVSF 430
D + +T L Y+ + + LPR +G S+ D C+ L F V VP V+
Sbjct: 337 DIASTITFLEASLYDELVNDLEVEI-RLPRGTGSSLGLDLCFILPDGVAFDRVYVPAVAL 395
Query: 431 YFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVG 489
F G L L + ++G C + +G +SI+GN QQ+ +Q+ ++ G V
Sbjct: 396 AFDGR-WLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRGRVT 454
Query: 490 FGPNVC 495
F + C
Sbjct: 455 FVQSPC 460
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 182 bits (461), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 132/426 (30%), Positives = 205/426 (48%), Gaps = 49/426 (11%)
Query: 109 HARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVV--SGMDQGSGEYFVRIGVGSPP 166
H ++R ++R RL+G G A+ E VV + + GEY V++G+G+PP
Sbjct: 45 HELLRRAIQRSRY---RLAGIGM--ARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPP 99
Query: 167 RSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC-- 224
ID+ SD++W QCQPC+ CY Q DP+F+P S++++ + CSS CD L+ C
Sbjct: 100 YKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGH 159
Query: 225 -HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG--MFVGAAGLLGL 281
C+Y +Y + T+GTLA++ L IG + VA GC + G A+G++GL
Sbjct: 160 DDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAFGCSTSSTGGAPPPQASGVVGL 219
Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA----WVPLVRNPRA 337
G G +SLV QL + F+YCL + G LV G +A A VP+ R+PR
Sbjct: 220 GRGPLSLVSQLSVRR---FAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRY 276
Query: 338 PSFYYVGLSGLGVGGMRIPISEDLFRL--------------------TQMGDD---GVVM 374
PS+YY+ L GL +G + + +GD G+++
Sbjct: 277 PSYYYLNLDGLLIGDRAMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMII 336
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLS---GFVSVRVPTVSF 430
D + +T L Y+ + + LPR +G S+ D C+ L F V VP V+
Sbjct: 337 DIASTITFLEASLYDELVNDLEVEI-RLPRGTGSSLGLDLCFILPDGVAFDRVYVPAVAL 395
Query: 431 YFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVG 489
F G L L + ++G C + +G +SI+GN QQ+ +Q+ ++ G V
Sbjct: 396 AFDGR-WLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRGRVT 454
Query: 490 FGPNVC 495
F + C
Sbjct: 455 FVQSPC 460
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 120/345 (34%), Positives = 166/345 (48%), Gaps = 35/345 (10%)
Query: 168 SQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDRL---ENA 222
+Q MV+D+ SD+ WVQC PC CY Q D ++DP S+S SC+S C +L N
Sbjct: 143 TQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANG 202
Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGMFV---GAAGL 278
+ +C+Y V Y DG+ T GT + LTI T V++ GC H QG F AAG+
Sbjct: 203 CTNNNQCQYRVRYPDGTSTAGTYISDLLTITPATAVRSFQFGCSHGVQGSFSFGSSAAGI 262
Query: 279 LGLGGGSMSLVGQLGGQTGGAFSYCL---VSRGTGSSGSLVFGREALPVGAAW----VPL 331
+ LGGG SLV Q G FS+C RG F +P AAW P+
Sbjct: 263 MALGGGPESLVSQTAATYGRVFSHCFPPPTRRG--------FFTLGVPRVAAWRYVLTPM 314
Query: 332 VRNPR-APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
++NP P+FY V L + V G RI + +F G +D+ TA+TRLP AY+A
Sbjct: 315 LKNPAIPPTFYMVRLEAIAVAGQRIAVPPTVFAA------GAALDSRTAITRLPPTAYQA 368
Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
R AF + A DTCY+++G S +P ++ F + L S L
Sbjct: 369 LRQAFRDRMAMYQPAPPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLF--- 425
Query: 451 DAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
G F P+ IIGNIQ + +++ ++ VGF C
Sbjct: 426 -QGCLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 469
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 181 bits (459), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 122/342 (35%), Positives = 174/342 (50%), Gaps = 34/342 (9%)
Query: 169 QYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAGC 224
Q +V+DS SD+ WVQC PC C+ Q D +DP+ S S + SCSS C L GC
Sbjct: 159 QTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTALGPYANGC 218
Query: 225 HAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMF-VGAAGLLGLG 282
+C+Y V Y DGS T G + LT+ V GC H QG F AAG++ LG
Sbjct: 219 ANNQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAEQGSFDARAAGIMALG 278
Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA----WVPLVRNPRAP 338
GG SL+ Q + G AFSYC+ + + SG G +P A+ P+VR +A
Sbjct: 279 GGPESLLSQTASRYGNAFSYCIPATAS-DSGFFTLG---VPRRASSRYVVTPMVRFRQAA 334
Query: 339 SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ 398
+FY V L + VGG R+ ++ +F G V+D+ TA+TRLP AY+A R AF +
Sbjct: 335 TFYGVLLRTITVGGQRLGVAPAVFAA------GSVLDSRTAITRLPPTAYQALRSAFRSS 388
Query: 399 TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF--- 455
A DTCY+ +G V++R+P +S F N ++P+D +G
Sbjct: 389 MTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFD---------RNAVLPLDPSGILFND 439
Query: 456 CFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C AF + ++G++QQ+ I++ +D G VGF C
Sbjct: 440 CLAFTSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 481
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 181 bits (459), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 135/412 (32%), Positives = 207/412 (50%), Gaps = 47/412 (11%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
++RD+ R L+ QD T +GEY + + +G+PP
Sbjct: 57 LRRDMHRHNARKLALAASSGATVSAPTQDSPT---------AGEYLMALAIGTPPLPYQA 107
Query: 172 VIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSA--VCDRL-------EN 221
+ D+GSD++W QC PC SQC++Q P+++P+ S +F+ + C+S+ VC
Sbjct: 108 IADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPP 167
Query: 222 AGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV-----VKNVAIGCGHKNQGMFVGAA 276
GC C Y V+YG G +T ET T G T V +A GC + G +A
Sbjct: 168 PGC---ACTYNVTYGSG-WTSVFQGSETFTFGSTPAGHARVPGIAFGCSTASSGFNASSA 223
Query: 277 -GLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV---PL 331
GL+GLG G +SLV QLG FSYCL + T S+ +L+ G A G A V P
Sbjct: 224 SGLVGLGRGRLSLVSQLGVP---KFSYCLTPYQDTNSTSTLLLGPSASLNGTAGVSSTPF 280
Query: 332 VRNPR-AP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
V +P AP +FYY+ L+G+ +G + I D F L G G+++D+GT +T L AY
Sbjct: 281 VASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGTGGLIIDSGTTITLLGNTAY 340
Query: 389 EAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSV--RVPTVSFYFSGGPVLTLPASN 444
+ R A V+ LP G + D C+ L S +P+++ +F+G ++ LPA +
Sbjct: 341 QQVRAAVVSLV-TLPTTDGSADTGLDLCFMLPSSTSAPPAMPSMTLHFNGADMV-LPADS 398
Query: 445 FLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+++ DD+G +C A G ++I+GN QQ+ + I +D + F P C
Sbjct: 399 YMM-SDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDIGQETLSFAPAKC 449
>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
gi|194690728|gb|ACF79448.1| unknown [Zea mays]
Length = 431
Score = 181 bits (459), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 119/360 (33%), Positives = 171/360 (47%), Gaps = 18/360 (5%)
Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
Q Y VR G+G+P + + +D+ +D W C PC C S F PA S+S++ +
Sbjct: 74 QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLP 131
Query: 211 CSSAVCDRLENAGCHAGR--------CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI 262
C+S C E C A + C + + D S+ + +L +TL +G+ + A
Sbjct: 132 CASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSF-QASLGSDTLRLGKDAIAGYAF 190
Query: 263 GCGHKNQG--MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGR 319
GC G + GLLGLG G MSL+ Q G + G FSYCL S R SGSL G
Sbjct: 191 GCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGA 250
Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
P + PL+ NP PS YYV ++GL VG + + F G V+D+GT
Sbjct: 251 AGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTV 310
Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
+TR P Y A R+ F Q + + FDTC+N + P V+ + GG LT
Sbjct: 311 ITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLT 370
Query: 440 LPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
LP N LI C A A +P + ++++ N+QQ+ +++ D A VGF C
Sbjct: 371 LPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430
>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
gi|194703964|gb|ACF86066.1| unknown [Zea mays]
gi|219886221|gb|ACL53485.1| unknown [Zea mays]
gi|219886359|gb|ACL53554.1| unknown [Zea mays]
gi|223950085|gb|ACN29126.1| unknown [Zea mays]
gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 431
Score = 181 bits (458), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 119/360 (33%), Positives = 171/360 (47%), Gaps = 18/360 (5%)
Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
Q Y VR G+G+P + + +D+ +D W C PC C S F PA S+S++ +
Sbjct: 74 QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLP 131
Query: 211 CSSAVCDRLENAGCHAGR--------CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI 262
C+S C E C A + C + + D S+ + +L +TL +G+ + A
Sbjct: 132 CASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSF-QASLGSDTLRLGKDAIAGYAF 190
Query: 263 GCGHKNQG--MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGR 319
GC G + GLLGLG G MSL+ Q G + G FSYCL S R SGSL G
Sbjct: 191 GCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGA 250
Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
P + PL+ NP PS YYV ++GL VG + + F G V+D+GT
Sbjct: 251 AGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTV 310
Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
+TR P Y A R+ F Q + + FDTC+N + P V+ + GG LT
Sbjct: 311 ITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLT 370
Query: 440 LPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
LP N LI C A A +P + ++++ N+QQ+ +++ D A VGF C
Sbjct: 371 LPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 181 bits (458), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 128/371 (34%), Positives = 196/371 (52%), Gaps = 38/371 (10%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSC 211
+GEY + + +G+PP + D+GSD++W QC PC SQC++Q P+++P+ S +F+ + C
Sbjct: 87 AGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPC 146
Query: 212 SSA--VCDRL-------ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV-----V 257
+S+ VC GC C Y V+YG G +T ET T G T V
Sbjct: 147 NSSLSVCAAALAGTGTAPPPGC---ACTYNVTYGSG-WTSVFQGSETFTFGSTPAGQSRV 202
Query: 258 KNVAIGCGHKNQGMFVGAA-GLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSL 315
+A GC + G +A GL+GLG G +SLV QLG FSYCL + T S+ +L
Sbjct: 203 PGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVP---KFSYCLTPYQDTNSTSTL 259
Query: 316 VFGREALPVGAAWV---PLVRNPR-AP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
+ G A G A V P V +P AP +FYY+ L+G+ +G + I D F L G
Sbjct: 260 LLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFLLNADGT 319
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSV--RV 425
G+++D+GT +T L AY+ R A V+ LP G + D C+ L S +
Sbjct: 320 GGLIIDSGTTITLLGNTAYQQVRAAVVSLV-TLPTTDGSAATGLDLCFMLPSSTSAPPAM 378
Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGA 484
P+++ +F+G ++ LPA ++++ DD+G +C A G ++I+GN QQ+ + I +D
Sbjct: 379 PSMTLHFNGADMV-LPADSYMM-SDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDIG 436
Query: 485 NGFVGFGPNVC 495
+ F P C
Sbjct: 437 QETLSFAPAKC 447
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 139/447 (31%), Positives = 214/447 (47%), Gaps = 61/447 (13%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
L+L+HRD S +T N R Q SF + R + V D
Sbjct: 29 LDLIHRDSPLSPLHTPNLTFSDRLQASFLRAISRQSRHV--------------------D 68
Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
F TD++ GEY + + +G+PP + D+GSD+ W+Q +PC QCY Q P+FDP
Sbjct: 69 FQTDLLPS----GGEYMMNLSIGTPPFPILAIADTGSDLTWLQSKPCDQCYPQKGPIFDP 124
Query: 201 ADSASFSGVSCSSAVCDRLENAG---CHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
++S +F + C++A C+ L+ + C Y SYGD SYT G LA +T+T+G V
Sbjct: 125 SNSTTFHKLPCTTAPCNALDESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTVGNASV 184
Query: 258 --KNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV--------- 305
+NVA GCG +N G F +G++GLGGG++S V QLG G FSYCL+
Sbjct: 185 QIRNVAFGCGTRNGGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLPLENEISSQ 244
Query: 306 SRGTGSSGSLVFGREAL-------PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI--- 355
+ ++ +VFG + V A PLV N ++YY+ + + VG ++
Sbjct: 245 PSDSPATSRIVFGDNPVFSSSSTNGVVFATTPLV-NKEPSTYYYLTIEAITVGRKKLLYS 303
Query: 356 -----PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV-- 408
S D + + + +++D+GT +T L Y A A V + + R + V
Sbjct: 304 SSSSKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEI-KMERVNDVKN 362
Query: 409 SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSI 468
S+F C+ SG V +P + +F GG + L N + ++ G CF P+ + + I
Sbjct: 363 SMFSLCFK-SGKEEVELPLMKVHFRGGADVELKPVNTFVRAEE-GLVCFTMLPT-NDVGI 419
Query: 469 IGNIQQEGIQISFDGANGFVGFGPNVC 495
GN+ Q + +D V F P C
Sbjct: 420 YGNLAQMNFVVGYDLGKRTVSFLPADC 446
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 131/410 (31%), Positives = 192/410 (46%), Gaps = 35/410 (8%)
Query: 112 MQRDVKRVATLVRRLSGGG----ADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPR 167
MQR R A L +GGG A+ ++ G V + G EY + + VG+PP+
Sbjct: 53 MQRSKARAAALSVVRNGGGFYGSIAQAREREREPGMAVRA---SGDLEYVLDLAVGTPPQ 109
Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC-DRLENAGCHA 226
++D+GSD++W QC C+ C +Q DP+F P S+S+ + C+ +C D L ++
Sbjct: 110 PITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQLCGDILHHSCVRP 169
Query: 227 GRCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
C Y SYGDG+ T G A E T G T + GCG N G A+G++G G
Sbjct: 170 DTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGCGTMNVGSLNNASGIVGFG 229
Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG--------AAWVPLVRN 334
+SLV QL + FSYCL + +L FG A VG P++++
Sbjct: 230 RDPLSLVSQLSIRR---FSYCLTPYASSRKSTLQFGSLA-DVGLYDDATGPVQTTPILQS 285
Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
+ P+FYYV +G+ VG R+ I F L G GV++D+GTA+T P A
Sbjct: 286 AQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSGTALTLFPVAVLAEVVRA 345
Query: 395 FVAQTGNLPRASGVSIFD-TCYNLSGFV--------SVRVPTVSFYFSGGPVLTLPASNF 445
F +Q LP A+G S D C+ V VP + F+F G L LP N+
Sbjct: 346 FRSQL-RLPFANGSSPDDGVCFAAPAVAAGGGRMARQVAVPRMVFHFQGAD-LDLPRENY 403
Query: 446 LIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
++ G C S + IGN Q+ +++ +D + F P C
Sbjct: 404 VLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLERETLSFAPVEC 453
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 131/410 (31%), Positives = 192/410 (46%), Gaps = 35/410 (8%)
Query: 112 MQRDVKRVATLVRRLSGGG----ADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPR 167
MQR R A L +GGG A+ ++ G V + G EY + + VG+PP+
Sbjct: 53 MQRSKARAAALSVVRNGGGFYGSIAQAREREREPGMAVRA---SGDLEYVLDLAVGTPPQ 109
Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC-DRLENAGCHA 226
++D+GSD++W QC C+ C +Q DP+F P S+S+ + C+ +C D L ++
Sbjct: 110 PITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQLCGDILHHSCVRP 169
Query: 227 GRCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
C Y SYGDG+ T G A E T G T + GCG N G A+G++G G
Sbjct: 170 DTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGCGTMNVGSLNNASGIVGFG 229
Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG--------AAWVPLVRN 334
+SLV QL + FSYCL + +L FG A VG P++++
Sbjct: 230 RDPLSLVSQLSIRR---FSYCLTPYASSRKSTLQFGSLA-DVGLYDDATGPVQTTPILQS 285
Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
+ P+FYYV +G+ VG R+ I F L G GV++D+GTA+T P A
Sbjct: 286 AQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSGTALTLFPAAVLAEVVRA 345
Query: 395 FVAQTGNLPRASGVSIFD-TCYNLSGFV--------SVRVPTVSFYFSGGPVLTLPASNF 445
F +Q LP A+G S D C+ V VP + F+F G L LP N+
Sbjct: 346 FRSQL-RLPFANGSSPDDGVCFAAPAVAAGGGRMARQVAVPRMVFHFQGAD-LDLPRENY 403
Query: 446 LIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
++ G C S + IGN Q+ +++ +D + F P C
Sbjct: 404 VLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLERETLSFAPVEC 453
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 130/445 (29%), Positives = 211/445 (47%), Gaps = 50/445 (11%)
Query: 69 SSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSG 128
+SN+S++ +EL+HRD S Y+ H H+ R+ R + RR +
Sbjct: 19 ASNSSANRENLTVELIHRDSPHSP-------LYNPH-HTVSDRLNAAFLRSISRSRRFTT 70
Query: 129 GGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS 188
TD+ SG+ GEYF+ I +G+PP + + D+GSD+ WVQC+PC
Sbjct: 71 K-------------TDLQSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQ 117
Query: 189 QCYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAGCHAGR--CRYEVSYGDGSYTKGT 244
QCYKQ+ P+FD S+++ SC S C L GC + C+Y SYGD S+TKG
Sbjct: 118 QCYKQNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGD 177
Query: 245 LALETLTIGRTVVKN-----VAIGCGHKNQGMFVGAAGLLGLGGGS-MSLVGQLGGQTGG 298
+A ET++I + + GCG+ N G F + GG +SLV QLG G
Sbjct: 178 VATETISIDSSSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGK 237
Query: 299 AFSYCLVSRGTGSSGSLV--FGREALPVGAA------WVPLV-RNPRAPSFYYVGLSGLG 349
FSYCL ++G+ V G ++P + PL+ ++P ++Y++ L +
Sbjct: 238 KFSYCLSHTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPE--TYYFLTLEAVT 295
Query: 350 VGGMRIPISEDLFRLTQMGDD---GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
VG ++P + + L +++D+GT +T L + Y+ F A R S
Sbjct: 296 VGKTKLPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVS 355
Query: 407 GVS-IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG 465
+ C+ SG + +P ++ +F+ V P + F+ +D T C + P+ +
Sbjct: 356 DPQGLLTHCFK-SGDKEIGLPAITMHFTNADVKLSPINAFVKLNED--TVCLSMIPT-TE 411
Query: 466 LSIIGNIQQEGIQISFDGANGFVGF 490
++I GN+ Q + +D V F
Sbjct: 412 VAIYGNMVQMDFLVGYDLETKTVSF 436
>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 123/353 (34%), Positives = 173/353 (49%), Gaps = 20/353 (5%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
S Y VR +GSPP++ + +D+ +D W+ C C C + +F P S +F VSC
Sbjct: 95 SPTYIVRAKIGSPPQTLLLAMDTSNDAAWIPCTACDGC---TSTLFAPEKSTTFKNVSCG 151
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
S C+++ N C C + ++YG S + +T+T+ + + GC K G
Sbjct: 152 SPQCNQVPNPSCGTSACTFNLTYGSSSIAANVVQ-DTVTLATDPIPDYTFGCVAKTTGAS 210
Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPL 331
GLLGLG G +SL+ Q FSYCL S + SGSL G A P+ + PL
Sbjct: 211 APPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPIRIKYTPL 270
Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
++NPR S YYV L + VG + I + G V D+GT TRL PAY A
Sbjct: 271 LKNPRRSSLYYVNLVAIRVGRKVVDIPPEALAFNAATGAGTVFDSGTVFTRLVAPAYTAV 330
Query: 392 RDAF-----VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
RD F +A NL + + FDTCY V + PT++F FSG V TLP N L
Sbjct: 331 RDEFQRRVAIAAKANL-TVTSLGGFDTCYT----VPIVAPTITFMFSGMNV-TLPEDNIL 384
Query: 447 IPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
I T C A A +P S L++I N+QQ+ ++ +D N +G +C
Sbjct: 385 IHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNSRLGVARELC 437
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 128/371 (34%), Positives = 196/371 (52%), Gaps = 38/371 (10%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSC 211
+GEY + + +G+PP + D+GSD++W QC PC SQC++Q P+++P+ S +F+ + C
Sbjct: 29 AGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPC 88
Query: 212 SSA--VCDRL-------ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV-----V 257
+S+ VC GC C Y V+YG G +T ET T G T V
Sbjct: 89 NSSLSVCAAALAGTGTAPPPGC---ACTYNVTYGSG-WTSVFQGSETFTFGSTPAGHARV 144
Query: 258 KNVAIGCGHKNQGMFVGAA-GLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSL 315
+A GC + G +A GL+GLG G +SLV QLG FSYCL + T S+ +L
Sbjct: 145 PGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVP---KFSYCLTPYQDTNSTSTL 201
Query: 316 VFGREALPVGAAWV---PLVRNPR-AP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
+ G A G A V P V +P AP +FYY+ L+G+ +G + I D F L G
Sbjct: 202 LLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGT 261
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSV--RV 425
G+++D+GT +T L AY+ R A V+ LP G + D C+ L S +
Sbjct: 262 GGLIIDSGTTITLLGNTAYQQVRAAVVSLV-TLPTTDGSADTGLDLCFMLPSSTSAPPAM 320
Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGA 484
P+++ +F+G ++ LPA ++++ DD+G +C A G ++I+GN QQ+ + I +D
Sbjct: 321 PSMTLHFNGADMV-LPADSYMM-SDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDIG 378
Query: 485 NGFVGFGPNVC 495
+ F P C
Sbjct: 379 QETLSFAPAKC 389
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 119/360 (33%), Positives = 170/360 (47%), Gaps = 18/360 (5%)
Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
Q Y VR G+G+P + + +D+ +D W C PC C S F PA S+S++ +
Sbjct: 74 QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLP 131
Query: 211 CSSAVCDRLENAGCHAGR--------CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI 262
C+S C E C A + C + + D S+ + +L +TL +G+ + A
Sbjct: 132 CASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSF-QASLGSDTLRLGKDAIAGYAF 190
Query: 263 GCGHKNQG--MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGR 319
GC G + GLLGLG G MSL+ Q G G FSYCL S R SGSL G
Sbjct: 191 GCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCLPSYRSYYFSGSLRLGA 250
Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
P + PL+ NP PS YYV ++GL VG + + F G V+D+GT
Sbjct: 251 AGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTV 310
Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
+TR P Y A R+ F Q + + FDTC+N + P V+ + GG LT
Sbjct: 311 ITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLT 370
Query: 440 LPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
LP N LI C A A +P + ++++ N+QQ+ +++ D A VGF C
Sbjct: 371 LPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 129/410 (31%), Positives = 188/410 (45%), Gaps = 31/410 (7%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQ--DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQ 169
MQR R A L SG G K Q V G EY + + +G+PP+
Sbjct: 57 MQRSKARAAALSVARSGSGRVPGKSAQQGEQHQQPGVPVRPSGDLEYLIDLAIGTPPQPV 116
Query: 170 YMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH-AGR 228
++D+GSD++W QC PC+ C Q DP+F PA S+S+ + CS +C+ + + C
Sbjct: 117 SALLDTGSDLIWTQCAPCASCLAQPDPLFAPAASSSYVPMRCSGQLCNDILHHSCQRPDT 176
Query: 229 CRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGG 284
C Y +YGDG+ T G A E T G + + GCG N G +G++G G
Sbjct: 177 CTYRYNYGDGTTTLGVYATERFTFASSSGEKLSVPLGFGCGTMNVGSLNNGSGIVGFGRD 236
Query: 285 SMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR---------EALPVGAAWVPLVRNP 335
+SLV QL + FSYCL + +L+FG +A L+++
Sbjct: 237 PLSLVSQLSIRR---FSYCLTPYTSTRKSTLMFGSLSDGVFEGDDAATGQVQTTRLLQSR 293
Query: 336 RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF 395
+ P+FYYV +G+ VG R+ I F L G GV++D+GTA+T P AF
Sbjct: 294 QNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGSGGVIVDSGTALTLFPAAVLTEVLRAF 353
Query: 396 VAQTGNLPRASGVSIFD-TCY---------NLSGFVSVRVPTVSFYFSGGPVLTLPASNF 445
AQ LP S S D C+ S V VP ++F+F G L LP N+
Sbjct: 354 RAQL-RLPFTSSSSPDDGVCFATPMAAGGRRASAATVVSVPRMAFHFQGAD-LELPRRNY 411
Query: 446 LIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
++ G+ C A S + IGN Q+ +++ +D + F P C
Sbjct: 412 VLDDPRRGSLCILLADSGDSGATIGNFVQQDMRVLYDLEAETLSFAPAQC 461
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 124/347 (35%), Positives = 170/347 (48%), Gaps = 14/347 (4%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
Y VR +G+PP+ + +D+ +D W+ C C+ C S P FDPA S S+ V C S +
Sbjct: 110 YVVRARLGTPPQQLLLAVDTSNDAAWIPCAGCAGCPTSSAPPFDPAASTSYRSVPCGSPL 169
Query: 216 CDRLENAGCHAG--RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFV 273
C + NA C G C + ++Y D S + L+ ++L + VK GC K G
Sbjct: 170 CAQAPNAACPPGGKACGFSLTYADSSL-QAALSQDSLAVAGDAVKTYTFGCLQKATGTAA 228
Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLV 332
GLLGLG G +S + Q G FSYCL S + SG+L GR P PL+
Sbjct: 229 PPQGLLGLGRGPLSFLSQTRDMYQGTFSYCLPSFKSLNFSGTLRLGRNGQPPRIKTTPLL 288
Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
NP S YYV ++G+ VG +PI G V+D+GT TRL PAY A R
Sbjct: 289 ANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTMFTRLVAPAYVAVR 348
Query: 393 DAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA 452
D + G S + FDTC+N + +V P V+ F G V TLP N +I
Sbjct: 349 DEVRRRVGA--PVSSLGGFDTCFNTT---AVAWPPVTLLFDGMQV-TLPEENVVIHSTYG 402
Query: 453 GTFCFAFAPSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A A +P G L++I ++QQ+ ++ FD NG VGF C
Sbjct: 403 TISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 449
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 124/363 (34%), Positives = 179/363 (49%), Gaps = 37/363 (10%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+GEY + + +G+PP + + D+GSD++WVQC PC C+ Q P+F+P S++F +C
Sbjct: 89 NGEYLMTLYIGTPPVERLAIADTGSDLIWVQCSPCQNCFPQDTPLFEPLKSSTFKAATCD 148
Query: 213 SAVCDRLENAGCHAGR---CRYEVSYGDGSYTKGTLALETLTIGRT------VVKNVAIG 263
S C + + G+ C Y SYGD S+T G + ETL+ G T + G
Sbjct: 149 SQPCTSVPPSQRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGSTGDAQTVSFPSSIFG 208
Query: 264 CGHKNQGMFVGA---AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE 320
CG N F + GL+GLGGG +SLV QLG Q G FSYCL+ + S+ L FG E
Sbjct: 209 CGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQIGYKFSYCLLPFSSNSTSKLKFGSE 268
Query: 321 ALPV--GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
A+ G PL+ P PSFY++ L + +G +P T D +++D+GT
Sbjct: 269 AIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVVP--------TGRTDGNIIIDSGT 320
Query: 379 AVTRLPTPAYEAFRDAF-----VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFS 433
+T L Y F + V +LP F C+ + + +P ++F F+
Sbjct: 321 VLTYLEQTFYNNFVASLQEVLSVESAQDLPFP-----FKFCF---PYRDMTIPVIAFQFT 372
Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGP 492
G V P N LI + D C A PS SG+SI GN+ Q Q+ +D V F P
Sbjct: 373 GASVALQP-KNLLIKLQDRNMLCLAVVPSSLSGISIFGNVAQFDFQVVYDLEGKKVSFAP 431
Query: 493 NVC 495
C
Sbjct: 432 TDC 434
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 179 bits (455), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 126/371 (33%), Positives = 192/371 (51%), Gaps = 29/371 (7%)
Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
+D +G Y + + +G+PP + ++ D+GS ++W QC PC++C + P F PA S++FS
Sbjct: 83 LDNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSK 142
Query: 209 VSCSSAVCDRLENA--GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
+ C+S++C L + C+A C Y YG G +T G LA ETL +G VA GC
Sbjct: 143 LPCASSLCQFLTSPYLTCNATGCVYYYPYGMG-FTAGYLATETLHVGGASFPGVAFGCST 201
Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG- 325
+N G+ ++G++GLG +SLV Q+G G FSYCL S ++FG A G
Sbjct: 202 EN-GVGNSSSGIVGLGRSPLSLVSQVG---VGRFSYCLRSDADAGDSPILFGSLAKVTGG 257
Query: 326 -AAWVPLVRNPRAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVV----MDTGT 378
PL+ NP P S+YYV L+G+ VG +P++ F T+ G+V +D+GT
Sbjct: 258 NVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGLVGGTIVDSGT 317
Query: 379 AVTRLPTPAYEAFRDAFVAQ--TGNL-PRASGVSI-FDTCYNLS---GFVSVRVPTVSFY 431
+T L Y + AF++Q T NL +G FD C++ + G V VPT+
Sbjct: 318 TLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGSGVPVPTLVLR 377
Query: 432 FSGGPVLTLPASNF--LIPVDD---AGTFCFAFAPSPSGL--SIIGNIQQEGIQISFDGA 484
F+GG + ++ ++ VD A C P+ L SIIGN+ Q + + +D
Sbjct: 378 FAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLHVLYDLD 437
Query: 485 NGFVGFGPNVC 495
G F P C
Sbjct: 438 GGMFSFAPADC 448
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 122/385 (31%), Positives = 184/385 (47%), Gaps = 21/385 (5%)
Query: 122 LVRRLSGGGADAAKHEVQDFGTDVVS--GMDQG--SGEYFVRIGVGSPPRSQYMVIDSGS 177
L+RR++ A + T VS D G EY + + +G+PP+ + +D+GS
Sbjct: 53 LMRRMALRSKARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGS 112
Query: 178 DIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR----CRYEV 233
D+VW QCQPC+ C+ QS P +D + S++F+ SC S C + + C +
Sbjct: 113 DLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAFSY 172
Query: 234 SYGDGSYTKGTLALETLT-IGRTVVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQ 291
SYGD S T G L +ET++ + V V GCG N G+F G+ G G G +SL Q
Sbjct: 173 SYGDKSATIGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQ 232
Query: 292 LGGQTGGAFSYCLVSRGTGSSGSLVFGREA--LPVGAAWV---PLVRNPRAPSFYYVGLS 346
L G FS+C + +++F A G V PL++NP P+FYY+ L
Sbjct: 233 L---KVGNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLK 289
Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
G+ VG R+P+ E F L + G G ++D+GTA T LP Y D F A S
Sbjct: 290 GITVGSTRLPVPESAFAL-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPS 348
Query: 407 GVSIFDTCYNLSGF-VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG 465
+ C++ + VP + +F G + LP N++ D G A
Sbjct: 349 NETGPLLCFSAPPLGKAPHVPKLVLHFEGA-TMHLPRENYVFEAKDGGNCSICLAIIEGE 407
Query: 466 LSIIGNIQQEGIQISFDGANGFVGF 490
++IIGN QQ+ + + +D N + F
Sbjct: 408 MTIIGNFQQQNMHVLYDLKNSKLSF 432
>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 460
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 145/431 (33%), Positives = 198/431 (45%), Gaps = 45/431 (10%)
Query: 94 NTTNNMHYHRHQHSF-HARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQ- 151
N+ N+ SF A+ RD RV L SG G G + SG
Sbjct: 41 NSNNDAAPSSSWTSFIAAQTSRDTSRVLYLSSLASGFG-----------GAPLASGRQLL 89
Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC 211
+ Y VR +G+PP+ + +D+ +D WV C C C + P F+PA SA+F V C
Sbjct: 90 HTPTYLVRASLGTPPQRLLLAVDTSNDAAWVPCAGCHGC-PTTAPSFNPASSATFRPVPC 148
Query: 212 SSAVCDRLENAGCHA-----GRCRYEVSYGDGSYTKGTLALETLTIGRT--VVKNVAIGC 264
+ C + N C + C + +SYGD S TL+ + L + V+K GC
Sbjct: 149 GAPPCSQAPNPSCTSLAKSKNSCGFSLSYGDSSL-DATLSQDNLAVTANGGVIKGYTFGC 207
Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS---RGTGSSGSLVFGREA 321
K+ G A GLLGLG G + V Q G G FSYCL S SGSL GR+
Sbjct: 208 LTKSNGSAAPAQGLLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSAANFSGSLTLGRKG 267
Query: 322 LPVGAAW--VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
P PL+ +P PS YYV ++G+ +G +PI G V+D+GT
Sbjct: 268 QPAPEKMKTTPLLASPHRPSLYYVAMTGVRIGKKSVPIPPSALAFDAATGAGTVLDSGTM 327
Query: 380 VTRLPTPAYEAFRDAFVAQ-TGNL----PRASGVSI-----FDTCYNLSGFVSVRVPTVS 429
RL PAY A RD + G+L + VS+ FDTCYN+S +V P V+
Sbjct: 328 FARLAQPAYAAVRDEVRRRVAGSLRRRGGGGASVSVSSLGGFDTCYNVS---TVAWPAVT 384
Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-----SGLSIIGNIQQEGIQISFDGA 484
F GG + LP N +I T C A A SP + L++IG++QQ+ ++ FD
Sbjct: 385 LVFGGGMEVRLPEENVVIRSTYGSTSCLAMAASPADGVNAALNVIGSLQQQNHRVLFDVP 444
Query: 485 NGFVGFGPNVC 495
N VGF C
Sbjct: 445 NARVGFARERC 455
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 126/361 (34%), Positives = 175/361 (48%), Gaps = 24/361 (6%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK-QSDPVFDPADSASFSGVSCSS 213
Y R +G+PP++ + ID +D WV C C C S P FDP S+++ V C +
Sbjct: 99 SYVARARLGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASSPSFDPTQSSTYRPVRCGA 158
Query: 214 AVCDRLENA--GCHAG---RCRYEVSYGDGSYTKGTLALETLTI----GRTVVKN-VAIG 263
C ++ A C AG C + +SY S L + L++ G V + G
Sbjct: 159 PQCAQVPPATPSCPAGPGASCAFNLSYAS-STLHAVLGQDALSLSDSNGAAVPDDHYTFG 217
Query: 264 CGH--KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGRE 320
C G V GL+G G G +S + Q G FSYCL S + + SG+L G
Sbjct: 218 CLRVVTGSGGSVPPQGLVGFGRGPLSFLSQTKATYGSIFSYCLPSYKSSNFSGTLRLGPA 277
Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL-TQMGDDGVVMDTGTA 379
P PL+ NP PS YYV + G+ V G +PI L G G ++D GT
Sbjct: 278 GQPRRIKTTPLLSNPHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGRGGTIVDAGTM 337
Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
TRL PAY A R+AF + + P A + FDTCY ++G S VP V+F F+GG +T
Sbjct: 338 FTRLSPPAYAALRNAF-RRGVSAPAAPALGGFDTCYYVNGTKS--VPAVAFVFAGGARVT 394
Query: 440 LPASNFLIPVDDAGTFCFAFAPSPS-----GLSIIGNIQQEGIQISFDGANGFVGFGPNV 494
LP N +I G C A A PS GL+++ ++QQ+ ++ FD NG VGF +
Sbjct: 395 LPEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGNGRVGFSREL 454
Query: 495 C 495
C
Sbjct: 455 C 455
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 122/357 (34%), Positives = 175/357 (49%), Gaps = 24/357 (6%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
G++ + I +G+PP ++D+GSD++W+QC PC CYKQ P+FDP S++++ +SC S
Sbjct: 66 GQHLMEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIKPMFDPLKSSTYNNISCDS 125
Query: 214 AVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI-----GCGHK 267
+C +L+ C RC Y YGD S TKG LA +T T K V++ GCGH
Sbjct: 126 PLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPVSLSRFLFGCGHN 185
Query: 268 NQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGG-AFSYCLVSRGT--GSSGSLVFGR--EA 321
N G F GL+GLGGG SL+ Q+G GG FS CLV T S + FG+ +
Sbjct: 186 NTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRMSFGKGSQV 245
Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
L G PLV + S Y+V L G+ V P++ + +G +++D+GT
Sbjct: 246 LGNGVVTTPLVPREKDTS-YFVTLLGISVEDTYFPMN------STIGKANMLVDSGTPPI 298
Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
LP Y+ + P S+ CY +++ PT++F+F G VL
Sbjct: 299 LLPQQLYDKVFAEVRNKVALKPITDDPSLGTQLCYRTQ--TNLKGPTLTFHFVGANVLLT 356
Query: 441 PASNFLIPVDDA-GTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P F+ P G FC A + + S + GN Q I FD V F P C
Sbjct: 357 PIQTFIPPTPQTKGIFCLAIYNRTNSDPGVYGNFAQSNYLIGFDLDRQVVSFKPTDC 413
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 118/349 (33%), Positives = 162/349 (46%), Gaps = 31/349 (8%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
Y +++ VG+PP VID+GS+I W QC PC CYKQ+ P+FDP+ S++F
Sbjct: 380 YLMKLQVGTPPFEIEAVIDTGSEITWTQCLPCVHCYKQNAPIFDPSKSSTF--------- 430
Query: 216 CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQG 270
+ CH C YEV Y D +YTKGTLA +T+TI T V+ IGCG N
Sbjct: 431 ----KEKRCHDHSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETIIGCGRNNSW 486
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVP 330
G +GL G +SL+ Q+GG+ G SYC GT + FG A+ G V
Sbjct: 487 FRPSFEGFVGLNWGPLSLITQMGGEYPGLMSYCFAGNGT---SKINFGTNAIVGGGGVVS 543
Query: 331 ---LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
V R P FYY+ L + VG RI E L + +V+D+GT +T P
Sbjct: 544 TTMFVTTAR-PGFYYLNLDAVSVGDTRI---ETLGTPFHALEGNIVIDSGTTLTYFPESY 599
Query: 388 YEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
R A +P A CY + + P ++ +FSGG L L N +
Sbjct: 600 CNLVRQAVEHVVPAVPAADPTGNDLLCYYSN--TTEIFPVITMHFSGGADLVLDKYNMFM 657
Query: 448 PVDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
G FC A +P+ +I GN Q + +D ++ V F P C
Sbjct: 658 ESYSGGLFCLAIICNNPTQEAIFGNRAQNNFLVGYDSSSLLVSFKPTNC 706
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 101/336 (30%), Positives = 153/336 (45%), Gaps = 49/336 (14%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
EY +++ +G+PP V+D+GS+++W QC PC CY Q P+FDP+ S++F C
Sbjct: 64 EYLMKLQIGTPPFEVEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRC--- 120
Query: 215 VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQ 269
N H+ C Y++ Y D SYT+GTLA ET+TI T V+ IGC N
Sbjct: 121 ------NTPDHS--CPYKLVYDDKSYTQGTLATETVTIHSTSGVPFVMPETIIGCSRNNS 172
Query: 270 G--MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA 327
G ++G++GL GS+SL+ Q+GG G G + +F + A
Sbjct: 173 GSGFRPSSSGIVGLSRGSLSLISQMGG----------AYPGDGVVSTTMFAKTA------ 216
Query: 328 WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
YY+ L + VG RI F + +V+D+GT +T P
Sbjct: 217 ---------KRGQYYLNLDAVSVGDTRIETVGTPFHAL---NGNIVIDSGTPLTYFPVSY 264
Query: 388 YEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
R A V + R S D S + + P ++ +FSGG L L N +
Sbjct: 265 CNLVRKA-VERVVTADRVVDPSRNDMLCYYSNTIEI-FPVITVHFSGGADLVLDKYNMYM 322
Query: 448 PVDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFD 482
++ G FC A +P+ ++I GN Q + +D
Sbjct: 323 ELNRGGVFCLAIICNNPTQVAIFGNRAQNNFLVGYD 358
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 121/342 (35%), Positives = 174/342 (50%), Gaps = 34/342 (9%)
Query: 169 QYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAGC 224
Q +V+DS SD+ WVQC PC C+ Q D +DP+ S + + SCSS C L GC
Sbjct: 29 QTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALGPYANGC 88
Query: 225 HAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMF-VGAAGLLGLG 282
+C+Y V Y DGS T G + LT+ V GC H QG F AAG++ LG
Sbjct: 89 ANNQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAEQGSFDARAAGIMALG 148
Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA----WVPLVRNPRAP 338
GG SL+ Q + G AFSYC+ + + SG G +P A+ P+VR +A
Sbjct: 149 GGPESLLSQTASRYGNAFSYCIPATAS-DSGFFTLG---VPRRASSRYVVTPMVRFRQAA 204
Query: 339 SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ 398
+FY V L + VGG R+ ++ +F G V+D+ TA+TRLP AY+A R AF +
Sbjct: 205 TFYGVLLRTITVGGQRLGVAPAVFAA------GSVLDSRTAITRLPPTAYQALRAAFRSS 258
Query: 399 TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF--- 455
A DTCY+ +G V++R+P +S F N ++P+D +G
Sbjct: 259 MTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFD---------RNAVLPLDPSGILFND 309
Query: 456 CFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C AF + ++G++QQ+ I++ +D G VGF C
Sbjct: 310 CLAFTSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351
>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
Length = 287
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 112/281 (39%), Positives = 159/281 (56%), Gaps = 16/281 (5%)
Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVAIGCGHKNQGMFVGAAGLLGL 281
GC G C Y V YGDGSYT G A++TLT+ +K GCG +N+G+F AAGLLGL
Sbjct: 15 GCSGGHCLYGVQYGDGSYTIGFFAMDTLTLSSHDAIKGFRFGCGERNEGLFGEAAGLLGL 74
Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV---PLVRNPRAP 338
G G SL Q + GG F++C +R +G +G L FG + P +A + P++ + P
Sbjct: 75 GRGKTSLPVQTYDKYGGVFAHCFPARSSG-TGYLEFGPGSSPAVSAKLSTTPMLID-TGP 132
Query: 339 SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ 398
+FYYVG++G+ VGG +PI + +F G ++D+GT +TRLP AY + R AF A
Sbjct: 133 TFYYVGMTGIRVGGKLLPIPQSVFAAA-----GTIVDSGTVITRLPPAAYSSLRSAFAAS 187
Query: 399 TG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFC 456
RA +S+ DTCY+L+G V +PTVS F GG L + AS +I C
Sbjct: 188 MAARGYKRAPALSLLDTCYDLTGASEVAIPTVSLLFQGGVSLDVDASG-IIYAASVSQAC 246
Query: 457 FAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
FA + + ++I+GN Q + + +D A+ VGF P C
Sbjct: 247 LGFAGNEAADDVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 107/307 (34%), Positives = 168/307 (54%), Gaps = 28/307 (9%)
Query: 107 SFHARMQRDVKRVATLVRRLSGGGADAAKHEV--------QDFGTDVVSGMDQGSGEYFV 158
SF + D RV TL RL+ K + + + G GSG Y+V
Sbjct: 61 SFSDVLAWDDARVKTLNSRLTRKDTRFPKSVLTKKDIRFPKSVSVPLNPGASIGSGNYYV 120
Query: 159 RIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCSSAVCD 217
++G GSP R M++D+GS + W+QC+PC C+ Q+DP+FDP+ S ++ +SC+S+ C
Sbjct: 121 KVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSSQCS 180
Query: 218 RLENAGCH-------AGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQ 269
L +A + + C Y SYGD SY+ G L+ + LT+ + + GCG +
Sbjct: 181 SLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQTLPGFVYGCGQDSD 240
Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAW- 328
G+F AAG+LGLG +S++GQ+ + G AFSYCL +RG G G L G+ +L G+A+
Sbjct: 241 GLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGG--GFLSIGKASL-AGSAYK 297
Query: 329 -VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
P+ +P PS Y++ L+ + VGG + ++ +R+ ++D+GT +TRLP
Sbjct: 298 FTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVP------TIIDSGTVITRLPMSV 351
Query: 388 YEAFRDA 394
Y F+ A
Sbjct: 352 YTPFQQA 358
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 135/438 (30%), Positives = 212/438 (48%), Gaps = 40/438 (9%)
Query: 73 SSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGAD 132
+S + ++L L+HRD S N+ + R +++F R + RV +
Sbjct: 28 ASPDPGFSLNLIHRDSPLSPLYNPNHTDFDRLRNAF----SRSISRVNVFKTK------- 76
Query: 133 AAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK 192
++ F D+V GEYF+++ +G+P ++ D+GSD+ WVQC PC CY+
Sbjct: 77 --AVDINSFQNDLV----PNGGEYFMKMSIGTPLVEVIVIADTGSDLTWVQCLPCDPCYR 130
Query: 193 QSDPVFDPADSASFSGVSCSSAVCDRLE--NAGC--HAGRCRYEVSYGDGSYTKGTLALE 248
Q P+FDP+ S+S+ + C S C+ L+ C C Y SYGD SYT G LA E
Sbjct: 131 QKSPLFDPSRSSSYRHMLCGSRFCNALDVSEQACTMDTNICEYHYSYGDKSYTNGNLATE 190
Query: 249 TLTIGRTVVKNVAI-----GCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
TIG T + V + GCG N G F +G++GLGGG++SLV QL G FSY
Sbjct: 191 KFTIGSTSSRPVHLSPIVFGCGTGNGGTFDELGSGIVGLGGGALSLVSQLSSIIKGKFSY 250
Query: 303 CLV--SRGTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPIS 358
CLV S + + + FG +++ G V PLV + ++YYV L + VG R+P +
Sbjct: 251 CLVPLSEQSNVTSKIKFGTDSVISGPQVVSTPLVSK-QPDTYYYVTLEAISVGNKRLPYT 309
Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNL 417
L + V++D+GT +T L + + + + +T R S +F C+
Sbjct: 310 NGLLN-GNVEKGNVIIDSGTTLTFLDSEFFTEL-ERVLEETVKAERVSDPRGLFSVCFRS 367
Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGI 477
+G + +P ++ +F+ V P + F+ D CF S + + I GN+ Q
Sbjct: 368 AG--DIDLPVIAVHFNDADVKLQPLNTFVKA--DEDLLCFTMI-SSNQIGIFGNLAQMDF 422
Query: 478 QISFDGANGFVGFGPNVC 495
+ +D V F P C
Sbjct: 423 LVGYDLEKRTVSFKPTDC 440
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 136/453 (30%), Positives = 207/453 (45%), Gaps = 56/453 (12%)
Query: 85 HRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTD 144
H D ++ T +++ H+ A +QR R+A++ RL +++++V
Sbjct: 25 HLDIARVDASDTESLNLTDHELLRRA-IQRSRDRLASIAPRLL---PTSSRNKVVVAEAP 80
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
V+S GEY V++G+G+P ID+ SD++W QCQPC +CYKQ DPVF+P S
Sbjct: 81 VLSA----GGEYLVKLGLGTPQHCFTAAIDTASDLIWTQCQPCVKCYKQLDPVFNPVAST 136
Query: 205 SFSGVSCSSAVCDRLENAGC-------HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
S++ V C+S CD L+ C C+Y SYG + T+G LA++ L IG V
Sbjct: 137 SYAVVPCNSDTCDELDTHRCARDGDSDDEDACQYTYSYGGNATTRGILAVDRLAIGDDVF 196
Query: 258 KNVAIGCGHKNQ-GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLV 316
+ V GC + G +G++GLG G++SLV QL + F YCL + S+G LV
Sbjct: 197 RGVVFGCSSSSVGGPPPQVSGVVGLGRGALSLVSQLSVRR---FMYCLPPPVSRSAGRLV 253
Query: 317 FGREALPV-----GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPI-SEDLFRLTQMGDD 370
G +A VP+ R PS+YY+ L G+ +G + S + T G
Sbjct: 254 LGADAAATVRNASERVVVPMSTGSRYPSYYYLNLDGISIGDRAMSFRSRNRMNATTPGTA 313
Query: 371 ------------------------GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
G+++D + +T L YE D + LPR S
Sbjct: 314 AGAPASPVSGSGDGDGSGTGPDAYGMIIDIASTITFLEESLYEEMVDDLEEEI-RLPRGS 372
Query: 407 GVSI-FDTCYNLSGFVS---VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS 462
G + D C+ L V V P VS F G L L + +G C +
Sbjct: 373 GSDLGLDLCFILPEGVPMSRVYAPPVSLAFEGV-WLRLDKEQMFVEDRASGMMCLMVGKT 431
Query: 463 PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
G+SI+GN QQ+ +Q+ ++ G + F C
Sbjct: 432 -DGVSILGNYQQQNMQVMYNLRRGRITFIKTAC 463
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 126/412 (30%), Positives = 195/412 (47%), Gaps = 25/412 (6%)
Query: 103 RHQHSFHARMQ---RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVR 159
RH + A ++ R +R+A + +LSG A D V+ + +
Sbjct: 51 RHDNWRRAALESNARQARRLAKALDKLSGAAPGAPAAAATDIAAADVTISPYAHQGHSLT 110
Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD-- 217
+GVG+PP+ +++D GSD++W QC KQ +PVFD A S+SFS + C S +C+
Sbjct: 111 VGVGTPPQPSKVILDLGSDLLWTQCSLVGPTAKQLEPVFDAARSSSFSVLPCDSKLCEAG 170
Query: 218 RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG--RTVVKNVAIGCGHKNQGMFVGA 275
N C +C YE YG + T G LA ET T G V N+ GCG G A
Sbjct: 171 TFTNKTCTDRKCAYENDYGIMTAT-GVLATETFTFGAHHGVSANLTFGCGKLANGTIAEA 229
Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA------LPVGAAWV 329
+G+LGL G +S++ QL FSYCL + ++FG A +
Sbjct: 230 SGILGLSPGPLSMLKQLAITK---FSYCLTPFADRKTSPVMFGAMADLGKYKTTGKVQTI 286
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
PL++NP +YYV + G+ VG R+ + ++ + G G V+D+ T + L PA+
Sbjct: 287 PLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDSATTLAYLVEPAFT 346
Query: 390 AFRDAFVAQTGNLPRAS-GVSIFDTCYNLSGFVS---VRVPTVSFYFSGGPVLTLPASNF 445
+ A V + LP A+ V + C+ L +S V+VP + +F G ++LP N+
Sbjct: 347 ELKKA-VMEGIKLPVANRSVDDYPVCFELPRGMSMEGVQVPPLVLHFDGDAEMSLPRDNY 405
Query: 446 LIPVDDAGTFCFAF--APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
G C A AP ++IGN+QQ+ + + +D N + P C
Sbjct: 406 FQE-PSPGMMCLAVMQAPFEGAPNVIGNVQQQNMHVLYDVGNRKFSYAPTKC 456
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 135/420 (32%), Positives = 192/420 (45%), Gaps = 26/420 (6%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
L++ H S + M + + A+ Q ++ ++LV R S +A+ +Q
Sbjct: 35 LKVFHIFSQCSPFKPSKPMSWEESVLNLQAKDQARMQYFSSLVARKSVVPIASARQIIQ- 93
Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
S Y V+ G+PP++ + +D+ SD W+ C C C S P F P
Sbjct: 94 ------------SPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGC-STSKP-FAP 139
Query: 201 ADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNV 260
S SF VSC S C ++ N C C + +YG S ++ +TLT+ +
Sbjct: 140 IKSTSFRNVSCGSPHCKQVPNPTCGGSACAFNFTYGSSSIA-ASVVQDTLTLATDPIPGY 198
Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGR 319
GC +K G GLLGLG G +SL+ Q FSYCL S + SGSL G
Sbjct: 199 TFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSINFSGSLRLGP 258
Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
P + PL+RNPR S YYV L + VG + I G + D+GT
Sbjct: 259 VYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTV 318
Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
TRL P Y A R+ F + G + + FDTCYN V + VPT++F FSG V T
Sbjct: 319 FTRLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYN----VPIVVPTITFLFSGMNV-T 373
Query: 440 LPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
LP N +I T C A A +P S L++I N+QQ+ ++ FD N +G +C
Sbjct: 374 LPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIGIARELC 433
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 142/446 (31%), Positives = 206/446 (46%), Gaps = 58/446 (13%)
Query: 68 SSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLS 127
S + T + ++ +L+H++ +S +NN H ++ + SF+ ++ + + R S
Sbjct: 19 SQTPTEAYNKGFSFKLIHKNSPNSPFYKSNNFHKNKLR-SFYQVPKKSFVQKSPYTRVTS 77
Query: 128 GGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC 187
+G+Y +++ +GSPP Y ++D+GSD+VW QC PC
Sbjct: 78 N------------------------NGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPC 113
Query: 188 SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLAL 247
CY+Q P+F+P S ++S + C S C + C Y SY D S TKG LA
Sbjct: 114 GGCYRQKSPMFEPLRSKTYSPIPCESEQCSFFGYSCSPQKMCAYSYSYADSSVTKGVLAR 173
Query: 248 ETLTIGRT-----VVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGA-F 300
E +T T VV ++ GCGH N G F G++G+GGG +SLV Q+G G F
Sbjct: 174 EAITFSSTDGDPVVVGDIIFGCGHSNSGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRF 233
Query: 301 SYCLVSRGTG--SSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGG--MR 354
S CLV T +SG++ FG E+ G V PL S Y V L G+ VG +R
Sbjct: 234 SQCLVPFHTDAHTSGTINFGEESDVSGEGVVTTPLASEEGQTS-YLVTLEGISVGDTFVR 292
Query: 355 IPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDT 413
SE L + +++D+GT T +P YE + Q+ LP +
Sbjct: 293 FNSSETLSK------GNIMIDSGTPATYIPQEFYERLVEELKVQSSLLPIEDDPDLGTQL 346
Query: 414 CY----NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSII 469
CY NL G P ++ +F G V LP F+ P D G FCFA A S G I
Sbjct: 347 CYRSETNLEG------PILTAHFEGADVQLLPIQTFIPPKD--GVFCFAMAGSTDGDYIF 398
Query: 470 GNIQQEGIQISFDGANGFVGFGPNVC 495
GN Q I + FD + F P C
Sbjct: 399 GNFAQSNILMGFDLDRKTISFKPTDC 424
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 116/364 (31%), Positives = 172/364 (47%), Gaps = 27/364 (7%)
Query: 152 GSGEYFVRIGVGSP-PRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
G EY + G+G+P P+ + +D+GSD+VW QC+PC C+ Q P FD + S + GV
Sbjct: 88 GYTEYLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPCFDCFTQPLPRFDTSASDTVHGVL 147
Query: 211 CSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIGCG 265
C+ +C L C G C Y+V+YGD S T G LA ++ T G+ V ++ GCG
Sbjct: 148 CTDPICRALRPHACFLGGCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVFGCG 207
Query: 266 HKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG---REA 321
N G F G+ G G G +SL QLG + FSYC + S + G +
Sbjct: 208 QYNTGNFHSNETGIAGFGRGPLSLPRQLGVSS---FSYCFTTIFESKSTPVFLGGAPADG 264
Query: 322 LPVGAAWVPLVRN---PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
L A P++ P P +YY+ L G+ VG R+ + E F + G G ++D+GT
Sbjct: 265 LRAHATG-PILSTPFLPNHPEYYYLSLKGITVGKTRLAVPESAFVVKADGSGGTIIDSGT 323
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRAS-------GVSIFDTCYNLSGFVSVRVPTVSFY 431
A+T P + + +AFVAQ LP S + F T ++ V VP ++ +
Sbjct: 324 AITAFPRAVFRSLWEAFVAQV-PLPHTSYNDTGEPTLQCFST-ESVPDASKVPVPKMTLH 381
Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFG 491
G LP N++ D+ C ++IGN QQ+ + I D A +
Sbjct: 382 LEGAD-WELPRENYMAEYPDSDQLCVVVLAGDDDRTMIGNFQQQNMHIVHDLAGNKLVIE 440
Query: 492 PNVC 495
P C
Sbjct: 441 PAQC 444
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 178 bits (451), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 120/357 (33%), Positives = 176/357 (49%), Gaps = 25/357 (7%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
G Y + + +G+PP Y + D+GSD+ W C PC+ CYKQ +P+FDP S ++ +SC S
Sbjct: 70 GHYLMELSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRNPMFDPQKSTTYRNISCDS 129
Query: 214 AVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTI----GRTV-VKNVAIGCGHK 267
+C +L+ C RC Y +Y + T+G LA ET+T+ G++V +K + GCGH
Sbjct: 130 KLCHKLDTGVCSPQKRCNYTYAYASAAITRGVLAQETITLSSTKGKSVPLKGIVFGCGHN 189
Query: 268 NQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGA-FSYCLVSRGTGSSGS--LVFGREALP 323
N G F G++GLGGG +SL+ Q+G GG FS CLV T S S + FG+ +
Sbjct: 190 NTGGFNDHEMGIIGLGGGPVSLISQMGSSFGGKRFSQCLVPFHTDVSVSSKMSFGKGSKV 249
Query: 324 VGAAWV--PLV-RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG-VVMDTGTA 379
G V PLV + + P Y+V L G+ V + + +Q + G + +D+GT
Sbjct: 250 SGKGVVSTPLVAKQDKTP--YFVTLLGISVENTYLHFNGS----SQNVEKGNMFLDSGTP 303
Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
T LPT Y+ ++ P + CY ++R P ++ +F G V
Sbjct: 304 PTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLCYRTKN--NLRGPVLTAHFEGADVK 361
Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P F+ P D G FC F + S + GN Q I FD V F P C
Sbjct: 362 LSPTQTFISPKD--GVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPKDC 416
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 177 bits (450), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 104/311 (33%), Positives = 153/311 (49%), Gaps = 20/311 (6%)
Query: 102 HRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIG 161
H + + ++Q + +A R++ + A V D T + SGEY V +
Sbjct: 35 HVDAGTSYTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITAARVLVTASSGEYLVDLA 94
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
+G+PP ++D+GSD++W QC PC C Q P FD SA++ + C S+ C L +
Sbjct: 95 IGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSS 154
Query: 222 AGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHKNQGMFVGAA 276
C C Y+ YGD + T G LA ET T G + N+A GCG N G ++
Sbjct: 155 PSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSS 214
Query: 277 GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA----------LPVGA 326
G++G G G +SLV QLG FSYCL S + + L FG A PV +
Sbjct: 215 GMVGFGRGPLSLVSQLGPSR---FSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQS 271
Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
P V NP P+ Y++ L + +G +PI +F + G GV++D+GT++T L
Sbjct: 272 --TPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQD 329
Query: 387 AYEAFRDAFVA 397
AYEA R V+
Sbjct: 330 AYEAVRRGLVS 340
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 122/385 (31%), Positives = 183/385 (47%), Gaps = 21/385 (5%)
Query: 122 LVRRLSGGGADAAKHEVQDFGTDVVS--GMDQG--SGEYFVRIGVGSPPRSQYMVIDSGS 177
L+RR++ A + T VS D G EY + + +G+PP+ + +D+GS
Sbjct: 53 LMRRMALRSKARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGS 112
Query: 178 DIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR----CRYEV 233
+VW QCQPC+ C+ QS P +D + S++F+ SC S C + + C Y
Sbjct: 113 VLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAYSY 172
Query: 234 SYGDGSYTKGTLALETLT-IGRTVVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQ 291
SYGD S T G L +ET++ + V V GCG N G+F G+ G G G +SL Q
Sbjct: 173 SYGDKSATIGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQ 232
Query: 292 LGGQTGGAFSYCLVSRGTGSSGSLVFGREA--LPVGAAWV---PLVRNPRAPSFYYVGLS 346
L G FS+C + +++F A G V PL++NP P+FYY+ L
Sbjct: 233 L---KVGNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLK 289
Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
G+ VG R+P+ E F L + G G ++D+GTA T LP Y D F A S
Sbjct: 290 GITVGSTRLPVPESAFAL-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPS 348
Query: 407 GVSIFDTCYNLSGF-VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG 465
+ C++ + VP + +F G + LP N++ D G A
Sbjct: 349 NETGPLLCFSAPPLGKAPHVPKLVLHFEGA-TMHLPRENYVFEAKDGGNCSICLAIIEGE 407
Query: 466 LSIIGNIQQEGIQISFDGANGFVGF 490
++IIGN QQ+ + + +D N + F
Sbjct: 408 MTIIGNFQQQNMHVLYDLKNSKLSF 432
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 125/389 (32%), Positives = 190/389 (48%), Gaps = 30/389 (7%)
Query: 122 LVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVW 181
L+R+ + + + +QD V + ++ G+Y + + +G+PP +D+GSD++W
Sbjct: 37 LIRK----SSHLSSNNIQDI---VQAPINAYIGQYLMELYIGTPPIKISGTVDTGSDLIW 89
Query: 182 VQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSY 240
VQC PC CY Q +P+FDP S++++ +SC S +C + C RC Y Y D S
Sbjct: 90 VQCVPCLGCYNQINPMFDPLKSSTYTNISCDSPLCYKPYIGECSPEKRCDYTYGYADSSL 149
Query: 241 TKGTLALETLTI----GRTV-VKNVAIGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGG 294
TKG LA ET+T+ G+ + ++ + GCGH N G F GL+GLGGG SLV Q+G
Sbjct: 150 TKGVLAQETVTLTSNTGKPISLQGILFGCGHNNTGNFNDHEMGLIGLGGGPTSLVSQIGP 209
Query: 295 QTGG-AFSYCLVSRGTG--SSGSLVFGR--EALPVGAAWVPLVRNPRAPSFYYVGLSGLG 349
GG FS CLV T S + FG+ E L G PLV+ + + YYV L G+
Sbjct: 210 LFGGKKFSQCLVPFLTDITISSQMSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGIS 269
Query: 350 VGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS 409
V +P++ + + +++D+GT LP Y+ + P S
Sbjct: 270 VEDTYLPMNSTIEK------GNMLVDSGTPPNILPQQLYDRVYVEVKNKVPLEPITDDPS 323
Query: 410 I-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAP-SPSGL 466
+ CY +++ PT++++F G +L P F+ P + G FC A + S
Sbjct: 324 LGPQLCYRTQ--TNLKGPTLTYHFEGANLLLTPIQTFIPPTPETKGVFCLAITNCANSDP 381
Query: 467 SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
I GN Q I FD V F P C
Sbjct: 382 GIYGNFAQTNYLIGFDLDRQIVSFKPTDC 410
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 122/362 (33%), Positives = 175/362 (48%), Gaps = 39/362 (10%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS- 213
EY V + +G+PP+ + +D+GSD++W QC+PC C+ Q P FD + S++ + + C S
Sbjct: 34 EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFDQPLPYFDTSRSSTNALLPCEST 93
Query: 214 --------AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLT-IGRTVVKNVAIGC 264
VC +L C Y SYGD S T G LA + T + T + V GC
Sbjct: 94 QCKLDPTVTVCVKLNQT---VQTCAYYTSYGDNSVTIGLLAADKFTFVAGTSLPGVTFGC 150
Query: 265 GHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP 323
G N G+F G+ G G G +SL QL G FS+C + TG+ S V LP
Sbjct: 151 GLNNTGVFNSNETGIAGFGRGPLSLPSQL---KVGNFSHCFTTI-TGAIPSTVLLD--LP 204
Query: 324 V--------GAAWVPLV---RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
PL+ +N P+ YY+ L G+ VG R+P+ E F LT G G
Sbjct: 205 ADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTN-GTGGT 263
Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFY 431
++D+GT++T LP Y+ RD F AQ LP G + TC++ VP + +
Sbjct: 264 IIDSGTSITSLPPQVYQVVRDEFAAQI-KLPVVPGNATGHYTCFSAPSQAKPDVPKLVLH 322
Query: 432 FSGGPVLTLPASNFLIPV-DDAGT--FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
F G + LP N++ V DDAG C A +IIGN QQ+ + + +D N +
Sbjct: 323 FEGA-TMDLPRENYVFEVPDDAGNSIICLAINKG-DETTIIGNFQQQNMHVLYDLQNNML 380
Query: 489 GF 490
F
Sbjct: 381 SF 382
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 132/446 (29%), Positives = 213/446 (47%), Gaps = 52/446 (11%)
Query: 69 SSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSG 128
+S +S+ ++EL+HRD S + QH+ R+ A +R +S
Sbjct: 19 TSTSSAHRKNLSVELIHRDSPHSP--------LYNPQHTVSDRLN------AAFLRSISR 64
Query: 129 GGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS 188
+ K TD+ SG+ GEYF+ I +G+PP + D+GSD+ WVQC+PC
Sbjct: 65 SRRFSTK-------TDLQSGLISNGGEYFMSISIGTPPSKFLAIADTGSDLTWVQCKPCQ 117
Query: 189 QCYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAGCHAGR--CRYEVSYGDGSYTKGT 244
QCYKQ+ P+FD S+++ SC S C+ L GC R C+Y SYGD S+TKG
Sbjct: 118 QCYKQNTPLFDKKKSSTYKTESCDSITCNALSEHEEGCDESRNACKYRYSYGDESFTKGE 177
Query: 245 LALETLTIGRTVVKNV-----AIGCGHKNQGMFVGAAGLLGLGGGS-MSLVGQLGGQTGG 298
+A ET++I + V A GCG+ N G F + GG +SLV QLG G
Sbjct: 178 VATETISIDSSSGSPVSFPGTAFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGK 237
Query: 299 AFSYCLVSRGTGSSGSLVFG--------REALPVGAAWVPLV-RNPRAPSFYYVGLSGLG 349
FSYCL ++G+ V + + PL+ ++P ++Y++ L +
Sbjct: 238 KFSYCLSHTSATTNGTSVINLGTNSMTSKPSKDSAILTTPLIQKDPE--TYYFLTLEAIT 295
Query: 350 VGGMRIPIS----EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
VG ++P + L R ++ + +++D+GT +T L + Y+ F R
Sbjct: 296 VGKTKLPYTGGGGYSLNRKSKKTGN-IIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKRV 354
Query: 406 SGVS-IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS 464
S I C+ SG + +PT++ +F+G V P ++F+ +D C + P+ +
Sbjct: 355 SDPQGILTHCFK-SGDKEIGLPTITMHFTGADVKLSPINSFVKLSEDI--VCLSMIPT-T 410
Query: 465 GLSIIGNIQQEGIQISFDGANGFVGF 490
++I GN+ Q + +D V F
Sbjct: 411 EVAIYGNMVQMDFLVGYDLETKTVSF 436
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 102/306 (33%), Positives = 159/306 (51%), Gaps = 21/306 (6%)
Query: 59 ELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKR 118
++ +R + S E+R + + S + +++HR H+ V+
Sbjct: 52 QILQRKQQLGSLGCLHPESRQEKGAIMLEMKDRSYCSKKKVNWHRKLHNQLTLDDLHVRS 111
Query: 119 VATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSD 178
+ +R++ + EV + SG++ + Y V + +G + ++ID+GSD
Sbjct: 112 MQNRLRKM----VSSHSVEVSQIQIPLASGVNFQTLNYIVTMELGG--QDMTVIIDTGSD 165
Query: 179 IVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLE----NAG-CHAG--RCRY 231
+ WVQC+PC CY Q PVF P+ S+S+ + C+S+ C L+ NAG C + C Y
Sbjct: 166 LTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSY 225
Query: 232 EVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQ 291
V+YGDGSYT G L E L+ G V N GCG N+G+F G +GL+GLG ++SL+ Q
Sbjct: 226 AVNYGDGSYTNGELGAEHLSFGGISVSNFVFGCGKNNKGLFGGVSGLMGLGRSNLSLISQ 285
Query: 292 LGGQTGGAFSYCLVSRGTGSSGSLVFGREA------LPVGAAWVPLVRNPRAPSFYYVGL 345
GG FSYCL G+SGSL G E+ P+ A+ +V NP+ +FY + L
Sbjct: 286 TNSTFGGVFSYCLPPTDAGASGSLAMGNESSVFKNLTPI--AYTRMVPNPQLSNFYMLNL 343
Query: 346 SGLGVG 351
+G+ VG
Sbjct: 344 TGIDVG 349
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 134/420 (31%), Positives = 191/420 (45%), Gaps = 26/420 (6%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
L++ H S + M + + A+ Q ++ ++LV R S +A+ +Q
Sbjct: 35 LKVFHIFSQCSPFKPSKPMSWEESVLNLQAKDQARMQYFSSLVARKSVVPIASARQIIQ- 93
Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
S Y V+ G+PP++ + +D+ SD W+ C C C S P F P
Sbjct: 94 ------------SPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGC-STSKP-FAP 139
Query: 201 ADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNV 260
S SF VSC S C ++ N C C + +YG S ++ +TLT+ +
Sbjct: 140 IKSTSFRNVSCGSPHCKQVPNPTCGGSACAFNFTYGSSSIA-ASVVQDTLTLAADPIPGY 198
Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGR 319
GC +K G GLLGLG G +SL+ Q FSYCL S + SGSL G
Sbjct: 199 TFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSINFSGSLRLGP 258
Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
P + PL+RNPR S YYV L + VG + I G + D+GT
Sbjct: 259 VYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTV 318
Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
TRL P Y A R+ F + G + + FDTCYN V + VPT++F FSG V
Sbjct: 319 FTRLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYN----VPIVVPTITFLFSGMNV-A 373
Query: 440 LPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
LP N +I T C A A +P S L++I N+QQ+ ++ FD N +G +C
Sbjct: 374 LPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIGIARELC 433
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 113/348 (32%), Positives = 170/348 (48%), Gaps = 17/348 (4%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
EY + + +G+PP+ + +D+GS +VW QCQPC+ C+ QS P +D + S++F+ SC S
Sbjct: 34 EYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDST 93
Query: 215 VCDRLENAGCHAGR----CRYEVSYGDGSYTKGTLALETLT-IGRTVVKNVAIGCGHKNQ 269
C + + C Y SYGD S T G L +ET++ + V V GCG N
Sbjct: 94 QCKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVAGASVPGVVFGCGLNNT 153
Query: 270 GMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA--LPVGA 326
G+F G+ G G G +SL QL G FS+C + +++F A G
Sbjct: 154 GIFRSNETGIAGFGRGPLSLPSQL---KVGNFSHCFTAVSGRKPSTVLFDLPADLYKNGR 210
Query: 327 AWV---PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
V PL++NP P+FYY+ L G+ VG R+P+ E F L + G G ++D+GTA T L
Sbjct: 211 GTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFAL-KNGTGGTIIDSGTAFTSL 269
Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGF-VSVRVPTVSFYFSGGPVLTLPA 442
P Y D F A S + C++ + VP + +F G + LP
Sbjct: 270 PPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFEGA-TMHLPR 328
Query: 443 SNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
N++ D G A ++IIGN QQ+ + + +D N + F
Sbjct: 329 ENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSF 376
>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 441
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 140/433 (32%), Positives = 203/433 (46%), Gaps = 27/433 (6%)
Query: 68 SSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLS 127
S+ N ++D + L++ H S + + + + A+ Q ++ +++LV R S
Sbjct: 29 SNCNPAADRSS-TLQVFHIFSPCSPFRPSKPLSWADNVLQMQAKDQARLQFLSSLVARRS 87
Query: 128 GGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC 187
+A+ +Q S + VR +G+P ++ + +D+ +D W+ C C
Sbjct: 88 FVPIASARQLIQ-------------SPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGC 134
Query: 188 SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLAL 247
C S VF S+SF + C S C+++ N C C + ++YG S L
Sbjct: 135 IGC--PSTTVFSSDKSSSFRPLPCQSPQCNQVPNPSCSGSACGFNLTYG-SSTVAADLVQ 191
Query: 248 ETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS- 306
+ LT+ V + GC K G V GLLGLG G +SL+GQ FSYCL S
Sbjct: 192 DNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSF 251
Query: 307 RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
+ SGSL G A P+ + PL+RNPR S YYV L + VG + I
Sbjct: 252 KSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNS 311
Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
G V+D+GT TRL PAY A RD F + G S + FDTCY V + P
Sbjct: 312 ATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYT----VPIISP 367
Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFD 482
T++F F+G V TLP NFLI T C A A +P S L++I ++QQ+ +I FD
Sbjct: 368 TITFMFAGMNV-TLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFD 426
Query: 483 GANGFVGFGPNVC 495
N VG C
Sbjct: 427 IPNSRVGVARESC 439
>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
Length = 451
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 134/438 (30%), Positives = 190/438 (43%), Gaps = 24/438 (5%)
Query: 74 SDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVR----RLSGG 129
S R + L+ R S S + + A+ D R ATL
Sbjct: 20 STALRSSTLLLARSPQSVSLSAVPGTPVTAWAATLAAQTASDAARAATLATGPRDPPPAS 79
Query: 130 GADAAKHEVQDFGTDVVSGMDQGS-GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS 188
DAAK + + G S Y R +G+P ++ + ID +D WV C +
Sbjct: 80 AVDAAKKGPRRSFVPIAPGRQLLSIPSYVARARLGTPAQALLVAIDPSNDAAWVPCA--A 137
Query: 189 QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG---RCRYEVSYGDGSYTKGTL 245
P FDP S+++ V C + C + C G C + +SY ++ + L
Sbjct: 138 CAGCARAPSFDPTRSSTYRPVRCGAPQCSQAPAPSCPGGLGSSCAFNLSYAASTF-QALL 196
Query: 246 ALETLTIGRTV--VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
+ L + V V GC H G V GL+G G G +S Q G FSYC
Sbjct: 197 GQDALALHDDVDAVAAYTFGCLHVVTGGSVPPQGLVGFGRGPLSFPSQTKDVYGSVFSYC 256
Query: 304 LVS-RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
L S + + SG+L G P PL+ NP PS YYV + G+ VGG +P+
Sbjct: 257 LPSYKSSNFSGTLRLGPAGQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPVPVPASAL 316
Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
G ++D GT TRL P Y A RD F ++ P A + FDTCYN V+
Sbjct: 317 AFDPTSGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRV-RAPVAGPLGGFDTCYN----VT 371
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-----SGLSIIGNIQQEGI 477
+ VPTV+F F G +TLP N +I G C A A P + L+++ ++QQ+
Sbjct: 372 ISVPTVTFSFDGRVSVTLPEENVVIRSSSGGIACLAMAAGPPDGVDAALNVLASMQQQNH 431
Query: 478 QISFDGANGFVGFGPNVC 495
++ FD ANG VGF +C
Sbjct: 432 RVLFDVANGRVGFSRELC 449
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 126/416 (30%), Positives = 195/416 (46%), Gaps = 57/416 (13%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
++++L+HRD S + R +F +R V RV R + +D + +
Sbjct: 32 FSVDLIHRDSPHSPFFDPSKTQAERLTDAF----RRSVSRVGRF--RPTAMTSDGIQSRI 85
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
+GEY + + +G+PP ++D+GSD+ W QC+PC+ CYKQ P+F
Sbjct: 86 V-----------PSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLF 134
Query: 199 DPADSASFSGVSCSSAVCDRL-ENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
DP +S+++ SC ++ C L ++ C +C + SY DGS+T G LA ETLT+ T
Sbjct: 135 DPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTA 194
Query: 257 VKNV-----AIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG 310
K V A GCGH + G+F ++G++GLGGG +SL+ QL G FSYCL+ T
Sbjct: 195 GKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTD 254
Query: 311 SSGS--LVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
SS S + FG G V PL R P Y G S + T+
Sbjct: 255 SSISSRINFGASGRVSGYGTVSTPL----RLP---YKGYS----------------KKTE 291
Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
+ + +++D+GT T LP Y + IF CYN + + P
Sbjct: 292 VEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTA--EINAP 349
Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
++ +F V P + F+ +D CF AP+ S + ++GN+ Q + FD
Sbjct: 350 IITAHFKDANVELQPLNTFMRMQEDL--VCFTVAPT-SDIGVLGNLAQVNFLVGFD 402
Score = 39.7 bits (91), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 34/134 (25%), Positives = 58/134 (43%), Gaps = 6/134 (4%)
Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLSGFV 421
+ ++ + +++D+GT T LP Y ++ VA + R + I CYN +
Sbjct: 411 KKAEVEEGNIIVDSGTTYTYLPLEFYVKLEES-VAHSIKGKRVRDPNGISSLCYNTT-VD 468
Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISF 481
+ P ++ +F V P + FL +D CF P+ S + I+GN+ Q + F
Sbjct: 469 QIDAPIITAHFKDANVELQPWNTFLRMQEDL--VCFTVLPT-SDIGILGNLAQVNFLVGF 525
Query: 482 DGANGFVGFGPNVC 495
D V F C
Sbjct: 526 DLRKKRVSFKAADC 539
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 136/433 (31%), Positives = 208/433 (48%), Gaps = 63/433 (14%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+++EL+HRD SS + +++QH +A +R + R A H
Sbjct: 28 FSVELIHRD---SSKSPLYQPTQNKYQHIVNA-ARRSINR---------------ANHFY 68
Query: 139 QDFGTDVV-SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
+ T+ S + GEY + VG+PP Y + D+GSDIVW+QC+PC +CY Q+ P
Sbjct: 69 KTALTNTPQSTVIPDHGEYLMTYSVGTPPFKLYGIADTGSDIVWLQCEPCKECYNQTTPK 128
Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
F P+ S+++ + CSS +C S G+ + TL LE+ T
Sbjct: 129 FKPSKSSTYKNIPCSSDLCK----------------SGQQGNLSVDTLTLESSTGHPISF 172
Query: 258 KNVAIGCGHKNQGMFVGA-AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--GTGSSGS 314
IGCG N F GA +G++GLGGG SL+ QLG FSYCL+ + ++
Sbjct: 173 PKTVIGCGTDNTVSFEGASSGIVGLGGGPASLITQLGSSIDAKFSYCLLPNPVESNTTSK 232
Query: 315 LVFGREALPVGAAWV--PLVRNPRAP-SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
L FG A+ G V P+V+ + P FYY+ L VG RI + G +G
Sbjct: 233 LNFGDTAVVSGDGVVSTPIVK--KDPIVFYYLTLEAFSVGNKRIEFEGS----SNGGHEG 286
Query: 372 -VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLS--GFVSVRVPT 427
+++D+GT +T +PT Y A V + L R + + +F+ CY+++ G+ P
Sbjct: 287 NIIIDSGTTLTVIPTDVYNNLESA-VLELVKLKRVNDPTRLFNLCYSVTSDGY---DFPI 342
Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS----PSG-LSIIGNIQQEGIQISFD 482
++ +F G V P S F+ D G C AFA + PS +SI GN+ Q+ + + +D
Sbjct: 343 ITTHFKGADVKLHPISTFVDVAD--GIVCLAFATTSAFIPSDVVSIFGNLAQQNLLVGYD 400
Query: 483 GANGFVGFGPNVC 495
V F P C
Sbjct: 401 LQQKIVSFKPTDC 413
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 124/353 (35%), Positives = 176/353 (49%), Gaps = 40/353 (11%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
EY V + +G+PP+ + +D+GSD++W QCQPC C+ Q+ P FDP+ S++ S SC S
Sbjct: 88 EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 147
Query: 215 VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFV- 273
+C L A R G G+ G VA GCG N G+F
Sbjct: 148 LCQGLPVASLP--RSDKFTFVGAGASVPG----------------VAFGCGLFNNGVFKS 189
Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV--------G 325
G+ G G G +SL QL G FS+C + TG+ S V LP
Sbjct: 190 NETGIAGFGRGPLSLPSQL---KVGNFSHCFTTI-TGAIPSTVL--LDLPADLFSNGQGA 243
Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
PL++NP P+FYY+ L G+ VG R+P+ E F L + G G ++D+GTA+T LPT
Sbjct: 244 VQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFAL-KNGTGGTIIDSGTAMTSLPT 302
Query: 386 PAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR--VPTVSFYFSGGPVLTLPAS 443
Y RDAF AQ LP SG + D + LS + + VP + +F G + LP
Sbjct: 303 RVYRLVRDAFAAQV-KLPVVSG-NTTDPYFCLSAPLRAKPYVPKLVLHFEGA-TMDLPRE 359
Query: 444 NFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
N++ V+DAG+ A G ++ IGN QQ+ + + +D N + F P C
Sbjct: 360 NYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 412
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 175 bits (444), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 114/312 (36%), Positives = 163/312 (52%), Gaps = 31/312 (9%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
EY V + +G+PP+ + +D+GSD++W QCQPC C+ Q+ P FDP+ S++ S SC S
Sbjct: 81 EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 140
Query: 215 VCDRLENAGCHAGR------CRYEVSYGDGSYTKGTLALETLTI--GRTVVKNVAIGCGH 266
+C L A C + + C Y SYGD S T G L ++ T V VA GCG
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGL 200
Query: 267 KNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF-------- 317
N G+F G+ G G G +SL QL G FS+C + +++
Sbjct: 201 FNNGVFKSNETGIAGFGRGPLSLPSQL---KVGNFSHCFTAVNGLKPSTVLLDLPADLYK 257
Query: 318 -GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
GR A+ PL++NP P+FYY+ L G+ VG R+P+ E F L + G G ++D+
Sbjct: 258 SGRGAV----QSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFAL-KNGTGGTIIDS 312
Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR--VPTVSFYFSG 434
GTA+T LPT Y RDAF AQ LP SG + D + LS + + VP + +F G
Sbjct: 313 GTAMTSLPTRVYRLVRDAFAAQV-KLPVVSG-NTTDPYFCLSAPLRAKPYVPKLVLHFEG 370
Query: 435 GPVLTLPASNFL 446
+ LP N++
Sbjct: 371 A-TMDLPRENYV 381
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 116/360 (32%), Positives = 183/360 (50%), Gaps = 32/360 (8%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--SQCYKQSDPVFDPADSASFSGVS 210
+G Y +RI +G+P + + D+GSD+ WVQC PC ++C+ Q+ P++DP +S++F+ +
Sbjct: 93 NGNYLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLLP 152
Query: 211 CSSAVCDRL---ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV---KNVAIGC 264
C S C +L + G C Y +YGD SY+ G L+ +++ + + + GC
Sbjct: 153 CDSQPCTQLPYSQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRLMLLQLHYNSKICFGC 212
Query: 265 GHKNQGMFVG-----AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR 319
G +N+ F G++GLG G +SLV QLG + G FSYCL+ + S+ L FG
Sbjct: 213 GFQNK--FTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCLLPFSSNSNSKLKFGE 270
Query: 320 EALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
A+ G V PL+ P P FYY+ L G+ VG + T D +++D+G
Sbjct: 271 AAIVQGNGVVSTPLIIKPDLP-FYYLNLEGITVGAKTVK--------TGQTDGNIIIDSG 321
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGP 436
+ +T L Y F + V +T + + FD C+ +S P V F+F+GG
Sbjct: 322 STLTYLEESFYNEFV-SLVKETVAVEEDQYIPYPFDFCFTYKEGMSTP-PDVVFHFTGGD 379
Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
V+ P N L+ ++D C PS G++I GN+ Q + +D G V F P C
Sbjct: 380 VVLKPM-NTLVLIED-NLICSTVVPSHFDGIAIFGNLGQIDFHVGYDIQGGKVSFAPTDC 437
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 174 bits (442), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 139/423 (32%), Positives = 193/423 (45%), Gaps = 46/423 (10%)
Query: 105 QHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGS 164
Q +QR + R +V R GG AD A V V G GEY V++G G+
Sbjct: 47 QELIRRAVQRSLDRPG-IVARSGGGAADEAGKAVASEAPLV-----PGGGEYLVKLGTGT 100
Query: 165 PPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC 224
P ID+ SD+VW+QCQPC CY+Q DPVF+P S+S++ V C+S C +L+ C
Sbjct: 101 PQHFFSAAIDTASDLVWMQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRC 160
Query: 225 HA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQ-GMFVGAAGLLG 280
H G C+Y Y TKGTLA++ L IG V V GC + G A+GL+G
Sbjct: 161 HEDDDGACQYTYKYSGHGVTKGTLAIDKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLVG 220
Query: 281 LGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV----GAAWVPLVRNPR 336
LG G +SLV QL F YCL + +SG LV G A V V + + R
Sbjct: 221 LGRGPLSLVSQLSVHR---FMYCLPPPMSRTSGKLVLGAGADAVRNMSDRVTVTMSSSTR 277
Query: 337 APSFYYVGLSGLGVGGMRIPISED-------------------LFRLTQMGDDGVVMDTG 377
PS+YY+ L GL VG + + + G+++D
Sbjct: 278 YPSYYYLNLDGLAVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVA 337
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRAS-GVSI-FDTCYNLS---GFVSVRVPTVSFYF 432
+ ++ L T Y+ D + LPRA+ + + D C+ L G V VPTVS F
Sbjct: 338 STISFLETSLYDELADDLEEEI-RLPRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSF 396
Query: 433 SGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGP 492
G L L V D C + SG+SI+GN Q + +++ F+ G + F
Sbjct: 397 DGR-WLELDRDRLF--VTDGRMMCLMIGRT-SGVSILGNFQLQNMRVLFNLRRGKITFAK 452
Query: 493 NVC 495
C
Sbjct: 453 ASC 455
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 126/356 (35%), Positives = 180/356 (50%), Gaps = 22/356 (6%)
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSAS 205
G S EY + G+P Q +VID+GSD+ W+QC+PCS QC Q DP+FDP+ S++
Sbjct: 104 GTSVKSLEYVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSST 163
Query: 206 FSGVSCSSAVCDRLE----NAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGR-TVVKN 259
+S V C+S C +L +GC G+ C + +SY DG+ T G + LT+ +VK+
Sbjct: 164 YSAVPCASGECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDKLTLAPGAIVKD 223
Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR 319
GCGH + GLLGLG S SL Q GG FSYCL + + G L FG
Sbjct: 224 FYFGCGHSKSSLPGLFDGLLGLGRLSESLGAQY--GGGGGFSYCLPAVNS-KPGFLAFGA 280
Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
P G + P+ R P P+F V L+G+ VGG ++ + F G+++D+GT
Sbjct: 281 GRNPSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAF------SGGMIVDSGTV 334
Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
VT L + Y A R AF G DTCY+L+G+ +V VP ++ FSGG +
Sbjct: 335 VTVLQSTVYRALRAAFREAMKAYRLVHG--DLDTCYDLTGYKNVVVPKIALTFSGGATIN 392
Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L N ++ G FA ++GN+ Q ++ FD + GF C
Sbjct: 393 LDVPNGIL---VNGCLAFAETGKDGTAGVLGNVNQRTFEVLFDTSASKFGFRAKAC 445
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 136/439 (30%), Positives = 197/439 (44%), Gaps = 28/439 (6%)
Query: 64 HNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLV 123
HN + D L++ H S + M + A+ Q ++ +++LV
Sbjct: 19 HNPKCDATHQHDHDGSTLQVFHVFSPCSPFRPSKPMSWEESVLKLQAKDQARMQYLSSLV 78
Query: 124 RRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQ 183
R S + + Q S Y V+ +G+P ++ + +D+ +D WV
Sbjct: 79 ARRSIVPIASGRQITQ-------------SPTYIVKAKIGTPAQTLLLAMDTSNDASWVP 125
Query: 184 CQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKG 243
C C C + F PA S +F V C ++ C ++ N C C + +YG S
Sbjct: 126 CTACVGCSTTTP--FAPAKSTTFKKVGCGASQCKQVRNPTCDGSACAFNFTYGTSS-VAA 182
Query: 244 TLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
+L +T+T+ V A GC K G V GLLGLG G +SL+ Q FSYC
Sbjct: 183 SLVQDTVTLATDPVPAYAFGCIQKVTGSSVPPQGLLGLGRGPLSLLAQTQKLYQSTFSYC 242
Query: 304 LVSRGTGS-SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
L S T + SGSL G A P + PL++NPR S YYV L + VG + I +
Sbjct: 243 LPSFKTLNFSGSLRLGPVAQPKRIKFTPLLKNPRRSSLYYVNLVAIRVGRRIVDIPPEAL 302
Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGF 420
G V D+GT TRL PAY A R+ F + + + S+ FDTCY
Sbjct: 303 AFNANTGAGTVFDSGTVFTRLVEPAYNAVRNEFRRRIAVHKKLTVTSLGGFDTCYT---- 358
Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEG 476
+ PT++F FSG V TLP N LI C A AP+P S L++I N+QQ+
Sbjct: 359 APIVAPTITFMFSGMNV-TLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQN 417
Query: 477 IQISFDGANGFVGFGPNVC 495
++ FD N +G +C
Sbjct: 418 HRVLFDVPNSRLGVARELC 436
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 108/348 (31%), Positives = 163/348 (46%), Gaps = 29/348 (8%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
Y +++ VG+PP ID+GSD++W QC PC+ CY Q P+FDP++S++F
Sbjct: 61 YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTF--------- 111
Query: 216 CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQG 270
+ C+ C Y++ Y D +Y+KGTLA ET+TI T V+ IGCGH +
Sbjct: 112 ----KEKRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSW 167
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV- 329
+G++GL G SL+ Q+GG+ G SYC S+GT + FG A+ G V
Sbjct: 168 FKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGT---SKINFGTNAIVAGDGVVS 224
Query: 330 -PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
+ P YY+ L + VG + F + +++D+GT +T P
Sbjct: 225 TTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALE---GNIIIDSGTTLTYFPVSYC 281
Query: 389 EAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
R+A + A CY + + + P ++ +FSGG L L N I
Sbjct: 282 NLVREAVDHYVTAVRTADPTGNDMLCY-YTDTIDI-FPVITMHFSGGADLVLDKYNMYIE 339
Query: 449 VDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
GTFC A +P +I GN Q + +D ++ V F P C
Sbjct: 340 TITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVSFSPTNC 387
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 120/370 (32%), Positives = 184/370 (49%), Gaps = 33/370 (8%)
Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ----CYKQSDPVFDPADSAS 205
DQG + + +G+G+PP+ + +++D+GSD++W QC+ S S PV+DP +S++
Sbjct: 88 DQG---HSLTVGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESST 144
Query: 206 FSGVSCSSAVCD--RLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIG--RTVVKNV 260
F+ + CS +C + C + RC YE YG + G LA ET T G R V +
Sbjct: 145 FAFLPCSDRLCQEGQFSFKNCTSKNRCVYEDVYGSAAAV-GVLASETFTFGARRAVSLRL 203
Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG-- 318
GCG + G +GA G+LGL S+SL+ QL Q FSYCL + L+FG
Sbjct: 204 GFGCGALSAGSLIGATGILGLSPESLSLITQLKIQR---FSYCLTPFADKKTSPLLFGAM 260
Query: 319 ----REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
R +V NP +YYV L G+ +G R+ + + G G ++
Sbjct: 261 ADLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIV 320
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS-GVSIFDTCYNL------SGFVSVRVPT 427
D+G+ V L A+EA ++A V LP A+ V ++ C+ L + +V+VP
Sbjct: 321 DSGSTVAYLVEAAFEAVKEA-VMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPP 379
Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFDGAN 485
+ +F GG + LP N+ AG C A + SG+SIIGN+QQ+ + + FD +
Sbjct: 380 LVLHFDGGAAMVLPRDNYFQE-PRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQH 438
Query: 486 GFVGFGPNVC 495
F P C
Sbjct: 439 HKFSFAPTQC 448
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 129/412 (31%), Positives = 192/412 (46%), Gaps = 31/412 (7%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
L+++H S + + + A+ ++ + +LV R S + + +Q
Sbjct: 31 LQVIHVFSPCSPFRPSKPLSWEESVLQMQAKDTTRLQFLDSLVARKSIVPIASGRQIIQ- 89
Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
S Y VR +G+PP++ + +D+ +D W+ C C C + +F P
Sbjct: 90 ------------SPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGC---ASTLFAP 134
Query: 201 ADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNV 260
S +F VSC++ C ++ N GC + ++YG S L +T+T+ V +
Sbjct: 135 EKSTTFKNVSCAAPECKQVPNPGCGVSSRNFNLTYGSSSIA-ANLVQDTITLATDPVPSY 193
Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGR 319
GC K G GLLGLG G +SL+ Q FSYCL S + SGSL G
Sbjct: 194 TFGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGP 253
Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
A P + PL++NPR S YYV L + VG + I G + D+GT
Sbjct: 254 VAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTV 313
Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSVRVPTVSFYFSGGPV 437
TRL P Y A RD F + G P+ + S+ FDTCYN V + VPT++F F+G V
Sbjct: 314 FTRLVAPVYVAVRDEFRRRVG--PKLTVTSLGGFDTCYN----VPIVVPTITFIFTGMNV 367
Query: 438 LTLPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGAN 485
TLP N LI T C A A +P S L++I N+QQ+ ++ +D N
Sbjct: 368 -TLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPN 418
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 118/364 (32%), Positives = 179/364 (49%), Gaps = 29/364 (7%)
Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
+ G EY + + +G+PP + D+GSD+ W QC+PC C+ Q P++D S+SFS
Sbjct: 76 LRSGQAEYLMELAIGTPPVPFIALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSSSFSP 135
Query: 209 VSCSSAVCDRLENAGCH--AGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
+ CSSA C + ++ C + CRY +Y DG+Y+ + V +A GCG
Sbjct: 136 LPCSSATCLPIWSSRCSTPSATCRYRYAYDDGAYSPECAGIS--------VGGIAFGCGV 187
Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVG 325
N G+ + G +GLG GS+SLV QLG G FSYCL T S + FG A
Sbjct: 188 DNGGLSYNSTGTVGLGRGSLSLVAQLG---VGKFSYCLTDFFNTSLSSPVFFGSLAELAA 244
Query: 326 AAW---------VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT-QMGDDGVVMD 375
++ PLV++P PS YYV L G+ +G R+PI F L G G+++D
Sbjct: 245 SSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGTFDLNDDDGSGGMIVD 304
Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCY--NLSGFVSV-RVPTVSFYF 432
+GT T L + D G P + S+ C+ +G + +P + +F
Sbjct: 305 SGTIFTILVETGFRVVVDHVAGVLGQ-PVVNASSLDRPCFPAPAAGVQELPDMPDMVLHF 363
Query: 433 SGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL-SIIGNIQQEGIQISFDGANGFVGFG 491
+GG + L N++ ++ +FC + S S++GN QQ+ IQ+ FD G + F
Sbjct: 364 AGGADMRLHRDNYMSFNEEESSFCLNIVGTESASGSVLGNFQQQNIQMLFDITVGQLSFM 423
Query: 492 PNVC 495
P C
Sbjct: 424 PTDC 427
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 108/348 (31%), Positives = 163/348 (46%), Gaps = 29/348 (8%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
Y +++ VG+PP ID+GSD++W QC PC+ CY Q P+FDP++S++F
Sbjct: 61 YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTF--------- 111
Query: 216 CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQG 270
+ C+ C Y++ Y D +Y+KGTLA ET+TI T V+ IGCGH +
Sbjct: 112 ----KEKRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSW 167
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV- 329
+G++GL G SL+ Q+GG+ G SYC S+GT + FG A+ G V
Sbjct: 168 FKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGT---SKINFGTNAIVAGDGVVS 224
Query: 330 -PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
+ P YY+ L + VG + F + +++D+GT +T P
Sbjct: 225 TTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALE---GNIIIDSGTTLTYFPVSYC 281
Query: 389 EAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
R+A + A CY + + + P ++ +FSGG L L N I
Sbjct: 282 NLVREAVDHYVTAVRTADPTGNDMLCY-YTDTIDI-FPVITMHFSGGADLVLDKYNMYIE 339
Query: 449 VDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
GTFC A +P +I GN Q + +D ++ V F P C
Sbjct: 340 TITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVFFSPTNC 387
>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 438
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 121/353 (34%), Positives = 170/353 (48%), Gaps = 20/353 (5%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
S Y VR +G+PP++ + ID+ +D W+ C C C + +F P S +F VSC
Sbjct: 94 SPTYIVRAKIGTPPQTLLLAIDTSNDAAWIPCTACDGC---TSTLFAPEKSTTFKNVSCG 150
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
S C+++ + C C + ++YG S + +T+T+ + GC K G
Sbjct: 151 SPECNKVPSPSCGTSACTFNLTYGSSSIAANVVQ-DTVTLATDPIPGYTFGCVAKTTGPS 209
Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPL 331
GLLGLG G +SL+ Q FSYCL S + SGSL G A P+ + PL
Sbjct: 210 TPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPIRIKYTPL 269
Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
++NPR S YYV L + VG + I G V D+GT TRL P Y A
Sbjct: 270 LKNPRRSSLYYVNLFAIRVGRKIVDIPPAALAFNAATGAGTVFDSGTVFTRLVAPVYTAV 329
Query: 392 RDAF-----VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
RD F +A NL + + FDTCY V + PT++F FSG V TLP N L
Sbjct: 330 RDEFRRRVAMAAKANL-TVTSLGGFDTCYT----VPIVAPTITFMFSGMNV-TLPQDNIL 383
Query: 447 IPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
I T C A A +P S L++I N+QQ+ ++ +D N +G +C
Sbjct: 384 IHSTAGSTSCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNSRLGVARELC 436
>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 491
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 134/355 (37%), Positives = 169/355 (47%), Gaps = 48/355 (13%)
Query: 168 SQYMVIDSGSDIVWVQCQPC--SQCYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAG 223
SQ M ID+ D+ W+QC PC QCY Q + FDP S++ + V C S C L G
Sbjct: 158 SQTMAIDTTEDVPWIQCLPCLIPQCYPQRNAFFDPRRSSTGAPVRCGSRACRTLGGYANG 217
Query: 224 CH----AGRCRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVAIGCGHKNQGMFVG-AAG 277
C G C Y + Y D T GT +TLTI T N GC H +G F A+G
Sbjct: 218 CSKPNSTGDCLYRIEYSDHRLTLGTYMTDTLTISPSTTFLNFRFGCSHAVRGKFSAQASG 277
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLV---SRGTGSSGSLVFGREALPVGA-AWVPLVR 333
+ LGGG SL+ Q G AFSYC+ + G S G V G + GA A PLVR
Sbjct: 278 TMSLGGGPQSLLSQTARAYGNAFSYCVPGPSAAGFLSIGGPVNGDDGGGSGAFATTPLVR 337
Query: 334 --NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
N P+ Y V L G+ V G R+ + +F G VMD+ +T+LP AY A
Sbjct: 338 SANVINPTIYVVRLQGIEVAGRRLNVPPVVF------SGGTVMDSSAVITQLPPTAYRAL 391
Query: 392 RDAF---------VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPA 442
R AF A TGNL DTC++ G V VPTVS F GG V+ L
Sbjct: 392 RLAFRNAMRAYKTRAPTGNL---------DTCFDFVGVSKVTVPTVSLVFDGGAVIELGL 442
Query: 443 SNFLIPVDDAGTFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ L+ C AFAP + L IGN+QQ+ ++ +D A G VGF C
Sbjct: 443 LSVLL------DSCLAFAPMAADFALGFIGNVQQQTHEVLYDVAGGAVGFRHGAC 491
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 123/371 (33%), Positives = 193/371 (52%), Gaps = 39/371 (10%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSC 211
+GEY + + +G+PP S + D+GSD++W QC PC SQC++Q P+++P+ S +F+ + C
Sbjct: 83 AGEYLMTLAIGTPPVSYQAIADTGSDLIWTQCAPCSSQCFQQPTPLYNPSSSTTFAVLPC 142
Query: 212 SS-------AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG------RTVVK 258
+S A+ GC C Y ++YG G +T ET T G +T V
Sbjct: 143 NSSLSMCAAALAGTTPPPGC---TCMYNMTYGSG-WTSVYQGSETFTFGSSTPANQTGVP 198
Query: 259 NVAIGCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLV 316
+A GC + + G A+GL+GLG GS+SLV QLG FSYCL + T S+ +L+
Sbjct: 199 GIAFGCSNASGGFNTSSASGLVGLGRGSLSLVSQLGVP---KFSYCLTPYQDTNSTSTLL 255
Query: 317 FGREAL---PVGAAWVPLVRNPR-AP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
G A G + P V +P AP ++YY+ L+G+ +G + I L G
Sbjct: 256 LGPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSLKADGTG 315
Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI---FDTCYNLSGFVSV--RV 425
G ++D+GT +T L AY+ R A V+ LP G S D C+ L S +
Sbjct: 316 GFIIDSGTTITLLGNTAYQQVRAAVVSLV-TLPTTDGGSAATGLDLCFELPSSTSAPPTM 374
Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA-PSPSGLSIIGNIQQEGIQISFDGA 484
P+++ +F G ++ LPA ++++ D+ +C A + G+SI+GN QQ+ + I +D
Sbjct: 375 PSMTLHFDGADMV-LPADSYMM--LDSNLWCLAMQNQTDGGVSILGNYQQQNMHILYDVG 431
Query: 485 NGFVGFGPNVC 495
+ F P C
Sbjct: 432 QETLTFAPAKC 442
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 172 bits (435), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 131/408 (32%), Positives = 194/408 (47%), Gaps = 43/408 (10%)
Query: 102 HRHQHSFHARMQRDVKRVATL---VRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFV 158
H S RD RV+ + + + G H F D G + V
Sbjct: 80 HSQPPSPQEIFGRDESRVSFINSKCNQYTSGNLKNHAHNNNLFDED---------GNFLV 130
Query: 159 RIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
+ G+P +++D+GS I W QC+ C C + S+ FD + S+++S SC +
Sbjct: 131 DVAFGTPXTEIXLILDTGSSITWTQCKACVNCLQDSNRYFDSSASSTYSFGSC---IPST 187
Query: 219 LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMF-VGAA 276
+EN Y ++YGD S + G +T+T+ + V + GCG N+G F G
Sbjct: 188 VEN--------NYNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNNKGDFGSGVD 239
Query: 277 GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA--WVPLVRN 334
G+LGLG G +S V Q + FSYCL S GSL+FG +A ++ + LV
Sbjct: 240 GMLGLGQGQLSTVSQTASKFNKVFSYCLPEE--DSIGSLLFGEKATSQSSSLKFTSLVNG 297
Query: 335 P---RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
P + +Y+V LS + VG R+ I +F G ++D+ T +TRLP AY A
Sbjct: 298 PGTLQESGYYFVNLSDISVGNERLNIPSSVF-----ASPGTIIDSRTVITRLPQRAYSAL 352
Query: 392 RDAFVAQTGNLPRASGV----SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
+ AF P ++G I DTCYNLSG V +P + +F GG + L +N ++
Sbjct: 353 KAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLNGTN-IV 411
Query: 448 PVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
DA C AFA + S L+IIGN QQ + + +D +GFG N C
Sbjct: 412 WGSDASRLCLAFAGT-SELTIIGNRQQLSLTVLYDIQGRRIGFGGNGC 458
>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
vinifera]
Length = 437
Score = 172 bits (435), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 131/430 (30%), Positives = 203/430 (47%), Gaps = 27/430 (6%)
Query: 71 NTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGG 130
N + + L+++H S + + A+ + ++ +++LV R S
Sbjct: 29 NCETPDQGSTLQVLHVYSPCSPFRPKEPLSWEESVLQMQAKDKARLQFLSSLVARKSVVP 88
Query: 131 ADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC 190
+ + VQ+ Y VR +G+P ++ M +D+ SD+ W+ PC+ C
Sbjct: 89 IASGRQIVQN-------------PTYIVRAKIGTPAQTMLMAMDTSSDVAWI---PCNGC 132
Query: 191 YKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETL 250
S +F+ S ++ + C +A C ++ C G C + ++YG GS L+ +T+
Sbjct: 133 LGCSSTLFNSPASTTYKSLGCQAAQCKQVPKPTCGGGVCSFNLTYG-GSSLAANLSQDTI 191
Query: 251 TIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGT 309
T+ V + GC K G + A GLLGLG G +SL+ Q FSYCL S +
Sbjct: 192 TLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSL 251
Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
SGSL G P + PL++NPR PS Y+V L + VG + + F
Sbjct: 252 NFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTG 311
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVS 429
G + D+GT TRL TPAY A RDAF + G + + FDTCY V + PT++
Sbjct: 312 AGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTSLGGFDTCYT----VPIAAPTIT 367
Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGAN 485
F F+G V TLP N LI T C A A +P S L++I N+QQ+ ++ +D N
Sbjct: 368 FMFTGMNV-TLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPN 426
Query: 486 GFVGFGPNVC 495
+G +C
Sbjct: 427 SRLGVARELC 436
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 127/436 (29%), Positives = 203/436 (46%), Gaps = 49/436 (11%)
Query: 80 NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
++EL+HRD S N R +A R + R L LS
Sbjct: 27 SVELIHRDSPLSPLYNPKNTVTDR----LNAAFLRSISRSRRLNNILSQ----------- 71
Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
TD+ SG+ GE+F+ I +G+PP + + D+GSD+ WVQC+PC QCYK++ P+FD
Sbjct: 72 ---TDLQSGLIGADGEFFMSITIGTPPMKVFAIADTGSDLTWVQCKPCQQCYKENGPIFD 128
Query: 200 PADSASFSGVSCSSAVCDRLENA--GCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRT 255
S+++ C S C L ++ GC + C+Y SYGD S++KG +A ET++I
Sbjct: 129 KKKSSTYKSEPCDSRNCHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSA 188
Query: 256 VVKNVA-----IGCGHKNQGMFVGAAGLLGLGGGS-MSLVGQLGGQTGGAFSYCLVSRGT 309
V+ GCG+ N G F + GG +SL+ QLG FSYCL +
Sbjct: 189 SGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSA 248
Query: 310 GSSGSLV--FGREALP------VGAAWVPLV-RNPRAPSFYYVGLSGLGVGGMRIPISED 360
++G+ V G ++P G PLV + PR ++YY+ L + VG +IP +
Sbjct: 249 TTNGTSVINLGTNSIPSSLSKDSGVISTPLVDKEPR--TYYYLTLEAISVGKKKIPYTGS 306
Query: 361 LFRLTQMG-----DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTC 414
+ G +++D+GT +T L + ++ F A R S + C
Sbjct: 307 SYNPNDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAKRVSDPQGLLSHC 366
Query: 415 YNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQ 474
+ SG + +P ++ +F+G V P + F+ +D C + P+ + ++I GN Q
Sbjct: 367 FK-SGSAEIGLPEITVHFTGADVRLSPINAFVKVSEDM--VCLSMVPT-TEVAIYGNFAQ 422
Query: 475 EGIQISFDGANGFVGF 490
+ +D V F
Sbjct: 423 MDFLVGYDLETRTVSF 438
>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
sativus]
Length = 364
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 126/348 (36%), Positives = 170/348 (48%), Gaps = 13/348 (3%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
S + VR +G+P ++ + +D+ +D W+ C C C S VF S+SF + C
Sbjct: 23 SPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGC--PSTTVFSSDKSSSFRPLPCQ 80
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
S C+++ N C C + ++YG S L + LT+ V + GC K G
Sbjct: 81 SPQCNQVPNPSCSGSACGFNLTYG-SSTVAADLVQDNLTLATDSVPSYTFGCIRKATGSS 139
Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPL 331
V GLLGLG G +SL+GQ FSYCL S + SGSL G A P+ + PL
Sbjct: 140 VPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPL 199
Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
+RNPR S YYV L + VG + I G V+D+GT TRL PAY A
Sbjct: 200 LRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAV 259
Query: 392 RDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDD 451
RD F + G S + FDTCY V + PT++F F+G V TLP NFLI
Sbjct: 260 RDEFRRRVGRNVTVSSLGGFDTCYT----VPIISPTITFMFAGMNV-TLPPDNFLIHSTS 314
Query: 452 AGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
T C A A +P S L++I ++QQ+ +I FD N VG C
Sbjct: 315 GSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESC 362
>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 506
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 142/431 (32%), Positives = 191/431 (44%), Gaps = 60/431 (13%)
Query: 107 SFHARMQRDVKRVATLVRRLSGGGA---DAAKHEVQDFGTDVVSG-----------MDQG 152
+ A +Q D R + R+LSG A DA + Q T V S D
Sbjct: 94 TLSATLQWDEHRAGHIQRKLSGNAAPMDDAGEETPQS--TQVTSSPAANVNVGKSSTDSA 151
Query: 153 SGEYFVRIGVGS------PPRSQYMVIDSGSDIVWVQCQPCSQ--CYKQSDPVFDPADSA 204
+ V G P +Q MV+D+ SD+ WVQC PC Q CY QSD ++DP S
Sbjct: 152 FEQGIVPAATGPGGQKKLPGVAQSMVVDTASDVPWVQCAPCPQPQCYAQSDVLYDPTKSI 211
Query: 205 SFSGVSCSSAVCDRLEN--AGC----HAGRCRYEVSYGDGSYTKGTLALETLTIG---RT 255
+ CSS C L GC + G C+Y V Y DGS T GT + LT+ +
Sbjct: 212 LSAPFPCSSPQCRSLGRYANGCTGAGNTGTCQYRVLYPDGSGTSGTYVSDLLTLNADPKG 271
Query: 256 VVKNVAIGCGHK--NQGMFVG-AAGLLGLGGGSMSLVGQLGG--QTGGAFSYCLVSRGTG 310
V GC H G F AG + LG G+ SL Q G G FSYCL G+
Sbjct: 272 AVSKFQFGCSHALLRPGSFNNKTAGFMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGS- 330
Query: 311 SSGSLVFGREALPVGAA----WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
G L G +P AA P++++ AP Y V L G+ V G R+P+ +F
Sbjct: 331 HKGFLSLG---VPQHAASRYAVTPMLKSKMAPMIYMVRLIGIDVAGQRLPVPPAVFAAN- 386
Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
MD+ T +TRLP AY A R AF AQ + DTCY+ +G VR+P
Sbjct: 387 -----AAMDSRTIITRLPPTAYMALRAAFRAQMRAYRAVAPKGQLDTCYDFTGVPMVRLP 441
Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL--SIIGNIQQEGIQISFDGA 484
V+ F + L S ++ C AFAP+ + IIGN+QQ+ +++ ++
Sbjct: 442 KVTLVFDRNAAVELDPSGVML------DSCLAFAPNANDFMPGIIGNVQQQTLEVLYNVD 495
Query: 485 NGFVGFGPNVC 495
VGF C
Sbjct: 496 GASVGFRRAAC 506
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 134/421 (31%), Positives = 193/421 (45%), Gaps = 45/421 (10%)
Query: 109 HARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSP-PR 167
H ++R V R + L D A D G G D GS EY + +G+G+P P+
Sbjct: 52 HELLRRMVARSKARLASLRSSACDTALTAPVDHG-----GSDVGSSEYLIHLGIGTPRPQ 106
Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDR---LENAGC 224
+ +D+GSD+VW QC C+ C+ Q PVF + S +FS V CS +C L +GC
Sbjct: 107 RVVLHLDTGSDLVWTQCA-CTVCFDQPVPVFRASVSHTFSRVPCSDPLCGHAVYLPLSGC 165
Query: 225 HAG--RCRYEVSYGDGSYTKGTLALETLTIGR-------TVVKNVAIGCGHKNQGMFV-G 274
A C Y Y D S T G +A +T T V N+ GCG N G+F
Sbjct: 166 AARDRSCFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNIRFGCGMMNYGLFTPN 225
Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-AWVPLVR 333
+G+ G G G +SL QL + FSYC + ++ G E + A A P+
Sbjct: 226 QSGIAGFGTGPLSLPSQLKVRR---FSYCFTAMEESRVSPVILGGEPENIEAHATGPIQS 282
Query: 334 NPRAP----------SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
P AP FY++ L G+ VG R+P + F L G G +D+GTA+T
Sbjct: 283 TPFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPFNASTFALKGDGSGGTFIDSGTAITFF 342
Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDT--CYNLSGFVSV-RVPTVSFYFSGGPVLTL 440
P + + R+AFVAQ LP A G + D C+++ VP + + G L
Sbjct: 343 PQAVFRSLREAFVAQV-PLPVAKGYTDPDNLLCFSVPAKKKAPAVPKLILHLEGAD-WEL 400
Query: 441 PASNFLIPVDDAGT-----FCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNV 494
P N+++ DD G+ C + S +IIGN QQ+ + I +D + + F P
Sbjct: 401 PRENYVLDNDDDGSGAGRKLCVVILSAGNSNGTIIGNFQQQNMHIVYDLESNKMVFAPAR 460
Query: 495 C 495
C
Sbjct: 461 C 461
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 171 bits (433), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 154/536 (28%), Positives = 232/536 (43%), Gaps = 89/536 (16%)
Query: 16 LHLLCSIITTSTSAASDTHFQILNVNES-------IKGSRTDHAKMSQYNELFERHNNIS 68
L +LC + A +D + V S KG R H ++ Y+ + +N
Sbjct: 7 LLILCIATSLLADAGADDQVNYVVVETSSLKPSAVCKGHRV-HPSVNNYSSSWTPLSNPH 65
Query: 69 SSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARM--------QRDVKRVA 120
+ S E ++ S+SS + + + +H+ + R ++
Sbjct: 66 GPCSPSWEEGAAMDY------SASSMVDDMLRWDQHRAGYIQRKLSGNVSHEDTEISDST 119
Query: 121 TLVRRLSGGGAD-----------AAKHEVQDFGTDVVSGMDQ--------GSGEYFVRIG 161
T + ++GGGA AK + QD VV + GS +R G
Sbjct: 120 TTLESVNGGGAGDFSMGDDGTGGMAKAQQQDTHHQVVEELSSAADPAATGGSRRSRLRPG 179
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPC--SQCYKQSDPVFDPADSASFSGVSCSSAVCDRL 219
V Q M++D+ SD+ WVQC PC SQCY Q+D ++DP+ S S +CSS C +L
Sbjct: 180 V-----RQLMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQL 234
Query: 220 --ENAGCH-----AGRCRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVAIGCGHKNQGM 271
GC AG+C+Y V Y DGS T GTL + L++ T V GC H +G
Sbjct: 235 GPYANGCSSSSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTSQVPKFEFGCSHAARGS 294
Query: 272 FV--GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA--- 326
F AG++ LG G SLV Q + G FSYC + G V G +P +
Sbjct: 295 FSRSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCFPPTAS-HKGFFVLG---VPRRSSSR 350
Query: 327 -AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
A P+++ P Y V L + V G R+ + +F G +D+ T +TRLP
Sbjct: 351 YAVTPMLKTPM---LYQVRLEAIAVAGQRLDVPPTVFAA------GAALDSRTVITRLPP 401
Query: 386 PAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNF 445
AY+A R AF + A+ DTCY+ +G S+ +PT+S F +
Sbjct: 402 TAYQALRSAFRDKMSMYRPAAANGQLDTCYDFTGVSSIMLPTISLVFD--------RTGA 453
Query: 446 LIPVDDAGTF---CFAFAPSP---SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ +D +G C AFA + IIG +Q + I++ ++ A G VGF C
Sbjct: 454 GVQLDPSGVLFGSCLAFASTAGDDRATGIIGFLQLQTIEVLYNVAGGSVGFRRGAC 509
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 125/439 (28%), Positives = 204/439 (46%), Gaps = 53/439 (12%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+++EL+HRD S + Q + R+ R + RR + H++
Sbjct: 26 FSVELIHRDSPLSP--------IYNPQITVTDRLNAAFLRSVSRSRRFN--------HQL 69
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
TD+ SG+ GE+F+ I +G+PP + + D+GSD+ WVQC+PC QCYK++ P+F
Sbjct: 70 SQ--TDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIF 127
Query: 199 DPADSASFSGVSCSSAVCDRLENA--GCHAGR--CRYEVSYGDGSYTKGTLALETLTIGR 254
D S+++ C S C L + GC C+Y SYGD S++KG +A ET++I
Sbjct: 128 DKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDS 187
Query: 255 TVVKNVA-----IGCGHKNQGMFVGAAGLLGLGGGS-MSLVGQLGGQTGGAFSYCLVSRG 308
V+ GCG+ N G F + GG +SL+ QLG FSYCL +
Sbjct: 188 ASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKS 247
Query: 309 TGSSGSLV--FGREALP------VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISED 360
++G+ V G ++P G PLV + ++YY+ L + VG +IP +
Sbjct: 248 ATTNGTSVINLGTNSIPSSLSKDSGVVSTPLV-DKEPLTYYYLTLEAISVGKKKIPYTGS 306
Query: 361 LFRLTQMGDDG--------VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IF 411
+ DDG +++D+GT +T L ++ F A R S +
Sbjct: 307 SY---NPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLL 363
Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGN 471
C+ SG + +P ++ +F+G V P + F+ +D C + P+ + ++I GN
Sbjct: 364 SHCFK-SGSAEIGLPEITVHFTGADVRLSPINAFVKLSEDM--VCLSMVPT-TEVAIYGN 419
Query: 472 IQQEGIQISFDGANGFVGF 490
Q + +D V F
Sbjct: 420 FAQMDFLVGYDLETRTVSF 438
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 138/437 (31%), Positives = 200/437 (45%), Gaps = 56/437 (12%)
Query: 90 SSSSNTTNNMHYHRHQHSFHARMQRD-VKRVATLVRRLSGGGADAAKHEVQDFGTDVVSG 148
+ S+ + + +H+ + R D V +++ ++S G K Q GT V
Sbjct: 80 APPSSVAETLRWDQHRAGYIQRKLEDQVPITRSVITQVSHQGVVQPKVGTQGQGTGV--- 136
Query: 149 MDQGSGEYFVRIGVGSPPR------SQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDP 200
Q +GE VG P +Q MVID+ SD+ WVQC PC C+ Q+D ++DP
Sbjct: 137 --QPAGE-----PVGDAPTGGSGGVAQTMVIDTASDVPWVQCAPCPAPHCHAQTDVLYDP 189
Query: 201 ADSASFSGVSCSSAVCDRL---ENAGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGR-- 254
+ S+S + CSS C L N AG +C+Y V Y DGS + GT + LT+
Sbjct: 190 SKSSSSAAFPCSSPACRNLGPYANGCTPAGDQCQYRVQYPDGSASAGTYISDVLTLNPAK 249
Query: 255 --TVVKNVAIGCGHK--NQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT 309
+ + GC H G F +G++ LG G+ SL Q G FSYCL
Sbjct: 250 PASAISEFRFGCSHALLQPGSFSNKTSGIMALGRGAQSLPTQTKATYGDVFSYCLPPTPV 309
Query: 310 GSSGSLVFGREALPVGA-AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
SG + G + A P++R+ AP Y V L + V G R+P+ +F
Sbjct: 310 -HSGFFILGVPRVAASRYAVTPMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVFAA---- 364
Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLS-----GFVSV 423
G VMD+ T VTRLP AY A R AFVA+ A+ DTCY+ S G V
Sbjct: 365 --GAVMDSRTIVTRLPPTAYMALRAAFVAEMRAYRAAAPKEHLDTCYDFSGAAPGGGGGV 422
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF---CFAFAPSPSG--LSIIGNIQQEGIQ 478
++P ++ F G N + +D +G C AFAP+ IIGN+QQ+ ++
Sbjct: 423 KLPKITLVFDG--------PNGAVELDPSGVLLDGCLAFAPNTDDQMTGIIGNVQQQALE 474
Query: 479 ISFDGANGFVGFGPNVC 495
+ ++ VGF C
Sbjct: 475 VLYNVDGATVGFRRGAC 491
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 100/295 (33%), Positives = 137/295 (46%), Gaps = 28/295 (9%)
Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC 211
+ EY V + VG+PPR + +D+GSD+VW QC PC C+ Q P+ DPA S++++ + C
Sbjct: 82 ATNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQGIPLLDPAASSTYAALPC 141
Query: 212 SSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT----------VVKNVA 261
+ C L C C Y YGD S T G +A + T G + +
Sbjct: 142 GAPRCRALPFTSCGGRSCVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDGSLPATRRLT 201
Query: 262 IGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG-- 318
GCGH N+G+F G+ G G G SL QL + FSYC S S + G
Sbjct: 202 FGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNATS---FSYCFTSMFDSKSSIVTLGGA 258
Query: 319 -----REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVV 373
A PL +NP PS Y++ L G+ VG R+P+ E FR T +
Sbjct: 259 PAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKFRST-------I 311
Query: 374 MDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
+D+G ++T LP YEA + F AQ G P S D C+ L R P V
Sbjct: 312 IDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDVCFALPVSALWRRPAV 366
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 122/347 (35%), Positives = 169/347 (48%), Gaps = 18/347 (5%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
Y VR +G+PP+ + +D+ +D W+ C C+ C S FDPA SAS+ V C S +
Sbjct: 112 YVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDPASSASYRTVPCGSPL 171
Query: 216 CDRLENAGCHAG--RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFV 273
C + NA C G C + ++Y D S + L+ ++L + VK GC + G
Sbjct: 172 CAQAPNAACPPGGKACGFSLTYADSSL-QAALSQDSLAVAGNAVKAYTFGCLQRATGTAA 230
Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLV 332
GLLGLG G +S + Q FSYCL S + SG+L GR P PL+
Sbjct: 231 PPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTPLL 290
Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
NP S YYV ++G+ VG +PI G V+D+GT TRL PAY A R
Sbjct: 291 ANPHRSSLYYVNMTGIRVGRKVVPIPA----FDPATGAGTVLDSGTMFTRLVAPAYVAVR 346
Query: 393 DAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA 452
D + G S + FDTC+N + +V P V+ F G V TLP N +I
Sbjct: 347 DEVRRRVGA--PVSSLGGFDTCFNTT---AVAWPPVTLLFDGMQV-TLPEENVVIHSTYG 400
Query: 453 GTFCFAFAPSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A A +P G L++I ++QQ+ ++ FD NG VGF C
Sbjct: 401 TISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447
>gi|115466078|ref|NP_001056638.1| Os06g0121500 [Oryza sativa Japonica Group]
gi|113594678|dbj|BAF18552.1| Os06g0121500 [Oryza sativa Japonica Group]
Length = 442
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 118/344 (34%), Positives = 157/344 (45%), Gaps = 66/344 (19%)
Query: 161 GVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
+ P +Q M ID+ D+ W+QC PC +CY Q + +FDP S + + V C SA C
Sbjct: 156 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 215
Query: 219 L--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVAIGCGHKNQGMFVGA 275
L AGC +C+Y V YGDG T GT ++ LT+ TVV N GC H +G F
Sbjct: 216 LGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNF--- 272
Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNP 335
+ S+ +F R PLVRNP
Sbjct: 273 ---------------------------------SASTSGTMFAR---------TPLVRNP 290
Query: 336 RA-PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
P+ Y V L G+ VGG R+ + +F G VMD+ +T+LP AY A R A
Sbjct: 291 SIIPTLYLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLA 344
Query: 395 FVAQTGNLPR-ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
F + PR A G + DTCY+ F SV VP VS F GG V+ L A ++
Sbjct: 345 FRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------ 398
Query: 454 TFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C AF P+P L IGN+QQ+ ++ +D G VGF C
Sbjct: 399 EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 442
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 123/375 (32%), Positives = 188/375 (50%), Gaps = 38/375 (10%)
Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-------CYKQSDPVFDPAD 202
DQG + + +G+G+PP+ + +++D+GSD++W QC S+ +Q +P+++P
Sbjct: 81 DQG---HSLTVGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRR 137
Query: 203 SASFSGVSCSSAVCD--RLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIG--RTVV 257
S+SF+ + CS +C + C RC Y+ YG G LA ET T G V
Sbjct: 138 SSSFAYLPCSDRLCQEGQFSYKNCARNNRCMYDELYGSAE-AGGVLASETFTFGVNAKVS 196
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF 317
+ GCG + G VGA+GL+GL G MSLV QL + FSYCL + L+F
Sbjct: 197 LPLGFGCGALSAGDLVGASGLMGLSPGIMSLVSQL---SVPRFSYCLTPFAERKTSPLLF 253
Query: 318 G-----REALPVGAAWVP-LVRNPRAPS-FYYVGLSGLGVGGMRIPI-SEDLFRLTQMGD 369
G R G ++RNP + +YYV L GL +G R+ + + L + G
Sbjct: 254 GAMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSLGMIKPDGS 313
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI----FDTCYNLSGFV---S 422
G ++D+G+ ++ L A+ A + A V + LP A+G ++ C+ L V +
Sbjct: 314 GGTIVDSGSTMSYLEETAFRAVKKA-VVEAVRLPVANGTDEDYDDYELCFALPTGVAMEA 372
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS--GLSIIGNIQQEGIQIS 480
V+ P + +F GG +TLP N+ AG C A SP G+SIIGN+QQ+ + +
Sbjct: 373 VKTPPLVLHFDGGAAMTLPRDNYF-QEPRAGLMCLAVGTSPDGFGVSIIGNVQQQNMHVL 431
Query: 481 FDGANGFVGFGPNVC 495
FD N F P C
Sbjct: 432 FDVRNQKFSFAPTKC 446
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 140/412 (33%), Positives = 208/412 (50%), Gaps = 40/412 (9%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGT--DVVSGMDQGSGEYFVRIGVGSPPRSQ 169
++RD+ R A R L+ + ++ T D+ +G GEY + + +G+PP+S
Sbjct: 51 LRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNG-----GEYIMTLAIGTPPQSY 105
Query: 170 YMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSASFSGVSCSSAV--CD---RLENAG 223
+ D+GSD+VW QC PC + C+KQ P+++P+ S +F + CSSA+ C RL A
Sbjct: 106 PAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAGAT 165
Query: 224 CHAG-RCRYEVSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHKNQGMFVGAAG 277
G CRY +YG G +T G ET T G + V +A GC + + + G+AG
Sbjct: 166 PPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFGCSNASSDDWNGSAG 224
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALP-----VGAAWVPL 331
L+GLG G +SLV QL G FSYCL + T S +L+ G A G P
Sbjct: 225 LVGLGRGGLSLVSQLAA---GMFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPF 281
Query: 332 VRNPRAP---SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
V +P P ++YY+ L+G+ VG +PI F L G G+++D+GT +T L AY
Sbjct: 282 VPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFALRADGTGGLIIDSGTTITSLVDAAY 341
Query: 389 EAFRDAFVAQTGNLPRASGVSI--FDTCYNL--SGFVSVRVPTVSFYFSGGPVLTLPASN 444
+ R A V LP G + D C+ L S +P+++ +F GG + LP N
Sbjct: 342 KRVRAA-VRSLVKLPVTDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVEN 400
Query: 445 FLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
++I D G +C A G LS +GN QQ+ + I +D + F P C
Sbjct: 401 YMI--LDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKETLSFAPAKC 450
>gi|356507997|ref|XP_003522749.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 440
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 124/359 (34%), Positives = 175/359 (48%), Gaps = 16/359 (4%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
+ SG G Y VR+ +G+P + +MV+D+ +D +V C C+ C SD F P S
Sbjct: 89 IASGQTFNIGNYVVRVKLGTPGQLLFMVLDTSTDEAFVPCSGCTGC---SDTTFSPKAST 145
Query: 205 SFSGVSCSSAVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA 261
S+ + CS C ++ C A G C + SY S++ TL ++L + V+ N +
Sbjct: 146 SYGPLDCSVPQCGQVRGLSCPATGTGACSFNQSYAGSSFS-ATLVQDSLRLATDVIPNYS 204
Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGRE 320
GC + G V A GLLGLG G +SL+ Q G G FSYCL S + SGSL G
Sbjct: 205 FGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSYCLPSFKSYYFSGSLKLGPV 264
Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
P PL+R+P PS YYV +G+ VG + +P + G ++D+GT +
Sbjct: 265 GQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTIIDSGTVI 324
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
TR P Y A R+ F Q G S + FDTC+ + P ++ +F G L L
Sbjct: 325 TRFVEPVYNAVREEFRKQVGGTTFTS-IGAFDTCFVKT--YETLAPPITLHFEGLD-LKL 380
Query: 441 PASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P N LI C A A +P S L++I N QQ+ ++I FD N VG VC
Sbjct: 381 PLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRILFDTVNNKVGIAREVC 439
>gi|55296937|dbj|BAD68388.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|218197467|gb|EEC79894.1| hypothetical protein OsI_21421 [Oryza sativa Indica Group]
Length = 424
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 118/344 (34%), Positives = 157/344 (45%), Gaps = 66/344 (19%)
Query: 161 GVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
+ P +Q M ID+ D+ W+QC PC +CY Q + +FDP S + + V C SA C
Sbjct: 138 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 197
Query: 219 L--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVAIGCGHKNQGMFVGA 275
L AGC +C+Y V YGDG T GT ++ LT+ TVV N GC H +G F
Sbjct: 198 LGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNF--- 254
Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNP 335
+ S+ +F R PLVRNP
Sbjct: 255 ---------------------------------SASTSGTMFAR---------TPLVRNP 272
Query: 336 RA-PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
P+ Y V L G+ VGG R+ + +F G VMD+ +T+LP AY A R A
Sbjct: 273 SIIPTLYLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLA 326
Query: 395 FVAQTGNLPR-ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
F + PR A G + DTCY+ F SV VP VS F GG V+ L A ++
Sbjct: 327 FRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------ 380
Query: 454 TFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C AF P+P L IGN+QQ+ ++ +D G VGF C
Sbjct: 381 EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 424
>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
Length = 372
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 120/345 (34%), Positives = 175/345 (50%), Gaps = 14/345 (4%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
Y VR +G+P ++ M +D+ SD+ W+ PC+ C S +F+ S ++ + C +A
Sbjct: 36 YIVRAKIGTPAQTMLMAMDTSSDVAWI---PCNGCLGCSSTLFNSPASTTYKSLGCQAAQ 92
Query: 216 CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGA 275
C ++ C G C + ++YG GS L+ +T+T+ V + GC K G + A
Sbjct: 93 CKQVPKPTCGGGVCSFNLTYG-GSSLAANLSQDTITLATDAVPGYSFGCIQKATGGSLPA 151
Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLVRN 334
GLLGLG G +SL+ Q FSYCL S + SGSL G P + PL++N
Sbjct: 152 QGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKRIKYTPLLKN 211
Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
PR PS Y+V L + VG + + F G + D+GT TRL TPAY A RDA
Sbjct: 212 PRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDA 271
Query: 395 FVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
F + G + + FDTCY V + PT++F F+G V TLP N LI T
Sbjct: 272 FRNRVGRNLTVTSLGGFDTCYT----VPIAAPTITFMFTGMNV-TLPPDNLLIHSTAGST 326
Query: 455 FCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A A +P S L++I N+QQ+ ++ +D N +G +C
Sbjct: 327 TCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELC 371
>gi|55296886|dbj|BAD68338.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|55296941|dbj|BAD68392.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
Length = 424
Score = 169 bits (429), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 118/344 (34%), Positives = 157/344 (45%), Gaps = 66/344 (19%)
Query: 161 GVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
+ P +Q M ID+ D+ W+QC PC +CY Q + +FDP S + + V C SA C
Sbjct: 138 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 197
Query: 219 L--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVAIGCGHKNQGMFVGA 275
L AGC +C+Y V YGDG T GT ++ LT+ TVV N GC H +G F
Sbjct: 198 LGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNF--- 254
Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNP 335
+ S+ +F R PLVRNP
Sbjct: 255 ---------------------------------SASTSGTMFAR---------TPLVRNP 272
Query: 336 RA-PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
P+ Y V L G+ VGG R+ + +F G VMD+ +T+LP AY A R A
Sbjct: 273 SIIPTLYLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLA 326
Query: 395 FVAQTGNLPR-ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
F + PR A G + DTCY+ F SV VP VS F GG V+ L A ++
Sbjct: 327 FRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------ 380
Query: 454 TFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C AF P+P L IGN+QQ+ ++ +D G VGF C
Sbjct: 381 EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 424
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 123/400 (30%), Positives = 180/400 (45%), Gaps = 36/400 (9%)
Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
R R A L SG A A V TDV S EY + + +G+P RSQ +V+
Sbjct: 58 RSRARAANLCP-YSGATARPATAPVGRANTDVNS-------EYLIHLSIGAP-RSQPVVL 108
Query: 174 --DSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRY 231
D+GSD+VW QC+PC++C+ Q P FD A S + V+CS +C+ GC C Y
Sbjct: 109 TLDTGSDVVWTQCEPCAECFTQPLPRFDTAASNTVRSVACSDPLCNAHSEHGCFLHGCTY 168
Query: 232 EVSYGDGSYTKGTLALETLTI------GRTVVKNVAIGCGHKNQGMFVGA-AGLLGLGGG 284
YGDGS + G ++ T G+ V ++ GCG N G F+ G+ G G G
Sbjct: 169 VSGYGDGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNAGRFLQTETGIAGFGRG 228
Query: 285 SMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSF---- 340
+SL QL + FSYC +R S + G A P++ P S
Sbjct: 229 PLSLPSQLKVRQ---FSYCFTTRFEAKSSPVFLGGAGDLKAHATGPILSTPFVRSLPPGT 285
Query: 341 ----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
Y + G+ VG R+P+ E + G +D+GT +T P + + AF+
Sbjct: 286 DNSHYVLSFKGVTVGKTRLPVPE----IKADGSGATFIDSGTDITTFPDAVFRQLKSAFI 341
Query: 397 AQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFC 456
AQ LP D C++ G + +P + F+ G LP N++ ++G C
Sbjct: 342 AQAA-LPVNKTADEDDICFSWDGKKTAAMPKLVFHLEGAD-WDLPRENYVTEDRESGQVC 399
Query: 457 FAFAPS-PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
A + S ++IGN QQ+ I +D A G + P C
Sbjct: 400 VAVSTSGQMDRTLIGNFQQQNTHIVYDLAAGKLLLVPAQC 439
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 140/412 (33%), Positives = 208/412 (50%), Gaps = 40/412 (9%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGT--DVVSGMDQGSGEYFVRIGVGSPPRSQ 169
++RD+ R A R L+ + ++ T D+ +G GEY + + +G+PP+S
Sbjct: 51 LRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNG-----GEYIMTLAIGTPPQSY 105
Query: 170 YMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSASFSGVSCSSAV--CD---RLENAG 223
+ D+GSD+VW QC PC + C+KQ P+++P+ S +F + CSSA+ C RL A
Sbjct: 106 PAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAGAT 165
Query: 224 CHAG-RCRYEVSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHKNQGMFVGAAG 277
G CRY +YG G +T G ET T G + V +A GC + + + G+AG
Sbjct: 166 PPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFGCSNASSDDWNGSAG 224
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALP-----VGAAWVPL 331
L+GLG G +SLV QL G FSYCL + T S +L+ G A G P
Sbjct: 225 LVGLGRGGLSLVSQLAA---GMFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPF 281
Query: 332 VRNPRAP---SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
V +P P ++YY+ L+G+ VG +PI F L G G+++D+GT +T L AY
Sbjct: 282 VPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGTGGLIIDSGTTITSLVDAAY 341
Query: 389 EAFRDAFVAQTGNLPRASGVSI--FDTCYNL--SGFVSVRVPTVSFYFSGGPVLTLPASN 444
+ R A V LP G + D C+ L S +P+++ +F GG + LP N
Sbjct: 342 KRVRAA-VRSLVKLPVTDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVEN 400
Query: 445 FLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
++I D G +C A G LS +GN QQ+ + I +D + F P C
Sbjct: 401 YMI--LDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKETLSFAPAKC 450
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 106/298 (35%), Positives = 146/298 (48%), Gaps = 12/298 (4%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
Y VR+ +G+P + +MV+D+ +D WV C C+ C S F P S + + CS
Sbjct: 43 ANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC---SSTTFLPNASTTLGSLDCSE 99
Query: 214 AVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
A C ++ C A C + SYG S TL + +T+ V+ GC + G
Sbjct: 100 AQCSQVRGFSCPATGSSACLFNQSYGGDSSLAATLVQDAITLANDVIPGFTFGCINAVSG 159
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV 329
+ GLLGLG G +SL+ Q G G FSYCL S + SGSL G P
Sbjct: 160 GSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTT 219
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
PL+RNP PS YYV L+G+ VG +++PI + G ++D+GT +TR P Y
Sbjct: 220 PLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYF 279
Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
A RD F Q N P +S + FDTC+ + P V+ +F G L LP N LI
Sbjct: 280 AIRDEFRKQV-NGPISS-LGAFDTCFAATN--EAEAPAVTLHFEGL-NLVLPMENSLI 332
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 140/412 (33%), Positives = 208/412 (50%), Gaps = 40/412 (9%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGT--DVVSGMDQGSGEYFVRIGVGSPPRSQ 169
++RD+ R A R L+ + ++ T D+ +G GEY + + +G+PP+S
Sbjct: 56 LRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNG-----GEYIMTLAIGTPPQSY 110
Query: 170 YMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSASFSGVSCSSAV--CD---RLENAG 223
+ D+GSD+VW QC PC + C+KQ P+++P+ S +F + CSSA+ C RL A
Sbjct: 111 PAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAGAT 170
Query: 224 CHAG-RCRYEVSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHKNQGMFVGAAG 277
G CRY +YG G +T G ET T G + V +A GC + + + G+AG
Sbjct: 171 PPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFGCSNASSDDWNGSAG 229
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALP-----VGAAWVPL 331
L+GLG G +SLV QL G FSYCL + T S +L+ G A G P
Sbjct: 230 LVGLGRGGLSLVSQLAA---GMFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPF 286
Query: 332 VRNPRAP---SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
V +P P ++YY+ L+G+ VG +PI F L G G+++D+GT +T L AY
Sbjct: 287 VPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGTGGLIIDSGTTITSLVDAAY 346
Query: 389 EAFRDAFVAQTGNLPRASGVSI--FDTCYNL--SGFVSVRVPTVSFYFSGGPVLTLPASN 444
+ R A V LP G + D C+ L S +P+++ +F GG + LP N
Sbjct: 347 KRVRAA-VRSLVKLPVTDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVEN 405
Query: 445 FLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
++I D G +C A G LS +GN QQ+ + I +D + F P C
Sbjct: 406 YMI--LDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKETLSFAPAKC 455
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 119/366 (32%), Positives = 190/366 (51%), Gaps = 36/366 (9%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCS 212
GE+ + + +G+PP + D+GSD++W QC PCS QC++Q P+++P+ S +FS + C+
Sbjct: 83 GEFLMTLAIGTPPLPFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSALPCN 142
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV------VKNVAIGCGH 266
S++ L C C Y ++YG G +T ET T G + V +A GC +
Sbjct: 143 SSL--GLCAPAC---ACMYNMTYGSG-WTYVFQGTETFTFGSSTPADQVRVPGIAFGCSN 196
Query: 267 KNQGMFVGAA-GLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPV 324
+ G +A GL+GLG GS+SLV QLG FSYCL + T S+ +L+ G A
Sbjct: 197 ASSGFNASSASGLVGLGRGSLSLVSQLGAP---KFSYCLTPYQDTNSTSTLLLGPSASLN 253
Query: 325 GAAWV---PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
V P V +P + +YY+ L+G+ +G +PI + F L G G+++D+GT +T
Sbjct: 254 DTGVVSSTPFVASPSS-IYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLIIDSGTTIT 312
Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSV--RVPTVSFYFSGGPV 437
L AY+ R A ++ LP G + D C+ L S +P+++ +F G +
Sbjct: 313 MLGNTAYQQVRAAVLSLV-TLPTTDGSAATGLDLCFELPSSTSAPPSMPSMTLHFDGADM 371
Query: 438 LTLPASNFLI----PVDDAGTFCFAFAPSPSG----LSIIGNIQQEGIQISFDGANGFVG 489
+ LPA N+++ P D+ +C A +SI+GN QQ+ + I +D +
Sbjct: 372 V-LPADNYMMSLSDPDSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMHILYDVGKETLS 430
Query: 490 FGPNVC 495
F P C
Sbjct: 431 FAPAKC 436
>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
Length = 425
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 122/348 (35%), Positives = 165/348 (47%), Gaps = 15/348 (4%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
S Y V+ VG+P ++ M +D+ +D W+ C C C S VF+ S +F + C
Sbjct: 87 SPTYIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGC---SSTVFNSVTSTTFKTLGCD 143
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
+ C ++ N C C + +YG GS L +T+ + +V GC K G
Sbjct: 144 APQCKQVPNPTCGGSTCTWNTTYG-GSTILSNLTRDTIALSTDIVPGYTFGCIQKTTGSS 202
Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPL 331
V GLLGLG G +S + Q FSYCL S R SG+L G P+ PL
Sbjct: 203 VPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLNFSGTLRLGPAGQPLRIKTTPL 262
Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
++NPR S YYV L G+ VG + I G + D+GT TRL P Y A
Sbjct: 263 LKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVFTRLVAPVYTAV 322
Query: 392 RDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDD 451
RD F + GN S + FDTCY +G + PT++F FSG V TLP N LI
Sbjct: 323 RDEFRKRVGNA-IVSSLGGFDTCY--TG--PIVAPTMTFMFSGMNV-TLPTDNLLIRSTA 376
Query: 452 AGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
T C A A +P S L++I N+QQ+ +I FD N +G C
Sbjct: 377 GSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIGVAREPC 424
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 118/342 (34%), Positives = 171/342 (50%), Gaps = 33/342 (9%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
G + V + G+PP+ +++D+GS I W QC+PC +C K S FDP+ S ++S SC
Sbjct: 160 GNFLVDVAFGTPPQKFTLILDTGSSITWTQCKPCVRCLKASRRHFDPSASLTYSLGSCIP 219
Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMF 272
+ Y ++YGD S + G +T+T+ + V GCG N+G F
Sbjct: 220 STVGN-----------TYNMTYGDKSTSVGNYGCDTMTLEHSDVFPKFQFGCGRNNEGDF 268
Query: 273 -VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA--WV 329
GA G+LGLG G +S V Q + FSYCL S GSL+FG +A ++ +
Sbjct: 269 GSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEE--DSIGSLLFGEKATSQSSSLKFT 326
Query: 330 PLVRNP-----RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
LV P +Y+V L + VG R+ I +F G ++D+GT +TRLP
Sbjct: 327 SLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF-----ASPGTIIDSGTVITRLP 381
Query: 385 TPAYEAFRDAFVAQTGNLPRASGV----SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
AY A + AF P ++G I DTCYNLSG V +P + +F G + L
Sbjct: 382 QRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRL 441
Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
+I +DA C AFA + S L+IIGN QQ + + +D
Sbjct: 442 NGKR-VIWGNDASRLCLAFAGN-SELTIIGNRQQVSLTVLYD 481
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 106/298 (35%), Positives = 146/298 (48%), Gaps = 12/298 (4%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
Y VR+ +G+P + +MV+D+ +D WV C C+ C S F P S + + CS
Sbjct: 43 ANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC---SSTTFLPNASTTLGSLDCSE 99
Query: 214 AVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
A C ++ C A C + SYG S TL + +T+ V+ GC + G
Sbjct: 100 AQCSQVRGFSCPATGSSACLFNQSYGGDSSLAATLVQDAITLANDVIPGFTFGCINAVSG 159
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV 329
+ GLLGLG G +SL+ Q G G FSYCL S + SGSL G P
Sbjct: 160 GSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTT 219
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
PL+RNP PS YYV L+G+ VG +++PI + G ++D+GT +TR P Y
Sbjct: 220 PLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYF 279
Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
A RD F Q N P +S + FDTC+ + P V+ +F G L LP N LI
Sbjct: 280 AIRDEFRKQV-NGPISS-LGAFDTCFAETN--EAEAPAVTLHFEGL-NLVLPMENSLI 332
>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 122/348 (35%), Positives = 165/348 (47%), Gaps = 15/348 (4%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
S Y V+ VG+P ++ M +D+ +D W+ C C C S VF+ S +F + C
Sbjct: 87 SPTYIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGC---SSTVFNSVTSTTFKTLGCD 143
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
+ C ++ N C C + +YG GS L +T+ + +V GC K G
Sbjct: 144 APQCKQVPNPTCGGSTCTWNTTYG-GSTILSNLTRDTIALSTDIVPGYTFGCIQKTTGSS 202
Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPL 331
V GLLGLG G +S + Q FSYCL S R SG+L G P+ PL
Sbjct: 203 VPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLNFSGTLRLGPAGQPLRIKTTPL 262
Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
++NPR S YYV L G+ VG + I G + D+GT TRL P Y A
Sbjct: 263 LKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVFTRLVAPVYTAV 322
Query: 392 RDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDD 451
RD F + GN S + FDTCY +G + PT++F FSG V TLP N LI
Sbjct: 323 RDEFRKRVGNA-IVSSLGGFDTCY--TG--PIVAPTMTFMFSGMNV-TLPPDNLLIRSTA 376
Query: 452 AGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
T C A A +P S L++I N+QQ+ +I FD N +G C
Sbjct: 377 GSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIGVAREPC 424
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 121/347 (34%), Positives = 169/347 (48%), Gaps = 18/347 (5%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
Y VR +G+PP+ + +D+ +D W+ C C+ C S FDPA SAS+ V C S +
Sbjct: 112 YVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDPAASASYRTVPCGSPL 171
Query: 216 CDRLENAGCHAG--RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFV 273
C + NA C G C + ++Y D S + L+ ++L + VK GC + G
Sbjct: 172 CAQAPNAACPPGGKACGFSLTYADSSL-QAALSQDSLAVAGNAVKAYTFGCLQRATGTAA 230
Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLV 332
GLLGLG G +S + Q FSYCL S + SG+L GR P PL+
Sbjct: 231 PPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTPLL 290
Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
NP S YYV ++G+ VG +PI G V+D+GT TRL PAY A R
Sbjct: 291 ANPHRSSLYYVNMTGVRVGRKVVPIPA----FDPATGAGTVLDSGTMFTRLVAPAYVAVR 346
Query: 393 DAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA 452
D + G S + FDTC+N + +V P ++ F G V TLP N +I
Sbjct: 347 DEVRRRVGA--PVSSLGGFDTCFNTT---AVAWPPMTLLFDGMQV-TLPEENVVIHSTYG 400
Query: 453 GTFCFAFAPSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A A +P G L++I ++QQ+ ++ FD NG VGF C
Sbjct: 401 TISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 168 bits (425), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 118/360 (32%), Positives = 173/360 (48%), Gaps = 37/360 (10%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
G + V + G+PP+ +++D+GS I W QC+ C C K S FD S+++S SC
Sbjct: 125 GNFLVDVAFGTPPQKFKLILDTGSSITWTQCKACVHCLKDSHRHFDSLASSTYSFGSCIP 184
Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMF 272
+ Y ++YGD S + G +T+T+ + V + GCG N+G F
Sbjct: 185 STVGNT-----------YNMTYGDKSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNNEGDF 233
Query: 273 -VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA--WV 329
GA G+LGLG G +S V Q + FSYCL S GSL+FG +A ++ +
Sbjct: 234 GSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEE--NSIGSLLFGEKATSQSSSLKFT 291
Query: 330 PLVRNP-----RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
LV P +Y+V L + VG R+ I +F G ++D+GT +TRLP
Sbjct: 292 SLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF-----ASPGTIIDSGTVITRLP 346
Query: 385 TPAYEAFRDAFVAQTGNLPRASGV----SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
AY A + AF P ++G + DTCYNLSG V +P +F G + L
Sbjct: 347 QRAYSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLHFGDGADVRL 406
Query: 441 PASNFLIPVDDAGTFCFAFAPSPSG-----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
++ +DA C AFA + L+IIGN QQ + + +D +GFG N C
Sbjct: 407 NGKR-VVWGNDASRLCLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGRRIGFGGNGC 465
>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
Length = 449
Score = 168 bits (425), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 124/348 (35%), Positives = 167/348 (47%), Gaps = 16/348 (4%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
Y VR +G+P + + +D+ +D W+ C C+ C S F+PA SAS+ V C S
Sbjct: 107 YVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSP--FNPAASASYRPVPCGSPQ 164
Query: 216 CDRLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFV 273
C N C +A C + +SY D S + L+ +TL + VVK GC + G
Sbjct: 165 CVLAPNPSCSPNAKSCGFSLSYADSSL-QAALSQDTLAVAGDVVKAYTFGCLQRATGTAA 223
Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLV 332
GLLGLG G +S + Q G FSYCL S + SG+L GR P PL+
Sbjct: 224 PPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQPRRIKTTPLL 283
Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
NP S YYV ++G+ VG + I G V+D+GT TRL P Y A R
Sbjct: 284 ANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYLALR 343
Query: 393 DAFVAQTGNLPRA-SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDD 451
D + G A S + FDTCYN +V P V+ F G V TLP N +I
Sbjct: 344 DEVRRRVGAGAAAVSSLGGFDTCYN----TTVAWPPVTLLFDGMQV-TLPEENVVIHTTY 398
Query: 452 AGTFCFAFAPSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
T C A A +P G L++I ++QQ+ ++ FD NG VGF C
Sbjct: 399 GTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESC 446
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 121/369 (32%), Positives = 184/369 (49%), Gaps = 42/369 (11%)
Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGV 209
D+G + V VG PP Q + ID+GSD++WVQC+PC+ C++QS P+FDP+ S+++ +
Sbjct: 86 DRGQA-FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDL 144
Query: 210 SCSSAVC-DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIG 263
S S +C + + H +C Y SY DGS + G LA E + G V +V G
Sbjct: 145 SYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFG 204
Query: 264 CGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCL--VSRGTGSSGSLVFGRE 320
CGH N+G F G +G+LGL G S+V +LG + FSYC+ + + LV G +
Sbjct: 205 CGHSNRGRFDGQQSGILGLSAGDQSIVSRLGSR----FSYCIGDLFDPHYTHNQLVLG-D 259
Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
+ + + P FYYV L G+ VG R+ I+ ++F+ T+ G GVVMD+GT
Sbjct: 260 GVKMEGSSTPF---HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTA 316
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDT-----CY------NLSGFVSVRVPTV 428
T L + D + L R I+ T CY +L GF P +
Sbjct: 317 TFLAKDGF----DPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGF-----PEL 367
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANG 486
+F+F+ G L L A++ + + FC A S + S+IG + Q+ +++D
Sbjct: 368 AFHFAEGADLVLDANSLFVQ-KNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGK 426
Query: 487 FVGFGPNVC 495
V F C
Sbjct: 427 RVYFQRTDC 435
>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 437
Score = 167 bits (424), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 121/350 (34%), Positives = 173/350 (49%), Gaps = 16/350 (4%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
G Y VR+ +G+P + +MV+D+ +D WV C C+ C + S+++ + CS
Sbjct: 95 GNYVVRVKLGTPGQFMFMVLDTSNDAAWVPCSGCTGCSSTTFST---NTSSTYGSLDCSM 151
Query: 214 AVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
A C ++ C A C + SYG S TL ++L + V+ N A GC + G
Sbjct: 152 AQCTQVRGFSCPATGSSSCVFNQSYGGDSSFSATLVEDSLRLVNDVIPNFAFGCINSISG 211
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV 329
V GLLGLG G +SL+ Q G G FSYCL S + SGSL G P +
Sbjct: 212 GSVPPQGLLGLGRGPLSLIAQSGSLYSGLFSYCLPSFKSYYFSGSLKLGPAGQPKSIRYT 271
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
PL+RNP PS YYV L+G+ VG +PI+ +L G ++D+GT +TR P Y
Sbjct: 272 PLLRNPHRPSLYYVNLTGVSVGRTLVPIAPELLAFNPNTGAGTIIDSGTVITRFVQPIYT 331
Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
A RD F Q S + FDTC+ + P V+ +F+G L LP N LI
Sbjct: 332 AIRDEFRKQVAG--PFSSLGAFDTCFAATN--EAVAPAVTLHFTGL-NLVLPMENSLIHS 386
Query: 450 DDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A A +P S L++I N+QQ+ +++ FD N +G +C
Sbjct: 387 SAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRLLFDVPNSRLGIARELC 436
>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
Length = 396
Score = 167 bits (424), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 124/348 (35%), Positives = 167/348 (47%), Gaps = 16/348 (4%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
Y VR +G+P + + +D+ +D W+ C C+ C S F+PA SAS+ V C S
Sbjct: 54 YVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSP--FNPAASASYRPVPCGSPQ 111
Query: 216 CDRLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFV 273
C N C +A C + +SY D S + L+ +TL + VVK GC + G
Sbjct: 112 CVLAPNPSCSPNAKSCGFSLSYADSSL-QAALSQDTLAVAGDVVKAYTFGCLQRATGTAA 170
Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLV 332
GLLGLG G +S + Q G FSYCL S + SG+L GR P PL+
Sbjct: 171 PPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQPRRIKTTPLL 230
Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
NP S YYV ++G+ VG + I G V+D+GT TRL P Y A R
Sbjct: 231 ANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYLALR 290
Query: 393 DAFVAQTGNLPRA-SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDD 451
D + G A S + FDTCYN +V P V+ F G V TLP N +I
Sbjct: 291 DEVRRRVGAGAAAVSSLGGFDTCYN----TTVAWPPVTLLFDGMQV-TLPEENVVIHTTY 345
Query: 452 AGTFCFAFAPSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
T C A A +P G L++I ++QQ+ ++ FD NG VGF C
Sbjct: 346 GTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESC 393
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 121/369 (32%), Positives = 184/369 (49%), Gaps = 42/369 (11%)
Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGV 209
D+G + V VG PP Q + ID+GSD++WVQC+PC+ C++QS P+FDP+ S+++ +
Sbjct: 54 DRGQA-FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDL 112
Query: 210 SCSSAVC-DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIG 263
S S +C + + H +C Y SY DGS + G LA E + G V +V G
Sbjct: 113 SYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFG 172
Query: 264 CGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCL--VSRGTGSSGSLVFGRE 320
CGH N+G F G +G+LGL G S+V +LG + FSYC+ + + LV G +
Sbjct: 173 CGHSNRGRFDGQQSGILGLSAGDQSIVSRLGSR----FSYCIGDLFDPHYTHNQLVLG-D 227
Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
+ + + P FYYV L G+ VG R+ I+ ++F+ T+ G GVVMD+GT
Sbjct: 228 GVKMEGSSTPF---HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTA 284
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDT-----CY------NLSGFVSVRVPTV 428
T L + D + L R I+ T CY +L GF P +
Sbjct: 285 TFLAKDGF----DPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGF-----PEL 335
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANG 486
+F+F+ G L L A++ + + FC A S + S+IG + Q+ +++D
Sbjct: 336 AFHFAEGADLVLDANSLFVQ-KNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGK 394
Query: 487 FVGFGPNVC 495
V F C
Sbjct: 395 RVYFQRTDC 403
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 121/369 (32%), Positives = 184/369 (49%), Gaps = 42/369 (11%)
Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGV 209
D+G + V VG PP Q + ID+GSD++WVQC+PC+ C++QS P+FDP+ S+++ +
Sbjct: 54 DRGQA-FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDL 112
Query: 210 SCSSAVC-DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIG 263
S S +C + + H +C Y SY DGS + G LA E + G V +V G
Sbjct: 113 SYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFG 172
Query: 264 CGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCL--VSRGTGSSGSLVFGRE 320
CGH N+G F G +G+LGL G S+V +LG + FSYC+ + + LV G +
Sbjct: 173 CGHSNRGRFDGQQSGILGLSAGDQSIVSRLGSR----FSYCIGDLFDPHYTHNQLVLG-D 227
Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
+ + + P FYYV L G+ VG R+ I+ ++F+ T+ G GVVMD+GT
Sbjct: 228 GVKMEGSSTPF---HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTA 284
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDT-----CY------NLSGFVSVRVPTV 428
T L + D + L R I+ T CY +L GF P +
Sbjct: 285 TFLAKDGF----DPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGF-----PEL 335
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANG 486
+F+F+ G L L A++ + + FC A S + S+IG + Q+ +++D
Sbjct: 336 AFHFAEGADLVLDANSLFVQ-KNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGK 394
Query: 487 FVGFGPNVC 495
V F C
Sbjct: 395 RVYFQRTDC 403
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 118/354 (33%), Positives = 173/354 (48%), Gaps = 26/354 (7%)
Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC--YKQSDPVFDPADSASFSGV 209
G GEY + + +G+PP+ +ID+GSD+VW++C C C + +F S+S+ +
Sbjct: 1 GEGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKL 60
Query: 210 SCSSAVCDRLENAG----CHAGRCRYEVSYGDGSYTKGTLALETLTI--------GRTVV 257
C+S C + +AG C C+Y+ YGDGS T G + + ++ R+
Sbjct: 61 PCNSTHCSGMSSAGIGPRCEE-TCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFF 119
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS--L 315
GCG K +G + GL+GLG S SL+ QLG + G FSYCLVS + S L
Sbjct: 120 DGFLFGCGRKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFL 179
Query: 316 VFGREALPVGAAWV--PLVRNPRAP-SFYYVGLSGLGVGGMRIPI-SEDLFRLTQMGD-- 369
G A G V P++ + YYV L + VGG+ + + ++ T +G
Sbjct: 180 FLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHNTSVGPFL 239
Query: 370 -DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
+ V+D+GT T L P YEA R + Q LP + D C+N SG S P+V
Sbjct: 240 ANKTVIDSGTTYTLLTPPVYEAMRKSIEEQV-ILPTLGNSAGLDLCFNSSGDTSYGFPSV 298
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
+FYF+ L LP N + V C + S LSIIGN+QQ+ I +D
Sbjct: 299 TFYFANQVQLVLPFEN-IFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYD 351
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 121/376 (32%), Positives = 182/376 (48%), Gaps = 42/376 (11%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
+ +++G+GS ++ +ID+GS+ V VQC +S PVFDPA S S+ V C S +
Sbjct: 100 FSMQLGIGSLQKNLSAIIDTGSEAVLVQCG------SRSRPVFDPAASQSYRQVPCISQL 153
Query: 216 CDRLENAGCH---------AGRCRYEVSYGDGSYTKGTLALETLTIGRT-------VVKN 259
C ++ + + C Y +SYGD + G + + + + T ++
Sbjct: 154 CLAVQQQTSNGSSQPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQFRD 213
Query: 260 VAIGCGHKNQGMFV--GAAGLLGLGGGSMSLVGQLGGQTGGA-FSYCLVSR--GTGSSGS 314
VA GC H QG V G+ G++G G++SL QL + GG+ FSYC S+ ++G
Sbjct: 214 VAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGV 273
Query: 315 LVFGREALPVGAA-WVPLVRNPRAPS---FYYVGLSGLGVGGMRIPISEDLFRLT-QMGD 369
+ G L + PL+ NP P+ YYVGL+ + V G + I E F+L GD
Sbjct: 274 IFLGDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGD 333
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVA--QTGNLPRASGVSIFDTCYNLSGFVSVR-VP 426
G V+D+GT TR+ AY AFR+AF A ++G + + FD CYN+S S+ VP
Sbjct: 334 GGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLPGVP 393
Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAG---TFCFAFAPSPSG----LSIIGNIQQEGIQI 479
V L L + +PV AG T C A S ++++GN QQ +
Sbjct: 394 EVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLV 453
Query: 480 SFDGANGFVGFGPNVC 495
+D VGF C
Sbjct: 454 EYDNERSRVGFERADC 469
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 113/357 (31%), Positives = 164/357 (45%), Gaps = 30/357 (8%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
GEY +R +G+P + + D+GSD+ W+QC PC CY Q P+FDP S+++ V C S
Sbjct: 86 GEYLMRFSLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTYVDVPCES 145
Query: 214 AVCDRLENAGCHAG---RCRYEVSYGDGSYTKGTLALETLTI--------GRTVVKNVAI 262
C G +C Y YG S+T G L +T++ G T K+V
Sbjct: 146 QPCTLFPQNQRECGSSKQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSV-F 204
Query: 263 GCGHKNQGMF---VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR 319
GC + F A G +GLG G +SL QLG Q G FSYC+V + S+G L FG
Sbjct: 205 GCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCMVPFSSTSTGKLKFGS 264
Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
A P + NP PS+Y + L G+ VG ++ LT +++D+
Sbjct: 265 MAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKV--------LTGQIGGNIIIDSVPI 316
Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGV-SIFDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
+T L Y F + V + N+ A + F+ C + ++ P F+F+G V+
Sbjct: 317 LTHLEQGIYTDFISS-VKEAINVEVAEDAPTPFEYC--VRNPTNLNFPEFVFHFTGADVV 373
Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P + F+ D C PS G+SI GN Q Q+ +D V F P C
Sbjct: 374 LGPKNMFI--ALDNNLVCMTVVPS-KGISIFGNWAQVNFQVEYDLGEKKVSFAPTNC 427
>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/347 (33%), Positives = 173/347 (49%), Gaps = 27/347 (7%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
G + V +G G+P + ++ID+GSD W+QC CS + F+P+ S+S+S SC
Sbjct: 127 GLFLVNVGFGTPQQKFNLIIDTGSDTTWIQCNSCSLGNCHNKKTFNPSLSSSYSNRSCIP 186
Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFV 273
+ Y + Y D SY+KG + +T+ V GCG G F
Sbjct: 187 ST------------DTNYTMKYEDNSYSKGVFVCDEVTLKPDVFPKFQFGCGDSGGGEFG 234
Query: 274 GAAGLLGLGGGSM-SLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAW-VPL 331
A+G+LGL G SL+ Q + FSYC + + GSL+FG +A+ +
Sbjct: 235 TASGVLGLAKGEQYSLISQTASKFKKKFSYCFPPK-EHTLGSLLFGEKAISASPSLKFTQ 293
Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
+ NP + Y+V L G+ V R+ +S LF G ++D+GT +TRLPT AYEA
Sbjct: 294 LLNPPSGLGYFVELIGISVAKKRLNVSSSLF-----ASPGTIIDSGTVITRLPTAAYEAL 348
Query: 392 RDAFVAQTGNLPRASGV---SIFDTCYNLSGF--VSVRVPTVSFYFSGGPVLTLPASNFL 446
R AF + + P S + DTCYNL G ++++P + +F G ++L S L
Sbjct: 349 RTAFQQEMLHCPSISPPPQEKLLDTCYNLKGCGGRNIKLPEIVLHFVGEVDVSLHPSGIL 408
Query: 447 IPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFG 491
D C AFA +PS ++IIGN QQ +++ +D G +GFG
Sbjct: 409 WANGDLTQACLAFARKSNPSHVTIIGNRQQVSLKVVYDIEGGRLGFG 455
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 130/408 (31%), Positives = 197/408 (48%), Gaps = 39/408 (9%)
Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGV 162
+ + A + + RV + R A+++ TDV S + G Y + I V
Sbjct: 7 KRSEAIRALVAKSHARVRWMAAR-----ANSSSWSSMAGTTDVESPLHPDGGGYVMDISV 61
Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA 222
G+P + + D+GSD+VWVQ +PC+ C + +FDP S++F + CSS +C L +
Sbjct: 62 GTPGKRFRAIADTGSDLVWVQSEPCTGCSGGT--IFDPRQSSTFREMDCSSQLCAELPGS 119
Query: 223 GCHAGR--CRYEVSYG----DGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAA 276
C G C Y YG +G + + T++L T + G + A+GCG N G F G
Sbjct: 120 -CEPGSSTCSYSYEYGSGETEGEFARDTISLGTTSDGSQKFPSFAVGCGMVNSG-FDGVD 177
Query: 277 GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS-LVFGREALPVGAAWVPLVRNP 335
GL+GLG G +SL QL FSYCLV + S S L+FG A G P
Sbjct: 178 GLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITP 237
Query: 336 RA---PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG-VVMDTGTAVTRLPTPAYEAF 391
+ P++Y + ++G+ V G MG G ++D+GT +T +P+ Y
Sbjct: 238 PSDTYPTYYLLTVNGIAVAGQ------------TMGSPGTTIIDSGTTLTYVPSGVYGRV 285
Query: 392 RDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
+ LPR G S+ D CY+ S + + P ++ +G +T P+SN+ + VD
Sbjct: 286 LSRMESMV-TLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAGA-TMTPPSSNYFLVVD 343
Query: 451 DAG-TFCFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
D+G T C A S SGL SIIGN+ Q+G I +D + + F C
Sbjct: 344 DSGDTVCLAMG-SASGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 127/369 (34%), Positives = 176/369 (47%), Gaps = 32/369 (8%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
V + + +G+Y +++ +G+PP Y ++D+GSD+VW QC PC CY+Q P+F+P S
Sbjct: 39 VFTRVTSNNGDYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQGCYRQKSPMFEPLRSN 98
Query: 205 SFSGVSCSSAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT-----VVK 258
+++ + C S C+ L C + C Y +Y D S TKG LA ET+T T VV
Sbjct: 99 TYTPIPCDSEECNSLFGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVVG 158
Query: 259 NVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGA-FSYCLVS--RGTGSSGS 314
++ GCGH N G F G++GLGGG +SLV Q G G FS CLV + G+
Sbjct: 159 DIVFGCGHSNSGTFNENDMGIIGLGGGPLSLVSQFGNLYGSKRFSQCLVPFHADPHTLGT 218
Query: 315 LVFG--REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
+ FG + G A PLV + Y V L G+ VG + F ++M G
Sbjct: 219 ISFGDASDVSGEGVAATPLVSE-EGQTPYLVTLEGISVGDTFVS-----FNSSEMLSKGN 272
Query: 373 VM-DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCY----NLSGFVSVRVP 426
+M D+GT T LP Y+ Q+ LP + CY NL G P
Sbjct: 273 IMIDSGTPATYLPQEFYDRLVKELKVQSNMLPIDDDPDLGTQLCYRSETNLEG------P 326
Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
+ +F G V +P F+ P D G FCFA A + G I GN Q + I FD
Sbjct: 327 ILIAHFEGADVQLMPIQTFIPPKD--GVFCFAMAGTTDGEYIFGNFAQSNVLIGFDLDRK 384
Query: 487 FVGFGPNVC 495
V F C
Sbjct: 385 TVSFKATDC 393
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 121/352 (34%), Positives = 174/352 (49%), Gaps = 18/352 (5%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
S Y V++ +G+P + + +D+ SD+ W+ C C C S+ F PA S SF VSCS
Sbjct: 96 STTYIVKVLIGTPAQPLLLAMDTSSDVAWIPCSGCVGC--PSNTAFSPAKSTSFKNVSCS 153
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG-- 270
+ C ++ N C A C + ++YG S L+ +T+ + +K GC +K G
Sbjct: 154 APQCKQVPNPACGARACSFNLTYGSSSIA-ANLSQDTIRLAADPIKAFTFGCVNKVAGGG 212
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV 329
GLLGLG G +SL+ Q FSYCL S R SGSL G + P +
Sbjct: 213 TIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQRVKYT 272
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
L+RNPR S YYV L + VG + + G + D+GT TRL P YE
Sbjct: 273 QLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYE 332
Query: 390 AFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
A R+ F + P A S+ FDTCY SG V+VPT++F F G +T+PA N ++
Sbjct: 333 AVRNEFRKRVKP-PTAVVTSLGGFDTCY--SG--QVKVPTITFMFKGV-NMTMPADNLML 386
Query: 448 PVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
T C A A +P S +++I ++QQ+ ++ D NG +G C
Sbjct: 387 HSTAGSTSCLAMASAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERC 438
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 116/354 (32%), Positives = 172/354 (48%), Gaps = 26/354 (7%)
Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC--YKQSDPVFDPADSASFSGV 209
G GEY + + +G+PP+ +ID+GSD+VW++C C C + +F S+S+ +
Sbjct: 1 GEGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKL 60
Query: 210 SCSSAVCDRLENAG----CHAGRCRYEVSYGDGSYTKGTLALETLTI--------GRTVV 257
C+S C + +AG C C+Y+ YGDGS T G + + ++ R+
Sbjct: 61 PCNSTHCSGMSSAGIGPRCEE-TCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFF 119
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS--L 315
GC K +G + GL+GLG S SL+ QLG + G FSYCLVS + S L
Sbjct: 120 DGFLFGCARKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFL 179
Query: 316 VFGREALPVGAAWV--PLVRNPRAP-SFYYVGLSGLGVGGMRIPI-SEDLFRLTQMGD-- 369
G A G V P++ + YYV L + +GG+ + + ++ T +G
Sbjct: 180 FLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGHNTSVGPFL 239
Query: 370 -DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
+ V+D+GT T L P YEA R + Q LP + D C+N SG S P+V
Sbjct: 240 ANKTVIDSGTTYTLLTPPVYEAMRKSIEEQV-ILPTLGNSAGLDLCFNSSGDTSYGFPSV 298
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
+FYF+ L LP N + V C + S LSIIGN+QQ+ I +D
Sbjct: 299 TFYFANQVQLVLPFEN-IFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYD 351
>gi|356515690|ref|XP_003526531.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 123/359 (34%), Positives = 173/359 (48%), Gaps = 16/359 (4%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
+ SG G Y VR+ +G+P + +MV+D+ +D +V C C+ C SD F P S
Sbjct: 88 IASGQAFNIGNYVVRVKLGTPGQLLFMVLDTSTDEAFVPCSGCTGC---SDTTFSPKAST 144
Query: 205 SFSGVSCSSAVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA 261
S+ + CS C ++ C A G C + SY S++ TL + L + V+ +
Sbjct: 145 SYGPLDCSVPQCGQVRGLSCPATGTGACSFNQSYAGSSFS-ATLVQDALRLATDVIPYYS 203
Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGRE 320
GC + G V A GLLGLG G +SL+ Q G G FSYCL S + SGSL G
Sbjct: 204 FGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSYCLPSFKSYYFSGSLKLGPV 263
Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
P PL+R+P PS YYV +G+ VG + +P + G ++D+GT +
Sbjct: 264 GQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTIIDSGTVI 323
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
TR P Y A R+ F Q G S + FDTC+ + P ++ +F G L L
Sbjct: 324 TRFVEPVYNAVREEFRKQVGGTTFTS-IGAFDTCFVKT--YETLAPPITLHFEGLD-LKL 379
Query: 441 PASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P N LI C A A +P S L++I N QQ+ ++I FD N VG VC
Sbjct: 380 PLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRILFDIVNNKVGIAREVC 438
>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 462
Score = 164 bits (415), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 124/389 (31%), Positives = 190/389 (48%), Gaps = 32/389 (8%)
Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
+D RV ++ R+ G + E +D G+ G + V +G G P ++ ++I
Sbjct: 90 QDRSRVRSINARILG---QYSTEESKDGGSPESMHSLNEDGFFLVNVGFGKPQQNLNLII 146
Query: 174 DSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRY 231
D+GSD W++C CS C+ + P F+P+ S+S+S SC + + Y
Sbjct: 147 DTGSDTTWIRCNSCSLGNCHNKKIPTFNPSLSSSYSNRSCIPST------------KTNY 194
Query: 232 EVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSM-SLVG 290
++Y D SY+KG + +T+ V GCG G F A+G+LGL G SL+
Sbjct: 195 TMNYEDNSYSKGVFVCDEVTLKPDVFPKFQFGCGDSGGGDFGSASGVLGLAQGEQYSLIS 254
Query: 291 QLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAW-VPLVRNPRAPSFYYVGLSGLG 349
Q + FSYC + GSL+FG +A+ + + NP + S Y+V L G+
Sbjct: 255 QTASKFKKKFSYCF-PHNENTRGSLLFGEKAISASPSLKFTRLLNPSSGSVYFVELIGIS 313
Query: 350 VGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV- 408
V R+ +S LF G ++D+GT +T LPT AYEA R AF + + P S
Sbjct: 314 VAKKRLNVSSSLF-----ASPGTIIDSGTVITHLPTAAYEALRTAFQQEMLHCPSVSPPP 368
Query: 409 --SIFDTCYNLSGF--VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS-- 462
DTCYNL G ++++P + +F G ++L S L D C AFA
Sbjct: 369 QEKPLDTCYNLKGCGGRNIKLPEIVLHFVGEVDVSLHPSGILWANGDLTQACLAFARKSH 428
Query: 463 PSGLSIIGNIQQEGIQISFDGANGFVGFG 491
PS ++IIGN QQ +++ +D G +GFG
Sbjct: 429 PSHVTIIGNRQQVSLKVVYDIEGGRLGFG 457
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 122/388 (31%), Positives = 184/388 (47%), Gaps = 35/388 (9%)
Query: 137 EVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC-----QPCSQCY 191
E F + SG G+G+YFVR+ VG+P + +V D+GSD+ WV+C S
Sbjct: 85 ESSAFAMPLTSGAYTGTGQYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAA 144
Query: 192 KQSDPVFDPADSASFSGVSCSSAVCD-----RLENAGCHAGRCRYEVSYGDGSYTKGTLA 246
VF PA S S+S + C S C L N C Y+ Y D S +G +
Sbjct: 145 SPPQRVFRPAGSKSWSPLPCDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVG 204
Query: 247 LETLTIG--------RTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTG 297
L++ T+ + ++ V +GC G F + G+L LG ++S + + G
Sbjct: 205 LDSATVSLSGNDGTRKAKLQEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASRAASRFG 264
Query: 298 GAFSYCLVSR--GTGSSGSLVFGREALPVGAAW----VPLV--RNPRAPSFYYVGLSGLG 349
G FSYCLV ++ L FG G PLV + R FY+V + +
Sbjct: 265 GRFSYCLVDHLAPRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVT 324
Query: 350 VGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS 409
V G R+ I D++ + G G ++D+GT++T L TPAY+A A Q +PR + +
Sbjct: 325 VAGERLEILPDVWDFRKNG--GAILDSGTSLTILATPAYDAVVKAISKQFAGVPRVN-MD 381
Query: 410 IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAP-SPSGLS 467
F+ CYN +G VS +P + F+G L P +++I D A G C + G+S
Sbjct: 382 PFEYCYNWTG-VSAEIPRMELRFAGAATLAPPGKSYVI--DTAPGVKCIGVVEGAWPGVS 438
Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
+IGNI Q+ FD AN ++ F + C
Sbjct: 439 VIGNILQQEHLWEFDLANRWLRFKQSRC 466
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 120/369 (32%), Positives = 181/369 (49%), Gaps = 42/369 (11%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD 217
+++G+GS ++ +ID+GS+ V VQC +S PVFDPA S S+ V C S +C
Sbjct: 1 MQLGIGSLQKNLSAIIDTGSEAVLVQCG------SRSRPVFDPAASQSYRQVPCISQLCL 54
Query: 218 RLENAGCH---------AGRCRYEVSYGDGSYTKGTLALETLTIGRT-------VVKNVA 261
++ + + C Y +SYGD + G + + + + T ++VA
Sbjct: 55 AVQQQTSNGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDVA 114
Query: 262 IGCGHKNQGMFV--GAAGLLGLGGGSMSLVGQLGGQTGGA-FSYCLVSR--GTGSSGSLV 316
GC H QG V G+ G++G G++SL QL + GG+ FSYC S+ ++G +
Sbjct: 115 FGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIF 174
Query: 317 FGREALPVG-AAWVPLVRNPRAPS---FYYVGLSGLGVGGMRIPISEDLFRLT-QMGDDG 371
G L ++ PL+ NP P+ YYVGL+ + V G + I E F+L GD G
Sbjct: 175 LGDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGG 234
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVA--QTGNLPRASGVSIFDTCYNLSGFVSVR-VPTV 428
V+D+GT TR+ AY AFR+AF A ++G + + FD CYN+S S+ VP V
Sbjct: 235 TVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEV 294
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAG---TFCFAFAPSPSG----LSIIGNIQQEGIQISF 481
L L + +PV AG T C A S ++++GN QQ + +
Sbjct: 295 RLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEY 354
Query: 482 DGANGFVGF 490
D VGF
Sbjct: 355 DNERSRVGF 363
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 125/364 (34%), Positives = 183/364 (50%), Gaps = 36/364 (9%)
Query: 143 TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPAD 202
TDV S + G Y + I VG+P + + D+GSD+VWVQ +PC+ C + +FDP
Sbjct: 42 TDVESPLHPDGGGYVMDISVGTPGKRFRAIADTGSDLVWVQSEPCTGCSGGT--IFDPRQ 99
Query: 203 SASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRT----- 255
S++F + CSS +C L + C G C Y YG G T+G A +T+++G T
Sbjct: 100 SSTFREMDCSSQLCTELPGS-CEPGSSACSYSYEYGSGE-TEGEFARDTISLGTTSGGSQ 157
Query: 256 VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS- 314
+ A+GCG N G F G GL+GLG G +SL QL FSYCLV + S S
Sbjct: 158 KFPSFAVGCGMVNSG-FDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSP 216
Query: 315 LVFGREALPVGAAWVPLVRNPRA---PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
L+FG A G P + P++Y + ++G+ V G MG G
Sbjct: 217 LLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQ------------TMGSPG 264
Query: 372 -VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVS 429
++D+GT +T +P+ Y + LPR G S+ D CY+ S + + P ++
Sbjct: 265 TTIIDSGTTLTYVPSGVYGRVLSRMESMV-TLPRVDGSSMGLDLCYDRSSNRNYKFPALT 323
Query: 430 FYFSGGPVLTLPASNFLIPVDDAG-TFCFAFAPSPSGL--SIIGNIQQEGIQISFDGANG 486
+G +T P+SN+ + VDD+G T C A S GL SIIGN+ Q+G I +D +
Sbjct: 324 IRLAGA-TMTPPSSNYFLVVDDSGDTVCLAMG-SAGGLPVSIIGNVMQQGYHILYDRGSS 381
Query: 487 FVGF 490
+ F
Sbjct: 382 ELSF 385
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 115/370 (31%), Positives = 188/370 (50%), Gaps = 36/370 (9%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC-S 212
GEY+ I +GSP + +++D+GS++ W+QC PC C D ++D A SAS+ V+C +
Sbjct: 98 GEYYTSIKLGSPGQEAILIVDTGSELTWLQCLPCKVCAPSVDTIYDAARSASYRPVTCNN 157
Query: 213 SAVCDRLEN---AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGRTV------VKNVAI 262
S +C A C G +C++ YGDGS++ G+L+ +TL + V V++ A
Sbjct: 158 SQLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAF 217
Query: 263 GCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT--GSSGSLVFGR 319
GC + + GA+G+LGL G M+L QLG + G FS+C R + S+G + FG
Sbjct: 218 GCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFFGN 277
Query: 320 EALP---VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
LP V V L + FY+V L G+ + S +L L + V++D+
Sbjct: 278 AELPHEQVQYTSVALTNSELQRKFYHVALKGVSIN------SHELVFLPR--GSVVILDS 329
Query: 377 GTAVTRLPTPAYEAFRDAFVA-QTGNLPRASGVSIFD--TCYNLSG----FVSVRVPTVS 429
G++ + P + R+AF+ + +L G S D TC+ +S + +P++S
Sbjct: 330 GSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSLS 389
Query: 430 FYFSGGPVLTLPASNFLIPV---DDAGTFCFAFAP-SPSGLSIIGNIQQEGIQISFDGAN 485
F G + +P+ L+PV + CFAF P+ +++IGN QQ+ + + +D
Sbjct: 390 LVFEDGVTIGIPSIGVLLPVARFQNHVKMCFAFEDGGPNPVNVIGNYQQQNLWVEYDIQR 449
Query: 486 GFVGFGPNVC 495
VGF C
Sbjct: 450 SRVGFARASC 459
>gi|125602787|gb|EAZ42112.1| hypothetical protein OsJ_26672 [Oryza sativa Japonica Group]
Length = 477
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 110/346 (31%), Positives = 159/346 (45%), Gaps = 70/346 (20%)
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC------ 224
+++D+GSD+ WVQC+PCS CY Q DP+FDP+ SAS++ V C+++ C+ A
Sbjct: 178 VIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 237
Query: 225 ----------HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVG 274
+ RC Y ++YGDGS+++G LA +T+ +G V GCG N+G+F G
Sbjct: 238 ATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDGFVFGCGLSNRGLFGG 297
Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRN 334
AGL+GLG G L G LP GA
Sbjct: 298 TAGLMGLGPD-----GALAG---------------------------LPDGAP------- 318
Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
P FY++ ++G V +G V++D+GT +TRL Y A R
Sbjct: 319 ---PPFYFMNVTGASV-------GGAAVAAAGLGAANVLLDSGTVITRLAPSVYRAVRAE 368
Query: 395 FVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL-IPVDD 451
F Q G P A S+ D CYNL+G V+VP ++ GG +T+ A+ L + D
Sbjct: 369 FARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMTVDAAGMLFMARKD 428
Query: 452 AGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A A IIGN QQ+ ++ +D +GF C
Sbjct: 429 GSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 474
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 118/370 (31%), Positives = 180/370 (48%), Gaps = 36/370 (9%)
Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQ----SDPVFDPADSAS 205
DQG + + +G+ P + +++D+GSD++W QC+ S S PV+DP +S++
Sbjct: 13 DQG---HSLTVGIVQPRK---LIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESST 66
Query: 206 FSGVSCSSAVCD--RLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIG--RTVVKNV 260
F+ + CS +C + C + RC YE YG + G LA ET T G R V +
Sbjct: 67 FAFLPCSDRLCQEGQFSFKNCTSKNRCVYEDVYGSAAAV-GVLASETFTFGARRAVSLRL 125
Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG-- 318
GCG + G +GA G+LGL S+SL+ QL Q FSYCL + L+FG
Sbjct: 126 GFGCGALSAGSLIGATGILGLSPESLSLITQLKIQR---FSYCLTPFADKKTSPLLFGAM 182
Query: 319 ----REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
R +V NP +YYV L G+ +G R+ + + G G ++
Sbjct: 183 ADLSRHKTTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIV 242
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS-GVSIFDTCYNL------SGFVSVRVPT 427
D+G+ V L A+EA ++A V LP A+ V ++ C+ L + +V+VP
Sbjct: 243 DSGSTVAYLVEAAFEAVKEA-VMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPP 301
Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFDGAN 485
+ +F GG + LP N+ AG C A + SG+SIIGN+QQ+ + + FD +
Sbjct: 302 LVLHFDGGAAMVLPRDNYFQE-PRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQH 360
Query: 486 GFVGFGPNVC 495
F P C
Sbjct: 361 HKFSFAPTQC 370
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 121/375 (32%), Positives = 176/375 (46%), Gaps = 44/375 (11%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
GEY V++G+G+P ID+ SD+VW+QCQPC CY+Q DP+F+P S+S++ V CSS
Sbjct: 86 GEYLVKLGIGTPQHYFSAAIDTASDLVWLQCQPCVSCYRQLDPIFNPRLSSSYAVVPCSS 145
Query: 214 AVCDRLENAGCHAGR---CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQ- 269
C +L+ C CRY Y + T GTLA++ L +G V V +GC +
Sbjct: 146 DTCSQLDGHRCDEDDDQACRYNYKYSGNAVTNGTLAIDKLAVGGNVFHAVVLGCSDSSVG 205
Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA--- 326
G A+GL+GL G +SL+ QL + F YCL + + G LV G A GA
Sbjct: 206 GPPPQASGLVGLARGPLSLLSQLSVRR---FMYCLPPPMSRTPGKLVLGAGA---GADAV 259
Query: 327 ------AWVPLVRNPRAPSFYYVGLSGLGVGG-----MRIPISEDLFRLTQMGDD----- 370
V + + R PS+YY+ GL VG +R P S G
Sbjct: 260 RNVSDRVTVTMSSSTRYPSYYYLNFDGLAVGDQTPGTIRRPTSPPATGGGVGGGGGDGGS 319
Query: 371 -----GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLS---GF 420
G+++D + ++ L Y+ D + LPRA+ + D C+ L G
Sbjct: 320 GANAYGMIVDVASTISFLEASLYDELADDLEEEI-RLPRATPSTRLGLDLCFILPEGVGI 378
Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQIS 480
V VPTVS F G L L + +D C + SG+SI+GN QQ+ + +
Sbjct: 379 DRVYVPTVSMSFDGR-WLELERDRLFL--EDGRMMCLMIGRT-SGVSILGNYQQQNMHVL 434
Query: 481 FDGANGFVGFGPNVC 495
++ G + F C
Sbjct: 435 YNLRRGKITFAKASC 449
>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
Length = 328
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 128/214 (59%), Gaps = 23/214 (10%)
Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC-DRLEN 221
GSP + +++D+GSD+ WVQC+PCS CY Q DP+FDPA SA+++ V C+++ C D L
Sbjct: 103 GSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACADSLRA 162
Query: 222 A----------GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
A G + +C Y ++YGDGS+++G LA +T+ +G + GCG N+G+
Sbjct: 163 ATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGASLGGFVFGCGLSNRGL 222
Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG-SSGSLVFG---------REA 321
F G AGL+GLG +SLV Q + GG FSYCL + +G +SGSL G R
Sbjct: 223 FGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGGDDAASSYRNT 282
Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
PV A+ ++ +P P FY++ ++G VGG +
Sbjct: 283 TPV--AYTRMIADPAQPPFYFLNVTGAAVGGTAL 314
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 128/414 (30%), Positives = 193/414 (46%), Gaps = 30/414 (7%)
Query: 92 SSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVS-GMD 150
+ TT +++ + H R+ R + + + S + + ++ TD V MD
Sbjct: 40 TDTTTAAINFTQAALESHRRLSFLASRSSQVDKPQSSSASQLSNND-----TDTVPLRMD 94
Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
G G Y + +G+PP+ + D+GSD++W +C + P S++F+ +
Sbjct: 95 GGGGAYDMEFSIGTPPQKLTALADTGSDLIWTKCDAGGGAAWGGSSSYHPNASSTFTRLP 154
Query: 211 CSSAVCDRLEN---AGCHAG--RCRYEVSYG---DGSYTKGTLALETLTIGRTVVKNVAI 262
CS +C L + A C AG C Y+ +YG D +T+G L ET T+G V V
Sbjct: 155 CSDRLCAALRSYSLARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTLGGDAVPGVGF 214
Query: 263 GCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL 322
GC +G + AGL+GLG G +SLV QL G F YCL + + +S L+FG A
Sbjct: 215 GCTTALEGDYGEGAGLVGLGRGPLSLVSQL---DAGTFMYCLTADASKAS-PLLFGALAT 270
Query: 323 PVGA-AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
GA A V + +FY V L + +G VV D+GT +T
Sbjct: 271 MTGAGAGVQSTGLLASTTFYAVNLRSITIGSATTAGVGGPGG--------VVFDSGTTLT 322
Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
L PAY + AF++QT +L G F+ CY + +P + +F GG + LP
Sbjct: 323 YLAEPAYTEAKAAFLSQTTSLTPVEGRYGFEACYEKPDSARL-IPAMVLHFDGGADMALP 381
Query: 442 ASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+N+++ VDD G C+ SPS LSIIGNI Q + D + F P C
Sbjct: 382 VANYVVEVDD-GVVCWVVQRSPS-LSIIGNIMQMNYLVLHDVRKSVLSFQPANC 433
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 114/367 (31%), Positives = 179/367 (48%), Gaps = 32/367 (8%)
Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC 211
G + + + +G+PP+ + +++D+GSD++W QC+ + P++DPA S+SF+ C
Sbjct: 85 GRLHHTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPC 144
Query: 212 SSAVCD--RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG--RTVVKNVAIGCGHK 267
+C+ C +C Y +YG + TKG LA ET T G R V ++ GCG
Sbjct: 145 DGRLCETGSFNTKNCSRNKCIYTYNYGSAT-TKGELASETFTFGEHRRVSVSLDFGCGKL 203
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL---VSRGT------GSSGSLVFG 318
G GA+G+LG+ +SLV QL FSYCL + R T G+ L
Sbjct: 204 TSGSLPGASGILGISPDRLSLVSQLQIPR---FSYCLTPFLDRNTTSHIFFGAMADLSKY 260
Query: 319 REALPVGAAWVPLVRNPRAPS-FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
R P+ LV NP + +YYV L G+ VG R+ + F + + G G +D+G
Sbjct: 261 RTTGPIQT--TSLVTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGSGGTFVDSG 318
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS---IFDTCYNLS----GFV--SVRVPTV 428
LP+ EA ++A V + LP + ++ C+ L G V +V+VP +
Sbjct: 319 DTTGMLPSVVMEALKEAMV-EAVKLPVVNATDHGYEYELCFQLPRNGGGAVETAVQVPPL 377
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
++F GG + L ++++ V AG C + G +IIGN QQ+ + + FD N
Sbjct: 378 VYHFDGGAAMLLRRDSYMVEV-SAGRMCLVISSGARG-AIIGNYQQQNMHVLFDVENHEF 435
Query: 489 GFGPNVC 495
F P C
Sbjct: 436 SFAPTQC 442
>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
vinifera]
Length = 451
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 131/444 (29%), Positives = 203/444 (45%), Gaps = 41/444 (9%)
Query: 71 NTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGG 130
N + + L+++H S + + A+ + ++ +++LV R S
Sbjct: 29 NCETPDQGSTLQVLHVYSPCSPFRPKEPLSWEESVLQMQAKDKARLQFLSSLVARKSVVP 88
Query: 131 ADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC 190
+ + VQ+ Y VR +G+P ++ M +D+ SD+ W+ PC+ C
Sbjct: 89 IASGRQIVQN-------------PTYIVRAKIGTPAQTMLMAMDTSSDVAWI---PCNGC 132
Query: 191 YKQSDPVFDPADSASFSGVSCSSAVCDRL--------------ENAGCHAGRCRYEVSYG 236
S +F+ S ++ + C +A C ++ C G C + ++YG
Sbjct: 133 LGCSSTLFNSPASTTYKSLGCQAAQCKQVLHLLSPLLTSPSVVPKPTCGGGVCSFNLTYG 192
Query: 237 DGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT 296
GS L+ +T+T+ V + GC K G + A GLLGLG G +SL+ Q
Sbjct: 193 -GSSLAANLSQDTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLY 251
Query: 297 GGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
FSYCL S + SGSL G P + PL++NPR PS Y+V L + VG +
Sbjct: 252 QSTFSYCLPSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVV 311
Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCY 415
+ F G + D+GT TRL TPAY A RDAF + G + + FDTCY
Sbjct: 312 DVPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTSLGGFDTCY 371
Query: 416 NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGN 471
V + PT++F F+G V TLP N LI T C A A +P S L++I N
Sbjct: 372 T----VPIAAPTITFMFTGMNV-TLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIAN 426
Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
+QQ+ ++ +D N +G +C
Sbjct: 427 LQQQNHRLLYDVPNSRLGVARELC 450
>gi|357465299|ref|XP_003602931.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355491979|gb|AES73182.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 120/361 (33%), Positives = 179/361 (49%), Gaps = 20/361 (5%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
+ SG G Y VR+ +G+P + +MV+D+ +D ++ P S C S F P S
Sbjct: 87 IASGQAFNIGNYIVRVKIGTPGQLLFMVLDTSTDEAFI---PSSGCIGCSATTFSPNAST 143
Query: 205 SFSGVSCSSAVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA 261
S+ + CS C ++ C A G C + SY +Y+ TL ++L + V+ + +
Sbjct: 144 SYVPLECSVPQCSQVRGLSCPATGSGACSFNKSYAGSTYS-ATLVQDSLRLATDVIPSYS 202
Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGRE 320
G + G + A GLLGLG G +SL+ Q G G FSYCL S + SGSL G
Sbjct: 203 FGSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLYSGVFSYCLPSFKSYYFSGSLKLGPV 262
Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
P PL+RNPR PS Y+V L+G+ VG + +P ++L G ++D+GT +
Sbjct: 263 GQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELLAFDVNTGSGTIIDSGTVI 322
Query: 381 TRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
TR P Y A RD F Q TG S + FDTC+ + + ++ P ++ +F+ L
Sbjct: 323 TRFVEPVYNAVRDEFRKQVTGPF---SSLGAFDTCF-VKNYETL-APAITLHFTDLD-LK 376
Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSG-----LSIIGNIQQEGIQISFDGANGFVGFGPNV 494
LP N LI C A A +P L++I N QQ+ +++ FD N VG +
Sbjct: 377 LPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFDTVNNKVGIAREL 436
Query: 495 C 495
C
Sbjct: 437 C 437
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 120/353 (33%), Positives = 173/353 (49%), Gaps = 20/353 (5%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
S Y V+ +G+P + + +D+ SD+ W+ C C C S+ F PA S SF VSCS
Sbjct: 96 STTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGC--PSNTAFSPAKSTSFKNVSCS 153
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG-- 270
+ C ++ N C A C + ++YG S L+ +T+ + +K GC +K G
Sbjct: 154 APQCKQVPNPTCGARACSFNLTYGSSSIA-ANLSQDTIRLAADPIKAFTFGCVNKVAGGG 212
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV 329
GLLGLG G +SL+ Q FSYCL S R SGSL G + P +
Sbjct: 213 TIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQRVKYT 272
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
L+RNPR S YYV L + VG + + G + D+GT TRL P YE
Sbjct: 273 QLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYE 332
Query: 390 AFRDAFVAQTGNLPRASGVSI---FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
A R+ F + P + V+ FDTCY SG V+VPT++F F G +T+PA N +
Sbjct: 333 AVRNEFRKRVK--PTTAVVTSLGGFDTCY--SG--QVKVPTITFMFKGV-NMTMPADNLM 385
Query: 447 IPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ T C A A +P S +++I ++QQ+ ++ D NG +G C
Sbjct: 386 LHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERC 438
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 120/353 (33%), Positives = 173/353 (49%), Gaps = 20/353 (5%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
S Y V+ +G+P + + +D+ SD+ W+ C C C S+ F PA S SF VSCS
Sbjct: 112 STTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGC--PSNTAFSPAKSTSFKNVSCS 169
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG-- 270
+ C ++ N C A C + ++YG S L+ +T+ + +K GC +K G
Sbjct: 170 APQCKQVPNPTCGARACSFNLTYGSSSIA-ANLSQDTIRLAADPIKAFTFGCVNKVAGGG 228
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV 329
GLLGLG G +SL+ Q FSYCL S R SGSL G + P +
Sbjct: 229 TIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQRVKYT 288
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
L+RNPR S YYV L + VG + + G + D+GT TRL P YE
Sbjct: 289 QLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYE 348
Query: 390 AFRDAFVAQTGNLPRASGVSI---FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
A R+ F + P + V+ FDTCY SG V+VPT++F F G +T+PA N +
Sbjct: 349 AVRNEFRKRVK--PTTAVVTSLGGFDTCY--SG--QVKVPTITFMFKGV-NMTMPADNLM 401
Query: 447 IPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ T C A A +P S +++I ++QQ+ ++ D NG +G C
Sbjct: 402 LHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERC 454
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 145/479 (30%), Positives = 217/479 (45%), Gaps = 66/479 (13%)
Query: 54 MSQYNELFERHNNISSSNTSSDEARWNLELVHRDKM--SSSSNTTNNMHYHRH------- 104
M Y+ L R + S S + + K+ SSSS T ++ HRH
Sbjct: 15 MITYHALVARAGDEKSYKVLSASSLKPGAVCAEPKVRDSSSSGATVPLN-HRHGPCSPVP 73
Query: 105 -----QHSFHARMQRDVKRVATLVRRLSG------GGADAAKHEVQDFGTDVVSGMDQGS 153
Q +F ++RD R + R+ S GG ++ V + G +
Sbjct: 74 SGKKKQPTFTELLRRDQLRANYIQRQFSDEHYPRTGGLQQSEATVP-----IALGSLLNT 128
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
EY + + +GSP + M ID+GSD+ W++C+ ++DP S++++ SCS+
Sbjct: 129 LEYVITVSIGSPAVAXTMFIDTGSDVSWLRCK---------SRLYDPGTSSTYAPFSCSA 179
Query: 214 AVCDRL--ENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT---VVKNVAIGCGHK 267
C +L GC +G C Y V YGDGS T GT +TLT+ T ++ GC
Sbjct: 180 PACAQLGRRGTGCSSGSTCVYSVKYGDGSNTTGTYGSDTLTLAGTSEPLISGFQFGCSAV 239
Query: 268 NQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG--REALPV 324
G GL+GLGG + S V Q G AFSYCL SSG L G +
Sbjct: 240 EHGFEEDNTDGLMGLGGDAQSFVSQTAATYGSAFSYCLPPTWN-SSGFLTLGAPSSSTSA 298
Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
+ P++R+ +A +FY + L G+ VGG + I +F G ++D+GT +TRLP
Sbjct: 299 AFSTTPMLRSKQAATFYGLLLRGISVGGKTLEIPSSVF------SAGSIVDSGTVITRLP 352
Query: 385 TPAYEAFRDAF---VAQTGNLPRASGVSIFDTCYNLSGF---VSVRVPTVSFYFSGGPVL 438
AY A AF +A+ P A+ + DTC++ +G + VP+V+ GG V+
Sbjct: 353 PTAYGALSAAFRDGMARYQYQP-AAPRGLLDTCFDFTGHGEGNNFTVPSVALVLDGGAVV 411
Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L + V D C AFA + IIGN+QQ ++ +D GF P C
Sbjct: 412 DLHPNGI---VQDG---CLAFAATDDDGRTGIIGNVQQRTFEVLYDVGQSVFGFRPGAC 464
>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
Length = 423
Score = 161 bits (408), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 109/350 (31%), Positives = 159/350 (45%), Gaps = 38/350 (10%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
Y R G+G+P ++ + ID +D WV C C+ C S P F P S+++ V C S
Sbjct: 101 NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSP 159
Query: 215 VCDRLENAGCHAG---RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
C ++ + C AG C + ++Y ++ + L ++L + VV + GC G
Sbjct: 160 QCAQVPSPSCPAGVGSSCGFNLTYAASTF-QAVLGQDSLALENNVVVSYTFGCLRVVNGN 218
Query: 272 FVGAAGLLGLGG-GSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVP 330
AAG L ++ LV G G P P
Sbjct: 219 SRAAAGAHRLRPRAALLLVADQGH----------------------LGPIGQPKRIKTTP 256
Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
L+ NP PS YYV + G+ VG + + + + G ++D GT TRL P Y A
Sbjct: 257 LLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYAA 316
Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
RDAF + P A + FDTCYN V+V VPTV+F F+G +TLP N +I
Sbjct: 317 VRDAFRGRV-RTPVAPPLGGFDTCYN----VTVSVPTVTFMFAGAVAVTLPEENVMIHSS 371
Query: 451 DAGTFCFAFAPSPS-----GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
G C A A PS L+++ ++QQ+ ++ FD ANG VGF +C
Sbjct: 372 SGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 421
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 161 bits (407), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 113/370 (30%), Positives = 187/370 (50%), Gaps = 36/370 (9%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC-S 212
GEY+ I +GSP + +++D+GS++ W++C PC C D ++D A S S+ V+C +
Sbjct: 98 GEYYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVDTIYDAARSVSYKPVTCNN 157
Query: 213 SAVCDRLEN---AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGRTV------VKNVAI 262
S +C A C G +C++ YGDGS++ G+L+ +TL + V V++ A
Sbjct: 158 SQLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAF 217
Query: 263 GCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT--GSSGSLVFGR 319
GC + + GA+G+LGL G M+L QLG + G FS+C R + S+G + FG
Sbjct: 218 GCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFFGN 277
Query: 320 EALP---VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
LP V V L + FY+V L G+ + S +L L + V++D+
Sbjct: 278 AELPHEQVQYTSVALTNSELQRKFYHVALKGVSIN------SHELVLLPR--GSVVILDS 329
Query: 377 GTAVTRLPTPAYEAFRDAFVA-QTGNLPRASGVSIFD--TCYNLSG----FVSVRVPTVS 429
G++ + P + R+AF+ + +L G S D TC+ +S + +P++S
Sbjct: 330 GSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSLS 389
Query: 430 FYFSGGPVLTLPASNFLIPV---DDAGTFCFAFAP-SPSGLSIIGNIQQEGIQISFDGAN 485
F G + +P+ L+PV + CFAF P+ +++IGN QQ+ + + +D
Sbjct: 390 LVFEDGVTIGIPSIGVLLPVARYQNHVKMCFAFEDGGPNPVNVIGNYQQQNLWVEYDIQR 449
Query: 486 GFVGFGPNVC 495
VGF C
Sbjct: 450 SRVGFARASC 459
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 161 bits (407), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 90/217 (41%), Positives = 123/217 (56%), Gaps = 13/217 (5%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
+Y + + +G+PP Y D+GSD++W+QC PC+ CYKQ +P+FD S++FS ++C S
Sbjct: 58 DYLMELSIGTPPVKIYAQADTGSDLIWLQCIPCTNCYKQLNPMFDSQSSSTFSNIACGSE 117
Query: 215 VCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTV-----VKNVAIGCGHK 267
C +L + C + C+Y SY DGS T+G LA ETLT+ T K V GCGH
Sbjct: 118 SCSKLYSTSCSPDQINCKYNYSYVDGSETQGVLAQETLTLTSTTGEPVAFKGVIFGCGHN 177
Query: 268 NQGMFVGAA-GLLGLGGGSMSLVGQLGGQTGG-AFSYCLVSRGTGSSGS--LVFGR--EA 321
N G F G++GLG G +SLV Q+G GG FS CLV T S S + FG+ E
Sbjct: 178 NNGAFNDKEMGIIGLGRGPLSLVSQIGSSLGGNMFSQCLVPFNTNPSISSPMSFGKGSEV 237
Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPIS 358
L G PLV SFY+V L G+ V + +P +
Sbjct: 238 LGNGVVSTPLVSKTTYQSFYFVTLLGISVEDINLPFN 274
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 161 bits (407), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 111/343 (32%), Positives = 159/343 (46%), Gaps = 37/343 (10%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
Y +++ VG+PP ID+GSD++W QC PC CY Q DP+FDP+ S++F+
Sbjct: 82 YLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQFDPIFDPSKSSTFN-------- 133
Query: 216 CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCG----- 265
CH C YE+ Y D +Y+KG LA ET+TI T V+ IGCG
Sbjct: 134 -----EQRCHGKSCHYEIIYEDNTYSKGILATETVTIHSTSGEPFVMAETTIGCGLHNTD 188
Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
N G ++G++GL G SL+ Q+ G SYC +GT + FG A+ G
Sbjct: 189 LDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGLISYCFSGQGT---SKINFGTNAIVAG 245
Query: 326 AAWVPL-VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
V + + FYY+ L + V RI E L D +V+D+G+ VT P
Sbjct: 246 DGTVAADMFIKKDNPFYYLNLDAVSVEDNRI---ETLGTPFHAEDGNIVIDSGSTVTYFP 302
Query: 385 TPAYEAFRDAF--VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPA 442
R A V +P SG + CY S + + P ++ +FSGG L L
Sbjct: 303 VSYCNLVRKAVEQVVTAVRVPDPSGNDML--CY-FSETIDI-FPVITMHFSGGADLVLDK 358
Query: 443 SNFLIPVDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGA 484
N + + G FC A SP+ +I GN Q + +D +
Sbjct: 359 YNMYMESNSGGLFCLAIICNSPTQEAIFGNRAQNNFLVGYDSS 401
Score = 154 bits (390), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 107/354 (30%), Positives = 158/354 (44%), Gaps = 37/354 (10%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
Y +++ VG+PP ID+GSDI+W QC PC CY Q P+FDP+ S++F
Sbjct: 421 YLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTF--------- 471
Query: 216 CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKN-- 268
C+ C YE+ Y D +Y+KG LA ET+TI T V+ IGCG N
Sbjct: 472 ----REQRCNGNSCHYEIIYADKTYSKGILATETVTIPSTSGEPFVMAETKIGCGLDNTN 527
Query: 269 ---QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
G ++G++GL G +SL+ Q+ G SYC +GT + FG A+ G
Sbjct: 528 LQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLISYCFSGQGT---SKINFGTNAIVAG 584
Query: 326 AAWVPL-VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
V + + FYY+ L + V I F D + +D+GT +T P
Sbjct: 585 DGTVAADMFIKKDNPFYYLNLDAVSVEDNLIATLGTPFHAE---DGNIFIDSGTTLTYFP 641
Query: 385 TPAYEAFRDAF--VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPA 442
R+A V +P ++ CY S + + P ++ +FSGG L L
Sbjct: 642 MSYCNLVREAVEQVVTAVKVPDMGSDNLL--CY-YSDTIDI-FPVITMHFSGGADLVLDK 697
Query: 443 SNFLIPVDDAGTFCFAF-APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
N + G FC A PS ++ GN Q + +D ++ + F P C
Sbjct: 698 YNMYLETITGGIFCLAIGCNDPSMPAVFGNRAQNNFLVGYDPSSNVISFSPTNC 751
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 137/450 (30%), Positives = 207/450 (46%), Gaps = 52/450 (11%)
Query: 77 ARWNLELVHRDKMSS-SSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAK 135
AR LELV +S S +++H H + S A +R RR + GA A
Sbjct: 36 ARPRLELVPAAPGASLSDRARDDLHRHAYIRSQLASSRRG--------RRAAEVGASA-- 85
Query: 136 HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD 195
F + SG G+G+YFVR VG+P + +V D+GSD+ WV+C+
Sbjct: 86 -----FAMPLSSGAYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGA 140
Query: 196 P----VFDPADSASFSGVSCSSAVCD-----RLENAGCHAGRCRYEVSYGDGSYTKGTLA 246
VF A S S++ ++CSS C L N A C Y+ Y DGS +G +
Sbjct: 141 GSPARVFRTAASKSWAPIACSSDTCTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVG 200
Query: 247 LETLTIG----------------RTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLV 289
++ TI R ++ V +GC G F + G+L LG ++S
Sbjct: 201 TDSATIALSSGSGRGGGDSSGGRRAKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFA 260
Query: 290 GQLGGQTGGAFSYCLVSR--GTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSG 347
+ + GG FSYCLV ++ L FG A AA PL+ + R FY V +
Sbjct: 261 SRAAARFGGRFSYCLVDHLAPRNATSYLTFGPGAT-APAAQTPLLLDRRMTPFYAVTVDA 319
Query: 348 LGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG 407
+ V G + I D++ + + G G ++D+GT++T L TPAY A A LPR +
Sbjct: 320 VYVAGEALDIPADVWDVDRNG--GAILDSGTSLTILATPAYRAVVTALSKHLAGLPRVT- 376
Query: 408 VSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAP-SPSG 465
+ F+ CYN + ++ +P + +F+G L PA +++I D A G C S G
Sbjct: 377 MDPFEYCYNWTDAGALEIPKMEVHFAGSARLEPPAKSYVI--DAAPGVKCIGVQEGSWPG 434
Query: 466 LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+S+IGNI Q+ FD + ++ F C
Sbjct: 435 VSVIGNILQQEHLWEFDLRDRWLRFKHTRC 464
>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
Length = 360
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 103/284 (36%), Positives = 145/284 (51%), Gaps = 17/284 (5%)
Query: 229 CRYEVSYGDGSYTKGTLALETLTIGRTV---------VKNVAIGCGHKNQGMFVGAAGLL 279
C Y YGD S T G ALET T+ T+ V+NV GCGH N+G+F GAAGLL
Sbjct: 74 CPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRVENVMFGCGHWNRGLFHGAAGLL 133
Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS--SGSLVFGREALPVGAA---WVPLVRN 334
GLG G +S QL G +FSYCLV R + + S L+FG + + + LV
Sbjct: 134 GLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDANVSSKLIFGEDKDLLSHPELNFTTLVAG 193
Query: 335 PRAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
P +FYYV + + VGG + I E+ +++ G G ++D+GT ++ PAY+ +
Sbjct: 194 KENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIK 253
Query: 393 DAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA 452
+AF+A+ P + + CYN++G +P FS G V P N+ I ++
Sbjct: 254 EAFMAKVKGYPVVKDFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPR 313
Query: 453 GTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A PS LSIIGN QQ+ I +D +GF P C
Sbjct: 314 EVVCLAILGTPPSALSIIGNYQQQNFHILYDTKKSRLGFAPTKC 357
>gi|356500756|ref|XP_003519197.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 451
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 133/397 (33%), Positives = 190/397 (47%), Gaps = 33/397 (8%)
Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
+D +RV L DA+ + SG G G Y VR+ +GSP + +MV+
Sbjct: 72 KDPERVVYL------SSLDASLRRKPISAAPIASGQAFGIGSYVVRVKLGSPNQLFFMVL 125
Query: 174 DSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG-VSCSSAVCDRLENA-GCH---AGR 228
D+ +D WV C C+ C S + P S ++ G V+C + C + A C +
Sbjct: 126 DTSTDEAWVPCTGCTGC-SSSSTYYSPQASTTYGGAVACYAPRCAQARGALPCPYTGSKA 184
Query: 229 CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSL 288
C + SY GS TL ++L +G + + A GC + G + A GLLGLG G +SL
Sbjct: 185 CTFNQSYA-GSTFSATLVQDSLRLGIDTLPSYAFGCVNSASGWTLPAQGLLGLGRGPLSL 243
Query: 289 VGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSG 347
Q G FSYCL S + + SGSL G P PL++NPR PS YYV L+G
Sbjct: 244 PSQSSKLYSGIFSYCLPSFQSSYFSGSLKLGPTGQPRRIRTTPLLQNPRRPSLYYVNLTG 303
Query: 348 LGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG 407
+ VG +++P+ + G ++D+GT +TR P Y A RD F Q + G
Sbjct: 304 VTVGRVKVPLPIEYLAFDPNKGSGTILDSGTVITRFVGPVYSAIRDEFRNQVKGPFFSRG 363
Query: 408 VSIFDTCY-----NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS 462
FDTC+ NL+ + +R F+G V TLP N LI G C A A +
Sbjct: 364 G--FDTCFVKTYENLTPLIKLR-------FTGLDV-TLPYENTLIHTAYGGMACLAMAAA 413
Query: 463 P----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P S L++I N QQ+ +++ FD N VG +C
Sbjct: 414 PNNVNSVLNVIANYQQQNLRVLFDTVNNRVGIARELC 450
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 114/371 (30%), Positives = 183/371 (49%), Gaps = 34/371 (9%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD 217
++ +G+PPR +++D+ S++ WVQ C+ C P F+P S+SF C+S+VC
Sbjct: 1 MQTKIGTPPREVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVCL 60
Query: 218 RLENAGCHA------GRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIGCGH 266
G + G C ++V+Y DGS G +A E ++ + + +V GC
Sbjct: 61 GRSKLGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGCAS 120
Query: 267 KNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGA----FSYCLVSRGT--GSSGSLVFGR 319
K+ V ++G LGL GS S Q+G ++ FSYC +R SSG ++FG
Sbjct: 121 KDLQRPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVIIFGD 180
Query: 320 EALPVGA-AWVPLVRNPRAPS---FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMD 375
+P ++ L + P S FYYVGL G+ VGG + I F++ ++G+ G D
Sbjct: 181 SGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLGNGGTYFD 240
Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF-DTCYNLSGFVSVRVPT---VSFY 431
+GT V+ L PA+ A +AF + +L R SG + CY+++ R+PT V+ +
Sbjct: 241 SGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAA-GDARLPTAPLVTLH 299
Query: 432 FSGGPVLTLPASNFLIPV---DDAGTFCFAF----APSPSGLSIIGNIQQEGIQISFDGA 484
F + L ++ +P+ T C AF A + G+++IGN QQ+ I D
Sbjct: 300 FKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQDYLIEHDLE 359
Query: 485 NGFVGFGPNVC 495
+GF P C
Sbjct: 360 RSRIGFAPANC 370
>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
Length = 450
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 111/352 (31%), Positives = 167/352 (47%), Gaps = 55/352 (15%)
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC------ 224
+++D+GSD+ WVQC+PCS CY Q DP+FDP+ SAS++ V C+++ C+ A
Sbjct: 124 VIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 183
Query: 225 ----------HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVG 274
+ RC Y ++YGDGS+++G LA +T+ +G V GCG N+
Sbjct: 184 ATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDGFVFGCGLSNR----- 238
Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG------REALPVGAAW 328
GL G + S G +G A +GSL G R A PV ++
Sbjct: 239 --GLRRPGSAASSPTASPPGTSGDA------------AGSLSLGGDTSSYRNATPV--SY 282
Query: 329 VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
++ +P P FY++ ++G VGG + +G V++D+GT +TRL Y
Sbjct: 283 TRMIADPAQPPFYFMNVTGASVGGAAV-------AAAGLGAANVLLDSGTVITRLAPSVY 335
Query: 389 EAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
A R F Q G P A S+ D CYNL+G V+VP ++ G +T+ A+ L
Sbjct: 336 RAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEAGADMTVDAAGML 395
Query: 447 -IPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ D C A A IIGN QQ+ ++ +D +GF C
Sbjct: 396 FMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 447
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 112/355 (31%), Positives = 160/355 (45%), Gaps = 38/355 (10%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
Y +R+ +G+PP ID+GSD++W QC PC CY Q P+FDP+ S++F
Sbjct: 61 YLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAPIFDPSKSSTF--------- 111
Query: 216 CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQG 270
+ CH C YE+ Y D SY+ G LA ET+TI T V+ +IGCG N
Sbjct: 112 ----KEKRCHGNSCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAETSIGCGLNNSN 167
Query: 271 MFV-----GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
+ ++G++GL G SL+ Q+ G SYC S+GT + FG A+ G
Sbjct: 168 LMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCFSSQGT---SKINFGTNAVVAG 224
Query: 326 AAWVPL-VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
V + + FYY+ L + VG RI E L D + +D+GT T LP
Sbjct: 225 DGTVAADMFIKKDQPFYYLNLDAVSVGDKRI---ETLGTPFHAQDGNIFIDSGTTYTYLP 281
Query: 385 TP---AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
T A V +P S ++ CYN P ++ +F+GG L L
Sbjct: 282 TSYCNLVREAVAASVVAANQVPDPSSENLL--CYNWDTM--EIFPVITLHFAGGADLVLD 337
Query: 442 ASNFLIPVDDAGTFCFAF-APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
N + GTFC A PS +I GN + + +D + + F P C
Sbjct: 338 KYNMYVETITGGTFCLAIGCVDPSMPAIFGNRAHNNLLVGYDSSTLVISFSPTNC 392
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 118/370 (31%), Positives = 176/370 (47%), Gaps = 43/370 (11%)
Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS--DPV 197
DF DV + + +FV VG PP Q+ ++D+GS ++W+QC PC C PV
Sbjct: 54 DFQVDVHQAIK--TSLFFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPV 111
Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI----G 253
F+PA S++F SC C N C + +C YE Y G+ +KG LA E LT G
Sbjct: 112 FNPALSSTFVECSCDDRFCRYAPNGHCSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNG 171
Query: 254 RTVV-KNVAIGCGHKN-QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
TVV + +A GCGH+N + + G+LGLG SL QLG + FSYC+ +
Sbjct: 172 NTVVTQPIAFGCGHENGEQLESEFTGILGLGAKPTSLAVQLGSK----FSYCIGDLANKN 227
Query: 312 SG--SLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
G LV G +A +G P + YY+ L G+ VG ++ I +F+ +
Sbjct: 228 YGYNQLVLGEDADILGDP-TP-IEFETENGIYYMNLEGISVGDKQLNIEPVVFK-RRGSR 284
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD-TCYN------LSGFVS 422
GV++DTGT T L AY + + P+ D CY+ L GF
Sbjct: 285 TGVILDTGTLYTWLADIAYRELYNEIKSILD--PKLERFWFRDFLCYHGRVNEELIGF-- 340
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT----FCFAFAPSPS------GLSIIGNI 472
P V+F+F+GG L + A++ P+ ++ T FC + P+ + IG +
Sbjct: 341 ---PVVTFHFAGGAELAMEATSMFYPMTESDTYHNVFCMSVRPTTEHGGEYKDFTAIGLM 397
Query: 473 QQEGIQISFD 482
Q+ I++D
Sbjct: 398 AQQYYNIAYD 407
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 157 bits (398), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 122/394 (30%), Positives = 187/394 (47%), Gaps = 40/394 (10%)
Query: 133 AAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK 192
A E F + SG G+G+YFV+ VG+P + +V D+GSD+ WV+C+
Sbjct: 87 APMPEASAFAMPLTSGAYTGTGQYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSP 146
Query: 193 QSDP-----VFDPADSASFSGVSCSSAVCDR---LENAGCHAGR-----CRYEVSYGDGS 239
+ P VF PA+S S++ + CSS C A C AG C Y+ Y D S
Sbjct: 147 DASPLASPRVFRPANSKSWAPIPCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKS 206
Query: 240 YTKGTLALETLTIG--------RTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVG 290
+G + + TI + ++ V +GC G F + G+L LG ++S
Sbjct: 207 SARGVVGTDAATIALSGSGSDRKAKLQEVVLGCTTSYDGQSFQSSDGVLSLGNSNISFAS 266
Query: 291 QLGGQTGGAFSYCLVSR--GTGSSGSLVFGREALPVGAA----WVPLVRNPRAPSFYYVG 344
+ + GG FSYCLV ++ L FG PVGAA PL+ + + FY V
Sbjct: 267 RAAARFGGRFSYCLVDHLAPRNATSYLTFG----PVGAAHSPSRTPLLLDAQVAPFYAVT 322
Query: 345 LSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR 404
+ + V G + I +++ + + G G ++D+GT++T L TPAY+A A Q +PR
Sbjct: 323 VDAVSVAGKALNIPAEVWDVKKNG--GAILDSGTSLTILATPAYKAVVAALSKQLARVPR 380
Query: 405 ASGVSIFDTCYNLSGF-VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAPS 462
+ + F+ CYN + VP + F+G L P +++I D A G C
Sbjct: 381 VT-MDPFEYCYNWTATRRPPAVPRLEVRFAGSARLRPPTKSYVI--DAAPGVKCIGLQEG 437
Query: 463 P-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
G+S+IGNI Q+ FD AN ++ F + C
Sbjct: 438 VWPGVSVIGNILQQEHLWEFDLANRWLRFQESRC 471
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 114/356 (32%), Positives = 169/356 (47%), Gaps = 30/356 (8%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
Y +G+PP+ VID ++VW QC+ CS+C++Q P+FDP S ++ C +
Sbjct: 50 NYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTP 109
Query: 215 VCDRL--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC-GHKNQGM 271
+C+ + ++ C C Y+ S G T G + +T +G T ++A GC +
Sbjct: 110 LCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVG-TAKASLAFGCVVASDIDT 167
Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG---AAW 328
G +G++GLG SLV Q G AFSYCL G + +L G A G AA
Sbjct: 168 MGGPSGIVGLGRTPWSLVTQTG---VAAFSYCLAPHDAGRNSALFLGSSAKLAGGGKAAS 224
Query: 329 VPLV----RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
P V ++Y V L GL G IP+ V++DT + ++ L
Sbjct: 225 TPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPS--------GSTVLLDTFSPISFLV 276
Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
AY+A + A A G P A+ V FD C+ SG S P + F F GG +T+PA+N
Sbjct: 277 DGAYQAVKKAVTAAVGAPPMATPVEPFDLCFPKSG-ASGAAPDLVFTFRGGAAMTVPATN 335
Query: 445 FLIPVDDAGTFCFAFAPSP-----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+L+ + GT C A S + LS++G++QQE I FD + F P C
Sbjct: 336 YLLDYKN-GTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADC 390
>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 147/462 (31%), Positives = 208/462 (45%), Gaps = 83/462 (17%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARM----QRDVKRVATLVR---RLSGGGA 131
+++E +HRD S +H + AR+ +R R A L R R+ A
Sbjct: 35 FSVEFIHRDSARSP--------FHDPSLTAPARVLEAARRSTVRAAALSRSYVRVDAPSA 86
Query: 132 DAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-----P 186
D VS + EY + + +G+PP + D+GSD++W+ C P
Sbjct: 87 DG-----------FVSELTSTPFEYLMAVNIGTPPTRMVAIADTGSDLIWLNCSYGGDGP 135
Query: 187 CSQCYKQSDP-----VFDPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSY 240
+ +D FDP+ S +F V C S C L A C A +CRY SYGDGS+
Sbjct: 136 GLAAARDADAQPPGVQFDPSKSTTFRLVDCDSVACSELPEASCGADSKCRYSYSYGDGSH 195
Query: 241 TKGTLALETLTIG----------RTVVKNVAIGCGHKNQGMFVGAA---GLLGLGGGSMS 287
T G L+ ET T T V NV GC FVG++ GL+GLGGG +S
Sbjct: 196 TSGVLSTETFTFADAPGARGDGTTTRVANVNFGCSTT----FVGSSVGDGLVGLGGGDLS 251
Query: 288 LVGQLGGQT--GGAFSYCLVSRGTGSSGSLVFGREALPV--GAAWVPLVRNPRAPSFYYV 343
LV QLG T G FSYCLV +S +L FG A GA PL+ + + ++Y V
Sbjct: 252 LVSQLGADTSLGRRFSYCLVPYSVKASSALNFGPRAAVTDPGAVTTPLIPS-QVKAYYIV 310
Query: 344 GLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNL 402
L + VG + +++D+GT +T LP EA D V + TG +
Sbjct: 311 ELRSVKVGNKTFEAPD---------RSPLIVDSGTTLTFLP----EALVDPLVKELTGRI 357
Query: 403 ---PRASGVSIFDTCYNLSGF----VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF 455
P S + C+++SG V+ +P V+ GG +TL A N + V + GT
Sbjct: 358 KLPPAQSPERLLPLCFDVSGVREGQVAAMIPDVTVGLGGGAAVTLKAENTFVEVQE-GTL 416
Query: 456 CFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C A + SIIGNI Q+ + + +D G V F P C
Sbjct: 417 CLAVSAMSEQFPASIIGNIAQQNMHVGYDLDKGTVTFAPAAC 458
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 128/377 (33%), Positives = 179/377 (47%), Gaps = 36/377 (9%)
Query: 129 GGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS 188
GGA A V + V S + S EY + + VG+PP + D+GSD+VWV C
Sbjct: 73 GGASPAPGPVPEADGGVESKIITRSFEYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNG 132
Query: 189 QCYKQSD--PVFDPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTL 245
SD VF P+ S ++S +SC SA C L A C A C+Y+ +YGDGS T G L
Sbjct: 133 GGGGASDGAVVFHPSRSTTYSLLSCQSAACQALSQASCDADSECQYQYAYGDGSRTIGVL 192
Query: 246 ALETLTI--------GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTG 297
+ ET + G+ V V+ GC + G F + GL+GLG G++SLV QLG
Sbjct: 193 STETFSFAAAGGGGEGQVRVPRVSFGCSTGSAGSF-RSDGLVGLGAGALSLVSQLGAAAR 251
Query: 298 GA--FSYCLVS--RGTGSSGSLVFGREAL--PVGAAWVPLVRNPRAPSFYYVGLSGLGVG 351
A FSYCLV SS +L FG A+ GAA PLV + S+Y V L + V
Sbjct: 252 IARRFSYCLVPPYAAANSSSTLSFGARAVVSDPGAASTPLVPS-EVDSYYTVALESVAVA 310
Query: 352 GMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-I 410
G + +++D+GT +T L PA A + + LPRA +
Sbjct: 311 GQDV---------ASANSSRIIVDSGTTLTFL-DPALLRPLVAELERRIRLPRAQPPEQL 360
Query: 411 FDTCYNLSGFVSVR---VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP--SPSG 465
CY++ G +P V+ F GG +TL N +++ GT C P
Sbjct: 361 LQLCYDVQGKSQAEDFGIPDVTLRFGGGASVTLRPENTFSLLEE-GTLCLVLVPVSESQP 419
Query: 466 LSIIGNIQQEGIQISFD 482
+SI+GNI Q+ + +D
Sbjct: 420 VSILGNIAQQNFHVGYD 436
>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 444
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 131/439 (29%), Positives = 189/439 (43%), Gaps = 29/439 (6%)
Query: 64 HNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLV 123
HN + D L++ H S + M + A+ Q ++ ++ LV
Sbjct: 27 HNPKCDAAYQHDHDGSTLQVFHVFSPCSPFRPSKPMSWEESVLQLQAKDQARMQYLSNLV 86
Query: 124 RRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQ 183
R S + + Q S Y VR G+P ++ + +D+ +D WV
Sbjct: 87 ARRSIVPIASGRQITQ-------------SPTYIVRAKFGTPAQTLLLAMDTSNDAAWVP 133
Query: 184 CQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKG 243
C C C + F P S +F V C ++ C ++ N C C + +YG S
Sbjct: 134 CTACVGCSTTTP--FAPPKSTTFKKVGCGASQCKQVRNPTCDGSACAFNFTYGTSS-VAA 190
Query: 244 TLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
+L +T+T+ V GC K G + GLLGLG G +SL+ Q FSYC
Sbjct: 191 SLVQDTVTLATDPVPAYTFGCIQKATGSSLPPQGLLGLGRGPLSLLAQTQKLYQSTFSYC 250
Query: 304 LVSRGTGS-SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
L S T + SG A P + P +NPR S YYV L + VG + I +
Sbjct: 251 LPSFKTLNFSGHXDLXPVAQPRDQVY-PSFKNPRRSSLYYVNLVAIRVGRRIVDIPPEAL 309
Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGF 420
G V D+GT TRL PAY A R+ F + + + S+ FDTCY
Sbjct: 310 AFNPXTGAGTVFDSGTVFTRLVEPAYTAVRNEFRRRVSVHKKLTVTSLGGFDTCYT---- 365
Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEG 476
V + PT++F FSG V TLP N LI C A AP+P S L++I N+QQ+
Sbjct: 366 VPIVAPTITFMFSGMNV-TLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQN 424
Query: 477 IQISFDGANGFVGFGPNVC 495
++ FD N +G +C
Sbjct: 425 HRVLFDVPNSRLGVARELC 443
>gi|388516465|gb|AFK46294.1| unknown [Medicago truncatula]
Length = 434
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 117/352 (33%), Positives = 175/352 (49%), Gaps = 20/352 (5%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
+ SG G Y VR+ +G+P + +MV+D+ +D ++ P S C S F P S
Sbjct: 87 IASGQAFNIGNYIVRVKIGTPGQLLFMVLDTSTDEAFI---PSSGCIGCSATTFSPNAST 143
Query: 205 SFSGVSCSSAVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA 261
S+ + CS C ++ C A G C + SY +Y+ TL ++L + V+ + +
Sbjct: 144 SYVPLECSVPQCSQVRGLSCPATGSGACSFNKSYAGSTYS-ATLVQDSLRLATDVIPSYS 202
Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGRE 320
G + G + A GLLGLG G +SL+ Q G G FSYCL S + SGSL G
Sbjct: 203 FGSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLYSGVFSYCLPSFKSYYFSGSLKLGPV 262
Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
P PL+RNPR PS Y+V L+G+ VG + +P ++L G ++D+GT +
Sbjct: 263 GQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELLAFDVNTGSGTIIDSGTVI 322
Query: 381 TRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
TR P Y A RD F Q TG S + FDTC+ + + ++ P ++ +F+ L
Sbjct: 323 TRFVEPVYNAVRDEFRKQVTGPF---SSLGAFDTCF-VKNYETL-APAITLHFTDLD-LK 376
Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSG-----LSIIGNIQQEGIQISFDGANG 486
LP N LI C A A +P L++I N QQ+ +++ FD N
Sbjct: 377 LPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFDTVNN 428
>gi|84453222|dbj|BAE71208.1| hypothetical protein [Trifolium pratense]
gi|84453226|dbj|BAE71210.1| hypothetical protein [Trifolium pratense]
Length = 437
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 123/360 (34%), Positives = 176/360 (48%), Gaps = 19/360 (5%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
+ SG G Y VR+ +G+P + +MV+D+ +D +V P S C S F P S
Sbjct: 87 IASGQTFNIGNYVVRVKIGTPGQLLFMVLDTSTDEAFV---PSSGCIGCSATTFYPNVST 143
Query: 205 SFSGVSCSSAVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA 261
SF + CS C ++ C A G C + SY GS TL ++L + V+ + +
Sbjct: 144 SFVPLDCSVPQCGQVRGLSCPATGSGACSFNQSYA-GSTFSATLVQDSLRLATDVIPSYS 202
Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGRE 320
G + G V A GLLGLG G +SL+ Q G G FSYCL S + SGSL G
Sbjct: 203 FGSINAISGSSVPAQGLLGLGRGPLSLLSQSGAIYSGVFSYCLPSFKSYYFSGSLKLGPV 262
Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
P PL+ NP PS YYV L+ + VG + +P+ +L G ++D+GT +
Sbjct: 263 GQPKSIRTTPLLHNPHRPSLYYVNLTAISVGRVYVPLPSELLAFNPSTGAGTIIDSGTVI 322
Query: 381 TRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
TR P Y A RD F Q TG S + FDTC+ + + ++ P ++ +F+ L
Sbjct: 323 TRFVEPIYNAVRDEFRKQVTGPF---SSLGAFDTCF-VKNYETL-APAITLHFTDLD-LK 376
Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
LP N LI C A A +PS L++I N QQ+ +++ FD N VG +C
Sbjct: 377 LPLENSLIHSSSGSLACLAMAAAPSNVNSVLNVIANFQQQNLRVLFDTVNNKVGIARELC 436
>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
Length = 376
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 128/234 (54%), Gaps = 12/234 (5%)
Query: 65 NNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVR 124
+++ S + D+ R +LE++H+ S + R Q + +D RV ++
Sbjct: 52 SSVCSPSPKGDDKRASLEVIHKHGPCSKLSQDKGRSPSRTQM-----LDQDESRVNSIRS 106
Query: 125 RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC 184
RL+ AD K + SG G+G Y V +G+G+P R + D+GSD+ W QC
Sbjct: 107 RLAKNPADGGKLKGSKVTLPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQC 166
Query: 185 QPCSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAGRCRYEVSYGDG 238
+PC++ CY Q +P+F+P+ S S++ +SCSS CD L++ C A C Y + YGD
Sbjct: 167 EPCARYCYHQQEPIFNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQ 226
Query: 239 SYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQ 291
SY+ G A + L + T V N GCG N+G+FVG AGL+GLG ++SL+ +
Sbjct: 227 SYSVGFFAQDKLALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLMSK 280
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 53/95 (55%), Gaps = 3/95 (3%)
Query: 403 PRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA-- 460
P+A+ SI DTCY+ S + +V VP ++ YFS G + L S + + C AFA
Sbjct: 282 PKAAPASILDTCYDFSQYDTVDVPKINLYFSDGAEMDLDPSGIFY-ILNISQVCLAFAGN 340
Query: 461 PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ ++I+GN+QQ+ + +D A G +GF P C
Sbjct: 341 SDATDIAILGNVQQKTFDVVYDVAGGRIGFAPGGC 375
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 155 bits (391), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 124/421 (29%), Positives = 188/421 (44%), Gaps = 45/421 (10%)
Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGV 162
Q A RD R +++ + GG D + D G YF ++ +
Sbjct: 39 NQQVELEALRARDRARHGRILQGVVGGVVDFSVQGTSD---------PYFVGLYFTKVKL 89
Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCD 217
GSP + Y+ ID+GSDI+W+ C CS C S FD A S++ + VSC+ +C
Sbjct: 90 GSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVSCADPICS 149
Query: 218 ---RLENAGC--HAGRCRYEVSYGDGS-----YTKGTLALETLTIGRTVVKN----VAIG 263
+ +GC A +C Y YGDGS Y T+ +T+ +G+++V N + G
Sbjct: 150 YAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMVANSSSTIVFG 209
Query: 264 CGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVF 317
C G G+ G G G++S++ QL G T FS+CL G G LV
Sbjct: 210 CSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCL-KGGENGGGVLVL 268
Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
G E L + PLV P P Y + L + V G +PI ++F T + G ++D+G
Sbjct: 269 G-EILEPSIVYSPLV--PSLPH-YNLNLQSIAVNGQLLPIDSNVFATTN--NQGTIVDSG 322
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV 437
T + L AY F DA A + +S + CY +S V P VS F GG
Sbjct: 323 TTLAYLVQEAYNPFVDAITAAVSQFSKPI-ISKGNQCYLVSNSVGDIFPQVSLNFMGGAS 381
Query: 438 LTLPASNFLIP---VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNV 494
+ L ++L+ +D A +C F G +I+G++ + +D AN +G+
Sbjct: 382 MVLNPEHYLMHYGFLDSAAMWCIGFQKVERGFTILGDLVLKDKIFVYDLANQRIGWADYN 441
Query: 495 C 495
C
Sbjct: 442 C 442
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 127/445 (28%), Positives = 204/445 (45%), Gaps = 76/445 (17%)
Query: 72 TSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVR------R 125
T + +N+EL+H +SS S F+ + ++R+++++ R
Sbjct: 20 TKTQNHGFNVELIH--PISSRS-------------PFYNPKETQIQRISSILNYSINRVR 64
Query: 126 LSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ 185
+ +++QD + S M G Y + +G+PP Y +ID+G+D +W QC+
Sbjct: 65 YLNHVFSFSPNKIQD--VPLSSFMGAG---YVMSYSIGTPPFQLYSLIDTGNDNIWFQCK 119
Query: 186 PCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTL 245
PC C Q+ P+F P+ S+++ + C+S +C +NA DG Y L
Sbjct: 120 PCKPCLNQTSPMFHPSKSSTYKTIPCTSPIC---KNA--------------DGHY----L 158
Query: 246 ALETLTIGRT-----VVKNVAIGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGA 299
++TLT+ KN+ IGCGH+NQG G +G +GL G +S + QL GG
Sbjct: 159 GVDTLTLNSNNGTPISFKNIVIGCGHRNQGPLEGYVSGNIGLARGPLSFISQLNSSIGGK 218
Query: 300 FSYCLVSRGTGS--SGSLVFGREALP--VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
FSYCLV + S L FG ++ +G P+ + + Y+V L VG
Sbjct: 219 FSYCLVPLFSKENVSSKLHFGDKSTVSGLGTVSTPI----KEENGYFVSLEAFSVG---- 270
Query: 356 PISEDLFRLTQMGDDG-VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDT 413
+ + +L + G ++D+GT +T LP Y ++ V L R S F+
Sbjct: 271 ---DHIIKLENSDNRGNSIIDSGTTMTILPKDVYSRL-ESVVLDMVKLKRVKDPSQQFNL 326
Query: 414 CYN-LSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP--SPSGLSIIG 470
CY S + +V ++ +FSG V L A N P+ D CFAF + S L+I G
Sbjct: 327 CYQTTSTTLLTKVLIITAHFSGSEV-HLNALNTFYPITDE-VICFAFVSGGNFSSLAIFG 384
Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
N+ Q+ + FD + F P C
Sbjct: 385 NVVQQNFLVGFDLNKKTISFKPTDC 409
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 113/356 (31%), Positives = 166/356 (46%), Gaps = 30/356 (8%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
Y +G+PP+ VID ++VW QC+ C +C++Q P+FDP S ++ C +
Sbjct: 50 NYVANFTIGTPPQPASAVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAEPCGTP 109
Query: 215 VCDRLEN--AGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC-GHKNQGM 271
+C+ + + C C YE S G T G + +T +G T ++A GC +
Sbjct: 110 LCESIPSDVRNCSGNVCAYEASTNAGD-TGGKVGTDTFAVG-TAKASLAFGCVVASDIDT 167
Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG---AAW 328
G +G++GLG SLV Q G AFSYCL G + +L G A G AA
Sbjct: 168 MGGPSGIVGLGRTPWSLVTQTG---VAAFSYCLAPHDAGKNSALFLGSSAKLAGGGKAAS 224
Query: 329 VPLV----RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
P V ++Y V L GL G IP+ V++DT + ++ L
Sbjct: 225 TPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPS--------GSTVLLDTFSPISFLV 276
Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
AY+A + A G P A+ V FD C+ SG S P + F F GG +T+PA+N
Sbjct: 277 DGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSG-ASGAAPDLVFTFRGGAAMTVPATN 335
Query: 445 FLIPVDDAGTFCFAFAPSP-----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+L+ + GT C A S + LS++G++QQE I FD + F P C
Sbjct: 336 YLLDYKN-GTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADC 390
>gi|296087864|emb|CBI35120.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 114/328 (34%), Positives = 165/328 (50%), Gaps = 14/328 (4%)
Query: 173 IDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYE 232
+D+ SD+ W+ PC+ C S +F+ S ++ + C +A C ++ C G C +
Sbjct: 1 MDTSSDVAWI---PCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQVPKPTCGGGVCSFN 57
Query: 233 VSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQL 292
++YG GS L+ +T+T+ V + GC K G + A GLLGLG G +SL+ Q
Sbjct: 58 LTYG-GSSLAANLSQDTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQT 116
Query: 293 GGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVG 351
FSYCL S + SGSL G P + PL++NPR PS Y+V L + VG
Sbjct: 117 QNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVG 176
Query: 352 GMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF 411
+ + F G + D+GT TRL TPAY A RDAF + G + + F
Sbjct: 177 RRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTSLGGF 236
Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP----SGLS 467
DTCY V + PT++F F+G V TLP N LI T C A A +P S L+
Sbjct: 237 DTCYT----VPIAAPTITFMFTGMNV-TLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLN 291
Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
+I N+QQ+ ++ +D N +G +C
Sbjct: 292 VIANLQQQNHRLLYDVPNSRLGVARELC 319
>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
Length = 417
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 109/344 (31%), Positives = 168/344 (48%), Gaps = 44/344 (12%)
Query: 109 HARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVV--SGMDQGSGEYFVRIGVGSPP 166
H ++R ++R RL+G G A+ E VV + + GEY V++G+G+PP
Sbjct: 45 HELLRRAIQRSRY---RLAGIGM--ARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPP 99
Query: 167 RSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC-- 224
ID+ SD++W QCQPC+ CY Q DP+F+P S++++ + CSS CD L+ C
Sbjct: 100 YKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGH 159
Query: 225 -HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG--MFVGAAGLLGL 281
C+Y +Y + T+GTLA++ L IG + VA GC + G A+G++GL
Sbjct: 160 DDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAFGCSTSSTGGAPPPQASGVVGL 219
Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAW----VPLVRNPRA 337
G G +SLV QL + F+YCL + G LV G +A A VP+ R+PR
Sbjct: 220 GRGPLSLVSQLSVRR---FAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRY 276
Query: 338 PSFYYVGLSGLGVGGMRIPISEDLFRL--------------------TQMGDD---GVVM 374
PS+YY+ L GL +G + + +GD G+++
Sbjct: 277 PSYYYLNLDGLLIGDRTMSLPPTTTTTATATATATAPAPTPSPNATAVAVGDANRYGMII 336
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNL 417
D + +T L Y+ + + LPR +G S+ D C+ L
Sbjct: 337 DIASTITFLEASLYDELVNDLEVEI-RLPRGTGSSLGLDLCFIL 379
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 113/356 (31%), Positives = 167/356 (46%), Gaps = 30/356 (8%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
Y +G+PP+ VID ++VW QC+ CS+C++Q P+FDP S ++ C +
Sbjct: 50 NYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTP 109
Query: 215 VCDRL--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC-GHKNQGM 271
+C+ + ++ C C Y+ S G T G + +T +G T ++A GC +
Sbjct: 110 LCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVG-TAKASLAFGCVVASDIDT 167
Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG---AAW 328
G +G++GLG SLV Q G AFSYCL G + +L G A G AA
Sbjct: 168 MGGPSGIVGLGRTPWSLVTQTG---VAAFSYCLAPHDAGKNSALFLGSSAKLAGGGKAAS 224
Query: 329 VPLV----RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
P V ++Y V L GL G IP+ V++DT + ++ L
Sbjct: 225 TPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPS--------GSTVLLDTFSPISFLV 276
Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
AY+A + A G P A+ V FD C+ SG S P + F F GG +T+ ASN
Sbjct: 277 DGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSG-ASGAAPDLVFTFRGGAAMTVAASN 335
Query: 445 FLIPVDDAGTFCFAFAPSP-----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+L+ + GT C A S + LS++G++QQE I FD + F P C
Sbjct: 336 YLLDYKN-GTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADC 390
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 112/322 (34%), Positives = 158/322 (49%), Gaps = 23/322 (7%)
Query: 142 GTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPA 201
GT Q G+Y ++ +G PP + +D+GSD++WV+C PC+ C P++DPA
Sbjct: 73 GTKAPVTKSQKGGKYIMQFSIGEPPLLIWAEVDTGSDLMWVKCSPCNGCNPPPSPLYDPA 132
Query: 202 DSASFSGVSCSSAVCDRLENAGCHAGRCR-------YEVSYG-DGSY-TKGTLALETLTI 252
S S + CSS +C L + +C Y +YG G + T+G L ET T
Sbjct: 133 RSRSSGKLPCSSQLCQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTF 192
Query: 253 GR-TVVKNVAIGCGHKNQG-MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG 310
G V NV+ G G F G AGL+GLG G +SLV QLG G F+YCL +
Sbjct: 193 GDGYVANNVSFGRSDTIDGSQFGGTAGLVGLGRGHLSLVSQLG---AGRFAYCLAADPNV 249
Query: 311 SSGSLVFGREALPVGAAWV---PLVRNPRA--PSFYYVGLSGLGVGGMRIPISEDLFRLT 365
S L AL A V PLV NP+ + YYV L G+ VGG R+PI + F +
Sbjct: 250 YSTILFGSLAALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGTFAIN 309
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV-R 424
G GV D+G T L AY+ R A ++ L +G DTC+ + +V +
Sbjct: 310 SDGSGGVFFDSGAIDTSLKDAAYQVVRQAITSEIQRLGYDAG---DDTCFVAANQQAVAQ 366
Query: 425 VPTVSFYFSGGPVLTLPASNFL 446
+P + +F G ++L N+L
Sbjct: 367 MPPLVLHFDDGADMSLNGRNYL 388
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 122/436 (27%), Positives = 188/436 (43%), Gaps = 28/436 (6%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
L L+H + +H + + F+ D +R+ LV + A
Sbjct: 15 LSLIHFAISKPDGFSLEIVHRYSRESPFYPGNITDYERITRLVELSKIRAHNLAITTSSG 74
Query: 141 FGTDVVS-GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
F + + Q Y V++ +GSP Y+V D+GS + W QC+PC++ ++Q P+F+
Sbjct: 75 FSPEAFRLRISQDDTCYLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRFRQLPPIFN 134
Query: 200 PADSASFSGVSCSSAVCDRLENA-GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVK 258
S ++ + C C +N C +C Y ++Y GS T G A + L
Sbjct: 135 STASRTYRDLPCQHQFCTNNQNVFQCRDDKCVYRIAYAGGSATAGVAAQDILQSAENDRI 194
Query: 259 NVAIGCGHKNQGM-----FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL----VSRGT 309
GC NQ G++GL +SL+ Q+ T FSYCL +S +
Sbjct: 195 PFYFGCSRDNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFDLSSPS 254
Query: 310 GSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
++ L FG + ++ P V +PR Y++ L + V G R+ I F L
Sbjct: 255 HATSLLRFGNDIRKSRRKYLSTPFV-SPRGMPNYFLNLIDVSVAGNRMQIPPGTFALKPD 313
Query: 368 GDDGVVMDTGTAVTRLPTPAY----EAFRDAFVA---QTGNLPRASGVSIFDTCYNLSGF 420
G G ++D+GTAVT + AY AF++ F Q N+ + SG CY G
Sbjct: 314 GTGGTIIDSGTAVTYISQTAYFPVITAFKNYFDQHGFQRVNI-QLSGY----ICYKQQGH 368
Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP-SPSGLSIIGNIQQEGIQI 479
P+++F+F G P +L V D G FC A P SP +IIG + Q Q
Sbjct: 369 TFHNYPSMAFHFQGADFFVEPEYVYLT-VQDRGAFCVALQPISPQQRTIIGALNQANTQF 427
Query: 480 SFDGANGFVGFGPNVC 495
+D AN + F P C
Sbjct: 428 IYDAANRQLLFTPENC 443
>gi|125595873|gb|EAZ35653.1| hypothetical protein OsJ_19940 [Oryza sativa Japonica Group]
Length = 468
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 122/345 (35%), Positives = 160/345 (46%), Gaps = 40/345 (11%)
Query: 161 GVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
+ P +Q M ID+ D+ W+QC PC +CY Q + +FDP S + + V C SA C
Sbjct: 154 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 213
Query: 219 LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF-VGAAG 277
L RY G + + R + C H +G F +G
Sbjct: 214 LG---------RY------GRWLLQQPVPVLRRLRRRQGQPRGRTC-HAVRGNFSASTSG 257
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA---AWVPLVRN 334
+ LGGG SL+ Q G AFSYC+ SSG L G A GA A PLVRN
Sbjct: 258 TMSLGGGRQSLLSQTAATFGNAFSYCVPD--PSSSGFLSLGGPADGGGAGRFARTPLVRN 315
Query: 335 PRA-PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD 393
P P+ Y V L G+ VGG R+ + +F G VMD+ +T+LP AY A R
Sbjct: 316 PSIIPTLYLVRLRGIEVGGRRLNVPPVVF------AGGAVMDSSVIITQLPPTAYRALRL 369
Query: 394 AFVAQTGNLPR-ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA 452
AF + PR A G + DTCY+ F SV VP VS F GG V+ L A ++
Sbjct: 370 AFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV----- 424
Query: 453 GTFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C AF P+P L IGN+QQ+ ++ +D G VGF C
Sbjct: 425 -EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 468
>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 364
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 128/353 (36%), Positives = 172/353 (48%), Gaps = 35/353 (9%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP-ADSASFSGVSC 211
+G+Y +++ +G+PP Y ++D+ SD+VW QC PC CYKQ +P+FDP + SF SC
Sbjct: 28 NGDYLMKLTLGTPPVDVYGLVDTDSDLVWAQCTPCQGCYKQKNPMFDPLKECNSFFDHSC 87
Query: 212 SSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVAIGCGHK 267
S E A C Y +Y D S TKG LA E T G+ +V+++ GCGH
Sbjct: 88 SP------EKA------CDYVYAYADDSATKGMLAKEIATFSSTDGKPIVESIIFGCGHN 135
Query: 268 NQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGA-FSYCLVS--RGTGSSGSLVFGREALP 323
N G+F GL+GLGGG +SLV Q+G G FS CLV +SG++ G EA
Sbjct: 136 NTGVFNENDMGLIGLGGGPLSLVSQMGNLYGSKRFSQCLVPFHADPHTSGTISLG-EASD 194
Query: 324 V---GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM-DTGTA 379
V G PLV + Y V L G+ VG +P F ++M G +M D+GT
Sbjct: 195 VSGEGVVTTPLVSE-EGQTPYLVTLEGISVGDTFVP-----FNSSEMLSKGNIMIDSGTP 248
Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
T LP Y+ + Q NLP T ++ P ++ +F G V
Sbjct: 249 ETYLPQEFYDRLVEELKVQI-NLPPIHVDPDLGTQLCYKSETNLEGPILTAHFEGADVKL 307
Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGP 492
LP F+ P D G FCFA + GL I GN Q + I FD V F P
Sbjct: 308 LPLQTFIPPKD--GVFCFAMTGTTDGLYIFGNFAQSNVLIGFDLDKRIVFFKP 358
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 105/371 (28%), Positives = 171/371 (46%), Gaps = 36/371 (9%)
Query: 156 YFVRIGVGSPP--------RSQYMVIDSGSDIVWVQCQPC----SQCYKQSDPVFDPADS 203
+ ++GVGS ++ Y ID+G+++ W+QC+ C + C+ DP + + S
Sbjct: 80 FLAQVGVGSFQEKSHRTHFKTYYFQIDTGNELSWIQCEGCQNKGNMCFPHKDPPYTSSQS 139
Query: 204 ASFSGVSCSS-AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVV 257
S+ VSC+ + C E C G C Y V+YG GSYT G LA ET T T +
Sbjct: 140 KSYKPVSCNQHSFC---EPNQCKEGLCAYNVTYGPGSYTSGNLANETFTFYSNHGKHTAL 196
Query: 258 KNVAIGCGHKNQGMFVG-------AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG 310
K+++ GC ++ M +G+LG+G G S + QLG + G FSYC+ + T
Sbjct: 197 KSISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRSFLAQLGSISHGKFSYCITANNTH 256
Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
++ L FG+ + + + + Y+V L G+ V G+++ I++ + + G
Sbjct: 257 NT-YLRFGKHVVKSKNLQTTKIMQVKPSAAYHVNLLGISVNGVKLNITKTDLAVRKDGSR 315
Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAF---VAQTGNLPRASGVSIF-DTCYN-LSGFVSVRV 425
G ++D GT T L P ++ A ++ NL R + D CY LS +
Sbjct: 316 GCIIDAGTLATLLVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHKDLCYEQLSDAGRKNL 375
Query: 426 PTVSFYFSGGPVLTLPASNFLI-PVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGA 484
P V+F+ + P + FL + FC + S +IIG QQ + +D
Sbjct: 376 PVVTFHLENADLEVKPEAIFLFREFEGKNVFCLSML-SDDSKTIIGAYQQMKQKFVYDTK 434
Query: 485 NGFVGFGPNVC 495
+ FGP C
Sbjct: 435 ARVLSFGPEDC 445
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 113/340 (33%), Positives = 160/340 (47%), Gaps = 37/340 (10%)
Query: 161 GVGSPPRSQYMVIDSGSD-IVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL 219
G PP Q ++ + D I W QC+PC +C K S FDP+ S ++S SC +
Sbjct: 79 GHSQPPSPQEILAEMNPDSITWTQCKPCVRCLKDSHRHFDPSASLTYSLGSCIPSTVGN- 137
Query: 220 ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMF-VGAAG 277
Y ++YGD S + G +T+T+ + V GCG N+G F GA G
Sbjct: 138 ----------TYNMTYGDKSTSVGNYGCDTMTLEPSDVFPKFQFGCGRNNEGDFGSGADG 187
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-AWVPLVRNP- 335
+LGLG G +S V Q + FSYCL S GSL+FG +A + + LV P
Sbjct: 188 MLGLGQGQLSTVSQTASKFKKVFSYCLPEE--DSIGSLLFGEKATSQSSLKFTSLVNGPG 245
Query: 336 ----RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
+Y+V L + VG R+ + +F G ++D+GT +T LP AY A
Sbjct: 246 TSGLEESGYYFVKLLDISVGNKRLNVPSSVF-----ASPGTIIDSGTVITCLPQRAYSAL 300
Query: 392 RDAFVAQTGNLPRASGV----SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
AF P ++G I DTCYNLSG V +P + +F G + L +I
Sbjct: 301 TAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKR-VI 359
Query: 448 PVDDAGTFCFAFAPSP-----SGLSIIGNIQQEGIQISFD 482
+DA C AFA + S L+IIGN QQ + + +D
Sbjct: 360 WGNDASRLCLAFAGNSKSTMNSELTIIGNRQQVSLTVLYD 399
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 112/348 (32%), Positives = 166/348 (47%), Gaps = 34/348 (9%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS-- 213
+ V I +GSPP +Q + +D+ SD++W+QC+PC CY QS P+FDP+ S + SC +
Sbjct: 85 FLVNISIGSPPVTQLLHMDTASDLLWLQCRPCINCYAQSLPIFDPSRSYTHRNESCRTSQ 144
Query: 214 -AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-------RTVVKNVAIGCG 265
++ NA + C Y + Y DG+ +KG LA E L + +V GCG
Sbjct: 145 YSMPSLRFNAKTRS--CEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALHDVVFGCG 202
Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS--SGSLVFGREALP 323
H N G + G+LGLG G SLV + G + FSYC S S LV G +
Sbjct: 203 HDNYGEPLVGTGILGLGYGEFSLVHRFGTK----FSYCFGSLDDPSYPHNVLVLGDDGAN 258
Query: 324 VGAAWVPL-VRNPRAPSFYYVGLSGLGVGGMRIPISEDLF-RLTQMGDDGVVMDTGTAVT 381
+ PL + N FYYV + + V G+ +PI +F R Q G G ++DTG ++T
Sbjct: 259 ILGDTTPLEIYN----GFYYVTIEAISVDGIILPIDPWVFNRNHQTGLGGTIIDTGNSLT 314
Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSIFDT----CYN---LSGFVSVRVPTVSFYFSG 434
L AY+ ++ A+ V+ D CYN V P V+F+FS
Sbjct: 315 SLVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLERDLVESGFPIVTFHFSD 374
Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
G L+L + + + FC A +P ++ IG Q+ I +D
Sbjct: 375 GAELSLDVKSVFMKL-SPNVFCLAV--TPGNMNSIGATAQQSYNIGYD 419
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 118/368 (32%), Positives = 169/368 (45%), Gaps = 66/368 (17%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ--PCSQCYKQSDPVFDPADSASFSGVSCS 212
EY V + G+PP+ + +D+GSDI W QC+ P S C+ Q+ P+FDP+ S+SF+ + CS
Sbjct: 87 EYLVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCS 146
Query: 213 SAVCDRLENAG----CHAGRCRYEVSYGDGSYTKGTLALETLTIGR-------TVVKNVA 261
S C+ G + C Y +SYGDGS ++G + E T V +
Sbjct: 147 SPACETTPPCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLV 206
Query: 262 IGCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE 320
GCGH N+G+F G+ G G GS+SL QL G FS+C + TGS S V
Sbjct: 207 FGCGHANRGVFTSNETGIAGFGRGSLSLPSQL---KVGNFSHCFTTI-TGSKTSAVL--- 259
Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT---QMGDDGVVMDTG 377
LG+ G+ P + L R + ++G
Sbjct: 260 ---------------------------LGLPGVAPPSASPLGRRRGSYRCRSTPRSSNSG 292
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD-TCYN--LSGFVSVRVPTVSFYFSG 434
T++T LP Y A R+ F AQ LP G + TC++ L G VPT++ +F G
Sbjct: 293 TSITSLPPRTYRAVREEFAAQV-KLPVVPGNATDPFTCFSAPLRG-PKPDVPTMALHFEG 350
Query: 435 GPVLTLPASNFLIPV---DDAGT----FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGF 487
+ LP N++ V DDAG C A G I+GNIQQ+ + + +D N
Sbjct: 351 A-TMRLPQENYVFEVVDDDDAGNSSRIICLAVI--EGGEIILGNIQQQNMHVLYDLQNSK 407
Query: 488 VGFGPNVC 495
+ F P C
Sbjct: 408 LSFVPAQC 415
>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
Length = 486
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 136/431 (31%), Positives = 199/431 (46%), Gaps = 60/431 (13%)
Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGG--GADAAK---------HEVQDFGTDVVS---G 148
R + S +++D RV + RR+SG GA A+K E Q +S G
Sbjct: 78 RTKPSLADVLRQDRLRVHHIHRRVSGSSRGARASKGSFKEPVSVEETQLHHQAAISVEVG 137
Query: 149 MDQGSGEYFVRI-------GVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFD 199
Q S E I G SPP + +V+D+ D+ W++C PC+ QC +D
Sbjct: 138 TSQTSSEPSSGIHPAAATDGSSSPPVT--VVLDTAGDVPWMRCVPCTFAQCAD-----YD 190
Query: 200 PADSASFSGVSCSSAVCDRLEN--AGCHA-GRCRYEV-SYGDGSYTKGTLALETLTIGR- 254
P S+++S C+S+ C +L GC A G+C+Y V + GD T GT + + LTI
Sbjct: 191 PTRSSTYSAFPCNSSACKQLGRYANGCDANGQCQYMVVTAGDSFTTSGTYSSDVLTINSG 250
Query: 255 TVVKNVAIGCGHKNQGMFVGAA-GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSG 313
V+ GC QG F A G++ LG G SL+ Q G AFSYCL T
Sbjct: 251 DRVEGFRFGCSQNEQGSFENQADGIMALGRGVQSLMAQTSSTYGDAFSYCLPPTETTKG- 309
Query: 314 SLVFGREALPVGAAW----VPLVR-----NPRAPSFYYVGLSGLGVGGMRIPISEDLFRL 364
F + +P+GA++ P+++ + A + Y L + V G + + ++F
Sbjct: 310 ---FFQIGVPIGASYRFVTTPMLKERGGASAAAATLYRALLLAITVDGKELNVPAEVFAA 366
Query: 365 TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR 424
G VMD+ T +TRLP AY A R AF + A DTCY+L+G R
Sbjct: 367 ------GTVMDSRTIITRLPVTAYGALRAAFRNRM-RYRVAPPQEELDTCYDLTGVRYPR 419
Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGA 484
+P ++ F G V+ + S L+ G FA S SI+GN+QQ+ IQ+ D
Sbjct: 420 LPRIALVFDGNAVVEMDRSGILL----NGCLAFASNDDDSSPSILGNVQQQTIQVLHDVG 475
Query: 485 NGFVGFGPNVC 495
G +GF C
Sbjct: 476 GGRIGFRSAAC 486
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 139/452 (30%), Positives = 196/452 (43%), Gaps = 49/452 (10%)
Query: 69 SSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSG 128
+++ ++ E ++++ +HRD S H H+ R R L R SG
Sbjct: 23 TASAAAGEGGFSVDFIHRDSARSPYR-----HPALSPHARALAAARRSLRGEVLGRSYSG 77
Query: 129 GGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS 188
AA D G V S + S EY + + VG+PP + D+GSD+VWV C
Sbjct: 78 ASPAAAPVSAADGG--VESKIITRSFEYLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSG 135
Query: 189 QCYKQSDP----VFDPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKG 243
+D VF P S+++S +SC S C L A C A C+Y+ SYGDGS T G
Sbjct: 136 GGLADADAGGNVVFQPTRSSTYSQLSCQSNACQALSQASCDADSECQYQYSYGDGSRTIG 195
Query: 244 TLALETLTI------GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT- 296
L+ ET + G+ V V GC + G F + GL+GLG G+ SLV QLG T
Sbjct: 196 VLSTETFSFVDGGGKGQVRVPRVNFGCSTASAGTFR-SDGLVGLGAGAFSLVSQLGATTH 254
Query: 297 -GGAFSYCLV-SRGTGSSGSLVFGREAL--PVGAAWVPLVRNPRAPSFYYVGLSGLGVGG 352
SYCL+ S SS +L FG A+ GAA PLV + S+Y V L + VGG
Sbjct: 255 IDRKLSYCLIPSYDANSSSTLNFGSRAVVSEPGAASTPLVPS-DVDSYYTVALESVAVGG 313
Query: 353 MRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT----PAYEAFRDAFVAQTGNLPRASGV 408
+ D +++D+GT +T L P Q P
Sbjct: 314 QEVATH----------DSRIIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPE---- 359
Query: 409 SIFDTCYNLSGFVSVR---VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP--SP 463
+ CY++ G +P V+ F GG +TL N + + GT C P
Sbjct: 360 QLLQLCYDVQGKSETDNFGIPDVTLRFGGGAAVTLRPENTFSLLQE-GTLCLVLVPVSES 418
Query: 464 SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+SI+GNI Q+ + +D V F C
Sbjct: 419 QPVSILGNIAQQNFHVGYDLDARTVTFAAADC 450
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 125/437 (28%), Positives = 201/437 (45%), Gaps = 45/437 (10%)
Query: 67 ISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL 126
ISS+ ++ +R +L+HR+ N R + + ++R + + ++ L
Sbjct: 26 ISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQTSSIER-FDFLESKIKEL 84
Query: 127 SGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP 186
G +A + ++GSG + V + +GSPP +Q +V+D+GS ++WVQC P
Sbjct: 85 KSVGNEARSSLIP---------FNRGSG-FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLP 134
Query: 187 CSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTL 245
C C++QS FDP S SF + C + + C+ + Y++ Y G ++G L
Sbjct: 135 CINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQGIL 194
Query: 246 A-----LETLTIGRTVVKNVAIGCGHKNQGMFVGAA--GLLGLGG-GSMSLVGQLGGQTG 297
A ETL G+ N+ GCGH N A G+ GLG +++ QLG +
Sbjct: 195 AKESLLFETLDEGKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQLGNK-- 252
Query: 298 GAFSYCL--VSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSF--YYVGLSGLGVGGM 353
FSYC+ ++ + LV G+ +++ P F YYV L + VG
Sbjct: 253 --FSYCIGDINNPLYTHNHLVLGQ------GSYIEGDSTPLQIHFGHYYVTLQSISVGSK 304
Query: 354 RIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV-AQTGNLPRASGVSIFD 412
+ I + F+++ G GV++D+G T+L +E D V G L R F+
Sbjct: 305 TLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFE 364
Query: 413 -TCYNLSGFVS---VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS---G 465
C+ G VS V P V+F+F+GG L L + + L FC A PS S
Sbjct: 365 GLCF--KGVVSRDLVGFPAVTFHFAGGADLVLESGS-LFRQHGGDRFCLAILPSNSELLN 421
Query: 466 LSIIGNIQQEGIQISFD 482
LS+IG + Q+ + FD
Sbjct: 422 LSVIGILAQQNYNVGFD 438
>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 252
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 91/238 (38%), Positives = 133/238 (55%), Gaps = 25/238 (10%)
Query: 100 HYHRHQHSFHARMQRD-------VKRVATLVRRLSGGGADAAKHEVQDFGTDV--VSGMD 150
H + ++ R+Q+ V+ + +RR+ A+ H V+ T + SG++
Sbjct: 6 HCSEKKIDWNRRLQKQLILDDLRVRSMQNRIRRV------ASTHNVEASQTQIPLSSGIN 59
Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
+ Y V +G+GS ++ ++ID+ SD+ WVQC+PC CY Q P+F P+ S+S+ VS
Sbjct: 60 LQTLNYIVTMGLGS--KNMTVIIDTRSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVS 117
Query: 211 CSSAVCDRLENAGCHAG--------RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI 262
C+S+ C L+ A + G C Y V+YGDGSYT G L +E L+ G V +
Sbjct: 118 CNSSTCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGDLGVEALSFGGVSVSDFVF 177
Query: 263 GCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE 320
GCG N+G+F G +GL+GLG +SLV Q GG FSYCL + GSSGSLV G E
Sbjct: 178 GCGRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGSSGSLVMGNE 235
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 110/346 (31%), Positives = 161/346 (46%), Gaps = 30/346 (8%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
+ V I +GSPP +Q + +D+ SD++W+QC PC CY QS P+FDP+ S + +C ++
Sbjct: 85 FLVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQSLPIFDPSRSYTHRNETCRTSQ 144
Query: 216 CDRLE-NAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-------RTVVKNVAIGCGHK 267
+ C Y + Y D + +KG LA E L + +V GCGH
Sbjct: 145 YSMPSLKFNANTRSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALHDVVFGCGHD 204
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS--SGSLVFGREALPVG 325
N G + G+LGLG G SLV + G + FSYC S S LV G + +
Sbjct: 205 NYGEPLVGTGILGLGYGEFSLVHRFGKK----FSYCFGSLDDPSYPHNVLVLGDDGANIL 260
Query: 326 AAWVPL-VRNPRAPSFYYVGLSGLGVGGMRIPISEDLF-RLTQMGDDGVVMDTGTAVTRL 383
PL + N FYYV + + V G+ +PI +F R Q G G ++DTG ++T L
Sbjct: 261 GDTTPLEIHN----GFYYVTIEAISVDGIILPIDPRVFNRNHQTGLGGTIIDTGNSLTSL 316
Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDT----CYN---LSGFVSVRVPTVSFYFSGGP 436
AY+ ++ A+ VS D CYN V P V+F+FS G
Sbjct: 317 VEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECYNGNFERDLVESGFPIVTFHFSEGA 376
Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
L+L + + + FC A +P L+ IG Q+ I +D
Sbjct: 377 ELSLDVKSLFMKL-SPNVFCLAV--TPGNLNSIGATAQQSYNIGYD 419
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 124/378 (32%), Positives = 187/378 (49%), Gaps = 38/378 (10%)
Query: 144 DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC---SQCYKQSDPVFDP 200
DV + + + + +Y +GSPP+ +ID+GSD++W QC C KQ P ++
Sbjct: 74 DVSAQVHRATRQYIASYLIGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYYNL 133
Query: 201 ADSASFSGVSCSSAVCDRLENAGCHA----GRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
+ S++F V C+ N G H G C + SYG G G+L E+ +
Sbjct: 134 SQSSTFVPVPCADKAGFCAAN-GVHLCGLDGSCTFIASYGAGRVI-GSLGTESFAF-ESG 190
Query: 257 VKNVAIGC---GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGTGS 311
++A GC G A+GL+GLG G +SLV Q+G FSYCL +G+
Sbjct: 191 TTSLAFGCVSLTRITSGALNDASGLIGLGRGRLSLVSQIGATR---FSYCLTPYFHSSGA 247
Query: 312 SGSL-VFGREALPVGAAWVPLVRNPR---APSFYYVGLSGLGVGGMRIP-ISEDLFRLTQ 366
S L V +L G A +P V++P+ +FYY+ L G+ VG R+P ++ F+L Q
Sbjct: 248 SSHLFVGASASLGGGGASMPFVKSPKDYPYSTFYYLPLEGITVGKTRLPAVNSTTFQLRQ 307
Query: 367 MGD----DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN-----LPRASGVSIFDTCYNL 417
+ GV++DTG+ +T+L + AYEA ++ AQ GN P SG+ + C
Sbjct: 308 LFKGYWAGGVIIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDSGLEL---CVAR 364
Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGI 477
GF V VP + F+F GG + +PA+++ PVD A C SIIGN QQ+ +
Sbjct: 365 EGFQKV-VPALVFHFGGGADMAVPAASYWAPVDKAAA-CMMILEGGYD-SIIGNFQQQDM 421
Query: 478 QISFDGANGFVGFGPNVC 495
+ +D G F C
Sbjct: 422 HLLYDLRRGRFSFQTADC 439
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 151 bits (381), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 135/447 (30%), Positives = 199/447 (44%), Gaps = 67/447 (14%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
LEL H D + S RM+R +R T R S G A A H +
Sbjct: 26 LELTHVDA--------------KQNCSTEERMRRATER--THRRLASMGEASAPVHWAES 69
Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ--CYKQSDPVF 198
+Y +G PP+ +ID+GS+++W QC C C+ Q+ +
Sbjct: 70 --------------QYIAEYLIGDPPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFY 115
Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTV 256
DP+ S + V+C+ C C C +YG G G L E T +
Sbjct: 116 DPSRSRTARPVACNDTACALGSETRCARDNKACAVLTAYGAG-VIGGVLGTEAFTF-QPQ 173
Query: 257 VKNV--AIGCGHKNQ---GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV---SRG 308
+NV A GC + G GA+G++GLG G++SLV QLG FSYCL S+
Sbjct: 174 SENVSLAFGCIAATRLTPGSLDGASGIIGLGRGNLSLVSQLGDNK---FSYCLTPYFSQS 230
Query: 309 TGSSGSLVFGREALPVG---AAWVPLVRNPRA---PSFYYVGLSGLGVGGMRIPISEDLF 362
T +S V L G A VP ++NP +FYY+ L+G+ VG ++ + E F
Sbjct: 231 TNTSRLFVGASAGLSSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAF 290
Query: 363 RLTQMGD---DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDTCYNL 417
L Q+ G ++D+G+ T L AY+A RD V Q G +P +G D C +
Sbjct: 291 DLRQVATGLWAGTLIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLDLCAAV 350
Query: 418 S-GFVSVRVPTVSFYF-SGGPVLTLPASNFLIPVDDAGTFCFAFA---PSPS----GLSI 468
+ G V VP + +F SGG + +P N+ PVDD+ F+ P+ + +I
Sbjct: 351 AHGDVGKLVPPLVLHFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTI 410
Query: 469 IGNIQQEGIQISFDGANGFVGFGPNVC 495
IGN Q+ + + +D G + F P C
Sbjct: 411 IGNYMQQDMHLLYDLEKGMLSFQPADC 437
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 151 bits (381), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 129/450 (28%), Positives = 200/450 (44%), Gaps = 78/450 (17%)
Query: 110 ARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQ 169
ARM R+ R+A + R G A F + SG G+G+YFVR VG+P +
Sbjct: 47 ARMDRE--RMAFISSR----GRRRAAETASAFAMPLSSGAYTGTGQYFVRFRVGTPAQPF 100
Query: 170 YMVIDSGSDIVWVQCQ------------------PCSQCYKQSDPVFDPADSASFSGVSC 211
+V D+GSD+ WV+C P +++ F P S +++ + C
Sbjct: 101 LLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRT---FRPDKSRTWAPIPC 157
Query: 212 SSAVCDR-----LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-------RTVVKN 259
SSA C L A C Y+ Y DGS +GT+ +++ TI + ++
Sbjct: 158 SSATCRESLPFSLAACATPANPCAYDYRYKDGSAARGTVGVDSATIALSGRAARKAKLRG 217
Query: 260 VAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--GTGSSGSLV 316
V +GC G F+ + G+L LG ++S + + GG FSYCLV ++ L
Sbjct: 218 VVLGCTTSYNGQSFLASDGVLSLGYSNISFASRAASRFGGRFSYCLVDHLAPRNATSYLT 277
Query: 317 FG--------REALPVGAA-----------------WVPLVRNPRAPSFYYVGLSGLGVG 351
FG R + + + PLV + R FY V + G+ V
Sbjct: 278 FGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTVKGVSVA 337
Query: 352 GMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF 411
G + I ++ + Q G G ++D+GT++T L PAY A A + LPR + + F
Sbjct: 338 GELLKIPRAVWDVEQGG--GAILDSGTSLTMLAKPAYRAVVAALSKRLAGLPRVT-MDPF 394
Query: 412 DTCYNLSGF----VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAPSP-SG 465
D CYN + V+ +P ++ +F+G L PA +++I D A G C P G
Sbjct: 395 DYCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAKSYVI--DAAPGVKCIGLQEGPWPG 452
Query: 466 LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
LS+IGNI Q+ +D N + F + C
Sbjct: 453 LSVIGNILQQEHLWEYDLKNRRLRFKRSRC 482
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 151 bits (381), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 123/362 (33%), Positives = 179/362 (49%), Gaps = 33/362 (9%)
Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC--QPCSQCYKQSDPVFDPADSASF 206
MD G Y + +G+PP+ + D+GSD++W +C + C Q P + P S++F
Sbjct: 84 MDDSGGAYDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTF 143
Query: 207 SGVSCSSAVCDRLEN---AGCHAG--RCRYEVSYG----DGSYTKGTLALETLTIGRTVV 257
+ + CS +C L + A C A C Y SYG D YT+G LA ET T+G V
Sbjct: 144 AKLPCSDRLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTLGADAV 203
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF 317
+V GC ++G + +GL+GLG G +SLV QL T F YCL S + +S L+F
Sbjct: 204 PSVRFGCTTASEGGYGSGSGLVGLGRGPLSLVSQLNAST---FMYCLTSDASKAS-PLLF 259
Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDT 376
G A GA V + +FY V L + +G P + E +GVV D+
Sbjct: 260 GSLASLTGAQ-VQSTGLLASTTFYAVNLRSISIGSATTPGVGE---------PEGVVFDS 309
Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSG---FVSVRVPTVSFYFS 433
GT +T L PAY + AF++QT +L + F+ C+ + VPT+ +F
Sbjct: 310 GTTLTYLAEPAYSEAKAAFLSQT-SLDQVEDTDGFEACFQKPANGRLSNAAVPTMVLHFD 368
Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
G + LP +N+++ V+D G C+ SPS LSIIGNI Q + D + F P
Sbjct: 369 GAD-MALPVANYVVEVED-GVVCWIVQRSPS-LSIIGNIMQVNYLVLHDVHRSVLSFQPA 425
Query: 494 VC 495
C
Sbjct: 426 NC 427
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 116/376 (30%), Positives = 173/376 (46%), Gaps = 34/376 (9%)
Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ------PCSQCYKQS---DPVFDP 200
D G G+YFV VG+P + +V D+GSD+ W+ C+ CS + VF
Sbjct: 77 DYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHA 136
Query: 201 ADSASFSGVSCSSAVCD-------RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI- 252
S+SF + C + +C L N C Y+ Y DGS G A ET+T+
Sbjct: 137 NLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVE 196
Query: 253 ---GRTV-VKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
GR + + NV IGC QG F A G++GLG S + + GG FSYCLV
Sbjct: 197 LKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDH 256
Query: 308 GTGS--SGSLVFG----REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
+ S L FG +EAL + LV SFY V + G+ +GG + I ++
Sbjct: 257 LSHKNVSNYLTFGSSRSKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLKIPSEV 315
Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS-GVSIFDTCYNLSGF 420
+ + G G ++D+G+++T L PAY+ A + + + C+N +GF
Sbjct: 316 WDVK--GAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGF 373
Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP-SPSGLSIIGNIQQEGIQI 479
VP + F+F+ G P +++I D G C F + G S++GNI Q+
Sbjct: 374 EESLVPRLVFHFADGAEFEPPVKSYVISAAD-GVRCLGFVSVAWPGTSVVGNIMQQNHLW 432
Query: 480 SFDGANGFVGFGPNVC 495
FD +GF P+ C
Sbjct: 433 EFDLGLKKLGFAPSSC 448
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 126/450 (28%), Positives = 191/450 (42%), Gaps = 67/450 (14%)
Query: 81 LELVHR--DKMSSSSNTTNNMH----YHRHQHSFHARMQRDVKRVATLVRRLSGGGADAA 134
LELVHR ++ + + + + + RM + V+ R G
Sbjct: 35 LELVHRHHERFAGGGGDVDRVEAVKGFVKRDKLRRQRMNQRWGVVSNYDSRRKGFEMTTT 94
Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS 194
EV+ + SG D GEYF + VGSP + ++V+D+GS+ W+ C
Sbjct: 95 PAEVE---MPMHSGRDDALGEYFAEVKVGSPGQRFWLVVDTGSEFTWLNC---------- 141
Query: 195 DPVFDPADSASFSGVSCSSAVCD-------RLENAGCHAGRCRYEVSYGDGSYTKGTLAL 247
S SF V+C+S C L + C Y++SY DGS KG
Sbjct: 142 --------SKSFEAVTCASRKCKVDLSELFSLSVCPKPSDPCLYDISYADGSSAKGFFGT 193
Query: 248 ETLTIGRT-----VVKNVAIGCGHKNQGMFVGA------AGLLGLGGGSMSLVGQLGGQT 296
+++T+G T + N+ IGC + M G G+LGLG S + + +
Sbjct: 194 DSITVGLTNGKQGKLNNLTIGC---TKSMLNGVNFNEETGGILGLGFAKDSFIDKAANKY 250
Query: 297 GGAFSYCLVSRGTGSSGSLVFGREALPVG----AAWVPLVRNPRA---PSFYYVGLSGLG 349
G FSYCLV + S S L +G A + +R P FY V + G+
Sbjct: 251 GAKFSYCLVDHLSHRSVS-----SNLTIGGHHNAKLLGEIRRTELILFPPFYGVNVVGIS 305
Query: 350 VGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS 409
+GG + I ++ G G ++D+GT +T L PAYEA +A + R +G
Sbjct: 306 IGGQMLKIPPQVWDFNAEG--GTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVTGED 363
Query: 410 I--FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP--SPSG 465
+ C++ GF VP + F+F+GG P +++I V C P G
Sbjct: 364 FDALEFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPL-VKCIGIVPIDGIGG 422
Query: 466 LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S+IGNI Q+ FD + VGF P+ C
Sbjct: 423 ASVIGNIMQQNHLWEFDLSTNTVGFAPSTC 452
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 123/421 (29%), Positives = 185/421 (43%), Gaps = 45/421 (10%)
Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGV 162
Q A RD R +++ + GG D + D G YF ++ +
Sbjct: 39 NQQVELEALRARDRARHGRILQGVVGGVVDFSVQGTSD---------PYFVGLYFTKVKL 89
Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCD 217
GSP + Y+ ID+GSDI+W+ C CS C S FD A S++ + VSC +C
Sbjct: 90 GSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVSCGDPICS 149
Query: 218 ---RLENAGC--HAGRCRYEVSYGDGS-----YTKGTLALETLTIGRTVVKN----VAIG 263
+ + C A +C Y YGDGS Y T+ +T+ +G++VV N + G
Sbjct: 150 YAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVVANSSSTIIFG 209
Query: 264 CGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVF 317
C G G+ G G G++S++ QL G T FS+CL G G LV
Sbjct: 210 CSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCL-KGGENGGGVLVL 268
Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
G E L + PLV P P Y + L + V G +PI ++F T + G ++D+G
Sbjct: 269 G-EILEPSIVYSPLV--PSQPH-YNLNLQSIAVNGQLLPIDSNVFATTN--NQGTIVDSG 322
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV 437
T + L AY F A A + +S + CY +S V P VS F GG
Sbjct: 323 TTLAYLVQEAYNPFVKAITAAVSQFSKPI-ISKGNQCYLVSNSVGDIFPQVSLNFMGGAS 381
Query: 438 LTLPASNFLIP---VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNV 494
+ L ++L+ +D A +C F G +I+G++ + +D AN +G+
Sbjct: 382 MVLNPEHYLMHYGFLDGAAMWCIGFQKVEQGFTILGDLVLKDKIFVYDLANQRIGWADYD 441
Query: 495 C 495
C
Sbjct: 442 C 442
>gi|298204765|emb|CBI25263.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 85/276 (30%), Positives = 131/276 (47%), Gaps = 49/276 (17%)
Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
G A C Y ++YGDGS+T+G L E L G +VK+ GCG N+G+F G +GL+GLG
Sbjct: 127 GSAAPICNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGLG 186
Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYY 342
+SL+ Q NP+ +FY+
Sbjct: 187 RSDLSLISQTS---------------------------------------ENPQLYNFYF 207
Query: 343 VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL 402
+ L+G+ +GG+ + + +G +++D+GT +TRLP Y+A + F+ Q
Sbjct: 208 INLTGISIGGVAL-------QAPSVGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGF 260
Query: 403 PRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN-FLIPVDDAGTFCFAFA- 460
P A SI DTC+NLS + V +PT+ +F G LT+ + F DA C A A
Sbjct: 261 PPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQVCLALAS 320
Query: 461 -PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
++I+GN QQ+ +++ +D VGF C
Sbjct: 321 LEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETC 356
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 118/368 (32%), Positives = 175/368 (47%), Gaps = 29/368 (7%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASF 206
SG G+G+YFV++ VG+P + +V D+GSD+ WV+C S + VF P S S+
Sbjct: 107 SGAYSGTGQYFVKLRVGTPVQEFTLVADTGSDLTWVKCAGASPPGR----VFRPKTSRSW 162
Query: 207 SGVSCSSAVCD-----RLENAGCHAGRCRYEVSYGDGSY-TKGTLALETLTI----GRTV 256
+ + CSS C L N A C Y+ Y +GS +G + E+ TI G+
Sbjct: 163 APIPCSSDTCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVA 222
Query: 257 -VKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--GTGSS 312
+K+V +GC + G F A G+L LG +S Q + GG+FSYCLV ++
Sbjct: 223 QLKDVVLGCSSSHDGQSFRSADGVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNAT 282
Query: 313 GSLVFGREALP-VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
G L FG +P A L +P P FY V + + V G + I +++ G
Sbjct: 283 GYLAFGPGQVPRTPATQTKLFLDPEMP-FYGVKVDAIHVAGKALDIPAEVW---DAKSGG 338
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGF---VSVRVPTV 428
V++D+G +T L PAY+A A +P+ S F+ CYN + +P +
Sbjct: 339 VILDSGNTLTVLAAPAYKAVVAALSKHLDGVPKVS-FPPFEHCYNWTARRPGAPEIIPKL 397
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGF 487
+ F+G L PA +++I V G C GLS+IGNI Q+ FD N
Sbjct: 398 AVQFAGSARLEPPAKSYVIDVKP-GVKCIGVQEGEWPGLSVIGNIMQQEHLWEFDLKNMQ 456
Query: 488 VGFGPNVC 495
V F + C
Sbjct: 457 VRFKQSNC 464
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 149 bits (376), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 118/404 (29%), Positives = 185/404 (45%), Gaps = 55/404 (13%)
Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ----------- 189
F + SG G+G+YFVR VG+P + ++ D+GSD+ WV+C+ +
Sbjct: 95 FAMPLSSGAYTGTGQYFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPA 154
Query: 190 ----CYKQSDPVFDPADSASFSGVSCSSAVCD-----RLENAGCHAGRCRYEVSYGDGSY 240
VF P DS ++S + CSS C L N C Y+ Y D S
Sbjct: 155 AAPSPAVAPPRVFRPGDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSA 214
Query: 241 TKGTLALETLTIG-------------RTVVKNVAIGC--GHKNQGMFVGAAGLLGLGGGS 285
+G + ++ T+ + ++ V +GC H QG F + G+L LG +
Sbjct: 215 ARGVVGTDSATVALSGGRGGGGGGDRKAKLQGVVLGCTTAHAGQG-FEASDGVLSLGYSN 273
Query: 286 MSLVGQLGGQTGGAFSYCLVSR--GTGSSGSLVFG------REALPVGAAWVPLVRNPRA 337
+S + + GG FSYCLV ++ L FG + P + PL+ + R
Sbjct: 274 ISFASRAASRFGGRFSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARV 333
Query: 338 PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVA 397
FY V + + V G+ + I +++ + G G ++D+GT++T L TPAY+A A
Sbjct: 334 RPFYAVAVDSVSVDGVALDIPAEVWDVGSNG--GTIIDSGTSLTVLATPAYKAVVAALSE 391
Query: 398 QTGNLPRASGVSIFDTCYNLS----GFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA- 452
Q LPR + + FD CYN + G + VP ++ F+G L PA +++I D A
Sbjct: 392 QLAGLPRVA-MDPFDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVI--DAAP 448
Query: 453 GTFCFAFAP-SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
G C + G+S+IGNI Q+ FD N ++ F C
Sbjct: 449 GVKCIGVQEGAWPGVSVIGNILQQEHLWEFDLNNRWLRFRQTSC 492
>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
Length = 373
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 113/358 (31%), Positives = 165/358 (46%), Gaps = 34/358 (9%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
Y + +G+PP+ +I + VW QC PC +C+KQ P+F+ + S+++ C +A+
Sbjct: 28 YMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRSASSTYRPEPCGTAL 87
Query: 216 CDRLENAGCHA-GRCRYEVS--YGDGSYTKGTLALETLTIGRTVVKNVAIGCG-HKNQGM 271
C+ + + C G C YEV +GD S GT +T IG T ++A GC N
Sbjct: 88 CESVPASTCSGDGVCSYEVETMFGDTSGIGGT---DTFAIG-TATASLAFGCAMDSNIKQ 143
Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG-TGSSGSLVFGREALPVG---AA 327
+GA+G++GLG SLVGQ+ AFSYCL G G +L+ G A G AA
Sbjct: 144 LLGASGVVGLGRTPWSLVGQMNAT---AFSYCLAPHGAAGKKSALLLGASAKLAGGKSAA 200
Query: 328 WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
PLV S Y + L G+ G D+ V++DT V+ L A
Sbjct: 201 TTPLVNTSDDSSDYMIHLEGIKFG--------DVIIAPPPNGSVVLVDTIFGVSFLVDAA 252
Query: 388 YEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV-----SVRVPTVSFYFSGGPVLTLPA 442
++A + A G P A+ FD C+ + S+ +P V F G LT+P
Sbjct: 253 FQAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAAALTVPP 312
Query: 443 SNFLIPVDDAGTFCFAFAPSP-----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S ++ + GT C A S + LSI+G + QE I FD + F P C
Sbjct: 313 SKYMYDAGN-GTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEPADC 369
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 121/407 (29%), Positives = 188/407 (46%), Gaps = 30/407 (7%)
Query: 95 TTNNMHYHRHQHSFH----ARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMD 150
+TN +H H + + +D +TL R + DF V +
Sbjct: 44 STNLIHIHSPSSPYKNVKAESLAKDTALESTLSRHAYLRARQQKALQPADF---VPPPLI 100
Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
+ + + +G+PP + Y+V+D+GSD+ W+QC+PC CYKQ DP+++ S S++ +
Sbjct: 101 RDKSAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEML 160
Query: 211 CSSAVCDRL--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIG 263
C+ C L E +G C Y+ SY DGS T G L+ E + V G
Sbjct: 161 CNEPPCLSLGREGQCSDSGSCLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDKTAQVGFG 220
Query: 264 CGHKNQGMFVGA--AGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGS-LVFG 318
CG +N + G+LGLG G +SLV QL G+ +F+YC + ++G LVFG
Sbjct: 221 CGLQNLNFVTSSRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNLSNPNAGGFLVFG 280
Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVG--GMRIPISEDLFRLTQMGDDGVVMDT 376
+A + P+V FYYV L G+G+G R+ I+ F G GV++D+
Sbjct: 281 -DATYLNGDMTPMV----IAEFYYVNLLGIGLGVEEPRLDINSSSFERKPDGSGGVIIDS 335
Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYN-LSGFVSVRVPTVSFYFSGG 435
G+ ++ P YE R+A V + S ++ C+ G PT+ Y
Sbjct: 336 GSTLSIFPPEVYEVVRNAVVDKLKKGYNISPLTSSPDCFEGKIGRDLPLFPTLVLYLEST 395
Query: 436 PVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
+L S FL D+ FC F S GLSIIG + Q+ + ++
Sbjct: 396 GILNDRWSIFLQRYDE--LFCLGFT-SGEGLSIIGTLAQQSYKFGYN 439
>gi|449467979|ref|XP_004151699.1| PREDICTED: probable aspartic protease At2g35615-like, partial
[Cucumis sativus]
Length = 209
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 79/195 (40%), Positives = 114/195 (58%), Gaps = 17/195 (8%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+ L HRD + S ++ HY R ++F +R + R ATL+ R + GA
Sbjct: 30 FTTSLFHRDSLLSPLEFSSLSHYDRLTNAF----RRSLSRSATLLNRAATNGA------- 78
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
D+ + + GSGEY + + +G+PP + D+GSD++W QC PC +CYKQS P+F
Sbjct: 79 ----LDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIF 134
Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
DP S SFS V C+S C ++++ C A G C Y +YGD +YTKG L E +TIG + V
Sbjct: 135 DPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKITIGSSSV 194
Query: 258 KNVAIGCGHKNQGMF 272
K+V IGCGH++ G F
Sbjct: 195 KSV-IGCGHESGGGF 208
>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
Length = 469
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 126/381 (33%), Positives = 183/381 (48%), Gaps = 33/381 (8%)
Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSP-PRSQYMVIDSGSDIVWVQCQPCSQCYKQ 193
K + Q G + SG + + I VG+P ++ ++D S VW QC PC+
Sbjct: 70 KQQQQQLGGEAASG---AAPPLVINITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGC 126
Query: 194 SDP---VFDPADSASFSGVSCSSAVCDRLENAGCHAG----------RC-RYEVSYG-DG 238
P F P SA+FS + CSS +C + C RC Y ++YG
Sbjct: 127 LPPPATAFRPNGSATFSPLPCSSDMCLPVLRETCGRAGAAANATAGARCDSYSLTYGGSA 186
Query: 239 SYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGG 298
+ T G LA +T T G T V V GC + G F GA+G++G+G G++SL+ QL G
Sbjct: 187 ANTSGYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQL---QFG 243
Query: 299 AFSYCLVSRGTGSSGS----LVFGREALPVGA--AWVPLVRNPRAPSFYYVGLSGLGVGG 352
FSY L++ GS + FG +A+P PL+ + P FYYV L+G+ V G
Sbjct: 244 KFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGQSTPLLSSTLYPDFYYVNLTGVRVDG 303
Query: 353 MRI-PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI- 410
R+ I F L G GV++ + T VT L AY+ R A ++ G LP +G +
Sbjct: 304 NRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIG-LPAVNGSAAL 362
Query: 411 -FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSII 469
D CYN S V+VP ++ F GG + L A+N+ +D G C PS G S++
Sbjct: 363 ELDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG-SVL 421
Query: 470 GNIQQEGIQISFDGANGFVGF 490
G + Q G + +D G + F
Sbjct: 422 GTLLQTGTNMIYDVDAGRLTF 442
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 121/409 (29%), Positives = 189/409 (46%), Gaps = 34/409 (8%)
Query: 95 TTNNMHYHRHQHSFH----ARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMD 150
+TN +H H + + +D +TL R + DF V +
Sbjct: 31 STNLIHIHSPSSPYKNVKAESLAKDTALESTLSRHAYLRARQQKALQPADF---VPPPLI 87
Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
+ + + +G+PP + Y+V+D+GSD+ W+QC+PC CYKQ DP+++ S S++ +
Sbjct: 88 RDKSAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEML 147
Query: 211 CSSAVCDRLENAG--CHAGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIG 263
C+ C L G +G C Y+ +Y DG+ T G L+ E + V G
Sbjct: 148 CNEPPCVSLGREGQCSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKTAQVGFG 207
Query: 264 CGHKNQGMFVGA--AGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGS-LVFG 318
CG +N G+LGLG G +SLV QL G+ +F+YC + ++G LVFG
Sbjct: 208 CGLQNLNFITSNRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNISNPNAGGFLVFG 267
Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGL--SGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
+A + P+V FYYV L GLGVG R+ I+ F G GV++D+
Sbjct: 268 -DATYLNGDMTPMV----IAEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGGVIIDS 322
Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV---PTVSFYFS 433
G+ ++ P YE R+A V + S ++ C+ G + + PT+ Y
Sbjct: 323 GSTLSVFPPEVYEVVRNAVVDKLKKGYNISPLTSSPDCFE--GKIERDLPLFPTLVLYLE 380
Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
+L S FL D+ FC F S GLSIIG + Q+ + ++
Sbjct: 381 STGILNDRWSIFLQRYDE--LFCLGFT-SGEGLSIIGTLAQQSYKFGYN 426
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 148 bits (373), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 116/367 (31%), Positives = 175/367 (47%), Gaps = 37/367 (10%)
Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS--DPV 197
+F DV + + + V VG PP Q ++D+GS ++W+QCQPC C PV
Sbjct: 82 NFQVDVEQAIK--TSLFLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHCSSDHMIHPV 139
Query: 198 FDPADSASFSGVSCSSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTI---- 252
F+PA S++F SC C N C + +C YE Y G+ +KG LA E LT
Sbjct: 140 FNPALSSTFVECSCDDRFCRYAPNGHCGSSNKCVYEQVYISGTGSKGVLAKERLTFTTPN 199
Query: 253 GRTVV-KNVAIGCGHKN-QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG 310
G TVV + +A GCG++N + + G+LGLG SL QLG + FSYC+
Sbjct: 200 GNTVVTQPIAFGCGYENGEQLESHFTGILGLGAKPTSLAVQLGSK----FSYCIGDLANK 255
Query: 311 SSG--SLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
+ G LV G +A +G P + S YY+ L G+ VG ++ I +F+ +
Sbjct: 256 NYGYNQLVLGEDADILGDP-TP-IEFETENSIYYMNLEGISVGDTQLNIEPVVFK-RRGP 312
Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD-TCYNLSGFVSVRV-- 425
GV++D+GT T L AY + + P+ D CY+ G VS +
Sbjct: 313 RTGVILDSGTLYTWLADIAYRELYNEIKSILD--PKLERFWFRDFLCYH--GRVSEELIG 368
Query: 426 -PTVSFYFSGGPVLTLPASNFLIPVDDAGT---FCFAFAPSPS------GLSIIGNIQQE 475
P V+F+F+GG L + A++ P+ + T FC + P+ + IG + Q+
Sbjct: 369 FPVVTFHFAGGAELAMEATSMFYPLSEPNTFNVFCMSVKPTKEHGGEYKEFTAIGLMAQQ 428
Query: 476 GIQISFD 482
I +D
Sbjct: 429 YYNIGYD 435
>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
Length = 334
Score = 148 bits (373), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 110/342 (32%), Positives = 150/342 (43%), Gaps = 57/342 (16%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+GEY ++I +G+PP Y + D+GSD++W QC PC CYKQ +P+FDP+ S SF VSC
Sbjct: 21 NGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCE 80
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
S C L+ T + N+ GCGH N G F
Sbjct: 81 SQQCRLLDTP--------------------------------TSILNIVFGCGHNNSGTF 108
Query: 273 -VGAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSS--GSLVFGREALPVGAA 327
GL G GG +SL Q+ +G FS CLV T S ++FG EA G+
Sbjct: 109 NENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSD 168
Query: 328 WV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG-VVMDTGTAVTRLP 384
V PLV P++Y+V L G+ VG P S + M G V +D GT T LP
Sbjct: 169 VVSTPLVTK-DDPTYYFVTLDGISVGDKLFPFSSS----SPMATKGNVFIDAGTPPTLLP 223
Query: 385 TPAY----EAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
Y + ++A + P CY + + P ++ +F G V
Sbjct: 224 RDFYNRLVQGVKEAIPMEPVQDPDLQP----QLCYRSATLID--GPILTAHFDGADVQLK 277
Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
P + F+ P + G +CFA P I GN Q I FD
Sbjct: 278 PLNTFISPKE--GVYCFAMQPIDGDTGIFGNFVQMNFLIGFD 317
>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
Length = 469
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 126/381 (33%), Positives = 183/381 (48%), Gaps = 33/381 (8%)
Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSP-PRSQYMVIDSGSDIVWVQCQPCSQCYKQ 193
K + Q G + SG + + I VG+P ++ ++D S VW QC PC+
Sbjct: 70 KQQQQQLGGEAASG---AAPPLVINITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGC 126
Query: 194 SDP---VFDPADSASFSGVSCSSAVCDRLENAGCHAG----------RC-RYEVSYG-DG 238
P F P SA+FS + CSS +C + C RC Y ++YG
Sbjct: 127 LPPPATAFRPNGSATFSPLPCSSDMCLPVLRETCGRAGAAANATAGARCDSYSLTYGGSA 186
Query: 239 SYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGG 298
+ T G LA +T T G T V V GC + G F GA+G++G+G G++SL+ QL G
Sbjct: 187 ANTSGYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQL---QFG 243
Query: 299 AFSYCLVSRGTGSSGS----LVFGREALPVGAAW--VPLVRNPRAPSFYYVGLSGLGVGG 352
FSY L++ GS + FG +A+P PL+ + P FYYV L+G+ V G
Sbjct: 244 KFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVRVDG 303
Query: 353 MRI-PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI- 410
R+ I F L G GV++ + T VT L AY+ R A ++ G LP +G +
Sbjct: 304 NRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIG-LPAVNGSAAL 362
Query: 411 -FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSII 469
D CYN S V+VP ++ F GG + L A+N+ +D G C PS G S++
Sbjct: 363 ELDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG-SVL 421
Query: 470 GNIQQEGIQISFDGANGFVGF 490
G + Q G + +D G + F
Sbjct: 422 GTLLQTGTNMIYDVDAGRLTF 442
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 147 bits (372), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 117/360 (32%), Positives = 172/360 (47%), Gaps = 24/360 (6%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ--CYKQSDPVFDPADSASFSGVSCS 212
+Y +G PP+ +ID+GSD+VW QC C + C +Q+ P ++ + S++F+ V C+
Sbjct: 89 QYVAEYLIGDPPQRAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCA 148
Query: 213 SAVC---DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC---GH 266
+ +C D + + A C YG G GTL E ++ +A GC
Sbjct: 149 ARICAANDDIIHFCDLAAGCSVIAGYGAG-VVAGTLGTEAFAF-QSGTAELAFGCVTFTR 206
Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGTGSSGSLVFGREALPV 324
QG GA+GL+GLG G +SLV Q G FSYCL G++G L G A
Sbjct: 207 IVQGALHGASGLIGLGRGRLSLVSQTGATK---FSYCLTPYFHNNGATGHLFVGASASLG 263
Query: 325 GAAWV---PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG----DDGVVMDTG 377
G V V+ P+ FYY+ L GL VG R+PI +F L ++ GV++D+G
Sbjct: 264 GHGDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSG 323
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGF-VSVRVPTVSFYFSGGP 436
+ T L AY+A A+ A D ++ V VP V F+F GG
Sbjct: 324 SPFTSLVHDAYDALASELAARLNGSLVAPPPDADDGALCVARRDVGRVVPAVVFHFRGGA 383
Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ +PA ++ PVD A + P S+IGN QQ+ +++ +D ANG F P C
Sbjct: 384 DMAVPAESYWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQPADC 443
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 147 bits (372), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 108/339 (31%), Positives = 160/339 (47%), Gaps = 21/339 (6%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+G Y G+G+PP+ +D SD+VW C + F+P S + + V C+
Sbjct: 97 AGMYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP--------FNPVRSTTVADVPCT 148
Query: 213 SAVCDRLENAGCHAG--RCRYEVSYGDGSY-TKGTLALETLTIGRTVVKNVAIGCGHKNQ 269
C + C AG C Y YG G+ T G L E T G T + V GCG KN
Sbjct: 149 DDACQQFAPQTCGAGASECAYTYMYGGGAANTTGLLGTEAFTFGDTRIDGVVFGCGLKNV 208
Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLV-FGREALPVGAAW 328
G F G +G++GLG G++SLV QL FSY + + S + FG +A P +
Sbjct: 209 GDFSGVSGVIGLGRGNLSLVSQLQVDR---FSYHFAPDDSVDTQSFILFGDDATPQTSHT 265
Query: 329 VP--LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL-TQMGDDGVVMDTGTAVTRLPT 385
+ L+ + PS YYV L+G+ V G + I F L + G GV + VT L
Sbjct: 266 LSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVTVLEE 325
Query: 386 PAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
AY+ R A ++ G LP +G ++ D CY +VP+++ F+GG V+ L N
Sbjct: 326 AAYKPLRQAVASKIG-LPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVMELELGN 384
Query: 445 FLIPVDDAGTFCFAFAPSPSGL-SIIGNIQQEGIQISFD 482
+ G C PS +G S++G++ Q G + +D
Sbjct: 385 YFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYD 423
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 147 bits (372), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 122/396 (30%), Positives = 181/396 (45%), Gaps = 36/396 (9%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
+QR R++ L R A Q + + +GSG+Y + G+G+P
Sbjct: 55 VQRSRSRLSMLAARAVSNAGAAPGESAQ-------TPLKKGSGDYAMSFGIGTPATGLSG 107
Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH------ 225
D+GSD++W +C C++C + P + P S+S + V+C C L C
Sbjct: 108 EADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGG 167
Query: 226 --AGRCRYEVSYGDGS----YTKGTLALETLTIG--RTVVKNVAIGCGHKNQGMFVGAAG 277
+G C Y +YG+ YT+G L ET T G +A GC +++G F +G
Sbjct: 168 SGSGNCSYHYAYGNARDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSG 227
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-----AWVPLV 332
L+GLG G +SLV QL + AF Y L S + S + FG A G PL+
Sbjct: 228 LVGLGRGKLSLVTQLNVE---AFGYRLSSDLSAPS-PISFGSLADVTGGNGDSFMSTPLL 283
Query: 333 RNP--RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ-MGDDGVVMDTGTAVTRLPTPAYE 389
NP + FYYVGL+G+ VGG + I F + G GV+ D+GT +T LP PAY
Sbjct: 284 TNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYT 343
Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
RD ++Q G + D G + P++ +F GG + L N+L +
Sbjct: 344 LVRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQM 403
Query: 450 ---DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
+ C++ S L+IIGNI Q + FD
Sbjct: 404 QGQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFD 439
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 147 bits (372), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 122/396 (30%), Positives = 181/396 (45%), Gaps = 36/396 (9%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
+QR R++ L R A Q + + +GSG+Y + G+G+P
Sbjct: 55 VQRSRSRLSMLAARAVSNAGAAPGESAQ-------TPLKKGSGDYAMSFGIGTPATGLSG 107
Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH------ 225
D+GSD++W +C C++C + P + P S+S + V+C C L C
Sbjct: 108 EADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGG 167
Query: 226 --AGRCRYEVSYGDGS----YTKGTLALETLTIG--RTVVKNVAIGCGHKNQGMFVGAAG 277
+G C Y +YG+ YT+G L ET T G +A GC +++G F +G
Sbjct: 168 SGSGNCSYHYAYGNARDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSG 227
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-----AWVPLV 332
L+GLG G +SLV QL + AF Y L S + S + FG A G PL+
Sbjct: 228 LVGLGRGKLSLVTQLNVE---AFGYRLSSDLSAPS-PISFGSLADVTGGNGDSFMSTPLL 283
Query: 333 RNP--RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ-MGDDGVVMDTGTAVTRLPTPAYE 389
NP + FYYVGL+G+ VGG + I F + G GV+ D+GT +T LP PAY
Sbjct: 284 TNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYT 343
Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
RD ++Q G + D G + P++ +F GG + L N+L +
Sbjct: 344 LVRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQM 403
Query: 450 ---DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
+ C++ S L+IIGNI Q + FD
Sbjct: 404 QGQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFD 439
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 113/365 (30%), Positives = 167/365 (45%), Gaps = 41/365 (11%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
G Y +G+PP+ V+D ++VW QC PC C++Q P+FDP S++F G+ C S
Sbjct: 55 GLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGS 114
Query: 214 AVCDRLENA--GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC---GHKN 268
+C+ + + C + C YE G T G +T IG + + GC K
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGMAGTDTFAIG-AAKETLGFGCVVMTDKR 172
Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-- 326
G +G++GLG SLV Q+ AFSYCL + SSG+L G A +
Sbjct: 173 LKTIGGPSGIVGLGRTPWSLVTQMNVT---AFSYCLAGK---SSGALFLGATAKQLAGGK 226
Query: 327 -AWVPLVRNPRAPS-------FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
+ P V A S +Y V L+G+ GG + + V++DT +
Sbjct: 227 NSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKAGGAPL-------QAASSSGSTVLLDTVS 279
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
+ L AY+A + A A G P AS +D C+ S V+ P + F F GG L
Sbjct: 280 RASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCF--SKAVAGDAPELVFTFDGGAAL 337
Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPS--------GLSIIGNIQQEGIQISFDGANGFVGF 490
T+P +N+L+ + GT C S S G SI+G++QQE + + FD + F
Sbjct: 338 TVPPANYLLASGN-GTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLSF 396
Query: 491 GPNVC 495
P C
Sbjct: 397 KPADC 401
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 115/376 (30%), Positives = 172/376 (45%), Gaps = 34/376 (9%)
Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ------PCSQCYKQS---DPVFDP 200
D G G+Y V VG+P + +V D+GSD+ W+ C+ CS + VF
Sbjct: 77 DYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHA 136
Query: 201 ADSASFSGVSCSSAVCD-------RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI- 252
S+SF + C + +C L N C Y+ Y DGS G A ET+T+
Sbjct: 137 NLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVE 196
Query: 253 ---GRTV-VKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
GR + + NV IGC QG F A G++GLG S + + GG FSYCLV
Sbjct: 197 LKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDH 256
Query: 308 GTGS--SGSLVFG----REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
+ S L FG +EAL + LV SFY V + G+ +GG + I ++
Sbjct: 257 LSHKNVSNYLTFGSSRSKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLKIPSEV 315
Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS-GVSIFDTCYNLSGF 420
+ + G G ++D+G+++T L PAY+ A + + + C+N +GF
Sbjct: 316 WDVK--GAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGF 373
Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP-SPSGLSIIGNIQQEGIQI 479
VP + F+F+ G P +++I D G C F + G S++GNI Q+
Sbjct: 374 EESLVPRLVFHFADGAEFEPPVKSYVISAAD-GVRCLGFVSVAWPGTSVVGNIMQQNHLW 432
Query: 480 SFDGANGFVGFGPNVC 495
FD +GF P+ C
Sbjct: 433 EFDLGLKKLGFAPSSC 448
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 132/452 (29%), Positives = 198/452 (43%), Gaps = 74/452 (16%)
Query: 80 NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL-SGGGADAAKHEV 138
LEL H D + ++ R++R +R RRL S GG A H
Sbjct: 24 RLELTHVDA--------------KEHYTVEERVRRATERTH---RRLASMGGVTAPIH-- 64
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPV 197
+G G +Y +G PP+ +ID+GS+++W QC C C++Q+ P
Sbjct: 65 --WG---------GQSQYIAEYLIGDPPQRAEAIIDTGSNLIWTQCSRCRPTCFRQNLPY 113
Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRT 255
+DP+ S + V C+ A C C + C YG G+ GTLA E LT
Sbjct: 114 YDPSRSRAARAVGCNDAACALGSETQCLSDNKTCAVVTGYGAGN-IAGTLATENLTFQSE 172
Query: 256 VVKNVAIGC---GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS------ 306
V ++ GC + G GA+G++GLG G +SL QLG FSYCL
Sbjct: 173 TV-SLVFGCIVVTKLSPGSLNGASGIIGLGRGKLSLPSQLGDTR---FSYCLTPYFEDTI 228
Query: 307 ----RGTGSSGSLVFGR-EALPVGAAWVPLVRNPRA---PSFYYVGLSGLGVGGMRIPIS 358
G+S L+ G + PV VP VR+P +FYY+ L+G+ G +++ +
Sbjct: 229 EPSHMVVGASAGLINGSASSTPVTT--VPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVP 286
Query: 359 EDLFRLTQMGD---DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDT 413
F L Q+ G +D+G +T L AY+A R Q G + +G + FD
Sbjct: 287 SAAFDLRQVAPGMWTGTFIDSGAPLTSLVDVAYQALRAELARQLGAALVQPLAGTTGFDL 346
Query: 414 CYNLSGFVSVRVPTVSFYFSG----GPVLTLPASNFLIPVDDAGTFCFAFAP------SP 463
C L + VP + +F G G L +P +N+ PVD A F+
Sbjct: 347 CVALKDAERL-VPPLVLHFGGGSGTGTDLVVPPANYWAPVDSATACMVVFSSVDRKSLPM 405
Query: 464 SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ ++IGN Q+ + + +D A G + F P C
Sbjct: 406 NETTVIGNYMQQNMHVLYDLAGGVLSFQPADC 437
>gi|3641868|emb|CAA09458.1| hypothetical protein [Cicer arietinum]
Length = 110
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 71/109 (65%), Positives = 81/109 (74%)
Query: 387 AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
AYE+ RDAF T NL A GV+IFDTCY+LS SVRVPTVSF+F V LPA N+L
Sbjct: 2 AYESVRDAFKRLTQNLRSAEGVAIFDTCYDLSSLRSVRVPTVSFHFGNDRVWDLPAKNYL 61
Query: 447 IPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
IPVD GTFCFAFAP+ S LSIIGN+QQ+G ++SFD AN VGF PN C
Sbjct: 62 IPVDSDGTFCFAFAPTSSSLSIIGNVQQQGTRVSFDIANSLVGFSPNKC 110
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 119/377 (31%), Positives = 182/377 (48%), Gaps = 52/377 (13%)
Query: 146 VSGMDQG--SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVF 198
+SG D +G Y+ RI +G+PP+ Y+ +D+GSD+ WV C PC+ C + S+ +F
Sbjct: 36 ISGDDDTFTTGLYYTRIYLGTPPQQFYVHVDTGSDVAWVNCVPCTNCKRASNVALPISIF 95
Query: 199 DPADSASFSGVSCSSAVCDRLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
DP S S + +SC+ C N+ C ++ C Y YGDGS T G L + L+ +
Sbjct: 96 DPEKSTSKTSISCTDEECYLASNSKCSFNSMSCPYSTLYGDGSSTAGYLINDVLSFNQVP 155
Query: 257 VKN---------VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLV 305
N + GCG G ++ GL+G G +SL QL Q + F++CL
Sbjct: 156 SGNSTATSGTARLTFGCGSNQTGTWL-TDGLVGFGQAEVSLPSQLSKQNVSVNIFAHCLQ 214
Query: 306 SRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
G SG+LV G P G + P+V P+ S Y V L +GV G + + F L+
Sbjct: 215 GDNKG-SGTLVIGHIREP-GLVYTPIV--PKQ-SHYNVELLNIGVSGTNV-TTPTAFDLS 268
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAF----RDAFVAQTGNLPRASGVSIFDTCYNLSGFV 421
G GV+MD+GT +T L PAY+ F RD ++G LP A F + G+
Sbjct: 269 NSG--GVIMDSGTTLTYLVQPAYDQFQAKVRDCM--RSGVLPVA-----FQFFCTIEGY- 318
Query: 422 SVRVPTVSFYFSGGPVLTLPASNFL---IPVDDAGTFCFAFAPSPS-----GLSIIGNIQ 473
P V+ YF+GG + L S++L + +CF++ S S +I G+
Sbjct: 319 ---FPNVTLYFAGGAAMLLSPSSYLYKEMLTTGLSAYCFSWLESTSVYGYLSYTIFGDNV 375
Query: 474 QEGIQISFDGANGFVGF 490
+ + +D N +G+
Sbjct: 376 LKDQLVVYDNVNNRIGW 392
>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
Length = 435
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 112/371 (30%), Positives = 166/371 (44%), Gaps = 46/371 (12%)
Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC---SQCYKQSDPVFDPADSASFSG 208
G+ EY V G G+P + + D+ + ++C+PC + C DP F+P+ S+SF+
Sbjct: 84 GALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGGAPC----DPAFEPSRSSSFAA 139
Query: 209 VSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV-KNVAIGCGH- 266
+ C S C C C + + +G+ + GTL +TLT+ + GC
Sbjct: 140 IPCGSPEC----AVECTGASCPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEV 195
Query: 267 -KNQGMFVGAAGLLGLGGGSMSLVGQL----GGQTGGAFSYCLVSRGTGSSGSLVFGREA 321
+ F GA GL+ L S SL ++ + AFSYCL S SS R
Sbjct: 196 GADADTFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSS------RGF 249
Query: 322 LPVGAA----------WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
L +GA+ + P+ NP P+ Y+V L G+ VGG +P+ +F G
Sbjct: 250 LSIGASRPEYSGGDIKYAPMSSNPNHPNSYFVELVGISVGGEDLPVPPAVF-----AAHG 304
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
+++ T T L AY A RDAF P A + DTCYNL+G S+ VPTV+
Sbjct: 305 TLLEAATEFTFLAPAAYAALRDAFRRDMAPYPAAPPFRVLDTCYNLTGLASLAVPTVALR 364
Query: 432 FSGGPVLTLPASNFLIPVDDAGTFC-------FAFAPSPSGLSIIGNIQQEGIQISFDGA 484
F+GG L L + D + F A +S+IG + Q ++ +D
Sbjct: 365 FAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLR 424
Query: 485 NGFVGFGPNVC 495
G VGF P C
Sbjct: 425 GGRVGFIPGRC 435
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 112/365 (30%), Positives = 167/365 (45%), Gaps = 41/365 (11%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
G Y +G+PP+ V+D ++VW QC PC C++Q P+FDP S++F G+ C S
Sbjct: 55 GLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGS 114
Query: 214 AVCDRLENA--GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC---GHKN 268
+C+ + + C + C YE G T G +T IG + + GC K
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGKAGTDTFAIG-AAKETLGFGCVVMTDKR 172
Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-- 326
G +G++GLG SLV Q+ AFSYCL + SSG+L G A +
Sbjct: 173 LKTIGGPSGIVGLGRTPWSLVTQMNVT---AFSYCLAGK---SSGALFLGATAKQLAGGK 226
Query: 327 -AWVPLVRNPRAPS-------FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
+ P V A S +Y V L+G+ GG + + V++DT +
Sbjct: 227 NSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPL-------QAASSSGSTVLLDTVS 279
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
+ L AY+A + A A G P AS +D C+ + V+ P + F F GG L
Sbjct: 280 RASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFPKA--VAGDAPELVFTFDGGAAL 337
Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPS--------GLSIIGNIQQEGIQISFDGANGFVGF 490
T+P +N+L+ + GT C S S G SI+G++QQE + + FD + F
Sbjct: 338 TVPPANYLLASGN-GTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLSF 396
Query: 491 GPNVC 495
P C
Sbjct: 397 KPADC 401
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 115/376 (30%), Positives = 172/376 (45%), Gaps = 34/376 (9%)
Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ------PCSQCYKQS---DPVFDP 200
D G G+Y V VG+P + +V D+GSD+ W+ C+ CS + VF
Sbjct: 6 DYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHA 65
Query: 201 ADSASFSGVSCSSAVCD-------RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI- 252
S+SF + C + +C L N C Y+ Y DGS G A ET+T+
Sbjct: 66 NLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVE 125
Query: 253 ---GRTV-VKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
GR + + NV IGC QG F A G++GLG S + + GG FSYCLV
Sbjct: 126 LKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDH 185
Query: 308 GTGS--SGSLVFG----REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
+ S L FG +EAL + LV SFY V + G+ +GG + I ++
Sbjct: 186 LSHKNVSNYLTFGSSRSKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLKIPSEV 244
Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS-GVSIFDTCYNLSGF 420
+ + G G ++D+G+++T L PAY+ A + + + C+N +GF
Sbjct: 245 WDVK--GAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGF 302
Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP-SPSGLSIIGNIQQEGIQI 479
VP + F+F+ G P +++I D G C F + G S++GNI Q+
Sbjct: 303 EESLVPRLVFHFADGAEFEPPVKSYVISAAD-GVRCLGFVSVAWPGTSVVGNIMQQNHLW 361
Query: 480 SFDGANGFVGFGPNVC 495
FD +GF P+ C
Sbjct: 362 EFDLGLKKLGFAPSSC 377
>gi|110739922|dbj|BAF01866.1| chloroplast nucleoid DNA binding protein like [Arabidopsis
thaliana]
Length = 142
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 74/139 (53%), Positives = 97/139 (69%), Gaps = 1/139 (0%)
Query: 357 ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYN 416
++ LF+L Q+G+ GV++D+GT+VTRL PAY A RDAF L RA S+FDTC++
Sbjct: 4 VTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFD 63
Query: 417 LSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEG 476
LS V+VPTV +F G V +LPA+N+LIPVD G FCFAFA + GLSIIGNIQQ+G
Sbjct: 64 LSNMNEVKVPTVVLHFRGADV-SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQG 122
Query: 477 IQISFDGANGFVGFGPNVC 495
++ +D A+ VGF P C
Sbjct: 123 FRVVYDLASSRVGFAPGGC 141
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 118/395 (29%), Positives = 177/395 (44%), Gaps = 49/395 (12%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP-------- 196
+ SG G G+YFVR VG+P + +V D+GSD+ WV+C+ + P
Sbjct: 86 LTSGAYTGIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPG 145
Query: 197 -VFDPADSASFSGVSCSSAVCDR-----LENAGCHAGRCRYEVSYGDGSYTKGTLALETL 250
F P DS +++ +SC+S C + L C Y+ Y DGS +GT+ E+
Sbjct: 146 RAFRPEDSRTWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESA 205
Query: 251 TIG-------RTVVKNVAIGCGHKNQG-MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
TI + +K + +GC G F + G+L LG +S + GG FSY
Sbjct: 206 TIALSGREERKAKLKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGRFSY 265
Query: 303 CLVSRGTGSSGS--LVFG--------------REALPVGAAWVPLVRNPRAPSFYYVGLS 346
CLV + + + L FG A A PL+ + R FY V L
Sbjct: 266 CLVDHLSPRNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLK 325
Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
+ V G + I ++ + G GV++D+GT++T L PAY A A LPR +
Sbjct: 326 AISVAGEFLKIPRAVWDVEAGG--GVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRVT 383
Query: 407 GVSIFDTCYNLSGF----VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAP 461
+ F+ CYN + V VP ++ +F+G L P +++I D A G C
Sbjct: 384 -MDPFEYCYNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKSYVI--DAAPGVKCIGLQE 440
Query: 462 SP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P G+S+IGNI Q+ FD N + F + C
Sbjct: 441 GPWPGISVIGNILQQEHLWEFDIKNRRLKFQRSRC 475
>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 523
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 111/371 (29%), Positives = 165/371 (44%), Gaps = 46/371 (12%)
Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC---SQCYKQSDPVFDPADSASFSG 208
G+ EY V G G+P + + D+ + ++C+PC + C DP F+P+ S+SF+
Sbjct: 172 GALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGGAPC----DPAFEPSRSSSFAA 227
Query: 209 VSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV-KNVAIGCGH- 266
+ C S C C C + + +G+ + GTL +TLT+ + GC
Sbjct: 228 IPCGSPEC----AVECTGASCPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEV 283
Query: 267 -KNQGMFVGAAGLLGLGGGSMSLVGQL----GGQTGGAFSYCLVSRGTGSSGSLVFGREA 321
+ F GA GL+ L S SL ++ + AFSYCL S SS R
Sbjct: 284 GADADTFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSS------RGF 337
Query: 322 LPVGAA----------WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
L +GA+ + P+ NP P+ Y+V L G+ VGG +P+ +F G
Sbjct: 338 LSIGASRPEYSGGDIKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAVF-----AAHG 392
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
+++ T T L AY A RDAF P A + DTCYNL+G S+ VP V+
Sbjct: 393 TLLEAATEFTFLAPAAYAALRDAFRKDMAPYPAAPPFRVLDTCYNLTGLASLAVPAVALR 452
Query: 432 FSGGPVLTLPASNFLIPVDDAGTFC-------FAFAPSPSGLSIIGNIQQEGIQISFDGA 484
F+GG L L + D + F A +S+IG + Q ++ +D
Sbjct: 453 FAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLR 512
Query: 485 NGFVGFGPNVC 495
G VGF P C
Sbjct: 513 GGRVGFIPGRC 523
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 145 bits (366), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 124/450 (27%), Positives = 201/450 (44%), Gaps = 58/450 (12%)
Query: 67 ISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL 126
ISS+ ++ +R +L+HR+ N R + + ++R + + ++ L
Sbjct: 26 ISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQTSSIER-FDFLESKIKEL 84
Query: 127 SGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP 186
G +A + ++GSG + V + +GSPP +Q +V+D+GS ++WVQC P
Sbjct: 85 KSVGNEARSSLIP---------FNRGSG-FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLP 134
Query: 187 CSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTL 245
C C++QS FDP S SF + C + + C+ + Y++ Y G ++G L
Sbjct: 135 CINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQGIL 194
Query: 246 ALETL------------------TIGRTVVKNVAIGCGHKNQGMFVGAA--GLLGLGG-G 284
A E+L I + N+ GCGH N A G+ GLG
Sbjct: 195 AKESLLFETLDEGRVFQYNAISTQISKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGAYP 254
Query: 285 SMSLVGQLGGQTGGAFSYCL--VSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSF-- 340
+++ QLG + FSYC+ ++ + LV G+ +++ P F
Sbjct: 255 HITMATQLGNK----FSYCIGDINNPLYTHNHLVLGQ------GSYIEGDSTPLQIHFGH 304
Query: 341 YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV-AQT 399
YYV L + VG + I + F+++ G GV++D+G T+L +E D V
Sbjct: 305 YYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMK 364
Query: 400 GNLPRASGVSIFD-TCYNLSGFVS---VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF 455
G L R F+ C+ G VS V P V+F+F+GG L L + + L F
Sbjct: 365 GLLERIPTQRKFEGLCF--KGVVSRDLVGFPAVTFHFAGGADLVLESGS-LFRQHGGDRF 421
Query: 456 CFAFAPSPS---GLSIIGNIQQEGIQISFD 482
C A PS S LS+IG + Q+ + FD
Sbjct: 422 CLAILPSNSELLNLSVIGILAQQNYNVGFD 451
>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
Length = 492
Score = 145 bits (366), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 107/343 (31%), Positives = 160/343 (46%), Gaps = 25/343 (7%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+G Y G+G+PP+ +D SD+VW C + F+P S + + V C+
Sbjct: 97 AGMYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP--------FNPVRSTTVADVPCT 148
Query: 213 SAVCDRLENAGCHAG------RCRYEVSYGDGSY-TKGTLALETLTIGRTVVKNVAIGCG 265
C + C AG C Y YG G+ T G L E T G T + V GCG
Sbjct: 149 DDACQQFAPQTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTEAFTFGDTRIDGVVFGCG 208
Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLV-FGREALPV 324
+N G F G +G++GLG G++SLV QL FSY + + S + FG +A P
Sbjct: 209 LQNVGDFSGVSGVIGLGRGNLSLVSQLQVDR---FSYHFAPDDSVDTQSFILFGDDATPQ 265
Query: 325 GAAWVP--LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL-TQMGDDGVVMDTGTAVT 381
+ + L+ + PS YYV L+G+ V G + I F L + G GV + VT
Sbjct: 266 TSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVT 325
Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
L AY+ R A ++ G LP +G ++ D CY +VP+++ F+GG V+ L
Sbjct: 326 VLEEAAYKPLRQAVASKIG-LPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVMEL 384
Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGL-SIIGNIQQEGIQISFD 482
N+ G C PS +G S++G++ Q G + +D
Sbjct: 385 ELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYD 427
>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
Length = 435
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 111/371 (29%), Positives = 165/371 (44%), Gaps = 46/371 (12%)
Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC---SQCYKQSDPVFDPADSASFSG 208
G+ EY V G G+P + + D+ + ++C+PC + C DP F+P+ S+SF+
Sbjct: 84 GALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGGAPC----DPAFEPSRSSSFAA 139
Query: 209 VSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV-KNVAIGCGH- 266
+ C S C C C + + +G+ + GTL +TLT+ + GC
Sbjct: 140 IPCGSPEC----AVECTGASCPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEV 195
Query: 267 -KNQGMFVGAAGLLGLGGGSMSLVGQL----GGQTGGAFSYCLVSRGTGSSGSLVFGREA 321
+ F GA GL+ L S SL ++ + AFSYCL S SS R
Sbjct: 196 GADADTFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSS------RGF 249
Query: 322 LPVGAA----------WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
L +GA+ + P+ NP P+ Y+V L G+ VGG +P+ +F G
Sbjct: 250 LSIGASRPEYSGGDIKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAVF-----AAHG 304
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
+++ T T L AY A RDAF P A + DTCYNL+G S+ VP V+
Sbjct: 305 TLLEAATEFTFLAPAAYAALRDAFRKDMAPYPAAPPFRVLDTCYNLTGLASLAVPAVALR 364
Query: 432 FSGGPVLTLPASNFLIPVDDAGTFC-------FAFAPSPSGLSIIGNIQQEGIQISFDGA 484
F+GG L L + D + F A +S+IG + Q ++ +D
Sbjct: 365 FAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLR 424
Query: 485 NGFVGFGPNVC 495
G VGF P C
Sbjct: 425 GGRVGFIPGRC 435
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 112/384 (29%), Positives = 179/384 (46%), Gaps = 45/384 (11%)
Query: 145 VVSGMDQGSGEYFVRIGVGSP-PRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP----VFD 199
+ SG D G +YFV I +G+P P+ +V D+GSD+ W+ C+ + + +P VF
Sbjct: 108 IHSGADSGQSQYFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFR 167
Query: 200 PADSASFSGVSCSSAVC-----DRLENAGCHA--GRCRYEVSYGDGSYTKGTLALETLTI 252
DS+SF + CSS C D C C ++ Y +G G A ET+T+
Sbjct: 168 ANDSSSFRTIPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTV 227
Query: 253 G-----RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
G + + +V IGC G++GLG SL +L G FSYCLV
Sbjct: 228 GLNDHKKIRLFDVLIGCTESFNETNGFPDGVMGLGYRKHSLALRLAEIFGNKFSYCLVDH 287
Query: 308 GTGSSGS--LVFGREALPVGAAWVPLVRNPRAP----------SFYYVGLSGLGVGGMRI 355
+ S+ L FG +P ++ P+ +FY V +SG+ VGG +
Sbjct: 288 LSSSNHKNFLSFGD---------IPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSML 338
Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDT-- 413
IS D++ +T +G G+++D+GT++T L AY+ DA + + + +
Sbjct: 339 SISSDIWNVTGVG--GMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELNN 396
Query: 414 -CYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGN 471
C+ GF VP + +F+ G + P +++I V + G C + G SI+GN
Sbjct: 397 FCFEDKGFDRAAVPRLLIHFADGAIFKPPVKSYIIDVAE-GIKCLGIIKADFPGSSILGN 455
Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
+ Q+ +D G +GFGP+ C
Sbjct: 456 VMQQNHLWEYDLGRGKLGFGPSSC 479
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 127/414 (30%), Positives = 192/414 (46%), Gaps = 36/414 (8%)
Query: 107 SFHARMQRDVKRVATLVRRLSG--GGADAAKHEVQD---FGTDVVSGMDQGSGEYFVRIG 161
S AR + D +R A + +L GG EV + SG G+G+YFV++
Sbjct: 37 SVTARARGDRRRHAYISAQLPSRRGGRQRVAAEVASSSAVSLPMSSGAYAGTGQYFVKVL 96
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP--VFDPADSASFSGVSCSSAVCD-- 217
VG+P + +V D+GS++ WV+C S P VF P S S++ V CSS C
Sbjct: 97 VGTPAQEFTLVADTGSELTWVKC-----AGGASPPGLVFRPEASKSWAPVPCSSDTCKLD 151
Query: 218 ---RLENAGCHAGRCRYEVSYGDGSY-TKGTLALETLTI----GRTV-VKNVAIGCGHKN 268
L N A C Y+ Y +GS G + ++ TI G+ +++V +GC +
Sbjct: 152 VPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVAQLQDVVLGCSSTH 211
Query: 269 QGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--GTGSSGSLVFGREALP-V 324
G F G+L LG +S + + GG+FSYCLV ++G L FG +P
Sbjct: 212 DGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSYCLVDHLAPRNATGYLAFGPGQVPRT 271
Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
A L +P P FY V + + V G + I +++ GV++D+GT +T L
Sbjct: 272 PATQTKLFLDPAMP-FYGVKVDAVHVAGQALDIPAEVW---DPKSGGVILDSGTTLTVLA 327
Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV--SVRVPTVSFYFSGGPVLTLPA 442
TPAY+A A +P+ F+ CYN + + +P ++ F+G L PA
Sbjct: 328 TPAYKAVVAALTKLLAGVPKVD-FPPFEHCYNWTAPRPGAPEIPKLAVQFTGCARLEPPA 386
Query: 443 SNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+++I V G C G+S+IGNI Q+ FD N V F P+ C
Sbjct: 387 KSYVIDVKP-GVKCIGLQEGEWPGVSVIGNIMQQEHLWEFDLKNMEVRFMPSTC 439
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 119/409 (29%), Positives = 185/409 (45%), Gaps = 44/409 (10%)
Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
RD R A L++ GG D VQ + G+ YF R+ +G+PPR + I
Sbjct: 48 RDHLRHARLLQGFVGGVVD---FSVQGSSDPYLVGL------YFTRVKLGTPPREFNVQI 98
Query: 174 DSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVC-DRLENAGC--- 224
D+GSD++WV C CS C + S FD S++ V CS +C +++
Sbjct: 99 DTGSDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARLVPCSHPICTSQIQTTATQCP 158
Query: 225 -HAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN----VAIGCGHKNQGMFV-- 273
+ +C Y YGDGS T G +T +G +++ N + GC G
Sbjct: 159 PQSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIANSSAAIVFGCSTYQSGDLTKT 218
Query: 274 --GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV 329
G+ G G G +S++ QL G T FS+CL +G S G ++ E L G +
Sbjct: 219 DKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCL--KGEDSGGGILVLGEILEPGIVYS 276
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
PLV P P Y + L + V G +PI F + + G ++DTGT + L AY+
Sbjct: 277 PLV--PSQPH-YNLDLQSIAVSGQLLPIDPAAFATS--SNRGTIIDTGTTLAYLVEEAYD 331
Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
F A A L + ++ + CY +S VS P VSF F+GG + L +L+ +
Sbjct: 332 PFVSAITAAVSQLATPT-INKGNQCYLVSNSVSEVFPPVSFNFAGGATMLLKPEEYLMYL 390
Query: 450 DD---AGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ A +C F G++I+G++ + +D A+ +G+ C
Sbjct: 391 TNYAGAALWCIGFQKIQGGITILGDLVLKDKIFVYDLAHQRIGWANYDC 439
>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 397
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 105/348 (30%), Positives = 158/348 (45%), Gaps = 28/348 (8%)
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
+G+PP++ ID ++VW QC C C+KQ PVF P S++F C + VC +
Sbjct: 60 IGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPT 119
Query: 222 AGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC-GHKNQGMFVGAAGLLG 280
C + C Y+ G G +T G +A +T IG ++ GC + G +G +G
Sbjct: 120 PKCASDVCAYDGVTGLGGHTVGIVATDTFAIGTAAPASLGFGCVVASDIDTMGGPSGFIG 179
Query: 281 LGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-LPVGAAWVPLVR---NPR 336
LG SLV Q+ FSYCL TG + L G A L G AW P V+ N
Sbjct: 180 LGRTPWSLVAQMKLTR---FSYCLAPHDTGKNSRLFLGASAKLAGGGAWTPFVKTSPNDG 236
Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA-VTRLPTPAYEAFRDAF 395
+Y + L + G I + G + V++ T V+ L Y+ F+ A
Sbjct: 237 MSQYYPIELEEIKAGDATITMPR--------GRNTVLVQTAVVRVSLLVDSVYQEFKKAV 288
Query: 396 VAQTGNLPRASGV-SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
+A G P A+ V + F+ C+ +G P + F F G LT+P +N+L V + T
Sbjct: 289 MASVGAAPTATPVGAPFEVCFPKAGVSG--APDLVFTFQAGAALTVPPANYLFDVGN-DT 345
Query: 455 FCFAFA-------PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C + + GL+I+G+ QQE + + FD + F P C
Sbjct: 346 VCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADC 393
>gi|357125298|ref|XP_003564331.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 524
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 127/376 (33%), Positives = 167/376 (44%), Gaps = 63/376 (16%)
Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQS--DPVFDPADSASFSGVSCSSAVCDRLENAG-- 223
+Q M ID+ DI W+QC+PC + +FDP S S + V C S C L N G
Sbjct: 164 AQTMAIDTTIDIPWIQCRPCPPPQCYPQRNALFDPTKSFSAAAVPCGSRACRALGNYGNG 223
Query: 224 -----------------CHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCG 265
G C Y V+Y DG + GT + LTI T N GC
Sbjct: 224 CSNNSRRNKKKNKSKSNNSTGDCNYRVAYSDGRVSSGTYMTDILTISPGTSFLNFRFGCS 283
Query: 266 HKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG------ 318
H +G F G +G + LGGG SL+ Q G AFSYC+ +SG L G
Sbjct: 284 HGVRGSFSGETSGTMSLGGGRQSLLSQTARAYGNAFSYCVPK--PSASGFLSLGGAINDG 341
Query: 319 --REALPVGAAWVPLVRNPRA--PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
P PL+RN R P++Y V L G+ V G R+ + +F G +M
Sbjct: 342 DSDSDSPSSFVTTPLMRNARIVNPTYYVVRLQGIDVAGRRLNVPPVVF------SGGTLM 395
Query: 375 DTGTAVTRLPTPAYEAFRDAFV------------AQTGNLPRASGVSIFDTCYNLSGFVS 422
D+ VT+LP AY A R AF T + P A G I DTCY+ G +
Sbjct: 396 DSSAVVTQLPPTAYRALRLAFRNAMRGYRMNTRNGSTSSTP-AGGEMILDTCYDFEGLDN 454
Query: 423 VRVPTVSFYFSGGPVLTL-PASNFLIPVDDAGTFCFAFAPSPS--GLSIIGNIQQEGIQI 479
V VPTVS F GG V+ L P + ++ C AF P+P+ L IGN+QQ+ ++
Sbjct: 455 VTVPTVSLVFFGGAVVDLDPTTAVMM------EGCLAFVPTPADFDLGFIGNVQQQTHEV 508
Query: 480 SFDGANGFVGFGPNVC 495
+D VGF C
Sbjct: 509 LYDVGARNVGFRRGAC 524
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 101/359 (28%), Positives = 166/359 (46%), Gaps = 29/359 (8%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+G Y R+ +G+PP+ +++D+GS + +V C C QC + DP FDP S+++ + C+
Sbjct: 80 NGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCN 139
Query: 213 -SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHKN 268
+CD +C YE Y + S + G L + ++ G + + GC +
Sbjct: 140 IDCICD------SDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENME 193
Query: 269 QGMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
G A G++GLG G +SLV QL G +FS C G G++V G + P
Sbjct: 194 TGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIG-GGAMVLGGISPPS 252
Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
+ +P +Y V L + V G ++P+S +F G G V+D+GT LP
Sbjct: 253 DMIFT--YSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFD----GRYGAVLDSGTTYAYLP 306
Query: 385 TPAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSG----FVSVRVPTVSFYFSGGPVL 438
A+ AF+DA + + +L + G + D C++ +G +S + PTV F G L
Sbjct: 307 AEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQKL 366
Query: 439 TLPASNFLIPVDDA-GTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+L N+ G +C F +++G I + +D AN +GF C
Sbjct: 367 SLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANSKIGFWKTNC 425
>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 85/233 (36%), Positives = 128/233 (54%), Gaps = 19/233 (8%)
Query: 107 SFHARMQRDVKRVATLVRRLSGGGADAAKHEV--------QDFGTDVVSGMDQGSGEYFV 158
SF + D RV TL RL+ K + + + G GSG Y+V
Sbjct: 61 SFSDVLAWDDARVKTLNSRLTRKDTRFPKSVLTKKDIRFPKSVSVPLNPGASIGSGNYYV 120
Query: 159 RIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCSSAVCD 217
++G GSP R M++D+GS + W+QC+PC C+ Q+DP+FDP+ S ++ +SC+S+ C
Sbjct: 121 KVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSSQCS 180
Query: 218 RLENAGCH-------AGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQ 269
L +A + + C Y SYGD SY+ G L+ + LT+ + + GCG +
Sbjct: 181 SLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQTLPGFVYGCGQDSD 240
Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL 322
G+F AAG+LGLG +S++GQ+ + G AFSYCL +RG G G L G+ +L
Sbjct: 241 GLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGG--GFLSIGKASL 291
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 101/359 (28%), Positives = 166/359 (46%), Gaps = 29/359 (8%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+G Y R+ +G+PP+ +++D+GS + +V C C QC + DP FDP S+++ + C+
Sbjct: 80 NGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCN 139
Query: 213 -SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHKN 268
+CD +C YE Y + S + G L + ++ G + + GC +
Sbjct: 140 IDCICD------SDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENME 193
Query: 269 QGMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
G A G++GLG G +SLV QL G +FS C G G++V G + P
Sbjct: 194 TGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIG-GGAMVLGGISPPS 252
Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
+ +P +Y V L + V G ++P+S +F G G V+D+GT LP
Sbjct: 253 DMIFT--YSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFD----GRYGAVLDSGTTYAYLP 306
Query: 385 TPAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSG----FVSVRVPTVSFYFSGGPVL 438
A+ AF+DA + + +L + G + D C++ +G +S + PTV F G L
Sbjct: 307 AEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQKL 366
Query: 439 TLPASNFLIPVDDA-GTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+L N+ G +C F +++G I + +D AN +GF C
Sbjct: 367 SLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANSKIGFWKTNC 425
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 108/369 (29%), Positives = 179/369 (48%), Gaps = 35/369 (9%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC-YKQS----------DPVFDPA 201
G Y R+ +G+PP +++D+GS + +V C C+ C + Q+ DP F P
Sbjct: 37 KGYYTSRVFIGTPPNEFALIVDTGSTVTYVPCSSCTHCGHHQASFSTHRLFCRDPRFKPE 96
Query: 202 DSASFSGVSCSSAVCDR-LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVV 257
+S+S+ + C S+ C L ++ H +C+YE Y + S +KG L + L G R
Sbjct: 97 NSSSYQKIGCRSSDCITGLCDSNSH--QCKYERMYAEMSTSKGVLGKDLLDFGPASRLQS 154
Query: 258 KNVAIGCGHKNQG-MFVGAA-GLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSG 313
+ ++ GC G +++ A G++GLG G +S+V QL G +FS C G G
Sbjct: 155 QLLSFGCETAESGDLYLQVADGIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGMDEGG-G 213
Query: 314 SLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVV 373
S+V G A+P + V +PR ++Y + L+ + V G + + ++F G G +
Sbjct: 214 SMVLG--AIPAPSGMVFAKSDPRRSNYYNLELTEIQVQGASLKLDSNVFN----GKFGTI 267
Query: 374 MDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGFVSVRV----PT 427
+D+GT LP A+EAF DA VAQ G+L G + D CY +G + + P
Sbjct: 268 LDSGTTYAYLPDRAFEAFTDAVVAQLGSLQAVDGPDPNYPDICYAGAGTDTKELGKHFPL 327
Query: 428 VSFYFSGGPVLTLPASNFLIP-VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
V F F+ ++L N+L G +C F + +++G I + +++D N
Sbjct: 328 VDFVFAENQKVSLAPENYLFKHTKVPGAYCLGFFKNQDATTLLGGIIVRNMLVTYDRYNH 387
Query: 487 FVGFGPNVC 495
+GF C
Sbjct: 388 QIGFLKTNC 396
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 107/360 (29%), Positives = 168/360 (46%), Gaps = 46/360 (12%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
G Y+ I +GSPP+ +V+D+GSD+ WV+C PCS FD S ++ ++C+
Sbjct: 1 GVYYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCS---PDCSSTFDRLASNTYKALTCAD 57
Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNV------AIGCGHK 267
Y YGDGS+T+G L+++TL + + GCG
Sbjct: 58 ----------------DYSYGYGDGSFTQGDLSVDTLKMAGAASDELEEFPGFVFGCGSL 101
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS---GSLVFGREAL-- 322
+G+ G G+L L GS+S Q+G + G FSYCL+ + +S +VFG A+
Sbjct: 102 LKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAAVEL 161
Query: 323 --PVGAAWVPLVRNPRAPS--FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
P L P S +Y V L G+ VG R+ +S F Q D + D+GT
Sbjct: 162 KEPGSGKLQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSAFLNGQ--DKPTIFDSGT 219
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSVRVPTVSFYFSGGP 436
+T LP ++ + + + A V+I D C+ + +P ++F+F+GG
Sbjct: 220 TLTMLPPGVCDSIKQSLASMVSG---AEFVAIKGLDACFRVPPSSGQGLPDITFHFNGGA 276
Query: 437 VLTLPASNFLIPVDDAGTF-CFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
SN++I D G+ C F P+ + +SI GN+QQ+ + D N +GF C
Sbjct: 277 DFVTRPSNYVI---DLGSLQCLIFVPT-NEVSIFGNLQQQDFFVLHDMDNRRIGFKETDC 332
>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 439
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 111/358 (31%), Positives = 167/358 (46%), Gaps = 62/358 (17%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
G + V + G+PP++ +++D+GS I W QC+ C+
Sbjct: 126 GNFLVDVAFGTPPQNFTLILDTGSSITWTQCKACT------------------------- 160
Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMF 272
+EN Y ++YGD S + G +T+T+ + V + G G N+G F
Sbjct: 161 -----VEN--------NYNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGRGRNNKGDF 207
Query: 273 -VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA--WV 329
G G+LGLG G +S V Q + FSYCL S GSL+FG +A ++ +
Sbjct: 208 GSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEED--SIGSLLFGEKATSQSSSLKFT 265
Query: 330 PLVRNP---RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
LV P + +Y+V LS + VG R+ I +F G ++D+ T +TRLP
Sbjct: 266 SLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVF-----ASPGTIIDSRTVITRLPQR 320
Query: 387 AYEAFRDAFVAQTGNLPRASGV----SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPA 442
AY A + AF P ++G I DTCYNLSG V +P + +F GG + L
Sbjct: 321 AYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLNG 380
Query: 443 SNFLIPVDDAGTFCFAFAPSPSG-----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+N + D++ C AFA + L+IIGN QQ + + +D G +GF N C
Sbjct: 381 TNIVWGSDES-RLCLAFAGNSKSTMNPELTIIGNRQQLSLTVLYDIQGGRIGFRSNGC 437
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 129/414 (31%), Positives = 184/414 (44%), Gaps = 50/414 (12%)
Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGS------GEYFVRIGVGSPPR 167
RD R A R L GGG ++ V DF QGS G YF ++ +GSPP
Sbjct: 62 RDRVRHA---RILLGGGRQSSVGGVVDFPV-------QGSSDPYLVGLYFTKVKLGSPPT 111
Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCDRL--- 219
+ ID+GSDI+WV C CS C S FD S + V+CS +C +
Sbjct: 112 EFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQT 171
Query: 220 ENAGC-HAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN----VAIGCGHKNQG 270
A C +C Y YGDGS T G +T +G ++V N + GC G
Sbjct: 172 TAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSG 231
Query: 271 MFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
G+ G G G +S+V QL G T FS+CL +G GS G + E L
Sbjct: 232 DLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL--KGDGSGGGVFVLGEILVP 289
Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
G + PLV P P Y + L +GV G +P+ +F + G ++DTGT +T L
Sbjct: 290 GMVYSPLV--PSQPH-YNLNLLSIGVNGQMLPLDAAVFEASNT--RGTIVDTGTTLTYLV 344
Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
AY+ F +A L +S + CY +S +S P+VS F+GG + L +
Sbjct: 345 KEAYDLFLNAISNSVSQLVTPI-ISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQD 403
Query: 445 FLIP---VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+L D A +C F +P +I+G++ + +D A +G+ C
Sbjct: 404 YLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDC 457
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 129/414 (31%), Positives = 184/414 (44%), Gaps = 50/414 (12%)
Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGS------GEYFVRIGVGSPPR 167
RD R A R L GGG ++ V DF QGS G YF ++ +GSPP
Sbjct: 62 RDRVRHA---RILLGGGRQSSVGGVVDFPV-------QGSSDPYLVGLYFTKVKLGSPPT 111
Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCDRL--- 219
+ ID+GSDI+WV C CS C S FD S + V+CS +C +
Sbjct: 112 EFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQT 171
Query: 220 ENAGC-HAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN----VAIGCGHKNQG 270
A C +C Y YGDGS T G +T +G ++V N + GC G
Sbjct: 172 TAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSG 231
Query: 271 MFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
G+ G G G +S+V QL G T FS+CL +G GS G + E L
Sbjct: 232 DLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL--KGDGSGGGVFVLGEILVP 289
Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
G + PLV P P Y + L +GV G +P+ +F + G ++DTGT +T L
Sbjct: 290 GMVYSPLV--PSQPH-YNLNLLSIGVNGQMLPLDAAVFEASNT--RGTIVDTGTTLTYLV 344
Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
AY+ F +A L +S + CY +S +S P+VS F+GG + L +
Sbjct: 345 KEAYDLFLNAISNSVSQLVTPI-ISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQD 403
Query: 445 FLI---PVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+L D A +C F +P +I+G++ + +D A +G+ C
Sbjct: 404 YLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDC 457
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 110/345 (31%), Positives = 162/345 (46%), Gaps = 30/345 (8%)
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPAD-SASFSGVSCSSAVCDRLE 220
+G+PP + +++G++++W P +C++Q+ P F+P S SC S
Sbjct: 1 MGTPPNPVKLKLENGNELIWNHSNPSPECFEQAFPYFEPLTFSRGLPFASCGSP--KFWP 58
Query: 221 NAGCHAGRCRYEVSYGDGSYTKGTLALETLTI--GRTVVKNVAIGCGHKNQGMFV-GAAG 277
N C Y SYGD S T G L ++ T V VA GCG N G+F G
Sbjct: 59 NQ-----TCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLFNNGVFKSNETG 113
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCL--VSRGTGSSGSLVFGREALPVGAAWV---PLV 332
+ G G G +SL QL G FS+C ++ S+ L + G V PL+
Sbjct: 114 IAGFGRGPLSLPSQL---KVGNFSHCFTTITGAIPSTVLLDLPADLFSNGQGAVQTTPLI 170
Query: 333 ---RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
+N P+ YY+ L G+ VG R+P+ E F LT G G ++D+GT++T LP Y+
Sbjct: 171 QYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTN-GTGGTIIDSGTSITSLPPQVYQ 229
Query: 390 AFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
RD F AQ LP G + TC++ VP + +F G + LP N++
Sbjct: 230 VVRDEFAAQI-KLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFEGA-TMDLPRENYVFE 287
Query: 449 V-DDAGT--FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
V DDAG C A +IIGN QQ+ + + +D N + F
Sbjct: 288 VPDDAGNSIICLAINKG-DETTIIGNFQQQNMHVLYDLQNNMLSF 331
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 127/374 (33%), Positives = 179/374 (47%), Gaps = 34/374 (9%)
Query: 144 DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP-CSQCYKQSDPV--FDP 200
DVVS + S EY + + +GSPPRS + D+GSD+VWV+C+ + + P FDP
Sbjct: 89 DVVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDP 148
Query: 201 ADSASFSGVSCSSAVCDRLENAGCHAG-RCRYEVSYGDGSYTKGTLALETLTI-----GR 254
+ S+++ VSC + C+ L A C G C Y +YGDGS T G L+ ET T GR
Sbjct: 149 SRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGSGR 208
Query: 255 TV----VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSRG 308
+ V V GC G F A GL+GLGGG++SLV QLGG T G FSYCLV
Sbjct: 209 SPRQVRVGGVKFGCSTATAGSFP-ADGLVGLGGGAVSLVTQLGGATSLGRRFSYCLVPHS 267
Query: 309 TGSSGSLVFG--REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
+S +L FG + GAA PLV ++Y V L + VG +
Sbjct: 268 VNASSALNFGALADVTEPGAASTPLVAG-DVDTYYTVVLDSVKVGNKTV---------AS 317
Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGF---VSV 423
+++D+GT +T L D + P S + CYN++G
Sbjct: 318 AASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGE 377
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISF 481
+P ++ F GG + L N + V + GT C A + +SI+GN+ Q+ I + +
Sbjct: 378 SIPDLTLEFGGGAAVALKPENAFVAVQE-GTLCLAIVATTEQQPVSILGNLAQQNIHVGY 436
Query: 482 DGANGFVGFGPNVC 495
D G V F C
Sbjct: 437 DLDAGTVTFAGADC 450
>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
Length = 408
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 107/353 (30%), Positives = 149/353 (42%), Gaps = 27/353 (7%)
Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
Q Y VR G+G+P + + +D+ +D W C PC C S F PA S+S++ +
Sbjct: 74 QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLP 131
Query: 211 CSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN---VAIGCGHK 267
C+S C R G+ + L ++ A CG
Sbjct: 132 CASDWCPLF----------RRPAVPGEPGRVGAAADVRLLQAASRTPRSGVLAATRCGWA 181
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGA 326
G MSL+ Q G + G FSYCL S R SGSL G P
Sbjct: 182 RTPS-------PATRSGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQPRNV 234
Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
+ PL+ NP PS YYV ++GL VG + F G V+D+GT +TR P
Sbjct: 235 RYTPLLTNPHRPSLYYVNVTGLSVGRALVKAPAGSFAFDPSTGAGTVIDSGTVITRWTAP 294
Query: 387 AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
Y A RD F Q + + FDTC+N + P V+ + GG LTLP N L
Sbjct: 295 VYAALRDEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMGGGVDLTLPMENTL 354
Query: 447 IPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
I C A A +P S ++++ N+QQ+ +++ D A VGF C
Sbjct: 355 IHSSATPLACLAMAEAPQNVNSVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 407
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 112/361 (31%), Positives = 175/361 (48%), Gaps = 27/361 (7%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC-YKQS--DPVFDPADSASFSGV 209
G Y R+ +G+P + +++D+GS + +V C C+ C + Q+ DP F P +S+S+ V
Sbjct: 96 KGYYTSRVFIGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFDPRFKPDNSSSYQTV 155
Query: 210 SCSSAVC-DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCG 265
SC+S C ++ +A H +C+YE Y + S +KG L + L G R + GC
Sbjct: 156 SCNSPDCITKMCDARVH--QCKYERVYAEMSSSKGVLGKDLLGFGNGSRLQPHPLLFGCE 213
Query: 266 HKNQG--MFVGAAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREA 321
G A G++GLG G +S+V QL G +FS C G GS+V G A
Sbjct: 214 TAETGDLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDEGG-GSMVLG--A 270
Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
+P A V +P ++Y + LS + V G+ + + ++F G G V+D+GT
Sbjct: 271 IPPPPAMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVFN----GRLGTVLDSGTTYA 326
Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGFVSVRV----PTVSFYFSGG 435
LP A++AF+DA Q G+L G S D C+ +G S + P V F FSG
Sbjct: 327 YLPDKAFDAFKDAITQQLGSLQAVPGPDPSYPDVCFAGAGSDSKALGKHFPPVDFVFSGN 386
Query: 436 PVLTLPASNFLIP-VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNV 494
+ L N+L G +C F + +++G I +++D AN +GF
Sbjct: 387 QKVFLAPENYLFKHTKVPGAYCLGFFKNQDATTLLGGIVVRNTLVTYDRANHQIGFFKTN 446
Query: 495 C 495
C
Sbjct: 447 C 447
>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
Length = 508
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 113/344 (32%), Positives = 158/344 (45%), Gaps = 21/344 (6%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC-----YKQSDPVFDPADSASFS 207
+G Y + VG+PP+ V+D SD VW+QC C+ C S P F S++
Sbjct: 94 TGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIR 153
Query: 208 GVSCSSAVCDRLENAGCHA--GRCRYEVSYGDGS--YTKGTLALETLTIGRTVVKNVAIG 263
V C++ C RL C A C Y YG G+ T G LA++ V G
Sbjct: 154 EVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADGVIFG 213
Query: 264 CGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLV-FGREAL 322
C +G G++GLG G +SLV QL G FSY L GS + F +A
Sbjct: 214 CAVATEGDI---GGVIGLGRGELSLVSQL---QIGRFSYYLAPDDAVDVGSFILFLDDAK 267
Query: 323 PVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
P + V PLV N + S YYV L+G+ V G + I F L G GVV+ V
Sbjct: 268 PRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSITIPV 327
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
T L AY+ R A ++ G L A G + D CY + +VP+++ F+GG V+
Sbjct: 328 TFLDAGAYKVVRQAMASKIG-LRAADGSELGLDLCYTSESLATAKVPSMALVFAGGAVME 386
Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGL-SIIGNIQQEGIQISFD 482
L N+ G C PSP+G S++G++ Q G + +D
Sbjct: 387 LEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMIYD 430
>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
Length = 367
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 104/348 (29%), Positives = 157/348 (45%), Gaps = 28/348 (8%)
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
+G+PP++ ID ++VW QC C C+KQ PVF P S++F C + VC +
Sbjct: 30 IGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPT 89
Query: 222 AGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC-GHKNQGMFVGAAGLLG 280
C + C ++ G G +T G +A +T IG ++ GC + G +G +G
Sbjct: 90 PKCASDVCAFDGVTGLGGHTVGIVATDTFAIGTAAPASLGFGCVVASDIDTMGGPSGFIG 149
Query: 281 LGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-LPVGAAWVPLVR---NPR 336
LG SLV Q+ FSYCL TG + L G A L G AW P V+ N
Sbjct: 150 LGRTPWSLVAQMKLTR---FSYCLAPHDTGKNSRLFLGASAKLAGGGAWTPFVKTSPNDG 206
Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA-VTRLPTPAYEAFRDAF 395
+Y + L + G I + G + V++ T V+ L Y+ F+ A
Sbjct: 207 MSQYYPIELEEIKAGDATITMPR--------GRNTVLVQTAVVRVSLLVDSVYQEFKKAV 258
Query: 396 VAQTGNLPRASGV-SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
+A G P A+ V F+ C+ +G P + F F G LT+P +N+L V + T
Sbjct: 259 MASVGAAPTATPVGEPFEVCFPKAGVSG--APDLVFTFQAGAALTVPPANYLFDVGN-DT 315
Query: 455 FCFAFA-------PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C + + GL+I+G+ QQE + + FD + F P C
Sbjct: 316 VCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADC 363
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 141 bits (356), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 118/415 (28%), Positives = 188/415 (45%), Gaps = 47/415 (11%)
Query: 113 QRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMV 172
+RD R RRL GG A V+ + G+ YF R+ +G+P + ++
Sbjct: 54 RRDAARHRVSRRRLLGGVAGVVDFPVEGSANPYMVGL------YFTRVKLGNPAKEFFVQ 107
Query: 173 IDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCDR--------L 219
ID+GSDI+WV C PC+ C S F+P S++ S ++CS C
Sbjct: 108 IDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRITCSDDRCTAGFQTGEAIC 167
Query: 220 ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN---------VAIGCGHKNQG 270
+ + + C Y +YGDGS T G +T+ TV+ N + GC + G
Sbjct: 168 QTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFF-ETVMGNEQTANSSASIVFGCSNSQSG 226
Query: 271 MFVGA----AGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
A G+ G G +S++ QL G + FS+CL +G+ + G ++ E +
Sbjct: 227 DLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL--KGSDNGGGILVLGEIVEP 284
Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
G + PLV P P Y + L + V G ++PI LF T G ++D+GT + L
Sbjct: 285 GLVYTPLV--PSQP-HYNLNLESIAVNGQKLPIDSSLF--TTSNTQGTIVDSGTTLAYLA 339
Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
AY+ F A A R S VS C+ S V PTV+ YF GG +++ N
Sbjct: 340 DGAYDPFVSAIAAAVSPSVR-SLVSKGSQCFITSSSVDSSFPTVTLYFMGGVAMSVKPEN 398
Query: 445 FLI---PVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+L+ VD++ +C + + ++I+G++ + +D AN +G+ C
Sbjct: 399 YLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWADYDC 453
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 141 bits (356), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 118/415 (28%), Positives = 188/415 (45%), Gaps = 47/415 (11%)
Query: 113 QRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMV 172
+RD R RRL GG A V+ + G+ YF R+ +G+P + ++
Sbjct: 52 RRDAARHRVSRRRLLGGVAGVVDFPVEGSANPYMVGL------YFTRVKLGNPAKEFFVQ 105
Query: 173 IDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCDR--------L 219
ID+GSDI+WV C PC+ C S F+P S++ S ++CS C
Sbjct: 106 IDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRITCSDDRCTAGFQTGEAIC 165
Query: 220 ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN---------VAIGCGHKNQG 270
+ + + C Y +YGDGS T G +T+ TV+ N + GC + G
Sbjct: 166 QTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFF-ETVMGNEQTANSSASIVFGCSNSQSG 224
Query: 271 MFVGA----AGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
A G+ G G +S++ QL G + FS+CL +G+ + G ++ E +
Sbjct: 225 DLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL--KGSDNGGGILVLGEIVEP 282
Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
G + PLV P P Y + L + V G ++PI LF T G ++D+GT + L
Sbjct: 283 GLVYTPLV--PSQP-HYNLNLESIAVNGQKLPIDSSLF--TTSNTQGTIVDSGTTLAYLA 337
Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
AY+ F A A R S VS C+ S V PTV+ YF GG +++ N
Sbjct: 338 DGAYDPFVSAIAAAVSPSVR-SLVSKGSQCFITSSSVDSSFPTVTLYFMGGVAMSVKPEN 396
Query: 445 FLI---PVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+L+ VD++ +C + + ++I+G++ + +D AN +G+ C
Sbjct: 397 YLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWADYDC 451
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 141 bits (355), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 131/417 (31%), Positives = 187/417 (44%), Gaps = 56/417 (13%)
Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGS------GEYFVRIGVGSPPR 167
RD R A R L GGG ++ V DF QGS G YF ++ +GSPP
Sbjct: 62 RDRVRHA---RILLGGGRQSSVGGVVDFPV-------QGSSDPYLVGLYFTKVKLGSPPT 111
Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCDRL--- 219
+ ID+GSDI+WV C CS C S FD S + V+CS +C +
Sbjct: 112 EFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGSVTCSDPICSSVFQT 171
Query: 220 ENAGC-HAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN----VAIGCGHKNQG 270
A C +C Y YGDGS T G +T +G ++V N + GC G
Sbjct: 172 TAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSG 231
Query: 271 MFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
G+ G G G +S+V QL G T FS+CL +G GS G + E L
Sbjct: 232 DLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL--KGDGSGGGVFVLGEILVP 289
Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
G + PL+ P P Y + L +GV G +PI +F + G ++DTGT +T L
Sbjct: 290 GMVYSPLL--PSQPH-YNLNLLSIGVNGQILPIDAAVFEASNT--RGTIVDTGTTLTYLV 344
Query: 385 TPAYEAFRDAF---VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
AY+ F +A V+Q L ++G + CY +S +S P VS F+GG + L
Sbjct: 345 KEAYDPFLNAISNSVSQLVTLIISNG----EQCYLVSTSISDMFPPVSLNFAGGASMMLR 400
Query: 442 ASNFLIP---VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
++L D A +C F +P +I+G++ + +D A +G+ C
Sbjct: 401 PQDYLFHYGFYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWANYDC 457
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 141 bits (355), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 114/375 (30%), Positives = 167/375 (44%), Gaps = 42/375 (11%)
Query: 144 DVVSGMDQ--GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPA 201
D+VS + + I +G PP Q ++ID+GSD+ W+QC PC +CY Q+ P F P+
Sbjct: 74 DIVSHVTPIPNPAAFLANISIGDPPVPQLLLIDTGSDLTWIQCLPC-KCYPQTIPFFHPS 132
Query: 202 DSASFSGVSCSSAV-----CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI---- 252
S+++ SC SA R E G CRY + Y D S T+G LA E LT
Sbjct: 133 RSSTYRNASCESAPHAMPQIFRDEK----TGNCRYHLRYRDFSNTRGILAKEKLTFQTSD 188
Query: 253 -GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGT 309
G N+ GCG N G F +G+LGLG G+ S+V + G FSYC S T
Sbjct: 189 EGLISKPNIVFGCGQDNSG-FTQYSGVLGLGPGTFSIVTR---NFGSKFSYCFGSLIDPT 244
Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
L+ G A G + R YY+ L + +G + I +F+ +
Sbjct: 245 YPHNFLILGNGARIEGDPTPLQIFQDR----YYLDLQAISLGEKLLDIEPGIFQRYR-SK 299
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR--ASGVSIFDTCY------NLSGFV 421
G V+DTG + T L AYE + G + R + CY +L GF
Sbjct: 300 GGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGF- 358
Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQIS 480
P V+F+F+GG L L + + + +FC A + +S+IG + Q+ +
Sbjct: 359 ----PVVTFHFAGGAELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVG 414
Query: 481 FDGANGFVGFGPNVC 495
++ V F C
Sbjct: 415 YNLRTMKVYFQRTDC 429
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 111/366 (30%), Positives = 172/366 (46%), Gaps = 37/366 (10%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
G YF R+ +GSPP+ ++ ID+GSDI+WV C PC+ C S F+P S++ S
Sbjct: 89 GLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSK 148
Query: 209 VSCSSAVCD---RLENAGCHAGR---CRYEVSYGDGSYTKGTLALETL----TIGRTVVK 258
+ CS C + A C C Y +YGDGS T G +T+ +G
Sbjct: 149 IPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQTA 208
Query: 259 N----VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRG 308
N + GC + G G+ G G +S+V QL G + FS+CL +G
Sbjct: 209 NSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL--KG 266
Query: 309 TGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
+ + G ++ E + G + PLV P P Y + L + V G ++PI LF T
Sbjct: 267 SDNGGGILVLGEIVEPGLVYTPLV--PSQP-HYNLNLESIVVNGQKLPIDSSLF--TTSN 321
Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
G ++D+GT + L AY+ F +A A R S VS + C+ S V PTV
Sbjct: 322 TQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVR-SLVSKGNQCFVTSSSVDSSFPTV 380
Query: 429 SFYFSGGPVLTLPASNFLI---PVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGA 484
S YF GG +T+ N+L+ +D+ +C + + ++I+G++ + +D A
Sbjct: 381 SLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLA 440
Query: 485 NGFVGF 490
N +G+
Sbjct: 441 NMRMGW 446
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 111/366 (30%), Positives = 172/366 (46%), Gaps = 37/366 (10%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
G YF R+ +GSPP+ ++ ID+GSDI+WV C PC+ C S F+P S++ S
Sbjct: 89 GLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSK 148
Query: 209 VSCSSAVCD---RLENAGCHAGR---CRYEVSYGDGSYTKGTLALETL----TIGRTVVK 258
+ CS C + A C C Y +YGDGS T G +T+ +G
Sbjct: 149 IPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTA 208
Query: 259 N----VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRG 308
N + GC + G G+ G G +S+V QL G + FS+CL +G
Sbjct: 209 NSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL--KG 266
Query: 309 TGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
+ + G ++ E + G + PLV P P Y + L + V G ++PI LF T
Sbjct: 267 SDNGGGILVLGEIVEPGLVYTPLV--PSQP-HYNLNLESIVVNGQKLPIDSSLF--TTSN 321
Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
G ++D+GT + L AY+ F +A A R S VS + C+ S V PTV
Sbjct: 322 TQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVR-SLVSKGNQCFVTSSSVDSSFPTV 380
Query: 429 SFYFSGGPVLTLPASNFLI---PVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGA 484
S YF GG +T+ N+L+ +D+ +C + + ++I+G++ + +D A
Sbjct: 381 SLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLA 440
Query: 485 NGFVGF 490
N +G+
Sbjct: 441 NMRMGW 446
>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
Length = 469
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 126/418 (30%), Positives = 193/418 (46%), Gaps = 35/418 (8%)
Query: 107 SFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPP 166
S R + D +R A + +L+ AA F + SG G+G+YFVR VG+P
Sbjct: 56 SLGERARDDARRHAYIRSQLASRRRRAADVGASAFAMPLSSGAYTGTGQYFVRFRVGTPA 115
Query: 167 RSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV--FDPADSASFSGVSCSSAVCD-----RL 219
+ +V D+GSD+ WV+C+ + P F ++S S++ ++CSS C L
Sbjct: 116 QPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESRSWAPLACSSDTCTSYVPFSL 175
Query: 220 ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---------------RTVVKNVAIGC 264
N A C Y+ Y DGS +G + + TI R ++ V +GC
Sbjct: 176 ANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGSGGGGRRAKLQGVVLGC 235
Query: 265 GHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS----RGTGSSGSLVFGR 319
G F + G+L LG ++S + + GG FSYCLV R S + G
Sbjct: 236 TATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNASSYLTFGPGP 295
Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
E AA PLV + R FY V + + V G + I D++ + + G G ++D+GT+
Sbjct: 296 EGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPADVWDVGRGG--GAILDSGTS 353
Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
+T L TPAY A A + LPR + + F+ CYN + +P + F+G L
Sbjct: 354 LTVLATPAYRAVVAALGGRLAALPRVA-MDPFEYCYNWTAGAP-EIPKLEVSFAGSARLE 411
Query: 440 LPASNFLIPVDDA-GTFCFAFAP-SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
PA +++I D A G C + G+S+IGNI Q+ FD + ++ F C
Sbjct: 412 PPAKSYVI--DAAPGVKCIGVQEGAWPGVSVIGNILQQEHLWEFDLRDRWLRFKHTRC 467
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 122/422 (28%), Positives = 185/422 (43%), Gaps = 47/422 (11%)
Query: 102 HRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIG 161
+ H H RD R A L++ GG D VQ + G+ YF ++
Sbjct: 21 NNHGLELHQLRARDRLRHARLLQGFVGGVVD---FSVQGSSDPYLVGL------YFTKVK 71
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVC 216
+GSPPR + ID+GSD++WV C C+ C + S FD + S++ V CS +C
Sbjct: 72 LGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQVRCSDPIC 131
Query: 217 DR-----LENAGCHAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN----VAIG 263
+C Y YGDGS T G +TL +G++++ N + G
Sbjct: 132 TSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDNSSALIVFG 191
Query: 264 CGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVF 317
C G G+ G G G +S++ QL G T FS+CL +G GS G ++
Sbjct: 192 CSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCL--KGDGSGGGILV 249
Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
E L G + PLV P P Y + L + V G +PI F + G ++D+G
Sbjct: 250 LGEILEPGIVYSPLV--PSQPH-YNLNLLSIAVNGQLLPIDPAAFATSN--SQGTIVDSG 304
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGV-SIFDTCYNLSGFVSVRVPTVSFYFSGGP 436
T + L AY+ F A A P + + S + CY +S VS P SF F+GG
Sbjct: 305 TTLAYLVAEAYDPFVSAVNAIVS--PSVTPITSKGNQCYLVSTSVSQMFPLASFNFAGGA 362
Query: 437 VLTLPASNFLIPVDDAG---TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
+ L ++LIP +G +C F G++I+G++ + +D +G+
Sbjct: 363 SMVLKPEDYLIPFGSSGGSAMWCIGFQ-KVQGVTILGDLVLKDKIFVYDLVRQRIGWANY 421
Query: 494 VC 495
C
Sbjct: 422 DC 423
>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
Length = 293
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 78/198 (39%), Positives = 112/198 (56%), Gaps = 5/198 (2%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
++RD RV ++ +LS AD + + +G+ GS Y V IG+G+P +
Sbjct: 91 LRRDEARVESIHSKLSKNIADEVS-KAKSTKLPAKNGIILGSPNYIVTIGIGTPKHDISL 149
Query: 172 VIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
+ D+GSD+ W QC+PC CY Q +P F+P+ S+S+ VSCSS +C E+ C A C
Sbjct: 150 MFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSSYHNVSCSSPMCGNPES--CSASNCL 207
Query: 231 YEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLV 289
Y + YGDGS T G LA E T+ + V+ ++ GCG N+G+F+G+AG+LGLG G S
Sbjct: 208 YGIGYGDGSVTVGFLAKEKFTLTNSDVLDDIYFGCGENNKGVFIGSAGILGLGPGKFSFP 267
Query: 290 GQLGGQTGGAFSYCLVSR 307
Q FSYC R
Sbjct: 268 LQTTTTYNNIFSYCCGCR 285
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 126/412 (30%), Positives = 184/412 (44%), Gaps = 41/412 (9%)
Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDF----GTDVVSGMDQGSGEYFVRIGVGSPPRSQ 169
RD R A R L GGG ++ V DF +D + + YF ++ +GSPP
Sbjct: 62 RDRVRHA---RILLGGGRQSSVGGVVDFPVQGSSDPYLVGSKMTMLYFTKVKLGSPPTEF 118
Query: 170 YMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCDRL---EN 221
+ ID+GSDI+WV C CS C S FD S + V+CS +C +
Sbjct: 119 NVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTA 178
Query: 222 AGC-HAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN----VAIGCGHKNQGMF 272
A C +C Y YGDGS T G +T +G ++V N + GC G
Sbjct: 179 AQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDL 238
Query: 273 V----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA 326
G+ G G G +S+V QL G T FS+CL +G GS G + E L G
Sbjct: 239 TKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL--KGDGSGGGVFVLGEILVPGM 296
Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
+ PLV P P Y + L +GV G +P+ +F + G ++DTGT +T L
Sbjct: 297 VYSPLV--PSQPH-YNLNLLSIGVNGQMLPLDAAVFEASNT--RGTIVDTGTTLTYLVKE 351
Query: 387 AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
AY+ F +A L +S + CY +S +S P+VS F+GG + L ++L
Sbjct: 352 AYDLFLNAISNSVSQLVTPI-ISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYL 410
Query: 447 I---PVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
D A +C F +P +I+G++ + +D A +G+ C
Sbjct: 411 FHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDC 462
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 114/383 (29%), Positives = 171/383 (44%), Gaps = 51/383 (13%)
Query: 132 DAAKHEV--QDFGTDVVSGMDQGSGE--YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC 187
D H++ Q F D +S + + + +G PP Q V+D+GS + WV C PC
Sbjct: 65 DHYSHKILKQTFSNDYISNLVPSPRYVVFLMNFSIGEPPIPQLAVMDTGSSLTWVMCHPC 124
Query: 188 SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSY-----GDGSYTK 242
S C +QS P+FDP+ S+++S +SCS C++ + G C Y V Y G Y +
Sbjct: 125 SSCSQQSVPIFDPSKSSTYSNLSCSE--CNKCDVVN---GECPYSVEYVGSGSSQGIYAR 179
Query: 243 GTLALETLTIGRTVVKNVAIGCGHK-----NQGMFVGAAGLLGLGGGSMSLVGQLGGQTG 297
L LET+ V ++ GCG K N + G G+ GLG G SL+ G +
Sbjct: 180 EQLTLETIDESIIKVPSLIFGCGRKFSISSNGYPYQGINGVFGLGSGRFSLLPSFGKK-- 237
Query: 298 GAFSYCLVS-RGTGSS-GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
FSYC+ + R T LV G +A G + V N YYV L + +GG ++
Sbjct: 238 --FSYCIGNLRNTNYKFNRLVLGDKANMQGDSTTLNVIN----GLYYVNLEAISIGGRKL 291
Query: 356 PISEDLF-RLTQMGDDGVVMDTGTAVTRLPTPAYEAFR---DAFVAQTGNLPRASGVSIF 411
I LF R + GV++D+G T L +E + + L + + +
Sbjct: 292 DIDPTLFERSITDNNSGVIIDSGADHTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPY 351
Query: 412 DTCY------NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP---- 461
CY +LSGF P V+F+F+ G VL L ++ I + FC A P
Sbjct: 352 TLCYSGVVSQDLSGF-----PLVTFHFAEGAVLDLDVTSMFIQTTE-NEFCMAMLPGNYF 405
Query: 462 --SPSGLSIIGNIQQEGIQISFD 482
S IG + Q+ + +D
Sbjct: 406 GDDYESFSSIGMLAQQNYNVGYD 428
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 129/440 (29%), Positives = 196/440 (44%), Gaps = 40/440 (9%)
Query: 83 LVHRDKMSSSSNTTNNMHY---HRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
L R + +SS+T + H + + +R + VA RL A + +
Sbjct: 12 LCFRASLVTSSSTGAGLRMKLTHVDDKAGYTTEERVRRAVAVSRERL----AYTQQQQQL 67
Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC-QPC--SQCYKQSDP 196
DV + + + +Y +G PP+ +ID+GS+++W QC C C KQ P
Sbjct: 68 RASGDVSAPVHLATRQYIAEYLIGDPPQRAAALIDTGSNLIWTQCGTTCGLKACAKQDLP 127
Query: 197 VFDPADSASFSGVSCSSAVCDRLENAGCHA----GRCRYEVSYGDGSYTKGTLALETLTI 252
++ + S++F+ V C+ + N G H G C + SYG GS G+L E T
Sbjct: 128 YYNLSRSSTFAAVPCADSAKLCAAN-GVHLCGLDGSCTFAASYGAGS-VFGSLGTEAFTF 185
Query: 253 GRTVVKNVAIGC---GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--R 307
K + GC +G GA+GL+GLG G +SLV Q G FSYCL R
Sbjct: 186 QSGAAK-LGFGCVSLTRITKGALNGASGLIGLGRGRLSLVSQTGATK---FSYCLTPYLR 241
Query: 308 GTGSSGSLVFGREALPVG----AAWVPLVRNPR---APSFYYVGLSGLGVGGMRIPISED 360
G+S L G A G +P V++P +FYY+ L G+ VG ++PI
Sbjct: 242 NHGASSHLFVGASASLSGGGGAVTSIPFVKSPEDYPYSTFYYLPLVGISVGETKLPIPSA 301
Query: 361 LFRLTQMG----DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG-NLPRASGVSIFDTCY 415
F L ++ GV++DTG+ VT L AY A D Q +L + + D C
Sbjct: 302 AFELRRVAAGYWSGGVIIDTGSPVTSLAEAAYSALSDEVARQLNRSLVQPPADTGLDLCV 361
Query: 416 NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQE 475
V VP + F+F GG + + A ++ PVD + T C ++IGN QQ+
Sbjct: 362 ARQDVDKV-VPVLVFHFGGGADMAVSAGSYWGPVDKS-TACMLIEEGGYE-TVIGNFQQQ 418
Query: 476 GIQISFDGANGFVGFGPNVC 495
+ + +D G + F C
Sbjct: 419 DVHLLYDIGKGELSFQTADC 438
>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
Length = 396
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 107/356 (30%), Positives = 156/356 (43%), Gaps = 34/356 (9%)
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
+G+PP+ +ID ++VW QC CS+C+KQ P+F P S++F C + C
Sbjct: 49 IGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACKSTPT 108
Query: 222 AGCHAGRCRYEVSYG---DGSYTKGTLALETLTIGRTVVKNVAIGC-GHKNQGMFVGAAG 277
+ C C YE + D T G + ET IG T ++A GC + G +G
Sbjct: 109 SNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIG-TATASLAFGCVVASDIDTMDGTSG 167
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG---AAWVPLVR- 333
+GLG SLV Q+ FSYCL RGTG S L G A G + P ++
Sbjct: 168 FIGLGRTPRSLVAQMKLT---KFSYCLSPRGTGKSSRLFLGSSAKLAGGESTSTAPFIKT 224
Query: 334 NPRAPS--FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
+P S +Y + L + G I T +VM T + + L AY AF
Sbjct: 225 SPDDDSHHYYLLSLDAIRAGNTTI--------ATAQSGGILVMHTVSPFSLLVDSAYRAF 276
Query: 392 RDAFVAQTG---NLPRASGVSIFDTCY-NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
+ A G P A+ FD C+ +GF P + F F G LT+P + +LI
Sbjct: 277 KKAVTEAVGGAAEQPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTVPPAKYLI 336
Query: 448 PV-DDAGTFCFAFAPSP-------SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
V ++ T C A G+S++G++QQE + +D + F P C
Sbjct: 337 DVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEPADC 392
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 129/451 (28%), Positives = 206/451 (45%), Gaps = 61/451 (13%)
Query: 67 ISSSNT--SSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVK----RVA 120
SS+NT S R +L+H + ++ HY ++ + RM+ D++ R+A
Sbjct: 21 FSSTNTISSGKPQRLVSKLIH-------PGSVHHPHYKPNETA-KDRMELDIQHSAARLA 72
Query: 121 TLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIV 180
+ R+ G ++ + VS G I +G PP Q +V+D+GSDI+
Sbjct: 73 NIQARIEGSLVSNNDYKAR------VSPSLTGR-TIMANISIGQPPIPQLVVMDTGSDIL 125
Query: 181 WVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGD--- 237
WV C PC+ C +FDP+ S++FS + C + CD GC + V+Y D
Sbjct: 126 WVMCTPCTNCDNDLGLLFDPSKSSTFSPL-CKTP-CDF---EGCRCDPIPFTVTYADNST 180
Query: 238 --GSYTKGTLALETLTIGRTVVKNVAIGCGHK-NQGMFVGAAGLLGLGGGSMSLVGQLGG 294
G++ + T+ ET G + + +V GCGH G G+LGL G SLV +LG
Sbjct: 181 ASGTFGRDTVVFETTDEGTSRISDVLFGCGHNIGHDTDPGHNGILGLNNGPDSLVTKLGQ 240
Query: 295 QTGGAFSYCL--VSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGG 352
+ FSYC+ ++ + L+ G A G + V N FYYV + G+ VG
Sbjct: 241 K----FSYCIGNLADPYYNYHQLILGEGADLEGYSTPFEVYN----GFYYVTMEGISVGE 292
Query: 353 MRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV--SI 410
R+ I+ + F + + GV++DTG+ +T L ++ G R + + S
Sbjct: 293 KRLDIAPETFEMKENRAGGVIIDTGSTITFLVDSVHKLLSKEVRNLLGWSFRQATIEKSP 352
Query: 411 FDTCY------NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS-- 462
+ C+ +L GF P V+F+FS G L L + +F ++D FC P
Sbjct: 353 WMQCFYGSISRDLVGF-----PVVTFHFSDGADLALDSGSFFNQLND-NVFCMTVGPVSS 406
Query: 463 ---PSGLSIIGNIQQEGIQISFDGANGFVGF 490
S S+IG + Q+ + +D N FV F
Sbjct: 407 LNIKSKPSLIGLLAQQSYNVGYDLVNQFVYF 437
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 115/401 (28%), Positives = 173/401 (43%), Gaps = 48/401 (11%)
Query: 133 AAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQ 189
A H +++ T V G Y + + G+PP++ V+D+GS VW C C+
Sbjct: 56 ARAHHLKNPQTTPV--FSHSYGGYSISLSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCNN 113
Query: 190 C-YKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR------------YEVSYG 236
C + F P S+S + C + C + C Y + YG
Sbjct: 114 CSFTSRISPFLPKHSSSSKIIGCKNPKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYG 173
Query: 237 DGSYTKGTLAL-ETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQ 295
G T G +AL ETL + +V N +GC + AG+ G G G SL QLG
Sbjct: 174 SG--TTGGVALSETLHLHGLIVPNFLVGCSVFSSRQ---PAGIAGFGRGPSSLPSQLGLT 228
Query: 296 TGGAFSYCLVSRG---TGSSGSLVFGREA----LPVGAAWVPLVRNPRA---PSF---YY 342
FSYCL+S T S SLV ++ + PLV+NP+ P+F YY
Sbjct: 229 ---KFSYCLLSHKFDDTQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYY 285
Query: 343 VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL 402
V L + +GG + I + G+ G ++D+GT T + T A+E + F++Q N
Sbjct: 286 VSLRRISIGGRSVKIPYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNY 345
Query: 403 PRA---SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF 459
RA +S C+N+SG + +P + +F GG + LP N+ + CF
Sbjct: 346 ERALMVEALSGLKPCFNVSGAKELELPQLRLHFKGGADVELPLENYFAFLGSREVACFTV 405
Query: 460 ----APSPSGLS-IIGNIQQEGIQISFDGANGFVGFGPNVC 495
A SG I+GN Q + + +D N +GF C
Sbjct: 406 VTDGAEKASGPGMILGNFQMQNFYVEYDLQNERLGFKKESC 446
>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 417
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 123/394 (31%), Positives = 176/394 (44%), Gaps = 59/394 (14%)
Query: 155 EYFVRIGVGS-PPRSQYMVIDSGSDIVWVQCQP-----CSQCYKQSDPVF---------- 198
+Y + +GS P +S + +D+GSD+VW C P C + + P+
Sbjct: 18 DYTLSFNLGSHPSQSITLYMDTGSDLVWFPCAPFECILCEGKFNATKPLNITRSHRVSCQ 77
Query: 199 DPADSASFSGVS----CSSAVC--DRLENAGCHAGRCR-YEVSYGDGSYTKGTLALETLT 251
PA S + S VS C+ A C D +E + C + C + +YGDGS+ L +TL+
Sbjct: 78 SPACSTAHSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYGDGSFI-AHLHRDTLS 136
Query: 252 IGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT---GGAFSYCLVSRG 308
+ + +KN GC H G+ G G G +SL QL + G FSYCLVS
Sbjct: 137 MSQLFLKNFTFGCAHT---ALAEPTGVAGFGRGLLSLPAQLATLSPNLGNRFSYCLVSHS 193
Query: 309 -----TGSSGSLVFGR----EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISE 359
L+ G + V + ++RNP+ FY VGL+G+ VG I E
Sbjct: 194 FDKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNPKHSYFYCVGLTGISVGKRTILAPE 253
Query: 360 DLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL-PRASGVSI---FDTCY 415
L R+ + GD GVV+D+GT T LP Y + F + G + RAS V CY
Sbjct: 254 MLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKRASEVEEKTGLGPCY 313
Query: 416 NLSGFVSVRVPTVSFYFSGGPV-LTLPASNFLIPVDD--------AGTFCFAFAPSPSGL 466
L G V VPTV+++F G + LP N+ D G + L
Sbjct: 314 FLEGL--VEVPTVTWHFLGNNSNVMLPRMNYFYEFLDGEDEARRKVGCLMLMNGGDDTEL 371
Query: 467 S-----IIGNIQQEGIQISFDGANGFVGFGPNVC 495
S I+GN QQ+G ++ +D N VGF C
Sbjct: 372 SGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQC 405
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 113/398 (28%), Positives = 178/398 (44%), Gaps = 52/398 (13%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP-------- 196
+ S G G+YFVR VG+P + +V D+GSD+ WV+C+P ++
Sbjct: 84 LTSAAYTGIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASS 143
Query: 197 ---VFDPADSASFSGVSCSSAVCDR-----LENAGCHAGRCRYEVSYGDGSYTKGTLALE 248
F P S +++ + C+S C + L C Y+ Y DGS +GT+ E
Sbjct: 144 PRRAFRPEKSKTWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTE 203
Query: 249 TLTIG-------------RTVVKNVAIGC-GHKNQGMFVGAAGLLGLGGGSMSLVGQLGG 294
+ TI + ++ + +GC G F + G+L LG ++S
Sbjct: 204 SATIALSSSSSSSKNKVKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVSFASHAAS 263
Query: 295 QTGGAFSYCLVSRGTGSSGS--LVFGRE---------ALPVGAAWVPLVRNPRAPSFYYV 343
+ GG FSYCLV + + + L FG A GA PLV + R FY V
Sbjct: 264 RFGGRFSYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDV 323
Query: 344 GLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP 403
+ + V G + I D++ + G GV++D+GT++T L PAY A A + P
Sbjct: 324 SIKAISVDGELLKIPRDVWEVD--GGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLARFP 381
Query: 404 RASGVSIFDTCYNLSGFVSV----RVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFA 458
R + + F+ CYN + +P ++ +F+G L P+ +++I D A G C
Sbjct: 382 RVA-MDPFEYCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVI--DAAPGVKCIG 438
Query: 459 FAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P G+S+IGNI Q+ FD N + F + C
Sbjct: 439 VQEGPWPGISVIGNILQQEHLWEFDLKNRRLRFKRSRC 476
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 101/356 (28%), Positives = 158/356 (44%), Gaps = 37/356 (10%)
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
+G+PP+ +ID ++VW QC CS+C+KQ P+F P S++F C + C +
Sbjct: 73 IGTPPQPASAIIDVAGELVWTQCSMCSRCFKQDLPLFVPNASSTFRPEPCGTDACKSIPT 132
Query: 222 AGCHAGRCRYE--VSYGDGSYTKGTLALETLTIGRTVVKNVAIGC----GHKNQGMFVGA 275
+ C + C YE ++ G +T G +A +T IG T ++ GC G G G
Sbjct: 133 SNCSSNMCTYEGTINSKLGGHTLGIVATDTFAIG-TATASLGFGCVVASGIDTMG---GP 188
Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG---AAWVPLV 332
+GL+GLG SLV Q+ FSYCL +G + L+ G A G + P V
Sbjct: 189 SGLIGLGRAPSSLVSQMNIT---KFSYCLTPHDSGKNSRLLLGSSAKLAGGGNSTTTPFV 245
Query: 333 RNPRA---PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
+ +Y + L G+ G I + + V++ T ++ L AY+
Sbjct: 246 KTSPGDDMSQYYPIQLDGIKAGDAAIALPPS--------GNTVLVQTLAPMSFLVDSAYQ 297
Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYF-SGGPVLTLPASNFLIP 448
A + G P A+ + FD C+ +G + P + F F G LT+P +LI
Sbjct: 298 ALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQQGAAALTVPPPKYLID 357
Query: 449 V-DDAGTFCFAFAPSP--------SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
V ++ GT C A + L+I+G++QQE D + F P C
Sbjct: 358 VGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEPADC 413
>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
Length = 447
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 79/176 (44%), Positives = 103/176 (58%), Gaps = 14/176 (7%)
Query: 116 VKRVATLVRRLSGGGADAAKHE-----VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
KR + L +RL+ ADAA++ + V SG+ SGEYF +GVG+P
Sbjct: 44 AKRGSLLRQRLA---ADAARYASLVDATGRLHSPVFSGIPFESGEYFALVGVGTPSTKAM 100
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA---- 226
+VID+GSD+VW+QC PC +CY Q VFDP S+++ V CSS C L GC +
Sbjct: 101 LVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQCRALRFPGCDSGGAA 160
Query: 227 -GRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLG 280
G CRY V+YGDGS + G LA + L T V NV +GCG N+G+F AAGLLG
Sbjct: 161 GGGCRYMVAYGDGSSSTGDLATDKLAFANDTYVNNVTLGCGRDNEGLFDSAAGLLG 216
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 70/130 (53%), Gaps = 9/130 (6%)
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV---SIFDTCYNLSGFVSVRVPTVSFY 431
D+GTA++R AY A RDAF A+ S+FD CY+L G + P + +
Sbjct: 316 DSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLIVLH 375
Query: 432 FSGGPVLTLPASNFLIPVD----DAGTF--CFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
F+GG + LP N+ +PVD A ++ C F + GLS+IGN+QQ+G ++ FD
Sbjct: 376 FAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQGFRVVFDVEK 435
Query: 486 GFVGFGPNVC 495
+GF P C
Sbjct: 436 ERIGFAPKGC 445
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 110/361 (30%), Positives = 163/361 (45%), Gaps = 40/361 (11%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
+ I +G+PP Q ++ID+GSD+ W+ C PC +CY Q+ P F P+ S+++ SC SA
Sbjct: 78 FLANISIGNPPVPQLLLIDTGSDLTWIHCLPC-KCYPQTIPFFHPSRSSTYRNASCVSAP 136
Query: 216 -----CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIGCG 265
R E G C+Y + Y D S T+G LA E LT G +N+ GCG
Sbjct: 137 HAMPQIFRDEK----TGNCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGCG 192
Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL--VSRGTGSSGSLVFGREALP 323
N G F +G+LGLG G+ S+V + G FSYC ++ T L+ G A
Sbjct: 193 QDNSG-FTKYSGVLGLGPGTFSIVTR---NFGSKFSYCFGSLTNPTYPHNILILGNGAKI 248
Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
G + R YY+ L + G + I F+ + G V+DTG + T L
Sbjct: 249 EGDPTPLQIFQDR----YYLDLQAISFGEKLLDIEPGTFQRYR-SQGGTVIDTGCSPTIL 303
Query: 384 PTPAYEAFRDAFVAQTGN-LPRASGVSIFDT-CY------NLSGFVSVRVPTVSFYFSGG 435
AYE + G L R + T CY +L GF P V+F+F+GG
Sbjct: 304 AREAYETLSEEIDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLYGF-----PVVTFHFAGG 358
Query: 436 PVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNV 494
L L + + + +FC A + +S+IG + Q+ + ++ V F
Sbjct: 359 AELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTD 418
Query: 495 C 495
C
Sbjct: 419 C 419
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 120/397 (30%), Positives = 178/397 (44%), Gaps = 39/397 (9%)
Query: 130 GADAAKHEVQD--------FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVW 181
GAD +H + D+ SG+D G+ +YF I VG+P + +V+D+GS++ W
Sbjct: 72 GADQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTW 131
Query: 182 VQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD-------RLENAGCHAGRCRYEVS 234
V C+ ++ K + VF +S SF V C + C L + C Y+
Sbjct: 132 VNCRYRARG-KDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYR 190
Query: 235 YGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSL 288
Y DGS +G A ET+T+G T + IGC G F GA G+LGL S
Sbjct: 191 YADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSF 250
Query: 289 VGQLGGQTGGAFSYCLVSRGTGS--SGSLVFG--REALPVGAAWVPLVRNPRAPSFYYVG 344
G FSYCLV + S L+FG R PL R P FY +
Sbjct: 251 TSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLT-RIPPFYAIN 309
Query: 345 LSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR 404
+ G+ +G + I ++ T G G ++D+GT++T L AY+ L R
Sbjct: 310 VIGISLGYDMLDIPSQVWDATSGG--GTILDSGTSLTLLADAAYKQVVTGLARYLVELKR 367
Query: 405 AS--GVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAF- 459
GV I + C++ SGF ++P ++F+ GG ++L VD A G C F
Sbjct: 368 VKPEGVPI-EYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYL--VDAAPGVKCLGFV 424
Query: 460 -APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
A +P+ ++IGNI Q+ FD + F P+ C
Sbjct: 425 SAGTPA-TNVIGNIMQQNYLWEFDLMASTLSFAPSAC 460
>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 438
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 125/435 (28%), Positives = 194/435 (44%), Gaps = 56/435 (12%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVR-----RLSGGGADA 133
+++E +HRD + S +H + AR+++ +R ++ R R++ A A
Sbjct: 4 FSVEFIHRDSVKS--------LFHDPTLTPEARLRQAARR--SMARHAHAARINNSAAAA 53
Query: 134 AKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQ 193
D DVVS M + EY + + V +PP + D+GS +VW++C+
Sbjct: 54 GASGSDDSDADVVSPMVPQNFEYLMALDVSTPPVRMLALADTGSSLVWLKCK-------- 105
Query: 194 SDPVFDPADSASFSGVSCSSAVCDRL-ENAGCHA-----GRCRYEVSYGDGSYTKGTLAL 247
P S+S++ + C + C L + A C A C Y ++ DGS T G + +
Sbjct: 106 -LPAAHTPASSSYARLPCDAFACKALGDAASCRATGSGNNICVYRYAFADGSCTAGPVTV 164
Query: 248 ETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA--FSYCLV 305
+ T + GC + +G+ V GL+GL G +SLV QL +T A FSYCLV
Sbjct: 165 DAFTFS----TRLDFGCATRTEGLSVPDDGLVGLANGPISLVSQLSAKTPFAHKFSYCLV 220
Query: 306 --SRGTGSSGSLVFGREAL---PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISED 360
S S SL FG A+ GAA PLV R SFY + L + V G +P+
Sbjct: 221 PYSSSETVSSSLNFGSHAIVSSSPGAATTPLVAG-RNKSFYTIALDSIKVAGKPVPL--- 276
Query: 361 LFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA-SGVSIFDTCYNLSG 419
Q +++D+GT +T LP + A A LPR S +++ CY++
Sbjct: 277 -----QTTTTKLIVDSGTMLTYLPKAVLDPLVAALTAAI-KLPRVKSPETLYAVCYDVRR 330
Query: 420 F----VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQE 475
V +P V+ GG + LP N + + T C A S I+GN+ Q+
Sbjct: 331 RAPEDVGKSIPDVTLVLGGGGEVRLPWGNTFVVENKGTTVCLALVESHLPEFILGNVAQQ 390
Query: 476 GIQISFDGANGFVGF 490
+ + FD V F
Sbjct: 391 NLHVGFDLERRTVSF 405
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 139 bits (349), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 110/364 (30%), Positives = 167/364 (45%), Gaps = 62/364 (17%)
Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
+D +G Y + + +G+PP + ++ D+GS ++W QC PC++C + P F PA S++FS
Sbjct: 83 LDNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSK 142
Query: 209 VSCSSAVCDRLENA--GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
+ C+S++C L + C+A C Y YG G +T G LA ETL +G V GC
Sbjct: 143 LPCASSLCQFLTSPYRTCNATGCVYYYPYGMG-FTAGYLATETLHVGGASFPGVTFGCST 201
Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG- 325
+N G+ ++G++GLG +SLV Q+G FSYCL S ++FG A G
Sbjct: 202 EN-GVGNSSSGIVGLGRSPLSLVSQVG---VARFSYCLRSNADAGDSPILFGSLAKVTGG 257
Query: 326 -AAWVPLVRNPRAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
PL+ NP P S+YYV L+G+ VG +P++
Sbjct: 258 NVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPMA------------------------ 293
Query: 383 LPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYN---LSGFVSVRVPTVSFYFSGGPVL 438
NL +G FD C++ G V VPT+ F+GG
Sbjct: 294 ----------------MANLTTVNGTRFGFDLCFDATAAGGGGGVPVPTLVLRFAGGAEY 337
Query: 439 TLPASNF--LIPVDD---AGTFCFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFG 491
+ ++ ++ VD A C P+ L SIIGN+ Q + + +D G F
Sbjct: 338 AVRRRSYFGVVEVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFA 397
Query: 492 PNVC 495
P C
Sbjct: 398 PADC 401
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 139 bits (349), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 110/364 (30%), Positives = 171/364 (46%), Gaps = 37/364 (10%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVS 210
YF R+ +GSPP+ ++ ID+GSDI+WV C PC+ C S F+P S++ S +
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176
Query: 211 CSSAVCD---RLENAGCHAGR---CRYEVSYGDGSYTKGTLALETL----TIGRTVVKN- 259
CS C + A C C Y +YGDGS T G +T+ +G N
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 236
Query: 260 ---VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTG 310
+ GC + G G+ G G +S+V QL G + FS+CL +G+
Sbjct: 237 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL--KGSD 294
Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
+ G ++ E + G + PLV P P Y + L + V G ++PI LF T
Sbjct: 295 NGGGILVLGEIVEPGLVYTPLV--PSQP-HYNLNLESIVVNGQKLPIDSSLF--TTSNTQ 349
Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSF 430
G ++D+GT + L AY+ F +A A R S VS + C+ S V PTVS
Sbjct: 350 GTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVR-SLVSKGNQCFVTSSSVDSSFPTVSL 408
Query: 431 YFSGGPVLTLPASNFLI---PVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANG 486
YF GG +T+ N+L+ +D+ +C + + ++I+G++ + +D AN
Sbjct: 409 YFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANM 468
Query: 487 FVGF 490
+G+
Sbjct: 469 RMGW 472
>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
gi|224030351|gb|ACN34251.1| unknown [Zea mays]
Length = 342
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 112/346 (32%), Positives = 158/346 (45%), Gaps = 40/346 (11%)
Query: 182 VQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA---GRCRYEVSYGDG 238
+QCQPC CY+Q DPVF+P S+S++ V C+S C +L+ CH G C+Y Y
Sbjct: 1 MQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSGH 60
Query: 239 SYTKGTLALETLTIGRTVVKNVAIGCGHKNQ-GMFVGAAGLLGLGGGSMSLVGQLGGQTG 297
TKGTLA++ L IG V V GC + G A+GL+GLG G +SLV QL
Sbjct: 61 GVTKGTLAIDKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQLSVHR- 119
Query: 298 GAFSYCLVSRGTGSSGSLVFGREALPV----GAAWVPLVRNPRAPSFYYVGLSGLGVGGM 353
F YCL + +SG LV G A V V + + R PS+YY+ L GL VG
Sbjct: 120 --FMYCLPPPMSRTSGKLVLGAGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQ 177
Query: 354 RIPISED-------------------LFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
+ + + G+++D + ++ L T Y+ D
Sbjct: 178 TPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDELADD 237
Query: 395 FVAQTGNLPRAS-GVSI-FDTCYNLS---GFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
+ LPRA+ + + D C+ L G V VPTVS F G L L V
Sbjct: 238 LEEEI-RLPRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSFDGR-WLELDRDRLF--V 293
Query: 450 DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
D C + SG+SI+GN Q + +++ F+ G + F C
Sbjct: 294 TDGRMMCLMIGRT-SGVSILGNFQLQNMRVLFNLRRGKITFAKASC 338
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 118/396 (29%), Positives = 176/396 (44%), Gaps = 37/396 (9%)
Query: 130 GADAAKHEVQD--------FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVW 181
GAD +H + D+ SG+D G+ +YF I VG+P + +V+D+GS++ W
Sbjct: 50 GADQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTW 109
Query: 182 VQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD-------RLENAGCHAGRCRYEVS 234
V C+ ++ K + VF +S SF V C + C L + C Y+
Sbjct: 110 VNCRYRARG-KDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYR 168
Query: 235 YGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSL 288
Y DGS +G A ET+T+G T + IGC G F GA G+LGL S
Sbjct: 169 YADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSF 228
Query: 289 VGQLGGQTGGAFSYCLVSRGTGS--SGSLVFG--REALPVGAAWVPLVRNPRAPSFYYVG 344
G FSYCLV + S L+FG R PL R P FY +
Sbjct: 229 TSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLT-RIPPFYAIN 287
Query: 345 LSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR 404
+ G+ +G + I ++ T G G ++D+GT++T L AY+ L R
Sbjct: 288 VIGISLGYDMLDIPSQVWDATSGG--GTILDSGTSLTLLADAAYKQVVTGLARYLVELKR 345
Query: 405 AS--GVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFA 460
GV I + C++ SGF ++P ++F+ GG ++L VD A G C F
Sbjct: 346 VKPEGVPI-EYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYL--VDAAPGVKCLGFV 402
Query: 461 PSPS-GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ + ++IGNI Q+ FD + F P+ C
Sbjct: 403 SAGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSAC 438
>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
Length = 480
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 118/382 (30%), Positives = 180/382 (47%), Gaps = 39/382 (10%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-PVFDPADSAS 205
SG G+G+YFVR VG+P + +V D+GSD+ WV+C + VF A S S
Sbjct: 103 SGAYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDGTGDAPRRVFRAAASRS 162
Query: 206 FSGVSCSSAVCD-----RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG------- 253
++ ++CSS C L N A C Y+ Y DGS +G + ++ TI
Sbjct: 163 WAPIACSSDTCTSYVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGSESR 222
Query: 254 -----RTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
R ++ V +GC G F + G+L LG ++S + + GG FSYCLV
Sbjct: 223 DGGGRRAKLQGVVLGCTASYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDH 282
Query: 308 --GTGSSGSLVFGREALPVGAAW----------VPLVRNPRAPSFYYVGLSGLGVGGMRI 355
++ L FG GAA PL+ + R FY V + + V G +
Sbjct: 283 LAPRNATSYLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHVAGEAL 342
Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCY 415
I D++ + + G G ++D+GT++T L TPAY A A + LPR S + F+ CY
Sbjct: 343 DIPADVWDVARGG--GAILDSGTSLTVLATPAYRAVVAALSERLAGLPRVS-MDPFEYCY 399
Query: 416 NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAP-SPSGLSIIGNIQ 473
N + ++ +P + F+G L PA +++ VD A G C + G+S+IGNI
Sbjct: 400 NWTA-AALEIPGLEVRFAGSARLQPPAKSYV--VDAAPGVKCIGVQEGAWPGVSVIGNIL 456
Query: 474 QEGIQISFDGANGFVGFGPNVC 495
Q+ FD + ++ F C
Sbjct: 457 QQDHLWEFDLRDRWLRFKHTRC 478
>gi|125553836|gb|EAY99441.1| hypothetical protein OsI_21409 [Oryza sativa Indica Group]
Length = 376
Score = 138 bits (347), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 113/320 (35%), Positives = 154/320 (48%), Gaps = 33/320 (10%)
Query: 65 NNISSSNTSSDEARW-NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLV 123
+ ++ SN +S + W L LV + S T+N S + D RVA +
Sbjct: 48 HKVAPSNEASLNSTWAPLHLVSGPCSPAYSRGTDNSSTDDDVTSIAKMLDADQHRVAYIQ 107
Query: 124 RRLSGG----GADAAKHEVQ--DFGTDVVSGMDQGSGEYFVRIGVGSPPR-----SQYMV 172
+RL+GG G A + Q D GT + + G G IG + P Q ++
Sbjct: 108 KRLAGGDTSNGVAGASWDGQTTDVGT-YLPASNVGVGAKM--IGTTAAPDGTSAVRQTVI 164
Query: 173 IDSGSDIVWVQCQPCSQ--CYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAGCHAG- 227
IDSGSD+ WVQCQPC C+ Q DP+FDPA S ++S V CSSA C RL GC A
Sbjct: 165 IDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYSAVPCSSAACARLGPYRRGCSANV 224
Query: 228 RCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQG--MFVGAAGLLGLGGG 284
+C++ +Y DG+ GT + + LT+G VV+ GC H ++G +G L LGGG
Sbjct: 225 QCQFGFTYTDGATATGTYSSDDLTLGPYDVVRGFLFGCAHADRGSTFSFDVSGTLALGGG 284
Query: 285 SMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVP-LVRNP------RA 337
+ S V Q Q G FSYC + S G + G P AA VP V P
Sbjct: 285 AQSFVQQTATQYGRVFSYC-IPPSPSSLGFITLGVP--PQRAALVPTFVSTPLLSSSSMP 341
Query: 338 PSFYYVGLSGLGVGGMRIPI 357
P+FY V L + V G +P+
Sbjct: 342 PTFYRVLLRAIIVAGRPLPV 361
>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
Length = 414
Score = 138 bits (347), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 111/330 (33%), Positives = 165/330 (50%), Gaps = 28/330 (8%)
Query: 189 QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA--GCHAGRCRYEVSYGDGSYTKGTLA 246
+C + P F PA S++FS + C+S++C L + C+A C Y YG G +T G LA
Sbjct: 87 ECAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMG-FTAGYLA 145
Query: 247 LETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS 306
ETL +G VA GC +N G+ ++G++GLG +SLV Q+G G FSYCL S
Sbjct: 146 TETLHVGGASFPGVAFGCSTEN-GVGNSSSGIVGLGRSPLSLVSQVGV---GRFSYCLRS 201
Query: 307 RGTGSSGSLVFGREALPVGAAWVP-LVRNPRAPS--FYYVGLSGLGVGGMRIPISEDLFR 363
++FG A G P ++ NP PS +YYV L+G+ VG +P++ F
Sbjct: 202 DADAGDSPILFGSLAKVTGGKSSPAILENPEMPSSSYYYVNLTGITVGATDLPVTSTTFG 261
Query: 364 LTQMGDDGVV----MDTGTAVTRLPTPAYEAFRDAFVAQ--TGNL-PRASGVSI-FDTCY 415
T+ G+V +D+GT +T L Y + AF++Q T NL +G FD C+
Sbjct: 262 FTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCF 321
Query: 416 NLS---GFVSVRVPTVSFYFSGGPVLTLPASNF--LIPVDD---AGTFCFAFAPSPSGL- 466
+ + G V VPT+ F+GG + ++ ++ VD A C P+ L
Sbjct: 322 DANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECLLVLPASEKLS 381
Query: 467 -SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
SIIGN+ Q + + +D G F P C
Sbjct: 382 ISIIGNVMQMDLHVLYDLDGGMFSFAPADC 411
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 101/358 (28%), Positives = 166/358 (46%), Gaps = 29/358 (8%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS- 212
G Y RI +G+PP++ +++D+GS + +V C C QC K DP F P S+++ + CS
Sbjct: 90 GYYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSM 149
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT---VVKNVAIGCGHKNQ 269
CD C Y+ Y + S + G L + ++ G+ + GC +
Sbjct: 150 ECTCDS------EMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFGCENVET 203
Query: 270 GMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
G A G++GLG G +S+V QL G G +FS C G G++V G + P G
Sbjct: 204 GDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVG-GGAMVLGGISPPAG 262
Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
+ +P ++Y + L + + G ++PI+ +F G G ++D+GT LP
Sbjct: 263 MVFTH--SDPARSAYYNIDLKEIHIAGKQLPINPMVFD----GKYGTILDSGTTYAYLPE 316
Query: 386 PAYEAFRDAFVAQTGNLPRASGV--SIFDTCYNLSGF----VSVRVPTVSFYFSGGPVLT 439
PA++AF+DA + + +L G + D C++ G +S P V FS G L+
Sbjct: 317 PAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNRLS 376
Query: 440 LPASNFLIPVDDA-GTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L N+L A G +C F +++G I + +D + +GF C
Sbjct: 377 LSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTNC 434
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 101/358 (28%), Positives = 166/358 (46%), Gaps = 29/358 (8%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS- 212
G Y RI +G+PP++ +++D+GS + +V C C QC K DP F P S+++ + CS
Sbjct: 90 GYYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSM 149
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT---VVKNVAIGCGHKNQ 269
CD C Y+ Y + S + G L + ++ G+ + GC +
Sbjct: 150 ECTCDS------EMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFGCENVET 203
Query: 270 GMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
G A G++GLG G +S+V QL G G +FS C G G++V G + P G
Sbjct: 204 GDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVG-GGAMVLGGISPPAG 262
Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
+ +P ++Y + L + + G ++PI+ +F G G ++D+GT LP
Sbjct: 263 MVFTH--SDPARSAYYNIDLKEIHIAGKQLPINPMVFD----GKYGTILDSGTTYAYLPE 316
Query: 386 PAYEAFRDAFVAQTGNLPRASGV--SIFDTCYNLSGF----VSVRVPTVSFYFSGGPVLT 439
PA++AF+DA + + +L G + D C++ G +S P V FS G L+
Sbjct: 317 PAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNRLS 376
Query: 440 LPASNFLIPVDDA-GTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L N+L A G +C F +++G I + +D + +GF C
Sbjct: 377 LSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTNC 434
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 107/373 (28%), Positives = 171/373 (45%), Gaps = 41/373 (10%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFS 207
+G YF +IG+G+PP+ Y+ +D+GSDI+WV C C +C +SD ++DP S S +
Sbjct: 79 AGLYFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSAT 138
Query: 208 GVSCSSAVCDRLENA---GCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVK----- 258
+ C C N GC C+Y V YGDGS T G + L R
Sbjct: 139 RIYCDDDFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQTSS 198
Query: 259 ---NVAIGCGHKNQGMFVGAA----GLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGT 309
+V GCG K G ++ G+LG G + S++ QL G+ F++CL +
Sbjct: 199 ANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCLDNVKG 258
Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
G G G P P+V P P Y V + + VGG + + D+F GD
Sbjct: 259 G--GIFAIGEVVSP-KVNTTPMV--PNQPH-YNVVMKEIEVGGNVLELPTDIF---DTGD 309
Query: 370 -DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
G ++D+GT + LP YE+ V++ L + F TC+ +G V+ P V
Sbjct: 310 RRGTIIDSGTTLAYLPEVVYESMMTKIVSEQPGLKLHTVEEQF-TCFQYTGNVNEGFPVV 368
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS------PSGLSIIGNIQQEGIQISFD 482
F+F+G LT+ ++L + + +CF + S ++++G++ + +D
Sbjct: 369 KFHFNGSLSLTVNPHDYLFQIHEE-VWCFGWQNSGMQSKDGRDMTLLGDLVLSNKLVLYD 427
Query: 483 GANGFVGFGPNVC 495
N +G+ C
Sbjct: 428 LENQAIGWTDYNC 440
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 140/485 (28%), Positives = 208/485 (42%), Gaps = 61/485 (12%)
Query: 49 TDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSF 108
TDH YN + N S ++E +++E +HRD + S + + R +
Sbjct: 14 TDHV----YNLRHKAINLFVSPAVGAEEDGFSVEFIHRDSVKSPFHDPALTPHGRALAAA 69
Query: 109 HARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRS 168
R + L RR SG + G VV+ + EY + I VG+PP
Sbjct: 70 RRSAARAAELHHLLARRSSGAPSPGT-------GAGVVAEVVSRQFEYLMAIEVGTPPVR 122
Query: 169 QYMVIDSGSDIVWVQCQPCSQCYKQSDP---VFDPADSASFSGVSCSSAVCDRLENAGCH 225
+ D+GSD+VWV+C+ + P F P+ S+++ V C + C L +A
Sbjct: 123 VLAIADTGSDLVWVKCKGKDNDNNSTAPPSVYFVPSASSTYGRVGCDTKACRALSSAASC 182
Query: 226 A--GRCRYEVSYGDGSYTKGTLALETLTIG----------------------RTVVKNVA 261
+ G C Y SYGDGS G L+ ET T + + +
Sbjct: 183 SPDGSCEYLYSYGDGSRASGQLSTETFTFSTIADSSKTNSHGNNNNNSSSHGQVEIAKLD 242
Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSRG-TGSSGSLVFG 318
GC G F A GL+GLGGG +SL QLG T G FSYCL T +S +L FG
Sbjct: 243 FGCSTTTTGTF-RADGLVGLGGGPVSLASQLGATTSLGRKFSYCLAPYANTNASSALNFG 301
Query: 319 REAL--PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
A+ GAA PL+ ++Y + L + V G + P T +++D+
Sbjct: 302 SRAVVSEPGAASTPLITG-EVETYYTIALDSINVAGTKRP--------TTAAQAHIIVDS 352
Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRA-SGVSIFDTCYNLSGFV---SVRVPTVSFYF 432
GT +T L + + LPRA S I D CY++SG ++ +P V+
Sbjct: 353 GTTLTYLDSALLTPLVKDLTRRI-KLPRAESPEKILDLCYDISGVRGEDALGIPDVTLVL 411
Query: 433 SGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGF 490
GG +TL N + V + G C A + +SI+GNI Q+ + + +D G V F
Sbjct: 412 GGGGEVTLKPDNTFVVVQE-GVLCLALVATSERQSVSILGNIAQQNLHVGYDLEKGTVTF 470
Query: 491 GPNVC 495
C
Sbjct: 471 AAADC 475
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 123/427 (28%), Positives = 188/427 (44%), Gaps = 75/427 (17%)
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC-------- 190
+ F + SG G+G+YFVR VG+P R +V D+GSD+ WV+C+ +
Sbjct: 38 EAFAMPLSSGAYTGTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAP 97
Query: 191 ---YKQSDP-----------------VFDPADSASFSGVSCSSAVCDR---LENAGCHA- 226
Y P VF P S +++ + CSS C A C
Sbjct: 98 GYNYGYGAPASNDSSSVSAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTP 157
Query: 227 -GRCRYEVSYGDGSYTKGTLALETLTIG-----------RTVVKNVAIGCGHKNQGM-FV 273
C YE Y DGS +GT+ ++ TI R ++ V +GC G F+
Sbjct: 158 GSPCAYEYRYKDGSAARGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFL 217
Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--GTGSSGSLVFGRE----------- 320
+ G+L LG ++S + + GG FSYCLV ++ L FG
Sbjct: 218 ASDGVLSLGYSNVSFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRT 277
Query: 321 -----ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMD 375
A GA PL+ + R FY V ++G+ V G + I ++ + + G G ++D
Sbjct: 278 ACAGSAAAPGARQTPLLLDHRMRPFYAVAVNGVSVDGELLRIPRLVWDVQKGG--GAILD 335
Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS-----VRVPTVSF 430
+GT++T L +PAY A A + LPR + + FD CYN + ++ V VP ++
Sbjct: 336 SGTSLTVLVSPAYRAVVAALGKKLVGLPRVA-MDPFDYCYNWTSPLTGEDLAVAVPALAV 394
Query: 431 YFSGGPVLTLPASNFLIPVDDA-GTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFV 488
+F+G L P +++I D A G C G+S+IGNI Q+ FD N +
Sbjct: 395 HFAGSARLQPPPKSYVI--DAAPGVKCIGLQEGDWPGVSVIGNILQQEHLWEFDLKNRRL 452
Query: 489 GFGPNVC 495
F + C
Sbjct: 453 RFKRSRC 459
>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 486
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 110/344 (31%), Positives = 156/344 (45%), Gaps = 21/344 (6%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC-----YKQSDPVFDPADSASFS 207
+G Y + VG+PP+ V+D SD VW+QC C+ C S P F S++
Sbjct: 94 TGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIR 153
Query: 208 GVSCSSAVCDRLENAGCHA--GRCRYEVSYGDGS--YTKGTLALETLTIGRTVVKNVAIG 263
V C++ C RL C A C Y YG G+ T G LA++ V G
Sbjct: 154 EVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADGVIFG 213
Query: 264 CGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLV-FGREAL 322
C +G G++GLG G +S V QL G FSY L GS + F +A
Sbjct: 214 CAVATEG---DIGGVIGLGRGELSPVSQL---QIGRFSYYLAPDDAVDVGSFILFLDDAK 267
Query: 323 PVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
P + V PLV + + S YYV L+G+ V G + I F L G GVV+ V
Sbjct: 268 PRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSITIPV 327
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
T L AY+ R A ++ L A G + D CY + +VP+++ F+GG V+
Sbjct: 328 TFLDAGAYKVVRQAMASKI-ELRAADGSELGLDLCYTSESLATAKVPSMALVFAGGAVME 386
Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGL-SIIGNIQQEGIQISFD 482
L N+ G C PSP+G S++G++ Q G + +D
Sbjct: 387 LEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMIYD 430
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 111/368 (30%), Positives = 172/368 (46%), Gaps = 42/368 (11%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD---PV--FDPADSASFSG 208
G Y+ R+ +G+PP+ Y+ ID+GSD++WV C C+ C S P+ FDP S + S
Sbjct: 81 GLYYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASL 140
Query: 209 VSCSSAVCD---RLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV------- 256
VSCS +C + ++ C + +C Y YGDGS T G ++ + + +
Sbjct: 141 VSCSDQICALGVQSSDSACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTSN 200
Query: 257 -VKNVAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGT 309
+V GC G G+ G G +S++ QL G FS+CL +G
Sbjct: 201 SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCL--KGD 258
Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
S G ++ E + + PLV P P Y + L + V G +PIS +F +
Sbjct: 259 DSGGGILVLGEIVEPNVVYTPLV--PSQPH-YNLNLQSISVNGQVLPISPAVFATS--SS 313
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF---DTCYNLSGFVSVRVP 426
G ++D+GT + L AY +AFV N+ S S+ + CY S VS P
Sbjct: 314 QGTIIDSGTTLAYLAEEAY----NAFVVAVTNIVSQSTQSVVLKGNRCYVTSSSVSDIFP 369
Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAG---TFCFAFAPSP-SGLSIIGNIQQEGIQISFD 482
VS F+GG L L A ++LI + G +C F P G++I+G++ + +D
Sbjct: 370 QVSLNFAGGASLVLGAQDYLIQQNSVGGTTVWCIGFQKIPGQGITILGDLVLKDKIFIYD 429
Query: 483 GANGFVGF 490
AN +G+
Sbjct: 430 LANQRIGW 437
>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 119/417 (28%), Positives = 184/417 (44%), Gaps = 56/417 (13%)
Query: 125 RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC 184
R + G+ AA E+ + SG G G+YFVR VG+P + +V D+GSD+ WV+C
Sbjct: 68 RETAAGSSAAAFEMP-----LTSGAYTGIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKC 122
Query: 185 -QPCSQCYKQSDP---VFDPADSASFSGVSCSSAVCDR-----LENAGCHAGRCRYEVSY 235
+P + + F P DS +++ +SC+S C + L C Y+ Y
Sbjct: 123 RRPAANSSESGSGSGRAFRPEDSRTWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRY 182
Query: 236 GDGSYTKGTLALETLTIG---------RTVVKNVAIGCGHKNQG-MFVGAAGLLGLGGGS 285
DGS +GT+ E+ TI + +K + +GC G F + G+L LG
Sbjct: 183 KDGSAARGTVGTESATIALSGRGREERKAKLKGLVLGCTSSYTGPSFEVSDGVLSLGYSD 242
Query: 286 MSLVGQLGGQTGGAFSYCLVSRGTGSSGS--LVFG----------------------REA 321
+S + G FSYCLV + + + L FG
Sbjct: 243 VSFASHAASRFAGRFSYCLVDHLSPRNATSYLTFGPNPAVASSSSPSSPAPASCTAAAPR 302
Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
A PL+ + R FY V + + V G + I ++ + G GV++D+GT++T
Sbjct: 303 PRPRARQTPLLLDRRMRPFYDVAVKAVSVAGQFLKIPRAVWDVDAGG--GVILDSGTSLT 360
Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV-SVRVPTVSFYFSGGPVLTL 440
L PAY A A LPR + + F+ CYN + V +P ++ +F+G L
Sbjct: 361 VLAKPAYRAVVAALSEGLAGLPRVT-MDPFEYCYNWTSPSGDVTLPKMAVHFAGAARLEP 419
Query: 441 PASNFLIPVDDA-GTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P +++I D A G C P G+S+IGNI Q+ FD N + F + C
Sbjct: 420 PGKSYVI--DAAPGVKCIGLQEGPWPGISVIGNILQQEHLWEFDIKNRRLKFQRSRC 474
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 108/374 (28%), Positives = 174/374 (46%), Gaps = 41/374 (10%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
G YF R+ +G+P + ++ ID+GSDI+WV C PC+ C S F+P S++ S
Sbjct: 3 GLYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASR 62
Query: 209 VSCSSAVCDR--------LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN- 259
++CS C + + + C Y +YGDGS T G +T+ TV+ N
Sbjct: 63 ITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFF-ETVMGNE 121
Query: 260 --------VAIGCGHKNQGMFVGA----AGLLGLGGGSMSLVGQLG--GQTGGAFSYCLV 305
+ GC + G A G+ G G +S++ QL G + FS+CL
Sbjct: 122 QTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL- 180
Query: 306 SRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
+G+ + G ++ E + G + PLV P P Y + L + V G ++PI LF T
Sbjct: 181 -KGSDNGGGILVLGEIVEPGLVYTPLV--PSQP-HYNLNLESIAVNGQKLPIDSSLF--T 234
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
G ++D+GT + L AY+ F A A R S VS C+ S V
Sbjct: 235 TSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVR-SLVSKGSQCFITSSSVDSSF 293
Query: 426 PTVSFYFSGGPVLTLPASNFLI---PVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISF 481
PTV+ YF GG +++ N+L+ VD++ +C + + ++I+G++ + +
Sbjct: 294 PTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVY 353
Query: 482 DGANGFVGFGPNVC 495
D AN +G+ C
Sbjct: 354 DLANMRMGWADYDC 367
>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 507
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 109/408 (26%), Positives = 165/408 (40%), Gaps = 62/408 (15%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC---------------------- 184
+G D GEYF + VGSP + ++ D+GS+ W C
Sbjct: 102 AGRDDALGEYFTEVKVGSPGQRFWLAADTGSEFTWFNCVMRNATTTATTKKTRKNKTKKK 161
Query: 185 -------------QPCSQCYKQSDP---VFDPADSASFSGVSCSSAVCD-------RLEN 221
+ + +S+P VF P S SF V+C+S C L
Sbjct: 162 HHHHSKRNRTRTTRRTKKKKAKSNPCKGVFCPHRSKSFQAVTCASQKCKIDLSQLFSLSL 221
Query: 222 AGCHAGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIGCGHKNQG---MFV 273
+ C Y++SY DGS KG +T+T+ + N+ IGC +
Sbjct: 222 CPKPSDPCLYDISYADGSSAKGFFGTDTITVDLKNGKEGKLNNLTIGCTKSMENGVNFNE 281
Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT--GSSGSLVFGREALPVGAAWVPL 331
G+LGLG S + + + G FSYCLV + S L G +
Sbjct: 282 DTGGILGLGFAKDSFIDKAAYEYGAKFSYCLVDHLSHRNVSSYLTIGGHHNAKLLGEIKR 341
Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
P FY V + G+ +GG + I ++ G G ++D+GT +T L PAYE
Sbjct: 342 TELILFPPFYGVNVVGISIGGQMLKIPPQVWDFNSQG--GTLIDSGTTLTALLVPAYEPV 399
Query: 392 RDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
+A + + R +G D C++ GF VP + F+F+GG P +++I V
Sbjct: 400 FEALIKSLTKVKRVTGEDFGALDFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDV 459
Query: 450 DDAGTFCFAFAP--SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C P G S+IGNI Q+ FD + +GF P++C
Sbjct: 460 APL-VKCIGIVPIDGIGGASVIGNIMQQNHLWEFDLSTNTIGFAPSIC 506
>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
Length = 471
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 108/364 (29%), Positives = 170/364 (46%), Gaps = 36/364 (9%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQC-QP-CSQCYKQSDPVFDPADSASFSGVSCSS 213
Y ++ +GSPP Y + D+GS+IVW+QC P C+ CYKQ P+F+P S++++ C
Sbjct: 108 YVMKFNIGSPPVETYAIPDTGSNIVWIQCGSPICTNCYKQKIPLFNPTKSSTYAIRLCGH 167
Query: 214 AVCDRL-----ENAGCHAG--RCRYEVSYGDGSYTKGTLALETLTIGRTVVK------NV 260
C + E GC + CRY +SY D S+++GT++ + +T + + +
Sbjct: 168 RECKQALWGLGEYLGCKSSVQVCRYHISYEDHSFSEGTISTDIITFPEHIAEFGNYSLRM 227
Query: 261 AIGCGHKNQGM------FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL----VSRGTG 310
GCG+ N A G++GLG SLVGQL T G FSYC+ V + G
Sbjct: 228 FFGCGYNNSETPGQDPNSFTAPGVVGLGNEMASLVGQL---TLGQFSYCISTPDVQKPNG 284
Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGD 369
+ + FG A G + L N + + + G+ V ++ E +F+ + G
Sbjct: 285 TI-EIRFGLAASISGHS-TALANNLEG-WYIFQNVDGIYVDDTKVKGYPEWVFQFAEGGI 341
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP--RASGVSIFDTCYNLSGFVSVRVPT 427
G++MD+GT T L A +A Q P + S + CYN + F+ VP
Sbjct: 342 GGLIMDSGTTYTELYFSALDALIGELKEQIELAPDTQDHSNSNYSLCYNAANFLLTYVPA 401
Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAG-TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
+ F+ P + +D+ +C A + SG+SIIG Q I+I +D
Sbjct: 402 IELKFTDNKEAYFPFTLRNAWIDNGNDQYCLAMFGT-SGISIIGIYQHRDIKIGYDLKYN 460
Query: 487 FVGF 490
V F
Sbjct: 461 LVSF 464
>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
Length = 315
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 97/316 (30%), Positives = 142/316 (44%), Gaps = 28/316 (8%)
Query: 203 SASFSGVSCSSAVCDRLENAGCHAG-----RCRYEVSYGDGSYTKGTLALETLTIGR--- 254
S++F V+C +C A +C Y SYGD S T G + +T T
Sbjct: 2 SSTFKAVACPDPICRPSSGVSVSACAMENFQCFYLCSYGDRSITAGHIFKDTFTFMSPNG 61
Query: 255 --TVVKNVAIGCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
V +A GCG N G+FV +G+ G G G SL QL G FSYCL
Sbjct: 62 VPVAVSELAFGCGDYNTGLFVSNESGIAGFGRGPQSLPSQL---KVGRFSYCLTLVTESK 118
Query: 312 SGSLVFGREALPVGA--------AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
S ++ G P G P++ NP P+FYY+ L G+ VG R+P + +F
Sbjct: 119 SSVVILGTPPDPDGLRAHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGKTRLPFDKSVFA 178
Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF--DTCYNL-SGF 420
L + G G V+D+GT++T LP +E ++ VAQ LPR C+ G
Sbjct: 179 LKKDGSGGTVIDSGTSLTTLPEAVFELLQEELVAQF-PLPRYDNTPEVGDRLCFRRPKGG 237
Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF-APSPSGLSIIGNIQQEGIQI 479
V VP + + +G + LP N+ + D+G C + + +IGN QQ+ + +
Sbjct: 238 KQVPVPKLILHLAGAD-MDLPRDNYFVEEPDSGVMCLQINGAEDTTMVLIGNFQQQNMHV 296
Query: 480 SFDGANGFVGFGPNVC 495
+D N + F P C
Sbjct: 297 VYDVENNKLLFAPAQC 312
>gi|125595855|gb|EAZ35635.1| hypothetical protein OsJ_19925 [Oryza sativa Japonica Group]
Length = 335
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 102/278 (36%), Positives = 138/278 (49%), Gaps = 47/278 (16%)
Query: 112 MQRDVKRVATLVRRLSGGGADAA-KHEVQDFG-TDVVSGMDQGSGEYFVRIGVGSPPR-- 167
+ D RVA + +RL+G D A H+ + G T VVS + +G G+G P
Sbjct: 5 LDADQLRVAYIQKRLAGDTGDGADPHKFVEGGDTHVVSSLQVATGA-----GIGQKPHLT 59
Query: 168 --------------------SQYMVIDSGSDIVWVQCQPCSQ--CYKQSDPVFDPADSAS 205
SQ ++IDSGSD+ WVQCQPC C+ Q DP+FDPA S +
Sbjct: 60 TTRLGTTATTNSAPDGTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTT 119
Query: 206 FSGVSCSSAVCDRL--ENAGCHAG-RCRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVA 261
++ V CSSA C RL GC A +C++ ++Y +G+ GT + + LT+G VV+
Sbjct: 120 YAAVPCSSAACARLGPYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYDVVRGFL 179
Query: 262 IGCGHKNQG--MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR 319
GC H +QG AG L LGGGS S V Q Q FSYC V T S G ++FG
Sbjct: 180 FGCAHADQGSTFSYDVAGTLALGGGSQSFVQQTASQYSRVFSYC-VPPSTSSFGFIMFGV 238
Query: 320 EALPVGAAWVP-LVRNP------RAPSFYYVGLSGLGV 350
P AA VP V P +P+FY + L + +
Sbjct: 239 P--PQRAALVPTFVSTPLLSSSTMSPTFYSITLPSIAL 274
Score = 38.9 bits (89), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 23/78 (29%), Positives = 38/78 (48%), Gaps = 8/78 (10%)
Query: 420 FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL--SIIGNIQQEGI 477
F S+ +P+++ F GG + L A+ L+ C AFAP+ S IGN+QQ +
Sbjct: 264 FYSITLPSIALVFDGGATVNLDAAGILL------QGCLAFAPTASDRMPGFIGNVQQRTL 317
Query: 478 QISFDGANGFVGFGPNVC 495
++ +D + F C
Sbjct: 318 EVVYDVPGKAIRFRSAAC 335
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 107/369 (28%), Positives = 173/369 (46%), Gaps = 39/369 (10%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV----FDPADSASFSGV 209
G YF +IG+G+P R ++ +D+GSDI+WV C C +C ++SD V +D S++ V
Sbjct: 83 GLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDADASSTAKSV 142
Query: 210 SCSSAVCDRL-ENAGCHAGR-CRYEVSYGDGSYTKGTLA-----LETLTIGR---TVVKN 259
SCS C + + + CH+G C+Y + YGDGS T G L L+ +T R +
Sbjct: 143 SCSDNFCSYVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGSTNGT 202
Query: 260 VAIGCGHKNQGMF----VGAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSG 313
+ GCG K G G++G G + S + QL G+ +F++CL + G G
Sbjct: 203 IIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGG--G 260
Query: 314 SLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD-GV 372
G P P++ + Y V L+ + VG + +S D F GDD GV
Sbjct: 261 IFAIGEVVSP-KVKTTPMLSK---SAHYSVNLNAIEVGNSVLQLSSDAF---DSGDDKGV 313
Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYF 432
++D+GT + LP Y + +A L + F TC++ + R PTV+F F
Sbjct: 314 IIDSGTTLVYLPDAVYNPLMNQILASHQELNLHTVQDSF-TCFHYIDRLD-RFPTVTFQF 371
Query: 433 SGGPVLTLPASNFLIPVDDAGTFCFAF------APSPSGLSIIGNIQQEGIQISFDGANG 486
L + +L V + T+CF + + L+I+G++ + +D N
Sbjct: 372 DKSVSLAVYPQEYLFQVRE-DTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQ 430
Query: 487 FVGFGPNVC 495
+G+ + C
Sbjct: 431 VIGWTNHNC 439
>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
Length = 396
Score = 134 bits (338), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 114/358 (31%), Positives = 165/358 (46%), Gaps = 34/358 (9%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQC-QPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
Y V + +G+PP+ +ID G ++VW QC Q C +C+KQ P+FD S++F C +A
Sbjct: 51 YVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCGAA 110
Query: 215 VCDRLEN---AGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQ-G 270
VC+ + AG G C YE S G T G + + + IG +A GC ++
Sbjct: 111 VCESIPTRSCAGDGGGACGYEASTSFG-RTVGRIGTDAVAIGTAATARLAFGCAVASEMD 169
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV----GA 326
G++G +GLG ++SL Q+ AFSYCL TG S +L G A GA
Sbjct: 170 TMWGSSGSVGLGRTNLSLAAQMNAT---AFSYCLAPPDTGKSSALFLGASAKLAGAGKGA 226
Query: 327 AWVPLVRNPRAP-----SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
P V+ P Y + L + G I + Q G+ +++ T T VT
Sbjct: 227 GTTPFVKTSTPPHSGLSRSYLLRLEAIRAGNATI-------AMPQSGNT-IMVSTATPVT 278
Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
L Y R A G P V +D C+ + S P + F GG +T+P
Sbjct: 279 ALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKAS-ASGGAPDLVLAFQGGAEMTVP 337
Query: 442 ASNFLIPVDDAG--TFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S++L DAG T C A SP+ G+SI+G++QQ I + FD + F P C
Sbjct: 338 VSSYLF---DAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADC 392
>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 396
Score = 134 bits (337), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 114/358 (31%), Positives = 165/358 (46%), Gaps = 34/358 (9%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQC-QPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
Y V + +G+PP+ +ID G ++VW QC Q C +C+KQ P+FD S++F C +A
Sbjct: 51 YVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCGAA 110
Query: 215 VCDRLEN---AGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQ-G 270
VC+ + AG G C YE S G T G + + + IG +A GC ++
Sbjct: 111 VCESIPTRSCAGDGGGACGYEASTSFG-RTVGRIGTDAVAIGTAATARLAFGCAVASEMD 169
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV----GA 326
G++G +GLG ++SL Q+ AFSYCL TG S +L G A GA
Sbjct: 170 TMWGSSGSVGLGRTNLSLAAQMNAT---AFSYCLAPPDTGKSSALFLGASAKLAGAGKGA 226
Query: 327 AWVPLVRNPRAPS-----FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
P V+ P+ Y + L + G I + Q G+ + + T T VT
Sbjct: 227 GTTPFVKTSTPPNSGLSRSYLLRLEAIRAGNATI-------AMPQSGNT-ITVSTATPVT 278
Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
L Y R A G P V +D C+ + S P + F GG +T+P
Sbjct: 279 ALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKAS-ASGGAPDLVLAFQGGAEMTVP 337
Query: 442 ASNFLIPVDDAG--TFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S++L DAG T C A SP+ G+SI+G++QQ I + FD + F P C
Sbjct: 338 VSSYLF---DAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADC 392
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 101/336 (30%), Positives = 163/336 (48%), Gaps = 35/336 (10%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
+G+ +G YF +IG+G+P +S Y+ +D+GSDI+WV C C C ++S ++DP+
Sbjct: 72 NGLPTETGLYFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPRKSGLGIELTLYDPS 131
Query: 202 DSASFSGVSCSSAVCDRLEN----AGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---- 253
S+S +GV+C C + A C+Y +SYGDGS T G + L
Sbjct: 132 GSSSGTGVTCGQDFCVATHGGVIPSCVPAAPCQYSISYGDGSSTTGFFVTDFLQYNQVSG 191
Query: 254 --RTVVKNVAI--GCGHKNQGMFVGAA----GLLGLGGGSMSLVGQL--GGQTGGAFSYC 303
+T + N +I GCG K G ++ G+LG G + S++ QL G+ F++C
Sbjct: 192 NSQTTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKVFAHC 251
Query: 304 LVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
L + G G G P + PLV P P Y V L + VGG+++ + ++F
Sbjct: 252 LDTINGG--GIFAIGDVVQP-KVSTTPLV--PGMPH-YNVNLEAIDVGGVKLQLPTNIFD 305
Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
+ + G ++D+GT + LP Y A AQ G++P + C+ SG V
Sbjct: 306 IGE--SKGTIIDSGTTLAYLPGVVYNAIMSKVFAQYGDMPLKNDQDF--QCFRYSGSVDD 361
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF 459
P ++F+F GG L + ++L + +C F
Sbjct: 362 GFPIITFHFEGGLPLNIHPHDYLF--QNGELYCMGF 395
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 166/362 (45%), Gaps = 35/362 (9%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+G Y R+ +G+PP+ +++D+GS + +V C C QC + DP F P S+++ V C+
Sbjct: 81 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKCT 140
Query: 213 SAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHK 267
+ C + R C YE Y + S + G L + ++ G + GC +
Sbjct: 141 I-------DCNCDSDRMQCVYERQYAEMSTSSGVLGEDLISFGNQSELAPQRAVFGCENV 193
Query: 268 NQGMFVG--AAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSRGTGSSGSLVFGREALP 323
G A G++GLG G +S++ QL + +FS C G G++V G + P
Sbjct: 194 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDVG-GGAMVLGGISPP 252
Query: 324 --VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
+ A+ VR+P +Y + L + V G R+P++ ++F G G V+D+GT
Sbjct: 253 SDMAFAYSDPVRSP----YYNIDLKEIHVAGKRLPLNANVFD----GKHGTVLDSGTTYA 304
Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGF----VSVRVPTVSFYFSGG 435
LP A+ AF+DA V + +L + SG + D C++ +G +S P V F G
Sbjct: 305 YLPEAAFLAFKDAIVKELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMVFENG 364
Query: 436 PVLTLPASNFLIPVDDA-GTFCF-AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
TL N++ G +C F +++G I + +D +GF
Sbjct: 365 QKYTLSPENYMFRHSKVRGAYCLGVFQNGNDQTTLLGGIIVRNTLVVYDREQTKIGFWKT 424
Query: 494 VC 495
C
Sbjct: 425 NC 426
>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 485
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 127/455 (27%), Positives = 187/455 (41%), Gaps = 78/455 (17%)
Query: 109 HARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGS-PPR 167
H+ + L++ S A H + + G D Y + +GS PP+
Sbjct: 31 HSLSKSQFNSTPHLLKFTSARSATRFHHRHRQISLPLSPGSD-----YTLSFNLGSHPPQ 85
Query: 168 SQYMVIDSGSDIVWVQCQP--CSQCYKQSDPV----FDPADSASFSGVSCSSAVCDRLEN 221
+ +D+GSD+VW C P C C + D P + S + VSC S C
Sbjct: 86 PISLYMDTGSDLVWFPCAPFECILCEGKYDTAATGGLSPPNITSSASVSCKSPACSAAHT 145
Query: 222 AG-----CHAGRCRYEV----------------SYGDGSYTKGTLALETLTIGRT---VV 257
+ C RC E+ +YGDGS L ++L++ + V+
Sbjct: 146 SLSSSDLCAMARCPLELIETSDCSSFSCPPFYYAYGDGSLV-ARLYRDSLSMPASSPLVL 204
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGG---QTGGAFSYCLVSRGTGSS-- 312
N GC H G VG AG G G +SL QL G FSYCLVS +
Sbjct: 205 HNFTFGCAHTALGEPVGVAGF---GRGVLSLPAQLASFSPHLGNQFSYCLVSHSFDADRV 261
Query: 313 ---GSLVFGREALP------VGA-----AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPIS 358
L+ GR +L VG + ++ NP+ P FY VGL G+ VG +IP+
Sbjct: 262 RRPSPLILGRYSLDDEKKKRVGHDRGEFVYTAMLDNPKHPYFYCVGLEGITVGNRKIPVP 321
Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL-PRASGVSI---FDTC 414
E L R+ + G+ G+V+D+GT T LP YE+ F + G + RA+ + C
Sbjct: 322 EILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTEFNHRMGRVYKRATQIEERTGLGPC 381
Query: 415 YNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA--------GTFCFAF------A 460
Y S + +VP V+ +F G + LP +N+ D C A
Sbjct: 382 Y-YSDDSAAKVPAVALHFVGNSTVILPRNNYYYEFFDGRDGQKKKRKVGCLMLMNGGDEA 440
Query: 461 PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S + +GN QQ+G ++ +D VGF C
Sbjct: 441 ESGGPAATLGNYQQQGFEVVYDLEKHRVGFARRKC 475
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 114/369 (30%), Positives = 166/369 (44%), Gaps = 38/369 (10%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
V + VG+PP++ MV+D+GS++ W+ C +D F P SA+F+ V C SA C
Sbjct: 63 VSLAVGTPPQNVTMVLDTGSELSWLLCATGRAAAAAAD-SFRPRASATFAAVPCGSARCS 121
Query: 217 --DRLENAGCHAG--RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC---GHKNQ 269
D C A RCR +SY DGS + G LA + +G A GC + +
Sbjct: 122 SRDLPAPPSCDAASRRCRVSLSYADGSASDGALATDVFAVGDAPPLRSAFGCMSAAYDSS 181
Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP-VGAAW 328
V AGLLG+ G++S V Q + FSYC+ R +G L+ G LP + +
Sbjct: 182 PDAVATAGLLGMNRGALSFVTQASTRR---FSYCISDR--DDAGVLLLGHSDLPFLPLNY 236
Query: 329 VPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
PL + P P F Y V L G+ VGG +PI + G ++D+GT T L
Sbjct: 237 TPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAPDHTGAGQTMVDSGTQFTFL 296
Query: 384 PTPAYEAFRDAFVAQTGNL------PRASGVSIFDTCYNLSG---FVSVRVPTVSFYFSG 434
AY A + F+ QT L P + FDTC+ + S R+P V+ F+G
Sbjct: 297 LGDAYSAVKAEFLKQTKPLLPALEDPSFAFQEAFDTCFRVPKGRPPPSARLPPVTLLFNG 356
Query: 435 GPVLTLPASNFLIPVDDA-----GTFCFAFAPS---PSGLSIIGNIQQEGIQISFDGANG 486
+++ L V G +C F + P +IG+ Q + + +D G
Sbjct: 357 A-QMSVAGDRLLYKVPGERRGADGVWCLTFGNADMVPLTAYVIGHHHQMNLWVEYDLERG 415
Query: 487 FVGFGPNVC 495
VG P C
Sbjct: 416 RVGLAPVKC 424
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 122/368 (33%), Positives = 171/368 (46%), Gaps = 51/368 (13%)
Query: 144 DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP-CSQCYKQSDPV--FDP 200
DVVS + S EY + + +GSPPRS + D+GSD+VWV+C+ + + P FDP
Sbjct: 89 DVVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDP 148
Query: 201 ADSASFSGVSCSSAVCDRLENAGCHAG-RCRYEVSYGDGSYTKGTLALETLTI-----GR 254
+ S+++ VSC + C+ L A C G C Y +YGDGS T G L+ ET T GR
Sbjct: 149 SRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGAGR 208
Query: 255 TV----VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSRG 308
+ + V GC G F A GL+GLGGG++SLV QLGG T G FSYCLV
Sbjct: 209 SPRQVRIGGVKFGCSTATAGSF-PADGLVGLGGGAVSLVTQLGGATSLGRRFSYCLVPHS 267
Query: 309 TGSSGSLVFG--REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
+S +L FG + GAA PLV N S
Sbjct: 268 VNASSALNFGALADVTEPGAASTPLVGNKTVAS--------------------------- 300
Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGF---VSV 423
+++D+GT +T L D + P S + CYN++G
Sbjct: 301 AASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGE 360
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS--GLSIIGNIQQEGIQISF 481
+P ++ F GG + L N + V + GT C A + +SI+GN+ Q+ I + +
Sbjct: 361 SIPDLTLEFGGGAAVALKPENAFVAVQE-GTLCLAIVATTEQQPVSILGNLAQQNIHVGY 419
Query: 482 DGANGFVG 489
D G VG
Sbjct: 420 DLDAGTVG 427
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 111/394 (28%), Positives = 182/394 (46%), Gaps = 42/394 (10%)
Query: 132 DAAKHEVQDFGTDVVSGMD---QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS 188
D +H D+ G D + G YF +IG+G+P R ++ +D+GSDI+WV C C
Sbjct: 58 DVHRHSRLLSAIDIPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCI 117
Query: 189 QCYKQSDPV----FDPADSASFSGVSCSSAVCDRL-ENAGCHAGR-CRYEVSYGDGSYTK 242
+C ++SD V +D S++ VSCS C + + + CH+G C+Y + YGDGS T
Sbjct: 118 RCPRKSDLVELTPYDVDASSTAKSVSCSDNFCSYVNQRSECHSGSTCQYVIMYGDGSSTN 177
Query: 243 GTLA-----LETLTIGR---TVVKNVAIGCGHKNQGMF----VGAAGLLGLGGGSMSLVG 290
G L L+ +T R + + GCG K G G++G G + S +
Sbjct: 178 GYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFIS 237
Query: 291 QLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGL 348
QL G+ +F++CL + G G G P P++ + Y V L+ +
Sbjct: 238 QLASQGKVKRSFAHCLDNNNGG--GIFAIGEVVSP-KVKTTPMLSK---SAHYSVNLNAI 291
Query: 349 GVGGMRIPISEDLFRLTQMGDD-GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG 407
VG + +S + F GDD GV++D+GT + LP Y + +A L +
Sbjct: 292 EVGNSVLELSSNAF---DSGDDKGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTLHTV 348
Query: 408 VSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF------AP 461
F TC++ + + R PTV+F F L + +L V + T+CF +
Sbjct: 349 QESF-TCFHYTDKLD-RFPTVTFQFDKSVSLAVYPREYLFQVRE-DTWCFGWQNGGLQTK 405
Query: 462 SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ L+I+G++ + +D N +G+ + C
Sbjct: 406 GGASLTILGDMALSNKLVVYDIENQVIGWTNHNC 439
>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
Length = 375
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 108/373 (28%), Positives = 167/373 (44%), Gaps = 63/373 (16%)
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ---CYKQSDPVFD---PA 201
G DQG + + +G+ P + +++D+GSD++W QC+ S + P PA
Sbjct: 38 GSDQG---HSLTVGIVQP---RKLIVDTGSDLIWTQCKLSSSTAAAARHGSPPLSRTAPA 91
Query: 202 DSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG--RTVVKN 259
+ +F+ +SA G LA ET T G R V
Sbjct: 92 RTGAFTRTCTASAAA-------------------------VGVLASETFTFGARRAVSLR 126
Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG- 318
+ GCG + G +GA G+LGL S+SL+ QL Q FSYCL + L+FG
Sbjct: 127 LGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQR---FSYCLTPFADKKTSPLLFGA 183
Query: 319 -------REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
+ P+ + V NP +YYV L G+ +G R+ + + G G
Sbjct: 184 MADLSRHKTTRPIQTTAI--VSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGG 241
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS-GVSIFDTCYNL------SGFVSVR 424
++D+G+ V L A+EA ++A V LP A+ V ++ C+ L + +V+
Sbjct: 242 TIVDSGSTVAYLVEAAFEAVKEA-VMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQ 300
Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFD 482
VP + +F GG + LP N+ AG C A + SG+SIIGN+QQ+ + + FD
Sbjct: 301 VPPLVLHFDGGAAMVLPRDNYFQE-PRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFD 359
Query: 483 GANGFVGFGPNVC 495
+ F P C
Sbjct: 360 VQHHKFSFAPTQC 372
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 157/357 (43%), Gaps = 26/357 (7%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+G Y R+ +G+PP+ +++D+GS + +V C C QC K DP F P S+S+ + C+
Sbjct: 77 NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSSSYKALKCN 136
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHKNQ 269
N C YE Y + S + G L+ + ++ G + + GC +
Sbjct: 137 PDC-----NCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLTPQRAVFGCENVET 191
Query: 270 GMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
G A G++GLG G +S+V QL G FS C G G++V G+ + P G
Sbjct: 192 GDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVG-GGAMVLGKISPPAG 250
Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
+ +P +Y + L + V G + ++ +F G G V+D+GT P
Sbjct: 251 MVFSH--SDPFRSPYYNIDLKQMHVAGKSLKLNPKVFN----GKHGTVLDSGTTYAYFPK 304
Query: 386 PAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGFVSVRV----PTVSFYFSGGPVLT 439
A+ A +DA + + +L R G + D C++ +G + P + F G L
Sbjct: 305 EAFIAIKDAIIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIDMEFGNGQKLI 364
Query: 440 LPASNFLIPVDDA-GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L N+L G +C P +++G I +++D N +GF C
Sbjct: 365 LSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNC 421
>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 459
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 117/434 (26%), Positives = 193/434 (44%), Gaps = 63/434 (14%)
Query: 82 ELVHRDKMSSSSNTTNNMHYHRHQHSF---HARMQ--RDVKRVATLVRRLSGGGADAAKH 136
+L+HRD + S + N+ R + +AR + + + + V GG AA
Sbjct: 38 KLIHRDSIFSPAYNPNDSIKDRAKRMLKNSNARFDYVQAISKRNSAVVDYDGGDTSAADD 97
Query: 137 EVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP 196
+ ++S + + V +G PP QY V+D+GS + W+QC+PC C++Q P
Sbjct: 98 A---YEASLLSEL----CTFLVNFSIGQPPVPQYAVMDTGSSLTWIQCEPCINCHQQKGP 150
Query: 197 VFDPADSASFSGVSCSSAVCDRLEN--AGCHAGRCRYEVSYGDGSYTKGTLALETLTI-- 252
+++P+ S++ S + DR + H C Y +Y D + T+GT A E L
Sbjct: 151 LYNPSSSST----YVSCSDFDRTDTTFTATHGSDCNYSQTYADKTTTRGTYAREQLLFET 206
Query: 253 ---GRTVVKNVAIGCGHKNQ---GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS 306
G T++ +V GCGH N G A+G+ GLG S++ +L G FSYC+
Sbjct: 207 PDDGITIMHDVIFGCGHNNTQLPGPTGYASGVFGLGDSGSSIISKL----GFGFSYCI-- 260
Query: 307 RGTGSSGSLVFGREALPVGAAW------VPLVRNPRAPSFYYVGLSGLGVGGMRIPISED 360
G+ G ++G L +G PLV PR YY+ L G+ +G R+ I
Sbjct: 261 ---GNIGDPLYGFHRLTLGNKLKIEGYSTPLV--PRG--LYYITLVGISIGQERLDIDPI 313
Query: 361 LFRLTQMG--DDGVVMDTGTAVTRLPTPAYEAFRDAFVA-QTGNLPRASGVSI-FDTCY- 415
+F+ + +V+D+G ++ +P AY RD + +G L R ++ CY
Sbjct: 314 VFQRVDLNGISSRIVIDSGATLSYIPRQAYNVVRDKVSSILSGFLSRYRYIARHLSLCYI 373
Query: 416 -----NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS--I 468
+L GF P +F+ + G L D C A P+ S +
Sbjct: 374 GKLNQDLQGF-----PDATFHLADGADLVFQVEGLFFQYTD-NVLCLALVPTESDEETCL 427
Query: 469 IGNIQQEGIQISFD 482
IG + Q+ +++D
Sbjct: 428 IGLLAQQYYNVAYD 441
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 121/411 (29%), Positives = 189/411 (45%), Gaps = 57/411 (13%)
Query: 101 YHRHQHSFHARMQRDVK----RVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEY 156
Y S R +R VK R+A L ++ G + DF +++ + +
Sbjct: 48 YFNPNASVAERAERIVKTSATRIAYLYAQIKG------DIHMNDFELNLLPSTYEPL--F 99
Query: 157 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC 216
V +G P Q ++D+GS+I+WV+C PC +C +Q+ P+ DP+ S++++ + C++ +C
Sbjct: 100 LVNFSMGQPATPQLAIMDTGSNILWVRCAPCKRCTQQNGPLLDPSKSSTYASLPCTNTMC 159
Query: 217 DRLENAGCH-AGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIGCGHKNQG 270
+A C+ +C Y +SY G + G LA E L G V +V GC H+N G
Sbjct: 160 HYAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVPSVVFGCSHEN-G 218
Query: 271 MFVGA--AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSG--SLVFGREALPVGA 326
+ G+ GLG G S V ++G + FSYCL + G LVFG +A G
Sbjct: 219 DYKDRRFTGVFGLGKGITSFVTRMGSK----FSYCLGNIADPHYGYNQLVFGEKANFEGY 274
Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
+ V N YYV L G+ VG R+ I F + + + ++D+GTA+T L
Sbjct: 275 STPLKVVNGH----YYVTLEGISVGEKRLDIDSTAFSM-KGNEKSALIDSGTALTWLAES 329
Query: 387 AYEAFRDAFVAQTGN---LPRASGVSIFDTCYNLSGFVS---VRVPTVSFYFSGGPVLTL 440
A+ A D V Q + +P G CY G VS + P V+F+FSGG L L
Sbjct: 330 AFRAL-DNEVRQLLDGVLMPFWRGSF---ACYK--GTVSQDLIGFPVVTFHFSGGADLDL 383
Query: 441 PASNFL---------IPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
+ I V A A+ S+IG + Q+ +++D
Sbjct: 384 DTESMFYQATPDILCIAVRQAS----AYGNDFKSFSVIGLMAQQYYNMAYD 430
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 110/389 (28%), Positives = 175/389 (44%), Gaps = 43/389 (11%)
Query: 126 LSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ 185
L+ GG +A+ + D D+++ +G Y R+ +G+PP+ +++DSGS + +V C
Sbjct: 66 LAEGGRPSARMRLHD---DLLT-----NGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCA 117
Query: 186 PCSQCYKQSDPVFDPADSASFSGVSCS-SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGT 244
C QC DP F P S+++S V C+ CD +N +C YE Y + S + G
Sbjct: 118 SCEQCGNHQDPRFQPDLSSTYSPVKCNVDCTCDSDKN------QCTYERQYAEMSSSSGV 171
Query: 245 LALETLTIG---RTVVKNVAIGCGHKNQGMFVG--AAGLLGLGGGSMSLVGQL--GGQTG 297
L + ++ G + GC + G A G++GLG G +S++ QL G G
Sbjct: 172 LGEDIVSFGTESELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIG 231
Query: 298 GAFSYCLVSRGTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRI 355
+FS C G G++V G P G + VR+P +Y + L + V G +
Sbjct: 232 DSFSMCYGGMDIG-GGAMVLGAMPAPPGMIYTHSNAVRSP----YYNIELKEMHVAGKAL 286
Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV--SIFDT 413
+ +F G G V+D+GT LP A+ AF+DA +Q L + G + D
Sbjct: 287 RVDPRIFD----GKHGTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDSNYKDI 342
Query: 414 CY-----NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCF-AFAPSPSGL 466
C+ N+S V P V F G L+L N+L G +C F
Sbjct: 343 CFAGAGRNVSQLSEV-FPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPT 401
Query: 467 SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+++G I +++D N +GF C
Sbjct: 402 TLLGGIVVRNTLVTYDRHNEKIGFWKTNC 430
>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
Length = 397
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 108/357 (30%), Positives = 157/357 (43%), Gaps = 35/357 (9%)
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
+G+PP+ +ID ++VW QC CS+C+KQ P+F P S++F C + C
Sbjct: 49 IGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACKSTPT 108
Query: 222 AGCHAGRCRYEVSYG---DGSYTKGTLALETLTIGRTVVKNVAIGC-GHKNQGMFVGAAG 277
+ C C YE + D T G + ET IG T ++A GC + G +G
Sbjct: 109 SNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIG-TATASLAFGCVVASDIDTMDGTSG 167
Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG---AAWVPLVR- 333
+GLG SLV Q+ FSYCL RGTG S L G A G + P ++
Sbjct: 168 FIGLGRTPRSLVAQMKLT---KFSYCLSPRGTGKSSRLFLGSSAKLAGGESTSTAPFIKT 224
Query: 334 NPRAPS--FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
+P S +Y + L + G I T +VM T + + L AY AF
Sbjct: 225 SPDDDSHHYYLLSLDAIRAGNTTIA--------TAQSGGILVMHTVSPFSLLVDSAYRAF 276
Query: 392 RDAFVAQTG---NLPRASGVSIFDTCY-NLSGFVSVRVPTVSFYFS-GGPVLTLPASNFL 446
+ A G P A+ FD C+ +GF P + F F GG LT+P + +L
Sbjct: 277 KKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGGGAALTVPPAKYL 336
Query: 447 IPV-DDAGTFCFAFAPSP-------SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
I V ++ T C A G+S++G++QQE + +D + F P C
Sbjct: 337 IDVGEEKDTACAAILSMARLNRTGLEGVSVLGSLQQENVHFLYDLKKETLSFEPADC 393
>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 474
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 115/388 (29%), Positives = 164/388 (42%), Gaps = 53/388 (13%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQC-YKQSD----PVFDPADSAS 205
G Y + + +G+PP++ V+D+GS +VW C CS C + D P F P +S++
Sbjct: 90 GGYSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKIPTFIPKNSST 149
Query: 206 FSGVSCSSAVCDRL--------------ENAGCHAGRCRYEVSYGDGSYTKGTLALETLT 251
+ C + C + E+ C Y + YG GS T G L L+ L
Sbjct: 150 AKLLGCRNPKCGYIFGSDVQFRCPQCKPESQNCSLTCPAYIIQYGLGS-TAGFLLLDNLN 208
Query: 252 IGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR---G 308
V +GC + +G+ G G G SL Q+ + FSYCLVS
Sbjct: 209 FPGKTVPQFLVGCSILS---IRQPSGIAGFGRGQESLPSQMNLK---RFSYCLVSHRFDD 262
Query: 309 TGSSGSLVFGR----EALPVGAAWVPLVRNPRA--PSF---YYVGLSGLGVGGMRIPISE 359
T S LV + G ++ P NP P+F YY+ L + VGG + I
Sbjct: 263 TPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSTNNPAFKEYYYLTLRKVIVGGKDVKIPY 322
Query: 360 DLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSI---FDTCY 415
G+ G ++D+G+ T + P Y FV Q N RA C+
Sbjct: 323 TFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSRAEDAETQSGLSPCF 382
Query: 416 NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCF-------AFAPSPSGLSI 468
N+SG +V P ++F F GG +T P N+ V DA C A P +G +I
Sbjct: 383 NISGVKTVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEVVCLTVVSDGGAGPPKTTGPAI 442
Query: 469 I-GNIQQEGIQISFDGANGFVGFGPNVC 495
I GN QQ+ I +D N GFGP C
Sbjct: 443 ILGNYQQQNFYIEYDLENERFGFGPRSC 470
>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
gi|223942623|gb|ACN25395.1| unknown [Zea mays]
Length = 378
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 117/378 (30%), Positives = 178/378 (47%), Gaps = 35/378 (9%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV--FDPADSA 204
SG G+G+YFVR VG+P + +V D+GSD+ WV+C+ + P F ++S
Sbjct: 5 SGAYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESR 64
Query: 205 SFSGVSCSSAVCD-----RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG------ 253
S++ ++CSS C L N A C Y+ Y DGS +G + + TI
Sbjct: 65 SWAPLACSSDTCTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGS 124
Query: 254 ---------RTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
R ++ V +GC G F + G+L LG ++S + + GG FSYC
Sbjct: 125 EDGSGGGGRRAKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYC 184
Query: 304 LVS----RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISE 359
LV R S + G E AA PLV + R FY V + + V G + I
Sbjct: 185 LVDHLAPRNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPA 244
Query: 360 DLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSG 419
D++ + + G G ++D+GT++T L TPAY A A + LPR + + F+ CYN +
Sbjct: 245 DVWDVGRGG--GAILDSGTSLTVLATPAYRAVVAALGGRLAALPRVA-MDPFEYCYNWTA 301
Query: 420 FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAP-SPSGLSIIGNIQQEGI 477
+P + F+G L PA +++I D A G C + G+S+IGNI Q+
Sbjct: 302 GAP-EIPKLEVSFAGSARLEPPAKSYVI--DAAPGVKCIGVQEGAWPGVSVIGNILQQEH 358
Query: 478 QISFDGANGFVGFGPNVC 495
FD + ++ F C
Sbjct: 359 LWEFDLRDRWLRFKHTRC 376
>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
Length = 308
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 114/364 (31%), Positives = 161/364 (44%), Gaps = 87/364 (23%)
Query: 144 DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADS 203
D+ S + G G Y + I +G+PP S + D+GSD++W QC PC CYKQ +P+FDP S
Sbjct: 17 DIQSNVISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKS 76
Query: 204 ASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVK 258
++ T G L+ ET TIG T
Sbjct: 77 KTYK---------------------------------TLGYLSSETFTIGSTEGDPASFP 103
Query: 259 NVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS--L 315
+A GCGH N G F +GL+GLGGG +SLV QL + GG FSYCLV + S+ S +
Sbjct: 104 GLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDSTASSKI 163
Query: 316 VFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMD 375
FG+ A+ G+ G P + + + +++D
Sbjct: 164 NFGKSAVVSGS-------------------------GTSSPAAAE--------ESNIIID 190
Query: 376 TGTAVTRLPTPAYEAFRDAFVA----QTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
+GT +T LP Y A QT PR + F CY SG + +PT++ +
Sbjct: 191 SGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGT----FSLCY--SGVKKLEIPTITAH 244
Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFG 491
F G V P + F+ +D CF+ PS S L+I GN+ Q + +D N V F
Sbjct: 245 FIGADVQLPPLNTFVQAQEDL--VCFSMIPS-SNLAIFGNLSQMNFLVGYDLKNNKVSFK 301
Query: 492 PNVC 495
P C
Sbjct: 302 PTDC 305
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 101/359 (28%), Positives = 162/359 (45%), Gaps = 30/359 (8%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+G Y R+ +G+PP+ +++D+GS + +V C C C K DP F P +S+++ V C+
Sbjct: 85 NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHCGKHQDPRFQPDESSTYHPVKCN 144
Query: 213 -SAVCDRLENAGCHAG-RCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHK 267
CD H G C YE Y + S + G L + ++ G V + GC +
Sbjct: 145 MDCNCD-------HDGVNCVYERRYAEMSSSSGVLGEDIISFGNQSEVVPQRAVFGCENV 197
Query: 268 NQGMFVG--AAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSRGTGSSGSLVFGREALP 323
G A G++GLG G +S+V QL + +FS C G G++V G +P
Sbjct: 198 ETGDLYSQRADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVG-GGAMVLG--GIP 254
Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
V +P +Y + L + V G + +S F G V+D+GT L
Sbjct: 255 PPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKH----GTVLDSGTTYAYL 310
Query: 384 PTPAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGF----VSVRVPTVSFYFSGGPV 437
P A+ AFRDA + ++ NL + G + D C++ +G +S P V FS G
Sbjct: 311 PEEAFVAFRDAIIKKSHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAFPEVDMVFSNGQK 370
Query: 438 LTLPASNFLIPVDDA-GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L+L N+L G +C + +++G I +++D N +GF C
Sbjct: 371 LSLTPENYLFQHTKVHGAYCLGIFRNGDSTTLLGGIIVRNTLVTYDRENEKIGFWKTNC 429
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 110/389 (28%), Positives = 175/389 (44%), Gaps = 43/389 (11%)
Query: 126 LSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ 185
L+ GG +A+ + D D+++ +G Y R+ +G+PP+ +++DSGS + +V C
Sbjct: 66 LAEGGRPSARMRLHD---DLLT-----NGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCA 117
Query: 186 PCSQCYKQSDPVFDPADSASFSGVSCS-SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGT 244
C QC DP F P S+++S V C+ CD +N +C YE Y + S + G
Sbjct: 118 SCEQCGNHQDPRFQPDLSSTYSPVKCNVDCTCDSDKN------QCTYERQYAEMSSSSGV 171
Query: 245 LALETLTIG---RTVVKNVAIGCGHKNQGMFVG--AAGLLGLGGGSMSLVGQL--GGQTG 297
L + ++ G + GC + G A G++GLG G +S++ QL G G
Sbjct: 172 LGEDIVSFGTESELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIG 231
Query: 298 GAFSYCLVSRGTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRI 355
+FS C G G++V G P G + VR+P +Y + L + V G +
Sbjct: 232 DSFSMCYGGMDIG-GGAMVLGAMPAPPGMIYTHSNAVRSP----YYNIELKEMHVAGKAL 286
Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG--VSIFDT 413
+ +F G G V+D+GT LP A+ AF+DA +Q L + G + D
Sbjct: 287 RVDPRIFD----GKHGTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDPNYKDI 342
Query: 414 CY-----NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCF-AFAPSPSGL 466
C+ N+S V P V F G L+L N+L G +C F
Sbjct: 343 CFAGAGRNVSQLSEV-FPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPT 401
Query: 467 SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+++G I +++D N +GF C
Sbjct: 402 TLLGGIVVRNTLVTYDRHNEKIGFWKTNC 430
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 111/372 (29%), Positives = 170/372 (45%), Gaps = 47/372 (12%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVS 210
YF ++G+G+P + + +D+GSD++WV C+PCS C ++S ++DP +S++ S VS
Sbjct: 2 YFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVS 61
Query: 211 CSSAVC---DRLENAGCH--AGRCRYEVSYGDGSYTKGTL---ALETLTIGRTVVKN--- 259
CS +C R A C C Y SYGDGS ++G A++ I + N
Sbjct: 62 CSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTS 121
Query: 260 -VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQTG--GAFSYCLVSRGTGSS 312
V GC + G G++G G +S+ QL Q FS+CL G
Sbjct: 122 QVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL--EGEKRG 179
Query: 313 GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
G ++ G + PLV + Y V L G+ V R+PI + F T D GV
Sbjct: 180 GGILVIGGIAEPGMTYTPLVPD---SVHYNVVLRGISVNSNRLPIDAEDFSSTN--DTGV 234
Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP-RASGVSIFDTCYNLSGFVSVRVPTVSFY 431
+MD+GT + P+ AY F A T P R G+ C+ +SG +S P V+
Sbjct: 235 IMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDT--QCFLVSGRLSDLFPNVTLN 292
Query: 432 FSGGPVLTLPASNFLI-----PVDDAGTFCFAFAPSPSG--------LSIIGNIQQEGIQ 478
F GG + L N+L+ P +C + S S L+I+G+I +
Sbjct: 293 FEGG-AMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKL 351
Query: 479 ISFDGANGFVGF 490
+ +D N +G+
Sbjct: 352 VVYDLDNSRIGW 363
>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 535
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 125/448 (27%), Positives = 188/448 (41%), Gaps = 91/448 (20%)
Query: 132 DAAKH---EVQDFGTDVVSGMDQGS------GEYFVRIGVGSPPRSQYMVIDSGSDIVWV 182
D A+H +QD G ++ QG+ G YF ++ +GSP + Y+ ID+GSDI+W+
Sbjct: 38 DRARHGGRILQDGGGGILDFSVQGTSDPYLVGLYFTKVKMGSPAKEFYVQIDTGSDILWL 97
Query: 183 QCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCD---RLENAGC--HAGRCRYE 232
C C+ C K S FD A S++ + VSCS VC + + C A +C Y
Sbjct: 98 NCNTCNNCPKSSGLGIDLNYFDTASSSTAALVSCSDPVCSYAVQTATSQCSSQANQCSYT 157
Query: 233 VSYGDGSYTKGTLALETL----TIGRTVVKN----VAIGCGHKNQGMFV----GAAGLLG 280
YGDGS T G + + +G++V N V GC G G+ G
Sbjct: 158 FQYGDGSGTSGYYVYDAMYFDVIMGQSVFSNSSSTVVFGCSTYQSGDLARTEKAVDGIFG 217
Query: 281 LGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAP 338
G G++S+V Q+ G FS+CL +G GS G ++ E L + PLV P P
Sbjct: 218 FGPGALSVVSQVSSQGMAPKVFSHCL--KGQGSGGGILVLGEILEPNIVYTPLV--PLQP 273
Query: 339 SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA---- 394
Y + L + V G +PI +D+F + G ++D+GT + L AY+ F +A
Sbjct: 274 H-YNLNLQSIAVNGQILPIDQDVFATGN--NRGTIVDSGTTLAYLVQEAYDPFLNAGSPC 330
Query: 395 -----FVAQTGNLPRASG-------------------------------VSIF------- 411
F T N+ G VS F
Sbjct: 331 HFFTHFNEPTNNIKYEDGNNNHQSRVKRHYYDEVTLRLVLKHSAIITTTVSQFSKPIISK 390
Query: 412 -DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP---VDDAGTFCFAFAPSPSGLS 467
+ CY + + P VS F GG + L +LI +D A +C F G +
Sbjct: 391 GNQCYLVPTSLGDIFPLVSLNFMGGASMVLKPEQYLIHYGFLDGAAMWCIGFQKVQKGYT 450
Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
I+G++ + +D AN +G+ C
Sbjct: 451 ILGDLVLKDKIFVYDLANQRIGWTDYDC 478
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 158/357 (44%), Gaps = 26/357 (7%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+G Y R+ +G+PP+ +++D+GS + +V C C QC K DP F P S S+ + C+
Sbjct: 73 NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCN 132
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHKNQ 269
N C YE Y + S + G L+ + ++ G + + GC ++
Sbjct: 133 PDC-----NCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCENEET 187
Query: 270 GMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
G A G++GLG G +S+V QL G FS C G G++V G+ + P G
Sbjct: 188 GDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVG-GGAMVLGKISPPPG 246
Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
+ +P +Y + L + V G + ++ +F G G V+D+GT P
Sbjct: 247 MVFS--HSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFN----GKHGTVLDSGTTYAYFPK 300
Query: 386 PAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGFVSVRV----PTVSFYFSGGPVLT 439
A+ A +DA + + +L R G + D C++ +G + P ++ F G L
Sbjct: 301 EAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLI 360
Query: 440 LPASNFLIPVDDA-GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L N+L G +C P +++G I +++D N +GF C
Sbjct: 361 LSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNC 417
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 158/357 (44%), Gaps = 26/357 (7%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+G Y R+ +G+PP+ +++D+GS + +V C C QC K DP F P S S+ + C+
Sbjct: 73 NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCN 132
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHKNQ 269
N C YE Y + S + G L+ + ++ G + + GC ++
Sbjct: 133 PDC-----NCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCENEET 187
Query: 270 GMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
G A G++GLG G +S+V QL G FS C G G++V G+ + P G
Sbjct: 188 GDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVG-GGAMVLGKISPPPG 246
Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
+ +P +Y + L + V G + ++ +F G G V+D+GT P
Sbjct: 247 MVFS--HSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFN----GKHGTVLDSGTTYAYFPK 300
Query: 386 PAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGFVSVRV----PTVSFYFSGGPVLT 439
A+ A +DA + + +L R G + D C++ +G + P ++ F G L
Sbjct: 301 EAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLI 360
Query: 440 LPASNFLIPVDDA-GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L N+L G +C P +++G I +++D N +GF C
Sbjct: 361 LSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNC 417
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 105/358 (29%), Positives = 164/358 (45%), Gaps = 38/358 (10%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
G Y+ I +GSPP+ +V+D+GSD+ WV+C PCS FD S ++ ++C+
Sbjct: 122 GVYYSSITLGSPPKDFSLVMDTGSDLTWVRCDPCS---PDCSSTFDRLASNTYKALTCAD 178
Query: 214 ----AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQ 269
V RL H+GR + G+ + LE GCG +
Sbjct: 179 DLRLPVLLRLWRRLFHSGRSLRDTLKMAGAASD---ELEEF-------PGFVFGCGSLLK 228
Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS---GSLVFGREAL---- 322
G+ G G+L L GS+S Q+G + G FSYCL+ + +S +VFG A+
Sbjct: 229 GLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAAVELKE 288
Query: 323 PVGAAWVPLVRNPRAPS--FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
P L P S +Y V L G+ VG R+ +S F Q D + D+GT +
Sbjct: 289 PGSGKPQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSTFLNGQ--DKPTIFDSGTTL 346
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
T LP+ ++ + + + A V+I D C+ + +P ++F+F+GG
Sbjct: 347 TMLPSGVCDSIKQSLASMVSG---AEFVAIKGLDACFRVPPSSGQGLPDITFHFNGGADF 403
Query: 439 TLPASNFLIPVDDAGTF-CFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
SN++I D G+ C F P+ + +SI GN+QQ+ + D N +GF C
Sbjct: 404 VTRPSNYVI---DLGSLQCLIFVPT-NEVSIFGNLQQQDFFVLHDMDNRRIGFKETDC 457
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 112/374 (29%), Positives = 171/374 (45%), Gaps = 47/374 (12%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
G YF ++G+G+P + + +D+GSD++WV C+PCS C ++S ++DP +S++ S
Sbjct: 27 GLYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSL 86
Query: 209 VSCSSAVC---DRLENAGCH--AGRCRYEVSYGDGSYTKGTL---ALETLTIGRTVVKN- 259
VSCS +C R A C C Y SYGDGS ++G A++ I + N
Sbjct: 87 VSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANT 146
Query: 260 ---VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQTG--GAFSYCLVSRGTG 310
V GC + G G++G G +S+ QL Q FS+CL G
Sbjct: 147 TSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL--EGEK 204
Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
G ++ G + PLV + Y V L G+ V R+PI + F T D
Sbjct: 205 RGGGILVIGGIAEPGMTYTPLVPD---SVHYNVVLRGISVNSNRLPIDAEDFSSTN--DT 259
Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP-RASGVSIFDTCYNLSGFVSVRVPTVS 429
GV+MD+GT + P+ AY F A T P R G+ C+ +SG +S P V+
Sbjct: 260 GVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDT--QCFLVSGRLSDLFPNVT 317
Query: 430 FYFSGGPVLTLPASNFLI-----PVDDAGTFCFAFAPSPSG--------LSIIGNIQQEG 476
F GG + L N+L+ P +C + S S L+I+G+I +
Sbjct: 318 LNFEGG-AMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKD 376
Query: 477 IQISFDGANGFVGF 490
+ +D N +G+
Sbjct: 377 KLVVYDLDNSRIGW 390
>gi|356563324|ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 480
Score = 132 bits (332), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 120/406 (29%), Positives = 167/406 (41%), Gaps = 71/406 (17%)
Query: 155 EYFVRIGVGSPPRSQ--YMVIDSGSDIVWVQCQP--CSQCY-KQSDPVFDPADSASFS-G 208
+Y + +G ++Q + +D+GSD+VW C P C C K ++P P + + S
Sbjct: 69 DYTLSFNLGPQAQAQPITLYMDTGSDLVWFPCAPFKCILCEGKPNEPNASPPTNITQSVA 128
Query: 209 VSCSSAVCDRLENAG-----CHAGRCRYE----------------VSYGDGSYTKGTLAL 247
VSC S C N C A RC E +YGDGS L
Sbjct: 129 VSCKSPACSAAHNLAPPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSLI-ARLYR 187
Query: 248 ETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLG---GQTGGAFSYCL 304
+TL++ ++N GC H G+ G G G +SL QL Q G FSYCL
Sbjct: 188 DTLSLSSLFLRNFTFGCAHTT---LAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCL 244
Query: 305 VSRGTGSS-----GSLVFGR----EALPVGA-----AWVPLVRNPRAPSFYYVGLSGLGV 350
VS S L+ GR E +G + ++ NP+ P FY V L G+ V
Sbjct: 245 VSHSFDSERVRKPSPLILGRYEEKEKEKIGGGVAEFVYTSMLENPKHPYFYTVSLIGIAV 304
Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG-NLPRASGVS 409
G IP E L R+ GD GVV+D+GT T LP Y + D F + G + RA +
Sbjct: 305 GKRTIPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRRVGRDNKRARKIE 364
Query: 410 I---FDTCYNLSGFVSVRVPTVSFYFSGGP--VLTLPASNFLIPVDD----------AGT 454
CY L+ VP ++ F+GG + LP N+ D G
Sbjct: 365 EKTGLAPCYYLNSVAD--VPALTLRFAGGKNSSVVLPRKNYFYEFSDGSDGAKGKRKVGC 422
Query: 455 FCFAFAPSPSGLS-----IIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ LS +GN QQ+G ++ +D VGF C
Sbjct: 423 LMLMNGGDEADLSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQC 468
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 132 bits (332), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 116/427 (27%), Positives = 187/427 (43%), Gaps = 48/427 (11%)
Query: 85 HRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTD 144
HR + +T+N+ HR F + R R+L A + D D
Sbjct: 36 HRPMIIPLHLSTSNISSHRK--PFTSNYHR---------RQLHNSDLPNAHMRLYD---D 81
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
++S +G Y R+ +G+PP+ +++D+GS + +V C C QC K DP F P S+
Sbjct: 82 LLS-----NGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQCGKHQDPRFQPESSS 136
Query: 205 SFSGVSCS-SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNV 260
++ + C+ S CD +C YE Y + S + G LA + L+ G +
Sbjct: 137 TYKPMQCNPSCNCDD------EGKQCTYERRYAEMSSSSGLLAEDVLSFGNESELTPQRA 190
Query: 261 AIGCGHKNQGMFVG--AAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSSGSLV 316
GC G A G++GLG G +S+V QL + G +FS C G++V
Sbjct: 191 IFGCETVETGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDV-VGGAMV 249
Query: 317 FGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
G +P V +P ++Y + L L V G R+ ++ +F G G V+D+
Sbjct: 250 LGN--IPPPPDMVFAHSDPYRSAYYNIELKELHVAGKRLKLNPRVFD----GKHGTVLDS 303
Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGF----VSVRVPTVSF 430
GT LP A+ AF+DA + + L + G S D C++ +G +S P V+
Sbjct: 304 GTTYAYLPEEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAGRDVSQLSKIFPEVNM 363
Query: 431 YFSGGPVLTLPASNFLIP-VDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
F G L+L N+L +G +C F +++G I +++D N +
Sbjct: 364 VFGNGQKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIVVRNTLVTYDRDNDKI 423
Query: 489 GFGPNVC 495
GF C
Sbjct: 424 GFWKTNC 430
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 114/391 (29%), Positives = 169/391 (43%), Gaps = 59/391 (15%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQC-YKQSD----PVFDPADSAS 205
G Y + + +G+PP++ V+D+GS +VW C CS C + D P F P +S++
Sbjct: 86 GGYSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSST 145
Query: 206 FSGVSCSSAVCDRL--ENAGCHAGRCR-------------YEVSYGDGSYTKGTLALETL 250
+ C + C L + +C+ Y + YG G+ T G L L+ L
Sbjct: 146 AKLLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGA-TAGFLLLDNL 204
Query: 251 TIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--- 307
V +GC + +G+ G G G SL Q+ + FSYCLVS
Sbjct: 205 NFPGKTVPQFLVGCSILS---IRQPSGIAGFGRGQESLPSQMNLK---RFSYCLVSHRFD 258
Query: 308 GTGSSGSLVFGR----EALPVGAAWVPLVRNPRAPS----FYYVGLSGLGVGGMRIPISE 359
T S LV + G ++ P NP S +YYV L L VGG+ + I
Sbjct: 259 DTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVDVKIPY 318
Query: 360 DLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG-------NLPRASGVSIFD 412
G+ G ++D+G+ T + P Y F+ Q G N+ SG+S
Sbjct: 319 KFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQSGLS--- 375
Query: 413 TCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCF-------AFAPSPSG 465
C+N+SG ++ P +F F GG ++ P N+ V DA CF A P +G
Sbjct: 376 PCFNISGVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVLCFTVVSDGGAGQPKTAG 435
Query: 466 LSII-GNIQQEGIQISFDGANGFVGFGPNVC 495
+II GN QQ+ + +D N GFGP C
Sbjct: 436 PAIILGNYQQQNFYVEYDLENERFGFGPRNC 466
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 115/367 (31%), Positives = 165/367 (44%), Gaps = 41/367 (11%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD---PV--FDPADSASFS 207
+G YF ++ +G+PPR+ + +D+GSD++WV C PC C SD P+ +D SAS S
Sbjct: 33 AGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSS 92
Query: 208 GVSCSSAVC---DRLENAGCH-AGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIG 263
V CS C ++ +GC+ +C Y YGDGS T G L + L V G
Sbjct: 93 KVPCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVNATATVIFG 152
Query: 264 CGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVF 317
CG K G + G++G G +S QL G+T F++CL G G LV
Sbjct: 153 CGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCL-DGGERGGGILVL 211
Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
G P + PLV P S Y V L + V + I LF M G + D+G
Sbjct: 212 GNVIEP-DIQYTPLV--PYM-SHYNVVLQSISVNNANLTIDPKLFSNDVM--QGTIFDSG 265
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTC-YNLSGFVSVRVPTVSFYFSGGP 436
T + LP AY+AF A + V+ F C LS F+ P V YF G
Sbjct: 266 TTLAYLPDEAYQAFTQAV---------SLVVAPFLLCDTRLSRFIYKLFPNVVLYFEGAS 316
Query: 437 VLTLPASNFLI---PVDDAGTFCFAF-----APSPSGLSIIGNIQQEGIQISFDGANGFV 488
+TL + +LI +A +C + A S +I G++ + + +D G +
Sbjct: 317 -MTLTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGRI 375
Query: 489 GFGPNVC 495
G+ P C
Sbjct: 376 GWRPFDC 382
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 120/418 (28%), Positives = 190/418 (45%), Gaps = 56/418 (13%)
Query: 101 YHRHQHSFHARMQRDVK----RVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEY 156
+++ + RM+ D++ R A + R+ G +++ + VS G
Sbjct: 49 HYKPNETAKDRMELDIQHSAARFAYIQARIEGSLVSNNEYKAR------VSPSLTGR-TI 101
Query: 157 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC 216
I +G PP Q +V+D+GSDI+WV C PC+ C +FDP+ S++FS + C + C
Sbjct: 102 MANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNHLGLLFDPSMSSTFSPL-CKTP-C 159
Query: 217 DRLENAGCHAGRCR---YEVSYGDGSYTKG-----TLALETLTIGRTVVKNVAIGCGHK- 267
D GC RC + V+Y D S G T+ ET G + + +V GCGH
Sbjct: 160 DF---KGC--SRCDPIPFTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVLFGCGHNI 214
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL--VSRGTGSSGSLVFGREALPVG 325
Q G G+LGL G SL ++G + FSYC+ ++ + L+ G A G
Sbjct: 215 GQDTDPGHNGILGLNNGPDSLATKIGQK----FSYCIGDLADPYYNYHQLILGEGADLEG 270
Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
+ V N FYYV + G+ VG R+ I+ + F + + GV++DTG+ +T L
Sbjct: 271 YSTPFEVHN----GFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTITFLVD 326
Query: 386 PAYEAFRDAFVAQTGNLPRASGV--SIFDTCY------NLSGFVSVRVPTVSFYFSGGPV 437
+ G R + + S + C+ +L GF P V+F+F+ G
Sbjct: 327 SVHRLLSKEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGF-----PVVTFHFADGAD 381
Query: 438 LTLPASNFLIPVDDAGTFCFAFAPS-----PSGLSIIGNIQQEGIQISFDGANGFVGF 490
L L + +F ++D FC P S S+IG + Q+ + +D N FV F
Sbjct: 382 LALDSGSFFNQLND-NVFCMTVGPVSSLNLKSKPSLIGLLAQQSYSVGYDLVNQFVYF 438
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 123/404 (30%), Positives = 175/404 (43%), Gaps = 58/404 (14%)
Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTD------VVSGMDQGSGEY 156
H+ RD R L++ L G V DF D VV G Y
Sbjct: 38 NHEMELSQLKARDKARHGRLLQSLGG---------VIDFPVDGTFDPFVV-------GLY 81
Query: 157 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSC 211
+ +I +GSPPR Y+ +D+GSD++WV C C+ C + S FDP S + + VSC
Sbjct: 82 YTKIRLGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATPVSC 141
Query: 212 SSAVCD---RLENAGC--HAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN--- 259
S C + ++GC C Y YGDGS T G + L +G ++V N
Sbjct: 142 SDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTA 201
Query: 260 -VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSS 312
V GC G V G+ G G MS++ QL Q FS+CL G
Sbjct: 202 PVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLKGE-NGGG 260
Query: 313 GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
G LV G P + PLV P P Y V L + V G +PI+ +F + G
Sbjct: 261 GILVLGEIVEP-NMVFTPLV--PSQPH-YNVNLLSISVNGQALPINPSVFSTSN--GQGT 314
Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYF 432
++DTGT + L AY F +A R VS + CY ++ V+ P VS F
Sbjct: 315 IIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPV-VSKGNQCYVIATSVADIFPPVSLNF 373
Query: 433 SGGPVLTLPASNFLIPVDDAG---TFCFAFAP-SPSGLSIIGNI 472
+GG + L ++LI ++ G +C F G++I+G++
Sbjct: 374 AGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDL 417
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 117/380 (30%), Positives = 172/380 (45%), Gaps = 60/380 (15%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
V + VG PP++ MV+D+GS++ W+ C+ VF+P S+++S V CSS +C
Sbjct: 67 VTLAVGDPPQNISMVLDTGSELSWLHCKKSPNL----GSVFNPVSSSTYSPVPCSSPICR 122
Query: 217 ----DRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK--- 267
D A C C +SY D + +G LA ET IG GC
Sbjct: 123 TRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLS 182
Query: 268 -NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL---- 322
N + GL+G+ GS+S V QLG FSYC+ G+ SSG L+ G +
Sbjct: 183 SNSEEDAKSTGLMGMNRGSLSFVNQLGFS---KFSYCI--SGSDSSGFLLLGDASYSWLG 237
Query: 323 PVGAAWVPLV-RNPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
P+ + PLV ++ P F Y V L G+ VG + + + +F G ++D+G
Sbjct: 238 PI--QYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSG 295
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCY--------NLSGFVSV 423
T T L P Y A ++ F+ QT ++ R F D CY N SG
Sbjct: 296 TQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSG---- 351
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGT------FCFAFAPSP-SGLS--IIGNIQQ 474
+P VS F G +++ L V+ AG+ +CF F S G+ +IG+ Q
Sbjct: 352 -LPMVSLMFRGAE-MSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQ 409
Query: 475 EGIQISFDGANGFVGFGPNV 494
+ + + FD A VGF NV
Sbjct: 410 QNVWMEFDLAKSRVGFAGNV 429
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 98/341 (28%), Positives = 157/341 (46%), Gaps = 41/341 (12%)
Query: 124 RRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQ 183
RRL G A+ + D D++ +G Y RI +G+PP++ +++D+GS + +V
Sbjct: 66 RRLQGSARPNARMRLYD---DLLL-----NGYYTTRIWIGTPPQTFALIVDTGSTVTYVP 117
Query: 184 CQPCSQCYKQSDPVFDPADSASFSGVSCS-SAVCDRLENAGCHAGRCRYEVSYGDGSYTK 242
C C QC + DP F+P S+++ VSC+ CD +C YE Y + S +
Sbjct: 118 CSTCEQCGRHQDPKFEPELSSTYQPVSCNIDCTCDN------ERKQCVYERQYAEMSSSS 171
Query: 243 GTLALETLTIG---RTVVKNVAIGCGHKNQGMFVG--AAGLLGLGGGSMSLVGQL--GGQ 295
G L + ++ G V + GC ++ G A G++GLG G +S+V QL G
Sbjct: 172 GVLGEDIISFGNQSELVPQRAIFGCENQETGDLYSQRADGIMGLGRGDLSIVDQLVEKGV 231
Query: 296 TGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
+FS C G G+++ G + P G + +P +Y + L + V G ++
Sbjct: 232 ISDSFSLCYGGMDIG-GGAMILGGISPPSGMVFAE--SDPVRSQYYNIDLKAIHVAGKQL 288
Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCY 415
+ +F G G V+D+GT LP A+ AF+DA + + +L + G D Y
Sbjct: 289 HLDPSIFD----GKHGTVLDSGTTYAYLPEAAFTAFKDAMMKELTSLKQIHGP---DPNY 341
Query: 416 NLSGF---------VSVRVPTVSFYFSGGPVLTLPASNFLI 447
N F +S P V FS G L+L N+L
Sbjct: 342 NDICFSGAESDVSQLSNTFPAVEMVFSNGQKLSLSPENYLF 382
>gi|194701538|gb|ACF84853.1| unknown [Zea mays]
gi|194703714|gb|ACF85941.1| unknown [Zea mays]
Length = 208
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 89/221 (40%), Positives = 119/221 (53%), Gaps = 17/221 (7%)
Query: 279 LGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV--PLVRNPR 336
+GLGGG+ SLV Q G G AFSYCL + SSG L G + +V P++R+ +
Sbjct: 1 MGLGGGAQSLVSQTAGTLGRAFSYCLPPTPS-SSGFLTLGAAGGSGTSGFVKTPMLRSSQ 59
Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
P+FY V L + VGG ++ I +F G VMD+GT +TRLP AY A AF
Sbjct: 60 VPTFYGVRLQAIRVGGRQLSIPASVF------SAGTVMDSGTVITRLPPTAYSALSSAFK 113
Query: 397 AQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFC 456
A P A I DTC++ SG SV +P+V+ FSGG V++L AS ++ + C
Sbjct: 114 AGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL------SNC 167
Query: 457 FAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
AFA S L IIGN+QQ ++ +D G VGF C
Sbjct: 168 LAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 208
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 109/397 (27%), Positives = 177/397 (44%), Gaps = 37/397 (9%)
Query: 115 DVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVID 174
+ R+A+ R L GG +A+ + D D+++ +G Y R+ +G+PP+ +++D
Sbjct: 52 NASRLASSRRVLGDGGRPSARMRLHD---DLLT-----NGYYTTRLYIGTPPQEFALIVD 103
Query: 175 SGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS-AVCDRLENAGCHAGRCRYEV 233
SGS + +V C C QC DP F P S+++S V CS+ CD +C YE
Sbjct: 104 SGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCSADCTCD------SDKSQCTYER 157
Query: 234 SYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHKNQGMFVG--AAGLLGLGGGSMSL 288
Y + S + G L + ++ G + GC + G A G++GLG G +S+
Sbjct: 158 QYAEMSSSSGVLGEDIVSFGTESELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSI 217
Query: 289 VGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLS 346
+ QL G G +FS C G G++V G A+P V +P +Y + L
Sbjct: 218 MDQLVDKGVIGDSFSMCYGGMDIG-GGAMVLG--AMPAPPDMVFSRSDPVRSPYYNIELK 274
Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
+ V G + + +F G V+D+GT LP A+ AF+DA ++ L +
Sbjct: 275 EIHVAGKALRLDPRIFD----SKHGTVLDSGTTYAYLPEQAFVAFKDAVTSKVRPLKKIR 330
Query: 407 G--VSIFDTCYNLSGF----VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCF-A 458
G + D C+ +G +S P V F G L+L N+L G +C
Sbjct: 331 GPDPNYKDICFAGAGRNVSQLSQAFPDVDMVFGDGQKLSLSPENYLFRHSKVEGAYCLGV 390
Query: 459 FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
F +++G I +++D N +GF C
Sbjct: 391 FQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 427
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 112/361 (31%), Positives = 168/361 (46%), Gaps = 53/361 (14%)
Query: 157 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC 216
V + +G P Q +V+D+GSDI+W+ C PC+ C +FDP+ S++FS + C +
Sbjct: 102 LVNLSIGQPSIPQLVVMDTGSDILWIMCNPCTNCDNHLGLLFDPSMSSTFSPL-CKTPCG 160
Query: 217 DRLENAGCHAGRCRYEVSYGDGSYTKGT-----LALETLTIGRTVVKNVAIGCGHKNQGM 271
+ GC + +SY D S GT L ET G + + +V IGCGH N G
Sbjct: 161 FK----GCKCDPIPFTISYVDNSSASGTFGRDILVFETTDEGTSQISDVIIGCGH-NIGF 215
Query: 272 FV--GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-AW 328
G G+LGL G SL Q+G + FSYC+ G+ + L +G A
Sbjct: 216 NSDPGYNGILGLNNGPNSLATQIGRK----FSYCI-----GNLADPYYNYNQLRLGEGAD 266
Query: 329 VPLVRNPRA--PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
+ P FYYV + G+ VG R+ I+ + F + + G GV++D+GT +T L
Sbjct: 267 LEGYSTPFEVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITYLVDS 326
Query: 387 AYEAFRDAFVAQTGNLPRASGVSI------FDTCYNLSGFVS---VRVPTVSFYFSGGPV 437
A++ + + NL + S + + CY G +S V P V+F+F G
Sbjct: 327 AHKLLYN----EVRNLLKWSFRQVIFENAPWKLCY--YGIISRDLVGFPVVTFHFVDGAD 380
Query: 438 LTLPASNFLIPVDDAGTFCFAFAP--------SPSGLSIIGNIQQEGIQISFDGANGFVG 489
L L +F DD FC +P SP S+IG + Q+ + +D N FV
Sbjct: 381 LALDTGSFFSQRDD--IFCMTVSPASILNTTISP---SVIGLLAQQSYNVGYDLVNQFVY 435
Query: 490 F 490
F
Sbjct: 436 F 436
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 102/364 (28%), Positives = 163/364 (44%), Gaps = 36/364 (9%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
V + +G+PP+ Q MV+D+GS + W+QC + FDP+ S++FS + C+ VC
Sbjct: 99 VDLPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVCK 158
Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVAIGCGHKNQG 270
D C R C Y Y DG+Y +G L E T R++ + +GC ++
Sbjct: 159 PRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRSLFTPPLILGCATES-- 216
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----GTGSSGSLVFGREALPVGA 326
G+LG+ G +S Q FSYC+ +R G +GS G
Sbjct: 217 --TDPRGILGMNRGRLSFASQ---SKITKFSYCVPTRVTRPGYTPTGSFYLGHNPNSNTF 271
Query: 327 AWVPLV---RNPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
++ ++ R+ R P+ Y V L G+ +GG ++ IS +FR G ++D+G+
Sbjct: 272 RYIEMLTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAGGSGQTMLDSGSE 331
Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF----DTCYNLSGF-VSVRVPTVSFYFSG 434
T L AY+ R V G PR ++ D C++ + + + + F F
Sbjct: 332 FTYLVNEAYDKVRAEVVRAVG--PRMKKGYVYGGVADMCFDGNAIEIGRLIGDMVFEFEK 389
Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFDGANGFVGFG 491
G + +P L V + G C A S + +IIGN Q+ + + FD N +GFG
Sbjct: 390 GVQIVVPKERVLATV-EGGVHCIGIANSDKLGAASNIIGNFHQQNLWVEFDLVNRRMGFG 448
Query: 492 PNVC 495
C
Sbjct: 449 TADC 452
>gi|357117301|ref|XP_003560410.1| PREDICTED: uncharacterized protein LOC100833752 [Brachypodium
distachyon]
Length = 473
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 110/371 (29%), Positives = 166/371 (44%), Gaps = 36/371 (9%)
Query: 156 YFVRIGVGSPP--RSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
Y V +GVG+ + + +D + W+QC PC C Q +PVFDPA S +F VS +
Sbjct: 101 YAVAVGVGTEHGYENYELEMDMAAGFSWMQCAPCHPCLPQLNPVFDPAKSPTFRPVSGHN 160
Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-----TVVKNVAIGCGHK- 267
AV R GRC + ++Y +G+ G LA +T + + + GC ++
Sbjct: 161 AVLCRPPYHPLQDGRCGFGIAYRNGASAAGYLARDTFSFPTGDNNFQHLPGIVFGCANRI 220
Query: 268 ----NQGMFVGAAGLLGLGGGSMSLVG---QLGGQTGGAFSYCLVSRGTGSSGSLVFGRE 320
G G G +G+G L G QL GG FSYC + GT + L FG +
Sbjct: 221 ARFDTHGALAGVLG-MGMGAEGKPLTGFMRQLYHNGGGRFSYCPIVPGTTAYSFLRFGND 279
Query: 321 ---ALPVGAAWVPL-VRNPRAPS-FYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVM 374
P G + V P S YYV L+G+ VG +R+P ++ ++F Q G G +
Sbjct: 280 IPSQPPAGVHRQSMAVLAPTTTSEAYYVKLAGISVGALRVPGVTPEMFERDQHGRGGCAI 339
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI----FDTCYNLSGFVSVRVPTVSF 430
D GT +T + AY A G+L R + C + + + R+P+++
Sbjct: 340 DIGTKMTAIVQTAYAHVEAAV---RGHLQRNRARFVQSPGHHLCVHRTPAIEERLPSMTL 396
Query: 431 YFSGGPVLTL-PASNFLI---PVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
+F GGP L + P FL+ P C P + +++IG +QQ + FD N
Sbjct: 397 HFVGGPWLRVKPQHLFLVVGSPTGGGEYLCLGLVPD-AEMTVIGAMQQIDTRFIFDLHNN 455
Query: 487 --FVGFGPNVC 495
V F P C
Sbjct: 456 IPIVSFNPEDC 466
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 131 bits (330), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 107/376 (28%), Positives = 173/376 (46%), Gaps = 46/376 (12%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD------PVFDPADSASF 206
+G Y+ +I +G+PP Y+ +D+GSD+ W+ C PC+ C ++ +DP+ S++
Sbjct: 34 TGLYYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSSTD 93
Query: 207 SGVSCSSAVCDRL----ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI----GRTVVK 258
+SC + C E + AG C Y +YGDGS T+G + +T T V
Sbjct: 94 GALSCRDSNCGAALGSNEVSCTSAGYCAYSTTYGDGSSTQGYFIQDVMTFQEIHNNTQVN 153
Query: 259 ---NVAIGCGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGT 309
+V GCG G + ++ GL+G G ++S+ QL G+ G F++CL
Sbjct: 154 GTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQGDNQ 213
Query: 310 GSSGSLVFGREALPVGAAWVPLV-RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
G G++V G + P ++ P+V RN Y VG+ + V G + F T
Sbjct: 214 G-GGTIVIGSVSEP-NISYTPIVSRN-----HYAVGMQNIAVNGRNVTTPAS-FDTTSTS 265
Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGF-VSVRVPT 427
GV+MD+GT + L PAY F +A + +S S C L+ + PT
Sbjct: 266 AGGVIMDSGTTLAYLVDPAYTQFVNAV-----STFESSMFSSHSQCLQLAWCSLQADFPT 320
Query: 428 VSFYFSGGPVLTLPASNFLI--PVDDA-GTFCFAFAPSPS-----GLSIIGNIQQEGIQI 479
V +F G V+ L N+L P+ + +C + S + SI+G+I + +
Sbjct: 321 VKLFFDAGAVMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAGYLSYSILGDIVLKDHLV 380
Query: 480 SFDGANGFVGFGPNVC 495
+D N VG+ C
Sbjct: 381 VYDNDNRVVGWKSFDC 396
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 122/453 (26%), Positives = 191/453 (42%), Gaps = 68/453 (15%)
Query: 81 LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
LEL H D + + RM+R +R + ++GGG +A+
Sbjct: 35 LELTHVDA--------------KQNCTTKERMRRATERTHRRLASMAGGGGEAS------ 74
Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ--CYKQSDPVF 198
+ + +Y +G PP+ +ID+GS+++W QC C C+ Q +
Sbjct: 75 ------APIHWNETQYIAEYLIGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFY 128
Query: 199 DPADSASFSGVSCSSAVCDRLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTI--GR 254
DP+ S + V+C+ C C C +YG G+ G L E T G+
Sbjct: 129 DPSRSRTAKPVACNDTACLLGSETRCARDGKACAVLTAYGAGA-IGGFLGTEVFTFGHGQ 187
Query: 255 TVVKNV--AIGC---GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT 309
+ NV A GC G GA+G++GLG G +SL QLG FSYCL +
Sbjct: 188 SSENNVSLAFGCITASRLTPGSLDGASGIIGLGRGKLSLPSQLGDNK---FSYCLTPYFS 244
Query: 310 GSSGSLVF------GREALPVGAAWVPLVRNPRA---PSFYYVGLSGLGVGGMRIPISED 360
++ + G A VP ++NP SFYY+ L+G+ VG ++ +
Sbjct: 245 DAANTSTLFVGASAGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAA 304
Query: 361 LFRLTQMGDD---GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDTCY 415
F L ++ G ++D+G+ T L AY+A RD V Q G +P +G D C
Sbjct: 305 AFDLREVAPAKWGGTLIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGAEGLDLCV 364
Query: 416 N--LSGFVSVRVPTVSFYFSGGPV----LTLPASNFLIPVDDAGTFCFAFA---PSPS-- 464
G VP + +F G + +P N+ PVDD+ F+ P+ +
Sbjct: 365 GGVAPGDAGKLVPPLVLHFGSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTLP 424
Query: 465 --GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+IIGN Q+ + + +D G + F P C
Sbjct: 425 LNETTIIGNYMQQDMHLLYDLGQGVLSFQPADC 457
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 121/439 (27%), Positives = 190/439 (43%), Gaps = 48/439 (10%)
Query: 74 SDEARWNLELVHRDKMSS----SSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGG 129
+++ + EL+HRD +S +S TT+ R+ V+R A V R +
Sbjct: 32 AEKLSFTTELIHRDSPNSPLFNASETTD------------IRLANAVERSADRVNRFN-- 77
Query: 130 GADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ 189
D + + + S +D G ++ ++I +G PP + + +GSD+VW+ C
Sbjct: 78 --DLISNSIT--AAEFPSILDNG--DFLMKISIGIPPTELLVNVATGSDLVWIPCLSFKP 131
Query: 190 CYKQSD-PVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVS-YGDGSYTKGTLAL 247
C D FDP +S+++ V C S C A C C Y S G LA+
Sbjct: 132 CTHNCDLRFFDPMESSTYKNVPCDSYRCQITNAATCQFSDCFYSCDPRHQDSCPDGDLAM 191
Query: 248 ETLTIGRT-----VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
+TLT+ T ++ N CG++ G + G G+LGLG GS+SL+ ++ G FS+
Sbjct: 192 DTLTLNSTTGKSFMLPNTGFICGNRIGGDYPGV-GILGLGHGSLSLLNRISHLIDGKFSH 250
Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPR-APSFYYVGLSGLGVGGMRIP---IS 358
C+V + + L FG +A+ G+A + P Y + G+ VG I I
Sbjct: 251 CIVPYSSNQTSKLSFGDKAVVSGSAMFSTRLDMTGGPYSYTLSFYGISVGNKSISAGGIG 310
Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR-DAFVAQTGNLPRASGVSIFDTCYNL 417
D + +G+ MD+GT T P Y D A CY
Sbjct: 311 SDYYM------NGLGMDSGTMFTYFPEYFYSQLEYDVRYAIQQEPLYPDPTRRLRLCYRY 364
Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL-SIIGNIQQEG 476
S S PT++ +F GG V +++F+ +D C AFA S S ++ G QQ
Sbjct: 365 SPDFS--PPTITMHFEGGSVELSSSNSFIRMTED--IVCLAFATSSSEQDAVFGYWQQTN 420
Query: 477 IQISFDGANGFVGFGPNVC 495
+ I +D GF+ F C
Sbjct: 421 LLIGYDLDAGFLSFLKTDC 439
>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
Length = 519
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 121/433 (27%), Positives = 189/433 (43%), Gaps = 81/433 (18%)
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-----PCSQCYKQ 193
+ F + SG G+G+YFVR VG+P R +V D+GSD+ WV+C + Y
Sbjct: 90 EAFAMPLSSGAYTGTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCHRHDHDAPAPGYGY 149
Query: 194 SDP----------------------VFDPADSASFSGVSCSSAVCDR---LENAGCHA-- 226
+ P VF P S +++ + CSS C A C
Sbjct: 150 AAPASNDSSTSSLSAAAASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPG 209
Query: 227 GRCRYEVSYGDGSYTKGTLALETLTIG-----------RTVVKNVAIGCGHKNQG-MFVG 274
C Y+ Y DGS +GT+ ++ TI + ++ V +GC G F+
Sbjct: 210 SPCAYDYRYKDGSAARGTVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFLA 269
Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--GTGSSGSLVFG-------------- 318
+ G+L LG ++S + + GG FSYCLV ++ L FG
Sbjct: 270 SDGVLSLGYSNISFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSSPPSKTA 329
Query: 319 ---------REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
P GA PL+ + R FY V ++G+ V G + I ++ + + G
Sbjct: 330 CAGGGSPAAAPPGPGGARQTPLLLDHRMRPFYAVTVNGISVDGELLRIPRLVWDVAKGG- 388
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGF-----VSVR 424
G ++D+GT++T L +PAY A A + LPR + + FD CYN + ++V
Sbjct: 389 -GAILDSGTSLTVLVSPAYRAVVAALNKKLAGLPRVT-MDPFDYCYNWTSPSTGEDLTVA 446
Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAPSP-SGLSIIGNIQQEGIQISFD 482
+P ++ +F+G L PA +++I D A G C G+S+IGNI Q+ FD
Sbjct: 447 MPELAVHFAGSARLQPPAKSYVI--DAAPGVKCIGLQEGEWPGVSVIGNILQQEHLWEFD 504
Query: 483 GANGFVGFGPNVC 495
N + F + C
Sbjct: 505 LKNRRLRFKRSRC 517
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 114/369 (30%), Positives = 165/369 (44%), Gaps = 45/369 (12%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD---PV--FDPADSASFS 207
+G YF ++ +G+PPR+ + +D+GSD++WV C PC C SD P+ +D SAS S
Sbjct: 33 AGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSS 92
Query: 208 GVSCSSAVC---DRLENAGCH-AGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIG 263
V CS C ++ +GC+ +C Y YGDGS T G L + L V G
Sbjct: 93 KVPCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVNATATVIFG 152
Query: 264 CGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVF 317
CG K G + G++G G +S QL G+T F++CL G G LV
Sbjct: 153 CGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCL-DGGERGGGILVL 211
Query: 318 GREALPVGAAWVPLVRNPRAPSFYY--VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMD 375
G P + PLV P Y+ V L + V + I LF M G + D
Sbjct: 212 GNVIEP-DIQYTPLV-----PYMYHYNVVLQSISVNNANLTIDPKLFSNDVM--QGTIFD 263
Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTC-YNLSGFVSVRVPTVSFYFSG 434
+GT + LP AY+AF A + V+ F C LS F+ P V YF G
Sbjct: 264 SGTTLAYLPDEAYQAFTQAV---------SLVVAPFLLCDTRLSRFIYKLFPNVVLYFEG 314
Query: 435 GPVLTLPASNFLI---PVDDAGTFCFAF-----APSPSGLSIIGNIQQEGIQISFDGANG 486
+TL + +LI +A +C + A S +I G++ + + +D G
Sbjct: 315 AS-MTLTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERG 373
Query: 487 FVGFGPNVC 495
+G+ P C
Sbjct: 374 RIGWRPFDC 382
>gi|54290725|dbj|BAD62395.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 500
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 110/367 (29%), Positives = 160/367 (43%), Gaps = 37/367 (10%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC---SQCYKQSDPVFDPADSASFSGVSC 211
+Y V +G G+P + M D+G I V+C C + C + FDP+ S++F+ V C
Sbjct: 145 DYTVVVGYGTPAQQLAMAFDTGLGISLVRCAACRPGAPCDGLAS--FDPSRSSTFAPVPC 202
Query: 212 SSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVAIGCGHKNQG 270
S C +GC +G + G +A + LT+ + V + GC + G
Sbjct: 203 GSPDC----RSGCSSGSTP-SCPLTSFPFLSGAVAQDVLTLTPSASVDDFTFGCVEGSSG 257
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG----- 325
+GAAGLL L S S+ +L GG FSYCL T S G L G +P
Sbjct: 258 EPLGAAGLLDLSRDSRSVASRLAADAGGTFSYCLPLSTTSSHGFLAIGEADVPHNRTARV 317
Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
A PLV +P P+ Y + L+G+ +GG IPI +V+DT T +
Sbjct: 318 TAVAPLVYDPAFPNHYVIDLAGVSLGGRDIPIPPH----AATASAAMVLDTALPYTYMKP 373
Query: 386 PAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV-SVRVPTVSFYF-----SGGPVLT 439
Y RDAF PRA + DTCYN +G V +P V F GG +
Sbjct: 374 SMYAPLRDAFRRAMARYPRAPAMGDLDTCYNFTGVRHEVLIPLVHLTFRGIGGGGGGQVL 433
Query: 440 LPASNFLIPVDDAGTF----CFAFAPSPSG-------LSIIGNIQQEGIQISFDGANGFV 488
++ + + + G F C AFA PS ++G + Q +++ D G +
Sbjct: 434 GLGADQMFYMSEPGNFFSVTCLAFAALPSDGDAEAPLAMVMGTLAQSSMEVVHDVPGGKI 493
Query: 489 GFGPNVC 495
GF P C
Sbjct: 494 GFIPGSC 500
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 113/371 (30%), Positives = 164/371 (44%), Gaps = 40/371 (10%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK--QSDPVFDPADSASFSGVSCSSAV 215
V + VG+PP++ MV+D+GS++ W+ C P +S F P S +F+ V C SA
Sbjct: 68 VSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCDSAQ 127
Query: 216 C---DRLENAGCH--AGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC---GHK 267
C D C + +CR +SY DGS + G LA E T+G+ A GC
Sbjct: 128 CRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGCMATAFD 187
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP-VGA 326
V AGLLG+ G++S V Q + FSYC+ R +G L+ G LP +
Sbjct: 188 TSPDGVATAGLLGMNRGALSFVSQASTRR---FSYCISDR--DDAGVLLLGHSDLPFLPL 242
Query: 327 AWVPLVRNPRAPSFYY------VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
+ PL + P P Y+ V L G+ VGG +PI + G ++D+GT
Sbjct: 243 NYTPLYQ-PAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQF 301
Query: 381 TRLPTPAYEAFRDAFVAQTG------NLPRASGVSIFDTCYNLSG--FVSVRVPTVSFYF 432
T L AY A + F QT N P + FDTC+ + R+P V+ F
Sbjct: 302 TFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLF 361
Query: 433 SGGPVLTLPASNFLIPVDDA-----GTFCFAFAPS---PSGLSIIGNIQQEGIQISFDGA 484
+G +T+ L V G +C F + P +IG+ Q + + +D
Sbjct: 362 NGA-QMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLE 420
Query: 485 NGFVGFGPNVC 495
G VG P C
Sbjct: 421 RGRVGLAPIRC 431
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 107/379 (28%), Positives = 170/379 (44%), Gaps = 39/379 (10%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
+G+ +G YF +IG+GSP + Y+ +D+GSDI+WV C C++C ++SD ++DP
Sbjct: 60 NGLPTVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPK 119
Query: 202 DSASFSGVSCSSAVCDRLENA---GCHAGR-CRYEVSYGDGSYTKGTLALETLTIGR--- 254
S + VSC C GC A C Y +SYGDGS T G + LT R
Sbjct: 120 RSKTSEFVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNG 179
Query: 255 ---TVVKNVAI--GCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQLG--GQTGGAFSY 302
T +N +I GCG G F ++ G++G G + S++ QL G+ FS+
Sbjct: 180 NPHTATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSH 239
Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
CL T G + E + PLV N + Y V L + V G + + D F
Sbjct: 240 CL---DTNVGGGIFSIGEVVEPKVKTTPLVPN---MAHYNVILKNIEVDGDILQLPSDTF 293
Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
++ G G V+D+GT + LP Y+ +A+ L + V +C+ +G V
Sbjct: 294 D-SENG-KGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRL-KVYLVEEQYSCFQYTGNVD 350
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS------GLSIIGNIQQEG 476
P V +F LT+ ++L +C + S S ++++G+
Sbjct: 351 SGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSN 410
Query: 477 IQISFDGANGFVGFGPNVC 495
+ +D N +G+ C
Sbjct: 411 KLVVYDLENMTIGWTDYNC 429
>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
Length = 396
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 105/363 (28%), Positives = 159/363 (43%), Gaps = 38/363 (10%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
Y +G+PP+ ++D ++VW QC C +C+KQ PVF P S++F C +AV
Sbjct: 45 YVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTAV 104
Query: 216 CDRLENAGCHAGRCRYEVSYGDGSY----TKGTLALETLTIGRTVVKNVAIGC-GHKNQG 270
C+ + C C Y+ G + T G A +T IG V+ +A GC +
Sbjct: 105 CESIPTRSCSGDVCSYK---GPPTQLRGNTSGFAATDTFAIGTATVR-LAFGCVVASDID 160
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA---A 327
G +G +GLG SLV Q+ FSYCL R TG S L G A G+ +
Sbjct: 161 TMDGPSGFIGLGRTPWSLVAQMKLTR---FSYCLSPRNTGKSSRLFLGSSAKLAGSESTS 217
Query: 328 WVPLVR---NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
P ++ + ++Y + L + G I T +VM T + + L
Sbjct: 218 TAPFIKTSPDDDGSNYYLLSLDAIRAGNTTI--------ATAQSGGILVMHTVSPFSLLV 269
Query: 385 TPAYEAFRDAFVAQTG---NLPRASGVSIFDTCY-NLSGFVSVRVPTVSFYFSGGPVLTL 440
AY+AF+ A G P A+ FD C+ +GF P + F F G LT+
Sbjct: 270 DSAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTV 329
Query: 441 PASNFLIPV-DDAGTFCFAFAPSP-------SGLSIIGNIQQEGIQISFDGANGFVGFGP 492
P + +LI V ++ T C A G+S++G++QQE + +D + F P
Sbjct: 330 PPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEP 389
Query: 493 NVC 495
C
Sbjct: 390 ADC 392
>gi|413937238|gb|AFW71789.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
Length = 598
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 88/246 (35%), Positives = 117/246 (47%), Gaps = 11/246 (4%)
Query: 256 VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGS 314
VV GC G V GL+G G G +S Q G FSYCL S + + S +
Sbjct: 356 VVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQNKDVYGFVFSYCLPSYKSSNFSST 415
Query: 315 LVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
L G P PL+ NP PS YYV + G+ VGG + + G ++
Sbjct: 416 LRLGPAGQPKRIKMTPLLSNPHRPSLYYVNMVGIHVGGRPMLVPASALAFDPASGRGTIV 475
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSG 434
D GT TRL P Y A RD F ++ P + FDTCYN V++ VPTV+F F G
Sbjct: 476 DAGTMFTRLSAPVYAAVRDVFRSRV-RAPVTGPLGGFDTCYN----VTISVPTVTFSFDG 530
Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-----LSIIGNIQQEGIQISFDGANGFVG 489
+TLP N +I G C A A PS L+++ ++QQ+ ++ FD ANG VG
Sbjct: 531 RVSVTLPEENVVIRSSSDGIACLAMAAGPSDGVDAVLNVLASMQQQNHRVLFDVANGRVG 590
Query: 490 FGPNVC 495
F +C
Sbjct: 591 FSRELC 596
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 117/422 (27%), Positives = 185/422 (43%), Gaps = 49/422 (11%)
Query: 104 HQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVG 163
H+ A RD R A ++R ++GG D + D G Y+ ++ +G
Sbjct: 35 HRVEVAALKARDRARHARMLRGVAGGVVDFSVQGTSD---------PNSVGLYYTKVKMG 85
Query: 164 SPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVC-D 217
+PP+ + ID+GSDI+WV C CS C + S FD S++ + + CS +C
Sbjct: 86 TPPKEFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALIPCSDPICTS 145
Query: 218 RLENAGCH----AGRCRYEVSYGDGSYTKGTLALE----TLTIGRTVVKN----VAIGCG 265
R++ A +C Y YGDGS T G + +L +G+ N + GC
Sbjct: 146 RVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPAVNSSATIVFGCS 205
Query: 266 HKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGR 319
G G+ G G G +S+V QL G T FS+CL +G G G ++
Sbjct: 206 ISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCL--KGDGDGGGVLVLG 263
Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
E L + PLV P P Y + L + V G +PI+ +F ++ G ++D GT
Sbjct: 264 EILEPSIVYSPLV--PSQP-HYNLNLQSIAVNGQLLPINPAVFSISN-NRGGTIVDCGTT 319
Query: 380 VTRLPTPAYEAFRDAF---VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGP 436
+ L AY+ A V+Q+ + G + CY +S + P+VS F GG
Sbjct: 320 LAYLIQEAYDPLVTAINTAVSQSARQTNSKG----NQCYLVSTSIGDIFPSVSLNFEGGA 375
Query: 437 VLTLPASNFLIP---VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
+ L +L+ +D A +C F G SI+G++ + + +D A +G+
Sbjct: 376 SMVLKPEQYLMHNGYLDGAEMWCIGFQKFQEGASILGDLVLKDKIVVYDIAQQRIGWANY 435
Query: 494 VC 495
C
Sbjct: 436 DC 437
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 98/358 (27%), Positives = 166/358 (46%), Gaps = 27/358 (7%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+G Y R+ +GSPP+ +++D+GS + +V C C QC DP F P S+++ V C
Sbjct: 86 NGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKC- 144
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT---VVKNVAIGCGHKNQ 269
+A C+ EN +C YE Y + S + G LA + ++ G+ V + GC
Sbjct: 145 NADCNCDEN----GVQCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCETMES 200
Query: 270 GMFVG--AAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
G A G++GLG G++S++ QL G+ +FS C G G++V G + P G
Sbjct: 201 GDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVG-GGAMVLGGISSPPG 259
Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
+ +P +Y + L + V G + ++ F G G ++D+GT P
Sbjct: 260 MVFSH--SDPSRSPYYNIELKEIHVAGKPLKLNPRTFD----GKYGAILDSGTTYAYFPE 313
Query: 386 PAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGFVSVRVPT----VSFYFSGGPVLT 439
AY AF+DA + + L + SG + D C++ +G +P V F+ G ++
Sbjct: 314 KAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQKIS 373
Query: 440 LPASNFLIP-VDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L N+L +G +C F +++G I ++++ N +GF C
Sbjct: 374 LSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKTNC 431
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 115/377 (30%), Positives = 173/377 (45%), Gaps = 54/377 (14%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
V + VGSPP++ MV+D+GS++ W+ C+ VF+P S+++S V CSS +C
Sbjct: 63 VTLAVGSPPQNISMVLDTGSELSWLHCKKSPNL----GSVFNPVSSSTYSPVPCSSPICR 118
Query: 217 ----DRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK--- 267
D A C C +SY D + +G LA +T IG GC
Sbjct: 119 TRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVIGSVTRPGTLFGCMDSGLS 178
Query: 268 -NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL---- 322
+ + GL+G+ GS+S V QLG FSYC+ G+ SSG L+ G +
Sbjct: 179 SDSEEDAKSTGLMGMNRGSLSFVNQLGFS---KFSYCI--SGSDSSGILLLGDASYSWLG 233
Query: 323 PVGAAWVPLV-RNPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
P+ + PLV + P F Y V L G+ VG + + + +F G ++D+G
Sbjct: 234 PI--QYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSG 291
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCYNLSGFVSVR-----VP 426
T T L P Y A ++ F+AQT ++ R F D CY + S R +P
Sbjct: 292 TQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGTMDLCYRVGS--STRPNFTGLP 349
Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGT------FCFAFAPSP-SGLS--IIGNIQQEGI 477
+S F G +++ L V+ AG+ +CF F S G+ +IG+ Q+ +
Sbjct: 350 VISLMFRGAE-MSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNV 408
Query: 478 QISFDGANGFVGFGPNV 494
+ FD A VGF NV
Sbjct: 409 WMEFDLAKSRVGFAGNV 425
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 113/371 (30%), Positives = 164/371 (44%), Gaps = 40/371 (10%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK--QSDPVFDPADSASFSGVSCSSAV 215
V + VG+PP++ MV+D+GS++ W+ C P +S F P S +F+ V C SA
Sbjct: 67 VSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCGSAQ 126
Query: 216 C---DRLENAGCH--AGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC---GHK 267
C D C + +CR +SY DGS + G LA E T+G+ A GC
Sbjct: 127 CRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGCMATAFD 186
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP-VGA 326
V AGLLG+ G++S V Q + FSYC+ R +G L+ G LP +
Sbjct: 187 TSPDGVATAGLLGMNRGALSFVSQASTRR---FSYCISDR--DDAGVLLLGHSDLPFLPL 241
Query: 327 AWVPLVRNPRAPSFYY------VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
+ PL + P P Y+ V L G+ VGG +PI + G ++D+GT
Sbjct: 242 NYTPLYQ-PAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQF 300
Query: 381 TRLPTPAYEAFRDAFVAQTG------NLPRASGVSIFDTCYNLSG--FVSVRVPTVSFYF 432
T L AY A + F QT N P + FDTC+ + R+P V+ F
Sbjct: 301 TFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLF 360
Query: 433 SGGPVLTLPASNFLIPVDDA-----GTFCFAFAPS---PSGLSIIGNIQQEGIQISFDGA 484
+G +T+ L V G +C F + P +IG+ Q + + +D
Sbjct: 361 NGA-QMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLE 419
Query: 485 NGFVGFGPNVC 495
G VG P C
Sbjct: 420 RGRVGLAPIRC 430
>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
Japonica Group]
Length = 377
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 103/339 (30%), Positives = 156/339 (46%), Gaps = 34/339 (10%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
G Y +G+PP+ V+D ++VW QC PC C++Q P+FDP S++F G+ C S
Sbjct: 55 GLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGS 114
Query: 214 AVCDRLENA--GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC---GHKN 268
+C+ + + C + C YE G T G +T IG + + GC K
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGKAGTDTFAIG-AAKETLGFGCVVMTDKR 172
Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-- 326
G +G++GLG SLV Q+ AFSYCL + SSG+L G A +
Sbjct: 173 LKTIGGPSGIVGLGRTPWSLVTQMNVT---AFSYCLAGK---SSGALFLGATAKQLAGGK 226
Query: 327 -AWVPLVRNPRAPS-------FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
+ P V A S +Y V L+G+ GG + + V++DT +
Sbjct: 227 NSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPL-------QAASSSGSTVLLDTVS 279
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
+ L AY+A + A A G P AS +D C+ + V+ P + F F GG L
Sbjct: 280 RASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFPKA--VAGDAPELVFTFDGGAAL 337
Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGI 477
T+P +N+L+ + GT C S S L++ G ++ I
Sbjct: 338 TVPPANYLLASGN-GTVCLTIGSSAS-LNLTGELEGASI 374
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 98/358 (27%), Positives = 166/358 (46%), Gaps = 27/358 (7%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+G Y R+ +GSPP+ +++D+GS + +V C C QC DP F P S+++ V C
Sbjct: 86 NGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKC- 144
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT---VVKNVAIGCGHKNQ 269
+A C+ EN +C YE Y + S + G LA + ++ G+ V + GC
Sbjct: 145 NADCNCDEN----GVQCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCETMES 200
Query: 270 GMFVG--AAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
G A G++GLG G++S++ QL G+ +FS C G G++V G + P G
Sbjct: 201 GDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVG-GGAMVLGGISSPPG 259
Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
+ +P +Y + L + V G + ++ F G G ++D+GT P
Sbjct: 260 MVFSH--SDPSRSPYYNIELKEIHVAGKPLKLNPRTFD----GKYGAILDSGTTYAYFPE 313
Query: 386 PAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGFVSVRVPT----VSFYFSGGPVLT 439
AY AF+DA + + L + SG + D C++ +G +P V F+ G ++
Sbjct: 314 KAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQKIS 373
Query: 440 LPASNFLIP-VDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L N+L +G +C F +++G I ++++ N +GF C
Sbjct: 374 LSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKTNC 431
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 98/359 (27%), Positives = 164/359 (45%), Gaps = 29/359 (8%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+G Y R+ +G+PP+ +++D+GS + +V C C QC K DP F P S+++ V C+
Sbjct: 74 NGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVKCN 133
Query: 213 -SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHKN 268
S CD +C YE Y + S + G +A + ++ G + GC +
Sbjct: 134 PSCNCDD------EGKQCTYERRYAEMSSSSGVIAEDVVSFGNESELKPQRAVFGCENVE 187
Query: 269 QGMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
G A G++GLG G +S+V QL G G +FS C G G++V G+ + P
Sbjct: 188 TGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVG-GGAMVLGQISPPP 246
Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
+ NP +Y + L L V G + + +F G V+D+GT P
Sbjct: 247 NMVFS--HSNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKH----GTVLDSGTTYAYFP 300
Query: 385 TPAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSG----FVSVRVPTVSFYFSGGPVL 438
A+ A +DA + + +L + G + D C++ +G +S P V+ F G L
Sbjct: 301 EAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREVSHLSKVFPEVNMVFGSGQKL 360
Query: 439 TLPASNFLI-PVDDAGTFCFAFAPSPSGL-SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+L N+L +G +C + + L +++G I +++D N +GF C
Sbjct: 361 SLSPENYLFRHTKVSGAYCLGIFQNGNDLTTLLGGIVVRNTLVTYDRENDKIGFWKTNC 419
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 109/373 (29%), Positives = 168/373 (45%), Gaps = 47/373 (12%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFS 207
+G Y+ RI +G+PPR Y+ ID+GSDI+WV C+PC+ C S FDP S++ S
Sbjct: 38 AGLYYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTAS 97
Query: 208 GVSCSSAVC---DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLT----IGRTVVKN 259
+SC + C +++ + C R C Y YGDGS T G + + + V N
Sbjct: 98 PLSCIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNN 157
Query: 260 ----VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGT 309
+ GC + G G+ G G +S+V QL Q FS+CL
Sbjct: 158 ASAKITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEGADP 217
Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
G G LV G P G + P+V P P Y + L G+ V G ++ I +F T
Sbjct: 218 G-GGILVLGEITEP-GMVYTPIV--PSQPH-YNLNLQGIAVNGQQLSIDPQVFATTNT-- 270
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVA---QTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
G ++D GT + L AYE F + +A Q+ G F T +++ P
Sbjct: 271 RGTIIDCGTTLAYLAEEAYEPFVNTIIAAVSQSTQPFMLKGNPCFLTVHSIDEI----FP 326
Query: 427 TVSFYFSGGPVLTLPASNFLI---PVDDAGTFCFAF------APSPSGLSIIGNIQQEGI 477
+V+ YF G P + L ++LI D + +C + A S ++I+G++ +
Sbjct: 327 SVTLYFEGAP-MDLKPKDYLIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILGDLVLKDK 385
Query: 478 QISFDGANGFVGF 490
+D N +G+
Sbjct: 386 VFVYDLENQRIGW 398
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 121/404 (29%), Positives = 174/404 (43%), Gaps = 58/404 (14%)
Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTD------VVSGMDQGSGEY 156
H+ RD R L++ L G V DF D VV G Y
Sbjct: 38 NHEMELSQLKARDEARHGRLLQSLGG---------VIDFPVDGTFDPFVV-------GLY 81
Query: 157 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSC 211
+ ++ +G+PPR Y+ +D+GSD++WV C C+ C + S FDP S + S +SC
Sbjct: 82 YTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISC 141
Query: 212 SSAVCD---RLENAGC--HAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN--- 259
S C + ++GC C Y YGDGS T G + L +G ++V N
Sbjct: 142 SDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTA 201
Query: 260 -VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSS 312
V GC G V G+ G G MS++ QL Q FS+CL G
Sbjct: 202 PVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGE-NGGG 260
Query: 313 GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
G LV G P + PLV P P Y V L + V G +PI+ +F + G
Sbjct: 261 GILVLGEIVEP-NMVFTPLV--PSQPH-YNVNLLSISVNGQALPINPSVFSTSN--GQGT 314
Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYF 432
++DTGT + L AY F +A R VS + CY ++ V P VS F
Sbjct: 315 IIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPV-VSKGNQCYVITTSVGDIFPPVSLNF 373
Query: 433 SGGPVLTLPASNFLIPVDDAG---TFCFAFAP-SPSGLSIIGNI 472
+GG + L ++LI ++ G +C F G++I+G++
Sbjct: 374 AGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDL 417
>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 481
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 122/412 (29%), Positives = 174/412 (42%), Gaps = 75/412 (18%)
Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC----------SQCYKQSDPVFDPA 201
G +Y G+G PP+ V+D+GSD+VW QC C C+ Q+ P ++ +
Sbjct: 74 GKTQYIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFS 133
Query: 202 DSASFSGVSCSS---AVCD-RLENAGCHAG------RCRYEVSYGDGSYTKGTLALETLT 251
S + V C A+C E AGC G C SYG G G L + T
Sbjct: 134 LSRTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFT 192
Query: 252 IGRTVVKNVAIGCGHKNQ---GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-- 306
+ +A GC + + G GA+G++GLG G++SLV QL FSYCL
Sbjct: 193 FPSSSSVTLAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNATE---FSYCLTPYF 249
Query: 307 RGTGSSGSLVFG---------------REALPVGAAWVPLVRNPR-AP--SFYYVGLSGL 348
R T S L G PV VP +NP+ +P +FYY+ L GL
Sbjct: 250 RDTVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTT--VPFAKNPKDSPFSTFYYLPLVGL 307
Query: 349 GVGGMRIPISEDLFRLTQMGDD----GVVMDTGTAVTRLPTPAYEAFRDAFVAQ---TGN 401
G + + F L + G ++D+G+ TRL PA+ A Q +G+
Sbjct: 308 AAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGS 367
Query: 402 L--PRASGVSIFDTCYNL----SGFVSVRVPTVSFYFS----GGPVLTLPASNFLIPVDD 451
L P A + C + VP + F GG L +PA + V +
Sbjct: 368 LVPPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARV-E 426
Query: 452 AGTFCFAFAPSPSG--------LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
A T+C A S SG +IIGN Q+ +++ +D ANG + F P C
Sbjct: 427 ASTWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANC 478
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 112/371 (30%), Positives = 166/371 (44%), Gaps = 43/371 (11%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
V + VGSPP++ MV+D+GS++ W+ C+ + VFDP S+S+S + C+S C
Sbjct: 65 VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPCTSPTCR 120
Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH----K 267
D C + C +SY D S +G LA +T IG + + GC
Sbjct: 121 TRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNSAIPATIFGCMDSGFSS 180
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA 327
N GL+G+ GS+S V Q+G Q FSYC+ G SSG L+FG + A
Sbjct: 181 NSDEDSKTTGLIGMNRGSLSFVTQMGLQ---KFSYCI--SGQDSSGILLFGESSFSWLKA 235
Query: 328 --WVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
+ PLV+ + P F Y V L G+ V + + + ++ G ++D+GT
Sbjct: 236 LKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQF 295
Query: 381 TRLPTPAYEAFRDAFVAQTG------NLPRASGVSIFDTCYN--LSGFVSVRVPTVSFYF 432
T L P Y A ++ FV QT P D CY L+ +PTV+ F
Sbjct: 296 TFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMF 355
Query: 433 SGGPVLTLPASNFLIPVDDA-----GTFCFAFAPSP-SGLS--IIGNIQQEGIQISFDGA 484
G +++ A + V +CF F S G+ IIG+ Q+ + + FD A
Sbjct: 356 RGAE-MSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQNVWMEFDLA 414
Query: 485 NGFVGFGPNVC 495
VGF C
Sbjct: 415 KSRVGFAEVRC 425
>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 413
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 107/363 (29%), Positives = 158/363 (43%), Gaps = 38/363 (10%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
Y +G+PP+ ++D ++VW QC C +C+KQ PVF P S++F C +AV
Sbjct: 62 YVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTAV 121
Query: 216 CDRLENAGCHAGRCRYEVSYGDGSY----TKGTLALETLTIGRTVVKNVAIGC-GHKNQG 270
C+ + C C Y+ G + T G A +T IG V+ +A GC +
Sbjct: 122 CESIPTRSCSGDVCSYK---GPPTQLRGNTSGFAATDTFAIGTATVR-LAFGCVVASDID 177
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG---AA 327
G +G +GLG SLV Q+ FSYCL R TG S L G A G +
Sbjct: 178 TMDGPSGFIGLGRTPWSLVAQMKLTR---FSYCLSPRNTGKSSRLFLGSSAKLAGGESTS 234
Query: 328 WVPLVR-NPRAPS--FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
P ++ +P S +Y + L + G I T +VM T + + L
Sbjct: 235 TAPFIKTSPDDDSHHYYLLSLDAIRAGNTTI--------ATAQSGGILVMHTVSPFSLLV 286
Query: 385 TPAYEAFRDAFVAQTG---NLPRASGVSIFDTCY-NLSGFVSVRVPTVSFYFSGGPVLTL 440
AY AF+ A G P A+ FD C+ +GF P + F F G LT+
Sbjct: 287 DSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTV 346
Query: 441 PASNFLIPV-DDAGTFCFAFAPSP-------SGLSIIGNIQQEGIQISFDGANGFVGFGP 492
P + +LI V ++ T C A G+S++G++QQE + +D + F P
Sbjct: 347 PPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEP 406
Query: 493 NVC 495
C
Sbjct: 407 ADC 409
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 112/371 (30%), Positives = 166/371 (44%), Gaps = 43/371 (11%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
V + VGSPP++ MV+D+GS++ W+ C+ + VFDP S+S+S + C+S C
Sbjct: 58 VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPCTSPTCR 113
Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH----K 267
D C + C +SY D S +G LA +T IG + + GC
Sbjct: 114 TRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNSAIPATIFGCMDSGFSS 173
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA 327
N GL+G+ GS+S V Q+G Q FSYC+ G SSG L+FG + A
Sbjct: 174 NSDEDSKTTGLIGMNRGSLSFVTQMGLQ---KFSYCI--SGQDSSGILLFGESSFSWLKA 228
Query: 328 --WVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
+ PLV+ + P F Y V L G+ V + + + ++ G ++D+GT
Sbjct: 229 LKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQF 288
Query: 381 TRLPTPAYEAFRDAFVAQTG------NLPRASGVSIFDTCYN--LSGFVSVRVPTVSFYF 432
T L P Y A ++ FV QT P D CY L+ +PTV+ F
Sbjct: 289 TFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMF 348
Query: 433 SGGPVLTLPASNFLIPVDDA-----GTFCFAFAPSP-SGLS--IIGNIQQEGIQISFDGA 484
G +++ A + V +CF F S G+ IIG+ Q+ + + FD A
Sbjct: 349 RGAE-MSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQNVWMEFDLA 407
Query: 485 NGFVGFGPNVC 495
VGF C
Sbjct: 408 KSRVGFAEVRC 418
>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
Length = 488
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 102/316 (32%), Positives = 145/316 (45%), Gaps = 31/316 (9%)
Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCHAGR------CRYEVSYGDGSYTKGTLALET 249
P FD + S++ SC S +C L A C + C Y Y D S T G L ++
Sbjct: 175 PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLLEVDK 234
Query: 250 LTIGR-TVVKNVAIGCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
T G V VA GCG N G+F G+ G G G +SL QL G FS+C +
Sbjct: 235 FTFGAGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQL---KVGNFSHCFTAV 291
Query: 308 GTGSSGSLVF---------GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPIS 358
+++ GR A+ PL++N P+ YY+ L G+ VG R+P+
Sbjct: 292 NGLKQSTVLLDLLADLYKNGRGAV----QSTPLIQNSANPTLYYLSLKGITVGSTRLPVP 347
Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD-TCYNL 417
E F LT G G ++D+GT++T LP Y+ RD F AQ LP G + TC++
Sbjct: 348 ESAFALTN-GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI-KLPVVPGNATGPYTCFSA 405
Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV-DDAGT--FCFAFAPSPSGLSIIGNIQQ 474
VP + +F G + LP N++ V DDAG C A + IGN QQ
Sbjct: 406 PSQAKPDVPKLVLHFEGA-TMDLPRENYVFEVPDDAGNSMICLAINELGDERATIGNFQQ 464
Query: 475 EGIQISFDGANGFVGF 490
+ + + +D N + F
Sbjct: 465 QNMHVLYDLQNNMLSF 480
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 49/139 (35%), Positives = 69/139 (49%), Gaps = 8/139 (5%)
Query: 344 GLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP 403
G G+ VG R+P+ E F LT G G ++D+GT++T LP Y+ RD F AQ LP
Sbjct: 38 GRPGITVGSTRLPVPESAFALTN-GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI-KLP 95
Query: 404 RASGVSIFD-TCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV-DDAGT--FCFAF 459
G + TC++ VP + +F G + LP N++ V DDAG C A
Sbjct: 96 VVPGNATGPYTCFSAPSQAKPDVPKLVLHFEGA-TMDLPRENYVFEVPDDAGNSIICLAI 154
Query: 460 APSPSGLSIIGNIQQEGIQ 478
+IIGN QQ+ +
Sbjct: 155 NKG-DETTIIGNFQQQNMH 172
>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 409
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 94/264 (35%), Positives = 137/264 (51%), Gaps = 14/264 (5%)
Query: 236 GDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQ 295
G + T G LA +T T G T V V GC + G F GA+G++G+G G++SL+ QL
Sbjct: 124 GSAANTSGYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQL--- 180
Query: 296 TGGAFSYCLV---SRGTGSSGSLV-FGREALPVGAAW--VPLVRNPRAPSFYYVGLSGLG 349
G FSY L+ + GS+ S++ FG +A+P PL+ + P FYYV L+G+
Sbjct: 181 QFGKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVR 240
Query: 350 VGGMRI-PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV 408
V G R+ I F L G GV++ + T VT L AY+ R A ++ G LP +G
Sbjct: 241 VDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIG-LPAVNGS 299
Query: 409 SI--FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL 466
+ D CYN S V+VP ++ F GG + L A+N+ +D G C PS G
Sbjct: 300 AALELDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG- 358
Query: 467 SIIGNIQQEGIQISFDGANGFVGF 490
S++G + Q G + +D G + F
Sbjct: 359 SVLGTLLQTGTNMIYDVDAGRLTF 382
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 119/456 (26%), Positives = 193/456 (42%), Gaps = 65/456 (14%)
Query: 73 SSDEARWNLELVHRDKMSSS--SNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGG 130
S+++ L+L HRD + + S + + + +HS +R R+ GG
Sbjct: 25 STEDTAVRLKLAHRDTLWPNPLSRIEDIIGADQKRHSLISRK-----------RKFKGG- 72
Query: 131 ADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC 190
D+ SG+D G+ +YF + VG+P + +V+D+GS++ WV C+
Sbjct: 73 ----------VKMDLGSGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCR----- 117
Query: 191 YK-------QSDPVFDPADSASFSGVSCSSAVCD-------RLENAGCHAGRCRYEVSYG 236
Y+ ++ VF +S SF V C + C L + C Y+ Y
Sbjct: 118 YRGRGKGKVKNRRVFRAEESKSFKTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYA 177
Query: 237 DGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQGMFVGAA-GLLGLGGGSMSLVG 290
DGS +G A ET+T+G T ++ + +GC G A G+LGL S
Sbjct: 178 DGSAAQGVFAKETITVGLTNGRKARLRGLLVGCSSSFSGQSFQGADGVLGLAFSDFSFTS 237
Query: 291 QLGGQTGGAFSYCLVSRGTGS--SGSLVFGREALPVGAAWVPLVRNP----RAPSFYYVG 344
G SYCLV + S L+FG + P P P FY +
Sbjct: 238 TATSLFGAKLSYCLVDHLSNKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAIN 297
Query: 345 LSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR 404
+ G+ +G + I ++ T G G ++D+GT++T L AY+ L R
Sbjct: 298 IIGISIGDDMLDIPTQVWDATTGG--GTILDSGTSLTLLAEAAYKPVVTGLARYLVELKR 355
Query: 405 ASGVSI-FDTCY-NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAF-- 459
I + C+ + SGF ++P ++F+ GG ++L VD A G C F
Sbjct: 356 VKPEGIPIEYCFSSTSGFNESKLPQLTFHLKGGARFEPHRKSYL--VDAAPGVKCLGFMS 413
Query: 460 APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
A +P+ +++GNI Q+ FD + F P+ C
Sbjct: 414 AGTPA-TNVVGNIMQQNYLWEFDLMASTLSFAPSTC 448
>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
Length = 499
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 116/402 (28%), Positives = 167/402 (41%), Gaps = 66/402 (16%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP--CSQCYKQSDP-VFDPADSASFSGVSC 211
+Y + + S S YM D+GSDIVW C P C C + +P P + + S +SC
Sbjct: 93 DYTLTFSINSQTLSVYM--DTGSDIVWFPCSPFECILCEGKFEPGTLTPLNVSKSSLISC 150
Query: 212 SSAVC--------------------DRLENAGCHAGRC-RYEVSYGDGSYT----KGTLA 246
S C D +E + C C + +YGDGS K L
Sbjct: 151 KSRACSTAHNSPSTSDLCAIAKCPLDEIETSDCSNYHCPSFYYAYGDGSLIAKLHKHNLI 210
Query: 247 LETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT---GGAFSYC 303
+ + + +K+ GC H G +G AG G GS+SL QL + G FSYC
Sbjct: 211 MPSTSNKPFSLKDFTFGCAHSALGEPIGVAGF---GFGSLSLPAQLANLSPDLGNQFSYC 267
Query: 304 LVSRGTGSS-----GSLVFGREALP-----VGAAWVPLVRNPRAPSFYYVGLSGLGVGGM 353
LVS S+ L+ G+ + P++ NP+ P FY V + + VG
Sbjct: 268 LVSHSFDSTKLHHPSPLILGKVKERDFDEITQFVYTPMLDNPKHPYFYSVSMEAISVGSS 327
Query: 354 RIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL-PRASGVSI-- 410
R+ L R+ + G+ GVV+D+GT T LPT Y + + G + RAS
Sbjct: 328 RVRAPNALIRIDRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRRVGRVFKRASETESKT 387
Query: 411 -FDTCYNLSG----FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-------GTFCFA 458
CY L G + + VP ++F+F G + LP N+ D C
Sbjct: 388 GLSPCYYLEGNGVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLDGEDEKKGRKVGCLM 447
Query: 459 FA----PSPSGL-SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
S G + +GN QQ+G Q+ +D VGF P C
Sbjct: 448 LMDGGDESEGGPGATLGNYQQQGFQVVYDLEERRVGFAPRKC 489
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 112/391 (28%), Positives = 171/391 (43%), Gaps = 61/391 (15%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQC-YKQSDP---VFDPADSASF 206
G Y + + G+PP++ +++D+GSD+VW C C C + S+P +F P S+S
Sbjct: 88 GAYSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSS 147
Query: 207 SGVSCSSAVCDRLENAGCHAGRCR---------------YEVSYGDGSYTKGTLALETLT 251
+ C + C + + + RCR Y V YG G T G + ETL
Sbjct: 148 KVLGCVNPKCGWIHGSKVQS-RCRDCEPTSPNCTQICPPYLVFYGSG-ITGGIMLSETLD 205
Query: 252 IGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR---G 308
+ V N +GC + AG+ G G G SL QLG + FSYCL+SR
Sbjct: 206 LPGKGVPNFIVGCSVLSTSQ---PAGISGFGRGPPSLPSQLGLK---KFSYCLLSRRYDD 259
Query: 309 TGSSGSLVFGREA----LPVGAAWVPLVRNPRAPS------FYYVGLSGLGVGGMRIPIS 358
T S SLV E+ G ++ P V+NP+ +YY+GL + VGG + I
Sbjct: 260 TTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIP 319
Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV--AQTGNLPRASGVSIFDTCYN 416
GD G ++D+GT T + +E F Q+ G++ C+N
Sbjct: 320 YKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGITGLRPCFN 379
Query: 417 LSGFVSVRVPTVSFYFSGGPVLTLPASNFL------------IPVDDAGTFCFAFAPSPS 464
+SG + P ++ F GG + LP +N++ I D A F+ P+
Sbjct: 380 ISGLNTPSFPELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPA-- 437
Query: 465 GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
I+GN QQ+ + +D N +GF C
Sbjct: 438 --IILGNFQQQNFYVEYDLRNERLGFRQQSC 466
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 111/373 (29%), Positives = 163/373 (43%), Gaps = 44/373 (11%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
V + VG+PP++ MV+D+GS++ W+ C P K S F P S++F+ V C+SA C
Sbjct: 87 VSLAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPCASAQCR 146
Query: 217 --DRLENAGCH--AGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC---GHKNQ 269
D C + RC +SY DGS + G LA + +G A GC +
Sbjct: 147 SRDLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAVGSGPPLRAAFGCMSSAFDSS 206
Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV 329
V +AGLLG+ G++S V Q + FSYC+ R +G L+ G LP ++
Sbjct: 207 PDGVASAGLLGMNRGALSFVSQASTRR---FSYCISDR--DDAGVLLLGHSDLPT---FL 258
Query: 330 PLVRNPR------APSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
PL P P F Y V L G+ VGG +PI + G ++D+GT
Sbjct: 259 PLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMVDSGTQ 318
Query: 380 VTRLPTPAYEAFRDAFVAQTGNL------PRASGVSIFDTCYNLS---GFVSVRVPTVSF 430
T L AY A + F Q L P + FDTC+ + + R+P V+
Sbjct: 319 FTFLLGDAYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRVPQGRSPPTARLPGVTL 378
Query: 431 YFSGGPVLTLPASNFLIPVDDA-----GTFCFAFAPS---PSGLSIIGNIQQEGIQISFD 482
F+G + + L V G +C F + P +IG+ Q + + +D
Sbjct: 379 LFNGA-EMAVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPIMAYVIGHHHQMNVWVEYD 437
Query: 483 GANGFVGFGPNVC 495
G VG P C
Sbjct: 438 LERGRVGLAPVRC 450
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 110/389 (28%), Positives = 162/389 (41%), Gaps = 55/389 (14%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQCYKQSDPV------FDPADSA 204
G Y V + G+PP++ ++D+GSDIVW C C C S F P +S+
Sbjct: 65 GGYSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESS 124
Query: 205 SFSGVSCSSAVCDRLENAG------CHAGRC------RYEVSYGDGSYTKGTLAL-ETLT 251
S + C + C + ++ C C Y + YG G T G +AL ETL
Sbjct: 125 SSKLLGCKNPKCSWIHHSNINCDQDCSIKSCLNQTCPPYMIFYGSG--TTGGVALSETLH 182
Query: 252 IGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR---- 307
+ N +GC + AG+ G G G SL QLG G FSYCL+S
Sbjct: 183 LHSLSKPNFLVGCSVFSSHQ---PAGIAGFGRGLSSLPSQLGL---GKFSYCLLSHRFDD 236
Query: 308 GTGSSGSLVFGREALPV-----GAAWVPLVRNPRAPS------FYYVGLSGLGVGGMRIP 356
T S SLV E L + P V+NP+ + +YY+GL + VGG +
Sbjct: 237 DTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGHHVK 296
Query: 357 ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI---FDT 413
+ + G+ GV++D+GT T + A+E D F+ Q + R +
Sbjct: 297 VPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAIGLRP 356
Query: 414 CYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS------ 467
C+N+S +V P + YF GG + LP N+ V +G
Sbjct: 357 CFNVSDAKTVSFPELRLYFKGGADVALPVENYFAFVGGEVACLTVVTDGVAGPERVGGPG 416
Query: 468 -IIGNIQQEGIQISFDGANGFVGFGPNVC 495
I+GN Q + + +D N +GF C
Sbjct: 417 MILGNFQMQNFYVEYDLRNERLGFKQEKC 445
>gi|242094480|ref|XP_002437730.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
gi|241915953|gb|EER89097.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
Length = 507
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 101/339 (29%), Positives = 153/339 (45%), Gaps = 38/339 (11%)
Query: 169 QYMVIDSGSDIVWVQCQPCSQCYKQSDPV--FDPADSASFSGVSCSSAVCD---RLENAG 223
Q +V+D+ SD+ WVQC P + +DPA S+++ ++C+SA C RL
Sbjct: 124 QTVVLDTASDVPWVQCHPLASSATTDSSSSSYDPARSSTYYALACNSAACTELGRLYRGA 183
Query: 224 CHAGRCRYEVSYGD--------GSYTKGTLALETLTIGRTVVKNVAIGCGH--KNQG--- 270
C +C+Y V G+Y L L T + GC H QG
Sbjct: 184 CVNNQCQYRVPIPSSPASSSSSGTYGSDLLKL-TADPADGASMSFKFGCSHGEAKQGGEG 242
Query: 271 -MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV----G 325
+ AG++ LGGG SLV Q G AFSYC+ + + G V G + G
Sbjct: 243 SIDNATAGIMALGGGPESLVSQNAAMYGSAFSYCIPATESRRPGFFVLGGGVGDLSGAGG 302
Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
A P++R R P+ Y V L + V G ++ ++ +F G V+D+ TA+TRLP
Sbjct: 303 YAVTPMLRYARVPTLYRVRLLAIAVDGQQLNVTPSVFA------SGSVLDSRTAITRLPP 356
Query: 386 PAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNF 445
AY+A R+AF ++ A DTCY+ +G V VP V+ G V+ L
Sbjct: 357 TAYQALREAFRSRMAMYREAPPQGNLDTCYDFAGAFLVMVPRVALLLDGNAVVALDRQGI 416
Query: 446 LIPVDDAGTFCFAFAPSPSGL--SIIGNIQQEGIQISFD 482
L C F + I+GN+QQ+ +++ ++
Sbjct: 417 LF------HDCLVFTSNTDDRMPGILGNVQQQTMEVLYN 449
>gi|413937239|gb|AFW71790.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
Length = 537
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 88/246 (35%), Positives = 117/246 (47%), Gaps = 11/246 (4%)
Query: 256 VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGS 314
VV GC G V GL+G G G +S Q G FSYCL S + + S +
Sbjct: 295 VVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQNKDVYGFVFSYCLPSYKSSNFSST 354
Query: 315 LVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
L G P PL+ NP PS YYV + G+ VGG + + G ++
Sbjct: 355 LRLGPAGQPKRIKMTPLLSNPHRPSLYYVNMVGIHVGGRPMLVPASALAFDPASGRGTIV 414
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSG 434
D GT TRL P Y A RD F ++ P + FDTCYN V++ VPTV+F F G
Sbjct: 415 DAGTMFTRLSAPVYAAVRDVFRSRV-RAPVTGPLGGFDTCYN----VTISVPTVTFSFDG 469
Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-----LSIIGNIQQEGIQISFDGANGFVG 489
+TLP N +I G C A A PS L+++ ++QQ+ ++ FD ANG VG
Sbjct: 470 RVSVTLPEENVVIRSSSDGIACLAMAAGPSDGVDAVLNVLASMQQQNHRVLFDVANGRVG 529
Query: 490 FGPNVC 495
F +C
Sbjct: 530 FSRELC 535
>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 482
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 117/400 (29%), Positives = 165/400 (41%), Gaps = 74/400 (18%)
Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQP--CSQCYKQSDPVFDPADSASFS---GVSCSSA 214
+G S P + YM D+GSD+VW C P C C + DP+ + S +SC+S
Sbjct: 81 LGPHSQPITLYM--DTGSDLVWFPCTPFNCILCELKPKLTSDPSPPTNISHSTPISCNSH 138
Query: 215 VC--------------------DRLENAGCHAGRC-RYEVSYGDGSYTKGTLALETLTIG 253
C D +E C + C + +YGDGS +L +TL++
Sbjct: 139 ACSVAHSSTPSSDLCTMAHCPLDSIETKDCGSFHCPPFYYAYGDGSLI-ASLYRDTLSLS 197
Query: 254 RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLG---GQTGGAFSYCLVSRGTG 310
+ N GC H F G+ G G G +SL QL Q G FSYCLVS
Sbjct: 198 TLQLTNFTFGCAHTT---FSEPTGVAGFGRGLLSLPAQLATHSPQLGNRFSYCLVSHSFR 254
Query: 311 SS-----GSLVFGREALP--------VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPI 357
S L+ GR V + ++ NP+ FY VGL G+ VG +P
Sbjct: 255 SERIRKPSPLILGRYNDEKQSNGDEVVEFVYTSMLENPKHSYFYTVGLKGISVGKKTVPA 314
Query: 358 SEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF--VAQTGN--LPRASGVSIFDT 413
+ L R+ + GD GVV+D+GT T LP Y + + F A+ N P +
Sbjct: 315 PKILRRVNKKGDGGVVVDSGTTFTMLPEKFYNSVVEGFDRRARKSNRRAPEIEQKTGLSP 374
Query: 414 CYNLSGFVSVRVPTVSFYFSG-GPVLTLPASNFLIPVDDAG--------TFCFAF----- 459
CY L+ + VP V+ F G + LP N+ D G C F
Sbjct: 375 CYYLN--TAAIVPAVTLRFVGMNSSVVLPRKNYFYEFMDGGDGVRRKERVGCLMFMNGGD 432
Query: 460 ----APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ P G ++GN QQ+G ++ +D VGF C
Sbjct: 433 EAEMSGGPGG--VLGNYQQQGFEVEYDLEKKRVGFARRKC 470
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 111/376 (29%), Positives = 172/376 (45%), Gaps = 44/376 (11%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
+G+ +G YF +IG+G+P + Y+ +D+GSDI+WV C C C ++S ++DP
Sbjct: 80 NGIPTDTGLYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPT 139
Query: 202 DSASFSGVSCSSAVCDRLENAG----CHAGR-CRYEVSYGDGSYTKGTLALETLTI---- 252
SAS V+C C N G C A C+Y ++YGDGS T G + L
Sbjct: 140 ASASSKTVTCGQEFCATATNGGVPPSCAANSPCQYSITYGDGSSTTGFFVADFLQYDQVS 199
Query: 253 --GRTVVKN--VAIGCGHKNQGMF----VGAAGLLGLGGGSMSLVGQL--GGQTGGAFSY 302
G+T + N V GCG K G V G+LG G + S++ QL G+ FS+
Sbjct: 200 GDGQTNLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSH 259
Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
CL + G G G P PLV P P Y V L + VGG + + ++F
Sbjct: 260 CLDTVNGG--GIFAIGNVVQP-KVKTTPLV--PGMPH-YNVVLKTIDVGGSTLQLPTNIF 313
Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD-TCYNLSGFV 421
+ G G ++D+GT + LP Y+A A + N P + ++ D C+ SG V
Sbjct: 314 DIGG-GSRGTIIDSGTTLAYLPEVVYKAVLSAVFS---NHPDVTLKNVQDFLCFQYSGSV 369
Query: 422 SVRVPTVSFYFSGG-PVLTLPASNFLIPVDDAGTFCFAF------APSPSGLSIIGNIQQ 474
P V+F+F G P++ P +D +C F + + ++G++
Sbjct: 370 DNGFPEVTFHFDGDLPLVVYPHDYLFQNTEDV--YCVGFQSGGVQSKDGKDMVLLGDLAL 427
Query: 475 EGIQISFDGANGFVGF 490
+ +D N +G+
Sbjct: 428 SNKLVVYDLENQVIGW 443
>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
max]
Length = 455
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 118/395 (29%), Positives = 161/395 (40%), Gaps = 72/395 (18%)
Query: 166 PRSQ----YMVIDSGSDIVWVQCQP--CSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL 219
PR+Q + +D+GSD+VW C P C C + + P ++ VSC S C
Sbjct: 56 PRAQAQPITLYMDTGSDLVWFPCAPFKCILCEGKPN-ASPPVNTTRSVAVSCKSPACSAA 114
Query: 220 ENAG-----CHAGRCRYE----------------VSYGDGSYTKGTLALETLTIGRTVVK 258
N C A RC E +YGDGS L +TL++ ++
Sbjct: 115 HNLASPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSLI-ARLYRDTLSLSSLFLR 173
Query: 259 NVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLG---GQTGGAFSYCLVSRGTGSS--- 312
N GC + G+ G G G +SL QL Q G FSYCLVS S
Sbjct: 174 NFTFGCAYTT---LAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHSFDSERVR 230
Query: 313 --GSLVFGR-----EALPVGA-----AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISED 360
L+ GR E VG + P++ NP+ P FY VGL G+ VG +P E
Sbjct: 231 KPSPLILGRYEEEEEEEKVGGGVAEFVYTPMLENPKHPYFYTVGLIGISVGKRIVPAPEM 290
Query: 361 LFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL-PRASGVSI---FDTCYN 416
L R+ GD GVV+D+GT T LP Y + D F G + RA + CY
Sbjct: 291 LRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRGVGRVNERARKIEEKTGLAPCYY 350
Query: 417 LSGFVSVRVPTVSFYFSGG-PVLTLPASNFLIPVDD----------AGTFCFAFAPSPSG 465
L+ VP ++ F+GG + LP N+ D G +
Sbjct: 351 LNSVAE--VPVLTLRFAGGNSSVVLPRKNYFYEFLDGRDAAKGKRRVGCLMLMNGGDEAE 408
Query: 466 LS-----IIGNIQQEGIQISFDGANGFVGFGPNVC 495
LS +GN QQ+G ++ +D VGF C
Sbjct: 409 LSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQC 443
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 113/405 (27%), Positives = 179/405 (44%), Gaps = 44/405 (10%)
Query: 111 RMQRDVKRVATLVRRLSGGGADA-AKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQ 169
R + R+A RR G GA A+ + D D+++ +G Y R+ +G+PP+
Sbjct: 51 RSYPNASRLAASSRRGLGDGAHPNARMRLHD---DLLT-----NGYYTTRLYIGTPPQEF 102
Query: 170 YMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS-SAVCDRLENAGCHAGR 228
+++DSGS + +V C C QC DP F P S+S+S V C+ CD +
Sbjct: 103 ALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKCNVDCTCD------SDKKQ 156
Query: 229 CRYEVSYGDGSYTKGTLALETLTIGRT---VVKNVAIGCGHKNQGMFVG--AAGLLGLGG 283
C YE Y + S + G L + ++ GR + GC + G A G++GLG
Sbjct: 157 CTYERQYAEMSSSSGVLGEDIVSFGRESELKPQRAVFGCENSETGDLFSQHADGIMGLGR 216
Query: 284 GSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFY 341
G +S++ QL G +FS C G G++V G +P + V +P +Y
Sbjct: 217 GQLSIMDQLVEKGVISDSFSLCYGGMDIG-GGAMVLG--GVPAPSDMVFSHSDPLRSPYY 273
Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
+ L + V G + + +F G V+D+GT LP A+ AF+DA ++ +
Sbjct: 274 NIELKEIHVAGKALRVDSRVFN----SKHGTVLDSGTTYAYLPEQAFVAFKDAVTSKVHS 329
Query: 402 LPRASG--VSIFDTCY-----NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI---PVDD 451
L + G + D C+ N+S V P V F G L+L N+L VD
Sbjct: 330 LKKIRGPDPNYKDICFAGAGRNVSKLHEV-FPDVDMVFGNGQKLSLTPENYLFRHSKVD- 387
Query: 452 AGTFCF-AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
G +C F +++G I +++D N +GF C
Sbjct: 388 -GAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNC 431
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 117/416 (28%), Positives = 179/416 (43%), Gaps = 44/416 (10%)
Query: 104 HQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVG 163
H+ RD R ++R GG D D T G G Y ++ +G
Sbjct: 39 HRVEIDTLRARDRVRHGRILRASVGGVVDFRVQGSSDPST-------LGYGLYTTKVKMG 91
Query: 164 SPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVC-D 217
+PPR + ID+GSDI+W+ C CS C K S FD S++ + V CS +C
Sbjct: 92 TPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTAALVPCSDPMCAS 151
Query: 218 RLENAGC----HAGRCRYEVSYGDGSYTKGTLALET----LTIGRTVVKNVA------IG 263
++ A +C Y Y DGS T G + + +G++ NVA G
Sbjct: 152 AIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTPANVASSATIVFG 211
Query: 264 CGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVF 317
C G G+LG G G +S+V QL G T FS+CL +G G+ G ++
Sbjct: 212 CSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCL--KGDGNGGGILV 269
Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
E L + PLV P P Y + L + V G + I+ +F + G ++D+G
Sbjct: 270 LGEILEPSIVYSPLV--PSQPH-YNLNLQSIAVNGQVLSINPAVFATSD--KRGTIIDSG 324
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV 437
T ++ L AY+ +A S +S CY + + PTVSF F GG
Sbjct: 325 TTLSYLVQEAYDPLVNAVDTAVSQFA-TSFISKGSQCYLVLTSIDDSFPTVSFNFEGGAS 383
Query: 438 LTLPASNFLIP---VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
+ L S +L+ D A +C F G++I+G++ + + +D A +G+
Sbjct: 384 MDLKPSQYLLNRGFQDGAKMWCIGFQKVQEGVTILGDLVLKDKIVVYDLARQQIGW 439
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 124/410 (30%), Positives = 185/410 (45%), Gaps = 45/410 (10%)
Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
RD R L+R + GG D + D G YF ++ +GSPPR + I
Sbjct: 53 RDQARHGRLLRGVVGGVVDFTVYGTSD---------PYLVGLYFTKVKLGSPPREFNVQI 103
Query: 174 DSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCDRLEN---AGC- 224
D+GSDI+WV C C+ C + S FDP+ S++ S VSCS +C L A C
Sbjct: 104 DTGSDILWVTCNSCNDCPRTSGLGIELSFFDPSSSSTTSLVSCSHPICTSLVQTTAAECS 163
Query: 225 -HAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN----VAIGCGHKNQGMFV-- 273
+ +C Y YGDGS T G + L +G +++ N + GC G
Sbjct: 164 PQSNQCSYSFHYGDGSGTTGYYVSDMLYFDTVLGDSLIANSSASIVFGCSTYQSGDLTKV 223
Query: 274 --GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV 329
G+ G G +S+V QL G T FS+CL G G G LV G E L +
Sbjct: 224 DKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCLKGEGDG-GGKLVLG-EILEPNIIYS 281
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
PLV + S Y + L + V G +PI +F + + G ++D+GT +T L AY+
Sbjct: 282 PLV---PSQSHYNLNLQSISVNGQLLPIDPAVFATSN--NQGTIVDSGTTLTYLVETAYD 336
Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
F A A + +S + CY +S V P VS F+GG + L +L+ +
Sbjct: 337 PFVSAITATVSS-STTPVLSKGNQCYLVSTSVDEIFPPVSLNFAGGASMVLKPGEYLMHL 395
Query: 450 ---DDAGTFCFAFAP-SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
D A +C F + G++I+G++ + +D A+ +G+ C
Sbjct: 396 GFSDGAAMWCIGFQKVAEPGITILGDLVLKDKIFVYDLAHQRIGWANYDC 445
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 118/375 (31%), Positives = 167/375 (44%), Gaps = 58/375 (15%)
Query: 165 PPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV--FDPADSASFSGVSCSSAVC-----D 217
PP++ MVID+GS++ W++C S +PV FDP S+S+S + CSS C D
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSN----PNPVNNFDPTRSSSYSPIPCSSPTCRTRTRD 137
Query: 218 RLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQG----M 271
L A C + + C +SY D S ++G LA E G T N+ GC G
Sbjct: 138 FLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEE 197
Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP--VGAAWV 329
GLLG+ GS+S + Q+G FSYC +S G L+ G +
Sbjct: 198 DTKTTGLLGMNRGSLSFISQMGFP---KFSYC-ISGTDDFPGFLLLGDSNFTWLTPLNYT 253
Query: 330 PLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
PL+R + P F Y V L+G+ V G +PI + + G ++D+GT T L
Sbjct: 254 PLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGAGQTMVDSGTQFTFLL 313
Query: 385 TPAYEAFRDAFVAQTGNL------PRASGVSIFDTCYNLSGF-----VSVRVPTVSFYF- 432
P Y A R F+ QT + P D CY +S F + R+PTVS F
Sbjct: 314 GPVYTALRSDFLNQTNGILTVYEDPEFVFQGTMDLCYRISPFRIRTGILHRLPTVSLVFE 373
Query: 433 ------SGGPVLTLPASNFLIPVDDAG---TFCFAFAPSP-SGLS--IIGNIQQEGIQIS 480
SG P+L + +P AG +CF F S G+ +IG+ Q+ + I
Sbjct: 374 GAEIAVSGQPLL------YRVPHLTAGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIE 427
Query: 481 FDGANGFVGFGPNVC 495
FD +G P C
Sbjct: 428 FDLQRSRIGLAPVQC 442
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 116/380 (30%), Positives = 171/380 (45%), Gaps = 60/380 (15%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
V + VG PP++ MV+D+GS++ W+ C+ VF+P S+++S V CSS +C
Sbjct: 67 VTLAVGDPPQNISMVLDTGSELSWLHCKKSPNL----GSVFNPVSSSTYSPVPCSSPICR 122
Query: 217 ----DRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK--- 267
D A C C +SY D + +G LA ET IG GC
Sbjct: 123 TRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLS 182
Query: 268 -NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL---- 322
N + GL+G+ GS+S V QLG FSYC+ G+ SS L+ G +
Sbjct: 183 SNSEEDAKSTGLMGMNRGSLSFVNQLGF---SKFSYCI--SGSDSSVFLLLGDASYSWLG 237
Query: 323 PVGAAWVPLV-RNPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
P+ + PLV ++ P F Y V L G+ VG + + + +F G ++D+G
Sbjct: 238 PI--QYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSG 295
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCY--------NLSGFVSV 423
T T L P Y A ++ F+ QT ++ R F D CY N SG
Sbjct: 296 TQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSG---- 351
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGT------FCFAFAPSP-SGLS--IIGNIQQ 474
+P VS F G +++ L V+ AG+ +CF F S G+ +IG+ Q
Sbjct: 352 -LPMVSLMFRGAE-MSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQ 409
Query: 475 EGIQISFDGANGFVGFGPNV 494
+ + + FD A VGF NV
Sbjct: 410 QNVWMEFDLAKSRVGFAGNV 429
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 120/399 (30%), Positives = 173/399 (43%), Gaps = 48/399 (12%)
Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGS-GEYFVRIG 161
H+ RD R L++ L G V DF D D G Y+ ++
Sbjct: 38 NHEMELSQLKARDEARHGRLLQSLGG---------VIDFPVD--GTFDPFVVGLYYTKLR 86
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVC 216
+G+PPR Y+ +D+GSD++WV C C+ C + S FDP S + S +SCS C
Sbjct: 87 LGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRC 146
Query: 217 D---RLENAGC--HAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN----VAIG 263
+ ++GC C Y YGDGS T G + L +G ++V N V G
Sbjct: 147 SWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFG 206
Query: 264 CGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSSGSLVF 317
C G V G+ G G MS++ QL Q FS+CL G G LV
Sbjct: 207 CSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGE-NGGGGILVL 265
Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
G P + PLV P P Y V L + V G +PI+ +F + G ++DTG
Sbjct: 266 GEIVEP-NMVFTPLV--PSQPH-YNVNLLSISVNGQALPINPSVFSTSN--GQGTIIDTG 319
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV 437
T + L AY F +A R VS + CY ++ V P VS F+GG
Sbjct: 320 TTLAYLSEAAYVPFVEAITNAVSQSVRPV-VSKGNQCYVITTSVGDIFPPVSLNFAGGAS 378
Query: 438 LTLPASNFLIPVDDAG---TFCFAFAP-SPSGLSIIGNI 472
+ L ++LI ++ G +C F G++I+G++
Sbjct: 379 MFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDL 417
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 108/373 (28%), Positives = 171/373 (45%), Gaps = 41/373 (10%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
G YF R+ +G+P + ++ ID+GSDI+WV C PC+ C S F+P S++ S
Sbjct: 87 GLYFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSR 146
Query: 209 VSCSSAVCDRLENAG---CHAGR-----CRYEVSYGDGSYTKGTLALETLTIGRTVVKN- 259
+ CS C G C + C Y +YGDGS T G +T+ TV+ N
Sbjct: 147 IPCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYF-DTVMGNE 205
Query: 260 --------VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLV 305
V GC + G + G+ G G +S+V QL G + FS+CL
Sbjct: 206 QTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCL- 264
Query: 306 SRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
+G+ + G ++ E + G + PLV P P Y + L + V G ++PI LF +
Sbjct: 265 -KGSDNGGGILVLGEIVEPGLVFTPLV--PSQP-HYNLNLESIAVSGQKLPIDSSLFATS 320
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
G ++D+GT + L AY+ F +A A R+ C+ + V
Sbjct: 321 NT--QGTIVDSGTTLVYLVDGAYDPFINAIAAAVSPSVRSVVSKGIQ-CFVTTSSVDSSF 377
Query: 426 PTVSFYFSGGPVLTLPASNFLI---PVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
PT + YF GG +T+ N+L+ VD+ +C + S G++I+G++ + +D
Sbjct: 378 PTATLYFKGGVSMTVKPENYLLQQGSVDNNVLWCIGWQRS-QGITILGDLVLKDKIFVYD 436
Query: 483 GANGFVGFGPNVC 495
AN +G+ C
Sbjct: 437 LANMRMGWADYDC 449
>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
Length = 353
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 101/367 (27%), Positives = 163/367 (44%), Gaps = 40/367 (10%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDP---VFDPADSASFSG 208
+YF+ I +G+PP + ID+GS + WVQC+ C +CY Q+ +F+P +S+++S
Sbjct: 3 KNKYFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSK 62
Query: 209 VSCSSAVCDRLE-----NAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNV 260
V CS+ C+ + GC C Y + YG G Y+ G L + LT+ + N
Sbjct: 63 VGCSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNRSIDNF 122
Query: 261 AIGCGHKNQGMFVGA-AGLLGLGGGSMSLVGQLGGQTG-GAFSYCLVSRGTGSSGSLVFG 318
GCG N ++ G AG++G G S S Q+ QT AFSYC R + GSL G
Sbjct: 123 IFGCGEDN--LYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCF-PRDHENEGSLTIG 179
Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
A + W L+ P+ Y + + V G+R+ I ++ +++M ++D+GT
Sbjct: 180 PYARDINLMWTKLIYYDHKPA-YAIQQLDMMVNGIRLEIDPYIY-ISKM----TIVDSGT 233
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCY-------NLSGFVSVRVPTVSFY 431
A T + +P ++A A + G C+ N + F +V + +
Sbjct: 234 ADTYILSPVFDALDKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKLIR-- 291
Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS---GLSIIGNIQQEGIQISFDGANGFV 488
L LP N + C F P + G+ ++GN ++ FD
Sbjct: 292 ----STLKLPVENAFYESSN-NVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNF 346
Query: 489 GFGPNVC 495
GF C
Sbjct: 347 GFKARAC 353
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 121/432 (28%), Positives = 185/432 (42%), Gaps = 67/432 (15%)
Query: 100 HYHRHQHSFHARMQRDVKRVATLVR--RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYF 157
Y R Q S A + D +R T++ L GG +G G Y+
Sbjct: 38 RYPRLQGSLSALKEHDDRRQLTILAGIDLPLGG----------------TGRPDIPGLYY 81
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCS 212
+IG+G+P +S Y+ +D+GSDI+WV C C QC ++S +++ +S S VSC
Sbjct: 82 AKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCD 141
Query: 213 SAVCDRLEN---AGCHAGR-CRYEVSYGDGSYTKGTLALETLTIG--------RTVVKNV 260
C ++ +GC A C Y YGDGS T G + + +T +V
Sbjct: 142 DDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSV 201
Query: 261 AIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSG 313
GCG + G + G+LG G + S++ QL G+ F++CL R G G
Sbjct: 202 IFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGG--G 259
Query: 314 SLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD-DGV 372
GR P PLV P P Y V ++ + VG + I DLF Q GD G
Sbjct: 260 IFAIGRVVQP-KVNMTPLV--PNQPH-YNVNMTAVQVGQEFLNIPADLF---QPGDRKGA 312
Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD---TCYNLSGFVSVRVPTVS 429
++D+GT + LP YE +Q L V I D C+ SG V P V+
Sbjct: 313 IIDSGTTLAYLPEIIYEPLVKKITSQEPALK----VHIVDKDYKCFQYSGRVDEGFPNVT 368
Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP------SGLSIIGNIQQEGIQISFDG 483
F+F L + ++L P + G +C + S ++++G++ + +D
Sbjct: 369 FHFENSVFLRVYPHDYLFPYE--GMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDL 426
Query: 484 ANGFVGFGPNVC 495
N +G+ C
Sbjct: 427 ENQLIGWTEYNC 438
>gi|414871328|tpg|DAA49885.1| TPA: hypothetical protein ZEAMMB73_545054 [Zea mays]
Length = 565
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 86/240 (35%), Positives = 116/240 (48%), Gaps = 11/240 (4%)
Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGRE 320
GC G V + GL+G G +S Q G FSYCL S + + SG+L G
Sbjct: 329 FGCLCVVTGGSVPSQGLVGFNRGPLSFPSQNKNVYGSVFSYCLPSYKSSNFSGTLRLGPA 388
Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
P PL+ NP PS YYV + G+ VGG + + G ++D GT
Sbjct: 389 GQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPVAVPASALAFDPASGHGTIVDAGTMF 448
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
TRL P Y A D F ++ P A + FDTCYN V++ VPTV+F F G +TL
Sbjct: 449 TRLSAPVYAAVCDVFRSRV-RAPVAGPLGGFDTCYN----VTISVPTVTFLFDGRVSVTL 503
Query: 441 PASNFLIPVDDAGTFCFAFAPSPSG-----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
P N +I G C A A PS L+++ ++QQ+ ++ FD ANG VGF +C
Sbjct: 504 PEENVVIRSSLDGIACLAMAAGPSDSVDAVLNVMASMQQQNHRVLFDVANGRVGFSRELC 563
>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 372
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 101/367 (27%), Positives = 163/367 (44%), Gaps = 40/367 (10%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDP---VFDPADSASFSG 208
+YF+ I +G+PP + ID+GS + WVQC+ C +CY Q+ +F+P +S+++S
Sbjct: 22 KNKYFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSK 81
Query: 209 VSCSSAVCDRLE-----NAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNV 260
V CS+ C+ + GC C Y + YG G Y+ G L + LT+ + N
Sbjct: 82 VGCSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNRSIDNF 141
Query: 261 AIGCGHKNQGMFVGA-AGLLGLGGGSMSLVGQLGGQTG-GAFSYCLVSRGTGSSGSLVFG 318
GCG N ++ G AG++G G S S Q+ QT AFSYC R + GSL G
Sbjct: 142 IFGCGEDN--LYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCF-PRDHENEGSLTIG 198
Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
A + W L+ P+ Y + + V G+R+ I ++ +++M ++D+GT
Sbjct: 199 PYARDINLMWTKLIYYDHKPA-YAIQQLDMMVNGIRLEIDPYIY-ISKM----TIVDSGT 252
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCY-------NLSGFVSVRVPTVSFY 431
A T + +P ++A A + G C+ N + F +V + +
Sbjct: 253 ADTYILSPVFDALDKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKLIR-- 310
Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS---GLSIIGNIQQEGIQISFDGANGFV 488
L LP N + C F P + G+ ++GN ++ FD
Sbjct: 311 ----STLKLPVENAFYESSN-NVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNF 365
Query: 489 GFGPNVC 495
GF C
Sbjct: 366 GFKARAC 372
>gi|110740049|dbj|BAF01928.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
Length = 183
Score = 128 bits (322), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 73/186 (39%), Positives = 103/186 (55%), Gaps = 8/186 (4%)
Query: 312 SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
+G L FG + + P+ SFY + + + VGG ++PI +F G
Sbjct: 3 TGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFS-----TPG 57
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
++D+GT +TRLP AY A R +F A+ P SGVSI DTC++LSGF +V +P V+F
Sbjct: 58 ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFS 117
Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVG 489
FSGG V+ L S + V C AFA S +I GN+QQ+ +++ +DGA G VG
Sbjct: 118 FSGGAVVEL-GSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVG 176
Query: 490 FGPNVC 495
F PN C
Sbjct: 177 FAPNGC 182
>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 330
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 102/311 (32%), Positives = 145/311 (46%), Gaps = 32/311 (10%)
Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCHAGR------CRYEVSYGDGSYTKGTLALET 249
P FD + S++ SC S +C L A C + C Y Y D S T G + ++
Sbjct: 23 PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLIEVDK 82
Query: 250 LTIGR-TVVKNVAIGCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
T G V VA GCG N G+F G+ G G G +SL QL G FS+C +
Sbjct: 83 FTFGAGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQL---KVGNFSHCFTAV 139
Query: 308 GTGSSGSLVF---------GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPIS 358
+++ GR A+ PL++N P+FYY+ L G+ VG R+P+
Sbjct: 140 NGLKQSTVLLDLPADLYKNGRGAV----QSTPLIQNSANPTFYYLSLKGITVGSTRLPVP 195
Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD-TCYNL 417
E F LT G G ++D+GT++T LP Y+ RD F AQ LP G + TC++
Sbjct: 196 ESAFALTN-GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI-KLPVVPGNATGPYTCFSA 253
Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV-DDAGT--FCFAFAPSPSGLSIIGNIQQ 474
VP + +F G + LP N++ V DDAG C A +IIGN QQ
Sbjct: 254 PSQAKPDVPKLVLHFEGA-TMDLPRENYVFEVPDDAGNSIICLAINKG-DETTIIGNFQQ 311
Query: 475 EGIQISFDGAN 485
+ + + +D N
Sbjct: 312 QNMHVLYDLQN 322
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 95/360 (26%), Positives = 161/360 (44%), Gaps = 31/360 (8%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+G Y R+ +G+PP+ +++D+GS + +V C C QC + DP F P S+++ V C+
Sbjct: 109 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKCT 168
Query: 213 SAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHK 267
+ C R C YE Y + S + G L + ++ G + GC +
Sbjct: 169 I-------DCNCDGDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENV 221
Query: 268 NQGMFVG--AAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSRGTGSSGSLVFGREALP 323
G A G++GLG G +S++ QL + +FS C G G++V G + P
Sbjct: 222 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVG-GGAMVLGGISPP 280
Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
+ +P +Y + L + V G R+P++ ++F G G V+D+GT L
Sbjct: 281 SDMTFA--YSDPDRSPYYNIDLKEMHVAGKRLPLNANVFD----GKHGTVLDSGTTYAYL 334
Query: 384 PTPAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGF----VSVRVPTVSFYFSGGPV 437
P A+ AF+DA V + +L + SG + D C++ +G +S P V F G
Sbjct: 335 PEAAFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQLSKSFPVVDMVFGNGHK 394
Query: 438 LTLPASNFLIPVDDA-GTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+L N++ G +C F +++G I + +D +GF C
Sbjct: 395 YSLSPENYMFRHSKVRGAYCLGIFQNGNDQTTLLGGIIVRNTLVMYDREQTKIGFWKTNC 454
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 124/457 (27%), Positives = 199/457 (43%), Gaps = 84/457 (18%)
Query: 60 LFERHNNISSSN----TSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRD 115
+F H ++ +N +S R +L+HRD + S Y+R + R +R
Sbjct: 14 IFSTHFALTIANNLEFSSIQPTRLVTKLIHRDSIVSP--------YYRSNDTVADRTERT 65
Query: 116 VKRVATLVRRLSGGGADAAKHEVQDFG-TDVVSGMDQGSGE--YFVRIGVGSPPRSQYMV 172
+K A+L R LS A + DF D+ + + E + V +G PP Q +
Sbjct: 66 MK--ASLAR-LSYLYAKIER----DFDINDLWLNLHPSASEPLFLVNFSMGQPPVPQLAI 118
Query: 173 IDSGSDIVWVQCQPCSQCYKQ-SDPVFDPADSASFSGVSCSSAVCDRLENAGCH-AGRCR 230
+D+GS ++W+QC PC C +Q P+FDP+ S+++ +SC + +C + C + +C
Sbjct: 119 MDTGSSLLWIQCAPCKSCSQQIIGPMFDPSISSTYDSLSCKNIICRYAPSGECDSSSQCV 178
Query: 231 YEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIGCGHKNQGMFVGA--AGLLGLGG 283
Y +Y +G + G +A E L GR V NV GC H+N G + G+ GLG
Sbjct: 179 YNQTYVEGLPSVGVIATEQLIFGSSDEGRNAVNNVLFGCSHRN-GNYKDRRFTGVFGLGS 237
Query: 284 GSMSLVGQLGGQTGGAFSYCL--VSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFY 341
G S+V Q+G + FSYC+ ++ S LV E + + PL Y
Sbjct: 238 GITSVVNQMGSK----FSYCIGNIADPDYSYNQLVLS-EGVNMEGYSTPL---DVVDGHY 289
Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF---------- 391
V L G+ VG R+ I F+ T+ V++D+GTA T L Y A
Sbjct: 290 QVILEGISVGETRLVIDPSAFKRTE-KQRRVIIDSGTAPTWLAENEYRALEREVRNLLDR 348
Query: 392 ------RDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNF 445
R++F+ G + + +L GF P V+F+F+ G L
Sbjct: 349 FLTPFMRESFLCYKGKVGQ-----------DLVGF-----PAVTFHFAEGADLV------ 386
Query: 446 LIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
VD + S+IG + Q+ +++D
Sbjct: 387 ---VDTEMRQASVYGKDFKDFSVIGLMAQQYYNVAYD 420
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 113/371 (30%), Positives = 166/371 (44%), Gaps = 48/371 (12%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
V + VGSPP+ MV+D+GS++ W+ C+ VF+P S+S+S + CSS +C
Sbjct: 1002 VSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS----VFNPLSSSSYSPIPCSSPICR 1057
Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK---- 267
D C + C VSY D S +G LA + IG + + GC
Sbjct: 1058 TRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSSALPGTLFGCMDSGFSS 1117
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV--G 325
N GL+G+ GS+S V QLG FSYC+ G SSG L+FG L
Sbjct: 1118 NSEEDAKTTGLMGMNRGSLSFVTQLGLP---KFSYCI--SGRDSSGVLLFGDLHLSWLGN 1172
Query: 326 AAWVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
+ PLV+ + P F Y V L G+ VG +P+ + +F G ++D+GT
Sbjct: 1173 LTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQF 1232
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCYNL-SGFVSVRVPTVSFYFS 433
T L P Y A R+ F+ QT + G F D CY++ +G +P+VS F
Sbjct: 1233 TFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVAAGGKLPTLPSVSLMFR 1292
Query: 434 ------GGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLS--IIGNIQQEGIQISFDGA 484
GG VL + + +C F S G+ +IG+ Q+ + + FD
Sbjct: 1293 GAEMVVGGEVLLYRVPEMM--KGNEWVYCLTFGNSDLLGIEAFVIGHHHQQNVWMEFD-- 1348
Query: 485 NGFVGFGPNVC 495
V F ++C
Sbjct: 1349 --LVAFAADLC 1357
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 125/428 (29%), Positives = 182/428 (42%), Gaps = 50/428 (11%)
Query: 92 SSNTTNNMHYHRHQHSFHARMQRDVKRVATLVR-------RLSGGGADAAKHEVQDFGTD 144
+SNT M + V+R L R R GGG A H
Sbjct: 29 TSNTGIRMKLTHVDAKGNYTAPERVRRAIALSRQINLASTRAEGGGVSAPVH-------- 80
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ--CYKQSDPVFDPAD 202
+ +Y VG PP+ +ID+GS ++W QC C + C +Q P F+ +
Sbjct: 81 ------WATRQYIAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASS 134
Query: 203 SASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA 261
S SF+ V C C C G C + V+YG G G L + T ++ +A
Sbjct: 135 SGSFAPVPCQDKACAGNYLHFCALDGTCTFRVTYGAGGII-GFLGTDAFTF-QSGGATLA 192
Query: 262 IGCGHKNQ----GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGTGSSGSL 315
GC + + GA+GL+GLG G +SL Q G + FSYCL G+S L
Sbjct: 193 FGCVSFTRFAAPDVLHGASGLIGLGRGRLSLASQTGAKR---FSYCLTPYFHNNGASSHL 249
Query: 316 VFGREALPVGA-------AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM- 367
G A G A+V ++ +FYY+ L G+ VG ++ I F L ++
Sbjct: 250 FVGAAASLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEVE 309
Query: 368 ---GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLS-GFVS 422
+ GV++D+G+ T L AYE Q G+L G ++ G +
Sbjct: 310 EGFWEGGVIIDSGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMALCVARGDLD 369
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
VPT+ +FSGG + LP N+ P++ + T C A SIIGN QQ+ + I FD
Sbjct: 370 RVVPTLVLHFSGGADMALPPENYWAPLEKS-TACMAIVRG-YLQSIIGNFQQQNMHILFD 427
Query: 483 GANGFVGF 490
G + F
Sbjct: 428 VGGGRLSF 435
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 106/371 (28%), Positives = 168/371 (45%), Gaps = 47/371 (12%)
Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD--PVFDPADSASFSGVSCSSAVC- 216
+ +G+PP++ MV+D+GS++ W++C+ K+ + +F+P S +++ + CSS C
Sbjct: 71 LTIGTPPQNITMVLDTGSELSWLRCK------KEPNFTSIFNPLASKTYTKIPCSSQTCK 124
Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC----GHK 267
D C + C + +SY D S +G LA ET G GC
Sbjct: 125 TRTSDLTLPVTCDPAKLCHFIISYADASSVEGHLAFETFRFGSLTRPATVFGCMDSGSSS 184
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG--REALPVG 325
N GL+G+ GS+S V Q+G + FSYC+ G S+G L+ G R +
Sbjct: 185 NTEEDAKTTGLMGMNRGSLSFVNQMGFR---KFSYCI--SGLDSTGFLLLGEARYSWLKP 239
Query: 326 AAWVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
+ PLV+ + P F Y V L G+ V +P+ + +F G ++D+GT
Sbjct: 240 LNYTPLVQISTPLPYFDRVAYSVQLEGIKVNNKVLPLPKSVFVPDHTGAGQTMVDSGTQF 299
Query: 381 TRLPTPAYEAFRDAFVAQTG------NLPRASGVSIFDTCYNLSGFVSV--RVPTVSFYF 432
T L P Y A R F+ QT N P+ D CY + S +P V F
Sbjct: 300 TFLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQGAMDLCYLIDSTSSTLPNLPVVKLMF 359
Query: 433 SGGPVLTLPASNFL--IPVDDAG---TFCFAFAPSPS-GLS--IIGNIQQEGIQISFDGA 484
G +++ L +P + G +CF F S G+S +IG+ QQ+ + + +D
Sbjct: 360 RGAE-MSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDELGISSFLIGHHQQQNVWMEYDLE 418
Query: 485 NGFVGFGPNVC 495
N +GF C
Sbjct: 419 NSRIGFAELRC 429
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 100/370 (27%), Positives = 168/370 (45%), Gaps = 40/370 (10%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
G YF +I +GSPP+ Y+ +D+GSDI+WV C PC +C ++D ++D S++
Sbjct: 75 GLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKASSTSKN 134
Query: 209 VSCSSAVCD-RLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT--------VVK 258
V C A C +++ C A + C Y V YGDGS + G + +T+ + + +
Sbjct: 135 VGCEDAFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTAPLAQ 194
Query: 259 NVAIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGS 311
V GCG KNQ +G G++G G + S++ QL GG FS+CL + G
Sbjct: 195 EVVFGCG-KNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMNGG- 252
Query: 312 SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
G G PV PLV N Y V L G+ V G I + L + GD G
Sbjct: 253 -GIFAIGEVESPV-VKTTPLVPNQV---HYNVILKGMDVDGEPIDLPPSL--ASTNGDGG 305
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
++D+GT + LP Y + + A+ + V C++ + P V+ +
Sbjct: 306 TIIDSGTTLAYLPQNLYNSLIEKITAK--QQVKLHMVQETFACFSFTSNTDKAFPVVNLH 363
Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAF------APSPSGLSIIGNIQQEGIQISFDGAN 485
F L++ ++L + + +CF + + + ++G++ + +D N
Sbjct: 364 FEDSLKLSVYPHDYLFSLRE-DMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLEN 422
Query: 486 GFVGFGPNVC 495
+G+ + C
Sbjct: 423 EVIGWADHNC 432
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 121/432 (28%), Positives = 185/432 (42%), Gaps = 67/432 (15%)
Query: 100 HYHRHQHSFHARMQRDVKRVATLVR--RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYF 157
Y R Q S A + D +R T++ L GG +G G Y+
Sbjct: 38 RYPRLQGSLTALKEHDDRRQLTILAGIDLPLGG----------------TGRPDIPGLYY 81
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCS 212
+IG+G+P +S Y+ +D+GSDI+WV C C QC ++S +++ +S S VSC
Sbjct: 82 AKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCD 141
Query: 213 SAVCDRLEN---AGCHAGR-CRYEVSYGDGSYTKGTLALETLTIG--------RTVVKNV 260
C ++ +GC A C Y YGDGS T G + + +T +V
Sbjct: 142 DDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSV 201
Query: 261 AIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSG 313
GCG + G + G+LG G + S++ QL G+ F++CL R G G
Sbjct: 202 IFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGG--G 259
Query: 314 SLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD-DGV 372
GR P PLV P P Y V ++ + VG + I DLF Q GD G
Sbjct: 260 IFAIGRVVQP-KVNMTPLV--PNQPH-YNVNMTAVQVGQEFLTIPADLF---QPGDRKGA 312
Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD---TCYNLSGFVSVRVPTVS 429
++D+GT + LP YE +Q L V I D C+ SG V P V+
Sbjct: 313 IIDSGTTLAYLPEIIYEPLVKKITSQEPALK----VHIVDKDYKCFQYSGRVDEGFPNVT 368
Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP------SGLSIIGNIQQEGIQISFDG 483
F+F L + ++L P + G +C + S ++++G++ + +D
Sbjct: 369 FHFENSVFLRVYPHDYLFPHE--GMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDL 426
Query: 484 ANGFVGFGPNVC 495
N +G+ C
Sbjct: 427 ENQLIGWTEYNC 438
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 103/374 (27%), Positives = 171/374 (45%), Gaps = 43/374 (11%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFS 207
+G Y+ RIG+GSPP ++ +D+GSDI+WV C CS C K+SD +++P S++ +
Sbjct: 70 TGLYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTST 129
Query: 208 GVSCSSAVCDRLENA---GCHAG-RCRYEVSYGDGSYTKGTLALETLTIGRTVVKN---- 259
++C C +A GC C+Y+V YGDGS T G + + + R V +
Sbjct: 130 LITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSE 189
Query: 260 ----VAIGCGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGT 309
+ GCG K G ++ G+LG G + S++ QL G+ F++CL S
Sbjct: 190 TNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISG 249
Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
G G G P L P P + Y V L+G+ VG + + LF +
Sbjct: 250 G--GIFAIGEVVEP------KLXNTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSY- 300
Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
G ++D+GT + LP Y + + +L + F TC+ V PT
Sbjct: 301 -KRGAIIDSGTTLAYLPESIYLPLMEKILGAQPDLKLRTVDDQF-TCFVFDKNVDDGFPT 358
Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF----APSPSG--LSIIGNIQQEGIQISF 481
V+F F +LT+ +L + D +C + A S G ++++G++ + + +
Sbjct: 359 VTFKFEESLILTIYPHEYLFQIRD-DVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYY 417
Query: 482 DGANGFVGFGPNVC 495
+ N +G+ C
Sbjct: 418 NLENQTIGWTEYNC 431
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 103/373 (27%), Positives = 163/373 (43%), Gaps = 57/373 (15%)
Query: 173 IDSGSDIVWVQCQ---PCSQCYKQS--DPVFDPADSASFSGVSCSSAVCDRLEN------ 221
+D+GSD+VWV C C C + S + VF P S+S V+C+ + C L
Sbjct: 1 MDTGSDLVWVPCTRNYSCINCPEDSASNGVFLPRMSSSLHLVTCADSNCKTLYGNNTELL 60
Query: 222 --------AGCHAGRCRYEVSYGDGSYTKGTLALETLTI------GRTVVKNVAIGCGHK 267
C Y + YG GS T G L ETL + G + + A+GC
Sbjct: 61 CQSCAGSLKNCSETCPPYGIQYGRGS-TAGLLLTETLNLPLENGEGARAITHFAVGCSIV 119
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGG-AFSYCLVSR---GTGSSGSLVFGREALP 323
+ +G+ G G G++S+ QLG G F+YCL S +V G +ALP
Sbjct: 120 SSQQ---PSGIAGFGRGALSMPSQLGEHIGKDRFAYCLQSHRFDEENKKSLMVLGDKALP 176
Query: 324 --VGAAWVPLVRNPRAPS------FYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVM 374
+ + P + N RAP +YY+GL G+ +GG R+ + L R G+ G ++
Sbjct: 177 NNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLLRFDTKGNGGTII 236
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQ-----TGNLPRASGVSIFDTCYNLSGFVSVRVPTVS 429
D+GT T ++ F +Q G + +G+ + CY+++G ++ +P +
Sbjct: 237 DSGTTFTVFSDEIFKHIAAGFASQIGYRRAGEVEDKTGMGL---CYDVTGLENIVLPEFA 293
Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS-------IIGNIQQEGIQISFD 482
F+F GG + LP +N+ + C S L I+GN QQ+ + +D
Sbjct: 294 FHFKGGSDMVLPVANYFSYFSSFDSICLTMISSRGLLEVDSGPAVILGNDQQQDFYLLYD 353
Query: 483 GANGFVGFGPNVC 495
+GF C
Sbjct: 354 REKNRLGFTQQTC 366
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 117/385 (30%), Positives = 165/385 (42%), Gaps = 53/385 (13%)
Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTD-----VVSGMDQGSGEYF 157
H+ RD R L++ L G V DF D V G+ Y+
Sbjct: 38 NHEMELSQLKARDEARHGRLLQSLGG---------VIDFPVDGTFDPFVVGL------YY 82
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCS 212
++ +G+PPR Y+ +D+GSD++WV C C+ C + S FDP S + S +SCS
Sbjct: 83 TKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCS 142
Query: 213 SAVCD---RLENAGC--HAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN---- 259
C + ++GC C Y YGDGS T G + L +G ++V N
Sbjct: 143 DQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAP 202
Query: 260 VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSSG 313
V GC G V G+ G G MS++ QL Q FS+CL G G
Sbjct: 203 VVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGE-NGGGG 261
Query: 314 SLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVV 373
LV G P + PLV P P Y V L + V G +PI+ +F + G +
Sbjct: 262 ILVLGEIVEP-NMVFTPLV--PSQPH-YNVNLLSISVNGQALPINPSVFSTSN--GQGTI 315
Query: 374 MDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFS 433
+DTGT + L AY F +A R VS + CY ++ V P VS F+
Sbjct: 316 IDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPV-VSKGNQCYVITTSVGDIFPPVSLNFA 374
Query: 434 GGPVLTLPASNFLIPVDD-AGTFCF 457
GG + L ++LI ++ A CF
Sbjct: 375 GGASMFLNPQDYLIQQNNVASALCF 399
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 115/371 (30%), Positives = 162/371 (43%), Gaps = 44/371 (11%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
V + VGSPP+ MV+D+GS++ W+ C+ VF+P S+S+S + CSS VC
Sbjct: 42 VSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS----VFNPLSSSSYSPIPCSSPVCR 97
Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK---- 267
D C + C VSY D S +G LA + IG + + GC
Sbjct: 98 TRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSSALPGTLFGCMDSGFSS 157
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV--G 325
N GL+G+ GS+S V QLG FSYC+ G SSG L+FG L
Sbjct: 158 NSEEDAKTTGLMGMNRGSLSFVTQLGLP---KFSYCI--SGRDSSGVLLFGDSHLSWLGN 212
Query: 326 AAWVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
+ PLV+ + P F Y V L G+ VG +P+ + +F G ++D+GT
Sbjct: 213 LTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQF 272
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCYNL-SGFVSVRVPTVSFYFS 433
T L P Y A R+ F+ QT + G F D CY + +G +P VS F
Sbjct: 273 TFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVPAGGKLPELPAVSLMFR 332
Query: 434 ------GGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLS--IIGNIQQEGIQISFDGA 484
GG VL + +C F S G+ +IG+ Q+ + + FD
Sbjct: 333 GAEMVVGGEVLLYKVPGMM--KGKEWVYCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLV 390
Query: 485 NGFVGFGPNVC 495
VGF C
Sbjct: 391 KSRVGFVETRC 401
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 114/371 (30%), Positives = 169/371 (45%), Gaps = 40/371 (10%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
V + VG+PP++ MVID+GS++ W+ C SQ S F+P S+S+S + CSS+ C
Sbjct: 75 VSLTVGTPPQNVTMVIDTGSELSWLHCN-TSQNSSSSSSTFNPVWSSSYSPIPCSSSTCT 133
Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK---- 267
D C + + C +SY D S ++G LA +T IG + + NV GC
Sbjct: 134 DQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIPNVVFGCMDSIFSS 193
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA 327
N GL+G+ GS+S V Q+G FSYC+ SG L+ G A
Sbjct: 194 NSEEDSKNTGLMGMNRGSLSFVSQMGFP---KFSYCISEYDF--SGLLLLGDANFSWLAP 248
Query: 328 --WVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
+ PL+ + P F Y V L G+ V +PI E +F G ++D+GT
Sbjct: 249 LNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQTMVDSGTQF 308
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCYNLSGFVS--VRVPTVSFYF 432
T L PAY A RD F+ +T R S F D CY + + +P+V+ F
Sbjct: 309 TFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPPLPSVTLVF 368
Query: 433 SGGPVLTLPASNFL--IPVDDAGT---FCFAFAPSP-SGLS--IIGNIQQEGIQISFDGA 484
G +T+ L +P + G CF F S G+ +IG++ Q+ + + FD
Sbjct: 369 RGAE-MTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNVWMEFDLK 427
Query: 485 NGFVGFGPNVC 495
+G C
Sbjct: 428 KSRIGLAEIRC 438
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 111/405 (27%), Positives = 178/405 (43%), Gaps = 44/405 (10%)
Query: 111 RMQRDVKRVATLVRRLSGGGADA-AKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQ 169
R + R+A +RR G G A+ + D D+++ +G Y R+ +G+PP+
Sbjct: 50 RSYPNASRLAASLRRGLGDGVHPNARMRLHD---DLLT-----NGYYTTRLYIGTPPQEF 101
Query: 170 YMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS-SAVCDRLENAGCHAGR 228
+++DSGS + +V C C QC DP F P S+S+S V C+ CD +
Sbjct: 102 ALIVDSGSTVTYVPCSSCEQCGNHQDPRFQPDLSSSYSPVKCNVDCTCD------SDKKQ 155
Query: 229 CRYEVSYGDGSYTKGTLALETLTIGRT---VVKNVAIGCGHKNQGMFVG--AAGLLGLGG 283
C YE Y + S + G L + ++ GR ++ GC + G A G++GLG
Sbjct: 156 CTYERQYAEMSSSSGVLGEDIVSFGRESELKPQHAIFGCENSETGDLFSQHADGIMGLGR 215
Query: 284 GSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFY 341
G +S++ QL G +FS C G G++V G P + +P +Y
Sbjct: 216 GQLSIMDQLVEKGVISDSFSLCYGGMDIG-GGAMVLGGMLAPPDMIFS--NSDPLRSPYY 272
Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
+ L + V G + + +F G V+D+GT LP A+ AF++A ++ +
Sbjct: 273 NIELKEIHVAGKALRVESRIFN----SKHGTVLDSGTTYAYLPEQAFVAFKEAVTSKVHS 328
Query: 402 LPRASG--VSIFDTCY-----NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI---PVDD 451
L + G S D C+ N+S V P V F G L+L N+L VD
Sbjct: 329 LKKIRGPDPSYKDICFAGAGRNVSKLHEV-FPDVDMVFGNGQKLSLTPENYLFRHSKVD- 386
Query: 452 AGTFCF-AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
G +C F +++G I +++D N +GF C
Sbjct: 387 -GAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNC 430
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 103/372 (27%), Positives = 172/372 (46%), Gaps = 39/372 (10%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFS 207
+G Y+ RIG+GSPP ++ +D+GSDI+WV C CS C K+SD +++P S++ +
Sbjct: 70 TGLYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTST 129
Query: 208 GVSCSSAVCDRLENA---GCHAG-RCRYEVSYGDGSYTKGTLALETLTIGRTVVKN---- 259
++C C +A GC C+Y+V YGDGS T G + + + R V +
Sbjct: 130 LITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSE 189
Query: 260 ----VAIGCGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGT 309
+ GCG K G ++ G+LG G + S++ QL G+ F++CL S
Sbjct: 190 TNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISG 249
Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
G G G P P+V N + Y V L+G+ VG + + LF +
Sbjct: 250 G--GIFAIGEVVEP-KLKTTPVVPN---QAHYNVVLNGVKVGDTALDLPLGLFETSY--K 301
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVS 429
G ++D+GT + LP Y + + +L + F TC+ V PTV+
Sbjct: 302 RGAIIDSGTTLAYLPDSIYLPLMEKILGAQPDLKLRTVDDQF-TCFVFDKNVDDGFPTVT 360
Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAF----APSPSG--LSIIGNIQQEGIQISFDG 483
F F +LT+ +L + D +C + A S G ++++G++ + + ++
Sbjct: 361 FKFEESLILTIYPHEYLFQIRD-DVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYYNL 419
Query: 484 ANGFVGFGPNVC 495
N +G+ C
Sbjct: 420 ENQTIGWTEYNC 431
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 117/400 (29%), Positives = 181/400 (45%), Gaps = 44/400 (11%)
Query: 132 DAAKH--EVQDFGTDVVSGMDQGS------GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQ 183
D +H +Q G VV QG+ G Y+ R+ +G+PPR Y+ ID+GSD++WV
Sbjct: 20 DRVRHGRMLQSSGVGVVDFPVQGTFDPFLVGLYYTRLQLGTPPRDFYVQIDTGSDVLWVS 79
Query: 184 CQPCSQCYKQSD---PV--FDPADSASFSGVSCSSAVCD---RLENAGCHA--GRCRYEV 233
C C+ C S P+ FDP S + S +SCS C + ++ C A C Y
Sbjct: 80 CGSCNGCPVNSGLHIPLNFFDPGSSPTASLISCSDQRCSLGLQSSDSVCSAQNNLCGYNF 139
Query: 234 SYGDGSYTKGTLALETL----TIGRTVVKN----VAIGCGHKNQGMFV----GAAGLLGL 281
YGDGS T G + L +G +V+ N + GC G G+ G
Sbjct: 140 QYGDGSGTSGYYVSDLLHFDTVLGGSVMNNSSAPIVFGCSALQTGDLTKSDRAVDGIFGF 199
Query: 282 GGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPS 339
G MS+V QL Q + AFS+CL +G S G ++ E + + PLV P P
Sbjct: 200 GQQDMSVVSQLASQGISPRAFSHCL--KGDDSGGGILVLGEIVEPNIVYTPLV--PSQPH 255
Query: 340 FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQT 399
Y + + + V G + I +F + G ++D+GT + L AY+ F A +
Sbjct: 256 -YNLNMQSISVNGQTLAIDPSVFGTSS--SQGTIIDSGTTLAYLAEAAYDPFISAITSIV 312
Query: 400 GNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI---PVDDAGTFC 456
R +S + CY +S ++ P VS F+GG + L ++LI + A +C
Sbjct: 313 SPSVRPY-LSKGNHCYLISSSINDIFPQVSLNFAGGASMILIPQDYLIQQSSIGGAALWC 371
Query: 457 FAFAP-SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
F G++I+G++ + +D AN +G+ C
Sbjct: 372 IGFQKIQGQGITILGDLVLKDKIFVYDIANQRIGWANYDC 411
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 99/370 (26%), Positives = 168/370 (45%), Gaps = 40/370 (10%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
G YF +I +GSPP+ Y+ +D+GSDI+WV C PC +C ++D ++D S++
Sbjct: 76 GLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKN 135
Query: 209 VSCSSAVCD-RLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT--------VVK 258
V C C +++ C A + C Y V YGDGS + G + +T+ + + +
Sbjct: 136 VGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQ 195
Query: 259 NVAIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGS 311
V GCG KNQ +G G++G G + S++ QL GG T FS+CL + G
Sbjct: 196 EVVFGCG-KNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGG- 253
Query: 312 SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
G G PV P+V N Y V L G+ V G I + L + GD G
Sbjct: 254 -GIFAVGEVESPV-VKTTPIVPNQV---HYNVILKGMDVDGDPIDLPPSL--ASTNGDGG 306
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
++D+GT + LP Y + + A+ + V C++ + P V+ +
Sbjct: 307 TIIDSGTTLAYLPQNLYNSLIEKITAK--QQVKLHMVQETFACFSFTSNTDKAFPVVNLH 364
Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAF------APSPSGLSIIGNIQQEGIQISFDGAN 485
F L++ ++L + + +CF + + + ++G++ + +D N
Sbjct: 365 FEDSLKLSVYPHDYLFSLRE-DMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLEN 423
Query: 486 GFVGFGPNVC 495
+G+ + C
Sbjct: 424 EVIGWADHNC 433
>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 421
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 101/329 (30%), Positives = 153/329 (46%), Gaps = 41/329 (12%)
Query: 102 HRHQHSFHARMQRDVKRVATL---VRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFV 158
H S RD RV+ + + + G H F D G + V
Sbjct: 80 HSQPPSPQEIFGRDESRVSFINSKCNQYTSGNLKNHAHNNNLFDED---------GNFLV 130
Query: 159 RIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
+ G+PP++ +++D+GS I W QC+ C C + S F+ + S+++S SC +
Sbjct: 131 DVAFGTPPQNFMLILDTGSSITWTQCKACVNCLQDSHRYFNWSASSTYSSGSC---IPGT 187
Query: 219 LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMF-VGAA 276
+EN Y ++YGD S + G +T+T+ + V + GCG N+G F G
Sbjct: 188 VEN--------NYNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNNKGDFGSGVD 239
Query: 277 GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA--WVPLVRN 334
G+LGLG G +S V Q + FSYCL S GSL+FG +A ++ + LV
Sbjct: 240 GMLGLGQGQLSTVSQTASKFNKVFSYCLPEE--DSIGSLLFGEKATSQSSSLKFTSLVNG 297
Query: 335 P---RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
P + +Y+V LS + VG R+ I +F G ++D+ T +TRLP AY A
Sbjct: 298 PGTLQESGYYFVNLSDISVGNERLNIPSSVF-----ASPGTIIDSRTVITRLPQRAYSAL 352
Query: 392 RDAFVAQTGNLPRASGV----SIFDTCYN 416
+ AF P ++G I DTCYN
Sbjct: 353 KAAFKKAMAKYPLSNGRRKKGDILDTCYN 381
>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
Length = 419
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 108/371 (29%), Positives = 159/371 (42%), Gaps = 48/371 (12%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--SQCYKQSDPVFDPADSASFSGVSCSS 213
Y +G+PP++ ++D ++VW QC C S C+KQ PVFDP+ S ++ C S
Sbjct: 62 YVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGS 121
Query: 214 AVCDRLENAGCHA-GRCRYEVS--YGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
+C + C G C YE +GD T G + + + IG + +A GC + G
Sbjct: 122 PLCKSIPTRNCSGDGECGYEAPSMFGD---TFGIASTDAIAIGNAEGR-LAFGCVVASDG 177
Query: 271 MFVGA----AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA 326
GA +G +GLG SLVGQ AFSYCL G G +L G A GA
Sbjct: 178 SIDGAMDGPSGFVGLGRTPWSLVGQ---SNVTAFSYCLAPHGPGKKSALFLGASAKLAGA 234
Query: 327 AWVPLVRNPRAP---------------SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
NP P +Y V L G+ G + + + +
Sbjct: 235 G----KSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAASSGGGAITI---- 286
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
+ ++T ++ LP AY+A A G+ A+ FD C+ + VP + F
Sbjct: 287 LQLETFRPLSYLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQNAAVSG--VPDLVFT 344
Query: 432 FSGGPVLTLPASNFLIPVDDA-GTFCFAFAPSP------SGLSIIGNIQQEGIQISFDGA 484
F GG LT P S +L+ + GT C + S G+SI+G++ QE + FD
Sbjct: 345 FQGGATLTAPPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDLE 404
Query: 485 NGFVGFGPNVC 495
+ F P C
Sbjct: 405 KETLSFEPADC 415
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 113/414 (27%), Positives = 164/414 (39%), Gaps = 52/414 (12%)
Query: 126 LSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ 185
LS A K +F + G Y + + G+PP++ V+D+GS +VW C
Sbjct: 53 LSLSRAHHIKSPKTNFSLIKTPLFPRSYGGYSISLNFGTPPQTTKFVMDTGSSLVWFPCT 112
Query: 186 P---CSQC-----YKQSDPVFDPADSASFSGVSCSSAVCD---------RLENAGCHAGR 228
CS+C K P F P S+S + C + C + + A
Sbjct: 113 SRYLCSECNFPNIKKTGIPTFLPKLSSSSKLIGCKNPRCSMIFGPEIQSKCQECDSTAQN 172
Query: 229 CR-----YEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
C Y + YG GS T G L ETL + + + +GC + G+ G G
Sbjct: 173 CTQTCPPYVIQYGSGS-TAGLLLSETLDFPNKKTIPDFLVGCSIFS---IKQPEGIAGFG 228
Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRG---TGSSGSLVFGREA-----LPVGAAWVPLVRN 334
SL QLG + FSYCLVS T +S LV + G + P ++N
Sbjct: 229 RSPESLPSQLGLK---KFSYCLVSHAFDDTPTSSDLVLDTGSGSGVTKTAGLSHTPFLKN 285
Query: 335 PRAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
P +YYV L + +G + + G+ G ++D+GT T + P YE
Sbjct: 286 PTTAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGTDGNGGTIVDSGTTFTFMENPVYELVA 345
Query: 393 DAFVAQTGNLPRASGVSIFD---TCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
F Q + A+ + CYN+SG S+ VP + F F GG + LP SN+ +
Sbjct: 346 KEFEKQMAHYTVATEIQNLTGLRPCYNISGEKSLSVPDLIFQFKGGAKMALPLSNYF-SI 404
Query: 450 DDAGTFCFAFAP--------SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
D+G C I+GN QQ + FD N GF C
Sbjct: 405 VDSGVICLTIVSDNVAGPGLGGGPAIILGNYQQRNFYVEFDLENEKFGFKQQSC 458
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 99/370 (26%), Positives = 168/370 (45%), Gaps = 40/370 (10%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
G YF +I +GSPP+ Y+ +D+GSDI+WV C PC +C ++D ++D S++
Sbjct: 72 GLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKN 131
Query: 209 VSCSSAVCD-RLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT--------VVK 258
V C C +++ C A + C Y V YGDGS + G + +T+ + + +
Sbjct: 132 VGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQ 191
Query: 259 NVAIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGS 311
V GCG KNQ +G G++G G + S++ QL GG T FS+CL + G
Sbjct: 192 EVVFGCG-KNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGG- 249
Query: 312 SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
G G PV P+V N Y V L G+ V G I + L + GD G
Sbjct: 250 -GIFAVGEVESPV-VKTTPIVPNQV---HYNVILKGMDVDGDPIDLPPSL--ASTNGDGG 302
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
++D+GT + LP Y + + A+ + V C++ + P V+ +
Sbjct: 303 TIIDSGTTLAYLPQNLYNSLIEKITAK--QQVKLHMVQETFACFSFTSNTDKAFPVVNLH 360
Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAF------APSPSGLSIIGNIQQEGIQISFDGAN 485
F L++ ++L + + +CF + + + ++G++ + +D N
Sbjct: 361 FEDSLKLSVYPHDYLFSLRE-DMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLEN 419
Query: 486 GFVGFGPNVC 495
+G+ + C
Sbjct: 420 EVIGWADHNC 429
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 111/396 (28%), Positives = 162/396 (40%), Gaps = 53/396 (13%)
Query: 146 VSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPAD--- 202
VS + G Y V + G+PP++ + D+GS +VW C +C + S P DPA
Sbjct: 122 VSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISK 181
Query: 203 -----SASFSGVSCSSAVC---------DRLENAGCHAGRCR-----YEVSYGDGSYTKG 243
S+S V C + C R N + +C Y + YG G+ T G
Sbjct: 182 FVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGA-TAG 240
Query: 244 TLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
L ETL + V + +GC + AG+ G G G SL Q+ + FS+C
Sbjct: 241 ILLSETLDLENKRVPDFLVGCSVMSVHQ---PAGIAGFGRGPESLPSQMRLKR---FSHC 294
Query: 304 LVSRG---TGSSGSLVF-----GREALPVGAAWVPLVRNPRAPS-----FYYVGLSGLGV 350
LVSRG + S LV E+ + P NP + +YY+ L + +
Sbjct: 295 LVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILI 354
Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV-- 408
GG + G+ G ++D+G+ T L P +EA D Q PRA V
Sbjct: 355 GGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEA 414
Query: 409 -SIFDTCYNLSG-FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL 466
S C+N+ S P V F GG L+L A N+L V D G C + +
Sbjct: 415 QSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMTDEAVV 474
Query: 467 S-------IIGNIQQEGIQISFDGANGFVGFGPNVC 495
I+G QQ+ + + +D A +GF C
Sbjct: 475 GGGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 510
>gi|125532795|gb|EAY79360.1| hypothetical protein OsI_34488 [Oryza sativa Indica Group]
Length = 342
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 107/357 (29%), Positives = 150/357 (42%), Gaps = 63/357 (17%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
Y + +G+PP+ +I + VW QC PC +C+KQ P+F+
Sbjct: 28 YMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFN---------------- 71
Query: 216 CDRLENAGCHAGRCRYEVS--YGDGSYTKGTLALETLTIGRTVVKNVAIGCG-HKNQGMF 272
RYEV +GD S GT +T IG T ++A GC N
Sbjct: 72 --------------RYEVETMFGDTSGIGGT---DTFAIG-TATASLAFGCAMDSNIKQL 113
Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG-TGSSGSLVFGREALPVG---AAW 328
+GA+G++GLG SLVGQ+ AFSYCL G G +L+ G A G AA
Sbjct: 114 LGASGVVGLGRTPWSLVGQMNAT---AFSYCLAPHGAAGKKSALLLGASAKLAGGKSAAT 170
Query: 329 VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
PLV S Y + L G+ G + I + V++DT V+ L A+
Sbjct: 171 TPLVNTSDDSSDYMIHLEGIKFGDVIIEPPPN--------GSVVLVDTIFGVSFLVDAAF 222
Query: 389 EAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV-----SVRVPTVSFYFSGGPVLTLPAS 443
A + A G P A+ FD C+ + S+ +P V F G LT+P S
Sbjct: 223 HAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAAALTVPPS 282
Query: 444 NFLIPVDDAGTFCFAFAPSP-----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
++ + GT C A S + LSI+G + QE I FD + F P C
Sbjct: 283 KYMYDAGN-GTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEPADC 338
>gi|414869114|tpg|DAA47671.1| TPA: hypothetical protein ZEAMMB73_872184 [Zea mays]
Length = 492
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 105/362 (29%), Positives = 161/362 (44%), Gaps = 28/362 (7%)
Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGV 209
D G+ +Y V +G G+P + M +D+ + V C+PC+ DP FD + S +F+ V
Sbjct: 143 DAGALDYTVNVGYGTPEQQFPMFLDTIFGVSLVLCKPCAPGSTSCDPAFDTSQSTTFTHV 202
Query: 210 SCSSAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVAIGCGHK 267
C S C N C AG C + + + +G++++ + LT+ +V V++ C
Sbjct: 203 PCDSPDCPSTAN--CSAGSVCPFNLFFVEGTFSQ-----DVLTVAPSVAVQDFTFVCLDA 255
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG-- 325
+ G L L SL +L G AFSYC+ + S G L G +A G
Sbjct: 256 GASDGMPEVGTLDLSRDRNSLPSRLAGSASAAFSYCM-PQYPDSPGFLSLGDDATVRGDN 314
Query: 326 -AAWVPLVR--NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
A PL+ +P + Y++ + G+ +G + +PI F + +++ GT T
Sbjct: 315 CTAHAPLLSSDDPDLANMYFIDVVGMSLGDVDLPIPSGTFG----NNASTIVEAGTTFTM 370
Query: 383 LPTPAYEAFRDAFVAQTGNLPRA-SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
L AY RDAF R+ G FDTCYN +G + VP V F F G L +
Sbjct: 371 LAPDAYTPLRDAFRQAMAQYNRSVPGFYDFDTCYNFTGLQELTVPLVEFKFGNGDSLLID 430
Query: 442 ASNFL---IPVDDAGTF-CFAFA----PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
L IP + T C AF+ ++IG ++ +D A G VGF P
Sbjct: 431 GDQMLYYDIPSEGPFTVTCLAFSTLDVDDDDVSAVIGAYSLATTEVVYDVAGGTVGFIPE 490
Query: 494 VC 495
C
Sbjct: 491 SC 492
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 111/370 (30%), Positives = 173/370 (46%), Gaps = 41/370 (11%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD---PV--FDPADSASFSG 208
G YF R+ +GSPP+ Y+ ID+GSD++WV C C+ C S P+ FDP S + +
Sbjct: 82 GLYFTRVQLGSPPKDFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAAL 141
Query: 209 VSCSSAVCD---RLENAGC--HAGRCRYEVSYGDGSYTKG-----TLALETL-------- 250
VSCS C + ++ C +C Y YGDGS T G + L+TL
Sbjct: 142 VSCSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLSSGELS 201
Query: 251 TIGRTVVKNVAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCL 304
I +T +V+ C G G+ G G MS++ QL Q T FS+CL
Sbjct: 202 QICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFSHCL 261
Query: 305 VSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL 364
+G S G ++ E + + PLV P P Y + L + V G + I +F
Sbjct: 262 --KGDDSGGGVLVLGEIVEPNIVYTPLV--PSQPH-YNLYLQSISVAGQTLAIDPSVFGA 316
Query: 365 TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR 424
+ + G ++D+GT + L AY+ F A + +L + +S + CY ++ V+
Sbjct: 317 SS--NQGTIVDSGTTLAYLAEGAYDPFVSA-ITSVVSLNARTYLSKGNQCYLVTSSVNDV 373
Query: 425 VPTVSFYFSGGPVLTLPASNFLI---PVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQIS 480
P VS F+GG L L ++L+ V A +C F +P ++I+G++ +
Sbjct: 374 FPQVSLNFAGGASLILNPQDYLLQQNSVGGAAVWCVGFQKTPGQQITILGDLVLKDKIFV 433
Query: 481 FDGANGFVGF 490
+D AN VG+
Sbjct: 434 YDIANQRVGW 443
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 112/386 (29%), Positives = 162/386 (41%), Gaps = 52/386 (13%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQC-YKQSD----PVFDPADSAS 205
G Y + + G+PP++ V+D+GS +VW C CS+C + + P F P S+S
Sbjct: 90 GGYSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSS 149
Query: 206 FSGVSCSSAVCDRL--------------ENAGCHAGRCRYEVSYGDGSYTKGTLALETLT 251
+ + C + C L C Y + YG GS T G L ETL
Sbjct: 150 SNLIGCKNHKCSWLFGPKVQSKCQECDPTTQNCTQSCPPYVIQYGLGS-TAGLLLSETLD 208
Query: 252 IG-RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG-- 308
+ + +GC + G+ G G SL QLG + FSYCLVS
Sbjct: 209 FPHKKTIPGFLVGCSLFS---IRQPEGIAGFGRSPESLPSQLGLK---KFSYCLVSHAFD 262
Query: 309 -TGSSGSLVF-----GREALPVGAAWVPLVRNPRAP--SFYYVGLSGLGVGGMRIPISED 360
T +S LV + G ++ P +NP A +YYV L + +G + +
Sbjct: 263 DTPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVLLRNIVIGDTHVKVPYK 322
Query: 361 LFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV---SIFDTCYNL 417
G+ G ++D+GT T + P YE F Q + A+ V + C+N+
Sbjct: 323 FLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQNQTGLRPCFNI 382
Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP---SPSGLS-----II 469
SG SV VP F+F GG + LP +N+ V D+G C S SG+ I+
Sbjct: 383 SGEKSVSVPEFIFHFKGGAKMALPLANYFSFV-DSGVICLTIVSDNMSGSGIGGGPAIIL 441
Query: 470 GNIQQEGIQISFDGANGFVGFGPNVC 495
GN QQ + FD N GF C
Sbjct: 442 GNYQQRNFHVEFDLKNERFGFKQQNC 467
>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 455
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 113/450 (25%), Positives = 198/450 (44%), Gaps = 58/450 (12%)
Query: 74 SDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSG-GGAD 132
+D+ + EL+H D + N+ ++ + + H R+ + ++R A V RL+ +D
Sbjct: 33 ADKFSFTAELIHID-------SPNSPFFNASETTTH-RLAKALQRSANRVARLNPLSNSD 84
Query: 133 AAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK 192
H + + G G Y +++ +G+PP + ID+GS+++W+ C C C+
Sbjct: 85 EGVH----------ASIFSGDGNYLMKLLIGTPPTEIHAAIDTGSNVIWIPCINCKDCFN 134
Query: 193 QSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDG-SYTKGTLALETLT 251
QS +F+P S+++ C S C+ ++ C Y + G +A++T+T
Sbjct: 135 QSSSIFNPLASSTYQDAPCDSYQCETTSSSCQSDNVCLYSCDEKHQLNCPNGRIAVDTMT 194
Query: 252 I----GRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS 306
+ GR + CG+ F G G++GLG G++SL +L + G FSYCL
Sbjct: 195 LTSSDGRPFPLPYSDFVCGNSIYKTFAG-VGVIGLGRGALSLTSKLYHLSDGKFSYCLAD 253
Query: 307 RGTGSSGSLVFGREALPVGAAWVPLVR----NPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
+ + FG ++ + + +V + R YYV L G+ VG R +DL+
Sbjct: 254 YYSKQPSKINFGLQSF-ISDDDLEVVSTTLGHHRHSGNYYVTLEGISVGEKR----QDLY 308
Query: 363 RLTQMGDD-------GVVMDTGTAVTRLPTPAYEAFRD----AFVAQTGNLPRASGVSI- 410
+ DD +++D+GT T LP Y+ A N P S
Sbjct: 309 YV----DDPFAPPVGNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPFS 364
Query: 411 FDTCYNLSG----FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL 466
D LS + ++ P ++ +F+ V ++F+ +D CFAFA + G
Sbjct: 365 MDNTLKLSPCFWYYPELKFPKITIHFTDADVELSDDNSFIRVAEDV--VCFAFAATQPGQ 422
Query: 467 SII-GNIQQEGIQISFDGANGFVGFGPNVC 495
S + G+ QQ + +D G V F C
Sbjct: 423 STVYGSWQQMNFILGYDLKRGTVSFKRTDC 452
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 117/413 (28%), Positives = 177/413 (42%), Gaps = 52/413 (12%)
Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
RD R A L++ GG D + D G YF ++ +GSPPR + I
Sbjct: 33 RDRLRHARLLQGFVGGVVDFSVQGSPD---------PYLVGLYFTKVKLGSPPREFNVQI 83
Query: 174 DSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCDR-----LENAG 223
D+GSD++WV C C+ C + S FD + S++ V CS +C +
Sbjct: 84 DTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGLVHCSDPICTSAVQTTVTQCS 143
Query: 224 CHAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN----VAIGCGHKNQGMFV-- 273
+C Y Y DGS T G +TL +G ++V N + GC G
Sbjct: 144 PQTNQCSYTFQYEDGSGTSGYYVSDTLYFDAILGESLVVNSSALIVFGCSTFQSGDLTMT 203
Query: 274 --GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV 329
G+ G G G +S++ QL G T FS+CL +G G G ++ E L G +
Sbjct: 204 DKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCL--KGEGIGGGILVLGEILEPGMVYS 261
Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
PLV P P Y + L + V G +PI +F + G ++D+GT + L AY
Sbjct: 262 PLV--PSQPH-YNLNLQSIAVNGKLLPIDPSVFATSN--SQGTIVDSGTTLAYLVAEAY- 315
Query: 390 AFRDAFVAQTGNLPRASGVSIF---DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
D FV+ + S I + CY +S VS P SF F+GG + L ++L
Sbjct: 316 ---DPFVSAVNVIVSPSVTPIISKGNQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDYL 372
Query: 447 IPVDDAG----TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
IP + +C F G++I+G++ + +D +G+ C
Sbjct: 373 IPFGPSQGGSVMWCIGFQ-KVQGVTILGDLVLKDKIFVYDLVRQRIGWANYDC 424
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 101/379 (26%), Positives = 166/379 (43%), Gaps = 41/379 (10%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
SG G Y+ +IG+G+PP++ Y+ +D+GSDI+WV C C +C +S+ ++D
Sbjct: 76 SGRPDAVGLYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIK 135
Query: 202 DSASFSGVSCSSAVCDRLEN---AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGR--- 254
+S+S V C C + GC A C Y YGDGS T G + + +
Sbjct: 136 ESSSGKFVPCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSG 195
Query: 255 -----TVVKNVAIGCGHKNQGMFVGA-----AGLLGLGGGSMSLVGQLG--GQTGGAFSY 302
+ ++ GCG + G + G+LG G + S++ QL G+ F++
Sbjct: 196 DLKTDSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAH 255
Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
CL G G G P PL+ P P Y V ++ + VG + +S D
Sbjct: 256 CL--NGVNGGGIFAIGHVVQP-KVNMTPLL--PDQPH-YSVNMTAVQVGHAFLSLSTD-- 307
Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
TQ G ++D+GT + LP YE ++Q +L + + TC+ S V
Sbjct: 308 TSTQGDRKGTIIDSGTTLAYLPEGIYEPLVYKIISQHPDL-KVRTLHDEYTCFQYSESVD 366
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS------PSGLSIIGNIQQEG 476
P V+FYF G L + ++L P D +C + S ++++G++
Sbjct: 367 DGFPAVTFYFENGLSLKVYPHDYLFPSGDF--WCIGWQNSGTQSRDSKNMTLLGDLVLSN 424
Query: 477 IQISFDGANGFVGFGPNVC 495
+ +D N +G+ C
Sbjct: 425 KLVFYDLENQVIGWTEYNC 443
>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
Length = 204
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 69/200 (34%), Positives = 105/200 (52%), Gaps = 4/200 (2%)
Query: 298 GAFSYCLVSRGTGSSGSLVFGREALPVGAAW-VPLVRNPRAPSFYYVGLSGLGVGGMRIP 356
FSYCL S + L+ G A A PL+ NP PSFYY+ L G+ VGG ++
Sbjct: 4 AKFSYCLTSMDDSKASVLLLGSLAKATKDAISTPLLTNPSQPSFYYLSLEGIPVGGTQLS 63
Query: 357 ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYN 416
I + +F ++ G GV++D+GT +T L ++ + F++Q+ S + D C++
Sbjct: 64 IEQSIFDVSDDGSGGVIIDSGTTITYLEKSVFDTLKKEFISQSNLQLDKSSSTGLDVCFS 123
Query: 417 L-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQE 475
L S V VP + F+F GG L LPA +++I G C A S +G+SI GN+QQ+
Sbjct: 124 LPSETTQVEVPKLVFHFKGGD-LELPAESYMIADSKLGVACLAMGAS-NGMSIFGNVQQQ 181
Query: 476 GIQISFDGANGFVGFGPNVC 495
I ++ D + F P C
Sbjct: 182 NILVNHDLEKETISFVPTQC 201
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 166/379 (43%), Gaps = 45/379 (11%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ------PCSQCY-----KQSDPVFDPAD 202
G Y V +G+PP+ +V+D+GS +VW C C C P++
Sbjct: 72 GGYSVIFSLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNK 131
Query: 203 SASFSGVSCSSAVCDRL--ENAGCH-AGRCRYE-VSYGDGSYTKGTLALETLTIGR-TVV 257
S++ + C S C+ + + C RC Y + YG GS T G L + L + + +
Sbjct: 132 SSTVQSLPCRSPKCNWVFGSDLNCSTTKRCPYYGLEYGLGS-TTGQLVSDVLGLSKLNRI 190
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR---GTGSSGS 314
+ GC + G+ G G G S+ QLG FSYCLVS T SG
Sbjct: 191 PDFLFGCSLVSNRQ---PEGIAGFGRGLASIPAQLGLT---KFSYCLVSHRFDDTPQSGD 244
Query: 315 LVFGR-----EALPVGAAWVPLVRNPR-AP--SFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
LV R +A G A+ P ++P +P +YY+ LS + VGG +PI ++
Sbjct: 245 LVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSK 304
Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV---SIFDTCYNLSGFVSV 423
GD G+++D+G+ T + ++ RA + S CYN++G V
Sbjct: 305 EGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGPCYNITGQSEV 364
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP------SGLSII-GNIQQEG 476
VP ++F F GG + LP +++ V D G C P +G +II GN QQ+
Sbjct: 365 DVPKLTFSFKGGANMDLPLTDYFSLVTD-GVVCMTVLTDPDEPGSTTGPAIILGNYQQQN 423
Query: 477 IQISFDGANGFVGFGPNVC 495
I +D GF P C
Sbjct: 424 FYIEYDLKKQRFGFKPQQC 442
>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 423
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 115/414 (27%), Positives = 179/414 (43%), Gaps = 59/414 (14%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
++RD+ R+ G + H V+ V G G Y++ + +GSPP+ ++
Sbjct: 9 LERDLSRL---------GKSSVGNHSVRFH----VGGNIYPDGLYYMALLLGSPPKLYFL 55
Query: 172 VIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH----- 225
+D+GSD+ W QC PC C +++P + V C VC +++ G +
Sbjct: 56 DMDTGSDLTWAQCDAPCRNCAIGPHGLYNPKKAKV---VDCHLPVCAQIQQGGSYECNSD 112
Query: 226 AGRCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVAIGCGHKNQGMFVGAA----G 277
+C YEV Y DGS T G L +TLT+ G + IGCG+ QG + G
Sbjct: 113 VKQCDYEVEYADGSSTMGVLVEDTLTVRLTNGTLIQTKAIIGCGYDQQGTLAKSPASTDG 172
Query: 278 LLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALP-VGAAWVPLVRN 334
++GL ++L QL G +CL + G+ G L FG E +P G W P++
Sbjct: 173 VIGLSSSKVALPAQLAEKGIIKNVLGHCL-ADGSNGGGYLFFGDELVPSWGMTWTPMMGK 231
Query: 335 PRAPSFYYVGLSGLGVGGMRIPIS--EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
P Y L + GG + ++ EDL R T V+ D+GT+ T L AY +
Sbjct: 232 PEMLG-YQARLQSIRYGGDSLVLNNDEDLTRSTS----SVMFDSGTSFTYLVPQAYASVL 286
Query: 393 DAFVAQTGNLPRASGVSIFDTCYN-LSGFVSV-------RVPTVSF----YFSGGPVLTL 440
A Q+G L R + C+ S F S+ + T+ F +F+ L L
Sbjct: 287 SAVTKQSG-LLRVKSDTTLPYCWRGPSPFQSITDVHQYFKTLTLDFGGRNWFATDSTLDL 345
Query: 441 PASNFLIPVDDAGTFCF----AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
+LI V G C A S +IIG++ G + +D +G+
Sbjct: 346 SPQGYLI-VSTQGNVCLGILDASGASLEVTNIIGDVSMRGYLVVYDNVRDRIGW 398
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 122/433 (28%), Positives = 193/433 (44%), Gaps = 46/433 (10%)
Query: 98 NMHYHR-----HQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQG 152
+H R H+ +RD R + +++ GG D VQ + G G
Sbjct: 28 TLHLERGVPASHKLKLSQLKERDRVRHSRMLQSSGGGVVD---FPVQGTFDPFLVGFYFG 84
Query: 153 S--GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD---PV--FDPADSAS 205
S Y+ R+ +GSPPR Y+ ID+GSD++WV C C+ C S P+ FDP S +
Sbjct: 85 SFCRLYYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPT 144
Query: 206 FSGVSCSSAVCD---RLENAGCHA--GRCRYEVSYGDGSYTKGTLALETL----TIGRTV 256
S +SCS C + ++ C A +C Y YGDGS T G + L +G +V
Sbjct: 145 ASLISCSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSV 204
Query: 257 VKN----VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVS 306
+KN + GC G G+ G G MS++ QL Q T FS+CL
Sbjct: 205 MKNSSAPIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCL-- 262
Query: 307 RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
+G S G ++ E + + PLV P P Y + L + V G + I +F +
Sbjct: 263 KGDDSGGGILVLGEIVEPNIVYTPLV--PSQPH-YNLNLQSIYVNGQTLAIDPSVFATSS 319
Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
+ G ++D+GT + L AY+ F A + T + + +S + CY S ++ P
Sbjct: 320 --NQGTIIDSGTTLAYLTEAAYDPFISA-ITSTVSPSVSPYLSKGNQCYLTSSSINDVFP 376
Query: 427 TVSFYFSGGPVLTLPASNFLI---PVDDAGTFCFAFAP-SPSGLSIIGNIQQEGIQISFD 482
VS F+GG + L ++LI ++ A +C F ++I+G++ + +D
Sbjct: 377 QVSLNFAGGTSMILIPQDYLIQQSSINGAALWCVGFQKIQGQEITILGDLVLKDKIFVYD 436
Query: 483 GANGFVGFGPNVC 495
A +G+ C
Sbjct: 437 IAGQRIGWANYDC 449
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 124 bits (312), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 99/328 (30%), Positives = 152/328 (46%), Gaps = 33/328 (10%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
+G +G YF +IG+G+P + Y+ +D+GSDI+WV C C +C +SD ++D
Sbjct: 146 NGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMK 205
Query: 202 DSASFSGVSCSSAVCDRLEN--AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGR---- 254
S + V C C + GC G +C Y V YGDGS T G + + R
Sbjct: 206 ASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGN 265
Query: 255 --TVVKN--VAIGCGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGAFSYCL 304
T N V GCG+K G ++ G+LG G + S++ QL G+ FS+CL
Sbjct: 266 FQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL 325
Query: 305 VSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL 364
+ G G G E + PLV+N + Y V + + VGG + + D F
Sbjct: 326 DNVDGG--GIFAIG-EVVEPKVNITPLVQN---QAHYNVVMKEIEVGGDPLDVPSDAF-- 377
Query: 365 TQMGD-DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
+ GD G ++D+GT + P Y + ++Q +L R V TC++ +G V
Sbjct: 378 -ESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDL-RLHTVEQAFTCFDYTGNVDD 435
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDD 451
PTV+ +F LT+ +L V +
Sbjct: 436 GFPTVTLHFDKSISLTVYPHEYLFQVKE 463
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 113/372 (30%), Positives = 166/372 (44%), Gaps = 43/372 (11%)
Query: 157 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC 216
V + VG+PP++ MVID+GS++ W+ C + Y + FDP S S+ + CSS C
Sbjct: 32 IVSLTVGTPPQNVSMVIDTGSELSWLHCNK-TLSYPTT---FDPTRSTSYQTIPCSSPTC 87
Query: 217 -DRLEN----AGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK--- 267
+R ++ A C + C +SY D S + G LA + IG + + + GC
Sbjct: 88 TNRTQDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHIGSSDISGLVFGCMDSVFS 147
Query: 268 -NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP--V 324
N + GL+G+ GS+S V QLG FSYC+ GT SG L+ G L V
Sbjct: 148 SNSDEDSKSTGLMGMNRGSLSFVSQLGFP---KFSYCI--SGTDFSGLLLLGESNLTWSV 202
Query: 325 GAAWVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
+ PL++ + P F Y V L G+ V +PI + F G ++D+GT
Sbjct: 203 PLNYTPLIQISTPLPYFDRVAYTVQLEGIKVLDKLLPIPKSTFEPDHTGAGQTMVDSGTQ 262
Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCY--NLSGFVSVRVPTVSFY 431
T L P Y A R AF+ QT ++ R F D CY LS V +PTV+
Sbjct: 263 FTFLLGPVYNALRSAFLNQTSSVLRVLEDPDFVFQGAMDLCYLVPLSQRVLPLLPTVTLV 322
Query: 432 FSGGPVLTLPASNFLIPVD-----DAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFDG 483
F G +T+ L V + C +F S +IG+ Q+ + + FD
Sbjct: 323 FRGAE-MTVSGDRVLYRVPGELRGNDSVHCLSFGNSDLLGVEAYVIGHHHQQNVWMEFDL 381
Query: 484 ANGFVGFGPNVC 495
+G C
Sbjct: 382 EKSRIGLAQVRC 393
>gi|242095592|ref|XP_002438286.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
gi|241916509|gb|EER89653.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
Length = 495
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 113/381 (29%), Positives = 174/381 (45%), Gaps = 44/381 (11%)
Query: 144 DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC------SQCYKQSDPV 197
+++S + G EY V G G+P + + D S + ++C+PC + D
Sbjct: 127 NIISSL-PGVFEYTVLAGYGTPAQQLPLFFDV-SGMSNMRCKPCFSGSSGGETTTTCDVA 184
Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGRTV 256
FDP+ S+SF V C S C C AG C + + + GT+ ++TLT+ +
Sbjct: 185 FDPSMSSSFRSVLCGSPDCG---GHSCSAGGSCTFTLQNSTFVFGNGTIVMDTLTLSPSA 241
Query: 257 V-KNVAIGCGHKNQGMFVG--AAGLLGLGGGSMSLVGQLGGQTG---GAFSYCLVSRGTG 310
+N A+GC + +F A G + L SL ++ + AFSYCL + T
Sbjct: 242 TFENFAVGCMQLDNDLFTDGVAVGNIDLSLSRHSLATRVLNSSPPGMAAFSYCLPAD-TD 300
Query: 311 SSGSLVFGREALP-----VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
+ G L AL G +VPLV NP P+FYYV L + + G +PI LF
Sbjct: 301 THGFLTIA-PALSDYSDHAGVKYVPLVTNPTGPNFYYVDLVAIAINGEDLPIPPALFT-- 357
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAF---VAQTGNLPRASGVSIFDTCYNLSGFVS 422
+G ++D+ +A T L P Y A RD F + Q +P G+ DTCYN + +
Sbjct: 358 ---GNGTMIDSQSAFTYLNPPIYAALRDEFRKAMLQYQPVPAFGGL---DTCYNFTLAEN 411
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLI----PVDDAGTF-CFAFAPSPS---GLSIIGNIQQ 474
+ +P ++ FS G + L F+ + D F C AFA +P + +G+ Q
Sbjct: 412 IYLPDITLRFSNGETMDLDDRQFMYFFREHLTDGFPFGCLAFAAAPDQNFPWNYLGSQVQ 471
Query: 475 EGIQISFDGANGFVGFGPNVC 495
+I +D G V F P+ C
Sbjct: 472 RTKEIVYDVRGGMVAFVPSRC 492
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 111/369 (30%), Positives = 167/369 (45%), Gaps = 35/369 (9%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
G YF ++ +G+PP + ID+GSDI+WV C C+ C + S FD + S+S S
Sbjct: 77 GLYFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSL 136
Query: 209 VS-----CSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALET----LTIGRTVVKN 259
VS C+SA + +C Y YGDGS T G E+ + +G++++ N
Sbjct: 137 VSCSDPICNSAFQTTATQCLTQSNQCSYTFQYGDGSGTSGYYVSESMYFDMVMGQSMIAN 196
Query: 260 ----VAIGCGHKNQGMFVGA----AGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGT 309
V GC G + G+ G G G +S++ QL G T FS+CL G
Sbjct: 197 SSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLKGEGN 256
Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
G G LV G E L G + PLV P P Y + L + V G +PI +F + +
Sbjct: 257 G-GGILVLG-EVLEPGIVYSPLV--PSQPH-YNLYLQSISVNGQTLPIDPSVFATSI--N 309
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVS 429
G ++D+GT + L AY F A A +S + CY +S V P VS
Sbjct: 310 RGTIIDSGTTLAYLVEEAYTPFVSAITAAVSQ-SVTPTISKGNQCYLVSTSVGEIFPLVS 368
Query: 430 FYFSGGPVLTLPASNFLIPV---DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
F+G + L +L+ + D A +C F G++I+G++ + +D A
Sbjct: 369 LNFAGSASMVLKPEEYLMHLGFYDGAALWCIGFQKVQEGVTILGDLVMKDKIFVYDLARQ 428
Query: 487 FVGFGPNVC 495
+G+ C
Sbjct: 429 RIGWASYDC 437
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 104/372 (27%), Positives = 172/372 (46%), Gaps = 40/372 (10%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
G Y+ ++ +G+PP + ID+GSD++WV C CS C + S FDP S++ S
Sbjct: 73 GLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSM 132
Query: 209 VSCSSAVCD---RLENAGCHA--GRCRYEVSYGDGSYTKG-----TLALETLTIGRTVVK 258
++CS C+ + +A C + +C Y YGDGS T G + L T+ G
Sbjct: 133 IACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTN 192
Query: 259 N---VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGT 309
+ V GC ++ G G+ G G MS++ QL Q FS+CL +G
Sbjct: 193 STAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCL--KGD 250
Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
S G ++ E + + LV P P Y + L + V G + I +F +
Sbjct: 251 SSGGGILVLGEIVEPNIVYTSLV--PAQPH-YNLNLQSIAVNGQTLQIDSSVFATSN--S 305
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA--SGVSIFDTCYNLSGFVSVRVPT 427
G ++D+GT + L AY+ F A T ++P++ + VS + CY ++ V+ P
Sbjct: 306 RGTIVDSGTTLAYLAEEAYDPFVSAI---TASIPQSVHTVVSRGNQCYLITSSVTEVFPQ 362
Query: 428 VSFYFSGGPVLTLPASNFLI---PVDDAGTFCFAFAP-SPSGLSIIGNIQQEGIQISFDG 483
VS F+GG + L ++LI + A +C F G++I+G++ + + +D
Sbjct: 363 VSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDL 422
Query: 484 ANGFVGFGPNVC 495
A +G+ C
Sbjct: 423 AGQRIGWANYDC 434
>gi|413936472|gb|AFW71023.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
Length = 289
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/277 (33%), Positives = 140/277 (50%), Gaps = 32/277 (11%)
Query: 228 RCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKN---QGMFVGAAGLLGLGG 283
+C + +SY DG+ T G + + LT+ +V+N GCGH +G+F G+LGLG
Sbjct: 36 QCGFAISYADGTSTVGAYSQDKLTLAPGAIVQNFYFGCGHGKHAVRGLF---DGVLGLG- 91
Query: 284 GSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYV 343
L LG + GG FSYCL S + G L G P G + P+ P P+F V
Sbjct: 92 ---RLRESLGARYGGVFSYCLPSVSS-KPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTV 147
Query: 344 GLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF---VAQTG 400
L+G+ VGG ++ + F G+++D+GT +T L + AY A R AF +
Sbjct: 148 TLAGINVGGKKLDLRPSAF------SGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYR 201
Query: 401 NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA 460
LP DTCYNL+G+ +V VP ++ F+GG + L N ++ V+ C AFA
Sbjct: 202 LLPNGD----LDTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGIL-VNG----CLAFA 252
Query: 461 PS-PSGLS-IIGNIQQEGIQISFDGANGFVGFGPNVC 495
S P G + ++GN+ Q ++ FD + GF C
Sbjct: 253 ESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 289
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 104/379 (27%), Positives = 172/379 (45%), Gaps = 40/379 (10%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
+G+ +G YF ++G+GSPP+ Y+ +D+GSDI+WV C CS+C ++SD ++DP
Sbjct: 61 NGLPTETGLYFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPK 120
Query: 202 DSASFSGVSCSSAVCDRLENA---GCHAG-RCRYEVSYGDGSYTKGTLALETLTIG---- 253
S + +SC C + GC + C Y ++YGDGS T G + LT
Sbjct: 121 GSETSELISCDQEFCSATYDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVND 180
Query: 254 --RTVVKNVAI--GCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQLG--GQTGGAFSY 302
RT +N +I GCG G ++ G++G G + S++ QL G+ FS+
Sbjct: 181 NLRTAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSH 240
Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
CL + G G G P + PLV PR + Y V L + V + + D+F
Sbjct: 241 CLDNIRGG--GIFAIGEVVEP-KVSTTPLV--PRM-AHYNVVLKSIEVDTDILQLPSDIF 294
Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
G ++D+GT + LP Y+ +A+ L F +C+ +G V
Sbjct: 295 --DSGNGKGTIIDSGTTLAYLPAIVYDELIPKVMARQPRLKLYLVEQQF-SCFQYTGNVD 351
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS------GLSIIGNIQQEG 476
P V +F LT+ ++L D G +C + S + ++++G++
Sbjct: 352 RGFPVVKLHFEDSLSLTVYPHDYLFQFKD-GIWCIGWQKSVAQTKNGKDMTLLGDLVLSN 410
Query: 477 IQISFDGANGFVGFGPNVC 495
+ +D N +G+ C
Sbjct: 411 KLVIYDLENMAIGWTDYNC 429
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 156/360 (43%), Gaps = 31/360 (8%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+G Y R+ +G+P + +++DSGS + +V C C QC DP F P S+++S V C+
Sbjct: 88 NGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQDPRFQPDLSSTYSPVKCN 147
Query: 213 -SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT---VVKNVAIGCGHKN 268
CD +C YE Y + S + G L + ++ G+ + GC +
Sbjct: 148 VDCTCDN------ERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQRAVFGCENTE 201
Query: 269 QGMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
G A G++GLG G +S++ QL G +FS C G G++V G +P
Sbjct: 202 TGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVG-GGTMVLG--GMPA 258
Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
V NP +Y + L + V G + + +F G V+D+GT LP
Sbjct: 259 PPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFN----SKHGTVLDSGTTYAYLP 314
Query: 385 TPAYEAFRDAFVAQTGNLPRASG--VSIFDTCY-----NLSGFVSVRVPTVSFYFSGGPV 437
A+ AF+DA + +L + G + D C+ N+S V P V F G
Sbjct: 315 EQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEV-FPDVDMVFGNGQK 373
Query: 438 LTLPASNFLIPVDDA-GTFCF-AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L+L N+L G +C F +++G I +++D N +GF C
Sbjct: 374 LSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 433
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 102/353 (28%), Positives = 161/353 (45%), Gaps = 38/353 (10%)
Query: 111 RMQRDVKRVATLVRRLSGGGADA-AKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQ 169
R + R+A +RR G GA A+ + D D+++ +G Y R+ +G+PP+
Sbjct: 51 RSYPNASRLAASLRRGLGDGAHPNARMRLHD---DLLT-----NGYYTTRLYIGTPPQEF 102
Query: 170 YMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS-SAVCDRLENAGCHAGR 228
+++DSGS + +V C C QC DP F P S+S+S V C+ CD + +
Sbjct: 103 ALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKCNVDCTCDSDKK------Q 156
Query: 229 CRYEVSYGDGSYTKGTLALETLTIGRT---VVKNVAIGCGHKNQGMFVG--AAGLLGLGG 283
C YE Y + S + G L + ++ GR + GC + G A G++GLG
Sbjct: 157 CTYERQYAEMSSSSGVLGEDIVSFGRESELKAQRAVFGCENSETGDLFSQHADGIMGLGR 216
Query: 284 GSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFY 341
G +S++ QL G +FS C G G++V G +P + V +P +Y
Sbjct: 217 GQLSIMDQLVEKGVINDSFSLCYGGMDIG-GGAMVLG--GVPTPSDMVFSRSDPLRSPYY 273
Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
+ L + V G + + +F G V+D+GT LP A+ AF+DA ++ +
Sbjct: 274 NIELKEIHVAGKALRVDSRIFDSKH----GTVLDSGTTYAYLPEQAFMAFKDAVTSKVHS 329
Query: 402 LPRASG--VSIFDTCY-----NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
L + G S D C+ N+S V P V F G L+L N+L
Sbjct: 330 LKKIRGPDPSYKDICFAGARRNVSKLHEV-FPDVDMVFGNGQKLSLTPENYLF 381
>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
Length = 367
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 71/191 (37%), Positives = 107/191 (56%), Gaps = 12/191 (6%)
Query: 109 HARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVV--SGMDQGSGEYFVRIGVGSPP 166
H ++R ++R RL+G G A+ E VV + + GEY V++G+G+PP
Sbjct: 45 HELLRRAIQRSRY---RLAGIGM--ARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPP 99
Query: 167 RSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC-- 224
ID+ SD++W QCQPC+ CY Q DP+F+P S++++ + CSS CD L+ C
Sbjct: 100 YKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGH 159
Query: 225 -HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG--MFVGAAGLLGL 281
C+Y +Y + T+GTLA++ L IG + VA GC + G A+G++GL
Sbjct: 160 DDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAFGCSTSSTGGAPPPQASGVVGL 219
Query: 282 GGGSMSLVGQL 292
G G +SLV QL
Sbjct: 220 GRGPLSLVSQL 230
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 51/195 (26%), Positives = 83/195 (42%), Gaps = 13/195 (6%)
Query: 306 SRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
+ GT + LV G +A A AP G+ GLG G + + + R
Sbjct: 177 TEGTLAVDKLVIGEDAFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRY- 235
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLS---GFV 421
G+++D + +T L Y+ + + LPR +G S+ D C+ L F
Sbjct: 236 -----GMIIDIASTITFLEASLYDELVNDLEVEI-RLPRGTGSSLGLDLCFILPDGVAFD 289
Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQIS 480
V VP V+ F G L L + ++G C + +G +SI+GN QQ+ +Q+
Sbjct: 290 RVYVPAVALAFDGR-WLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVL 348
Query: 481 FDGANGFVGFGPNVC 495
++ G V F + C
Sbjct: 349 YNLRRGRVTFVQSPC 363
>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
Length = 346
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 99/360 (27%), Positives = 159/360 (44%), Gaps = 40/360 (11%)
Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDP---VFDPADSASFSGVSCSSAV 215
I +G+PP + ID+GS + WVQC+ C +CY Q+ +F+P +S+++S V CS+
Sbjct: 3 ISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGCSTEA 62
Query: 216 CDRLE-----NAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHK 267
C+ + GC C Y + YG G Y+ G L + LT+ + N GCG
Sbjct: 63 CNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNRSIDNFIFGCGED 122
Query: 268 NQGMFVGA-AGLLGLGGGSMSLVGQLGGQTG-GAFSYCLVSRGTGSSGSLVFGREALPVG 325
N ++ G AG++G G S S Q+ QT AFSYC R + GSL G A +
Sbjct: 123 N--LYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCF-PRDHENEGSLTIGPYARDIN 179
Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
W L+ P+ Y + + V G+R+ I ++ +++M ++D+GTA T + +
Sbjct: 180 LMWTKLIYYDHKPA-YAIQQLDMMVNGIRLEIDPYIY-ISKM----TIVDSGTADTYILS 233
Query: 386 PAYEAFRDAFVAQTGNLPRASGVSIFDTCY-------NLSGFVSVRVPTVSFYFSGGPVL 438
P ++A A + G C+ N + F +V + + L
Sbjct: 234 PVFDALDKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKLIR------STL 287
Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPS---GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
LP N + C F P + G+ ++GN ++ FD GF C
Sbjct: 288 KLPVENAFYESSN-NVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 346
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 98/322 (30%), Positives = 149/322 (46%), Gaps = 33/322 (10%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFS 207
+G YF +IG+G+P + Y+ +D+GSDI+WV C C +C +SD ++D S +
Sbjct: 71 AGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSD 130
Query: 208 GVSCSSAVCDRLEN--AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGR------TVVK 258
V C C + GC G +C Y V YGDGS T G + + R T
Sbjct: 131 AVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPT 190
Query: 259 N--VAIGCGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTG 310
N V GCG+K G ++ G+LG G + S++ QL G+ FS+CL + G
Sbjct: 191 NGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGG 250
Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD- 369
G G P PLV+N + Y V + + VGG + + D F + GD
Sbjct: 251 --GIFAIGEVVEP-KVNITPLVQN---QAHYNVVMKEIEVGGDPLDVPSDAF---ESGDR 301
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVS 429
G ++D+GT + P Y + ++Q +L R V TC++ +G V PTV+
Sbjct: 302 KGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDL-RLHTVEQAFTCFDYTGNVDDGFPTVT 360
Query: 430 FYFSGGPVLTLPASNFLIPVDD 451
+F LT+ +L V +
Sbjct: 361 LHFDKSISLTVYPHEYLFQVKE 382
>gi|125606590|gb|EAZ45626.1| hypothetical protein OsJ_30294 [Oryza sativa Japonica Group]
Length = 431
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 110/367 (29%), Positives = 169/367 (46%), Gaps = 42/367 (11%)
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFS 207
G+ + E V +G+G+P + +V D+ SD++W QCQPC C Q+ ++DP + +++
Sbjct: 80 GVQEKHVEPHVFLGIGTPAMNVTLVFDTTSDLLWTQCQPCLSCVAQAGDMYDPNKTETYA 139
Query: 208 GVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK 267
++ SS Y +Y S+T G A ET +G V N+ GCG +
Sbjct: 140 NLTSSS-----------------YNYTYSKQSFTSGYFATETFALGNVTVANITFGCGTR 182
Query: 268 NQGMFVGAA---GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG------ 318
NQG + A G+ G G +SL+ QLG FSYC S G S ++ G
Sbjct: 183 NQGYYDNVAGVFGVGRGGRGGVSLLNQLGIDR---FSYCFSSSGAPGSSAVFLGGSPELA 239
Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
A AA P+V +P S Y+V L G+ VG + ++ + G +V+D+ +
Sbjct: 240 TNATTTPAASTPMVADPVLKSGYFVKLVGVTVGATLVDVAGA--SSAEGGGRALVIDSTS 297
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRA-----SGVSIFDTCYNLSGFVSVRVP---TVSF 430
VT L Y R A VAQ L A +GV + D C+ L+ + P T++
Sbjct: 298 PVTVLDEATYGPVRRALVAQLAPLKEANANASAGVGL-DLCFELAAGGATPTPPNVTMTL 356
Query: 431 YFSGGPV-LTLPASNFLIPVDDAGTFCFAFAPSPS-GLSIIGNIQQEGIQISFDGANGFV 488
+F GG L LP +++L G C PS S G+ ++G+ + +D A V
Sbjct: 357 HFDGGAADLVLPPASYLAKDSAGGLICLTMTPSSSNGVPVLGSWALLDTLVLYDLAKNVV 416
Query: 489 GFGPNVC 495
F P C
Sbjct: 417 SFQPLDC 423
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 98/324 (30%), Positives = 150/324 (46%), Gaps = 33/324 (10%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
+G +G YF +IG+G+P + Y+ +D+GSDI+WV C C +C +SD ++D
Sbjct: 146 NGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMK 205
Query: 202 DSASFSGVSCSSAVCDRLEN--AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGR---- 254
S + V C C + GC G +C Y V YGDGS T G + + R
Sbjct: 206 ASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGN 265
Query: 255 --TVVKN--VAIGCGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGAFSYCL 304
T N V GCG+K G ++ G+LG G + S++ QL G+ FS+CL
Sbjct: 266 FQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL 325
Query: 305 VSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL 364
+ G G G E + PLV+N + Y V + + VGG + + D F
Sbjct: 326 DNVDGG--GIFAIG-EVVEPKVNITPLVQN---QAHYNVVMKEIEVGGDPLDVPSDAF-- 377
Query: 365 TQMGD-DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
+ GD G ++D+GT + P Y + ++Q +L R V TC++ +G V
Sbjct: 378 -ESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDL-RLHTVEQAFTCFDYTGNVDD 435
Query: 424 RVPTVSFYFSGGPVLTLPASNFLI 447
PTV+ +F LT+ +L
Sbjct: 436 GFPTVTLHFDKSISLTVYPHEYLF 459
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 108/369 (29%), Positives = 169/369 (45%), Gaps = 35/369 (9%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD---PV--FDPADSASFSG 208
G YF R+ +GSPP+ Y+ ID+GSD++WV C C+ C + S P+ FDP S++ S
Sbjct: 66 GLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASL 125
Query: 209 VSCSSAVCD---RLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLT----IGRTVVK- 258
+SCS C + +AGC +C Y YGDGS T G + L +G +V
Sbjct: 126 ISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNS 185
Query: 259 --NVAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTG 310
++ GC G G+ G G MS++ Q+ Q T FS+CL G G
Sbjct: 186 SASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGG 245
Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
++ E + + PLV P P Y + L + V G + I ++F + +
Sbjct: 246 GGILVL--GEIVEEDIVYSPLV--PSQPH-YNLNLQSISVNGKSLAIDPEVFATST--NR 298
Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSF 430
G ++D+GT + L AY+ F A R +S CY ++ V PTVS
Sbjct: 299 GTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPL-LSKGTQCYLITSSVKGIFPTVSL 357
Query: 431 YFSGGPVLTLPASNFLI---PVDDAGTFCFAFAP-SPSGLSIIGNIQQEGIQISFDGANG 486
F+GG + L ++L+ + DA +C F G++I+G++ + +D A
Sbjct: 358 NFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDLAGQ 417
Query: 487 FVGFGPNVC 495
+G+ C
Sbjct: 418 RIGWANYDC 426
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 102/366 (27%), Positives = 164/366 (44%), Gaps = 44/366 (12%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV--FDPADSASFSGVSCSSAV 215
+ + +G+PP++Q MV+D+GS + W+ QC+K+ P FDP+ S++FS + C+ +
Sbjct: 77 INLPIGTPPQTQPMVLDTGSQLSWI------QCHKKQPPTASFDPSLSSTFSILPCTHPL 130
Query: 216 C-----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVAIGCGHKN 268
C D C R C Y Y DG+Y +G L E T R+V + +GC ++
Sbjct: 131 CKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRSVSTPPLILGCATES 190
Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----GTGSSGSLVFGREALPV 324
G+LG+ G +S Q FSYC+ R G +GS G
Sbjct: 191 ----TDPRGILGMNLGRLSFAKQ---SKITKFSYCVPPRQTRPGFTPTGSFYLGNNPSSK 243
Query: 325 GAAWVPLVRNP--RAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
G +V ++ + R P+F Y + + G+ + G ++ IS +FR G ++D+G+
Sbjct: 244 GFKYVGMMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLNISPAVFRADAGGSGQTMIDSGS 303
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF----DTCYNLSGFVSV--RVPTVSFYF 432
T L + AY+ R V G PR ++ D C++ V + + + F F
Sbjct: 304 EFTYLVSEAYDKVRAQVVRAVG--PRLKKGYVYGGVADMCFDSVKAVEIGRLIGEMVFEF 361
Query: 433 SGGPVLTLPASNFLIPVDDAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFDGANGFVG 489
G + +P L V G C S + +IIGN Q+ + + FD VG
Sbjct: 362 ERGVEVVIPKERVLADV-GGGVHCVGIGSSDKLGAASNIIGNFHQQNLWVEFDLVRRRVG 420
Query: 490 FGPNVC 495
FG C
Sbjct: 421 FGKADC 426
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 115/375 (30%), Positives = 166/375 (44%), Gaps = 58/375 (15%)
Query: 165 PPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV--FDPADSASFSGVSCSSAVC-----D 217
PP++ MVID+GS++ W++C S +PV FDP S+S+S + CSS C D
Sbjct: 82 PPQNISMVIDTGSELSWLRCNRSSN----PNPVNNFDPTRSSSYSPIPCSSPTCRTRTRD 137
Query: 218 RLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQG----M 271
L A C + + C +SY D S ++G LA E G T N+ GC G
Sbjct: 138 FLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEE 197
Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP--VGAAWV 329
GLLG+ GS+S + Q+G FSYC +S G L+ G +
Sbjct: 198 DTKTTGLLGMNRGSLSFISQMGFP---KFSYC-ISGTDDFPGFLLLGDSNFTWLTPLNYT 253
Query: 330 PLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
PL+R + P F Y V L+G+ V G +PI + + G ++D+GT T L
Sbjct: 254 PLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLL 313
Query: 385 TPAYEAFRDAFVAQTGNL------PRASGVSIFDTCYNLSGF-----VSVRVPTVSFYF- 432
P Y A R F+ +T + P D CY +S + R+PTVS F
Sbjct: 314 GPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSLVFE 373
Query: 433 ------SGGPVLTLPASNFLIP---VDDAGTFCFAFAPSP-SGLS--IIGNIQQEGIQIS 480
SG P+L + +P V + +CF F S G+ +IG+ Q+ + I
Sbjct: 374 GAEIAVSGQPLL------YRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIE 427
Query: 481 FDGANGFVGFGPNVC 495
FD +G P C
Sbjct: 428 FDLQRSRIGLAPVEC 442
>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 481
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 108/402 (26%), Positives = 164/402 (40%), Gaps = 67/402 (16%)
Query: 155 EYFVRIGVGS-PPRSQYMVIDSGSDIVWVQCQP--CSQCYKQSDPVFDPADSASFSGVSC 211
+Y + +GS PP+ + +D+GSD+VW C P C C + + VSC
Sbjct: 74 DYTLSFNLGSNPPQLITLYMDTGSDLVWFPCSPFECILCEGKPQTTKPANITKQTHSVSC 133
Query: 212 SSA-------------VC-------DRLENAGCHAGRCR-YEVSYGDGSYTKGTLALETL 250
S +C D +E + C + C + +YGDGS+ L +TL
Sbjct: 134 QSPACSAAHASMSSSNLCAISRCPLDYIETSDCSSFSCPPFYYAYGDGSFV-ANLYQQTL 192
Query: 251 TIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGG---QTGGAFSYCLVSR 307
++ ++N GC H G+ G G G +SL QL G FSYCLVS
Sbjct: 193 SLSSLHLQNFTFGCAHT---ALAEPTGVAGFGRGILSLPAQLSTLSPHLGNRFSYCLVSH 249
Query: 308 G-----TGSSGSLVFGREALPVGAA---------WVPLVRNPRAPSFYYVGLSGLGVGGM 353
L+ GR + A + ++ NP+ P +Y VGL+G+ VG
Sbjct: 250 SFDGDRLRRPSPLILGRHNDTITGAGDGESVEFVYTSMLSNPKHPYYYCVGLAGISVGKR 309
Query: 354 RIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL-PRASGVSI-- 410
+P E L R+ + G+ G+V+D+GT T LP Y A + F + RAS +
Sbjct: 310 TVPAPEILKRVDEKGNGGMVVDSGTTFTMLPESFYNAVVNEFDKRVNRFHKRASEIETKT 369
Query: 411 -FDTCYNLSGFVSVRVPTVSFYFSGGPV-LTLPASNFLIPVDDAG--------TFCFAFA 460
CY L+G ++P + +F G + LP N+ D G C
Sbjct: 370 GLGPCYYLNGL--SQIPVLKLHFVGNNSDVVLPRKNYFYEFMDGGDGIRRKGKVGCMMLM 427
Query: 461 PSPSGLSI-------IGNIQQEGIQISFDGANGFVGFGPNVC 495
+ +GN QQ+G ++ +D VGF C
Sbjct: 428 NGEDETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKKEC 469
>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
Length = 407
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 165/380 (43%), Gaps = 60/380 (15%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ----PCSQCYKQSDPVFDPADSASFSG 208
+G ++V + +G P + ++ ID+GS++ W++C PC C K P++ P
Sbjct: 37 TGHFYVTMNIGEPAKPYFLDIDTGSNLTWIKCHATPGPCKTCNKVPHPLYRPKKL----- 91
Query: 209 VSCSSAVCDRLEN-----AGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA 261
V C+ +CD L C +C Y+++Y DG+ + G L L+ ++ +N+A
Sbjct: 92 VPCADPLCDALHKDLGTTKDCREEPDQCHYQINYADGTTSLGVLLLDKFSLPTGSARNIA 151
Query: 262 IGCGH-------KNQGMFVGAAGLLGLGGGSMSLVGQL---GGQTGGAFSYCLVSRGTGS 311
GCG+ K V G+LGLG GS+ LV QL G + +CL S+G
Sbjct: 152 FGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQLKHSGAVSKNVIGHCLSSKG--- 208
Query: 312 SGSLVFGREALPVGAAWVPLVRN-PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
G L G E +P + + R P+ Y G + L +G R PI F+
Sbjct: 209 GGYLFIGEENVPSSHLHIIYIYCISREPNHYSPGQATLHLG--RNPIGTKPFK------- 259
Query: 371 GVVMDTGTAVTRLPTPAY----EAFRDAFVAQTGNLPRAS---------GVSIFDTCYNL 417
+ D+G+ T LP + A + + + + L + G F T ++L
Sbjct: 260 -AIFDSGSTYTYLPENLHAQLVSALKASLIKSSLKLVSDTDTRLHLCWKGPKPFKTVHDL 318
Query: 418 SG-FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS-GLSIIGNIQQE 475
F S+ V+ F G +T+P N+LI + G CF P L +IG I +
Sbjct: 319 PKEFKSL----VTLKFDHGVTMTIPPENYLI-ITGHGNACFGILELPGYDLFVIGGISMQ 373
Query: 476 GIQISFDGANGFVGFGPNVC 495
+ D G + + P+ C
Sbjct: 374 EQLVIHDNEKGRLAWMPSPC 393
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 107/379 (28%), Positives = 171/379 (45%), Gaps = 44/379 (11%)
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPAD 202
G+ +G Y+ IG+G+P + Y+ +D+GSDI+WV C C +C ++S ++DP D
Sbjct: 81 GLPTDTGLYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKD 140
Query: 203 SASFSGVSCSSAVCDRLENA---GCHAGR-CRYEVSYGDGSYTKGTLALETLTI------ 252
S++ S VSC C GC C Y V+YGDGS T G + L
Sbjct: 141 SSTGSKVSCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGD 200
Query: 253 GRTVVKN--VAIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQL--GGQTGGAFSYC 303
G+T N V GCG + QG +G++ G++G G + S++ QL G+ F++C
Sbjct: 201 GQTRPANSTVTFGCGSQ-QGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHC 259
Query: 304 LVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
L + G G G P PLV P P Y V L + VGG + + +F
Sbjct: 260 LDTINGG--GIFAIGNVVQP-KVKTTPLV--PNMPH-YNVNLKSIDVGGTALKLPSHMFD 313
Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
+ G ++D+GT +T LP Y+ A A+ ++ + V F C+ G V
Sbjct: 314 TGE--KKGTIIDSGTTLTYLPEIVYKEIMLAVFAKHKDITFHN-VQEF-LCFQYVGRVDD 369
Query: 424 RVPTVSFYFSGG-PVLTLPASNFLIPVDDAGTFCFAF------APSPSGLSIIGNIQQEG 476
P ++F+F P+ P F D+ +C F + G+ ++G++
Sbjct: 370 DFPKITFHFENDLPLNVYPHDYFFENGDNL--YCVGFQNGGLQSKDGKGMVLLGDLVLSN 427
Query: 477 IQISFDGANGFVGFGPNVC 495
+ +D N +G+ C
Sbjct: 428 KLVVYDLENQVIGWTEYNC 446
>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
Length = 450
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/382 (25%), Positives = 161/382 (42%), Gaps = 47/382 (12%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ---PCSQC-YKQSD----PVFDPADSAS 205
G + + + G+PP+ ++D+GSD+VW C C+ C + +D P+FDP S+S
Sbjct: 76 GGHSISLSFGTPPQKLSFLVDTGSDVVWAPCTTDYTCTNCSFSAADPKKVPIFDPKLSSS 135
Query: 206 FSGVSCSSAVCDRLENAGCHAG-------------RCRYEVSYGDGSYTKGTLALETLTI 252
+ C + C H G C Y YG G+ + G LE L
Sbjct: 136 SKILDCRNPKCVSTYFPYVHLGCPRCNGNSKHCSYACPYSTQYGTGA-SSGYFLLENLKF 194
Query: 253 GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR---GT 309
R ++N +GC + + + L G G SL Q+G + F+YCL S T
Sbjct: 195 PRKTIRNFLLGCT-TSAARELSSDALAGFGRSMFSLPIQMGVK---KFAYCLNSHDYDDT 250
Query: 310 GSSGSLVFG-REALPVGAAWVPLVRNPRAPSFYY-VGLSGLGVGGMRIPISEDLFRLTQM 367
+SG L+ R+ G ++ P +++P A +FYY +G+ + +G + I
Sbjct: 251 RNSGKLILDYRDGKTKGLSYTPFLKSPPASAFYYHLGVKDIKIGNKLLRIPSKYLAPGSD 310
Query: 368 GDDGVVMDTGT-AVTRLPTPAYEAFRDAFVAQTGNLPR---ASGVSIFDTCYNLSGFVSV 423
G GV++D+G + P ++ + Q R A + CYN +G S+
Sbjct: 311 GRSGVIIDSGYGGAGYMTGPVFKIVTNELKKQMSKYRRSLEAETQTGLTPCYNFTGHKSI 370
Query: 424 RVPTVSFYFSGGPVLTLPASNFL----------IPVDDAGTFCFAFAPSPSGLSIIGNIQ 473
++P + + F GG + +P N+ +D GT P PS I+GN Q
Sbjct: 371 KIPPLIYQFRGGANMVVPGKNYFGISPQESLACFLMDTNGTNALEITPDPS--IILGNSQ 428
Query: 474 QEGIQISFDGANGFVGFGPNVC 495
+ +D N GF C
Sbjct: 429 HVDYYVEYDLKNDRFGFRRQTC 450
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 105/365 (28%), Positives = 167/365 (45%), Gaps = 31/365 (8%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQC-YKQSDPVFDPADSASFSGVSC 211
G ++ + +G+P + +++D+GS + +V C C S C D FDP S++ S +SC
Sbjct: 76 GYFYATLYLGTPAKKFAVIVDTGSTMTYVPCSSCGSGCGPNHQDAAFDPEASSTASRISC 135
Query: 212 SSAVCD-RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV-KNVAIGCGHKNQ 269
+S C GC +C Y SY + S + G L + L + + + GC +
Sbjct: 136 TSPKCSCGSPRCGCSTQQCTYTRSYAEQSSSSGILLEDVLALHDGLPGAPIIFGCETRET 195
Query: 270 GMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCL-VSRGTGSSGSLVFGREALP- 323
G A GL GLG S+V QL G FS C + G G+L+ G +P
Sbjct: 196 GEIFRQRADGLFGLGNSDASVVNQLVKAGVIDDVFSLCFGMVEG---DGALLLGDAEVPG 252
Query: 324 -VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
+ + PL+ + P +Y V + L V G +P+S+ LF G V+D+GT T
Sbjct: 253 SISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLPVSQSLFDQGY----GTVLDSGTTFTY 308
Query: 383 LPTPAYEAFRDAF--VAQTGNLPRASGV--SIFDTCY-------NLSGFVSVRVPTVSFY 431
+P+P ++AF A A + L R G D C+ +L SV P++
Sbjct: 309 MPSPVFKAFAGAVEKYALSHGLKRVPGPDPQFDDICFGQAPSHDDLEALSSV-FPSMEVQ 367
Query: 432 FSGGPVLTLPASNFL-IPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
F G L L N+L + ++G +C + +++G I + + +D AN VGF
Sbjct: 368 FDQGTSLVLGPLNYLFVHTFNSGKYCLGVFDNGRAGTLLGGITFRNVLVRYDRANQRVGF 427
Query: 491 GPNVC 495
GP +C
Sbjct: 428 GPALC 432
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 107/379 (28%), Positives = 172/379 (45%), Gaps = 40/379 (10%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
+G+ +G YF ++G+GSPPR Y+ +D+GSDI+WV C CS+C ++SD ++DP
Sbjct: 61 NGLPTETGLYFTKLGLGSPPRDYYVQVDTGSDILWVNCVECSRCPRKSDLGIDLTLYDPK 120
Query: 202 DSASFSGVSCSSAVCDRLENA---GCHAG-RCRYEVSYGDGSYTKGTLALETLTIG---- 253
S + VSC C + GC + C Y ++YGDGS T G + LT
Sbjct: 121 GSETSDVVSCDQDFCSATFDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNRING 180
Query: 254 --RTVVKNVAI--GCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQLG--GQTGGAFSY 302
RT +N +I GCG G ++ G++G G + S++ QL G+ FS+
Sbjct: 181 NLRTSPQNSSIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSH 240
Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
CL + G G G P + PLV PR + Y V L + V + + D+F
Sbjct: 241 CLDNVRGG--GIFAIGEVVEP-KVSTTPLV--PRM-AHYNVVLKSIEVDTDILQLPSDIF 294
Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
+ G V+D+GT + LP Y+ +A+ L F C+ +G V
Sbjct: 295 --DSVNGKGTVIDSGTTLAYLPDIVYDELIQKVLARQPGLKLYLVEQQF-RCFLYTGNVD 351
Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS------GLSIIGNIQQEG 476
P V +F LT+ ++L D G +C + S + ++++G++
Sbjct: 352 RGFPVVKLHFKDSLSLTVYPHDYLFQFKD-GIWCIGWQRSVAQTKNGKDMTLLGDLVLSN 410
Query: 477 IQISFDGANGFVGFGPNVC 495
+ +D N +G+ C
Sbjct: 411 KLVIYDLENMVIGWTDYNC 429
>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 480
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 119/403 (29%), Positives = 170/403 (42%), Gaps = 68/403 (16%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP-----CSQCYKQSDPVFDPADSASFS- 207
G+Y + +GS + +D+GSD+VW C P C K P+ A++ S S
Sbjct: 74 GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANNKSVSC 133
Query: 208 ----------GVSCSSAVC-------DRLENAGCHAGRCR-YEVSYGDGSYT----KGTL 245
G +S +C + +E + C + C + +YGDGS + +L
Sbjct: 134 SAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSL 193
Query: 246 ALETLTIGRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLG---GQTGGAFS 301
+L T + V+N GC H G VG AG G G +S+ QL Q G FS
Sbjct: 194 SLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGF---GRGVLSMPSQLATFSPQLGNRFS 250
Query: 302 YCLVSRGTGSS-----GSLVFGREAL-PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
YCLVS + L+ GR + L+ NP+ P FY VGL+G+ VG +RI
Sbjct: 251 YCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGLAGISVGNIRI 310
Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP----RASGVSIF 411
P E L ++ + G GVV+D+GT T LP YE+ F +TG + R +
Sbjct: 311 PAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRARRIEENTGL 370
Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPV-LTLPASNFLIPVDDAG---------TFCF---- 457
CY SV VP V +F G + LP N+ D G C
Sbjct: 371 SPCYYYEN--SVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKVGCLMLMN 428
Query: 458 -----AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
A P + +GN QQ+G ++ +D VGF C
Sbjct: 429 GGDEAELAGGPG--ATLGNYQQQGFEVVYDLEKNRVGFARRQC 469
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 103/387 (26%), Positives = 158/387 (40%), Gaps = 54/387 (13%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQC-YKQSDPV----FDPADSAS 205
G Y + G+P ++ +++ D+GS +VW C CS+C + + DP F P S+S
Sbjct: 79 GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSS 138
Query: 206 FSGVSCSSAVCDRL--------------ENAGCHAGRCRYEVSYGDGSYTKGTLALETLT 251
V C + C + + C Y V YG GS T G L ETL
Sbjct: 139 SKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLD 197
Query: 252 IGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG--- 308
+ N +GC + +G+ G G GS SL Q+G + F+YCL SR
Sbjct: 198 FPDKXIPNFVVGCSFLSIHQ---PSGIAGFGRGSESLPSQMGLK---KFAYCLASRKFDD 251
Query: 309 TGSSGSLVFGREALP-VGAAWVPLVRNPRA-----PSFYYVGLSGLGVGGMRIPISEDLF 362
+ SG L+ + G + P +NP +YY+ + + VG + +
Sbjct: 252 SPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFL 311
Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD---TCYNLSG 419
G+ G ++D+G+ T + P E F Q N RA+ V C+++S
Sbjct: 312 VPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDISK 371
Query: 420 FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA-----------PSPSGLSI 468
SV+ P + F F GG LP +N+ V +G C PS I
Sbjct: 372 EKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGGGPS--VI 429
Query: 469 IGNIQQEGIQISFDGANGFVGFGPNVC 495
+G QQ+ + +D N +GF C
Sbjct: 430 LGAFQQQNFYVEYDLVNQRLGFRQQTC 456
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 108/369 (29%), Positives = 169/369 (45%), Gaps = 35/369 (9%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD---PV--FDPADSASFSG 208
G YF R+ +GSPP+ Y+ ID+GSD++WV C C+ C + S P+ FDP S++ S
Sbjct: 81 GLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASL 140
Query: 209 VSCSSAVCD---RLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLT----IGRTVVK- 258
+SCS C + +AGC +C Y YGDGS T G + L +G +V
Sbjct: 141 ISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNS 200
Query: 259 --NVAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTG 310
++ GC G G+ G G MS++ Q+ Q T FS+CL G G
Sbjct: 201 SASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGG 260
Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
++ E + + PLV P P Y + L + V G + I ++F + +
Sbjct: 261 GGILVL--GEIVEEDIVYSPLV--PSQPH-YNLNLQSISVNGKSLAIDPEVFATST--NR 313
Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSF 430
G ++D+GT + L AY+ F A R +S CY ++ V PTVS
Sbjct: 314 GTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPL-LSKGTQCYLITSSVKGIFPTVSL 372
Query: 431 YFSGGPVLTLPASNFLI---PVDDAGTFCFAFAP-SPSGLSIIGNIQQEGIQISFDGANG 486
F+GG + L ++L+ + DA +C F G++I+G++ + +D A
Sbjct: 373 NFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDLAGQ 432
Query: 487 FVGFGPNVC 495
+G+ C
Sbjct: 433 RIGWANYDC 441
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 116/385 (30%), Positives = 175/385 (45%), Gaps = 52/385 (13%)
Query: 155 EYFVRIGVGSP-PRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
EY + + +G+P P+ + +D+GSD+VW QC C C+ Q P FD S + V CS
Sbjct: 99 EYLIHLSIGTPRPQRVALTLDTGSDLVWTQCA-CHVCFAQPFPTFDALASQTTLAVPCSD 157
Query: 214 AVCD--RLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTI------------GRTVV 257
+C + +GC + C Y Y D S T G + +T T V
Sbjct: 158 PICTSGKYPLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAGVAV 217
Query: 258 KNVAIGCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLV 316
NV GCG N+G+F +G+ G G MSL QL FS+C + + +
Sbjct: 218 PNVRFGCGQYNKGIFKSNESGIAGFSRGPMSLPSQL---KVARFSHCFTAIADARTSPVF 274
Query: 317 FGREALP--VGA-AWVPLVRNPRAPS---FYYVGLSGLGVGGMRIPISEDLF--RLTQMG 368
G P +GA A P+ P A S YY+ L G+ VG R+P++ F + T G
Sbjct: 275 LGGAPGPDNLGAHATGPVQSTPFANSNGSLYYLTLKGITVGKTRLPLNALAFAGKGTGSG 334
Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR---- 424
G ++D+GT + LP P Y + R AFVA+ LP A+ S D L F + R
Sbjct: 335 SGGTIIDSGTGIRTLPGPMYRSLRAAFVARV-KLPVANE-SAADAESTLC-FEAARSASL 391
Query: 425 --------VPTVSFYFSGGPVLTLPASNFLIPV--DDAGT---FCFAF-APSPSGLSIIG 470
+P V + +G LP ++++ + D+ G+ C + S L+IIG
Sbjct: 392 PPEAPAPALPKVVLHVAGAD-WDLPRESYVLDLLEDEDGSGSGLCLVMNSAGDSDLTIIG 450
Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
N QQ+ + +++D + F P C
Sbjct: 451 NFQQQNMHVAYDLEKNKLVFVPARC 475
>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
Length = 519
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 117/415 (28%), Positives = 166/415 (40%), Gaps = 93/415 (22%)
Query: 155 EYFVRIGVGSP--PRSQYMVIDSGSDIVWVQCQP--CSQCY-------KQSDPVFDPADS 203
+Y + + VG P S + +D+GSD+VW C P C C S P+ P DS
Sbjct: 87 DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPIDS 146
Query: 204 ASFSGVSCSSAVCDRLENAG-----CHAGRCRYEV----------------SYGDGSYTK 242
+SC+S +C ++ C A RC + +YGDGS
Sbjct: 147 RR---ISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVA 203
Query: 243 GTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
V+N C H VG AG G G +SL QL G FSY
Sbjct: 204 NLRRGRVGLAASMAVENFTFACAHTALAEPVGVAGF---GRGPLSLPAQLAPSLSGRFSY 260
Query: 303 CLVSRGTGS-----SGSLVFGR--EALPVGAA-----WVPLVRNPRAPSFYYVGLSGLGV 350
CLV+ + S L+ GR +A +GA+ + PL+ NP+ P FY V L + V
Sbjct: 261 CLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSV 320
Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV-------------- 396
GG RI +L + + G+ G+V+D+GT T LP+ + D F
Sbjct: 321 GGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGA 380
Query: 397 -AQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI---PVDDA 452
AQTG P CY+ S VP V+ +F G + LP N+ + +
Sbjct: 381 EAQTGLAP----------CYHYSP-SDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGR 429
Query: 453 GTFCFAFAP------------SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C P+G +GN QQ+G ++ +D G VGF C
Sbjct: 430 SVGCLMLMNVGGNNDDGEDGGGPAG--TLGNFQQQGFEVVYDVDAGRVGFARRRC 482
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 101/368 (27%), Positives = 170/368 (46%), Gaps = 36/368 (9%)
Query: 148 GMDQG--SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP-VFDPADSA 204
G D+G + Y + +G+G+P ++Q + ID+GS WV C+ C C+ ++P F + S
Sbjct: 72 GWDRGLQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCH--TNPRTFLQSRST 128
Query: 205 SFSGVSCSSAVC-----DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVK 258
+ + VSC +++C D + C + VSY DGS + G L +TLT +
Sbjct: 129 TCAKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIP 188
Query: 259 NVAIGCGHKNQGM--FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL----VSRG--TG 310
+ GC + G F GLLG+G G MS++ Q T FSYCL RG +
Sbjct: 189 GFSFGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQ-SSPTFDCFSYCLPLQKSERGFFSK 247
Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
++G G+ A + +V + ++V L+ + V G R+ +S +F
Sbjct: 248 TTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVF-----SRK 302
Query: 371 GVVMDTGTAVTRLPTPAYEAFRD---AFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
GVV D+G+ ++ +P A + + G S CY++ +P
Sbjct: 303 GVVFDSGSELSYIPDRALSVLSQRIRELLLKRGAAEEESE----RNCYDMRSVDEGDMPA 358
Query: 428 VSFYFSGGPVLTLPASNFLIP--VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
+S +F G L + + V + +C AFAP+ S +SIIG++ Q ++ +D
Sbjct: 359 ISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYDLKR 417
Query: 486 GFVGFGPN 493
+G GP+
Sbjct: 418 QLIGIGPS 425
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 102/363 (28%), Positives = 159/363 (43%), Gaps = 32/363 (8%)
Query: 157 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC 216
V + +G+PP++Q M++D+GS + W+QC VFDP+ S+SFS + C+ +C
Sbjct: 83 LVSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRKPPPSSVFDPSLSSSFSVLPCNHPLC 142
Query: 217 -----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQ 269
D C R C Y Y DG+ +G L E +T R+ + +GC ++
Sbjct: 143 KPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQSTPPLILGCAEESS 202
Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----GTGSSGSLVFGREALPVG 325
A G+LG+ G +S Q FSYC+ +R G +GS G G
Sbjct: 203 ----DAKGILGMNLGRLSFASQ---AKLTKFSYCVPTRQVRPGFTPTGSFYLGENPNSGG 255
Query: 326 AAWVPLV---RNPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
++ L+ ++ R P+ Y V + G+ +G ++ I FR G ++D+G+
Sbjct: 256 FRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPSGAGQTMIDSGS 315
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGV--SIFDTCYNLSGFVSVR-VPTVSFYFSGG 435
T L AY R+ V G + V + D C+N + R + + F F G
Sbjct: 316 EFTYLVDEAYNKVREEVVRLVGARLKKGYVYGGVSDMCFNGNAIEIGRLIGNMVFEFDKG 375
Query: 436 PVLTLPASNFLIPVDDAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFDGANGFVGFGP 492
+ + L V G C S + +IIGN Q+ I + FD AN VGFG
Sbjct: 376 VEIVVEKERVLADV-GGGVHCVGIGRSEMLGAASNIIGNFHQQNIWVEFDLANRRVGFGK 434
Query: 493 NVC 495
C
Sbjct: 435 ADC 437
>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
Length = 443
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 74/199 (37%), Positives = 110/199 (55%), Gaps = 15/199 (7%)
Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
+G P Y + D+GS+++W+QC PC+ CY Q+ P+FDPA+S ++ VS S +C+ +
Sbjct: 63 LGVPSTLVYGIADTGSELIWLQCLPCTHCYNQTPPIFDPAESYTYETVSSDSPICNAVRR 122
Query: 222 AGCHAG--RCRYEVSYGDGSYTKGTLALETLTI---GRTVVK--NVAIGCGHKNQGMFVG 274
C G C Y+ +YGDG+ TKGTL+ + RT+V+ + GC H + G
Sbjct: 123 ISCREGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTRTIVEVGYLTFGCSHDTKARLKG 182
Query: 275 -AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS-LVFGREALPVGAAWVPLV 332
AG++GL SLV QL + FSYC+V SGS + FG A+ +G PL+
Sbjct: 183 HQAGVVGLNRHPNSLVSQLKVK---KFSYCMVIPDDHGSGSRMYFGSRAVILGGK-TPLL 238
Query: 333 RNPRAPSFYYVGLSGLGVG 351
+ S Y+V L G+ VG
Sbjct: 239 KGDY--SHYFVTLKGISVG 255
Score = 62.8 bits (151), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 60/117 (51%), Gaps = 9/117 (7%)
Query: 182 VQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGS 239
++ Q +QC+ Q+ P+FDP+ S+++S V + C + CH C Y +SYG GS
Sbjct: 326 LEAQEVAQCFNQTPPIFDPSKSSTYSTVPWDAPTCYQAGGYACHIDEEDCCYRISYGSGS 385
Query: 240 Y-TKGTLALETLTI-----GRTVVKNVAIGCGHKNQGMFVG-AAGLLGLGGGSMSLV 289
T+GT++++ V ++ GC G F G G++GL S+SLV
Sbjct: 386 TSTEGTISIDAFAFEDNRQNMVDVXHLVFGCSDYTTGTFKGYEVGIVGLNQDSLSLV 442
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 103/387 (26%), Positives = 158/387 (40%), Gaps = 54/387 (13%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQC-YKQSDPV----FDPADSAS 205
G Y + G+P ++ +++ D+GS +VW C CS+C + + DP F P S+S
Sbjct: 79 GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSS 138
Query: 206 FSGVSCSSAVCDRL--------------ENAGCHAGRCRYEVSYGDGSYTKGTLALETLT 251
V C + C + + C Y V YG GS T G L ETL
Sbjct: 139 SKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLD 197
Query: 252 IGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG--- 308
+ N +GC + +G+ G G GS SL Q+G + F+YCL SR
Sbjct: 198 FPDKKIPNFVVGCSFLSIHQ---PSGIAGFGRGSESLPSQMGLK---KFAYCLASRKFDD 251
Query: 309 TGSSGSLVFGREALP-VGAAWVPLVRNPRA-----PSFYYVGLSGLGVGGMRIPISEDLF 362
+ SG L+ + G + P +NP +YY+ + + VG + +
Sbjct: 252 SPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFL 311
Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD---TCYNLSG 419
G+ G ++D+G+ T + P E F Q N RA+ V C+++S
Sbjct: 312 VPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDISK 371
Query: 420 FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA-----------PSPSGLSI 468
SV+ P + F F GG LP +N+ V +G C PS I
Sbjct: 372 EKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGGGPS--VI 429
Query: 469 IGNIQQEGIQISFDGANGFVGFGPNVC 495
+G QQ+ + +D N +GF C
Sbjct: 430 LGAFQQQNFYVEYDLVNQRLGFRQQTC 456
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 95/368 (25%), Positives = 155/368 (42%), Gaps = 47/368 (12%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+G Y R+ +G+PP+ +++D+GS + +V C C C DP F P DS ++ V C+
Sbjct: 90 NGYYTARLWIGTPPQRFALIVDTGSTVTYVPCSTCRHCGSHQDPKFRPEDSETYQPVKCT 149
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV---VKNVAIGCGHKNQ 269
N +C YE Y + S + G L + ++ G + GC +
Sbjct: 150 WQC-----NCDNDRKQCTYERRYAEMSTSSGALGEDVVSFGNQTELSPQRAIFGCENDET 204
Query: 270 GMFVG--AAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSR----------GTGSSGSL 315
G A G++GLG G +S++ QL + +FS C G +
Sbjct: 205 GDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMGVGGGAMVLGGISPPADM 264
Query: 316 VFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMD 375
VF R VR+P +Y + L + V G R+ ++ +F G G V+D
Sbjct: 265 VFTRSD---------PVRSP----YYNIDLKEIHVAGKRLHLNPKVFD----GKHGTVLD 307
Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS--IFDTCYNLSGF----VSVRVPTVS 429
+GT LP A+ AF+ A + +T +L R SG D C++ + +S P V
Sbjct: 308 SGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPRYNDICFSGAEIDVSQISKSFPVVE 367
Query: 430 FYFSGGPVLTLPASNFLIPVDDA-GTFCF-AFAPSPSGLSIIGNIQQEGIQISFDGANGF 487
F G L+L N+L G +C F+ +++G I + +D +
Sbjct: 368 MVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHTK 427
Query: 488 VGFGPNVC 495
+GF C
Sbjct: 428 IGFWKTNC 435
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 114/394 (28%), Positives = 162/394 (41%), Gaps = 66/394 (16%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQC-YKQSDPV----FDPADSAS 205
G Y V + G+P ++ V D+GS +VW+ C CS C + DP F P +S+S
Sbjct: 88 GGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSS 147
Query: 206 FSGVSCSSAVCDRL------------ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG 253
+ C S C L C G Y + YG GS T G L E L
Sbjct: 148 SKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGS-TAGVLITEKLDFP 206
Query: 254 RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR------ 307
V + +GC + AG+ G G G +SL Q+ + FS+CLVSR
Sbjct: 207 DLTVPDFVVGCSIISTRQ---PAGIAGFGRGPVSLPSQMNLK---RFSHCLVSRRFDDTN 260
Query: 308 -------GTGS---SGSLVFGREALPVGAAWVPLVRNPRAPS-----FYYVGLSGLGVGG 352
TGS SGS G + P +NP + +YY+ L + VG
Sbjct: 261 VTTDLDLDTGSGHNSGSKT-------PGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGR 313
Query: 353 MRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-- 410
+ I GD G ++D+G+ T + P +E + F +Q N R +
Sbjct: 314 KHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKET 373
Query: 411 -FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP----SPSG 465
C+N+SG V VP + F F GG L LP SN+ V + T C +PSG
Sbjct: 374 GLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSG 433
Query: 466 LS----IIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ I+G+ QQ+ + +D N GF C
Sbjct: 434 GTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 102/365 (27%), Positives = 162/365 (44%), Gaps = 35/365 (9%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV--FDPADSASFSGVSCSSAV 215
+ + +G+P +SQ +V+D+GS + W+QC P P FDP+ S+SFS + CS +
Sbjct: 83 LSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPL 142
Query: 216 C-----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKN 268
C D C + R C Y Y DG++ +G L E T + + +GC ++
Sbjct: 143 CKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPLILGCAKES 202
Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----GTGSSGSLVFGREALPV 324
G+LG+ G +S + Q FSYC+ +R G S+GS G
Sbjct: 203 ----TDVKGILGMNLGRLSFISQ---AKISKFSYCIPTRSNRPGLASTGSFYLGENPNSR 255
Query: 325 GAAWVPLVRNPRA-------PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
G +V L+ P++ P Y V L G+ +G R+ I +FR G ++D+G
Sbjct: 256 GFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGGSGQTMVDSG 315
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGV--SIFDTCY--NLSGFVSVRVPTVSFYFS 433
+ T L AY+ ++ V G+ + V S D C+ N + + + F F
Sbjct: 316 SEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRLIGDLVFEFG 375
Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFDGANGFVGF 490
G + + L+ V G C S + +IIGN+ Q+ + + FD AN VGF
Sbjct: 376 RGVEILVEKQRLLVNV-GGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVANRRVGF 434
Query: 491 GPNVC 495
C
Sbjct: 435 SKAEC 439
>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
Length = 419
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 107/371 (28%), Positives = 158/371 (42%), Gaps = 48/371 (12%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--SQCYKQSDPVFDPADSASFSGVSCSS 213
Y +G+PP++ ++D ++VW QC C S C+KQ PVFDP+ S ++ C S
Sbjct: 62 YVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGS 121
Query: 214 AVCDRLENAGCHA-GRCRYEVS--YGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
+C + C G C YE +GD T G + + + IG + +A GC + G
Sbjct: 122 PLCKSIPTRNCSGDGECGYEAPSMFGD---TFGIASTDAIAIGNAEGR-LAFGCVVASDG 177
Query: 271 MFVGA----AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA 326
GA +G +GLG SLVGQ AFSYCL G G +L G A GA
Sbjct: 178 SIDGAMDGPSGFVGLGRTPWSLVGQ---SNVTAFSYCLALHGPGKKSALFLGASAKLAGA 234
Query: 327 AWVPLVRNPRAP---------------SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
NP P +Y V L G+ G + + + +
Sbjct: 235 G----KSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAASSGGGAITV---- 286
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
+ ++T ++ LP AY+A A G+ A+ FD C+ + VP + F
Sbjct: 287 LQLETFRPLSYLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQNAAVSG--VPDLVFT 344
Query: 432 FSGGPVLTLPASNFLIPVDDA-GTFCFAFAPSP------SGLSIIGNIQQEGIQISFDGA 484
F GG LT S +L+ + GT C + S G+SI+G++ QE + FD
Sbjct: 345 FQGGATLTAQPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDLE 404
Query: 485 NGFVGFGPNVC 495
+ F P C
Sbjct: 405 KETLSFEPADC 415
>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
Length = 492
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 117/415 (28%), Positives = 166/415 (40%), Gaps = 93/415 (22%)
Query: 155 EYFVRIGVGSP--PRSQYMVIDSGSDIVWVQCQP--CSQCY-------KQSDPVFDPADS 203
+Y + + VG P S + +D+GSD+VW C P C C S P+ P DS
Sbjct: 87 DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPIDS 146
Query: 204 ASFSGVSCSSAVCDRLENAG-----CHAGRCRYEV----------------SYGDGSYTK 242
+SC+S +C ++ C A RC + +YGDGS
Sbjct: 147 RR---ISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVA 203
Query: 243 GTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
V+N C H VG AG G G +SL QL G FSY
Sbjct: 204 NLRRGRVGLAASMAVENFTFACAHTALAEPVGVAGF---GRGPLSLPAQLAPSLSGRFSY 260
Query: 303 CLVSRGTGS-----SGSLVFGR--EALPVGAA-----WVPLVRNPRAPSFYYVGLSGLGV 350
CLV+ + S L+ GR +A +GA+ + PL+ NP+ P FY V L + V
Sbjct: 261 CLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSV 320
Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV-------------- 396
GG RI +L + + G+ G+V+D+GT T LP+ + D F
Sbjct: 321 GGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGA 380
Query: 397 -AQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI---PVDDA 452
AQTG P CY+ S VP V+ +F G + LP N+ + +
Sbjct: 381 EAQTGLAP----------CYHYSP-SDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGR 429
Query: 453 GTFCFAFAP------------SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C P+G +GN QQ+G ++ +D G VGF C
Sbjct: 430 SVGCLMLMNVGGNNDDGEDGGGPAG--TLGNFQQQGFEVVYDVDAGRVGFARRRC 482
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 122 bits (305), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 93/360 (25%), Positives = 159/360 (44%), Gaps = 31/360 (8%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+G Y R+ +G+PP+ +++D+GS + +V C C QC + DP F P S+++ V C+
Sbjct: 78 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDLSSTYQPVKCT 137
Query: 213 SAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHK 267
+ C R C YE Y + S + G L + ++ G + GC +
Sbjct: 138 L-------DCNCDNDRMQCVYERQYAEMSTSSGVLGEDVVSFGNQSELAPQRAVFGCENV 190
Query: 268 NQGMFVG--AAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSRGTGSSGSLVFGREALP 323
G A G++GLG G +S++ QL + +FS C G G++V G + P
Sbjct: 191 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVG-GGAMVLGGISPP 249
Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
+ +P +Y + L + V G R+P++ +F G G V+D+GT L
Sbjct: 250 SDMVFA--QSDPVRSPYYNIDLKEIHVAGKRLPLNPSVFD----GKHGSVLDSGTTYAYL 303
Query: 384 PTPAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGF----VSVRVPTVSFYFSGGPV 437
P A+ AF++A V + + + SG + D C++ +G +S P V F G
Sbjct: 304 PEEAFLAFKEAIVKELQSFSQISGPDPNYNDLCFSGAGIDVSQLSKTFPVVDMIFGNGHK 363
Query: 438 LTLPASNFLIPVDDA-GTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+L N++ G +C F +++G I + +D +GF C
Sbjct: 364 YSLSPENYMFRHSKVRGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDREQTKIGFWKTNC 423
>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
Length = 458
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 104/388 (26%), Positives = 167/388 (43%), Gaps = 60/388 (15%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQCY---KQSDPVFDPADSASFS 207
G + + + G+PP+ ++D+GS +VW C C+ C + P+F+P S+S
Sbjct: 85 GAHTIPLSFGTPPQKLSFLMDTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDK 144
Query: 208 GVSCSSAVCDRLENAGCHAG--RC------------RYEVSYGDGSYTKGTLALETLTIG 253
+ C C + H G RC +Y + YG G+ G LE L
Sbjct: 145 ILGCRDPKCADTSSPBVHLGXPRCNGNSKKCSHACPQYTLQYGTGA-ASGFFLLENLDFP 203
Query: 254 RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG---TG 310
+ +GC + + L G G SL Q+G + F+YCL S T
Sbjct: 204 GKTIHKFLVGCT-TSADREPSSDALAGFGRTMFSLPMQMGVK---KFAYCLNSHDYDDTR 259
Query: 311 SSGSLVFG-REALPVGAAWVPLVRNPR-APSFYYVGLSGLGVGG--MRIPISEDLFRLTQ 366
+SG L+ + G ++ P +NP P +YY+G+ + +G +RIP LT
Sbjct: 260 NSGKLILDYSDGETQGLSYAPFXKNPPDYPIYYYLGVKDMKIGNKVLRIPGK----YLTP 315
Query: 367 MGDD--GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA------SGVSIFDTCYNLS 418
D GVV+D+G A + + P ++ + Q R+ +GV+ CYN +
Sbjct: 316 GSDSRGGVVIDSGFAYSYMTLPVFKIVTNELKKQMSKYRRSLELEAQTGVT---PCYNFT 372
Query: 419 GFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFA-----------FAPSPSGLS 467
G S+++P + + F+GG + +P N+ + +A CF F P PS
Sbjct: 373 GHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEASLGCFPVTTDSPTSNLEFTPGPS--I 430
Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
I+GN QQ + FD N +GF C
Sbjct: 431 ILGNYQQVDHYVEFDLKNERLGFRQQTC 458
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 98/338 (28%), Positives = 156/338 (46%), Gaps = 39/338 (11%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
G Y+ ++ +G+PP + ID+GSD++WV C CS C + S FDP S++ S
Sbjct: 23 GLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSM 82
Query: 209 VSCSSAVCD---RLENAGCHA--GRCRYEVSYGDGSYTKG-----TLALETLTIGRTVVK 258
++CS C+ + +A C + +C Y YGDGS T G + L T+ G
Sbjct: 83 IACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTN 142
Query: 259 N---VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGT 309
+ V GC ++ G G+ G G MS++ QL Q FS+CL +G
Sbjct: 143 STAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCL--KGD 200
Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
S G ++ E + + LV P P Y + L + V G + I +F +
Sbjct: 201 SSGGGILVLGEIVEPNIVYTSLV--PAQP-HYNLNLQSIAVNGQTLQIDSSVFATSN--S 255
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA--SGVSIFDTCYNLSGFVSVRVPT 427
G ++D+GT + L AY+ F A T ++P++ + VS + CY ++ V+ P
Sbjct: 256 RGTIVDSGTTLAYLAEEAYDPFVSAI---TASIPQSVHTAVSRGNQCYLITSSVTEVFPQ 312
Query: 428 VSFYFSGGPVLTLPASNFLI---PVDDAGTFCFAFAPS 462
VS F+GG + L ++LI + A +C F S
Sbjct: 313 VSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKS 350
>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
gi|194689376|gb|ACF78772.1| unknown [Zea mays]
gi|224031455|gb|ACN34803.1| unknown [Zea mays]
gi|238011528|gb|ACR36799.1| unknown [Zea mays]
gi|238015454|gb|ACR38762.1| unknown [Zea mays]
Length = 304
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 94/309 (30%), Positives = 140/309 (45%), Gaps = 27/309 (8%)
Query: 209 VSCSSAVCDRLENAGCH-AGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN-------V 260
+ C+ +C + + C C Y +YGDG+ T G A E T + +
Sbjct: 1 MRCAGTLCSDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPL 60
Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE 320
GCG N G +G++G G +SLV QL + FSYCL S + +L+FG
Sbjct: 61 GFGCGSVNVGSLNNGSGIVGFGRNPLSLVSQLSIRR---FSYCLTSYASRRQSTLLFGSL 117
Query: 321 ALPV------GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
+ V PL+++P+ P+FYYV +GL VG R+ I E F L G GV++
Sbjct: 118 SDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIV 177
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD-TCYNL-------SGFVSVRVP 426
D+GTA+T LP AF Q LP A+G + D C+ + S + VP
Sbjct: 178 DSGTALTLLPAAVLAEVVRAFRQQL-RLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVP 236
Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
+ +F G L LP N+++ G C A S S IGN+ Q+ +++ +D
Sbjct: 237 RMVLHFQGAD-LDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAE 295
Query: 487 FVGFGPNVC 495
+ P C
Sbjct: 296 TLSIAPARC 304
>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
Length = 504
Score = 121 bits (304), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 116/396 (29%), Positives = 156/396 (39%), Gaps = 79/396 (19%)
Query: 171 MVIDSGSDIVWVQCQP--CSQCYKQ-----SDPVFDPADSASFSGVSCSSAVCDRLENAG 223
+ +D+GSD+VW C P C C + S P+ P DS + C+S +C +
Sbjct: 107 LFLDTGSDLVWFPCAPFTCMLCEGKPTPGRSGPLPPPPDSRR---IPCASPLCSAAHASA 163
Query: 224 -----CHAGRCRYE-----------------VSYGDGSYT----KGTLALETLTIGRTVV 257
C A RC E +YGDGS +G +AL V
Sbjct: 164 PPSDLCAAARCPLEDIETGSCGASHACPPLYYAYGDGSLVAHLRRGRVALGAGARASVAV 223
Query: 258 --KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS---- 311
N C H G VG AG G G +SL GQL Q G FSYCLVS +
Sbjct: 224 AVDNFTFACAHTALGEPVGVAGF---GRGPLSLPGQLSPQLSGRFSYCLVSHSFRADRLI 280
Query: 312 -SGSLVFGREALPV--------GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
L+ GR G + PL+ NP+ P FY V L + VG RI +L
Sbjct: 281 RPSPLILGRSPDDADAAAAETDGFVYTPLLHNPKHPYFYSVALEAVSVGAARIQARPELA 340
Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYE-----AFRDAFVAQTGNLPRASGVSIFDTCYNL 417
R+ + G+ G+V+D+GT T LP Y R A RA + CY
Sbjct: 341 RVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARAERAEEQTGLTPCYRY 400
Query: 418 SGFVSVR-VPTVSFYFSGGPVLTLPASNFLIPV-----------DDAGTFCFAFAPSPSG 465
+ S R VP ++ +F G + LP N+ + DD G SG
Sbjct: 401 A--ASDRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDVGCLMLMNGGDASG 458
Query: 466 ------LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+GN QQ+G ++ +D G VGF C
Sbjct: 459 EEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 494
>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
Length = 137
Score = 121 bits (304), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 81/127 (63%)
Query: 144 DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADS 203
DV + + G+GE+ +++ +G P + ++D+GSD+ W QC PCS CYKQ P++DP+ S
Sbjct: 9 DVQAPVSAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCMPCSDCYKQPTPIYDPSLS 68
Query: 204 ASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIG 263
+++ VSC S++C L + C + C Y +YGD S T+G L+ ET T+ + ++A G
Sbjct: 69 STYGTVSCKSSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTLSSQSIPHIAFG 128
Query: 264 CGHKNQG 270
CG N+G
Sbjct: 129 CGQDNEG 135
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 121 bits (304), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 96/364 (26%), Positives = 157/364 (43%), Gaps = 42/364 (11%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFS----GV 209
G Y R+ +G+PP +++D+GS + +V C C+ C DP F PA S+S+ G
Sbjct: 33 GYYTSRVKIGTPPHEFSLIVDTGSTVTYVPCSSCTHCGNHQDPRFSPALSSSYKPLECGS 92
Query: 210 SCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV---KNVAIGCGH 266
CS+ CD G +Y+ Y + S + G L + + + + + GC
Sbjct: 93 ECSTGFCD---------GSRKYQRQYAEKSTSSGVLGKDVIGFSNSSDLGGQRLVFGCET 143
Query: 267 KNQGMFVG--AAGLLGLGGGSMSLVGQLGGQTG--GAFSYCLVSRGTGSSGSLVFGREAL 322
G A G++GLG G +S++ QL + FS C G G+++ G
Sbjct: 144 AETGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEG-GGAMILGGFQP 202
Query: 323 PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
P + +P +Y + L G+ VGG + + ++F G G V+D+GT
Sbjct: 203 PKDMVFT--ASDPHRSPYYNLMLKGIRVGGSPLRLKPEVFD----GKYGTVLDSGTTYAY 256
Query: 383 LPTPAYEAFRDAFVAQTGNLPRASG--VSIFDTCY--------NLSGFVSVRVPTVSFYF 432
P A++AF+ A Q G+L G D CY NLS F P+V F F
Sbjct: 257 FPGAAFQAFKSAVKEQVGSLKEVPGPDEKFKDICYAGAGTNVSNLSQF----FPSVDFVF 312
Query: 433 SGGPVLTLPASNFLIP-VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFG 491
G +TL N+L +G +C + +++G I + ++++ +GF
Sbjct: 313 GDGQSVTLSPENYLFRHTKISGAYCLGVFENGDPTTLLGGIIVRNMLVTYNRGKASIGFL 372
Query: 492 PNVC 495
C
Sbjct: 373 KTKC 376
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 101/368 (27%), Positives = 171/368 (46%), Gaps = 36/368 (9%)
Query: 148 GMDQG--SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP-VFDPADSA 204
G D+G + Y + +G+G+P ++Q + ID+GS WV C+ C C+ ++P F + S
Sbjct: 72 GWDRGLQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCH--TNPRTFLQSRST 128
Query: 205 SFSGVSCSSAVC-----DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVK 258
+ + VSC +++C D + C + VSY DGS + G L +TLT +
Sbjct: 129 TCAKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIP 188
Query: 259 NVAIGCGHKNQGM--FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL----VSRG--TG 310
+ GC + G F GLLG+G G MS++ Q + G FSYCL RG +
Sbjct: 189 SFTFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSK 247
Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
++G G+ A + +V + ++V L+ + V G R+ +S +F
Sbjct: 248 TTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRK 302
Query: 371 GVVMDTGTAVTRLPTPAYEAFRD---AFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
GVV D+G+ ++ +P A + + G S CY++ +P
Sbjct: 303 GVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESE----RNCYDMRSVDEGDMPA 358
Query: 428 VSFYFSGGPVLTLPASNFLIP--VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
+S +F G L + + V + +C AFAP+ S +SIIG++ Q ++ +D
Sbjct: 359 ISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYDLKR 417
Query: 486 GFVGFGPN 493
+G GP+
Sbjct: 418 QLIGIGPS 425
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 103/370 (27%), Positives = 167/370 (45%), Gaps = 36/370 (9%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
G Y+ ++ +G+PP + ID+GSD++WV C C+ C + S FDP S++ S
Sbjct: 76 GLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSM 135
Query: 209 VSCSSAVCD---RLENAGCHA--GRCRYEVSYGDGSYTKG-----TLALETLTIGRTVVK 258
++CS C+ + +A C + +C Y YGDGS T G + L T+ G
Sbjct: 136 IACSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMTTN 195
Query: 259 N---VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGT 309
+ V GC ++ G G+ G G MS++ QL Q FS+CL +G
Sbjct: 196 STAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCL--KGD 253
Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
S G ++ E + + LV P P Y + L + V G + I +F +
Sbjct: 254 SSGGGILVLGEIVEPNIVYTSLV--PAQP-HYNLNLQSISVNGQTLQIDSSVFATSN--S 308
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVS 429
G ++D+GT + L AY+ F A A R VS + CY ++ V+ P VS
Sbjct: 309 RGTIVDSGTTLAYLAEEAYDPFVSAITAAIPQSVRTV-VSRGNQCYLITSSVTDVFPQVS 367
Query: 430 FYFSGGPVLTLPASNFLI---PVDDAGTFCFAFAP-SPSGLSIIGNIQQEGIQISFDGAN 485
F+GG + L ++LI + A +C F G++I+G++ + + +D A
Sbjct: 368 LNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDLAG 427
Query: 486 GFVGFGPNVC 495
+G+ C
Sbjct: 428 QRIGWANYDC 437
>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
Length = 458
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 103/385 (26%), Positives = 164/385 (42%), Gaps = 54/385 (14%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQCY---KQSDPVFDPADSASFS 207
G + + + G+PP+ ++D+GS +VW C C+ C + P+F+P S+S
Sbjct: 85 GGHTIPLSFGTPPQKLSFLVDTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDK 144
Query: 208 GVSCSSAVCDRLENAGCHAG--RC------------RYEVSYGDGSYTKGTLALETLTIG 253
+ C C + H G RC +Y + YG G+ G LE L
Sbjct: 145 ILGCRDPKCANTSSPDVHLGCPRCNGNSKKCSHACPQYTLQYGTGA-ASGFFLLENLDFP 203
Query: 254 RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG---TG 310
+ +GC + + L G G SL Q+G + F+YCL S T
Sbjct: 204 GKTIHKFLVGCT-TSADREPSSDALAGFGRTMFSLPMQMGVK---KFAYCLNSHDYDDTR 259
Query: 311 SSGSLVFG-REALPVGAAWVPLVRNPR-APSFYYVGLSGLGVGG--MRIPISEDLFRLTQ 366
+SG L+ + G ++ P ++NP P +YY+G+ + +G +RIP LT
Sbjct: 260 NSGKLILDYSDGETQGLSYAPFLKNPPDYPFYYYLGVKDMKIGNKLLRIPGK----YLTP 315
Query: 367 MGDD--GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR---ASGVSIFDTCYNLSGFV 421
D GV++D+G A + P ++ + Q R A S CYN +G
Sbjct: 316 GSDSRGGVMIDSGFAYGYMTLPVFKIVTNELKKQMSKYRRSLEAETQSGLTPCYNFTGHK 375
Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFA-----------FAPSPSGLSIIG 470
S+++P + + F+GG + +P N+ + +A CF F P PS I+G
Sbjct: 376 SIKIPDLIYQFTGGANMVVPGMNYFLLFSEASLGCFPVTTDSPTNNLEFTPGPS--IILG 433
Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
N QQ + FD N +GF C
Sbjct: 434 NYQQVDHYVEFDLKNERLGFRQQTC 458
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 110/373 (29%), Positives = 173/373 (46%), Gaps = 47/373 (12%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
V + VG+PP++ MV+D+GS++ W++C +Q ++ + FDP S+S+S V CSS C
Sbjct: 87 VSLTVGTPPQNVSMVLDTGSELSWLRCNK-TQTFQTT---FDPNRSSSYSPVPCSSLTCT 142
Query: 217 DRLEN----AGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK---- 267
DR + A C + + C +SY D S ++G LA +T IG + + GC
Sbjct: 143 DRTRDFPIPASCDSNQLCHAILSYADASSSEGNLASDTFYIGNSDMPGTIFGCMDSSFST 202
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE----ALP 323
N GL+G+ GS+S V Q+ FSYC+ + SG L+ G +P
Sbjct: 203 NTEEDSKNTGLMGMNRGSLSFVSQMDFP---KFSYCI--SDSDFSGVLLLGDANFSWLMP 257
Query: 324 VGAAWVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
+ + PL++ + P F Y V L G+ V +P+ + +F G ++D+GT
Sbjct: 258 LN--YTPLIQISTPLPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVDSGT 315
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCYN--LSGFVSVRVPTVSF 430
T L P Y A R+ F+ QT + R + D CY LS +PTVS
Sbjct: 316 QFTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWLPTVSL 375
Query: 431 YFSGGPVLTLPASNFL--IPVDDAGT---FCFAFAPS---PSGLSIIGNIQQEGIQISFD 482
F G + + L +P + G+ +CF F S +IG+ Q+ + + FD
Sbjct: 376 MFRGAE-MKVSGDRLLYRVPGEVRGSDSVYCFTFGNSDLLAVEAYVIGHHHQQNVWMEFD 434
Query: 483 GANGFVGFGPNVC 495
+GF C
Sbjct: 435 LEKSRIGFAQVQC 447
>gi|302141829|emb|CBI19032.3| unnamed protein product [Vitis vinifera]
Length = 382
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 89/246 (36%), Positives = 130/246 (52%), Gaps = 12/246 (4%)
Query: 257 VKNVAIGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSL 315
+ + GCG N+ + AGLLGLG G +SLV QLG Q FSYCL S + SL
Sbjct: 139 IPRIGFGCGVNNRATGMDQTAGLLGLGRGVLSLVSQLGTQ---KFSYCLTSIHENKTSSL 195
Query: 316 VFGREAL----PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
+FG A P PL++NP PS+YY+ L G+ VG +PI E F+L + G G
Sbjct: 196 LFGSLAYSNFNPGKIPRTPLIQNPFLPSYYYLALKGITVGYTLLPIPEFAFQLGKDGSGG 255
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNL--SGFVSVRVPTVS 429
+++D+GT +T L A++ ++AF++QT S + D C++L V+VP +
Sbjct: 256 MILDSGTTITYLQEDAFDVLKNAFISQTELQVANSSTTGLDLCFHLPVKNAAEVKVPKLI 315
Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVG 489
F+F G L LP N+++ + G C A + S LSI GNIQQ+ + + D +
Sbjct: 316 FHFKGLD-LALPVENYMVSDPEMGLICLAIDATGS-LSIFGNIQQQNMLVLHDLKKSTLS 373
Query: 490 FGPNVC 495
P C
Sbjct: 374 LVPTQC 379
Score = 41.6 bits (96), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 20/70 (28%), Positives = 38/70 (54%), Gaps = 7/70 (10%)
Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
+QR + R ++R+SG A ++ Q + + G GE+ V + +G+PP
Sbjct: 62 IQRGINRGRQRLQRMSGMATTAERNGFQ-------APVHVGDGEFVVNLMIGTPPVPFPA 114
Query: 172 VIDSGSDIVW 181
++D+GSD++W
Sbjct: 115 IMDTGSDLIW 124
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 105/371 (28%), Positives = 167/371 (45%), Gaps = 44/371 (11%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVS 210
Y+ IG+G+P + Y+ +D+GSDI+WV C C +C ++S ++DP DS++ S VS
Sbjct: 4 YYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVS 63
Query: 211 CSSAVCDRLENA---GCHAGR-CRYEVSYGDGSYTKGTLALETLTI------GRTVVKN- 259
C C GC C Y V+YGDGS T G + L G+T N
Sbjct: 64 CDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANS 123
Query: 260 -VAIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGS 311
V GCG + QG +G++ G++G G + S++ QL G+ F++CL + G
Sbjct: 124 TVTFGCGSQ-QGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTINGG- 181
Query: 312 SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
G G P PLV P P Y V L + VGG + + +F + G
Sbjct: 182 -GIFAIGNVVQP-KVKTTPLV--PNMPH-YNVNLKSIDVGGTALKLPSHMFDTGE--KKG 234
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
++D+GT +T LP Y+ A A+ ++ + V F C+ G V P ++F+
Sbjct: 235 TIIDSGTTLTYLPEIVYKEIMLAVFAKHKDITFHN-VQEF-LCFQYVGRVDDDFPKITFH 292
Query: 432 FSGG-PVLTLPASNFLIPVDDAGTFCFAF------APSPSGLSIIGNIQQEGIQISFDGA 484
F P+ P F D+ +C F + G+ ++G++ + +D
Sbjct: 293 FENDLPLNVYPHDYFFENGDNL--YCVGFQNGGLQSKDGKGMVLLGDLVLSNKLVVYDLE 350
Query: 485 NGFVGFGPNVC 495
N +G+ C
Sbjct: 351 NQVIGWTEYNC 361
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 110/376 (29%), Positives = 163/376 (43%), Gaps = 45/376 (11%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV------FDPADSASFSGVSC 211
V + VG+PP++ MV+D+GS++ W+ C Q + F P SA+F+ V C
Sbjct: 65 VSLAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVPC 124
Query: 212 SSAVC---DRLENAGCHAG--RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC-- 264
S C D C +C +SY DGS + G LA + +G A GC
Sbjct: 125 GSTQCSSRDLPAPPSCDGASRQCHVSLSYADGSASDGALATDVFAVGEAPPLRSAFGCMS 184
Query: 265 -GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP 323
+ + V AGLLG+ G++S V Q + FSYC+ R +G L+ G LP
Sbjct: 185 TAYDSSPDGVATAGLLGMNRGTLSFVTQASTRR---FSYCISDRD--DAGVLLLGHSDLP 239
Query: 324 -VGAAWVPLVRNPRAPSFYY------VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
+ + PL + P P Y+ V L G+ VGG +PI + G ++D+
Sbjct: 240 FLPLNYTPLYQ-PTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGAGQTMVDS 298
Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCYNLSG---FVSVRVPT 427
GT T L AY A + F+ QT L RA F DTC+ + S R+P
Sbjct: 299 GTQFTFLLGDAYSALKAEFLKQTKPLLRALDDPSFAFQEALDTCFRVPAGRPPPSARLPP 358
Query: 428 VSFYFSGGPVLTLPASNFLIPVDDA-----GTFCFAFAPS---PSGLSIIGNIQQEGIQI 479
V+ F+G +++ L V G +C F + P +IG+ Q + +
Sbjct: 359 VTLLFNGA-EMSVAGDRLLYKVPGEHRGADGVWCLTFGNADMVPLTAYVIGHHHQMNLWV 417
Query: 480 SFDGANGFVGFGPNVC 495
+D G VG P C
Sbjct: 418 EYDLERGRVGLAPVKC 433
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 115/372 (30%), Positives = 164/372 (44%), Gaps = 42/372 (11%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
+ I VG+PP++ MVID+GS++ W+ C + P F+P S+S++ +SCSS C
Sbjct: 68 ISITVGTPPQNMSMVIDTGSELSWLHCN-TNTTATIPYPFFNPNISSSYTPISCSSPTCT 126
Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK---- 267
D A C + C +SY D S ++G LA +T G + + GC +
Sbjct: 127 TRTRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGFGSSFNPGIVFGCMNSSYST 186
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA 327
N GL+G+ GS+SLV QL FSYC+ G+ SG L+ G G +
Sbjct: 187 NSESDSNTTGLMGMNLGSLSLVSQLKIP---KFSYCI--SGSDFSGILLLGESNFSWGGS 241
Query: 328 --WVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
+ PLV+ + P F Y V L G+ + + IS +LF G + D GT
Sbjct: 242 LNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNISGNLFVPDHTGAGQTMFDLGTQF 301
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCYNLSGFVS--VRVPTVSFYF 432
+ L P Y A RD F+ QT RA F D CY + S +P+VS F
Sbjct: 302 SYLLGPVYNALRDEFLNQTNGTLRALDDPNFVFQIAMDLCYRVPVNQSELPELPSVSLVF 361
Query: 433 SG------GPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLS--IIGNIQQEGIQISFDG 483
G G L F+ D +CF F S G+ IIG+ Q+ + + FD
Sbjct: 362 EGAEMRVFGDQLLYRVPGFVWGNDSV--YCFTFGNSDLLGVEAFIIGHHHQQSMWMEFDL 419
Query: 484 ANGFVGFGPNVC 495
VG C
Sbjct: 420 VEHRVGLAHARC 431
>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
Length = 334
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 100/312 (32%), Positives = 142/312 (45%), Gaps = 29/312 (9%)
Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCH--------AGRCRYEVSYGDGS----YTKG 243
P+ P S+S + V+C C L C +G C Y +YG+ YT+G
Sbjct: 13 PLLYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEG 72
Query: 244 TLALETLTIGR--TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFS 301
L ET T G +A GC +++G F +GL+GLG G +SLV QL + AF
Sbjct: 73 ILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVE---AFG 129
Query: 302 YCLVSRGTGSSGSLVFGREALPVGA-----AWVPLVRNP--RAPSFYYVGLSGLGVGGMR 354
Y L S + S + FG A G PL+ NP + FYYVGL+G+ VGG
Sbjct: 130 YRLSSDLSAPS-PISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKL 188
Query: 355 IPISEDLFRLTQ-MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDT 413
+ I F + G GV+ D+GT +T LP PAY RD ++Q G + D
Sbjct: 189 VQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDL 248
Query: 414 CYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV---DDAGTFCFAFAPSPSGLSIIG 470
G + P++ +F GG + L N+L + + C++ S L+IIG
Sbjct: 249 ICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIG 308
Query: 471 NIQQEGIQISFD 482
NI Q + FD
Sbjct: 309 NIMQMDFHVVFD 320
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 100/373 (26%), Positives = 163/373 (43%), Gaps = 43/373 (11%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
G Y+ +IG+G+P + Y+ +D+GSDI+WV C C +C K S +++ +S +
Sbjct: 76 GLYYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGKL 135
Query: 209 VSCSSAVCDRLENA---GCHAGR-CRYEVSYGDGSYTKGTLALETLTIGR------TVVK 258
V C C + GC A C Y YGDGS T G + + R T
Sbjct: 136 VPCDQEFCYEINGGQLPGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLKTTAA 195
Query: 259 N--VAIGCGHKNQGMF-----VGAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGT 309
N V GCG + G G+LG G + S++ QL G+ F++CL GT
Sbjct: 196 NGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCL--DGT 253
Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
G V G P PL+ P P Y V ++ + VG + + D+F + GD
Sbjct: 254 NGGGIFVIGHVVQP-KVNMTPLI--PNQPH-YNVNMTAVQVGHEFLSLPTDVF---EAGD 306
Query: 370 -DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
G ++D+GT + LP Y+ ++Q +L + V TC+ S + P V
Sbjct: 307 RKGAIIDSGTTLAYLPEMVYKPLVSKIISQQPDL-KVHTVRDEYTCFQYSDSLDDGFPNV 365
Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS------PSGLSIIGNIQQEGIQISFD 482
+F+F +L + +L P + G +C + S ++++G++ + +D
Sbjct: 366 TFHFENSVILKVYPHEYLFPFE--GLWCIGWQNSGVQSRDRRNMTLLGDLVLSNKLVLYD 423
Query: 483 GANGFVGFGPNVC 495
N +G+ C
Sbjct: 424 LENQAIGWTEYNC 436
>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
Length = 137
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 81/127 (63%)
Query: 144 DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADS 203
DV + + G+GE+ +++ +G P + ++D+GSD+ W QC PCS CYKQ P++DP+ S
Sbjct: 9 DVQAPVSAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCIPCSDCYKQPTPIYDPSLS 68
Query: 204 ASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIG 263
+++ VSC S++C L + C + C Y +YGD S T+G L+ ET T+ + ++A G
Sbjct: 69 STYGTVSCKSSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTLSSQSIPHIAFG 128
Query: 264 CGHKNQG 270
CG N+G
Sbjct: 129 CGQDNEG 135
>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 492
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 113/403 (28%), Positives = 157/403 (38%), Gaps = 98/403 (24%)
Query: 171 MVIDSGSDIVWVQCQP--CSQCYKQ---------SDPVFDPADSASFSGVSCSSAVCDRL 219
+ +D+GSD+VW C P C C + S+P+ P DS + C+S C
Sbjct: 100 LFLDTGSDLVWFPCAPFTCMLCEGKPTPPGNNNSSNPLPPPTDSRR---IPCASPFCSAA 156
Query: 220 ENAG-----CHAGRCRYE-----------------VSYGDGSYTKGTLALETLTIGRTVV 257
++ C A RC + +YGDGS V
Sbjct: 157 HSSAPPADLCAAARCPLDDIETGSCAASHACPPLYYAYGDGSLVARLRRGRVGIAASVAV 216
Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLG-GQTGGAFSYCLVSRGTGS----- 311
+N C H G VG AG G G +SL QL G FSYCLV+ +
Sbjct: 217 ENFTFACAHTALGEPVGVAGF---GRGPLSLPAQLAPAALSGRFSYCLVAHSFRADRPIR 273
Query: 312 SGSLVFGR-----EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
L+ GR A G + PL+ NP+ P FY V L + VGG RIP +L R+ +
Sbjct: 274 PSPLILGRSPGEDPASETGIVYTPLLHNPKHPYFYSVALEAVSVGGTRIPARPELGRVGR 333
Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAF---------------VAQTGNLPRASGVSIF 411
GD G+V+D+GT T LP Y + F QTG P +
Sbjct: 334 AGDGGMVVDSGTTFTMLPNETYARVAEEFGRAMAAARFERAEAAEDQTGLAP----CYYY 389
Query: 412 DTCYNLSGFVSVR-VPTVSFYFSGGPVLTLPASNFLIPV------------------DDA 452
D + + S R VP ++ +F G + LP N+ + DD
Sbjct: 390 DHDASAAEEGSARAVPPLAMHFRGEATVVLPRRNYFMGFRSEERRRVGCLMLMNGGEDDG 449
Query: 453 GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
G P+G +GN QQ+G ++ +D G VGF C
Sbjct: 450 G--------GPAG--TLGNFQQQGFEVVYDVDAGRVGFARRRC 482
>gi|125555054|gb|EAZ00660.1| hypothetical protein OsI_22681 [Oryza sativa Indica Group]
Length = 337
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 105/353 (29%), Positives = 151/353 (42%), Gaps = 44/353 (12%)
Query: 171 MVIDSGSDIVWVQCQPC---SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG 227
M D+G I +C C + C + FDP+ S++F+ V C S C +GC +G
Sbjct: 1 MAFDTGLGISLARCAACRPGAPCDGLAS--FDPSRSSTFAPVPCGSPDC----RSGCSSG 54
Query: 228 RCRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSM 286
+ G +A + LT+ + V + GC + G +GAAGLL L S
Sbjct: 55 STP-SCPLTSFPFLSGAVAQDVLTLTPSASVDDFTFGCVEGSSGEPLGAAGLLDLSRDSR 113
Query: 287 SLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG-----AAWVPLVRNPRAPSFY 341
SL +L GG FSYCL T S G LV G +P A PLV +P P+ Y
Sbjct: 114 SLASRLAAGAGGTFSYCLPLSTTSSHGFLVIGEADVPHNRSARVTAVAPLVYDPAFPNHY 173
Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
+ L+G+ +GG IPI +V+DT T + Y RDAF
Sbjct: 174 VIDLAGVSLGGRDIPIPPHA---------AMVLDTALPYTYMKPSMYAPLRDAFRRAMAR 224
Query: 402 LPRASGVSIFDTCYNLSGFV-SVRVPTVSFYF-------SGGPVLTLPASNFLIPVDDAG 453
PRA + DTCYN +G V +P V F G + ++ ++ + + G
Sbjct: 225 YPRAPAMGDLDTCYNFTGVRHEVLIPLVHLTFRGISGGGGGEGQVLGLGADQMLYMSEPG 284
Query: 454 TF----CFAFAPSPSG-------LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
F C AFA PS ++G + Q +++ D G +GF P C
Sbjct: 285 NFFSVTCLAFAALPSDGDAAAPLAMVMGTLAQSSMEVVHDVQGGKIGFIPGSC 337
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 88/358 (24%), Positives = 157/358 (43%), Gaps = 27/358 (7%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+G Y R+ +G+PP+ +++D+GS + +V C C C + DP F P S ++ V C+
Sbjct: 86 NGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCEHCGRHQDPKFQPDLSETYQPVKCT 145
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHKNQ 269
N +C Y+ Y + S + G L + ++ G + GC +
Sbjct: 146 PDC-----NCDGDTNQCMYDRQYAEMSSSSGVLGEDVVSFGNLSELAPQRAVFGCENDET 200
Query: 270 GMFVG--AAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSRGTGSSGSLVFGREALPVG 325
G A G++GLG G +S++ QL + +FS C G G+++ G + P
Sbjct: 201 GDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVG-GGAMILGGISPPED 259
Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
+ +P +Y + L + V G ++ ++ +F G G V+D+GT LP
Sbjct: 260 MVFT--HSDPDRSPYYNINLKEMHVAGKKLQLNPKVFD----GKHGTVLDSGTTYAYLPE 313
Query: 386 PAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGF----VSVRVPTVSFYFSGGPVLT 439
A+ AF+ A + + +L + +G + D C+ +G ++ P V F G L+
Sbjct: 314 TAFLAFKRAIMKERNSLKQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFENGHKLS 373
Query: 440 LPASNFLIPVDDA-GTFCF-AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L N+L G +C F+ +++G I + +D N +GF C
Sbjct: 374 LSPENYLFRHSKVRGAYCLGVFSNGRDPTTLLGGIFVRNTLVMYDRENSKIGFWKTNC 431
>gi|222613193|gb|EEE51325.1| hypothetical protein OsJ_32293 [Oryza sativa Japonica Group]
Length = 371
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/328 (29%), Positives = 144/328 (43%), Gaps = 29/328 (8%)
Query: 183 QCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTK 242
C C C+KQ PVF P S++F C + VC + C + C Y+ G G +T
Sbjct: 54 NCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPTPKCASDVCAYDGVTGLGGHTV 113
Query: 243 GTLALETLTIGRTV-VKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAF 300
G +A +T IG + A G + + G +G +GLG SLV Q+ F
Sbjct: 114 GIVATDTFAIGTAAPARPPASGASWRATSTPWAGPSGFIGLGRTPWSLVAQMKLTR---F 170
Query: 301 SYCLVSRGTGSSGSLVFGREA-LPVGAAWVPLVR---NPRAPSFYYVGLSGLGVGGMRIP 356
SYCL TG + L G A L G AW P V+ N +Y + L + G I
Sbjct: 171 SYCLAPHDTGKNSRLFLGASAKLAGGGAWTPFVKTSPNDGMSQYYPIELEEIKAGDATIT 230
Query: 357 ISEDLFRLTQMGDDGVVMDTGTA-VTRLPTPAYEAFRDAFVAQTGNLPRASGV-SIFDTC 414
+ G + V++ T V+ L Y+ F+ A +A G P A+ V + F+ C
Sbjct: 231 MPR--------GRNTVLVQTAVVRVSLLVDSVYQEFKKAVMASVGAAPTATPVGAPFEVC 282
Query: 415 YNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA-------PSPSGLS 467
+ +G P + F F G LT+P +N+L V + T C + + GL+
Sbjct: 283 FPKAGVSG--APDLVFTFQAGAALTVPPANYLFDVGN-DTVCLSVMSIALLNITALDGLN 339
Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
I+G+ QQE + + FD + F P C
Sbjct: 340 ILGSFQQENVHLLFDLDKDMLSFEPADC 367
>gi|125564663|gb|EAZ10043.1| hypothetical protein OsI_32347 [Oryza sativa Indica Group]
Length = 330
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 106/341 (31%), Positives = 160/341 (46%), Gaps = 39/341 (11%)
Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
+V D+ SD++W QCQPC C Q+ ++DP + +++ ++ S+
Sbjct: 5 LVFDTTSDLLWTQCQPCLSCVAQAGDMYDPNKTETYANLTSSN----------------- 47
Query: 231 YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVG 290
Y +Y S+T G A ET +G V N+ GCG +NQG + AG+ G+G G +SL+
Sbjct: 48 YNYTYSKQSFTSGYFATETFALGNVTVANITFGCGTRNQGYYDNVAGVFGVGRGGVSLLN 107
Query: 291 QLGGQTGGAFSYCLVSRGTGSSGSLVFG------REALPVGAAWVPLVRNPRAPSFYYVG 344
QLG FSYC S G S ++ G A AA P+V +P S Y+V
Sbjct: 108 QLGIDR---FSYCFSSSGAPGSSAVFLGGSPELATNATTTPAASTPMVADPVLKSGYFVK 164
Query: 345 LSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR 404
L G+ VG R+ ++ + G +V+D+ + VT L Y R A VAQ L
Sbjct: 165 LVGVTVGATRVDVAGA--SSAEGGGRALVIDSTSPVTVLDEATYGPVRRALVAQLAPLKE 222
Query: 405 A-----SGVSIFDTCYNLSGFVSVRVP---TVSFYFSGGPV-LTLPASNFLIPVDDAGTF 455
A +GV + D C+ L+ + P T++ +F GG L LP +N+L G
Sbjct: 223 ANANASAGVGL-DLCFELAAGGATPTPPNVTMTLHFDGGAADLVLPPANYLAKDSAGGLI 281
Query: 456 CFAFAPSPS-GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C PS S G+ ++G+ + +D A V F P C
Sbjct: 282 CLTMTPSSSNGVPVLGSSALLDTLVLYDLAKNVVSFQPLDC 322
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 98/362 (27%), Positives = 162/362 (44%), Gaps = 33/362 (9%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
+ + +G+PP++Q MV+D+GS + W+QC + + FDP+ S+SFS + CS +C
Sbjct: 74 ISLPIGTPPQAQQMVLDTGSQLSWIQCH-RKKLPPKPKTSFDPSLSSSFSTLPCSHPLCK 132
Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQG 270
D C + R C Y Y DG++ +G L E +T T + + +GC ++
Sbjct: 133 PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLILGCATESS- 191
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----GTGSSGSLVFGREALPVGA 326
G+LG+ G +S V Q FSYC+ + G +GS G G
Sbjct: 192 ---DDRGILGMNRGRLSFVSQ---AKISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGF 245
Query: 327 AWVPLVRNPRA-------PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
+V L+ P + P Y V + G+ G ++ IS +FR G ++D+G+
Sbjct: 246 KYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQTMVDSGSE 305
Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGV--SIFDTCYNLS-GFVSVRVPTVSFYFSGGP 436
T L AY+ R + + G + V D C++ + + + + F F+ G
Sbjct: 306 FTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNVAMIPRLIGDLVFVFTRGV 365
Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
+ +P L+ V G C S + +IIGN+ Q+ + + FD N VGF
Sbjct: 366 EILVPKERVLVNV-GGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFAKA 424
Query: 494 VC 495
C
Sbjct: 425 DC 426
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 99/380 (26%), Positives = 165/380 (43%), Gaps = 43/380 (11%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
SG G Y+ +IG+G+PP++ Y+ +D+GSDI+WV C C +C +S ++D
Sbjct: 74 SGRPDAVGLYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIK 133
Query: 202 DSASFSGVSCSSAVCDRLEN---AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGR--- 254
+S+S V C C + GC A C Y YGDGS T G + + +
Sbjct: 134 ESSSGKLVPCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSG 193
Query: 255 -----TVVKNVAIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQLG--GQTGGAFSY 302
+ ++ GCG + G + G+LG G + S++ QL G+ F++
Sbjct: 194 DLKTDSANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAH 253
Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
CL G G G P PL+ P P Y V ++ + VG + +S D
Sbjct: 254 CL--NGVNGGGIFAIGHVVQP-KVNMTPLL--PDQPH-YSVNMTAVQVGHTFLSLSTD-- 305
Query: 363 RLTQMGD-DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV 421
+ GD G ++D+GT + LP YE ++Q +L + + TC+ S V
Sbjct: 306 -TSAQGDRKGTIIDSGTTLAYLPEGIYEPLVYKMISQHPDL-KVQTLHDEYTCFQYSESV 363
Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS------PSGLSIIGNIQQE 475
P V+F+F G L + ++L P +C + S ++++G++
Sbjct: 364 DDGFPAVTFFFENGLSLKVYPHDYLFP--SVNFWCIGWQNSGTQSRDSKNMTLLGDLVLS 421
Query: 476 GIQISFDGANGFVGFGPNVC 495
+ +D N +G+ C
Sbjct: 422 NKLVFYDLENQAIGWAEYNC 441
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 104/390 (26%), Positives = 169/390 (43%), Gaps = 37/390 (9%)
Query: 124 RRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQ 183
R+L + + H D++ +G Y R+ +G+PP+ +++DSGS + +V
Sbjct: 66 RKLHKSDSKSLPHSRMRLYDDLLI-----NGYYTTRLWIGTPPQMFALIVDSGSTVTYVP 120
Query: 184 CQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYT 241
C C QC K DP F P S+++ V C+ + C R C YE Y + S +
Sbjct: 121 CSDCEQCGKHQDPKFQPEMSSTYQPVKCNM-------DCNCDDDREQCVYEREYAEHSSS 173
Query: 242 KGTLALETLTIG---RTVVKNVAIGCGHKNQGMFVG--AAGLLGLGGGSMSLVGQL--GG 294
KG L + ++ G + + GC G A G++GLG G +SLV QL G
Sbjct: 174 KGVLGEDLISFGNESQLTPQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQLVDKG 233
Query: 295 QTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMR 354
+F C G GS++ G P + +P +Y + L+G+ V G +
Sbjct: 234 LISNSFGLCYGGMDVG-GGSMILGGFDYPSDMVFTD--SDPDRSPYYNIDLTGIRVAGKQ 290
Query: 355 IPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG--VSIFD 412
+ + +F G+ G V+D+GT LP A+ AF +A + + L + G + D
Sbjct: 291 LSLHSRVFD----GEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNFKD 346
Query: 413 TCYNL--SGFVSVR---VPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAPS-PSG 465
TC+ + S +VS P+V F G L N++ G +C P+
Sbjct: 347 TCFQVAASNYVSELSKIFPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDH 406
Query: 466 LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+++G I + +D N VGF C
Sbjct: 407 TTLLGGIVVRNTLVVYDRENSKVGFWRTNC 436
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 98/362 (27%), Positives = 162/362 (44%), Gaps = 33/362 (9%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
+ + +G+PP++Q MV+D+GS + W+QC + + FDP+ S+SFS + CS +C
Sbjct: 74 ISLPIGTPPQAQQMVLDTGSQLSWIQCH-RKKLPPKPKTSFDPSLSSSFSTLPCSHPLCK 132
Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQG 270
D C + R C Y Y DG++ +G L E +T T + + +GC ++
Sbjct: 133 PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLILGCATESS- 191
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----GTGSSGSLVFGREALPVGA 326
G+LG+ G +S V Q FSYC+ + G +GS G G
Sbjct: 192 ---DDRGILGMNRGRLSFVSQ---AKISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGF 245
Query: 327 AWVPLVRNPRA-------PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
+V L+ P + P Y V + G+ G ++ IS +FR G ++D+G+
Sbjct: 246 KYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQTMVDSGSE 305
Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGV--SIFDTCYNLS-GFVSVRVPTVSFYFSGGP 436
T L AY+ R + + G + V D C++ + + + + F F+ G
Sbjct: 306 FTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNVAMIPRLIGDLVFVFTRGV 365
Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
+ +P L+ V G C S + +IIGN+ Q+ + + FD N VGF
Sbjct: 366 EIFVPKERVLVNV-GGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFAKA 424
Query: 494 VC 495
C
Sbjct: 425 DC 426
>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 160/383 (41%), Gaps = 48/383 (12%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQC-----YKQSDPVFDPADSAS 205
G + + + G+PP+ ++D+GS +VW C C+ C + P+F+P S+S
Sbjct: 85 GGHSIPLSFGTPPQKLSFLVDTGSHVVWAPCTTHYTCTNCSFSDAEPKKVPIFNPKLSSS 144
Query: 206 FSGVSCSSAVCDRLENAGCHAG---------RCR-----YEVSYGDGSYTKGTLALETLT 251
+ C + C + H G C Y + YG G+ + G LE L
Sbjct: 145 SKILGCRNPKCVNTSSPDVHLGCPPCNGNSKNCSHACPPYSLQYGTGA-SSGDFLLENLN 203
Query: 252 IGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG--- 308
+ +GC G V +A L G G SL Q+G + F+YCL S
Sbjct: 204 FPGKTIHEFLVGCTTSAVGE-VTSAALAGFGRSMFSLPMQMGVK---KFAYCLNSHDYDD 259
Query: 309 TGSSGSLVFG-REALPVGAAWVPLVRNPRA-PSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
T +S L+ + G ++ P ++NP P +YY+G+ + +G + I
Sbjct: 260 TRNSSKLILDYSDGETKGLSYAPFLKNPPDFPIYYYLGVKDIKIGNKLLRIPSKYLAPGS 319
Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR---ASGVSIFDTCYNLSGFVSV 423
G G+++D+G A + P ++ + + R A CYN +G S+
Sbjct: 320 DGRGGLMIDSGFAYGYMTGPVFKKVTNELKKRMSKYRRSLEAEAEIGVTPCYNFTGQKSI 379
Query: 424 RVPTVSFYFSGGPVLTLPASNF--LIP---------VDDAGTFCFAFAPSPSGLSIIGNI 472
++P + + F GG + +P N+ LIP DAGT F P PS I+GN
Sbjct: 380 KIPDLIYQFRGGATMVVPGKNYFVLIPEISLACFPLTTDAGTNTLEFTPGPS--IILGNS 437
Query: 473 QQEGIQISFDGANGFVGFGPNVC 495
Q + FD N +GF C
Sbjct: 438 QHVDYYVEFDLKNERLGFRQQTC 460
>gi|21668075|gb|AAM74221.1|AF518565_1 putative chloroplast nucleoid DNA-binding protein [Brassica
oleracea]
Length = 165
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 66/170 (38%), Positives = 97/170 (57%), Gaps = 8/170 (4%)
Query: 328 WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
+ P+ SFY + + G+ VGG ++ I + +F G ++D+GT ++RLP A
Sbjct: 1 FTPISTITDGTSFYGLDIVGISVGGQKLAIPQTVFS-----TPGALIDSGTVISRLPPKA 55
Query: 388 YEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
Y A R AF A+ S VSI DTC++L+GF +V +PTVSFYF+GG V+ L + L
Sbjct: 56 YAALRGAFKAKMSQYKNTSAVSILDTCFDLTGFKTVTIPTVSFYFNGGAVVELGSKGVLY 115
Query: 448 PVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
C AFA + +I GN+QQ+ +++ +DGA G VGF PN C
Sbjct: 116 AF-KMSQVCLAFAGNSDDNNAAIFGNVQQQTLEVVYDGAAGRVGFAPNGC 164
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 92/369 (24%), Positives = 156/369 (42%), Gaps = 49/369 (13%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+G Y R+ +G+PP+ +++D+GS + +V C C QC + DP F P S+++ V C+
Sbjct: 10 NGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKCN 69
Query: 213 -SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR---TVVKNVAIGCGHKN 268
CD +C YE Y + S + G L + ++ G + GC +
Sbjct: 70 IDCNCDD------EKQQCVYERQYAEMSTSSGVLGEDIISFGNLSALAPQRAVFGCENME 123
Query: 269 QGMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYC----------LVSRGTGSSGS 314
G A G++G+G G +S+V L G +FS C +V G +
Sbjct: 124 TGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGAMVLGGISPPSN 183
Query: 315 LVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
+VF + +P +Y + L + V G +P++ +F G G ++
Sbjct: 184 MVFSQS-------------DPVRSPYYNIDLKEIHVAGKPLPLNPTVFD----GKHGTIL 226
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLP--RASGVSIFDTCYNLSGF----VSVRVPTV 428
D+GT LP A+ +F+DA + + +L R + D C++ +G +S P V
Sbjct: 227 DSGTTYAYLPEAAFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQLSSSFPAV 286
Query: 429 SFYFSGGPVLTLPASNFLIPVDDA-GTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANG 486
F G L L N+L G +C F +++G I + +D N
Sbjct: 287 EMVFGNGQKLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDRENS 346
Query: 487 FVGFGPNVC 495
+GF C
Sbjct: 347 KIGFWKTNC 355
>gi|242076594|ref|XP_002448233.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
gi|241939416|gb|EES12561.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
Length = 508
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 118/394 (29%), Positives = 158/394 (40%), Gaps = 75/394 (19%)
Query: 171 MVIDSGSDIVWVQCQP--CSQCYKQSDPVFDPADSASFSG--------VSCSSAVC---- 216
+ +D+GSD+VW C P C C + P + SA V C+S +C
Sbjct: 111 LFLDTGSDLVWFPCAPFTCMLCEGKPTPSGGHSSSAPLPLPPPPDSRRVPCASPLCSAAH 170
Query: 217 ------DRLENAGC-----HAGRCR--------YEVSYGDGSYTKGTLALETLTIGRTV- 256
D AGC G CR +YGDGS L + +G +V
Sbjct: 171 ASAPPSDLCAAAGCPLEDIETGSCRGASHACPPLYYAYGDGSLV-AHLRRGRVGLGASVA 229
Query: 257 VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS---- 312
V N C H G VG AG G G +SL GQL Q G FSYCLVS +
Sbjct: 230 VDNFTFACAHTALGEPVGVAGF---GRGPLSLPGQLAPQLSGRFSYCLVSHSFRADRLIR 286
Query: 313 -GSLVFGRE----ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
L+ GR A G + PL+ NP+ P FY V L + VG RI +L R+ +
Sbjct: 287 PSPLILGRSPDAAAETGGFVYTPLLHNPKHPYFYSVALEAVSVGATRIQARPELARVDRA 346
Query: 368 GDDGVVMDTGTAVTRLPTPAYE-----AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
G+ G+V+D+GT T LP Y R A RA + CY+ + S
Sbjct: 347 GNGGMVVDSGTTFTMLPNETYARVAEAFARAMAAAGFARAERAEEQTGLTPCYHYA--AS 404
Query: 423 VR-VPTVSFYFSGGPVLTLPASNFLIPV------------DDAGTFCFAFAPSPSG---- 465
R VP ++ +F G + LP N+ + DD G SG
Sbjct: 405 DRGVPPLALHFRGNATVALPRRNYFMGFKSEEEAGGAGRKDDVGCLMLMNGGDVSGEDGG 464
Query: 466 ----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+GN QQ+G ++ +D G VGF C
Sbjct: 465 DDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 498
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 102/365 (27%), Positives = 163/365 (44%), Gaps = 35/365 (9%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV--FDPADSASFSGVSCSSAV 215
+ + +G+P +SQ +V+D+GS + W+QC P P FDP+ S+SFS + CS +
Sbjct: 82 LSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPL 141
Query: 216 C-----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKN 268
C D C + R C Y Y DG++ +G L E T + + +GC ++
Sbjct: 142 CKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPLILGCAKES 201
Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----GTGSSGSLVFGREALPV 324
G+LG+ G +S + Q FSYC+ +R G S+GS G
Sbjct: 202 ----TDEKGILGMNLGRLSFISQ---AKISKFSYCIPTRSNRPGLASTGSFYLGDNPNSR 254
Query: 325 GAAWVPLVRNPRA-------PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
G +V L+ P++ P Y V L G+ +G R+ I +FR G ++D+G
Sbjct: 255 GFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTMVDSG 314
Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGV--SIFDTCY--NLSGFVSVRVPTVSFYFS 433
+ T L AY+ ++ V G+ + V S D C+ N S + + + F F
Sbjct: 315 SEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLIGDLVFEFG 374
Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFDGANGFVGF 490
G + + + L+ V G C S + +IIGN+ Q+ + + FD N VGF
Sbjct: 375 RGVEILVEKQSLLVNV-GGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGF 433
Query: 491 GPNVC 495
C
Sbjct: 434 SKAEC 438
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 111/373 (29%), Positives = 165/373 (44%), Gaps = 44/373 (11%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
V + VG+PP++ MVID+GS++ W+ C + F+ S S+ + CSS+ C
Sbjct: 33 VSLTVGTPPQNVSMVIDTGSELSWLYCNKTTTTTSYPT-TFNQTRSISYRPIPCSSSTCT 91
Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK---- 267
D A C + C +SY D S ++G LA +T +G + + + GC
Sbjct: 92 NQTRDFSIPASCDSNSLCHATLSYADASSSEGNLASDTFHMGASDIPGMVFGCMDSVFSS 151
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE----ALP 323
N GL+G+ GS+S V Q+G FSYC+ GT SG L+ G A+P
Sbjct: 152 NSDEDSKNTGLMGMNRGSLSFVSQMGFP---KFSYCI--SGTDFSGMLLLGESNFTWAVP 206
Query: 324 VGAAWVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
+ + PLV+ + P F Y V L G+ V +PI + +F G ++D+GT
Sbjct: 207 LN--YTPLVQISTPLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFEPDHTGAGQTMVDSGT 264
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCYN--LSGFVSVRVPTVSF 430
T L PAY A R F+ QT R F D CY +S V R+PTVS
Sbjct: 265 QFTFLLGPAYTALRSEFLNQTTGFLRVLEDPDFVFQGAMDLCYRVPISQRVLPRLPTVSL 324
Query: 431 YFSGGPVLTLPASNFLIPVD-----DAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFD 482
F+G +T+ L V + C +F S +IG+ Q+ + + FD
Sbjct: 325 VFNGAE-MTVADERVLYRVPGEIRGNDSVHCLSFGNSDLLGVEAYVIGHHHQQNVWMEFD 383
Query: 483 GANGFVGFGPNVC 495
+G C
Sbjct: 384 LERSRIGLAQVRC 396
>gi|255647724|gb|ACU24323.1| unknown [Glycine max]
Length = 334
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 85/246 (34%), Positives = 123/246 (50%), Gaps = 8/246 (3%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
+ SG G Y VR+ +G+P + +MV+D+ +D +V C C+ C SD F P S
Sbjct: 89 IASGQTFNIGNYVVRVKLGTPGQLLFMVLDTSTDEAFVPCSGCTGC---SDATFSPKAST 145
Query: 205 SFSGVSCSSAVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA 261
S+ + CS C ++ C A G C + SY S++ TL ++L + V+ N +
Sbjct: 146 SYGPLDCSVPQCGQVRGLSCPATGTGACSFNQSYAGSSFS-ATLVQDSLRLATDVIPNYS 204
Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGRE 320
GC + G V A GLLGLG G +SL+ Q G G FSYCL S + SGSL
Sbjct: 205 FGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSYCLPSFKSYYFSGSLKLRPV 264
Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
P PL+R+P PS YYV +G+ VG + +P + G ++D+GT +
Sbjct: 265 GQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTIIDSGTVI 324
Query: 381 TRLPTP 386
TR P
Sbjct: 325 TRFVEP 330
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 96/359 (26%), Positives = 156/359 (43%), Gaps = 29/359 (8%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+G Y R+ +G+PP+ +++D+GS + +V C C C DP F P S ++ V C+
Sbjct: 90 NGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCKHCGSHQDPKFRPEASETYQPVKCT 149
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHKNQ 269
N +C YE Y + S + G L + ++ G + GC +
Sbjct: 150 WQC-----NCDDDRKQCTYERRYAEMSTSSGVLGEDVVSFGNQSELSPQRAIFGCENDET 204
Query: 270 GMFVG--AAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSRGTGSSGSLVFGREALPVG 325
G A G++GLG G +S++ QL + AFS C G G ++ G +
Sbjct: 205 GDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGGAMVLGG---ISPP 261
Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
A V +P +Y + L + V G R+ ++ +F G G V+D+GT LP
Sbjct: 262 ADMVFTHSDPVRSPYYNIDLKEIHVAGKRLHLNPKVFD----GKHGTVLDSGTTYAYLPE 317
Query: 386 PAYEAFRDAFVAQTGNLPRASGVSIF--DTCY-----NLSGFVSVRVPTVSFYFSGGPVL 438
A+ AF+ A + +T +L R SG D C+ N+S +S P V F G L
Sbjct: 318 SAFLAFKHAIMKETHSLKRISGPDPHYNDICFSGAEINVSQ-LSKSFPVVEMVFGNGHKL 376
Query: 439 TLPASNFLIPVDDA-GTFCF-AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+L N+L G +C F+ +++G I + +D + +GF C
Sbjct: 377 SLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHSKIGFWKTNC 435
>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
Length = 486
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 104/400 (26%), Positives = 170/400 (42%), Gaps = 67/400 (16%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ----PCSQC--YKQSDPVF----------- 198
Y + + +G+PP+ +++D+GSD+ WV C C +C Y+ + +
Sbjct: 82 YLISLNIGTPPQVIQVLMDTGSDLTWVPCGNLSFDCMECDDYRNNKLMATFSPSYSSSSY 141
Query: 199 ----------------DPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTK 242
+P D+ + +G S S+ V A C + +YG G
Sbjct: 142 RASCASPFCIDIHSSDNPLDTCTVAGCSLSTLV-----KATCSRPCPSFAYTYGAGGVVT 196
Query: 243 GTLALETLTIGRT---VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA 299
G L +TL + + V K + C + G+ G G G++S+V QLG G
Sbjct: 197 GILTRDTLRVNGSSPGVAKEIPKFCFGCVGSAYREPIGIAGFGRGTLSMVSQLGFLQKG- 255
Query: 300 FSYCLV----SRGTGSSGSLVFGREALPVG--AAWVPLVRNPRAPSFYYVGLSGLGVGGM 353
FS+C + + S LV G AL + P++ +P P+FYYVGL + VG +
Sbjct: 256 FSHCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLNSPMYPNFYYVGLEAITVGNV 315
Query: 354 R-IPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-- 410
+ L +G+ G+ +D+GT T LP P Y + + T N PR +G+ +
Sbjct: 316 SATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVL-SILQSTINYPRDTGMEMQT 374
Query: 411 -FDTCY------NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG----TFCFAF 459
FD CY N + +P+++F+F L LP N PV G C F
Sbjct: 375 GFDLCYKVPRPNNNTLTSDDLLPSITFHFLNNVSLVLPQGNHFYPVSAPGNPAVVKCLMF 434
Query: 460 APSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ G + G+ QQ+ +++ +D +GF P C
Sbjct: 435 QSTDDGDDGPAGVFGSFQQQNVEVVYDLEKERIGFQPMDC 474
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 116/445 (26%), Positives = 196/445 (44%), Gaps = 49/445 (11%)
Query: 74 SDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL---SGGG 130
S+E + L+H D S ++ H + AR++ V R + + L +
Sbjct: 3 SNEVGFTARLIHHDSPLSP--------FYNHTMTDTARIEATVHRSRSRLNYLYYINKLS 54
Query: 131 ADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQ 189
+A ++V T V G GEY + +G+P +D+ + ++WVQC C SQ
Sbjct: 55 ENALDNDVSLSPTLVNEG-----GEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQ 109
Query: 190 CYKQSDPV---FDPADSASFSGVSCSSAVCDRL---ENAGCHAGRCRYEVSYGDGSYTKG 243
C + + F + S ++ C S C+ L + C+Y + YGD T G
Sbjct: 110 CEPEKRGLTTKFLSSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSG 169
Query: 244 TLALETL----TIGRTV-VKNVAIGCGHKN-QGMFVGAAGLLGLGGGSMSLVGQLGGQTG 297
L+ ++ + G V V + GC G G +GL +SL+ QLG +
Sbjct: 170 ILSSDSFGFDTSDGMLVDVGFLNFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQLGIK-- 227
Query: 298 GAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP 356
FSYCLV GS+ + FG +LPV + + P + + YYV + G+ +G P
Sbjct: 228 -KFSYCLVPFNNLGSTSKMYFG--SLPVTSGGQTPLLYPNSDA-YYVKVLGISIGNDE-P 282
Query: 357 ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVA-----QTGNLPRASGVSIF 411
+ +F + ++ D G ++DTG + L T A+++ F+ Q + P+ F
Sbjct: 283 HFDGVFDVYEVRD-GWIIDTGITYSSLETDAFDSLLAKFLTLKDFPQRKDDPKER----F 337
Query: 412 DTCYNLSGFVSVR-VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIG 470
+ C+ L + P V+ +F G ++ S F + ++D G FC A S S +SI+G
Sbjct: 338 ELCFELQNANDLESFPDVTVHFDGADLILNVESTF-VKIEDDGIFCLALLRSGSPVSILG 396
Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
N Q + + +D + F P C
Sbjct: 397 NFQLQNYHVGYDLEAQVISFAPVDC 421
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 102/388 (26%), Positives = 167/388 (43%), Gaps = 33/388 (8%)
Query: 124 RRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQ 183
R+L + + H D++ +G Y R+ +G+PP+ +++DSGS + +V
Sbjct: 67 RKLHKSDSKSLPHSRMRLYDDLLI-----NGYYTTRLWIGTPPQMFALIVDSGSTVTYVP 121
Query: 184 CQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKG 243
C C QC K DP F P S+++ V C+ N +C YE Y + S +KG
Sbjct: 122 CSDCEQCGKHQDPKFQPELSSTYQPVKCNMDC-----NCDDDKEQCVYEREYAEHSSSKG 176
Query: 244 TLALETLTIG---RTVVKNVAIGCGHKNQGMFVG--AAGLLGLGGGSMSLVGQL--GGQT 296
L + ++ G + + GC G A G++GLG G +SLV QL G
Sbjct: 177 VLGEDLISFGNESQLTPQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLI 236
Query: 297 GGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP 356
+F C G GS++ G P + +P +Y + L+G+ V G ++
Sbjct: 237 SNSFGLCYGGMDVG-GGSMILGGFDYPSDMIFTD--SDPDRSPYYNIDLTGIRVAGKKLS 293
Query: 357 ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG--VSIFDTC 414
++ +F G+ G V+D+GT LP A+ AF +A + + L + G + DTC
Sbjct: 294 LNSRVFD----GEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSPLKQIDGPDPNFKDTC 349
Query: 415 Y-----NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAPS-PSGLS 467
+ N +S P+V F G L N++ G +C P+ +
Sbjct: 350 FLVAASNDVSELSKIFPSVEMIFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTT 409
Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
++G I + +D N VGF C
Sbjct: 410 LLGGIVVRNTLVVYDRENSKVGFWRTNC 437
>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
Length = 493
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 111/403 (27%), Positives = 161/403 (39%), Gaps = 81/403 (20%)
Query: 164 SPPRSQYMVIDSGSDIVWVQCQP--CSQCYKQSDPVF----DPADSASFSGVSCSSAVC- 216
+PP+ + +D+GSD+VW C+P C C +++ P S++ V C S+ C
Sbjct: 91 NPPQHVSLYLDTGSDLVWFPCKPFECILCEGKAENTTASTPPPRLSSTARSVHCKSSACS 150
Query: 217 -------------------DRLENAGCHAGRC-RYEVSYGDGSYT--------KGTLALE 248
+ +E + CH+ C + +YGDGS K LA
Sbjct: 151 AAHSNLPTSDLCAIADCPLESIETSDCHSFSCPSFYYAYGDGSLVARLYHDSIKLPLATP 210
Query: 249 TLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGG---QTGGAFSYCLV 305
+L++ N GC H VG AG G G +SL QL Q G FSYCLV
Sbjct: 211 SLSL-----HNFTFGCAHTALAEPVGVAGF---GRGVLSLPAQLASFAPQLGNRFSYCLV 262
Query: 306 SRGTGSS-----GSLVFGR--------EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGG 352
S S L+ G V + ++ NP+ P FY VGL G+ +G
Sbjct: 263 SHSFNSDRLRLPSPLILGHSDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGISIGK 322
Query: 353 MRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL-PRASGVSI- 410
+IP E L R+ + G GVV+D+GT T LP Y + F + G + RA V
Sbjct: 323 KKIPAPEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVEDK 382
Query: 411 --FDTCYNLSGFVSVRVPTVSFYFSGGP-VLTLPASNFLIPVDDAG--------TFCFAF 459
CY V+ +P++ +F G + LP N+ D G C
Sbjct: 383 TGLGPCYYYDTVVN--IPSLVLHFVGNESSVVLPKKNYFYDFLDGGDGVRRKRRVGCLML 440
Query: 460 APSPSGLSI-------IGNIQQEGIQISFDGANGFVGFGPNVC 495
+ +GN QQ G ++ +D VGF C
Sbjct: 441 MNGGEEAELTGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKC 483
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 106/367 (28%), Positives = 164/367 (44%), Gaps = 40/367 (10%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
+ + +GSPP++ MV+D+GS++ W+ C+ + F+P S+S++ C+S+VC
Sbjct: 61 ISLTIGSPPQNVTMVLDTGSELSWLHCKKLPNL----NSTFNPLLSSSYTPTPCNSSVCM 116
Query: 217 ----DRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC----GH 266
D A C C VSY D S +GTLA ET ++ GC G+
Sbjct: 117 TRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGTLFGCMDSAGY 176
Query: 267 KNQ-GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR-EALPV 324
+ GL+G+ GS+SLV Q+ FSYC+ G + G L+ G + P
Sbjct: 177 TSDINEDAKTTGLMGMNRGSLSLVTQM---VLPKFSYCI--SGEDAFGVLLLGDGPSAPS 231
Query: 325 GAAWVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
+ PLV +P F Y V L G+ V + + + +F G ++D+GT
Sbjct: 232 PLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSGTQ 291
Query: 380 VTRLPTPAYEAFRDAFVAQT-GNLPRASGVSI-----FDTCYNLSGFVSVRVPTVSFYFS 433
T L P Y + +D F+ QT G L R + D CY+ ++ VP V+ FS
Sbjct: 292 FTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPASLAA-VPAVTLVFS 350
Query: 434 GGPVLTLPASNFLIPVDDA--GTFCFAFAPSP-SGLS--IIGNIQQEGIQISFDGANGFV 488
G + + L V +CF F S G+ +IG+ Q+ + + FD V
Sbjct: 351 GAE-MRVSGERLLYRVSKGRDWVYCFTFGNSDLLGIEAYVIGHHHQQNVWMEFDLVKSRV 409
Query: 489 GFGPNVC 495
GF C
Sbjct: 410 GFTETTC 416
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 108/374 (28%), Positives = 166/374 (44%), Gaps = 49/374 (13%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
V + VGSPP++ MV+D+GS++ W+ C+ + + VF+P S ++S V C S C
Sbjct: 71 VSLTVGSPPQNVTMVLDTGSELSWLHCKKT----QFLNSVFNPLSSKTYSKVPCLSPTCK 126
Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH----K 267
D C A + C VSY D + +G LA ET +G GC
Sbjct: 127 TRTRDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRLGSLTKPATIFGCMDSGFSS 186
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV--G 325
N GL+G+ GS+S V Q+G FSYC+ G S+G L+ G + P
Sbjct: 187 NSEEDSKTTGLIGMNRGSLSFVNQMGYP---KFSYCI--SGFDSAGVLLLGNASFPWLKP 241
Query: 326 AAWVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
++ PLV+ + P F Y V L G+ V + + + +F G ++D+GT
Sbjct: 242 LSYTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMVDSGTQF 301
Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCYNLSGFVSVR-----VPTVS 429
T L P Y A ++ F++QT + + F D CY L S R +P VS
Sbjct: 302 TFLLGPVYTALKNEFLSQTRGILKVLNDDNFVFQGAMDLCYLLD---SSRPNLQNLPVVS 358
Query: 430 FYFSGGPVLTLPASNFL--IPVDDAG---TFCFAFAPSP-SGLS--IIGNIQQEGIQISF 481
F G +++ L +P + G +CF F S G+ +IG+ Q+ + + F
Sbjct: 359 LMFQGAE-MSVSGERLLYRVPGEVRGRDSVWCFTFGNSDLLGVEAFVIGHHHQQNVWMEF 417
Query: 482 DGANGFVGFGPNVC 495
D +G C
Sbjct: 418 DLEKSRIGLADVRC 431
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 118 bits (295), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 105/384 (27%), Positives = 168/384 (43%), Gaps = 54/384 (14%)
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPAD 202
G+ +G Y+ I +G+PP+ ++ +D+GSDI+WV C C++C ++SD ++DP
Sbjct: 75 GLPTDTGLYYTEIEIGTPPKQYHVQVDTGSDILWVNCISCNKCPRKSDLGIDLRLYDPKG 134
Query: 203 SASFSGVSCSSAVCDRLENAGCHAGR---------CRYEVSYGDGSYTKGTLALETLTIG 253
S+S S VSC C A + G+ C Y V YGDGS T G ++L
Sbjct: 135 SSSGSTVSCDQKFC-----AATYGGKLPGCAKNIPCEYSVMYGDGSSTTGYFVSDSLQYN 189
Query: 254 --------RTVVKNVAIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQL--GGQTGG 298
R +V GCG + QG +G+ G++G G + S++ QL G+
Sbjct: 190 QVSGDGQTRHANASVIFGCGAQ-QGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKK 248
Query: 299 AFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPIS 358
FS+CL + G G G P + PLV P P Y V L + VGG + +
Sbjct: 249 IFSHCLDTIKGG--GIFAIGDVVQPKVKS-TPLV--PDMPH-YNVNLESINVGGTTLQLP 302
Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD-TCYNL 417
+F + G ++D+GT +T LP Y +D A P + S+ D C
Sbjct: 303 SHMFETGE--KKGTIIDSGTTLTYLPELVY---KDVLAAVFAKHPDTTFHSVQDFLCIQY 357
Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF------APSPSGLSIIGN 471
V P ++F+F L + ++ D +CF F + + ++G+
Sbjct: 358 FQSVDDGFPKITFHFEDDLGLNVYPHDYFFQNGD-NLYCFGFQNGGLQSKDGKDMVLLGD 416
Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
+ + +D N VG+ C
Sbjct: 417 LVLSNKVVVYDLENQVVGWTDYNC 440
>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 469
Score = 118 bits (295), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 113/394 (28%), Positives = 162/394 (41%), Gaps = 66/394 (16%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQC-YKQSDPV----FDPADSAS 205
G Y V + G+P ++ V D+GS +VW C CS C + DP F P +S+S
Sbjct: 88 GGYSVSLSFGTPSQTIPFVFDTGSSLVWFPCTSRYLCSDCNFSGLDPTQIPRFIPKNSSS 147
Query: 206 FSGVSCSSAVCDRLENAGCHAGRCR------------YEVSYGDGSYTKGTLALETLTIG 253
+ C + C L A C Y + YG GS T G L E L
Sbjct: 148 SRVIGCQNPKCQFLFGANVQCRGCDPNTRNCTVPCPPYILQYGLGS-TAGILISEKLDFP 206
Query: 254 RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR------ 307
V + +GC + AG+ G G G SL Q+ + +FS+CLVSR
Sbjct: 207 DLTVPDFVVGCSVISTRT---PAGIAGFGRGPESLPSQMKLK---SFSHCLVSRRFDDTN 260
Query: 308 -------GTGS---SGSLVFGREALPVGAAWVPLVRNPRAPS-----FYYVGLSGLGVGG 352
TGS SGS G ++ P +NP + +YY+ L + VG
Sbjct: 261 VTTDLGLDTGSGHKSGSKT-------PGLSYTPFRKNPNVSNTAFLEYYYLNLRRIYVGS 313
Query: 353 MRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS---GVS 409
+ I G+ G ++D+G+ T + P +E + F Q N R VS
Sbjct: 314 KHVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQMSNYTREKDLEKVS 373
Query: 410 IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP----SPSG 465
C+N+SG V VP + F F GG + LP SN+ V +A T C +P G
Sbjct: 374 GIAPCFNISGKGDVTVPELIFEFKGGAKMELPLSNYFSFVGNADTVCLTVVSDNTVNPGG 433
Query: 466 LS----IIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ I+G+ QQ+ + +D N GF C
Sbjct: 434 GTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467
>gi|125524351|gb|EAY72465.1| hypothetical protein OsI_00321 [Oryza sativa Indica Group]
Length = 343
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 53/90 (58%), Positives = 66/90 (73%), Gaps = 2/90 (2%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
VVSG+ GSGEYF R+GVGSP R YMV+D+GSD+ WVQCQPC+ CY+QSDPVFDP+ S
Sbjct: 156 VVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLST 215
Query: 205 SFSGVSCSSAVCDRLENAGCH--AGRCRYE 232
S++ V+C + C L+ A C G C YE
Sbjct: 216 SYASVACDNPRCHDLDAAACRNSTGACLYE 245
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 96/328 (29%), Positives = 150/328 (45%), Gaps = 37/328 (11%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFS 207
+G YF +IG+G+P + Y+ +D+GSDI+WV C C +C +SD ++D S +
Sbjct: 75 AGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSD 134
Query: 208 GVSCSSAVCDRLEN--AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGR------TVVK 258
V C C + GC G +C Y V YGDGS T G + + R T
Sbjct: 135 AVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPT 194
Query: 259 N--VAIGCGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTG 310
N V GCG+K G ++ G+LG G + S++ QL G+ FS+CL + G
Sbjct: 195 NGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGG 254
Query: 311 SSGSLVFGREALP------VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL 364
G G P + + + ++ RA Y V + + VGG + + D F
Sbjct: 255 --GIFAIGEVVEPKVRFLLMNSVMIVVLFLSRA--HYNVVMKEIEVGGDPLDVPSDAF-- 308
Query: 365 TQMGD-DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
+ GD G ++D+GT + P Y + ++Q +L R V TC++ +G V
Sbjct: 309 -ESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDL-RLHTVEQAFTCFDYTGNVDD 366
Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDD 451
PTV+ +F LT+ +L V +
Sbjct: 367 GFPTVTLHFDKSISLTVYPHEYLFQVKE 394
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 103/396 (26%), Positives = 162/396 (40%), Gaps = 66/396 (16%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ----PCSQCYK------QSDPVFDPADSAS 205
Y + + +G+PP++ + +D+GSD+ WV C C +CY +S VF P S++
Sbjct: 83 YLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSST 142
Query: 206 FSGVSCSSAVC----------DRLENAGCHAGRC----------RYEVSYGDGSYTKGTL 245
SC+S+ C D AGC + +YG+G G L
Sbjct: 143 SFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGIL 202
Query: 246 ALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
+ L V + GC + G+ G G G +SL QLG G FS+C +
Sbjct: 203 TRDILKARTRDVPRFSFGCV---TSTYREPIGIAGFGRGLLSLPSQLGFLEKG-FSHCFL 258
Query: 306 S----RGTGSSGSLVFGREALPVGAA----WVPLVRNPRAPSFYYVGLSGLGVGGMRIP- 356
S L+ G AL + + P++ P P+ YY+GL + +G P
Sbjct: 259 PFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPMYPNSYYIGLESITIGTNITPT 318
Query: 357 -ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI---FD 412
+ L + G+ G+++D+GT T LP P Y + T PRA+ FD
Sbjct: 319 QVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTT-LQSTITYPRATETESRTGFD 377
Query: 413 TCY-------NLSGF---VSVRVPTVSFYFSGGPVLTLPASNFLI----PVDDAGTFCFA 458
CY NL+ V + P+++F+F L LP N P D + C
Sbjct: 378 LCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMSAPSDGSVVQCLL 437
Query: 459 FAPSPSG----LSIIGNIQQEGIQISFDGANGFVGF 490
F G + G+ QQ+ +++ +D +GF
Sbjct: 438 FQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGF 473
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 98/370 (26%), Positives = 157/370 (42%), Gaps = 41/370 (11%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC----------YKQSDPVFDPAD 202
+G Y R+ +G+P + +++DSGS + +V C C QC + DP F P
Sbjct: 89 NGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDL 148
Query: 203 SASFSGVSCS-SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT---VVK 258
S+++S V C+ CD +C YE Y + S + G L + ++ G+ +
Sbjct: 149 SSTYSPVKCNVDCTCDN------ERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQ 202
Query: 259 NVAIGCGHKNQGMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGS 314
GC + G A G++GLG G +S++ QL G +FS C G G+
Sbjct: 203 RAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVG-GGT 261
Query: 315 LVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
+V G +P V NP +Y + L + V G + + +F G V+
Sbjct: 262 MVLG--GMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFN----SKHGTVL 315
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG--VSIFDTCY-----NLSGFVSVRVPT 427
D+GT LP A+ AF+DA + +L + G + D C+ N+S V P
Sbjct: 316 DSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEV-FPD 374
Query: 428 VSFYFSGGPVLTLPASNFLIPVDDA-GTFCF-AFAPSPSGLSIIGNIQQEGIQISFDGAN 485
V F G L+L N+L G +C F +++G I +++D N
Sbjct: 375 VDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHN 434
Query: 486 GFVGFGPNVC 495
+GF C
Sbjct: 435 EKIGFWKTNC 444
>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 547
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 103/362 (28%), Positives = 156/362 (43%), Gaps = 40/362 (11%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP--------VFDPADSASFS 207
Y+ + VG+P + +D+GSD+ W+ C C C + ++ P +S++
Sbjct: 130 YYAEVTVGTPGVPYLVALDTGSDLFWLPCD-CVNCITGLNTTQGPVNFNIYSPNNSSTSK 188
Query: 208 GVSCSSAVCDRLENAGCHAGRCRYEVSY-GDGSYTKGTLALETLTI------GRTVVKNV 260
V CSS++C L+ + C Y+VSY D + + G L + L + + V +
Sbjct: 189 EVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNARI 248
Query: 261 AIGCGHKNQGMFVGAA---GLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSL 315
+GCG G F+ +A GL GLG ++S+ L G +FS C G G +
Sbjct: 249 TLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCF---GPARMGRI 305
Query: 316 VFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMD 375
FG + P G P R P+ Y V ++ +GVGG DL D V+ D
Sbjct: 306 EFGDKGSP-GQNETPFNLGRRHPT-YNVSITQIGVGGHI----SDL-------DVAVIFD 352
Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLS-GFVSVRVPTVSFYFS 433
+GT+ T L PAY F D F + I F+ CY LS + P ++
Sbjct: 353 SGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLTMK 412
Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
GG + LI + FC A A S S ++IIG G I FD +G+ +
Sbjct: 413 GGGHFVINHPIVLISTESKRLFCLAIARSDS-INIIGQNFMTGYHIVFDREKMVLGWKES 471
Query: 494 VC 495
C
Sbjct: 472 NC 473
>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
from this gene [Arabidopsis thaliana]
Length = 388
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 112/379 (29%), Positives = 162/379 (42%), Gaps = 63/379 (16%)
Query: 100 HYHRHQHSFHARMQRDVKRVATLVR--RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYF 157
Y R Q S A + D +R T++ L GG +G G Y+
Sbjct: 38 RYPRLQGSLTALKEHDDRRQLTILAGIDLPLGG----------------TGRPDIPGLYY 81
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCS 212
+IG+G+P +S Y+ +D+GSDI+WV C C QC ++S +++ +S S VSC
Sbjct: 82 AKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCD 141
Query: 213 SAVCDRLEN---AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIG--------RTVVKNV 260
C ++ +GC A C Y YGDGS T G + + +T +V
Sbjct: 142 DDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSV 201
Query: 261 AIGCGHKNQGMFVGA-----AGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSG 313
GCG + G + G+LG G + S++ QL G+ F++CL R G G
Sbjct: 202 IFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGG--G 259
Query: 314 SLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD-DGV 372
GR P PLV P P Y V ++ + VG + I DLF Q GD G
Sbjct: 260 IFAIGRVVQP-KVNMTPLV--PNQPH-YNVNMTAVQVGQEFLTIPADLF---QPGDRKGA 312
Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD---TCYNLSGFVSVRVPTVS 429
++D+GT + LP YE A V I D C+ SG V P V+
Sbjct: 313 IIDSGTTLAYLPEIIYEPLVKK--------EPALKVHIVDKDYKCFQYSGRVDEGFPNVT 364
Query: 430 FYFSGGPVLTLPASNFLIP 448
F+F L + ++L P
Sbjct: 365 FHFENSVFLRVYPHDYLFP 383
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 163/375 (43%), Gaps = 51/375 (13%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD 217
V + G+P ++ MV+D+GS++ W+ C+ + +F+P S +++ + CSS C+
Sbjct: 69 VSLTAGTPLQNITMVLDTGSELSWLHCKK----EPNFNSIFNPLASKTYTKIPCSSPTCE 124
Query: 218 RLEN-----AGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK---- 267
C + C + +SY D S +G LA ET +G GC
Sbjct: 125 TRTRDLPLPVSCDPAKLCHFIISYADASSVEGNLAFETFRVGSVTGPATVFGCMDSGFSS 184
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA 327
N GL+G+ GS+S V Q+G + FSYC+ R SSG L+ G + +
Sbjct: 185 NSEEDAKTTGLMGMNRGSLSFVNQMGFR---KFSYCISDRD--SSGVLLLGEASF----S 235
Query: 328 WV-PLVRNPRA------PSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
W+ PL P P F Y V L G+ V + + + +F G ++D+
Sbjct: 236 WLKPLNYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVFVPDHTGAGQTMVDS 295
Query: 377 GTAVTRLPTPAYEAFRDAFVAQTG------NLPRASGVSIFDTCYNLSGFVSV--RVPTV 428
GT T L P Y A + F+ QT N PR D CY + + +P V
Sbjct: 296 GTQFTFLLGPVYSALKQEFLLQTKGVLRVLNEPRYVFQGAMDLCYLIEPTRAALPNLPVV 355
Query: 429 SFYFSGGPVLTLPASNFL--IPVDDAG---TFCFAFAPSPS-GLS--IIGNIQQEGIQIS 480
+ F G +++ L +P + G +CF F S S G+ +IG+ QQ+ + +
Sbjct: 356 NLMFRGAE-MSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDSLGIESFVIGHHQQQNVWME 414
Query: 481 FDGANGFVGFGPNVC 495
+D +GF C
Sbjct: 415 YDLEKSRIGFAEVRC 429
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 100/356 (28%), Positives = 159/356 (44%), Gaps = 33/356 (9%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVS 210
YF +IG+G+P + Y+ +D+GSDI+WV C C +C +SD ++DPA S S + VS
Sbjct: 27 YFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKCPTKSDLGIKLTLYDPASSVSATRVS 86
Query: 211 CSSAVCDRLENA---GCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
C C N C C+Y V YGDGS T G + + R V N+ G +
Sbjct: 87 CDDDFCTSTYNGLLPDCKKELPCQYNVVYGDGSSTAGYFVSDAVQFER-VTGNLQTGLSN 145
Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA 326
GA GLG +L G L GAF++CL + G G G P
Sbjct: 146 GTVTFGCGAQQSGGLGTSGEALDGIL-----GAFAHCLDNVNGG--GIFAIGELVSP-KV 197
Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD-DGVVMDTGTAVTRLPT 385
P+V N + Y V + + VGG + + D+F GD G ++D+GT + LP
Sbjct: 198 NTTPMVPN---QAHYNVYMKEIEVGGTVLELPTDVF---DSGDRRGTIIDSGTTLAYLPE 251
Query: 386 PAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNF 445
Y++ + +Q L + F C+ SG V P + F+F LT+ ++
Sbjct: 252 VVYDSMMNEIRSQQPGLSLHTVEEQF-ICFKYSGNVDDGFPDIKFHFKDSLTLTVYPHDY 310
Query: 446 LIPVDDAGTFCFAF------APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
L + + +CF + + ++++G++ + +D N +G+ C
Sbjct: 311 LFQISE-DIWCFGWQNGGMQSKDGRDMTLLGDLVLSNKLVLYDIENQAIGWTEYNC 365
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 116/420 (27%), Positives = 179/420 (42%), Gaps = 55/420 (13%)
Query: 119 VATLVRRLSGGGADAAKHEVQDFG--------TDVV---SGMDQGSGEYFVRIGVGSPPR 167
V +VR+ G + A + D G D+ +G +G Y+ +IG+G P
Sbjct: 29 VFPVVRKFKGPAENLAAIKAHDAGRRGRFLSVVDLALGGNGRPTSTGLYYTKIGLG--PN 86
Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCDRLEN- 221
Y+ +D+GSD +WV C C+ C K+S ++DP S + V C C +
Sbjct: 87 DYYVQVDTGSDTLWVNCVGCTTCPKKSGLGMELTLYDPNSSKTSKVVPCDDEFCTSTYDG 146
Query: 222 --AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIG------RTVVKNVAI--GCGHKNQG 270
+GC C Y ++YGDGS T G+ + LT RTV N ++ GCG K G
Sbjct: 147 PISGCKKDMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSVIFGCGSKQSG 206
Query: 271 MFVGAA-----GLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALP 323
G++G G + S++ QL G+ FS+CL + G G G P
Sbjct: 207 TLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCLDTVNGG--GIFAIGEVVQP 264
Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
PLV PR + Y V L + V G I + D+F T G ++D+GT + L
Sbjct: 265 -KVKTTPLV--PRM-AHYNVVLKDIEVAGDPIQLPTDIFDSTS--GRGTIIDSGTTLAYL 318
Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR--VPTVSFYFSGGPVLTLP 441
P Y+ + +AQ + F TC++ S S+ PTV F F G LT
Sbjct: 319 PVSIYDQLLEKTLAQRSGMELYLVEDQF-TCFHYSDEKSLDDAFPTVKFTFEEGLTLTAY 377
Query: 442 ASNFLIPVDDAGTFCFAFAPSPS------GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
++L P + +C + S + L ++G++ +D N +G+ C
Sbjct: 378 PHDYLFPFKE-DMWCIGWQKSTAQTKDGKDLILLGDLVLTNKLFIYDLDNMSIGWTDYNC 436
>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 101/446 (22%), Positives = 182/446 (40%), Gaps = 51/446 (11%)
Query: 84 VHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKH-EVQ--- 139
+ ++ N T + + + + R R+ + +R+ GG K +V+
Sbjct: 110 MEEEEAQRERNETKSFLFQLYPKAHQGRGLREFGDIKLAAKRVDDGGRKVTKKLDVKGAA 169
Query: 140 DFGTD-----VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQ 193
GT+ + G G+Y+ I VG+PPR ++ +D+GSD+ W+QC PC+ C K
Sbjct: 170 SAGTNSTVLLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKG 229
Query: 194 SDPVFDPADSASFSGVSCSSAVCDRL---ENAGCHAGRCRYEVSYGDGSYTKGTLALETL 250
P++ PA V ++C L +N +C YE+ Y D S + G LA + +
Sbjct: 230 PHPLYKPAKEKI---VPPRDSLCQELQGDQNYCETCKQCDYEIEYADRSSSMGVLAKDDM 286
Query: 251 TI-----GRTVVKNVAIGCGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGA 299
+ GR + + GC + QG + + G+LGL ++SL QL G
Sbjct: 287 HLIATNGGREKL-DFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNV 345
Query: 300 FSYCLVSRGTGSSGSLVFGREALP-VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPIS 358
F +C ++R T G + G + +P G W P+ P + Y+ + G +
Sbjct: 346 FGHC-ITRETNGGGYMFLGDDYVPRWGMTWAPIRGGPD--NLYHTEAQKVNYGDQELHAG 402
Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLS 418
+ V+ D+G++ T LP Y+ DA + + + S + C+
Sbjct: 403 NSV---------QVIFDSGSSYTYLPEEMYKNLIDAIKEDSPSFVQDSSDTTLPLCWKAD 453
Query: 419 GFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD-----DAGTFCFAFAPSPS----GLSII 469
V ++ +F G +P + ++P D D G C I+
Sbjct: 454 FSVRSFFKPLNLHF-GRRWFVVPKTFTIVPDDYLIISDKGNVCLGLLNGTEINHGSTIIV 512
Query: 470 GNIQQEGIQISFDGANGFVGFGPNVC 495
G++ G + +D +G+ + C
Sbjct: 513 GDVSLRGKLVVYDNERRQIGWANSEC 538
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 98/370 (26%), Positives = 157/370 (42%), Gaps = 41/370 (11%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC----------YKQSDPVFDPAD 202
+G Y R+ +G+P + +++DSGS + +V C C QC + DP F P
Sbjct: 88 NGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDL 147
Query: 203 SASFSGVSCS-SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT---VVK 258
S+++S V C+ CD +C YE Y + S + G L + ++ G+ +
Sbjct: 148 SSTYSPVKCNVDCTCDN------ERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQ 201
Query: 259 NVAIGCGHKNQGMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGS 314
GC + G A G++GLG G +S++ QL G +FS C G G+
Sbjct: 202 RAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVG-GGT 260
Query: 315 LVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
+V G +P V NP +Y + L + V G + + +F G V+
Sbjct: 261 MVLG--GMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFN----SKHGTVL 314
Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG--VSIFDTCY-----NLSGFVSVRVPT 427
D+GT LP A+ AF+DA + +L + G + D C+ N+S V P
Sbjct: 315 DSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEV-FPD 373
Query: 428 VSFYFSGGPVLTLPASNFLIPVDDA-GTFCF-AFAPSPSGLSIIGNIQQEGIQISFDGAN 485
V F G L+L N+L G +C F +++G I +++D N
Sbjct: 374 VDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHN 433
Query: 486 GFVGFGPNVC 495
+GF C
Sbjct: 434 EKIGFWKTNC 443
>gi|125572774|gb|EAZ14289.1| hypothetical protein OsJ_04213 [Oryza sativa Japonica Group]
Length = 492
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 100/329 (30%), Positives = 144/329 (43%), Gaps = 23/329 (6%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
+G Y + VG+PP+ V+D SD VW+QC C+ C AD+ + +
Sbjct: 94 TGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCG---------ADAPAATSAPPF 144
Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGS--YTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
A + C Y YG G+ T G LA++ V GC +G
Sbjct: 145 YAFLSFHDTRAPTTPPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADGVIFGCAVATEG 204
Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLV-FGREALPVGAAWV 329
G++GLG G +S V QL G FSY L GS + F +A P + V
Sbjct: 205 DI---GGVIGLGRGELSPVSQL---QIGRFSYYLAPDDAVDVGSFILFLDDAKPRTSRAV 258
Query: 330 --PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
PLV + + S YYV L+G+ V G + I F L G GVV+ VT L A
Sbjct: 259 STPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSITIPVTFLDAGA 318
Query: 388 YEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
Y+ R A ++ L A G + D CY + +VP+++ F+GG V+ L N+
Sbjct: 319 YKVVRQAMASKI-ELRAADGSELGLDLCYTSESLATAKVPSMALVFAGGAVMELEMGNYF 377
Query: 447 IPVDDAGTFCFAFAPSPSGL-SIIGNIQQ 474
G C PSP+G S++G++ Q
Sbjct: 378 YMDSTTGLECLTILPSPAGDGSLLGSLIQ 406
>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
Length = 469
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 113/394 (28%), Positives = 161/394 (40%), Gaps = 66/394 (16%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQC-YKQSDPV----FDPADSAS 205
G Y V + G+P ++ V D+GS +V + C CS C + DP F P +S+S
Sbjct: 88 GGYSVSLSFGTPSQTIPFVFDTGSSLVCLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSS 147
Query: 206 FSGVSCSSAVCDRL------------ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG 253
+ C S C L C G Y + YG GS T G L E L
Sbjct: 148 SKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGS-TAGVLITEKLDFP 206
Query: 254 RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR------ 307
V + +GC + AG+ G G G +SL Q+ + FS+CLVSR
Sbjct: 207 DLTVPDFVVGCSIISTRQ---PAGIAGFGRGPVSLPSQMNLK---RFSHCLVSRRFDDTN 260
Query: 308 -------GTGS---SGSLVFGREALPVGAAWVPLVRNPRAPS-----FYYVGLSGLGVGG 352
TGS SGS G + P +NP + +YY+ L + VG
Sbjct: 261 VTTDLDLDTGSGHNSGSKT-------PGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGR 313
Query: 353 MRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-- 410
+ I GD G ++D+G+ T + P +E + F +Q N R +
Sbjct: 314 KHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKET 373
Query: 411 -FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP----SPSG 465
C+N+SG V VP + F F GG L LP SN+ V + T C +PSG
Sbjct: 374 GLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSG 433
Query: 466 LS----IIGNIQQEGIQISFDGANGFVGFGPNVC 495
+ I+G+ QQ+ + +D N GF C
Sbjct: 434 GTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 85/269 (31%), Positives = 126/269 (46%), Gaps = 32/269 (11%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
G YF R+ +GSPP+ ++ ID+GSDI+WV C PC+ C S F+P S++ S
Sbjct: 89 GLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSK 148
Query: 209 VSCSSAVCD---RLENAGCHAGR---CRYEVSYGDGSYTKGTLALETL----TIGRTVVK 258
+ CS C + A C C Y +YGDGS T G +T+ +G
Sbjct: 149 IPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTA 208
Query: 259 N----VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRG 308
N + GC + G G+ G G +S+V QL G + FS+CL +G
Sbjct: 209 NSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL--KG 266
Query: 309 TGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
+ + G ++ E + G + PLV P P Y + L + V G ++PI LF T
Sbjct: 267 SDNGGGILVLGEIVEPGLVYTPLV--PSQP-HYNLNLESIVVNGQKLPIDSSLF--TTSN 321
Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVA 397
G ++D+GT + L AY+ F +A A
Sbjct: 322 TQGTIVDSGTTLAYLADGAYDPFVNAITA 350
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 99/363 (27%), Positives = 163/363 (44%), Gaps = 45/363 (12%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV--FDPADSASFSGVSCSSAV 215
V + +G+PP+ Q MV+D+GS + W+ QC+ ++ P FDP+ S+SF + C+ +
Sbjct: 90 VTLPIGTPPQPQQMVLDTGSQLSWI------QCHNKTPPTASFDPSLSSSFYVLPCTHPL 143
Query: 216 C-----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKN 268
C D C R C Y Y DG+Y +G L E L + + +GC ++
Sbjct: 144 CKPRVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSPSQTTPPLILGCSSES 203
Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS-----SGSLVFGREALP 323
+ A G+LG+ G +S Q FSYC+ +R + +GS G
Sbjct: 204 R----DARGILGMNLGRLSFPFQ---AKVTKFSYCVPTRQPANNNNFPTGSFYLGNNPNS 256
Query: 324 VGAAWVPLVRNPRA-------PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
+V ++ P++ P Y V + G+ +GG ++ I +FR G ++D+
Sbjct: 257 ARFRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVFRPNAGGSGQTMVDS 316
Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF----DTCYNLSGFVSVRVP-TVSFY 431
G+ T L AY+ R+ + G PR ++ D C++ + R+ V+F
Sbjct: 317 GSEFTFLVDVAYDRVREEIIRVLG--PRVKKGYVYGGVADMCFDGNAMEIGRLLGDVAFE 374
Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFDGANGFV 488
F G + +P L V G C S + +IIGN Q+ + + FD AN +
Sbjct: 375 FEKGVEIVVPKERVLADV-GGGVHCVGIGRSERLGAASNIIGNFHQQNLWVEFDLANRRI 433
Query: 489 GFG 491
GFG
Sbjct: 434 GFG 436
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 109/373 (29%), Positives = 165/373 (44%), Gaps = 45/373 (12%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
V + VG+PP+S MV+D+GS++ W+ C+ + + VF+P S+S++ + C S +C
Sbjct: 72 VSLTVGTPPQSVTMVLDTGSELSWLHCKK----QQNINSVFNPHLSSSYTPIPCMSPICK 127
Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK---- 267
D L C + C VSY D + +G LA +T I + + G
Sbjct: 128 TRTRDFLIPVSCDSNNLCHVTVSYADFTSLEGNLASDTFAISGSGQPGIIFGSMDSGFSS 187
Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV--G 325
N GL+G+ GS+S V Q+G FSYC+ G +SG L+FG
Sbjct: 188 NANEDSKTTGLMGMNRGSLSFVTQMGFP---KFSYCI--SGKDASGVLLFGDATFKWLGP 242
Query: 326 AAWVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
+ PLV+ N P F Y V L G+ VG + + +++F G ++D+GT
Sbjct: 243 LKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGAGQTMVDSGTRF 302
Query: 381 TRLPTPAYEAFRDAFVAQTGNL------PRASGVSIFDTCYNL-SGFVSVRVPTVSFYFS 433
T L Y A R+ FVAQT + P D C+ + G V VP V+ F
Sbjct: 303 TFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFEGAMDLCFRVRRGGVVPAVPAVTMVFE 362
Query: 434 GGPVLTLPASNFLIPVDDAG--------TFCFAFAPSP-SGLS--IIGNIQQEGIQISFD 482
G +++ L V G +C F S G+ +IG+ Q+ + + FD
Sbjct: 363 GAE-MSVSGERLLYRVGGDGDVAKGNGDVYCLTFGNSDLLGIEAYVIGHHHQQNVWMEFD 421
Query: 483 GANGFVGFGPNVC 495
N VGF C
Sbjct: 422 LVNSRVGFADTKC 434
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 103/374 (27%), Positives = 168/374 (44%), Gaps = 46/374 (12%)
Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADS 203
++SG +G Y+V + +G P + ++ +D+GSD+ W+QC PC C K P++ P +
Sbjct: 46 LLSGDVYPTGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKN 105
Query: 204 ASFSGVSCSSAVCDRLE-----NAGCHA-GRCRYEVSYGDGSYTKGTLALETLTI----G 253
V C++++C L N C +C Y++ Y D + + G L ++ ++
Sbjct: 106 KL---VPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSLPLRNK 162
Query: 254 RTVVKNVAIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVS 306
V +++ GCG+ Q GAA GLLGLG GS+SL+ QL Q T +CL +
Sbjct: 163 SNVRPSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLST 222
Query: 307 RGTGSSGSLVFGREALPVG-AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
G G L FG + +P WVP+VR+ ++Y G + L R +S
Sbjct: 223 SG---GGFLFFGDDMVPTSRVTWVPMVRSTSG-NYYSPGSATLYFD--RRSLSTKPME-- 274
Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFVSVR 424
VV D+G+ T Y+A A + +L + S S+ F SV
Sbjct: 275 ------VVFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPLCWKGQKAFKSVS 328
Query: 425 -----VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF---APSPSGLSIIGNIQQEG 476
++ F F V+ +P N+LI V G C + + SIIG+I +
Sbjct: 329 DVKKDFKSLQFIFGKNAVMEIPPENYLI-VTKNGNVCLGILDGSAAKLSFSIIGDITMQD 387
Query: 477 IQISFDGANGFVGF 490
+ +D +G+
Sbjct: 388 QMVIYDNEKAQLGW 401
>gi|357444933|ref|XP_003592744.1| hypothetical protein MTR_1g115080, partial [Medicago truncatula]
gi|355481792|gb|AES62995.1| hypothetical protein MTR_1g115080, partial [Medicago truncatula]
Length = 65
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 54/65 (83%), Positives = 59/65 (90%)
Query: 431 YFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
YF GGP+LTLPA NFLIPVD GTFCFAFAPS SGLSIIGNIQQEGI+IS DGANG++GF
Sbjct: 1 YFLGGPILTLPARNFLIPVDSVGTFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGYIGF 60
Query: 491 GPNVC 495
GPN+C
Sbjct: 61 GPNIC 65
>gi|194707292|gb|ACF87730.1| unknown [Zea mays]
Length = 216
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 75/215 (34%), Positives = 103/215 (47%), Gaps = 5/215 (2%)
Query: 286 MSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVG 344
MSL+ Q G + G FSYCL S R SGSL G P + PL+ NP PS YYV
Sbjct: 1 MSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVN 60
Query: 345 LSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR 404
++GL VG + + F G V+D+GT +TR P Y A R+ F Q
Sbjct: 61 VTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSG 120
Query: 405 ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS 464
+ + FDTC+N + P V+ + GG LTLP N LI C A A +P
Sbjct: 121 YTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQ 180
Query: 465 ----GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
++++ N+QQ+ +++ D A VGF C
Sbjct: 181 NVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 215
>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like [Cucumis sativus]
Length = 524
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 103/362 (28%), Positives = 156/362 (43%), Gaps = 40/362 (11%)
Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP--------VFDPADSASFS 207
Y+ + VG+P + +D+GSD+ W+ C C C + ++ P +S++
Sbjct: 107 YYAEVTVGTPGVPYLVALDTGSDLFWLPCD-CVNCITGLNTTQGPVNFNIYSPNNSSTSK 165
Query: 208 GVSCSSAVCDRLENAGCHAGRCRYEVSY-GDGSYTKGTLALETLTI------GRTVVKNV 260
V CSS++C L+ + C Y+VSY D + + G L + L + + V +
Sbjct: 166 EVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNARI 225
Query: 261 AIGCGHKNQGMFVGAA---GLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSL 315
+GCG G F+ +A GL GLG ++S+ L G +FS C G G +
Sbjct: 226 TLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCF---GPARMGRI 282
Query: 316 VFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMD 375
FG + P G P R P+ Y V ++ +GVGG DL D V+ D
Sbjct: 283 EFGDKGSP-GQNETPFNLGRRHPT-YNVSITQIGVGGHI----SDL-------DVAVIFD 329
Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLS-GFVSVRVPTVSFYFS 433
+GT+ T L PAY F D F + I F+ CY LS + P ++
Sbjct: 330 SGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLTMK 389
Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
GG + LI + FC A A S S ++IIG G I FD +G+ +
Sbjct: 390 GGGHFVINHPIVLISTESKRLFCLAIARSDS-INIIGQNFMTGYHIVFDREKMVLGWKES 448
Query: 494 VC 495
C
Sbjct: 449 NC 450
>gi|222631382|gb|EEE63514.1| hypothetical protein OsJ_18330 [Oryza sativa Japonica Group]
Length = 464
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 116/392 (29%), Positives = 165/392 (42%), Gaps = 75/392 (19%)
Query: 172 VIDSGSDIVWVQCQPC----------SQCYKQSDPVFDPADSASFSGVSCSS---AVCD- 217
V+D+GSD+VW QC C C+ Q+ P ++ + S + V C A+C
Sbjct: 77 VVDTGSDLVWTQCSTCRLPAVAAAGGGGCFPQNLPYYNFSLSRTARAVPCDDDDGALCGV 136
Query: 218 RLENAGCHAG------RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQ-- 269
E AGC G C SYG G G L + T + +A GC + +
Sbjct: 137 APETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFTFPSSSSVTLAFGCVSQTRIS 195
Query: 270 -GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGTGSSGSLVFG-------- 318
G GA+G++GLG G++SLV QL FSYCL R T S L G
Sbjct: 196 PGALNGASGIIGLGRGALSLVSQLNATE---FSYCLTPYFRDTVSPSHLFVGDGELAGLR 252
Query: 319 -------REALPVGAAWVPLVRNPR-AP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
PV VP +NP+ +P +FYY+ L GL G + + F L +
Sbjct: 253 AAAGGGGGGGAPVTT--VPFAKNPKDSPFSTFYYLPLVGLAAGNATVALPAGAFDLREAA 310
Query: 369 DD----GVVMDTGTAVTRLPTPAYEAFRDAFVAQ---TGNL--PRASGVSIFDTCYNL-- 417
G ++D+G+ TRL PA+ A Q +G+L P A + C
Sbjct: 311 PKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGALELCVEAGD 370
Query: 418 --SGFVSVRVPTVSFYFS----GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG------ 465
+ VP + F GG L +PA + V +A T+C A S SG
Sbjct: 371 DGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARV-EASTWCMAVVSSASGNATLPT 429
Query: 466 --LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+IIGN Q+ +++ +D ANG + F P C
Sbjct: 430 NETTIIGNFMQQDMRVLYDLANGLLSFQPANC 461
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 162/379 (42%), Gaps = 50/379 (13%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSCS 212
G Y++ + +G+P + Y+ +D+GSD+ W+QC PC C ++DP + V C
Sbjct: 29 GLYYMAMRIGNPAKLYYLDMDTGSDLTWLQCDAPCRSCAVGPHGLYDPKRARV---VDCR 85
Query: 213 SAVCDRLENAG---CHAG--RCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVAIG 263
C +++ G C +C YEV Y DGS T G L +T+T+ G IG
Sbjct: 86 RPTCAQVQRGGQFTCSGDVRQCDYEVDYVDGSSTMGILVEDTITLVLTNGTRFQTRAVIG 145
Query: 264 CGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVF 317
CG+ QG A G++GL +SL QL G +CL G+ G L F
Sbjct: 146 CGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIANNVIGHCLAG-GSNGGGYLFF 204
Query: 318 GREALP-VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD--GVVM 374
G +P +G W P++ P Y L + GG ++ L DD G +
Sbjct: 205 GDTLVPALGMTWTPMIGRPLVEG-YQARLRSIKYGG-------EVLELEGTTDDVGGAMF 256
Query: 375 DTGTAVTRLPTPAYEAFRDAFV--AQTGNLPRASGVSIFDTCYN-LSGFVSV-------R 424
D+GT+ T L AY A A V AQ L R + C+ S F SV +
Sbjct: 257 DSGTSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTDTTLPFCWRGPSPFESVADVSAYFK 316
Query: 425 VPTVSF----YFSGGPVLTLPASNFLIPVDDAGTFCF----AFAPSPSGLSIIGNIQQEG 476
T+ F ++S G +L L +LI V G C A S +I+G+I G
Sbjct: 317 TVTLDFGGSTWWSSGKLLELSPEGYLI-VSTQGNVCLGVLDASVASLEVTNILGDISMRG 375
Query: 477 IQISFDGANGFVGFGPNVC 495
+ +D +G+ C
Sbjct: 376 YLVVYDNMREQIGWVRRNC 394
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 108/367 (29%), Positives = 164/367 (44%), Gaps = 33/367 (8%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
G Y+ ++ +G+PPR + ID+GSD++WV C C+ C K S+ FDP S+S S
Sbjct: 82 GLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASL 141
Query: 209 VSCSSAVC--DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI--- 262
VSCS C + +GC C Y YGDGS T G + ++ + +AI
Sbjct: 142 VSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINSS 201
Query: 263 -----GCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGS 311
GC + G G+ GLG GS+S++ QL G FS+CL +G
Sbjct: 202 APFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSG- 260
Query: 312 SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
G +V G+ P + PLV P P Y V L + V G +PI +F + DG
Sbjct: 261 GGIMVLGQIKRP-DTVYTPLV--PSQPH-YNVNLQSIAVNGQILPIDPSVFTIAT--GDG 314
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
++DTGT + LP AY F A R + C+ ++ P VS
Sbjct: 315 TIIDTGTTLAYLPDEAYSPFIQAIANAVSQYGRPITYESYQ-CFEITAGDVDVFPEVSLS 373
Query: 432 FSGGPVLTLPASNFLIPVDDAGT--FCFAFAP-SPSGLSIIGNIQQEGIQISFDGANGFV 488
F+GG + L +L +G+ +C F S ++I+G++ + + +D +
Sbjct: 374 FAGGASMVLRPHAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQRI 433
Query: 489 GFGPNVC 495
G+ C
Sbjct: 434 GWAEYDC 440
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 162/382 (42%), Gaps = 47/382 (12%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
SG G Y+ ++G+G+P + Y+ +D+GSDI+WV C C +C + S +++
Sbjct: 77 SGRPDTVGLYYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIK 136
Query: 202 DSASFSGVSCSSAVCDRLEN---AGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGR--- 254
DS S V C C + +GC A C Y YGDGS T G + + R
Sbjct: 137 DSVSGKLVPCDEEFCYEVNGGPLSGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSG 196
Query: 255 ---TVVKN--VAIGCGHKNQGMF-----VGAAGLLGLGGGSMSLVGQLGG--QTGGAFSY 302
T N V GCG + G G+LG G + S++ QL + F++
Sbjct: 197 DLQTTSSNGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAH 256
Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGG--MRIPISED 360
CL G G G P PL+ P P Y V ++ + VG + +P E
Sbjct: 257 CL--DGINGGGIFAIGHVVQP-KVNMTPLI--PNQPH-YNVNMTAVQVGEDFLHLPTEE- 309
Query: 361 LFRLTQMGD-DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSG 419
+ GD G ++D+GT + LP YE ++Q +L + V TC+ SG
Sbjct: 310 ----FEAGDRKGAIIDSGTTLAYLPEIVYEPLVSKIISQQPDL-KVHIVRDEYTCFQYSG 364
Query: 420 FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS------PSGLSIIGNIQ 473
V P V+F+F L + +L P + G +C + S ++++G++
Sbjct: 365 SVDDGFPNVTFHFENSVFLKVHPHEYLFPFE--GLWCIGWQNSGMQSRDRRNMTLLGDLV 422
Query: 474 QEGIQISFDGANGFVGFGPNVC 495
+ +D N +G+ C
Sbjct: 423 LSNKLVLYDLENQAIGWTEYNC 444
>gi|225440731|ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 469
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 112/389 (28%), Positives = 163/389 (41%), Gaps = 55/389 (14%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
G Y V + G+P ++ V+D+GS +VW C C + S P DPA +F SS
Sbjct: 88 GGYSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSS 147
Query: 214 AV-----------------------CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETL 250
A CD+ +A C Y + YG G+ T G L LE+L
Sbjct: 148 AKIVGCLNPKCGFVMDSEVRTRCPGCDQ-NSANCTKACPTYAIQYGLGT-TVGLLLLESL 205
Query: 251 TIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--- 307
+ +GC + +G+ G G G SL Q+G + FSYCL+S
Sbjct: 206 VFAERTEPDFVVGCSILSSRQ---PSGIAGFGRGPSSLPKQMGLK---KFSYCLLSHRFD 259
Query: 308 --GTGSSGSLVFG---REALPVGAAWVPLVRNPRA-----PSFYYVGLSGLGVGGMRIPI 357
S +L G ++ G ++ P +NP + +YYV L + VG R+ +
Sbjct: 260 DSPKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKV 319
Query: 358 SEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV---SIFDTC 414
G+ G ++D+G+ T + P +EA F Q N RA+ V S C
Sbjct: 320 PYSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPC 379
Query: 415 YNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-------SGLS 467
+NLSG SV +P++ F F GG + LP +N+ V D C + SG S
Sbjct: 380 FNLSGVGSVALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTLSSGPS 439
Query: 468 II-GNIQQEGIQISFDGANGFVGFGPNVC 495
II GN Q + +D N GF C
Sbjct: 440 IILGNYQSQNFYTEYDLENERFGFRRQRC 468
>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 413
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 107/373 (28%), Positives = 163/373 (43%), Gaps = 60/373 (16%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSC 211
+G Y+V + +G P + ++ ID+GSD+ W+QC PC C K P++ P + V C
Sbjct: 49 TGHYYVTMNIGDPAKPYFLDIDTGSDLTWLQCDAPCQSCNKVPHPLYKPTKNKL---VPC 105
Query: 212 SSAVCDRLE-----NAGCHA-GRCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVA 261
++++C L N C +C Y++ Y D + + G L + T+ +V +
Sbjct: 106 AASICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTLPLRNSSSVRPSFT 165
Query: 262 IGCGH-----KNQGMFVGAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGS 314
GCG+ KN + GLLGLG GS+SLV QL G T +CL + G G
Sbjct: 166 FGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCLSTNG---GGF 222
Query: 315 LVFGREALPVG-AAWVPLVR-------NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
L FG +P A WVP+VR +P + + Y+ S LGV M
Sbjct: 223 LFFGDNVVPTSRATWVPMVRSTSGNYYSPGSGTLYFDRRS-LGVKPME------------ 269
Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVA-QTGNLPRASGVSIFDTCYNLSGFVSVR- 424
VV D+G+ T Y+A A A + +L + S S+ F SV
Sbjct: 270 -----VVFDSGSTYTYFAAQPYQATVSALKAGLSKSLQQVSDPSLPLCWKGQKVFKSVSD 324
Query: 425 ----VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF---APSPSGLSIIGNIQQEGI 477
++ F VL +P N+LI V G C + + +IIG+I +
Sbjct: 325 VKNDFKSLFLSFVKNSVLEIPPENYLI-VTKNGNACLGILDGSAAKLTFNIIGDITMQDQ 383
Query: 478 QISFDGANGFVGF 490
I +D G +G+
Sbjct: 384 LIIYDNERGQLGW 396
>gi|357118734|ref|XP_003561105.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 404
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 96/269 (35%), Positives = 124/269 (46%), Gaps = 26/269 (9%)
Query: 236 GDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGG 294
DG T T+A++T TI GC H +G F G +G + LGGG SL Q
Sbjct: 153 ADGDPTSQTMAIDT-TID-VPSSXXRFGCSHSVRGRFSGQTSGTMSLGGGRQSLRSQTAS 210
Query: 295 QTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAW----VPLVRNPRAPSFYYVGLSGLGV 350
G AFSYC+ +SG L G G+ PLV P+FY V L G+ V
Sbjct: 211 AYGDAFSYCVPQ--PSASGFLSLGGAIGSSGSGSGFASTPLVATAN-PTFYVVRLQGIDV 267
Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR--ASGV 408
G R+ + +F G +MD+ VT+LP AY A R AF R A G
Sbjct: 268 AGRRLNVPPAVF------SAGTLMDSSAVVTQLPPTAYRALRRAFRNAMRRYRRVPAGGK 321
Query: 409 SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGL 466
I DTCY+ G +V VP VS FSGG V+ L ++ C AF P+P S L
Sbjct: 322 QILDTCYDFEGLGNVTVPAVSLVFSGGAVVRLEPMAVMM------EGCLAFVPTPADSDL 375
Query: 467 SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
IGN+QQ+ ++ +D VGF C
Sbjct: 376 GFIGNVQQQTHEVLYDVGARNVGFRRGAC 404
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 107/371 (28%), Positives = 165/371 (44%), Gaps = 55/371 (14%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSCS 212
G Y+V + +G+PPR ++ +D+GSD+ W+QC PC C K P++ P + V C
Sbjct: 56 GLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKL---VPCV 112
Query: 213 SAVCDRLEN--AGCH-----AGRCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVA 261
+C L G H +C YE+ Y D + G L ++ + V +A
Sbjct: 113 DQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLA 172
Query: 262 IGCGHKNQGMFVGAA-------GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSS 312
GCG+ Q VG++ G+LGLG GS+SL+ QL G T +CL +RG
Sbjct: 173 FGCGYDQQ---VGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRG---G 226
Query: 313 GSLVFGREALPVG-AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
G L FG + +P A W P+ R+ + ++Y G + L GG + + R +
Sbjct: 227 GFLFFGDDIVPYSRATWAPMARS-TSRNYYSPGSANLYFGGRPLGV-----RPME----- 275
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFVSV-----RV 425
VV D+G++ T Y+A DA + NL S+ F SV
Sbjct: 276 VVFDSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEF 335
Query: 426 PTVSFYFSGGP--VLTLPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQI 479
TV FS G ++ +P N+LI V G C L+I+G+I + +
Sbjct: 336 KTVVLSFSNGKKALMEIPPENYLI-VTKYGNACLGILNGSEVGLKDLNIVGDITMQDQMV 394
Query: 480 SFDGANGFVGF 490
+D G +G+
Sbjct: 395 IYDNERGQIGW 405
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 103/380 (27%), Positives = 163/380 (42%), Gaps = 43/380 (11%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
SG G Y+ +IG+G+P + Y+ +D+GSDIVWV C C +C + S +D
Sbjct: 78 SGRPDAVGLYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLE 137
Query: 202 DSASFSGVSCSSAVCDRLEN---AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGR--- 254
+S + VSC C + +GC C Y YGDGS T G + + R
Sbjct: 138 ESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSG 197
Query: 255 ---TVVKN--VAIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQLGG--QTGGAFSY 302
T N + GCG + G + G+LG G + S++ QL + F++
Sbjct: 198 DLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAH 257
Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
CL GT G G P PLV P P Y V ++G+ VG + + IS D+F
Sbjct: 258 CL--DGTNGGGIFAMGHVVQP-KVNMTPLV--PNQPH-YNVNMTGVQVGHIILNISADVF 311
Query: 363 RLTQMGD-DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV 421
+ GD G ++D+GT + LP YE ++Q NL + + C+ S V
Sbjct: 312 ---EAGDRKGTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYK-CFQYSERV 367
Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS------PSGLSIIGNIQQE 475
P V F+F +L + +L ++ +C + S +++ G++
Sbjct: 368 DDGFPPVIFHFENSLLLKVYPHEYLFQYENL--WCIGWQNSGMQSRDRKNVTLFGDLVLS 425
Query: 476 GIQISFDGANGFVGFGPNVC 495
+ +D N +G+ C
Sbjct: 426 NKLVLYDLENQTIGWTEYNC 445
>gi|224074147|ref|XP_002304273.1| predicted protein [Populus trichocarpa]
gi|222841705|gb|EEE79252.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 112/411 (27%), Positives = 163/411 (39%), Gaps = 75/411 (18%)
Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP--CSQCYKQSD-----PVFDPADSASFS 207
+Y + + S P S Y+ D+GSD+VW CQP C C +++ P S + +
Sbjct: 81 DYTLSFTINSQPISLYL--DTGSDLVWFPCQPFECILCEGKAENASLASTPPPKLSKTAT 138
Query: 208 GVSCSSAVC--------------------DRLENAGCHAGRC-RYEVSYGDGSYT----K 242
VSC S+ C + +E + C C ++ +YGDGS +
Sbjct: 139 PVSCKSSACSAVHSNLPSSDLCAISNCPLESIEISDCRKHSCPQFYYAYGDGSLIARLYR 198
Query: 243 GTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGG---QTGGA 299
++ L + N GC H +G AG G G +SL QL Q G
Sbjct: 199 DSIRLPLSNQTNLIFNNFTFGCAHTTLAEPIGVAGF---GRGVLSLPAQLATLSPQLGNQ 255
Query: 300 FSYCLVSRGTGSS-----GSLVFGR-----EALPVGAAWVP------LVRNPRAPSFYYV 343
FSYCLVS S L+ GR + V P ++ NPR P FY V
Sbjct: 256 FSYCLVSHSFDSDRVRRPSPLILGRYDHDEKERRVNGVKKPSFVYTSMLDNPRHPYFYCV 315
Query: 344 GLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP 403
GL G+ +G +IP + L ++ + G GVV+D+GT T LP Y+ F + G +
Sbjct: 316 GLEGISIGRKKIPAPDFLRKVDRKGSGGVVVDSGTTFTMLPASLYDFVVAEFENRVGRVN 375
Query: 404 RASGVSIFDT----CYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDD-------- 451
+ V +T CY V V + G + LP N+ D
Sbjct: 376 ERASVIEENTGLSPCYYFDNNVVNVPRVVLHFVGNGSSVVLPRRNYFYEFLDGGHGKGKK 435
Query: 452 AGTFCFAFAP--SPSGLS-----IIGNIQQEGIQISFDGANGFVGFGPNVC 495
C + LS +GN QQ+G ++ +D N VGF C
Sbjct: 436 RKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENRRVGFARRQC 486
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 95/374 (25%), Positives = 162/374 (43%), Gaps = 43/374 (11%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
G Y+ +IG+G+P + Y+ +D+G+D++WV C C +C +S+ +++ +S+S
Sbjct: 71 GLYYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSGKL 130
Query: 209 VSCSSAVCDRLEN---AGCHA---GRCRYEVSYGDGSYTKGTLALETLTIG------RTV 256
V C +C + GC + C Y YGDGS T G + + +T
Sbjct: 131 VPCDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGDLKTA 190
Query: 257 VKN--VAIGCGHKNQGMFV-----GAAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSR 307
N V GCG + G G+LG G + S++ QL G+ F++CL
Sbjct: 191 SANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCL--N 248
Query: 308 GTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
G G G P PL+ P P Y V ++ + VG + +S D Q
Sbjct: 249 GVNGGGIFAIGHVVQPT-VNTTPLL--PDQP-HYSVNMTAIQVGHTFLNLSTDASE--QR 302
Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
G ++D+GT + LP Y+ ++Q NL + + TC+ SG V P
Sbjct: 303 DSKGTIIDSGTTLAYLPDGIYQPLVYKILSQQPNL-KVQTLHDEYTCFQYSGSVDDGFPN 361
Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS------PSGLSIIGNIQQEGIQISF 481
V+FYF G L + ++L ++ +C + S ++++G++ + +
Sbjct: 362 VTFYFENGLSLKVYPHDYLFLSENL--WCIGWQNSGAQSRDSKNMTLLGDLVLSNKLVFY 419
Query: 482 DGANGFVGFGPNVC 495
D N +G+ C
Sbjct: 420 DLENQVIGWTEYNC 433
>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 466
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 119/449 (26%), Positives = 186/449 (41%), Gaps = 82/449 (18%)
Query: 79 WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
+++EL+HRD + S +H + + H R +
Sbjct: 27 FSVELIHRDSIKSP--------FHDPKLTRH--------------DRFLAAARRSRARAA 64
Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ--------- 189
+DV S + G EY + VG+PP V D+GSD+VW++C
Sbjct: 65 ALLASDVSSDLFYGDFEYLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDS 124
Query: 190 ----------CYKQSDPVFDPADSASFSGVSCSSAVCDRLE-NAGCHAGR--CRYEVSYG 236
++ F+P DS+S+S V C C L NA C+ C + SY
Sbjct: 125 GNNSNSSPPPPPPEAVVYFNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYR 184
Query: 237 DGSYTKGTLALETLTIG------RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVG 290
DG+ G LA +T T G T ++ GC G A G++GLG G +SL
Sbjct: 185 DGASATGLLAADTFTFGGNINNDTTSTASIDFGCATGTAGREFQADGMVGLGAGPLSLAS 244
Query: 291 QLGGQTGGAFSYCLVSRGTGSSGSLV-FGREAL--PVGAAWVPLV-RNPRAPSFYYVGLS 346
QLG + FS+CL + + S++ FG A+ GAA PL+ + A ++Y + +
Sbjct: 245 QLGRK----FSFCLTAYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISID 300
Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT-----RLPTPAYEAFRDAFVAQTGN 401
L V G +P + + + V++DTGT +T L P E+ A V
Sbjct: 301 SLKVAGQPVPGTTSVSK--------VIVDTGTVLTFLDRAALLAPLTESL--ARVMDGAG 350
Query: 402 LPRASGV-SIFDTCYNLSGFVSVR--VPTVSFYFSGGPV--LTLPASNFLIPVDDAGTFC 456
LPRA + CY++S V +P V+ GG + L + V + G C
Sbjct: 351 LPRAPPPDETLELCYDVSRVKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVKE-GVLC 409
Query: 457 FAF---APSPSGLSIIGNIQQEGIQISFD 482
A +P LS++GN+ + + + D
Sbjct: 410 LAVVTTSPELQPLSVLGNVALQDLHVGID 438
>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 445
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 101/363 (27%), Positives = 158/363 (43%), Gaps = 31/363 (8%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD 217
V IG G + ++V+D+ S + W++C C +Q PVFDP+DS+S+ + +S +C
Sbjct: 78 VTIGTGRGKSTYFLVLDTASSLPWMRCAHCLPVQRQRSPVFDPSDSSSYRPLHPTSPLC- 136
Query: 218 RLENAGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGRTV--VKNVAIGCGHKNQGMFVG 274
R N AG +C + + G + +T+ +G + +VA GC +G
Sbjct: 137 RAPNPVLPAGDKCSFHLP----GEAHGYVGTDTIILGNPTLPIHSVAFGCAQSTEGFDTK 192
Query: 275 A--AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG--TGSSGSLVFGRE----ALPVGA 326
AG LG+G SL+ Q+ + G FSYCL+ G G +G + FG + L V
Sbjct: 193 GTFAGTLGMGKLPTSLIMQIKDRVGSRFSYCLIGLGHSPGRNGFIRFGADIPDPTLLVHH 252
Query: 327 AWVPLVRNPRAP-----SFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAV 380
L P P S YYV L G+ + G IP I + +F G G +D GT V
Sbjct: 253 RIKILPTPPHLPHGVADSAYYVKLLGISLNGTPIPGIRQAMFERRSDGSGGCFVDAGTQV 312
Query: 381 TRLPTPAYEAFRDAF--VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV- 437
T L AY +A + Q R + F C+ + +P ++ F G
Sbjct: 313 THLVPAAYAVVEEAVAHMVQQWGYKRVRDPN-FSLCFREHPGIWSHIPKLTLDFEGPASR 371
Query: 438 ----LTLPASNFLIPVDDAGTFCF-AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGP 492
L + + N + VD+ CF + S +++G +QQ + FD + F
Sbjct: 372 TVAHLEIVSRNLFLKVDNQPLVCFGVYRTSRGSPTVVGAMQQVDTRFIFDLHANTITFHR 431
Query: 493 NVC 495
C
Sbjct: 432 ESC 434
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 104/367 (28%), Positives = 158/367 (43%), Gaps = 40/367 (10%)
Query: 157 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC 216
V + +G+PP+SQ M++D+GS + W+QC VFDP+ S+SFS + C+ +C
Sbjct: 78 LVSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPRKPPPSTVFDPSLSSSFSVLPCNHPLC 137
Query: 217 -----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQ 269
D C R C Y Y DG+ +G L E +T + + +GC
Sbjct: 138 KPRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVREKITFSTSQSTPPLILGCAEDAS 197
Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----GTGSSGSLVFGREALPVG 325
G+LG+ G +S Q FSYC+ +R G +GS G G
Sbjct: 198 ----DDKGILGMNLGRLSFASQ---AKITKFSYCVPTRQVRPGFTPTGSFYLGENPNSAG 250
Query: 326 AAWVPLV---RNPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
++ L+ ++ R P+ + V L G+ +G ++ I FR G ++D+G+
Sbjct: 251 FQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAGQSMIDSGS 310
Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRA------SGVSIFDTCYNLSGFVSVR-VPTVSFY 431
T L AY R+ V G PR SGVS D C++ + R + + F
Sbjct: 311 EFTYLVDVAYNKVREEVVRLAG--PRLKKGYVYSGVS--DMCFDGNAMEIGRLIGNMVFE 366
Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFDGANGFV 488
F G + + L V G C S + +IIGN Q+ + + FD AN V
Sbjct: 367 FDKGVEIVIEKGRVLADV-GGGVHCVGIGRSEMLGAASNIIGNFHQQNLWVEFDIANRRV 425
Query: 489 GFGPNVC 495
GFG C
Sbjct: 426 GFGKADC 432
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 107/380 (28%), Positives = 166/380 (43%), Gaps = 44/380 (11%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
SG+ +G YF RIG+G+P + Y+ +D+GSDI+WV C C C ++S+ ++DP
Sbjct: 81 SGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPR 140
Query: 202 DSASFSGVSCSSAVCDRLENAG------CHAGRCRYEVSYGDGSYTKGTLALETLTI--- 252
S S V+C C + N G C Y +SYGDGS T G + L
Sbjct: 141 GSQSGELVTCDQQFC--VANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQV 198
Query: 253 ---GRTVVKN--VAIGCGHKNQGMF----VGAAGLLGLGGGSMSLVGQL--GGQTGGAFS 301
G+T N V+ GCG K G + G+LG G + S++ QL G+ F+
Sbjct: 199 SGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFA 258
Query: 302 YCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
+CL + G G G P PLV P P Y V L G+ VGG + + ++
Sbjct: 259 HCLDTVNGG--GIFAIGNVVQP-KVKTTPLV--PDMPH-YNVILKGIDVGGTALGLPTNI 312
Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV 421
F G ++D+GT + +P Y+A A V + F +C+ SG V
Sbjct: 313 FD--SGNSKGTIIDSGTTLAYVPEGVYKALF-AMVFDKHQDISVQTLQDF-SCFQYSGSV 368
Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF------APSPSGLSIIGNIQQE 475
P V+F+F G L + ++L + +C F + ++G++
Sbjct: 369 DDGFPEVTFHFEGDVSLIVSPHDYLFQ-NGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLS 427
Query: 476 GIQISFDGANGFVGFGPNVC 495
+ +D N +G+ C
Sbjct: 428 NKLVLYDLENQAIGWADYNC 447
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 105/374 (28%), Positives = 163/374 (43%), Gaps = 41/374 (10%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-C-YKQSDPVFDPADSASFSGVSC 211
G ++ + +G+P R +++D+GS I +V C C + C D FDPA S+S + + C
Sbjct: 60 GYFYATLHLGTPARQFAVIVDTGSTITYVPCASCGRNCGPHHKDAAFDPASSSSSAVIGC 119
Query: 212 SS--AVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKN 268
S +C R GC R C Y+ +Y + S + G L + L + R V GC K
Sbjct: 120 DSDKCICGR-PPCGCSEKRECTYQRTYAEQSSSAGLLVSDQLQL-RDGAVEVVFGCETKE 177
Query: 269 QGMFVG--AAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSSGSLVFGR---EA 321
G A G+LGLG +SLV QL G F+ C S G+L+ G
Sbjct: 178 TGEIYNQEADGILGLGNSEVSLVNQLAGSGVIDDVFALCFGS--VEGDGALMLGDVDAAE 235
Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
V + L+ + P +Y V L L VGG ++P+ + + G V+D+GT T
Sbjct: 236 YDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKPERYEEGY----GTVLDSGTTFT 291
Query: 382 RLPTPAYEAFRDAFVAQT---------GNLPRASGVSIF-DTCY---------NLSGFVS 422
LP+ A++ F++A A G P+ + F D C+ + S
Sbjct: 292 YLPSEAFQLFKEAVSAYALEHGLNSVKGPDPKEKSFAQFHDICFGGAPHAGHADQSKLEK 351
Query: 423 VRVPTVSFYFSGGPVL-TLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISF 481
V P F+ G L T P + + + G +C + + +++G I I + +
Sbjct: 352 V-FPVFELQFADGVRLRTGPLNYLFMHTGEMGAYCLGVFDNGASGTLLGGISFRNILVQY 410
Query: 482 DGANGFVGFGPNVC 495
D N VGFG C
Sbjct: 411 DRRNRRVGFGAASC 424
>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 421
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 107/371 (28%), Positives = 165/371 (44%), Gaps = 55/371 (14%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSCS 212
G Y+V + +G+PPR ++ +D+GSD+ W+QC PC C K P++ P + V C
Sbjct: 56 GLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKL---VPCV 112
Query: 213 SAVCDRLEN--AGCH-----AGRCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVA 261
+C L G H +C YE+ Y D + G L ++ + V +A
Sbjct: 113 DQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLA 172
Query: 262 IGCGHKNQGMFVGAA-------GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSS 312
GCG+ Q VG++ G+LGLG GS+SL+ QL G T +CL +RG
Sbjct: 173 FGCGYDQQ---VGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRG---G 226
Query: 313 GSLVFGREALPVG-AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
G L FG + +P A W P+ R+ + ++Y G + L GG + + R +
Sbjct: 227 GFLFFGDDIVPYSRATWAPMARS-TSRNYYSPGSANLYFGGRPLGV-----RPME----- 275
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFVSV-----RV 425
VV D+G++ T Y+A DA + NL S+ F SV
Sbjct: 276 VVFDSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEF 335
Query: 426 PTVSFYFSGGP--VLTLPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQI 479
TV FS G ++ +P N+LI V G C L+I+G+I + +
Sbjct: 336 RTVVLSFSNGKKALMEIPPENYLI-VTKYGNACLGILNGSEVGLKDLNIVGDITMQDQMV 394
Query: 480 SFDGANGFVGF 490
+D G +G+
Sbjct: 395 IYDNERGQIGW 405
>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 457
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 98/367 (26%), Positives = 152/367 (41%), Gaps = 30/367 (8%)
Query: 146 VSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC--QPCSQCYKQSDPVFDPADS 203
+S M Y ++ +GSP Y + DSGS +VW+QC C CY+Q P+F+P+ S
Sbjct: 91 ISRMSYTDKAYVMKFSIGSPAVDTYAIPDSGSSLVWLQCGTPYCRNCYRQKIPLFNPSKS 150
Query: 204 ASFSGVSCSSAVC-----DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV-- 256
++ C++A C D C+Y Y D SYT+G ++ + T +
Sbjct: 151 VTYMKRLCNTAECRVALGDEYWRCKKPNQICKYHEDYLDDSYTEGVISTDIFTFPEHISG 210
Query: 257 ----VKNVAIGCGHKNQG-MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL---VSRG 308
+ GCG+ N GL+GL SLVGQ+ FSYC+ +
Sbjct: 211 FGNYTLRIIFGCGYNNSDPQHFYPPGLVGLTNNKASLVGQMDVD---QFSYCVSIDTEQN 267
Query: 309 TGSSGSLVFGREALPVGAAWVPLVRNPRAPSFY-YVGLSGLGVGGMRIP-ISEDLFRLTQ 366
S + FG A G + LV P + +Y + + G+ V + +F+ T+
Sbjct: 268 LKGSMEIRFGLAASISGHS-TQLV--PNSDGWYIFKNVDGIYVNEFEVEGYPAWVFKYTE 324
Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS-GVSIFDTCYNLSGFVSVRV 425
G G+ MDTGT T L + +P S F+ CY F+ +
Sbjct: 325 GGQGGLTMDTGTTYTELHNSVMDPLIKLLEEHITIVPEKDYSNSGFELCYFSDDFLGATL 384
Query: 426 PTVSFYFSGGP--VLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDG 483
P + F+ + N P + C A + +G+SIIG Q I+I +D
Sbjct: 385 PDIELRFTDNKDTYFSFNTRNAWTP-NGRSQMCLAMFRT-NGMSIIGMHQLRDIKIGYDL 442
Query: 484 ANGFVGF 490
+ V F
Sbjct: 443 HHNIVSF 449
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 95/374 (25%), Positives = 167/374 (44%), Gaps = 51/374 (13%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
G YF +I +GSPP+ ++ +D+GSDI+WV C+PC +C +++ +FD S++
Sbjct: 72 GLYFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKK 131
Query: 209 VSCSSAVCDRL-ENAGCH-AGRCRYEVSYGDGSYTKGTLALETLTIGRT--------VVK 258
V C C + ++ C A C Y + Y D S ++G + LT+ + + +
Sbjct: 132 VGCDDDFCSFISQSDSCQPAVGCSYHIVYADESTSEGNFIRDKLTLEQVTGDLQTGPLGQ 191
Query: 259 NVAIGCGHKNQGMF----VGAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSS 312
V GCG G G++G G + S++ QL G FS+CL +
Sbjct: 192 EVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDN------ 245
Query: 313 GSLVFGREALPVGAAWVPLVR-NPRAPS--FYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
V G VG P V+ P P+ Y V L G+ V G + + + R +
Sbjct: 246 ---VKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTALDLPPSIMR-----N 297
Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDT--CYNLSGFVSVRVPT 427
G ++D+GT + P Y++ + +A+ P + + DT C++ S V V P
Sbjct: 298 GGTIVDSGTTLAYFPKVLYDSLIETILARQ---PVKLHI-VEDTFQCFSFSENVDVAFPP 353
Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP------SPSGLSIIGNIQQEGIQISF 481
VSF F LT+ ++L ++ +CF + + + ++G++ + +
Sbjct: 354 VSFEFEDSVKLTVYPHDYLFTLEKE-LYCFGWQAGGLTTGERTEVILLGDLVLSNKLVVY 412
Query: 482 DGANGFVGFGPNVC 495
D N +G+ + C
Sbjct: 413 DLENEVIGWADHNC 426
>gi|224030719|gb|ACN34435.1| unknown [Zea mays]
Length = 216
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 75/215 (34%), Positives = 102/215 (47%), Gaps = 5/215 (2%)
Query: 286 MSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVG 344
MSL+ Q G + G FSYCL S R SGSL G P PL+ NP PS YYV
Sbjct: 1 MSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQPRNVRHTPLLTNPHRPSLYYVN 60
Query: 345 LSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR 404
++GL VG + + F G V+D+GT +TR P Y A R+ F Q
Sbjct: 61 VTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSG 120
Query: 405 ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS 464
+ + FDTC+N + P V+ + GG LTLP N LI C A A +P
Sbjct: 121 YTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQ 180
Query: 465 ----GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
++++ N+QQ+ +++ D A VGF C
Sbjct: 181 NVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 215
>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 451
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 107/371 (28%), Positives = 165/371 (44%), Gaps = 55/371 (14%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSCS 212
G Y+V + +G+PPR ++ +D+GSD+ W+QC PC C K P++ P + V C
Sbjct: 56 GLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKL---VPCV 112
Query: 213 SAVCDRLEN--AGCH-----AGRCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVA 261
+C L G H +C YE+ Y D + G L ++ + V +A
Sbjct: 113 DQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLA 172
Query: 262 IGCGHKNQGMFVGAA-------GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSS 312
GCG+ Q VG++ G+LGLG GS+SL+ QL G T +CL +RG
Sbjct: 173 FGCGYDQQ---VGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRG---G 226
Query: 313 GSLVFGREALPVG-AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
G L FG + +P A W P+ R+ + ++Y G + L GG + + R +
Sbjct: 227 GFLFFGDDIVPYSRATWAPMARS-TSRNYYSPGSANLYFGGRPLGV-----RPME----- 275
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFVSV-----RV 425
VV D+G++ T Y+A DA + NL S+ F SV
Sbjct: 276 VVFDSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEF 335
Query: 426 PTVSFYFSGGP--VLTLPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQI 479
TV FS G ++ +P N+LI V G C L+I+G+I + +
Sbjct: 336 RTVVLSFSNGKKALMEIPPENYLI-VTKYGNACLGILNGSEVGLKDLNIVGDITMQDQMV 394
Query: 480 SFDGANGFVGF 490
+D G +G+
Sbjct: 395 IYDNERGQIGW 405
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 102/338 (30%), Positives = 152/338 (44%), Gaps = 38/338 (11%)
Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
SG+ +G YF RIG+G+P + Y+ +D+GSDI+WV C C C ++S+ ++DP
Sbjct: 81 SGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPR 140
Query: 202 DSASFSGVSCSSAVCDRLENAG------CHAGRCRYEVSYGDGSYTKGTLALETLTI--- 252
S S V+C C + N G C Y +SYGDGS T G + L
Sbjct: 141 GSQSGELVTCDQQFC--VANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQV 198
Query: 253 ---GRTVVKN--VAIGCGHKNQGMF----VGAAGLLGLGGGSMSLVGQL--GGQTGGAFS 301
G+T N V+ GCG K G + G+LG G + S++ QL G+ F+
Sbjct: 199 SGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFA 258
Query: 302 YCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
+CL + G G G P PLV P P Y V L G+ VGG + + ++
Sbjct: 259 HCLDTVNGG--GIFAIGNVVQPK-VKTTPLV--PDMPH-YNVILKGIDVGGTALGLPTNI 312
Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV 421
F G ++D+GT + +P Y+A A V + F +C+ SG V
Sbjct: 313 FD--SGNSKGTIIDSGTTLAYVPEGVYKALF-AMVFDKHQDISVQTLQDF-SCFQYSGSV 368
Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF 459
P V+F+F G L + ++L + +C F
Sbjct: 369 DDGFPEVTFHFEGDVSLIVSPHDYLFQ-NGKNLYCMGF 405
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 108/367 (29%), Positives = 164/367 (44%), Gaps = 33/367 (8%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
G Y+ ++ +G+PPR + ID+GSD++WV C C+ C K S+ FDP S+S S
Sbjct: 82 GLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASL 141
Query: 209 VSCSSAVC--DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI--- 262
VSCS C + +GC C Y YGDGS T G + ++ + +AI
Sbjct: 142 VSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINSS 201
Query: 263 -----GCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGS 311
GC + G G+ GLG GS+S++ QL G FS+CL +G
Sbjct: 202 APFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSG- 260
Query: 312 SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
G +V G+ P + PLV P P Y V L + V G +PI +F + DG
Sbjct: 261 GGIMVLGQIKRP-DTVYTPLV--PSQPH-YNVNLQSIAVNGQILPIDPSVFTIAT--GDG 314
Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
++DTGT + LP AY F A R + C+ ++ P VS
Sbjct: 315 TIIDTGTTLAYLPDEAYSPFIQAVANAVSQYGRPITYESYQ-CFEITAGDVDVFPQVSLS 373
Query: 432 FSGGPVLTLPASNFLIPVDDAGT--FCFAFAP-SPSGLSIIGNIQQEGIQISFDGANGFV 488
F+GG + L +L +G+ +C F S ++I+G++ + + +D +
Sbjct: 374 FAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQRI 433
Query: 489 GFGPNVC 495
G+ C
Sbjct: 434 GWAEYDC 440
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 96/308 (31%), Positives = 139/308 (45%), Gaps = 44/308 (14%)
Query: 104 HQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVG 163
H H+ QR ++R+ V G + D+ + G Y+ RI +G
Sbjct: 5 HYHTLRKHDQRRLRRMLPEVVSFPISGDN-----------DIFA-----MGLYYTRISLG 48
Query: 164 SPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-PV----FDPADSASFSGVSCSSAVCDR 218
+PP+ Y+ +D+GS++ WV+C PC+ C D PV FDP S + +SC+ A C
Sbjct: 49 TPPQQFYVDVDTGSNVAWVKCAPCTGCEHSGDVPVPMSTFDPRKSTTKISISCTDAECGV 108
Query: 219 L-ENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKN---------VAIGCGH 266
L + C R C Y + YGDGS T G + T + N + GCG
Sbjct: 109 LNKKLQCSPERLSCPYSLLYGDGSSTAGYYLNDVFTFNQVPSDNSTAKSGTARLVFGCGG 168
Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSSGSLVFGREALPV 324
G + GLLG G ++SL QL Q + F++CL +G GSLV G P
Sbjct: 169 TQTGSW-SVDGLLGFGPTTVSLPNQLAQQNISVNIFAHCLQGDVSG-RGSLVIGTIREP- 225
Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
+ P+V Y V L +G+ G + F L G GV++D+GT +T L
Sbjct: 226 DLVYTPMV---FGEDHYNVQLLNIGISGRNVTTPAS-FDLEYTG--GVIIDSGTTLTYLV 279
Query: 385 TPAYEAFR 392
PAY+ FR
Sbjct: 280 QPAYDEFR 287
>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
Length = 473
Score = 114 bits (286), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 108/415 (26%), Positives = 174/415 (41%), Gaps = 48/415 (11%)
Query: 117 KRVATLVRRLSGGGADAAKHEVQDFGTDV---VSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
K V V + GG + V F + V G +G YF I VGSPPR ++ +
Sbjct: 59 KFVDFHVNDMKPGGINKLATSVSAFDSSTIFPVRGDVYPNGLYFTHIFVGSPPRRYFLDM 118
Query: 174 DSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSCSSAVC----DRLENAGCHA-G 227
D+GSD+ W+QC PC+ C K +P++ P + V ++C L+ C
Sbjct: 119 DTGSDLTWIQCDAPCTSCAKGPNPLYKPKKG---NLVPLKDSLCVEVQRNLKTGYCETCE 175
Query: 228 RCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVAIGCGHKNQGMFVGAA----GLL 279
+C YE+ Y D S + G LA + L + G + GC + QG+ + + G+L
Sbjct: 176 QCDYEIEYADHSSSMGVLASDDLHLMLANGSLTKLGIMFGCAYDQQGLLLNSLAKTDGIL 235
Query: 280 GLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSSGSLVFGREALPV-GAAWVPLVRNPR 336
GL +SL QL Q +CL S TG G + G + +P G AWVP++ N
Sbjct: 236 GLSKAKVSLPSQLASQRIINNVLGHCLTSDATG-GGYMFLGDDFVPYWGMAWVPML-NSH 293
Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF----- 391
+P+ Y+ + + G ++ + R + VV DTG++ T P AY A
Sbjct: 294 SPN-YHSQIMKISHGSRQLSLGRQDGRTER-----VVFDTGSSYTYFPKEAYYALVASLK 347
Query: 392 --RDAFVAQTGNLPRA-----SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
D + Q G+ P + I F + + S ++ +P
Sbjct: 348 DVSDEGLIQDGSDPTLPVCWRAKFPIRSVIDVKQFFQPLTLQFRSKWWIVSTKFRIPPEG 407
Query: 445 FLIPVDDAGTFCFAFAPSPS----GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
+LI + + G C + I+G+I G + +D N +G+ + C
Sbjct: 408 YLI-ISNKGNVCLGILDGSNVHDGSTIILGDISLRGKLVVYDNVNQKIGWAQSTC 461
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 114 bits (286), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 101/375 (26%), Positives = 166/375 (44%), Gaps = 45/375 (12%)
Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD-----PADSASFS 207
SG YF +IG+G+P + Y+ +D+GSDI+WV C C+ C K+SD + P+ S++ +
Sbjct: 71 SGLYFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNCPKKSDLGIELSLYSPSSSSTSN 130
Query: 208 GVSCSSAVCDRLENA---GCHAG-RCRYEVSYGDGSYTKGTLALETLTIGR------TVV 257
V+C+ C + GC C Y V+YGDGS T G + + + R T
Sbjct: 131 RVTCNQDFCTSTYDGPIPGCTPELLCEYRVAYGDGSSTAGYFVRDHVVLDRVTGNFQTTS 190
Query: 258 KN--VAIGCGHKNQGMF----VGAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGT 309
N + GCG + G G+LG G + S++ QL G+ F++CL +
Sbjct: 191 TNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCLDN--- 247
Query: 310 GSSGSLVFGREALPVGAAWVPLVR-NPRAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
+ G +G P VR P P + Y V + + V + + D+F
Sbjct: 248 ------INGGGIFAIGEVVQPKVRTTPLVPQQAHYNVFMKAIEVDNEVLNLPTDVFDTDL 301
Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
G ++D+GT + P YE A+ L + F TC+ G V P
Sbjct: 302 R--KGTIIDSGTTLAYFPDVIYEPLISKIFARQSTLKLHTVEEQF-TCFEYDGNVDDGFP 358
Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF----APSPSG--LSIIGNIQQEGIQIS 480
TV+F+F LT+ +L + D+ +C + A S G + ++G++ + +
Sbjct: 359 TVTFHFEDSLSLTVYPHEYLFDI-DSNKWCVGWQNSGAQSRDGKDMILLGDLVLQNRLVM 417
Query: 481 FDGANGFVGFGPNVC 495
+D N +G+ C
Sbjct: 418 YDLENQTIGWTEYNC 432
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 114 bits (286), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 108/392 (27%), Positives = 175/392 (44%), Gaps = 54/392 (13%)
Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPAD 202
G+ +G YF I +G+PP+ Y+ +D+GSDI+WV C CS+C ++S +DP
Sbjct: 79 GLPTDTGLYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCSKCPRKSGLGLDLTFYDPKA 138
Query: 203 SASFSGVSCSSAVCDRL---ENAGCHAG-RCRYEVSYGDGSYTKGTLALETLTI------ 252
S+S S VSC C + GC A C Y V YGDGS T G + L
Sbjct: 139 SSSGSTVSCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGD 198
Query: 253 GRTVVKNVAI--GCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQL--GGQTGGAFSYC 303
G+T N I GCG + QG +G + G+LG G + S++ QL G+ F++C
Sbjct: 199 GQTQPGNATITFGCGAQ-QGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHC 257
Query: 304 LVS-RGTG--SSGSLV-----------FGREALPVGAAWVPLVRNPRAPSFYYVGLSGLG 349
L + +G G + G++V G +P+ + L+ P Y V L +
Sbjct: 258 LDTIKGGGIFAIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPH----YNVNLKSID 313
Query: 350 VGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS 409
VGG + + +F + G ++D+GT +T LP ++ D ++ ++ +
Sbjct: 314 VGGTTLQLPAHVFETGE--KKGTIIDSGTTLTYLPELVFKQVMDVVFSKHRDIAFHNLQD 371
Query: 410 IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA----PSPSG 465
C+ SG V PT++F+F L + + P + +C F S G
Sbjct: 372 FL--CFQYSGSVDDGFPTITFHFEDDLALHVYPHEYFFP-NGNDIYCVGFQNGALQSKDG 428
Query: 466 LSII--GNIQQEGIQISFDGANGFVGFGPNVC 495
I+ G++ + +D N +G+ C
Sbjct: 429 KDIVLMGDLVLSNKLVVYDLENQVIGWTDYNC 460
>gi|297740190|emb|CBI30372.3| unnamed protein product [Vitis vinifera]
Length = 445
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 104/354 (29%), Positives = 155/354 (43%), Gaps = 47/354 (13%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
G Y V + G+P ++ V+D+GS +VW C C + S P DPA +F SS
Sbjct: 104 GGYSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSS 163
Query: 214 A------------VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA 261
A V D +A C Y + YG G+ T G L LE+L +
Sbjct: 164 AKIVGCLNPKCGFVMDSENSANCTKACPTYAIQYGLGT-TVGLLLLESLVFAERTEPDFV 222
Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR-----GTGSSGSLV 316
+GC + +G+ G G G SL Q+G + FSYCL+S S +L
Sbjct: 223 VGCSILSSRQ---PSGIAGFGRGPSSLPKQMGLK---KFSYCLLSHRFDDSPKSSKMTLY 276
Query: 317 FG---REALPVGAAWVPLVRNPRA-----PSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
G ++ G ++ P +NP + +YYV L + VG R+ + G
Sbjct: 277 VGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDG 336
Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV---SIFDTCYNLSGFVSVRV 425
+ G ++D+G+ T + P +EA F Q N RA+ V S C+NLSG SV +
Sbjct: 337 NGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPCFNLSGVGSVAL 396
Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQI 479
P++ F F GG + LP +N+ V D C L+I+ N E ++I
Sbjct: 397 PSLVFQFKGGAKMELPVANYFSLVGDLSVLC---------LTIVSN---EAVEI 438
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 106/367 (28%), Positives = 162/367 (44%), Gaps = 40/367 (10%)
Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
V + VGSPP++ MV+D+GS++ W+ C+ + F+P S+S++ C+S++C
Sbjct: 62 VSLTVGSPPQNVTMVLDTGSELSWLHCKKLPNL----NSTFNPLLSSSYTPTPCNSSICT 117
Query: 217 ----DRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC----GH 266
D A C C VSY D S +GTLA ET ++ GC G+
Sbjct: 118 TRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGTLFGCMDSAGY 177
Query: 267 KNQ-GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR-EALPV 324
+ GL+G+ GS+SLV Q+ + FSYC+ G + G L+ G P
Sbjct: 178 TSDINEDSKTTGLMGMNRGSLSLVTQM---SLPKFSYCI--SGEDALGVLLLGDGTDAPS 232
Query: 325 GAAWVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
+ PLV +P F Y V L G+ V + + + +F G ++D+GT
Sbjct: 233 PLQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSGTQ 292
Query: 380 VTRLPTPAYEAFRDAFVAQT-GNLPRASGVSI-----FDTCYNLSGFVSVRVPTVSFYFS 433
T L Y + +D F+ QT G L R + D CY+ + VP V+ FS
Sbjct: 293 FTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPASFAA-VPAVTLVFS 351
Query: 434 GGPVLTLPASNFLIPVDDAG--TFCFAFAPSP-SGLS--IIGNIQQEGIQISFDGANGFV 488
G + + L V +CF F S G+ +IG+ Q+ + + FD V
Sbjct: 352 GAE-MRVSGERLLYRVSKGSDWVYCFTFGNSDLLGIEAYVIGHHHQQNVWMEFDLLKSRV 410
Query: 489 GFGPNVC 495
GF C
Sbjct: 411 GFTQTTC 417
>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 102/365 (27%), Positives = 156/365 (42%), Gaps = 50/365 (13%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSCS 212
G Y+V + +G P + ++ +D+GSD+ W+QC PC C K P + P + V C+
Sbjct: 71 GHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPWYKPTKNKI---VPCA 127
Query: 213 SAVCDRL-ENAGCHAG-RCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVAIGCGH 266
+++C L N C +C Y++ Y D + + G L + T+ TV N+ GCG+
Sbjct: 128 ASLCTSLTPNKKCAVPQQCDYQIKYTDKASSLGVLIADNFTLSLRNSSTVRANLTFGCGY 187
Query: 267 -----KNQGMFVGAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSSGSLVFGR 319
KN + GLLGLG G++SL+ QL Q T +C + G G L FG
Sbjct: 188 DQQVGKNGAVQAATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHCFSTNG---GGFLFFGD 244
Query: 320 EALPVG-AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF---RLTQMGDDGVVMD 375
+ +P WVP+ R G P S L+ R M VV D
Sbjct: 245 DIVPTSRVTWVPMARTTS--------------GNYYSPGSGTLYFDRRSLGMKPMEVVFD 290
Query: 376 TGTAVTRLPTPAYEAFRDAFVA-QTGNLPRASGVSIFDTCYN----LSGFVSVRVPTVSF 430
+G+ Y+A A A + +L S VS+ C+ V+ S
Sbjct: 291 SGSTYAYFAAEPYQATVSALKAGLSKSLKEVSDVSL-PLCWKGQKVFKSVSEVKNDFKSL 349
Query: 431 YFSGGP--VLTLPASNFLIPVDDAGTFCFAFAPSPSG---LSIIGNIQQEGIQISFDGAN 485
+ S G V+ +P N+LI V G C + +IIG+I + I +D
Sbjct: 350 FLSFGKNSVMEIPPENYLI-VTKYGNVCLGILDGTTAKLKFNIIGDITMQDQMIIYDNEK 408
Query: 486 GFVGF 490
G +G+
Sbjct: 409 GQLGW 413
>gi|242092874|ref|XP_002436927.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
gi|241915150|gb|EER88294.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
Length = 484
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 122/459 (26%), Positives = 182/459 (39%), Gaps = 47/459 (10%)
Query: 63 RHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATL 122
+ SS S R L +VHR +S S + S + RD R +L
Sbjct: 47 KQTPTCSSAHSGTSRRDTLPVVHR--LSPCSPLGAARIQQLEKPSVADILHRDALRFRSL 104
Query: 123 VRRLSGG---------GADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
R + G GAD + G D + + G+ EY V G G+P + +
Sbjct: 105 FRDHNHGSAAPAPTSPGADGGGLSIPSRG-DPIQEL-PGAFEYHVTAGFGTPVQQFTVGF 162
Query: 174 DSGSD-IVWVQCQPCS---QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRC 229
D+ + +QC+PC+ C+ FDP+ S+S + V C S C N GC C
Sbjct: 163 DTTTTGATQLQCKPCAADEPCHH----AFDPSASSSIAHVPCGSPDCPF--NKGCSGHSC 216
Query: 230 RYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSL 288
VS + T + LT+ +V + C + G+L L S SL
Sbjct: 217 TLSVSINNTLLGNATFFTDKLTLTPWNIVDDFRFVCLEAGFRPDDDSTGILDLSRNSHSL 276
Query: 289 VGQLGGQTGGA--FSYCLVSRGT-------GSSGSLVFGREALPVGAAWVPLVRNPRAPS 339
+ + A FSYCL S + G++ + GR+ ++ PL N +
Sbjct: 277 ASRAAPSSPDAVAFSYCLPSYPSDVGFLSLGATKPELLGRKV-----SYTPLRSNRHNGN 331
Query: 340 FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQT 399
Y V L GLG+GG+ +P+ + G +++ T T L Y A RD F
Sbjct: 332 LYVVELVGLGLGGVDLPVPR-----AAIAGGGTILELHTTFTYLKPKVYAALRDEFRKSM 386
Query: 400 GNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF---C 456
P A DTCYN + S VP V+ F GG L + + F C
Sbjct: 387 SQYPVAPPQGSLDTCYNFTALSSYSVPAVTLKFDGGAEFDLWIDEMMYFPEPGSYFSVGC 446
Query: 457 FAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
AF + G ++IG++ Q ++ +D G VGF P C
Sbjct: 447 LAFV-AQDGGAVIGSMAQMSTEVVYDVRGGKVGFVPYRC 484
>gi|147789749|emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]
Length = 609
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 112/389 (28%), Positives = 162/389 (41%), Gaps = 55/389 (14%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
G Y V + G+P ++ V+D+GS +VW C C + S P DPA +F SS
Sbjct: 88 GGYSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSS 147
Query: 214 AV-----------------------CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETL 250
A CD+ +A C Y + YG G+ T G L LE+L
Sbjct: 148 AKIVGCLNPKCGFVMDSEVRTRCPGCDQ-NSANCTKACPTYAIQYGLGT-TVGLLLLESL 205
Query: 251 TIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--- 307
+ +GC + +G+ G G G SL Q+G + FSYCL+S
Sbjct: 206 VFAERTEPDFVVGCSILSSRQ---PSGIAGFGRGPSSLPKQMGLK---KFSYCLLSHRFD 259
Query: 308 --GTGSSGSLVFG---REALPVGAAWVPLVRNPRA-----PSFYYVGLSGLGVGGMRIPI 357
S +L G ++ G ++ P +NP + +YYV L + VG R+
Sbjct: 260 DSPKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKX 319
Query: 358 SEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV---SIFDTC 414
G+ G ++D+G+ T + P +EA F Q N RA+ V S C
Sbjct: 320 PYSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPC 379
Query: 415 YNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-------SGLS 467
+NLSG SV +P++ F F GG + LP +N+ V D C + SG S
Sbjct: 380 FNLSGVGSVALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTLSSGPS 439
Query: 468 II-GNIQQEGIQISFDGANGFVGFGPNVC 495
II GN Q + +D N GF C
Sbjct: 440 IILGNYQSQNFYTEYDLENERFGFRRQRC 468
>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
Length = 538
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 103/385 (26%), Positives = 160/385 (41%), Gaps = 44/385 (11%)
Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK--------------------Q 193
G Y V + G+P +V+D+ +D+ W+ C+ + K +
Sbjct: 125 GMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKEAR 184
Query: 194 SDPVFDPADSASFSGVSCSSAVCDRLENAGCH----AGRCRYEVSYGDGSYTKGTLALET 249
+ PA S+S+ + CS C L C A C Y DG+ T G E
Sbjct: 185 RKNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQDGTLTMGIYGKEK 244
Query: 250 LTI----GRTV-VKNVAIGCGHKNQGMFVGAA-GLLGLGGGSMSLVGQLGGQTGGAFSYC 303
T+ GR + + +GC G V A G+L LG G MS + G FS+C
Sbjct: 245 ATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVHAAKRFGQRFSFC 304
Query: 304 LVSRGTGSSGS--LVFGREALPVGAAWVP--LVRNPRAPSFYYVGLSGLGVGGMRIPISE 359
L+S + S L FG +G + +V N Y ++G+ VGG R+ I +
Sbjct: 305 LLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERLDIPQ 364
Query: 360 DLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSG 419
+++ ++ GV++DT T+VT L AY A A +LPR + F+ CY +
Sbjct: 365 EIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELDGFEYCYRWT- 423
Query: 420 FV--------SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS-GLSIIG 470
F +V VP ++ +GG L A + ++P G C AF P G I+G
Sbjct: 424 FAGDGVDLAHNVTVPRLTVEMAGGARLEPEAKSVVMPEVVPGVACLAFRKLPRGGPGILG 483
Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
N+ + D G + F + C
Sbjct: 484 NVLMQEYIWEIDHGKGKMRFRKDKC 508
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.135 0.407
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,186,560,293
Number of Sequences: 23463169
Number of extensions: 382816785
Number of successful extensions: 1077599
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1741
Number of HSP's successfully gapped in prelim test: 2717
Number of HSP's that attempted gapping in prelim test: 1067543
Number of HSP's gapped (non-prelim): 5415
length of query: 495
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 348
effective length of database: 8,910,109,524
effective search space: 3100718114352
effective search space used: 3100718114352
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)