BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 016916
(380 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 523 bits (1346), Expect = e-146, Method: Compositional matrix adjust.
Identities = 247/353 (69%), Positives = 291/353 (82%), Gaps = 3/353 (0%)
Query: 27 SSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
S F SS++F VHGNVYP GYYNVT+ IGQP RPY+LDLDTGSDLTWLQCDAPCVRC
Sbjct: 36 SDRFTRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC 95
Query: 87 VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAF 146
+EAPHPLY+PS+DL+PC DP+C +LH + CE P QCDYE+EYADGGSSLGVLV+D F
Sbjct: 96 LEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVF 155
Query: 147 AFNYTNGQRLNPRLALGCGYNQVPGA-SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVG 205
+ NYT G RL PRLALGCGY+Q+PGA S+HPLDG+LGLG+GK SI+SQLHSQ ++NV+G
Sbjct: 156 SMNYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIG 215
Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGV-AELFFGGETTGLKNLPVVF 264
HCLS GGG LFFGDDLYDSSRV WT MS +Y+K+YSP + EL FGG TTGLKNL VF
Sbjct: 216 HCLSSLGGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVF 275
Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
DSGSSYTY N YQ +T ++K+ELS K LKEA +D TLPLCW+GRRPF ++ +VKK F+
Sbjct: 276 DSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFK 335
Query: 325 TLALSFTDG-KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
LALSF G +++TLFE+ PEAYLIIS KGNVCLGILNG E+GLQ+LN+IG I
Sbjct: 336 PLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDI 388
>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 523 bits (1346), Expect = e-146, Method: Compositional matrix adjust.
Identities = 247/353 (69%), Positives = 291/353 (82%), Gaps = 3/353 (0%)
Query: 27 SSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
S F SS++F VHGNVYP GYYNVT+ IGQP RPY+LDLDTGSDLTWLQCDAPCVRC
Sbjct: 36 SDRFTRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC 95
Query: 87 VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAF 146
+EAPHPLY+PS+DL+PC DP+C +LH + CE P QCDYE+EYADGGSSLGVLV+D F
Sbjct: 96 LEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVF 155
Query: 147 AFNYTNGQRLNPRLALGCGYNQVPGA-SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVG 205
+ NYT G RL PRLALGCGY+Q+PGA S+HPLDG+LGLG+GK SI+SQLHSQ ++NV+G
Sbjct: 156 SMNYTKGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIG 215
Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGV-AELFFGGETTGLKNLPVVF 264
HCLS GGG LFFGDDLYDSSRV WT MS +Y+K+YSP + EL FGG TTGLKNL VF
Sbjct: 216 HCLSSLGGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVF 275
Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
DSGSSYTY N YQ +T ++K+ELS K LKEA +D TLPLCW+GRRPF ++ +VKK F+
Sbjct: 276 DSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFK 335
Query: 325 TLALSFTDG-KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
LALSF G +++TLFE+ PEAYLIIS KGNVCLGILNG E+GLQ+LN+IG I
Sbjct: 336 PLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDI 388
>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 401
Score = 522 bits (1345), Expect = e-146, Method: Compositional matrix adjust.
Identities = 247/352 (70%), Positives = 291/352 (82%), Gaps = 3/352 (0%)
Query: 27 SSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
S F SS++F VHGNVYP GYYNVT+ IGQP RPY+LDLDTGSDLTWLQCDAPCVRC
Sbjct: 33 SDRFTRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC 92
Query: 87 VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAF 146
+EAPHPLY+PS+DL+PC DP+C +LH + CE P QCDYE+EYADGGSSLGVLV+D F
Sbjct: 93 LEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVF 152
Query: 147 AFNYTNGQRLNPRLALGCGYNQVPGA-SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVG 205
+ NYT G RL PRLALGCGY+Q+PGA S+HPLDG+LGLG+GK SI+SQLHSQ ++NV+G
Sbjct: 153 SMNYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIG 212
Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGV-AELFFGGETTGLKNLPVVF 264
HCLS GGG LFFGDDLYDSSRV WT MS +Y+K+YSP + EL FGG TTGLKNL VF
Sbjct: 213 HCLSSLGGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVF 272
Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
DSGSSYTY N YQ +T ++K+ELS K LKEA +D TLPLCW+GRRPF ++ +VKK F+
Sbjct: 273 DSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFK 332
Query: 325 TLALSFTDG-KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGG 375
LALSF G +++TLFE+ PEAYLIIS KGNVCLGILNG E+GLQ+LN+IGG
Sbjct: 333 PLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGG 384
>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
Length = 413
Score = 522 bits (1345), Expect = e-145, Method: Compositional matrix adjust.
Identities = 247/353 (69%), Positives = 291/353 (82%), Gaps = 3/353 (0%)
Query: 27 SSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
S F SS++F VHGNVYP GYYNVT+ IGQP RPY+LDLDTGSDLTWLQCDAPCVRC
Sbjct: 24 SDRFTRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC 83
Query: 87 VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAF 146
+EAPHPLY+PS+DL+PC DP+C +LH + CE P QCDYE+EYADGGSSLGVLV+D F
Sbjct: 84 LEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVF 143
Query: 147 AFNYTNGQRLNPRLALGCGYNQVPGA-SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVG 205
+ NYT G RL PRLALGCGY+Q+PGA S+HPLDG+LGLG+GK SI+SQLHSQ ++NV+G
Sbjct: 144 SMNYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIG 203
Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGV-AELFFGGETTGLKNLPVVF 264
HCLS GGG LFFGDDLYDSSRV WT MS +Y+K+YSP + EL FGG TTGLKNL VF
Sbjct: 204 HCLSSLGGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVF 263
Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
DSGSSYTY N YQ +T ++K+ELS K LKEA +D TLPLCW+GRRPF ++ +VKK F+
Sbjct: 264 DSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFK 323
Query: 325 TLALSFTDG-KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
LALSF G +++TLFE+ PEAYLIIS KGNVCLGILNG E+GLQ+LN+IG I
Sbjct: 324 PLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDI 376
>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
Length = 424
Score = 522 bits (1344), Expect = e-145, Method: Compositional matrix adjust.
Identities = 246/354 (69%), Positives = 292/354 (82%), Gaps = 3/354 (0%)
Query: 26 SSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR 85
++ F SS++F VHGNVYP GYYNVT+ IGQP RPY+LDLDTGSDLTWLQCDAPCV
Sbjct: 32 AADRFTRAASSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVH 91
Query: 86 CVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDA 145
C+EAPHPLY+PSNDL+PC DP+C +LH G+H CE P QCDYE+EYADGGSSLGVLV+D
Sbjct: 92 CLEAPHPLYQPSNDLIPCNDPLCKALHFNGNHRCETPEQCDYEVEYADGGSSLGVLVRDV 151
Query: 146 FAFNYTNGQRLNPRLALGCGYNQVPGAS-YHPLDGILGLGKGKSSIVSQLHSQKLIRNVV 204
F+ NYT G RL PRLALGCGY+Q+PGAS +HPLDG+LGLG+GK SI+SQLHSQ ++NVV
Sbjct: 152 FSLNYTKGLRLTPRLALGCGYDQIPGASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNVV 211
Query: 205 GHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGV-AELFFGGETTGLKNLPVV 263
GHCLS GGG LFFG+DLYDSSRV WT M+ + +K+YSP + EL FGG TTGLKNL V
Sbjct: 212 GHCLSSLGGGILFFGNDLYDSSRVSWTPMARENSKHYSPAMGGELLFGGRTTGLKNLLTV 271
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
FDSGSSYTY N YQ +T ++K+ELS K LKEA +D TLPLCW+GRRPF ++ +VKK F
Sbjct: 272 FDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYF 331
Query: 324 RTLALSFTDG-KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
+ LALSF G +++TLFE+ PEAYLIIS KGNVCLGILNG E+GLQ+LN+IG I
Sbjct: 332 KPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDI 385
>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
Length = 426
Score = 516 bits (1328), Expect = e-144, Method: Compositional matrix adjust.
Identities = 248/354 (70%), Positives = 293/354 (82%), Gaps = 2/354 (0%)
Query: 24 SSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC 83
++SSSL N + SS++F ++GNVYP GYY V++ IGQP +PYFLD DTGSDL+WLQCDAPC
Sbjct: 40 AASSSLINIIQSSVVFPLYGNVYPLGYYYVSLSIGQPPKPYFLDPDTGSDLSWLQCDAPC 99
Query: 84 VRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVK 143
VRC +APHPLYRP+N+LV C+DP+CASLH PG+ CE P QCDYE+EYADGGSSLGVLVK
Sbjct: 100 VRCTKAPHPLYRPNNNLVICKDPMCASLHPPGY-KCEHPEQCDYEVEYADGGSSLGVLVK 158
Query: 144 DAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNV 203
D F N+TNG RL PRLALGCGY+Q+PG SYHPLDG+LGLGKGKSSIVSQLHSQ +IRNV
Sbjct: 159 DVFPLNFTNGLRLAPRLALGCGYDQIPGQSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNV 218
Query: 204 VGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVV 263
VGHC+S GGGFLFFGDDLYDSSRVVWT M D +YS G AEL GG+TT KNL V
Sbjct: 219 VGHCVSSRGGGFLFFGDDLYDSSRVVWTPMLRDQHTHYSSGYAELILGGKTTVFKNLLVT 278
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
FDSGSSYTYLN + YQ L +++KELS K ++EA +D+TLPLCW+G+RPFK+V DVKK F
Sbjct: 279 FDSGSSYTYLNSLAYQALVHLVRKELSEKPVREALDDQTLPLCWRGKRPFKSVRDVKKFF 338
Query: 324 RTLALSFT-DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
+ LALSF G+T+T +++ E+YLIIS KGNVCLGILNG E GLQD N+IG I
Sbjct: 339 KPLALSFPGGGRTKTQYDIPLESYLIISLKGNVCLGILNGTEAGLQDFNLIGDI 392
>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
Length = 420
Score = 510 bits (1314), Expect = e-142, Method: Compositional matrix adjust.
Identities = 245/362 (67%), Positives = 289/362 (79%), Gaps = 20/362 (5%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
SS++F VHGNVYP GYYNVT+ IGQP RPY+LDLDTGSDLTWLQCDAPCVRC+EAPHPLY
Sbjct: 22 SSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHPLY 81
Query: 95 RPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
+PS+DL+PC DP+C +LH + CE P QCDYE+EYADGGSSLGVLV+D F+ NYT G
Sbjct: 82 QPSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGL 141
Query: 155 RLNPRLALGCGYNQVPGA-SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
RL PRLALGCGY+Q+PGA S+HPLDG+LGLG+GK SI+SQLHSQ ++NV+GHCLS GG
Sbjct: 142 RLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCLSSLGG 201
Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGV-AELFFGGETTGLKNLPVVFDSGSSYTY 272
G LFFGDDLYDSSRV WT MS +Y+K+YSP + EL FGG TTGLKNL VFDSGSSYTY
Sbjct: 202 GILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGSSYTY 261
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
N YQ +T ++K+ELS K LKEA +D TLPLCW+GRRPF ++ +VKK F+ LALSF
Sbjct: 262 FNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKT 321
Query: 333 G-KTRTLFELTPEAYLIIS-----------------NKGNVCLGILNGAEVGLQDLNVIG 374
G +++TLFE+ PEAYLIIS KGNVCLGILNG E+GLQ+LN+IG
Sbjct: 322 GWRSKTLFEIPPEAYLIISVWFSHTMLKGRFIKMLQMKGNVCLGILNGTEIGLQNLNLIG 381
Query: 375 GI 376
I
Sbjct: 382 DI 383
>gi|255563835|ref|XP_002522918.1| nucellin, putative [Ricinus communis]
gi|223537845|gb|EEF39461.1| nucellin, putative [Ricinus communis]
Length = 433
Score = 508 bits (1308), Expect = e-141, Method: Compositional matrix adjust.
Identities = 253/364 (69%), Positives = 295/364 (81%), Gaps = 3/364 (0%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
R + S +SS + N GSSL+F +HGNVYP GYYNVT+ IGQPA+PYFLD+DTGSDLT
Sbjct: 36 RKAVLSGEITSSMMINRAGSSLVFPLHGNVYPAGYYNVTLSIGQPAKPYFLDVDTGSDLT 95
Query: 76 WLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGG 135
WLQCDAPC +C+EAPHPLYRPSN+LV CEDP+CASL PG HNC+DP QCDYE+EYADGG
Sbjct: 96 WLQCDAPCRQCIEAPHPLYRPSNNLVICEDPLCASLQPPGVHNCQDPDQCDYEVEYADGG 155
Query: 136 SSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLH 195
SSLGVLVKD F N+TNG+RLNP LALGCGY+Q+PG S HPLDGILGLG+G SSI SQL
Sbjct: 156 SSLGVLVKDVFVLNFTNGKRLNPLLALGCGYDQLPGRSNHPLDGILGLGRGISSIPSQLS 215
Query: 196 SQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETT 255
SQ L+ NV+GHCLSG GGGFLFFG+D+YDSS V WT MS D+ K+YSPG AEL F G++T
Sbjct: 216 SQGLVSNVIGHCLSGRGGGFLFFGEDIYDSSGVTWTPMSRDHLKHYSPGFAELIFDGKST 275
Query: 256 GLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
G++NL VVFDSGSSYTYLN YQ L +K+ELS K + EA +D+TLPLCWKG+RPFK+
Sbjct: 276 GIRNLLVVFDSGSSYTYLNAQAYQHLVFSLKRELSRKPISEALDDQTLPLCWKGKRPFKS 335
Query: 316 VHDVKKCFRTLALSFTDGKTR---TLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNV 372
+ DVKK F+ AL F R T FE +PEAYLIIS+KGN CLGILNG EVGL+DLNV
Sbjct: 336 IRDVKKYFKPFALVFKTSSGRSSKTQFEFSPEAYLIISSKGNACLGILNGTEVGLRDLNV 395
Query: 373 IGGI 376
IG +
Sbjct: 396 IGDV 399
>gi|147802609|emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]
Length = 424
Score = 497 bits (1280), Expect = e-138, Method: Compositional matrix adjust.
Identities = 242/354 (68%), Positives = 287/354 (81%), Gaps = 4/354 (1%)
Query: 24 SSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC 83
++SSSL N + SS++F ++GNVYP GYY V++ IGQP PYFLD TGSDL+WLQCDAPC
Sbjct: 40 AASSSLINIIQSSVVFPLYGNVYPLGYYYVSLSIGQPPXPYFLDPXTGSDLSWLQCDAPC 99
Query: 84 VRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVK 143
VRC +A H LYRP+N+LV C+DP+CA LH PG+ CE P QCDYE+EYADGGSSLGVLVK
Sbjct: 100 VRCTKAXHXLYRPNNNLVICKDPMCAXLHPPGY-KCEHPEQCDYEVEYADGGSSLGVLVK 158
Query: 144 DAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNV 203
D F N+TNG RL PRLALGCGY+Q+PG SYHPLDG+LGLGKGKSSIVSQLHSQ +IRNV
Sbjct: 159 DVFPLNFTNGLRLAPRLALGCGYDQIPGXSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNV 218
Query: 204 VGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVV 263
VGHC+S GGGFLFFGDDLYDSSRVVWT M D +YS G AEL GG+TT KNL V
Sbjct: 219 VGHCVSSHGGGFLFFGDDLYDSSRVVWTPMLRDQHTHYSSGYAELILGGKTTVFKNLLVT 278
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
FDSGSSYTYLN + YQ L +++KELS K ++EA +D+TLPLCW+G+RPFK+V DV+K F
Sbjct: 279 FDSGSSYTYLNSLAYQALVHLVRKELSEKPVREALDDQTLPLCWRGKRPFKSVRDVRKFF 338
Query: 324 RTLALSFT-DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
+ LALSF G+T+T +++ E+YLIIS GNVCLGILNG E GLQD N+IG I
Sbjct: 339 KPLALSFAGGGRTKTQYDIPLESYLIIS--GNVCLGILNGTEAGLQDFNLIGDI 390
>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
Length = 393
Score = 495 bits (1274), Expect = e-137, Method: Compositional matrix adjust.
Identities = 251/358 (70%), Positives = 292/358 (81%), Gaps = 2/358 (0%)
Query: 20 SSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC 79
S + +SS L N V SS++ +HGNVYP GYYNVT+ IGQP++PYFLD+DTGSDLTWLQC
Sbjct: 3 SGETMASSMLINRVPSSIVLPLHGNVYPNGYYNVTLNIGQPSKPYFLDVDTGSDLTWLQC 62
Query: 80 DAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLG 139
DAPCV+C EAPHP YRP N+LVPC DPIC SLH+ G H CE+P QCDYE+EYADGGSS G
Sbjct: 63 DAPCVQCTEAPHPYYRPRNNLVPCMDPICQSLHSNGDHRCENPGQCDYEVEYADGGSSFG 122
Query: 140 VLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
VLV D F N+T+ +R +P LALGCGY+Q PG S+HP+DG+LGLGKGKSSIVSQL S L
Sbjct: 123 VLVTDTFNLNFTSEKRHSPLLALGCGYDQFPGGSHHPIDGVLGLGKGKSSIVSQLSSLGL 182
Query: 200 IRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN 259
+RNV+GHCLSG GGGFLFFGDDLYDSSRV WT MS D K+YSPG+AEL F G+TTG KN
Sbjct: 183 VRNVIGHCLSGHGGGFLFFGDDLYDSSRVAWTPMSPD-AKHYSPGLAELTFDGKTTGFKN 241
Query: 260 LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
L FDSG+SYTYLN YQ L S++KKELS K L+EA +D+TLPLCWKGR+PFK++ DV
Sbjct: 242 LLTTFDSGASYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKGRKPFKSIRDV 301
Query: 320 KKCFRTLALSFT-DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
KK F+T ALSFT + K++T E PEAYLIIS+KGN CLGILNG EVGL DLNVIG I
Sbjct: 302 KKYFKTFALSFTNERKSKTELEFPPEAYLIISSKGNACLGILNGTEVGLNDLNVIGDI 359
>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
Length = 426
Score = 489 bits (1259), Expect = e-136, Method: Compositional matrix adjust.
Identities = 228/337 (67%), Positives = 278/337 (82%), Gaps = 2/337 (0%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL 100
++GNVYP+GYY+V IGQP +PYFLD DTGSDLTWLQCDAPC++C APHPLY+P+NDL
Sbjct: 57 LYGNVYPSGYYHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPCIQCTPAPHPLYQPTNDL 116
Query: 101 VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
V C+DPICASLH P ++ C+DP QCDYE+EYADGGSS+GVLV D F N T+G R PRL
Sbjct: 117 VVCKDPICASLH-PDNYRCDDPDQCDYEVEYADGGSSIGVLVNDLFPVNLTSGMRARPRL 175
Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGD 220
+GCGY+Q+PG +YHPLDG+LGLG+G SSIV+QL SQ L+RNVVGHC S GGG+LFFGD
Sbjct: 176 TIGCGYDQLPGIAYHPLDGVLGLGRGSSSIVAQLSSQGLVRNVVGHCFSRRGGGYLFFGD 235
Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQT 280
D+YDSS+V+WT MS DY K+Y+PG AEL G ++GLKNL VVFDSGSSYTY N TYQT
Sbjct: 236 DIYDSSKVIWTPMSRDYLKHYTPGFAELILNGRSSGLKNLLVVFDSGSSYTYFNTQTYQT 295
Query: 281 LTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG-KTRTLF 339
L S +KK+L K LKEA ED+TLP+CW+G++PFK++ D KK F+ LALSF G KT++ F
Sbjct: 296 LLSFIKKDLHGKPLKEAVEDDTLPVCWRGKKPFKSIRDAKKYFKPLALSFGSGWKTKSQF 355
Query: 340 ELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
E+ E+YLIIS+KG+VCLGILNG EVGLQ+ N+IG I
Sbjct: 356 EIQQESYLIISSKGSVCLGILNGTEVGLQNYNIIGDI 392
>gi|224083514|ref|XP_002307058.1| predicted protein [Populus trichocarpa]
gi|222856507|gb|EEE94054.1| predicted protein [Populus trichocarpa]
Length = 376
Score = 483 bits (1242), Expect = e-134, Method: Compositional matrix adjust.
Identities = 246/347 (70%), Positives = 286/347 (82%), Gaps = 3/347 (0%)
Query: 32 HVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH 91
V SS++ +HGNVYP GYYNVT+ IGQP++PYFLD+DTGSDLTWLQCDAPCV+C EAPH
Sbjct: 1 RVPSSIVLPLHGNVYPNGYYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPH 60
Query: 92 PLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
P YRP N+LVPC DPIC SLH+ G H CE+P QCDYE+EYADGGSS GVLV+D F N+T
Sbjct: 61 PYYRPRNNLVPCMDPICQSLHSNGDHRCENPGQCDYEVEYADGGSSFGVLVRDTFNLNFT 120
Query: 152 NGQRLNPRLALG-CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
+ +R +P LALG CGY+Q PG S+HP+DG+LGLGKGKSSIVSQL S L+RNV+GHCLSG
Sbjct: 121 SEKRHSPLLALGLCGYDQFPGGSHHPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSG 180
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
GGGFLFFGDDLYDSSRV WT MS D K+YSPG+AEL F G+TTG KNL FDSG+SY
Sbjct: 181 HGGGFLFFGDDLYDSSRVAWTPMSPD-AKHYSPGLAELTFDGKTTGFKNLLTTFDSGASY 239
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
TYLN YQ L S++KKELS K L+EA +D+TLPLCWKGR+PFK++ DVKK F+T ALSF
Sbjct: 240 TYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKGRKPFKSIRDVKKYFKTFALSF 299
Query: 331 T-DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
T + K++T E PEAYLIIS+KGN CLGILNG EVGL DLNVIG I
Sbjct: 300 TNERKSKTELEFPPEAYLIISSKGNACLGILNGTEVGLNDLNVIGDI 346
>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 418
Score = 482 bits (1240), Expect = e-133, Method: Compositional matrix adjust.
Identities = 221/342 (64%), Positives = 276/342 (80%), Gaps = 2/342 (0%)
Query: 37 LLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP 96
++ + GNVYP G+YNVT+Y+GQP +PYFLD DTGSDLTWLQCDAPC +C E HPLY+P
Sbjct: 43 IVLPLQGNVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQP 102
Query: 97 SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
SNDLVPC+DP+C SLH+ H CE+P QCDYE+EYADGGSSLGVLV+D F N TNG +
Sbjct: 103 SNDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPI 162
Query: 157 NPRLALGCGYNQVPGAS-YHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
PRLALGCGY+Q PG+S YHP+DGILGLG+G SIVSQLH+Q ++RNVVGHC + GGG+
Sbjct: 163 RPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGY 222
Query: 216 LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNR 275
LFFGD +YD R+VWT MS DY K+YSPG EL F G +TGL+NL VVFDSGSSYTY N
Sbjct: 223 LFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNA 282
Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD-GK 334
YQ LTS++ +EL+ K L+EA +D+TLPLCW+GR+P K++ DV+K F+ LALSF+ G+
Sbjct: 283 QAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGR 342
Query: 335 TRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
++ +FE+ E Y+IIS+ GNVCLGILNG +VGL++ N+IG I
Sbjct: 343 SKAVFEIPTEGYMIISSMGNVCLGILNGTDVGLENSNIIGDI 384
>gi|224096119|ref|XP_002310541.1| predicted protein [Populus trichocarpa]
gi|222853444|gb|EEE90991.1| predicted protein [Populus trichocarpa]
Length = 379
Score = 473 bits (1217), Expect = e-131, Method: Compositional matrix adjust.
Identities = 239/347 (68%), Positives = 284/347 (81%), Gaps = 3/347 (0%)
Query: 32 HVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH 91
V SS++ +HGNVYPTG+YNVT+ IGQP++PYFLD+DTGSDLTWLQCD P +C EAPH
Sbjct: 1 RVPSSIVLPLHGNVYPTGFYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDVPRAQCTEAPH 60
Query: 92 PLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
P Y+PSN+LV C+DPIC SLH G CE+P QCDYE+EYADGGSSLGVLVKDAF N+T
Sbjct: 61 PYYKPSNNLVACKDPICQSLHTGGDQRCENPGQCDYEVEYADGGSSLGVLVKDAFNLNFT 120
Query: 152 NGQRLNPRLALG-CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
+ +R +P LALG CGY+Q+PG +YHP+DG+LGLG+GK SIVSQL L+RNV+GHCLSG
Sbjct: 121 SEKRQSPLLALGLCGYDQLPGGTYHPIDGVLGLGRGKPSIVSQLSGLGLVRNVIGHCLSG 180
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
GGGFLFFGDDLYDSSRV WT MS + K+YSPG AEL F G+TTG KNL V FDSG+SY
Sbjct: 181 RGGGFLFFGDDLYDSSRVAWTPMSPN-AKHYSPGFAELTFDGKTTGFKNLIVAFDSGASY 239
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
TYLN YQ L S++K+ELS K L+EA +D+TLP+CWKGR+PFK+V DVKK F+T ALSF
Sbjct: 240 TYLNSQVYQGLISLIKRELSTKPLREALDDQTLPICWKGRKPFKSVRDVKKYFKTFALSF 299
Query: 331 T-DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
DGK++T E PEAYLI+S+KGN CLG+LNG EVGL DLNVIG I
Sbjct: 300 ANDGKSKTQLEFPPEAYLIVSSKGNACLGVLNGTEVGLNDLNVIGDI 346
>gi|356507437|ref|XP_003522473.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 440
Score = 470 bits (1209), Expect = e-130, Method: Compositional matrix adjust.
Identities = 221/347 (63%), Positives = 277/347 (79%), Gaps = 4/347 (1%)
Query: 33 VGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP 92
GSS++F VHGNVYP G+YNVT+ IGQP RPYFLD+DTGSDLTWLQCDAPC RC + PHP
Sbjct: 61 AGSSVVFPVHGNVYPVGFYNVTLNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHP 120
Query: 93 LYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN 152
LYRPSNDLVPC +CASLH +++CE P QCDYE++YAD SSLGVL+ D + N+TN
Sbjct: 121 LYRPSNDLVPCRHALCASLHLSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTLNFTN 180
Query: 153 GQRLNPRLALGCGYNQV-PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
G +L R+ALGCGY+Q+ P S+HPLDG+LGLG+GK+S+ SQL+SQ L+RNV+GHCLS
Sbjct: 181 GVQLKVRMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLSAQ 240
Query: 212 GGGFLFFGDDLYDSSRVVWTSMSS-DYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
GGG++FFG D+YDS R+ WT MSS DY Y G AEL FGG+ +G+ NL VFD+GSSY
Sbjct: 241 GGGYIFFG-DVYDSFRLTWTPMSSRDYKHYSVAGAAELLFGGKKSGVGNLHAVFDTGSSY 299
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
TY N YQ L S +KKE K LKEA +D+TLPLCW+GRRPF+++++V+K F+ + LSF
Sbjct: 300 TYFNSYAYQVLISWLKKESGGKPLKEAHDDQTLPLCWRGRRPFRSIYEVRKYFKPIVLSF 359
Query: 331 T-DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
T +G+++ FE+ PEAYLI+SN GNVCLGILNG+EVG+ DLN+IG I
Sbjct: 360 TSNGRSKAQFEMLPEAYLIVSNMGNVCLGILNGSEVGMGDLNLIGDI 406
>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 438
Score = 467 bits (1201), Expect = e-129, Method: Compositional matrix adjust.
Identities = 220/347 (63%), Positives = 277/347 (79%), Gaps = 4/347 (1%)
Query: 33 VGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP 92
GSS++F VHGNVYP G+YNVT+ IGQP RPYFLD+DTGSDLTWLQCDAPC RC + PHP
Sbjct: 59 AGSSVVFPVHGNVYPVGFYNVTLNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHP 118
Query: 93 LYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN 152
LYRPSND VPC +CASLH +++CE P QCDYE++YAD SSLGVL+ D + N+TN
Sbjct: 119 LYRPSNDFVPCRHSLCASLHHSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTLNFTN 178
Query: 153 GQRLNPRLALGCGYNQV-PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
G +L R+ALGCGY+Q+ P S+HPLDG+LGLG+GK+S+ SQL+SQ L+RNV+GHCLS
Sbjct: 179 GVQLKVRMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLSAQ 238
Query: 212 GGGFLFFGDDLYDSSRVVWTSMSS-DYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
GGG++FFG D+YDSSR+ WT MSS DY Y + G AEL FGG+ +G+ +L VFD+GSSY
Sbjct: 239 GGGYIFFG-DVYDSSRLTWTPMSSRDYKHYSAAGAAELLFGGKKSGIGSLHAVFDTGSSY 297
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
TY N YQ L S + KE K LKEA +D+TLPLCW+GRRPF+++++V+K F+ + LSF
Sbjct: 298 TYFNPYAYQALISWLGKESGGKPLKEAHDDQTLPLCWRGRRPFRSIYEVRKYFKPIVLSF 357
Query: 331 T-DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
T +G+++ FE+ PEAYLIISN GNVCLGILNG+EVG+ DLN+IG I
Sbjct: 358 TSNGRSKAQFEMPPEAYLIISNMGNVCLGILNGSEVGMGDLNLIGDI 404
>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 435
Score = 466 bits (1200), Expect = e-129, Method: Compositional matrix adjust.
Identities = 220/357 (61%), Positives = 285/357 (79%), Gaps = 5/357 (1%)
Query: 24 SSSSSLFNHV-GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP 82
SS SL NH GSS++F ++GNVYP G+YNVT+ IGQP RPYFLD+DTGS+LTWLQCDAP
Sbjct: 46 SSRPSLMNHAAGSSIVFPIYGNVYPVGFYNVTLNIGQPPRPYFLDVDTGSELTWLQCDAP 105
Query: 83 CVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLV 142
C +C E PHPLY+PSND +PC+DP+CASL + CEDP QCDYE++YAD S+LGVL+
Sbjct: 106 CSQCSETPHPLYKPSNDFIPCKDPLCASLQPTDDYTCEDPNQCDYEIKYADQYSTLGVLL 165
Query: 143 KDAFAFNYTNGQRLNPRLALGCGYNQV-PGASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
D + N+TNG +L R+ALGCGY+Q+ ++YHPLDGILGLG+GK+S++SQL+SQ L+R
Sbjct: 166 NDVYLLNFTNGVQLKVRMALGCGYDQIFSPSTYHPLDGILGLGRGKASLISQLNSQGLVR 225
Query: 202 NVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSS-DYTKYYSPGVAELFFGGETTGLKNL 260
NV+GHCLS GGG++FFG ++YDSSR+ WT +SS D K+YS G AEL FGG TG+ +L
Sbjct: 226 NVMGHCLSSRGGGYIFFG-NVYDSSRMSWTPISSIDSGKHYSAGPAELVFGGRKTGVGSL 284
Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
++FD+GSSYTY N YQ + S++ KEL K +K AP+D+TLP+CW G+RPF+++++VK
Sbjct: 285 NIIFDTGSSYTYFNSQAYQAMISLLNKELHRKPIKAAPDDQTLPMCWHGKRPFRSINEVK 344
Query: 321 KCFRTLALSFTD-GKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
K F+ L LSFT+ G+ + FE+ PEAYLIISN GNVCLGILNG EVGL +LN+IG I
Sbjct: 345 KYFKPLTLSFTNGGRVKPQFEIPPEAYLIISNMGNVCLGILNGPEVGLGELNLIGDI 401
>gi|357464807|ref|XP_003602685.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355491733|gb|AES72936.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 440
Score = 464 bits (1193), Expect = e-128, Method: Compositional matrix adjust.
Identities = 222/345 (64%), Positives = 274/345 (79%), Gaps = 8/345 (2%)
Query: 34 GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPL 93
GSS++F VHGNVYP G+YNVT+ IG P RPYFLD+DTGSDLTWLQCDAPC RC + PHPL
Sbjct: 68 GSSVVFPVHGNVYPVGFYNVTINIGYPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHPL 127
Query: 94 YRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
YRPSNDLVPC P+CAS+H ++ CE QCDYE+EYAD SSLGVLV D + N+TNG
Sbjct: 128 YRPSNDLVPCRHPLCASVHQTDNYECEVEHQCDYEVEYADHYSSLGVLVNDVYVLNFTNG 187
Query: 154 QRLNPRLALGCGYNQV-PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
+L R+ALGCGY+Q+ P +SYHP+DG+LGLG+GKSS++SQL+ Q L+RNVVGHCLS G
Sbjct: 188 VQLKVRMALGCGYDQIFPDSSYHPVDGMLGLGRGKSSLISQLNGQGLVRNVVGHCLSAQG 247
Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTY 272
GG++FFG D+YDSSR+ WT MSS K+YS G AEL GG+ TG NL VFD+GSSYTY
Sbjct: 248 GGYIFFG-DVYDSSRLAWTPMSSRDYKHYSAGAAELVLGGKRTGFGNLLAVFDAGSSYTY 306
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
N YQ + KEL+ K +KEAPED+TLPLCW G+RPF++V++VKK F+ +ALSF
Sbjct: 307 FNSNAYQ-----LTKELAGKPIKEAPEDQTLPLCWYGKRPFRSVYEVKKYFKPIALSFPG 361
Query: 333 G-KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
+++ FE+ PEAYLIISN GNVCLGIL+G+EVG++DLN+IG I
Sbjct: 362 SRRSKAQFEIPPEAYLIISNMGNVCLGILDGSEVGVEDLNLIGDI 406
>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Cucumis sativus]
Length = 418
Score = 460 bits (1184), Expect = e-127, Method: Compositional matrix adjust.
Identities = 220/342 (64%), Positives = 275/342 (80%), Gaps = 2/342 (0%)
Query: 37 LLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP 96
++ + GNVYP G+YNVT+Y+GQP +PYFLD DTGSDLTWLQCDAPC +C E HPLY+P
Sbjct: 43 IVLPLQGNVYPNGFYNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHPLYQP 102
Query: 97 SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
SNDLVPC+DP+C SLH+ H CE+P QCDYE+EYADGGSSLGVLV+D F N TNG +
Sbjct: 103 SNDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGDPI 162
Query: 157 NPRLALGCGYNQVPGAS-YHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
PRLALGCGY+Q PG+S YHP+DGILGLG+G SIVSQLH+Q ++RNVVGHC + GGG+
Sbjct: 163 RPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGY 222
Query: 216 LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNR 275
FFGD +YD R+VWT MS DY K+YSPG EL F G +TGL+NL VVFDSGSSYTY N
Sbjct: 223 XFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSSYTYFNA 282
Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD-GK 334
YQ LTS++ +EL+ K L+EA +D+TLPLCW+GR+P K++ DV+K F+ LALSF+ G+
Sbjct: 283 QAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRKPIKSLRDVRKYFKPLALSFSSGGR 342
Query: 335 TRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
++ +FE+ E Y+IIS+ GNVCLGILNG +VGL++ N+IG I
Sbjct: 343 SKAVFEIPTEGYMIISSMGNVCLGILNGTDVGLENSNIIGDI 384
>gi|356527532|ref|XP_003532363.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 429
Score = 454 bits (1168), Expect = e-125, Method: Compositional matrix adjust.
Identities = 227/359 (63%), Positives = 283/359 (78%), Gaps = 3/359 (0%)
Query: 20 SSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC 79
S ++SS S L N GSS++ ++GNVYP G+YNVT+ IGQPARPYFLD+DTGSDLTWLQC
Sbjct: 38 SEATSSRSRLLNPAGSSIVLPLYGNVYPVGFYNVTLNIGQPARPYFLDVDTGSDLTWLQC 97
Query: 80 DAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLG 139
DAPC C E PHPLYRPSND VPC DP+CASL +NCE P QCDYE+ YAD S+ G
Sbjct: 98 DAPCTHCSETPHPLYRPSNDFVPCRDPLCASLQPTEDYNCEHPDQCDYEINYADQYSTFG 157
Query: 140 VLVKDAFAFNYTNGQRLNPRLALGCGYNQV-PGASYHPLDGILGLGKGKSSIVSQLHSQK 198
VL+ D + N+TNG +L R+ALGCGY+QV +SYHPLDG+LGLG+GK+S++SQL+SQ
Sbjct: 158 VLLNDVYLLNFTNGVQLKVRMALGCGYDQVFSPSSYHPLDGLLGLGRGKASLISQLNSQG 217
Query: 199 LIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK 258
L+RNV+GHCLS GGG++FFG + YDS+RV WT +SS +K+YS G AEL FGG TG+
Sbjct: 218 LVRNVIGHCLSAQGGGYIFFG-NAYDSARVTWTPISSVDSKHYSAGPAELVFGGRKTGVG 276
Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
+L VFD+GSSYTY N YQ L S +KKELS K LK AP+D+TLPLCW G+RPF ++ +
Sbjct: 277 SLTAVFDTGSSYTYFNSHAYQALLSWLKKELSGKPLKVAPDDQTLPLCWHGKRPFTSLRE 336
Query: 319 VKKCFRTLALSFTD-GKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
V+K F+ +AL FT+ G+T+ FE+ PEAYLIISN GNVCLGILNG+EVGL++LN+IG I
Sbjct: 337 VRKYFKPVALGFTNGGRTKAQFEILPEAYLIISNLGNVCLGILNGSEVGLEELNLIGDI 395
>gi|356511197|ref|XP_003524315.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 431
Score = 440 bits (1132), Expect = e-121, Method: Compositional matrix adjust.
Identities = 222/355 (62%), Positives = 277/355 (78%), Gaps = 3/355 (0%)
Query: 24 SSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC 83
SS SL N GSS++F ++GNVYP G+YNVT+ IGQPARPYFLD+DTGSDLTWLQCDAPC
Sbjct: 44 SSWPSLLNPAGSSIVFPLYGNVYPVGFYNVTLNIGQPARPYFLDVDTGSDLTWLQCDAPC 103
Query: 84 VRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVK 143
C E PHPL+RPSND VPC DP+CASL +NCE P QCDYE+ YAD S+ GVL+
Sbjct: 104 THCSETPHPLHRPSNDFVPCRDPLCASLQPTEDYNCEHPDQCDYEINYADQYSTYGVLLN 163
Query: 144 DAFAFNYTNGQRLNPRLALGCGYNQV-PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRN 202
D + N +NG +L R+ALGCGY+QV +SYHPLDG+LGLG+GK+S++SQL+SQ L+RN
Sbjct: 164 DVYLLNSSNGVQLKVRMALGCGYDQVFSPSSYHPLDGLLGLGRGKASLISQLNSQGLVRN 223
Query: 203 VVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV 262
V+GHCLS GGG++FFG + YDS+RV WT +SS +K+YS G AEL FGG TG+ +L
Sbjct: 224 VIGHCLSSQGGGYIFFG-NAYDSARVTWTPISSVDSKHYSAGPAELVFGGRKTGVGSLTA 282
Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
VFD+GSSYTY N YQ L S + KELS K LK AP+D+TL LCW G+RPF ++ +V+K
Sbjct: 283 VFDTGSSYTYFNSHAYQALLSWLNKELSGKPLKVAPDDQTLSLCWHGKRPFTSLREVRKY 342
Query: 323 FRTLALSFTD-GKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
F+ +ALSFT+ G+ + FE+ PEAYLIISN GNVCLGILNG EVGL++LN++G I
Sbjct: 343 FKPVALSFTNGGRVKAQFEIPPEAYLIISNLGNVCLGILNGFEVGLEELNLVGDI 397
>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 421
Score = 400 bits (1028), Expect = e-109, Method: Compositional matrix adjust.
Identities = 202/349 (57%), Positives = 251/349 (71%), Gaps = 10/349 (2%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
SS +F ++G+VYP G Y V M IG P RPYFLD+DTGSDLTWLQCDAPCV C + PHPLY
Sbjct: 42 SSAVFPLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLY 101
Query: 95 RPS-NDLVPCEDPICASLHA--PGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNY 150
RP+ N LVPC D +CA+LH G H C+ P QCDYE++YAD GSSLGVLV D+FA
Sbjct: 102 RPTKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRL 161
Query: 151 TNGQRLNPRLALGCGYNQVPGASYH--PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
N + P LA GCGY+Q G+S DG+LGLG G S++SQL + +NVVGHCL
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221
Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK-YYSPGVAELFFGGETTGLKNLPVVFDSG 267
S GGGFLFFGDD+ SR W M+ ++ YYSPG A L+FGG G++ + VVFDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
SS+TY + YQ L +K +LS K+LKE P D +LPLCWKG++PFK+V DVKK FRT+
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLS-KNLKEVP-DHSLPLCWKGKKPFKSVLDVKKEFRTVV 339
Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
LSF++GK + L E+ PE YLI++ GN CLGILNG+EVGL+DLN++G I
Sbjct: 340 LSFSNGK-KALMEIPPENYLIVTKYGNACLGILNGSEVGLKDLNIVGDI 387
>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 451
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 202/349 (57%), Positives = 251/349 (71%), Gaps = 10/349 (2%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
SS +F ++G+VYP G Y V M IG P RPYFLD+DTGSDLTWLQCDAPCV C + PHPLY
Sbjct: 42 SSAVFPLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLY 101
Query: 95 RPS-NDLVPCEDPICASLHA--PGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNY 150
RP+ N LVPC D +CA+LH G H C+ P QCDYE++YAD GSSLGVLV D+FA
Sbjct: 102 RPTKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRL 161
Query: 151 TNGQRLNPRLALGCGYNQVPGASYH--PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
N + P LA GCGY+Q G+S DG+LGLG G S++SQL + +NVVGHCL
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221
Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK-YYSPGVAELFFGGETTGLKNLPVVFDSG 267
S GGGFLFFGDD+ SR W M+ ++ YYSPG A L+FGG G++ + VVFDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
SS+TY + YQ L +K +LS K+LKE P D +LPLCWKG++PFK+V DVKK FRT+
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLS-KNLKEVP-DHSLPLCWKGKKPFKSVLDVKKEFRTVV 339
Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
LSF++GK + L E+ PE YLI++ GN CLGILNG+EVGL+DLN++G I
Sbjct: 340 LSFSNGK-KALMEIPPENYLIVTKYGNACLGILNGSEVGLKDLNIVGDI 387
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 201/349 (57%), Positives = 251/349 (71%), Gaps = 10/349 (2%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
SS +F ++G+VYP G Y V M IG P RPYFLD+DTGSDLTWLQCDAPCV C + PHPLY
Sbjct: 42 SSAVFPLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLY 101
Query: 95 RPS-NDLVPCEDPICASLHA--PGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNY 150
RP+ N LVPC D +CA+LH G H C+ P QCDYE++YAD GSSLGVLV D+FA
Sbjct: 102 RPTKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRL 161
Query: 151 TNGQRLNPRLALGCGYNQVPGASYH--PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
N + P LA GCGY+Q G+S DG+LGLG G S++SQL + +NVVGHCL
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221
Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK-YYSPGVAELFFGGETTGLKNLPVVFDSG 267
S GGGFLFFGDD+ SR W M+ ++ YYSPG A L+FGG G++ + VVFDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
SS+TY + YQ L +K +LS K+LKE P D +LPLCWKG++PFK+V DVKK F+T+
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLS-KNLKEVP-DHSLPLCWKGKKPFKSVLDVKKEFKTVV 339
Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
LSF++GK + L E+ PE YLI++ GN CLGILNG+EVGL+DLN++G I
Sbjct: 340 LSFSNGK-KALMEIPPENYLIVTKYGNACLGILNGSEVGLKDLNIVGDI 387
>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 421
Score = 396 bits (1017), Expect = e-107, Method: Compositional matrix adjust.
Identities = 199/349 (57%), Positives = 251/349 (71%), Gaps = 10/349 (2%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
SS +FQ++G+VYP G Y V M IG P RPYFLD+DTGSDLTWLQCDAPCV C + PHPLY
Sbjct: 42 SSAVFQLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCNKVPHPLY 101
Query: 95 RPS-NDLVPCEDPICASLHA--PGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNY 150
RP+ N +VPC D +C+SLH G H C+ P QCDYE++YAD GSSLGVL+ D+FA
Sbjct: 102 RPTKNKIVPCVDQLCSSLHGGLSGKHKCDSPKQQCDYEIKYADQGSSLGVLLTDSFAVRL 161
Query: 151 TNGQRLNPRLALGCGYNQVPGASYH--PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
N + P LA GCGY+Q G+S P DG+LGLG G S++SQL + +NVVGHCL
Sbjct: 162 ANSSIVRPSLAFGCGYDQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVVGHCL 221
Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSG 267
S GGGFLFFGD+L SR W M S + YYSPG A L+FGG + G++ + VV DSG
Sbjct: 222 SIRGGGFLFFGDNLVPYSRATWVPMVRSAFKNYYSPGTASLYFGGRSLGVRPMEVVLDSG 281
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
SS+TY YQ L + +K +LS K+LKE D +LPLCWKG++PFK+V DVKK F++L
Sbjct: 282 SSFTYFGAQPYQALVTALKSDLS-KTLKEV-FDPSLPLCWKGKKPFKSVLDVKKEFKSLV 339
Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
LSF++GK + L E+ PE YLI++ GN CLGILNG+E+GL+DLN++G I
Sbjct: 340 LSFSNGK-KALMEIPPENYLIVTKFGNACLGILNGSEIGLKDLNIVGDI 387
>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
Length = 395
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 194/338 (57%), Positives = 241/338 (71%), Gaps = 10/338 (2%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
SS +F ++G+VYP G Y V M IG P RPYFLD+DTGSDLTWLQCDAPCV C + PHPLY
Sbjct: 42 SSAVFPLYGDVYPHGLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLY 101
Query: 95 RPS-NDLVPCEDPICASLHA--PGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNY 150
RP+ N LVPC D +CA+LH G H C+ P QCDYE++YAD GSSLGVLV D+FA
Sbjct: 102 RPTKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRL 161
Query: 151 TNGQRLNPRLALGCGYNQVPGASYH--PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
N + P LA GCGY+Q G+S DG+LGLG G S++SQL + +NVVGHCL
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221
Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK-YYSPGVAELFFGGETTGLKNLPVVFDSG 267
S GGGFLFFGDD+ SR W M+ ++ YYSPG A L+FGG G++ + VVFDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
SS+TY + YQ L +K +LS K+LKE P D +LPLCWKG++PFK+V DVKK FRT+
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLS-KNLKEVP-DHSLPLCWKGKKPFKSVLDVKKEFRTVV 339
Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEV 365
LSF++GK + L E+ PE YLI++ GN CLGILNG+E+
Sbjct: 340 LSFSNGK-KALMEIPPENYLIVTKYGNACLGILNGSEL 376
>gi|356554625|ref|XP_003545645.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 452
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 195/356 (54%), Positives = 249/356 (69%), Gaps = 7/356 (1%)
Query: 26 SSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR 85
SS + + SS +F+V GNVYP G+Y V++ IG P + Y LD+D+GSDLTW+QCDAPC
Sbjct: 39 SSDNHHRLSSSAVFKVQGNVYPLGHYTVSLNIGYPPKLYDLDIDSGSDLTWVQCDAPCKG 98
Query: 86 CVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKD 144
C + LY+P+++LV C D +C+ + + C P QCDYE+EYAD GSSLGVLV+D
Sbjct: 99 CTKPRDQLYKPNHNLVQCVDQLCSEVQLSMEYTCASPDDQCDYEVEYADHGSSLGVLVRD 158
Query: 145 AFAFNYTNGQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRN 202
F +TNG + PR+A GCGY+Q S P G+LGLG G++SI+SQLHS LI N
Sbjct: 159 YIPFQFTNGSVVRPRVAFGCGYDQKYSGSNSPPATSGVLGLGNGRASILSQLHSLGLIHN 218
Query: 203 VVGHCLSGGGGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFGGETTGLKNLP 261
VVGHCLS GGGFLFFGDD SS +VWTSM S K+YS G AEL F G+ T +K L
Sbjct: 219 VVGHCLSARGGGFLFFGDDFIPSSGIVWTSMLPSSSEKHYSSGPAELVFNGKATVVKGLE 278
Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
++FDSGSSYTY N YQ + ++ ++L K LK A +D +LP+CWKG + FK++ DVKK
Sbjct: 279 LIFDSGSSYTYFNSQAYQAVVDLVTQDLKGKQLKRATDDPSLPICWKGAKSFKSLSDVKK 338
Query: 322 CFRTLALSFTDGKTRTL-FELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
F+ LALSFT KT+ L L PEAYLII+ GNVCLGIL+G EVGL++LN+IG I
Sbjct: 339 YFKPLALSFT--KTKILQMHLPPEAYLIITKHGNVCLGILDGTEVGLENLNIIGDI 392
>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 432
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 195/350 (55%), Positives = 244/350 (69%), Gaps = 11/350 (3%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
SS +F ++G+VYP G Y V M IG P +PYFLD+D+GSDLTWLQCDAPC C E PHPLY
Sbjct: 48 SSAVFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLY 107
Query: 95 RPS-NDLVPCEDPICASLHAP---GHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFN 149
RP+ + LVPC +CASLH G H CE P QCDY ++YAD GSS GVLV D+FA
Sbjct: 108 RPTKSKLVPCVHRLCASLHNALTGGKHRCESPHEQCDYVIKYADQGSSTGVLVNDSFALR 167
Query: 150 YTNGQRLNPRLALGCGYNQV--PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
TNG P +A GCGY+Q G P DG+LGLG G S++SQL + + +NVVGHC
Sbjct: 168 LTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHC 227
Query: 208 LSGGGGGFLFFGDDLYDSSRVVWTSMS-SDYTKYYSPGVAELFFGGETTGLKNLPVVFDS 266
LS GGGFLFFGDDL R WT M+ S + YYSPG A L+FG + G++ VVFDS
Sbjct: 228 LSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDS 287
Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
GSS+TY YQ L + +K LS ++L+E P D +LPLCWKG+ PFK+V DV+K F++L
Sbjct: 288 GSSFTYFAAKPYQALVTALKDGLS-RTLEEEP-DTSLPLCWKGQEPFKSVLDVRKEFKSL 345
Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
L+F GK +TL E+ PE YLI++ GN CLGILNG+E+GL+DL++IG I
Sbjct: 346 VLNFASGK-KTLMEIPPENYLIVTENGNACLGILNGSEIGLKDLSIIGDI 394
>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
Length = 433
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 193/349 (55%), Positives = 244/349 (69%), Gaps = 10/349 (2%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
SS +F ++G+VYP G Y V M IG P +PYFLD+D+GSDLTWLQCDAPC C E PHPLY
Sbjct: 50 SSAVFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLY 109
Query: 95 RPS-NDLVPCEDPICASLH--APGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNY 150
RP+ + LVPC +CASLH G H C+ P QCDY ++YAD GSS GVL+ D+FA
Sbjct: 110 RPTKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRL 169
Query: 151 TNGQRLNPRLALGCGYNQV--PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
TNG P +A GCGY+Q G P DG+LGLG G S++SQL + + +NVVGHCL
Sbjct: 170 TNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCL 229
Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMS-SDYTKYYSPGVAELFFGGETTGLKNLPVVFDSG 267
S GGGFLFFGDDL R WT M+ S + YYSPG A L+FG + G++ VVFDSG
Sbjct: 230 SLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSG 289
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
SS+TY YQ L + +K LS ++L+E P D +LPLCWKG+ PFK+V DV+K F++L
Sbjct: 290 SSFTYFAAKPYQALVTALKDGLS-RTLEEEP-DTSLPLCWKGQEPFKSVLDVRKEFKSLV 347
Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
L+F GK +TL E+ PE YLI++ GN CLGILNG+E+GL+DL++IG I
Sbjct: 348 LNFASGK-KTLMEIPPENYLIVTENGNACLGILNGSEIGLKDLSIIGDI 395
>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
gi|194692946|gb|ACF80557.1| unknown [Zea mays]
Length = 424
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 191/346 (55%), Positives = 242/346 (69%), Gaps = 10/346 (2%)
Query: 38 LFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS 97
+F ++G+VYP G Y V M IG P +PYFLD+D+GSDLTWLQCDAPC C E PHPLYRP+
Sbjct: 44 VFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSCNEVPHPLYRPT 103
Query: 98 -NDLVPCEDPICASLH--APGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
+ LVPC +CASLH G H C+ P QCDY ++YAD GSS GVL+ D+FA TNG
Sbjct: 104 KSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTNG 163
Query: 154 QRLNPRLALGCGYNQV--PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
P +A GCGY+Q G P DG+LGLG G S++SQL + + +NVVGHCLS
Sbjct: 164 SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLSLR 223
Query: 212 GGGFLFFGDDLYDSSRVVWTSMS-SDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
GGGFLFFGDDL R WT M+ S + YYSPG A L+FG + G++ VVFDSGSS+
Sbjct: 224 GGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSGSSF 283
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
TY YQ L + +K LS ++L+E P D +LPLCWKG+ PFK+V DV+K F++L L+F
Sbjct: 284 TYFAAKPYQALVTALKDGLS-RTLEEEP-DTSLPLCWKGQEPFKSVLDVRKEFKSLVLNF 341
Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
GK +TL E+ PE YLI++ GN CLGILNG+E+GL+DL++IG I
Sbjct: 342 ASGK-KTLMEIPPENYLIVTENGNACLGILNGSEIGLKDLSIIGDI 386
>gi|15219354|ref|NP_175079.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12320825|gb|AAG50556.1|AC074228_11 nucellin, putative [Arabidopsis thaliana]
gi|332193902|gb|AEE32023.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 405
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 182/359 (50%), Positives = 244/359 (67%), Gaps = 4/359 (1%)
Query: 21 SSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD 80
S SS + SS++F + GNV+P GYY+V M IG P + + D+DTGSDLTW+QCD
Sbjct: 19 SKSSIFKTFIKSSPSSVVFPLSGNVFPLGYYSVLMQIGSPPKAFQFDIDTGSDLTWVQCD 78
Query: 81 APCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLG 139
APC C P+ Y+P +++PC +PIC +LH P +C +P QCDYE++YAD GSS+G
Sbjct: 79 APCSGCTLPPNLQYKPKGNIIPCSNPICTALHWPNKPHCPNPQEQCDYEVKYADQGSSMG 138
Query: 140 VLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQ 197
LV D F NG + P +A GCGY+Q +++ P G+LGLG+GK +++QL S
Sbjct: 139 ALVTDQFPLKLVNGSFMQPPVAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSA 198
Query: 198 KLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL 257
L RNVVGHCLS GGGFLFFGD+L S V WT + S +Y+ G A+L F G+ TGL
Sbjct: 199 GLTRNVVGHCLSSKGGGFLFFGDNLVPSIGVAWTPLLSQ-DNHYTTGPADLLFNGKPTGL 257
Query: 258 KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
K L ++FD+GSSYTY N YQT+ +++ +L LK A ED+TLP+CWKG +PFK+V
Sbjct: 258 KGLKLIFDTGSSYTYFNSKAYQTIINLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVL 317
Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
+VK F+T+ ++FT+G+ T L PE YLI+S GNVCLG+LNG+EVGLQ+ NVIG I
Sbjct: 318 EVKNFFKTITINFTNGRRNTQLYLAPELYLIVSKTGNVCLGLLNGSEVGLQNSNVIGDI 376
>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
Length = 429
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 195/360 (54%), Positives = 245/360 (68%), Gaps = 9/360 (2%)
Query: 23 SSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP 82
+SSS + SS +F ++G+VYP G Y V M IG P +PYFLD+DTGSDLTWLQCDAP
Sbjct: 38 ASSSVAGVETEASSAVFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDTGSDLTWLQCDAP 97
Query: 83 CVRCVEAPHPLYRPS-NDLVPCEDPICASLH--APGHHNCEDP-AQCDYELEYADGGSSL 138
C C + PHPLYRP+ N LVPC D +CASLH H C+ P QCDY ++YAD GSS
Sbjct: 98 CRSCNKVPHPLYRPTKNKLVPCVDQLCASLHNGLNRKHKCDSPYEQCDYVIKYADQGSST 157
Query: 139 GVLVKDAFAFNYTNGQRLNPRLALGCGYN-QVPGASYHPLDGILGLGKGKSSIVSQLHSQ 197
GVLV D+FA NG + P LA GCGY+ QV P DG+LGLG G S++SQ
Sbjct: 158 GVLVNDSFALRLANGSVVRPSLAFGCGYDQQVSSGEMSPTDGVLGLGTGSVSLLSQFKQH 217
Query: 198 KLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFGGETTG 256
+ +NVVGHCLS GGGFLFFGDDL RV WT M S YYSPG A L+FG ++
Sbjct: 218 GVTKNVVGHCLSLRGGGFLFFGDDLVPYQRVTWTPMVRSPLRNYYSPGSASLYFGDQSLR 277
Query: 257 LKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNV 316
+K VVFDSGSS+TY YQ L + +K +LS ++LKE D +LPLCWKG++PFK+V
Sbjct: 278 VKLTEVVFDSGSSFTYFAAQPYQALVTALKGDLS-RTLKEV-SDPSLPLCWKGKKPFKSV 335
Query: 317 HDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
DVKK F++L L+F +G + E+ P+ YLI++ GN CLGILNG+EVGL+DL+++G I
Sbjct: 336 LDVKKEFKSLVLNFGNG-NKAFMEIPPQNYLIVTKYGNACLGILNGSEVGLKDLSILGDI 394
>gi|255558640|ref|XP_002520345.1| nucellin, putative [Ricinus communis]
gi|223540564|gb|EEF42131.1| nucellin, putative [Ricinus communis]
Length = 424
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 184/345 (53%), Positives = 242/345 (70%), Gaps = 5/345 (1%)
Query: 34 GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPL 93
GSSL+ V GNVYP GYY+V++YIG P + + LD+DTGSDLTW+QCDAPC C + H L
Sbjct: 50 GSSLVLPVFGNVYPLGYYSVSLYIGNPPKLFELDIDTGSDLTWVQCDAPCTGCTKPLHHL 109
Query: 94 YRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTN 152
Y+P N+L+ C DP+C+++ G + C+ QCDYE++YAD GSSLGVLV D F N
Sbjct: 110 YKPRNNLLSCIDPLCSAVQNSGTYQCQSATDQCDYEIQYADEGSSLGVLVTDYFPLRLMN 169
Query: 153 GQRLNPRLALGCGYNQ-VPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
G L P++ GCGY+Q PG + P G+LGLG GK+SI+SQL + ++ NV+GHCLS
Sbjct: 170 GSFLRPKMTFGCGYDQKSPGPVAPPPTTGVLGLGNGKTSIISQLQALGVMGNVIGHCLSR 229
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSD-YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSS 269
GGGFLFFG D S + W MS KYY+ G AEL +GG+ TG K +FDSGSS
Sbjct: 230 KGGGFLFFGQDPVPSFGISWAPMSQKSLDKYYASGPAELLYGGKPTGTKAEEFIFDSGSS 289
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
YTY N YQ+ ++++KELS K L++APE++ L +CWKG + FK+V++VK F+ ALS
Sbjct: 290 YTYFNAQVYQSTLNLIRKELSGKPLRDAPEEKALAICWKGTKRFKSVNEVKSYFKPFALS 349
Query: 330 FTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
FT K+ L ++ PE YLI++N GNVCLGILNG+EVGL + NVIG
Sbjct: 350 FTKAKSVQL-QIPPEDYLIVTNDGNVCLGILNGSEVGLGNFNVIG 393
>gi|297841447|ref|XP_002888605.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
gi|297334446|gb|EFH64864.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
Length = 410
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 179/345 (51%), Positives = 237/345 (68%), Gaps = 4/345 (1%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
SS++ + GNV+P GYY+V + IG P + + D+DTGSD+TW+QCDAPC C P Y
Sbjct: 38 SSVVLLLSGNVFPLGYYSVLLQIGNPPKAFEFDIDTGSDITWVQCDAPCTGCNLPPKLQY 97
Query: 95 RPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
+P + VPC DPIC +LH P + C +P QCDYE+ YAD GSS+G LV D F F NG
Sbjct: 98 KPKGNTVPCSDPICLALHFPNNPQCPNPKEQCDYEVNYADQGSSMGALVIDQFPFKLLNG 157
Query: 154 QRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
+ PRLA GCGY+Q +++ P G+LGLG+GK +++QL S L RNVVGHCLS
Sbjct: 158 SAMQPRLAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSSK 217
Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYT 271
GGG+LFFGD L S V WT + +Y+ G AEL F G+ TGLK L ++FD+GSSYT
Sbjct: 218 GGGYLFFGDTLIPSLGVAWTPLLPP-DNHYTTGPAELLFNGKPTGLKGLKLIFDTGSSYT 276
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
Y N TYQT+ +++ +L LK A ED+TLP+CWKG +PFK+V +VK F+T+ ++FT
Sbjct: 277 YFNSKTYQTIVNLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVLEVKNFFKTITINFT 336
Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
+ + T ++ PE+YLIIS GN CLG+LNG+EVGLQ+ NVIG I
Sbjct: 337 NARRNTQLQIPPESYLIISKTGNACLGLLNGSEVGLQNSNVIGDI 381
>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 188/347 (54%), Positives = 242/347 (69%), Gaps = 8/347 (2%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
SS++F + GNVYP GYY+V++ IG+ + D+D+GSDLTW+QCDAPC C + LY
Sbjct: 39 SSVVFPLKGNVYPLGYYSVSINIGKGDEAFEFDIDSGSDLTWVQCDAPCTHCTKPREQLY 98
Query: 95 RPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
+P+N+ + C +P+C SLH +H+C+ QC YE+EYAD GSSLGVLV D TNG
Sbjct: 99 KPNNNALNCFEPLCTSLHPITNHHCKSADDQCQYEIEYADHGSSLGVLVNDHVPLKLTNG 158
Query: 154 QRLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
PR+A GCGY+ VP +S P G+LGLG G+ S +SQL S ++RNVVGHCLS
Sbjct: 159 SLAAPRIAFGCGYDHKYSVPDSS-PPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCLSD 217
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYT-KYYSPGVAELFFGGETTGLKNLPVVFDSGSS 269
GG FLFFGD+ SS V WTSMS + YYS G AE++FGG+ TG+K+L +VFDSGSS
Sbjct: 218 EGG-FLFFGDEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFGGKATGIKDLTLVFDSGSS 276
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
YTY N Y ++ +++K L K L++APED++LP+CWKG RPFK++ DVKK F LAL
Sbjct: 277 YTYFNSQAYNSILALVKNNLRGKPLEDAPEDKSLPVCWKGTRPFKSLRDVKKYFNLLALR 336
Query: 330 FTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
FT K + +L PE YLII+ GNVC GILNG EVGL DLN+IG I
Sbjct: 337 FTKTKNAQI-QLPPENYLIITKYGNVCFGILNGTEVGLGDLNIIGDI 382
>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 369 bits (947), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 185/345 (53%), Positives = 238/345 (68%), Gaps = 11/345 (3%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
FQ+ GNVYP GYY V++ IG P + Y LD+DTGSDLTW+QCDAPC C + LY+P+
Sbjct: 52 FQIKGNVYPLGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCQGCTIPRNRLYKPNG 111
Query: 99 DLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
+LV C DP+C ++ + +H+C P QCDYE+EYAD GSSLGVL++D +TNG
Sbjct: 112 NLVKCGDPLCKAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNGSLAR 171
Query: 158 PRLALGCGYNQV-----PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
P LA GCGY+Q P AS G+LGLG GK+SI+SQLHS LIRNVVGHCLS G
Sbjct: 172 PILAFGCGYDQKHVGHNPSASTA---GVLGLGNGKTSILSQLHSLGLIRNVVGHCLSERG 228
Query: 213 GGFLFFGDDLYDSSRVVWTS-MSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYT 271
GGFLFFGD L S VVWT + S T++Y G A+LFF + T +K L ++FDSGSSYT
Sbjct: 229 GGFLFFGDQLVPQSGVVWTPLLQSSSTQHYKTGPADLFFDRKPTSVKGLQLIFDSGSSYT 288
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
Y N ++ L +++ +L K L A ED +LP+CW+G +PFK++HDV F+ L LSFT
Sbjct: 289 YFNSKAHKALVNLVTNDLRGKPLSRATEDSSLPICWRGPKPFKSLHDVTSNFKPLLLSFT 348
Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
K +L +L PEAYLI++ GNVCLGIL+G E+GL + N+IG I
Sbjct: 349 KSKN-SLLQLPPEAYLIVTKHGNVCLGILDGTEIGLGNTNIIGDI 392
>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
Length = 410
Score = 369 bits (947), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 187/370 (50%), Positives = 251/370 (67%), Gaps = 7/370 (1%)
Query: 11 CFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDT 70
CF + SS+ + + VGSS+ F+V GNVYPTGYY+V + IG P + + D+DT
Sbjct: 15 CFSAASQTPIKGESSTPA-NDRVGSSVFFRVTGNVYPTGYYSVILNIGNPPKAFDFDIDT 73
Query: 71 GSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDP-AQCDYEL 129
GSDLTW+QCDAPC C + LY+P N+LVPC + +C ++ +++C+ P QCDYE+
Sbjct: 74 GSDLTWVQCDAPCKGCTKPRDKLYKPKNNLVPCSNSLCQAVSTGENYHCDAPDDQCDYEI 133
Query: 130 EYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLD--GILGLGKGK 187
EYAD GSS+GVL+ D+F +NG L P++A GCGY+Q + P D GILGLG+GK
Sbjct: 134 EYADLGSSIGVLLSDSFPLRLSNGTLLQPKMAFGCGYDQKHLGPHPPPDTAGILGLGRGK 193
Query: 188 SSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVA 246
SI+SQL + + +NVVGHC S GGFLFFGD L+ SSR+ WT M S YS G A
Sbjct: 194 VSILSQLRTLGITQNVVGHCFSRARGGFLFFGDHLFPSSRITWTPMLRSSSDTLYSSGPA 253
Query: 247 ELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLC 306
EL FGG+ TG+K L ++FDSGSSYTY N YQ++ ++++K+L+ K LK+APE E L +C
Sbjct: 254 ELLFGGKPTGIKGLQLIFDSGSSYTYFNAQVYQSILNLVRKDLAGKPLKDAPEKE-LAVC 312
Query: 307 WKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVG 366
WK +P K++ D+K F+ L +SF + K L +L PE YLII+ GNVCLGILNG+E
Sbjct: 313 WKTAKPIKSILDIKSYFKPLTISFMNAKNVQL-QLAPEDYLIITKDGNVCLGILNGSEQQ 371
Query: 367 LQDLNVIGGI 376
L + NVIG I
Sbjct: 372 LGNFNVIGDI 381
>gi|356500374|ref|XP_003519007.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Glycine max]
Length = 454
Score = 368 bits (945), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 192/365 (52%), Positives = 251/365 (68%), Gaps = 5/365 (1%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
R + + S + + + SS +F++ GNVYP G+Y V++ IG P + Y LD+D+GSDLT
Sbjct: 29 RNAKKPKTPYSDNNHHRLSSSAVFKLQGNVYPLGHYTVSLNIGYPPKLYDLDIDSGSDLT 88
Query: 76 WLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADG 134
W+QCDAPC C + LY+P+++LV C D +C+ +H +NC P CDYE+EYAD
Sbjct: 89 WVQCDAPCKGCTKPRDQLYKPNHNLVQCVDQLCSEVHLSMAYNCPSPDDPCDYEVEYADH 148
Query: 135 GSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVS 192
GSSLGVLV+D F +TNG + PR+A GCGY+Q S P G+LGLG G++SI+S
Sbjct: 149 GSSLGVLVRDYIPFQFTNGSVVRPRVAFGCGYDQKYSGSNSPPATSGVLGLGNGRASILS 208
Query: 193 QLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFG 251
QLHS LIRNVVGHCLS GGGFLFFGDD SS +VWTSM SS K+YS G AEL F
Sbjct: 209 QLHSLGLIRNVVGHCLSAQGGGFLFFGDDFIPSSGIVWTSMLSSSSEKHYSSGPAELVFN 268
Query: 252 GETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR 311
G+ T +K L ++FDSGSSYTY N YQ + ++ K+L K LK A +D +LP+CWKG +
Sbjct: 269 GKATAVKGLELIFDSGSSYTYFNSQAYQAVVDLVTKDLKGKQLKRATDDPSLPICWKGAK 328
Query: 312 PFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLN 371
F+++ DVKK F+ LALSF + L PE+YLII+ GNVCLGIL+G EVGL++LN
Sbjct: 329 SFESLSDVKKYFKPLALSFKKSXNLQM-HLPPESYLIITKHGNVCLGILDGTEVGLENLN 387
Query: 372 VIGGI 376
+IG I
Sbjct: 388 IIGDI 392
>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 368 bits (944), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 187/347 (53%), Positives = 241/347 (69%), Gaps = 8/347 (2%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
SS++F + GNVYP GYY+V++ IG+ + D+D+GSDLTW+QCDAPC C + LY
Sbjct: 39 SSVVFPLKGNVYPLGYYSVSINIGKGDEAFEFDIDSGSDLTWVQCDAPCTHCTKPREQLY 98
Query: 95 RPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
+P+N+ + C +P+C SLH +H+C+ QC YE+EYAD GSSLGVLV D TNG
Sbjct: 99 KPNNNALNCFEPLCTSLHPITNHHCKSADDQCQYEIEYADHGSSLGVLVNDHVPLKLTNG 158
Query: 154 QRLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
PR+A GCGY+ VP +S P G+LGLG G+ S +SQL S ++RNVVGHCLS
Sbjct: 159 SLAAPRIAFGCGYDHKYSVPDSS-PPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCLSD 217
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYT-KYYSPGVAELFFGGETTGLKNLPVVFDSGSS 269
GG FLFFGD+ SS V WTSMS + YYS G AE++F G+ TG+K+L +VFDSGSS
Sbjct: 218 EGG-FLFFGDEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFSGKATGIKDLTLVFDSGSS 276
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
YTY N Y ++ +++K L K L++APED++LP+CWKG RPFK++ DVKK F LAL
Sbjct: 277 YTYFNSQAYNSILALVKNNLRGKPLEDAPEDKSLPVCWKGTRPFKSLRDVKKYFNPLALR 336
Query: 330 FTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
FT K + +L PE YLII+ GNVC GILNG EVGL DLN+IG I
Sbjct: 337 FTKTKNAQI-QLPPENYLIITKYGNVCFGILNGTEVGLGDLNIIGDI 382
>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 362 bits (930), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 186/370 (50%), Positives = 246/370 (66%), Gaps = 9/370 (2%)
Query: 11 CFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDT 70
CF + S++ + + VGSS+ F+V GNVYPTG+Y+V + IG P + + LD+DT
Sbjct: 29 CFSAASQTPIKGKSTTPA-NDRVGSSVFFRVTGNVYPTGHYSVILNIGNPPKAFDLDIDT 87
Query: 71 GSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYEL 129
GSDLTW+QCDAPC C + LY+P N+ VPC +C ++ ++NC+ P QCDYE+
Sbjct: 88 GSDLTWVQCDAPCKGCTKPLDKLYKPKNNRVPCASSLCQAIQ---NNNCDIPTEQCDYEV 144
Query: 130 EYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLD--GILGLGKGK 187
EYAD GSSLGVL+ D F NG L PR+A GCGY+Q + P D GILGLG+GK
Sbjct: 145 EYADLGSSLGVLLSDYFPLRLNNGSLLQPRIAFGCGYDQKYLGPHSPPDTAGILGLGRGK 204
Query: 188 SSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVA 246
+SI+SQL + + +NVVGHC S GGFLFFGD L S + WT M S YS G A
Sbjct: 205 ASILSQLRTLGITQNVVGHCFSRVTGGFLFFGDHLLPPSGITWTPMLRSSSDTLYSSGPA 264
Query: 247 ELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLC 306
EL FGG+ TG+K L ++FDSGSSYTY N YQ++ ++++K+LS LK+APE++ L +C
Sbjct: 265 ELLFGGKPTGIKGLQLIFDSGSSYTYFNAQVYQSILNLVRKDLSGMPLKDAPEEKALAVC 324
Query: 307 WKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVG 366
WK +P K++ D+K F+ L ++F K L +L PE YLII+ GNVCLGILNG E G
Sbjct: 325 WKTAKPIKSILDIKSFFKPLTINFIKAKNVQL-QLAPEDYLIITKDGNVCLGILNGGEQG 383
Query: 367 LQDLNVIGGI 376
L +LNVIG I
Sbjct: 384 LGNLNVIGDI 393
>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 453
Score = 362 bits (930), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 197/359 (54%), Positives = 256/359 (71%), Gaps = 7/359 (1%)
Query: 23 SSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP 82
S+S+ + N +G +++F + GNVYP G+Y+V++ IG P +PY LD+D+GSDLTWLQCDAP
Sbjct: 40 SASNQPISNRMGHTVVFPLQGNVYPQGFYSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAP 99
Query: 83 CVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVL 141
CV C +APHP Y+P+ + C DP+C++LH P C+ QCDYE+ YAD GSSLGVL
Sbjct: 100 CVSCTKAPHPPYKPNKGPITCNDPMCSALHWPSKPPCKASHEQCDYEVSYADHGSSLGVL 159
Query: 142 VKDAFAFNYTNGQRLNPRLALGCGYNQ-VPGASYHP-LDGILGLGKGKSSIVSQLHSQKL 199
V D F+ TNG PRLA GCGY+Q PG + P +DG+LGLG GKSSIV+QL S L
Sbjct: 160 VHDIFSLQLTNGTLAAPRLAFGCGYDQSYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGL 219
Query: 200 IRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK-YYSPGVAELFFGGETTGLK 258
IR++VGHCLSG GGGFLF GD L + ++WT MS + Y+ G A+L F G+ +G+K
Sbjct: 220 IRSIVGHCLSGRGGGFLFLGDGLSTTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGVK 279
Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
L +VFDSGSSYTY N Y+T S+++K L+ K LKE DE+LP+CW+G +PFK++ +
Sbjct: 280 GLRLVFDSGSSYTYFNAQAYKTTLSLVRKYLNGK-LKET-ADESLPVCWRGAKPFKSIFE 337
Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIG 377
VK F+ ALSFT K+ L +L PE+YLIIS GN CLGILNG+EVGL D NVIG I
Sbjct: 338 VKNYFKPFALSFTKAKSAQL-QLPPESYLIISKHGNACLGILNGSEVGLGDSNVIGDIA 395
>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
Length = 390
Score = 361 bits (926), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 197/359 (54%), Positives = 256/359 (71%), Gaps = 7/359 (1%)
Query: 23 SSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP 82
S+S+ + N +G +++F + GNVYP G+Y+V++ IG P +PY LD+D+GSDLTWLQCDAP
Sbjct: 7 SASNQPISNRMGHTVVFPLQGNVYPQGFYSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAP 66
Query: 83 CVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVL 141
CV C +APHP Y+P+ + C DP+C++LH P C+ QCDYE+ YAD GSSLGVL
Sbjct: 67 CVSCTKAPHPPYKPNKGPITCNDPMCSALHWPSKPPCKASHEQCDYEVSYADHGSSLGVL 126
Query: 142 VKDAFAFNYTNGQRLNPRLALGCGYNQ-VPGASYHP-LDGILGLGKGKSSIVSQLHSQKL 199
V D F+ TNG PRLA GCGY+Q PG + P +DG+LGLG GKSSIV+QL S L
Sbjct: 127 VHDIFSLQLTNGTLAAPRLAFGCGYDQSYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGL 186
Query: 200 IRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK-YYSPGVAELFFGGETTGLK 258
IR++VGHCLSG GGGFLF GD L + ++WT MS + Y+ G A+L F G+ +G+K
Sbjct: 187 IRSIVGHCLSGRGGGFLFLGDGLSTTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGVK 246
Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
L +VFDSGSSYTY N Y+T S+++K L+ K LKE DE+LP+CW+G +PFK++ +
Sbjct: 247 GLRLVFDSGSSYTYFNAQAYKTTLSLVRKYLNGK-LKET-ADESLPVCWRGAKPFKSIFE 304
Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIG 377
VK F+ ALSFT K+ L +L PE+YLIIS GN CLGILNG+EVGL D NVIG I
Sbjct: 305 VKNYFKPFALSFTKAKSAQL-QLPPESYLIISKHGNACLGILNGSEVGLGDSNVIGDIA 362
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 360 bits (923), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 186/349 (53%), Positives = 238/349 (68%), Gaps = 12/349 (3%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
S+ +FQ+ G+VYPTG+Y VTM IG PA+PYFLD+DTGSDLTWLQCDAPC C + PHPLY
Sbjct: 37 STAVFQLQGDVYPTGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLY 96
Query: 95 RPS-NDLVPCEDPICASLHA-PGHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
RP+ N LVPC + +C +LH+ G +N C P QCDY+++Y D SS GVL+ D+F+
Sbjct: 97 RPTANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMR 156
Query: 152 NGQRLNPRLALGCGYNQV---PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
+ + P L GCGY+Q GA +DG+LGLG+G S+VSQL Q + +NVVGHCL
Sbjct: 157 S-SNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCL 215
Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYT-KYYSPGVAELFFGGETTGLKNLPVVFDSG 267
S GGGFLFFGDD+ SSRV W M+ + YYSPG L+F + G+K + VVFDSG
Sbjct: 216 STNGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSG 275
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
S+YTY YQ + S +K LS KSLK+ D TLPLCWKG++ FK+V DVK F+++
Sbjct: 276 STYTYFTAQPYQAVVSALKGGLS-KSLKQV-SDPTLPLCWKGQKAFKSVFDVKNEFKSMF 333
Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
LSF+ K + E+ PE YLI++ GNVCLGIL+G L NVIG I
Sbjct: 334 LSFSSAKNAAM-EIPPENYLIVTKNGNVCLGILDGTAAKL-SFNVIGDI 380
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 359 bits (922), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 186/349 (53%), Positives = 237/349 (67%), Gaps = 12/349 (3%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
S+ +FQ+ G+VYPTG+Y VTM IG PA+PYFLD+DTGSDLTWLQCDAPC C + PHPLY
Sbjct: 37 STAVFQLQGDVYPTGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLY 96
Query: 95 RPS-NDLVPCEDPICASLHA-PGHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
RP+ N LVPC + +C +LH+ G +N C P QCDY+++Y D SS GVL+ D+F+
Sbjct: 97 RPTANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMR 156
Query: 152 NGQRLNPRLALGCGYNQV---PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
+ + P L GCGY+Q GA +DG+LGLG+G S+VSQL Q + +NVVGHCL
Sbjct: 157 S-SNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCL 215
Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYT-KYYSPGVAELFFGGETTGLKNLPVVFDSG 267
S GGGFLFFGDD+ SSRV W M+ + YYSPG L+F + G+K + VVFDSG
Sbjct: 216 STNGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSG 275
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
S+YTY YQ + S +K LS KSLK+ D TLPLCWKG++ FK+V DVK F+++
Sbjct: 276 STYTYFTAQPYQAVVSALKGGLS-KSLKQV-SDPTLPLCWKGQKAFKSVFDVKNEFKSMF 333
Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
LSF K + E+ PE YLI++ GNVCLGIL+G L NVIG I
Sbjct: 334 LSFASAKNAAM-EIPPENYLIVTKNGNVCLGILDGTAAKL-SFNVIGDI 380
>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
Length = 418
Score = 349 bits (895), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 183/346 (52%), Positives = 233/346 (67%), Gaps = 13/346 (3%)
Query: 38 LFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS 97
+F + G+VYPTG+Y VTM IG PA+PYFLD+DTGSDLTWLQCDAPC C + PHPLYRP+
Sbjct: 44 VFLLSGDVYPTGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPT 103
Query: 98 -NDLVPCEDPICASLHAPGHHN--CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
N LVPC + IC +LH+ N C QCDY+++Y D SSLGVLV D+F+ N
Sbjct: 104 KNKLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVMDSFSLPLRNKS 163
Query: 155 RLNPRLALGCGYNQVP---GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
+ P L+ GCGY+Q GA+ DG+LGLG+G S++SQL Q + +NV+GHCLS
Sbjct: 164 NVRPSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLSTS 223
Query: 212 GGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
GGGFLFFGDD+ +SRV W SM S YYSPG A L+F + K + VVFDSGS+Y
Sbjct: 224 GGGFLFFGDDMVPTSRVTWVSMVRSTSGNYYSPGSATLYFDRRSLSTKPMEVVFDSGSTY 283
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
TY + YQ S +K LS KSLK+ D +LPLCWKG++ FK+V DVKK F++L F
Sbjct: 284 TYFSAQPYQATISAIKGSLS-KSLKQV-SDPSLPLCWKGQKAFKSVSDVKKDFKSLQFIF 341
Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
GK + ++ PE YLII+ GNVCLGIL+G+ L ++IG I
Sbjct: 342 --GK-NAVMDIPPENYLIITKNGNVCLGILDGSAAKL-SFSIIGDI 383
>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 185/342 (54%), Positives = 235/342 (68%), Gaps = 5/342 (1%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
FQ+ GNVYP GYY V++ IG P + Y LD+DTGSDLTW+QCDAPC C + LY+P
Sbjct: 52 FQIKGNVYPLGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCKGCTLPRNRLYKPHG 111
Query: 99 DLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
DLV C DP+CA++ + +H+C P QCDYE+EYAD GSSLGVL++D +TNG
Sbjct: 112 DLVKCVDPLCAAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNGSLAR 171
Query: 158 PRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
P LA GCGY+Q P G+LGLG G++SI+SQLHS LIRNVVGHCLSG GGGF
Sbjct: 172 PMLAFGCGYDQTHHGQNPPPSTAGVLGLGNGRTSILSQLHSLGLIRNVVGHCLSGRGGGF 231
Query: 216 LFFGDDLYDSSRVVWTS-MSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLN 274
LFFGD L S VVWT + S ++Y G A+LFF +TT +K L ++FDSGSSYTY N
Sbjct: 232 LFFGDQLIPPSGVVWTPLLQSSSAQHYKTGPADLFFDRKTTSVKGLELIFDSGSSYTYFN 291
Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGK 334
++ L +++ +L K L A D +LP+CWKG +PFK++HDV F+ L LSFT K
Sbjct: 292 SQAHKALVNLIANDLRGKPLSRATGDPSLPICWKGPKPFKSLHDVTSNFKPLLLSFTKSK 351
Query: 335 TRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
L +L PEAYLI++ GNVCLGIL+G E+GL + N+IG I
Sbjct: 352 NSPL-QLPPEAYLIVTKHGNVCLGILDGTEIGLGNTNIIGDI 392
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 348 bits (893), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 182/346 (52%), Positives = 232/346 (67%), Gaps = 13/346 (3%)
Query: 38 LFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS 97
+F + G+VYPTG+Y VTM IG PA+PYFLD+DTGSDLTWLQCDAPC C + PHPLYRP+
Sbjct: 44 VFLLSGDVYPTGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPT 103
Query: 98 -NDLVPCEDPICASLHAPGHHN--CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
N LVPC + IC +LH+ N C QCDY+++Y D SSLGVLV D+F+ N
Sbjct: 104 KNKLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSLPLRNKS 163
Query: 155 RLNPRLALGCGYNQV---PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
+ P L+ GCGY+Q GA+ DG+LGLG+G S++SQL Q + +NV+GHCLS
Sbjct: 164 NVRPSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLSTS 223
Query: 212 GGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
GGGFLFFGDD+ +SRV W M S YYSPG A L+F + K + VVFDSGS+Y
Sbjct: 224 GGGFLFFGDDMVPTSRVTWVPMVRSTSGNYYSPGSATLYFDRRSLSTKPMEVVFDSGSTY 283
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
TY + YQ S +K LS KSLK+ D +LPLCWKG++ FK+V DVKK F++L F
Sbjct: 284 TYFSAQPYQATISAIKGSLS-KSLKQV-SDPSLPLCWKGQKAFKSVSDVKKDFKSLQFIF 341
Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
GK + E+ PE YLI++ GNVCLGIL+G+ L ++IG I
Sbjct: 342 --GK-NAVMEIPPENYLIVTKNGNVCLGILDGSAAKL-SFSIIGDI 383
>gi|356509401|ref|XP_003523438.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 407
Score = 347 bits (889), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 187/358 (52%), Positives = 244/358 (68%), Gaps = 8/358 (2%)
Query: 25 SSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV 84
S+ S+ +H SS+ FQ+ GNVYP GYY+V + IG P + Y LD+DTGSDLTW+QCDAPC
Sbjct: 23 SAISVLSH-ASSIAFQIKGNVYPLGYYSVNLAIGNPPKAYELDIDTGSDLTWVQCDAPCK 81
Query: 85 RCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVK 143
C Y+P +LV C DP+CA++ + + C +P QCDYE+EYAD GSSLGVLV+
Sbjct: 82 GCTLPRDRQYKPHGNLVKCVDPLCAAIQSAPNPPCVNPNEQCDYEVEYADQGSSLGVLVR 141
Query: 144 DAFAFNYTNGQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIR 201
D TNG + LA GCGY+Q P G+LGLG G++SI+SQL+S+ LIR
Sbjct: 142 DIIPLKLTNGTLTHSMLAFGCGYDQTHVGHNPPPSAAGVLGLGNGRASILSQLNSKGLIR 201
Query: 202 NVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSM---SSDYTKYYSPGVAELFFGGETTGLK 258
NVVGHCLSG GGGFLFFGD L S VVWT + SS K+Y G A++FF G+ T +K
Sbjct: 202 NVVGHCLSGTGGGFLFFGDQLIPQSGVVWTPILQSSSSLLKHYKTGPADMFFNGKATSVK 261
Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
L + FDSGSSYTY N + ++ L ++ ++ K L A ED +LP+CWKG +PFK++HD
Sbjct: 262 GLELTFDSGSSYTYFNSLAHKALVDLITNDIKGKPLSRATEDPSLPICWKGPKPFKSLHD 321
Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
V F+ L LSFT K +LF++ PEAYLI++ GNVCLGIL+G E+GL + N+IG I
Sbjct: 322 VTSNFKPLVLSFTKSK-NSLFQVPPEAYLIVTKHGNVCLGILDGTEIGLGNTNIIGDI 378
>gi|297842525|ref|XP_002889144.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
gi|297334985|gb|EFH65403.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
Length = 467
Score = 344 bits (883), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 176/370 (47%), Positives = 240/370 (64%), Gaps = 5/370 (1%)
Query: 12 FPTVRMSSSSSSSSSSSLFNH-VGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDT 70
F ++ SS+ L N +GSS++F V GNVYP GYY V + IG P + + LD+DT
Sbjct: 28 FQPSDATTKDSSAQQVKLQNRRLGSSVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDIDT 87
Query: 71 GSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYEL 129
GSDLTW+QCDAPC C + Y+P+++ +PC +C+ L + C+DP QCDYE+
Sbjct: 88 GSDLTWVQCDAPCNGCTKPRAKQYKPNHNTLPCSHLLCSGLDLTQNRPCDDPEDQCDYEI 147
Query: 130 EYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGKGK 187
Y+D SS+G LV D F NG +NP L GCGY+Q P GILGLG+GK
Sbjct: 148 GYSDHASSIGALVTDEFPLKLANGSIMNPHLTFGCGYDQQNPGPHPPPPTAGILGLGRGK 207
Query: 188 SSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSD-YTKYYSPGVA 246
I +QL S + +NV+ HCLS G GFL GD+L SS V WTS++++ +K Y G A
Sbjct: 208 VGISTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSASKNYMTGPA 267
Query: 247 ELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLC 306
EL F +TTG+K + VVFDSGSSYTY N YQ + +++K+L+ K L + +D++LP+C
Sbjct: 268 ELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVC 327
Query: 307 WKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVG 366
WKG++P K++ +VKK F+T+ L F K LF++ PE+YLII+ KGNVCLGILNG EVG
Sbjct: 328 WKGKKPLKSLDEVKKYFKTITLRFGYQKNGQLFQVPPESYLIITEKGNVCLGILNGTEVG 387
Query: 367 LQDLNVIGGI 376
L N++G I
Sbjct: 388 LDSYNIVGDI 397
>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 413
Score = 343 bits (880), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 178/346 (51%), Positives = 226/346 (65%), Gaps = 13/346 (3%)
Query: 38 LFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS 97
+FQ++G+VYPTG+Y VTM IG PA+PYFLD+DTGSDLTWLQCDAPC C + PHPLY+P+
Sbjct: 39 VFQLNGDVYPTGHYYVTMNIGDPAKPYFLDIDTGSDLTWLQCDAPCQSCNKVPHPLYKPT 98
Query: 98 -NDLVPCEDPICASLHAPGHHN--CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
N LVPC IC +LH+ N C P QCDY+++Y D SSLGVLV D F N
Sbjct: 99 KNKLVPCAASICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTLPLRNSS 158
Query: 155 RLNPRLALGCGYNQVPGAS---YHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
+ P GCGY+Q G + DG+LGLGKG S+VSQL + +NV+GHCLS
Sbjct: 159 SVRPSFTFGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCLSTN 218
Query: 212 GGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
GGGFLFFGD++ +SR W M S YYSPG L+F + G+K + VVFDSGS+Y
Sbjct: 219 GGGFLFFGDNVVPTSRATWVPMVRSTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTY 278
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
TY YQ S +K LS KSL++ D +LPLCWKG++ FK+V DVK F++L LSF
Sbjct: 279 TYFAAQPYQATVSALKAGLS-KSLQQV-SDPSLPLCWKGQKVFKSVSDVKNDFKSLFLSF 336
Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
++ E+ PE YLI++ GN CLGIL+G+ L N+IG I
Sbjct: 337 VK---NSVLEIPPENYLIVTKNGNACLGILDGSAAKLT-FNIIGDI 378
>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 340 bits (872), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 182/372 (48%), Positives = 237/372 (63%), Gaps = 15/372 (4%)
Query: 10 LCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLD 69
L P S + +++ SL + S+ +FQ+ G VYP G+Y VTM IG PA+PYFLD+D
Sbjct: 34 LLLPPFAPSPARAATPGKSLSS--ASTAVFQLQGAVYPIGHYYVTMNIGDPAKPYFLDVD 91
Query: 70 TGSDLTWLQCDAPCVRCVEAPHPLYRPS-NDLVPCEDPICASLHAPGHHNCEDPAQCDYE 128
TGSDLTWLQCDAPC C + PHP Y+P+ N +VPC +C SL + C P QCDY+
Sbjct: 92 TGSDLTWLQCDAPCQSCNKVPHPWYKPTKNKIVPCAASLCTSLTP--NKKCAVPQQCDYQ 149
Query: 129 LEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVP---GASYHPLDGILGLGK 185
++Y D SSLGVL+ D F + N + L GCGY+Q GA DG+LGLGK
Sbjct: 150 IKYTDKASSLGVLIADNFTLSLRNSSTVRANLTFGCGYDQQVGKNGAVQAATDGLLGLGK 209
Query: 186 GKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYT-KYYSPG 244
G S++SQL Q + +NV+GHC S GGGFLFFGDD+ +SRV W M+ + YYSPG
Sbjct: 210 GAVSLLSQLKQQGVTKNVLGHCFSTNGGGFLFFGDDIVPTSRVTWVPMARTTSGNYYSPG 269
Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
L+F + G+K + VVFDSGS+Y Y YQ S +K LS KSLKE D +LP
Sbjct: 270 SGTLYFDRRSLGMKPMEVVFDSGSTYAYFAAEPYQATVSALKAGLS-KSLKEV-SDVSLP 327
Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
LCWKG++ FK+V +VK F++L LSF GK ++ E+ PE YLI++ GNVCLGIL+G
Sbjct: 328 LCWKGQKVFKSVSEVKNDFKSLFLSF--GK-NSVMEIPPENYLIVTKYGNVCLGILDGTT 384
Query: 365 VGLQDLNVIGGI 376
L+ N+IG I
Sbjct: 385 AKLK-FNIIGDI 395
>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
Length = 383
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 179/336 (53%), Positives = 228/336 (67%), Gaps = 10/336 (2%)
Query: 22 SSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA 81
+SSS ++ SS +F ++G+VYP G Y V M IG P +PYFLD+D+GSDLTWLQCDA
Sbjct: 37 ASSSIAAGAETEPSSAVFPLYGDVYPHGLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDA 96
Query: 82 PCVRCVEAPHPLYRPS-NDLVPCEDPICASLH--APGHHNCEDP-AQCDYELEYADGGSS 137
PC C E PHPLYRP+ + LVPC +CASLH G H C+ P QCDY ++YAD GSS
Sbjct: 97 PCRSCNEVPHPLYRPTKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSS 156
Query: 138 LGVLVKDAFAFNYTNGQRLNPRLALGCGYNQV--PGASYHPLDGILGLGKGKSSIVSQLH 195
GVL+ D+FA TNG P +A GCGY+Q G P DG+LGLG G S++SQL
Sbjct: 157 TGVLINDSFALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLK 216
Query: 196 SQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMS-SDYTKYYSPGVAELFFGGET 254
+ + +NVVGHCLS GGGFLFFGDDL R WT M+ S + YYSPG A L+FG +
Sbjct: 217 QRGVTKNVVGHCLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRS 276
Query: 255 TGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFK 314
G++ VVFDSGSS+TY YQ L + +K LS ++L+E P D +LPLCWKG+ PFK
Sbjct: 277 LGVRLAKVVFDSGSSFTYFAAKPYQALVTALKDGLS-RTLEEEP-DTSLPLCWKGQEPFK 334
Query: 315 NVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIIS 350
+V DV+K F++L L+F GK +TL E+ PE YLI++
Sbjct: 335 SVLDVRKEFKSLVLNFASGK-KTLMEIPPENYLIVT 369
>gi|30699261|ref|NP_850981.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17065172|gb|AAL32740.1| nucellin-like protein [Arabidopsis thaliana]
gi|24899795|gb|AAN65112.1| nucellin-like protein [Arabidopsis thaliana]
gi|332197863|gb|AEE35984.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 466
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 173/372 (46%), Positives = 239/372 (64%), Gaps = 5/372 (1%)
Query: 10 LCFPTVRMSSSSSSSSSSSLFNH-VGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDL 68
LC ++ SS+ L N + S+++F V GNVYP GYY V + IG P + + LD+
Sbjct: 25 LCARFQTSEATKDSSAQVKLQNRRLSSTVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDI 84
Query: 69 DTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDY 127
DTGSDLTW+QCDAPC C + Y+P+++ +PC +C+ L P C DP QCDY
Sbjct: 85 DTGSDLTWVQCDAPCNGCTKPRAKQYKPNHNTLPCSHILCSGLDLPQDRPCADPEDQCDY 144
Query: 128 ELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGK 185
E+ Y+D SS+G LV D NG +N RL GCGY+Q P GILGLG+
Sbjct: 145 EIGYSDHASSIGALVTDEVPLKLANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGR 204
Query: 186 GKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDY-TKYYSPG 244
GK + +QL S + +NV+ HCLS G GFL GD+L SS V WTS++++ +K Y G
Sbjct: 205 GKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAG 264
Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
AEL F +TTG+K + VVFDSGSSYTY N YQ + +++K+L+ K L + +D++LP
Sbjct: 265 PAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLP 324
Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
+CWKG++P K++ +VKK F+T+ L F + K LF++ PE+YLII+ KG VCLGILNG E
Sbjct: 325 VCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGRVCLGILNGTE 384
Query: 365 VGLQDLNVIGGI 376
+GL+ N+IG I
Sbjct: 385 IGLEGYNIIGDI 396
>gi|30699263|ref|NP_177872.3| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332197862|gb|AEE35983.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 432
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 173/372 (46%), Positives = 239/372 (64%), Gaps = 5/372 (1%)
Query: 10 LCFPTVRMSSSSSSSSSSSLFNH-VGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDL 68
LC ++ SS+ L N + S+++F V GNVYP GYY V + IG P + + LD+
Sbjct: 25 LCARFQTSEATKDSSAQVKLQNRRLSSTVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDI 84
Query: 69 DTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDY 127
DTGSDLTW+QCDAPC C + Y+P+++ +PC +C+ L P C DP QCDY
Sbjct: 85 DTGSDLTWVQCDAPCNGCTKPRAKQYKPNHNTLPCSHILCSGLDLPQDRPCADPEDQCDY 144
Query: 128 ELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGK 185
E+ Y+D SS+G LV D NG +N RL GCGY+Q P GILGLG+
Sbjct: 145 EIGYSDHASSIGALVTDEVPLKLANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGR 204
Query: 186 GKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDY-TKYYSPG 244
GK + +QL S + +NV+ HCLS G GFL GD+L SS V WTS++++ +K Y G
Sbjct: 205 GKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAG 264
Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
AEL F +TTG+K + VVFDSGSSYTY N YQ + +++K+L+ K L + +D++LP
Sbjct: 265 PAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLP 324
Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
+CWKG++P K++ +VKK F+T+ L F + K LF++ PE+YLII+ KG VCLGILNG E
Sbjct: 325 VCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGRVCLGILNGTE 384
Query: 365 VGLQDLNVIGGI 376
+GL+ N+IG I
Sbjct: 385 IGLEGYNIIGDI 396
>gi|12323376|gb|AAG51657.1|AC010704_1 nucellin-like protein; 27671-25467 [Arabidopsis thaliana]
Length = 427
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 173/372 (46%), Positives = 239/372 (64%), Gaps = 10/372 (2%)
Query: 10 LCFPTVRMSSSSSSSSSSSLFNH-VGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDL 68
LC ++ SS+ L N + S+++F V GNVYP GYY V + IG P + + LD+
Sbjct: 25 LCARFQTSEATKDSSAQVKLQNRRLSSTVVFPVSGNVYPLGYYYVLLNIGNPPKLFDLDI 84
Query: 69 DTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDY 127
DTGSDLTW+QCDAPC C + Y+P+++ +PC +C+ L P C DP QCDY
Sbjct: 85 DTGSDLTWVQCDAPCNGCTK-----YKPNHNTLPCSHILCSGLDLPQDRPCADPEDQCDY 139
Query: 128 ELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGK 185
E+ Y+D SS+G LV D NG +N RL GCGY+Q P GILGLG+
Sbjct: 140 EIGYSDHASSIGALVTDEVPLKLANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGR 199
Query: 186 GKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDY-TKYYSPG 244
GK + +QL S + +NV+ HCLS G GFL GD+L SS V WTS++++ +K Y G
Sbjct: 200 GKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAG 259
Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
AEL F +TTG+K + VVFDSGSSYTY N YQ + +++K+L+ K L + +D++LP
Sbjct: 260 PAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLP 319
Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
+CWKG++P K++ +VKK F+T+ L F + K LF++ PE+YLII+ KG VCLGILNG E
Sbjct: 320 VCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGRVCLGILNGTE 379
Query: 365 VGLQDLNVIGGI 376
+GL+ N+IG I
Sbjct: 380 IGLEGYNIIGDI 391
>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
Length = 358
Score = 335 bits (858), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 170/320 (53%), Positives = 215/320 (67%), Gaps = 12/320 (3%)
Query: 38 LFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS 97
+FQ+ GNVYPTG+Y VTM IG PA+PYFLD+DTGSDLTWLQCDAPC C + PHPLYRP+
Sbjct: 41 IFQLQGNVYPTGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPT 100
Query: 98 -NDLVPCEDPICASLHAPGH---HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
N LVPC + +C +LH+ GH + C P QCDY+++Y D SS GVL+ D F+
Sbjct: 101 ANSLVPCANALCTALHS-GHGSNNKCPSPKQCDYQIKYTDSASSQGVLINDNFSLP-MRS 158
Query: 154 QRLNPRLALGCGYNQVPG---ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
+ P L GCGY+Q G A DG+LGLG+G S+VSQL Q + +NV+GHCLS
Sbjct: 159 SNIRPGLTFGCGYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHCLST 218
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
GGGFLFFGDD+ +SRV W M+ YYSPG L+F + G+K + VVFDSGS+Y
Sbjct: 219 NGGGFLFFGDDIVPTSRVTWVPMAKISGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTY 278
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
TY YQ + S +K LS KSLK+ D +LPLCWKG + FK+V DVKK F++L LSF
Sbjct: 279 TYFTAQPYQAVVSALKSGLS-KSLKQV-SDPSLPLCWKGPKAFKSVFDVKKEFKSLFLSF 336
Query: 331 TDGKTRTLFELTPEAYLIIS 350
K + E+ PE YLI++
Sbjct: 337 ASAK-NAVMEIPPENYLIVT 355
>gi|449449906|ref|XP_004142705.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500739|ref|XP_004161182.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 410
Score = 331 bits (849), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 179/353 (50%), Positives = 239/353 (67%), Gaps = 5/353 (1%)
Query: 27 SSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
SS N SS+L V GNVYP G++ V++ IG P + + LD+DTGSDLTW+QCDAPC C
Sbjct: 31 SSAVNPFDSSILLPVKGNVYPLGHFTVSVTIGNPPKVFELDIDTGSDLTWVQCDAPCTGC 90
Query: 87 VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDA 145
LY+P N++V C +P+C++L + C++P QCDYE+EYAD GSS+GVLVKD
Sbjct: 91 TLPHDRLYKPHNNVVRCGEPLCSALFSASKSPCKNPNDQCDYEVEYADHGSSIGVLVKDP 150
Query: 146 FAFNYTNGQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNV 203
TNG L P L GCGY+Q G S P G+LGLG K+++ +QL + +RNV
Sbjct: 151 VPLRLTNGTILAPNLGFGCGYDQHNGGSQLPPLTAGVLGLGNSKATMATQLSALSHVRNV 210
Query: 204 VGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVV 263
+GHC SG GGGFLFFG DL SS + W + YS G AE++FGG G++ L +
Sbjct: 211 LGHCFSGQGGGFLFFGGDLVPSSGMSWMPILRTPGGKYSAGPAEVYFGGNPVGIRGLILT 270
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
FDSGSSYTY N Y + ++++ L + L++APED+TLP+CWKG + FK+V DV+ F
Sbjct: 271 FDSGSSYTYFNSQVYGAVLNLLRNGLKGQPLRDAPEDKTLPICWKGSKAFKSVADVRNFF 330
Query: 324 RTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
+ LALSF G ++ F++ PEAYLIISN GNVCLGILNG++VGL ++N+IG I
Sbjct: 331 KPLALSF--GNSKVQFQIPPEAYLIISNLGNVCLGILNGSQVGLGNVNLIGDI 381
>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
Length = 357
Score = 331 bits (848), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 173/327 (52%), Positives = 219/327 (66%), Gaps = 12/327 (3%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS-NDLVPCEDPICASLHA-P 114
IG PA+PYFLD+DTGSDLTWLQCDAPC C + PHPLYRP+ N LVPC + +C +LH+
Sbjct: 1 IGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRLVPCANALCTALHSGQ 60
Query: 115 GHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQV---P 170
G +N C P QCDY+++Y D SS GVL+ D+F+ + + P L GCGY+Q
Sbjct: 61 GSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRS-SNIRPGLTFGCGYDQQVGKN 119
Query: 171 GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVW 230
GA +DG+LGLG+G S+VSQL Q + +NVVGHCLS GGGFLFFGDD+ SSRV W
Sbjct: 120 GAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGGGFLFFGDDVVPSSRVTW 179
Query: 231 TSMSSDYT-KYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKEL 289
M+ + YYSPG L+F + G+K + VVFDSGS+YTY YQ + S +K L
Sbjct: 180 VPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTYFTAQPYQAVVSALKGGL 239
Query: 290 SAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLII 349
S KSLK+ D TLPLCWKG++ FK+V DVK F+++ LSF K + E+ PE YLI+
Sbjct: 240 S-KSLKQV-SDPTLPLCWKGQKAFKSVFDVKNEFKSMFLSFASAKNAAM-EIPPENYLIV 296
Query: 350 SNKGNVCLGILNGAEVGLQDLNVIGGI 376
+ GNVCLGIL+G L NVIG I
Sbjct: 297 TKNGNVCLGILDGTAAKL-SFNVIGDI 322
>gi|449449755|ref|XP_004142630.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500674|ref|XP_004161165.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 413
Score = 320 bits (820), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 173/346 (50%), Positives = 232/346 (67%), Gaps = 4/346 (1%)
Query: 34 GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPL 93
GSS+LF V GNVYP G++ V + IG P++ + LD+DTGSDLTW+QCD C+ C L
Sbjct: 36 GSSVLFPVRGNVYPLGHFTVLLNIGNPSKVFELDIDTGSDLTWVQCDVECIGCTLPRDML 95
Query: 94 YRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTN 152
YRP N+ V EDP+CA+L + G ++P QC YE+EYAD GSS+GVLVKD TN
Sbjct: 96 YRPHNNAVSREDPLCAALSSLGKFIFKNPNDQCAYEVEYADHGSSVGVLVKDLVPMRLTN 155
Query: 153 GQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
G+R++P L GCGY+Q G P + G+LGL K++IVSQL + NVVGHCL+G
Sbjct: 156 GKRISPNLGFGCGYDQENGDLQQPPSIAGVLGLSSSKATIVSQLSDLGHVSNVVGHCLTG 215
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
GGGFLFFG D+ SS + WT + + YS G AE++F G G+ L + FDSGSSY
Sbjct: 216 RGGGFLFFGGDVVPSSGMSWTPILRNSEGKYSSGPAEVYFNGRAVGIGGLTLTFDSGSSY 275
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
TY N Y+ + ++K +L LK A +D+TL LCWKG +PF++V DV+ F+ LA+SF
Sbjct: 276 TYFNSQVYRAIEKLLKNDLKGNPLKLASDDKTLELCWKGPKPFESVVDVRNFFKPLAMSF 335
Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
+ K F++ PEAYLIIS GNVCLGIL+G++ G+ ++N+IG I
Sbjct: 336 KNSKN-VQFQIPPEAYLIISEFGNVCLGILDGSKEGMGNVNIIGDI 380
>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 425
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 169/376 (44%), Positives = 228/376 (60%), Gaps = 13/376 (3%)
Query: 12 FPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTG 71
FP +++ ++S + + + SSL++ + GNVYP G Y V++ IG P +PY LD+DTG
Sbjct: 23 FPHHFSAANKNNSIPPTSIHSLISSLVYTIKGNVYPDGLYTVSINIGNPPKPYELDIDTG 82
Query: 72 SDLTWLQC---DAPCVRCVEAPHPLYRPS-NDLVPCEDPICA---SLHAPGHHNCEDPAQ 124
SDLTW+QC DAPC C LY+P+ +V C DPIC S H G +
Sbjct: 83 SDLTWVQCDGPDAPCKGCTMPKDKLYKPNGKQVVKCSDPICVATQSTHVLGQICSKQSPP 142
Query: 125 CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQV---PGASYHPLDGIL 181
C Y ++YAD S+LGVLV+D + +P +A GCGY Q P + GIL
Sbjct: 143 CVYNVQYADHASTLGVLVRDYMHIGSPSSSTKDPLVAFGCGYEQKFSGPTPPHSKPAGIL 202
Query: 182 GLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTS-MSSDYTKY 240
GLG GK+SI+SQL S I NV+GHCLS GGG+LF GD SS +VWT + S K+
Sbjct: 203 GLGNGKTSILSQLTSIGFIHNVLGHCLSAEGGGYLFLGDKFVPSSGIVWTPIIQSSLEKH 262
Query: 241 YSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPED 300
Y+ G +LFF G+ T K L ++FDSGSSYTY + Y + +++ +L K L +D
Sbjct: 263 YNTGPVDLFFNGKPTPAKGLQIIFDSGSSYTYFSSPVYTIVANMVNNDLKGKPLSRV-KD 321
Query: 301 ETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGIL 360
+LP+CWKG +PFK++++V F+ L LSFT K F+L P AYLII+ GNVCLGIL
Sbjct: 322 PSLPICWKGVKPFKSLNEVNNYFKPLTLSFTKSKNLQ-FQLPPVAYLIITKYGNVCLGIL 380
Query: 361 NGAEVGLQDLNVIGGI 376
NG E GL + NV+G I
Sbjct: 381 NGNEAGLGNRNVVGDI 396
>gi|115484513|ref|NP_001065918.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|122221757|sp|Q0IU52.1|ASP1_ORYSJ RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
gi|33340111|gb|AAQ14543.1|AF308691_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|33340113|gb|AAQ14544.1|AF308692_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|62954898|gb|AAY23267.1| nucellin-like protein [Oryza sativa Japonica Group]
gi|77548967|gb|ABA91764.1| Aspartic proteinase Asp1 precursor, putative, expressed [Oryza
sativa Japonica Group]
gi|113644622|dbj|BAF27763.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|215766817|dbj|BAG99045.1| unnamed protein product [Oryza sativa Japonica Group]
gi|385717694|gb|AFI71282.1| aspartic proteinase [Oryza sativa Japonica Group]
Length = 410
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 169/360 (46%), Positives = 229/360 (63%), Gaps = 26/360 (7%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
S+++ ++HGNVYP G++ +TM IG PA+ YFLD+DTGS LTWLQCDAPC C PH LY
Sbjct: 22 SAVVLELHGNVYPIGHFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNIVPHVLY 81
Query: 95 RPS-NDLVPCEDPICASLHAP--GHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
+P+ LV C D +C L+ C QCDY ++Y D SS+GVLV D F+ + +
Sbjct: 82 KPTPKKLVTCADSLCTDLYTDLGKPKRCGSQKQCDYVIQYVD-SSSMGVLVIDRFSLSAS 140
Query: 152 NGQRLNP-RLALGCGYNQ------VPGASYHPLDGILGLGKGKSSIVSQLHSQKLI-RNV 203
NG NP +A GCGY+Q VP P+D ILGL +GK +++SQL SQ +I ++V
Sbjct: 141 NGT--NPTTIAFGCGYDQGKKNRNVP----IPVDSILGLSRGKVTLLSQLKSQGVITKHV 194
Query: 204 VGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-- 261
+GHC+S GGGFLFFGD +S V WT M+ ++ KYYSPG L F + + P
Sbjct: 195 LGHCISSKGGGFLFFGDAQVPTSGVTWTPMNREH-KYYSPGHGTLHFDSNSKAISAAPMA 253
Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSA--KSLKEAPE-DETLPLCWKGRRPFKNVHD 318
V+FDSG++YTY YQ S++K L++ K L E E D L +CWKG+ + +
Sbjct: 254 VIFDSGATYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCWKGKDKIVTIDE 313
Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE--VGLQDLNVIGGI 376
VKKCFR+L+L F DG + E+ PE YLIIS +G+VCLGIL+G++ + L N+IGGI
Sbjct: 314 VKKCFRSLSLEFADGDKKATLEIPPEHYLIISQEGHVCLGILDGSKEHLSLAGTNLIGGI 373
>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
Length = 446
Score = 305 bits (780), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 166/376 (44%), Positives = 227/376 (60%), Gaps = 21/376 (5%)
Query: 21 SSSSSSSSLFNHVGSSL------LFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDL 74
S +S S N +G L +F + GNV P G Y VTM +G P++PYFLD+D+GS+L
Sbjct: 43 SKASFVSRDTNRIGRRLQAHQTAIFSLKGNVVPYGLYYVTMLVGNPSKPYFLDVDSGSEL 102
Query: 75 TWLQCDAPCVRCVEAPHPLYR-PSNDLVPCEDPICASLHA-PGH-HNCEDPAQ-CDYELE 130
TW+QCDAPC+ C + PHPLY+ LVP +DP+CA++ A GH HN ++ +Q CDY++
Sbjct: 103 TWIQCDAPCISCAKGPHPLYKLKKGSLVPSKDPLCAAVQAGSGHYHNHKEASQRCDYDVA 162
Query: 131 YADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKS 188
YAD G S G LV+D+ TN L GCGYNQ S DGILGLG G +
Sbjct: 163 YADHGYSEGFLVRDSVRALLTNKTVLTANSVFGCGYNQRESLPVSDARTDGILGLGSGMA 222
Query: 189 SIVSQLHSQKLIRNVVGHCLSGGG--GGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGV 245
S+ SQ Q LI+NV+GHC+ G G GG++FFGDDL +S + W M K+Y G
Sbjct: 223 SLPSQWAKQGLIKNVIGHCIFGAGRDGGYMFFGDDLVSTSAMTWVPMLGRPSIKHYYVGA 282
Query: 246 AELFFGG-----ETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPED 300
A++ FG + G K ++FDSGS+YTY Y S++K+ LS K L++ D
Sbjct: 283 AQMNFGNKPLDKDGDGKKLGGIIFDSGSTYTYFTNQAYGAFLSVVKENLSGKQLEQDSSD 342
Query: 301 ETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGIL 360
L LCW+ + F++V + F+ L L F KT+ + E+ PE YL+++ KGNVCLGIL
Sbjct: 343 SFLSLCWRRKEGFRSVAEAAAYFKPLTLKFRSTKTKQM-EIFPEGYLVVNKKGNVCLGIL 401
Query: 361 NGAEVGLQDLNVIGGI 376
NG +G+ D NV+G I
Sbjct: 402 NGTAIGIVDTNVLGDI 417
>gi|158513711|sp|A2ZC67.2|ASP1_ORYSI RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
Length = 410
Score = 300 bits (768), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 167/358 (46%), Positives = 229/358 (63%), Gaps = 22/358 (6%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
S+++ ++HGNVYP G++ VTM IG PA+PYFLD+DTGS LTWLQCD PC+ C + PH LY
Sbjct: 22 SAVVLELHGNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLY 81
Query: 95 RPS-NDLVPCEDPICASLHAPGHH--NCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
+P V C + CA L+A C QC Y ++Y GGSS+GVL+ D+F+ +
Sbjct: 82 KPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV-GGSSIGVLIVDSFSLPAS 140
Query: 152 NGQRLNP-RLALGCGYNQVPGASYH----PLDGILGLGKGKSSIVSQLHSQKLI-RNVVG 205
NG NP +A GCGYNQ G + H P++GILGLG+GK +++SQL SQ +I ++V+G
Sbjct: 141 NGT--NPTSIAFGCGYNQ--GKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLG 196
Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP--VV 263
HC+S G GFLFFGD +S V W+ M+ ++ K+YSP L F + + P V+
Sbjct: 197 HCISSKGKGFLFFGDAKVPTSGVTWSPMNREH-KHYSPRQGTLQFNSNSKPISAAPMEVI 255
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSA--KSLKEAPE-DETLPLCWKGRRPFKNVHDVK 320
FDSG++YTY Y S++K LS K L E E D L +CWKG+ + + +VK
Sbjct: 256 FDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEVK 315
Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEV--GLQDLNVIGGI 376
KCFR+L+L F DG + E+ PE YLIIS +G+VCLGIL+G++ L N+IGGI
Sbjct: 316 KCFRSLSLKFADGDKKATLEIPPEHYLIISQEGHVCLGILDGSKEHPSLAGTNLIGGI 373
>gi|37542275|gb|AAK81698.1| aspartyl proteinase [Oryza sativa]
Length = 410
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 166/358 (46%), Positives = 228/358 (63%), Gaps = 22/358 (6%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
S+++ ++HGNVYP G++ VTM I PA+PYFLD+DTGS LTWLQCD PC+ C + PH LY
Sbjct: 22 SAVVLELHGNVYPIGHFFVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLY 81
Query: 95 RPS-NDLVPCEDPICASLHAPGHH--NCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
+P V C + CA L+A C QC Y ++Y GGSS+GVL+ D+F+ +
Sbjct: 82 KPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV-GGSSIGVLIVDSFSLPAS 140
Query: 152 NGQRLNP-RLALGCGYNQVPGASYH----PLDGILGLGKGKSSIVSQLHSQKLI-RNVVG 205
NG NP +A GCGYNQ G + H P++GILGLG+GK +++SQL SQ +I ++V+G
Sbjct: 141 NGT--NPTSIAFGCGYNQ--GKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLG 196
Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP--VV 263
HC+S G GFLFFGD +S V W+ M+ ++ K+YSP L F + + P V+
Sbjct: 197 HCISSKGKGFLFFGDAKVPTSGVTWSPMNREH-KHYSPRQGTLHFNSNSKPISAAPMEVI 255
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSA--KSLKEAPE-DETLPLCWKGRRPFKNVHDVK 320
FDSG++YTY Y S++K LS K L E E D L +CWKG+ + + +VK
Sbjct: 256 FDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEVK 315
Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEV--GLQDLNVIGGI 376
KCFR+L+L F DG + E+ PE YLIIS +G+VCLGIL+G++ L N+IGGI
Sbjct: 316 KCFRSLSLKFADGDKKATLEIPPEHYLIISQEGHVCLGILDGSKEHPSLAGTNLIGGI 373
>gi|37542277|gb|AAK81699.1| aspartyl proteinase [Oryza sativa]
Length = 411
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 165/359 (45%), Positives = 226/359 (62%), Gaps = 23/359 (6%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
S+++ ++HGNVYP G++ VTM I PA+PYFLD+DTGS LTWLQCD PC+ C + PH LY
Sbjct: 22 SAVVLELHGNVYPIGHFFVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPHGLY 81
Query: 95 RPS-NDLVPCEDPICASLHAPGHH--NCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
+P V C + CA L+A C QC Y ++Y GGSS+GVL+ D+F+ +
Sbjct: 82 KPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV-GGSSIGVLIVDSFSLPAS 140
Query: 152 NGQRLNP-RLALGCGYNQVPGASYH----PLDGILGLGKGKSSIVSQLHSQKLI-RNVVG 205
NG NP +A GCGYNQ G + H P++GILGLG+GK +++SQL SQ +I ++V+G
Sbjct: 141 NGT--NPTSIAFGCGYNQ--GKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLG 196
Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGET---TGLKNLPV 262
HC+S G GFLFFGD +S V W+ M+ ++ K+YSP L F + V
Sbjct: 197 HCISSKGKGFLFFGDAKVPTSGVTWSPMNREH-KHYSPRQGTLHFNSNKQSPISAAPMEV 255
Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSA--KSLKEAPE-DETLPLCWKGRRPFKNVHDV 319
+FDSG++YTY Y S++K LS K L E E D L +CWKG+ + + +V
Sbjct: 256 IFDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCWKGKDKIRTIDEV 315
Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEV--GLQDLNVIGGI 376
KKCFR+L+L F DG + E+ PE YLIIS +G+VCLGIL+G++ L N+IGGI
Sbjct: 316 KKCFRSLSLKFADGDKKATLEIPPEHYLIISQEGHVCLGILDGSKEHPSLAGTNLIGGI 374
>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 418
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 164/376 (43%), Positives = 220/376 (58%), Gaps = 19/376 (5%)
Query: 12 FPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTG 71
FP +++ ++S + + + SSL++ + GNVYP G Y V++ IG P PY LD+DTG
Sbjct: 23 FPHHFSAANKNNSIPPTSIHSLISSLVYTIKGNVYPDGIYTVSINIGNPPNPYELDIDTG 82
Query: 72 SDLTWLQCD---APCVRCVEAPHPLYRPS-NDLVPCEDPICASLHAPGH---HNCEDP-A 123
SDLTW+QCD APC C LY+P+ N LV C DPICA++ P C P
Sbjct: 83 SDLTWVQCDGPDAPCKGCTLPKDKLYKPNGNQLVKCSDPICAAVQPPFSTFGQKCAKPIP 142
Query: 124 QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGIL 181
C Y++EYAD S G L +D +G + P + GCGY Q G+L
Sbjct: 143 PCVYKVEYADNAESTGALARDYMHIGSPSGSNV-PLVVFGCGYEQKFSGPTPPPSTPGVL 201
Query: 182 GLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSM-SSDYTKY 240
GLG GK SI+SQLHS I NV+GHCLS GGG+LF GD SS + WT + S K+
Sbjct: 202 GLGNGKISILSQLHSMGFIHNVLGHCLSAEGGGYLFLGDKFIPSSGIFWTPIIQSSLEKH 261
Query: 241 YSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPED 300
YS G +LFF G+ T K L ++FDSGSSYTY + Y + +++ +L K L+ +D
Sbjct: 262 YSTGPVDLFFNGKPTPAKGLQIIFDSGSSYTYFSPRVYTIVANMVNNDLKGKPLRRETKD 321
Query: 301 ETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGIL 360
+LP+CWKG +PFK++++V F+ L LSFT K F+L P + GNVCLGIL
Sbjct: 322 PSLPICWKGVKPFKSLNEVNNYFKPLTLSFTKSKNLQ-FQLPPVKF------GNVCLGIL 374
Query: 361 NGAEVGLQDLNVIGGI 376
NG E GL + NV+G I
Sbjct: 375 NGNEAGLGNRNVVGDI 390
>gi|218185383|gb|EEC67810.1| hypothetical protein OsI_35379 [Oryza sativa Indica Group]
Length = 423
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 168/371 (45%), Positives = 230/371 (61%), Gaps = 35/371 (9%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEA----- 89
S+++ ++HGNVYP G++ VTM IG PA+PYFLD+DTGS LTWLQCD PC+ C +A
Sbjct: 22 SAVVLELHGNVYPIGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKAHSLFY 81
Query: 90 --------PHPLYRPS-NDLVPCEDPICASLHAPGHH--NCEDPAQCDYELEYADGGSSL 138
PH LY+P V C + CA L+A C QC Y ++Y GGSS+
Sbjct: 82 PRLIGSFVPHGLYKPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV-GGSSI 140
Query: 139 GVLVKDAFAFNYTNGQRLNP-RLALGCGYNQVPGASYH----PLDGILGLGKGKSSIVSQ 193
GVL+ D+F+ +NG NP +A GCGYNQ G + H P++GILGLG+GK +++SQ
Sbjct: 141 GVLIVDSFSLPASNGT--NPTSIAFGCGYNQ--GKNNHNVPTPVNGILGLGRGKVTLLSQ 196
Query: 194 LHSQKLI-RNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGG 252
L SQ +I ++V+GHC+S G GFLFFGD +S V W+ M+ ++ K+YSP L F
Sbjct: 197 LKSQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVTWSPMNREH-KHYSPRQGTLQFNS 255
Query: 253 ETTGLKNLP--VVFDSGSSYTYLNRVTYQTLTSIMKKELSA--KSLKEAPE-DETLPLCW 307
+ + P V+FDSG++YTY Y S++K LS K L E E D L +CW
Sbjct: 256 NSKPISAAPMEVIFDSGATYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCW 315
Query: 308 KGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEV-- 365
KG+ + + +VKKCFR+L+L F DG + E+ PE YLIIS +G+VCLGIL+G++
Sbjct: 316 KGKDKIRTIDEVKKCFRSLSLKFADGDKKATLEIPPEHYLIISQEGHVCLGILDGSKEHP 375
Query: 366 GLQDLNVIGGI 376
L N+IGGI
Sbjct: 376 SLAGTNLIGGI 386
>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
Length = 407
Score = 290 bits (742), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 167/356 (46%), Positives = 222/356 (62%), Gaps = 27/356 (7%)
Query: 37 LLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA---PCVRCVEAPHPL 93
++F++ G+V+PTG++ VTM IG+PA+PYFLD+DTGS+LTW++C A PC C + PHPL
Sbjct: 26 MVFKLGGDVHPTGHFYVTMNIGEPAKPYFLDIDTGSNLTWIKCHATPGPCKTCNKVPHPL 85
Query: 94 YRPSNDLVPCEDPICASLHAP--GHHNC-EDPAQCDYELEYADGGSSLGVLVKDAFAFNY 150
YRP LVPC DP+C +LH +C E+P QC Y++ YADG +SLGVL+ D F+
Sbjct: 86 YRPKK-LVPCADPLCDALHKDLGTTKDCREEPDQCHYQINYADGTTSLGVLLLDKFSLPT 144
Query: 151 TNGQRLNPRLALGCGYNQVPGASYH-----PLDGILGLGKGKSSIVSQL-HSQKLIRNVV 204
+ + +A GCGY+Q+ G P+DGILGLG+G +VSQL HS + +NV+
Sbjct: 145 GSAR----NIAFGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQLKHSGAVSKNVI 200
Query: 205 GHCLSGGGGGFLFFGDDLYDSS--RVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV 262
GHCLS GGG+LF G++ SS +++ S +YSPG A L G G K
Sbjct: 201 GHCLSSKGGGYLFIGEENVPSSHLHIIYIYCISREPNHYSPGQATLHLGRNPIGTKPFKA 260
Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET-LPLCWKGRRPFKNVHDVKK 321
+FDSGS+YTYL + L S +K L SLK + +T L LCWKG +PFK VHD+ K
Sbjct: 261 IFDSGSTYTYLPENLHAQLVSALKASLIKSSLKLVSDTDTRLHLCWKGPKPFKTVHDLPK 320
Query: 322 CFRTLA-LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
F++L L F G T T + PE YLII+ GN C GIL E+ DL VIGGI
Sbjct: 321 EFKSLVTLKFDHGVTMT---IPPENYLIITGHGNACFGIL---ELPGYDLFVIGGI 370
>gi|222615640|gb|EEE51772.1| hypothetical protein OsJ_33215 [Oryza sativa Japonica Group]
Length = 775
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 171/390 (43%), Positives = 231/390 (59%), Gaps = 38/390 (9%)
Query: 5 HNGENLCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPY 64
H N T R SS + + ++ L VG+ P ++ +TM IG PA+ Y
Sbjct: 369 HETPNRKVGTARQPSSPAPTGAAILCRGVGA-----------PRHFF-ITMNIGDPAKSY 416
Query: 65 FLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS-NDLVPCEDPICASLHAP--GHHNCED 121
FLD+DTGS LTWLQCDAPC C PH LY+P+ LV C D +C L+ C
Sbjct: 417 FLDIDTGSTLTWLQCDAPCTNCNIVPHVLYKPTPKKLVTCADSLCTDLYTDLGKPKRCGS 476
Query: 122 PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALGCGYNQ------VPGASY 174
QCDY ++Y D SS+GVLV D F+ + +NG NP +A GCGY+Q VP
Sbjct: 477 QKQCDYVIQYVDS-SSMGVLVIDRFSLSASNGT--NPTTIAFGCGYDQGKKNRNVP---- 529
Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLI-RNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSM 233
P+D ILGL +GK +++SQL SQ +I ++V+GHC+S GGGFLFFGD +S V WT M
Sbjct: 530 IPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCISSKGGGFLFFGDAQVPTSGVTWTPM 589
Query: 234 SSDYTKYYSPGVAELFFGGETTGLKNLP--VVFDSGSSYTYLNRVTYQTLTSIMKKELSA 291
+ ++ KYYSPG L F + + P V+FDSG++YTY YQ S++K L++
Sbjct: 590 NREH-KYYSPGHGTLHFDSNSKAISAAPMAVIFDSGATYTYFAAQPYQATLSVVKSTLNS 648
Query: 292 --KSLKEAPE-DETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI 348
K L E E D L +CWKG+ + +VKKCFR+L+L F DG + E+ PE YLI
Sbjct: 649 ECKFLTEVTEKDRALTVCWKGKDKIVTIDEVKKCFRSLSLEFADGDKKATLEIPPEHYLI 708
Query: 349 ISNKGNVCLGILNGAE--VGLQDLNVIGGI 376
IS +G+VCLGIL+G++ + L N+IGGI
Sbjct: 709 ISQEGHVCLGILDGSKEHLSLAGTNLIGGI 738
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 125/284 (44%), Positives = 173/284 (60%), Gaps = 25/284 (8%)
Query: 100 LVPCEDPICASLHAPGHH---NCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+V +DP+ +LH G N P QCDYE++YADG S++G L+ D F+ +
Sbjct: 1 MVRADDPLYVALHEDGRSGDGNHMSPTQCDYEIKYADGASTIGALIVDQFSLPRIATR-- 58
Query: 157 NPRLALGCGYNQVPGASYH---PLDGILGLGKGKSSIVSQLHSQKLI-RNVVGHCLSGGG 212
P L GCGYNQ G ++ P++GILGL +GK S VSQL +I ++VVGHCLS GG
Sbjct: 59 -PNLPFGCGYNQGIGENFQQTSPVNGILGLDRGKVSFVSQLKMLGIITKHVVGHCLSSGG 117
Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTY 272
GG LF GD D + V+ + YYSPG A L+F + G+ + VVFDSGS+YTY
Sbjct: 118 GGLLFVGDG--DGNLVLL------HANYYSPGSATLYFDRHSLGMNPMDVVFDSGSTYTY 169
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
YQ +K LS+ SL++ D +LPLCWKG++ F++V DVKK F++L L+F +
Sbjct: 170 FTAQPYQATVYAIKGGLSSTSLEQV-SDPSLPLCWKGQKAFESVFDVKKEFKSLQLNFGN 228
Query: 333 GKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
+ E+ PE YLI++ GNVCLGIL+G + + N+IG I
Sbjct: 229 ---NAVMEIPPENYLIVTEYGNVCLGILHGCRL---NFNIIGDI 266
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 287 bits (735), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 151/367 (41%), Positives = 211/367 (57%), Gaps = 15/367 (4%)
Query: 25 SSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV 84
S +S+ + + + GN+YP G Y + M IG PA+ Y+LD+DTGSDLTWLQCDAPC
Sbjct: 5 SKASVPETAQRTAAYPIGGNIYPDGLYYMAMRIGNPAKLYYLDMDTGSDLTWLQCDAPCR 64
Query: 85 RCVEAPHPLYRPSN-DLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLV 142
C PH LY P +V C P CA + G C D QCDYE++Y DG S++G+LV
Sbjct: 65 SCAVGPHGLYDPKRARVVDCRRPTCAQVQRGGQFTCSGDVRQCDYEVDYVDGSSTMGILV 124
Query: 143 KDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLI 200
+D TNG R R +GCGY+Q + P DG++GL K S+ SQL ++ +
Sbjct: 125 EDTITLVLTNGTRFQTRAVIGCGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIA 184
Query: 201 RNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFGGETTGL 257
NV+GHCL+GG GGG+LFFGD L + + WT M + Y + + +GGE L
Sbjct: 185 NNVIGHCLAGGSNGGGYLFFGDTLVPALGMTWTPMIGRPLVEGYQARLRSIKYGGEVLEL 244
Query: 258 KNLP-----VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRP 312
+ +FDSG+S+TYL Y + S + ++ L+ D TLP CW+G P
Sbjct: 245 EGTTDDVGGAMFDSGTSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTDTTLPFCWRGPSP 304
Query: 313 FKNVHDVKKCFRTLALSF---TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQD 369
F++V DV F+T+ L F T + L EL+PE YLI+S +GNVCLG+L+ + L+
Sbjct: 305 FESVADVSAYFKTVTLDFGGSTWWSSGKLLELSPEGYLIVSTQGNVCLGVLDASVASLEV 364
Query: 370 LNVIGGI 376
N++G I
Sbjct: 365 TNILGDI 371
>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 381
Score = 284 bits (727), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 150/353 (42%), Positives = 209/353 (59%), Gaps = 16/353 (4%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
+++ Q+ GN+YP G Y + M IG PA+ Y+LD+DTGSDLTWLQCDAPC C PH LY
Sbjct: 7 ATVFSQLRGNIYPDGLYYMAMLIGAPAKLYYLDMDTGSDLTWLQCDAPCRSCASGPHGLY 66
Query: 95 RPSN-DLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTN 152
P LV C P+CA + G + C P QCDY++EYADG S++GVL++D TN
Sbjct: 67 DPKKARLVDCRVPLCALVQQGGSYACGGPVRQCDYDVEYADGSSTMGVLMEDTITLLLTN 126
Query: 153 GQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
G R +GCGY+Q + P DG++GL K S+ SQL + ++RNV+GHCL+G
Sbjct: 127 GTRSKTTAIIGCGYDQQGTLAQTPASTDGVMGLSSAKISLPSQLAKKGIVRNVIGHCLAG 186
Query: 211 G--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGS 268
G GGG+LFFGD L + + WT + K + + + V+FDSG+
Sbjct: 187 GSNGGGYLFFGDSLVPALGMTWTPIMG---KSITGNIGGKSGDADDKTGDIGGVMFDSGT 243
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
S+TYL Y + S M+ ++ L D TLP CW+G PF++V DV++ F+T+ L
Sbjct: 244 SFTYLVPEAYNAVLSAMEMQVEKSGLVRIKTDNTLPFCWRGPSPFESVADVQRYFKTVTL 303
Query: 329 SFTDGK-----TRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
F GK + EL+PE YLI+S +GNVCLGIL+ + L+ N+IG +
Sbjct: 304 DF--GKRNWYSASRVLELSPEGYLIVSTQGNVCLGILDASGASLEVTNIIGDV 354
>gi|356507650|ref|XP_003522577.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Glycine max]
Length = 326
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 160/330 (48%), Positives = 203/330 (61%), Gaps = 28/330 (8%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLH 112
+++ I + Y LD+DTGSDLTW Q DAPC C L +P LV C D +CA++H
Sbjct: 1 MSITITSSSELYELDIDTGSDLTWFQWDAPCQGCTLPRDKLNKPHCKLVKCGDRLCAAIH 60
Query: 113 APGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPG 171
+ C DP QCDYE+EYAD GSSLGVLV D A +T+G P LA P
Sbjct: 61 S---EPCADPDEQCDYEVEYADQGSSLGVLVLDNIALKFTSGSLARPILA-------APD 110
Query: 172 ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWT 231
+GL GK+SI+SQLHS LIRNVVGHCLS GGGFLFFGD L S VVWT
Sbjct: 111 ---------MGLATGKTSILSQLHSLGLIRNVVGHCLSRRGGGFLFFGDQLIPQSGVVWT 161
Query: 232 SM----SSDYTK-YYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMK 286
+ S YT+ +Y G A++FF G+ T +K L + FDSGSSYT N ++ L ++
Sbjct: 162 PLLQNSSVTYTRPHYKTGPADMFFNGKATSVKGLELTFDSGSSYTXFNSHAHKALVGLIT 221
Query: 287 KELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAY 346
++ KS A ED +LP+CWK + FK++HDV F+ +ALSFT K +L +L PEAY
Sbjct: 222 NDIKGKSFSRATEDPSLPICWKNPKTFKSLHDVTNYFKPIALSFTKSK-NSLLQLPPEAY 280
Query: 347 LIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
LI GNVCLGIL+G E+GL + N+IG I
Sbjct: 281 LI--KYGNVCLGILDGTEIGLGNTNIIGDI 308
>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 423
Score = 283 bits (724), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 153/368 (41%), Positives = 216/368 (58%), Gaps = 24/368 (6%)
Query: 27 SSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
SS+ NH S+ F V GN+YP G Y + + +G P + YFLD+DTGSDLTW QCDAPC C
Sbjct: 19 SSVGNH---SVRFHVGGNIYPDGLYYMALLLGSPPKLYFLDMDTGSDLTWAQCDAPCRNC 75
Query: 87 VEAPHPLYRPSN-DLVPCEDPICASLHAPGHHNC-EDPAQCDYELEYADGGSSLGVLVKD 144
PH LY P +V C P+CA + G + C D QCDYE+EYADG S++GVLV+D
Sbjct: 76 AIGPHGLYNPKKAKVVDCHLPVCAQIQQGGSYECNSDVKQCDYEVEYADGSSTMGVLVED 135
Query: 145 AFAFNYTNGQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRN 202
TNG + + +GCGY+Q + P DG++GL K ++ +QL + +I+N
Sbjct: 136 TLTVRLTNGTLIQTKAIIGCGYDQQGTLAKSPASTDGVIGLSSSKVALPAQLAEKGIIKN 195
Query: 203 VVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKY-YSPGVAELFFGGETTGLKN 259
V+GHCL+ G GGG+LFFGD+L S + WT M Y + + +GG++ L N
Sbjct: 196 VLGHCLADGSNGGGYLFFGDELVPSWGMTWTPMMGKPEMLGYQARLQSIRYGGDSLVLNN 255
Query: 260 --------LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR 311
V+FDSG+S+TYL Y ++ S + K+ L D TLP CW+G
Sbjct: 256 DEDLTRSTSSVMFDSGTSFTYLVPQAYASVLSAVTKQ---SGLLRVKSDTTLPYCWRGPS 312
Query: 312 PFKNVHDVKKCFRTLALSFTDGK---TRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQ 368
PF+++ DV + F+TL L F T + +L+P+ YLI+S +GNVCLGIL+ + L+
Sbjct: 313 PFQSITDVHQYFKTLTLDFGGRNWFATDSTLDLSPQGYLIVSTQGNVCLGILDASGASLE 372
Query: 369 DLNVIGGI 376
N+IG +
Sbjct: 373 VTNIIGDV 380
>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
Length = 557
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 152/357 (42%), Positives = 206/357 (57%), Gaps = 21/357 (5%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
S+ L + GNV+P G Y ++++G P RPYFLD+DTGSDLTW+QCDAPC C + PHPLY
Sbjct: 171 STALLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLY 230
Query: 95 RPSND-LVPCEDPICASLHAPGHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN 152
+P+ + +VP D +C L G+ N CE QCDYE+EYAD SS+GVL +D TN
Sbjct: 231 KPTKEKIVPPRDLLCQELQ--GNQNYCETCKQCDYEIEYADQSSSMGVLARDDMHLIATN 288
Query: 153 GQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS- 209
G R GC Y+Q P DGILGL S+ SQL S +I N+ GHC++
Sbjct: 289 GGREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSNAAISLPSQLASHGIISNIFGHCITR 348
Query: 210 -GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN-----LPVV 263
GGGG++F GDD + WTS+ S Y + +G + ++ + V+
Sbjct: 349 EQGGGGYMFLGDDYVPRWGITWTSIRSGPDNLYHTEAHHVKYGDQQLRMREQAGNTVQVI 408
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
FDSGSSYTYL Y+ L + +K ++ + D TLPLCWK P + + DVK+ F
Sbjct: 409 FDSGSSYTYLPDEIYENLVAAIK--YASPGFVQDSSDRTLPLCWKADFPVRYLEDVKQFF 466
Query: 324 RTLALSFTDGKT----RTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
+ L L F GK F ++PE YLIIS+KGNVCLG+LNG E+ ++G +
Sbjct: 467 KPLNLHF--GKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDV 521
>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
Length = 583
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 159/380 (41%), Positives = 218/380 (57%), Gaps = 23/380 (6%)
Query: 1 MKSSHNGENLCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQP 60
+ +S N +++ P +SS++++ V SS +F V GNVYP G Y + +G P
Sbjct: 164 LVASVNDDDVIVPNRNYKLASSNAAA------VDSSSVFPVRGNVYPDGLYFTYILVGNP 217
Query: 61 ARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND-LVPCEDPICASLHAPGHHN- 118
RPY+LD+DT SDLTW+QCDAPC C + + LY+P D +V +D +C LH
Sbjct: 218 PRPYYLDIDTASDLTWIQCDAPCTSCAKGANALYKPRRDNIVTPKDSLCVELHRNQKAGY 277
Query: 119 CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPL- 177
CE QCDYE+EYAD SS+GVL +D NG N + GC Y+Q G + L
Sbjct: 278 CETCQQCDYEIEYADHSSSMGVLARDELHLTMANGSSTNLKFNFGCAYDQ-QGLLLNTLV 336
Query: 178 --DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSM 233
DGILGL K K S+ SQL ++ +I NVVGHCL+ GGG++F GDD + W M
Sbjct: 337 KTDGILGLSKAKVSLPSQLANRGIINNVVGHCLANDVVGGGYMFLGDDFVPRWGMSWVPM 396
Query: 234 -SSDYTKYYSPGVAELFFGGETTGL-----KNLPVVFDSGSSYTYLNRVTYQTLTSIMKK 287
S Y + +L +G L + +VFDSGSSYTY + Y L + + K
Sbjct: 397 LDSPSIDSYQTQIMKLNYGSGPLSLGGQERRVRRIVFDSGSSYTYFTKEAYSELVASL-K 455
Query: 288 ELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG--KTRTLFELTPEA 345
++S ++L + D TLP CW+ + P ++V DVK+ F+TL L F T F + PE
Sbjct: 456 QVSGEALIQDTSDPTLPFCWRAKFPIRSVIDVKQYFKTLTLQFGSKWWIISTKFRIPPEG 515
Query: 346 YLIISNKGNVCLGILNGAEV 365
YLIISNKGNVCLGIL+G++V
Sbjct: 516 YLIISNKGNVCLGILDGSDV 535
>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
Length = 557
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 157/376 (41%), Positives = 211/376 (56%), Gaps = 27/376 (7%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
RM + ++++ ++ S+ L + GNV+P G Y +++IG P RPYFLD+DTGSDLT
Sbjct: 158 RMEVAKAATARTN------STALLPIKGNVFPDGQYYTSIFIGNPPRPYFLDVDTGSDLT 211
Query: 76 WLQCDAPCVRCVEAPHPLYRPSND-LVPCEDPICASLHAPGHHN-CEDPAQCDYELEYAD 133
W+QCDAPC C + PHPLY+P+ + +VP D +C L G+ N CE QCDYE+EYAD
Sbjct: 212 WIQCDAPCTNCAKGPHPLYKPAKEKIVPPRDLLCQELQ--GNQNYCETCKQCDYEIEYAD 269
Query: 134 GGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIV 191
SS+GVL +D TNG R GC Y+Q P DGILGL S
Sbjct: 270 QSSSMGVLARDDMHMIATNGGREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFP 329
Query: 192 SQLHSQKLIRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELF 249
SQL S +I NV GHC++ GGGG++F GDD V WTS+ S Y +
Sbjct: 330 SQLASHGIIANVFGHCITREQGGGGYMFLGDDYVPRWGVTWTSIRSGPDNLYHTQAHHVK 389
Query: 250 FGGET-----TGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
+G + + V+FDSGSSYTYL Y+ L + +K ++ + D TLP
Sbjct: 390 YGDQQLRRPEQAGSTVQVIFDSGSSYTYLPNEIYENLVAAIK--YASPGFVQDTSDRTLP 447
Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKT----RTLFELTPEAYLIISNKGNVCLGIL 360
LCWK P + + DVK+ F L L F GK F ++PE YLIIS+KGNVCLG+L
Sbjct: 448 LCWKADFPVRYLEDVKQFFEPLNLHF--GKKWLFMSKTFTISPEDYLIISDKGNVCLGLL 505
Query: 361 NGAEVGLQDLNVIGGI 376
NG E+ ++G +
Sbjct: 506 NGTEINHGSTIIVGDV 521
>gi|388518245|gb|AFK47184.1| unknown [Lotus japonicus]
Length = 245
Score = 277 bits (709), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 134/206 (65%), Positives = 169/206 (82%), Gaps = 2/206 (0%)
Query: 172 ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWT 231
+SYHPLDG+LGLG+GKSS+VSQL+SQ L+RNVVGHCLS GGG++FFGD +YDSSR+ WT
Sbjct: 7 SSYHPLDGMLGLGRGKSSLVSQLNSQGLVRNVVGHCLSAQGGGYIFFGD-VYDSSRLTWT 65
Query: 232 SMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSA 291
MSS K+Y G AEL FGG+ TG+ L VFD+GSSYTY N YQ + S +KKEL+
Sbjct: 66 PMSSRDLKHYVAGAAELIFGGKKTGIGGLLPVFDTGSSYTYFNSNAYQAVISWLKKELAG 125
Query: 292 KSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT-DGKTRTLFELTPEAYLIIS 350
K LKEAP+D+TLPLCW G+RPF++V++V+K F+++ALSFT G+T T FE+ PEAYLI+S
Sbjct: 126 KPLKEAPDDQTLPLCWHGKRPFRSVYEVRKYFKSMALSFTSSGRTNTQFEIPPEAYLIVS 185
Query: 351 NKGNVCLGILNGAEVGLQDLNVIGGI 376
N GNVCLGIL+G+EVG+ DLN+IG I
Sbjct: 186 NMGNVCLGILDGSEVGMGDLNLIGDI 211
>gi|357152725|ref|XP_003576216.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like,
partial [Brachypodium distachyon]
Length = 354
Score = 277 bits (708), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 154/347 (44%), Positives = 204/347 (58%), Gaps = 43/347 (12%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
SS++F++HG+VYPTG+ VTM IG+ +PYFLD+DTGS LTWL+ VR
Sbjct: 20 SSMVFELHGDVYPTGHIYVTMSIGEQEKPYFLDIDTGSTLTWLED----VRF-------- 67
Query: 95 RPSNDLVPCEDPICASLHAPGHHNC-EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
H+C E+P QCDY++ YA G SSLGVL+ D F+ G
Sbjct: 68 ---------------------KHDCKENPNQCDYDVRYAGGESSLGVLIADKFSLP---G 103
Query: 154 QRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLI-RNVVGHCLSGGG 212
+ P L GCGY+Q G + P+DG+LG+G+G + SQL Q I NV+GHCL G
Sbjct: 104 RDARPTLTFGCGYDQEGGKAEMPVDGVLGIGRGTRDLASQLKQQGAIAENVIGHCLRIQG 163
Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGET---TGLKNLPVVFDSGSS 269
GG+LFFG + SS V W M + YYSPG+A L F G + + VV DSGS+
Sbjct: 164 GGYLFFGHEKVPSSVVTWVPMVPN-NHYYSPGLAALHFNGNLGNPISVAPMEVVIDSGST 222
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
YTY+ TY+ L ++ LS SL D LP+CW G+ PFK + DVK F+ L L+
Sbjct: 223 YTYMPTETYRRLVFVVIASLSKSSLTLV-RDPALPVCWAGKEPFKXIGDVKDKFKPLELA 281
Query: 330 FTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
F G ++ + E+ PE YLIIS +GNVC+GIL+G + GL+ LNVIG I
Sbjct: 282 FIQGTSQAIMEIPPENYLIISGEGNVCMGILDGTQAGLRKLNVIGDI 328
>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 686
Score = 277 bits (708), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 155/366 (42%), Positives = 212/366 (57%), Gaps = 17/366 (4%)
Query: 24 SSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC 83
++S S F+ SS +F V G+VYP G Y +++G P R YFLD+DTGSDLTW+QCDAPC
Sbjct: 290 ATSVSAFD---SSTIFPVRGDVYPNGLYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPC 346
Query: 84 VRCVEAPHPLYRPSN-DLVPCEDPICASLHAPGHHN-CEDPAQCDYELEYADGGSSLGVL 141
C + P+PLY+P +LVP +D +C + CE QCDYE+EYAD SS+GVL
Sbjct: 347 TSCAKGPNPLYKPKKGNLVPLKDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVL 406
Query: 142 VKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
D NG + GC Y+Q + S DGILGL K K S+ SQL SQ++
Sbjct: 407 ASDDLHLMLANGSLTKLGIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRI 466
Query: 200 IRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL 257
I NV+GHCL+ GGG++F GDD + W M + ++ Y + ++ G L
Sbjct: 467 INNVLGHCLTSDATGGGYMFLGDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSL 526
Query: 258 -----KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRP 312
+ VVFD+GSSYTY + Y L + + K++S + L + D TLP+CW+ + P
Sbjct: 527 GRQDGRTERVVFDTGSSYTYFPKEAYYALVASL-KDVSDEGLIQDGSDPTLPVCWRAKFP 585
Query: 313 FKNVHDVKKCFRTLALSFTDG--KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDL 370
++V DVK+ F+ L L F T F + PE YLIISNKGNVCLGIL+G+ V
Sbjct: 586 IRSVIDVKQFFQPLTLQFRSKWWIVSTKFRIPPEGYLIISNKGNVCLGILDGSNVHDGST 645
Query: 371 NVIGGI 376
++G I
Sbjct: 646 IILGDI 651
>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
Length = 473
Score = 276 bits (707), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 155/366 (42%), Positives = 212/366 (57%), Gaps = 17/366 (4%)
Query: 24 SSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC 83
++S S F+ SS +F V G+VYP G Y +++G P R YFLD+DTGSDLTW+QCDAPC
Sbjct: 77 ATSVSAFD---SSTIFPVRGDVYPNGLYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPC 133
Query: 84 VRCVEAPHPLYRPSN-DLVPCEDPICASLHAPGHHN-CEDPAQCDYELEYADGGSSLGVL 141
C + P+PLY+P +LVP +D +C + CE QCDYE+EYAD SS+GVL
Sbjct: 134 TSCAKGPNPLYKPKKGNLVPLKDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVL 193
Query: 142 VKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
D NG + GC Y+Q + S DGILGL K K S+ SQL SQ++
Sbjct: 194 ASDDLHLMLANGSLTKLGIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRI 253
Query: 200 IRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL 257
I NV+GHCL+ GGG++F GDD + W M + ++ Y + ++ G L
Sbjct: 254 INNVLGHCLTSDATGGGYMFLGDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSL 313
Query: 258 -----KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRP 312
+ VVFD+GSSYTY + Y L + + K++S + L + D TLP+CW+ + P
Sbjct: 314 GRQDGRTERVVFDTGSSYTYFPKEAYYALVASL-KDVSDEGLIQDGSDPTLPVCWRAKFP 372
Query: 313 FKNVHDVKKCFRTLALSFTDGK--TRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDL 370
++V DVK+ F+ L L F T F + PE YLIISNKGNVCLGIL+G+ V
Sbjct: 373 IRSVIDVKQFFQPLTLQFRSKWWIVSTKFRIPPEGYLIISNKGNVCLGILDGSNVHDGST 432
Query: 371 NVIGGI 376
++G I
Sbjct: 433 IILGDI 438
>gi|356529585|ref|XP_003533370.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1388
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 162/363 (44%), Positives = 216/363 (59%), Gaps = 25/363 (6%)
Query: 33 VGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP 92
V SS +F V GNVYP G Y + +G P + YFLD+DTGSDLTW+QCDAPC+ C + H
Sbjct: 174 VDSSSVFPVSGNVYPDGLYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCISCGKGAHV 233
Query: 93 LYRPS-NDLVPCEDPICASL---HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
LY+P+ +++V D +C + GHH+ E QCDYE++YAD SSLGVLV+D
Sbjct: 234 LYKPTRSNVVSSVDALCLDVQKNQKNGHHD-ESLLQCDYEIQYADHSSSLGVLVRDELHL 292
Query: 149 NYTNGQRLNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVG 205
TNG + + GCGY+Q G + L DGI+GL + K S+ QL S+ LI+NVVG
Sbjct: 293 VTTNGSKTKLNVVFGCGYDQA-GLLLNTLGKTDGIMGLSRAKVSLPYQLASKGLIKNVVG 351
Query: 206 HCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTK--YYSP------GVAELFFGGETT 255
HCLS G GGG++F GDD + W M+ T Y + G +L F G++
Sbjct: 352 HCLSNDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLRFDGQSK 411
Query: 256 GLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
K +VFDSGSSYTY + Y L + + E+S L + D TLP+CW+ P K+
Sbjct: 412 VGK---MVFDSGSSYTYFPKEAYLDLVASL-NEVSGLGLVQDDSDTTLPICWQANFPIKS 467
Query: 316 VHDVKKCFRTLALSFTDG--KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVI 373
V DVK F+TL L F TLF+++PE YLIISNKG+VCLGIL+G+ V ++
Sbjct: 468 VKDVKDYFKTLTLRFGSKWWILSTLFQISPEGYLIISNKGHVCLGILDGSNVNDGSSIIL 527
Query: 374 GGI 376
G I
Sbjct: 528 GDI 530
>gi|242067693|ref|XP_002449123.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
gi|241934966|gb|EES08111.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
Length = 408
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 163/343 (47%), Positives = 208/343 (60%), Gaps = 26/343 (7%)
Query: 37 LLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC---DAPCVRCVEAPHPL 93
++F++ G+VYP G++ VTM IG+PA PYFLD+DTGS TWL+C D PC C + PHPL
Sbjct: 25 MVFKLDGSVYPVGHFYVTMNIGEPAEPYFLDIDTGSSFTWLECHAKDGPCKTCNKVPHPL 84
Query: 94 YRPS-NDLVPCEDPICASLHAP--GHHNCED--PAQCDYELEYADGGSSLGVLVKDAFAF 148
YR + LVPC DP+C +LH C D QCDY+++Y DG SSLGVL+ D F+
Sbjct: 85 YRLTRKKLVPCADPLCDALHKDLGTTKKCTDVRKNQCDYKVKYQDGLSSLGVLLLDKFSL 144
Query: 149 NYTNGQRLNPRLALGCGYNQVPGASYH-----PLDGILGLGKGKSSIVSQL-HSQKLIRN 202
T G R +A GCGY+Q+ G+ P+DGILGLG+G + SQL HS + +N
Sbjct: 145 P-TGGAR---NIAFGCGYDQMKGSKKKAPEKVPVDGILGLGRGSVDLASQLKHSGAVSKN 200
Query: 203 VVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDY---TKYYSPGVAELFFGGETTGLKN 259
V+GHCLS GGG+LF G++ SS V W M+ +YSPG A L G K
Sbjct: 201 VIGHCLSSKGGGYLFIGEENVPSSHVTWVPMAPTTPGEPNHYSPGQATLHLDSNPIGTKP 260
Query: 260 LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
L +FDSGS+YTYL + L S +K LS SLK+ D LPLCWKG +PFK VHD
Sbjct: 261 LKAIFDSGSTYTYLPENLHAQLVSALKASLSKSSLKQV-SDPALPLCWKGPKPFKTVHDT 319
Query: 320 KKCFRTLA-LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN 361
K F++L L F G T + PE YLII+ GN C GIL+
Sbjct: 320 PKEFKSLVTLKFDLGVTMI---IPPENYLIITGHGNACFGILD 359
>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
gi|219888491|gb|ACL54620.1| unknown [Zea mays]
Length = 557
Score = 274 bits (701), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 156/376 (41%), Positives = 210/376 (55%), Gaps = 27/376 (7%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
RM + ++++ ++ S+ L + GNV+P G Y +++IG P RPYFLD+DTGSDLT
Sbjct: 158 RMEVAKAATARTN------STALLPIKGNVFPDGQYYTSIFIGNPPRPYFLDVDTGSDLT 211
Query: 76 WLQCDAPCVRCVEAPHPLYRPSND-LVPCEDPICASLHAPGHHN-CEDPAQCDYELEYAD 133
W+QCDAPC + PHPLY+P+ + +VP D +C L G+ N CE QCDYE+EYAD
Sbjct: 212 WIQCDAPCTNFAKGPHPLYKPAKEKIVPPRDLLCQELQ--GNQNYCETCKQCDYEIEYAD 269
Query: 134 GGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIV 191
SS+GVL +D TNG R GC Y+Q P DGILGL S
Sbjct: 270 QSSSMGVLARDDMHMIATNGGREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFP 329
Query: 192 SQLHSQKLIRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELF 249
SQL S +I NV GHC++ GGGG++F GDD V WTS+ S Y +
Sbjct: 330 SQLASHGIIANVFGHCITREQGGGGYMFLGDDYVPRWGVTWTSIRSGPDNLYHTQAHHVK 389
Query: 250 FGGET-----TGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
+G + + V+FDSGSSYTYL Y+ L + +K ++ + D TLP
Sbjct: 390 YGDQQLRRPEQAGSTVQVIFDSGSSYTYLPNEIYENLVAAIK--YASPGFVQDTSDRTLP 447
Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKT----RTLFELTPEAYLIISNKGNVCLGIL 360
LCWK P + + DVK+ F L L F GK F ++PE YLIIS+KGNVCLG+L
Sbjct: 448 LCWKADFPVRYLEDVKQFFEPLNLHF--GKKWLFMSKTFTISPEDYLIISDKGNVCLGLL 505
Query: 361 NGAEVGLQDLNVIGGI 376
NG E+ ++G +
Sbjct: 506 NGTEINHGSTIIVGDV 521
>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 564
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 148/355 (41%), Positives = 203/355 (57%), Gaps = 17/355 (4%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
S++L + GNV+P G Y ++++G P RPYFLD+DTGSDLTW+QCDAPC C + PHPLY
Sbjct: 178 STVLLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLY 237
Query: 95 RPSND-LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
+P+ + +VP D +C L ++ C QCDYE+EYAD SS+GVL KD TNG
Sbjct: 238 KPAKEKIVPPRDLLCQELQGDQNY-CATCKQCDYEIEYADRSSSMGVLAKDDMHMIATNG 296
Query: 154 QRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG- 210
R GC Y+Q P DGILGL S+ SQL SQ +I NV GHC++
Sbjct: 297 GREKLDFVFGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCITKE 356
Query: 211 -GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL-----KNLPVVF 264
GGG++F GDD + W + Y ++ +G + + ++ V+F
Sbjct: 357 PNGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQQLRMHGQAGSSIQVIF 416
Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
DSGSSYTYL Y+ L + +K + S + D TLPLCWK + + DVK+ F+
Sbjct: 417 DSGSSYTYLPDEIYKKLVTAIKYDYP--SFVQDTSDTTLPLCWKADFDVRYLEDVKQFFK 474
Query: 325 TLALSFTDG---KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
L L F + RT F + P+ YLIIS+KGNVCLG+LNGAE+ ++G +
Sbjct: 475 PLNLHFGNRWFVIPRT-FTILPDDYLIISDKGNVCLGLLNGAEIDHASTLIVGDV 528
>gi|125554848|gb|EAZ00454.1| hypothetical protein OsI_22475 [Oryza sativa Indica Group]
Length = 538
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 150/358 (41%), Positives = 201/358 (56%), Gaps = 23/358 (6%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
SS L + GNV+P G Y +MYIG P RPYFLD+DTGSDLTW+QCDAPC C + PHPLY
Sbjct: 143 SSALLPIRGNVFPDGQYYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLY 202
Query: 95 RPSN-DLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTN 152
+P ++VP D C L G+ N D + QCDYE+ YAD SS+G+L +D +
Sbjct: 203 KPEKPNVVPPRDSYCQELQ--GNQNYGDTSKQCDYEITYADRSSSMGILARDNMQLITAD 260
Query: 153 GQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
G+R N GCGY+Q P DGILGL S+ +QL SQ +I NV GHC++
Sbjct: 261 GERENLDFVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAA 320
Query: 211 --GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN-----LPVV 263
GG++F GDD + W + + YS V ++ +G + ++ V+
Sbjct: 321 DPSNGGYMFLGDDYVPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQVI 380
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
FDSGSSYTYL Y L + +K + E+ D TLP C K P +++ DVK F
Sbjct: 381 FDSGSSYTYLPHDDYTNLIASLKSLSPSLLQDES--DRTLPFCMKPNFPVRSMDDVKHLF 438
Query: 324 RTLALSFTDGKTRTL-----FELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
+ L+L F K R F + PE YLIIS+K N+CLG+L+G E+G VIG +
Sbjct: 439 KPLSLVF---KKRLFILPRTFVIPPEDYLIISDKNNICLGVLDGTEIGHDSAIVIGDV 493
>gi|115467508|ref|NP_001057353.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|53791766|dbj|BAD53531.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|53793187|dbj|BAD54393.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113595393|dbj|BAF19267.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|125596798|gb|EAZ36578.1| hypothetical protein OsJ_20919 [Oryza sativa Japonica Group]
gi|215767941|dbj|BAH00170.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 538
Score = 265 bits (676), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 150/358 (41%), Positives = 201/358 (56%), Gaps = 23/358 (6%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
SS L + GNV+P G Y +MYIG P RPYFLD+DTGSDLTW+QCDAPC C + PHPLY
Sbjct: 143 SSALLPIRGNVFPDGQYYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLY 202
Query: 95 RPSN-DLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTN 152
+P ++VP D C L G+ N D + QCDYE+ YAD SS+G+L +D +
Sbjct: 203 KPEKPNVVPPRDSYCQELQ--GNQNYGDTSKQCDYEITYADRSSSMGILARDNMQLITAD 260
Query: 153 GQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
G+R N GCGY+Q P DGILGL S+ +QL SQ +I NV GHC++
Sbjct: 261 GERENLDFVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAA 320
Query: 211 --GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN-----LPVV 263
GG++F GDD + W + + YS V ++ +G + ++ V+
Sbjct: 321 DPSNGGYMFLGDDYVPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQVI 380
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
FDSGSSYTYL Y L + +K + E+ D TLP C K P +++ DVK F
Sbjct: 381 FDSGSSYTYLPHDDYTNLIASLKSLSPSLLQDES--DRTLPFCMKPNFPVRSMDDVKHLF 438
Query: 324 RTLALSFTDGKTRTL-----FELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
+ L+L F K R F + PE YLIIS+K N+CLG+L+G E+G VIG +
Sbjct: 439 KPLSLVF---KKRLFILPRTFVIPPEDYLIISDKNNICLGVLDGTEIGHDSAIVIGDV 493
>gi|356522749|ref|XP_003530008.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1336
Score = 264 bits (675), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 158/363 (43%), Positives = 212/363 (58%), Gaps = 25/363 (6%)
Query: 33 VGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP 92
V SS +F V GNVYP G Y + +G P + YFLD+DTGSDLTW+QCDAPC C + H
Sbjct: 176 VDSSSVFPVSGNVYPDGLYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCRSCGKGAHV 235
Query: 93 LYRPS-NDLVPCEDPICASL---HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
Y+P+ +++V D +C + GHH+ E QCDYE++YAD SSLGVLV+D
Sbjct: 236 QYKPTRSNVVSSVDSLCLDVQKNQKNGHHD-ESLLQCDYEIQYADHSSSLGVLVRDELHL 294
Query: 149 NYTNGQRLNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVG 205
TNG + + GCGY+Q G + L DGI+GL + K S+ QL S+ LI+NVVG
Sbjct: 295 VTTNGSKTKLNVVFGCGYDQ-EGLILNTLAKTDGIMGLSRAKVSLPYQLASKGLIKNVVG 353
Query: 206 HCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTK--YYSP------GVAELFFGGETT 255
HCLS G GGG++F GDD + W M+ T Y + G +L F G++
Sbjct: 354 HCLSNDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLKFDGQSK 413
Query: 256 GLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
K V FDSGSSYTY + Y L + + E+S L + D TLP+CW+ ++
Sbjct: 414 VGK---VFFDSGSSYTYFPKEAYLDLVASL-NEVSGLGLVQDDSDTTLPICWQANFQIRS 469
Query: 316 VHDVKKCFRTLALSFTDG--KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVI 373
+ DVK F+TL L F TLF++ PE YLIISNKG+VCLGIL+G++V ++
Sbjct: 470 IKDVKDYFKTLTLRFGSKWWILSTLFQIPPEGYLIISNKGHVCLGILDGSKVNDGSSIIL 529
Query: 374 GGI 376
G I
Sbjct: 530 GDI 532
>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 145/352 (41%), Positives = 198/352 (56%), Gaps = 21/352 (5%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
S++L + GNV+P G Y ++++G P RPYFLD+DTGSDLTW+QCDAPC C + PHPLY
Sbjct: 175 STVLLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLY 234
Query: 95 RPSND-LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
+P+ + +VP D +C L ++ CE QCDYE+EYAD SS+GVL KD TNG
Sbjct: 235 KPAKEKIVPPRDSLCQELQGDQNY-CETCKQCDYEIEYADRSSSMGVLAKDDMHLIATNG 293
Query: 154 QRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS-- 209
R GC Y+Q P DGILGL S+ SQL S+ +I NV GHC++
Sbjct: 294 GREKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCITRE 353
Query: 210 GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN-LPVVFDSGS 268
GGG++F GDD + W + Y ++ +G + N + V+FDSGS
Sbjct: 354 TNGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGNSVQVIFDSGS 413
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
SYTYL Y+ L +K++ + S + D TLPLCWK V+ F+ L L
Sbjct: 414 SYTYLPEEMYKNLIDAIKED--SPSFVQDSSDTTLPLCWKAD------FSVRSFFKPLNL 465
Query: 329 SFTDGK----TRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
F G+ F + P+ YLIIS+KGNVCLG+LNG E+ ++G +
Sbjct: 466 HF--GRRWFVVPKTFTIVPDDYLIISDKGNVCLGLLNGTEINHGSTIIVGDV 515
>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
Length = 573
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 148/356 (41%), Positives = 206/356 (57%), Gaps = 23/356 (6%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
S+ L + GNV+P G Y ++++G P RPYFLD+DTGSDLTW+QCDAPC C + PHPLY
Sbjct: 187 STALLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLY 246
Query: 95 RPSND-LVPCEDPICASLHAPGHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN 152
+P+ + +VP +D +C L G+ N CE QCDYE+EYAD SS+GVL +D TN
Sbjct: 247 KPAKEKIVPPKDLLCQELQ--GNQNYCETCKQCDYEIEYADRSSSMGVLARDDMHIITTN 304
Query: 153 GQRLNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
G R GC Y+Q AS DGILGL S+ SQL +Q +I NV GHC++
Sbjct: 305 GGREKLDFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITR 364
Query: 211 --GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK-----NLPVV 263
GGG++F GDD + T + S + ++++G + ++ ++ V+
Sbjct: 365 DPNGGGYMFLGDDYVPRWGMTSTPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQVI 424
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
FDSGSSYTYL Y+ L + +K + + + D TLPLC P + + DVK+ F
Sbjct: 425 FDSGSSYTYLPDEIYKNLIAAIK--YAYPNFVQDSSDRTLPLCLATDFPVRYLEDVKQLF 482
Query: 324 RTLALSFTDGKT-----RTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
+ L L F GK RT F + P+ YLIIS+KGNVCLG LNG ++ ++G
Sbjct: 483 KPLNLHF--GKRWFVMPRT-FTILPDNYLIISDKGNVCLGFLNGKDIDHGSTVIVG 535
>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
Length = 574
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 148/356 (41%), Positives = 206/356 (57%), Gaps = 23/356 (6%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
S+ L + GNV+P G Y ++++G P RPYFLD+DTGSDLTW+QCDAPC C + PHPLY
Sbjct: 188 STALLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLY 247
Query: 95 RPSND-LVPCEDPICASLHAPGHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN 152
+P+ + +VP +D +C L G+ N CE QCDYE+EYAD SS+GVL +D TN
Sbjct: 248 KPAKEKIVPPKDLLCQELQ--GNQNYCETCKQCDYEIEYADRSSSMGVLARDDMHIITTN 305
Query: 153 GQRLNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
G R GC Y+Q AS DGILGL S+ SQL +Q +I NV GHC++
Sbjct: 306 GGREKLDFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITR 365
Query: 211 --GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK-----NLPVV 263
GGG++F GDD + T + S + ++++G + ++ ++ V+
Sbjct: 366 DPNGGGYMFLGDDYVPRWGMTSTPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQVI 425
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
FDSGSSYTYL Y+ L + +K + + + D TLPLC P + + DVK+ F
Sbjct: 426 FDSGSSYTYLPDEIYKNLIAAIK--YAYPNFVQDSSDRTLPLCLATDFPVRYLEDVKQLF 483
Query: 324 RTLALSFTDGKT-----RTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
+ L L F GK RT F + P+ YLIIS+KGNVCLG LNG ++ ++G
Sbjct: 484 KPLNLHF--GKRWFVMPRT-FTILPDNYLIISDKGNVCLGFLNGKDIDHGSTVIVG 536
>gi|297852200|ref|XP_002893981.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
gi|297339823|gb|EFH70240.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
Length = 354
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 141/346 (40%), Positives = 188/346 (54%), Gaps = 62/346 (17%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
SS++ + GNV+P GYY+V + IG P + + D+DTGSDLTW+QCDAPC C P Y
Sbjct: 38 SSVVLPLSGNVFPLGYYSVLLQIGTPPKAFEFDIDTGSDLTWVQCDAPCTGCTLPPIRQY 97
Query: 95 RPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
+P + VPC DPIC +LH P C +P QCDYE+ YAD GSS+G LV D F NG
Sbjct: 98 KPKGNTVPCLDPICLALHFPNKPQCPNPKEQCDYEVNYADQGSSMGALVIDQFPLKLLNG 157
Query: 154 QRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
+ PRLA GCGY+Q+ ++ P G+LGLG+GK ++ QL + L RNVVGHCLS
Sbjct: 158 SAMQPRLAFGCGYDQILPKAHPPPATAGVLGLGRGKIGVLPQLVAAGLTRNVVGHCLSSK 217
Query: 212 GGGFLFFGDDLYDSSRVVWTS-MSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
GGG+LFFGD L + V WT +S +YT ++
Sbjct: 218 GGGYLFFGDTLIPTLGVAWTPLLSPEYTFFF----------------------------- 248
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
++ R Q + K L K+ FK + ++F
Sbjct: 249 -HICRDRLQRDYTFFKSVLEFKNF------------------FKTI----------TINF 279
Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
T+ + T ++ PE+YLIIS GN CLG+LNG+EVGLQ+ NVIG I
Sbjct: 280 TNARRITQLQIPPESYLIISKTGNACLGLLNGSEVGLQNSNVIGDI 325
>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 570
Score = 253 bits (647), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 145/345 (42%), Positives = 199/345 (57%), Gaps = 17/345 (4%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
SS +F V G++YP G Y + +G+P RPYFLD+DTGSDLTW+QCDAPC C + PLY
Sbjct: 183 SSAVFPVRGDIYPDGLYYTYIMVGEPPRPYFLDIDTGSDLTWVQCDAPCSSCGKGRSPLY 242
Query: 95 RPSND-LVPCEDPICASLHAP-GHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN 152
+P + +V +D +C + C QC+YE++YAD SSLGVLVKD F ++N
Sbjct: 243 KPRRENVVSFKDSLCMEVQRNYDGDQCAACQQCNYEVQYADQSSSLGVLVKDEFTLRFSN 302
Query: 153 GQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
G GC Y+Q + + DGILGL + K S+ SQL S+ +I NVVGHCL+G
Sbjct: 303 GSLTKLNAIFGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQLASRGIINNVVGHCLTG 362
Query: 211 --GGGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFGG-----ETTGLKNLPV 262
GGG+LF GDD + W +M S +Y V + +G +T G V
Sbjct: 363 DPAGGGYLFLGDDFVPQWGMAWVAMLDSPSIDFYQTKVVRIDYGSIPLSLDTWGSSREQV 422
Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
VFDSGSSYTY + Y L + + +E+SA L +D + +CWK + ++V DVK
Sbjct: 423 VFDSGSSYTYFTKEAYYQLVANL-EEVSAFGL--ILQDSSDTICWKTEQSIRSVKDVKHF 479
Query: 323 FRTLALSFTD--GKTRTLFELTPEAYLIISNKGNVCLGILNGAEV 365
F+ L L F T + PE YL+I+ +GNVCLGIL+G++V
Sbjct: 480 FKPLTLQFGSRFWLVSTKLVILPENYLLINKEGNVCLGILDGSQV 524
>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 535
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 146/370 (39%), Positives = 204/370 (55%), Gaps = 38/370 (10%)
Query: 25 SSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA-PC 83
+S+LF H + GN++P G Y + +G P RPYFLD+DTGS TW+QCDA PC
Sbjct: 141 QNSTLFPH-------SLAGNLFPEGLYYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPC 193
Query: 84 VRCVEAPHPLYRPSN--DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVL 141
C + HPLYRP+ D +P DP+C E+P QCDYE+ YADG SS+GV
Sbjct: 194 ASCAKGAHPLYRPARTADALPASDPLCEGAQH------ENPNQCDYEISYADGSSSMGVY 247
Query: 142 VKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
V+D+ F +G+R N + GCGY+Q V + DG+LGL S+ +QL S+ +
Sbjct: 248 VRDSMQFVGEDGERENADIVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGI 307
Query: 200 IRNVVGHCLS---GGGGGFLFFGDDLYDSSRVVWTSMSS--------DYTKYYSPGVAEL 248
I N GHC+S G GG+LF GDD + W + K + G +L
Sbjct: 308 ISNAFGHCMSTDPSGAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQL 367
Query: 249 FFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
G+ T VVFD+GS+YTY L S +K+ S + +++ D+TLP C K
Sbjct: 368 NAQGKLTQ-----VVFDTGSTYTYFPDEALTRLISSLKEAASPRFVQDD-SDKTLPFCMK 421
Query: 309 GRRPFKNVHDVKKCFRTLALSFTDG--KTRTLFELTPEAYLIISNKGNVCLGILNGAEVG 366
P ++V DVK F+ L+L F +RT F + PE YL+IS+KGNVCLG+LNG +G
Sbjct: 422 SDFPVRSVEDVKHFFKPLSLQFEKRFFFSRT-FNIRPEHYLVISDKGNVCLGVLNGTTIG 480
Query: 367 LQDLNVIGGI 376
+ ++G +
Sbjct: 481 YDSVVIVGDV 490
>gi|326533540|dbj|BAK05301.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 144/353 (40%), Positives = 201/353 (56%), Gaps = 22/353 (6%)
Query: 36 SLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV----EAPH 91
++ F + GNVYP G++ T+ IG+PA+PYFLD+DTGS+LTWL+C P C PH
Sbjct: 23 AIKFPLEGNVYPVGHFYATLNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPH 82
Query: 92 PLYRPS--NDLVPCEDPICASLH--APGHHNC--EDPAQCDYELEYADGGSSLGVLVKDA 145
P Y P+ N V C P+C ++ PG C DP +C YE++Y G S G L D
Sbjct: 83 PYYTPADGNLKVVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTGKSE-GDLATDI 141
Query: 146 FAFNYTNGQRLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-N 202
+ N R R+A GCGY Q A P+DGILGLG GK+ + +QL K+I+ N
Sbjct: 142 ISVN----GRDKKRIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKEN 197
Query: 203 VVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLP 261
V+GHCLS G G L+ GD + V W M YYSPG+AE+F + G
Sbjct: 198 VIGHCLSSKGKGVLYVGDFNPPTRGVTWAPMRESLF-YYSPGLAEVFIDKQPIRGNPTFE 256
Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
VFDSGS+YT++ Y + S ++ LS SL+E + LPLCWKG++PF +V+DVK
Sbjct: 257 AVFDSGSTYTHVPAQIYNEIVSKVRVTLSESSLEEV-KGRALPLCWKGKKPFGSVNDVKN 315
Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGA-EVGLQDLNVI 373
F+ L+L T + + ++ P+ YL + G CL IL+ + + L++LN I
Sbjct: 316 QFKALSLKITHARGTSNLDIPPQNYLFVKEDGETCLAILDASLDPVLKELNFI 368
>gi|2290202|gb|AAB96882.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|2290204|gb|AAB96883.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|45357050|gb|AAS58479.1| nucellin [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 144/353 (40%), Positives = 199/353 (56%), Gaps = 22/353 (6%)
Query: 36 SLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV----EAPH 91
++ F + GNVYP G++ T+ IG+PA+PYFLD+DTGS+LTWL+C P C PH
Sbjct: 23 AIKFPLEGNVYPVGHFYATLNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRPPH 82
Query: 92 PLYRPS--NDLVPCEDPICASLH--APGHHNC--EDPAQCDYELEYADGGSSLGVLVKDA 145
P Y P+ N V C P+C ++ PG C DP +C YE++Y G S G L D
Sbjct: 83 PYYTPADGNLKVVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTGKSE-GDLATDI 141
Query: 146 FAFNYTNGQRLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-N 202
+ N R R+A GCGY Q A P+DGILGLG GK+ +QL K+I+ N
Sbjct: 142 ISVN----GRDKKRIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKEN 197
Query: 203 VVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLP 261
V+GHCLS G G L+ GD + V W M YYSPG+AE+F + G
Sbjct: 198 VIGHCLSSKGKGVLYVGDFNPPTRGVTWAPMRESLF-YYSPGLAEVFIDKQPIRGNPTFE 256
Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
VFDSGS+YT++ Y + S ++ LS SL+E + LPLCWKG++PF +V+DVK
Sbjct: 257 AVFDSGSTYTHVPAQIYNEIVSKVRGTLSESSLEEV-KGRALPLCWKGKKPFGSVNDVKN 315
Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGA-EVGLQDLNVI 373
F+ L+L T + ++ P+ YL + G CL IL+ + + L++LN I
Sbjct: 316 QFKALSLKITHARGTNNLDIPPQNYLFVKEDGETCLAILDASLDPVLKELNFI 368
>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 144/351 (41%), Positives = 194/351 (55%), Gaps = 29/351 (8%)
Query: 45 VYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND-LVPC 103
V P Y ++ IG PARPYFLD+DTGS LTW+QCDAPC C + PHPLY+P+ + +VP
Sbjct: 123 VLPERQYYTSINIGNPARPYFLDVDTGSALTWIQCDAPCTNCTKGPHPLYKPAKENIVPP 182
Query: 104 EDPICASLHAPGHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
D C L G+ N C+ QCDYE+ YAD SS GVL +D +G+R N L
Sbjct: 183 RDSHCQELQ--GNQNYCDTCKQCDYEIAYADRSSSAGVLARDNMELITADGERENMDLVF 240
Query: 163 GCGYNQ------VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGG 214
GC ++Q P +S DGILGL G S+ +QL Q +I NV GHC++ G
Sbjct: 241 GCAHDQQGKLLGSPASS----DGILGLSNGAMSLPTQLAKQGIISNVFGHCIATDPSGSA 296
Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN-----LPVVFDSGSS 269
++F GDD + W + + YS V ++ +G + ++ V+FDSGSS
Sbjct: 297 YMFLGDDYVPRWGMTWVPVRNGPEDVYSTVVQKVNYGCQELNVREQAGKLTQVIFDSGSS 356
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
YTY Y +L I E + D+TLP C K P ++V DVK+ + L L
Sbjct: 357 YTYFPHEIYTSL--ITSLEAVSPGFVRDESDQTLPFCMKPNFPVRSVDDVKQLHKPLLLH 414
Query: 330 FTDGKTRTL----FELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
F+ KT + FE++PE YLIIS KGNVCLG+L+G E+G VIG +
Sbjct: 415 FS--KTWLVIPRTFEISPENYLIISGKGNVCLGVLDGTEIGHSSTIVIGDV 463
>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 578
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 149/362 (41%), Positives = 199/362 (54%), Gaps = 23/362 (6%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQP--ARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP 92
S+ +F V GNVYP G Y + +G+P + Y LD+DTGSDLTW+QCDAPC C + +
Sbjct: 182 STTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAPCTSCAKGANQ 241
Query: 93 LYRPSND-LVPCEDPICASLHAPG-HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNY 150
LY+P D LV +P C + +CE QCDYE+EYAD S+GVL KD F
Sbjct: 242 LYKPRKDNLVRSSEPFCVEVQRNQLTEHCESCHQCDYEIEYADHSYSMGVLTKDKFHLKL 301
Query: 151 TNGQRLNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
NG + GCGY+Q G + L DGILGL + K S+ SQL S+ +I NVVGHC
Sbjct: 302 HNGSLAESDIVFGCGYDQ-QGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHC 360
Query: 208 LSG--GGGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFGGETTGLKNL---- 260
L+ G G++F G DL S + W M + + Y V ++ +G L
Sbjct: 361 LASDLNGEGYIFMGSDLVPSHGMTWVPMLHHPHLEVYQMQVTKMSYGNAMLSLDGENGRV 420
Query: 261 -PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR--RPFKNVH 317
V+FD+GSSYTY Y L + + +E+S L DE LP+CW+ + P ++
Sbjct: 421 GKVLFDTGSSYTYFPNQAYSQLVTSL-QEVSDLELTRDDSDEALPICWRAKTNSPISSLS 479
Query: 318 DVKKCFRTLALSFTDG---KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
DVKK FR + L ++ L + PE YLIISNKGNVCLGIL+G+ V +IG
Sbjct: 480 DVKKFFRPITLQIGSKWLIISKKLL-IQPEDYLIISNKGNVCLGILDGSNVHDGSTIIIG 538
Query: 375 GI 376
I
Sbjct: 539 DI 540
>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
Length = 583
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 150/362 (41%), Positives = 200/362 (55%), Gaps = 23/362 (6%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQP--ARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP 92
S+ +F V GNVYP G Y + +G+P + Y LD+DTGS+LTW+QCDAPC C + +
Sbjct: 187 STTIFPVGGNVYPDGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQ 246
Query: 93 LYRPSND-LVPCEDPICASLHAPG-HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNY 150
LY+P D LV + C + +CE+ QCDYE+EYAD S+GVL KD F
Sbjct: 247 LYKPRKDNLVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKL 306
Query: 151 TNGQRLNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
NG + GCGY+Q G + L DGILGL + K S+ SQL S+ +I NVVGHC
Sbjct: 307 HNGSLAESDIVFGCGYDQ-QGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHC 365
Query: 208 LSG--GGGGFLFFGDDLYDSSRVVWTSMSSD-YTKYYSPGVAELFFGGETTGLKNL---- 260
L+ G G++F G DL S + W M D Y V ++ +G L
Sbjct: 366 LASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGRV 425
Query: 261 -PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR--PFKNVH 317
V+FD+GSSYTY Y L + + +E+S L DETLP+CW+ + PF ++
Sbjct: 426 GKVLFDTGSSYTYFPNQAYSQLVTSL-QEVSGLELTRDDSDETLPICWRAKTNFPFSSLS 484
Query: 318 DVKKCFRTLALSFTDG---KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
DVKK FR + L +R L + PE YLIISNKGNVCLGIL+G+ V ++G
Sbjct: 485 DVKKFFRPITLQIGSKWLIISRKLL-IQPEDYLIISNKGNVCLGILDGSSVHDGSTIILG 543
Query: 375 GI 376
I
Sbjct: 544 DI 545
>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 395
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 136/343 (39%), Positives = 186/343 (54%), Gaps = 17/343 (4%)
Query: 45 VYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN-DLVPC 103
V P Y ++ IG P RPYFLD+DTGSD TW+ CDAPC C + PHP+Y+P+ +V
Sbjct: 10 VVPERQYYTSINIGNPPRPYFLDIDTGSDFTWIHCDAPCTNCTKGPHPVYKPTEGKIVHP 69
Query: 104 EDPICASLHAPGHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
DP+C L G+ N CE QCDYE+ YAD SS GVL +D +G+ N
Sbjct: 70 RDPLCEELQ--GNQNYCETCKQCDYEITYADRSSSKGVLARDNMQLTTADGEMKNVDFVF 127
Query: 163 GCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFF 218
GC +NQ P DGILGL G S+ +QL + +I NV GHC++ GG++F
Sbjct: 128 GCAHNQQGKLLDSPTSTDGILGLSNGAISLSTQLANSGIISNVFGHCMATDPSSGGYMFL 187
Query: 219 GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN-----LPVVFDSGSSYTYL 273
GDD + W + + YS V ++ +G + L+ V+FDSGSSYTY
Sbjct: 188 GDDYVPRWGMTWVPIRNGPGNVYSTEVPKVNYGAQELNLRGQAGKLTQVIFDSGSSYTYF 247
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
Y L +++ E ++ D+TLP C K P ++V DV++ F L L
Sbjct: 248 PHEIYTNLIALL--EDASPGFVRDESDQTLPFCMKPNVPVRSVGDVEQLFNPLILQLRKR 305
Query: 334 --KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
T F ++PE YLIIS+KGNVCLG+L+G E+G +IG
Sbjct: 306 WFVIPTTFAISPENYLIISDKGNVCLGVLDGTEIGHSSTIIIG 348
>gi|2570402|gb|AAB97155.1| EEA1 [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 141/353 (39%), Positives = 199/353 (56%), Gaps = 22/353 (6%)
Query: 36 SLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV----EAPH 91
++ F + GNVYP G++ T+ IG+PA+PYFLD+DTGS+LTWL+C P C PH
Sbjct: 23 AINFPLEGNVYPVGHFYATLNIGEPAKPYFLDVDTGSNLTWLECHPPVHGCKGCHPRPPH 82
Query: 92 PLYRPSND--LVPCEDPICASLH--APGHHNC--EDPAQCDYELEYADGGSSLGVLVKDA 145
P Y P++ V C P+C ++ PG C DP +C YE++Y G S G L D
Sbjct: 83 PYYTPADGKLKVVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTGKSE-GDLATDI 141
Query: 146 FAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIR-N 202
+ N R R+A GCGY Q P + P++GILGLG GK+ +QL K+I+ N
Sbjct: 142 ISVN----GRDKKRIAFGCGYKQEEPPDSPPSPVNGILGLGMGKAGFAAQLKGLKMIKEN 197
Query: 203 VVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLP 261
V+GHCLS G G L+ GD + V W M YYSPG+AE+F + G
Sbjct: 198 VIGHCLSSKGKGVLYVGDFNPPTRGVTWAPMRESLF-YYSPGLAEVFIDKQPIRGNPTFE 256
Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
VFDSGS+YT++ Y + S ++ S SL+E + LPLCWKG++PF +V+DVK
Sbjct: 257 AVFDSGSTYTHVPAQIYNEIVSKVRGTFSESSLEEV-KGRALPLCWKGKKPFGSVNDVKN 315
Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGA-EVGLQDLNVI 373
F+ L+L T + ++ P+ YL + G CL IL+ + + L++LN I
Sbjct: 316 QFKALSLKITHARGTNNLDIPPQNYLFVKEDGETCLAILDASLDPVLKELNFI 368
>gi|224130234|ref|XP_002328687.1| predicted protein [Populus trichocarpa]
gi|222838863|gb|EEE77214.1| predicted protein [Populus trichocarpa]
Length = 603
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 147/407 (36%), Positives = 206/407 (50%), Gaps = 61/407 (14%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
++S +SS++++++ SS +F V GN+YP G P +PY+LD DTGSDLT
Sbjct: 169 KISKLASSNAAAAM----DSSAIFPVRGNLYPDG----------PPQPYYLDFDTGSDLT 214
Query: 76 WLQCDAPCVRCVEAPHPLYRPSN-DLVPCEDPICASLHAPGHHN-CEDPAQCDYELEYAD 133
W+QCDAPC C + + Y+P ++VP +D +C + CE QCDYE+EYAD
Sbjct: 215 WIQCDAPCTSCAKGANAWYKPRRGNIVPPKDLLCMEVQRNQKAGYCETCDQCDYEIEYAD 274
Query: 134 GGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGKGKSSIV 191
SS+GVL D NG GC Y+Q + + DGILGL + K S+
Sbjct: 275 HSSSMGVLATDKLLLMVANGSLTKLNFIFGCAYDQQGLLLKTLVKTDGILGLSRAKVSLP 334
Query: 192 SQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAEL 248
SQL SQ +I NV+GHCL+ GGGG++F GDD + W M S ++Y V +L
Sbjct: 335 SQLASQGIINNVIGHCLTTDLGGGGYMFLGDDFVPRWGMAWVPMLDSPSMEFYHTEVVKL 394
Query: 249 FFGGETTGLKNLP-----VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETL 303
+G L + ++FDSGSSYTY + Y L + + E+S L ++ D TL
Sbjct: 395 NYGSSPLSLGGMESRVKHILFDSGSSYTYFPKEAYSELVASL-NEVSGAGLVQSTSDTTL 453
Query: 304 PLCWKGRRPFKNV--------------------------------HDVKKCFRTLALSFT 331
PLCW+ P + DVKK F+TL F
Sbjct: 454 PLCWRANFPIRKFIYRTELTRPIRRRRRRRRRRRRRRRRRRQHIKGDVKKFFKTLTFQFG 513
Query: 332 DG--KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
T F + PE YL++S+KGNVCLGIL G++V ++G I
Sbjct: 514 TKWLVISTKFRIPPEGYLMMSDKGNVCLGILEGSKVHDGSTIILGDI 560
>gi|145324889|ref|NP_001077691.1| aspartyl protease [Arabidopsis thaliana]
gi|332194268|gb|AEE32389.1| aspartyl protease [Arabidopsis thaliana]
Length = 410
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 143/347 (41%), Positives = 193/347 (55%), Gaps = 25/347 (7%)
Query: 51 YNVTMYIGQP--ARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND-LVPCEDPI 107
Y + +G+P + Y LD+DTGS+LTW+QCDAPC C + + LY+P D LV +
Sbjct: 30 YYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNLVRSSEAF 89
Query: 108 CASLHAPG-HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + +CE+ QCDYE+EYAD S+GVL KD F NG + GCGY
Sbjct: 90 CVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDIVFGCGY 149
Query: 167 NQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDD 221
+Q G + L DGILGL + K S+ SQL S+ +I NVVGHCL+ G G++F G D
Sbjct: 150 DQ-QGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSD 208
Query: 222 LYDSSRVVWTSMSSD-YTKYYSPGVAELFFGGETTGL-----KNLPVVFDSGSSYTYL-N 274
L S + W M D Y V ++ +G L + V+FD+GSSYTY N
Sbjct: 209 LVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGRVGKVLFDTGSSYTYFPN 268
Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR--PFKNVHDVKKCFRTLALSFTD 332
+ Q +TS+ +E+S L DETLP+CW+ + PF ++ DVKK FR + L
Sbjct: 269 QAYSQLVTSL--QEVSGLELTRDDSDETLPICWRAKTNFPFSSLSDVKKFFRPITLQIGS 326
Query: 333 G---KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
+R L + PE YLIISNKGNVCLGIL+G+ V ++G I
Sbjct: 327 KWLIISRKLL-IQPEDYLIISNKGNVCLGILDGSSVHDGSTIILGDI 372
>gi|62954897|gb|AAY23266.1| Similar to nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|77548966|gb|ABA91763.1| Aspartic proteinase Asp1 precursor, putative [Oryza sativa Japonica
Group]
Length = 307
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 116/296 (39%), Positives = 162/296 (54%), Gaps = 44/296 (14%)
Query: 100 LVPCEDPICASLHAPGHH---NCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+V +DP+ +LH G N P QCDYE++YADG S++G L+ D F+ +
Sbjct: 1 MVRADDPLYVALHEDGRSGDGNHMSPTQCDYEIKYADGASTIGALIVDQFSLPRIATR-- 58
Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFL 216
P L GCGYNQ G ++ + LG + ++VVGHCLS GGGG L
Sbjct: 59 -PNLPFGCGYNQGIGENFQQTSPLKMLGI-------------ITKHVVGHCLSSGGGGLL 104
Query: 217 FFGDDLYDSSRV-----------VWTSMSSDYTK-----YYSPGVAELFFGGETTGLKNL 260
F GD D + V + S S Y + YYSPG A L+F + G+ +
Sbjct: 105 FVGDG--DGNLVLLHASLGSLCPIAISTPSSYNEPMLMNYYSPGSATLYFDRHSLGMNPM 162
Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
VVFDSGS+YTY YQ +K LS+ SL++ D +LPLCWKG++ F++V DVK
Sbjct: 163 DVVFDSGSTYTYFTAQPYQATVYAIKGGLSSTSLEQV-SDPSLPLCWKGQKAFESVFDVK 221
Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
K F++L L+F + + E+ PE YLI++ GNVCLGIL+G + + N+IG I
Sbjct: 222 KEFKSLQLNFGN---NAVMEIPPENYLIVTEYGNVCLGILHGCRL---NFNIIGDI 271
>gi|413953656|gb|AFW86305.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 406
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 107/278 (38%), Positives = 145/278 (52%), Gaps = 34/278 (12%)
Query: 12 FPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTG 71
+P +S+LF H + GN++P G Y + +G P RPYFLD+DTG
Sbjct: 128 YPKPPRRGGDDWPQNSTLFPH-------SLAGNLFPEGLYYTAISLGSPPRPYFLDVDTG 180
Query: 72 SDLTWLQCDA-PCVRCVEAPHPLYRPSN--DLVPCEDPICASLHAPGHHNCEDPAQCDYE 128
S TW+QCDA PC C + HPLYRP+ D +P DP+C E+P QCDYE
Sbjct: 181 SHTTWVQCDAPPCASCAKGAHPLYRPARTADALPASDPLCEGAQH------ENPNQCDYE 234
Query: 129 LEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGKG 186
+ YADG SS+GV V+D+ F +G+R N + GCGY+Q V + DG+LGL
Sbjct: 235 ISYADGSSSMGVYVRDSMQFVGEDGERENADIVFGCGYDQQGVLLNALETTDGVLGLTNK 294
Query: 187 KSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFGDDLYDSSRVVWTSMSSD------- 236
S+ +QL S+ +I N GHC+S G GG+LF GDD + W +
Sbjct: 295 ALSLPTQLASRGIISNAFGHCMSTDPSGAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRR 354
Query: 237 -YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
K + G +L G+ T VVFD+GS+YTY
Sbjct: 355 AQVKQINHGDQQLNAQGKLTQ-----VVFDTGSTYTYF 387
>gi|224097210|ref|XP_002334633.1| predicted protein [Populus trichocarpa]
gi|222873871|gb|EEF11002.1| predicted protein [Populus trichocarpa]
Length = 143
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 74/109 (67%), Positives = 89/109 (81%), Gaps = 1/109 (0%)
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
SYTYLN YQ L S++K+ELS K L+EA +D+TLP+CWKGR+PFK+VHDVKK F+T AL
Sbjct: 1 SYTYLNSQAYQGLISLIKRELSTKPLREALDDQTLPICWKGRKPFKSVHDVKKYFKTFAL 60
Query: 329 SFT-DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
SF DGK++T E PEAYLI+S+KGN CLG+LNG EVGL DLNVIG I
Sbjct: 61 SFANDGKSKTQLEFPPEAYLIVSSKGNACLGVLNGTEVGLNDLNVIGDI 109
>gi|308080924|ref|NP_001183009.1| uncharacterized protein LOC100501329 [Zea mays]
gi|238008766|gb|ACR35418.1| unknown [Zea mays]
Length = 205
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 70/152 (46%), Positives = 95/152 (62%), Gaps = 10/152 (6%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
RM + ++++ ++ S+ L + GNV+P G Y +++IG P RPYFLD+DTGSDLT
Sbjct: 61 RMEVAKAATARTN------STALLPIKGNVFPDGQYYTSIFIGNPPRPYFLDVDTGSDLT 114
Query: 76 WLQCDAPCVRCVEAPHPLYRPSND-LVPCEDPICASLHAPGHHN-CEDPAQCDYELEYAD 133
W+QCDAPC C + PHPLY+P+ + +VP D +C L G+ N CE QCDYE+EYAD
Sbjct: 115 WIQCDAPCTNCAKGPHPLYKPAKEKIVPPRDLLCQELQ--GNQNYCETCKQCDYEIEYAD 172
Query: 134 GGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
SS+GVL +D TNG R GC
Sbjct: 173 QSSSMGVLARDDMHMIATNGGREKLDFVFGCA 204
>gi|226530663|ref|NP_001146528.1| uncharacterized protein LOC100280120 [Zea mays]
gi|219887685|gb|ACL54217.1| unknown [Zea mays]
Length = 292
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 92/254 (36%), Positives = 135/254 (53%), Gaps = 22/254 (8%)
Query: 138 LGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ--VPGASYHPLDGILGLGKGKSSIVSQLH 195
+GV V+D+ F +G+R N + GCGY+Q V + DG+LGL S+ +QL
Sbjct: 1 MGVYVRDSMQFVGEDGERENADIVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLA 60
Query: 196 SQKLIRNVVGHCLS---GGGGGFLFFGDDLYDSSRVVWT--------SMSSDYTKYYSPG 244
S+ +I N GHC+S G GG+LF GDD + W + K + G
Sbjct: 61 SRGIISNAFGHCMSTDPSGAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHG 120
Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
+L G+ T VVFD+GS+YTY L S +K+ S + +++ D+TLP
Sbjct: 121 DQQLNAQGKLT-----QVVFDTGSTYTYFPDEALTRLISSLKEAASPRFVQD-DSDKTLP 174
Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGK--TRTLFELTPEAYLIISNKGNVCLGILNG 362
C K P ++V DVK F+ L+L F +RT F + PE YL+IS+KGNVCLG+LNG
Sbjct: 175 FCMKSDFPVRSVEDVKHFFKPLSLQFEKRFFFSRT-FNIRPEHYLVISDKGNVCLGVLNG 233
Query: 363 AEVGLQDLNVIGGI 376
+G + ++G +
Sbjct: 234 TTIGYDSVVIVGDV 247
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 102/367 (27%), Positives = 160/367 (43%), Gaps = 43/367 (11%)
Query: 9 NLCFPTVRMSSSSS--SSSSSSLFNHVGSSLLFQVHGNVYPT--GYYNVTMYIGQPARPY 64
NL FP R +S + + SS + S++ F + GN PT G Y + +G P++ Y
Sbjct: 23 NLVFPVQRRQASLTGIKAHDSSRRGRILSAVDFNLGGNGLPTVTGLYFTKIGLGSPSKDY 82
Query: 65 FLDLDTGSDLTWLQCDAPCVRCVEAPH-----PLYRP----SNDLVPCEDPICASLHAPG 115
++ +DTGSD+ W+ C C RC LY P +++ V CE C+S +
Sbjct: 83 YVQVDTGSDILWVNC-VECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGR 141
Query: 116 HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ----RLNPRLALGCGYNQ--- 168
C+ C Y + Y DG ++ G V+D FN NG N + GCG Q
Sbjct: 142 ILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTATQNSSIIFGCGAAQSGT 201
Query: 169 VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRV 228
+S LDGI+G G+ SS++SQL + ++ + HCL GG +F ++ +
Sbjct: 202 FASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFSIGEVVEPK-- 259
Query: 229 VWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNLPVVFDSGSSYTYLNRVTYQT 280
V T+ +Y+ + + G+ L V DSG++ YL R+ Y
Sbjct: 260 VKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYDQ 319
Query: 281 LTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFE 340
L S K L + P + L + F+ +V F + L F D + T++
Sbjct: 320 LMS--------KVLAKQPRLKVY-LVEEQYSCFQYTGNVDSGFPIVKLHFEDSLSLTVY- 369
Query: 341 LTPEAYL 347
P YL
Sbjct: 370 --PHDYL 374
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 103/353 (29%), Positives = 161/353 (45%), Gaps = 36/353 (10%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
++H ++ GYY ++IG P + + L +DTGS +T++ C C +C P ++P
Sbjct: 69 MRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCST-CEQCGRHQDPKFQP-- 125
Query: 99 DLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
DL P+ +L NC+ D QC YE +YA+ +S GVL +D +F N L
Sbjct: 126 DLSSTYQPVKCTLDC----NCDNDRMQCVYERQYAEMSTSSGVLGEDVVSFG--NQSELA 179
Query: 158 P-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGG 213
P R GC + DGI+GLG+G SI+ QL + ++ + C G GGG
Sbjct: 180 PQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGG 239
Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------D 265
+ G + S +V+ + YY+ + E+ G+ L P VF D
Sbjct: 240 AMVLGG--ISPPSDMVFAQSDPVRSPYYNIDLKEIHVAGKRLPLN--PSVFDGKHGSVLD 295
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
SG++Y YL + + KEL + S P+ LC+ G +V + K F
Sbjct: 296 SGTTYAYLPEEAFLAFKEAIVKELQSFSQISGPDPNYNDLCFSGAG--IDVSQLSKTFPV 353
Query: 326 LALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
+ + F +G + L+PE Y+ +K G CLGI G ++GGI
Sbjct: 354 VDMIFGNGHK---YSLSPENYMFRHSKVRGAYCLGIFQN---GKDPTTLLGGI 400
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 102/361 (28%), Positives = 166/361 (45%), Gaps = 36/361 (9%)
Query: 31 NHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP 90
+H ++ ++ ++ P GYY ++IG P + + L +DTGS LT++ C C +C +
Sbjct: 72 SHSTATARMPLYDDLIPYGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCST-CEQCGKHQ 130
Query: 91 HPLYRPSNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFN 149
P ++P D P+ S+ C+ + C Y+ +YA+ SS GVL +D +F
Sbjct: 131 DPNFQP--DWSSTYQPLKCSMEC----TCDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFG 184
Query: 150 YTNGQRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
L P R GC + DGI+GLG+G SIV QL + +I N C
Sbjct: 185 --KQSELKPQRTVFGCENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCY 242
Query: 209 SG---GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF- 264
G GGG + G + + +V+T + YY+ + E+ G+ + P+VF
Sbjct: 243 GGMDVGGGAMVLGG--ISPPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPIN--PMVFD 298
Query: 265 -------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
DSG++Y YL ++ + KEL++ L + P+ +C+ G +V
Sbjct: 299 GKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVG--SDVS 356
Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGG 375
+ K F + L F++G L+PE YL +K G CLGI ++GG
Sbjct: 357 QLSKTFPAVDLVFSNGNR---LSLSPENYLFQHSKAHGAYCLGIFQNEN---DQTTLLGG 410
Query: 376 I 376
I
Sbjct: 411 I 411
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 102/361 (28%), Positives = 166/361 (45%), Gaps = 36/361 (9%)
Query: 31 NHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP 90
+H ++ ++ ++ P GYY ++IG P + + L +DTGS LT++ C C +C +
Sbjct: 72 SHSTATARMPLYDDLIPYGYYTTRIWIGTPPQTFALIVDTGSTLTYVPCST-CEQCGKHQ 130
Query: 91 HPLYRPSNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFN 149
P ++P D P+ S+ C+ + C Y+ +YA+ SS GVL +D +F
Sbjct: 131 DPNFQP--DWSSTYQPLKCSMEC----TCDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFG 184
Query: 150 YTNGQRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
L P R GC + DGI+GLG+G SIV QL + +I N C
Sbjct: 185 --KQSELKPQRTVFGCENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCY 242
Query: 209 SG---GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF- 264
G GGG + G + + +V+T + YY+ + E+ G+ + P+VF
Sbjct: 243 GGMDVGGGAMVLGG--ISPPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPIN--PMVFD 298
Query: 265 -------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
DSG++Y YL ++ + KEL++ L + P+ +C+ G +V
Sbjct: 299 GKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVG--SDVS 356
Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGG 375
+ K F + L F++G L+PE YL +K G CLGI ++GG
Sbjct: 357 QLSKTFPAVDLVFSNGNR---LSLSPENYLFQHSKAHGAYCLGIFQNEN---DQTTLLGG 410
Query: 376 I 376
I
Sbjct: 411 I 411
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 109/391 (27%), Positives = 169/391 (43%), Gaps = 54/391 (13%)
Query: 1 MKSSHNGENLCFPTVRMSSSSSSSSSSSLFNH--VGSSLLFQVHGNVYPT--GYYNVTMY 56
+ S NG NL FP R S S+ + + + S++ + GN PT G Y +
Sbjct: 17 IGSVANG-NLVFPVERRKRSLSAVRAHDVRRRGRILSAVDLNLGGNGLPTETGLYFTKLG 75
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH-----PLYRP----SNDLVPCEDPI 107
+G P R Y++ +DTGSD+ W+ C C RC LY P ++D+V C+
Sbjct: 76 LGSPPRDYYVQVDTGSDILWVNC-VECSRCPRKSDLGIDLTLYDPKGSETSDVVSCDQDF 134
Query: 108 CASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ-RLNPR---LALG 163
C++ C+ C Y + Y DG ++ G V+D +N NG R +P+ + G
Sbjct: 135 CSATFDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNRINGNLRTSPQNSSIIFG 194
Query: 164 CGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGD 220
CG Q + +S LDGI+G G+ SS++SQL + ++ + HCL GG +F
Sbjct: 195 CGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDNVRGGGIFAIG 254
Query: 221 DLYDSS-------------RVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSG 267
++ + VV S+ D P +++F G V DSG
Sbjct: 255 EVVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLP--SDIFDSVNGKG-----TVIDSG 307
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
++ YL + Y EL K L P + L L + R F +V + F +
Sbjct: 308 TTLAYLPDIVYD--------ELIQKVLARQPGLK-LYLVEQQFRCFLYTGNVDRGFPVVK 358
Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLG 358
L F D + T++ P YL G C+G
Sbjct: 359 LHFKDSLSLTVY---PHDYLFQFKDGIWCIG 386
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 101/353 (28%), Positives = 161/353 (45%), Gaps = 36/353 (10%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
+++ ++ GYY ++IG P + + L +DTGS +T++ C C C P ++P
Sbjct: 77 MRLYDDLLINGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCST-CEHCGRHQDPKFQP-- 133
Query: 99 DLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
DL P+ + NC+ D QC Y+ +YA+ SS GVL +D +F N L
Sbjct: 134 DLSETYQPVKCTPDC----NCDGDTNQCMYDRQYAEMSSSSGVLGEDVVSFG--NLSELA 187
Query: 158 P-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGG 213
P R GC ++ DGI+GLG+G SI+ QL +K+I + C G GGG
Sbjct: 188 PQRAVFGCENDETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGG 247
Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------D 265
+ G + +V+T D + YY+ + E+ G+ L P VF D
Sbjct: 248 AMILGG--ISPPEDMVFTHSDPDRSPYYNINLKEMHVAGKKLQLN--PKVFDGKHGTVLD 303
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
SG++Y YL + + KE ++ P+ +C+ G +V + K F
Sbjct: 304 SGTTYAYLPETAFLAFKRAIMKERNSLKQINGPDPNYKDICFTGAG--IDVSQLAKSFPV 361
Query: 326 LALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
+ + F +G L+PE YL +K G CLG+ + G ++GGI
Sbjct: 362 VDMVFENGHK---LSLSPENYLFRHSKVRGAYCLGVFSN---GRDPTTLLGGI 408
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 100/356 (28%), Positives = 160/356 (44%), Gaps = 42/356 (11%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
++H ++ GYY ++IG P + + L +DTGS +T++ C C +C P ++P +
Sbjct: 100 MRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCST-CEQCGRHQDPKFQPES 158
Query: 99 DLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
P+ ++ NC+ D QC YE +YA+ +S GVL +D +F N L
Sbjct: 159 S--STYQPVKCTIDC----NCDGDRMQCVYERQYAEMSTSSGVLGEDVISFG--NQSELA 210
Query: 158 P-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGG 213
P R GC + DGI+GLG+G SI+ QL +K+I + C G GGG
Sbjct: 211 PQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGG 270
Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV----------- 262
+ G + S + + D + YY+ + E+ G K LP+
Sbjct: 271 AMVLGG--ISPPSDMTFAYSDPDRSPYYNIDLKEMHVAG-----KRLPLNANVFDGKHGT 323
Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
V DSG++Y YL + + KEL + P+ +C+ G +V + K
Sbjct: 324 VLDSGTTYAYLPEAAFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAG--NDVSQLSKS 381
Query: 323 FRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
F + + F +G + L+PE Y+ +K G CLGI G ++GGI
Sbjct: 382 FPVVDMVFGNGHK---YSLSPENYMFRHSKVRGAYCLGIFQN---GNDQTTLLGGI 431
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 103/346 (29%), Positives = 160/346 (46%), Gaps = 42/346 (12%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDLVPCE-D 105
GYY ++IG P + + L +DTGS +T++ C + C +C + P ++P S+ P + +
Sbjct: 75 GYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSS-CEQCGKHQDPRFQPDLSSTYRPVKCN 133
Query: 106 PICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLALG 163
P C NC+D QC YE YA+ SS GV+ +D +F N L P R G
Sbjct: 134 PSC---------NCDDEGKQCTYERRYAEMSSSSGVIAEDVVSFG--NESELKPQRAVFG 182
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--GGGFLFFGDD 221
C + DGI+GLG+G+ S+V QL + +I + C G GGG + G
Sbjct: 183 CENVETGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGAMVLG-Q 241
Query: 222 LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------DSGSSYTYL 273
+ +V++ + + YY+ + EL G+ LK P VF DSG++Y Y
Sbjct: 242 ISPPPNMVFSHSNPYRSPYYNIELKELHVAGKPLKLK--PKVFDEKHGTVLDSGTTYAYF 299
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
+ L + KE+ P+ +C+ G + V + K F + + F G
Sbjct: 300 PEAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAG--REVSHLSKVFPEVNMVFGSG 357
Query: 334 KTRTLFELTPEAYLIISNK--GNVCLGIL-NGAEVGLQDLNVIGGI 376
+ L+PE YL K G CLGI NG ++ ++GGI
Sbjct: 358 QK---LSLSPENYLFRHTKVSGAYCLGIFQNGNDL----TTLLGGI 396
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 91/338 (26%), Positives = 157/338 (46%), Gaps = 27/338 (7%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
++H ++ GYY +YIG P + + L +D+GS +T++ C A C +C P ++P
Sbjct: 77 MRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQP-- 133
Query: 99 DLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
DL P+ ++ C+ D QC YE +YA+ SS GVL +D +F + +
Sbjct: 134 DLSSSYSPVKCNVDC----TCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKAQ 189
Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGGG 214
R GC ++ DGI+GLG+G+ SI+ QL + +I + C G GGG
Sbjct: 190 -RAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIGGGA 248
Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNL------PVVFDSGS 268
+ G + S +V++ + YY+ + E+ G+ + + V DSG+
Sbjct: 249 MVLGG--VPTPSDMVFSRSDPLRSPYYNIELKEIHVAGKALRVDSRIFDSKHGTVLDSGT 306
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
+Y YL + + ++ + P+ +C+ G R +NV + + F + +
Sbjct: 307 TYAYLPEQAFMAFKDAVTSKVHSLKKIRGPDPSYKDICFAGAR--RNVSKLHEVFPDVDM 364
Query: 329 SFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAE 364
F +G+ LTPE YL +K G CLG+ +
Sbjct: 365 VFGNGQK---LSLTPENYLFRHSKVDGAYCLGVFQNGK 399
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 159/354 (44%), Gaps = 30/354 (8%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
S+ ++H ++ GYY ++IG P + + L +DTGS +T++ C + CV+C P +
Sbjct: 73 SNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGSTVTYVPC-SNCVQCGNHQDPRF 131
Query: 95 RPSNDLVPCEDPICASLHAPGHHNC-EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
+P +L P+ + NC E+ QC YE YA+ +S GVL +D +F
Sbjct: 132 QP--ELSSTYQPVKCNADC----NCDENGVQCTYERRYAEMSTSSGVLAEDVMSFG-KES 184
Query: 154 QRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--- 210
+ + R GC + DGI+GLG+G S++ QL + ++ N C G
Sbjct: 185 ELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDV 244
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK------NLPVVF 264
GGG + G + +V++ + YY+ + E+ G+ L +
Sbjct: 245 GGGAMVLGG--ISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAIL 302
Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
DSG++Y Y Y + K++S P+ +C+ G ++V ++ K F
Sbjct: 303 DSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAG--RDVTELPKVFP 360
Query: 325 TLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
+ + F +G+ L+PE YL K G CLGI G ++GGI
Sbjct: 361 EVDMVFANGQK---ISLSPENYLFRHTKVSGAYCLGIFKN---GNDQTTLLGGI 408
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 97/340 (28%), Positives = 162/340 (47%), Gaps = 39/340 (11%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP-- 96
+++ ++ GYY ++IG P + + L +DTGS +T++ C C +C + P ++P
Sbjct: 64 MKLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCST-CKQCGKHQDPKFQPEL 122
Query: 97 --SNDLVPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNG 153
S + C +P C NC+D + C YE YA+ SS GVL +D +F N
Sbjct: 123 STSYQALKC-NPDC---------NCDDEGKLCVYERRYAEMSSSSGVLSEDLISFG--NE 170
Query: 154 QRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG- 211
+L+P R GC + DGI+GLG+GK S+V QL + +I +V C G
Sbjct: 171 SQLSPQRAVFGCENEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGME 230
Query: 212 -GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF------ 264
GGG + G + +V++ + YY+ + ++ G++ LK P VF
Sbjct: 231 VGGGAMVLG-KISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKS--LKLNPKVFNGKHGT 287
Query: 265 --DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
DSG++Y Y + + + + KE+ + P+ +C+ G ++V ++
Sbjct: 288 VLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAG--RDVAEIHNF 345
Query: 323 FRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGIL 360
F +A+ F +G+ L+PE YL K G CLGI
Sbjct: 346 FPEIAMEFGNGQK---LILSPENYLFRHTKVRGAYCLGIF 382
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 99/351 (28%), Positives = 166/351 (47%), Gaps = 31/351 (8%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
+++ ++ GYY ++IG P + + L +D+GS +T++ C + C +C + P ++P
Sbjct: 81 MRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKHQDPKFQP-- 137
Query: 99 DLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
++ P+ ++ NC+D QC YE EYA+ SS GVL +D +F N +L
Sbjct: 138 EMSSTYQPVKCNMDC----NCDDDREQCVYEREYAEHSSSKGVLGEDLISFG--NESQLT 191
Query: 158 P-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGG 213
P R GC + DGI+GLG+G S+V QL + LI N G C G GGG
Sbjct: 192 PQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGG 251
Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP------VVFDSG 267
+ G D S +V+T D + YY+ + + G+ L + V DSG
Sbjct: 252 SMILGGFDY--PSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHSRVFDGEHGAVLDSG 309
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
++Y YL + + +E+S + P+ C++ V ++ K F ++
Sbjct: 310 TTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAAS-NYVSELSKIFPSVE 368
Query: 328 LSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
+ F G++ + L+PE Y+ +K G CLG+ G ++GGI
Sbjct: 369 MVFKSGQS---WLLSPENYMFRHSKVHGAYCLGVFPN---GKDHTTLLGGI 413
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 97/340 (28%), Positives = 162/340 (47%), Gaps = 39/340 (11%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP-- 96
+++ ++ GYY ++IG P + + L +DTGS +T++ C C +C + P ++P
Sbjct: 64 MKLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCST-CKQCGKHQDPKFQPEL 122
Query: 97 --SNDLVPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNG 153
S + C +P C NC+D + C YE YA+ SS GVL +D +F N
Sbjct: 123 STSYQALKC-NPDC---------NCDDEGKLCVYERRYAEMSSSSGVLSEDLISFG--NE 170
Query: 154 QRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG- 211
+L+P R GC + DGI+GLG+GK S+V QL + +I +V C G
Sbjct: 171 SQLSPQRAVFGCENEETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGME 230
Query: 212 -GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF------ 264
GGG + G + +V++ + YY+ + ++ G++ LK P VF
Sbjct: 231 VGGGAMVLG-KISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKS--LKLNPKVFNGKHGT 287
Query: 265 --DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
DSG++Y Y + + + + KE+ + P+ +C+ G ++V ++
Sbjct: 288 VLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAG--RDVAEIHNF 345
Query: 323 FRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGIL 360
F +A+ F +G+ L+PE YL K G CLGI
Sbjct: 346 FPEIAMEFGNGQK---LILSPENYLFRHTKVRGAYCLGIF 382
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 159/354 (44%), Gaps = 30/354 (8%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
S+ ++H ++ GYY ++IG P + + L +DTGS +T++ C + CV+C P +
Sbjct: 73 SNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGSTVTYVPC-SNCVQCGNHQDPRF 131
Query: 95 RPSNDLVPCEDPICASLHAPGHHNC-EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
+P +L P+ + NC E+ QC YE YA+ +S GVL +D +F
Sbjct: 132 QP--ELSSTYQPVKCNADC----NCDENGVQCTYERRYAEMSTSSGVLAEDVMSFG-KES 184
Query: 154 QRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--- 210
+ + R GC + DGI+GLG+G S++ QL + ++ N C G
Sbjct: 185 ELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDV 244
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK------NLPVVF 264
GGG + G + +V++ + YY+ + E+ G+ L +
Sbjct: 245 GGGAMVLGG--ISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAIL 302
Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
DSG++Y Y Y + K++S P+ +C+ G ++V ++ K F
Sbjct: 303 DSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAG--RDVTELPKVFP 360
Query: 325 TLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
+ + F +G+ L+PE YL K G CLGI G ++GGI
Sbjct: 361 EVDMVFANGQK---ISLSPENYLFRHTKVSGAYCLGIFKN---GNDQTTLLGGI 408
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 99/351 (28%), Positives = 166/351 (47%), Gaps = 31/351 (8%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
+++ ++ GYY ++IG P + + L +D+GS +T++ C + C +C + P ++P
Sbjct: 82 MRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKHQDPKFQP-- 138
Query: 99 DLVPCEDPICASLHAPGHHNCED-PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
+L P+ ++ NC+D QC YE EYA+ SS GVL +D +F N +L
Sbjct: 139 ELSSTYQPVKCNMDC----NCDDDKEQCVYEREYAEHSSSKGVLGEDLISFG--NESQLT 192
Query: 158 P-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGG 213
P R GC + DGI+GLG+G S+V QL + LI N G C G GGG
Sbjct: 193 PQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGG 252
Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP------VVFDSG 267
+ G D S +++T D + YY+ + + G+ L + V DSG
Sbjct: 253 SMILGGFDY--PSDMIFTDSDPDRSPYYNIDLTGIRVAGKKLSLNSRVFDGEHGAVLDSG 310
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
++Y YL + + +E+S + P+ C+ +V ++ K F ++
Sbjct: 311 TTYAYLPDAAFAAFEEAVMREVSPLKQIDGPDPNFKDTCFLVAAS-NDVSELSKIFPSVE 369
Query: 328 LSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
+ F G++ + L+PE Y+ +K G CLG+ G ++GGI
Sbjct: 370 MIFKSGQS---WLLSPENYMFRHSKVHGAYCLGVFPN---GKDHTTLLGGI 414
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 104/386 (26%), Positives = 175/386 (45%), Gaps = 35/386 (9%)
Query: 7 GENLCFPTVRM---SSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARP 63
G L P R +S ++SS L + + ++H ++ GYY +YIG P +
Sbjct: 42 GPPLFLPLTRSYPNASRLAASSRRGLGDGAHPNARMRLHDDLLTNGYYTTRLYIGTPPQE 101
Query: 64 YFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCE-DP 122
+ L +D+GS +T++ C A C +C P ++P DL P+ ++ C+ D
Sbjct: 102 FALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQP--DLSSSYSPVKCNVDC----TCDSDK 154
Query: 123 AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLALGCGYNQVPGASYHPLDGIL 181
QC YE +YA+ SS GVL +D +F + L P R GC ++ DGI+
Sbjct: 155 KQCTYERQYAEMSSSSGVLGEDIVSFGRES--ELKPQRAVFGCENSETGDLFSQHADGIM 212
Query: 182 GLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGGGFLFFGDDLYDSSRVVWTSMSSDYT 238
GLG+G+ SI+ QL + +I + C G GGG + G + S +V++ +
Sbjct: 213 GLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGG--VPAPSDMVFSHSDPLRS 270
Query: 239 KYYSPGVAELFFGGETTGLKNLP------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAK 292
YY+ + E+ G+ + + V DSG++Y YL + + ++ +
Sbjct: 271 PYYNIELKEIHVAGKALRVDSRVFNSKHGTVLDSGTTYAYLPEQAFVAFKDAVTSKVHSL 330
Query: 293 SLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK 352
P+ +C+ G +NV + + F + + F +G+ LTPE YL +K
Sbjct: 331 KKIRGPDPNYKDICFAGAG--RNVSKLHEVFPDVDMVFGNGQK---LSLTPENYLFRHSK 385
Query: 353 --GNVCLGILNGAEVGLQDLNVIGGI 376
G CLG+ G ++GGI
Sbjct: 386 VDGAYCLGVFQN---GKDPTTLLGGI 408
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 102/372 (27%), Positives = 169/372 (45%), Gaps = 32/372 (8%)
Query: 18 SSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWL 77
++S +SS L + S ++H ++ GYY +YIG P + + L +D+GS +T++
Sbjct: 52 NASRLASSRRVLGDGGRPSARMRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYV 111
Query: 78 QCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGS 136
C A C +C P ++P DL P+ S C+ D +QC YE +YA+ S
Sbjct: 112 PC-ASCEQCGNHQDPRFQP--DLSSTYSPVKCSADC----TCDSDKSQCTYERQYAEMSS 164
Query: 137 SLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS 196
S GVL +D +F T + R GC ++ DGI+GLG+G+ SI+ QL
Sbjct: 165 SSGVLGEDIVSFG-TESELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVD 223
Query: 197 QKLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGET 254
+ +I + C G GGG + G + +V++ + YY+ + E+ G+
Sbjct: 224 KGVIGDSFSMCYGGMDIGGGAMVLG-AMPAPPDMVFSRSDPVRSPYYNIELKEIHVAGKA 282
Query: 255 TGLKNLPVVF--------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLC 306
L P +F DSG++Y YL + + ++ P+ +C
Sbjct: 283 LRLD--PRIFDSKHGTVLDSGTTYAYLPEQAFVAFKDAVTSKVRPLKKIRGPDPNYKDIC 340
Query: 307 WKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAE 364
+ G +NV + + F + + F DG+ L+PE YL +K G CLG+
Sbjct: 341 FAGAG--RNVSQLSQAFPDVDMVFGDGQK---LSLSPENYLFRHSKVEGAYCLGVFQN-- 393
Query: 365 VGLQDLNVIGGI 376
G ++GGI
Sbjct: 394 -GKDPTTLLGGI 404
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 150/340 (44%), Gaps = 50/340 (14%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
G Y ++M IG P R Y LDTGSDL W QC APC+ CV+ P P + P+ +PC
Sbjct: 87 GEYLMSMGIGTPPRYYSAILDTGSDLIWTQC-APCMLCVDQPTPFFDPAQSPSYAKLPCN 145
Query: 105 DPICASLHAP-GHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
P+C +L+ P + N C Y+ Y D ++ GVL + F F + + PR+A G
Sbjct: 146 SPMCNALYYPLCYRNV-----CVYQYFYGDSANTAGVLSNETFTFGTNDTRVTVPRIAFG 200
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF---LFFGD 220
CG + S G++G G+G S+VSQL S + +CL+ L+FG
Sbjct: 201 CG--NLNAGSLFNGSGMVGFGRGPLSLVSQLGSPRF-----SYCLTSFMSPVPSRLYFGA 253
Query: 221 DLYDSSRVVWTSMSSDYTKYY-SPGVAELFF---GGETTGLKNLP--------------- 261
+S T T + +PG+ +++ G + G + LP
Sbjct: 254 YATLNSTSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGTG 313
Query: 262 -VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
V+ DSGS+ TYL R Y + ++ + L C+ P + + +
Sbjct: 314 GVIIDSGSTITYLARAAYDMVHQAFADQVGLPLTNATSLADVLDTCFVWPPPPRKIVTMP 373
Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLIIS-NKGNVCLGI 359
+ LA F EL E Y++I + GN+CL I
Sbjct: 374 E----LAFHFEGAN----MELPLENYMLIDGDTGNLCLAI 405
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 101/386 (26%), Positives = 179/386 (46%), Gaps = 35/386 (9%)
Query: 7 GENLCFPTVRMSSSSSSSSSS---SLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARP 63
G L P R ++S ++S L + V + ++H ++ GYY +YIG P +
Sbjct: 41 GPPLFLPLTRSYPNASRLAASLRRGLGDGVHPNARMRLHDDLLTNGYYTTRLYIGTPPQE 100
Query: 64 YFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCE-DP 122
+ L +D+GS +T++ C + C +C P ++P DL P+ ++ C+ D
Sbjct: 101 FALIVDSGSTVTYVPCSS-CEQCGNHQDPRFQP--DLSSSYSPVKCNVDC----TCDSDK 153
Query: 123 AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL-GCGYNQVPGASYHPLDGIL 181
QC YE +YA+ SS GVL +D +F + L P+ A+ GC ++ DGI+
Sbjct: 154 KQCTYERQYAEMSSSSGVLGEDIVSFGRES--ELKPQHAIFGCENSETGDLFSQHADGIM 211
Query: 182 GLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGGGFLFFGDDLYDSSRVVWTSMSSDYT 238
GLG+G+ SI+ QL + +I + C G GGG + G + +++++ +
Sbjct: 212 GLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGG--MLAPPDMIFSNSDPLRS 269
Query: 239 KYYSPGVAELFFGGETTGLKNLP------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAK 292
YY+ + E+ G+ +++ V DSG++Y YL + + ++ +
Sbjct: 270 PYYNIELKEIHVAGKALRVESRIFNSKHGTVLDSGTTYAYLPEQAFVAFKEAVTSKVHSL 329
Query: 293 SLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK 352
P+ +C+ G +NV + + F + + F +G+ LTPE YL +K
Sbjct: 330 KKIRGPDPSYKDICFAGAG--RNVSKLHEVFPDVDMVFGNGQK---LSLTPENYLFRHSK 384
Query: 353 --GNVCLGILNGAEVGLQDLNVIGGI 376
G CLG+ G ++GGI
Sbjct: 385 VDGAYCLGVFQN---GKDPTTLLGGI 407
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 112/398 (28%), Positives = 177/398 (44%), Gaps = 48/398 (12%)
Query: 4 SHNGENLCFP----TVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVY----PTGYYNVTM 55
+HN + P T +SS +S+ + +S L H +Y GYY +
Sbjct: 33 NHNHRPMIIPLHLSTSNISSHRKPFTSNYHRRQLHNSDLPNAHMRLYDDLLSNGYYTTRL 92
Query: 56 YIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDLVPCE-DPICASLH 112
+IG P + + L +DTGS +T++ C C +C + P ++P S+ P + +P C
Sbjct: 93 FIGTPPQEFALIVDTGSTVTYVPCST-CEQCGKHQDPRFQPESSSTYKPMQCNPSC---- 147
Query: 113 APGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL-GCGYNQVP 170
NC+D QC YE YA+ SS G+L +D +F N L P+ A+ GC +
Sbjct: 148 -----NCDDEGKQCTYERRYAEMSSSSGLLAEDVLSFG--NESELTPQRAIFGCETVETG 200
Query: 171 GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG--GGFLFFGDDLYDSSRV 228
DGI+GLG+G S+V QL ++++ N C G GG + G ++ +
Sbjct: 201 ELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVGGAMVLG-NIPPPPDM 259
Query: 229 VWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------DSGSSYTYLNRVTYQT 280
V+ + YY+ + EL G+ LK P VF DSG++Y YL +
Sbjct: 260 VFAHSDPYRSAYYNIELKELHVAGKR--LKLNPRVFDGKHGTVLDSGTTYAYLPEEAFVA 317
Query: 281 LTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFE 340
+ KE+ P+ +C+ G ++V + K F + + F +G+
Sbjct: 318 FKDAIIKEIKFLKQIHGPDPSYNDICFSGAG--RDVSQLSKIFPEVNMVFGNGQK---LS 372
Query: 341 LTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
L+PE YL K G CLGI G ++GGI
Sbjct: 373 LSPENYLFRHTKVSGAYCLGIFQN---GKDPTTLLGGI 407
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 164/383 (42%), Gaps = 53/383 (13%)
Query: 9 NLCFPTVRMSSS--SSSSSSSSLFNHVGSSLLFQVHGNVYPT--GYYNVTMYIGQPARPY 64
N FP R S + + + + S++ + GN PT G Y + +G P + Y
Sbjct: 24 NFVFPVERRKRSLNAVKAHDARRRGRILSAVDLNLGGNGLPTETGLYFTKLGLGSPPKDY 83
Query: 65 FLDLDTGSDLTWLQCDAPCVRCVEAPH-----PLYRP----SNDLVPCEDPICASLHAPG 115
++ +DTGSD+ W+ C C RC LY P +++L+ C+ C++ +
Sbjct: 84 YVQVDTGSDILWVNC-VKCSRCPRKSDLGIDLTLYDPKGSETSELISCDQEFCSATYDGP 142
Query: 116 HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ-RLNPR---LALGCGYNQ--- 168
C+ C Y + Y DG ++ G V+D +N+ N R P+ + GCG Q
Sbjct: 143 IPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVNDNLRTAPQNSSIIFGCGAVQSGT 202
Query: 169 VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSS-- 226
+ +S LDGI+G G+ SS++SQL + ++ + HCL GG +F ++ +
Sbjct: 203 LSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCLDNIRGGGIFAIGEVVEPKVS 262
Query: 227 -----------RVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNR 275
VV S+ D P +++F G G + DSG++ YL
Sbjct: 263 TTPLVPRMAHYNVVLKSIEVDTDILQLP--SDIFDSGNGKG-----TIIDSGTTLAYLPA 315
Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKT 335
+ Y EL K + P + L L + F+ +V + F + L F D +
Sbjct: 316 IVYD--------ELIPKVMARQPRLK-LYLVEQQFSCFQYTGNVDRGFPVVKLHFEDSLS 366
Query: 336 RTLFELTPEAYLIISNKGNVCLG 358
T++ P YL G C+G
Sbjct: 367 LTVY---PHDYLFQFKDGIWCIG 386
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 99/355 (27%), Positives = 156/355 (43%), Gaps = 54/355 (15%)
Query: 33 VGSSLLFQVHGNVYPT----GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVE 88
VG + F+V G+ P+ G Y + +G P R + + +DTGSD+ W+ C+ C C +
Sbjct: 62 VGGVVDFRVQGSSDPSTLGYGLYTTKVKMGTPPREFTVQIDTGSDILWINCNT-CSNCPK 120
Query: 89 AP---------HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSL 138
+ + + LVPC DP+CAS C QC Y +Y DG +
Sbjct: 121 SSGLGIELNFFDTVGSSTAALVPCSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTS 180
Query: 139 GVLVKDAFAFNYTNGQRLNPRLA------LGCGYNQVPGASY--HPLDGILGLGKGKSSI 190
GV V DA F+ GQ +A GC Q + +DGILG G G+ S+
Sbjct: 181 GVYVSDAMYFDMILGQSTPANVASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSV 240
Query: 191 VSQLHSQKLIRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAEL 248
VSQL S+ + V HCL G GGG L G+ L S +V++ + +Y+ + +
Sbjct: 241 VSQLSSRGITPKVFSHCLKGDGNGGGILVLGEILEPS--IVYSPLVPS-QPHYNLNLQSI 297
Query: 249 FFGGETTGLKNLPVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAP 298
G+ + P VF DSG++ +YL + Y L + + +S +
Sbjct: 298 AVNGQVLSIN--PAVFATSDKRGTIIDSGTTLSYLVQEAYDPLVNAVDTAVSQFATS--- 352
Query: 299 EDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKG 353
KG + + + + F T++ +F G + +L P YL+ N+G
Sbjct: 353 ------FISKGSQCYLVLTSIDDSFPTVSFNFEGGAS---MDLKPSQYLL--NRG 396
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 96/338 (28%), Positives = 140/338 (41%), Gaps = 48/338 (14%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------VEA 89
Q + Y G Y + +G P RP+++ +DTGSD+ W+ C PC C +
Sbjct: 29 LQGTADPYVAGLYYTRIELGTPPRPFYVQIDTGSDILWVNC-KPCNACPLTSGLGVALNF 87
Query: 90 PHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN 149
P + + C D C S + C C Y EY DG +LG V D F +N
Sbjct: 88 FDPRGSSTASPLSCIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYN 147
Query: 150 -YTNGQRLN---PRLALGCGYNQVPGASYHP---LDGILGLGKGKSSIVSQLHSQKLIRN 202
Y N N ++ GC YNQ G P +DGI G G+ S+VSQL+SQ L
Sbjct: 148 QYVNQYVTNNASAKITFGCSYNQ-SGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPK 206
Query: 203 VVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNL 260
+ HCL G GGG L G+ +V+T + +Y+ + + G+ +
Sbjct: 207 IFSHCLEGADPGGGILVLGE--ITEPGMVYTPIVPS-QPHYNLNLQGIAVNGQQLSID-- 261
Query: 261 PVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
P VF D G++ YL Y+ + + +S T P KG
Sbjct: 262 PQVFATTNTRGTIIDCGTTLAYLAEEAYEPFVNTIIAAVS---------QSTQPFMLKGN 312
Query: 311 RPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI 348
F VH + + F ++ L F +L P+ YLI
Sbjct: 313 PCFLTVHSIDEIFPSVTLYFEGAP----MDLKPKDYLI 346
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 162/357 (45%), Gaps = 39/357 (10%)
Query: 36 SLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR 95
S +H ++ GYY + IG P + L +DTGS +T++ C + C C P +
Sbjct: 20 SARMDLHDDLLTKGYYTSRVKIGTPPHEFSLIVDTGSTVTYVPCSS-CTHCGNHQDPRFS 78
Query: 96 P--SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN- 152
P S+ P E C S + G C+ + Y+ +YA+ +S GVL KD F+ ++
Sbjct: 79 PALSSSYKPLE---CGSECSTGF--CDGSRK--YQRQYAEKSTSSGVLGKDVIGFSNSSD 131
Query: 153 --GQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
GQRL GC + DGI+GLG+G SI+ QL + + +V C G
Sbjct: 132 LGGQRL----VFGCETAETGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGG 187
Query: 211 ---GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK------NLP 261
GGG + G +V+T+ + YY+ + + GG LK
Sbjct: 188 MDEGGGAMILGG--FQPPKDMVFTASDPHRSPYYNLMLKGIRVGGSPLRLKPEVFDGKYG 245
Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
V DSG++Y Y +Q S +K+++ + P+++ +C+ G NV ++ +
Sbjct: 246 TVLDSGTTYAYFPGAAFQAFKSAVKEQVGSLKEVPGPDEKFKDICYAGAG--TNVSNLSQ 303
Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
F ++ F DG++ T L+PE YL K G CLG+ + ++GGI
Sbjct: 304 FFPSVDFVFGDGQSVT---LSPENYLFRHTKISGAYCLGVFENGD----PTTLLGGI 353
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 96/340 (28%), Positives = 161/340 (47%), Gaps = 39/340 (11%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP-- 96
+++ ++ GYY ++IG P + + L +DTGS +T++ C C +C + P ++P
Sbjct: 68 MKLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCST-CKQCGKHQDPKFQPEL 126
Query: 97 --SNDLVPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNG 153
S + C +P C NC+D + C YE YA+ SS GVL +D +F N
Sbjct: 127 SSSYKALKC-NPDC---------NCDDEGKLCVYERRYAEMSSSSGVLSEDLISFG--NE 174
Query: 154 QRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG- 211
+L P R GC + DGI+GLG+GK S+V QL + +I +V C G
Sbjct: 175 SQLTPQRAVFGCENVETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGME 234
Query: 212 -GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF------ 264
GGG + G + + +V++ + YY+ + ++ G++ LK P VF
Sbjct: 235 VGGGAMVLG-KISPPAGMVFSHSDPFRSPYYNIDLKQMHVAGKS--LKLNPKVFNGKHGT 291
Query: 265 --DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
DSG++Y Y + + + + KE+ + P+ +C+ G ++V ++
Sbjct: 292 VLDSGTTYAYFPKEAFIAIKDAIIKEIPSLKRIHGPDPNYDDVCFSGAG--RDVAEIHNF 349
Query: 323 FRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGIL 360
F + + F +G+ L+PE YL K G CLGI
Sbjct: 350 FPEIDMEFGNGQK---LILSPENYLFRHTKVRGAYCLGIF 386
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 95/354 (26%), Positives = 164/354 (46%), Gaps = 32/354 (9%)
Query: 36 SLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR 95
S ++H ++ GYY ++IG P + + L +D+GS +T++ C A C +C P ++
Sbjct: 73 SARMRLHDDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQ 131
Query: 96 PSNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
P DL P+ ++ C+ D QC YE +YA+ SS GVL +D +F T +
Sbjct: 132 P--DLSSTYSPVKCNVDC----TCDSDKNQCTYERQYAEMSSSSGVLGEDIVSFG-TESE 184
Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--G 212
R GC ++ DGI+GLG+G+ SI+ QL + +I + C G G
Sbjct: 185 LKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIG 244
Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------- 264
GG + G + +++T ++ + YY+ + E+ G+ L+ P +F
Sbjct: 245 GGAMVLG-AMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKA--LRVDPRIFDGKHGTVL 301
Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
DSG++Y YL + + ++ P+ +C+ G +NV + + F
Sbjct: 302 DSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAG--RNVSQLSEVFP 359
Query: 325 TLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
+ + F +G+ L+PE YL +K G CLG+ G ++GGI
Sbjct: 360 KVDMVFGNGQK---LSLSPENYLFRHSKVEGAYCLGVFQN---GKDPTTLLGGI 407
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 111/347 (31%), Positives = 145/347 (41%), Gaps = 49/347 (14%)
Query: 46 YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLY----RP 96
Y G Y + +G P R Y L +DTGSDL W+ C PC+ C ++ P Y
Sbjct: 31 YIAGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCH-PCIGCPAFSDLKIPIVPYDVKASA 89
Query: 97 SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
S+ VPC DP C + C D QC Y +Y DG +LG LV+D +
Sbjct: 90 SSSKVPCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVNA---- 145
Query: 157 NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--G 212
+ GCG+ Q S LDGI+G G S SQL Q NV HCL GG G
Sbjct: 146 TATVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERG 205
Query: 213 GGFLFFGD----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-VVFDSG 267
GG L G+ D+ + V + S + + S A L + + +FDSG
Sbjct: 206 GGILVLGNVIEPDIQYTPLVPYMSHYNVVLQSISVNNANLTIDPKLFSNDVMQGTIFDSG 265
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
++ YL YQ T A SL AP LC F + K F +
Sbjct: 266 TTLAYLPDEAYQAFT-------QAVSLVVAP----FLLCDTRLSRF-----IYKLFPNVV 309
Query: 328 LSFTDGKTRTLFELTPEAYLI----ISNKGNVCLGI--LNGAEVGLQ 368
L F +G + T LTP YLI +N C+G + AE LQ
Sbjct: 310 LYF-EGASMT---LTPAEYLIRQASAANAPIWCMGWQSMGSAESELQ 352
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 95/330 (28%), Positives = 142/330 (43%), Gaps = 33/330 (10%)
Query: 43 GNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLYR 95
GN +P G Y + +G P + Y++ +DTGSD+ W+ C A C +C + LY
Sbjct: 72 GNGHPAEAGLYFAKIGLGNPPKDYYVQVDTGSDILWVNC-ANCDKCPTKSDLGVKLTLYD 130
Query: 96 P----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
P S + C+D CA+ + C C Y + Y DG S+ G VKD F+
Sbjct: 131 PQSSTSATRIYCDDDFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQFDRV 190
Query: 152 NGQ----RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVG 205
G N + GCG Q G S LDGILG G+ SS++SQL + ++ V
Sbjct: 191 TGNLQTSSANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFA 250
Query: 206 HCLSG-GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL------- 257
HCL GGG G+ + S +V T M + +Y+ + E+ GG L
Sbjct: 251 HCLDNVKGGGIFAIGEVV--SPKVNTTPMVPN-QPHYNVVMKEIEVGGNVLELPTDIFDT 307
Query: 258 -KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNV 316
+ DSG++ YL V Y+++ + + E L E T C++
Sbjct: 308 GDRRGTIIDSGTTLAYLPEVVYESMMTKIVSEQPGLKLHTVEEQFT---CFQYTGNVNEG 364
Query: 317 HDVKKCFRTLALSFTDGKTRTLFELTPEAY 346
V K +LS T LF++ E +
Sbjct: 365 FPVVKFHFNGSLSLTVNPHDYLFQIHEEVW 394
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 95/360 (26%), Positives = 153/360 (42%), Gaps = 48/360 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP--- 96
TG Y + +G P + Y++ +DTGSD+ W+ C + C + PH LY P
Sbjct: 83 TGLYYTEIKLGTPPKHYYVQVDTGSDILWVNC----ITCEQCPHKSGLGLDLTLYDPKAS 138
Query: 97 -SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT---- 151
+ +V C+ CA+ C C+Y + Y DG S++G V DA F+
Sbjct: 139 STGSMVMCDQAFCAATFGGKLPKCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDG 198
Query: 152 NGQRLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
Q N + GCG Q G+S LDGILG G+ +S++SQL + ++ + HCL
Sbjct: 199 QTQPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLD 258
Query: 210 G-GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNL 260
GGG GD + +V T + +D +Y+ + + GG T L +
Sbjct: 259 TIKGGGIFSIGDVV--QPKVKTTPLVAD-KPHYNVNLKTIDVGGTTLQLPAHIFEPGEKK 315
Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
+ DSG++ TYL + ++ + + + + + +G F+ V
Sbjct: 316 GTIIDSGTTLTYLPELVFKEVMLAVFNKHQDITFHDV----------QGFLCFQYPGSVD 365
Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDFV 380
F T+ F D ++ P Y + C+G NGA +D I +GD V
Sbjct: 366 DGFPTITFHFEDDLALHVY---PHEYFFANGNDVYCVGFQNGASQS-KDGKDIVLMGDLV 421
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 95/354 (26%), Positives = 164/354 (46%), Gaps = 32/354 (9%)
Query: 36 SLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR 95
S ++H ++ GYY ++IG P + + L +D+GS +T++ C A C +C P ++
Sbjct: 73 SARMRLHDDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPC-ASCEQCGNHQDPRFQ 131
Query: 96 PSNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
P DL P+ ++ C+ D QC YE +YA+ SS GVL +D +F T +
Sbjct: 132 P--DLSSTYSPVKCNVDC----TCDSDKNQCTYERQYAEMSSSSGVLGEDIVSFG-TESE 184
Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--G 212
R GC ++ DGI+GLG+G+ SI+ QL + +I + C G G
Sbjct: 185 LKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIG 244
Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------- 264
GG + G + +++T ++ + YY+ + E+ G+ L+ P +F
Sbjct: 245 GGAMVLG-AMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKA--LRVDPRIFDGKHGTVL 301
Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
DSG++Y YL + + ++ P+ +C+ G +NV + + F
Sbjct: 302 DSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAG--RNVSQLSEVFP 359
Query: 325 TLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
+ + F +G+ L+PE YL +K G CLG+ G ++GGI
Sbjct: 360 KVDMVFGNGQK---LSLSPENYLFRHSKVEGAYCLGVFQN---GKDPTTLLGGI 407
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 172/369 (46%), Gaps = 34/369 (9%)
Query: 22 SSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA 81
+SS+ L + + ++H ++ GYY +YIG P++ + L +D+GS +T++ C A
Sbjct: 62 ASSARRGLGDGHNPNARMRLHDDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPC-A 120
Query: 82 PCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGV 140
C +C P ++P DL P+ ++ C++ +QC YE +YA+ SS GV
Sbjct: 121 TCEQCGNHQDPRFQP--DLSSTYSPVKCNVDC----TCDNERSQCTYERQYAEMSSSSGV 174
Query: 141 LVKDAFAFNYTNGQRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
L +D +F + L P R GC + DGI+GLG+G+ SI+ QL + +
Sbjct: 175 LGEDIMSFGKES--ELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGV 232
Query: 200 IRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL 257
I + C G GGG + G + +V++ + + YY+ + E+ G+ L
Sbjct: 233 ISDSFSLCYGGMDVGGGTMVLG-GMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRL 291
Query: 258 KNLPVVF--------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
P +F DSG++Y YL + + ++++ P+ +C+ G
Sbjct: 292 D--PKIFNSKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAG 349
Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGL 367
+NV + + F + + F +G+ L+PE YL +K G CLG+ G
Sbjct: 350 AG--RNVSQLSEVFPDVDMVFGNGQK---LSLSPENYLFRHSKVEGAYCLGVFQN---GK 401
Query: 368 QDLNVIGGI 376
++GGI
Sbjct: 402 DPTTLLGGI 410
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 110/347 (31%), Positives = 144/347 (41%), Gaps = 49/347 (14%)
Query: 46 YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLY----RP 96
Y G Y + +G P R Y L +DTGSDL W+ C PC+ C ++ P Y
Sbjct: 31 YIAGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCH-PCIGCPAFSDLKIPIVPYDVKASA 89
Query: 97 SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
S+ VPC DP C + C D QC Y +Y DG +LG LV+D +
Sbjct: 90 SSSKVPCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVNA---- 145
Query: 157 NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--G 212
+ GCG+ Q S LDGI+G G S SQL Q NV HCL GG G
Sbjct: 146 TATVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERG 205
Query: 213 GGFLFFGD----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-VVFDSG 267
GG L G+ D+ + V + + + S A L + + +FDSG
Sbjct: 206 GGILVLGNVIEPDIQYTPLVPYMYHYNVVLQSISVNNANLTIDPKLFSNDVMQGTIFDSG 265
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
++ YL YQ T A SL AP LC F + K F +
Sbjct: 266 TTLAYLPDEAYQAFT-------QAVSLVVAP----FLLCDTRLSRF-----IYKLFPNVV 309
Query: 328 LSFTDGKTRTLFELTPEAYLI----ISNKGNVCLGI--LNGAEVGLQ 368
L F +G + T LTP YLI +N C+G + AE LQ
Sbjct: 310 LYF-EGASMT---LTPAEYLIRQASAANAPIWCMGWQSMGSAESELQ 352
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 100/379 (26%), Positives = 175/379 (46%), Gaps = 44/379 (11%)
Query: 22 SSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA 81
+SS+ L + + ++H ++ GYY +YIG P++ + L +D+GS +T++ C A
Sbjct: 62 ASSARRGLGDGHNPNARMRLHDDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPC-A 120
Query: 82 PCVRC----------VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDP-AQCDYELE 130
C +C +EA P ++P DL P+ ++ C++ +QC YE +
Sbjct: 121 TCEQCGNHQSESPNIIEAHDPRFQP--DLSSTYSPVKCNVDC----TCDNERSQCTYERQ 174
Query: 131 YADGGSSLGVLVKDAFAFNYTNGQRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSS 189
YA+ SS GVL +D +F + L P R GC + DGI+GLG+G+ S
Sbjct: 175 YAEMSSSSGVLGEDIMSFGKES--ELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLS 232
Query: 190 IVSQLHSQKLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAE 247
I+ QL + +I + C G GGG + G + +V++ + + YY+ + E
Sbjct: 233 IMDQLVEKGVISDSFSLCYGGMDVGGGTMVLG-GMPAPPDMVFSHSNPVRSPYYNIELKE 291
Query: 248 LFFGGETTGLKNLPVVF--------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPE 299
+ G+ L P +F DSG++Y YL + + ++++ P+
Sbjct: 292 IHVAGKALRLD--PKIFNSKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPD 349
Query: 300 DETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCL 357
+C+ G +NV + + F + + F +G+ L+PE YL +K G CL
Sbjct: 350 PNYKDICFAGAG--RNVSQLSEVFPDVDMVFGNGQK---LSLSPENYLFRHSKVEGAYCL 404
Query: 358 GILNGAEVGLQDLNVIGGI 376
G+ G ++GGI
Sbjct: 405 GVFQN---GKDPTTLLGGI 420
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 92/343 (26%), Positives = 146/343 (42%), Gaps = 47/343 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP--- 96
TG Y + +G P + +++ +DTGSD+ W+ C + C + PH LY P
Sbjct: 85 TGLYYTEVRLGTPPKRFYVQVDTGSDILWVNC----ITCDQCPHKSGLGLDLTLYDPKAS 140
Query: 97 -SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG-- 153
+ V C+ CA C C+Y + Y DG S++G V DA F+ G
Sbjct: 141 STGSTVMCDQGFCADTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTGDG 200
Query: 154 --QRLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
Q N + GCG Q G+S LDGILG G+ +S++SQL + ++ + HCL
Sbjct: 201 QTQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCLD 260
Query: 210 G-GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNL 260
GGG GD + +V T + +D +Y+ + + GG T L +
Sbjct: 261 TIKGGGIFAIGDVV--QPKVKTTPLVAD-KPHYNVNLKTIDVGGTTLELPADIFKPGEKR 317
Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
+ DSG++ TYL + ++ + + + + + + LC F+ V
Sbjct: 318 GTIIDSGTTLTYLPELVFKKVMLAVFNKHQDITFHDVQD----FLC------FEYSGSVD 367
Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGA 363
F TL F D ++ P Y + C+G NGA
Sbjct: 368 DGFPTLTFHFEDDLALHVY---PHEYFFPNGNDVYCVGFQNGA 407
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 97/362 (26%), Positives = 168/362 (46%), Gaps = 44/362 (12%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC----------VE 88
++H ++ GYY +YIG P++ + L +D+GS +T++ C A C +C +E
Sbjct: 80 MRLHDDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPC-ATCEQCGNHQSESPNIIE 138
Query: 89 APHPLYRPSNDLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFA 147
A P ++P DL P+ ++ C++ +QC YE +YA+ SS GVL +D +
Sbjct: 139 AHDPRFQP--DLSSTYSPVKCNVDC----TCDNERSQCTYERQYAEMSSSSGVLGEDIMS 192
Query: 148 FNYTNGQRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGH 206
F + L P R GC + DGI+GLG+G+ SI+ QL + +I +
Sbjct: 193 FGKES--ELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSL 250
Query: 207 CLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF 264
C G GGG + G + +V++ + + YY+ + E+ G+ L P +F
Sbjct: 251 CYGGMDVGGGTMVLG-GMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLD--PKIF 307
Query: 265 --------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNV 316
DSG++Y YL + + ++++ P+ +C+ G +NV
Sbjct: 308 NSKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAG--RNV 365
Query: 317 HDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIG 374
+ + F + + F +G+ L+PE YL +K G CLG+ G ++G
Sbjct: 366 SQLSEVFPDVDMVFGNGQK---LSLSPENYLFRHSKVEGAYCLGVFQN---GKDPTTLLG 419
Query: 375 GI 376
GI
Sbjct: 420 GI 421
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 98/356 (27%), Positives = 159/356 (44%), Gaps = 42/356 (11%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
++H ++ GYY ++IG P + + L +DTGS +T++ C C +C P ++P +
Sbjct: 72 MRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCST-CEQCGRHQDPKFQPES 130
Query: 99 DLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
P+ ++ NC+ D QC YE +YA+ +S GVL +D +F N L
Sbjct: 131 S--STYQPVKCTIDC----NCDSDRMQCVYERQYAEMSTSSGVLGEDLISFG--NQSELA 182
Query: 158 P-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGG 213
P R GC + DGI+GLG+G SI+ QL + +I + C G GGG
Sbjct: 183 PQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDVGGG 242
Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV----------- 262
+ G + S + + + YY+ + E+ G K LP+
Sbjct: 243 AMVLGG--ISPPSDMAFAYSDPVRSPYYNIDLKEIHVAG-----KRLPLNANVFDGKHGT 295
Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
V DSG++Y YL + + KEL + P+ +C+ G +V + K
Sbjct: 296 VLDSGTTYAYLPEAAFLAFKDAIVKELQSLKKISGPDPNYNDICFSGAG--IDVSQLSKS 353
Query: 323 FRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
F + + F +G+ T L+PE Y+ +K G CLG+ G ++GGI
Sbjct: 354 FPVVDMVFENGQKYT---LSPENYMFRHSKVRGAYCLGVFQN---GNDQTTLLGGI 403
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 103/367 (28%), Positives = 160/367 (43%), Gaps = 49/367 (13%)
Query: 39 FQVHGNVYP-TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP------- 90
F V G+ P G Y + +G PAR + + +DTGSD+ W+ C +PC C ++
Sbjct: 71 FSVKGSSNPFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTC-SPCDGCPDSSGLGIELN 129
Query: 91 --HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
S ++PC DPICA++ C Y Y D + G V D+ F
Sbjct: 130 LFDTTKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHF 189
Query: 149 NYTNGQRL----NPRLALGCG---YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
+ G+ + + GC Y + A+ LDGI G G+G+ S++SQL S+ +
Sbjct: 190 DILLGESTIANSSATIVFGCSIYQYGDLTRAT-KALDGIFGFGQGEFSVISQLSSRGITP 248
Query: 202 NVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSM---SSDYT-KYYSPGVA-ELFFGGET 254
V HCL GG GGG L G+ L S +V++ + YT K S ++ +LF
Sbjct: 249 KVFSHCLKGGENGGGILVLGEILEPS--IVYSPLIPSQPHYTLKLQSIALSGQLFPNPTM 306
Query: 255 TGLKNL-PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPF 313
+ N + DSG++ YL Y + S++ +S + P +G + F
Sbjct: 307 FPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSA---------TPTISRGSQCF 357
Query: 314 KNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL----IISNKGNVCLGILNGAEVGLQD 369
+ V F L +F + +TPE YL I+ C+G AE G
Sbjct: 358 RVSMSVADIFPVLRFNFEGIASMV---VTPEEYLQFDSIVREPALWCIG-FQKAEDG--- 410
Query: 370 LNVIGGI 376
LN++G +
Sbjct: 411 LNILGDL 417
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 90/323 (27%), Positives = 150/323 (46%), Gaps = 31/323 (9%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
+++ ++ GYY ++IG P + + L +DTGS +T++ C + C +C P + P
Sbjct: 78 MRLYDDLLLNGYYTTRIWIGTPPQTFALIVDTGSTVTYVPC-STCEQCGRHQDPKFEP-- 134
Query: 99 DLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
+L P+ ++ C++ QC YE +YA+ SS GVL +D +F N L
Sbjct: 135 ELSSTYQPVSCNIDC----TCDNERKQCVYERQYAEMSSSSGVLGEDIISFG--NQSELV 188
Query: 158 PRLALGCGYNQVPGASY-HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGG 213
P+ A+ NQ G Y DGI+GLG+G SIV QL + +I + C G GGG
Sbjct: 189 PQRAIFGCENQETGDLYSQRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGGG 248
Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------D 265
+ G + S +V+ ++YY+ + + G+ L P +F D
Sbjct: 249 AMILGG--ISPPSGMVFAESDPVRSQYYNIDLKAIHVAGKQLHLD--PSIFDGKHGTVLD 304
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
SG++Y YL + M KEL++ P+ +C+ G +V + F
Sbjct: 305 SGTTYAYLPEAAFTAFKDAMMKELTSLKQIHGPDPNYNDICFSGAE--SDVSQLSNTFPA 362
Query: 326 LALSFTDGKTRTLFELTPEAYLI 348
+ + F++G+ L+PE YL
Sbjct: 363 VEMVFSNGQK---LSLSPENYLF 382
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 113/368 (30%), Positives = 158/368 (42%), Gaps = 58/368 (15%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND- 99
V G +G Y V +G P + + L +DTGSDL ++QC APC C E PLY+PSN
Sbjct: 24 VSGTTLGSGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQC-APCDLCYEQDGPLYQPSNSS 82
Query: 100 ---LVPCEDPICASLHAPGHHNC-----EDPAQ--CDYELEYADGGSSLGVLVKDAFAFN 149
VPC+ C + AP C E P Q C YE Y D S++GV A+
Sbjct: 83 TFTPVPCDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVF---AYETA 139
Query: 150 YTNGQRLNPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
G R+N +A GCG NQ S+ G+LGLG+G S SQ + N +CL
Sbjct: 140 TVGGIRVN-HVAFGCGNRNQ---GSFVSAGGVLGLGQGALSFTSQ--AGYAFENKFAYCL 193
Query: 209 SG-----GGGGFLFFGDDL----YDSSRVVWTSMSSDYTKYYSPGVAELFFGGET----- 254
+ L FGDD+ +D S + + YY + + FGGET
Sbjct: 194 TSYLSPTSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYV-QIVRICFGGETLLIPD 252
Query: 255 -----TGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
+ N +FDSG++ TY + Y + + +K S + P + LPLC
Sbjct: 253 SAWKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEK--SVPYPRAPPSPQGLPLC--- 307
Query: 310 RRPFKNVHDVK-KCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQ 368
NV + + + + F G T + Y I + CL +L + G
Sbjct: 308 ----VNVSGIDHPIYPSFTIEFDQGAT---YRPNQGNYFIEVSPNIDCLAMLESSSDG-- 358
Query: 369 DLNVIGGI 376
NVIG I
Sbjct: 359 -FNVIGNI 365
>gi|172034220|gb|ACB69715.1| putative nucellin-like aspartic protease [Hordeum vulgare]
Length = 310
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 80/239 (33%), Positives = 117/239 (48%), Gaps = 22/239 (9%)
Query: 148 FNYTNGQRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVG 205
FN NG R LG ++Q P GILGL S+ SQL S+ +I NV G
Sbjct: 3 FNRYNGGR-KASFVLGVTFDQQGQLLSSPAKTSGILGLSSAAISLPSQLASKGIISNVFG 61
Query: 206 HCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGET--TGLKNLP 261
HC++ GGG++F GDD + W + Y ++ +G + G+ +
Sbjct: 62 HCITRETNGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGIP-VQ 120
Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
V+ G+SYTYL Y+ L +K++ + S + D TLPLCWK V+
Sbjct: 121 VISRCGTSYTYLPEEMYKNLIDAIKED--SPSFVQDSSDTTLPLCWKAD------FSVRS 172
Query: 322 CFRTLALSFTDGKTRTL----FELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
F+ L L F G+ + F + P+ YLIIS+KGNVCLG+LNG E+ ++G +
Sbjct: 173 FFKPLNLHF--GRRWFVVPKTFTIVPDDYLIISDKGNVCLGLLNGTEINHGSTIIVGDV 229
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 101/345 (29%), Positives = 150/345 (43%), Gaps = 49/345 (14%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC------VE-APHPLYRPSN-DL 100
G Y + +G P+R + + +DTGSD+ W+ C A C+RC VE P+ + S
Sbjct: 83 GLYFAKIGLGTPSRDFHVQVDTGSDILWVNC-AGCIRCPRKSDLVELTPYDVDASSTAKS 141
Query: 101 VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR----L 156
V C D C+ ++ C + C Y + Y DG S+ G LVKD + G R
Sbjct: 142 VSCSDNFCSYVNQ--RSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGST 199
Query: 157 NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
N + GCG Q G S +DGI+G G+ SS +SQL SQ ++ HCL GG
Sbjct: 200 NGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGG 259
Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK--------NLPVVFDS 266
+F ++ S +V T M S + +YS + + G L + V+ DS
Sbjct: 260 GIFAIGEVV-SPKVKTTPMLSK-SAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDS 317
Query: 267 GSSYTYLNRVTYQ-TLTSIMKK--ELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
G++ YL Y L I+ EL+ +++E+ F H K
Sbjct: 318 GTTLVYLPDAVYNPLLNEILASHPELTLHTVQES---------------FTCFHYTDKLD 362
Query: 324 RTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQ 368
R ++F K+ +L + P YL + C G NG GLQ
Sbjct: 363 RFPTVTFQFDKSVSL-AVYPREYLFQVREDTWCFGWQNG---GLQ 403
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 96/335 (28%), Positives = 145/335 (43%), Gaps = 38/335 (11%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
G T Y ++ +G PA ++LDTGSD +W+QC PC C E L+ PS
Sbjct: 126 GKYLDTTNYFTSLRLGTPATDLLVELDTGSDQSWIQCK-PCPDCYEQHEALFDPSKSSTY 184
Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
+ C C L + HNC +C YE+ YAD ++G L +D + T+ P
Sbjct: 185 SDITCSSRECQELGSSHKHNCSSDKKCPYEITYADDSYTVGNLARDTLTLSPTDAV---P 241
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL 216
GCG+N S+ +DG+LGLG+GK+S+ SQ+ ++ +CL S G+L
Sbjct: 242 GFVFGCGHNNA--GSFGEIDGLLGLGRGKASLSSQVAAR--YGAGFSYCLPSSPSATGYL 297
Query: 217 -FFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------DS 266
F G + +T M + + +Y + + G +K P VF DS
Sbjct: 298 SFSGAAAAAPTNAQFTEMVAGQHPSFYYLNLTGITVAGR--AIKVPPSVFATAAGTIIDS 355
Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
G++++ L Y L S ++ + K AP C+ H+ + ++
Sbjct: 356 GTAFSCLPPSAYAALRSSVRSAMG--RYKRAPSSTIFDTCYD-----LTGHETVR-IPSV 407
Query: 327 ALSFTDGKTRTLFELTPEAYLII-SNKGNVCLGIL 360
AL F DG T L P L SN CL L
Sbjct: 408 ALVFADGAT---VHLHPSGVLYTWSNVSQTCLAFL 439
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 161/351 (45%), Gaps = 33/351 (9%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
+++ ++ GYY ++IG P + + L +DTGS +T++ C + C C + P ++P
Sbjct: 76 MRLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPC-SDCEHCGKHQDPRFQP-- 132
Query: 99 DLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
D P+ ++ NC+ D C YE YA+ SS GVL +D +F + +
Sbjct: 133 DESSTYHPVKCNMDC----NCDHDGVNCVYERRYAEMSSSSGVLGEDIISFG-NQSEVVP 187
Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGGG 214
R GC + DGI+GLG+G+ SIV QL + +I + C G GGG
Sbjct: 188 QRAVFGCENVETGDLYSQRADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVGGGA 247
Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL------KNLPVVFDSGS 268
+ G + +V++ + YY+ + E+ G+ L + V DSG+
Sbjct: 248 MVLGG--IPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKHGTVLDSGT 305
Query: 269 SYTYLNRVTYQTLT-SIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
+Y YL + +I+KK + K + P+ +C+ G ++V + K F +
Sbjct: 306 TYAYLPEEAFVAFRDAIIKKSHNLKQI-HGPDPNYNDICFSGAG--RDVSQLSKAFPEVD 362
Query: 328 LSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
+ F++G+ LTPE YL K G CLGI + ++GGI
Sbjct: 363 MVFSNGQK---LSLTPENYLFQHTKVHGAYCLGIFRNGD----STTLLGGI 406
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 94/334 (28%), Positives = 146/334 (43%), Gaps = 41/334 (12%)
Query: 39 FQVHGNVYP-TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP------- 90
F V G+ P G Y + +G PAR + + +DTGSD+ W+ C +PC C ++
Sbjct: 71 FSVKGSSNPFVGLYFTKVKLGNPAREFNVQIDTGSDILWVTC-SPCDGCPDSSGLGIELN 129
Query: 91 --HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
S ++PC DPICA++ C Y Y D + G V D+ F
Sbjct: 130 LFDTTKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHF 189
Query: 149 NYTNGQRL----NPRLALGCG---YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
+ G+ + + GC Y + A+ LDGI G G+G+ S++SQL S+ +
Sbjct: 190 DILLGESTIANSSATIVFGCSIYQYGDLTRAT-KALDGIFGFGQGEFSVISQLSSRGITP 248
Query: 202 NVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSM---SSDYT-KYYSPGVA-ELFFGGET 254
V HCL GG GGG L G+ L S +V++ + YT K S ++ +LF
Sbjct: 249 KVFSHCLKGGENGGGILVLGEILEPS--IVYSPLIPSQPHYTLKLQSIALSGQLFPNPTM 306
Query: 255 TGLKNL-PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPF 313
+ N + DSG++ YL Y + S++ +S + P +G + F
Sbjct: 307 FPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSA---------TPTISRGSQCF 357
Query: 314 KNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL 347
+ V F L +F + +TPE YL
Sbjct: 358 RVSMSVADIFPVLRFNFEGIASMV---VTPEEYL 388
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 97/359 (27%), Positives = 157/359 (43%), Gaps = 40/359 (11%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
S+ +++ ++ GYY ++IG P + + L +DTGS +T++ C C +C P +
Sbjct: 67 SNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCST-CEQCGRHQDPKF 125
Query: 95 RPSNDLVPCEDPICASLHAPGHHNCE-----DPAQCDYELEYADGGSSLGVLVKDAFAFN 149
DP +S + P N + D QC YE +YA+ +S GVL +D +F
Sbjct: 126 ----------DPESSSTYKPIKCNIDCICDSDGVQCVYERQYAEMSTSSGVLGEDVISFG 175
Query: 150 YTNGQRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
N L P R GC + DGI+GLG G S+V QL + I + C
Sbjct: 176 --NQSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCY 233
Query: 209 SG---GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN------ 259
G GGG + G + S +++T + YY+ + E+ G+ L +
Sbjct: 234 GGMDIGGGAMVLGG--ISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGR 291
Query: 260 LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
V DSG++Y YL + + E+ + + P+ +C+ G + ++
Sbjct: 292 YGAVLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAG--SDAAEL 349
Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
F T+ + F +G+ LTPE Y +K G CLGI E G ++GGI
Sbjct: 350 SNKFPTVDMVFENGQK---LSLTPENYFFRHSKVHGAYCLGIF---ENGNDQTTLLGGI 402
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 97/359 (27%), Positives = 157/359 (43%), Gaps = 40/359 (11%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY 94
S+ +++ ++ GYY ++IG P + + L +DTGS +T++ C C +C P +
Sbjct: 67 SNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCST-CEQCGRHQDPKF 125
Query: 95 RPSNDLVPCEDPICASLHAPGHHNCE-----DPAQCDYELEYADGGSSLGVLVKDAFAFN 149
DP +S + P N + D QC YE +YA+ +S GVL +D +F
Sbjct: 126 ----------DPESSSTYKPIKCNIDCICDSDGVQCVYERQYAEMSTSSGVLGEDVISFG 175
Query: 150 YTNGQRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
N L P R GC + DGI+GLG G S+V QL + I + C
Sbjct: 176 --NQSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCY 233
Query: 209 SG---GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN------ 259
G GGG + G + S +++T + YY+ + E+ G+ L +
Sbjct: 234 GGMDIGGGAMVLGG--ISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGR 291
Query: 260 LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
V DSG++Y YL + + E+ + + P+ +C+ G + ++
Sbjct: 292 YGAVLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAG--SDAAEL 349
Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
F T+ + F +G+ LTPE Y +K G CLGI E G ++GGI
Sbjct: 350 SNKFPTVDMVFENGQK---LSLTPENYFFRHSKVHGAYCLGIF---ENGNDQTTLLGGI 402
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 94/348 (27%), Positives = 146/348 (41%), Gaps = 51/348 (14%)
Query: 33 VGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP 90
VG + F V G P G Y + +G PA+ +++ +DTGSD+ W+ C + C P
Sbjct: 63 VGGVVDFSVQGTSDPYFVGLYFTKVKLGSPAKEFYVQIDTGSDILWINC----ITCSNCP 118
Query: 91 HP------------LYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSS 137
H + LV C DPIC+ C A QC Y +Y DG +
Sbjct: 119 HSSGLGIELDFFDTAGSSTAALVSCGDPICSYAVQTATSECSSQANQCSYTFQYGDGSGT 178
Query: 138 LGVLVKDAFAFNYTN-GQRL----NPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSI 190
G V D F+ GQ + + + GC Q + +DGI G G G S+
Sbjct: 179 TGYYVSDTMYFDTVLLGQSVVANSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSV 238
Query: 191 VSQLHSQKLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAEL 248
+SQL S+ + V HCL GG GGG L G+ L S +V++ + +Y+ + +
Sbjct: 239 ISQLSSRGVTPKVFSHCLKGGENGGGVLVLGEILEPS--IVYSPLVPS-QPHYNLNLQSI 295
Query: 249 FFGGETTGL--------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPED 300
G+ + N + DSG++ YL + Y K++ A
Sbjct: 296 AVNGQLLPIDSNVFATTNNQGTIVDSGTTLAYLVQEAYNPFV---------KAITAAVSQ 346
Query: 301 ETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI 348
+ P+ KG + + + V F ++L+F G + L PE YL+
Sbjct: 347 FSKPIISKGNQCYLVSNSVGDIFPQVSLNFMGGASMV---LNPEHYLM 391
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 97/345 (28%), Positives = 152/345 (44%), Gaps = 49/345 (14%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC------VE-APHPLYRPSN-DL 100
G Y + +G P+R + + +DTGSD+ W+ C A C+RC VE P+ S
Sbjct: 83 GLYFAKIGLGTPSRDFHVQVDTGSDILWVNC-AGCIRCPRKSDLVELTPYDADASSTAKS 141
Query: 101 VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR----L 156
V C D C+ ++ C + C Y + Y DG S+ G LV+D + G R
Sbjct: 142 VSCSDNFCSYVNQ--RSECHSGSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGST 199
Query: 157 NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
N + GCG Q G S +DGI+G G+ SS +SQL SQ ++ HCL GG
Sbjct: 200 NGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGG 259
Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK--------NLPVVFDS 266
+F ++ S +V T M S + +YS + + G L + V+ DS
Sbjct: 260 GIFAIGEVV-SPKVKTTPMLSK-SAHYSVNLNAIEVGNSVLQLSSDAFDSGDDKGVIIDS 317
Query: 267 GSSYTYLNRVTYQTLTSIM---KKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
G++ YL Y L + + +EL+ +++++ F H + +
Sbjct: 318 GTTLVYLPDAVYNPLMNQILASHQELNLHTVQDS---------------FTCFHYIDRLD 362
Query: 324 RTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQ 368
R ++F K+ +L + P+ YL + C G NG GLQ
Sbjct: 363 RFPTVTFQFDKSVSL-AVYPQEYLFQVREDTWCFGWQNG---GLQ 403
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 98/350 (28%), Positives = 140/350 (40%), Gaps = 66/350 (18%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LY----R 95
G Y + IG PAR Y++ +DTGSD+ W+ C ++C E P LY
Sbjct: 95 VGLYYAKIGIGTPARDYYVQVDTGSDIMWVNC----IQCNECPKKSSLGMELTLYDIKES 150
Query: 96 PSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ- 154
+ LV C+ C +++ C C Y YADG SS G V+D ++ +G
Sbjct: 151 LTGKLVSCDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDL 210
Query: 155 ---RLNPRLALGCGYNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
N + GC Q +S LDGILG GK +S++SQL S +R + HCL G
Sbjct: 211 ETTSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDG 270
Query: 211 -GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-------- 261
GGG G + +V T + + T +Y+ + + GG NLP
Sbjct: 271 LNGGGIFAIGHIV--QPKVNTTPLVPNQT-HYNVNMKAVEVGGY---FLNLPTDVFDVGD 324
Query: 262 ---VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
+ DSG++ YL V Y L S + W+ +HD
Sbjct: 325 KKGTIIDSGTTLAYLPEVVYDQLLSKI-------------------FSWQSDLKVHTIHD 365
Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYL-------IISNKGNVCLGILN 361
CF+ + S DG F YL + S G C+G N
Sbjct: 366 QFTCFQ-YSESLDDGFPAVTFHFENSLYLKVHPHEYLFSYDGLWCIGWQN 414
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 90/347 (25%), Positives = 148/347 (42%), Gaps = 42/347 (12%)
Query: 50 YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCED 105
Y+ T+ +G P R + + +DTGS +T++ C C C + + P + C D
Sbjct: 12 YFYTTLKLGTPERTFSVIIDTGSTITYIPCKD-CSHCGKHTAEWFDPDKSTTAKKLACGD 70
Query: 106 PICASLHAPGHHNCEDPA------QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
P+C NC P+ +C Y YA+ SS G +++D F F ++ R
Sbjct: 71 PLC---------NCGTPSCTCNNDRCYYSRTYAERSSSEGWMIEDTFGFPDSDSPV---R 118
Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFG 219
L GC + DGI+G+G ++ SQL +K+I +V C G L G
Sbjct: 119 LVFGCENGETGEIYRQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFGYPKDGILLLG 178
Query: 220 D-DLYDSSRVVWTSMSSD-YTKYYSPGVAELFFGGETTGL------KNLPVVFDSGSSYT 271
D L + + V+T + + + YY+ + + G+T + V DSG+++T
Sbjct: 179 DVTLPEGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDRGYGTVLDSGTTFT 238
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAP--EDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
YL ++ + + + K L+ P + + +CWKG D+ K F
Sbjct: 239 YLPTDAFKAMAKAVGDYVEKKGLQSTPGADPQYNDICWKGAP--DQFKDLDKYFPPAEFV 296
Query: 330 FTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
F G T L P YL +S CLGI + G ++GG+
Sbjct: 297 FGGGAKLT---LPPLRYLFLSKPAEYCLGIFDNGNSGA----LVGGV 336
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 96/363 (26%), Positives = 151/363 (41%), Gaps = 55/363 (15%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH--------PLYR----P 96
G Y + IG PA+ Y++ +DTGSD+ W+ C ++C + P LY
Sbjct: 78 GLYYAKIGIGTPAKSYYVQVDTGSDIMWVNC----IQCKQCPRRSTLGIELTLYNIDESD 133
Query: 97 SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG--- 153
S LV C+D C + C+ C Y Y DG S+ G VKD ++ G
Sbjct: 134 SGKLVSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLK 193
Query: 154 -QRLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
Q N + GCG Q + ++ LDGILG GK SS++SQL S ++ + HCL
Sbjct: 194 TQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLD 253
Query: 210 GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY------------YSPGVAELFFGGETTGL 257
G GG +F + +V T + + Y + A+LF G+ G
Sbjct: 254 GRNGGGIFAIGRVV-QPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKG- 311
Query: 258 KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
+ DSG++ YL + Y+ L + + A + +D + F+
Sbjct: 312 ----AIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKD---------YKCFQYSG 358
Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIG 377
V + F + F + + P YL ++G C+G N A + +D + +G
Sbjct: 359 RVDEGFPNVTFHF---ENSVFLRVYPHDYL-FPHEGMWCIGWQNSA-MQSRDRRNMTLLG 413
Query: 378 DFV 380
D V
Sbjct: 414 DLV 416
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 104/369 (28%), Positives = 166/369 (44%), Gaps = 43/369 (11%)
Query: 35 SSLLFQVHGNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----V 87
S++ + GN +P+ G Y + IG P++ Y++ +DTGSD+ W+ C A C RC +
Sbjct: 56 SAVDLPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNC-AGCDRCPTKSDL 114
Query: 88 EAPHPLY----RPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVK 143
LY ++D V C+D C+ P C+ QC Y + Y DG S+ G V+
Sbjct: 115 GVDLTLYDMKASTTSDAVGCDDNFCSLYDGP-LPGCKPGLQCLYSVLYGDGSSTTGYFVQ 173
Query: 144 DAFAFNYTNGQ----RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQ 197
D +N +G N + GCG Q G+S LDGILG G+ SS++SQL S
Sbjct: 174 DFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASS 233
Query: 198 KLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL 257
++ V HCL GG +F ++ + +V T + + +Y+ + E+ GG+ +
Sbjct: 234 GKVKKVFSHCLDNVDGGGIFAIGEVVE-PKVNITPLVQN-QAHYNVVMKEIEVGGDPLDV 291
Query: 258 --------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
+ DSG++ Y + Y L K L + P D L +
Sbjct: 292 PSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLIE--------KILSQQP-DLRLHTVEQA 342
Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN-GAEVGL- 367
F +V F T+ L F + T++ P YL + C+G N GA+
Sbjct: 343 FTCFDYTGNVDDGFPTVTLHFDKSISLTVY---PHEYLFQVKEFEWCIGWQNSGAQTKDG 399
Query: 368 QDLNVIGGI 376
+DL ++G +
Sbjct: 400 KDLTLLGDL 408
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 103/361 (28%), Positives = 162/361 (44%), Gaps = 43/361 (11%)
Query: 43 GNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLY- 94
GN +P+ G Y + IG P++ Y++ +DTGSD+ W+ C A C RC + LY
Sbjct: 145 GNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNC-AGCDRCPTKSDLGVDLTLYD 203
Query: 95 ---RPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
++D V C+D C+ P C+ QC Y + Y DG S+ G V+D +N
Sbjct: 204 MKASTTSDAVGCDDNFCSLYDGP-LPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRI 262
Query: 152 NGQ----RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVG 205
+G N + GCG Q G+S LDGILG G+ SS++SQL S ++ V
Sbjct: 263 SGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFS 322
Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL-------- 257
HCL GG +F ++ + +V T + + +Y+ + E+ GG+ +
Sbjct: 323 HCLDNVDGGGIFAIGEVVE-PKVNITPLVQN-QAHYNVVMKEIEVGGDPLDVPSDAFESG 380
Query: 258 KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
+ DSG++ Y + Y L K L + P D L + F
Sbjct: 381 DRKGTIIDSGTTLAYFPQEVYVPLIE--------KILSQQP-DLRLHTVEQAFTCFDYTG 431
Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN-GAEVGL-QDLNVIGG 375
+V F T+ L F + T++ P YL + C+G N GA+ +DL ++G
Sbjct: 432 NVDDGFPTVTLHFDKSISLTVY---PHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLLGD 488
Query: 376 I 376
+
Sbjct: 489 L 489
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 93/349 (26%), Positives = 142/349 (40%), Gaps = 59/349 (16%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP---------HPLYRPSN 98
TG Y + +G P + Y++ +DTGSD+ W+ C + C +C P S
Sbjct: 81 TGLYFTEIKLGTPPKRYYVQVDTGSDILWVNCIS-CEKCPRKSGLGLDLTFYDPKASSSG 139
Query: 99 DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG----Q 154
V C+ CA+ + C C+Y + Y DG S+ G V DA F+ G Q
Sbjct: 140 STVSCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQTQ 199
Query: 155 RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-G 211
N + GCG Q G+S LDGILG G+ +S++SQL + ++ + HCL
Sbjct: 200 PGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCLDTIK 259
Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNLPVV 263
GGG G+ + +V T + +D +Y+ + + GG T L + +
Sbjct: 260 GGGIFAIGNVV--QPKVKTTPLVAD-MPHYNVNLKSIDVGGTTLQLPAHVFETGERKGTI 316
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD----- 318
DSG++ TYL + + KE+ A + + F NV D
Sbjct: 317 IDSGTTLTYLPELVF--------KEVMAAIFNKHQD-----------IVFHNVQDFMCFQ 357
Query: 319 ----VKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGA 363
V F T+ F D + P Y + C+G NGA
Sbjct: 358 YPGSVDDGFPTITFHFED---DLALHVYPHEYFFPNGNDMYCVGFQNGA 403
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 94/348 (27%), Positives = 145/348 (41%), Gaps = 51/348 (14%)
Query: 33 VGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP 90
VG + F V G P G Y + +G PA+ +++ +DTGSD+ W+ C + C P
Sbjct: 63 VGGVVDFSVQGTSDPYFVGLYFTKVKLGSPAKDFYVQIDTGSDILWINC----ITCSNCP 118
Query: 91 HP------------LYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSS 137
H + LV C DPIC+ C A QC Y +Y DG +
Sbjct: 119 HSSGLGIELDFFDTAGSSTAALVSCADPICSYAVQTATSGCSSQANQCSYTFQYGDGSGT 178
Query: 138 LGVLVKDAFAFNYTN-GQRL----NPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSI 190
G V D F+ GQ + + + GC Q + +DGI G G G S+
Sbjct: 179 TGYYVSDTMYFDTVLLGQSMVANSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSV 238
Query: 191 VSQLHSQKLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAEL 248
+SQL S+ + V HCL GG GGG L G+ L S +V++ + +Y+ + +
Sbjct: 239 ISQLSSRGVTPKVFSHCLKGGENGGGVLVLGEILEPS--IVYSPLVPSL-PHYNLNLQSI 295
Query: 249 FFGGETTGL--------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPED 300
G+ + N + DSG++ YL + Y + +S S
Sbjct: 296 AVNGQLLPIDSNVFATTNNQGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFS------- 348
Query: 301 ETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI 348
P+ KG + + + V F ++L+F G + L PE YL+
Sbjct: 349 --KPIISKGNQCYLVSNSVGDIFPQVSLNFMGGASMV---LNPEHYLM 391
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 98/350 (28%), Positives = 140/350 (40%), Gaps = 66/350 (18%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LY----R 95
G Y + IG PAR Y++ +DTGSD+ W+ C ++C E P LY
Sbjct: 95 VGLYYAKIGIGTPARDYYVQVDTGSDIMWVNC----IQCNECPKKSSLGMELTLYDIKES 150
Query: 96 PSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ- 154
+ LV C+ C +++ C C Y YADG SS G V+D ++ +G
Sbjct: 151 LTGKLVSCDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDL 210
Query: 155 ---RLNPRLALGCGYNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
N + GC Q +S LDGILG GK +S++SQL S +R + HCL G
Sbjct: 211 ETTSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDG 270
Query: 211 -GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-------- 261
GGG G + +V T + + T +Y+ + + GG NLP
Sbjct: 271 LNGGGIFAIGHIV--QPKVNTTPLVPNQT-HYNVNMKAVEVGGY---FLNLPTDVFDVGD 324
Query: 262 ---VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
+ DSG++ YL V Y L S + W+ +HD
Sbjct: 325 KKGTIIDSGTTLAYLPEVVYDQLLSKI-------------------FSWQSDLKVHTIHD 365
Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYL-------IISNKGNVCLGILN 361
CF+ + S DG F YL + S G C+G N
Sbjct: 366 QFTCFQ-YSESLDDGFPAVTFHFENSLYLKVHPHEYLFSYDGLWCIGWQN 414
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 98/341 (28%), Positives = 153/341 (44%), Gaps = 32/341 (9%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPIC 108
GYY ++IG P + + L +DTGS +T++ C C C P +RP + P+
Sbjct: 91 GYYTTRLWIGTPPQRFALIVDTGSTVTYVPCST-CKHCGSHQDPKFRP--EASETYQPVK 147
Query: 109 ASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL-GCGY 166
+ NC+D QC YE YA+ +S GVL +D +F N L+P+ A+ GC
Sbjct: 148 CTWQC----NCDDDRKQCTYERRYAEMSTSSGVLGEDVVSFG--NQSELSPQRAIFGCEN 201
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFG-DDLYDS 225
++ DGI+GLG+G SI+ QL +K+I + C G G G +
Sbjct: 202 DETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGGAMVLGGISPP 261
Query: 226 SRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------DSGSSYTYLNRVT 277
+ +V+T + YY+ + E+ G+ L P VF DSG++Y YL
Sbjct: 262 ADMVFTHSDPVRSPYYNIDLKEIHVAGKRLHLN--PKVFDGKHGTVLDSGTTYAYLPESA 319
Query: 278 YQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRT 337
+ + KE + P+ +C+ G NV + K F + + F +G
Sbjct: 320 FLAFKHAIMKETHSLKRISGPDPHYNDICFSGAE--INVSQLSKSFPVVEMVFGNGHK-- 375
Query: 338 LFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
L+PE YL +K G CLG+ + G ++GGI
Sbjct: 376 -LSLSPENYLFRHSKVRGAYCLGVFSN---GNDPTTLLGGI 412
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 99/363 (27%), Positives = 163/363 (44%), Gaps = 54/363 (14%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP--- 96
TG Y + IG PA+ Y++ +DTGSD+ W+ C V C P +Y P
Sbjct: 87 TGLYFTRIGIGTPAKRYYVQVDTGSDILWVNC----VSCDGCPRKSNLGIELTMYDPRGS 142
Query: 97 -SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
S +LV C+ C + + +C + C+Y + Y DG S+ G V D +N +G
Sbjct: 143 QSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDG 202
Query: 156 ----LNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL- 208
N ++ GCG G+S LDGILG G+ SS++SQL + +R + HCL
Sbjct: 203 QTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLD 262
Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNL 260
+ GGG G+ + +V T + SD +Y+ + + GG GL +
Sbjct: 263 TVNGGGIFAIGNVV--QPKVKTTPLVSD-MPHYNVILKGIDVGGTALGLPTNIFDSGNSK 319
Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIM---KKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
+ DSG++ Y+ Y+ L +++ +++S ++L++ C F+
Sbjct: 320 GTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS-------C------FQYSG 366
Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIG 377
V F + F +G + ++P YL + K C+G NG V +D + +G
Sbjct: 367 SVDDGFPEVTFHF-EGDVSLI--VSPHDYLFQNGKNLYCMGFQNGG-VQTKDGKDMVLLG 422
Query: 378 DFV 380
D V
Sbjct: 423 DLV 425
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 147/356 (41%), Gaps = 48/356 (13%)
Query: 35 SSLLFQVHGNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC------ 86
S++ Q+ GN +P+ G Y + +G P + Y++ +DTGSD+ W+ C A C C
Sbjct: 56 SAIDLQLGGNGHPSESGLYFAKIGLGTPVQDYYVQVDTGSDILWVNC-AGCTNCPKKSDL 114
Query: 87 ---VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVK 143
+ P +++ V C C S + C C+Y + Y DG S+ G V+
Sbjct: 115 GIELSLYSPSSSSTSNRVTCNQDFCTSTYDGPIPGCTPELLCEYRVAYGDGSSTAGYFVR 174
Query: 144 DAFAFNYTNGQ----RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQ 197
D + G N + GCG Q GA+ LDGILG G+ SS++SQL S
Sbjct: 175 DHVVLDRVTGNFQTTSTNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASS 234
Query: 198 KLIRNVVGHCLSG-GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTG 256
++ V HCL GGG G+ + R T+ +Y+ + + E
Sbjct: 235 GKVKRVFAHCLDNINGGGIFAIGEVVQPKVR---TTPLVPQQAHYNVFMKAIEVDNE--- 288
Query: 257 LKNLP-----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
+ NLP + DSG++ Y V Y+ L S + S L E T
Sbjct: 289 VLNLPTDVFDTDLRKGTIIDSGTTLAYFPDVIYEPLISKIFARQSTLKLHTVEEQFTC-- 346
Query: 306 CWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN 361
F+ +V F T+ F D + T++ P YL + C+G N
Sbjct: 347 -------FEYDGNVDDGFPTVTFHFEDSLSLTVY---PHEYLFDIDSNKWCVGWQN 392
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 151/357 (42%), Gaps = 42/357 (11%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH------PLY 94
+H ++ GYY ++IG PA+ + L +DTGS +T++ PC C H P +
Sbjct: 89 LHDDLLTKGYYTSRVFIGTPAQEFALIVDTGSTVTYV----PCSSCTHCGHHQACFDPRF 144
Query: 95 RPSN----DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNY 150
+P N V C P C + + QC YE YA+ SS GVL KD F
Sbjct: 145 KPDNSSSYQTVSCNSPDCITKMCDARVH-----QCKYERVYAEMSSSKGVLGKDLLGFG- 198
Query: 151 TNGQRLNPR-LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
NG RL P L GC + DGI+GLG+G SIV QL + + C
Sbjct: 199 -NGSRLQPHPLLFGCETAETGDLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYG 257
Query: 210 G--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN------LP 261
G GGG + G + +V+ + + YY+ ++E+ G + + + L
Sbjct: 258 GMDEGGGSMVLG-AIPPPPAMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVFNGRLG 316
Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
V DSG++Y YL + + ++L + P+ +C+ G + + K
Sbjct: 317 TVLDSGTTYAYLPDKAFDAFKDAITQQLGSLQAVPGPDPSYPDVCFAGAG--SDSKALGK 374
Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
F + F+ G + L PE YL K G CLG + ++GGI
Sbjct: 375 HFPPVDFVFS-GNQKVF--LAPENYLFKHTKVPGAYCLGFFKNQDA----TTLLGGI 424
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 101/372 (27%), Positives = 152/372 (40%), Gaps = 61/372 (16%)
Query: 31 NHVGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-- 86
+ VG + F V G+ P G Y + +G P + + +DTGSD+ W+ C + C C
Sbjct: 78 SSVGGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNCPH 136
Query: 87 ----------VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS 136
+AP L S V C DPIC+S+ C + QC Y Y DG
Sbjct: 137 SSGLGIDLHFFDAPGSLTAGS---VTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSG 193
Query: 137 SLGVLVKDAFAFNYTNGQRL----NPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSI 190
+ G + D F F+ G+ L + + GC Q S +DGI G GKGK S+
Sbjct: 194 TSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSV 253
Query: 191 VSQLHSQKLIRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY----YSPG 244
VSQL S+ + V HCL G GGG G+ L +V++ + Y S G
Sbjct: 254 VSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPG--MVYSPLVPSQPHYNLNLLSIG 311
Query: 245 V--------AELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
V A +F T G + D+G++ TYL + Y +L ++
Sbjct: 312 VNGQMLPLDAAVFEASNTRG-----TIVDTGTTLTYLVKEAY---------DLFLNAISN 357
Query: 297 APEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL----IISNK 352
+ P+ G + + + F +++L+F G + L P+ YL I
Sbjct: 358 SVSQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGAS---MMLRPQDYLFHYGIYDGA 414
Query: 353 GNVCLGILNGAE 364
C+G E
Sbjct: 415 SMWCIGFQKAPE 426
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 95/328 (28%), Positives = 137/328 (41%), Gaps = 46/328 (14%)
Query: 46 YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV-EAPHP-----LYRPS-- 97
+ TG Y +Y+G P Y++ +DTGSD+TWL C APC CV E P Y PS
Sbjct: 32 FVTGLYYTKIYLGTPPVGYYVQVDTGSDVTWLNC-APCTSCVTETQLPSIKLTTYDPSRS 90
Query: 98 --NDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT-NGQ 154
+ + C D C + +C C Y Y DG S+ G ++D F N
Sbjct: 91 STDGALSCRDSNCGAALGSNEVSCTSAGYCAYSTTYGDGSSTQGYFIQDVMTFQEIHNNT 150
Query: 155 RLN--PRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
++N + GCG Q S LDG++G G+ SI SQL S + N HCL G
Sbjct: 151 QVNGTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQG 210
Query: 211 G--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGG---------ETTGLKN 259
GGG + G + +T + S +Y+ G+ + G +TT
Sbjct: 211 DNQGGGTIVIGS--VSEPNISYTPIVS--RNHYAVGMQNIAVNGRNVTTPASFDTTSTSA 266
Query: 260 LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
V+ DSG++ YL Y T + + +S + + L L W +
Sbjct: 267 GGVIMDSGTTLAYLVDPAY---TQFVNAVSTFESSMFSSHSQCLQLAW---------CSL 314
Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYL 347
+ F T+ L F G + LTP YL
Sbjct: 315 QADFPTVKLFFDAGA---VMNLTPRNYL 339
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/358 (28%), Positives = 145/358 (40%), Gaps = 60/358 (16%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
T Y V + IG P P LDTGSDL W QCDAPC RC P PLY P+ V C
Sbjct: 89 TATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSC 148
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
P+C +L +P C Y Y DG S+ GVL + F R +A G
Sbjct: 149 RSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVR---GVAFG 205
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFGD 220
CG + S G++G+G+G S+VSQL + +C + LF G
Sbjct: 206 CGTENL--GSTDNSSGLVGMGRGPLSLVSQLGVTRF-----SYCFTPFNATAASPLFLG- 257
Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAE------LFFGGETTGLKNLP------------- 261
S+R+ + ++ + S G L G T G LP
Sbjct: 258 ---SSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGD 314
Query: 262 --VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
V+ DSG+++T L + L + + A L LC+ P +V
Sbjct: 315 GGVIIDSGTTFTALEESAFVALARALASRVRLPLASGA--HLGLSLCFAAASP--EAVEV 370
Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNV-CLGILNGAEVGLQDLNVIGGI 376
+ L L F DG EL E+Y++ V CLG+++ + ++V+G +
Sbjct: 371 PR----LVLHF-DGAD---MELRRESYVVEDRSAGVACLGMVSA-----RGMSVLGSM 415
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 107/376 (28%), Positives = 165/376 (43%), Gaps = 53/376 (14%)
Query: 35 SSLLFQVHGNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----V 87
S++ + GN +P+ G Y + IG P++ Y++ +DTGSD+ W+ C A C RC +
Sbjct: 60 SAVDLPLGGNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNC-AGCDRCPTKSDL 118
Query: 88 EAPHPLY----RPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVK 143
LY ++D V C+D C+ P C+ QC Y + Y DG S+ G V+
Sbjct: 119 GVDLTLYDMKASTTSDAVGCDDNFCSLYDGP-LPGCKPGLQCLYSVLYGDGSSTTGYFVQ 177
Query: 144 DAFAFNYTNGQ----RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQ 197
D +N +G N + GCG Q G+S LDGILG G+ SS++SQL S
Sbjct: 178 DFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASS 237
Query: 198 KLIRNVVGHCLSG-GGGGFLFFGDD--------LYDSSRVVWTSMSSDYTKYYSPGVAEL 248
++ V HCL GGG G+ L +S +V +S +Y+ + E+
Sbjct: 238 GKVKKVFSHCLDNVDGGGIFAIGEVVEPKVRFLLMNSVMIVVLFLSR---AHYNVVMKEI 294
Query: 249 FFGGETTGL--------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPED 300
GG+ + + DSG++ Y + Y L K L + P D
Sbjct: 295 EVGGDPLDVPSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLIE--------KILSQQP-D 345
Query: 301 ETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGIL 360
L + F +V F T+ L F + T++ P YL + C+G
Sbjct: 346 LRLHTVEQAFTCFDYTGNVDDGFPTVTLHFDKSISLTVY---PHEYLFQVKEFEWCIGWQ 402
Query: 361 N-GAEVGL-QDLNVIG 374
N GA+ +DL ++G
Sbjct: 403 NSGAQTKDGKDLTLLG 418
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 96/363 (26%), Positives = 150/363 (41%), Gaps = 55/363 (15%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH--------PLYR----P 96
G Y + IG PA+ Y++ +DTGSD+ W+ C ++C + P LY
Sbjct: 78 GLYYAKIGIGTPAKSYYVQVDTGSDIMWVNC----IQCKQCPRRSTLGIELTLYNIDESD 133
Query: 97 SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG--- 153
S LV C+D C + C+ C Y Y DG S+ G VKD ++ G
Sbjct: 134 SGKLVSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLK 193
Query: 154 -QRLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
Q N + GCG Q + ++ LDGILG GK SS++SQL S ++ + HCL
Sbjct: 194 TQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLD 253
Query: 210 GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY------------YSPGVAELFFGGETTGL 257
G GG +F + +V T + + Y + A+LF G+ G
Sbjct: 254 GRNGGGIFAIGRVV-QPKVNMTPLVPNQPHYNVNMTAVQVGQEFLNIPADLFQPGDRKG- 311
Query: 258 KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
+ DSG++ YL + Y+ L + + A + +D + F+
Sbjct: 312 ----AIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKD---------YKCFQYSG 358
Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIG 377
V + F + F + + P YL +G C+G N A + +D + +G
Sbjct: 359 RVDEGFPNVTFHF---ENSVFLRVYPHDYL-FPYEGMWCIGWQNSA-MQSRDRRNMTLLG 413
Query: 378 DFV 380
D V
Sbjct: 414 DLV 416
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 98/343 (28%), Positives = 153/343 (44%), Gaps = 49/343 (14%)
Query: 46 YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----V 101
+ G Y + + IG P R + +DTGSDL W QC APC+ CVE P P + P+ +
Sbjct: 80 FSEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQC-APCLLCVEQPTPYFEPAKSTSYASL 138
Query: 102 PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
PC +C +L++P C A C Y+ Y D SS GVL + F F + + PR++
Sbjct: 139 PCSSAMCNALYSP---LCFQNA-CVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVS 194
Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFF 218
GCG N G ++ G++G G+G S+VSQL S + +CL+ L+F
Sbjct: 195 FGCG-NMNAGTLFNG-SGMVGFGRGALSLVSQLGSPRF-----SYCLTSFMSPATSRLYF 247
Query: 219 GDDLYDSSRVVWTSMSSDYTKYY-SPGVAELFF---GGETTGLKNLP------------- 261
G +S +S T + +P + ++F G + LP
Sbjct: 248 GAYATLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDG 307
Query: 262 ---VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
V+ DSG++ T+L + Y + + P D T C+K P + +
Sbjct: 308 TGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSD-TFDTCFKWPPPPRRMVT 366
Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYLIIS-NKGNVCLGIL 360
+ + + L F DG EL E Y+++ GN+CL +L
Sbjct: 367 LPE----MVLHF-DGAD---MELPLENYMVMDGGTGNLCLAML 401
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 98/343 (28%), Positives = 153/343 (44%), Gaps = 49/343 (14%)
Query: 46 YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----V 101
+ G Y + + IG P R + +DTGSDL W QC APC+ CVE P P + P+ +
Sbjct: 83 FSEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQC-APCLLCVEQPTPYFEPAKSTSYASL 141
Query: 102 PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
PC +C +L++P C A C Y+ Y D SS GVL + F F + + PR++
Sbjct: 142 PCSSAMCNALYSP---LCFQNA-CVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVS 197
Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFF 218
GCG N G ++ G++G G+G S+VSQL S + +CL+ L+F
Sbjct: 198 FGCG-NMNAGTLFNG-SGMVGFGRGALSLVSQLGSPRF-----SYCLTSFMSPATSRLYF 250
Query: 219 GDDLYDSSRVVWTSMSSDYTKYY-SPGVAELFF---GGETTGLKNLP------------- 261
G +S +S T + +P + ++F G + LP
Sbjct: 251 GAYATLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDG 310
Query: 262 ---VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
V+ DSG++ T+L + Y + + P D T C+K P + +
Sbjct: 311 TGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSD-TFDTCFKWPPPPRRMVT 369
Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYLIIS-NKGNVCLGIL 360
+ + + L F DG EL E Y+++ GN+CL +L
Sbjct: 370 LPE----MVLHF-DGAD---MELPLENYMVMDGGTGNLCLAML 404
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/372 (27%), Positives = 152/372 (40%), Gaps = 61/372 (16%)
Query: 31 NHVGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-- 86
+ VG + F V G+ P G Y + +G P + + +DTGSD+ W+ C + C C
Sbjct: 78 SSVGGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNCPH 136
Query: 87 ----------VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS 136
+AP L S V C DPIC+S+ C + QC Y Y DG
Sbjct: 137 SSGLGIDLHFFDAPGSLTAGS---VTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSG 193
Query: 137 SLGVLVKDAFAFNYTNGQRL----NPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSI 190
+ G + D F F+ G+ L + + GC Q S +DGI G GKGK S+
Sbjct: 194 TSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSV 253
Query: 191 VSQLHSQKLIRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY----YSPG 244
VSQL S+ + V HCL G GGG G+ L +V++ + Y S G
Sbjct: 254 VSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPG--MVYSPLVPSQPHYNLNLLSIG 311
Query: 245 V--------AELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
V A +F T G + D+G++ TYL + Y +L ++
Sbjct: 312 VNGQMLPLDAAVFEASNTRG-----TIVDTGTTLTYLVKEAY---------DLFLNAISN 357
Query: 297 APEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL----IISNK 352
+ P+ G + + + F +++L+F G + L P+ YL I
Sbjct: 358 SVSQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGAS---MMLRPQDYLFHYGIYDGA 414
Query: 353 GNVCLGILNGAE 364
C+G E
Sbjct: 415 SMWCIGFQKAPE 426
>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
from this gene [Arabidopsis thaliana]
Length = 388
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 82/275 (29%), Positives = 122/275 (44%), Gaps = 44/275 (16%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH--------PLYR----P 96
G Y + IG PA+ Y++ +DTGSD+ W+ C ++C + P LY
Sbjct: 78 GLYYAKIGIGTPAKSYYVQVDTGSDIMWVNC----IQCKQCPRRSTLGIELTLYNIDESD 133
Query: 97 SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG--- 153
S LV C+D C + C+ C Y Y DG S+ G VKD ++ G
Sbjct: 134 SGKLVSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLK 193
Query: 154 -QRLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
Q N + GCG Q + ++ LDGILG GK SS++SQL S ++ + HCL
Sbjct: 194 TQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLD 253
Query: 210 GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY------------YSPGVAELFFGGETTGL 257
G GG +F + +V T + + Y + A+LF G+ G
Sbjct: 254 GRNGGGIFAIGRVV-QPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKG- 311
Query: 258 KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAK 292
+ DSG++ YL + Y+ L +KKE + K
Sbjct: 312 ----AIIDSGTTLAYLPEIIYEPL---VKKEPALK 339
>gi|213998842|gb|ACJ60788.1| nucellin [Hordeum cordobense]
Length = 154
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 65/148 (43%), Positives = 82/148 (55%), Gaps = 5/148 (3%)
Query: 154 QRLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSG 210
QR ++A GCGY Q A P+DGILGLG GK+ +QL QK+I NV+GHCLS
Sbjct: 3 QRDKKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSS 269
G G L+ GD S V W M YYSPG+AEL + G VVFDSGS+
Sbjct: 63 KGKGVLYVGDFNPPSRGVTWVPMKESLF-YYSPGLAELLIDNQPIRGNPTFEVVFDSGST 121
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEA 297
YT++ Y + S ++ LS SL+E
Sbjct: 122 YTHVPAQIYNEIVSKVRGTLSESSLEEV 149
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/358 (28%), Positives = 145/358 (40%), Gaps = 60/358 (16%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
T Y V + IG P P LDTGSDL W QCDAPC RC P PLY P+ V C
Sbjct: 89 TATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSC 148
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
P+C +L +P C Y Y DG S+ GVL + F R +A G
Sbjct: 149 RSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVR---GVAFG 205
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFGD 220
CG + S G++G+G+G S+VSQL + +C + LF G
Sbjct: 206 CGTENL--GSTDNSSGLVGMGRGPLSLVSQLGVTRF-----SYCFTPFNATAASPLFLG- 257
Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAE------LFFGGETTGLKNLP------------- 261
S+R+ + ++ + S G L G T G LP
Sbjct: 258 ---SSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGD 314
Query: 262 --VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
V+ DSG+++T L + L + + A L LC+ P +V
Sbjct: 315 GGVIIDSGTTFTALEERAFVALARALASRVRLPLASGA--HLGLSLCFAAASP--EAVEV 370
Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNV-CLGILNGAEVGLQDLNVIGGI 376
+ L L F DG EL E+Y++ V CLG+++ + ++V+G +
Sbjct: 371 PR----LVLHF-DGAD---MELRRESYVVEDRSAGVACLGMVSA-----RGMSVLGSM 415
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/359 (28%), Positives = 158/359 (44%), Gaps = 56/359 (15%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
G Y + M IG PAR Y LDTGSDL W QC APC+ CV+ P P + P+N + C
Sbjct: 90 GEYLMEMGIGTPARFYSAILDTGSDLIWTQC-APCLLCVDQPTPYFDPANSSTYRSLGCS 148
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
P C +L+ P + C Y+ Y D S+ GVL + F F + + PR++ GC
Sbjct: 149 APACNALYYPLCYQ----KTCVYQYFYGDSASTAGVLANETFTFGTNDTRVTLPRISFGC 204
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF---LFFGDD 221
G + S G++G G+G S+VSQL S + +CL+ L+FG
Sbjct: 205 G--NLNAGSLANGSGMVGFGRGSLSLVSQLGSPRF-----SYCLTSFLSPVRSRLYFGAY 257
Query: 222 LYDSSRVVWTSMSSDYTKYYSPGVAELFF---GGETTGLKNLPV---------------- 262
+S T S+ + +P + ++F G + G LP+
Sbjct: 258 ATLNSTNASTVQSTPFI--INPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGT 315
Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKEL-SAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
+ DSG++ TYL Y + L S L + E L C++ P + + +
Sbjct: 316 IIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVTLPQ 375
Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIIS-NKGNVCLGILNGAEVGL------QDLNVI 373
L L F DG +EL + Y+++ + G +CL + ++ + Q+ NV+
Sbjct: 376 ----LVLHF-DGAD---WELPLQNYMLVDPSTGGLCLAMATSSDGSIIGSYQHQNFNVL 426
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 94/336 (27%), Positives = 138/336 (41%), Gaps = 38/336 (11%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPC 103
TG Y V+M +G PAR + DTGSDL+W+QC PC C E PL+ P+ VPC
Sbjct: 143 TGNYVVSMGLGTPARDMTVVFDTGSDLSWVQC-TPCSDCYEQKDPLFDPARSSTYSAVPC 201
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
P C L + +C +C YE+ Y D + G L +D ++ + P G
Sbjct: 202 ASPECQGLDS---RSCSRDKKCRYEVVYGDQSQTDGALARDTLTLTQSD---VLPGFVFG 255
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDD 221
CG + DG++GLG+ K S+ SQ S+ +CL S G+L G
Sbjct: 256 CGEQDT--GLFGRADGLVGLGREKVSLSSQAASK--YGAGFSYCLPSSPSAAGYLSLGGP 311
Query: 222 LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSSYTYLN 274
++R D +Y + + G T ++ P+VF DSG+ T L
Sbjct: 312 APANARFTAMETRHDSPSFYYVRLVGVKVAGRT--VRVSPIVFSAAGTVIDSGTVITRLP 369
Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGK 334
Y L S + + K AP L C+ F V+ ++AL F G
Sbjct: 370 PRVYAALRSAFARSMGRYGYKRAPALSILDTCYD----FTGHTTVR--IPSVALVFAGGA 423
Query: 335 TRTLFELTPEAYLIISNKGNVCLGIL---NGAEVGL 367
L L ++ CL +GA+ G+
Sbjct: 424 A---VGLDFSGVLYVAKVSQACLAFAPNGDGADAGI 456
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 77/269 (28%), Positives = 122/269 (45%), Gaps = 35/269 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP--- 96
TG Y + IG P + Y + +DTGSD+ W+ C + C + P LY P
Sbjct: 80 TGLYYTEIEIGTPPKQYHVQVDTGSDILWVNC----ISCNKCPRKSDLGIDLRLYDPKGS 135
Query: 97 -SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG-- 153
S V C+ CA+ + C C+Y + Y DG S+ G V D+ +N +G
Sbjct: 136 SSGSTVSCDQKFCAATYGGKLPGCAKNIPCEYSVMYGDGSSTTGYFVSDSLQYNQVSGDG 195
Query: 154 --QRLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
+ N + GCG Q G++ LDGI+G G+ +S++SQL + ++ + HCL
Sbjct: 196 QTRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCLD 255
Query: 210 G-GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNL 260
GGG GD + +V T + D +Y+ + + GG T L +
Sbjct: 256 TIKGGGIFAIGDVV--QPKVKSTPLVPD-MPHYNVNLESINVGGTTLQLPSHMFETGEKK 312
Query: 261 PVVFDSGSSYTYLNRVTYQ-TLTSIMKKE 288
+ DSG++ TYL + Y+ L ++ K
Sbjct: 313 GTIIDSGTTLTYLPELVYKDVLAAVFAKH 341
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 157/351 (44%), Gaps = 32/351 (9%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
+++ ++ GYY ++IG P + + L +DTGS +T++ C C C P +RP +
Sbjct: 81 MRLYDDLLRNGYYTARLWIGTPPQRFALIVDTGSTVTYVPCST-CRHCGSHQDPKFRPED 139
Query: 99 DLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
P+ + NC+ D QC YE YA+ +S G L +D +F N L+
Sbjct: 140 S--ETYQPVKCTWQC----NCDNDRKQCTYERRYAEMSTSSGALGEDVVSFG--NQTELS 191
Query: 158 PRLAL-GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFL 216
P+ A+ GC ++ DGI+GLG+G SI+ QL +K+I + C G G G
Sbjct: 192 PQRAIFGCENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMGVGGG 251
Query: 217 FFG-DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------DSG 267
+ + +V+T + YY+ + E+ G+ L P VF DSG
Sbjct: 252 AMVLGGISPPADMVFTRSDPVRSPYYNIDLKEIHVAGKRLHLN--PKVFDGKHGTVLDSG 309
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
++Y YL + + KE + P+ +C+ G +V + K F +
Sbjct: 310 TTYAYLPESAFLAFKHAIMKETHSLKRISGPDPRYNDICFSGAE--IDVSQISKSFPVVE 367
Query: 328 LSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
+ F +G L+PE YL +K G CLG+ + G ++GGI
Sbjct: 368 MVFGNGHK---LSLSPENYLFRHSKVRGAYCLGVFSN---GNDPTTLLGGI 412
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 162/363 (44%), Gaps = 54/363 (14%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP--- 96
TG Y + IG PA+ Y++ +DTGSD+ W+ C V C P +Y P
Sbjct: 87 TGLYFTRIGIGTPAKRYYVQVDTGSDILWVNC----VSCDGCPRKSNLGIELTMYDPRGS 142
Query: 97 -SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
S +LV C+ C + + +C + C+Y + Y DG S+ G V D +N +G
Sbjct: 143 QSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDG 202
Query: 156 ----LNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL- 208
N ++ GCG G+S LDGILG G+ SS++SQL + +R + HCL
Sbjct: 203 QTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLD 262
Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNL 260
+ GGG G+ + +V T + D +Y+ + + GG GL +
Sbjct: 263 TVNGGGIFAIGNVV--QPKVKTTPLVPD-MPHYNVILKGIDVGGTALGLPTNIFDSGNSK 319
Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIM---KKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
+ DSG++ Y+ Y+ L +++ +++S ++L++ C F+
Sbjct: 320 GTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS-------C------FQYSG 366
Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIG 377
V F + F +G + ++P YL + K C+G NG V +D + +G
Sbjct: 367 SVDDGFPEVTFHF-EGDVSLI--VSPHDYLFQNGKNLYCMGFQNGG-VQTKDGKDMVLLG 422
Query: 378 DFV 380
D V
Sbjct: 423 DLV 425
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 84/282 (29%), Positives = 126/282 (44%), Gaps = 56/282 (19%)
Query: 45 VYPTG--YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDL 100
V P+G Y + + IG P +P LDTGSDL W QC APC C+ P PL+ P S+
Sbjct: 95 VRPSGDLEYLIDLAIGTPPQPVSALLDTGSDLIWTQC-APCASCLAQPDPLFAPAASSSY 153
Query: 101 VP--CEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
VP C +C + HH+C+ P C Y Y DG ++LGV + F F ++G++L+
Sbjct: 154 VPMRCSGQLCNDIL---HHSCQRPDTCTYRYNYGDGTTTLGVYATERFTFASSSGEKLSV 210
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS--------- 209
L GCG V S + GI+G G+ S+VSQL ++ +CL+
Sbjct: 211 PLGFGCGTMNV--GSLNNGSGIVGFGRDPLSLVSQLSIRRF-----SYCLTPYTSTRKST 263
Query: 210 ---GGGGGFLFFGDDL----YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP- 261
G +F GDD ++R++ + + + YY P F G T G + L
Sbjct: 264 LMFGSLSDGVFEGDDAATGQVQTTRLLQSRQNPTF--YYVP------FTGVTVGTRRLRI 315
Query: 262 --------------VVFDSGSSYTYLNRVTYQTLTSIMKKEL 289
V+ DSG++ T + + +L
Sbjct: 316 PLSAFALRPDGSGGVIVDSGTALTLFPAAVLTEVLRAFRAQL 357
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 103/361 (28%), Positives = 163/361 (45%), Gaps = 44/361 (12%)
Query: 43 GNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLY- 94
GN +P+ G Y + IG P++ Y++ +DTGSD+ W+ C A C RC + LY
Sbjct: 145 GNGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNC-AGCDRCPTKSDLGVDLTLYD 203
Query: 95 ---RPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
++D V C+D C+ P C+ QC Y + Y DG S+ G V+D +N
Sbjct: 204 MKASTTSDAVGCDDNFCSLYDGP-LPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRI 262
Query: 152 NGQ----RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVG 205
+G N + GCG Q G+S LDGILG G+ SS++SQL S ++ V
Sbjct: 263 SGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFS 322
Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL-------- 257
HCL GG +F ++ + +V T + + +Y+ + E+ GG+ +
Sbjct: 323 HCLDNVDGGGIFAIGEVVE-PKVNITPLVQN-QAHYNVVMKEIEVGGDPLDVPSDAFESG 380
Query: 258 KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
+ DSG++ Y + Y L K L + P D L + F
Sbjct: 381 DRKGTIIDSGTTLAYFPQEVYVPLIE--------KILSQQP-DLRLHTVEQAFTCFDYTG 431
Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN-GAEVGL-QDLNVIGG 375
+V F T+ L F + T++ P YL ++ C+G N GA+ +DL ++G
Sbjct: 432 NVDDGFPTVTLHFDKSISLTVY---PHEYL-FQHEFEWCIGWQNSGAQTKDGKDLTLLGD 487
Query: 376 I 376
+
Sbjct: 488 L 488
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 157/380 (41%), Gaps = 55/380 (14%)
Query: 33 VGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP 90
VG + F V G+ P G Y + +G P R + + +DTGSD+ W+ C + C C +
Sbjct: 61 VGGVVDFSVQGSSDPYLVGLYFTRVKLGTPPREFNVQIDTGSDVLWVTCSS-CSNCPQTS 119
Query: 91 ---------HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGV 140
+ LVPC PIC S C + QC Y +Y DG + G
Sbjct: 120 GLGIQLNYFDTTSSSTARLVPCSHPICTSQIQTTATQCPPQSNQCSYAFQYGDGSGTSGY 179
Query: 141 LVKDAFAFNYTNGQRL----NPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQL 194
V D F F+ G+ L + + GC Q + +DGI G G+G+ S++SQL
Sbjct: 180 YVSDTFYFDAVLGESLIANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQL 239
Query: 195 HSQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGG 252
S + V HCL G GGG L G+ L +V++ + +Y+ + + G
Sbjct: 240 SSHGITPRVFSHCLKGEDSGGGILVLGEIL--EPGIVYSPLVPS-QPHYNLDLQSIAVSG 296
Query: 253 ETTGL--------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
+ + N + D+G++ YL Y S + +S + P
Sbjct: 297 QLLPIDPAAFATSSNRGTIIDTGTTLAYLVEEAYDPFVSAITAAVSQLA---------TP 347
Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGA 363
KG + + + V + F ++ +F G T L PE YL+ ++N L
Sbjct: 348 TINKGNQCYLVSNSVSEVFPPVSFNFAGGATML---LKPEEYLMYLTNYAGAALWC---- 400
Query: 364 EVGLQDLNVIGGI---GDFV 380
+G Q + GGI GD V
Sbjct: 401 -IGFQKIQ--GGITILGDLV 417
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 93/346 (26%), Positives = 154/346 (44%), Gaps = 53/346 (15%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP--- 96
TG Y + IG PA+ Y++ +DTGSD+ W+ C V C P +Y P
Sbjct: 87 TGLYFTRIGIGTPAKRYYVQVDTGSDILWVNC----VSCDGCPRKSNLGIELTMYDPRGS 142
Query: 97 -SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
S +LV C+ C + + +C + C+Y + Y DG S+ G V D +N +G
Sbjct: 143 QSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDG 202
Query: 156 ----LNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL- 208
N ++ GCG G+S LDGILG G+ SS++SQL + +R + HCL
Sbjct: 203 QTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLD 262
Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNL 260
+ GGG G+ + +V T + D +Y+ + + GG GL +
Sbjct: 263 TVNGGGIFAIGNVV--QPKVKTTPLVPD-MPHYNVILKGIDVGGTALGLPTNIFDSGNSK 319
Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIM---KKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
+ DSG++ Y+ Y+ L +++ +++S ++L++ C F+
Sbjct: 320 GTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS-------C------FQYSG 366
Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGA 363
V F + F +G + ++P YL + K C+G NG
Sbjct: 367 SVDDGFPEVTFHF-EGDVSLI--VSPHDYLFQNGKNLYCMGFQNGG 409
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 87/289 (30%), Positives = 133/289 (46%), Gaps = 26/289 (8%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
R+ S + SS +F ++L Q G +G Y VT+ +G P + + L DTGSDLT
Sbjct: 99 RVDSIHARLSSHGVFQEKQATLPVQ-SGASIGSGDYAVTVGLGTPKKEFTLIFDTGSDLT 157
Query: 76 WLQCDAPCVR-CVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQCDYELE 130
W QC+ PC + C + P P+ + C C L G +C P C Y+++
Sbjct: 158 WTQCE-PCAKTCYKQKEPRLDPTKSTSYKNISCSSAFCKLLDTEGGESCSSPT-CLYQVQ 215
Query: 131 YADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSI 190
Y DG S+G + + +N + GCG Q + G+LGLG+ K S+
Sbjct: 216 YGDGSYSIGFFATETLTLSSSN---VFKNFLFGCG--QQNSGLFRGAAGLLGLGRTKLSL 270
Query: 191 VSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMSSDY--TKYYSPGVA 246
SQ +QK + + +CL S G+L FG + S V +T +S D+ T +Y +
Sbjct: 271 PSQT-AQKY-KKLFSYCLPASSSSKGYLSFGGQV--SKTVKFTPLSEDFKSTPFYGLDIT 326
Query: 247 ELFFGG-----ETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELS 290
EL GG + + V DSG+ T L Y L+S +K ++
Sbjct: 327 ELSVGGNKLSIDASIFSTSGTVIDSGTVITRLPSTAYSALSSAFQKLMT 375
>gi|213998804|gb|ACJ60769.1| nucellin [Hordeum muticum]
gi|213998808|gb|ACJ60771.1| nucellin [Hordeum erectifolium]
gi|213998820|gb|ACJ60777.1| nucellin [Hordeum patagonicum subsp. mustersii]
gi|213998822|gb|ACJ60778.1| nucellin [Hordeum patagonicum subsp. santacrucense]
gi|333069937|gb|AEF13570.1| nucellin, partial [Hordeum pubiflorum]
Length = 154
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 64/147 (43%), Positives = 81/147 (55%), Gaps = 5/147 (3%)
Query: 154 QRLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSG 210
QR ++A GCGY Q A P+DGILGLG GK+ +QL QK+I NV+GHCLS
Sbjct: 3 QRDKKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSS 269
G G L+ GD S V W M YYSPG+AEL + G VFDSGS+
Sbjct: 63 KGKGVLYVGDFNPPSRGVTWVPMKESLF-YYSPGLAELLIDNQPIRGNPTFEAVFDSGST 121
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKE 296
YT++ Y + S ++ LS SL+E
Sbjct: 122 YTHVPAQIYNEIVSKVRGTLSESSLEE 148
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 99/360 (27%), Positives = 148/360 (41%), Gaps = 72/360 (20%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
G Y + + IG P Y +DTGSDL W QC APCV C + P P +RP+ LVPC
Sbjct: 90 GEYLMDLAIGTPPLRYTAMVDTGSDLIWTQC-APCVLCADQPTPYFRPARSATYRLVPCR 148
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR-LNPRLALG 163
P+CA+L P C + C Y+ Y D S+ GVL + F F N + + +A G
Sbjct: 149 SPLCAALPYPA---CFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFG 205
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG------------- 210
CG + G++GLG+G S+VSQL + +CL+
Sbjct: 206 CG--NINSGQLANSSGMVGLGRGPLSLVSQLGPSRF-----SYCLTSFLSPEPSRLNFGV 258
Query: 211 ----GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----- 261
G G + + VV ++ S Y + G + G K LP
Sbjct: 259 FATLNGTNASSSGSPVQSTPLVVNAALPSLYF---------MSLKGISLGQKRLPIDPLV 309
Query: 262 ----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET---LPLCWK 308
V DSG+S T+L + Y + ++EL + P ++T L C+
Sbjct: 310 FAINDDGTGGVFIDSGTSLTWLQQDAYDAV----RRELVSVLRPLPPTNDTEIGLETCF- 364
Query: 309 GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK-GNVCLGILNGAEVGL 367
P+ V + L F G T + PE Y++I G +CL ++ + +
Sbjct: 365 ---PWPPPPSVAVTVPDMELHFDGGANMT---VPPENYMLIDGATGFLCLAMIRSGDATI 418
>gi|213998824|gb|ACJ60779.1| nucellin [Hordeum chilense]
Length = 140
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 63/141 (44%), Positives = 80/141 (56%), Gaps = 5/141 (3%)
Query: 160 LALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGGGGGFL 216
+A GCGY Q A P+DGILGLG GK+ +QL QK+I NV+GHCLS G G L
Sbjct: 1 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 60
Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLNR 275
+FGD S V W M + YYSPG+AEL + G VFDSGS+YT++
Sbjct: 61 YFGDFNPPSRGVTWVPM-KESXXYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPA 119
Query: 276 VTYQTLTSIMKKELSAKSLKE 296
Y + S ++ LS SL+E
Sbjct: 120 QIYNEIVSKVRGTLSESSLEE 140
>gi|213998816|gb|ACJ60775.1| nucellin [Hordeum patagonicum subsp. patagonicum]
Length = 152
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 64/147 (43%), Positives = 81/147 (55%), Gaps = 5/147 (3%)
Query: 154 QRLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSG 210
QR ++A GCGY Q A P+DGILGLG GK+ +QL QK+I NV+GHCLS
Sbjct: 1 QRDKKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKVITGNVIGHCLSS 60
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSS 269
G G L+ GD S V W M YYSPG+AEL + G VFDSGS+
Sbjct: 61 KGKGVLYVGDFNPPSRGVTWVPMKESLF-YYSPGLAELLIDNQPIRGNPTFEAVFDSGST 119
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKE 296
YT++ Y + S ++ LS SL+E
Sbjct: 120 YTHVPAQIYNEIVSKVRGTLSESSLEE 146
>gi|213998818|gb|ACJ60776.1| nucellin [Hordeum patagonicum subsp. setifolium]
Length = 149
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 64/148 (43%), Positives = 81/148 (54%), Gaps = 5/148 (3%)
Query: 154 QRLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSG 210
QR ++A GCGY Q A P+DGILGLG GK+ +QL QK+I NV+GHCLS
Sbjct: 3 QRDKKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSS 269
G G L+ GD S V W M YYSPG+AEL + G VFDSGS+
Sbjct: 63 KGKGVLYVGDFNPPSRGVTWVPMKESLF-YYSPGLAELLIDNQPIRGNPTFEAVFDSGST 121
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEA 297
YT++ Y + S ++ LS SL+E
Sbjct: 122 YTHVPAQIYNEILSKVRGTLSESSLEEV 149
>gi|213998840|gb|ACJ60787.1| nucellin [Hordeum patagonicum subsp. magellanicum]
Length = 154
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 64/147 (43%), Positives = 81/147 (55%), Gaps = 5/147 (3%)
Query: 154 QRLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSG 210
QR ++A GCGY Q A P+DGILGLG GK+ +QL QK+I NV+GHCLS
Sbjct: 3 QRDKKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSS 269
G G L+ GD S V W M YYSPG+AEL + G VFDSGS+
Sbjct: 63 KGKGVLYVGDFNPPSRGVTWVPMKESLF-YYSPGLAELLIDNQPIRGNPTFEAVFDSGST 121
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKE 296
YT++ Y + S ++ LS SL+E
Sbjct: 122 YTHVPAQIYNEILSKVRGTLSESSLEE 148
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 93/310 (30%), Positives = 128/310 (41%), Gaps = 54/310 (17%)
Query: 43 GNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP-------- 92
GN PT G Y + IG PA+ Y++ +DTGSD+ W+ C V C P
Sbjct: 71 GNGLPTETGLYFTQIGIGTPAKSYYVQVDTGSDILWVNC----VFCDTCPRKSGLGIELT 126
Query: 93 LYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
LY PS V C C + H +C A C Y + Y DG S+ G V D +
Sbjct: 127 LYDPSGSSSGTGVTCGQDFCVATHGGVIPSCVPAAPCQYSISYGDGSSTTGFFVTDFLQY 186
Query: 149 NYTNGQR----LNPRLALGCG--YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRN 202
N +G N + GCG G+S LDGILG G+ SS++SQL + +R
Sbjct: 187 NQVSGNSQTTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRK 246
Query: 203 VVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL----- 257
V HCL GG +F D+ V T+ +Y+ + + GG L
Sbjct: 247 VFAHCLDTINGGGIFAIGDVVQPK--VSTTPLVPGMPHYNVNLEAIDVGGVKLQLPTNIF 304
Query: 258 ---KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFK 314
++ + DSG++ YL V Y +IM K + G P K
Sbjct: 305 DIGESKGTIIDSGTTLAYLPGVVYN---AIMSKVFAQ----------------YGDMPLK 345
Query: 315 NVHDVKKCFR 324
N D +CFR
Sbjct: 346 NDQDF-QCFR 354
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/268 (29%), Positives = 116/268 (43%), Gaps = 43/268 (16%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYR---- 95
G Y + IG P++ Y++ +DTGSD+ W+ C ++C E P LY
Sbjct: 83 VGLYYAKVGIGTPSKDYYVQVDTGSDIMWVNC----IQCRECPRTSSLGMELTLYNIKDS 138
Query: 96 PSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ- 154
S LVPC++ C ++ C C Y Y DG S+ G VKD ++ +G
Sbjct: 139 VSGKLVPCDEEFCYEVNGGPLSGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSGDL 198
Query: 155 ---RLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
N + GCG Q + S LDGILG GK SS++SQL + + ++ + HCL
Sbjct: 199 QTTSSNGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCL 258
Query: 209 SG-GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVA------------ELFFGGETT 255
G GGG G + +V T + + Y A E F G+
Sbjct: 259 DGINGGGIFAIGHVV--QPKVNMTPLIPNQPHYNVNMTAVQVGEDFLHLPTEEFEAGDRK 316
Query: 256 GLKNLPVVFDSGSSYTYLNRVTYQTLTS 283
G + DSG++ YL + Y+ L S
Sbjct: 317 G-----AIIDSGTTLAYLPEIVYEPLVS 339
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 94/355 (26%), Positives = 151/355 (42%), Gaps = 42/355 (11%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLY----RPSND 99
G Y + +G P + Y++ +DTGSD+ W+ C APC +C + P LY ++
Sbjct: 75 GLYFTKIKLGSPPKEYYVQVDTGSDILWVNC-APCPKCPVKTDLGIPLSLYDSKASSTSK 133
Query: 100 LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR---- 155
V CED C+ + C C Y + Y DG +S G VKD + G
Sbjct: 134 NVGCEDAFCSFIMQS--ETCGAKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTAP 191
Query: 156 LNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
L + GCG NQ G + +DGI+G G+ +S++SQL + ++ + HCL G
Sbjct: 192 LAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMNG 251
Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK--------NLPVVFD 265
G +F ++ S VV T+ +Y+ + + GE L + + D
Sbjct: 252 GGIFAIGEV--ESPVVKTTPLVPNQVHYNVILKGMDVDGEPIDLPPSLASTNGDGGTIID 309
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
SG++ YL + Y +L ++K + + +K ET F + K F
Sbjct: 310 SGTTLAYLPQNLYNSL---IEKITAKQQVKLHMVQETFAC-------FSFTSNTDKAFPV 359
Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDFV 380
+ L F D +++ P YL + C G +G +VI +GD V
Sbjct: 360 VNLHFEDSLKLSVY---PHDYLFSLREDMYCFGWQSGGMTTQDGADVI-LLGDLV 410
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 94/334 (28%), Positives = 151/334 (45%), Gaps = 32/334 (9%)
Query: 56 YIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPG 115
+IG P + + L +DTGS +T++ C++ C +C P ++P DL P+ + P
Sbjct: 1 WIGTPPQEFALIVDTGSTVTYVPCNS-CDQCGNHQDPKFQP--DLSDTYHPVKCN---PD 54
Query: 116 HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLALGCGYNQVPGASY 174
+ QC YE +YA+ SS G+L +D +F N L P R GC +
Sbjct: 55 CTCDTENDQCTYERQYAEMSSSSGILGEDLVSFG--NMSELKPQRAVFGCENAETGDLFS 112
Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTS 232
DGI+GLG+G SIV QL + +I + C G GGG + G + S +V++
Sbjct: 113 QHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLG-QISPPSDMVFSH 171
Query: 233 MSSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------DSGSSYTYLNRVTYQTLTSI 284
D + YY+ + L G+ + P VF DSG++Y YL +
Sbjct: 172 SDPDRSPYYNIELRGLHVAGKKLDIN--PQVFDGKHGTILDSGTTYAYLPEAAFLPFIQA 229
Query: 285 MKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPE 344
+ EL P+ +C+ G + ++ K F ++ + F +G+ + L+PE
Sbjct: 230 ITSELHGLKQIRGPDPNYNDVCFSGAG--SEIPELYKTFPSVDMVFDNGEK---YSLSPE 284
Query: 345 AYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
YL +K G CLG+ G ++GGI
Sbjct: 285 NYLFKHSKVHGAYCLGVFQN---GKDPTTLLGGI 315
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 94/334 (28%), Positives = 151/334 (45%), Gaps = 32/334 (9%)
Query: 56 YIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPG 115
+IG P + + L +DTGS +T++ C++ C +C P ++P DL P+ + P
Sbjct: 1 WIGTPPQEFALIVDTGSTVTYVPCNS-CDQCGNHQDPKFQP--DLSDTYHPVKCN---PD 54
Query: 116 HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLALGCGYNQVPGASY 174
+ QC YE +YA+ SS G+L +D +F N L P R GC +
Sbjct: 55 CTCDTENDQCTYERQYAEMSSSSGILGEDLVSFG--NMSELKPQRAVFGCENAETGDLFS 112
Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTS 232
DGI+GLG+G SIV QL + +I + C G GGG + G + S +V++
Sbjct: 113 QHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLG-QISPPSDMVFSH 171
Query: 233 MSSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------DSGSSYTYLNRVTYQTLTSI 284
D + YY+ + L G+ + P VF DSG++Y YL +
Sbjct: 172 SDPDRSPYYNIELRGLHVAGKKLDIN--PQVFDGKHGTILDSGTTYAYLPEAAFLPFIQA 229
Query: 285 MKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPE 344
+ EL P+ +C+ G + ++ K F ++ + F +G+ + L+PE
Sbjct: 230 ITSELHGLKQIRGPDPNYNDVCFSGAG--SEIPELYKTFPSVDMVFDNGEK---YSLSPE 284
Query: 345 AYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
YL +K G CLG+ G ++GGI
Sbjct: 285 NYLFKHSKVHGAYCLGVFQN---GKDPTTLLGGI 315
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 101/377 (26%), Positives = 152/377 (40%), Gaps = 66/377 (17%)
Query: 31 NHVGSSLLFQVHGNVYP-------TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC 83
+ VG + F V G+ P T Y + +G P + + +DTGSD+ W+ C + C
Sbjct: 78 SSVGGVVDFPVQGSSDPYLVGSKMTMLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSS-C 136
Query: 84 VRC------------VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEY 131
C +AP L S V C DPIC+S+ C + QC Y Y
Sbjct: 137 SNCPHSSGLGIDLHFFDAPGSLTAGS---VTCSDPICSSVFQTTAAQCSENNQCGYSFRY 193
Query: 132 ADGGSSLGVLVKDAFAFNYTNGQRL----NPRLALGCGYNQVPG--ASYHPLDGILGLGK 185
DG + G + D F F+ G+ L + + GC Q S +DGI G GK
Sbjct: 194 GDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGK 253
Query: 186 GKSSIVSQLHSQKLIRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY--- 240
GK S+VSQL S+ + V HCL G GGG G+ L +V++ + Y
Sbjct: 254 GKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPG--MVYSPLVPSQPHYNLN 311
Query: 241 -YSPGV--------AELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSA 291
S GV A +F T G + D+G++ TYL + Y +L
Sbjct: 312 LLSIGVNGQMLPLDAAVFEASNTRG-----TIVDTGTTLTYLVKEAY---------DLFL 357
Query: 292 KSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL---- 347
++ + P+ G + + + F +++L+F G + L P+ YL
Sbjct: 358 NAISNSVSQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGAS---MMLRPQDYLFHYG 414
Query: 348 IISNKGNVCLGILNGAE 364
I C+G E
Sbjct: 415 IYDGASMWCIGFQKAPE 431
>gi|213998848|gb|ACJ60790.1| nucellin [Psathyrostachys stoloniformis]
Length = 154
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/141 (44%), Positives = 78/141 (55%), Gaps = 5/141 (3%)
Query: 160 LALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLI-RNVVGHCLSGGGGGFL 216
+A GCGY Q A P+DGILGLG GK+ +QL QK+I NV+GHCLS G G L
Sbjct: 9 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITENVIGHCLSSKGKGVL 68
Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLNR 275
+ GD + V W M YYSPG+A LF + G VFDSGS+YTY+
Sbjct: 69 YVGDFNPPTRGVTWVPMRESLF-YYSPGLAALFIDKQPIRGNPTFEAVFDSGSTYTYMPA 127
Query: 276 VTYQTLTSIMKKELSAKSLKE 296
Y L S ++ LS SL+E
Sbjct: 128 QIYNELVSKIRGTLSESSLEE 148
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 99/360 (27%), Positives = 147/360 (40%), Gaps = 72/360 (20%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
G Y + + IG P Y +DTGSDL W QC APCV C + P P +RP+ LVPC
Sbjct: 90 GEYLMDLAIGTPPLRYTAMVDTGSDLIWTQC-APCVLCADQPTPYFRPARSATYRLVPCR 148
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR-LNPRLALG 163
P+CA+L P C + C Y+ Y D S+ GVL + F F N + + +A G
Sbjct: 149 SPLCAALPYPA---CFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFG 205
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG------------- 210
CG + G++GLG+G S+VSQL + +CL+
Sbjct: 206 CG--NINSGQLANSSGMVGLGRGPLSLVSQLGPSRF-----SYCLTSFLSPEPSRLNFGV 258
Query: 211 ----GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----- 261
G G + + VV ++ S Y + G + G K LP
Sbjct: 259 FATLNGTNASSSGSPVQSTPLVVNAALPSLYF---------MSLKGISLGQKRLPIDPLV 309
Query: 262 ----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET---LPLCWK 308
V DSG+S T+L + Y + + EL + P ++T L C+
Sbjct: 310 FAINDDGTGGVFIDSGTSLTWLQQDAYDAV----RHELVSVLRPLPPTNDTEIGLETCF- 364
Query: 309 GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK-GNVCLGILNGAEVGL 367
P+ V + L F G T + PE Y++I G +CL ++ + +
Sbjct: 365 ---PWPPPPSVAVTVPDMELHFDGGANMT---VPPENYMLIDGATGFLCLAMIRSGDATI 418
>gi|213998836|gb|ACJ60785.1| nucellin [Hordeum bogdanii]
Length = 154
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 63/147 (42%), Positives = 81/147 (55%), Gaps = 5/147 (3%)
Query: 154 QRLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSG 210
+R ++A GCGY Q A P+DGILGLG GK+ +QL QK+I NV+GHCLS
Sbjct: 3 ERDKKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK-NLPVVFDSGSS 269
G G L+ GD S V W M YYSPG+AEL + G VFDSGS+
Sbjct: 63 KGKGVLYVGDFNPPSRGVTWVPMRESLF-YYSPGLAELLIDNQPIGGNPTFEAVFDSGST 121
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKE 296
YT++ Y + S ++ LS SL+E
Sbjct: 122 YTHVPAQIYNEIVSKVRGTLSESSLEE 148
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 97/387 (25%), Positives = 157/387 (40%), Gaps = 49/387 (12%)
Query: 5 HNGENLCFPTVRMSSSSSSSSSSSLFNHVG------SSLLFQVHGNVYPTGYYNVTMYIG 58
H N+ FP VR + + ++ + G S + + GN PT IG
Sbjct: 23 HANANMVFPVVRKFKGPAENLAAIKAHDAGRRGRFLSVVDLALGGNGRPTSTGLYYTKIG 82
Query: 59 QPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP----SNDLVPCEDP 106
Y++ +DTGSD W+ C V C P LY P ++ +VPC+D
Sbjct: 83 LGPNDYYVQVDTGSDTLWVNC----VGCTTCPKKSGLGMELTLYDPNSSKTSKVVPCDDE 138
Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL----NPRLAL 162
C S + C+ C Y + Y DG ++ G +KD F+ G N +
Sbjct: 139 FCTSTYDGPISGCKKDMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSVIF 198
Query: 163 GCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFG 219
GCG Q + + LDGI+G G+ SS++SQL + ++ V HCL GG +F
Sbjct: 199 GCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCLDTVNGGGIFAI 258
Query: 220 DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNLPVVFDSGSSYT 271
++ V T+ +Y+ + ++ G+ L + DSG++
Sbjct: 259 GEVVQPK--VKTTPLVPRMAHYNVVLKDIEVAGDPIQLPTDIFDSTSGRGTIIDSGTTLA 316
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
YL Y L ++K L+ +S E E C+ + + + F T+ +F
Sbjct: 317 YLPVSIYDQL---LEKTLAQRSGMELYLVEDQFTCFH----YSDEKSLDDAFPTVKFTFE 369
Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLG 358
+G T T + P YL + C+G
Sbjct: 370 EGLTLTAY---PHDYLFPFKEDMWCIG 393
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 107/372 (28%), Positives = 165/372 (44%), Gaps = 58/372 (15%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH-------PL 93
+HG V GY+ T+++G PAR + + +DTGS +T++ C A C R PH P
Sbjct: 52 LHGAVKDYGYFYATLHLGTPARQFAVIVDTGSTITYVPC-ASCGRNC-GPHHKDAAFDPA 109
Query: 94 YRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
S+ ++ C+ C P C + +C Y+ YA+ SS G+LV D +G
Sbjct: 110 SSSSSAVIGCDSDKCICGRPP--CGCSEKRECTYQRTYAEQSSSAGLLVSDQLQLR--DG 165
Query: 154 QRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGG 212
+ GC + DGILGLG + S+V+QL +I +V C S G
Sbjct: 166 A---VEVVFGCETKETGEIYNQEADGILGLGNSEVSLVNQLAGSGVIDDVFALCFGSVEG 222
Query: 213 GGFLFFGD---DLYDSSRVVWTSMSS-DYTKYYSPGVAELFFGGETTGLK------NLPV 262
G L GD YD + +SS + YYS + L+ GG+ +K
Sbjct: 223 DGALMLGDVDAAEYDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKPERYEEGYGT 282
Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEA--------PEDETLP----LCWKGR 310
V DSG+++TYL +Q + K+ +SA +L+ P++++ +C+ G
Sbjct: 283 VLDSGTTFTYLPSEAFQ----LFKEAVSAYALEHGLNSVKGPDPKEKSFAQFHDICFGG- 337
Query: 311 RPFKNVHD---VKKCFRTLALSFTDG-KTRTLFELTPEAYLII--SNKGNVCLGILNGAE 364
P D ++K F L F DG + RT P YL + G CLG+ +
Sbjct: 338 APHAGHADQSKLEKVFPVFELQFADGVRLRT----GPLNYLFMHTGEMGAYCLGVFDNGA 393
Query: 365 VGLQDLNVIGGI 376
G ++GGI
Sbjct: 394 SG----TLLGGI 401
>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
Length = 473
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 95/344 (27%), Positives = 150/344 (43%), Gaps = 34/344 (9%)
Query: 50 YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP-CVRCVEAPH------PLYRP----SN 98
Y NVT +G P+ + + LDTGSDL WL CD CVR ++AP +Y P ++
Sbjct: 56 YANVT--VGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNIYSPNASSTS 113
Query: 99 DLVPCEDPICASLHAPGHHNCEDPAQCDYELEY-ADGGSSLGVLVKDAFAF--NYTNGQR 155
VPC +C G + C Y++ Y ++G SS GVLV+D N + +
Sbjct: 114 TKVPCNSTLCTR----GDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKA 169
Query: 156 LNPRLALGCGYNQVPGASYH---PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
+ R+ GCG QV +H +G+ GLG S+ S L + + N C G
Sbjct: 170 IPARVTFGCG--QVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDG 227
Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTY 272
G + FGD R ++ + Y+ V ++ GG T L+ VFDSG+S+TY
Sbjct: 228 AGRISFGDKGSVDQRETPLNIRQPHPT-YNITVTKISVGGNTGDLE-FDAVFDSGTSFTY 285
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPF--KNVHDVKKCFRTLALSF 330
L Y ++ K + + C+ R P + H K F+ A++
Sbjct: 286 LTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALRLPLYSGHHHPNKDSFQYPAVNL 345
Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
T + P + + + CL I+ ++D+++IG
Sbjct: 346 TMKGGSSYPVYHPLVVIPMKDTDVYCLAIMK-----IEDISIIG 384
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 97/352 (27%), Positives = 159/352 (45%), Gaps = 34/352 (9%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
++H ++ GYY ++IG P + + L +DTGS +T++ C + C +C P ++P
Sbjct: 1 MRLHDDLLINGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSS-CEQCGRHQDPKFQP-- 57
Query: 99 DLVPCEDPICASLHAPGHHNCED-PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
DL + ++ NC+D QC YE +YA+ +S GVL +D +F N L
Sbjct: 58 DLSSTYQSVKCNIDC----NCDDEKQQCVYERQYAEMSTSSGVLGEDIISFG--NLSALA 111
Query: 158 P-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF- 215
P R GC + DGI+G+G+G SIV L + +I + C G G G
Sbjct: 112 PQRAVFGCENMETGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGG 171
Query: 216 -LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF--------DS 266
+ G + S +V++ + YY+ + E+ G+ L P VF DS
Sbjct: 172 AMVLG-GISPPSNMVFSQSDPVRSPYYNIDLKEIHVAGKPLPLN--PTVFDGKHGTILDS 228
Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
G++Y YL + + + KEL + P+ +C+ G ++ + F +
Sbjct: 229 GTTYAYLPEAAFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAG--SDISQLSSSFPAV 286
Query: 327 ALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
+ F +G+ L+PE YL +K G CLGI G ++GGI
Sbjct: 287 EMVFGNGQK---LLLSPENYLFRHSKVHGAYCLGIFQN---GKDPTTLLGGI 332
>gi|213998845|gb|ACJ60789.1| nucellin [Psathyrostachys fragilis subsp. fragilis]
Length = 150
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 63/141 (44%), Positives = 78/141 (55%), Gaps = 5/141 (3%)
Query: 160 LALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLI-RNVVGHCLSGGGGGFL 216
+A GCGY Q A P+DGILGLG GK+ +QL QK+I NV+GHCLS G G L
Sbjct: 7 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITENVIGHCLSSKGKGVL 66
Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLNR 275
+ GD + V W M YYSPG+A LF + G VFDSGS+YTY+
Sbjct: 67 YVGDFNPPTRGVTWVPMRESLF-YYSPGLAALFIDKQPIRGNPTFEAVFDSGSTYTYVPA 125
Query: 276 VTYQTLTSIMKKELSAKSLKE 296
Y L S ++ LS SL+E
Sbjct: 126 QIYNELVSKIRGTLSESSLEE 146
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 146/348 (41%), Gaps = 41/348 (11%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
G TG Y VT+ +G PA Y + DTGSD TW+QC V C E L+ P+
Sbjct: 172 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTY 231
Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
+ C P C+ L G C C Y ++Y DG S+G D + + +
Sbjct: 232 ANISCAAPACSDLDTRG---CSG-GNCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVK--- 284
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
GCG + G+LGLG+GK+S+ Q + + V HCL G G+L
Sbjct: 285 GFRFGCGERNE--GLFGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSSGTGYL 340
Query: 217 FF--GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP--------VVFDS 266
F G +R+ ++ + +Y G+ + GG+ L ++P + DS
Sbjct: 341 DFGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGGQ---LLSIPQSVFTTAGTIVDS 397
Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
G+ T L Y +L S ++A+ K+AP L C+ F + V T+
Sbjct: 398 GTVITRLPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCYD----FTGMSQVA--IPTV 451
Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
+L F G ++ + ++ VCLG + G D+ ++G
Sbjct: 452 SLLFQGGAR---LDVDASGIMYAASVSQVCLGFAANEDGG--DVGIVG 494
>gi|213998798|gb|ACJ60766.1| nucellin [Hordeum brevisubulatum subsp. violaceum]
Length = 141
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 62/142 (43%), Positives = 79/142 (55%), Gaps = 5/142 (3%)
Query: 160 LALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGGGGGFL 216
+A GCGY Q A P+DGILGLG GK+ +QL QK+I+ NV+GHCLS G G L
Sbjct: 1 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMIKENVIGHCLSSKGKGVL 60
Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLNR 275
+ GD S V W M YYSPG+AEL + G VFDSGS+YT++
Sbjct: 61 YVGDFNPPSRGVTWVPMRESLF-YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPA 119
Query: 276 VTYQTLTSIMKKELSAKSLKEA 297
Y + S ++ LS SL+E
Sbjct: 120 QIYNEIVSKVRGTLSEPSLEEV 141
>gi|213998812|gb|ACJ60773.1| nucellin [Hordeum euclaston]
Length = 154
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 64/151 (42%), Positives = 81/151 (53%), Gaps = 5/151 (3%)
Query: 154 QRLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSG 210
QR ++A GCGY Q A P+DGILGLG GK+ +QL QK+I NV+GHCLS
Sbjct: 3 QRDKKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSS 269
G G L+ GD S V W M YYS G+AEL + G VFDSGS+
Sbjct: 63 KGKGVLYVGDFNPPSRGVTWVPMKESLF-YYSAGLAELLIDNQPIRGNPTFEAVFDSGST 121
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPED 300
YT++ Y + S ++ LS SL+E D
Sbjct: 122 YTHVPAQIYNEIVSKVRGTLSESSLEEVKGD 152
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 95/362 (26%), Positives = 156/362 (43%), Gaps = 52/362 (14%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLYRPSND--- 99
TG Y + IG P + Y++ +DTGSD+ W+ C C RC + LY P +
Sbjct: 86 TGLYYTEIGIGTPTKRYYVQVDTGSDILWVNC-ISCDRCPRKSGLGLELTLYDPKDSSTG 144
Query: 100 -LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG----Q 154
V C+ CA+ + C C+Y + Y DG S+ G V D F+ +G +
Sbjct: 145 SKVSCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTR 204
Query: 155 RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-G 211
N + GCG Q G+S LDGI+G G+ +S++SQL + ++ + HCL
Sbjct: 205 PANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTIN 264
Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNLPVV 263
GGG G+ + +V T + + +Y+ + + GG L + +
Sbjct: 265 GGGIFAIGNVV--QPKVKTTPLVPN-MPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTI 321
Query: 264 FDSGSSYTYLNRVTYQTLTSIM---KKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
DSG++ TYL + Y+ + + K+++ +++E LC F+ V V
Sbjct: 322 IDSGTTLTYLPEIVYKEIMLAVFAKHKDITFHNVQEF-------LC------FQYVGRVD 368
Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI--GD 378
F + F + ++ P Y + C+G NG GLQ + G + GD
Sbjct: 369 DDFPKITFHFENDLPLNVY---PHDYFFENGDNLYCVGFQNG---GLQSKDGKGMVLLGD 422
Query: 379 FV 380
V
Sbjct: 423 LV 424
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 77/264 (29%), Positives = 116/264 (43%), Gaps = 35/264 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEA--------PHPLYRPSN- 98
G Y + IG P++ Y++ +DTGSD+ W+ C C C P+ L +
Sbjct: 84 VGLYYAKIGIGTPSKDYYVQVDTGSDIVWVNC-IQCRECPRTSSLGMELTPYDLEESTTG 142
Query: 99 DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ---- 154
LV C++ C ++ C C Y Y DG S+ G VKD +N +G
Sbjct: 143 KLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETT 202
Query: 155 RLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
N + GCG Q + + LDGILG GK SSI+SQL S + ++ + HCL G
Sbjct: 203 AANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGT 262
Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSDYTKY--YSPGV----------AELFFGGETTGLKN 259
GG +F + +V T + + Y GV A++F G+ G
Sbjct: 263 NGGGIFAMGHVVQ-PKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRKG--- 318
Query: 260 LPVVFDSGSSYTYLNRVTYQTLTS 283
+ DSG++ YL + Y+ L +
Sbjct: 319 --TIIDSGTTLAYLPELIYEPLVA 340
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 158/361 (43%), Gaps = 50/361 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH--------PLYRP--S 97
TG Y + IG P + Y++ +DTGSD+ W+ C +RC P Y P S
Sbjct: 81 TGLYYTRIEIGSPPKGYYVQVDTGSDILWVNC----IRCDGCPTRSGLGIELTQYDPAGS 136
Query: 98 NDLVPCEDPICASLHAPG-HHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYT--NG 153
V CE C + A G C + C + + Y DG ++ G V D +N NG
Sbjct: 137 GTTVGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNG 196
Query: 154 QRL--NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL- 208
Q N + GCG G+S LDGILG G+ SS++SQL + + +R + HCL
Sbjct: 197 QTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLD 256
Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNL 260
+ GGG G+ + +V T + + T +Y+ + + GG T L +
Sbjct: 257 TVRGGGIFAIGNVV--QPKVKTTPLVPNVT-HYNVNLQGISVGGATLQLPTSTFDSGDSK 313
Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL-CWKGRRPFKNVHDV 319
+ DSG++ YL R Y+TL + + + + LPL ++ F+ +
Sbjct: 314 GTIIDSGTTLAYLPREVYRTLLAAVFDKY-----------QDLPLHNYQDFVCFQFSGSI 362
Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
F + SF T ++ P+ YL + C+G L+G V +D + +GD
Sbjct: 363 DDGFPVITFSFKGDLTLNVY---PDDYLFQNRNDLYCMGFLDGG-VQTKDGKDMLLLGDL 418
Query: 380 V 380
V
Sbjct: 419 V 419
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 158/361 (43%), Gaps = 50/361 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH--------PLYRP--S 97
TG Y + IG P + Y++ +DTGSD+ W+ C +RC P Y P S
Sbjct: 81 TGLYYTRIEIGSPPKGYYVQVDTGSDILWVNC----IRCDGCPTRSGLGIELTQYDPAGS 136
Query: 98 NDLVPCEDPICASLHAPG-HHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYT--NG 153
V CE C + A G C + C + + Y DG ++ G V D +N NG
Sbjct: 137 GTTVGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNG 196
Query: 154 QRL--NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL- 208
Q N + GCG G+S LDGILG G+ SS++SQL + + +R + HCL
Sbjct: 197 QTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLD 256
Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNL 260
+ GGG G+ + +V T + + T +Y+ + + GG T L +
Sbjct: 257 TVRGGGIFAIGNVV--QPKVKTTPLVPNVT-HYNVNLQGISVGGATLQLPTSTFDSGDSK 313
Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL-CWKGRRPFKNVHDV 319
+ DSG++ YL R Y+TL + + + + LPL ++ F+ +
Sbjct: 314 GTIIDSGTTLAYLPREVYRTLLAAVFDKY-----------QDLPLHNYQDFVCFQFSGSI 362
Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
F + SF T ++ P+ YL + C+G L+G V +D + +GD
Sbjct: 363 DDGFPVITFSFEGDLTLNVY---PDDYLFQNRNDLYCMGFLDGG-VQTKDGKDMLLLGDL 418
Query: 380 V 380
V
Sbjct: 419 V 419
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 146/354 (41%), Gaps = 48/354 (13%)
Query: 29 LFNHVGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
L +G + F V G P G Y + +G P R +++ +DTGSD+ W+ C A C C
Sbjct: 57 LLQSLGGVIDFPVDGTFDPFVVGLYYTKIRLGSPPRDFYVQVDTGSDVLWVSC-ASCNGC 115
Query: 87 -----VEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGS 136
++ + P + + V C D C+ C C Y +Y DG
Sbjct: 116 PQTSGLQIQLNFFDPGSSVTATPVSCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSG 175
Query: 137 SLGVLVKDAFAFNYTNGQRLNPR----LALGCGYNQVPG--ASYHPLDGILGLGKGKSSI 190
+ G V D F+ G L P + GC +Q S +DGI G G+ S+
Sbjct: 176 TSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSV 235
Query: 191 VSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAEL 248
+SQL SQ L V HCL G GGGG L G+ + +V+T + +Y+ + +
Sbjct: 236 ISQLASQGLAPRVFSHCLKGENGGGGILVLGEIV--EPNMVFTPLVPS-QPHYNVNLLSI 292
Query: 249 FFGGETTGLKNLPVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAP 298
G+ + P VF D+G++ YL+ Y +++ A
Sbjct: 293 SVNGQALPIN--PSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFV---------EAITNAV 341
Query: 299 EDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK 352
P+ KG + + V F ++L+F G ++F L P+ YLI N
Sbjct: 342 SQSVRPVVSKGNQCYVIATSVADIFPPVSLNFAGGA--SMF-LNPQDYLIQQNN 392
>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
[Cucumis sativus]
Length = 420
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 76/267 (28%), Positives = 116/267 (43%), Gaps = 41/267 (15%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP------------LYR 95
G Y + IG P++ Y++ +DTGSD+ W+ C ++C E P
Sbjct: 84 VGLYYAKIGIGTPSKDYYVQVDTGSDIVWVNC----IQCRECPRTSSLGMELTPYDLEES 139
Query: 96 PSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ- 154
+ LV C++ C ++ C C Y Y DG S+ G VKD +N +G
Sbjct: 140 TTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDL 199
Query: 155 ---RLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
N + GCG Q + + LDGILG GK SSI+SQL S + ++ + HCL
Sbjct: 200 ETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCL 259
Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY--YSPGV----------AELFFGGETTG 256
G GG +F + +V T + + Y GV A++F G+ G
Sbjct: 260 DGTNGGGIFAMGHVV-QPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRKG 318
Query: 257 LKNLPVVFDSGSSYTYLNRVTYQTLTS 283
+ DSG++ YL + Y+ L +
Sbjct: 319 -----TIIDSGTTLAYLPELIYEPLVA 340
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 60/159 (37%), Positives = 76/159 (47%), Gaps = 9/159 (5%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
T Y V + IG P P LDTGSDL W QCDAPC RC P PLY P+ V C
Sbjct: 89 TATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSC 148
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
P+C +L +P C Y Y DG S+ GVL + F R +A G
Sbjct: 149 RSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVR---GVAFG 205
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRN 202
CG + S G++G+G+G S+VSQL + R+
Sbjct: 206 CGTENL--GSTDNSSGLVGMGRGPLSLVSQLGVTRPRRS 242
>gi|213998838|gb|ACJ60786.1| nucellin [Hordeum vulgare subsp. vulgare]
Length = 154
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 61/146 (41%), Positives = 81/146 (55%), Gaps = 5/146 (3%)
Query: 155 RLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGG 211
R ++A GCGY Q A P+DGILGLG GK+ +QL K+I+ NV+GHCLS
Sbjct: 4 RDKKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKENVIGHCLSSK 63
Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSY 270
G G L+ GD + V W M YYSPG+AE+F + G VFDSGS+Y
Sbjct: 64 GKGVLYVGDFNPPTRGVTWAPMRESLF-YYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTY 122
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKE 296
T++ Y + S ++ LS SL+E
Sbjct: 123 THVPAQIYNEIVSKVRVTLSESSLEE 148
>gi|213998810|gb|ACJ60772.1| nucellin [Hordeum comosum]
Length = 154
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 63/148 (42%), Positives = 80/148 (54%), Gaps = 5/148 (3%)
Query: 154 QRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSG 210
QR ++A GCGY Q A P +DGILGLG GK+ +QL QK+I NV+GHCLS
Sbjct: 3 QRDKKKIAFGCGYKQEEPADSPPSLVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSS 269
G G L+ GD S V W M YYSPG+AEL + G VFDS S+
Sbjct: 63 KGKGVLYVGDFNPPSRGVTWVPMKESLF-YYSPGLAELLIDNQPIRGNPTFEAVFDSDST 121
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEA 297
YT++ Y + S ++ LS SL+E
Sbjct: 122 YTHVPAQIYNEIVSKVRGTLSESSLEEV 149
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 98/361 (27%), Positives = 158/361 (43%), Gaps = 50/361 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP--S 97
TG Y + IG P++ Y++ +DTGSD+ W+ C +RC P Y P S
Sbjct: 82 TGLYYTQIEIGSPSKGYYVQVDTGSDILWVNC----IRCDGCPTTSGLGIELTQYDPAGS 137
Query: 98 NDLVPCEDPICASLHAPGHHNCEDPAQ---CDYELEYADGGSSLGVLVKDAFAFNYT--N 152
V C+ C + ++P P+ C + + Y DG S+ G V D+ +N N
Sbjct: 138 GTTVGCDQEFCVA-NSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGN 196
Query: 153 GQRL--NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
GQ N + GCG G+S LDGILG G+ SS++SQL + + +R + HCL
Sbjct: 197 GQTTPSNASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCL 256
Query: 209 -SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KN 259
+ GGG G+ + +V T + + T +Y+ + + GG T L +
Sbjct: 257 DTVHGGGIFAIGNVV--QPKVKTTPLVQNVT-HYNVNLQGISVGGATLQLPSSTFDSGDS 313
Query: 260 LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
+ DSG++ YL R Y+TL + + + +L ++ F+ +
Sbjct: 314 KGTIIDSGTTLAYLPREVYRTLLTAVFDKYQDLALHN----------YQDFVCFQFSGSI 363
Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
F + SF T ++ P YL + C+G L+G V +D + +GD
Sbjct: 364 DDGFPVVTFSFEGEITLNVY---PHDYLFQNENDLYCMGFLDGG-VQTKDGKDMVLLGDL 419
Query: 380 V 380
V
Sbjct: 420 V 420
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 85/272 (31%), Positives = 128/272 (47%), Gaps = 31/272 (11%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN-- 98
V G+ +G Y V ++G P + + L +D+GSDL W+QC APC++C PLY PSN
Sbjct: 55 VSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQC-APCLQCYAQDTPLYAPSNSS 113
Query: 99 --DLVPCEDPICASLHAPGHHNCE--DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
+ VPC P C + A C+ P C YE YAD S GV A+ +
Sbjct: 114 TFNPVPCLSPECLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGVF---AYESATVDDV 170
Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQL---HSQKLIRNVVGHCLSGG 211
R++ ++A GCG + S+ G+LGLG+G S SQ+ + K +V +
Sbjct: 171 RID-KVAFGCGRDN--QGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTS 227
Query: 212 GGGFLFFGDDL----YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTG----------L 257
+L FGD+L +D S S + T YY + ++ GGE+ L
Sbjct: 228 VSSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYV-QIEKVMVGGESLPISHSAWSLDFL 286
Query: 258 KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKEL 289
N +FDSG++ TY Y+ + + K +
Sbjct: 287 GNGGSIFDSGTTVTYWLPPAYRNILAAFDKNV 318
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 94/348 (27%), Positives = 146/348 (41%), Gaps = 41/348 (11%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
G TG Y VT+ +G PA Y + DTGSD TW+QC V C E L+ P+
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSSTY 230
Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
V C P C L G C C Y ++Y DG S+G D + + +
Sbjct: 231 ANVSCAAPACFDLDTRG---CSG-GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVK--- 283
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
GCG + G+LGLG+GK+S+ Q + + V HCL G G+L
Sbjct: 284 GFRFGCGERNE--GLFGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSSGTGYL 339
Query: 217 FF--GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP--------VVFDS 266
F G +R+ ++ + +Y G+ + GG+ L ++P + DS
Sbjct: 340 DFGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGGQ---LLSIPQSVFATAGTIVDS 396
Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
G+ T L Y +L S ++A+ K+AP L C+ F + V T+
Sbjct: 397 GTVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCYD----FTGMSQVA--IPTV 450
Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
+L F G + ++ + ++ VCLG + G D+ ++G
Sbjct: 451 SLLFQGG---AILDVDASGIMYAASVSQVCLGFAANEDGG--DVGIVG 493
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 105/372 (28%), Positives = 160/372 (43%), Gaps = 43/372 (11%)
Query: 35 SSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----V 87
S + ++ GN +P TG Y + IG P + + +DTGSD+ W+ C C C +
Sbjct: 55 SVIDLELGGNGHPAETGLYYARIGIGSPPNDFHVQVDTGSDILWVNC-VGCSNCPKKSDI 113
Query: 88 EAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVK 143
LY P ++ L+ C+ P C++ + C+ C Y++ Y DG ++ G V
Sbjct: 114 GVDLQLYNPKSSSTSTLITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVN 173
Query: 144 DAFAFNYTNGQ----RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQ 197
D G N + GCG Q G+S LDGILG G+ SS++SQL +
Sbjct: 174 DYIQLQRAVGNHKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAAT 233
Query: 198 KLIRNVVGHCL-SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVA------ELFF 250
++ + HCL S GGG G+ + + + + GV +L
Sbjct: 234 GKVKKIFAHCLDSISGGGIFAIGEVVEPKLKTTPVVPNQAHYNVVLNGVKVGDTALDLPL 293
Query: 251 GGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAK-SLKEAPEDETLPLCWKG 309
G T K ++ DSG++ YL Y L M+K L A+ LK D+ C+
Sbjct: 294 GLFETSYKRGAII-DSGTTLAYLPDSIYLPL---MEKILGAQPDLKLRTVDDQFT-CFVF 348
Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN-GAEVGLQ 368
KNV D F T+ F + T++ P YL C+G N GA+ +
Sbjct: 349 D---KNVDD---GFPTVTFKFEESLILTIY---PHEYLFQIRDDVWCVGWQNSGAQS--K 397
Query: 369 DLNVIGGIGDFV 380
D N + +GD V
Sbjct: 398 DGNEVTLLGDLV 409
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 154/371 (41%), Gaps = 63/371 (16%)
Query: 44 NVYPTG--YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----S 97
+V P+G Y V + IG P +P LDTGSDL W QC APC C+ P PL+ P S
Sbjct: 93 SVRPSGDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQC-APCASCLAQPDPLFAPGESAS 151
Query: 98 NDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL- 156
+ + C +C+ + HH CE P C Y Y DG ++GV + F F + G RL
Sbjct: 152 YEPMRCAGQLCSDIL---HHGCEMPDTCTYRYNYGDGTMTMGVYATERFTFTSSGGDRLM 208
Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG-- 214
L GCG V S + GI+G G+ S+VSQL ++ +CL+ G G
Sbjct: 209 TVPLGFGCGSMNV--GSLNNGSGIVGFGRNPLSLVSQLSIRRF-----SYCLTSYGSGRK 261
Query: 215 -FLFFGD---DLY-DSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-------- 261
L FG +Y D++ V T + +P + G T G + L
Sbjct: 262 STLLFGSLSGGVYGDATGPVQT--TPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFAL 319
Query: 262 -------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEA-PEDET---LPLCWKGR 310
V+ DSG++ T L + +++L PED +P W+
Sbjct: 320 RPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRS 379
Query: 311 RPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISN-KGNVCLGILNGAEVG--- 366
V + F F D +L Y++ + KG +CL + + + G
Sbjct: 380 SSTSQVPVPRMVFH-----FQDAD----LDLPRRNYVLDDHRKGRLCLLLADSGDDGSTI 430
Query: 367 ----LQDLNVI 373
QD+ V+
Sbjct: 431 GNLVQQDMRVL 441
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 94/298 (31%), Positives = 132/298 (44%), Gaps = 35/298 (11%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y V + +G PAR Y + +DTGS L+WLQC V C PL+ PS + C
Sbjct: 10 SGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSC 69
Query: 104 EDPICASLHAPGHHN--CEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
C+SL +N CE + C Y Y D S+G L +D Q L P
Sbjct: 70 TSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTL--APSQTL-PGF 126
Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFFG 219
GCG Q + GILGLG+ K S++ Q+ S+ +CL + GGGGFL G
Sbjct: 127 VYGCG--QDSEGLFGRAAGILGLGRNKLSMLGQVSSK--FGYAFSYCLPTRGGGGFLSIG 182
Query: 220 DDLYDSSRVVWTSMSSDYTKYYSPGVAELFF--------GGETTGLK----NLPVVFDSG 267
S +T M++D PG L+F GG G+ +P + DSG
Sbjct: 183 KASLAGSAYKFTPMTTD------PGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSG 236
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR-RPFKNVHDVKKCFR 324
+ T L Y K +S+K AP L C+KG + ++V +V+ F+
Sbjct: 237 TVITRLPMSVYTPFQQAFVKIMSSK-YARAPGFSILDTCFKGNLKDMQSVPEVRLIFQ 293
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 94/363 (25%), Positives = 148/363 (40%), Gaps = 67/363 (18%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP--CVRCVEAPHPLYRPSN 98
V G+ +G Y V + +G PA+ + L +DTGSDLTW+QC+ P P P Y S+
Sbjct: 17 VSGSSIGSGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSS 76
Query: 99 D----LVPCEDPICASLHAPGHHNC--EDPAQCDYELEYADGGSSLGVLVKDAFAFN--Y 150
+PC D C L AP +C + P+ CDY Y+D + G+L + +
Sbjct: 77 SSSYREIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRK 136
Query: 151 TNGQRLN---------PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
+G+R +ALGC V GAS+ G+LGLG+G S+ +Q L
Sbjct: 137 RSGKRAGNHKTRTIRIKNVALGCSRESV-GASFLGASGVLGLGQGPISLATQTRHTAL-G 194
Query: 202 NVVGHCL-----SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTG 256
+ +CL FL G R W ++ +T A+ F+ TG
Sbjct: 195 GIFSYCLVDYLRGSNASSFLVMG-------RTRWRKLA--HTPIVRNPAAQSFYYVNVTG 245
Query: 257 LK--------------------NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
+ N +FDSG++ +YL Y + + + +E
Sbjct: 246 VAVDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQE 305
Query: 297 APEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVC 356
PE LC+ NV ++K L + F G + EL Y+++ + C
Sbjct: 306 IPEG--FELCY-------NVTRMEKGMPKLGVEFQGG---AVMELPWNNYMVLVAENVQC 353
Query: 357 LGI 359
+ +
Sbjct: 354 VAL 356
>gi|213998826|gb|ACJ60780.1| nucellin [Hordeum intercedens]
Length = 148
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 63/147 (42%), Positives = 80/147 (54%), Gaps = 5/147 (3%)
Query: 154 QRLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSG 210
QR ++A GCGY Q A P+DGILGLG GK+ +QL QK+I NV+GHCLS
Sbjct: 3 QRDKKKVAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSS 269
G G L+ GD S V W M YYS G+AEL + G VFDSGS+
Sbjct: 63 KGKGVLYVGDFNPPSRGVTWVPMKESLF-YYSAGLAELLIDNQPIRGNPTFEAVFDSGST 121
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKE 296
YT++ Y + S ++ LS SL+E
Sbjct: 122 YTHVPAQIYNEIVSKVRGTLSESSLEE 148
>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 535
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 79/273 (28%), Positives = 119/273 (43%), Gaps = 32/273 (11%)
Query: 34 GSSLLFQVHG--NVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP- 90
G L F V G + Y G Y + +G PA+ +++ +DTGSD+ WL C+ C C ++
Sbjct: 52 GGILDFSVQGTSDPYLVGLYFTKVKMGSPAKEFYVQIDTGSDILWLNCNT-CNNCPKSSG 110
Query: 91 --------HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVL 141
+ LV C DP+C+ C A QC Y +Y DG + G
Sbjct: 111 LGIDLNYFDTASSSTAALVSCSDPVCSYAVQTATSQCSSQANQCSYTFQYGDGSGTSGYY 170
Query: 142 VKDAFAFNYTNGQRL----NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLH 195
V DA F+ GQ + + + GC Q + +DGI G G G S+VSQ+
Sbjct: 171 VYDAMYFDVIMGQSVFSNSSSTVVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVS 230
Query: 196 SQKLIRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE 253
SQ + V HCL G GGG L G+ L +V+T + +Y+ + + G+
Sbjct: 231 SQGMAPKVFSHCLKGQGSGGGILVLGEIL--EPNIVYTPLVP-LQPHYNLNLQSIAVNGQ 287
Query: 254 TTGL--------KNLPVVFDSGSSYTYLNRVTY 278
+ N + DSG++ YL + Y
Sbjct: 288 ILPIDQDVFATGNNRGTIVDSGTTLAYLVQEAY 320
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 81/276 (29%), Positives = 129/276 (46%), Gaps = 43/276 (15%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
+G Y + + +G P + + +DTGSDL W+QC APC RC E P PL+ P S C
Sbjct: 5 SGEYVLQISLGTPPQQFSAIVDTGSDLCWVQC-APCARCFEQPDPLFIPLASSSYSNASC 63
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
D +C +L P C C Y Y DG ++ G AF NG L R+ G
Sbjct: 64 TDSLCDALPRP---TCSMRNTCTYSYSYGDGSNTRGDF---AFETVTLNGSTL-ARIGFG 116
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFG 219
CG+NQ ++ DG++GLG+G S+ SQL+S ++ +CL + G + FG
Sbjct: 117 CGHNQ--EGTFAGADGLIGLGQGPLSLPSQLNSS--FTHIFSYCLVDQSTTGTFSPITFG 172
Query: 220 DDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLP---------------V 262
+ ++SR +T + + D YY GV + + G + +P V
Sbjct: 173 -NAAENSRASFTPLLQNEDNPSYYYVGVESI-----SVGNRRVPTPPSAFRIDANGVGGV 226
Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAP 298
+ DSG++ TY + + + +++++S P
Sbjct: 227 ILDSGTTITYWRLAAFIPILAELRRQISYPEADPTP 262
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 63/189 (33%), Positives = 89/189 (47%), Gaps = 23/189 (12%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH--------PLY----R 95
G Y + IG P + Y+L +DTGSD+ W+ C ++C E P LY
Sbjct: 80 VGLYYAKIGIGTPPKNYYLQVDTGSDIMWVNC----IQCKECPTRSSLGMDLTLYDIKES 135
Query: 96 PSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ- 154
S LVPC+ C ++ C C Y Y DG S+ G VKD ++ +G
Sbjct: 136 SSGKLVPCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDL 195
Query: 155 ---RLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
N + GCG Q + ++ LDGILG GK SS++SQL S ++ + HCL
Sbjct: 196 KTDSANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCL 255
Query: 209 SGGGGGFLF 217
+G GG +F
Sbjct: 256 NGVNGGGIF 264
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 95/344 (27%), Positives = 142/344 (41%), Gaps = 35/344 (10%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
G TG Y VT+ +G PA Y + DTGSD TW+QC V C E L+ P+
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTY 230
Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
V C P C+ L G C C Y ++Y DG S+G D + + +
Sbjct: 231 ANVSCAAPACSDLDTRG---CSG-GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVK--- 283
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
GCG + G+LGLG+GK+S+ Q + + V HCL G G+L
Sbjct: 284 GFRFGCGERNE--GLFGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSTGTGYL 339
Query: 217 FFGDDLYDSSRVVWTSMSSDY-TKYYSPGVAELFFGGE-----TTGLKNLPVVFDSGSSY 270
FG ++R+ T M D +Y G+ + GG + + DSG+
Sbjct: 340 DFGAG-SPAARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFATAGTIVDSGTVI 398
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
T L Y +L S +SA+ K+AP L C+ F + V T++L F
Sbjct: 399 TRLPPAAYSSLRSAFAAAMSARGYKKAPAVSLLDTCYD----FAGMSQVA--IPTVSLLF 452
Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
G ++ + ++ VCL + G D+ ++G
Sbjct: 453 QGGAR---LDVDASGIMYAASASQVCLAFAANEDGG--DVGIVG 491
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 90/357 (25%), Positives = 144/357 (40%), Gaps = 59/357 (16%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP---------HPLYRPSN 98
TG Y + +G P + Y++ +DTGSD+ W+ C C +C P S
Sbjct: 84 TGLYFTEIKLGTPPKRYYVQVDTGSDILWVNC-ISCSKCPRKSGLGLDLTFYDPKASSSG 142
Query: 99 DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG----Q 154
V C+ CA+ + C C+Y + Y DG S+ G + DA F+ G Q
Sbjct: 143 STVSCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQTQ 202
Query: 155 RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS--G 210
N + GCG Q G S LDGILG G+ +S++SQL + + + HCL
Sbjct: 203 PGNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLDTIK 262
Query: 211 GGGGF-------------LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL 257
GGG F FF L + + M +Y+ + + GG T L
Sbjct: 263 GGGIFAIGNVVQPKCYFVFFFAHGLLNIPLFLLV-MILLSRPHYNVNLKSIDVGGTTLQL 321
Query: 258 --------KNLPVVFDSGSSYTYLNRVTYQTLTSIM---KKELSAKSLKEAPEDETLPLC 306
+ + DSG++ TYL + ++ + ++ ++++ +L++ LC
Sbjct: 322 PAHVFETGEKKGTIIDSGTTLTYLPELVFKQVMDVVFSKHRDIAFHNLQDF-------LC 374
Query: 307 WKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGA 363
F+ V F T+ F D ++ P Y + C+G NGA
Sbjct: 375 ------FQYSGSVDDGFPTITFHFEDDLALHVY---PHEYFFPNGNDIYCVGFQNGA 422
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 96/357 (26%), Positives = 142/357 (39%), Gaps = 57/357 (15%)
Query: 39 FQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------V 87
F V G+ P G Y + +G P + YF+ +DTGSD+ W+ C +PC C +
Sbjct: 77 FPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGCPSSSGLNIQL 135
Query: 88 EAPHPLYRPSNDLVPCEDPICASLHAPGHHNCE--DPAQCDYELEYADGGSSLGVLVKDA 145
E +P ++ +PC D C + C+ D + C Y Y DG + G V D
Sbjct: 136 EFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDT 195
Query: 146 FAFNYTNGQRLNPR----LALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKL 199
F+ G + GC +Q + +DGI G G+ + S+VSQL+S +
Sbjct: 196 MYFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGV 255
Query: 200 IRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKY------------YSPGV 245
V HCL G GGG L G+ + +V+T + Y P
Sbjct: 256 SPKVFSHCLKGSDNGGGILVLGEIV--EPGLVYTPLVPSQPHYNLNLESIVVNGQKLPID 313
Query: 246 AELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
+ LF T G + DSG++ YL Y + + +S L
Sbjct: 314 SSLFTTSNTQG-----TIVDSGTTLAYLADGAYDPFVNAITAAVSP---------SVRSL 359
Query: 306 CWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI----ISNKGNVCLG 358
KG + F V F T++L F G T + PE YL+ I N C+G
Sbjct: 360 VSKGNQCFVTSSSVDSSFPTVSLYFMGGVAMT---VKPENYLLQQASIDNNVLWCIG 413
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 96/365 (26%), Positives = 152/365 (41%), Gaps = 46/365 (12%)
Query: 13 PTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGS 72
P + S+SSS+ S L G TG Y VT+ +G PA Y + DTGS
Sbjct: 135 PGIHPGHSASSSTPS----------LPATSGRAVSTGNYVVTVGLGTPASKYTVVFDTGS 184
Query: 73 DLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQCDYE 128
D TW+QC V+C + PL+ P+ V C D CA L G C C Y
Sbjct: 185 DTTWVQCRPCVVKCYKQKEPLFDPAKSSTYANVSCTDSACADLDTNG---CTG-GHCLYA 240
Query: 129 LEYADGGSSLGVLVKDAF--AFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKG 186
++Y DG ++G +D A + G R GCG + G++GLG+G
Sbjct: 241 VQYGDGSYTVGFFAQDTLTIAHDAIKGFR------FGCGEKN--NGLFGKTAGLMGLGRG 292
Query: 187 KSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPG 244
K+S+ Q +++ +CL G G+L FG ++ + ++ +Y G
Sbjct: 293 KTSLTVQAYNK--YGGAFAYCLPALTTGTGYLDFGPGSAGNNARLTPMLTDKGQTFYYVG 350
Query: 245 VAELFFGGETTGL-----KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPE 299
+ + GG+ + + DSG+ T L Y L+S K + A+ K+AP
Sbjct: 351 MTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPG 410
Query: 300 DETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGI 359
L C+ F + DV+ T++L F G ++ + ++ VCL
Sbjct: 411 YSILDTCYD----FTGLSDVE--LPTVSLVFQGG---ACLDVDVSGIVYAISEAQVCLAF 461
Query: 360 LNGAE 364
+ +
Sbjct: 462 ASNGD 466
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 85/350 (24%), Positives = 144/350 (41%), Gaps = 41/350 (11%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDA----PCVRCVEAP----HPLYRPSNDLVP 102
Y + +G P R +++ +DTGSD+ W+ C + P + P P P+ L+
Sbjct: 90 YYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLIS 149
Query: 103 CEDPICA-SLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL----N 157
C D C+ L + QC Y +Y DG + G V D F+ G + +
Sbjct: 150 CSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMKNSS 209
Query: 158 PRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGG 213
+ GC Q + +DGI G G+ S++SQL SQ + V HCL G GG
Sbjct: 210 APIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKGDDSGG 269
Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNLPVVFD 265
G L G+ + +V+T + +Y+ + ++ G+T + N + D
Sbjct: 270 GILVLGEIV--EPNIVYTPLVPS-QPHYNLNLQSIYVNGQTLAIDPSVFATSSNQGTIID 326
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
SG++ YL Y S + +S P KG + + + F
Sbjct: 327 SGTTLAYLTEAAYDPFISAITSTVSP---------SVSPYLSKGNQCYLTSSSINDVFPQ 377
Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGN-VCLGILNGAEVGLQDLNVIG 374
++L+F G + L P+ YLI + N L + ++ Q++ ++G
Sbjct: 378 VSLNFAGGTSMILI---PQDYLIQQSSINGAALWCVGFQKIQGQEITILG 424
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 93/363 (25%), Positives = 146/363 (40%), Gaps = 67/363 (18%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP--CVRCVEAPHPLYRPSN 98
V G+ +G Y V + +G PA+ + L +DTGSDLTW+QC+ P P P Y S+
Sbjct: 49 VSGSSIGSGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSS 108
Query: 99 D----LVPCEDPICASLHAPGHHNCE--DPAQCDYELEYADGGSSLGVLVKDAFAF---- 148
+PC D C L AP +C P+ CDY Y+D + G+L + +
Sbjct: 109 SSSYREIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRK 168
Query: 149 -------NYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
N+ + +ALGC V GAS+ G+LGLG+G S+ +Q L
Sbjct: 169 RSGKRAGNHKTRRIRIKNVALGCSRESV-GASFLGASGVLGLGQGPISLATQTRHTAL-G 226
Query: 202 NVVGHCL-----SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTG 256
+ +CL FL G R W ++ +T A+ F+ TG
Sbjct: 227 GIFSYCLVDYLRGSNASSFLVMG-------RTHWRKLA--HTPIVRNPAAQSFYYVNVTG 277
Query: 257 LK--------------------NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
+ N +FDSG++ +YL Y + + + +E
Sbjct: 278 VAVDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQE 337
Query: 297 APEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVC 356
PE LC+ NV ++K L + F G + EL Y+++ + C
Sbjct: 338 IPEG--FELCY-------NVTRMEKGMPKLGVEFQGG---AVMELPWNNYMVLVAENVQC 385
Query: 357 LGI 359
+ +
Sbjct: 386 VAL 388
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 96/357 (26%), Positives = 142/357 (39%), Gaps = 57/357 (15%)
Query: 39 FQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------V 87
F V G+ P G Y + +G P + YF+ +DTGSD+ W+ C +PC C +
Sbjct: 77 FPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGCPSSSGLNIQL 135
Query: 88 EAPHPLYRPSNDLVPCEDPICASLHAPGHHNCE--DPAQCDYELEYADGGSSLGVLVKDA 145
E +P ++ +PC D C + C+ D + C Y Y DG + G V D
Sbjct: 136 EFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDT 195
Query: 146 FAFNYTNGQRLNPR----LALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKL 199
F+ G + GC +Q + +DGI G G+ + S+VSQL+S +
Sbjct: 196 MYFDSVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGV 255
Query: 200 IRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKY------------YSPGV 245
V HCL G GGG L G+ + +V+T + Y P
Sbjct: 256 SPKVFSHCLKGSDNGGGILVLGEIV--EPGLVYTPLVPSQPHYNLNLESIVVNGQKLPID 313
Query: 246 AELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
+ LF T G + DSG++ YL Y + + +S L
Sbjct: 314 SSLFTTSNTQG-----TIVDSGTTLAYLADGAYDPFVNAITAAVSP---------SVRSL 359
Query: 306 CWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI----ISNKGNVCLG 358
KG + F V F T++L F G T + PE YL+ I N C+G
Sbjct: 360 VSKGNQCFVTSSSVDSSFPTVSLYFMGGVAMT---VKPENYLLQQASIDNNVLWCIG 413
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 95/347 (27%), Positives = 145/347 (41%), Gaps = 37/347 (10%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y + YIG P +DTGS L WLQC +PC C PL+ P + C+
Sbjct: 87 GEYLMRFYIGSPPVERLAMVDTGSSLIWLQC-SPCHNCFPQETPLFEPLKSSTYKYATCD 145
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN--PRLAL 162
C L P +C QC Y + Y D S+G+L + +F T G + P
Sbjct: 146 SQPCTLLQ-PSQRDCGKLGQCIYGIMYGDKSFSVGILGTETLSFGSTGGAQTVSFPNTIF 204
Query: 163 GCGY-NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLF 217
GCG N + + + GI GLG G S+VSQL +Q I + +CL S F
Sbjct: 205 GCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQ--IGHKFSYCLLPYDSTSTSKLKF 262
Query: 218 FGDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGE--TTGLKNLPVVFDSGSSYTYL 273
+ + ++ VV T + YY + + G + +TG + +V DSG+ TYL
Sbjct: 263 GSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVSTGQTDGNIVIDSGTPLTYL 322
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
Y + +++ L K L++ P L C+ R +A FT
Sbjct: 323 ENTFYNNFVASLQETLGVKLLQDLPSP--LKTCFPNR--------ANLAIPDIAFQFTGA 372
Query: 334 KTRTLFELTPEAYLIISNKGNV-CLGILNGAEVGLQDLNVIGGIGDF 379
L P+ LI N+ CL ++ + +G +++ G I +
Sbjct: 373 SV----ALRPKNVLIPLTDSNILCLAVVPSSGIG---ISLFGSIAQY 412
>gi|213998834|gb|ACJ60784.1| nucellin [Hordeum bulbosum]
Length = 154
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 60/146 (41%), Positives = 80/146 (54%), Gaps = 5/146 (3%)
Query: 155 RLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGG 211
R ++A GCGY Q A P+DGILGLG GK+ +QL K+I+ NV+GHCLS
Sbjct: 4 RDKKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLRGHKMIKENVIGHCLSSK 63
Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSY 270
G G L+ GD + V W M YYSPG+AE+F + G VFDSGS+Y
Sbjct: 64 GKGVLYVGDFNPPTRGVTWVPMRESLF-YYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTY 122
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKE 296
T++ Y + S ++ LS S +E
Sbjct: 123 THVPAQIYSEIVSKVRGTLSESSFEE 148
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 94/355 (26%), Positives = 150/355 (42%), Gaps = 42/355 (11%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLY----RPSND 99
G Y + +G P + Y++ +DTGSD+ W+ C APC +C + P LY ++
Sbjct: 72 GLYFTKIKLGSPPKEYYVQVDTGSDILWVNC-APCPKCPVKTDLGIPLSLYDSKTSSTSK 130
Query: 100 LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ-RLNP 158
V CED C+ + C C Y + Y DG +S G +KD G R P
Sbjct: 131 NVGCEDDFCSFIMQS--ETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAP 188
Query: 159 ---RLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
+ GCG NQ G + +DGI+G G+ +SI+SQL + + + HCL G
Sbjct: 189 LAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNG 248
Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK--------NLPVVFD 265
G +F ++ S VV T+ +Y+ + + G+ L + + D
Sbjct: 249 GGIFAVGEV--ESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIID 306
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
SG++ YL + Y +L ++K + + +K ET F + K F
Sbjct: 307 SGTTLAYLPQNLYNSL---IEKITAKQQVKLHMVQETFAC-------FSFTSNTDKAFPV 356
Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDFV 380
+ L F D +++ P YL + C G +G +VI +GD V
Sbjct: 357 VNLHFEDSLKLSVY---PHDYLFSLREDMYCFGWQSGGMTTQDGADVI-LLGDLV 407
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 92/354 (25%), Positives = 146/354 (41%), Gaps = 48/354 (13%)
Query: 29 LFNHVGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
L +G + F V G P G Y + +G P R +++ +DTGSD+ W+ C A C C
Sbjct: 57 LLQSLGGVIDFPVDGTFDPFVVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSC-ASCNGC 115
Query: 87 -----VEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGS 136
++ + P + + + C D C+ C C Y +Y DG
Sbjct: 116 PQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSG 175
Query: 137 SLGVLVKDAFAFNYTNGQRLNPR----LALGCGYNQVPG--ASYHPLDGILGLGKGKSSI 190
+ G V D F+ G L P + GC +Q S +DGI G G+ S+
Sbjct: 176 TSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSV 235
Query: 191 VSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAEL 248
+SQL SQ + V HCL G GGGG L G+ + +V+T + +Y+ + +
Sbjct: 236 ISQLASQGIAPRVFSHCLKGENGGGGILVLGEIV--EPNMVFTPLVPS-QPHYNVNLLSI 292
Query: 249 FFGGETTGLKNLPVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAP 298
G+ + P VF D+G++ YL+ Y +++ A
Sbjct: 293 SVNGQALPIN--PSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFV---------EAITNAV 341
Query: 299 EDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK 352
P+ KG + + V F ++L+F G ++F L P+ YLI N
Sbjct: 342 SQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGA--SMF-LNPQDYLIQQNN 392
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 94/355 (26%), Positives = 150/355 (42%), Gaps = 42/355 (11%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLY----RPSND 99
G Y + +G P + Y++ +DTGSD+ W+ C APC +C + P LY ++
Sbjct: 76 GLYFTKIKLGSPPKEYYVQVDTGSDILWVNC-APCPKCPVKTDLGIPLSLYDSKTSSTSK 134
Query: 100 LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ-RLNP 158
V CED C+ + C C Y + Y DG +S G +KD G R P
Sbjct: 135 NVGCEDDFCSFIMQS--ETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAP 192
Query: 159 ---RLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
+ GCG NQ G + +DGI+G G+ +SI+SQL + + + HCL G
Sbjct: 193 LAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNG 252
Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK--------NLPVVFD 265
G +F ++ S VV T+ +Y+ + + G+ L + + D
Sbjct: 253 GGIFAVGEV--ESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIID 310
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
SG++ YL + Y S+++K + + +K ET F + K F
Sbjct: 311 SGTTLAYLPQNLYN---SLIEKITAKQQVKLHMVQETFAC-------FSFTSNTDKAFPV 360
Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDFV 380
+ L F D +++ P YL + C G +G +VI +GD V
Sbjct: 361 VNLHFEDSLKLSVY---PHDYLFSLREDMYCFGWQSGGMTTQDGADVI-LLGDLV 411
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/190 (33%), Positives = 89/190 (46%), Gaps = 25/190 (13%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH--------PLYR----P 96
G Y + IG P++ Y+L +DTG+D+ W+ C ++C E P LY
Sbjct: 71 GLYYAKIGIGTPSKDYYLQVDTGTDMMWVNC----IQCKECPTRSNLGMDLTLYNIKESS 126
Query: 97 SNDLVPCEDPICASLHAPGHHNCEDPAQ--CDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
S LVPC+ +C ++ C C Y Y DG S+ G VKD F+ +G
Sbjct: 127 SGKLVPCDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGD 186
Query: 155 ----RLNPRLALGCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
N + GCG Q SY LDGILG GK S++SQL S ++ + HC
Sbjct: 187 LKTASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHC 246
Query: 208 LSGGGGGFLF 217
L+G GG +F
Sbjct: 247 LNGVNGGGIF 256
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 103/373 (27%), Positives = 161/373 (43%), Gaps = 45/373 (12%)
Query: 35 SSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----V 87
S + ++ GN +P TG Y + IG P + + +DTGSD+ W+ C C C +
Sbjct: 55 SVIDLELGGNGHPAETGLYYARIGIGSPPNDFHVQVDTGSDILWVNC-VGCSNCPKKSDI 113
Query: 88 EAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVK 143
LY P ++ L+ C+ P C++ + C+ C Y++ Y DG ++ G V
Sbjct: 114 GVDLQLYNPKSSSTSTLITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVN 173
Query: 144 DAFAFNYTNGQ----RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQ 197
D G N + GCG Q G+S LDGILG G+ SS++SQL +
Sbjct: 174 DYIQLQRAVGNHKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAAT 233
Query: 198 KLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY--------YSPGVAELF 249
++ + HCL GG +F ++ + ++ T + + Y +L
Sbjct: 234 GKVKKIFAHCLDSISGGGIFAIGEVVE-PKLXNTPVVPNQAHYNVVLNGVKVGDTALDLP 292
Query: 250 FGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAK-SLKEAPEDETLPLCWK 308
G T K ++ DSG++ YL Y L M+K L A+ LK D+ C+
Sbjct: 293 LGLFETSYKRGAII-DSGTTLAYLPESIYLPL---MEKILGAQPDLKLRTVDDQFT-CFV 347
Query: 309 GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN-GAEVGL 367
KNV D F T+ F + T++ P YL C+G N GA+
Sbjct: 348 FD---KNVDD---GFPTVTFKFEESLILTIY---PHEYLFQIRDDVWCVGWQNSGAQS-- 396
Query: 368 QDLNVIGGIGDFV 380
+D N + +GD V
Sbjct: 397 KDGNEVTLLGDLV 409
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 100/361 (27%), Positives = 159/361 (44%), Gaps = 48/361 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTW---LQCDA-PCVRCVEAPHPLYRP--SNDLV 101
TG Y + IG P + Y++ +DTGSD+ W + CD P + Y P S V
Sbjct: 82 TGLYYTRIEIGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSGLGIELTQYDPAGSGTTV 141
Query: 102 PCEDPICASLHA-----PGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT--NGQ 154
CE C + A P + P C + + Y DG S+ G V D +N NGQ
Sbjct: 142 GCEQEFCVANSAASGVPPACPSAASP--CQFRITYGDGSSTTGFYVTDFVQYNQVSGNGQ 199
Query: 155 RL--NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
N + GCG G+S LDGILG G+ +S++SQL + + +R + HCL
Sbjct: 200 TTPSNVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHCLDT 259
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNLPV 262
GG +F ++ V T + + T +Y+ + + GG T L +
Sbjct: 260 VRGGGIFAIGNVVQPPIVKTTPLVPNAT-HYNVNLQGISVGGATLQLPTSTFDSGDSKGT 318
Query: 263 VFDSGSSYTYLNRVTYQT-LTSIMKK--ELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
+ DSG++ YL R Y+T LT++ K +L+ ++ ++ +C F+ +
Sbjct: 319 IIDSGTTLAYLPREVYRTLLTAVFDKHPDLAVRNYEDF-------IC------FQFSGSL 365
Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
+ F + SF T ++ P YL + C+G L+G V +D + +GD
Sbjct: 366 DEEFPVITFSFEGDLTLNVY---PHDYLFQNGNDLYCMGFLDGG-VQTKDGKDMVLLGDL 421
Query: 380 V 380
V
Sbjct: 422 V 422
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 96/365 (26%), Positives = 152/365 (41%), Gaps = 46/365 (12%)
Query: 13 PTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGS 72
P + S+SSS+ S L G TG Y VT+ +G PA Y + DTGS
Sbjct: 135 PGIHPGHSASSSTPS----------LPATSGRAVSTGNYVVTVGLGTPASKYTVVFDTGS 184
Query: 73 DLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQCDYE 128
D TW+QC V+C + PL+ P+ V C D CA L G C C Y
Sbjct: 185 DTTWVQCRPCVVKCYKQKGPLFDPAKSSTYANVSCTDSACADLDTNG---CTG-GHCLYA 240
Query: 129 LEYADGGSSLGVLVKDAF--AFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKG 186
++Y DG ++G +D A + G R GCG + G++GLG+G
Sbjct: 241 VQYGDGSYTVGFFAQDTLTIAHDAIKGFR------FGCGEKN--NGLFGKTAGLMGLGRG 292
Query: 187 KSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPG 244
K+S+ Q +++ +CL G G+L FG ++ + ++ +Y G
Sbjct: 293 KTSLTVQAYNK--YGGAFAYCLPALTTGTGYLDFGPGSAGNNARLTPMLTDKGQTFYYVG 350
Query: 245 VAELFFGGETTGL-----KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPE 299
+ + GG+ + + DSG+ T L Y L+S K + A+ K+AP
Sbjct: 351 MTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPG 410
Query: 300 DETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGI 359
L C+ F + DV+ T++L F G ++ + ++ VCL
Sbjct: 411 YSILDTCYD----FTGLSDVE--LPTVSLVFQGG---ACLDVDVSGIVYAISEAQVCLAF 461
Query: 360 LNGAE 364
+ +
Sbjct: 462 ASNGD 466
>gi|213998830|gb|ACJ60782.1| nucellin [Hordeum pusillum]
Length = 147
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/142 (43%), Positives = 78/142 (54%), Gaps = 5/142 (3%)
Query: 159 RLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGGGGGF 215
++A GCGY Q A P+DGILGLG GK+ +QL QK+I NV+GHCLS G G
Sbjct: 1 KIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGV 60
Query: 216 LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLN 274
L+ GD S V W M YYSPG+AEL + G VFDSGS+YT++
Sbjct: 61 LYVGDFNPPSRGVTWVPMKESLF-YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVP 119
Query: 275 RVTYQTLTSIMKKELSAKSLKE 296
Y + S + LS SL+E
Sbjct: 120 AQIYNEIVSKVIGTLSESSLEE 141
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 97/351 (27%), Positives = 146/351 (41%), Gaps = 57/351 (16%)
Query: 31 NHVGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-- 86
+ VG + F V G+ P G Y + +G P + + +DTGSD+ W+ C + C C
Sbjct: 78 SSVGGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSS-CSNCPH 136
Query: 87 ----------VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS 136
+AP S V C DPIC+S+ C + QC Y Y DG
Sbjct: 137 SSGLGIDLHFFDAPGSFTAGS---VTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSG 193
Query: 137 SLGVLVKDAFAFNYTNGQRL----NPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSI 190
+ G + D F F+ G+ L + + GC Q S +DGI G GKGK S+
Sbjct: 194 TSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSV 253
Query: 191 VSQLHSQKLIRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY----YSPG 244
VSQL S+ + V HCL G GGG G+ L +V++ + Y S G
Sbjct: 254 VSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPG--MVYSPLLPSQPHYNLNLLSIG 311
Query: 245 V--------AELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
V A +F T G + D+G++ TYL + Y + + +S
Sbjct: 312 VNGQILPIDAAVFEASNTRG-----TIVDTGTTLTYLVKEAYDPFLNAISNSVS------ 360
Query: 297 APEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL 347
+ TL + G + + + F ++L+F G + L P+ YL
Sbjct: 361 --QLVTL-IISNGEQCYLVSTSISDMFPPVSLNFAGGASMM---LRPQDYL 405
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 99/331 (29%), Positives = 142/331 (42%), Gaps = 42/331 (12%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC-VRCVEAPHPLYRPSND----LVPC 103
G Y + +G P Y + +D+GS LTWLQC APC V C PLY P VPC
Sbjct: 106 GNYITRLGLGTPTTTYVMVVDSGSSLTWLQC-APCAVSCHPQAGPLYDPRASSTYAAVPC 164
Query: 104 EDPICASLHAP--GHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
P CA L A +C C Y+ Y DG S G L KD + + + P
Sbjct: 165 SAPQCAELQAATLNPSSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSSSGS---FPGFY 221
Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFF 218
GCG + V + G++GL + K S++SQL + N +CL + G+L F
Sbjct: 222 YGCGQDNV--GLFGRAAGLIGLARNKLSLLSQLAPS--VGNSFAYCLPTSAAASAGYLSF 277
Query: 219 G--DDLYDSSRVVWTSMSS---DYTKYYSPGVAELFFGGETTGLK-----NLPVVFDSGS 268
G D + + +TSM S D + Y+ +A + G + +LP + DSG
Sbjct: 278 GSNSDNKNPGKYSYTSMVSSSLDASLYFV-SLAGMSVAGSPLAVPSSEYGSLPTIIDSG- 335
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
T + R+ T++ K +A + AP L C+KG+ V V +
Sbjct: 336 --TVITRLPTPVYTALSKAVGAALAAPSAPAYSILQTCFKGQVAKLPVPAVN-------M 386
Query: 329 SFTDGKTRTLFELTPEAYLIISNKGNVCLGI 359
+F G T LTP L+ N+ CL
Sbjct: 387 AFAGGAT---LRLTPGNVLVDVNETTTCLAF 414
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 95/193 (49%), Gaps = 19/193 (9%)
Query: 43 GNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH-----PLYR 95
GN P TG Y + +G PA+ +++ +DTGSD+ W+ C A C C + LY
Sbjct: 62 GNGLPSSTGLYYTKVGLGSPAKEFYVQVDTGSDILWVNC-AGCTACPKKSGLGMDLTLYD 120
Query: 96 P----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
P +++ VPC D C ++ C+ C Y + Y DG ++ G V D+ F+
Sbjct: 121 PNGSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEV 180
Query: 152 NG----QRLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVV 204
+G + N + GCG Q + S LDGI+G G+ SS++SQL + ++ +
Sbjct: 181 SGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIF 240
Query: 205 GHCLSGGGGGFLF 217
HCL GG +F
Sbjct: 241 SHCLDSHHGGGIF 253
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/354 (25%), Positives = 146/354 (41%), Gaps = 48/354 (13%)
Query: 29 LFNHVGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
L +G + F V G P G Y + +G P R +++ +DTGSD+ W+ C A C C
Sbjct: 57 LLQSLGGVIDFPVDGTFDPFVVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSC-ASCNGC 115
Query: 87 -----VEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGS 136
++ + P + + + C D C+ C C Y +Y DG
Sbjct: 116 PQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSG 175
Query: 137 SLGVLVKDAFAFNYTNGQRLNPR----LALGCGYNQVPG--ASYHPLDGILGLGKGKSSI 190
+ G V D F+ G L P + GC +Q S +DGI G G+ S+
Sbjct: 176 TSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSV 235
Query: 191 VSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAEL 248
+SQL SQ + V HCL G GGGG L G+ + +V+T + +Y+ + +
Sbjct: 236 ISQLASQGIAPRVFSHCLKGENGGGGILVLGEIV--EPNMVFTPLVPS-QPHYNVNLLSI 292
Query: 249 FFGGETTGLKNLPVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAP 298
G+ + P VF D+G++ YL+ Y +++ A
Sbjct: 293 SVNGQALPIN--PSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFV---------EAITNAV 341
Query: 299 EDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK 352
P+ KG + + V F ++L+F G ++F L P+ YLI N
Sbjct: 342 SQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGA--SMF-LNPQDYLIQQNN 392
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 95/377 (25%), Positives = 155/377 (41%), Gaps = 49/377 (12%)
Query: 29 LFNHVGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
L +G + F V G P G Y + +G P R +++ +DTGSD+ W+ C A C C
Sbjct: 57 LLQSLGGVIDFPVDGTFDPFVVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSC-ASCNGC 115
Query: 87 -----VEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGS 136
++ + P + + + C D C+ C C Y +Y DG
Sbjct: 116 PQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSG 175
Query: 137 SLGVLVKDAFAFNYTNGQRLNPR----LALGCGYNQVPG--ASYHPLDGILGLGKGKSSI 190
+ G V D F+ G L P + GC +Q S +DGI G G+ S+
Sbjct: 176 TSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSV 235
Query: 191 VSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAEL 248
+SQL SQ + V HCL G GGGG L G+ + +V+T + +Y+ + +
Sbjct: 236 ISQLASQGIAPRVFSHCLKGENGGGGILVLGEIV--EPNMVFTPLVPS-QPHYNVNLLSI 292
Query: 249 FFGGETTGLKNLPVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAP 298
G+ + P VF D+G++ YL+ Y +++ A
Sbjct: 293 SVNGQALPIN--PSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFV---------EAITNAV 341
Query: 299 EDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK-GNVCL 357
P+ KG + + V F ++L+F G ++F L P+ YLI N G +
Sbjct: 342 SQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGA--SMF-LNPQDYLIQQNNVGGTAV 398
Query: 358 GILNGAEVGLQDLNVIG 374
+ + Q + ++G
Sbjct: 399 WCIGFQRIQNQGITILG 415
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 98/384 (25%), Positives = 156/384 (40%), Gaps = 51/384 (13%)
Query: 9 NLCFPTVRMSSSSSSSSSSSLFNHVG------SSLLFQVHGNVYPTGYYNVTMYIGQPAR 62
NL FP VR + ++ + G S + + GN PT IG +
Sbjct: 26 NLVFPVVRKFKGPVENLAAIKAHDAGRRGRFLSVVDVALGGNGRPTSNGLYYTKIGLGPK 85
Query: 63 PYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP----SNDLVPCEDPICAS 110
Y++ +DTGSD W+ C V C P LY P ++ VPC+D C S
Sbjct: 86 DYYVQVDTGSDTLWVNC----VGCTACPKKSGLGMDLTLYDPNLSKTSKAVPCDDEFCTS 141
Query: 111 LHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL----NPRLALGCGY 166
+ C C Y + Y DG ++ G +KD F+ G N + GCG
Sbjct: 142 TYDGQISGCTKGMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSVIFGCGS 201
Query: 167 NQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFFGDDL 222
Q + + LDGI+G G+ SS++SQL + ++ + HCL S GGG G+ +
Sbjct: 202 KQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCLDSISGGGIFAIGEVV 261
Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNLPVVFDSGSSYTYLN 274
V T+ +Y+ + ++ G+ L + DSG++ YL
Sbjct: 262 QPK---VKTTPLLQGMAHYNVVLKDIEVAGDPIQLPSDILDSSSGRGTIIDSGTTLAYLP 318
Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGK 334
Y L ++K L+ +S + E C+ + + V F T+ +F +G
Sbjct: 319 VSIYDQL---LEKILAQRSGMKLYLVEDQFTCFH----YSDEESVDDLFPTVKFTFEEGL 371
Query: 335 TRTLFELTPEAYLIISNKGNVCLG 358
T T + P YL + + C+G
Sbjct: 372 TLTTY---PRDYLFLFKEDMWCVG 392
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 104/368 (28%), Positives = 150/368 (40%), Gaps = 70/368 (19%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
+G + + + IG PA Y +DTGSDL W QC PC C + P P++ P S V C
Sbjct: 104 SGEFLMELSIGNPAVKYSAIVDTGSDLIWTQC-KPCTECFDQPTPIFDPEKSSSYSKVGC 162
Query: 104 EDPICASLHAPGHHNC-EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
+C +L NC ED C+Y Y D S+ G+L + F F N +
Sbjct: 163 SSGLCNALP---RSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSIS---GIGF 216
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS----GGGGGFLFF 218
GCG G + G++GLG+G S++SQL K +CL+ LF
Sbjct: 217 GCGVEN-EGDGFSQGSGLVGLGRGPLSLISQLKETKF-----SYCLTSIEDSEASSSLFI 270
Query: 219 GDDLYDSSRVVWTSMSSDYTKYYS-------PGVAELFFGGETTGLKNLPV--------- 262
G S+ + TK S P L G T G K L V
Sbjct: 271 GSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAE 330
Query: 263 ------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET----LPLCWKGRRP 312
+ DSG++ TYL ++ ++K+E +++ P D++ L LC+K
Sbjct: 331 DGTGGMIIDSGTTITYLEETAFK----VLKEEFTSR--MSLPVDDSGSTGLDLCFKLPDA 384
Query: 313 FKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLII-SNKGNVCL--GILNG----AEV 365
KN+ K F EL E Y++ S+ G +CL G NG V
Sbjct: 385 AKNIAVPKMIFHFKGAD---------LELPGENYMVADSSTGVLCLAMGSSNGMSIFGNV 435
Query: 366 GLQDLNVI 373
Q+ NV+
Sbjct: 436 QQQNFNVL 443
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 93/359 (25%), Positives = 154/359 (42%), Gaps = 52/359 (14%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLYRPSND----LV 101
Y + IG P + Y++ +DTGSD+ W+ C C RC + LY P + V
Sbjct: 4 YYTEIGIGTPTKRYYVQVDTGSDILWVNC-ISCDRCPRKSGLGLELTLYDPKDSSTGSKV 62
Query: 102 PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG----QRLN 157
C+ CA+ + C C+Y + Y DG S+ G V D F+ +G + N
Sbjct: 63 SCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPAN 122
Query: 158 PRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-GGGG 214
+ GCG Q G+S LDGI+G G+ +S++SQL + ++ + HCL GGG
Sbjct: 123 STVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTINGGG 182
Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNLPVVFDS 266
G+ + +V T + + +Y+ + + GG L + + DS
Sbjct: 183 IFAIGNVV--QPKVKTTPLVPN-MPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTIIDS 239
Query: 267 GSSYTYLNRVTYQTLTSIM---KKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
G++ TYL + Y+ + + K+++ +++E LC F+ V V F
Sbjct: 240 GTTLTYLPEIVYKEIMLAVFAKHKDITFHNVQEF-------LC------FQYVGRVDDDF 286
Query: 324 RTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI--GDFV 380
+ F + ++ P Y + C+G NG GLQ + G + GD V
Sbjct: 287 PKITFHFENDLPLNVY---PHDYFFENGDNLYCVGFQNG---GLQSKDGKGMVLLGDLV 339
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 147/348 (42%), Gaps = 41/348 (11%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
G TG Y VT+ +G PA Y + DTGSD TW+QC V C E L+ P+
Sbjct: 172 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTY 231
Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
V C P C+ L+ H C C Y ++Y DG S+G D + + +
Sbjct: 232 ANVSCAAPACSDLNI---HGCSG-GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVK--- 284
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
GCG + G+LGLG+GK+S+ Q + + V HCL G G+L
Sbjct: 285 GFRFGCGERNE--GLFGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSTGTGYL 340
Query: 217 FFGDDLYDSSRVVWTS--MSSDYTKYYSPGVAELFFGGETTGLKNLP--------VVFDS 266
FG ++R T+ ++ + +Y G+ + GG+ L ++P + DS
Sbjct: 341 DFGAGSLAAARARLTTPMLTENGPTFYYVGMTGIRVGGQ---LLSIPQSVFATAGTIVDS 397
Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
G+ T L Y +L ++A+ K+AP L C+ F + V T+
Sbjct: 398 GTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYD----FTGMSQVA--IPTV 451
Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
+L F G ++ + ++ VCL + G D+ ++G
Sbjct: 452 SLLFQGGAR---LDVDASGIMYAASASQVCLAFAANEDGG--DVGIVG 494
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 153/375 (40%), Gaps = 62/375 (16%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH------- 91
F V G P +V MY G + + +DTGSD+ W+ C+ C C ++
Sbjct: 60 FSVQGTSDPN---SVGMY-GXXXXXFNVQIDTGSDILWVNCNT-CSNCPQSSQLGIELNF 114
Query: 92 --PLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAF 148
+ + L+PC D IC S C QC Y +Y DG + G V DA F
Sbjct: 115 FDTVGSSTAALIPCSDLICTSGVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYF 174
Query: 149 NYTNGQ----RLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIRN 202
N GQ + GC +Q + +DGI G G G S+VSQL SQ +
Sbjct: 175 NLIMGQPPAVNSTATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPK 234
Query: 203 VVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNL 260
V HCL G GGG L G+ L S +V++ + +Y+ + + G+ +
Sbjct: 235 VFSHCLKGDGNGGGILVLGEILEPS--IVYSPLVPS-QPHYNLNLQSIAVNGQPLPIN-- 289
Query: 261 PVVF-----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
P VF D G++ YL + Y L + + +S + + KG
Sbjct: 290 PAVFSISNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNS---------KG 340
Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE---VG 366
+ + + F ++L+F G + L PE YL+ + G L+GAE VG
Sbjct: 341 NQCYLVSTSIGDIFPLVSLNFEGGASMV---LKPEQYLMHN-------GYLDGAEMWCVG 390
Query: 367 LQDLNVIGGI-GDFV 380
Q L I GD V
Sbjct: 391 FQKLQEGASILGDLV 405
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 75/266 (28%), Positives = 113/266 (42%), Gaps = 41/266 (15%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYR----P 96
G Y + IG P + Y++ +DTGSD+ W+ C ++C E P LY
Sbjct: 76 GLYYAKIGIGTPTKDYYVQVDTGSDIMWVNC----IQCRECPKTSSLGIDLTLYNINESD 131
Query: 97 SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ-- 154
+ LVPC+ C ++ C C Y Y DG S+ G VKD + +G
Sbjct: 132 TGKLVPCDQEFCYEINGGQLPGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLK 191
Query: 155 --RLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
N + GCG Q + ++ LDGILG GK SS++SQL ++ + HCL
Sbjct: 192 TTAANGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCLD 251
Query: 210 GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVA------------ELFFGGETTGL 257
G GG +F + +V T + + Y A ++F G+ G
Sbjct: 252 GTNGGGIFVIGHVV-QPKVNMTPLIPNQPHYNVNMTAVQVGHEFLSLPTDVFEAGDRKG- 309
Query: 258 KNLPVVFDSGSSYTYLNRVTYQTLTS 283
+ DSG++ YL + Y+ L S
Sbjct: 310 ----AIIDSGTTLAYLPEMVYKPLVS 331
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 92/348 (26%), Positives = 145/348 (41%), Gaps = 41/348 (11%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
G TG Y VT+ +G PA Y + DTGSD TW+QC V C + L+ P+
Sbjct: 174 GRALGTGNYVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSSTY 233
Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
V C P C+ L+ G C C Y ++Y DG S+G D + + +
Sbjct: 234 ANVSCAAPACSDLYTRG---CSG-GHCLYSVQYGDGSYSIGFFAMDTLTLSSYDAVK--- 286
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
GCG + G+LGLG+GK+S+ Q + + V HCL G G+L
Sbjct: 287 GFRFGCGERNE--GLFGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSSGTGYL 342
Query: 217 FF--GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP--------VVFDS 266
F G +R ++ + +Y G+ + GG+ L ++P + DS
Sbjct: 343 DFGPGSPAAVGARQTTPMLTDNGPTFYYVGMTGIRVGGQ---LLSIPQSVFSTAGTIVDS 399
Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
G+ T L Y +L S ++A+ K+AP L C+ F + +V +
Sbjct: 400 GTVITRLPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYD----FTGMSEVA--IPKV 453
Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
+L F G ++ + ++ VCLG A D+ ++G
Sbjct: 454 SLLFQGG---AYLDVNASGIMYAASLSQVCLGF--AANEDDDDVGIVG 496
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 96/339 (28%), Positives = 142/339 (41%), Gaps = 41/339 (12%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDLVP--CE 104
G Y + YIG P DTGSDL W+QC +PC C PL++P S+ +P C
Sbjct: 88 GEYLMRFYIGTPPVERLATADTGSDLIWVQC-SPCASCFPQSTPLFQPLKSSTFMPTTCR 146
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGS-SLGVLVKDAFAFNYTNGQRLN--PRLA 161
C +L P C +C Y +Y D S S G+L + F+ G + P
Sbjct: 147 SQPC-TLLLPEQKGCGKSGECIYTYKYGDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSF 205
Query: 162 LGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFL 216
GCG YN + + L GI+GLG G S+VSQ+ Q I + +CL S
Sbjct: 206 FGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQ--IGHKFSYCLLPLGSTSTSKLK 263
Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-------VVFDSGSS 269
F + + VV T M K + P L T K +P V+ DSG+
Sbjct: 264 FGNESIITGEGVVSTPM---IIKPWLPTYYFLNLEAVTVAQKTVPTGSTDGNVIIDSGTL 320
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
TYL Y + +++ L+ + +++ LP C+ R F F +A
Sbjct: 321 LTYLGESFYYNFAASLQESLAVELVQDV--LSPLPFCFPYRDNF--------VFPEIAFQ 370
Query: 330 FTDGKTRTLFELTP-EAYLIISNKGNVCLGILNGAEVGL 367
FT + L P +++ ++ VCL I + G+
Sbjct: 371 FTGARV----SLKPANLFVMTEDRNTVCLMIAPSSVSGI 405
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 76/261 (29%), Positives = 120/261 (45%), Gaps = 34/261 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH-----PLYRPSND--- 99
T Y + IG P + Y++ +DTGSD+ W+ C + C RC LY P +
Sbjct: 30 TRLYYTEIGIGTPTKRYYVQVDTGSDILWVNCIS-CDRCPRKSGLGLELTLYDPKDSSTG 88
Query: 100 -LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG----Q 154
V C+ CA+ + C C+Y + Y DG S+ G V D F+ +G +
Sbjct: 89 SKVSCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTR 148
Query: 155 RLNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-G 211
N + GCG Q G+S LDGI+G G+ +S++SQL + ++ + HCL
Sbjct: 149 PANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTIN 208
Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP---------- 261
GGG G+ + +V T + + +Y+ + + GG T LK LP
Sbjct: 209 GGGIFAIGNVV--QPKVKTTPLVPN-MPHYNVNLKSIDVGG--TALK-LPSHMFDTGEKK 262
Query: 262 -VVFDSGSSYTYLNRVTYQTL 281
+ DSG++ TYL + Y+ +
Sbjct: 263 GTIIDSGTTLTYLPEIVYKEI 283
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 90/341 (26%), Positives = 140/341 (41%), Gaps = 49/341 (14%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLYRP----SNDLV 101
Y + IG P +P+ + +DTGSD+ W+ C C +C + LY P S V
Sbjct: 87 YYTKIEIGTPPKPFHVQVDTGSDILWVNC-VSCDKCPTKSGLGIDLALYDPKGSSSGSAV 145
Query: 102 PCEDPICASLHAPGHH--NCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG----QR 155
C++ CA+ + G C C+Y EY DG S+ G V D+ +N +G +
Sbjct: 146 SCDNKFCAATYGSGEKLPGCTAGKPCEYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQTRH 205
Query: 156 LNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
+ GCG Q ++ LDGI+G G+ +S +SQL S ++ + HCL G
Sbjct: 206 AKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLDTIKG 265
Query: 214 GFLFFGDDLYD---SSRVVWTSMSSDYTKYYSPGVA--------ELFFGGETTGLKNLPV 262
G +F ++ S + +MS S VA +F E G
Sbjct: 266 GGIFAIGEVVQPKVKSTPLLPNMSHYNVNLQSIDVAGNALQLPPHIFETSEKRG-----T 320
Query: 263 VFDSGSSYTYLNRVTYQ-TLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
+ DSG++ TYL + Y+ L ++ +K +D T +G F+ V
Sbjct: 321 IIDSGTTLTYLPELVYKDILAAVFQKH----------QDITFRTI-QGFLCFEYSESVDD 369
Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNG 362
F + F D ++ P Y + CLG NG
Sbjct: 370 GFPKITFHFEDDLGLNVY---PHDYFFQNGDNLYCLGFQNG 407
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 90/360 (25%), Positives = 149/360 (41%), Gaps = 47/360 (13%)
Query: 45 VYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS------- 97
V G Y + +G P + Y + +DTGSD+ W+ C PC +C + +R S
Sbjct: 68 VDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINC-KPCPKCPTKTNLNFRLSLFDMNAS 126
Query: 98 --NDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
+ V C+D C+ + +C+ C Y + YAD +S G ++D G
Sbjct: 127 STSKKVGCDDDFCSFISQS--DSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDL 184
Query: 156 ----LNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
L + GCG +Q G +DG++G G+ +S++SQL + + V HCL
Sbjct: 185 KTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLD 244
Query: 210 GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTG---------LKNL 260
GG + F + DS +V T M + Y + G + G ++N
Sbjct: 245 NVKGGGI-FAVGVVDSPKVKTTPMVPNQMHYNV-----MLMGMDVDGTSLDLPRSIVRNG 298
Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
+ DSG++ Y +V Y +L + L+ + +K +ET + F +V
Sbjct: 299 GTIVDSGTTLAYFPKVLYDSLIETI---LARQPVKLHIVEETF-------QCFSFSTNVD 348
Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDFV 380
+ F ++ F D T++ P YL + C G G + VI +GD V
Sbjct: 349 EAFPPVSFEFEDSVKLTVY---PHDYLFTLEEELYCFGWQAGGLTTDERSEVI-LLGDLV 404
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 94/365 (25%), Positives = 149/365 (40%), Gaps = 40/365 (10%)
Query: 36 SLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR 95
S +H ++ GYY ++IG P + L +DTGS +T++ PC C H
Sbjct: 25 SARMTLHDDLLTKGYYTSRVFIGTPPNEFALIVDTGSTVTYV----PCSSCTHCGHHQAS 80
Query: 96 PSNDLVPCEDPICASLHAPGHHN------------CE-DPAQCDYELEYADGGSSLGVLV 142
S + C DP ++ + C+ + QC YE YA+ +S GVL
Sbjct: 81 FSTHRLFCRDPRFKPENSSSYQKIGCRSSDCITGLCDSNSHQCKYERMYAEMSTSKGVLG 140
Query: 143 KDAFAFNYTNGQRLNPR-LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
KD F RL + L+ GC + DGI+GLG+G SIV QL I
Sbjct: 141 KDLLDFG--PASRLQSQLLSFGCETAESGDLYLQVADGIMGLGRGPLSIVDQLVGNGAIE 198
Query: 202 NVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN 259
+ C G GGG + G + S +V+ + YY+ + E+ G + L +
Sbjct: 199 DSFSLCYGGMDEGGGSMVLG-AIPAPSGMVFAKSDPRRSNYYNLELTEIQVQGASLKLDS 257
Query: 260 ------LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPF 313
+ DSG++Y YL ++ T + +L + + P+ +C+ G
Sbjct: 258 NVFNGKFGTILDSGTTYAYLPDRAFEAFTDAVVAQLGSLQAVDGPDPNYPDICYAGAG-- 315
Query: 314 KNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLN 371
+ ++ K F + F + + L PE YL K G CLG +
Sbjct: 316 TDTKELGKHFPLVDFVFAENQK---VSLAPENYLFKHTKVPGAYCLGFFKNQDA----TT 368
Query: 372 VIGGI 376
++GGI
Sbjct: 369 LLGGI 373
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 101/358 (28%), Positives = 149/358 (41%), Gaps = 61/358 (17%)
Query: 34 GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPL 93
G L VH G + + + IG PA Y +DTGSDL W QC PCV C + P+
Sbjct: 86 GGDLQVPVHAG---NGEFLMDVAIGTPALSYAAIVDTGSDLVWTQCK-PCVDCFKQSTPV 141
Query: 94 YRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN 149
+ PS+ VPC +C+ L C ++C Y Y D S+ GVL + F
Sbjct: 142 FDPSSSSTYATVPCSSALCSDLPT---STCTSASKCGYTYTYGDASSTQGVLASETFTLG 198
Query: 150 YTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
++ P +A GCG + G + G++GLG+G S+VSQL K +CL+
Sbjct: 199 --KEKKKLPGVAFGCG-DTNEGDGFTQGAGLVGLGRGPLSLVSQLGLDKF-----SYCLT 250
Query: 210 ----GGGGGFLFFGDDLYDSSR-----------------------VVWTSMSSDYTKYYS 242
G G L G S V T ++ T+
Sbjct: 251 SLDDGDGKSPLLLGGSAAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITL 310
Query: 243 PGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET 302
P A T G V+ DSG+S TYL Y+ L +++ ++ + +
Sbjct: 311 PASAFAIQDDGTGG-----VIVDSGTSITYLELQGYRALKKAFVAQMALPTVDGS--EIG 363
Query: 303 LPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLII-SNKGNVCLGI 359
L LC++G P K V +V+ L L F G +L E Y+++ S G +CL +
Sbjct: 364 LDLCFQG--PAKGVDEVQ--VPKLVLHFDGGAD---LDLPAENYMVLDSASGALCLTV 414
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 95/368 (25%), Positives = 142/368 (38%), Gaps = 55/368 (14%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQVHG--NVYPTGYYNVTMYIGQPARPYFLDLDTGSD 73
R ++ S L V + F V G N Y G Y + +G PA+ +F+ +DTGSD
Sbjct: 52 RRDAARHRVSRRRLLGGVAGVVDFPVEGSANPYMVGLYFTRVKLGNPAKEFFVQIDTGSD 111
Query: 74 LTWLQCDAPCVRC---------VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCE---- 120
+ W+ C +PC C +E+ +P + + C D C + G C+
Sbjct: 112 ILWVTC-SPCTGCPTSSGLNIQLESFNPDSSSTASRITCSDDRCTAGFQTGEAICQTSNS 170
Query: 121 DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL----NPRLALGCGYNQVPGASY-- 174
+ C Y Y DG + G V D F G + + GC +Q +
Sbjct: 171 QSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKAD 230
Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTS 232
+DGI G G+ + S++SQL+S + V HCL G GGG L G+ + +V+T
Sbjct: 231 RAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPG--LVYTP 288
Query: 233 MSSDYTKY------------YSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQT 280
+ Y P + LF T G + DSG++ YL Y
Sbjct: 289 LVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQG-----TIVDSGTTLAYLADGAYDP 343
Query: 281 LTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFE 340
S + +S L KG + F V F T+ L F G
Sbjct: 344 FVSAIAAAVSP---------SVRSLVSKGSQCFITSSSVDSSFPTVTLYFMGG---VAMS 391
Query: 341 LTPEAYLI 348
+ PE YL+
Sbjct: 392 VKPENYLL 399
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 84/296 (28%), Positives = 129/296 (43%), Gaps = 44/296 (14%)
Query: 41 VHGNVYPT-GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND 99
+ + P+ G Y + +YIG P P +DTGSDLTW QC PC C + PL+ P N
Sbjct: 81 IQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQC-RPCTHCYKQVVPLFDPKNS 139
Query: 100 LV----PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
C C +L +C +C + YADG + G L + + T G+
Sbjct: 140 STYRDSSCGTSFCLALGK--DRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKP 197
Query: 156 LN-PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
++ P A GCG++ G GI+GLG G+ S++SQL S I + +CL
Sbjct: 198 VSFPGFAFGCGHSS-GGIFDKSSSGIVGLGGGELSLISQLKST--INGLFSYCL------ 248
Query: 215 FLFFGDDLYDSSRVVW--TSMSSDYTKYYSPGVAE-------LFFGGETTGLKNLP---- 261
L D SSR+ + + S Y +P V + L G + G K LP
Sbjct: 249 -LPVSTDSSISSRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKGY 307
Query: 262 ----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
++ DSG++YT+L + Y L + + K +++ + LC+
Sbjct: 308 SKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDP--NGIFSLCY 361
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 95/368 (25%), Positives = 142/368 (38%), Gaps = 55/368 (14%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQVHG--NVYPTGYYNVTMYIGQPARPYFLDLDTGSD 73
R ++ S L V + F V G N Y G Y + +G PA+ +F+ +DTGSD
Sbjct: 54 RRDAARHRVSRRRLLGGVAGVVDFPVEGSANPYMVGLYFTRVKLGNPAKEFFVQIDTGSD 113
Query: 74 LTWLQCDAPCVRC---------VEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCE---- 120
+ W+ C +PC C +E+ +P + + C D C + G C+
Sbjct: 114 ILWVTC-SPCTGCPTSSGLNIQLESFNPDSSSTASRITCSDDRCTAGFQTGEAICQTSNS 172
Query: 121 DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL----NPRLALGCGYNQVPGASY-- 174
+ C Y Y DG + G V D F G + + GC +Q +
Sbjct: 173 QSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKAD 232
Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTS 232
+DGI G G+ + S++SQL+S + V HCL G GGG L G+ + +V+T
Sbjct: 233 RAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPG--LVYTP 290
Query: 233 MSSDYTKY------------YSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQT 280
+ Y P + LF T G + DSG++ YL Y
Sbjct: 291 LVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQG-----TIVDSGTTLAYLADGAYDP 345
Query: 281 LTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFE 340
S + +S L KG + F V F T+ L F G
Sbjct: 346 FVSAIAAAVSP---------SVRSLVSKGSQCFITSSSVDSSFPTVTLYFMGG---VAMS 393
Query: 341 LTPEAYLI 348
+ PE YL+
Sbjct: 394 VKPENYLL 401
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 103/368 (27%), Positives = 150/368 (40%), Gaps = 70/368 (19%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
+G + + + IG PA Y +DTGSDL W QC PC C + P P++ P S V C
Sbjct: 105 SGEFLMELSIGNPAVKYAAIVDTGSDLIWTQC-KPCTECFDQPTPIFDPEKSSSYSKVGC 163
Query: 104 EDPICASLHAPGHHNC-EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
+C +L NC ED C+Y Y D S+ G+L + F F N +
Sbjct: 164 SSGLCNALP---RSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTFEDENSIS---GIGF 217
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS----GGGGGFLFF 218
GCG G + G++GLG+G S++SQL K +CL+ LF
Sbjct: 218 GCGVEN-EGDGFSQGSGLVGLGRGPLSLISQLKETKF-----SYCLTSIEDSEASSSLFI 271
Query: 219 GDDLYDSSRVVWTSMSSDYTKYYS-------PGVAELFFGGETTGLKNLPV--------- 262
G ++ + TK S P L G T G K L V
Sbjct: 272 GSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSE 331
Query: 263 ------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET----LPLCWKGRRP 312
+ DSG++ TYL ++ ++K+E +++ P D++ L LC+K
Sbjct: 332 DGTGGMIIDSGTTITYLEETAFK----VLKEEFTSR--MSLPVDDSGSTGLDLCFKLPNA 385
Query: 313 FKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLII-SNKGNVCL--GILNG----AEV 365
KN+ K F EL E Y++ S+ G +CL G NG V
Sbjct: 386 AKNIAVPKLIFHFKGAD---------LELPGENYMVADSSTGVLCLAMGSSNGMSIFGNV 436
Query: 366 GLQDLNVI 373
Q+ NV+
Sbjct: 437 QQQNFNVL 444
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 90/361 (24%), Positives = 149/361 (41%), Gaps = 47/361 (13%)
Query: 44 NVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS------ 97
V G Y + +G P + Y + +DTGSD+ W+ C PC +C + +R S
Sbjct: 67 RVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINC-KPCPKCPTKTNLNFRLSLFDMNA 125
Query: 98 ---NDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
+ V C+D C+ + +C+ C Y + YAD +S G ++D G
Sbjct: 126 SSTSKKVGCDDDFCSFISQSD--SCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGD 183
Query: 155 R----LNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
L + GCG +Q G +DG++G G+ +S++SQL + + V HCL
Sbjct: 184 LKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL 243
Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTG---------LKN 259
GG + F + DS +V T M + Y + G + G ++N
Sbjct: 244 DNVKGGGI-FAVGVVDSPKVKTTPMVPNQMHY-----NVMLMGMDVDGTSLDLPRSIVRN 297
Query: 260 LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
+ DSG++ Y +V Y +L + L+ + +K +ET + F +V
Sbjct: 298 GGTIVDSGTTLAYFPKVLYDSLIETI---LARQPVKLHIVEETF-------QCFSFSTNV 347
Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
+ F ++ F D T++ P YL + C G G + VI +GD
Sbjct: 348 DEAFPPVSFEFEDSVKLTVY---PHDYLFTLEEELYCFGWQAGGLTTDERSEVI-LLGDL 403
Query: 380 V 380
V
Sbjct: 404 V 404
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 94/345 (27%), Positives = 145/345 (42%), Gaps = 45/345 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
TG Y VT+ +G PA Y + DTGSD TW+QC+ V C E L+ P+ + C
Sbjct: 183 TGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANISC 242
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
P C+ L+ G C Y ++Y DG S+G D + + + G
Sbjct: 243 AAPACSDLYTKGCSG----GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIK---GFRFG 295
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDD 221
CG + G+LGLG+GK+S+ Q + + V HC G G+L FG
Sbjct: 296 CGERNE--GLFGEAAGLLGLGRGKTSLPVQAYDK--YGGVFAHCFPARSSGTGYLDFGP- 350
Query: 222 LYDSSRVVWTSMSS-----DYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSS 269
SS V T +++ + +Y G+ + GG+ L P VF DSG+
Sbjct: 351 --GSSPAVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKL--LSIPPSVFTTAGTIVDSGTV 406
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
T L Y +L S ++A+ K+AP L C+ F + V T++L
Sbjct: 407 ITRLPPAAYSSLRSAFASAIAARGYKKAPALSLLDTCYD----FTGMSQVA--IPTVSLL 460
Query: 330 FTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
F G + ++ + ++ CLG E D+ ++G
Sbjct: 461 FQGGAS---LDVDASGIIYAASVSQACLGFAANEED--DDVGIVG 500
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 90/360 (25%), Positives = 147/360 (40%), Gaps = 47/360 (13%)
Query: 45 VYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS------- 97
V G Y + +G P + Y + +DTGSD+ W+ C PC C + + S
Sbjct: 68 VDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWVNC-KPCPECPSKTNLNFHLSLFDVNAS 126
Query: 98 --NDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
+ V C+D C+ + +C+ C Y + YAD +S G ++D G
Sbjct: 127 STSKKVGCDDDFCSFISQS--DSCQPAVGCSYHIVYADESTSEGNFIRDKLTLEQVTGDL 184
Query: 156 ----LNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
L + GCG +Q G S +DG++G G+ +S++SQL + + V HCL
Sbjct: 185 QTGPLGQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLD 244
Query: 210 GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTG---------LKNL 260
GG + F + DS +V T M + Y + G + G ++N
Sbjct: 245 NVKGGGI-FAVGVVDSPKVKTTPMVPNQMHYNV-----MLMGMDVDGTALDLPPSIMRNG 298
Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
+ DSG++ Y +V Y +L + L+ + +K ++T + F +V
Sbjct: 299 GTIVDSGTTLAYFPKVLYDSLIETI---LARQPVKLHIVEDTF-------QCFSFSENVD 348
Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDFV 380
F ++ F D T++ P YL K C G G + VI +GD V
Sbjct: 349 VAFPPVSFEFEDSVKLTVY---PHDYLFTLEKELYCFGWQAGGLTTGERTEVI-LLGDLV 404
>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like [Cucumis sativus]
Length = 524
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 81/253 (32%), Positives = 119/253 (47%), Gaps = 29/253 (11%)
Query: 45 VYPTGY-YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYR 95
+ P G+ Y + +G P PY + LDTGSDL WL CD CV C+ + +Y
Sbjct: 100 ISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPCD--CVNCITGLNTTQGPVNFNIYS 157
Query: 96 PSND----LVPCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAF- 148
P+N V C +C+ L C P+ C Y++ Y +D SS G LV+D
Sbjct: 158 PNNSSTSKEVQCSSSLCSHL-----DQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLT 212
Query: 149 -NYTNGQRLNPRLALGCGYNQVPGA--SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVG 205
N + +N R+ LGCG +Q GA S +G+ GLG S+ S L + LI N
Sbjct: 213 TNDVQSKPVNARITLGCGKDQ-SGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFS 271
Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFD 265
C G + FGD ++ + Y+ + ++ GG + L ++ V+FD
Sbjct: 272 LCFGPARMGRIEFGDKGSPGQNETPFNLGRRHPT-YNVSITQIGVGGHISDL-DVAVIFD 329
Query: 266 SGSSYTYLNRVTY 278
SG+S+TYLN Y
Sbjct: 330 SGTSFTYLNDPAY 342
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 109/408 (26%), Positives = 164/408 (40%), Gaps = 64/408 (15%)
Query: 12 FPTVRMSSSSSSSSSSSLFNHVGSSLLFQVH----GNVYPT--GYYNVTMYIGQPARPYF 65
FP + + ++L H G LL V GN PT G Y + IG P++ Y+
Sbjct: 44 FPRHQGNGPGGEEHLAALRKHDGRRLLTAVDLPLGGNGIPTDTGLYFTQIGIGTPSKGYY 103
Query: 66 LDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP----SNDLVPCEDPICASLHA 113
+ +DTGSD+ W+ C + C P LY P S+ V C CA+
Sbjct: 104 VQVDTGSDILWVNC----ISCDSCPRKSGLGIDLTLYDPTASASSKTVTCGQEFCATATN 159
Query: 114 PG-HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG----QRLNPRLALGCG--Y 166
G +C + C Y + Y DG S+ G V D ++ +G N + GCG
Sbjct: 160 GGVPPSCAANSPCQYSITYGDGSSTTGFFVADFLQYDQVSGDGQTNLANASVTFGCGAKI 219
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSS 226
G+S LDGILG G+ SS++SQL S + + HCL GG +F ++
Sbjct: 220 GGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCLDTVNGGGIFAIGNVVQPK 279
Query: 227 RVVWTSMSSDYTKYYSPGVAELFFGGET---------TGLKNLPVVFDSGSSYTYLNRVT 277
V T+ +Y+ + + GG T G + + DSG++ YL V
Sbjct: 280 --VKTTPLVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIGGGSRGTIIDSGTTLAYLPEVV 337
Query: 278 YQTLTSIMKKELSAKSLKEA----------------PE-----DETLPL-CWKGRRPFKN 315
Y+ + S + +LK PE D LPL + F+N
Sbjct: 338 YKAVLSAVFSNHPDVTLKNVQDFLCFQYSGSVDNGFPEVTFHFDGDLPLVVYPHDYLFQN 397
Query: 316 VHDVKKC-FRTLALSFTDGKTRTLF-ELTPEAYLIISNKGNVCLGILN 361
DV F++ + DGK L +L L++ + N +G N
Sbjct: 398 TEDVYCVGFQSGGVQSKDGKDMVLLGDLALSNKLVVYDLENQVIGWTN 445
>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 547
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 81/253 (32%), Positives = 119/253 (47%), Gaps = 29/253 (11%)
Query: 45 VYPTGY-YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYR 95
+ P G+ Y + +G P PY + LDTGSDL WL CD CV C+ + +Y
Sbjct: 123 ISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPCD--CVNCITGLNTTQGPVNFNIYS 180
Query: 96 PSND----LVPCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAF- 148
P+N V C +C+ L C P+ C Y++ Y +D SS G LV+D
Sbjct: 181 PNNSSTSKEVQCSSSLCSHL-----DQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLT 235
Query: 149 -NYTNGQRLNPRLALGCGYNQVPGA--SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVG 205
N + +N R+ LGCG +Q GA S +G+ GLG S+ S L + LI N
Sbjct: 236 TNDVQSKPVNARITLGCGKDQ-SGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFS 294
Query: 206 HCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFD 265
C G + FGD ++ + Y+ + ++ GG + L ++ V+FD
Sbjct: 295 LCFGPARMGRIEFGDKGSPGQNETPFNLGRRHPT-YNVSITQIGVGGHISDL-DVAVIFD 352
Query: 266 SGSSYTYLNRVTY 278
SG+S+TYLN Y
Sbjct: 353 SGTSFTYLNDPAY 365
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 109/391 (27%), Positives = 163/391 (41%), Gaps = 70/391 (17%)
Query: 17 MSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTW 76
MS + +++ S+ L VH G + + M IG PA Y +DTGSDL W
Sbjct: 87 MSRLVARTATGSVKAAAAPDLQVPVHAG---NGEFLMDMSIGTPALAYAAIVDTGSDLVW 143
Query: 77 LQCDAPCVRCVEAPHPLYRPSN----DLVPCEDPICASLHAPGHHNCEDPAQ-CDYELEY 131
QC PCV C P++ PS+ +PC +C+ L C A+ C Y Y
Sbjct: 144 TQCK-PCVECFNQSTPVFDPSSSSTYSTLPCSSSLCSDLPT---STCTSAAKDCGYTYTY 199
Query: 132 ADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIV 191
D S+ GVL + F T P +A GCG + G + G++GLG+G S+V
Sbjct: 200 GDASSTQGVLAAETFTLAKTK----LPGVAFGCG-DTNEGDGFTQGAGLVGLGRGPLSLV 254
Query: 192 SQLHSQKLIRNVVGHCLSGGG---------GGFLFFGDDLYDSSRVVWTSMSSDYTKYYS 242
SQL K +CL+ G D ++ + T + + ++
Sbjct: 255 SQLGLGKF-----SYCLTSLDDTSKSPLLLGSLAAISTDTASAAAIQTTPLIKNPSQ--- 306
Query: 243 PGVAELFFGGETTGLKNLP---------------VVFDSGSSYTYLNRVTYQTLTSIMKK 287
P + T G +P V+ DSG+S TYL Y+ L KK
Sbjct: 307 PSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTGGVIVDSGTSITYLELQGYRPL----KK 362
Query: 288 ELSAKSLKEAPEDET---LPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPE 344
+A+ +K D + L LC+K P V DV+ L L F G +L E
Sbjct: 363 AFAAQ-MKLPVADGSAVGLDLCFKA--PASGVDDVE--VPKLVLHFDGGAD---LDLPAE 414
Query: 345 AYLII-SNKGNVCLGILNGAEVGLQDLNVIG 374
Y+++ S G +CL ++ G + L++IG
Sbjct: 415 NYMVLDSASGALCLTVM-----GSRGLSIIG 440
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 90/339 (26%), Positives = 133/339 (39%), Gaps = 48/339 (14%)
Query: 39 FQVHGNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPH 91
F + G P G Y + +G P + Y + +DTGSD+ W+ C PC C + P
Sbjct: 15 FSLGGTADPLSGGLYFTQVGLGNPVKHYIVQVDTGSDVLWVNC-RPCSGCPRKSALNIPL 73
Query: 92 PLYRP----SNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAF 146
+Y P + LV C DP+C C C+Y Y DG +S G V+DA
Sbjct: 74 TMYDPRESSTTSLVSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAM 133
Query: 147 AFNYTNGQRL---NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
+N + L ++ GC Q S +DGI+G G+ + S+ +QL +Q+ I
Sbjct: 134 QYNVISSNGLANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIP 193
Query: 202 NVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYS------------PGVAELF 249
V HCL G G + +T + D Y P AE F
Sbjct: 194 RVFSHCLEGEKRGGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDF 253
Query: 250 FGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
TG V+ DSG++ Y Y +++ SA ++ D L G
Sbjct: 254 SSTNDTG-----VIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLV-SG 307
Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI 348
R + F + L+F G EL P+ YL+
Sbjct: 308 R--------LSDLFPNVTLNFEGGA----MELQPDNYLM 334
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 94/348 (27%), Positives = 147/348 (42%), Gaps = 41/348 (11%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
G+ TG Y VT+ +G PA Y + DTGSD TW+QC+ V C + L+ P+
Sbjct: 153 GSALGTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDPARSSTY 212
Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
+ C P C+ L+ G C C Y ++Y DG S+G D + + +
Sbjct: 213 ANISCAAPACSDLYIKG---CSG-GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIK--- 265
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
GCG Y G+LGLG+GK+S+ Q + + V HC G G+L
Sbjct: 266 GFRFGCGERNE--GLYGEAAGLLGLGRGKTSLPVQAYDK--YGGVFAHCFPARSSGTGYL 321
Query: 217 FFG-DDLYDSSRVVWTSMSSDY-TKYYSPGVAELFFGGETTGLKNLP--------VVFDS 266
FG L S + T M D +Y G+ + GG+ L ++P + DS
Sbjct: 322 DFGPGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGK---LLSIPQSVFTTSGTIVDS 378
Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
G+ T L Y +L S ++ + K+AP L C+ F + +V T+
Sbjct: 379 GTVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYD----FTGMSEVA--IPTV 432
Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
+L F G + ++ + ++ CLG E D+ ++G
Sbjct: 433 SLLFQGGAS---LDVHASGIIYAASVSQACLGFAGNKED--DDVGIVG 475
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 103/363 (28%), Positives = 147/363 (40%), Gaps = 70/363 (19%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPIC 108
+ + IG PA Y +DTGSDL W QC PC C + P P++ P S V C +C
Sbjct: 1 MELSIGNPAVKYSAIVDTGSDLIWTQC-KPCTECFDQPTPIFDPEKSSSYSKVGCSSGLC 59
Query: 109 ASLHAPGHHNC-EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYN 167
+L NC ED C+Y Y D S+ G+L + F F N + GCG
Sbjct: 60 NALP---RSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSIS---GIGFGCGVE 113
Query: 168 QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS----GGGGGFLFFGDDLY 223
G + G++GLG+G S++SQL K +CL+ LF G
Sbjct: 114 N-EGDGFSQGSGLVGLGRGPLSLISQLKETKF-----SYCLTSIEDSEASSSLFIGSLAS 167
Query: 224 DSSRVVWTSMSSDYTKYYS-------PGVAELFFGGETTGLKNLPV-------------- 262
S+ + TK S P L G T G K L V
Sbjct: 168 GIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGG 227
Query: 263 -VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET----LPLCWKGRRPFKNVH 317
+ DSG++ TYL ++ ++K+E +++ P D++ L LC+K KN+
Sbjct: 228 MIIDSGTTITYLEETAFK----VLKEEFTSR--MSLPVDDSGSTGLDLCFKLPDAAKNIA 281
Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLII-SNKGNVCL--GILNG----AEVGLQDL 370
K F EL E Y++ S+ G +CL G NG V Q+
Sbjct: 282 VPKMIFHFKGAD---------LELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNF 332
Query: 371 NVI 373
NV+
Sbjct: 333 NVL 335
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 87/298 (29%), Positives = 134/298 (44%), Gaps = 25/298 (8%)
Query: 22 SSSSSSSLFNHVGSSLLFQVHGNVYP-TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD 80
S S S++L N +S + ++ P +G Y +++ IG P Y DTGSDLTW QC
Sbjct: 62 SLSRSAALLNRAATSGAVGLQSSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTWAQC- 120
Query: 81 APCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS 136
PC++C + P++ P S VPC C HA +C CDY Y D
Sbjct: 121 LPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTC---HAVDDGHCGVQGVCDYSYTYGDRTY 177
Query: 137 SLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS 196
S G L F + + +GCG+ G + G++GLG G+ S+VSQ+
Sbjct: 178 SKGDL-----GFEKITIGSSSVKSVIGCGHASSGGFGFA--SGVIGLGGGQLSLVSQMSQ 230
Query: 197 QKLIRNVVGHCLS---GGGGGFLFFGDDLYDSSRVVWTS--MSSDYTKYYSPGVAELFFG 251
I +CL G + FG++ S V ++ +S + YY + + G
Sbjct: 231 TSGISRRFSYCLPTLLSHANGKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISIG 290
Query: 252 GE--TTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
E K V+ DSG++ T L + Y + S + K + AK +K+ +L LC+
Sbjct: 291 NERHMAFAKQGNVIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKD--PHGSLDLCF 346
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 101/377 (26%), Positives = 149/377 (39%), Gaps = 59/377 (15%)
Query: 39 FQVHG--NVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------V 87
F V G N Y G Y + +G PA+ YF+ +DTGSD+ W+ C +PC C +
Sbjct: 75 FPVEGSANPYMVGLYFTRVKLGNPAKEYFVQIDTGSDILWVAC-SPCTGCPTSSGLNIQL 133
Query: 88 EAPHPLYRPSNDLVPCEDPICASLHAPGHHNCED----PAQCDYELEYADGGSSLGVLVK 143
E +P ++ +PC D C + G C+ + C Y Y DG + G V
Sbjct: 134 EFFNPDSSSTSSRIPCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVS 193
Query: 144 DAFAFNYTNGQRLNPR----LALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQ 197
D F+ G + GC +Q + +DGI G G+ + S+VSQL+S
Sbjct: 194 DTMYFDTVMGNEQTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSL 253
Query: 198 KLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKY------------YSP 243
+ HCL G GGG L G+ + +V+T + Y P
Sbjct: 254 GVSPKTFSHCLKGSDNGGGILVLGEIV--EPGLVFTPLVPSQPHYNLNLESIAVSGQKLP 311
Query: 244 GVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETL 303
+ LF T G + DSG++ YL Y ++ A
Sbjct: 312 IDSSLFATSNTQG-----TIVDSGTTLVYLVDGAYDPFI---------NAIAAAVSPSVR 357
Query: 304 PLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGA 363
+ KG + F V F T L F G + T + PE YL+ +G+V +L
Sbjct: 358 SVVSKGIQCFVTTSSVDSSFPTATLYFKGGVSMT---VKPENYLL--QQGSVDNNVL--W 410
Query: 364 EVGLQDLNVIGGIGDFV 380
+G Q I +GD V
Sbjct: 411 CIGWQRSQGITILGDLV 427
>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
Length = 515
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 98/350 (28%), Positives = 155/350 (44%), Gaps = 48/350 (13%)
Query: 50 YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP--CVRCVEAPH------PLYRP----S 97
Y NVT +G P+ + + LDTGSDL WL CD CVR ++AP +Y P +
Sbjct: 105 YANVT--VGTPSDWFLVALDTGSDLFWLPCDCSTNCVRELKAPGGSSLDLNIYSPNASST 162
Query: 98 NDLVPCEDPICASLHAPGHHNCEDP-AQCDYELEY-ADGGSSLGVLVKDAFAF--NYTNG 153
+ VPC +C + C P + C Y++ Y ++G SS GVLV+D N
Sbjct: 163 SSKVPCNSTLCTRV-----DRCASPLSDCPYQIRYLSNGTSSTGVLVEDVLHLVSMEKNS 217
Query: 154 QRLNPRLALGCGYNQVPGASYH---PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
+ + R+ LGCG Q +H +G+ GLG S+ S L + + N C
Sbjct: 218 KPIRARITLGCGLVQT--GVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGD 275
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
G G + FGD R ++ + Y+ V ++ GG T L+ VFD+G+S+
Sbjct: 276 DGAGRISFGDKGSVDQRETPLNIRQPHPT-YNVTVTQISVGGNTGDLE-FDAVFDTGTSF 333
Query: 271 TYLNRVTYQTLTSIMKKELSAKSL-KEAPEDETLPL--CWKGRRPFKNVHDVKKCFR--T 325
TYL Y +++ + ++ +L K D LP C+ V KK F
Sbjct: 334 TYLTDAPY----TLISESFNSLALDKRYQTDSELPFEYCYA-------VSPNKKSFEYPD 382
Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGG 375
+ L+ G + ++ P + I + CL I+ ++ + N + G
Sbjct: 383 VNLTMKGGSSYPVYH--PLIVVPIEDTVVYCLAIMKSEDISIIGQNFMTG 430
>gi|47497551|dbj|BAD19623.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
gi|47847593|dbj|BAD21980.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
Length = 297
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 62/188 (32%), Positives = 91/188 (48%), Gaps = 22/188 (11%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP--- 96
TG Y + IG PA+ Y++ +DTGSD+ W+ C V C P +Y P
Sbjct: 87 TGLYFTRIGIGTPAKRYYVQVDTGSDILWVNC----VSCDGCPRKSNLGIELTMYDPRGS 142
Query: 97 -SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
S +LV C+ C + + +C + C+Y + Y DG S+ G V D +N +G
Sbjct: 143 QSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDG 202
Query: 156 ----LNPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
N ++ GCG G+S LDGILG G+ SS++SQL + +R + HCL
Sbjct: 203 QTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLD 262
Query: 210 GGGGGFLF 217
GG +F
Sbjct: 263 TVNGGGIF 270
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 61/189 (32%), Positives = 87/189 (46%), Gaps = 23/189 (12%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH--------PLY----R 95
G Y + IG P + Y+L +DTGSD+ W+ C ++C E P LY
Sbjct: 82 VGLYYAKIGIGTPPKNYYLQVDTGSDIMWVNC----IQCKECPTRSNLGMDLTLYDIKES 137
Query: 96 PSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ- 154
S VPC+ C ++ C C Y Y DG S+ G VKD ++ +G
Sbjct: 138 SSGKFVPCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDL 197
Query: 155 ---RLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
N + GCG Q + ++ L GILG GK SS++SQL S ++ + HCL
Sbjct: 198 KTDSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCL 257
Query: 209 SGGGGGFLF 217
+G GG +F
Sbjct: 258 NGVNGGGIF 266
>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
Length = 437
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 88/334 (26%), Positives = 153/334 (45%), Gaps = 41/334 (12%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEA-----PHPL 93
F + GN G Y + +G P + + +DTGSD+ W++C +PC C+ P +
Sbjct: 71 FPLKGNYSDLGLYYTEIGLGNPVQKLKVIVDTGSDILWVKC-SPCRSCLSKQDIIPPLSI 129
Query: 94 YR----PSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN 149
Y ++ + C DP+C A + + A C Y + Y D +S+G VKD +
Sbjct: 130 YNLSASSTSSVSSCSDPLCTGEQAVCSRSGSNSA-CAYGISYQDKSTSIGAYVKDDMHYV 188
Query: 150 YTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
G + GC N + G+ P DGI+G G+ ++ +Q+ +Q+ + V HCL
Sbjct: 189 LQGGNATTSHIFFGCAIN-ITGS--WPADGIMGFGQISKTVPNQIATQRNMSRVFSHCLG 245
Query: 210 GG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELF------------FGGETT 255
G GGG L FG++ +++ +V+T + + T +Y+ + + F +
Sbjct: 246 GEKHGGGILEFGEE-PNTTEMVFTPL-LNVTTHYNVDLLSISVNSKVLPIDSKEFSYVSN 303
Query: 256 GLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
V+ DSG+S+ L + L S +K +AK P+ E L + K+
Sbjct: 304 STNETGVIIDSGTSFALLATKANRILFSEIKNLTTAKL---GPKLEGLQCFY-----LKS 355
Query: 316 VHDVKKCFRTLALSFTDGKTRTLFELTPEAYLII 349
V+ F + L+F+ G T +L P+ YL++
Sbjct: 356 GLTVETSFPNVTLTFSGGST---MKLKPDNYLVM 386
>gi|213998806|gb|ACJ60770.1| nucellin [Hordeum flexuosum]
Length = 136
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 59/134 (44%), Positives = 73/134 (54%), Gaps = 5/134 (3%)
Query: 154 QRLNPRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSG 210
QR ++A GCGY Q A P +DGILGLG GK+ +QL QK+I NV+GHCLS
Sbjct: 3 QRDKKKIAFGCGYKQEEPADSPPSLVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSS 269
G G L+ GD S V W M YYSPG+AEL + G VFDSGS+
Sbjct: 63 KGKGVLYVGDFNPPSRGVTWVPMKESLF-YYSPGLAELLIDNQPIRGNPTFEAVFDSGST 121
Query: 270 YTYLNRVTYQTLTS 283
YT++ Y + S
Sbjct: 122 YTHVPAQIYNEIVS 135
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 103/361 (28%), Positives = 155/361 (42%), Gaps = 48/361 (13%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL 100
V G+ +G Y V ++G P + + L +D+GSDL W+QC +PC +C PLY PSN
Sbjct: 54 VSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQC-SPCRQCYAQDSPLYVPSNSS 112
Query: 101 ----VPCEDPICASLHAPGHHNCE--DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
VPC C + A C+ P C YE YAD SS GV A+ +G
Sbjct: 113 TFSPVPCLSSDCLLIPATEGFPCDFRYPGACAYEYLYADTSSSKGVF---AYESATVDGV 169
Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQL---HSQKLIRNVVGHCLSGG 211
R++ ++A GCG + S+ G+LGLG+G S SQ+ + K +V +
Sbjct: 170 RID-KVAFGCGSDN--QGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTS 226
Query: 212 GGGFLFFGDDLYDSSR-VVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV-------- 262
L FGD+L + + +T + S+ SP + + T G K+LP+
Sbjct: 227 VSSSLIFGDELISTIHDMQYTPIVSNPK---SPTLYYVQIEKVTVGGKSLPISDSAWEID 283
Query: 263 -------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
+FDSG++ TY Y + I+ S A + L LC +
Sbjct: 284 LLGNGGSIFDSGTTLTYWFPSAY---SHILAAFDSGVHYPRAESVQGLDLCVE----LTG 336
Query: 316 VHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGG 375
V + F + + F DG +F+ E Y + CL + G L N IG
Sbjct: 337 VD--QPSFPSFTIEFDDG---AVFQPEAENYFVDVAPNVRCLA-MAGLASPLGGFNTIGN 390
Query: 376 I 376
+
Sbjct: 391 L 391
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 153/351 (43%), Gaps = 45/351 (12%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV-EAPHPLYRP--S 97
V G +G Y V + IGQP + L DTGSDL W++C A C C +P ++ P S
Sbjct: 73 VSGASSGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSA-CRNCSHHSPATVFFPRHS 131
Query: 98 NDLVP--CEDPICASLHAPGH----HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
+ P C DP+C + PG ++ + C YE YADG + G+ ++ + +
Sbjct: 132 STFSPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTS 191
Query: 152 NGQRLNPR-LALGCGY----NQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNV 203
+G+ + +A GCG+ V G S++ +G++GLG+G S SQL + K +
Sbjct: 192 SGKEAKLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCL 251
Query: 204 VGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGG--------- 252
+ + LS +L GD S++ +T + ++ +Y + +F G
Sbjct: 252 MDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSI 311
Query: 253 -ETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP---LCWK 308
E N V DSG++ +L Y+ + + +K+ +K DE P LC
Sbjct: 312 WEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAAVKQR-----IKLPNADELTPGFDLCVN 366
Query: 309 GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGI 359
V +K L F+ G +F P Y I + + CL I
Sbjct: 367 ----VSGVTKPEKILPRLKFEFSGG---AVFVPPPRNYFIETEEQIQCLAI 410
>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 513
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 95/341 (27%), Positives = 151/341 (44%), Gaps = 37/341 (10%)
Query: 50 YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP-CVRCVEAPH------PLYRPSNDLVP 102
Y NVT +G P+ + + LDTGSDL WL CD CVR ++AP +Y P+
Sbjct: 105 YANVT--VGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNIYSPNASSTS 162
Query: 103 CEDPICASLHAPGHHNCEDPAQCDYELEY-ADGGSSLGVLVKDAFAF--NYTNGQRLNPR 159
+ P ++L G + C Y++ Y ++G SS GVLV+D N + + + R
Sbjct: 163 TKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPAR 222
Query: 160 LALGCGYNQVPGASYH---PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFL 216
+ GCG QV +H +G+ GLG S+ S L + + N C G G +
Sbjct: 223 VTFGCG--QVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRI 280
Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRV 276
FGD R ++ + Y+ V ++ GG T L+ VFDSG+S+TYL
Sbjct: 281 SFGDKGSVDQRETPLNIRQPHPT-YNITVTKISVGGNTGDLE-FDAVFDSGTSFTYLTDA 338
Query: 277 TYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV---KKCFRTLALSFTDG 333
Y TL S L+ + + E PF+ + + K F+ A++ T
Sbjct: 339 AY-TLISESFNSLALDKRYQTTDSEL---------PFEYCYALSPNKDSFQYPAVNLTMK 388
Query: 334 KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
+ P + + + CL I+ ++D+++IG
Sbjct: 389 GGSSYPVYHPLVVIPMKDTDVYCLAIMK-----IEDISIIG 424
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 104/364 (28%), Positives = 153/364 (42%), Gaps = 50/364 (13%)
Query: 47 PTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVP 102
P Y + YIG P F DTGSDL W+QC APC +CV PL+ P VP
Sbjct: 88 PITEYLMRFYIGTPPVERFAIADTGSDLIWVQC-APCEKCVPQNAPLFDPRKSSTFKTVP 146
Query: 103 CEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
C+ C +L P C + QC Y+ Y D G+L ++ F N P+L
Sbjct: 147 CDSQPC-TLLPPSQRACVGKSGQCYYQYIYGDHTLVSGILGFESINFGSKNNAIKFPKLT 205
Query: 162 LGCGY--NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC---LSGGGGGFL 216
GC + N S + G++GLG G S++SQL Q I +C LS +
Sbjct: 206 FGCTFSNNDTVDESKRNM-GLVGLGVGPLSLISQLGYQ--IGRKFSYCFPPLSSNSTSKM 262
Query: 217 FFGDD--LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP---------VVFD 265
FG+D + VV T + K P L G + G K + ++ D
Sbjct: 263 RFGNDAIVKQIKGVVSTPL---IIKSIGPSYYYLNLEGVSIGNKKVKTSESQTDGNILID 319
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
SG+S+T L + Y +++K+ +++K P KG+R K F
Sbjct: 320 SGTSFTILKQSFYNKFVALVKEVYGVEAVKIPPLVYNFCFENKGKR---------KRFPD 370
Query: 326 LALSFTDGKTRT----LFELTPEAYLII-----SNKGNVCLGILNGAEVGLQ-DLNVIGG 375
+ FT K R LFE L + S++ + G N A++G Q + ++ GG
Sbjct: 371 VVFLFTGAKVRVDASNLFEAEDNNLLCMVALPTSDEDDSIFG--NHAQIGYQVEYDLQGG 428
Query: 376 IGDF 379
+ F
Sbjct: 429 MVSF 432
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 123/266 (46%), Gaps = 25/266 (9%)
Query: 41 VHGNVYPT-GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND 99
+ + P+ G Y + +YIG P P +DTGSDLTW QC PC C + PL+ P N
Sbjct: 81 IQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQC-RPCTHCYKQVVPLFDPKNS 139
Query: 100 LV----PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
C C +L +C +C + YADG + G L + + T G+
Sbjct: 140 STYRDSSCGTSFCLALGK--DRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKP 197
Query: 156 LN-PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
++ P A GCG++ G GI+GLG G+ S++SQL S I + +CL
Sbjct: 198 VSFPGFAFGCGHSS-GGIFDKSSSGIVGLGGGELSLISQLKST--INGLFSYCL------ 248
Query: 215 FLFFGDDLYDSSRVVW--TSMSSDYTKYYSPGVAELFFGG--ETTGLKNLPVVFDSGSSY 270
L D SSR+ + + S Y +P L + G + T ++ ++ DSG++Y
Sbjct: 249 -LPVSTDSSISSRINFGASGRVSGYGTVSTP--LRLPYKGYSKKTEVEEGNIIVDSGTTY 305
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKE 296
T+L + Y L + + K +++
Sbjct: 306 TFLPQEFYSKLEKSVANSIKGKRVRD 331
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 91/343 (26%), Positives = 136/343 (39%), Gaps = 55/343 (16%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------VEAPHPLYRPSNDLV 101
Y + +G P + YF+ +DTGSD+ W+ C +PC C +E +P ++ +
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 175
Query: 102 PCEDPICASLHAPGHHNCE--DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
PC D C + C+ D + C Y Y DG + G V D F+ G
Sbjct: 176 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTAN 235
Query: 160 ----LALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--G 211
+ GC +Q + +DGI G G+ + S+VSQL+S + V HCL G
Sbjct: 236 SSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDN 295
Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSDYTKY------------YSPGVAELFFGGETTGLKN 259
GGG L G+ + +V+T + Y P + LF T G
Sbjct: 296 GGGILVLGEIV--EPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQG--- 350
Query: 260 LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
+ DSG++ YL Y + + +S L KG + F V
Sbjct: 351 --TIVDSGTTLAYLADGAYDPFVNAITAAVSP---------SVRSLVSKGNQCFVTSSSV 399
Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLI----ISNKGNVCLG 358
F T++L F G T + PE YL+ I N C+G
Sbjct: 400 DSSFPTVSLYFMGGVAMT---VKPENYLLQQASIDNNVLWCIG 439
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 92/300 (30%), Positives = 129/300 (43%), Gaps = 25/300 (8%)
Query: 8 ENLCFPTVRMSSSSSSSSSSSLFNHVGSSL---LFQVHGNVYPTGYYNVTMYIGQPARPY 64
E L R++S S S NHV S L G+ +G Y VT+ +G P
Sbjct: 87 EILRLDQARVNSIHSKLSKKLTTNHVSQSQSTDLPAKDGSTLGSGNYIVTVGLGTPKNDL 146
Query: 65 FLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL----VPCEDPICASL-HAPGHHN 118
L DTGSDLTW QC PCVR C + P++ PS V C C SL A G+
Sbjct: 147 SLIFDTGSDLTWTQCQ-PCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAG 205
Query: 119 CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLD 178
+ C Y ++Y D S+G L KD F ++ + + GCG N + +
Sbjct: 206 SCSASNCIYGIQYGDQSFSVGFLAKDKFTLTSSD---VFDGVYFGCGENN--QGLFTGVA 260
Query: 179 GILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDD-LYDSSRVVWTSMSS 235
G+LGLG+ K S SQ + + +CL S G L FG + S + S +
Sbjct: 261 GLLGLGRDKLSFPSQ--TATAYNKIFSYCLPSSASYTGHLTFGSAGISRSVKFTPISTIT 318
Query: 236 DYTKYYSPGVAELFFGGE-----TTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELS 290
D T +Y + + GG+ +T + DSG+ T L Y L S K ++S
Sbjct: 319 DGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMS 378
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 79/276 (28%), Positives = 116/276 (42%), Gaps = 30/276 (10%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPC 103
+G Y V + IG P +L +D+GSD+ W+QC PC+ C PL+ P+ VPC
Sbjct: 124 SGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCK-PCLECYAQADPLFDPATSATFSAVPC 182
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+C +L G C D CDYE+ Y DG + G L + T + +A+G
Sbjct: 183 GSAVCRTLRTSG---CGDSGGCDYEVSYGDGSYTKGALALETLTLGGTAVE----GVAIG 235
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLY 223
CG+ + G+LGLG G S+V QL +CL+ G G L G
Sbjct: 236 CGHRNR--GLFVGAAGLLGLGWGPMSLVGQLGGAAG--GAFSYCLASRGAGSLVLGRSEA 291
Query: 224 DSSRVVWTSMSSD--YTKYYSPGVAELFFGGETTGLKN----------LPVVFDSGSSYT 271
VW + + +Y G++ + G E L+ VV D+G++ T
Sbjct: 292 VPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTGTAVT 351
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
L + Y L + A L AP L C+
Sbjct: 352 RLPQEAYAALRDAFVAAVGA--LPRAPGVSLLDTCY 385
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 99/337 (29%), Positives = 145/337 (43%), Gaps = 45/337 (13%)
Query: 20 SSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC 79
S +S S+ S+ H+G S+ + Y VT+ +G PA L +DTGSDL+W+QC
Sbjct: 98 SRASKSNVSIPTHLGGSV---------DSLEYVVTVGLGTPAVSQVLLIDTGSDLSWVQC 148
Query: 80 DAPC--VRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGH-HNCED----PAQCDYE 128
APC C PL+ PS +PC C L G+ +C AQC Y
Sbjct: 149 -APCNSTTCYPQKDPLFDPSRSSTYAPIPCNTDACRDLTRDGYGSDCTSGSGGGAQCGYA 207
Query: 129 LEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQV-PGASYHPLDGILGLGKGK 187
+ Y DG + GV + G + GCG++Q P Y DG+LGLG
Sbjct: 208 ITYGDGSQTTGVYSNETLTM--APGVTVK-DFHFGCGHDQDGPNDKY---DGLLGLGGAP 261
Query: 188 SSIVSQLHSQKLIRNVVGHCLSGGG--GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGV 245
S+V Q S + +CL GFL G + D+S V+T M + +Y +
Sbjct: 262 ESLVVQTSS--VYGGAFSYCLPAANDQAGFLALGAPVNDASGFVFTPMVREQQTFYVVNM 319
Query: 246 AELFFGGETTGLKNLP----VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDE 301
+ GGE + ++ DSG+ T L Y L + +K ++A L E +
Sbjct: 320 TGITVGGEPIDVPPSAFSGGMIIDSGTVVTELQHTAYAALQAAFRKAMAAYPLLPNGELD 379
Query: 302 TLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTL 338
T C+ F +V +AL+F+ G T L
Sbjct: 380 T---CYN----FTGHSNVT--VPRVALTFSGGATVDL 407
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/345 (24%), Positives = 150/345 (43%), Gaps = 39/345 (11%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP-------HPLYRPSNDLVPC 103
Y + + +G P DTGSDL W+ C + +A P + + C
Sbjct: 103 YLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTRSSTYSQLSC 162
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF--NYTNGQRLNPRLA 161
+ C +L +C+ ++C Y+ Y DG ++GVL + F+F GQ PR+
Sbjct: 163 QSNACQALS---QASCDADSECQYQYSYGDGSRTIGVLSTETFSFVDGGGKGQVRVPRVN 219
Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLF 217
GC A DG++GLG G S+VSQL + I + +CL L
Sbjct: 220 FGC---STASAGTFRSDGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSYDANSSSTLN 276
Query: 218 FGDDLYDSSRVVWTS--MSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNR 275
FG S ++ + SD YY+ + + GG+ + ++ DSG++ T+L+
Sbjct: 277 FGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQEVATHDSRIIVDSGTTLTFLDP 336
Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPLCW--KGRRPFKN--VHDVKKCFRTLALSFT 331
L + +++ + + ++ P ++ L LC+ +G+ N + DV L F
Sbjct: 337 ALLGPLVTELERRIKLQRVQ--PPEQLLQLCYDVQGKSETDNFGIPDVT-------LRFG 387
Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
G T L PE + +G +CL ++ +E Q ++++G I
Sbjct: 388 GGAAVT---LRPENTFSLLQEGTLCLVLVPVSES--QPVSILGNI 427
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/361 (26%), Positives = 157/361 (43%), Gaps = 44/361 (12%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLYR 95
+HG V GY+ T+Y+G PA+ + + +DTGS +T++ C + C A P
Sbjct: 68 LHGAVKDYGYFYATLYLGTPAKKFAVIVDTGSTMTYVPCSSCGSGCGPNHQDAAFDPEAS 127
Query: 96 PSNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
+ + C P C+ G C QC Y YA+ SS G+L++D A + +G
Sbjct: 128 STASRISCTSPKCSC----GSPRCGCSTQQCTYTRSYAEQSSSSGILLEDVLALH--DGL 181
Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-GGG 213
P + GC + DG+ GLG +S+V+QL +I +V C G
Sbjct: 182 PGAP-IIFGCETRETGEIFRQRADGLFGLGNSDASVVNQLVKAGVIDDVFSLCFGMVEGD 240
Query: 214 GFLFFGD-DLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPV-------- 262
G L GD ++ S + +T + S+ + YY+ + L G+ LPV
Sbjct: 241 GALLLGDAEVPGSISLQYTPLLTSTTHPFYYNVKMLSLAVEGQL-----LPVSQSLFDQG 295
Query: 263 ---VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE--APEDETLPLCWKGRRPFKNVH 317
V DSG+++TY+ ++ ++K + LK P+ + +C+ ++
Sbjct: 296 YGTVLDSGTTFTYMPSPVFKAFAGAVEKYALSHGLKRVPGPDPQFDDICFGQAPSHDDLE 355
Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIIS--NKGNVCLGILNGAEVGLQDLNVIGG 375
+ F ++ + F G T L P YL + N G CLG+ + G ++GG
Sbjct: 356 ALSSVFPSMEVQFDQG---TSLVLGPLNYLFVHTFNSGKYCLGVFDNGRAG----TLLGG 408
Query: 376 I 376
I
Sbjct: 409 I 409
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 90/344 (26%), Positives = 142/344 (41%), Gaps = 44/344 (12%)
Query: 33 VGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---- 86
VG + F V G+ P G Y + +G P R + + +DTGSD+ W+ C++ C C
Sbjct: 46 VGGVVDFSVQGSSDPYLVGLYFTKVKLGSPPREFNVQIDTGSDVLWVCCNS-CNNCPRTS 104
Query: 87 -VEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGV 140
+ + S+ V C DPIC S C QC Y +Y DG + G
Sbjct: 105 GLGIQLNFFDSSSSSTAGQVRCSDPICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGY 164
Query: 141 LVKDAFAFNYTNGQRL----NPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQL 194
V D F+ GQ L + + GC Q + +DGI G G+G+ S++SQL
Sbjct: 165 YVSDTLYFDAILGQSLIDNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQL 224
Query: 195 HSQKLIRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGG 252
++ + V HCL G GGG L G+ L +V++ + +Y+ + + G
Sbjct: 225 STRGITPRVFSHCLKGDGSGGGILVLGEIL--EPGIVYSPLVPS-QPHYNLNLLSIAVNG 281
Query: 253 ETTGL--------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
+ + + + DSG++ YL Y S + +S P
Sbjct: 282 QLLPIDPAAFATSNSQGTIVDSGTTLAYLVAEAYDPFVSAVNAIVSP---------SVTP 332
Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI 348
+ KG + + V + F + +F G + L PE YLI
Sbjct: 333 ITSKGNQCYLVSTSVSQMFPLASFNFAGGASMV---LKPEDYLI 373
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 96/349 (27%), Positives = 151/349 (43%), Gaps = 36/349 (10%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL- 100
G+ TG Y VT+ +G P R DTGSDLTW QC+ PC R C P++ PS
Sbjct: 130 GSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCE-PCARYCYHQQEPIFNPSKSTS 188
Query: 101 ---VPCEDPICASLHA-PGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+ C P C L + G+ + C Y ++Y D S+G +D A T+ +
Sbjct: 189 YTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTD---V 245
Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGG 214
GCG N + + G++GLG+ S+VSQ +QK + + +CL + G
Sbjct: 246 FNNFLFGCGQNNR--GLFVGVAGLIGLGRNALSLVSQT-AQKYGK-LFSYCLPSTSSSTG 301
Query: 215 FLFFGDDLYDSSRVVWTS--MSSDYTKYYSPGVAELFFGGE-----TTGLKNLPVVFDSG 267
+L FG S V +T ++S +Y + + GG + + DSG
Sbjct: 302 YLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFSTAGTIIDSG 361
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
+ + L Y L + ++++S K K AP L C+ + + DV K +
Sbjct: 362 TVISRLPPTAYSDLRASFQQQMS-KYPKAAPA-SILDTCYDFSQ--YDTVDVPK----IN 413
Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
L F+DG +L P I N VCL ++ D+ ++G +
Sbjct: 414 LYFSDGAE---MDLDPSGIFYILNISQVCLAFAGNSDA--TDIAILGNV 457
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 99/370 (26%), Positives = 154/370 (41%), Gaps = 48/370 (12%)
Query: 33 VGSSLLFQVHG--NVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP 90
VG + F V+G + Y G Y + +G P R + + +DTGSD+ W+ C++ C C
Sbjct: 66 VGGVVDFTVYGTSDPYLVGLYFTKVKLGSPPREFNVQIDTGSDILWVTCNS-CNDCPRTS 124
Query: 91 ---------HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGV 140
P + LV C PIC SL C + QC Y Y DG + G
Sbjct: 125 GLGIELSFFDPSSSSTTSLVSCSHPICTSLVQTTAAECSPQSNQCSYSFHYGDGSGTTGY 184
Query: 141 LVKDAFAFNYTNGQRL----NPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQL 194
V D F+ G L + + GC Q + +DGI G G+ S+VSQL
Sbjct: 185 YVSDMLYFDTVLGDSLIANSSASIVFGCSTYQSGDLTKVDKAIDGIFGFGQQDLSVVSQL 244
Query: 195 HSQKLIRNVVGHCLS--GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGG 252
S + V HCL G GGG L G+ L ++++ + + +Y+ + + G
Sbjct: 245 SSLGITPKVFSHCLKGEGDGGGKLVLGEIL--EPNIIYSPLVPSQS-HYNLNLQSISVNG 301
Query: 253 ETTGL--------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
+ + N + DSG++ TYL Y S + +S+ T P
Sbjct: 302 QLLPIDPAVFATSNNQGTIVDSGTTLTYLVETAYDPFVSAITATVSSS---------TTP 352
Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI---ISNKGNV-CLGIL 360
+ KG + + V + F ++L+F G + L P YL+ S+ + C+G
Sbjct: 353 VLSKGNQCYLVSTSVDEIFPPVSLNFAGGASMV---LKPGEYLMHLGFSDGAAMWCIGFQ 409
Query: 361 NGAEVGLQDL 370
AE G+ L
Sbjct: 410 KVAEPGITIL 419
>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 96/341 (28%), Positives = 151/341 (44%), Gaps = 37/341 (10%)
Query: 50 YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP-CVRCVEAPH------PLYRPSNDLVP 102
Y NVT +G P+ + + LDTGSDL WL CD CVR ++AP +Y P+
Sbjct: 105 YANVT--VGTPSDWFLVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNIYSPNASSTS 162
Query: 103 CEDPICASLHAPGHHNCEDPAQCDYELEY-ADGGSSLGVLVKDAFAF--NYTNGQRLNPR 159
+ P ++L G + C Y++ Y ++G SS GVLV+D N + + + R
Sbjct: 163 TKVPCNSTLCTRGDRCASPESNCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPAR 222
Query: 160 LALGCGYNQVPGASYH---PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFL 216
+ LGCG QV +H +G+ GLG S+ S L + + N C G G +
Sbjct: 223 VTLGCG--QVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRI 280
Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRV 276
FGD R ++ + Y+ V ++ G T L+ VFDSG+S+TYL
Sbjct: 281 SFGDKGSVDQRETPLNIRQPHPT-YNITVTKISVEGNTGDLE-FDAVFDSGTSFTYLTDA 338
Query: 277 TYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV---KKCFRTLALSFTDG 333
Y TL S L+ + + E PF+ + + K F+ A++ T
Sbjct: 339 AY-TLISESFNSLALDKRYQTTDSEL---------PFEYCYALSPNKDSFQYPAVNLTMK 388
Query: 334 KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
+ P + + + CL IL ++D+++IG
Sbjct: 389 GGSSYPVYHPLVVIPMKDTDVYCLAILK-----IEDISIIG 424
>gi|213998802|gb|ACJ60768.1| nucellin [Hordeum murinum subsp. glaucum]
Length = 142
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 59/138 (42%), Positives = 75/138 (54%), Gaps = 5/138 (3%)
Query: 164 CGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGGGGGFLFFGD 220
CGY Q A P+DGILGLG GK+ QL QK+I+ N++GHCLS G G L+ GD
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGFAVQLKGQKMIKENIIGHCLSSKGKGVLYVGD 60
Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLNRVTYQ 279
S V W M YYSPG+AEL + G VFDSGS+YT++ Y
Sbjct: 61 FNPPSRGVTWVPMRESLF-YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPAHIYS 119
Query: 280 TLTSIMKKELSAKSLKEA 297
+ S ++ LS SL+E
Sbjct: 120 EIVSKVRGTLSESSLEEV 137
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 85/319 (26%), Positives = 126/319 (39%), Gaps = 46/319 (14%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLYRP----SNDLVPCEDPI 107
+G P + Y + +DTGSD+ W+ C PC C + P +Y P + LV C DP+
Sbjct: 8 LGNPVKHYIVQVDTGSDVLWVNC-RPCSGCPRKSALNIPLTMYDPRESSTTSLVSCSDPL 66
Query: 108 CASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRL---NPRLALG 163
C C C+Y Y DG +S G V+DA +N + L ++ G
Sbjct: 67 CVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTSQVLFG 126
Query: 164 CGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDD 221
C Q S +DGI+G G+ + S+ +QL +Q+ I V HCL G G
Sbjct: 127 CSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGGGILVIG 186
Query: 222 LYDSSRVVWTSMSSDYTKYYS------------PGVAELFFGGETTGLKNLPVVFDSGSS 269
+ +T + D Y P AE F TG V+ DSG++
Sbjct: 187 GIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTG-----VIMDSGTT 241
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
Y Y +++ SA ++ D L GR + F + L+
Sbjct: 242 LAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLV-SGR--------LSDLFPNVTLN 292
Query: 330 FTDGKTRTLFELTPEAYLI 348
F G EL P+ YL+
Sbjct: 293 FEGGA----MELQPDNYLM 307
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 88/301 (29%), Positives = 134/301 (44%), Gaps = 35/301 (11%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDLVP--CEDP 106
Y +T+ +G PA+ + +D+GSD++W+QC PC++C PL+ P S+ P C
Sbjct: 131 YLITVRLGSPAKTQTVLIDSGSDVSWVQCK-PCLQCHSQVDPLFDPSLSSTYSPFSCSSA 189
Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
CA L G + C +QC Y + YADG S+ G D A G GC +
Sbjct: 190 ACAQLGQDG-NGCSSSSQCQYIVRYADGSSTTGTYSSDTLAL----GSNTISNFQFGCSH 244
Query: 167 NQVPGASYHPL-DGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLY 223
+ + ++ L DG++GLG G S+ SQ + +CL + GFL G
Sbjct: 245 VE---SGFNDLTDGLMGLGGGAPSLASQ--TAGTFGTAFSYCLPPTPSSSGFLTLG---A 296
Query: 224 DSSRVVWTSM--SSDYTKYYSPGVAELFFGGET----TGLKNLPVVFDSGSSYTYLNRVT 277
+S V T M SS +Y + + GG T + + +V DSG+ T L R
Sbjct: 297 GTSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFSAGMVMDSGTIITRLPRTA 356
Query: 278 YQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRT 337
Y L+S K + K + AP + C+ F V+ ++AL F+ G
Sbjct: 357 YSALSSAFKAGM--KQYRPAPPRSIMDTCFD----FSGQSSVR--LPSVALVFSGGAVVN 408
Query: 338 L 338
L
Sbjct: 409 L 409
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 109/388 (28%), Positives = 169/388 (43%), Gaps = 53/388 (13%)
Query: 17 MSSSSSSSSSSSLFNHVGSSLLFQVHGNV-YPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
+S S+++ S+ + ++ V V +G Y V +Y+G P R + + +DTGSDL
Sbjct: 114 LSGSAAARRDSAPRRALSERVVATVESGVPVGSGEYLVDVYLGTPPRRFRMIMDTGSDLN 173
Query: 76 WLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGH---HNCEDPAQ--CD 126
WLQC APC+ C E P++ P+ + V C D C + P C P C
Sbjct: 174 WLQC-APCLDCFEQSGPIFDPAASISYRNVTCGDDRCRLVSPPAESAPRECRRPRSDPCP 232
Query: 127 YELEYADGGSSLGVLVKDAFAFNYT-NGQRLNPRLALGCGYNQVPGASYHPLDGILGLGK 185
Y Y D ++ G L +AF N T +G R +A GCG+ +H G+LGLG+
Sbjct: 233 YYYWYGDQSNTTGDLALEAFTVNLTQSGTRRVDGVAFGCGHRN--RGLFHGAAGLLGLGR 290
Query: 186 GKSSIVSQLHSQKLIRNVVG-----HCL----SGGGGGFLFFGDD-LYDSSRVVWTSM-- 233
G S SQL R V G +CL S G +F DD L ++ +T+
Sbjct: 291 GPLSFASQL------RGVYGGHAFSYCLVEHGSAAGSKIIFGHDDALLAHPQLNYTAFAP 344
Query: 234 SSDYTKYYSPGVAELFFGGETTGLKNLPV-----VFDSGSSYTYLNRVTYQTLTSIMKKE 288
++D +Y + + GGE + + + + DSG++ +Y YQ +
Sbjct: 345 TTDADTFYYLQLKSILVGGEAVNISSDTLSAGGTIIDSGTTLSYFPEPAYQAIRQAFIDR 404
Query: 289 LSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC-FRTLALSFTDGKTRTLFELTPEAYL 347
+S L L + P NV +K L+L F DG +E E Y
Sbjct: 405 MSPSY--------PLILGFPVLSPCYNVSGAEKVEVPELSLVFADGAA---WEFPAENYF 453
Query: 348 I-ISNKGNVCLGILNGAEVGLQDLNVIG 374
I + +G +CL +L G +++IG
Sbjct: 454 IRLEPEGIMCLAVLGTPRSG---MSIIG 478
>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 527
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 89/341 (26%), Positives = 145/341 (42%), Gaps = 39/341 (11%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH--PLYRPSNDLVPCEDPICASLHAP 114
+G PA Y + LDTGSDL WL C+ C +CV + + ++ ++ + A
Sbjct: 119 VGTPASSYLVALDTGSDLFWLPCN--CTKCVHGIQLSTGQKIAFNIYDNKESSTSKNVAC 176
Query: 115 GHHNCEDPAQCD--------YELEY-ADGGSSLGVLVKDAFAF---NYTNGQRLNPRLAL 162
CE QC Y++EY ++ S+ G LV+D N Q NP +
Sbjct: 177 NSSLCEQKTQCSSSSGGTCPYQVEYLSENTSTTGFLVEDVLHLITDNDDQTQHANPLITF 236
Query: 163 GCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFF 218
GCG Q + GA+ +G+ GLG S+ S L Q L N C + G G + F
Sbjct: 237 GCGQVQTGAFLDGAA---PNGLFGLGMSDVSVPSILAKQGLTSNSFSMCFAADGLGRITF 293
Query: 219 GDD--LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRV 276
GD+ D + + S T Y+ V ++ GG + L+ +FD+G+S+TYLN
Sbjct: 294 GDNNSSLDQGKTPFNIRPSHST--YNITVTQIIVGGNSADLE-FNAIFDTGTSFTYLNNP 350
Query: 277 TYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK--KCFRTLALSFTDGK 334
Y+ +T ++ + + D+ PF+ +D++ + ++ T
Sbjct: 351 AYKQITQSFDSKIKLQRHSFSNSDDL---------PFEYCYDLRTNQTIEVPNINLTMKG 401
Query: 335 TRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGG 375
F + P N G +CL +L V + N + G
Sbjct: 402 GDNYFVMDPIITSGGGNNGVLCLAVLKSNNVNIIGQNFMTG 442
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 89/332 (26%), Positives = 142/332 (42%), Gaps = 38/332 (11%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y +T +G P + DTGSD+ WLQC+ PC +C P++ PS +PC
Sbjct: 85 GGYLMTYSVGTPPTKIYGIADTGSDIVWLQCE-PCEQCYNQTTPIFNPSKSSSYKNIPCS 143
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
+C H+ +C D C Y++ Y D S G L D + T+G ++ P++ +G
Sbjct: 144 SKLC---HSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKIVIG 200
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL------SGGGGGFLF 217
CG + G GI+GLG G S+++QL S I +CL L
Sbjct: 201 CGTDNA-GTFGGASSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSILS 257
Query: 218 FGDDLYDSSRVVWTS--MSSDYTKY------YSPGVAELFFGGETTGLKNL-PVVFDSGS 268
FGD S V ++ + D Y +S G + FGG + G + ++ DSG+
Sbjct: 258 FGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSGT 317
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR-----PFKNVH----DV 319
+ T + Y L S + + + + ++ LC+ + P VH DV
Sbjct: 318 TLTLIPSDVYTNLESAVVDLVKLDRVDDP--NQQFSLCYSLKSNEYDFPIITVHFKGADV 375
Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISN 351
+ + + TDG F+ +P+ I N
Sbjct: 376 ELHSISTFVPITDGIVCFAFQPSPQLGSIFGN 407
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 152/364 (41%), Gaps = 40/364 (10%)
Query: 39 FQVHG--NVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEA-----PH 91
F V G + Y G Y + +G P + +++ +DTGSD+ W+ C + C C ++ P
Sbjct: 54 FPVEGTYDPYRVGLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGS-CNGCPQSSGLHIPL 112
Query: 92 PLYRP----SNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAF 146
+ P + L+ C D C+ C QC Y +Y DG + G V D
Sbjct: 113 NFFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLL 172
Query: 147 AFNYTNGQRL---NPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
F+ G + + + GC +Q S +DGI G G+ S++SQ+ SQ +
Sbjct: 173 NFDAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITP 232
Query: 202 NVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL---- 257
V HCL G GGG +V++ + +Y+ + + G++ +
Sbjct: 233 KVFSHCLKGDGGGGGILVLGEIVEEDIVYSPLVPS-QPHYNLNLQSISVNGKSLAIDPEV 291
Query: 258 ----KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPF 313
N + DSG++ YL Y S ++ EA PL KG + +
Sbjct: 292 FATSTNRGTIVDSGTTLAYLAEEAYDPFVS---------AITEAVSQSVRPLLSKGTQCY 342
Query: 314 KNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK-GNVCLGILNGAEVGLQDLNV 372
VK F T++L+F G + L PE YL+ N G+ + + ++ Q + +
Sbjct: 343 LITSSVKGIFPTVSLNFAGGVS---MNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITI 399
Query: 373 IGGI 376
+G +
Sbjct: 400 LGDL 403
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 125/278 (44%), Gaps = 29/278 (10%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
G Y ++ +G P+ F LDTGSD+ WLQC PC +C E P++ S +PC
Sbjct: 87 GEYLISYSVGTPSLQVFGILDTGSDIIWLQCQ-PCKKCYEQTTPIFDSSKSQTYKTLPCP 145
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
C S+ C C Y + Y DG SLG L + TNG + P +G
Sbjct: 146 SNTCQSVQG---TFCSSRKHCLYSIHYVDGSQSLGDLSVETLTLGSTNGSPVQFPGTVIG 202
Query: 164 CG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG---GGGFLFFG 219
CG YN + + GI+GLG+G S+++QL + +CL G L FG
Sbjct: 203 CGRYNAIGIEEKN--SGIVGLGRGPMSLITQLSPSTGGK--FSYCLVPGLSTASSKLNFG 258
Query: 220 DDLYDSSR-VVWTSMSSD--------YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
+ S R V T + S + +S G + FG +G K ++ DSG++
Sbjct: 259 NAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSPGSGGKG-NIIIDSGTTL 317
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
T L Y L + + K + + +++ ++ L LC+K
Sbjct: 318 TALPNGVYSKLEAAVAKTVILQRVRD--PNQVLGLCYK 353
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 113/366 (30%), Positives = 152/366 (41%), Gaps = 71/366 (19%)
Query: 42 HGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS---- 97
+ N PT Y V + IG P +P L LDTGSDL W QC PCV C + P P + S
Sbjct: 26 YDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCK-PCVSCFDQPLPYFDTSRSST 84
Query: 98 NDLVPCE------DP---ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
N L+PCE DP +C L+ + C Y Y D ++G+L D F F
Sbjct: 85 NALLPCESTQCKLDPTVTVCVKLN-------QTVQTCAYYTSYGDNSVTIGLLAADKFTF 137
Query: 149 NYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
G L P + GCG N G GI G G+G S+ SQL HC
Sbjct: 138 --VAGTSL-PGVTFGCGLNNT-GVFNSNETGIAGFGRGPLSLPSQLKVGNF-----SHCF 188
Query: 209 SGGGGG-----FLFFGDDLYDSSR-VVWTSMSSDYTKYYS-PGVAELFFGGETTGLKNLP 261
+ G L DL+ + + V T+ Y K + P + L G T G LP
Sbjct: 189 TTITGAIPSTVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLP 248
Query: 262 V--------------VFDSGSSYTYLNRVTYQTLTSIMKKELSAK-SLKEAPEDET-LPL 305
V + DSG+S T L YQ +++ E +A+ L P + T
Sbjct: 249 VPESAFALTNGTGGTIIDSGTSITSLPPQVYQ----VVRDEFAAQIKLPVVPGNATGHYT 304
Query: 306 CWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL--IISNKGN--VCLGILN 361
C+ P + DV K L L F +G T +L E Y+ + + GN +CL I
Sbjct: 305 CFSA--PSQAKPDVPK----LVLHF-EGAT---MDLPRENYVFEVPDDAGNSIICLAINK 354
Query: 362 GAEVGL 367
G E +
Sbjct: 355 GDETTI 360
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 152/364 (41%), Gaps = 40/364 (10%)
Query: 39 FQVHG--NVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEA-----PH 91
F V G + Y G Y + +G P + +++ +DTGSD+ W+ C + C C ++ P
Sbjct: 69 FPVEGTYDPYRVGLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGS-CNGCPQSSGLHIPL 127
Query: 92 PLYRP----SNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAF 146
+ P + L+ C D C+ C QC Y +Y DG + G V D
Sbjct: 128 NFFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLL 187
Query: 147 AFNYTNGQRL---NPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
F+ G + + + GC +Q S +DGI G G+ S++SQ+ SQ +
Sbjct: 188 NFDAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITP 247
Query: 202 NVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL---- 257
V HCL G GGG +V++ + +Y+ + + G++ +
Sbjct: 248 KVFSHCLKGDGGGGGILVLGEIVEEDIVYSPLVPS-QPHYNLNLQSISVNGKSLAIDPEV 306
Query: 258 ----KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPF 313
N + DSG++ YL Y S ++ EA PL KG + +
Sbjct: 307 FATSTNRGTIVDSGTTLAYLAEEAYDPFVS---------AITEAVSQSVRPLLSKGTQCY 357
Query: 314 KNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK-GNVCLGILNGAEVGLQDLNV 372
VK F T++L+F G + L PE YL+ N G+ + + ++ Q + +
Sbjct: 358 LITSSVKGIFPTVSLNFAGGVS---MNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITI 414
Query: 373 IGGI 376
+G +
Sbjct: 415 LGDL 418
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 88/330 (26%), Positives = 136/330 (41%), Gaps = 56/330 (16%)
Query: 44 NVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEA-----PHPLYRP-- 96
+ + TG Y +Y+G P + +++ +DTGSD+ W+ C PC C A P ++ P
Sbjct: 41 DTFTTGLYYTRIYLGTPPQQFYVHVDTGSDVAWVNC-VPCTNCKRASNVALPISIFDPEK 99
Query: 97 --SNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNY--- 150
S + C D C + + C + C Y Y DG S+ G L+ D +FN
Sbjct: 100 STSKTSISCTDEEC---YLASNSKCSFNSMSCPYSTLYGDGSSTAGYLINDVLSFNQVPS 156
Query: 151 --TNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
+ RL GCG NQ DG++G G+ + S+ SQL Q + N+ HCL
Sbjct: 157 GNSTATSGTARLTFGCGSNQ---TGTWLTDGLVGFGQAEVSLPSQLSKQNVSVNIFAHCL 213
Query: 209 SG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----- 261
G G G L G +V+T + + Y V L G T +
Sbjct: 214 QGDNKGSGTLVIGH--IREPGLVYTPIVPKQSHY---NVELLNIGVSGTNVTTPTAFDLS 268
Query: 262 ----VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
V+ DSG++ TYL + Y + AK +++ LP+ F+
Sbjct: 269 NSGGVIMDSGTTLTYLVQPAYD--------QFQAK-VRDCMRSGVLPVA------FQFFC 313
Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYL 347
++ F + L F G L+P +YL
Sbjct: 314 TIEGYFPNVTLYFAGGAA---MLLSPSSYL 340
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 85/271 (31%), Positives = 123/271 (45%), Gaps = 27/271 (9%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSND----LVP 102
G Y + +YIG P+ DTGSDLTW+QC +PC +C PLY P N L+P
Sbjct: 94 GNYLMRIYIGTPSVERLAIADTGSDLTWVQC-SPCDNTKCFAQNTPLYDPLNSSTFTLLP 152
Query: 103 CEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
C+ C L + C D C Y Y D S G L D+ N ++
Sbjct: 153 CDSQPCTQLPY-SQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRLMLLQ-LHYNSKICF 210
Query: 163 GCGY-NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFF 218
GCG+ N+ GI+GLG G S+VSQL + I + +CL S L F
Sbjct: 211 GCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDE--IGHKFSYCLLPFSSNSNSKLKF 268
Query: 219 GD-DLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGET--TGLKNLPVVFDSGSSYTYL 273
G+ + + VV T + D YY + + G +T TG + ++ DSGS+ TYL
Sbjct: 269 GEAAIVQGNGVVSTPLIIKPDLPFYYL-NLEGITVGAKTVKTGQTDGNIIIDSGSTLTYL 327
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
Y S++K+ ++ + ED+ +P
Sbjct: 328 EESFYNEFVSLVKETVAVE------EDQYIP 352
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 144/356 (40%), Gaps = 51/356 (14%)
Query: 46 YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP- 96
Y TG Y + IG PA Y++ LDTGS W+ + C + PH Y P
Sbjct: 54 YGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVNG----ISCKQCPHESDILRKLTFYDPR 109
Query: 97 ---SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN--YT 151
S+ V C+D IC S P C +C Y YADGG ++G+L D ++ Y
Sbjct: 110 SSVSSKEVKCDDTICTS-RPP----CNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYG 164
Query: 152 NGQR--LNPRLALGCGYNQVPGA--SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
NGQ + + GCG Q S +DGI+G G + +SQL + + + HC
Sbjct: 165 NGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHC 224
Query: 208 L-SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------K 258
L S GGG G+ + +V T + + Y+ + + G T L K
Sbjct: 225 LDSTNGGGIFAIGEVV--EPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTK 282
Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
DSGS+ YL + Y EL + P D T+ + + F +
Sbjct: 283 TKGTFIDSGSTLVYLPEIIY--------SELILAVFAKHP-DITMGAMYN-FQCFHFLGS 332
Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
V F + F + T ++ P YL+ C G + G +D+ ++G
Sbjct: 333 VDDKFPKITFHFENDLT---LDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILG 385
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 94/340 (27%), Positives = 148/340 (43%), Gaps = 51/340 (15%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y + M IG P R Y LDTGSDL W QC APC+ CV+ P P + P+ + C
Sbjct: 88 GEYLMEMGIGTPTRYYSAILDTGSDLIWTQC-APCLLCVDQPTPYFDPARSATYRSLGCA 146
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
P C +L+ P + C Y+ Y D S+ GVL + F F TN R++ P ++ G
Sbjct: 147 SPACNALYYPLCYQ----KVCVYQYFYGDSASTAGVLANETFTFG-TNETRVSLPGISFG 201
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF---LFFGD 220
CG + S G++G G+G S+VSQL S + +CL+ L+FG
Sbjct: 202 CG--NLNAGSLANGSGMVGFGRGSLSLVSQLGSPRF-----SYCLTSFLSPVPSRLYFGV 254
Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFF---GGETTGLKNLPV--------------- 262
+S + +P + ++F G + G LP+
Sbjct: 255 YATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGG 314
Query: 263 -VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
+ DSG++ TYL Y + + +++ L + L C++ P + + +
Sbjct: 315 TIIDSGTTITYLAEPAYDAVRAAFASQITLP-LLNVTDASVLDTCFQWPPPPRQSVTLPQ 373
Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLII--SNKGNVCLGI 359
L L F DG +EL + Y+++ S G +CL +
Sbjct: 374 ----LVLHF-DGAD---WELPLQNYMLVDPSTGGGLCLAM 405
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 144/356 (40%), Gaps = 51/356 (14%)
Query: 46 YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP- 96
Y TG Y + IG PA Y++ LDTGS W+ + C + PH Y P
Sbjct: 78 YGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVNG----ISCKQCPHESDILRKLTFYDPR 133
Query: 97 ---SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN--YT 151
S+ V C+D IC S P C +C Y YADGG ++G+L D ++ Y
Sbjct: 134 SSVSSKEVKCDDTICTS-RPP----CNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYG 188
Query: 152 NGQR--LNPRLALGCGYNQVPGA--SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
NGQ + + GCG Q S +DGI+G G + +SQL + + + HC
Sbjct: 189 NGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHC 248
Query: 208 L-SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------K 258
L S GGG G+ + +V T + + Y+ + + G T L K
Sbjct: 249 LDSTNGGGIFAIGEVV--EPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTK 306
Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
DSGS+ YL + Y EL + P D T+ + + F +
Sbjct: 307 TKGTFIDSGSTLVYLPEIIY--------SELILAVFAKHP-DITMGAMYN-FQCFHFLGS 356
Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
V F + F + T ++ P YL+ C G + G +D+ ++G
Sbjct: 357 VDDKFPKITFHFENDLT---LDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILG 409
>gi|213998814|gb|ACJ60774.1| nucellin [Hordeum cf. pusillum GP-2003]
Length = 142
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 60/138 (43%), Positives = 74/138 (53%), Gaps = 5/138 (3%)
Query: 164 CGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGGGGGFLFFGD 220
CGY Q A P+DGILGLG GK+ +QL QK+I NV+GHCLS G G L+ GD
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGD 60
Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLNRVTYQ 279
S V W M YYSPG+AEL + G VFDSGS+YT++ Y
Sbjct: 61 FNPPSRGVTWVPMKESLF-YYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPAQIYN 119
Query: 280 TLTSIMKKELSAKSLKEA 297
+ S + LS SL+E
Sbjct: 120 EIVSKVIGTLSESSLEEV 137
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 96/332 (28%), Positives = 144/332 (43%), Gaps = 41/332 (12%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
TG Y V++ +G PA+ Y + DTGSDL+W+QC PC C E PL+ PS V C
Sbjct: 146 TGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCK-PCADCYEQQDPLFDPSLSSTYAAVAC 204
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
P C L A G C ++C YE++Y D + G LV+D + ++ P G
Sbjct: 205 GAPECQELDASG---CSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD---TLPGFVFG 258
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDD 221
CG +Q G + +DG+ GLG+ K S+ SQ +CL S G G+L G
Sbjct: 259 CG-DQNAGL-FGQVDGLFGLGREKVSLPSQ--GAPSYGPGFTYCLPSSSSGRGYLSLGG- 313
Query: 222 LYDSSRVVWTSMSSDYT-KYYSPGVAELFFGGETTGL------KNLPVVFDSGSSYTYLN 274
+ +T+++ T +Y + + GG + V DSG+ T L
Sbjct: 314 -APPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVITRLP 372
Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW--KGRRPFKNVHDVKKCFRTLALSFTD 332
Y L + + ++ K+AP L C+ G R + T+ L+F
Sbjct: 373 PRAYAPLRAAFARSMA--QYKKAPALSILDTCYDFTGHRTAQ--------IPTVELAFAG 422
Query: 333 GKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
G T +L + T L +S CL A+
Sbjct: 423 GATVSL-DFT--GVLYVSKVSQACLAFAPNAD 451
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 96/332 (28%), Positives = 144/332 (43%), Gaps = 41/332 (12%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
TG Y V++ +G PA+ Y + DTGSDL+W+QC PC C E PL+ PS V C
Sbjct: 146 TGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCK-PCADCYEQQDPLFDPSLSSTYAAVAC 204
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
P C L A G C ++C YE++Y D + G LV+D + ++ P G
Sbjct: 205 GAPECQELDASG---CSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD---TLPGFVFG 258
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDD 221
CG +Q G + +DG+ GLG+ K S+ SQ +CL S G G+L G
Sbjct: 259 CG-DQNAGL-FGQVDGLFGLGREKVSLPSQ--GAPSYGPGFTYCLPSSSSGRGYLSLGG- 313
Query: 222 LYDSSRVVWTSMSSDYT-KYYSPGVAELFFGGETTGL------KNLPVVFDSGSSYTYLN 274
+ +T+++ T +Y + + GG + V DSG+ T L
Sbjct: 314 -APPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVITRLP 372
Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW--KGRRPFKNVHDVKKCFRTLALSFTD 332
Y L + + ++ K+AP L C+ G R + T+ L+F
Sbjct: 373 PRAYAPLRAAFARSMA--QYKKAPALSILDTCYDFTGHRTAQ--------IPTVELAFAG 422
Query: 333 GKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
G T +L + T L +S CL A+
Sbjct: 423 GATVSL-DFT--GVLYVSKVSQACLAFAPNAD 451
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 144/356 (40%), Gaps = 51/356 (14%)
Query: 46 YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP- 96
Y TG Y + IG PA Y++ LDTGS W+ + C + PH Y P
Sbjct: 78 YGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVNG----ISCKQCPHESDILRKLTFYDPR 133
Query: 97 ---SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN--YT 151
S+ V C+D IC S P C +C Y YADGG ++G+L D ++ Y
Sbjct: 134 SSVSSKEVKCDDTICTS-RPP----CNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYG 188
Query: 152 NGQR--LNPRLALGCGYNQVPGA--SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
NGQ + + GCG Q S +DGI+G G + +SQL + + + HC
Sbjct: 189 NGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHC 248
Query: 208 L-SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------K 258
L S GGG G+ + +V T + + Y+ + + G T L K
Sbjct: 249 LDSTNGGGIFAIGEVV--EPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTK 306
Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
DSGS+ YL + Y EL + P D T+ + + F +
Sbjct: 307 TKGTFIDSGSTLVYLPEIIY--------SELILAVFAKHP-DITMGAMYN-FQCFHFLGS 356
Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
V F + F + T ++ P YL+ C G + G +D+ ++G
Sbjct: 357 VDDKFPKITFHFENDLT---LDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILG 409
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 144/356 (40%), Gaps = 51/356 (14%)
Query: 46 YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP- 96
Y TG Y + IG PA Y++ LDTGS W+ + C + PH Y P
Sbjct: 54 YGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVNG----ISCKQCPHESDILRKLTFYDPR 109
Query: 97 ---SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN--YT 151
S+ V C+D IC S P C +C Y YADGG ++G+L D ++ Y
Sbjct: 110 SSVSSKEVKCDDTICTS-RPP----CNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYG 164
Query: 152 NGQR--LNPRLALGCGYNQVPGA--SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
NGQ + + GCG Q S +DGI+G G + +SQL + + + HC
Sbjct: 165 NGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHC 224
Query: 208 L-SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------K 258
L S GGG G+ + +V T + + Y+ + + G T L K
Sbjct: 225 LDSTNGGGIFAIGEVV--EPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTK 282
Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
DSGS+ YL + Y EL + P D T+ + + F +
Sbjct: 283 TKGTFIDSGSTLVYLPEIIY--------SELILAVFAKHP-DITMGAMYN-FQCFHFLGS 332
Query: 319 VKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
V F + F + T ++ P YL+ C G + G +D+ ++G
Sbjct: 333 VDDKFPKITFHFENDLT---LDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILG 385
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 88/333 (26%), Positives = 131/333 (39%), Gaps = 53/333 (15%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------VEAPHPLYRPSND 99
G Y + +G PA+ +F+ +DTGSD+ W+ C +PC C +E+ +P +
Sbjct: 3 GLYFTRVKLGNPAKEFFVQIDTGSDILWVTC-SPCTGCPTSSGLNIQLESFNPDSSSTAS 61
Query: 100 LVPCEDPICASLHAPGHHNCE----DPAQCDYELEYADGGSSLGVLVKDAFAFNYT--NG 153
+ C D C + G C+ + C Y Y DG + G V D F N
Sbjct: 62 RITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNE 121
Query: 154 QRLN--PRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
Q N + GC +Q + +DGI G G+ + S++SQL+S + V HCL
Sbjct: 122 QTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLK 181
Query: 210 GG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKY------------YSPGVAELFFGGETT 255
G GGG L G+ + +V+T + Y P + LF T
Sbjct: 182 GSDNGGGILVLGEIV--EPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQ 239
Query: 256 GLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
G + DSG++ YL Y S + +S L KG + F
Sbjct: 240 G-----TIVDSGTTLAYLADGAYDPFVSAIAAAVSP---------SVRSLVSKGSQCFIT 285
Query: 316 VHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI 348
V F T+ L F G + PE YL+
Sbjct: 286 SSSVDSSFPTVTLYFMGG---VAMSVKPENYLL 315
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 78/283 (27%), Positives = 120/283 (42%), Gaps = 33/283 (11%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCE 104
G + V +Y+G P + + +DTGSDLTW+Q + PC C E P++ PS + + C
Sbjct: 23 GEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSE-PCRACFEQADPIFDPSKSSTYNKIACS 81
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
CA L G C A C Y Y DG + G K+ T G+ + G
Sbjct: 82 SSACADLL--GTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEEVK----FGA 135
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFG 219
+GILGLG+G S+ SQL S ++ N +CL +G ++FG
Sbjct: 136 SVYNTGTFGDTGGEGILGLGQGPVSMPSQLGS--VLGNKFSYCLVDWLSAGSETSTMYFG 193
Query: 220 DDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLP----------VVFDSG 267
D S V +T + ++D+ YY V + GG + + DSG
Sbjct: 194 DAAVPSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGTIIDSG 253
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
++ TYL + + L + ++ + A L LC+ R
Sbjct: 254 TTITYLQQEVFNALVAAYTSQVRYPTTTSA---TGLDLCFNTR 293
>gi|213998800|gb|ACJ60767.1| nucellin [Hordeum marinum subsp. marinum]
Length = 142
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/138 (42%), Positives = 76/138 (55%), Gaps = 5/138 (3%)
Query: 164 CGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGGGGGFLFFGD 220
CGY Q A P+DGILGLG GK+ +QL QK+I NV+GHCLS G G L+ G+
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGN 60
Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLNRVTYQ 279
S V W M + + YYSPG+AEL + G VFDSGS+YT + Y
Sbjct: 61 FNPPSRGVTWVPM-RESSFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTLVPSQIYN 119
Query: 280 TLTSIMKKELSAKSLKEA 297
+ S ++ LS SL+E
Sbjct: 120 EIVSKVRGTLSESSLEEV 137
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 107/374 (28%), Positives = 157/374 (41%), Gaps = 49/374 (13%)
Query: 25 SSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV 84
S +++ VGS + V G +G Y V + +G P +L +D+GSD+ W+QC PC
Sbjct: 110 SPTTMTTEVGSEV---VSGISEGSGEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCR-PCA 165
Query: 85 RCVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGV 140
C + PL+ P+ VPC+ +C +L G C D C Y++ Y DG + GV
Sbjct: 166 ECYQQADPLFDPAASASFTAVPCDSGVCRTLPG-GSSGCADSGACRYQVSYGDGSYTQGV 224
Query: 141 LVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLI 200
L + F + + +A+GCG+ + G+LGLG G S+V QL
Sbjct: 225 LAMETLTFGDSTPVQ---GVAIGCGHRNR--GLFVGAAGLLGLGWGPMSLVGQLGGAAG- 278
Query: 201 RNVVGHCLSG----GGGGFLFFGDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGE- 253
+CL+ G G L FG D VW + ++ +Y G+ L GGE
Sbjct: 279 -GAFSYCLASRGADAGAGSLVFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGER 337
Query: 254 ---TTGLKNLP------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
GL +L VV D+G++ T L Y L + L AP L
Sbjct: 338 LPLQDGLFDLTEDGGGGVVMDTGTAVTRLPPDAYAALRDAFASTIGGD-LPRAPGVSLLD 396
Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSF-TDGKTRTLFELTPEAYLIISNKGNV-CLGILNG 362
C+ V+ T+AL F DG TL P L++ G V CL
Sbjct: 397 TCYD----LSGYASVR--VPTVALYFGRDGAALTL----PARNLLVEMGGGVYCLAFAAS 446
Query: 363 AEVGLQDLNVIGGI 376
A L+++G I
Sbjct: 447 AS----GLSILGNI 456
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 93/366 (25%), Positives = 146/366 (39%), Gaps = 47/366 (12%)
Query: 39 FQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA----PCVRCVEAP-- 90
F V G P G Y + +G P R +++ +DTGSD+ W+ C + P + P
Sbjct: 38 FPVQGTFDPFLVGLYYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLN 97
Query: 91 --HPLYRPSNDLVPCEDPICA-SLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFA 147
P P+ L+ C D C+ L + C Y +Y DG + G V D
Sbjct: 98 FFDPGSSPTASLISCSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDLLH 157
Query: 148 FNYTNGQRL----NPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
F+ G + + + GC Q S +DGI G G+ S+VSQL SQ +
Sbjct: 158 FDTVLGGSVMNNSSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISP 217
Query: 202 NVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN 259
HCL G GGG L G+ + +V+T + +Y+ + + G+T +
Sbjct: 218 RAFSHCLKGDDSGGGILVLGEIV--EPNIVYTPLVPS-QPHYNLNMQSISVNGQTLAID- 273
Query: 260 LPVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
P VF DSG++ YL Y S + +S P KG
Sbjct: 274 -PSVFGTSSSQGTIIDSGTTLAYLAEAAYDPFISAITSIVSP---------SVRPYLSKG 323
Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQ 368
+ + F ++L+F G + L P+ YLI S+ G L + ++ Q
Sbjct: 324 NHCYLISSSINDIFPQVSLNFAGGASMILI---PQDYLIQQSSIGGAALWCIGFQKIQGQ 380
Query: 369 DLNVIG 374
+ ++G
Sbjct: 381 GITILG 386
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 100/345 (28%), Positives = 147/345 (42%), Gaps = 55/345 (15%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y + + IG P Y LDTGSDL W QC PC RC + P P++ P S V C
Sbjct: 106 GEYLIELAIGTPPVSYPAVLDTGSDLIWTQC-KPCTRCYKQPTPIFDPKKSSSFSKVSCG 164
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
+C++L + C D C+Y Y D + GVL + F F + + + GC
Sbjct: 165 SSLCSALPS---STCSD--GCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGC 219
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFGD- 220
G + G + G++GLG+G S+VSQL Q+ +CL+ L G
Sbjct: 220 GEDN-EGDGFEQASGLVGLGRGPLSLVSQLKEQRF-----SYCLTPIDDTKESVLLLGSL 273
Query: 221 -DLYDSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLK-----NLPVVFDSG 267
+ D+ VV T + + Y + V + E + + N V+ DSG
Sbjct: 274 GKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSG 333
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET----LPLCWKGRRPFKNVHDVKKCF 323
++ TY+ + Y+ L KKE +++ + D+T L LC+ V K F
Sbjct: 334 TTITYVQQKAYEAL----KKEFISQT--KLALDKTSSTGLDLCFSLPSGSTQVEIPKLVF 387
Query: 324 RTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGL 367
F G EL E Y+I SN G CL + GA G+
Sbjct: 388 H-----FKGGD----LELPAENYMIGDSNLGVACLAM--GASSGM 421
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 96/367 (26%), Positives = 154/367 (41%), Gaps = 44/367 (11%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA--PCVRCVEAPHPLYRPSN 98
V G +G Y V + +G P + L DTGSDL W++C A C R L R S
Sbjct: 79 VSGASTGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLARHST 138
Query: 99 DLVP--CEDPICASLHAPGHHNCEDP---AQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
P C D C + P HH C + C YE Y DG + G K+ N ++G
Sbjct: 139 TFSPNHCYDSACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSG 198
Query: 154 QRLNPR-LALGCGYN----QVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVG 205
+ + +A GC + V GAS++ G++GLG+G S+ SQL + K ++
Sbjct: 199 REAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSYCLMD 258
Query: 206 HCLSGGGGGFLFFGDDLYDSS----RVVWTSMSSD--YTKYYSPGVAELFFGG------- 252
H +S +L G D + R+ +T + + +Y G+ + G
Sbjct: 259 HDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPINP 318
Query: 253 ---ETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
L N + DSG++ T+L Y + +++K+ + S E LC
Sbjct: 319 SVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAE--PTPGFDLCV-- 374
Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQD 369
NV +++ R LSF G ++F P Y + +++ CL + A +
Sbjct: 375 -----NVSEIEHP-RLPKLSFKLGGD-SVFSPPPRNYFVDTDEDVKCLAL--QAVMTPSG 425
Query: 370 LNVIGGI 376
+VIG +
Sbjct: 426 FSVIGNL 432
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 89/353 (25%), Positives = 141/353 (39%), Gaps = 46/353 (13%)
Query: 29 LFNHVGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA----P 82
+ G + F V G P G Y + +G P + +++ +DTGSD+ W+ C++ P
Sbjct: 59 MLQSSGGVIDFSVSGTYDPFLVGLYYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCP 118
Query: 83 CVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSS 137
++ P + P + LV C D ICA C + QC Y +Y DG +
Sbjct: 119 ATSGLQIPLNFFDPGSSTTASLVSCSDQICALGVQSSDSACFGQSNQCAYVFQYGDGSGT 178
Query: 138 LGVLVKDAF----AFNYTNGQRLNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIV 191
G V D + + + + GC +Q S +DGI G G+ S++
Sbjct: 179 SGYYVMDMIHLDVVIDSSVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVI 238
Query: 192 SQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELF 249
SQL S+ + V HCL G GGG L G+ + VV+T + +Y+ + +
Sbjct: 239 SQLSSRGIAPKVFSHCLKGDDSGGGILVLGEIV--EPNVVYTPLVPS-QPHYNLNLQSIS 295
Query: 250 FGGETTGLKNLPVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPE 299
G+ + P VF DSG++ YL Y + +S
Sbjct: 296 VNGQVLPIS--PAVFATSSSQGTIIDSGTTLAYLAEEAYNAFVVAVTNIVS--------- 344
Query: 300 DETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK 352
T + KG R + V F ++L+F G + L + YLI N
Sbjct: 345 QSTQSVVLKGNRCYVTSSSVSDIFPQVSLNFAGGAS---LVLGAQDYLIQQNS 394
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 100/351 (28%), Positives = 148/351 (42%), Gaps = 50/351 (14%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y + +G PAR F+ LDTGSD+ W+QC APC +C P++ P+ +PC
Sbjct: 144 SGEYFTRLGVGTPARYVFMVLDTGSDVVWIQC-APCKKCYSQTDPVFNPTKSRSFANIPC 202
Query: 104 EDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
P+C L +PG C C Y++ Y DG + G + F G R+ R+AL
Sbjct: 203 GSPLCRRLDSPG---CSTKKHICLYQVSYGDGSFTYGEFSTETLTF---RGTRVG-RVAL 255
Query: 163 GCGY-NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDD 221
GCG+ N+ L G+ S + + S+K +V S ++ FGD
Sbjct: 256 GCGHDNEGLFIGAAGLLGLGRGRLSFPSQIGRRFSRKFSYCLVDRSAS-SKPSYMVFGDS 314
Query: 222 LYDSSRVVWTSMSSDY---TKYY------------SPGVAELFFGGETTGLKNLPVVFDS 266
S +T + S+ T YY PG+ F ++TG N V+ DS
Sbjct: 315 AI-SRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTG--NGGVIIDS 371
Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
G+S T L R Y L + + A +LK APE C+ +VK T+
Sbjct: 372 GTSVTRLTRPAYVALRDAFR--VGASNLKRAPEFSLFDTCFD----LSGKTEVK--VPTV 423
Query: 327 ALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
L F L YLI + N G+ C + L+++G I
Sbjct: 424 VLHFRGADV----SLPASNYLIPVDNSGSFCFAFAG----TMSGLSIVGNI 466
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 95/373 (25%), Positives = 164/373 (43%), Gaps = 39/373 (10%)
Query: 10 LCFPTVRMSSSSSSSS--SSSLFNHVGSSL--LFQVHGNVYPTGYYNVTMYIGQPARPYF 65
L F + +SS + + + LF +++ + Q N Y G + + +YIG P
Sbjct: 24 LLFHVLHLSSIEAQNDGFTIKLFRKTSNNIQNIVQAPINAY-IGQHLMEIYIGTPPIKIT 82
Query: 66 LDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCED 121
+DTGSDL W+QC APC+ C + P++ P + + + C+ P+C L C
Sbjct: 83 GLVDTGSDLIWIQC-APCLGCYKQIKPMFDPLKSSTYNNISCDSPLCHKLDT---GVCSP 138
Query: 122 PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALGCGYNQVPGASYHPLDGI 180
+C+Y Y D + GVL +D F G+ ++ R GCG+N G + H + G+
Sbjct: 139 EKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPVSLSRFLFGCGHNNTGGFNDHEM-GL 197
Query: 181 LGLGKGKSSIVSQL--------HSQKLIRNVVGHCLSGG---GGGFLFFGDDLYDSSRVV 229
+GLG G +S++SQ+ SQ L+ + +S G G G+ + + V
Sbjct: 198 IGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRMSFGKGSQVLGNGVVTTPLVP 257
Query: 230 WTSMSSDYTKYYSPGVAELFFG-GETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKE 288
+S + V + +F T G N+ V DSG+ L + Y + + ++ +
Sbjct: 258 REKDTSYFVTLLGISVEDTYFPMNSTIGKANMLV--DSGTPPILLPQQLYDKVFAEVRNK 315
Query: 289 LSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI 348
++ K + + P T LC++ + N+ F + + +T TP+
Sbjct: 316 VALKPITDDPSLGT-QLCYRTQ---TNLKGPTLTFHFVGANVLLTPIQTFIPPTPQT--- 368
Query: 349 ISNKGNVCLGILN 361
KG CL I N
Sbjct: 369 ---KGIFCLAIYN 378
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 91/303 (30%), Positives = 129/303 (42%), Gaps = 56/303 (18%)
Query: 45 VYPTG--YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SN 98
V P+G Y V + IG P +P LDTGSDL W QC APC C+ P PL+ P S
Sbjct: 88 VRPSGDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQC-APCASCLSQPDPLFAPGQSASY 146
Query: 99 DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
+ + C +C+ + HH+CE P C Y Y DG ++GV + F F + G L
Sbjct: 147 EPMRCAGTLCSDIL---HHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTT 203
Query: 159 R---LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG-- 213
L GCG V S + GI+G G+ S+VSQL ++ +CL+
Sbjct: 204 TTVPLGFGCGSVNV--GSLNNGSGIVGFGRNPLSLVSQLSIRRF-----SYCLTSYASRR 256
Query: 214 -GFLFFG---DDLYDSS--RVVWTSM---SSDYTKYYSPGVAELFFGGETTGLKNLP--- 261
L FG D +Y + RV T + + T YY + F G T G + L
Sbjct: 257 QSTLLFGSLSDGVYGDATGRVQTTPLLQSPQNPTFYY------VHFTGLTVGARRLRIPE 310
Query: 262 ------------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEA-PEDET---LPL 305
V+ DSG++ T L + +++L PED +P
Sbjct: 311 SAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPA 370
Query: 306 CWK 308
W+
Sbjct: 371 AWR 373
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 97/380 (25%), Positives = 153/380 (40%), Gaps = 56/380 (14%)
Query: 34 GSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH 91
G + F V G P G Y + +G P + + + +DTGSD+ W+ C+ C C ++
Sbjct: 59 GGVVDFSVQGTSDPNSVGLYYTKVKMGTPPKEFNVQIDTGSDILWVNCNT-CSNCPQSSQ 117
Query: 92 ---------PLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVL 141
+ + L+PC DPIC S C QC Y +Y DG + G
Sbjct: 118 LGIELNFFDTVGSSTAALIPCSDPICTSRVQGAAAECSPRVNQCSYTFQYGDGSGTSGYY 177
Query: 142 VKDAFAFNYTNGQ----RLNPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLH 195
V DA F+ GQ + + GC +Q + +DGI G G G S+VSQL
Sbjct: 178 VSDAMYFSLIMGQPPAVNSSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLS 237
Query: 196 SQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETT 255
S+ + V HCL G G G +V++ + +Y+ + + G+
Sbjct: 238 SRGITPKVFSHCLKGDGDGGGVLVLGEILEPSIVYSPLVPS-QPHYNLNLQSIAVNGQLL 296
Query: 256 GLKNLPVVF-----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
+ P VF D G++ YL + Y L + + +S + +
Sbjct: 297 PIN--PAVFSISNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNS------ 348
Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
KG + + + F +++L+F G + L PE YL+ + G L+GAE
Sbjct: 349 ---KGNQCYLVSTSIGDIFPSVSLNFEGGASMV---LKPEQYLMHN-------GYLDGAE 395
Query: 365 ---VGLQDLNVIGGI-GDFV 380
+G Q I GD V
Sbjct: 396 MWCIGFQKFQEGASILGDLV 415
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 94/347 (27%), Positives = 143/347 (41%), Gaps = 68/347 (19%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVPC 103
+G Y V + IG P Y +DTGSDL W QC APC+ C + P P + + +PC
Sbjct: 86 SGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQC-APCLLCADQPTPYFDVKKSATYRALPC 144
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLAL 162
CASL +P C Y+ Y D S+ GVL + F F N ++ +A
Sbjct: 145 RSSRCASLSSPSCFK----KMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAF 200
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFG 219
GCG + G++G G+G S+VSQL + +CL+ L+FG
Sbjct: 201 GCG--SLNAGDLANSSGMVGFGRGPLSLVSQLGPSRF-----SYCLTSYLSATPSRLYFG 253
Query: 220 DDLYDSSRVVWTSMSSDYTK----------YYSPGVAELFF---GGETTGLKNLP----- 261
V+ ++SS T +P + ++F + G K LP
Sbjct: 254 ---------VYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLV 304
Query: 262 ----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR 311
V+ DSG+S T+L + Y+ + + + ++ + D L C++
Sbjct: 305 FAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLPAMND--TDIGLDTCFQWPP 362
Query: 312 PFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLII-SNKGNVCL 357
P +V L F D TL PE Y++I S G +CL
Sbjct: 363 P----PNVTVTVPDLVFHF-DSANMTLL---PENYMLIASTTGYLCL 401
>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
Length = 437
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 85/339 (25%), Positives = 155/339 (45%), Gaps = 49/339 (14%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEA-----PHPL 93
F + GN G Y + +G P + + +DTGSD+ W++C +PC C+ P +
Sbjct: 71 FPLKGNYSDLGLYYTEIGLGNPVQKLKVIVDTGSDILWVKC-SPCRSCLSKQDIIPPLSI 129
Query: 94 YR----PSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN 149
Y ++ + C DP+C + + A C Y Y D +S+G V+D +
Sbjct: 130 YNLSASSTSSVSSCSDPLCTGEEVVCSRSGNNSA-CAYVSSYQDKSASVGAYVRDDMHYV 188
Query: 150 YTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
G R+ GC N + G+ P+DGI+G G ++ +Q+ +Q+ + V HCL
Sbjct: 189 LHGGNATTSRIFFGCATN-ITGS--WPVDGIMGFGLISKTVPNQIATQRNMSRVFSHCLG 245
Query: 210 GG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL---------- 257
G GGG L FG+ +++ +V+T + + T +Y+ + + + +
Sbjct: 246 GEKHGGGILEFGEAP-NTTEMVFTPL-LNVTTHYNVDLLSISVNSKVLPIDPKEFSYVRN 303
Query: 258 --KNLPVVFDSGSSYTYL----NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR 311
N V+ DSG+++ L NR+ +Q + S+ +L P+ E L +
Sbjct: 304 STNNTGVIIDSGTTFVLLTTKANRMLFQEIKSLTTAKL-------GPKLEGLECFY---- 352
Query: 312 PFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIIS 350
K+ ++ F + L+F+ G T +L P+ YL+++
Sbjct: 353 -LKSGLTMETSFPNVTLTFSGGST---MKLKPDNYLVMA 387
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 82/262 (31%), Positives = 116/262 (44%), Gaps = 22/262 (8%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL- 100
G+ +G Y VT+ +G P L DTGSDLTW QC PCVR C + P++ PS
Sbjct: 96 GSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQ-PCVRTCYDQKEPIFNPSKSTS 154
Query: 101 ---VPCEDPICASL-HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
V C C SL A G+ + C Y ++Y D S+G L K+ F TN
Sbjct: 155 YYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTL--TNSDVF 212
Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGG 214
+ + GCG N + + G+LGLG+ K S SQ + + +CL S G
Sbjct: 213 D-GVYFGCGENN--QGLFTGVAGLLGLGRDKLSFPSQ--TATAYNKIFSYCLPSSASYTG 267
Query: 215 FLFFGDD-LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-----TTGLKNLPVVFDSGS 268
L FG + S + S +D T +Y + + GG+ +T + DSG+
Sbjct: 268 HLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGT 327
Query: 269 SYTYLNRVTYQTLTSIMKKELS 290
T L Y L S K ++S
Sbjct: 328 VITRLPPKAYAALRSSFKAKMS 349
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 122/283 (43%), Gaps = 27/283 (9%)
Query: 46 YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----V 101
Y YY ++ IG P + +DTGSD W QC PC C+ P++ PS +
Sbjct: 85 YAGSYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCK-PCKPCLNQTSPIFNPSKSSTYKNI 143
Query: 102 PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRL 160
C PIC + +C+YE+ Y D S G + KD N +G ++ P++
Sbjct: 144 RCSSPICKR-GEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSNDGSPISFPKI 202
Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS-----GGGGGF 215
+GCG+ + GI+G G+G SIVSQL S I +CL+
Sbjct: 203 VIGCGHKN-SLTTEGLASGIIGFGRGNFSIVSQLGSS--IGGKFSYCLASLFSKANISSK 259
Query: 216 LFFGDDLYDSSRVVWTS--MSSDYTKYYSPGVAELFFGGETTGLKN---LP-----VVFD 265
L+FGD S V ++ + S Y Y + G LK+ +P V D
Sbjct: 260 LYFGDMAVVSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSSLIPDNEGNAVID 319
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
SGS+ T L Y L + + + K +K+ + L LC+K
Sbjct: 320 SGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQ--LSLCYK 360
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 93/327 (28%), Positives = 142/327 (43%), Gaps = 42/327 (12%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSN----DLVPCED 105
+ VT+ G PA+ Y + DTGSD++W+QC PC C + P++ P+ +VPC
Sbjct: 135 FVVTVGFGTPAQTYTVIFDTGSDVSWIQC-LPCSGHCYKQHDPIFDPTKSATYSVVPCGH 193
Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
P CA+ N C Y++EY DG SS GVL + + T R P A GCG
Sbjct: 194 PQCAAADGSKCSN----GTCLYKVEYGDGSSSAGVLSHETLSLTST---RALPGFAFGCG 246
Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG--GFLFFGDDLY 223
+ + +DG++GLG+G+ S+ SQ + +CL G+L G
Sbjct: 247 QTNL--GDFGDVDGLIGLGRGQLSLSSQAAAS--FGGTFSYCLPSDNTTHGYLTIGPTTP 302
Query: 224 DSS-RVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSSYTYL 273
S+ V +T+M DY +Y + + GG L P +F DSG+ TYL
Sbjct: 303 ASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYI--LPVPPTLFTDDGTFLDSGTILTYL 360
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
Y L K + K AP + C+ F + ++ F+DG
Sbjct: 361 PPEAYTALRDRFK--FTMTQYKPAPAYDPFDTCYD----FTGQSAI--FIPAVSFKFSDG 412
Query: 334 KTRTLFELTPEAYLIISNKGNVCLGIL 360
++F+L+ LI + +G L
Sbjct: 413 ---SVFDLSFFGILIFPDDTAPAIGCL 436
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 82/285 (28%), Positives = 123/285 (43%), Gaps = 38/285 (13%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y +T +G P + +DTGSD+ WLQC+ PC C P++ PS +PC
Sbjct: 85 GEYLMTYSVGTPPFKLYGIVDTGSDIVWLQCE-PCQECYNQTTPMFNPSKSSSYKNIPCP 143
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
+C S+ +C D C+Y Y D S G L D TNG ++ P + +G
Sbjct: 144 SKLCQSME---DTSCNDKNYCEYSTYYGDNSHSGGDLSVDTLTLESTNGLTVSFPNIVIG 200
Query: 164 CGYNQV---PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---------GG 211
CG N + GAS GI+G G G +S ++QL S + +CL+
Sbjct: 201 CGTNNILSYEGAS----SGIVGFGSGPASFITQLGSSTGGK--FSYCLTPLFSVTNIQSN 254
Query: 212 GGGFLFFGDDLYDSSRVVWTS--MSSDYTKYY-------SPGVAELFFGGETTGLKNLPV 262
L FGD S V T+ + D +Y S G + GG G +
Sbjct: 255 ATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIGGVPNGDNEGNI 314
Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
+ DSG++ T L + Y L S + + + + + +TL LC+
Sbjct: 315 IIDSGTTLTSLTKDDYSFLESAVVDLVKLERVDDPT--QTLNLCY 357
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 97/336 (28%), Positives = 139/336 (41%), Gaps = 33/336 (9%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL- 100
G+ +G Y VT+ +G P L DTGSDLTW QC PCVR C + P++ PS
Sbjct: 124 GSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQ-PCVRTCYDQKEPIFNPSKSTS 182
Query: 101 ---VPCEDPICASL-HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
V C C SL A G+ + C Y ++Y D S+G L K+ F TN
Sbjct: 183 YYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTL--TNSDVF 240
Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGG 214
+ + GCG N + + G+LGLG+ K S SQ + + +CL S G
Sbjct: 241 D-GVYFGCGENNQ--GLFTGVAGLLGLGRDKLSFPSQ--TATAYNKIFSYCLPSSASYTG 295
Query: 215 FLFFGD-DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-----TTGLKNLPVVFDSGS 268
L FG + S + S +D T +Y + + GG+ +T + DSG+
Sbjct: 296 HLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGT 355
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
T L Y L S K ++S L C+ FK V K +A
Sbjct: 356 VITRLPPKAYAALRSSFKAKMSKYPTTSGV--SILDTCFD-LSGFKTVTIPK-----VAF 407
Query: 329 SFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
SF+ G + EL + + VCL ++
Sbjct: 408 SFSGG---AVVELGSKGIFYVFKISQVCLAFAGNSD 440
>gi|302774304|ref|XP_002970569.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
gi|300162085|gb|EFJ28699.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
Length = 490
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 98/359 (27%), Positives = 158/359 (44%), Gaps = 45/359 (12%)
Query: 36 SLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR 95
S +H ++ GYY + IG P + L +D S ++ P +
Sbjct: 20 SARMDLHDDLLTKGYYTSRVKIGTPPHEFSLIVDRSS---FVSPKTMFCSFFFLQDPRFS 76
Query: 96 P--SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN- 152
P S+ P E C + + G C+ + Y+ +YA+ +S GVL KD +F+ ++
Sbjct: 77 PALSSSYKPLE---CGNECSTGF--CDGSRK--YQRQYAEKSTSSGVLGKDVISFSNSSD 129
Query: 153 --GQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
GQRL GC + DGI+GLG+G SI+ QL + + +V C G
Sbjct: 130 LGGQRL----VFGCETAETGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGG 185
Query: 211 ---GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK------NLP 261
GGG + G +V+TS + YY+ + + GG LK
Sbjct: 186 MDEGGGAMILGG--FQPPKDMVFTSSDPHRSPYYNLMLKGIRVGGSPLRLKPEVFDGKYG 243
Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEA--PEDETLPLCWKGRRPFKNVHDV 319
V DSG++Y Y +Q S +K+++ SLKE P+++ +C+ G NV ++
Sbjct: 244 TVLDSGTTYAYFPGAAFQAFKSAVKEQVG--SLKEVPGPDEKFKDICYAGAG--TNVSNL 299
Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEVGLQDLNVIGGI 376
+ F ++ F DG++ T L+PE YL K G CLG+ + ++GGI
Sbjct: 300 SQFFPSVDFVFGDGQSVT---LSPENYLFRHTKISGAYCLGVFENGD----PTTLLGGI 351
>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
Length = 520
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 82/317 (25%), Positives = 145/317 (45%), Gaps = 41/317 (12%)
Query: 43 GNVYPTG-----YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------VE 88
G+++P+G Y + +G P + + LDTGSDL W+ CD C++C ++
Sbjct: 89 GSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLFWVPCD--CIQCAPLSSYHGSLD 146
Query: 89 APHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLV 142
+Y+PS +PC +C+ C +P Q C Y ++Y ++ +S G+L+
Sbjct: 147 RDLGIYKPSESTTSRHLPCSHELCSPASG-----CTNPKQPCPYNIDYFSENTTSSGLLI 201
Query: 143 KDAFAFNYTNGQR-LNPRLALGCGYNQVPGASYH---PLDGILGLGKGKSSIVSQLHSQK 198
+D + G +N + +GCG Q SY DG+LGLG S+ S L
Sbjct: 202 EDMLHLDSREGHAPVNASVIIGCGKKQ--SGSYLEGIAPDGLLGLGMADISVPSFLARAG 259
Query: 199 LIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK 258
L+RN C G +FFGD + + + + Y+ V + G + T
Sbjct: 260 LVRNSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQTYAVNVDKYCIGHKCTEGA 319
Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
+ D+G+S+T L Y+++T K+++A + + +D + C+ P + + D
Sbjct: 320 GFQALVDTGTSFTSLPLDAYKSITMEFDKQINAS--RASSDDYSFEYCYS-TGPLE-MPD 375
Query: 319 VKKCFRTLALSFTDGKT 335
V T+ L+F + K+
Sbjct: 376 VP----TITLTFAENKS 388
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/153 (34%), Positives = 77/153 (50%), Gaps = 10/153 (6%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
Y + + +G P +P LDTGSDL W QCD C C+ P PL+ P S + + C
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDT-CTACLRQPDPLFSPRMSSSYEPMRCAGQ 156
Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
+C + HH+C P C Y Y DG ++LG + F F ++G+ + L GCG
Sbjct: 157 LCGDIL---HHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGCGT 213
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
V S + GI+G G+ S+VSQL ++
Sbjct: 214 MNV--GSLNNASGIVGFGRDPLSLVSQLSIRRF 244
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 97/334 (29%), Positives = 142/334 (42%), Gaps = 45/334 (13%)
Query: 52 NVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPI 107
N +G A + +DT S+LTW+QC PC C + PL+ PS+ VPC
Sbjct: 119 NYVATVGLGAAEATVVVDTASELTWVQCQ-PCESCHDQQDPLFDPSSSPSYAAVPCNSSS 177
Query: 108 CASLH---APGHHNCED-----PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
C +L A G C D PA C Y L Y DG S GVL +D GQ +
Sbjct: 178 CDALRVAMAAGTSPCADDNEQQPA-CSYALSYRDGSYSRGVLARDKLRL---AGQDIE-G 232
Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFL 216
GCG + GA + G++GLG+ S+VSQ Q V +CL G G L
Sbjct: 233 FVFGCGTSN-QGAPFGGTSGLMGLGRSHVSLVSQTMDQ--FGGVFSYCLPMRESGSSGSL 289
Query: 217 FFGDD---LYDSSRVVWTSMSSD----YTKYYSPGVAELFFGG---ETTGLKNLPVVFDS 266
GDD +S+ +V+T+M SD +Y + + GG E+ V+ DS
Sbjct: 290 VLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEVESPWFSAGRVIIDS 349
Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
G+ T L Y + + +L+ +AP L C+ N+ +K+ +
Sbjct: 350 GTIITTLVPSVYNAVRAEFLSQLA--EYPQAPAFSILDTCF-------NLTGLKE-VQVP 399
Query: 327 ALSFT-DGKTRTLFELTPEAYLIISNKGNVCLGI 359
+L F +G + Y + S+ VCL +
Sbjct: 400 SLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLAL 433
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/153 (34%), Positives = 77/153 (50%), Gaps = 10/153 (6%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
Y + + +G P +P LDTGSDL W QCD C C+ P PL+ P S + + C
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDT-CTACLRQPDPLFSPRMSSSYEPMRCAGQ 156
Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
+C + HH+C P C Y Y DG ++LG + F F ++G+ + L GCG
Sbjct: 157 LCGDIL---HHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGCGT 213
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
V S + GI+G G+ S+VSQL ++
Sbjct: 214 MNV--GSLNNASGIVGFGRDPLSLVSQLSIRRF 244
>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
Length = 520
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 82/317 (25%), Positives = 145/317 (45%), Gaps = 41/317 (12%)
Query: 43 GNVYPTG-----YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------VE 88
G+++P+G Y + +G P + + LDTGSDL W+ CD C++C ++
Sbjct: 89 GSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLFWVPCD--CIQCAPLSSYHGSLD 146
Query: 89 APHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLV 142
+Y+PS +PC +C+ C +P Q C Y ++Y ++ +S G+L+
Sbjct: 147 RDLGIYKPSESTTSRHLPCSHELCSPASG-----CTNPKQPCPYNIDYFSENTTSSGLLI 201
Query: 143 KDAFAFNYTNGQR-LNPRLALGCGYNQVPGASYH---PLDGILGLGKGKSSIVSQLHSQK 198
+D + G +N + +GCG Q SY DG+LGLG S+ S L
Sbjct: 202 EDMLHLDSREGHAPVNASVIIGCGKKQ--SGSYLEGIAPDGLLGLGMADISVPSFLARAG 259
Query: 199 LIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK 258
L+RN C G +FFGD + + + + Y+ V + G + T
Sbjct: 260 LVRNSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQTYAVNVDKYCIGHKCTEGA 319
Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHD 318
+ D+G+S+T L Y+++T K+++A + + +D + C+ P + + D
Sbjct: 320 GFQALVDTGTSFTSLPLDAYKSITMEFDKQINAS--RASSDDYSFEYCYS-TGPLE-MPD 375
Query: 319 VKKCFRTLALSFTDGKT 335
V T+ L+F + K+
Sbjct: 376 VP----TITLTFAENKS 388
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 94/345 (27%), Positives = 143/345 (41%), Gaps = 45/345 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y +T+ G P R + DTGSD+ WLQC VRC PL+ PS V C
Sbjct: 13 SGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTYRNVSC 72
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+P C L G + + C Y + Y DG S++G L D F T Q+ G
Sbjct: 73 TEPACVGLSTRGCSS----STCLYGVFYGDGSSTIGFLAMDTFML--TPAQKFK-NFIFG 125
Query: 164 CGYNQVPGASYHPLDGILGLGKGKS-SIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGD 220
CG N + G++GLG+ + S+ SQ+ + NV +CL + G+L G+
Sbjct: 126 CGQNNT--GLFQGTAGLVGLGRSSTYSLNSQVAPS--LGNVFSYCLPSTSSATGYLNIGN 181
Query: 221 DLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGG-----ETTGLKNLPVVFDSGSSYTYL 273
+T+M +D Y + + GG +T +++ + DSG+ T L
Sbjct: 182 PQNTPG---YTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQSVGTIIDSGTVITRL 238
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
Y L + ++ ++ +L AP L C+ R V+ V + L F
Sbjct: 239 PPTAYSALKTAVRAAMTQYTL--APAVTILDTCYDFSRTTSVVYPV------IVLHFAGL 290
Query: 334 KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGD 378
R + + N VCL A G D +IG IG+
Sbjct: 291 DVR----IPATGVFFVFNSSQVCL-----AFAGNTDSTMIGIIGN 326
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 87/343 (25%), Positives = 140/343 (40%), Gaps = 46/343 (13%)
Query: 39 FQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP------ 90
F V G P G Y + +G P + + +DTGSD+ W+ C++ C C +
Sbjct: 11 FSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNS-CSGCPQTSGLQIQL 69
Query: 91 ---HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAF 146
P ++ ++ C D C + C QC Y +Y DG + G V D
Sbjct: 70 NFFDPGSSSTSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMM 129
Query: 147 AFN--YTNGQRLNPR--LALGCGYNQVPG---ASYHPLDGILGLGKGKSSIVSQLHSQKL 199
N + N + GC NQ G S +DGI G G+ + S++SQL SQ +
Sbjct: 130 HLNTIFEGSVTTNSTAPVVFGCS-NQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGI 188
Query: 200 IRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL 257
V HCL G GGG L G+ + +V+TS+ +Y+ + + G+T +
Sbjct: 189 APRVFSHCLKGDSSGGGILVLGEIV--EPNIVYTSL-VPAQPHYNLNLQSIAVNGQTLQI 245
Query: 258 --------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
+ + DSG++ YL Y S + + +S+ A +G
Sbjct: 246 DSSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITASI-PQSVHTAVS--------RG 296
Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK 352
+ + V + F ++L+F G + L P+ YLI N
Sbjct: 297 NQCYLITSSVTEVFPQVSLNFAGGASMI---LRPQDYLIQQNS 336
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 92/338 (27%), Positives = 141/338 (41%), Gaps = 39/338 (11%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
Y +T+ +G P + + +DTGSD++W+QC PC +C PL+ P + C
Sbjct: 133 YLITVRLGSPGKSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCSSA 191
Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
CA L G + C +QC Y + Y DG S+ G D A +N R + GC
Sbjct: 192 ACAQLGQEG-NGCSS-SQCQYTVTYGDGSSTTGTYSSDTLALG-SNAVR---KFQFGC-- 243
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYD 224
+ V DG++GLG G S+VSQ + +CL + GFL G
Sbjct: 244 SNVESGFNDQTDGLMGLGGGAQSLVSQ--TAGTFGAAFSYCLPATSSSSGFLTLG---AG 298
Query: 225 SSRVVWTSM--SSDYTKYYSPGVAELFFGGET----TGLKNLPVVFDSGSSYTYLNRVTY 278
+S V T M SS +Y + + GG T + + + DSG+ T L Y
Sbjct: 299 TSGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVFSAGTIMDSGTVLTRLPPTAY 358
Query: 279 QTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTL 338
L+S K + K AP L C+ F V T+AL F+ G +
Sbjct: 359 SALSSAFKAGM--KQYPSAPPSGILDTCFD----FSGQSSVS--IPTVALVFSGGA---V 407
Query: 339 FELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
++ + ++ ++ +CL A L +IG +
Sbjct: 408 VDIASDGIMLQTSNSILCLAF--AANSDDSSLGIIGNV 443
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 87/353 (24%), Positives = 153/353 (43%), Gaps = 43/353 (12%)
Query: 48 TGYYNVTMYI--GQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP-----HPLYRPSNDL 100
T + MY+ G P DTGSDL W+ C + + HP + L
Sbjct: 95 TRSFEYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSRSTTYSL 154
Query: 101 VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF----NYTNGQRL 156
+ C+ C +L +C+ ++C Y+ Y DG ++GVL + F+F GQ
Sbjct: 155 LSCQSAACQALS---QASCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQVR 211
Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGG 211
PR++ GC A DG++GLG G S+VSQL + I +CL +
Sbjct: 212 VPRVSFGCSTGS---AGSFRSDGLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAAN 268
Query: 212 GGGFLFFGDD--LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-VVFDSGS 268
L FG + D + S+ YY+ + + G+ N ++ DSG+
Sbjct: 269 SSSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQDVASANSSRIIVDSGT 328
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW--KGRRPFKN--VHDVKKCFR 324
+ T+L+ + L + +++ + + + P ++ L LC+ +G+ ++ + DV
Sbjct: 329 TLTFLDPALLRPLVAELERRI--RLPRAQPPEQLLQLCYDVQGKSQAEDFGIPDVT---- 382
Query: 325 TLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIG 377
L F G + T L PE + +G +CL ++ +E Q ++++G I
Sbjct: 383 ---LRFGGGASVT---LRPENTFSLLEEGTLCLVLVPVSES--QPVSILGNIA 427
>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
Length = 485
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 96/350 (27%), Positives = 156/350 (44%), Gaps = 47/350 (13%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------VEAPHPLYRPSNDL- 100
Y + +G PA + + LDTGSDL W+ CD C++C ++ +YRP+
Sbjct: 66 YYAWVDVGTPATSFLVALDTGSDLFWVPCD--CIQCAPLSGYRGNLDRDLRIYRPAESTT 123
Query: 101 ---VPCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQ- 154
+PC +C S+ PG C +P Q C Y ++Y ++ +S G+L++D NY
Sbjct: 124 SRHLPCSHELCQSV--PG---CTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHV 178
Query: 155 RLNPRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
+N + +GCG Q + G + DG+LGLG S+ S L L++N C
Sbjct: 179 PVNASVIIGCGQKQSGDYLDGIA---PDGLLGLGMADISVPSFLARAGLVQNSFSMCFKE 235
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
G +FFGD S + + Y+ V + G + + + DSG+S+
Sbjct: 236 DSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSF 295
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
T L Y+ T K+++A + ED T C+ P + + DV T+ L+F
Sbjct: 296 TSLPLDVYKAFTMEFDKQMNATRVPY--EDTTWKYCYSA-SPLE-MPDVP----TITLTF 347
Query: 331 TDGKTRTLFELTPEAYLIISNK----GNVCLGILNGAE-VGLQDLNVIGG 375
K +L + P L ++K CL +L E +G+ N + G
Sbjct: 348 AADK--SLQAVNP--ILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVG 393
>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
gi|194704920|gb|ACF86544.1| unknown [Zea mays]
gi|223949445|gb|ACN28806.1| unknown [Zea mays]
gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
Length = 515
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 96/350 (27%), Positives = 156/350 (44%), Gaps = 47/350 (13%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------VEAPHPLYRPSNDL- 100
Y + +G PA + + LDTGSDL W+ CD C++C ++ +YRP+
Sbjct: 96 YYAWVDVGTPATSFLVALDTGSDLFWVPCD--CIQCAPLSGYRGNLDRDLRIYRPAESTT 153
Query: 101 ---VPCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQ- 154
+PC +C S+ PG C +P Q C Y ++Y ++ +S G+L++D NY
Sbjct: 154 SRHLPCSHELCQSV--PG---CTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHV 208
Query: 155 RLNPRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
+N + +GCG Q + G + DG+LGLG S+ S L L++N C
Sbjct: 209 PVNASVIIGCGQKQSGDYLDGIA---PDGLLGLGMADISVPSFLARAGLVQNSFSMCFKE 265
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
G +FFGD S + + Y+ V + G + + + DSG+S+
Sbjct: 266 DSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSF 325
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
T L Y+ T K+++A + ED T C+ P + + DV T+ L+F
Sbjct: 326 TSLPFDVYKAFTMEFDKQMNATRVPY--EDTTWKYCYSA-SPLE-MPDVP----TITLTF 377
Query: 331 TDGKTRTLFELTPEAYLIISNK----GNVCLGILNGAE-VGLQDLNVIGG 375
K +L + P L ++K CL +L E +G+ N + G
Sbjct: 378 AADK--SLQAVNP--ILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVG 423
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 88/287 (30%), Positives = 123/287 (42%), Gaps = 43/287 (14%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSND----LVPCE 104
Y VT+ IG PA + +DTGSDL+W+QC PC C PL+ PS +PC
Sbjct: 125 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCK-PCNASDCYPQKDPLFDPSKSSTFATIPCA 183
Query: 105 DPICASLHAPGHHN-CED-----PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
C L G+ N C + P QC Y +EY +G + GV + A + +
Sbjct: 184 SDACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLALGSSAVVK--- 240
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS--GGGGGFL 216
GCG +Q Y DG+LGLG S+VSQ S + +CL G GFL
Sbjct: 241 SFRFGCGSDQ--HGPYDKFDGLLGLGGAPESLVSQTAS--VYGGAFSYCLPPLNSGAGFL 296
Query: 217 FFG---DDLYDSSRVVWTSMSSDYTKYYSPGVAELF---FGGETTGLKNL---PVVF--- 264
G +S V+T M + +SP +A + G + G K L P VF
Sbjct: 297 TLGAPNSTNNSNSGFVFTPMHA-----FSPKIATFYVVTLTGISVGGKALDIPPAVFAKG 351
Query: 265 ---DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
DSG+ T + Y+ L + + ++ L P D L C+
Sbjct: 352 NIVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLP-PADSALDTCYN 397
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 96/321 (29%), Positives = 141/321 (43%), Gaps = 48/321 (14%)
Query: 68 LDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLH-------APGH 116
+DT S+LTW+QC APC C + PL+ PS+ VPC+ P C +L G
Sbjct: 158 VDTASELTWVQC-APCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDALQQQLATGAGAGA 216
Query: 117 HNCE--DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASY 174
C+ PA C Y L Y DG S GVL D + G+ ++ GCG + G +
Sbjct: 217 PPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSL---AGEVIDG-FVFGCGTSN-QGPPF 271
Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDD---LYDSSR 227
G++GLG+ + S+VSQ Q V +CL G L GDD +S+
Sbjct: 272 GGTSGLMGLGRSQLSLVSQTVDQ--FGGVFSYCLPLSRESDASGSLVLGDDPSAYRNSTP 329
Query: 228 VVWTSMSSD-----YTKYYSPGVAELFFGG---ETTGLKNLPVVFDSGSSYTYLNRVTYQ 279
VV+TSM S+ +Y + + GG E+TG +V DSG+ T L Y
Sbjct: 330 VVYTSMVSNSDPLLQGPFYLVNLTGITVGGQEVESTGFSARAIV-DSGTVITSLVPSVYN 388
Query: 280 TLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC-FRTLALSFTDGKTRTL 338
+ + +L+ +AP L C+ N+ +K+ +L L F DG
Sbjct: 389 AVRAEFMSQLA--EYPQAPGFSILDTCF-------NMTGLKEVQVPSLTLVF-DGGAEVE 438
Query: 339 FELTPEAYLIISNKGNVCLGI 359
+ Y + S+ VCL +
Sbjct: 439 VDSGGVLYFVSSDSSQVCLAV 459
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 91/334 (27%), Positives = 137/334 (41%), Gaps = 32/334 (9%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDLVP--C 103
+G Y + + +G P + Y + LDTGS L+WLQC V C PL+ P SN P C
Sbjct: 117 SGNYYLKLGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYC 176
Query: 104 EDPICASLHAPGHHN--CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
C+ L A ++ C C Y Y D S+G L +D T Q L P
Sbjct: 177 SSSECSLLKAATLNDPLCTASGVCVYTASYGDASYSMGYLSRDLLTL--TPSQTL-PSFT 233
Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFF 218
GCG Q + GI+GL + K S+++QL + +CL + GGGFL
Sbjct: 234 YGCG--QDNEGLFGKAAGIVGLARDKLSMLAQLSPK--YGYAFSYCLPTSTSSGGGFLSI 289
Query: 219 GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK----NLPVVFDSGSSYTYLN 274
G S + +S Y +A + G G+ +P + DSG+ T L
Sbjct: 290 GKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPTIIDSGTVVTRLP 349
Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR-RPFKNVHDVKKCFRTLALSFTDG 333
Y L K +S + ++AP L C+KG + +++ F+ A
Sbjct: 350 ISIYAALREAFVKIMS-RRYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQGGA------ 402
Query: 334 KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGL 367
L LI ++KG CL + ++ +
Sbjct: 403 ----DLSLRAPNILIEADKGIACLAFASSNQIAI 432
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 85/318 (26%), Positives = 127/318 (39%), Gaps = 50/318 (15%)
Query: 39 FQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------V 87
F V G+ P G Y + +G P + YF+ +DTGSD+ W+ C +PC C +
Sbjct: 77 FPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTGSDILWVAC-SPCTGCPSSSGLNIQL 135
Query: 88 EAPHPLYRPSNDLVPCEDPICASLHAPGHHNCE--DPAQCDYELEYADGGSSLGVLVKDA 145
E +P ++ +PC D C + C+ D + C Y Y DG + G V D
Sbjct: 136 EFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDT 195
Query: 146 FAFNYT--NGQRLN--PRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKL 199
F+ N Q N + GC +Q + +DGI G G+ + S+VSQL+S +
Sbjct: 196 MYFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGV 255
Query: 200 IRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKY------------YSPGV 245
V HCL G GGG L G+ + +V+T + Y P
Sbjct: 256 SPKVFSHCLKGSDNGGGILVLGEIV--EPGLVYTPLVPSQPHYNLNLESIVVNGQKLPID 313
Query: 246 AELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
+ LF T G + DSG++ YL Y + + +S L
Sbjct: 314 SSLFTTSNTQG-----TIVDSGTTLAYLADGAYDPFVNAITAAVSPS---------VRSL 359
Query: 306 CWKGRRPFKNVHDVKKCF 323
KG + F + CF
Sbjct: 360 VSKGNQCFVTSSRLASCF 377
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 93/340 (27%), Positives = 147/340 (43%), Gaps = 51/340 (15%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y + M IG P R Y LDTGSDL W QC APC+ CV+ P P + P+ + C
Sbjct: 88 GEYLMEMGIGTPTRYYSAILDTGSDLIWTQC-APCLLCVDQPTPYFDPARSATYRSLGCA 146
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
P C +L+ P + C Y+ Y D S+ GVL + F F TN R++ P ++ G
Sbjct: 147 SPACNALYYPLCYQ----KVCVYQYFYGDSASTAGVLANETFTFG-TNETRVSLPGISFG 201
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF---LFFGD 220
CG + G++G G+G S+VSQL S + +CL+ L+FG
Sbjct: 202 CG--NLNAGLLANGSGMVGFGRGSLSLVSQLGSPRF-----SYCLTSFLSPVPSRLYFGV 254
Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFF---GGETTGLKNLPV--------------- 262
+S + +P + ++F G + G LP+
Sbjct: 255 YATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGG 314
Query: 263 -VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
+ DSG++ TYL Y + + +++ L + L C++ P + + +
Sbjct: 315 TIIDSGTTITYLAEPAYDAVRAAFASQITLP-LLNVTDASVLDTCFQWPPPPRQSVTLPQ 373
Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLII--SNKGNVCLGI 359
L L F DG +EL + Y+++ S G +CL +
Sbjct: 374 ----LVLHF-DGAD---WELPLQNYMLVDPSTGGGLCLAM 405
>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
Length = 469
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 81/281 (28%), Positives = 128/281 (45%), Gaps = 36/281 (12%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------VEAPHPLYRPSNDL- 100
Y + +G PA + + LDTGSDL W+ CD C++C ++ +YRP+
Sbjct: 96 YYAWVDVGTPATSFLVALDTGSDLFWVPCD--CIQCAPLSGYRGNLDRDLRIYRPAESTT 153
Query: 101 ---VPCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQ- 154
+PC +C S+ PG C +P Q C Y ++Y ++ +S G+L++D NY
Sbjct: 154 SRHLPCSHELCQSV--PG---CTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHV 208
Query: 155 RLNPRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
+N + +GCG Q + G + DG+LGLG S+ S L L++N C
Sbjct: 209 PVNASVIIGCGQKQSGDYLDGIA---PDGLLGLGMADISVPSFLARAGLVQNSFSMCFKE 265
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKY--YSPGVAELFFGGETTGLKNLPVVFDSGS 268
G +FFGD S + T Y K Y+ V + G + + + DSG+
Sbjct: 266 DSSGRIFFGDQGVPSQQS--TPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGT 323
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
S+T L Y+ T K+++A + ED T C+
Sbjct: 324 SFTSLPFDVYKAFTMEFDKQMNATRVPY--EDTTWKYCYSA 362
>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
Length = 536
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 92/314 (29%), Positives = 131/314 (41%), Gaps = 37/314 (11%)
Query: 29 LFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVE 88
LF GS LF GN +Y + IG P + + LD GSDL W+ CD C++C
Sbjct: 88 LFPSQGSQALF--FGNELDWLHY-TWIDIGTPNVSFLVALDAGSDLLWVPCD--CIQCAP 142
Query: 89 APHPLYRPSNDLVPCEDPICASLHAPGHH------------NCEDPAQ-CDYELEYAD-- 133
Y S D E SL + H NC++P C Y Y D
Sbjct: 143 LSASYYNISLDRDLSE--YSPSLSSTSRHLSCDHQLCEWGSNCKNPKDPCPYIFNYDDFE 200
Query: 134 GGSSLGVLVKDAFAF----NYTNGQRLNPRLALGCGYNQVPGASYH---PLDGILGLGKG 186
+S G LV+D ++T + L + LGCG Q G S+ DG++GLG G
Sbjct: 201 NTTSAGFLVEDKLHLASVGDHTARKMLQASVVLGCGRKQ--GGSFFDGAAPDGVMGLGPG 258
Query: 187 KSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVV-WTSMSSDYTKYYSPGV 245
S+ S L LI+N C G + FGD + S + + + Y Y+ GV
Sbjct: 259 DISVPSLLAKAGLIQNCFSLCFDENDSGRILFGDRGHASQQSTPFLPIQGTYVAYFV-GV 317
Query: 246 AELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
G + DSGSS+TYL Y L S K+++AK + + +D
Sbjct: 318 ESYCVGNSCLKRSGFKALVDSGSSFTYLPSEVYNELVSEFDKQVNAKRI--SFQDGLWDY 375
Query: 306 CWKGRRPFKNVHDV 319
C+ + +HD+
Sbjct: 376 CYNASS--QELHDI 387
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 87/365 (23%), Positives = 148/365 (40%), Gaps = 45/365 (12%)
Query: 39 FQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA----PCVRCVEAPHP 92
F V G P G Y + +G P + + +DTGSD+ W+ C++ P ++
Sbjct: 64 FSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLN 123
Query: 93 LYRP----SNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFA 147
+ P ++ ++ C D C + C QC Y +Y DG + G V D
Sbjct: 124 FFDPGSSSTSSMIACSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMH 183
Query: 148 FN--YTNGQRLNPR--LALGCGYNQVPG---ASYHPLDGILGLGKGKSSIVSQLHSQKLI 200
N + N + GC NQ G S +DGI G G+ + S++SQL SQ +
Sbjct: 184 LNTIFEGSMTTNSTAPVVFGCS-NQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIA 242
Query: 201 RNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL- 257
+ HCL G GGG L G+ + +V+TS+ +Y+ + + G+T +
Sbjct: 243 PRIFSHCLKGDSSGGGILVLGEIV--EPNIVYTSLVP-AQPHYNLNLQSISVNGQTLQID 299
Query: 258 -------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
+ + DSG++ YL Y S ++ A + +G
Sbjct: 300 SSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVS---------AITAAIPQSVRTVVSRGN 350
Query: 311 RPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK-GNVCLGILNGAEVGLQD 369
+ + V F ++L+F G + L P+ YLI N G + + ++ Q
Sbjct: 351 QCYLITSSVTDVFPQVSLNFAGGASMI---LRPQDYLIQQNSIGGAAVWCIGFQKIQGQG 407
Query: 370 LNVIG 374
+ ++G
Sbjct: 408 ITILG 412
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 81/298 (27%), Positives = 131/298 (43%), Gaps = 46/298 (15%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
TG Y + M++G P + +L LDTGSDL+W+QCD PC C E Y P + + C
Sbjct: 168 TGEYFLDMFVGTPPKHVWLILDTGSDLSWIQCD-PCYDCFEQNGSHYYPKDSSTYRNISC 226
Query: 104 EDPIC--ASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT--NGQRLNPR 159
DP C S P H + C Y +YADG ++ G + F N T NG+ +
Sbjct: 227 YDPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQ 286
Query: 160 LA---LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-----G 211
+ GCG+ ++ G+LGLG+G S SQ+ Q + + +CL+
Sbjct: 287 VVDVMFGCGHWN--KGFFYGASGLLGLGRGPISFPSQI--QSIYGHSFSYCLTDLFSNTS 342
Query: 212 GGGFLFFGDD--LYDSSRVVWTSM-----SSDYTKYYSPGVAELFFGGETTGLKN----- 259
L FG+D L ++ + +T++ + D T YY + + GGE +
Sbjct: 343 VSSKLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQ-IKSIMVGGEVLDISEQTWHW 401
Query: 260 ----------LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
+ DSGS+ T+ Y + +K++ + + A +D + C+
Sbjct: 402 SSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQI--AADDFVMSPCY 457
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 76/251 (30%), Positives = 118/251 (47%), Gaps = 21/251 (8%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPI 107
TG Y V++ +G P + L DTGSDLTW +C A E P S V C P+
Sbjct: 131 TGNYIVSIGLGSPKKDLMLIFDTGSDLTWARCSA-----AETFDPTKSTSYANVSCSTPL 185
Query: 108 CAS-LHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C+S + A G+ + + C Y ++Y DG S+G L K+ T+ + GCG
Sbjct: 186 CSSVISATGNPSRCAASTCVYGIQYGDGSYSIGFLGKERLTIGSTD---IFNNFYFGCGQ 242
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFFGDDLYDS 225
+ V G + G+LGLG+ K S+VSQ + + +CL S GFL FG S
Sbjct: 243 D-VDGL-FGKAAGLLGLGRDKLSVVSQTAPK--YNQLFSYCLPSSSSTGFLSFGSSQSKS 298
Query: 226 SRVVWTSMSSDYTKYYSPGVAELFFGGETTGL-----KNLPVVFDSGSSYTYLNRVTYQT 280
++ +T +SS + +Y+ + + GG+ + + DSG+ T L Y
Sbjct: 299 AK--FTPLSSGPSSFYNLDLTGITVGGQKLAIPLSVFSTAGTIIDSGTVVTRLPPAAYSA 356
Query: 281 LTSIMKKELSA 291
L S +K +++
Sbjct: 357 LRSAFRKAMAS 367
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 84/339 (24%), Positives = 139/339 (41%), Gaps = 39/339 (11%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y ++ +G P + +DTGS++ WLQC PC C P++ PS +PC
Sbjct: 87 GEYLISYSVGTPPFKVYGFMDTGSNIVWLQCQ-PCNTCFNQTSPIFNPSKSSSYKNIPCT 145
Query: 105 DPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQR-LNPRLAL 162
C + H +C + C+Y + Y S G L D+ + T+G L P + +
Sbjct: 146 SSTCKDTNDT-HISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFPNIVI 204
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLF 217
GCG+ V + G++G+G+G S++ Q+ S + + +CL L
Sbjct: 205 GCGHINVLQDNSQS-SGVVGMGRGPMSLIKQVGSSS-VGSKFSYCLIPYNSDSNSSSKLI 262
Query: 218 FGDDLYDSSRVVWTS---MSSDYTKYYSPGVAELFFG------GETTGLKNLPVVFDSGS 268
FG+D+ S +V ++ + YY + G GE + ++ DSG+
Sbjct: 263 FGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIEYGERSNASTQNILIDSGT 322
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
T L + L S + +E+ ++ P D L LC+ NV D+ F +
Sbjct: 323 PLTMLPNLFLSKLVSYVAQEVKLPRIE--PPDHHLSLCYNTTGKQLNVPDITAHFNGADV 380
Query: 329 SFTDGKTRTLFELTPEAYLIISNKGNVCLGIL--NGAEV 365
T FE G +C G + NG E+
Sbjct: 381 KLNSNGTFFPFE-----------DGIMCFGFISSNGLEI 408
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 80/279 (28%), Positives = 119/279 (42%), Gaps = 30/279 (10%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y V + +G P +L +D+GSD+ W+QC PC +C PL+ P+ V C
Sbjct: 127 SGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSC 185
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
IC +L G D +CDY + Y DG + G L + T Q +A+G
Sbjct: 186 GSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTAVQ----GVAIG 241
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFGD 220
CG+ + G+LGLG G S+V QL V +CL+ GG G L G
Sbjct: 242 CGHRN--SGLFVGAAGLLGLGWGAMSLVGQLGGAA--GGVFSYCLASRGAGGAGSLVLGR 297
Query: 221 DLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNL----------PVVFDSGS 268
VW + ++ + +Y G+ + GGE L++ VV D+G+
Sbjct: 298 TEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGT 357
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
+ T L R Y L + A L +P L C+
Sbjct: 358 AVTRLPREAYAALRGAFDGAMGA--LPRSPAVSLLDTCY 394
>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 533
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 95/352 (26%), Positives = 154/352 (43%), Gaps = 50/352 (14%)
Query: 50 YYNVTMYIGQPARPYFLDLDTGSDLTWLQCD---APCVRCVEAPH------PLYRP---- 96
Y NV+ IG P+ Y + LDTGSDL WL CD + CV+ ++ P +YRP
Sbjct: 114 YANVS--IGTPSLSYLVALDTGSDLFWLPCDCTNSGCVQGLQFPSGEQIDFNIYRPNASS 171
Query: 97 SNDLVPCEDPICASLHAPGHHNCEDP-AQCDYELEY-ADGGSSLGVLVKDAFAFNYTNGQ 154
++ +PC + +C+ C + C Y+++Y ++G SS GVLV+D + Q
Sbjct: 172 TSQTIPCNNTLCSR-----QSRCPSAQSTCPYQVQYLSNGTSSTGVLVEDLLHLTTDDAQ 226
Query: 155 R--LNPRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
L+ ++ GCG Q + GA+ +G+ GLG S+ S L + N C
Sbjct: 227 SRALDAKIIFGCGRVQTGSFLDGAA---PNGLFGLGMTNISVPSTLAREGYTSNSFSMCF 283
Query: 209 SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGS 268
G G + FGD ++ + Y+ + ++ GG L+ +FDSG+
Sbjct: 284 GRDGIGRISFGDTGSSGQGETPFNLRQLHPT-YNVSITKINVGGRDADLE-FSAIFDSGT 341
Query: 269 SYTYLNRVTYQTLT---SIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
S+TYLN Y ++ +I KE S+ + P C++ N+ T
Sbjct: 342 SFTYLNDPAYTLISESFNIGAKEKRYSSISDIP----FEYCYEMSSNQTNLE-----IPT 392
Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGN--VCLGILNGAEVGLQDLNVIGG 375
+ L G F +T ++I G CL I+ +V + N + G
Sbjct: 393 VNLVMQGGSQ---FNVTDPIVIVILQGGASIYCLAIVKSGDVNIIGQNFMTG 441
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 91/351 (25%), Positives = 153/351 (43%), Gaps = 45/351 (12%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV-EAPHPLYRP--S 97
V G +G Y V + IGQP + L DTGSDL W++C A C C +P ++ P S
Sbjct: 74 VSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSA-CRNCSHHSPATVFFPRHS 132
Query: 98 NDLVP--CEDPICASL----HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
+ P C DP+C + AP ++ + C YE YADG + G+ ++ + +
Sbjct: 133 STFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTS 192
Query: 152 NGQRLNPR-LALGCGY----NQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNV 203
+G+ + +A GCG+ V G S++ +G++GLG+G S SQL + K +
Sbjct: 193 SGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCL 252
Query: 204 VGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGG--------- 252
+ + LS +L G+ S++ +T + ++ +Y + +F G
Sbjct: 253 MDYTLSPPPTSYLIIGNGGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSI 312
Query: 253 -ETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP---LCWK 308
E N V DSG++ +L Y+++ + +++ +K D P LC
Sbjct: 313 WEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRR-----VKLPIADALTPGFDLCVN 367
Query: 309 GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGI 359
V +K L F+ G +F P Y I + + CL I
Sbjct: 368 ----VSGVTKPEKILPRLKFEFSGG---AVFVPPPRNYFIETEEQIQCLAI 411
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 100/352 (28%), Positives = 145/352 (41%), Gaps = 44/352 (12%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL----VP 102
TG Y V + +G P + L DTGSDLTW QC PCV+ C P++ PS +
Sbjct: 151 TGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQ-PCVKSCYAQQQPIFDPSTSKTYSNIS 209
Query: 103 CEDPICASLH-APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
C C+SL A G+ + C Y ++Y D ++G KD + +
Sbjct: 210 CTSAACSSLKSATGNSPGCSSSNCVYGIQYGDSSFTIGFFAKDKLTLTQND---VFDGFM 266
Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFG 219
GCG N + G++GLG+ SIV Q +QK + +CL S G G L FG
Sbjct: 267 FGCGQNN--KGLFGKTAGLIGLGRDPLSIVQQT-AQKFGK-YFSYCLPTSRGSNGHLTFG 322
Query: 220 D-DLYDSSRVVWTSM------SSDYTKYYSPGVAELFFGGETTGL-----KNLPVVFDSG 267
+ + +S+ V + SS T YY V + GG+ + +N + DSG
Sbjct: 323 NGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQNAGTIIDSG 382
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
+ T L Y +L S K+ +S AP L C+ N + ++
Sbjct: 383 TVITRLPSTAYGSLKSAFKQFMS--KYPTAPALSLLDTCYD----LSNYTSIS--IPKIS 434
Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
+F EL P LI + VCL A G D + IG G+
Sbjct: 435 FNFNGNAN---VELDPNGILITNGASQVCL-----AFAGNGDDDSIGIFGNI 478
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 73/266 (27%), Positives = 113/266 (42%), Gaps = 47/266 (17%)
Query: 44 NVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLYRPSN 98
+++ G Y + +G P + +++D+DTGS++ W++C APC C V P + P
Sbjct: 34 DIFAMGLYYTRISLGTPPQQFYVDVDTGSNVAWVKC-APCTGCEHSGDVPVPMSTFDPRK 92
Query: 99 DL----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNY---- 150
+ C D C L+ + E C Y L Y DG S+ G + D F FN
Sbjct: 93 STTKISISCTDAECGVLNKKLQCSPER-LSCPYSLLYGDGSSTAGYYLNDVFTFNQVPSD 151
Query: 151 -TNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
+ + RL GCG Q S +DG+LG G S+ +QL Q + N+ HCL
Sbjct: 152 NSTAKSGTARLVFGCGGTQTGSWS---VDGLLGFGPTTVSLPNQLAQQNISVNIFAHCLQ 208
Query: 210 GGGGGF-----------------LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGG 252
G G + FG+D Y+ V ++ +P +L + G
Sbjct: 209 GDVSGRGSLVIGTIREPDLVYTPMVFGEDHYN---VQLLNIGISGRNVTTPASFDLEYTG 265
Query: 253 ETTGLKNLPVVFDSGSSYTYLNRVTY 278
V+ DSG++ TYL + Y
Sbjct: 266 G--------VIIDSGTTLTYLVQPAY 283
>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
Length = 388
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 79/263 (30%), Positives = 112/263 (42%), Gaps = 38/263 (14%)
Query: 46 YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP- 96
Y TG Y + IG PA Y++ LDTGS W+ + C + PH Y P
Sbjct: 78 YGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVNG----ISCKQCPHESDILRKLTFYDPR 133
Query: 97 ---SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN--YT 151
S+ V C+D IC S P C +C Y YADGG ++G+L D ++ Y
Sbjct: 134 SSVSSKEVKCDDTICTS-RPP----CNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYG 188
Query: 152 NGQR--LNPRLALGCGYNQVPGA--SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
NGQ + + GCG Q S +DGI+G G + +SQL + + + HC
Sbjct: 189 NGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHC 248
Query: 208 L-SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------K 258
L S GGG G+ + +V T + + Y+ + + G T L K
Sbjct: 249 LDSTNGGGIFAIGEVV--EPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTK 306
Query: 259 NLPVVFDSGSSYTYLNRVTYQTL 281
DSGS+ YL + Y L
Sbjct: 307 TKGTFIDSGSTLVYLPEIIYSEL 329
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 92/348 (26%), Positives = 145/348 (41%), Gaps = 41/348 (11%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
G TG Y VT+ +G P Y + DTGSD TW+QC V C E L+ P+
Sbjct: 172 GRALGTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTY 231
Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
V C P C+ L+ H C C Y ++Y DG S+G D + + +
Sbjct: 232 ANVSCAAPACSDLNI---HGCSG-GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVK--- 284
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
GCG + G+LGLG+GK+S+ Q + + V HCL G G+L
Sbjct: 285 GFRFGCGERNE--GLFGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSTGTGYL 340
Query: 217 FF--GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP--------VVFDS 266
F G S+R+ ++ + +Y G+ + GG+ L ++P + DS
Sbjct: 341 DFGAGSLAAASARLTTPMLTDNGPTFYYVGMTGIRVGGQ---LLSIPQSVFATAGTIVDS 397
Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
G+ T L Y +L ++A+ K+AP L C+ F + V T+
Sbjct: 398 GTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYD----FTGMSQVA--IPTV 451
Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
+L F G ++ + ++ VCL + G D+ ++G
Sbjct: 452 SLLFQGGAR---LDVDASGIMYAASASQVCLAFAANEDGG--DVGIVG 494
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 79/294 (26%), Positives = 126/294 (42%), Gaps = 42/294 (14%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND--- 99
G + TG Y + +G P R +L +DTGSD+TWLQC APC C + L+ PS+
Sbjct: 8 GLAFGTGEYFAVVGVGTPRRDMYLVVDTGSDITWLQC-APCTNCYKQKDALFNPSSSSSF 66
Query: 100 -LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN--YTNGQRL 156
++ C +C +L G + +C Y+ +Y DG ++G LV D + + GQ +
Sbjct: 67 KVLDCSSSLCLNLDVMGCLS----NKCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQVV 122
Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-----G 211
+ LGCG++ ++ GILGLG+G S + L + RN+ +CL
Sbjct: 123 LTNIPLGCGHDN--EGTFGTAAGILGLGRGPLSFPNNLDAST--RNIFSYCLPDRESDPN 178
Query: 212 GGGFLFFGDDLY-----DSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV---- 262
L FGD S + + + YY + + GG L N+P
Sbjct: 179 HKSTLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNL--LTNIPASVFQ 236
Query: 263 ---------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
+FDSG++ T L Y + + + L A + + C+
Sbjct: 237 LDSHGNGGTIFDSGTTITRLEARAYTAVRDAFRA--ATMHLTSAADFKIFDTCY 288
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 98/374 (26%), Positives = 154/374 (41%), Gaps = 55/374 (14%)
Query: 39 FQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP------ 90
F V G+ P G Y + +G P + + +DTGSD+ W+ C++ C C +
Sbjct: 65 FSVEGSSDPLLVGLYFTKVKLGTPPMEFTVQIDTGSDILWVNCNS-CNGCPRSSGLGIQL 123
Query: 91 ---HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAF 146
S+ LV C DPIC S C + QC Y +Y DG + G V ++
Sbjct: 124 NFFDASSSSSSSLVSCSDPICNSAFQTTATQCLTQSNQCSYTFQYGDGSGTSGYYVSESM 183
Query: 147 AFNYTNGQRL----NPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLI 200
F+ GQ + + + GC Q S H +DGI G G G S++SQL ++ +
Sbjct: 184 YFDMVMGQSMIANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGIT 243
Query: 201 RNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK 258
V HCL G GGG L G+ L +V++ + +Y+ + + G+T +
Sbjct: 244 PKVFSHCLKGEGNGGGILVLGEVL--EPGIVYSPLVPS-QPHYNLYLQSISVNGQTLPID 300
Query: 259 --------NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
N + DSG++ YL Y S ++ A P KG
Sbjct: 301 PSVFATSINRGTIIDSGTTLAYLVEEAYTPFVS---------AITAAVSQSVTPTISKGN 351
Query: 311 RPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE---VGL 367
+ + V + F ++L+F + L PE YL+ LG +GA +G
Sbjct: 352 QCYLVSTSVGEIFPLVSLNFAGSASMV---LKPEEYLM-------HLGFYDGAALWCIGF 401
Query: 368 QDLNV-IGGIGDFV 380
Q + + +GD V
Sbjct: 402 QKVQEGVTILGDLV 415
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 119/279 (42%), Gaps = 30/279 (10%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y V + +G P +L +D+GSD+ W+QC PC +C PL+ P+ V C
Sbjct: 127 SGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSC 185
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
IC +L G D +CDY + Y DG + G L + T Q +A+G
Sbjct: 186 GSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTAVQ----GVAIG 241
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFGD 220
CG+ + G+LGLG G S++ QL V +CL+ GG G L G
Sbjct: 242 CGHRN--SGLFVGAAGLLGLGWGAMSLIGQLGGAA--GGVFSYCLASRGAGGAGSLVLGR 297
Query: 221 DLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGE----TTGLKNLP------VVFDSGS 268
VW + ++ + +Y G+ + GGE GL L VV D+G+
Sbjct: 298 TEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVMDTGT 357
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
+ T L R Y L + A L +P L C+
Sbjct: 358 AVTRLPREAYAALRGAFDGAMGA--LPRSPAVSLLDTCY 394
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 89/366 (24%), Positives = 149/366 (40%), Gaps = 47/366 (12%)
Query: 39 FQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP------ 90
F V G P G Y + +G P + + +DTGSD+ W+ C++ C C +
Sbjct: 61 FSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNS-CSGCPQTSGLQIQL 119
Query: 91 ---HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAF 146
P ++ ++ C D C + C QC Y +Y DG + G V D
Sbjct: 120 NFFDPGSSSTSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMM 179
Query: 147 AFN--YTNGQRLNPR--LALGCGYNQVPG---ASYHPLDGILGLGKGKSSIVSQLHSQKL 199
N + N + GC NQ G S +DGI G G+ + S++SQL SQ +
Sbjct: 180 HLNTIFEGSVTTNSTAPVVFGCS-NQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGI 238
Query: 200 IRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL 257
V HCL G GGG L G+ + +V+TS+ +Y+ + + G+T +
Sbjct: 239 APRVFSHCLKGDSSGGGILVLGEIV--EPNIVYTSLVP-AQPHYNLNLQSIAVNGQTLQI 295
Query: 258 --------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
+ + DSG++ YL Y S + + P+ + +G
Sbjct: 296 DSSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITASI--------PQ-SVHTVVSRG 346
Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK-GNVCLGILNGAEVGLQ 368
+ + V + F ++L+F G + L P+ YLI N G + + ++ Q
Sbjct: 347 NQCYLITSSVTEVFPQVSLNFAGGASMI---LRPQDYLIQQNSIGGAAVWCIGFQKIQGQ 403
Query: 369 DLNVIG 374
+ ++G
Sbjct: 404 GITILG 409
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 145/348 (41%), Gaps = 41/348 (11%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
G TG Y VT+ +G PA Y + DTGSD TW+QC V C E L+ P
Sbjct: 170 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSSTY 229
Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
V C P C+ L+ H C C Y ++Y DG S+G D + + +
Sbjct: 230 ANVSCAAPACSDLNI---HGCSG-GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVK--- 282
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
GCG + G+LGLG+GK+S+ Q + + V HCL G G+L
Sbjct: 283 GFRFGCGERNE--GLFGEAAGLLGLGRGKTSLPVQTYDK--YGGVFAHCLPARSTGTGYL 338
Query: 217 FF--GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP--------VVFDS 266
F G S+R+ ++ + +Y G+ + GG+ L ++P + DS
Sbjct: 339 DFGAGSPAAASARLTTPMLTDNGPTFYYIGMTGIRVGGQ---LLSIPQSVFATAGTIVDS 395
Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
G+ T L Y +L ++A+ K+AP L C+ F + V T+
Sbjct: 396 GTVITRLPPPAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYD----FTGMSQVA--IPTV 449
Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
+L F G ++ + ++ VCL + G D+ ++G
Sbjct: 450 SLLFQGGAR---LDVDASGIMYAASASQVCLAFAANEDGG--DVGIVG 492
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 90/352 (25%), Positives = 146/352 (41%), Gaps = 47/352 (13%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y + + +G P P DTGSD+ W QC+ PC C + P++ PS V C
Sbjct: 83 GEYLMKLSVGTPPFPIIAVADTGSDIIWTQCE-PCTNCYQQDLPMFNPSKSTTYRKVSCS 141
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
P+C+ ++C C Y + Y D S G D T+G+ + PR A+G
Sbjct: 142 SPVCS--FTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIG 199
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS-----GGGGGFLFF 218
CG++ G+ + GI+GLG G +S++ Q+ S + +CL+ GG L F
Sbjct: 200 CGHDNA-GSFDANVSGIVGLGLGPASLIKQMGSA--VGGKFSYCLTPIGNDDGGSNKLNF 256
Query: 219 GDDLYDS-SRVVWTS--MSSDYTKYYSPGVAELFFGGETT----------GLKNLPVVFD 265
G + S S V T +S + +YS + + G T G N ++ D
Sbjct: 257 GSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKAN--IIID 314
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
SG++ T L Y + ++ + + ++ L C++ D K F
Sbjct: 315 SGTTLTLLPVDLYHNFAKAISNSINLQRTDD--PNQFLEYCFE-----TTTDDYKVPF-- 365
Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIG 377
+A+ F R L E LI + +CL + D+++ G I
Sbjct: 366 IAMHFEGANLR----LQRENVLIRVSDNVICLAFAGAQD---NDISIYGNIA 410
>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
gi|194693730|gb|ACF80949.1| unknown [Zea mays]
gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
Length = 519
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 75/275 (27%), Positives = 122/275 (44%), Gaps = 24/275 (8%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------VEAPHPLYRPSNDLV 101
Y + +G P + + LDTGSDL W+ CD C++C ++ +Y+P+
Sbjct: 100 YYAWVDVGTPTTSFLVALDTGSDLFWVPCD--CIQCAPLSSYRGNLDRDLGIYKPAESTT 157
Query: 102 PCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR-LNP 158
P L PG C +P Q C Y ++Y ++ +S G+L++D+ N G +N
Sbjct: 158 SRHLPCSHELCQPGS-GCTNPKQPCTYNIDYFSENTTSSGLLIEDSLHLNSREGHAPVNA 216
Query: 159 RLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
+ +GCG Q + G + DG+LGLG S+ S L L+RN C G
Sbjct: 217 SVIIGCGRKQSGDYLDGIA---PDGLLGLGMADISVPSFLARAGLVRNSFSMCFKEDSSG 273
Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLN 274
+FFGD S + + Y+ V + G + + + DSG+S+T L
Sbjct: 274 RIFFGDQGVSSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGSSFQALVDSGTSFTSLP 333
Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
Y+ T+ K+++A + ED T C+
Sbjct: 334 PDVYKAFTTEFDKQINASRVPY--EDSTWKYCYSA 366
>gi|168021169|ref|XP_001763114.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685597|gb|EDQ71991.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 641
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 72/228 (31%), Positives = 102/228 (44%), Gaps = 43/228 (18%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC--VEAPHPLYRPSNDL-VPCEDPI 107
Y V M +G+ + + +DTGS +WL C P + V P+ +Y P ++ V C P
Sbjct: 126 YYVKMRVGKSKKLFHFLIDTGSQPSWLHCKWPAIEKHPVAGPNGMYVPEKEVQVDCRSPE 185
Query: 108 CASLHA-PGHHN-------CEDP--AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
C SL P + N C +P +C Y++ Y D G V+D + G++L+
Sbjct: 186 CLSLQRIPSNFNNIRNLFPCNEPNDWRCTYDITYLDRSHLRGFYVQDVVSLATLEGEQLD 245
Query: 158 PRLALGCGYNQVPGA-----SYH--------------PL--DGILGLGKGKSSIVSQLHS 196
++ LG A S+H PL DG+LGL KG S VSQL
Sbjct: 246 AKITLGYATPNHRAAPFGFCSWHASSDRYGEEELERSPLTTDGLLGLNKGTESFVSQLKR 305
Query: 197 QKLI-RNVVGHCLSG-------GGGGFLFFGD-DLYDSSRVVWTSMSS 235
Q I +VVGHC GF+FFG L DS + W+ M+S
Sbjct: 306 QGAISSHVVGHCFRSLDTTDFETNSGFMFFGKSKLLDSLPITWSPMAS 353
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 85/346 (24%), Positives = 139/346 (40%), Gaps = 47/346 (13%)
Query: 39 FQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA----PCVRCVEAPHP 92
F V G P G Y + +G P + +++ +DTGSD+ W+ C + P ++ P
Sbjct: 70 FPVQGTFNPFLVGLYFTRVQLGSPPKDFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLT 129
Query: 93 LYRPSND----LVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFA 147
+ P + LV C D C + C QC Y +Y DG + G V D
Sbjct: 130 FFDPGSSTTAALVSCSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMH 189
Query: 148 FN---YTNG------QRLNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHS 196
+ ++G Q + ++ C Q S +DGI G G+ + S++SQL S
Sbjct: 190 LDTLLLSSGELSQICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLAS 249
Query: 197 QKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGET 254
Q + V HCL G GGG L G+ + +V+T + +Y+ + + G+T
Sbjct: 250 QGITPRVFSHCLKGDDSGGGVLVLGEIV--EPNIVYTPLVPS-QPHYNLYLQSISVAGQT 306
Query: 255 TGL--------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLC 306
+ N + DSG++ YL Y S + +S +
Sbjct: 307 LAIDPSVFGASSNQGTIVDSGTTLAYLAEGAYDPFVSAITSVVSLNARTYLS-------- 358
Query: 307 WKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK 352
KG + + V F ++L+F G + L P+ YL+ N
Sbjct: 359 -KGNQCYLVTSSVNDVFPQVSLNFAGGAS---LILNPQDYLLQQNS 400
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 73/239 (30%), Positives = 108/239 (45%), Gaps = 38/239 (15%)
Query: 45 VYPTG--YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SN 98
V P+G Y V + +G P +P LDTGSDL W QC APC C+ P P++ P S
Sbjct: 96 VRPSGDLEYLVDLAVGTPPQPVSALLDTGSDLIWTQC-APCASCLPQPDPIFSPGASSSY 154
Query: 99 DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF----NYTNGQ 154
+ + C +C + HH+C+ P C Y Y DG ++ GV + F F +
Sbjct: 155 EPMRCAGELCNDIL---HHSCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETT 211
Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GG 211
+L+ L GCG + S + GI+G G+ S+VSQL ++ +CL+ G
Sbjct: 212 KLSAPLGFGCG--TMNKGSLNNGSGIVGFGRAPLSLVSQLAIRRF-----SYCLTPYASG 264
Query: 212 GGGFLFFGD---DLYDSSRVVWTSM-----SSDYTKYYSPGVAELFFGGETTGLKNLPV 262
L FG +YD++ + + T YY P F G T G + L +
Sbjct: 265 RKSTLLFGSLRGGVYDAATATVQTTRLLRSRQNPTFYYVP------FTGVTVGARRLRI 317
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 93/371 (25%), Positives = 158/371 (42%), Gaps = 67/371 (18%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y T+ +G PA+ + + DTGSDL W+QC PC C P++ P S + C
Sbjct: 38 GDYVTTISLGTPAKVFSVIADTGSDLIWIQC-KPCQACFNQKDPIFDPEGSSSYTTMSCG 96
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALG 163
D +C SL +C CDY Y DG + G L + T G++L + +A G
Sbjct: 97 DTLCDSLP---RKSCS--PDCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFG 151
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFF 218
CG+ + S++ G++GLG+G S VSQL L + +CL + +FF
Sbjct: 152 CGH--LNRGSFNDASGLVGLGRGNLSFVSQLG--DLFGHKFSYCLVPWRDAPSKTSPMFF 207
Query: 219 GDDLYDSSRVVWTSMSSDYTKY-YSPGVAELFFGGETTGLKNLPV--------------- 262
GD+ SS + +T ++P + ++ LK++ +
Sbjct: 208 GDE--SSSHSSGKKLHYAFTPMIHNPAMESFYY----VKLKDISIAGRALRIPAGSFDIK 261
Query: 263 -------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
+FDSG++ T L YQ + ++ ++S + + L LC+ +
Sbjct: 262 PDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGS--SAGLDLCY-------D 312
Query: 316 VHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGN--VCLGILNG-AEVGL----- 367
V K ++ + ++L E Y I +N VCL +++ ++G+
Sbjct: 313 VSGSKASYKMKIPAMVFHFEGADYQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMM 372
Query: 368 -QDLNVIGGIG 377
Q+ V+ IG
Sbjct: 373 QQNFRVMYDIG 383
>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
Length = 515
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 95/350 (27%), Positives = 155/350 (44%), Gaps = 47/350 (13%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---------VEAPHPLYRPSNDL- 100
Y + +G PA + + LDTGSDL W+ CD C++C ++ +YRP+
Sbjct: 96 YYAWVDVGTPATSFLVALDTGSDLFWVPCD--CIQCAPLSGYRGNLDRDLRIYRPAESTT 153
Query: 101 ---VPCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQ- 154
+PC +C S+ PG C +P Q C Y ++Y ++ +S G+L++D NY
Sbjct: 154 SRHLPCSHELCQSV--PG---CTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHV 208
Query: 155 RLNPRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
+N + +GCG Q + G + DG+L LG S+ S L L++N C
Sbjct: 209 PVNASVIIGCGQKQSGDYLDGIA---PDGLLALGMADISVPSFLARAGLVQNSFSMCFKE 265
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
G +FFGD S + + Y+ V + G + + + DSG+S+
Sbjct: 266 DSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSF 325
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
T L Y+ T K+++A + ED T C+ P + + DV T+ L+F
Sbjct: 326 TSLPFDVYKAFTMEFDKQMNATRVPY--EDTTWKYCYSA-SPLE-MPDVP----TITLTF 377
Query: 331 TDGKTRTLFELTPEAYLIISNK----GNVCLGILNGAE-VGLQDLNVIGG 375
K +L + P L ++K CL +L E +G+ N + G
Sbjct: 378 AADK--SLQAVNP--ILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVG 423
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 90/352 (25%), Positives = 145/352 (41%), Gaps = 47/352 (13%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y + + +G P P DTGSD+ W QC PC C + P++ PS V C
Sbjct: 83 GEYLMKLSVGTPPFPIIAVADTGSDIIWTQC-VPCTNCYQQDLPMFNPSKSTTYRKVSCS 141
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
P+C+ ++C C Y + Y D S G D T+G+ + PR A+G
Sbjct: 142 SPVCS--FTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIG 199
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS-----GGGGGFLFF 218
CG++ G+ + GI+GLG G +S++ Q+ S + +CL+ GG L F
Sbjct: 200 CGHDNA-GSFDANVSGIVGLGLGPASLIKQMGSA--VGGKFSYCLTPIGNDDGGSNKLNF 256
Query: 219 GDDLYDS-SRVVWTS--MSSDYTKYYSPGVAELFFGGETT----------GLKNLPVVFD 265
G + S S V T +S + +YS + + G T G N ++ D
Sbjct: 257 GSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKAN--IIID 314
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
SG++ T L Y + ++ + + ++ L C++ D K F
Sbjct: 315 SGTTLTLLPVDLYHNFAKAISNSINLQRTDD--PNQFLEYCFE-----TTTDDYKVPF-- 365
Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIG 377
+A+ F R L E LI + +CL + D+++ G I
Sbjct: 366 IAMHFEGANLR----LQRENVLIRVSDNVICLAFAGAQD---NDISIYGNIA 410
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 87/332 (26%), Positives = 140/332 (42%), Gaps = 38/332 (11%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y +T +G P + DTGSD+ WLQC+ PC +C P++ PS +PC
Sbjct: 85 GGYLMTYSVGTPPTKIYGIADTGSDIVWLQCE-PCEQCYNQTTPIFNPSKSSSYKNIPCL 143
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
+C H+ +C D C Y++ Y D S G L D + T+G ++ P+ +G
Sbjct: 144 SKLC---HSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKTVIG 200
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL------SGGGGGFLF 217
CG + G GI+GLG G S+++QL S I +CL L
Sbjct: 201 CGTDNA-GTFGGASSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSILS 257
Query: 218 FGDDLYDSSRVVWTS--MSSDYTKY------YSPGVAELFFGGETTGLKNL-PVVFDSGS 268
FGD S V ++ + D Y +S G + FGG + G + ++ DSG+
Sbjct: 258 FGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSGT 317
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR-----PFKNVH----DV 319
+ T + Y L S + + + + ++ LC+ + P H D+
Sbjct: 318 TLTLIPSDVYTNLESAVVDLVKLDRVDDP--NQQFSLCYSLKSNEYDFPIITAHFKGADI 375
Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISN 351
+ + + TDG F+ +P+ I N
Sbjct: 376 ELHSISTFVPITDGIVCFAFQPSPQLGSIFGN 407
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 90/340 (26%), Positives = 142/340 (41%), Gaps = 50/340 (14%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVPC 103
+G Y V + IG P Y +DTGSDL W QC APC+ C P P + + +PC
Sbjct: 86 SGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQC-APCLLCAAQPTPYFDVKRSATYRALPC 144
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLAL 162
CA+L +P C Y+ Y D S+ GVL + F F + ++ ++
Sbjct: 145 RSSRCAALSSPSCFK----KMCVYQYYYGDTASTAGVLANETFTFGAASSTKVRAANISF 200
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFG 219
GCG + G++G G+G S+VSQL + +CL+ L+FG
Sbjct: 201 GCG--SLNAGELANSSGMVGFGRGPLSLVSQLGPSRF-----SYCLTSYLSPTPSRLYFG 253
Query: 220 DDLYDSSRVVWTSMSSDYTKY-YSPGVAELFF---GGETTGLKNLP-------------- 261
+S + T + +P + ++F G + G K LP
Sbjct: 254 VFANLNSTNTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGTG 313
Query: 262 -VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
V+ DSG+S T+L + Y+ + + + ++ + D L C++ P +V
Sbjct: 314 GVIIDSGTSITWLQQDAYEAVRRGLASTIPLPAMND--TDIGLDTCFQWPPP----PNVT 367
Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLII-SNKGNVCLGI 359
F DG T L PE Y++I S G +CL +
Sbjct: 368 VTVPDFVFHF-DGANMT---LPPENYMLIASTTGYLCLAM 403
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 84/303 (27%), Positives = 128/303 (42%), Gaps = 32/303 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV--RCVEAPHPLYRPSND----LVPCE 104
Y VT+ +G P L++DTGSDL+W+QC PC C PL+ P+ VPC
Sbjct: 140 YVVTVSLGTPGVAQTLEVDTGSDLSWVQCT-PCAAPACYSQKDPLFDPAQSSSYAAVPCG 198
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
P+C L + + AQC Y + Y DG + GV D + + R GC
Sbjct: 199 GPVCGGLGI--YASSCSAAQCGYVVSYGDGSKTTGVYSSDTLTLSPNDAVR---GFFFGC 253
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDL 222
G+ Q + + DG+LGLG+ ++S+V Q + V +CL G+L G
Sbjct: 254 GHAQ---SGFTGNDGLLGLGREEASLVEQ--TAGTYGGVFSYCLPTRPSTTGYLTLGGPS 308
Query: 223 YDSSRVVWTSM---SSDYTKYYSPGVAELFFGGETTGLKNL----PVVFDSGSSYTYLNR 275
+ T+ S + YY + + GG+ + + V D+G+ T L
Sbjct: 309 GAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFAGGTVVDTGTVITRLPP 368
Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKT 335
Y L S + +++ AP L C+ F V +AL+F+ G T
Sbjct: 369 TAYAALRSAFRSGMASYGYPSAPATGILDTCYN----FSGYGTVT--LPNVALTFSGGAT 422
Query: 336 RTL 338
TL
Sbjct: 423 VTL 425
>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 529
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 80/267 (29%), Positives = 119/267 (44%), Gaps = 31/267 (11%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRP----SNDLVPCE 104
+G P + + LDTGSDL W+ CD C++C P +Y P ++ VPC
Sbjct: 114 LGTPNVTFLVALDTGSDLFWVPCD--CLKCAPLSSPDYGNLKFDVYSPRKSSTSRKVPCS 171
Query: 105 DPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
+C C + C Y++EY +D SS GVLV+D +G + +
Sbjct: 172 SNMCDL-----QTECSAASNSCPYKIEYLSDNTSSKGVLVEDVMYLATESGHSKITQAPI 226
Query: 163 GCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFG 219
G QV S+ +G+LGLG S+ S L SQ + N C G G + FG
Sbjct: 227 TFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASQGVAANSFSMCFGEDGHGRINFG 286
Query: 220 DDLYDSSRVVWTSMSS-DYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTY 278
D S+ + T ++ + YY+ + GG+T K V DSG+S+T L+ Y
Sbjct: 287 DT--GSADQLETPLNIYKHNPYYNISIVGAMAGGKTFSTK-FSAVVDSGTSFTALSDPMY 343
Query: 279 QTLTSIMKKELSAKSLKEAPEDETLPL 305
+TS K++ K P D +LP
Sbjct: 344 TEITSAFDKQVKE---KRNPADSSLPF 367
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 94/345 (27%), Positives = 140/345 (40%), Gaps = 38/345 (11%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
G TG Y VT+ +G PA Y + DTGSD TW+QC V C E L+ P++
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 230
Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
V C P C+ L G C C Y ++Y DG S+G D + + +
Sbjct: 231 ANVSCAAPACSDLDVSG---CSG-GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVK--- 283
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
GCG + G+LGLG+GK+S+ Q + + V HCL G G+L
Sbjct: 284 GFRFGCGERN--DGLFGEAAGLLGLGRGKTSLPVQTYGK--YGGVFAHCLPARSTGTGYL 339
Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSS 269
FG ++ + T YY G+ + GG L P VF DSG+
Sbjct: 340 DFGAGSPPATTTTPMLTGNGPTFYYV-GMTGIRVGGRL--LPIAPSVFAAAGTIVDSGTV 396
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
T L Y +L S ++A+ ++A L C+ F + V T++L
Sbjct: 397 ITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYD----FTGMSQVA--IPTVSLL 450
Query: 330 FTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
F G ++ + + VCL + G D+ ++G
Sbjct: 451 FQGGAA---LDVDASGIMYTVSASQVCLAFAGNEDGG--DVGIVG 490
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 98/315 (31%), Positives = 137/315 (43%), Gaps = 36/315 (11%)
Query: 17 MSSSSSSSSSSSLFNHVGSSLLFQVH-GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
+SS+ +S+ +GSS FQV T + V +GQP P +DTGS L
Sbjct: 62 ISSARFKYLQNSIDKELGSSN-FQVDVEQAIKTSLFLVNFSVGQPPVPQLTIMDTGSSLL 120
Query: 76 WLQCDAPCVRCV--EAPHPLYRP--SNDLVP--CEDPICASLHAPGHHNCEDPAQCDYEL 129
W+QC PC C HP++ P S+ V C+D C +AP H C +C YE
Sbjct: 121 WIQCQ-PCKHCSSDHMIHPVFNPALSSTFVECSCDDRFCR--YAPNGH-CGSSNKCVYEQ 176
Query: 130 EYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALGCGYNQVPGASYHPLDGILGLGKGKS 188
Y G S GVL K+ F NG + + +A GCGY H GILGLG +
Sbjct: 177 VYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGYENGEQLESH-FTGILGLGAKPT 235
Query: 189 SIVSQLHSQKLIRNVVGHC---LSGGGGGF--LFFGDD---LYDSSRVVW-TSMSSDYTK 239
S+ QL S+ +C L+ G+ L G+D L D + + + T S Y
Sbjct: 236 SLAVQLGSK------FSYCIGDLANKNYGYNQLVLGEDADILGDPTPIEFETENSIYYMN 289
Query: 240 YYSPGVAELFFGGETTGLK----NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLK 295
V + E K V+ DSG+ YT+L + Y+ L + +K L K +
Sbjct: 290 LEGISVGDTQLNIEPVVFKRRGPRTGVILDSGTLYTWLADIAYRELYNEIKSILDPKLER 349
Query: 296 EAPEDETLPLCWKGR 310
D LC+ GR
Sbjct: 350 FWFRDF---LCYHGR 361
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 89/315 (28%), Positives = 134/315 (42%), Gaps = 42/315 (13%)
Query: 68 LDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLHAP---GHHNCE 120
+DT S+LTW+QC+ PC C + PL+ PS+ VPC C +L C+
Sbjct: 128 VDTASELTWVQCE-PCDACHDQQEPLFDPSSSPSYAAVPCNSSSCDALRVATGMSGQACD 186
Query: 121 D-PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY-NQVPGASYHPLD 178
D PA C Y L Y DG S GVL D + + Q GCG NQ P +
Sbjct: 187 DQPAACSYTLSYRDGSYSRGVLAHDRLSLAGEDIQ----GFVFGCGTSNQGP---FGGTS 239
Query: 179 GILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGDDL---YDSSRVVWTS 232
G++GLG+ + S++SQ Q V +CL G G L GDD +S+ +V+T+
Sbjct: 240 GLMGLGRSQLSLISQTMDQ--FGGVFSYCLPPKESGSSGSLVLGDDASVYRNSTPIVYTA 297
Query: 233 MSSDYTK--YYSPGVAELFFGGETTGLKNL------PVVFDSGSSYTYLNRVTYQTLTSI 284
M SD + +Y + + GGE + DSG+ T L Y + +
Sbjct: 298 MVSDPLQGPFYLANLTGITVGGEDVQSPGFSAGGGGKAIVDSGTIITSLVPSVYAAVRAE 357
Query: 285 MKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPE 344
+L+ +A L C+ + +V+ +L L F DG +
Sbjct: 358 FVSQLA--EYPQAAPFSILDTCFD----LTGLREVQ--VPSLKLVF-DGGAEVEVDSKGV 408
Query: 345 AYLIISNKGNVCLGI 359
Y++ + VCL +
Sbjct: 409 LYVVTGDASQVCLAL 423
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 94/345 (27%), Positives = 140/345 (40%), Gaps = 38/345 (11%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
G TG Y VT+ +G PA Y + DTGSD TW+QC V C E L+ P++
Sbjct: 172 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 231
Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
V C P C+ L G C C Y ++Y DG S+G D + + +
Sbjct: 232 ANVSCAAPACSDLDVSG---CSG-GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVK--- 284
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL 216
GCG + G+LGLG+GK+S+ Q + + V HCL G G+L
Sbjct: 285 GFRFGCGERN--DGLFGEAAGLLGLGRGKTSLPVQTYGK--YGGVFAHCLPPRSTGTGYL 340
Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSS 269
FG ++ + T YY G+ + GG L P VF DSG+
Sbjct: 341 DFGAGSPPATTTTPMLTGNGPTFYYV-GMTGIRVGGRL--LPIAPSVFAAAGTIVDSGTV 397
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
T L Y +L S ++A+ ++A L C+ F + V T++L
Sbjct: 398 ITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYD----FTGMSQVA--IPTVSLL 451
Query: 330 FTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
F G ++ + + VCL + G D+ ++G
Sbjct: 452 FQGGAA---LDVDASGIMYTVSASQVCLAFAGNEDGG--DVGIVG 491
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 81/277 (29%), Positives = 123/277 (44%), Gaps = 28/277 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSNDL----VPCED 105
Y VT+ IG PAR + + DTGSDLTW+QC PC C + PL+ PS VPC
Sbjct: 126 YVVTIGIGTPARNFTVLFDTGSDLTWVQCK-PCTDSCYQQQEPLFDPSKSSTYVDVPCGT 184
Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
P C G C+Y ++Y D + G L ++AF + + + GC
Sbjct: 185 PQCKI--GGGQDLTCGGTTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAAG--VVFGCS 240
Query: 166 Y---NQVPGASYH-PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFG 219
+ + V GA + G+LGLG+G SSI+SQ +V +CL G G+L G
Sbjct: 241 HEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNS-GDVFSYCLPPRGSSAGYLTIG 299
Query: 220 DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV---------VFDSGSSY 270
S + +T + +D ++ S V L G + LP+ V DSG+
Sbjct: 300 AAAPPQSNLSFTPLVTDNSQLSSVYVVNLV--GISVSGAALPIDASAFYIGTVIDSGTVI 357
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
T++ Y L ++ + ++ E+L C+
Sbjct: 358 THMPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCY 394
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 98/366 (26%), Positives = 159/366 (43%), Gaps = 50/366 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP-----LYRPSNDLVP 102
+G Y V++ +G P + L DTGSDLTW++C A C + HP L R S P
Sbjct: 80 SGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNC--SIHPPGSTFLARHSTTFSP 137
Query: 103 --CEDPICASLHAPGHHNCEDP---AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
C +C + P + C + C YE Y+DG + G K+ N ++G+ +
Sbjct: 138 THCFSSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMK 197
Query: 158 PR-LALGCGYN----QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRN----VVGHCL 208
+ +A GCG++ + G+S++ G++GLG+G S SQL ++ R+ ++ + L
Sbjct: 198 LKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQL-GRRFGRSFSYCLLDYTL 256
Query: 209 SGGGGGFLFFGDDLY----DSSRVVWTSM--SSDYTKYYSPGVAELFFGG---------- 252
S +L GD + + S + +T + + + +Y + +F G
Sbjct: 257 SPPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHIDPSVW 316
Query: 253 ETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE--APEDETLPLCWKGR 310
L N V DSG++ T+L Y+ + S K+E+ S A LC
Sbjct: 317 SLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTPGGASTRSGFDLCV--- 373
Query: 311 RPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDL 370
NV V + R LS G +L+ P Y I ++G CL I E
Sbjct: 374 ----NVTGVSRP-RFPRLSLELGG-ESLYSPPPRNYFIDISEGIKCLAI-QPVEAESGRF 426
Query: 371 NVIGGI 376
+VIG +
Sbjct: 427 SVIGNL 432
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 89/336 (26%), Positives = 134/336 (39%), Gaps = 58/336 (17%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLYRPSNDL----V 101
Y + +G P++ Y++ +DTGSD+ W+ C C +C + LY P++ + V
Sbjct: 27 YFAKIGLGNPSKDYYVQVDTGSDILWVNC-IGCDKCPTKSDLGIKLTLYDPASSVSATRV 85
Query: 102 PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL----N 157
C+D C S + +C+ C Y + Y DG S+ G V DA F G N
Sbjct: 86 SCDDDFCTSTYNGLLPDCKKELPCQYNVVYGDGSSTAGYFVSDAVQFERVTGNLQTGLSN 145
Query: 158 PRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
+ GCG Q G S LDGILG HCL GG
Sbjct: 146 GTVTFGCGAQQSGGLGTSGEALDGILG--------------------AFAHCLDNVNGGG 185
Query: 216 LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--------KNLPVVFDSG 267
+F +L S +V T M + +Y+ + E+ GG L + DSG
Sbjct: 186 IFAIGELV-SPKVNTTPMVPN-QAHYNVYMKEIEVGGTVLELPTDVFDSGDRRGTIIDSG 243
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
++ YL V Y ++ + ++ + SL E +C FK +V F +
Sbjct: 244 TTLAYLPEVVYDSMMNEIRSQQPGLSLHTVEEQF---IC------FKYSGNVDDGFPDIK 294
Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGA 363
F D T T++ P YL ++ C G NG
Sbjct: 295 FHFKDSLTLTVY---PHDYLFQISEDIWCFGWQNGG 327
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 94/345 (27%), Positives = 140/345 (40%), Gaps = 38/345 (11%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-- 100
G TG Y VT+ +G PA Y + DTGSD TW+QC V C E L+ P++
Sbjct: 175 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 234
Query: 101 --VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
V C P C+ L G C C Y ++Y DG S+G D + + +
Sbjct: 235 ANVSCAAPACSDLDVSG---CSG-GHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVK--- 287
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
GCG + G+LGLG+GK+S+ Q + + V HCL G G+L
Sbjct: 288 GFRFGCGERN--DGLFGEAAGLLGLGRGKTSLPVQTYGK--YGGVFAHCLPARSTGTGYL 343
Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSS 269
FG ++ + T YY G+ + GG L P VF DSG+
Sbjct: 344 DFGAGSPPATTTTPMLTGNGPTFYYV-GMTGIRVGGRL--LPIAPSVFAAAGTIVDSGTV 400
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
T L Y +L S ++A+ ++A L C+ F + V T++L
Sbjct: 401 ITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYD----FTGMSQVA--IPTVSLL 454
Query: 330 FTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
F G ++ + + VCL + G D+ ++G
Sbjct: 455 FQGGAA---LDVDASGIMYTVSASQVCLAFAGNEDGG--DVGIVG 494
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 85/336 (25%), Positives = 141/336 (41%), Gaps = 36/336 (10%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G+Y + + IG P + DTGSDLTW C PC C + +P++ P + C+
Sbjct: 70 GHYLMELSIGTPPFKIYGIADTGSDLTWTSC-VPCNNCYKQRNPMFDPQKSTTYRNISCD 128
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALG 163
+C L C +C+Y YA + GVL ++ + T G+ + + + G
Sbjct: 129 SKLCHKLDT---GVCSPQKRCNYTYAYASAAITRGVLAQETITLSSTKGKSVPLKGIVFG 185
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS----QKLIRNVVGHCLSGGGGGFLFFG 219
CG+N G + H + GI+GLG G S++SQ+ S ++ + +V + FG
Sbjct: 186 CGHNNTGGFNDHEM-GIIGLGGGPVSLISQMGSSFGGKRFSQCLVPFHTDVSVSSKMSFG 244
Query: 220 DDLYDSSR-VVWTSM--SSDYTKYY------SPGVAELFFGGETTGLKNLPVVFDSGSSY 270
S + VV T + D T Y+ S L F G + ++ + DSG+
Sbjct: 245 KGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVENTYLHFNGSSQNVEKGNMFLDSGTPP 304
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
T L Y + + ++ E++ K + + P D LC++ + + L F
Sbjct: 305 TILPTQLYDQVVAQVRSEVAMKPVTDDP-DLGPQLCYRTKNNLRG--------PVLTAHF 355
Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVG 366
+ L+P I G CLG N + G
Sbjct: 356 EGADVK----LSPTQTFISPKDGVFCLGFTNTSSDG 387
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 93/329 (28%), Positives = 139/329 (42%), Gaps = 45/329 (13%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSN----DLVPCED 105
+ VT+ G PA+ Y L DTGSD++W+QC PC C + P++ P+ VPC
Sbjct: 120 FVVTVGFGTPAQTYTLMFDTGSDVSWIQC-LPCSGHCYKQHDPIFDPTKSATYSAVPCGH 178
Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
P CA+ C C Y+++Y DG S+ GVL + + R P A GCG
Sbjct: 179 PQCAAAGG----KCSSNGTCLYKVQYGDGSSTAGVLSHETLSL---TSARALPGFAFGCG 231
Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLY 223
+ + +DG++GLG+G+ S+ SQ + +CL G+L G
Sbjct: 232 ETNL--GDFGDVDGLIGLGRGQLSLSSQAAASFGAAFS--YCLPSYNTSHGYLTIGTTTP 287
Query: 224 DSSR--VVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSSYTY 272
S V +T+M DY +Y + + GG L P++F DSG+ TY
Sbjct: 288 ASGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFV--LPVPPILFTRDGTLLDSGTVLTY 345
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL-ALSFT 331
L Y L K + K AP + C+ + F L + F+
Sbjct: 346 LPPEAYTALRDRFK--FTMTQYKPAPAYDPFDTCY-------DFAGQNAIFMPLVSFKFS 396
Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGIL 360
DG + F+L+P LI + G L
Sbjct: 397 DGSS---FDLSPFGVLIFPDDTAPATGCL 422
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 96/349 (27%), Positives = 151/349 (43%), Gaps = 46/349 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSNDL----VP 102
+G Y VT+ +G P R DTGSDLTW QC+ PCV C + ++ PS L V
Sbjct: 144 SGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCE-PCVGYCYQQREHIFDPSTSLSYSNVS 202
Query: 103 CEDPICASLH-APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
C+ P C L A G+ + C Y + Y DG S+G ++ + T+ +
Sbjct: 203 CDSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTD---VFNNFQ 259
Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFG 219
GCG N + G+LGL + S+VSQ +QK + V +CL S G+L FG
Sbjct: 260 FGCGQNN--RGLFGGTAGLLGLARNPLSLVSQT-AQKYGK-VFSYCLPSSSSSTGYLSFG 315
Query: 220 DDLYDSSRVVWT--SMSSDYTKYYSPGVAELFFGGETTGLKNLPV----------VFDSG 267
DS V +T ++SDY +Y L G + G + LP+ + DSG
Sbjct: 316 SGDGDSKAVKFTPSEVNSDYPSFYF-----LDMVGISVGERKLPIPKSVFSTAGTIIDSG 370
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
+ + L Y ++ + ++ +S L C+ + +K V K +
Sbjct: 371 TVISRLPPTVYSSVQKVFRELMS--DYPRVKGVSILDTCYDLSK-YKTVKVPK-----II 422
Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
L F+ G +L PE + + VCL ++ ++ +IG +
Sbjct: 423 LYFSGGAE---MDLAPEGIIYVLKVSQVCLAFAGNSDD--DEVAIIGNV 466
>gi|213998828|gb|ACJ60781.1| nucellin [Hordeum brachyantherum subsp. californicum]
Length = 133
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 68/124 (54%), Gaps = 3/124 (2%)
Query: 176 PLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMS 234
P+DGILGLG GK+ QL QK+I NV+GHCLS G G L+ GD S V W M
Sbjct: 8 PVDGILGLGMGKAGFAVQLKGQKMITGNVIGHCLSSQGKGVLYVGDFNPPSRGVTWVPMK 67
Query: 235 SDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKS 293
YYSPG+AE + G VFDSGS+YT++ Y + S ++ LS S
Sbjct: 68 ESLF-YYSPGLAEPLIDNQPIRGNPTFEAVFDSGSTYTHVPAQVYNEIVSKVRGTLSESS 126
Query: 294 LKEA 297
L+E
Sbjct: 127 LEEV 130
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 85/314 (27%), Positives = 130/314 (41%), Gaps = 60/314 (19%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVPC 103
+G Y V + IG P Y +DTGSDL W QC APC+ C + P P + + +PC
Sbjct: 86 SGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQC-APCLLCADQPTPYFDVKKSATYRALPC 144
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLAL 162
CASL +P C Y+ Y D S+ GVL + F F N ++ +A
Sbjct: 145 RSSRCASLSSPSCFK----KMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAF 200
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFG 219
GCG + G++G G+G S+VSQL + +CL+ L+FG
Sbjct: 201 GCG--SLNAGDLANSSGMVGFGRGPLSLVSQLGPSRF-----SYCLTSYLSATPSRLYFG 253
Query: 220 DDLYDSSRVVWTSMSSDYTK----------YYSPGVAELFF---GGETTGLKNLP----- 261
V+ ++SS T +P + ++F + G K LP
Sbjct: 254 ---------VYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLV 304
Query: 262 ----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR 311
V+ DSG+S T+L + Y+ + + + ++ + D L C++
Sbjct: 305 FAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLTAMND--TDIGLDTCFQWPP 362
Query: 312 PFKNVHDVKKCFRT 325
P NV FRT
Sbjct: 363 P-PNVTVTVPDFRT 375
>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 488
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 77/272 (28%), Positives = 118/272 (43%), Gaps = 24/272 (8%)
Query: 50 YYNVTMYIGQPARPYFLDLDTGSDLTWL--QCDAPCVRCVEAPH------PLYRPSNDL- 100
Y NVT IG PA+ + + LDTGSDL WL C++ CVR +E +Y PS
Sbjct: 90 YANVT--IGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQGERIKLNIYNPSKSKS 147
Query: 101 ---VPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGS-SLGVLVKDAFAFNYTNGQR 155
V C +CA + C P + C Y + Y GS S GVLV+D + G+
Sbjct: 148 SSKVTCNSTLCAL-----RNRCISPVSDCPYRIRYLSPGSKSTGVLVEDVIHMSTEEGEA 202
Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
+ R+ GC +Q+ ++GI+GL ++ + L + + C G G
Sbjct: 203 RDARITFGCSESQLGLFKEVAVNGIMGLAIADIAVPNMLVKAGVASDSFSMCFGPNGKGT 262
Query: 216 LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNR 275
+ FGD SS + T +S + + F G+ T FDSG++ T+L
Sbjct: 263 ISFGDK--GSSDQLETPLSGTISPMFYDVSITKFKVGKVTVDTEFTATFDSGTAVTWLIE 320
Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
Y LT+ + + L ++ D C+
Sbjct: 321 PYYTALTTNFHLSVPDRRLSKS-VDSPFEFCY 351
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/306 (29%), Positives = 131/306 (42%), Gaps = 43/306 (14%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV--RCVEAPHPLYRPSN----DLVPCE 104
Y V + G PA P + +DTGSD++WLQC PC +C PLY PS+ VPC
Sbjct: 79 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCK-PCSSGQCFPQKDPLYDPSHSSTYSAVPCA 137
Query: 105 DPICASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+C L A + C QC + + YADG S++G +D + G
Sbjct: 138 SDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTL---APGAIVQNFYFG 194
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG--GFLFFGDD 221
CG+ + A DG+LGLG+ + S+ ++ V +CL GFL G
Sbjct: 195 CGHGK--HAVRGLFDGVLGLGRLRESLGARYG------GVFSYCLPSVSSKPGFLALGAG 246
Query: 222 LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP---------VVFDSGSSYTY 272
+ S V+T M T P + + G G K L ++ DSG+ T
Sbjct: 247 -KNPSGFVFTPMG---TVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITG 302
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
L Y+ L S +K + A L + +T C+ +KNV K +AL+FT
Sbjct: 303 LQSTAYRALRSAFRKAMEAYRLLPNGDLDT---CYN-LTGYKNVVVPK-----IALTFTG 353
Query: 333 GKTRTL 338
G T L
Sbjct: 354 GATINL 359
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 84/284 (29%), Positives = 119/284 (41%), Gaps = 33/284 (11%)
Query: 51 YNVTMYIGQP-ARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY--RPSNDL--VPCED 105
Y + + IG P ++P L LDTGSD+ W QC+ PC C P P + SN + V C D
Sbjct: 92 YLIHLSIGAPRSQPVVLTLDTGSDVVWTQCE-PCAECFTQPLPRFDTAASNTVRSVACSD 150
Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN--YTNGQRLNPRLALG 163
P+C +A H C C Y Y DG S G ++D+F F+ G+ P + G
Sbjct: 151 PLC---NAHSEHGCFLHG-CTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFG 206
Query: 164 CG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL 222
CG YN G GI G G+G S+ SQL ++ + FL DL
Sbjct: 207 CGMYNA--GRFLQTETGIAGFGRGPLSLPSQLKVRQFSYCFTTRFEAKSSPVFLGGAGDL 264
Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAE----LFFGGETTGLKNLPV-----------VFDSG 267
+ +S+ + + PG L F G T G LPV DSG
Sbjct: 265 --KAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGKTRLPVPEIKADGSGATFIDSG 322
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR 311
+ T ++ L S + + K A ED+ + W G++
Sbjct: 323 TDITTFPDAVFRQLKSAFIAQAALPVNKTADEDD-ICFSWDGKK 365
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 104/361 (28%), Positives = 155/361 (42%), Gaps = 56/361 (15%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL 100
+ G +G Y + +G PAR ++ LDTGSD+ W+QC APC++C P++ P+
Sbjct: 135 ISGLAQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWIQC-APCIKCYSQTDPVFDPTKSR 193
Query: 101 ----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
+PC P+C L PG C Q C Y++ Y DG ++G + F G R
Sbjct: 194 SFANIPCGSPLCRRLDYPG---CSTKKQICLYQVSYGDGSFTVGEFSTETLTF---RGTR 247
Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG-- 213
+ R+ LGCG++ + G+LGLG+G+ S SQ+ + + +CL
Sbjct: 248 VG-RVVLGCGHDN--EGLFVGAAGLLGLGRGRLSFPSQIG--RRFNSKFSYCLGDRSASS 302
Query: 214 --GFLFFGDDLYDSSRVVWTSMSSD---YTKYYSP------------GVAELFFGGETTG 256
+ FGD S +T + S+ T YY G++ F ++TG
Sbjct: 303 RPSSIVFGDSAI-SRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTG 361
Query: 257 LKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNV 316
N V+ DSG+S T L R Y L + A +LK APE C+
Sbjct: 362 --NGGVIIDSGTSVTRLTRAAYVALRDAFL--VGASNLKRAPEFSLFDTCFD----LSGK 413
Query: 317 HDVKKCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGG 375
+VK T+ L F L YLI + N G+ C A L++IG
Sbjct: 414 TEVK--VPTVVLHFRGADV----PLPASNYLIPVDNSGSFCFAFAGTAS----GLSIIGN 463
Query: 376 I 376
I
Sbjct: 464 I 464
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/306 (29%), Positives = 131/306 (42%), Gaps = 43/306 (14%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV--RCVEAPHPLYRPSN----DLVPCE 104
Y V + G PA P + +DTGSD++WLQC PC +C PLY PS+ VPC
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCK-PCSSGQCFPQKDPLYDPSHSSTYSAVPCA 171
Query: 105 DPICASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+C L A + C QC + + YADG S++G +D + G
Sbjct: 172 SDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTL---APGAIVQNFYFG 228
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG--GFLFFGDD 221
CG+ + A DG+LGLG+ + S+ ++ V +CL GFL G
Sbjct: 229 CGHGK--HAVRGLFDGVLGLGRLRESLGARYG------GVFSYCLPSVSSKPGFLALGAG 280
Query: 222 LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP---------VVFDSGSSYTY 272
+ S V+T M T P + + G G K L ++ DSG+ T
Sbjct: 281 -KNPSGFVFTPMG---TVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITG 336
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
L Y+ L S +K + A L + +T C+ +KNV K +AL+FT
Sbjct: 337 LQSTAYRALRSAFRKAMEAYRLLPNGDLDT---CYN-LTGYKNVVVPK-----IALTFTG 387
Query: 333 GKTRTL 338
G T L
Sbjct: 388 GATINL 393
>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 528
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 97/351 (27%), Positives = 152/351 (43%), Gaps = 52/351 (14%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCD--APCVRCVE-------APHPLYRP----S 97
Y + +G P + + LDTGSDL WL C+ C+R +E P LY P +
Sbjct: 102 YYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTPNASTT 161
Query: 98 NDLVPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+ + C D C G C P+ C Y++ Y++ + G L++D T + L
Sbjct: 162 SSSIRCSDKRCF-----GSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHL-ATEDENL 215
Query: 157 NP---RLALGCGYNQVP-GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-- 210
P + LGCG Q + ++G+LGLG S+ S L + N C
Sbjct: 216 TPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFGRVI 275
Query: 211 GGGGFLFFGDDLY-DSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSS 269
G G + FGD Y D + S++ + Y ++ + G+ ++ L FD+GSS
Sbjct: 276 GNVGRISFGDRGYTDQEETPFISVAP--STAYGVNISGVSVAGDPVDIR-LFAKFDTGSS 332
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
+T+L Y LT KS E ED P+ PF+ +D+ T+
Sbjct: 333 FTHLREPAYGVLT---------KSFDELVEDRRRPV--DPELPFEFCYDLSPNATTIQFP 381
Query: 330 FTD----GKTRTLFELTPEAYLIISNKGNV--CLGILNGAEVGLQDLNVIG 374
+ G ++ + L + + +GNV CLG+L VGL+ +NVIG
Sbjct: 382 LVEMTFIGGSKII--LNNPFFTARTQEGNVMYCLGVLK--SVGLK-INVIG 427
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 106/390 (27%), Positives = 162/390 (41%), Gaps = 70/390 (17%)
Query: 27 SSLFNHVGSSLL-FQVH-----------GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDL 74
+SLF+H S++ Q H G T Y VT+ IG + L +DTGSDL
Sbjct: 109 NSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIG--GQNSTLIVDTGSDL 166
Query: 75 TWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLH----APGHHNCEDPAQCD 126
TW+QC PC C PL+ PSN +PC P C +L + G + ++ CD
Sbjct: 167 TWVQC-LPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCD 225
Query: 127 YELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKG 186
Y+++Y DG S G L + T G+ GCG N + G++GL +
Sbjct: 226 YQIDYGDGSYSRGELGFEKL----TLGKTEIDNFIFGCGRNN--KGLFGGASGLMGLARS 279
Query: 187 KSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYY- 241
+ S+VSQ S L +V +CL G G G D + + S YT+
Sbjct: 280 ELSLVSQTSS--LFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPIS----YTRMIQ 333
Query: 242 SPGVAELFF---GGETTGLKNLPV-----------VFDSGSSYTYLNRVTYQTLTSIMKK 287
+P ++ +F G + G NL V + DSG+ T L+ Y+ + +K
Sbjct: 334 NPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEK 393
Query: 288 ELSAKSLKEAPEDETLPLCWK--GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEA 345
+ S + P L C+ G N+ VK F +G + ++
Sbjct: 394 QFSG--YRTTPGFSILNTCFNLTGYEEV-NIPTVKFIF--------EGNAEMIVDVEGVF 442
Query: 346 YLIISNKGNVCLGILNGAEVGLQDLNVIGG 375
Y + S+ +CL A +G +D +I G
Sbjct: 443 YFVKSDASQICLAF---ASLGYEDQTMIIG 469
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 106/390 (27%), Positives = 162/390 (41%), Gaps = 70/390 (17%)
Query: 27 SSLFNHVGSSLL-FQVH-----------GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDL 74
+SLF+H S++ Q H G T Y VT+ IG + L +DTGSDL
Sbjct: 30 NSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIG--GQNSTLIVDTGSDL 87
Query: 75 TWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLH----APGHHNCEDPAQCD 126
TW+QC PC C PL+ PSN +PC P C +L + G + ++ CD
Sbjct: 88 TWVQC-LPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCD 146
Query: 127 YELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKG 186
Y+++Y DG S G L + T G+ GCG N + G++GL +
Sbjct: 147 YQIDYGDGSYSRGELGFEKL----TLGKTEIDNFIFGCGRNN--KGLFGGASGLMGLARS 200
Query: 187 KSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYY- 241
+ S+VSQ S L +V +CL G G G D + + S YT+
Sbjct: 201 ELSLVSQTSS--LFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPIS----YTRMIQ 254
Query: 242 SPGVAELFF---GGETTGLKNLPV-----------VFDSGSSYTYLNRVTYQTLTSIMKK 287
+P ++ +F G + G NL V + DSG+ T L+ Y+ + +K
Sbjct: 255 NPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEK 314
Query: 288 ELSAKSLKEAPEDETLPLCWK--GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEA 345
+ S + P L C+ G N+ VK F +G + ++
Sbjct: 315 QFSG--YRTTPGFSILNTCFNLTGYEEV-NIPTVKFIF--------EGNAEMIVDVEGVF 363
Query: 346 YLIISNKGNVCLGILNGAEVGLQDLNVIGG 375
Y + S+ +CL A +G +D +I G
Sbjct: 364 YFVKSDASQICLAF---ASLGYEDQTMIIG 390
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 107/350 (30%), Positives = 142/350 (40%), Gaps = 51/350 (14%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD---AP---CVRCVEAPHPLYRPSN---- 98
G Y V+M G P + L DTGSDL WLQC AP C + + P + S
Sbjct: 52 GQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATL 111
Query: 99 DLVPCEDPICASLHAPGHH--NCE--DPAQCDYELEYADGGSSLGVLVKD-AFAFNYTNG 153
+VPC C + AP H +C P C Y +YADG S+ G L +D A N T+G
Sbjct: 112 SVVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSG 171
Query: 154 QRLNPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC---LS 209
+A GCG NQ G S+ G++GLG+G+ S +Q S L +C L
Sbjct: 172 GAAVRGVAFGCGTRNQ--GGSFSGTGGVIGLGQGQLSFPAQ--SGSLFAQTFSYCLLDLE 227
Query: 210 GGGGG----FLFFGDDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGETTG------- 256
GG G FLF G ++ +T + S+ +Y GV + G
Sbjct: 228 GGRRGRSSSFLFLGRPERRAA-FAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWA 286
Query: 257 ---LKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET----LPLCWKG 309
L N V DSGS+ TYL Y L S + L P T L LC+
Sbjct: 287 IDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASV---HLPRIPSSATFFQGLELCYNV 343
Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGI 359
++ F L + F G + EL YL+ CL I
Sbjct: 344 SSS-SSLAPANGGFPRLTIDFAQGLS---LELPTGNYLVDVADDVKCLAI 389
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 80/281 (28%), Positives = 125/281 (44%), Gaps = 44/281 (15%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y V + +G P DTGSD+ W QC PC C + P++ PS V C
Sbjct: 81 GEYLVEISVGTPPFSIVAVADTGSDVIWTQCK-PCSNCYQQNAPMFDPSKSTTYKNVACS 139
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
P+C+ ++ +C D ++C Y + Y D S G L D T+G+ + PR +G
Sbjct: 140 SPVCS--YSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQSTSGRPVAFPRTVIG 197
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF------LF 217
CG++ G + GI+GLG+G +S+V+QL + +CL G G L
Sbjct: 198 CGHDNA-GTFNANVSGIVGLGRGPASLVTQLGPATGGK--FSYCLIPIGTGSTNDSTKLN 254
Query: 218 FGDDLYDS-SRVVWTSM--SSDYTKYYS----------------PGVAELFFGGETTGLK 258
FG + S S V T + S+ Y +YS G ++L GGE+
Sbjct: 255 FGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKL--GGESN--- 309
Query: 259 NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPE 299
++ DSG++ TYL + S + + +S ++ E
Sbjct: 310 ---IIIDSGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSE 347
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 101/371 (27%), Positives = 156/371 (42%), Gaps = 64/371 (17%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP---LYRPSNDLVP-- 102
+G Y V + +G P + L DTGSDL W++C A C C P L R S+ P
Sbjct: 85 SGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSA-CRNCSHHPPSSAFLPRHSSSFSPFH 143
Query: 103 CEDPICASL-HAPGHHNCEDP---AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
C DP C L HAP HH C + C + YADG S G K+ +G ++
Sbjct: 144 CFDPHCRLLPHAP-HHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSGSEIHL 202
Query: 159 R-LALGCGYN----QVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGHCLSG 210
+ L+ GCG+ V GA ++ G++GLG+G S SQL + K ++ + LS
Sbjct: 203 KGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLMDYTLSP 262
Query: 211 GGGGFLFFGDDLY-----DSSRVVWTSMSSD---YTKYY---------------SPGVAE 247
FL G L+ +++++ +T + + T YY +P V E
Sbjct: 263 PPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPINPAVWE 322
Query: 248 LFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPE--DETLPL 305
+ G N V DSG++ TYL + Y+ + +++ + + E D +
Sbjct: 323 IDEQG------NGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLCVNA 376
Query: 306 CWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEV 365
+ RRP L G +F P Y + + +G +CL I E
Sbjct: 377 SGESRRP---------SLPRLRFRLGGG---AVFAPPPRNYFLETEEGVMCLAI-RAVES 423
Query: 366 GLQDLNVIGGI 376
G +VIG +
Sbjct: 424 G-NGFSVIGNL 433
>gi|413936885|gb|AFW71436.1| hypothetical protein ZEAMMB73_738128, partial [Zea mays]
Length = 320
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 88/188 (46%), Gaps = 22/188 (11%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH--------PLYRP--S 97
TG Y + IG P + Y++ +DTGSD+ W+ C +RC P Y P S
Sbjct: 81 TGLYYTRIEIGSPPKGYYVQVDTGSDILWVNC----IRCDGCPTRSGLGIELTQYDPAGS 136
Query: 98 NDLVPCEDPICASLHAPG-HHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYT--NG 153
V CE C + A G C + C + + Y DG ++ G V D +N NG
Sbjct: 137 GTTVGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNG 196
Query: 154 QRL--NPRLALGCGYNQVP--GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
Q N + GCG G+S LDGILG G+ SS++SQL + + +R + HCL
Sbjct: 197 QTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLD 256
Query: 210 GGGGGFLF 217
GG +F
Sbjct: 257 TVRGGGIF 264
>gi|213998832|gb|ACJ60783.1| nucellin [Hordeum vulgare subsp. spontaneum]
Length = 127
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 53/127 (41%), Positives = 71/127 (55%), Gaps = 5/127 (3%)
Query: 164 CGYNQVPGASY--HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGGGGGFLFFGD 220
CGY Q A P+DGILGLG GK+ + +QL K+I+ NV+GHCLS G G L+ GD
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVIGHCLSSKGKGVLYVGD 60
Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLNRVTYQ 279
+ V W M YYSPG+AE+F + G VFDSGS+YT++ Y
Sbjct: 61 FNPPTRGVTWVPMRESLF-YYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTHVPAQIYN 119
Query: 280 TLTSIMK 286
+ S ++
Sbjct: 120 EIVSKVR 126
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 96/370 (25%), Positives = 139/370 (37%), Gaps = 67/370 (18%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND- 99
+ G + +G Y ++ +G P P L +DTGSD+ WLQC PCV C PLY P
Sbjct: 89 ISGLPFASGEYFASVGVGTPPTPALLVIDTGSDVVWLQCK-PCVHCYRQLSPLYDPRGSS 147
Query: 100 ---LVPCEDPICASLHAPGHHNCEDPAQCD-------YELEYADGGSSLGVLVKDAFAFN 149
PC P C +P CD Y + Y D S+ G L D F
Sbjct: 148 TYAQTPCSPP-----------QCRNPQTCDGTTGGCGYRIVYGDASSTSGNLATDRLVF- 195
Query: 150 YTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL- 208
+N + + LGCG++ + G+LG+ +G +S +Q+ +CL
Sbjct: 196 -SNDTSVG-NVTLGCGHDNE--GLFGSAAGLLGVARGNNSFATQVADS--YGRYFAYCLG 249
Query: 209 ----SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK---YYSPGVAELFFGGETTGLKNLP 261
SG +L FG + V+T + S+ + YY V G TG N
Sbjct: 250 DRTRSGSSSSYLVFGRTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNAS 309
Query: 262 -----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
VV DSG+S T R Y L + +++ +G
Sbjct: 310 LSLDPATGRGGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKV---------GRGI 360
Query: 311 RPFKNVHDVKKCFRT----LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVG 366
F +D++ + L F G L PE YL+ G L A G
Sbjct: 361 SVFDACYDLRGVAVADAPGVVLHFAGGAD---VALPPENYLVPEESGRYHCFALEAA--G 415
Query: 367 LQDLNVIGGI 376
L+VIG +
Sbjct: 416 HDGLSVIGNV 425
>gi|110738505|dbj|BAF01178.1| hypothetical protein [Arabidopsis thaliana]
Length = 284
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 66/203 (32%), Positives = 103/203 (50%), Gaps = 16/203 (7%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
+++ ++ GYY ++IG P + + L +D+GS +T++ C + C +C + P ++P
Sbjct: 81 MRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKHQDPKFQP-- 137
Query: 99 DLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
++ P+ ++ NC+D QC YE EYA+ SS GVL +D +F N +L
Sbjct: 138 EMSSTYQPVKCNMDC----NCDDDREQCVYEREYAEHSSSKGVLGEDLISFG--NESQLT 191
Query: 158 P-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGG 213
P R GC + DGI+GLG+G S+V QL + LI N G C G GGG
Sbjct: 192 PQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGG 251
Query: 214 GFLFFGDDLYDSSRVVWTSMSSD 236
+ G D S +V+T D
Sbjct: 252 SMILGGFDY--PSDMVFTDSDPD 272
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 99/347 (28%), Positives = 149/347 (42%), Gaps = 54/347 (15%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y V +Y+G P R + + +DTGSDL WLQC APC+ C E P++ P+ L V C
Sbjct: 149 SGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPATSLSYRNVTC 207
Query: 104 EDPICASLHAP-GHHNCEDPAQ--CDYELEYADGGSSLGVLVKDAFAFNYT--NGQRLNP 158
DP C + P C P C Y Y D ++ G L +AF N T R
Sbjct: 208 GDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVD 267
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGH----CLSGGG-- 212
+ GCG++ +H G+LGLG+G S SQL R V GH CL G
Sbjct: 268 DVVFGCGHSN--RGLFHGAAGLLGLGRGALSFASQL------RAVYGHAFSYCLVDHGSS 319
Query: 213 -GGFLFFGDD--LYDSSRVVWTSMSSDYT----KYYSPGVAELFFGGETTGLKNLP---- 261
G + FGDD L R+ +T+ + +Y + + GGE +
Sbjct: 320 VGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVG 379
Query: 262 ------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
+ DSG++ +Y Y+ ++++ + K P P+ P N
Sbjct: 380 KDGSGGTIIDSGTTLSYFAEPAYE----VIRRAFVERMDKAYPLVADFPVL----SPCYN 431
Query: 316 VHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGIL 360
V V++ +L F DG +++ E Y + + G +CL +L
Sbjct: 432 VSGVERVEVPEFSLLFADG---AVWDFPAENYFVRLDPDGIMCLAVL 475
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 100/367 (27%), Positives = 151/367 (41%), Gaps = 57/367 (15%)
Query: 34 GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPL 93
G L VH G + + + IG PA Y +DTGSDL W QC PCV C + P+
Sbjct: 81 GGDLQVPVHAG---NGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPV 136
Query: 94 YRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN 149
+ PS+ VPC C+ L C ++C Y Y D S+ GVL + F
Sbjct: 137 FDPSSSSTYATVPCSSASCSDLPT---SKCTSASKCGYTYTYGDSSSTQGVLATETF--- 190
Query: 150 YTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
T + P + GCG + G + G++GLG+G S+VSQL K +CL+
Sbjct: 191 -TLAKSKLPGVVFGCG-DTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKF-----SYCLT 243
Query: 210 GGG---------GGFLFFGDDLYDSSRVVWTSMSSDYTK--YYSPGVAELFFGGETTGLK 258
G + +S V T + + ++ +Y + + G L
Sbjct: 244 SLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLP 303
Query: 259 NLP----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
+ V+ DSG+S TYL Y+ L KK +A+ A + + L
Sbjct: 304 SSAFAVQDDGTGGVIVDSGTSITYLEVQGYRAL----KKAFAAQMALPAADGSGVGLDLC 359
Query: 309 GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIIS-NKGNVCLGILNGAEVGL 367
R P K V V+ L F G +L E Y+++ G +CL ++ G
Sbjct: 360 FRAPAKGVDQVE--VPRLVFHFDGGAD---LDLPAENYMVLDGGSGALCLTVM-----GS 409
Query: 368 QDLNVIG 374
+ L++IG
Sbjct: 410 RGLSIIG 416
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 100/367 (27%), Positives = 151/367 (41%), Gaps = 57/367 (15%)
Query: 34 GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPL 93
G L VH G + + + IG PA Y +DTGSDL W QC PCV C + P+
Sbjct: 91 GGDLQVPVHAG---NGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPV 146
Query: 94 YRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN 149
+ PS+ VPC C+ L C ++C Y Y D S+ GVL + F
Sbjct: 147 FDPSSSSTYATVPCSSASCSDLPT---SKCTSASKCGYTYTYGDSSSTQGVLATETF--- 200
Query: 150 YTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
T + P + GCG + G + G++GLG+G S+VSQL K +CL+
Sbjct: 201 -TLAKSKLPGVVFGCG-DTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKF-----SYCLT 253
Query: 210 GGG---------GGFLFFGDDLYDSSRVVWTSMSSDYTK--YYSPGVAELFFGGETTGLK 258
G + +S V T + + ++ +Y + + G L
Sbjct: 254 SLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLP 313
Query: 259 NLP----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
+ V+ DSG+S TYL Y+ L KK +A+ A + + L
Sbjct: 314 SSAFAVQDDGTGGVIVDSGTSITYLEVQGYRAL----KKAFAAQMALPAADGSGVGLDLC 369
Query: 309 GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIIS-NKGNVCLGILNGAEVGL 367
R P K V V+ L F G +L E Y+++ G +CL ++ G
Sbjct: 370 FRAPAKGVDQVE--VPRLVFHFDGGAD---LDLPAENYMVLDGGSGALCLTVM-----GS 419
Query: 368 QDLNVIG 374
+ L++IG
Sbjct: 420 RGLSIIG 426
>gi|357489329|ref|XP_003614952.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355516287|gb|AES97910.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 530
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 75/261 (28%), Positives = 114/261 (43%), Gaps = 33/261 (12%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP-SNDL-------------VP 102
IG P + + LDTGSD+ W+ CD C+ C Y DL +P
Sbjct: 108 IGTPNVSFLVALDTGSDMFWVPCD--CIECAPLSAAFYNALDRDLNQYSPSLSSSSRHLP 165
Query: 103 CEDPICASLHAPGHHNCED-PAQCDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR--LNP 158
C +C + NC+ +C Y EY +D SS G L++D N + +
Sbjct: 166 CGHQLCNQ-----NSNCKGFKDRCPYIKEYTSDNTSSSGFLIEDKLHLASNNATKNSIQA 220
Query: 159 RLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
+ LGCG Q + GA+ +G+LGLG G S+ + L LIRN + CL+ G G
Sbjct: 221 SVILGCGRKQSGYFLEGAA---PNGMLGLGPGSISVPALLAKAGLIRNSISICLNEKGSG 277
Query: 215 FLFFGDDLYDSSRVVWTSMSSD-YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
+ FGD + + R + D Y GV G D+G+S+TYL
Sbjct: 278 RILFGDQGHATQRRSTPFLLDDGELLNYFVGVERFCVGSFCYKETEFKAFIDTGTSFTYL 337
Query: 274 NRVTYQTLTSIMKKELSAKSL 294
+ Y+T+ + +K++ A +
Sbjct: 338 PKGVYETVVAEFEKQVHATRI 358
>gi|213998796|gb|ACJ60765.1| nucellin [Hordeum marinum subsp. gussoneanum]
Length = 133
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 53/124 (42%), Positives = 69/124 (55%), Gaps = 3/124 (2%)
Query: 176 PLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMS 234
P+DGILGLG GK+ +QL QK+I NV+GHCLS G G L+ G+ S V W M
Sbjct: 8 PVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGNFNPPSRGVTWVPM- 66
Query: 235 SDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKS 293
+ + YYSPG+AEL + G VFDSGS+YT + Y + ++ LS S
Sbjct: 67 RESSFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTLVPSQIYNEIVPKVRGTLSESS 126
Query: 294 LKEA 297
L E
Sbjct: 127 LAEV 130
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 100/365 (27%), Positives = 151/365 (41%), Gaps = 57/365 (15%)
Query: 36 SLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR 95
S L VH G + + + IG PA Y +DTGSDL W QC PCV C + P++
Sbjct: 62 SRLVPVHAG---NGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPVFD 117
Query: 96 PSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
PS+ VPC C+ L C ++C Y Y D S+ GVL + F T
Sbjct: 118 PSSSSTYATVPCSSASCSDLPT---SKCTSASKCGYTYTYGDSSSTQGVLATETF----T 170
Query: 152 NGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
+ P + GCG + G + G++GLG+G S+VSQL K +CL+
Sbjct: 171 LAKSKLPGVVFGCG-DTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKF-----SYCLTSL 224
Query: 212 G---------GGFLFFGDDLYDSSRVVWTSMSSDYTK--YYSPGVAELFFGGETTGLKNL 260
G + +S V T + + ++ +Y + + G L +
Sbjct: 225 DDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSS 284
Query: 261 P----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
V+ DSG+S TYL Y+ L KK +A+ A + + L R
Sbjct: 285 AFAVQDDGTGGVIVDSGTSITYLEVQGYRAL----KKAFAAQMALPAADGSGVGLDLCFR 340
Query: 311 RPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIIS-NKGNVCLGILNGAEVGLQD 369
P K V V+ L F G +L E Y+++ G +CL ++ G +
Sbjct: 341 APAKGVDQVE--VPRLVFHFDGGAD---LDLPAENYMVLDGGSGALCLTVM-----GSRG 390
Query: 370 LNVIG 374
L++IG
Sbjct: 391 LSIIG 395
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 93/371 (25%), Positives = 157/371 (42%), Gaps = 67/371 (18%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y T+ +G PA+ + + DTGSDL W+QC PC C P++ P S + C
Sbjct: 38 GDYVTTISLGTPAKVFSVIADTGSDLIWIQC-KPCQACFNQKDPIFDPEGSSSYTTMSCG 96
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALG 163
D +C SL +C CDY Y DG + G L + T G++L + +A G
Sbjct: 97 DTLCDSLP---RKSCS--PNCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFG 151
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFF 218
CG+ + S++ G++GLG+G S VSQL L + +CL + +FF
Sbjct: 152 CGH--LNRGSFNDASGLVGLGRGNLSFVSQLG--DLFGHKFSYCLVPWRDAPSKTSPMFF 207
Query: 219 GDDLYDSSRVVWTSMSSDYTKY-YSPGVAELFFGGETTGLKNLPV--------------- 262
GD+ SS + +T ++P + ++ LK++ +
Sbjct: 208 GDE--SSSHSSGKKLHYAFTPMIHNPAMESFYY----VKLKDISIAGRALRIPAGSFDIK 261
Query: 263 -------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
+FDSG++ T L YQ + ++ ++S + + L LC+ +
Sbjct: 262 PDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGS--SAGLDLCY-------D 312
Query: 316 VHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGN--VCLGILNG-AEVGL----- 367
V K ++ + +L E Y I +N VCL +++ ++G+
Sbjct: 313 VSGSKASYKKKIPAMVFHFEGADHQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMM 372
Query: 368 -QDLNVIGGIG 377
Q+ V+ IG
Sbjct: 373 QQNFRVMYDIG 383
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 141/348 (40%), Gaps = 38/348 (10%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
TG Y V + +G P + + L DTGSDLTW++C P ++RP +PC
Sbjct: 113 TGQYFVKLRVGTPVQEFTLVADTGSDLTWVKCAG-----ASPPGRVFRPKTSRSWAPIPC 167
Query: 104 EDPICASLHAP-GHHNCEDPAQ-CDYELEYADGGS-SLGVLVKDAFAFNYTNGQRLNPR- 159
C L P NC PA C Y+ Y +G + + G++ ++ G+ +
Sbjct: 168 SSDTC-KLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVAQLKD 226
Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGHCLSGGGGGFL 216
+ LGC + G S+ DG+L LG K S +Q ++ +V H G+L
Sbjct: 227 VVLGCSSSH-DGQSFRSADGVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNATGYL 285
Query: 217 FFGDDLYDSSRVVWTSMSSD-YTKYYSPGVAELFFGG-------ETTGLKNLPVVFDSGS 268
FG + T + D +Y V + G E K+ V+ DSG+
Sbjct: 286 AFGPGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVWDAKSGGVILDSGN 345
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
+ T L Y+ + + + K L P E W RRP + LA+
Sbjct: 346 TLTVLAAPAYKAVVAALSKHLDGVPKVSFPPFEHC-YNWTARRP-----GAPEIIPKLAV 399
Query: 329 SFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
F G R E ++Y+I G C+G+ G G L+VIG I
Sbjct: 400 QFA-GSAR--LEPPAKSYVIDVKPGVKCIGVQEGEWPG---LSVIGNI 441
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 97/345 (28%), Positives = 146/345 (42%), Gaps = 55/345 (15%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y + + IG P Y LDTGSDL W QC PC +C + P P++ P S V C
Sbjct: 106 GEYLMELAIGTPPVSYPAVLDTGSDLIWTQC-KPCTQCYKQPTPIFDPKKSSSFSKVSCG 164
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
+C+++ + C D C+Y Y D + GVL + F F + + + GC
Sbjct: 165 SSLCSAVPS---STCSD--GCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGC 219
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFGD- 220
G + G + G++GLG+G S+VSQL + +CL+ L G
Sbjct: 220 GEDN-EGDGFEQASGLVGLGRGPLSLVSQLKEPRF-----SYCLTPMDDTKESILLLGSL 273
Query: 221 -DLYDSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLK-----NLPVVFDSG 267
+ D+ VV T + + Y V + E + + N V+ DSG
Sbjct: 274 GKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSG 333
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET----LPLCWKGRRPFKNVHDVKKCF 323
++ TY+ + ++ L KKE +++ + P D+T L LC+ V K F
Sbjct: 334 TTITYIEQKAFEAL----KKEFISQT--KLPLDKTSSTGLDLCFSLPSGSTQVEIPKIVF 387
Query: 324 RTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGL 367
F G EL E Y+I SN G CL + GA G+
Sbjct: 388 H-----FKGGD----LELPAENYMIGDSNLGVACLAM--GASSGM 421
>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 466
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 97/365 (26%), Positives = 142/365 (38%), Gaps = 55/365 (15%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP----------------------CVRCVE 88
Y + +G P + DTGSDL WL+C+ V
Sbjct: 82 YLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPPPPEAVV 141
Query: 89 APHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
+P S V C+ P C +L N D CD+ Y DG S+ G+L D F F
Sbjct: 142 YFNPFDSSSYSRVGCDGPSCLALATNASCN-GDSHACDFRYSYRDGASATGLLAADTFTF 200
Query: 149 --NYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGH 206
N N + GC G + DG++GLG G S+ SQL +
Sbjct: 201 GGNINNDTTSTASIDFGCATGTA-GREFQA-DGMVGLGAGPLSLASQLGRK------FSF 252
Query: 207 CLSG----GGGGFLFFGDDLYDSSRVVWTS----MSSDYTKYYSPGVAELFFGGE----T 254
CL+ L FG S T+ SS+ YY+ + L G+ T
Sbjct: 253 CLTAYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISIDSLKVAGQPVPGT 312
Query: 255 TGLKNLPVVFDSGSSYTYLNRVTYQT-LTSIMKKELSAKSLKEA-PEDETLPLCWKGRRP 312
T + V+ D+G+ T+L+R LT + + + L A P DETL LC+ R
Sbjct: 313 TSVSK--VIVDTGTVLTFLDRAALLAPLTESLARVMDGAGLPRAPPPDETLELCYDVSR- 369
Query: 313 FKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNV 372
V DV + L G + LT E ++ +G +CL ++ + LQ L+V
Sbjct: 370 ---VKDVDGVIPDVTLVLGGGGGGEV-RLTGEGTFVLVKEGVLCLAVVTTSP-ELQPLSV 424
Query: 373 IGGIG 377
+G +
Sbjct: 425 LGNVA 429
>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 72/228 (31%), Positives = 104/228 (45%), Gaps = 26/228 (11%)
Query: 29 LFNHVGSSLLFQVHGNVYPTG-YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV 87
L +HV + F V P Y T+ IG P R + + +DTGSD+ W+ C + CV C
Sbjct: 59 LQSHVHGAFSFPVERGTNPISRIYYTTLQIGTPPREFNVVIDTGSDVLWVSCIS-CVGCP 117
Query: 88 EAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVK 143
+ P S + C D C S H + +Y++EY+DG + G +
Sbjct: 118 LQNVTFFDPGASSSAVKLACSDKRCFS----DLHKKSGCSPLEYKVEYSDGSFTSGYYIS 173
Query: 144 DAFAFNYTNGQRLNPR----LALGC-----GYNQVPGASYHPLDGILGLGKGKSSIVSQL 194
D +F L + GC G +P S H GI+GLGKG+ +VSQL
Sbjct: 174 DLISFETVMSSNLTVKSSAPFVFGCSNLHAGLISLPETSIH---GIVGLGKGRLLVVSQL 230
Query: 195 HSQKLIRNVVGHCLSGG--GGGFLFFGDDLYDSSRVVWTSMSSDYTKY 240
SQ+L V CLSGG GGG + G++ ++ V+T + T Y
Sbjct: 231 SSQRLAPEVFSLCLSGGQEGGGVIILGENRLPNT--VYTPLVRSQTHY 276
>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
Length = 632
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 78/280 (27%), Positives = 122/280 (43%), Gaps = 39/280 (13%)
Query: 44 NVYPTGYYNVTMY----IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR---- 95
+ P Y+ Y IG P+ + + LD+GSDL W+ C+ CV+C Y
Sbjct: 86 TISPGNYFGWLHYTWIDIGTPSVSFLVALDSGSDLLWIPCN--CVQCAPLSSAYYSSLAT 143
Query: 96 -------PS----NDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYA-DGGSSLGVLV 142
PS + + PC +C S A CE P QC Y + YA + SS G+LV
Sbjct: 144 KDLNEFDPSASTTSKVFPCSHKLCESAPA-----CESPKEQCPYTVTYASENTSSSGLLV 198
Query: 143 KDAF--AFNYTNGQRLNPRLALGCGYNQVPGASYHPL--DGILGLGKGKSSIVSQLHSQK 198
+D A++ + R+ +GCG Q G + DG++GLG G+ S+ S L
Sbjct: 199 EDVLHLAYSANASSSVKARVVVGCGEKQ-SGEFLKGIAPDGVMGLGPGEISVPSFLAKAG 257
Query: 199 LIRNVVGHCLSGGGGGFLFFGD---DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETT 255
L+RN C G ++FGD S+R + +++ Y+ GV G
Sbjct: 258 LMRNSFSMCFDEEDSGRIYFGDVGPSTQQSTRFL--PYKNEFVAYFV-GVEVCCVGNSCL 314
Query: 256 GLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLK 295
+ + DSG S+T+L Y+ + + ++A K
Sbjct: 315 KQSSFTTLIDSGQSFTFLPEEIYREVALEIDSHINATVKK 354
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 99/347 (28%), Positives = 149/347 (42%), Gaps = 54/347 (15%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y V +Y+G P R + + +DTGSDL WLQC APC+ C E P++ P+ L V C
Sbjct: 149 SGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASLSYRNVTC 207
Query: 104 EDPICASLHAP-GHHNCEDPAQ--CDYELEYADGGSSLGVLVKDAFAFNYT--NGQRLNP 158
DP C + P C P C Y Y D ++ G L +AF N T R
Sbjct: 208 GDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVD 267
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGH----CLSGGG-- 212
+ GCG++ +H G+LGLG+G S SQL R V GH CL G
Sbjct: 268 DVVFGCGHSN--RGLFHGAAGLLGLGRGALSFASQL------RAVYGHAFSYCLVDHGSS 319
Query: 213 -GGFLFFGDD--LYDSSRVVWTSMSSDYT----KYYSPGVAELFFGGETTGLKNLP---- 261
G + FGDD L R+ +T+ + +Y + + GGE +
Sbjct: 320 VGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVG 379
Query: 262 ------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
+ DSG++ +Y Y+ ++++ + K P P+ P N
Sbjct: 380 KDGSGGTIIDSGTTLSYFAEPAYE----VIRRAFVERMDKAYPLVADFPVL----SPCYN 431
Query: 316 VHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGIL 360
V V++ +L F DG +++ E Y + + G +CL +L
Sbjct: 432 VSGVERVEVPEFSLLFADG---AVWDFPAENYFVRLDPDGIMCLAVL 475
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 82/285 (28%), Positives = 118/285 (41%), Gaps = 39/285 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPC 103
+G Y V + IG P +L +D+GSD+ W+QC PC+ C PL+ P++ V C
Sbjct: 122 SGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCK-PCLECYAQADPLFDPASSATFSAVSC 180
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
IC +L G C D C+YE+ Y DG + G L + T + +A+G
Sbjct: 181 GSAICRTLRTSG---CGDSGGCEYEVSYGDGSYTKGTLALETLTLGGTAVE----GVAIG 233
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG---------G 214
CG+ + G+LGLG G S+V QL +CL+ GG G
Sbjct: 234 CGHRNR--GLFVGAAGLLGLGWGPMSLVGQLGGAA--GGAFSYCLASRGGSGSGAADAAG 289
Query: 215 FLFFGDDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGE----TTGLKNLP------V 262
L G VW + + +Y GV+ + G E GL L V
Sbjct: 290 SLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLTEDGGGGV 349
Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
V D+G++ T L + Y L + A L AP L C+
Sbjct: 350 VMDTGTAVTRLPQEAYAALRDAFVGAVGA--LPRAPGVSLLDTCY 392
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 88/292 (30%), Positives = 124/292 (42%), Gaps = 43/292 (14%)
Query: 27 SSLFNHVGSSLLFQVHGNVYPT--GY------YNVTMYIGQPARPYFLDLDTGSDLTWLQ 78
SS +N+V L Q PT GY Y +T+ IG PA + +DTGSD++W+Q
Sbjct: 99 SSRYNNVAKEL--QQSAVTIPTSSGYSLGTTEYVITVTIGTPAVTQVMSIDTGSDVSWVQ 156
Query: 79 CDAPCV--RCVEAPHPLYRPSNDLV----PCEDPICASLHAPGHHNCEDPAQCDYELEYA 132
C APC C L+ P+ C CA L G+ + +QC Y ++Y
Sbjct: 157 C-APCAAQSCSSQKDKLFDPAMSATYSAFSCGSAQCAQLGDEGNGCLK--SQCQYIVKYG 213
Query: 133 DGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVS 192
DG ++ G D + ++ + GC + LDG++GLG S+VS
Sbjct: 214 DGSNTAGTYGSDTLSLTSSDAVK---SFQFGCSHRAA--GFVGELDGLMGLGGDTESLVS 268
Query: 193 QLHSQKLIRNVVGHCL---SGGGGGFLFFG-DDLYDSSRVVWTSMSSDYTKYYSPGVAEL 248
Q + +CL S GGGFL G SSR T M ++ P +
Sbjct: 269 Q--TAATYGKAFSYCLPPPSSSGGGFLTLGAAGGASSSRYSHTPM----VRFSVPTFYGV 322
Query: 249 FFGGETTG--LKNLPV-------VFDSGSSYTYLNRVTYQTLTSIMKKELSA 291
F G T + N+P V DSG+ T L YQ L + KKE+ A
Sbjct: 323 FLQGITVAGTMLNVPASVFSGASVVDSGTVITQLPPTAYQALRTAFKKEMKA 374
>gi|297819836|ref|XP_002877801.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323639|gb|EFH54060.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 78/278 (28%), Positives = 117/278 (42%), Gaps = 30/278 (10%)
Query: 50 YYNVTMYIGQPARPYFLDLDTGSDLTWL--QCDAPCVRCVEAPH------------PLYR 95
Y NVT IG PA+ + + LDTGSDL WL C++ CVR +E +Y
Sbjct: 112 YANVT--IGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQGETHMNAQRIRLNIYN 169
Query: 96 PS----NDLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGS-SLGVLVKDAFAFN 149
PS + V C +CA + C P + C Y + Y GS S GVLV+D +
Sbjct: 170 PSISTSSSKVTCNSTLCAL-----RNRCISPLSDCPYRIRYLSPGSKSTGVLVEDVIHMS 224
Query: 150 YTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
G+ + R+ GC Q+ ++GI+GL ++ + L + + C
Sbjct: 225 TEEGEARDARITFGCSETQLGLFQEVAVNGIMGLAMADIAVPNMLVKAGVASDSFSMCFG 284
Query: 210 GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSS 269
G G + FGD SS T + + + F G+ T +FDSG++
Sbjct: 285 PNGKGTISFGDK--GSSDQHETPLGGTISPLFYDVSITKFKVGKVTVETKFSAIFDSGTA 342
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
T+L Y LT+ + + L A D T C+
Sbjct: 343 VTWLLDPYYTALTTNFHLSVPDRRLP-ANVDSTFEFCY 379
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 95/367 (25%), Positives = 161/367 (43%), Gaps = 40/367 (10%)
Query: 14 TVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSD 73
TV++ SS SS+++ + V + + N Y G Y + +YIG P +DTGSD
Sbjct: 34 TVKLIRKSSHLSSNNIQDIVQAPI------NAY-IGQYLMELYIGTPPIKISGTVDTGSD 86
Query: 74 LTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYEL 129
L W+QC PC+ C +P++ P + + C+ P+C + P C +CDY
Sbjct: 87 LIWVQC-VPCLGCYNQINPMFDPLKSSTYTNISCDSPLC---YKPYIGECSPEKRCDYTY 142
Query: 130 EYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALGCGYNQVPGASYHPLDGILGLGKGKS 188
YAD + GVL ++ G+ ++ + + GCG+N + H + G++GLG G +
Sbjct: 143 GYADSSLTKGVLAQETVTLTSNTGKPISLQGILFGCGHNNTGNFNDHEM-GLIGLGGGPT 201
Query: 189 SIVSQL--------HSQKLIRNVVGHCLSGG---GGGFLFFGDDLYDSSRVVWTS-MSSD 236
S+VSQ+ SQ L+ + +S G G G+ + + V M+S
Sbjct: 202 SLVSQIGPLFGGKKFSQCLVPFLTDITISSQMSFGKGSEVLGEGVVTTPLVQREQDMTSY 261
Query: 237 YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
Y V + + +T ++ ++ DSG+ L + Y + +K ++ + + +
Sbjct: 262 YVTLLGISVEDTYLPMNST-IEKGNMLVDSGTPPNILPQQLYDRVYVEVKNKVPLEPITD 320
Query: 297 APEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVC 356
P LC++ + K + F L T +T TPE KG C
Sbjct: 321 DPSLGPQ-LCYRTQTNLKG-PTLTYHFEGANLLLT--PIQTFIPPTPET------KGVFC 370
Query: 357 LGILNGA 363
L I N A
Sbjct: 371 LAITNCA 377
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 76/251 (30%), Positives = 109/251 (43%), Gaps = 22/251 (8%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPICASLH 112
IG P Y DTGSDLTW QC PC++C + P++ P S VPC C H
Sbjct: 86 IGTPPVDYLGIADTGSDLTWAQC-LPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTC---H 141
Query: 113 APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGA 172
A +C CDY Y D S G L F + + +GCG+ G
Sbjct: 142 AVDDGHCGVQGVCDYSYTYGDRTYSKGDL-----GFEKITIGSSSVKSVIGCGHASSGGF 196
Query: 173 SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC----LSGGGGGFLFFGDDLYDSSRV 228
+ G++GLG G+ S+VSQ+ I +C LS G F + + V
Sbjct: 197 GFA--SGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGV 254
Query: 229 VWTSMSSDYT-KYYSPGVAELFFGGE--TTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIM 285
V T + S T YY + + G E K V+ DSG++ ++L + Y + S +
Sbjct: 255 VSTPLISKNTVTYYYITLEAISIGNERHMAFAKQGNVIIDSGTTLSFLPKELYDGVVSSL 314
Query: 286 KKELSAKSLKE 296
K + AK +K+
Sbjct: 315 LKVVKAKRVKD 325
>gi|357463449|ref|XP_003602006.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355491054|gb|AES72257.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 529
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 76/254 (29%), Positives = 108/254 (42%), Gaps = 23/254 (9%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVPCEDPICASLH 112
IG P+ + + LD GSDL W+ CD C+ C Y R N+ P +S H
Sbjct: 106 IGTPSTSFLVALDAGSDLLWVPCD--CIHCAPLSASFYSNLDRDLNEYSPSRS--LSSKH 161
Query: 113 APGHH-------NCE--DPAQCDYELEY-ADGGSSLGVLVKDAFAFNYTNGQRLNPRL-- 160
H NC+ QC Y + Y +D SS G+LV+D F +G N +
Sbjct: 162 LSCSHRLCDMGSNCKTSKQQQCPYTINYLSDNTSSSGLLVEDIFHLQSGDGSTSNSSVQA 221
Query: 161 --ALGCGYNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF 217
+GCG Q G DG++GLG G+SS+ S L LIR+ C + G LF
Sbjct: 222 PVVVGCGMKQSGGYLDGTAPDGLIGLGPGESSVPSFLAKSGLIRDSFSLCFNEDDSGRLF 281
Query: 218 FGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVT 277
FGD + + Y GV G + + FDSG+S+T+L
Sbjct: 282 FGDQGSTVQQSTPFLLVDGMFSTYIVGVETCCIGNSCPKVTSFNAQFDSGTSFTFLPGHA 341
Query: 278 YQTLTSIMKKELSA 291
Y + K+++A
Sbjct: 342 YGAIAEEFDKQVNA 355
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 101/368 (27%), Positives = 151/368 (41%), Gaps = 59/368 (16%)
Query: 33 VGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP 92
V +L VH G + + M IG PA Y +DTGSDL W QC PCV C P
Sbjct: 87 VAPALQVPVHAG---NGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCK-PCVECFNQSTP 142
Query: 93 LYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
++ PS+ +PC +C+ L + C A+C Y Y D S+ GVL + F
Sbjct: 143 VFDPSSSSTYAALPCSSTLCSDLPS---SKCTS-AKCGYTYTYGDSSSTQGVLAAETFTL 198
Query: 149 NYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
T P +A GCG + G + G++GLG+G S+VSQL K +CL
Sbjct: 199 AKTK----LPDVAFGCG-DTNEGDGFTQGAGLVGLGRGPLSLVSQLGLNKF-----SYCL 248
Query: 209 SGGG---------GGFLFFGDDLYDSSRVVWTSMSSDYTK--YYSPGVAELFFGGETTGL 257
+ G + +S V T + + ++ +Y + L G L
Sbjct: 249 TSLDDTSKSPLLLGSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITL 308
Query: 258 KNLP----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
+ V+ DSG+S TYL Y+ L KK +A+ A + + L
Sbjct: 309 PSSAFAVQDDGTGGVIVDSGTSITYLELQGYRAL----KKAFAAQMKLPAADGSGIGLDT 364
Query: 308 KGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLII-SNKGNVCLGILNGAEVG 366
P V V+ L D +L E Y+++ S G +CL ++ G
Sbjct: 365 CFEAPASGVDQVEVPKLVFHLDGAD------LDLPAENYMVLDSGSGALCLTVM-----G 413
Query: 367 LQDLNVIG 374
+ L++IG
Sbjct: 414 SRGLSIIG 421
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 93/320 (29%), Positives = 133/320 (41%), Gaps = 38/320 (11%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSNDL----VPCE 104
Y VT+ IG PA + +DTGSDL+W+QC PC C PL+ PS+ VPC+
Sbjct: 118 YVVTLGIGTPAVQQIVLIDTGSDLSWVQCK-PCGAGECYAQKDPLFDPSSSSSYASVPCD 176
Query: 105 DPICASLHAPGH-HNCEDPAQ--CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
C L A + H C A C+Y +EY + ++ GV + +
Sbjct: 177 SDACRKLAAGAYGHGCTSGAAALCEYGIEYGNRATTTGVYSTETLTLKP---GVVVADFG 233
Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFG 219
GCG +Q Y DG+LGLG S+VSQ SQ +CL + GG GFL G
Sbjct: 234 FGCGDHQ--HGPYEKFDGLLGLGGAPESLVSQTSSQ--FGGPFSYCLPPTSGGAGFLALG 289
Query: 220 -----DDLYDSSRVVWTSMSS--DYTKYYSPGVAELFFGGETTGLK----NLPVVFDSGS 268
++ ++T M +Y + + GG + + +V DSG+
Sbjct: 290 APNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVGGAPLAVPPSAFSSGMVIDSGT 349
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
T L Y L S + +S L L C+ F +V T+AL
Sbjct: 350 VITGLPATAYAALRSAFRSAMSEYRLLPPSNGAVLDTCYD----FTGHTNVT--VPTIAL 403
Query: 329 SFTDGKTRTLFELTPEAYLI 348
+F+ G T L TP L+
Sbjct: 404 TFSGGATIDL--ATPAGVLV 421
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 82/251 (32%), Positives = 110/251 (43%), Gaps = 33/251 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y V + G PAR Y + +DTGS L+WLQC V C PL+ PS + C
Sbjct: 115 SGNYYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSC 174
Query: 104 EDPICASLHAPGHHN--CEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
C+SL +N CE + C Y Y D S+G L +D Q L P
Sbjct: 175 TSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTL--APSQTL-PGF 231
Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFFG 219
GCG Q + GILGLG+ K S++ Q+ S+ +CL + GGGGFL G
Sbjct: 232 VYGCG--QDSDGLFGRAAGILGLGRNKLSMLGQVSSK--FGYAFSYCLPTRGGGGFLSIG 287
Query: 220 DDLYDSSRVVWTSMSSDYTKYYSPGVAELFF--------GGETTGLK----NLPVVFDSG 267
S +T M++D PG L+F GG G+ +P + DSG
Sbjct: 288 KASLAGSAYKFTPMTTD------PGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSG 341
Query: 268 SSYTYLNRVTY 278
+ T L Y
Sbjct: 342 TVITRLPMSVY 352
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 55/155 (35%), Positives = 79/155 (50%), Gaps = 15/155 (9%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G + + + IG PA Y +DTGSDL W QC PC C + P P++ P S +PC
Sbjct: 95 GEFLMKLAIGTPAETYSAIMDTGSDLIWTQCK-PCKDCFDQPTPIFDPKKSSSFSKLPCS 153
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
+CA+L +C D C+Y Y D S+ GVL + FAF G ++ GC
Sbjct: 154 SDLCAALPI---SSCSD--GCEYLYSYGDYSSTQGVLATETFAF----GDASVSKIGFGC 204
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
G + G+ + G++GLG+G S++SQL K
Sbjct: 205 GEDN-DGSGFSQGAGLVGLGRGPLSLISQLGEPKF 238
>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 535
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 84/290 (28%), Positives = 121/290 (41%), Gaps = 35/290 (12%)
Query: 29 LFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVE 88
LF +GS F +GN +Y + IG P + + LD GSDL+W+ CD C++C
Sbjct: 83 LFPSLGSHTFF--YGNDLDWLHY-TWIDIGTPNVSFLVALDAGSDLSWVPCD--CIQCAP 137
Query: 89 APHPLYRP-SNDL-------------VPCEDPICASLHAPGHH--NCEDPAQCDYELEYA 132
LY+P DL + C +C G H N +DP C Y +YA
Sbjct: 138 LSASLYKPLDRDLSEYRPSLSTTSRHLSCNHQLCEL----GSHCKNLKDP--CPYIADYA 191
Query: 133 D-GGSSLGVLVKDAFAF------NYTNGQRLNPRLALGCGYNQVPG-ASYHPLDGILGLG 184
D SS G LV+D + + +R+ + LGCG Q G DG++GLG
Sbjct: 192 DPNTSSSGFLVEDILHLASVSDDSNSTQKRVQASVILGCGRKQTGGYLDGAAPDGVMGLG 251
Query: 185 KGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPG 244
G S+ S L LIR C G G + FGD + S + + Y
Sbjct: 252 PGSISVPSLLAKAGLIRKSFSLCFDVNGSGTILFGDQGHTSQKSTPLLPTQGNYDAYLIE 311
Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSL 294
V G + DSG+S+TYL Y + K+++A+ +
Sbjct: 312 VESYCVGNSCLKQSGFKALVDSGASFTYLPIDVYNKIVLEFDKQVNAQRI 361
>gi|218185382|gb|EEC67809.1| hypothetical protein OsI_35378 [Oryza sativa Indica Group]
Length = 344
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 51/140 (36%), Positives = 79/140 (56%), Gaps = 29/140 (20%)
Query: 237 YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
+ YYSPG A L+F + G+ + V+ K LS+ SL++
Sbjct: 63 FGNYYSPGSATLYFDRHSLGMNPMDVI----------------------KGGLSSTSLEQ 100
Query: 297 APEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVC 356
D +LPLCWKG++ F++V DVKK F++L L+F + + E+ PE +LI++ GNVC
Sbjct: 101 V-SDPSLPLCWKGQKAFESVSDVKKEFKSLQLNFGNN---AVMEIPPENFLIVTEYGNVC 156
Query: 357 LGILNGAEVGLQDLNVIGGI 376
LGIL+G+ + + N+IG I
Sbjct: 157 LGILHGSRL---NFNIIGDI 173
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/52 (42%), Positives = 32/52 (61%), Gaps = 3/52 (5%)
Query: 100 LVPCEDPICASLHAPGHH---NCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
+V +DP+ +LH G N P QCDYE++YADG S++G L+ D F+
Sbjct: 1 MVRADDPLFVALHEDGRSGDGNHMSPTQCDYEIKYADGASTIGALIVDQFSL 52
>gi|242094534|ref|XP_002437757.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
gi|241915980|gb|EER89124.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
Length = 575
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 98/342 (28%), Positives = 149/342 (43%), Gaps = 44/342 (12%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS----NDLVPCEDPICASLH 112
+G P+ + + LDTGSDL WL C+ C C + +Y PS + VPC P+C
Sbjct: 127 VGTPSSKFLVALDTGSDLFWLPCE--CKLCAKNGSTMYSPSLSSTSKTVPCGHPLCERPD 184
Query: 113 APGHHNCEDPAQCDYELEY--ADGGSSLGVLVKDAFAF----NYTNGQRLNPRLALGCGY 166
A + + C YE++Y A+ GSS GVLV+D G+ + + GCG
Sbjct: 185 ACATAG-KSSSSCPYEVKYVSANTGSS-GVLVEDVLHLVDGGGGGGGKAVQAPIVFGCGQ 242
Query: 167 NQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLI-RNVVGHCLSGGGGGFLFFGD- 220
Q + GA+ G++GLG K S+ S L S L+ + C S G G + FGD
Sbjct: 243 VQTGAFLRGAA---AGGLMGLGLDKVSVPSALASSGLVASDSFSMCFSRDGVGRINFGDA 299
Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQT 280
D + + S YY+ V + + ++ VV DSG+S+TYL+ Y
Sbjct: 300 GSPDQAETPLIAAGSLQPSYYNISVGAITVDSKAMAVEFTAVV-DSGTSFTYLDDPAYTF 358
Query: 281 LTSIMKKELSAKSLKEAPEDETLPLCWK---GRRPFKNVHDVKKCFRTLALSFTDGKTRT 337
LT+ +S S E C++ G+ K R A+S T K
Sbjct: 359 LTTNFNSRVSEASETYGSGYEKFEFCYRLSPGQTSMK---------RLPAMSLTT-KGGA 408
Query: 338 LFELT-PEAYLIISNKG------NVCLGILNGAEVGLQDLNV 372
+F +T P ++ S G CLGI+ + + +D +
Sbjct: 409 VFPITWPIIPVLASTNGGPYHPIGYCLGIIKTSILSTEDATI 450
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 101/358 (28%), Positives = 148/358 (41%), Gaps = 60/358 (16%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
+G Y + ++IG P + Y L LDTGSDL W+QC PC+ C E P Y P S + + C
Sbjct: 189 SGEYFMDVFIGTPPKHYSLILDTGSDLNWIQC-VPCIACFEQSGPYYDPKESSSFENITC 247
Query: 104 EDPICASLHAPGHHN-CEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYT--NG---QRL 156
DP C + +P C+D Q C Y Y D ++ G + F N T NG Q+
Sbjct: 248 HDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKH 307
Query: 157 NPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS------ 209
+ GCG +N+ +H G+LGLG+G S SQL S + GH S
Sbjct: 308 VENVMFGCGHWNR---GLFHGAAGLLGLGRGPLSFASQLQS------IYGHSFSYCLVDR 358
Query: 210 ---GGGGGFLFFGDD--LYDSSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGLKNL 260
L FG+D L + +TS + +Y G+ + GE +
Sbjct: 359 NSDTSVSSKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIPEE 418
Query: 261 P----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
+ DSG++ TY Y+ + K++ L E PL
Sbjct: 419 TWHLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVEG----FPPL----- 469
Query: 311 RPFKNVHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGL 367
+P NV ++K + F+DG +++ E Y I VCL IL + L
Sbjct: 470 KPCYNVSGIEKMELPDFGILFSDG---AMWDFPVENYFIQIEPDLVCLAILGTPKSAL 524
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 76/273 (27%), Positives = 114/273 (41%), Gaps = 26/273 (9%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL----VP 102
TG Y V + +G PA + + DTGSD TW+QC PCV C + PL+ P+ +
Sbjct: 162 TGNYVVPIRLGTPAARFTVVFDTGSDTTWVQCQ-PCVAYCYQQKEPLFTPTKSATYANIS 220
Query: 103 CEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
C C+ L G C Y ++Y DG ++G +D Y +
Sbjct: 221 CTSSYCSDLDTRGCSG----GHCLYAVQYGDGSYTVGFYAQDTLTLGYDTVKDFR----F 272
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGD 220
GCG + G++GLG+GK+S+ Q + + V +C+ + G GFL FG
Sbjct: 273 GCGEKNR--GLFGKAAGLMGLGRGKTSVPVQAYDK--YSGVFAYCIPATSSGTGFLDFGP 328
Query: 221 DLYDSSRVVWTSMSSDY-TKYYSPGVAELFFGGE-----TTGLKNLPVVFDSGSSYTYLN 274
++ T M D +Y G+ + GG T + + DSG+ T L
Sbjct: 329 GAPAAANARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVFSDAGALVDSGTVITRLP 388
Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
Y+ L S K + K AP L C+
Sbjct: 389 PSAYEPLRSAFAKGMEGLGYKTAPAFSILDTCY 421
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 90/333 (27%), Positives = 136/333 (40%), Gaps = 36/333 (10%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
T Y V++ +G P R + DTGSDL+W+QC PC C + PL+ PS VPC
Sbjct: 135 TANYIVSVGLGTPKRDLLVVFDTGSDLSWVQCK-PCDGCYQQHDPLFDPSQSTTYSAVPC 193
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL--- 160
C L + +C +C YE+ Y D + G L +D ++ + +L
Sbjct: 194 GAQECRRLDS---GSCSS-GKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQEF 249
Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFF 218
GCG + + DG+ GLG+ + S+ SQ ++ +CL S G+L
Sbjct: 250 VFGCGDDDT--GLFGKADGLFGLGRDRVSLASQAAAK--YGAGFSYCLPSSSTAEGYLSL 305
Query: 219 GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSSYT 271
G ++R SD +Y + + G T ++ P VF DSG+ T
Sbjct: 306 GSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAGRT--VRVSPAVFRTPGTVIDSGTVIT 363
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
L Y L S + S K AP L C+ F + V+ ++AL F
Sbjct: 364 RLPSRAYAALRSSFAGLMRRYSYKRAPALSILDTCYD----FTGRNKVQ--IPSVALLFD 417
Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
G T L L ++NK CL + +
Sbjct: 418 GGAT---LNLGFGEVLYVANKSQACLAFASNGD 447
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 93/361 (25%), Positives = 150/361 (41%), Gaps = 51/361 (14%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC-----DAPCVRCVEAPHPLYRPSNDL-- 100
TG Y V +G PA+P+ L DTGSDLTW++C +P + +P ++RP+N
Sbjct: 107 TGQYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLASPR-VFRPANSKSW 165
Query: 101 --VPCEDPICASLHAPGHHNCE----DPAQCDYELEYADGGSSLGVLVKDAFAFNYT-NG 153
+PC C S NC PA C Y+ Y D S+ GV+ DA + +G
Sbjct: 166 APIPCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVGTDAATIALSGSG 225
Query: 154 QRLNPRL---ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGHC 207
+L LGC G S+ DG+L LG S S+ ++ + +V H
Sbjct: 226 SDRKAKLQEVVLGC-TTSYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHL 284
Query: 208 LSGGGGGFLFFGD--DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL-------- 257
+L FG + SR + + +Y+ V + G+ +
Sbjct: 285 APRNATSYLTFGPVGAAHSPSRTPLL-LDAQVAPFYAVTVDAVSVAGKALNIPAEVWDVK 343
Query: 258 KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
KN + DSG+S T L Y+ + + + K+L+ + D PF+ +
Sbjct: 344 KNGGAILDSGTSLTILATPAYKAVVAALSKQLA--RVPRVTMD-----------PFEYCY 390
Query: 318 DVKKCFRTLALSFTDGKTRTLFELTP--EAYLIISNKGNVCLGILNGAEVGLQDLNVIGG 375
+ R A+ + + L P ++Y+I + G C+G+ G G ++VIG
Sbjct: 391 NWTATRRPPAVPRLEVRFAGSARLRPPTKSYVIDAAPGVKCIGLQEGVWPG---VSVIGN 447
Query: 376 I 376
I
Sbjct: 448 I 448
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 100/353 (28%), Positives = 143/353 (40%), Gaps = 51/353 (14%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSNDL----VPCE 104
Y VT+ IG PA + +DTGSDL+W+QC PC C PL+ PS+ VPC+
Sbjct: 91 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCK-PCGAGECYAQKDPLFDPSSSSSYASVPCD 149
Query: 105 DPICASLHAPGH-HNCED-----PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
C L A + H C A C+Y +EY + ++ GV + +
Sbjct: 150 SDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLTLKP---GVVVA 206
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL 216
GCG +Q Y DG+LGLG S+VSQ SQ +CL + GG GFL
Sbjct: 207 DFGFGCGDHQH--GPYEKFDGLLGLGGAPESLVSQTSSQ--FGGPFSYCLPPTSGGAGFL 262
Query: 217 FFGDDLYDSSRVVWTSMS-------SDYTKYYSPGVAELFFGGETTGLK----NLPVVFD 265
G SS + +S +Y + + GG + + +V D
Sbjct: 263 TLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSSGMVID 322
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
SG+ T L Y L S + +S L L C+ F +V T
Sbjct: 323 SGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYD----FTGHANVT--VPT 376
Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGD 378
++L+F+ G T +L A +++ CL A G N IG IG+
Sbjct: 377 ISLTFSGGAT---IDLAAPAGVLVDG----CL-----AFAGAGTDNAIGIIGN 417
>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 523
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 128/296 (43%), Gaps = 37/296 (12%)
Query: 29 LFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVE 88
LF GS ++F GN + +Y + +G P+ P+ + LD GSDL W+ CD C++C
Sbjct: 84 LFPSEGSQVIF--FGNEFNWLHY-TWIDLGTPSVPFLVALDVGSDLLWVPCD--CIQCAP 138
Query: 89 APHPLYRPSNDLVPCEDPICASLHAP---GHHNC---------EDPAQCDYELEY-ADGG 135
Y + + +P +S GH C DP C Y+ +Y +D
Sbjct: 139 LSANYYSVLDRDLSEYNPALSSTSKHLFCGHQLCAWSTTCKSANDP--CTYKRDYYSDNT 196
Query: 136 SSLGVLVKDAFAF----NYTNGQRLNPRLALGCGYNQ----VPGASYHPLDGILGLGKGK 187
S+ G +++D + L + GCG Q + GA+ DG++GLG G
Sbjct: 197 STSGFMIEDKLQLTSFSKHGTHSLLQASVVFGCGRKQSGSYLDGAA---PDGVMGLGPGN 253
Query: 188 SSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVV-WTSMSSDYTKYYSPGVA 246
S+ + L + L+RN C G G + FGDD + + + + ++ Y+ GV
Sbjct: 254 ISVPTLLAQEGLVRNTFSLCFDNNGSGRILFGDDGPATQQTTQFLPLFGEFAAYFI-GVE 312
Query: 247 ELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKS----LKEAP 298
G + DSGSS+TYL Y+ + K++ + L+E P
Sbjct: 313 SFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKVNATRIVLRELP 368
>gi|449533544|ref|XP_004173734.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like, partial [Cucumis sativus]
Length = 408
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 79/269 (29%), Positives = 122/269 (45%), Gaps = 37/269 (13%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRC----------VEAPHPLYRPSNDL----VP 102
IG P+ + + LD GSDL W+ C+ C++C ++ YRPS+ +
Sbjct: 109 IGTPSVSFLVALDAGSDLLWVPCN--CIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHIS 166
Query: 103 CEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAF-----NYTNGQR 155
C +C S +C+ P Q C Y ++Y + SS G+L++D N +N
Sbjct: 167 CSHNLCDS-----GQSCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCENSSNCTI 221
Query: 156 LNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
P + LGCG Q G + P DG+ GLG G+ S++S L ++L++N C + G
Sbjct: 222 QAP-VILGCGMKQSGGYLSGVAP-DGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGS 279
Query: 214 GFLFFGDDLYDSSRVV-WTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTY 272
G +FFGD+ S + + + Y Y GV + + DSG+S+TY
Sbjct: 280 GRIFFGDEGPASQQTTSFVPLDGKYETYIV-GVEACCIENSCLKQTSFKALIDSGTSFTY 338
Query: 273 LNRVTYQTLTSIMKKEL---SAKSLKEAP 298
L Y+ + K L SA S K P
Sbjct: 339 LPEEAYENIVIEFDKRLNTTSAVSFKGYP 367
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 95/299 (31%), Positives = 125/299 (41%), Gaps = 47/299 (15%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD---AP---CVRCVEAPHPLYRPSN---- 98
G Y V+M G P + L DTGSDL WLQC AP C + + P + S
Sbjct: 51 GQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATL 110
Query: 99 DLVPCEDPICASLHAPGHH----NCEDPAQCDYELEYADGGSSLGVLVKD-AFAFNYTNG 153
+VPC C + AP H + P C Y +YADG S+ G L +D A N T+G
Sbjct: 111 SVVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSG 170
Query: 154 QRLNPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC---LS 209
+A GCG NQ G S+ G++GLG+G+ S +Q S L +C L
Sbjct: 171 GAAVRGVAFGCGTRNQ--GGSFSGTGGVIGLGQGQLSFPAQ--SGSLFAQTFSYCLLDLE 226
Query: 210 GGGGG----FLFFGDDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGETTG------- 256
GG G FLF G ++ +T + S+ +Y GV + G
Sbjct: 227 GGRRGRSSSFLFLGRPERRAA-FAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWA 285
Query: 257 ---LKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET----LPLCWK 308
L N V DSGS+ TYL Y L S + L P T L LC+
Sbjct: 286 IDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASV---HLPRIPSSATFFQGLELCYN 341
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 109/370 (29%), Positives = 147/370 (39%), Gaps = 66/370 (17%)
Query: 42 HGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN--- 98
+ N P Y V + IG P +P L LDTGSDL W QC PC C PSN
Sbjct: 406 YANGVPDTEYLVHLAIGTPPQPVQLILDTGSDLVWTQCR-PCPVCFSRALGPLDPSNSST 464
Query: 99 -DLVPCEDPICASL--HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN--G 153
D++PC P+C +L + G HN + C Y YADG + G L + F F + G
Sbjct: 465 FDVLPCSSPVCDNLTWSSCGKHNWGN-QTCVYVYAYADGSITTGHLDAETFTFAAADGTG 523
Query: 154 QRLNPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
Q P LA GCG +N G GI G G+G S+ SQL HC +
Sbjct: 524 QATVPDLAFGCGLFNN--GIFTSNETGIAGFGRGALSLPSQLKVDNF-----SHCFTAIT 576
Query: 213 GG-----FLFFGDDLYDSS--RVVWTSMSSDYTK---YYSPGVAELFFGGETTGLKNLPV 262
G L +LY + V T + +++ YY L G T G LP+
Sbjct: 577 GSEPSSVLLGLPANLYSDADGAVQSTPLVQNFSSLRAYY------LSLKGITVGSTRLPI 630
Query: 263 ---------------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
+ DSG+ T L + Y+ + ++ + A LC+
Sbjct: 631 PESTFALKQDGTGGTIIDSGTGMTTLPQDAYKLVHDAFTAQVRLP-VDNATSSSLSRLCF 689
Query: 308 KGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGN--VCLGILNGAE 364
P + DV K L L F +G T +L E Y+ + G CL I G
Sbjct: 690 SFSVPRRAKPDVPK----LVLHF-EGAT---LDLPRENYMFEFEDAGGSVTCLAINAG-- 739
Query: 365 VGLQDLNVIG 374
DL +IG
Sbjct: 740 ---DDLTIIG 746
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 121/283 (42%), Gaps = 47/283 (16%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G + + + IG PA Y +DTGSDL W QC PC C + P P++ P S +PC
Sbjct: 95 GEFLMNLAIGTPAETYSAIMDTGSDLIWTQCK-PCKVCFDQPTPIFDPEKSSSFSKLPCS 153
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
+C +L +C D C+Y Y D S+ GVL + F F G ++ GC
Sbjct: 154 SDLCVALPI---SSCSD--GCEYRYSYGDHSSTQGVLATETFTF----GDASVSKIGFGC 204
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS----GGGGGFLFFGD 220
G + G +Y G++GLG+G S++SQL K +CL+ G L G
Sbjct: 205 GEDN-RGRAYSQGAGLVGLGRGPLSLISQLGVPKF-----SYCLTSIDDSKGISTLLVGS 258
Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV---------------VFD 265
+ S + T + + ++ P L G + G LP+ + D
Sbjct: 259 EATVKS-AIPTPLIQNPSR---PSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIID 314
Query: 266 SGSSYTYLNRVTYQTL----TSIMKKELSAKSLKEAPEDETLP 304
SG++ TYL + L S MK ++ A E TLP
Sbjct: 315 SGTTITYLKDSAFAALKKEFISQMKLDVDASGSTELELCFTLP 357
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 79/277 (28%), Positives = 115/277 (41%), Gaps = 35/277 (12%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y V + +G P +L +D+GSD+ W+QC PC +C PL+ P+ V C
Sbjct: 127 SGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSC 185
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
IC +L G D +CDY + Y DG + G L + T Q +A+G
Sbjct: 186 GSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTAVQ----GVAIG 241
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFGD 220
CG+ + G+LGLG G S+V QL V +CL+ GG G L G
Sbjct: 242 CGHRN--SGLFVGAAGLLGLGWGAMSLVGQLGGAA--GGVFSYCLASRGAGGAGSLVLG- 296
Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNL----------PVVFDSGSSY 270
R + +Y G+ + GGE L++ VV D+G++
Sbjct: 297 ------RTEAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAV 350
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
T L R Y L + A L +P L C+
Sbjct: 351 TRLPREAYAALRGAFDGAMGA--LPRSPAVSLLDTCY 385
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 87/351 (24%), Positives = 141/351 (40%), Gaps = 55/351 (15%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y + + IG PA+P+ +DTGSDL W QC PC +C P++ P S +PC
Sbjct: 93 GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ-PCTQCFNQSTPIFNPQGSSSFSTLPCS 151
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
+C +L +P N C Y Y DG + G + + F G P + GC
Sbjct: 152 SQLCQALSSPTCSN----NFCQYTYGYGDGSETQGSMGTETLTF----GSVSIPNITFGC 203
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYD 224
G N G G++G+G+G S+ SQL K +C++ G + L
Sbjct: 204 GENN-QGFGQGNGAGLVGMGRGPLSLPSQLDVTKF-----SYCMTPIGSSTP--SNLLLG 255
Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFF---GGETTGLKNLP----------------VVFD 265
S T+ S + T S + ++ G + G LP ++ D
Sbjct: 256 SLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIID 315
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
SG++ TY YQ++ +++ + + LC++ N+ T
Sbjct: 316 SGTTLTYFVNNAYQSVRQEFISQINLPVVNGS--SSGFDLCFQTPSDPSNLQ-----IPT 368
Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
+ F G EL E Y I + G +CL + + + Q +++ G I
Sbjct: 369 FVMHFDGGD----LELPSENYFISPSNGLICLAMGSSS----QGMSIFGNI 411
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 94/352 (26%), Positives = 146/352 (41%), Gaps = 44/352 (12%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
G + + + +G PA PY +DTGSDL W QC PCV C P++ P+ +PC
Sbjct: 114 GEFLMDLSVGTPALPYAAIVDTGSDLVWTQCK-PCVECFNQTTPVFDPAASSTYAALPCS 172
Query: 105 DPICASLHAPGHHNCEDPAQCD----YELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
+CA L + + Y Y D S+ GVL + F T ++ P +
Sbjct: 173 SALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETF----TLARQKVPGV 228
Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGD 220
A GCG + G + G++GLG+G S+VSQL + + + G L
Sbjct: 229 AFGCG-DTNEGDGFTQGAGLVGLGRGPLSLVSQLGIDRFSYCLTSLDDAAGRSPLLLGSA 287
Query: 221 DLYDSSRVVWTSMSSDYTKYYS-PGVAELFFGGETTGLKNLP---------------VVF 264
+S + ++ K S P + G T G L V+
Sbjct: 288 AGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDDGTGGVIV 347
Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNV-HDVKKCF 323
DSG+S TYL Y+ L +S ++ + + L LC++G P V DV+
Sbjct: 348 DSGTSITYLELRAYRALRKAFVAHMSLPTVDAS--EIGLDLCFQG--PAGAVDQDVQVQV 403
Query: 324 RTLALSFTDGKTRTLFELTPEAYLII-SNKGNVCLGILNGAEVGLQDLNVIG 374
L L F G +L E Y+++ S G +CL ++ + L++IG
Sbjct: 404 PKLVLHFDGGAD---LDLPAENYMVLDSASGALCLTVMAS-----RGLSIIG 447
>gi|449451627|ref|XP_004143563.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 532
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 79/269 (29%), Positives = 122/269 (45%), Gaps = 37/269 (13%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRC----------VEAPHPLYRPSNDL----VP 102
IG P+ + + LD GSDL W+ C+ C++C ++ YRPS+ +
Sbjct: 109 IGTPSVSFLVALDAGSDLLWVPCN--CIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHIS 166
Query: 103 CEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAF-----NYTNGQR 155
C +C S +C+ P Q C Y ++Y + SS G+L++D N +N
Sbjct: 167 CSHNLCDS-----GQSCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCENSSNCTI 221
Query: 156 LNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
P + LGCG Q G + P DG+ GLG G+ S++S L ++L++N C + G
Sbjct: 222 QAP-VILGCGMKQSGGYLSGVAP-DGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGS 279
Query: 214 GFLFFGDDLYDSSRVV-WTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTY 272
G +FFGD+ S + + + Y Y GV + + DSG+S+TY
Sbjct: 280 GRIFFGDEGPASQQTTSFVPLDGKYETYIV-GVEACCIENSCLKQTSFKALIDSGTSFTY 338
Query: 273 LNRVTYQTLTSIMKKEL---SAKSLKEAP 298
L Y+ + K L SA S K P
Sbjct: 339 LPEEAYENIVIEFDKRLNTTSAVSFKGYP 367
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 121/283 (42%), Gaps = 47/283 (16%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G + + + IG PA Y +DTGSDL W QC PC C + P P++ P S +PC
Sbjct: 95 GEFLMNLAIGTPAETYSAIMDTGSDLIWTQCK-PCKVCFDQPTPIFDPEKSSSFSKLPCS 153
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
+C +L +C D C+Y Y D S+ GVL + F F G ++ GC
Sbjct: 154 SDLCVALPI---SSCSD--GCEYRYSYGDHSSTQGVLATETFTF----GDASVSKIGFGC 204
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS----GGGGGFLFFGD 220
G + G +Y G++GLG+G S++SQL K +CL+ G L G
Sbjct: 205 GEDNR-GRAYSQGAGLVGLGRGPLSLISQLGVPKF-----SYCLTSIDDSKGISTLLVGS 258
Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV---------------VFD 265
+ S + T + + ++ P L G + G LP+ + D
Sbjct: 259 EATVKS-AIPTPLIQNPSR---PSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIID 314
Query: 266 SGSSYTYLNRVTYQTL----TSIMKKELSAKSLKEAPEDETLP 304
SG++ TYL + L S MK ++ A E TLP
Sbjct: 315 SGTTITYLKDNAFAALKKEFISQMKLDVDASGSTELELCFTLP 357
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/328 (28%), Positives = 132/328 (40%), Gaps = 37/328 (11%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC-VRCVEAPHPLYRP----SNDLVPC 103
G Y M +G PA+PY + +DTGS LTWLQC +PC V C P++ P S V C
Sbjct: 115 GNYVTRMGLGTPAKPYIMVVDTGSSLTWLQC-SPCRVSCHRQSGPVFDPKTSSSYAAVSC 173
Query: 104 EDPICASLHAPGHHN--CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
P C L + C C Y+ Y D S+G L KD +F G P
Sbjct: 174 SSPQCDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKDTVSF----GANSVPNFY 229
Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFFGD 220
GCG + + G++GL + K S++ QL + +CL S G+L G
Sbjct: 230 YGCGQDN--EGLFGRSAGLMGLARNKLSLLYQL--APTLGYSFSYCLPSTSSSGYLSIGS 285
Query: 221 DLYDSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
Y+ +T M S+ + VA ++ +LP + DSG+ T L
Sbjct: 286 --YNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPTIIDSGTVITRL 343
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR-RPFKNVHDVKKCFR---TLALS 329
Y L+ + + S K A L C++G+ + V V F TL LS
Sbjct: 344 PTSVYTALSKAVAAAMKG-STKRAAAYSILDTCFEGQASKLRAVPAVSMAFSGGATLKLS 402
Query: 330 F------TDGKTRTLFELTPEAYLIISN 351
DG T L + II N
Sbjct: 403 AGNLLVDVDGATTCLAFAPARSAAIIGN 430
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/162 (34%), Positives = 83/162 (51%), Gaps = 13/162 (8%)
Query: 44 NVYPTGY---YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND- 99
N+ P+ Y + V +GQPA P +DTGS++ W++C APC RC + PL PS
Sbjct: 89 NLLPSTYEPLFLVNFSMGQPATPQLAIMDTGSNILWVRC-APCKRCTQQNGPLLDPSKSS 147
Query: 100 ---LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN-GQR 155
+PC + +C +AP + C QC Y L YA G SS GVL + F+ ++ G
Sbjct: 148 TYASLPCTNTMCH--YAPSAY-CNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVN 204
Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ 197
P + GC + G+ GLGKG +S V+++ S+
Sbjct: 205 AVPSVVFGCSHENGDYKD-RRFTGVFGLGKGITSFVTRMGSK 245
>gi|297819834|ref|XP_002877800.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323638|gb|EFH54059.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 531
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 104/390 (26%), Positives = 169/390 (43%), Gaps = 69/390 (17%)
Query: 20 SSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC 79
+S++ + F+ ++ ++ G++Y Y NV+ +G P + + LDTGSDL WL C
Sbjct: 76 ASNNEDTPVTFDGGNLTVSIKLLGSLY---YANVS--VGTPPSSFLVALDTGSDLFWLPC 130
Query: 80 D--APCVRCVE-------APHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQ-C 125
+ C+R +E P LY P ++ + C D C G C P C
Sbjct: 131 NCGTTCIRDLEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCF-----GSKKCSSPKSIC 185
Query: 126 DYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP---RLALGCGYNQVP-GASYHPLDGIL 181
Y++ Y++ + G L++D T + L P + LGCG Q + ++G+L
Sbjct: 186 PYQISYSNSTGTTGTLLQDVLHL-ATEDENLTPVKTNVTLGCGQKQTGLFQRNNSVNGVL 244
Query: 182 GLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLY-DSSRVVWTSMSSDYT 238
GLG S+ S L + + C G G + FGD Y D + S++ +
Sbjct: 245 GLGIKGYSVPSLLAKANITADSFSMCFGRVIGNVGRISFGDKGYTDQEETPFISVAP--S 302
Query: 239 KYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAP 298
Y V + GG+ G + L FD+GSS+T+L Y LT KS +
Sbjct: 303 TAYGLNVTGVSVGGDPVGTR-LFAKFDTGSSFTHLMEPAYGVLT---------KSFDDLV 352
Query: 299 EDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISN------- 351
ED+ P+ PF+ +D+ ++ F + + +I++N
Sbjct: 353 EDKRRPV--DPELPFEFCYDLSPNATSIEFPFVE------MTFVGGSKIILNNPFFTART 404
Query: 352 -----KGNV--CLGILNGAEVGLQDLNVIG 374
+GNV CLG+L VGL+ +NVIG
Sbjct: 405 QARHGEGNVMYCLGVLK--SVGLK-INVIG 431
>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 81/318 (25%), Positives = 134/318 (42%), Gaps = 37/318 (11%)
Query: 50 YYNVTMYIGQPARPYFLDLDTGSDLTWLQCD-APCVRCVEAPH-----PLYRP----SND 99
Y NV+ +G P+ + + LDTGS+L WL CD + CV + +P +Y P +++
Sbjct: 63 YANVS--VGTPSVSFLVALDTGSNLLWLPCDCSSCVHSLRSPSGTVDLNIYSPNTSSTSE 120
Query: 100 LVPCEDPICASLHAPGHHNC-EDPAQCDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR-- 155
VPC +C+ C D + C Y++ Y ++G S+ G +V+D + Q
Sbjct: 121 KVPCNSTLCSQTQ---RDRCPSDQSNCPYQVVYLSNGTSTTGYIVQDLLHLISDDSQSKA 177
Query: 156 LNPRLALGCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
++ ++ GCG +V S+ +G+ GLG S+ S L C S G
Sbjct: 178 VDAKITFGCG--KVQTGSFLTGGAPNGLFGLGMSNISVPSTLAHNGYTSGSFSMCFSPNG 235
Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTY 272
G + FGD + + Y+ + + GG+ + L +FDSG+S+TY
Sbjct: 236 IGRISFGDKGSTGQGETSFNQGQPRSSLYNISITQTSIGGQASDLV-YSAIFDSGTSFTY 294
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
LN Y + E K +KE T + PF +D++ L F+
Sbjct: 295 LNDPAYTLI-----AESFNKLVKETRRSST-------QVPFDYCYDIRSFISAQILPFSC 342
Query: 333 GKTRTLFELTPEAYLIIS 350
P L++S
Sbjct: 343 AYANQTEPTIPAVTLVMS 360
>gi|15238055|ref|NP_196570.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
gi|75180764|sp|Q9LX20.1|ASPL1_ARATH RecName: Full=Aspartic proteinase-like protein 1; Flags: Precursor
gi|7960727|emb|CAB92049.1| putative protein [Arabidopsis thaliana]
gi|332004108|gb|AED91491.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
Length = 528
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 75/262 (28%), Positives = 112/262 (42%), Gaps = 31/262 (11%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY-----RPSNDLVPCEDP----- 106
IG P+ + + LDTGS+L W+ C+ CV+C Y + N+ P
Sbjct: 106 IGTPSVSFLVALDTGSNLLWIPCN--CVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVF 163
Query: 107 ICASLHAPGHHNCEDPA-QCDYELEYADGG-SSLGVLVKDAFAFNYTNGQRL-------N 157
+C+ +CE P QC Y + Y G SS G+LV+D Y RL
Sbjct: 164 LCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVK 223
Query: 158 PRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
R+ +GCG Q + G + DG++GLG + S+ S L L+RN C
Sbjct: 224 ARVVIGCGKKQSGDYLDGVA---PDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDS 280
Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKY--YSPGVAELFFGGETTGLKNLPVVFDSGSSYT 271
G ++FG D+ S + + D KY Y GV G + DSG S+T
Sbjct: 281 GRIYFG-DMGPSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFT 339
Query: 272 YLNRVTYQTLTSIMKKELSAKS 293
YL Y+ + + + ++A S
Sbjct: 340 YLPEEIYRKVALEIDRHINATS 361
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 89/325 (27%), Positives = 132/325 (40%), Gaps = 38/325 (11%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
G Y + +G PA Y + +DTGS LTWLQC V C PLY P VPC
Sbjct: 132 GNYVTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLYDPRASSTYATVPCS 191
Query: 105 DPICASLHAP--GHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
C L A C C Y+ Y D S+G L +D +F G P
Sbjct: 192 ASQCDELQAATLNPSACSVRNVCIYQASYGDSSFSVGYLSRDTVSF----GSGSYPNFYY 247
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFFGDD 221
GCG + + G++GL + K S++ QL + +CL + G+L G
Sbjct: 248 GCGQDN--EGLFGRSAGLIGLARNKLSLLYQLAPS--LGYSFSYCLPTPASTGYLSIGP- 302
Query: 222 LYDSSRVVWTSMSS---DYTKYYSPGVAELFFGGETTGL-----KNLPVVFDSGSSYTYL 273
Y S +T M+S D + Y+ ++ + GG + +LP + DSG+ T L
Sbjct: 303 -YTSGHYSYTPMASSSLDASLYFV-TLSGMSVGGSPLAVSPAEYSSLPTIIDSGTVITRL 360
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
Y L+ + + ++ AP L C++G+ V V A++F G
Sbjct: 361 PTAVYTALSKAVAAAM--VGVQSAPAFSILDTCFQGQASQLRVPAV-------AMAFAGG 411
Query: 334 KTRTLFELTPEAYLIISNKGNVCLG 358
T +L + LI + CL
Sbjct: 412 AT---LKLATQNVLIDVDDSTTCLA 433
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 90/347 (25%), Positives = 131/347 (37%), Gaps = 40/347 (11%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR--CVEAPHPLYRPSN----DLV 101
TG Y V++ +G PAR + DTGSDL+W+QC PC C PL+ PS+ V
Sbjct: 82 TGNYVVSVGLGTPARDLTVVFDTGSDLSWVQC-GPCSSGGCYHQQDPLFAPSSSSTFSAV 140
Query: 102 PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT-------NGQ 154
C +P C + D +C YE+ Y D ++G L D T N
Sbjct: 141 RCGEPECPRARQSCSSSPGDD-RCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASENNS 199
Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGG 211
P GCG N + DG+ GLG+GK S+ SQ + +CL S
Sbjct: 200 NKLPGFVFGCGENNT--GLFGKADGLFGLGRGKVSLSSQAAGK--YGEGFSYCLPSSSSN 255
Query: 212 GGGFLFFGDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLP------VV 263
G+L G + +T M S+ +Y + + G + + P ++
Sbjct: 256 AHGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWPAGLI 315
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
DSG+ T L Y L + + K AP L C+ F +
Sbjct: 316 VDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSILDTCYD----FTAHANATVSI 371
Query: 324 RTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGIL---NGAEVGL 367
+AL F G T + L ++ CL NG G+
Sbjct: 372 PAVALVFAGGAT---ISVDFSGVLYVAKVAQACLAFAPNGNGRSAGI 415
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 100/353 (28%), Positives = 143/353 (40%), Gaps = 51/353 (14%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSNDL----VPCE 104
Y VT+ IG PA + +DTGSDL+W+QC PC C PL+ PS+ VPC+
Sbjct: 171 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCK-PCGAGECYAQKDPLFDPSSSSSYASVPCD 229
Query: 105 DPICASLHAPGH-HNCED-----PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
C L A + H C A C+Y +EY + ++ GV + +
Sbjct: 230 SDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLTLKP---GVVVA 286
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL 216
GCG +Q Y DG+LGLG S+VSQ SQ +CL + GG GFL
Sbjct: 287 DFGFGCGDHQH--GPYEKFDGLLGLGGAPESLVSQTSSQ--FGGPFSYCLPPTSGGAGFL 342
Query: 217 FFGDDLYDSSRVVWTSMS-------SDYTKYYSPGVAELFFGGETTGLK----NLPVVFD 265
G SS + +S +Y + + GG + + +V D
Sbjct: 343 TLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSSGMVID 402
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
SG+ T L Y L S + +S L L C+ F +V T
Sbjct: 403 SGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYD----FTGHANVT--VPT 456
Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGD 378
++L+F+ G T +L A +++ CL A G N IG IG+
Sbjct: 457 ISLTFSGGAT---IDLAAPAGVLVDG----CL-----AFAGAGTDNAIGIIGN 497
>gi|217426809|gb|ACK44517.1| AT5G10080-like protein [Arabidopsis arenosa]
Length = 506
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 74/263 (28%), Positives = 111/263 (42%), Gaps = 35/263 (13%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY-----RPSNDLVPCEDP----- 106
IG P+ + + LDTGSDL W+ C+ CV+C Y + N+ P
Sbjct: 106 IGTPSVSFLVALDTGSDLLWIPCN--CVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVF 163
Query: 107 ICASLHAPGHHNCEDPA-QCDYELEYADGG-SSLGVLVKDAFAFNYTNGQRL-------N 157
+C+ +CE P QC Y + Y G SS G+LV+D Y RL
Sbjct: 164 LCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVK 223
Query: 158 PRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
R+ +GCG Q + G + DG++GLG + S+ S L L+RN C
Sbjct: 224 ARVVIGCGKKQSGDYLDGVA---PDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDS 280
Query: 214 GFLFFGD---DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
G ++FGD + S+ + +S Y GV G + DSG S+
Sbjct: 281 GRIYFGDMGPSIQQSTPFLQLENNSGYIV----GVEACCIGNSCLKQTSFTTFIDSGQSF 336
Query: 271 TYLNRVTYQTLTSIMKKELSAKS 293
TYL Y+ + + + ++A S
Sbjct: 337 TYLPEEIYRKVALEIDRHINATS 359
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 83/344 (24%), Positives = 145/344 (42%), Gaps = 56/344 (16%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCE 104
G Y + + IG P + +DTGSDL W QC+ PC +C P P++ P + +PCE
Sbjct: 94 GEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCE-PCTQCFSQPTPIFNPQDSSSFSTLPCE 152
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
C L + +N E C Y Y DG ++ G + + F F ++ P +A GC
Sbjct: 153 SQYCQDLPSETCNNNE----CQYTYGYGDGSTTQGYMATETFTFETSS----VPNIAFGC 204
Query: 165 -----GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG---GFL 216
G+ Q GA G++G+G G S+ SQL + +C++ G L
Sbjct: 205 GEDNQGFGQGNGA------GLIGMGWGPLSLPSQLGVGQF-----SYCMTSYGSSSPSTL 253
Query: 217 FFG---DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----------VV 263
G + + S SS YY + + GG+ G+ + ++
Sbjct: 254 ALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMI 313
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
DSG++ TYL + Y + +++ ++ E+ L C++ V
Sbjct: 314 IDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDES--SSGLSTCFQQPSDGSTVQ-----V 366
Query: 324 RTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGL 367
+++ F G + L + LI +G +CL + + +++G+
Sbjct: 367 PEISMQFDGG----VLNLGEQNILISPAEGVICLAMGSSSQLGI 406
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 73/277 (26%), Positives = 121/277 (43%), Gaps = 27/277 (9%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y +T +G P + +DTGSD+ WLQC PC +C + P++ PS +PC
Sbjct: 85 GEYLMTYSVGTPPFNVYGVVDTGSDIVWLQC-KPCEQCYKQTTPIFNPSKSSSYKNIPCS 143
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
+C S+ + +C C+Y + ++D S G L + + T G ++ P+ +G
Sbjct: 144 SNLCQSVR---YTSCNKQNSCEYTINFSDQSYSQGELSVETLTLDSTTGHSVSFPKTVIG 200
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFF 218
CG+N G GI+GLG G S+ +QL S I +CL L F
Sbjct: 201 CGHNN-RGMFQGETSGIVGLGIGPVSLTTQLKSS--IGGKFSYCLLPLLVDSNKTSKLNF 257
Query: 219 GDDLYDSSRVVWTS--MSSDYTKYYSPGVAELFFGGETTGLKNLP------VVFDSGSSY 270
GD S V ++ + D +Y + G + + L ++ DSG++
Sbjct: 258 GDAAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIEFEVLDDSEEGNIILDSGTTL 317
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
T L Y L S + + + + + ++ L LC+
Sbjct: 318 TLLPSHVYTNLESAVAQLVKLDRVDDP--NQLLNLCY 352
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 97/357 (27%), Positives = 140/357 (39%), Gaps = 62/357 (17%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
+G Y + +G PAR ++ LDTGSD+ WLQC APC RC P++ P +PC
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQC-APCRRCYSQSDPIFDPRKSKTYATIPC 197
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF--NYTNGQRLNPRLA 161
P C L + G + C Y++ Y DG ++G + F N G +A
Sbjct: 198 SSPHCRRLDSAGCNTRRK--TCLYQVSYGDGSFTVGDFSTETLTFRRNRVKG------VA 249
Query: 162 LGCGYNQ-------------------VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRN 202
LGCG++ PG + H + K +V + S K
Sbjct: 250 LGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFN-----QKFSYCLVDRSASSKPSSV 304
Query: 203 VVGHCLSGGGGGF--LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNL 260
V G+ F L L V +S T+ PGVA F + G N
Sbjct: 305 VFGNAAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRV--PGVAASLFKLDQIG--NG 360
Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
V+ DSG+S T L R Y + + + AK+LK AP+ C+ N+++VK
Sbjct: 361 GVIIDSGTSVTRLIRPAYIAMRDAFR--VGAKALKRAPDFSLFDTCFD----LSNMNEVK 414
Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
T+ L F L YLI + G C + L++IG I
Sbjct: 415 --VPTVVLHFRGADV----SLPATNYLIPVDTNGKFCFAFAG----TMGGLSIIGNI 461
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 96/324 (29%), Positives = 142/324 (43%), Gaps = 55/324 (16%)
Query: 17 MSSSSSSSSSSSLFNHVGSS-LLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
+SS+ +S+ +GSS VH + T + V +GQP P F +DTGS L
Sbjct: 34 ISSARFKYLQNSIVKELGSSDFQVDVHQAI-KTSLFFVNFSVGQPPVPQFTIMDTGSSLL 92
Query: 76 WLQCDAPCVRCV--EAPHPLYRP--SNDLVP--CEDPICASLHAPGHHNCEDPAQCDYEL 129
W+QC PC C HP++ P S+ V C+D C +AP H + +C YE
Sbjct: 93 WIQCH-PCKHCSSNHMIHPVFNPALSSTFVECSCDDRFCR--YAPNGHCSSN--KCVYEQ 147
Query: 130 EYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALGCGYNQVPGASYHPLDGILGLGKGKS 188
Y G S GVL K+ F NG + + +A GCG+ GILGLG +
Sbjct: 148 VYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGHENGEQLE-SEFTGILGLGAKPT 206
Query: 189 SIVSQLHSQKLIRNVVGHC---LSGGGGGF--LFFGDD---LYDSSRVVWTSMSSDYTKY 240
S+ QL S+ +C L+ G+ L G+D L D + + + + +
Sbjct: 207 SLAVQLGSK------FSYCIGDLANKNYGYNQLVLGEDADILGDPTPIEFETEN------ 254
Query: 241 YSPGVAELFFGGETTGLKNL---PVVF-----------DSGSSYTYLNRVTYQTLTSIMK 286
G+ + G + G K L PVVF D+G+ YT+L + Y+ L + +K
Sbjct: 255 ---GIYYMNLEGISVGDKQLNIEPVVFKRRGSRTGVILDTGTLYTWLADIAYRELYNEIK 311
Query: 287 KELSAKSLKEAPEDETLPLCWKGR 310
L K + D LC+ GR
Sbjct: 312 SILDPKLERFWFRDF---LCYHGR 332
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 78/278 (28%), Positives = 121/278 (43%), Gaps = 27/278 (9%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCEDP 106
Y ++ IG P + +DT +D W QC+ PC C P++ PS +PC P
Sbjct: 89 YIISFLIGTPPFQLYGVMDTANDNIWFQCN-PCKPCFNTTSPMFDPSKSSTYKTIPCSSP 147
Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALGCG 165
C ++ H + +D C+Y Y S G L D N N ++ + + +GCG
Sbjct: 148 KCKNVEN-THCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLTLNSNNDTPISFKNIVIGCG 206
Query: 166 Y-NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFG 219
+ N+ P Y + G +GLG+G S +SQL+S I +CL + G G L FG
Sbjct: 207 HRNKGPLEGY--VSGNIGLGRGPLSFISQLNSS--IGGKFSYCLVPLFSNEGISGKLHFG 262
Query: 220 D-DLYDSSRVVWTSMSSDYTKY------YSPGVAELFFGGETTGLKNL-PVVFDSGSSYT 271
D + V T +++ Y S G + F T+ NL + DSG++ T
Sbjct: 263 DKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHIIKFENSTSKNDNLGNTIIDSGTTLT 322
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
L Y L SI+ + + K ++ LC+K
Sbjct: 323 ILPENVYSRLESIVTSMVKLERAKSP--NQQFKLCYKA 358
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 95/348 (27%), Positives = 148/348 (42%), Gaps = 38/348 (10%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
TG Y V + +G PA+ + L DTGS+LTW++C P ++RP S VPC
Sbjct: 88 TGQYFVKVLVGTPAQEFTLVADTGSELTWVKCAG----GASPPGLVFRPEASKSWAPVPC 143
Query: 104 EDPICASLHAP-GHHNCEDPAQ-CDYELEYADGGS-SLGVLVKDAFAFNYTNGQRLNPR- 159
C L P NC A C Y+ Y +G + +LGV+ D+ G+ +
Sbjct: 144 SSDTC-KLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVAQLQD 202
Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGHCLSGGGGGFL 216
+ LGC G S+ +DG+L LG K S S+ ++ +V H G+L
Sbjct: 203 VVLGCSSTH-DGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSYCLVDHLAPRNATGYL 261
Query: 217 FFGDDLYDSSRVVWTSMSSD-YTKYYSPGVAELFFGGETTGL-------KNLPVVFDSGS 268
FG + T + D +Y V + G+ + K+ V+ DSG+
Sbjct: 262 AFGPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDPKSGGVILDSGT 321
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
+ T L Y+ + + + K L+ + P E C+ P ++ K LA+
Sbjct: 322 TLTVLATPAYKAVVAALTKLLAGVPKVDFPPFEH---CYNWTAPRPGAPEIPK----LAV 374
Query: 329 SFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
FT G R E ++Y+I G C+G+ G G ++VIG I
Sbjct: 375 QFT-GCAR--LEPPAKSYVIDVKPGVKCIGLQEGEWPG---VSVIGNI 416
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 96/346 (27%), Positives = 144/346 (41%), Gaps = 58/346 (16%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLH 112
IG PA Y +DTGSDL W QC PCV C + P++ PS+ VPC C+ L
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCK-PCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLP 231
Query: 113 APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGA 172
C ++C Y Y D S+ GVL + F + P + GCG + G
Sbjct: 232 T---SKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----LPGVVFGCG-DTNEGD 283
Query: 173 SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG---------GGFLFFGDDLY 223
+ G++GLG+G S+VSQL K +CL+ G +
Sbjct: 284 GFSQGAGLVGLGRGPLSLVSQLGLDKF-----SYCLTSLDDTNNSPLLLGSLAGISEASA 338
Query: 224 DSSRVVWTSMSSDYTK--YYSPGVAELFFGGETTGLKNLP----------VVFDSGSSYT 271
+S V T + + ++ +Y + + G L + V+ DSG+S T
Sbjct: 339 AASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSIT 398
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDE--TLPLCWKGRRPFKNVHDVKKCFRTLALS 329
YL Y+ L KK +A+ A + L LC+ R P K V V+ L
Sbjct: 399 YLEVQGYRAL----KKAFAAQMALPAADGSGVGLDLCF--RAPAKGVDQVE--VPRLVFH 450
Query: 330 FTDGKTRTLFELTPEAYLIIS-NKGNVCLGILNGAEVGLQDLNVIG 374
F G +L E Y+++ G +CL ++ G + L++IG
Sbjct: 451 FDGGAD---LDLPAENYMVLDGGSGALCLTVM-----GSRGLSIIG 488
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 104/363 (28%), Positives = 151/363 (41%), Gaps = 60/363 (16%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL 100
+ G +G Y + +G P R ++ LDTGSD+ W+QC APC RC P++ P
Sbjct: 116 ISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQC-APCKRCYAQSDPVFDPRKSR 174
Query: 101 ----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
+ C P+C L +PG C Q C Y++ Y DG + G + F T
Sbjct: 175 SFASIACRSPLCHRLDSPG---CNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRTR--- 228
Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGG 211
R+ALGCG++ + G+LGLG+G+ S SQ + + + +CL +
Sbjct: 229 -VARVALGCGHDN--EGLFVGAAGLLGLGRGRLSFPSQ--TGRRFNHKFSYCLVDRSASS 283
Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSD---YTKYY------------SPGVAELFFGGETTG 256
+ FGD S +T + S+ T YY PG+ F + TG
Sbjct: 284 KPSSMVFGDSAV-SRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTG 342
Query: 257 LKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW--KGRRPFK 314
N V+ DSG+S T L R Y + A +LK AP+ C+ G+ K
Sbjct: 343 --NGGVIIDSGTSVTRLTRPAYIAFRDAFRA--GASNLKRAPQFSLFDTCFDLSGKTEVK 398
Query: 315 NVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVI 373
V V FR +S L YLI + GN CL + L++I
Sbjct: 399 -VPTVVLHFRGADVS-----------LPASNYLIPVDTSGNFCLAFAG----TMGGLSII 442
Query: 374 GGI 376
G I
Sbjct: 443 GNI 445
>gi|62954896|gb|AAY23265.1| Similar to probable aspartic proteinase (EC 3.4.23.-) - barley
[Oryza sativa Japonica Group]
gi|77548965|gb|ABA91762.1| Aspartic proteinase Asp1 precursor, putative [Oryza sativa
Japonica Group]
gi|125576451|gb|EAZ17673.1| hypothetical protein OsJ_33214 [Oryza sativa Japonica Group]
Length = 96
Score = 85.9 bits (211), Expect = 3e-14, Method: Composition-based stats.
Identities = 34/53 (64%), Positives = 44/53 (83%)
Query: 37 LLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEA 89
++F +HGNVYP+G + VTM IG P +PYFLD+DTGSDLTW++CDAPC C +A
Sbjct: 30 MVFPLHGNVYPSGRFFVTMNIGVPEKPYFLDIDTGSDLTWVECDAPCQSCHQA 82
>gi|125546587|gb|EAY92726.1| hypothetical protein OsI_14476 [Oryza sativa Indica Group]
Length = 530
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 88/338 (26%), Positives = 139/338 (41%), Gaps = 38/338 (11%)
Query: 57 IGQPARPYFLDLDTGSDLTWL--QCD--APCVRCVEAPHPLYRPS----NDLVPCEDPIC 108
+G P + + + LDTGSDL WL QCD P Y PS + VPC C
Sbjct: 122 VGTPGQTFMVALDTGSDLFWLPCQCDGCTPPASAASGSASFYIPSMSSTSQAVPCNSQFC 181
Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNG--QRLNPRLALGCG 165
C +QC Y++ Y SS G LV+D + + Q L ++ GCG
Sbjct: 182 EL-----RKECSTTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIPQILKAQILFGCG 236
Query: 166 YNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL 222
QV S+ +G+ GLG SI S L + L N C S G G + FGD
Sbjct: 237 --QVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSRDGIGRISFGDQG 294
Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLT 282
++ + Y+ ++E+ G T L+ +FD+G+S+TYL Y +T
Sbjct: 295 SSDQEETPLDVNPQHPT-YTISISEMTVGNSLTDLE-FSTIFDTGTSFTYLADPAYTYIT 352
Query: 283 SIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTR--TLFE 340
++ A + A + R PF+ +D+ + +T ++F
Sbjct: 353 QSFHAQVHAN--RHAAD---------SRIPFEYCYDLSSSEDRIQTPSISLRTVGGSVFP 401
Query: 341 LTPEAYLIISNKGN--VCLGILNGAEVGLQDLNVIGGI 376
+ E +I + CL I+ A++ + N + G+
Sbjct: 402 VIDEGQVISIQQHEYVYCLAIVKSAKLNIIGQNFMTGL 439
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 86/353 (24%), Positives = 139/353 (39%), Gaps = 59/353 (16%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y + + IG PA+P+ +DTGSDL W QC PC +C P++ P S +PC
Sbjct: 93 GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ-PCTQCFNQSTPIFNPQGSSSFSTLPCS 151
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
+C +L +P N C Y Y DG + G + + F G P + GC
Sbjct: 152 SQLCQALQSPTCSN----NSCQYTYGYGDGSETQGSMGTETLTF----GSVSIPNITFGC 203
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS--GGGGGFLFFGDDL 222
G N G G++G+G+G S+ SQL K +C++ G L
Sbjct: 204 GENN-QGFGQGNGAGLVGMGRGPLSLPSQLDVTKF-----SYCMTPIGSSNSSTLLLGSL 257
Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAELFF---GGETTGLKNLP----------------VV 263
+S T+ S + T S + ++ G + G LP ++
Sbjct: 258 ANS----VTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGII 313
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
DSG++ TY YQ + +++ + + LC++ N+
Sbjct: 314 IDSGTTLTYFVDNAYQAVRQAFISQMNLSVVNGS--SSGFDLCFQMPSDQSNLQ-----I 366
Query: 324 RTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
T + F G L E Y I + G +CL + + + Q +++ G I
Sbjct: 367 PTFVMHFDGGD----LVLPSENYFISPSNGLICLAMGSSS----QGMSIFGNI 411
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 79/279 (28%), Positives = 115/279 (41%), Gaps = 33/279 (11%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCD-APCVRCVEAPHPLYRPSN----DLVPCED 105
Y VT+ +G PA L++DTGSD++W+QC P C PL+ P+ VPC
Sbjct: 142 YVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAA 201
Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
C+ L + N QC Y + Y DG ++ GV D +N + GCG
Sbjct: 202 ASCSQLAL--YSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGSNALK---GFLFGCG 256
Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---------KLIRNVVGHCLSGGGGGFL 216
+ Q + +DG+LGLG+ S+VSQ S +N VG+ GG
Sbjct: 257 HAQQ--GLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQNSVGYISLGGPSSTA 314
Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK----NLPVVFDSGSSYTY 272
F S + S+D T YY +A + GG+ + V D+G+ T
Sbjct: 315 GF-------STTPLLTASNDPT-YYIVMLAGISVGGQPLSIDASVFASGAVVDTGTVVTR 366
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR 311
L Y L S + ++ AP L C+ R
Sbjct: 367 LPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTR 405
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 76/165 (46%), Gaps = 18/165 (10%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
T Y V IG P LDTGSDL W QCDAPC RC P PLY P+ + V C
Sbjct: 97 TATYLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYANVSC 156
Query: 104 EDPICASLHA---------PGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
+C +L + + C Y Y DG S+ GVL + F F G
Sbjct: 157 GSRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTFG--AGT 214
Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
++ LA GCG + + G G++G+G+G S+VSQL K
Sbjct: 215 TVH-DLAFGCGTDNLGGTDNS--SGLVGMGRGPLSLVSQLGVTKF 256
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 96/354 (27%), Positives = 138/354 (38%), Gaps = 45/354 (12%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPI 107
+G Y + +G P L LDT SDLTWLQC PC RC P++ P + E
Sbjct: 135 SGEYIAKIAVGTPGVEALLALDTASDLTWLQCQ-PCRRCYPQSGPVFDPRHSTSYREMSF 193
Query: 108 -CASLHAPGHHNCEDPAQ--CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
A A G D + C Y + Y DG +++G +++ F G RL PR+++GC
Sbjct: 194 NAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEETLTF--AGGVRL-PRISIGC 250
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG--GGFLFFGDDL 222
G++ G P GILGLG+G S +Q+ + LSG G L FG
Sbjct: 251 GHDN-KGLFGAPAAGILGLGRGLMSFPNQIDHNGTFSYCLVDFLSGPGSLSSTLTFGAGA 309
Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-----------------VVFD 265
D+S V S + P + G + G +P V+ D
Sbjct: 310 VDTSPPV--SFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLDPYTGRGGVIVD 367
Query: 266 SGSSYTYLNRVTYQTLTSIMKK-ELSAKSLKEAPEDETLPLCWK-GRRPFKNVHDVKKCF 323
SG++ T L R Y + + + C+ G R K V V F
Sbjct: 368 SGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCYTVGGRGMKKVPTVSMHF 427
Query: 324 RTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
+L P+ YLI + + G VC A G +++IG I
Sbjct: 428 ----------AGSVEVKLQPKNYLIPVDSMGTVCFAF---AATGDHSVSIIGNI 468
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 92/349 (26%), Positives = 139/349 (39%), Gaps = 46/349 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y + +G PAR ++ LDTGSD+ WLQC APC +C P++ P+ +PC
Sbjct: 126 SGEYFTRIGVGTPARYVYMVLDTGSDVVWLQC-APCRKCYTQADPVFDPTKSRTYAGIPC 184
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
P+C L +PG +N C Y++ Y DG + G + F T R+ALG
Sbjct: 185 GAPLCRRLDSPGCNNKNK--VCQYQVSYGDGSFTFGDFSTETLTFRRTRVT----RVALG 238
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVS--QLHSQKLIRNVVGHCLSGGGGGFLFFGDD 221
CG++ G + S V + +QK +V S +F
Sbjct: 239 CGHDN-EGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSYCLVDRSASAKPSSVVFGDSA 297
Query: 222 LYDSSRVVWTSMSSDYTKYY-----------SP--GVAELFFGGETTGLKNLPVVFDSGS 268
+ ++R + +Y SP G++ F + G N V+ DSG+
Sbjct: 298 VSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAG--NGGVIIDSGT 355
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
S T L R Y L + + A LK A E C+ + +VK T+ L
Sbjct: 356 SVTRLTRPAYIALRDAFR--VGASHLKRAAEFSLFDTCFD----LSGLTEVK--VPTVVL 407
Query: 329 SFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
F L YLI + N G+ C + L++IG I
Sbjct: 408 HFRGADV----SLPATNYLIPVDNSGSFCFAFAG----TMSGLSIIGNI 448
>gi|115457374|ref|NP_001052287.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza sativa Japonica Group]
gi|113563858|dbj|BAF14201.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|215740420|dbj|BAG97076.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222626225|gb|EEE60357.1| hypothetical protein OsJ_13479 [Oryza sativa Japonica Group]
Length = 530
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 88/338 (26%), Positives = 139/338 (41%), Gaps = 38/338 (11%)
Query: 57 IGQPARPYFLDLDTGSDLTWL--QCD--APCVRCVEAPHPLYRPS----NDLVPCEDPIC 108
+G P + + + LDTGSDL WL QCD P Y PS + VPC C
Sbjct: 122 VGTPGQTFMVALDTGSDLFWLPCQCDGCTPPASAASGSASFYIPSMSSTSQAVPCNSQFC 181
Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNG--QRLNPRLALGCG 165
C +QC Y++ Y SS G LV+D + + Q L ++ GCG
Sbjct: 182 EL-----RKECSTTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIPQILKAQILFGCG 236
Query: 166 YNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL 222
QV S+ +G+ GLG SI S L + L N C S G G + FGD
Sbjct: 237 --QVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSRDGIGRISFGDQG 294
Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLT 282
++ + Y+ ++E+ G T L+ +FD+G+S+TYL Y +T
Sbjct: 295 SSDQEETPLDVNPQHPT-YTISISEITVGNSLTDLE-FSTIFDTGTSFTYLADPAYTYIT 352
Query: 283 SIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTR--TLFE 340
++ A + A + R PF+ +D+ + +T ++F
Sbjct: 353 QSFHAQVHAN--RHAAD---------SRIPFEYCYDLSSSEDRIQTPSISLRTVGGSVFP 401
Query: 341 LTPEAYLIISNKGN--VCLGILNGAEVGLQDLNVIGGI 376
+ E +I + CL I+ A++ + N + G+
Sbjct: 402 VIDEGQVISIQQHEYVYCLAIVKSAKLNIIGQNFMTGL 439
>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 508
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 146/356 (41%), Gaps = 53/356 (14%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND--- 99
GN+Y Y NV+ IG P + + LDTGSDL WL C+ C +C P Y D
Sbjct: 101 GNLY---YANVS--IGTPGLYFLVALDTGSDLFWLPCE--CTKC-----PTYLTKRDNGK 148
Query: 100 ---------------LVPCEDPICASLHAPGHHNCEDPAQCDYELEY-ADGGSSLGVLVK 143
VPC +C + + + C Y+ Y ++ SS G LV+
Sbjct: 149 FWLNHYSSNASSTSIRVPCSSSLCEL----ANQCSSNKSSCPYQTHYLSENSSSAGYLVQ 204
Query: 144 DAFAFNYTNGQRLNP---RLALGCGYNQVPGAS-YHPLDGILGLGKGKSSIVSQLHSQKL 199
D T+ +L P ++ LGCG Q S +G++GLG GK S+ S L SQ L
Sbjct: 205 DILHMA-TDDSQLKPVDVKVTLGCGKVQTGKFSNVTAPNGLIGLGMGKVSVPSFLASQGL 263
Query: 200 IRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN 259
+ C G G + FGD R + +S Y+ + ++ T + +
Sbjct: 264 TTDSFSMCFGYYGYGRIDFGDIGPVGQRETPFNPAS---LSYNVTILQIIVTNRPTNV-H 319
Query: 260 LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
L + DSG+S+TYL Y +T M +A L+ D P + R +
Sbjct: 320 LTAIIDSGASFTYLTDPFYSIITENMD---AAMELERIKSDSDFPFEYCYRLSLATI--- 373
Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGG 375
F+ L+FT R +T + + +CL I+ ++ + N GG
Sbjct: 374 ---FQQPNLNFTMEGGRKFDVITSYVSVDTDDGPALCLAIVKSTDINVIGHNFFGG 426
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 86/353 (24%), Positives = 139/353 (39%), Gaps = 59/353 (16%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y + + IG PA+P+ +DTGSDL W QC PC +C P++ P S +PC
Sbjct: 93 GEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ-PCTQCFNQSTPIFNPQGSSSFSTLPCS 151
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
+C +L +P N C Y Y DG + G + + F G P + GC
Sbjct: 152 SQLCQALQSPTCSN----NSCQYTYGYGDGSETQGSMGTETLTF----GSVSIPNITFGC 203
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS--GGGGGFLFFGDDL 222
G N G G++G+G+G S+ SQL K +C++ G L
Sbjct: 204 GENN-QGFGQGNGAGLVGMGRGPLSLPSQLDVTKF-----SYCMTPIGSSTSSTLLLGSL 257
Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAELFF---GGETTGLKNLP----------------VV 263
+S T+ S + T S + ++ G + G LP ++
Sbjct: 258 ANS----VTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGII 313
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
DSG++ TY YQ + +++ + + LC++ N+
Sbjct: 314 IDSGTTLTYFADNAYQAVRQAFISQMNLSVVNGS--SSGFDLCFQMPSDQSNLQ-----I 366
Query: 324 RTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
T + F G L E Y I + G +CL + + + Q +++ G I
Sbjct: 367 PTFVMHFDGGD----LVLPSENYFISPSNGLICLAMGSSS----QGMSIFGNI 411
>gi|116308959|emb|CAH66084.1| H0209A05.1 [Oryza sativa Indica Group]
Length = 530
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 88/338 (26%), Positives = 139/338 (41%), Gaps = 38/338 (11%)
Query: 57 IGQPARPYFLDLDTGSDLTWL--QCD--APCVRCVEAPHPLYRPS----NDLVPCEDPIC 108
+G P + + + LDTGSDL WL QCD P Y PS + VPC C
Sbjct: 122 VGTPGQTFMVALDTGSDLFWLPCQCDGCTPPASAASGSASFYIPSMSSTSQAVPCNSQFC 181
Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNG--QRLNPRLALGCG 165
C +QC Y++ Y SS G LV+D + + Q L ++ GCG
Sbjct: 182 EL-----RKECSTTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIPQILKAQILFGCG 236
Query: 166 YNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL 222
QV S+ +G+ GLG SI S L + L N C S G G + FGD
Sbjct: 237 --QVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSRDGIGRISFGDQG 294
Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLT 282
++ + Y+ ++E+ G T L+ +FD+G+S+TYL Y +T
Sbjct: 295 SSDQEETPLDVNPQHPT-YTISISEITVGNSLTDLE-FSTIFDTGTSFTYLADPAYTYIT 352
Query: 283 SIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTR--TLFE 340
++ A + A + R PF+ +D+ + +T ++F
Sbjct: 353 QSFHAQVHAN--RHAAD---------SRIPFEYCYDLSSSEDRIQTPSISLRTVGGSVFP 401
Query: 341 LTPEAYLIISNKGN--VCLGILNGAEVGLQDLNVIGGI 376
+ E +I + CL I+ A++ + N + G+
Sbjct: 402 VIDEGQVISIQQHEYVYCLAIVKSAKLNIIGQNFMTGL 439
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 83/273 (30%), Positives = 116/273 (42%), Gaps = 38/273 (13%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSNDL----VPCE 104
Y VT+ G P+ P L +DTGSD++W+QC PC +C PL+ PS + C
Sbjct: 131 YVVTLGFGTPSVPQVLLMDTGSDVSWVQC-TPCNSTKCYPQKDPLFDPSKSSTYAPIACN 189
Query: 105 DPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL- 162
C L H+ C QC Y +EYADG S GV + L P + +
Sbjct: 190 TDACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGVYSNETLT--------LAPGITVE 241
Query: 163 ----GCGYNQV-PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGF 215
GCG +Q P Y DG+LGLG S+V Q S + +CL GF
Sbjct: 242 DFHFGCGRDQRGPSDKY---DGLLGLGGAPVSLVVQTSS--VYGGAFSYCLPALNSEAGF 296
Query: 216 LFFGDDLY-DSSRVVWTSMS--SDYTKYYSPGVAELFFGGETTGLKNLP----VVFDSGS 268
L G + S V+T M Y +Y + + GG+ + ++ DSG+
Sbjct: 297 LVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAFRGGMIIDSGT 356
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDE 301
T L Y L + ++K L A L P D+
Sbjct: 357 VDTELPETAYNALEAALRKALKAYPL--VPSDD 387
>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 87/340 (25%), Positives = 144/340 (42%), Gaps = 28/340 (8%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL 100
+ G V TG V Y + Y L +DTGS T++ C C RC E H Y +
Sbjct: 29 LRGGVLGTGTL-VAEYALADGQTYDLIVDTGSARTYVPCKG-CARCGEHAHGYYDYDRSM 86
Query: 101 ----VPCEDPICASL-HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
+ C + A+L C+ +C Y + YA+G SS G +V+D
Sbjct: 87 EFERLDCGEASDATLCEETMKGTCQSDGRCSYVVSYAEGSSSRGYVVRDRVRLGEGT--- 143
Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG--G 213
L+ LA GC + DG+ G G+G +++ +QL S LI NV C+ G G G
Sbjct: 144 LSAMLAFGCEEAETNAIYEQKADGLFGFGRGTATVHAQLASAGLIENVFSFCVEGFGANG 203
Query: 214 GFLFFG--DDLYDSSRVVWTSMSSDYTK--YYSPGVAELFFGGE-TTGLKNLPVVFDSGS 268
G L G D D+ + T + +D +++ + G L + DSG+
Sbjct: 204 GVLTLGRFDFGADAPALARTPLVADPANPAFHNVRTSSWKLGDSLIEHLNSYTTTLDSGT 263
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLK--EAPEDETLPLCWKGRRPFKNV----HDVKKC 322
++T++ R + + + + + + L+ P+ + +C+ N+ V +
Sbjct: 264 TFTFVPRSVWVSFKTRLDTQATQAGLEIVAGPDPQYDDVCYGVSAAAMNMTLSQSTVSEW 323
Query: 323 FRTLALSFTDGKTRTLFELTPEAYLII--SNKGNVCLGIL 360
F L +++ G + T L PE YL +N C+GI
Sbjct: 324 FPPLTIAYEGGVSLT---LGPENYLFAHETNSAAFCVGIF 360
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 98/379 (25%), Positives = 151/379 (39%), Gaps = 53/379 (13%)
Query: 33 VGSSLLFQVHG--NVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC---- 86
VG + F V G + Y G Y + +G P R + + +DTGSD+ W+ C++ C C
Sbjct: 46 VGGVVDFSVQGSPDPYLVGLYFTKVKLGSPPREFNVQIDTGSDVLWVCCNS-CNNCPRTS 104
Query: 87 -VEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGV 140
+ + S+ LV C DPIC S C QC Y +Y DG + G
Sbjct: 105 GLGIQLNFFDSSSSSTAGLVHCSDPICTSAVQTTVTQCSPQTNQCSYTFQYEDGSGTSGY 164
Query: 141 LVKDAFAFNYTNGQRL----NPRLALGCGYNQVPGASY--HPLDGILGLGKGKSSIVSQL 194
V D F+ G+ L + + GC Q + +DGI G G+G+ S++SQL
Sbjct: 165 YVSDTLYFDAILGESLVVNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQL 224
Query: 195 HSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGET 254
+ + V HCL G G G +V++ + +Y+ + + G+
Sbjct: 225 STHGITPRVFSHCLKGEGIGGGILVLGEILEPGMVYSPLVPS-QPHYNLNLQSIAVNGKL 283
Query: 255 TGLKNLPVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
+ P VF DSG++ YL Y S + +S P
Sbjct: 284 LPID--PSVFATSNSQGTIVDSGTTLAYLVAEAYDPFVSAVNVIVSPS---------VTP 332
Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI---ISNKGNVCLGILN 361
+ KG + + V + F + +F G + L PE YLI S G+V I
Sbjct: 333 IISKGNQCYLVSTSVSQMFPLASFNFAGGASMV---LKPEDYLIPFGPSQGGSVMWCI-- 387
Query: 362 GAEVGLQDLNVIGGIGDFV 380
G Q + + +GD V
Sbjct: 388 ----GFQKVQGVTILGDLV 402
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 86/318 (27%), Positives = 132/318 (41%), Gaps = 50/318 (15%)
Query: 68 LDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVPCEDPICASLHAPGHHNCEDPA 123
+DTGSDL W QC APC+ C + P P + + +PC CASL +P
Sbjct: 1 MDTGSDLIWTQC-APCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFK----K 55
Query: 124 QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLALGCGYNQVPGASYHPLDGILG 182
C Y+ Y D S+ GVL + F F N ++ +A GCG + G++G
Sbjct: 56 MCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCG--SLNAGDLANSSGMVG 113
Query: 183 LGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFGDDLYDSSRVVWTSMSSDYTK 239
G+G S+VSQL + +CL+ L+FG SS + T
Sbjct: 114 FGRGPLSLVSQLGPSRF-----SYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTP 168
Query: 240 Y-YSPGVAELFF---GGETTGLKNLP---------------VVFDSGSSYTYLNRVTYQT 280
+ +P + ++F + G K LP V+ DSG+S T+L + Y+
Sbjct: 169 FVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEA 228
Query: 281 LTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFE 340
+ + + ++ + D L C++ P +V L F D TL
Sbjct: 229 VRRGLVSAIPLPAMND--TDIGLDTCFQWPPP----PNVTVTVPDLVFHF-DSANMTLL- 280
Query: 341 LTPEAYLII-SNKGNVCL 357
PE Y++I S G +CL
Sbjct: 281 --PENYMLIASTTGYLCL 296
>gi|226499286|ref|NP_001147826.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|195613980|gb|ACG28820.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 545
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 83/291 (28%), Positives = 125/291 (42%), Gaps = 44/291 (15%)
Query: 37 LLFQVHGNVYPTG-YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH---- 91
L F + Y +G Y + +G P + + LDTGSDL W+ CD C +C P
Sbjct: 95 LTFAAGNDTYQSGTLYYAEVELGTPNATFLVALDTGSDLFWVPCD--CRQCATIPSANAT 152
Query: 92 ----PLYRP-------SNDLVPCEDPICASLHAPGHHNCEDPA---QCDYELEYADGG-S 136
P RP +++ V C++P+C G N A C YE++Y S
Sbjct: 153 GPDAPPLRPYSPRRSSTSEQVACDNPLC------GRRNGCSAATNGSCPYEVQYVSANTS 206
Query: 137 SLGVLVKDAFAFNYTN------GQRLNPRLALGCGYNQVPGASYH----PLDGILGLGKG 186
S GVLV+D G+ L + GCG Q GA +DG++GLG G
Sbjct: 207 SSGVLVQDVLHLTRERPGPGAAGEALQAPVVFGCGQVQT-GAFLDDGGGAVDGLMGLGMG 265
Query: 187 KSSIVSQLHSQKLI-RNVVGHCLSGGGGGFLFFGD-DLYDSSRVVWTSMSSDYTKYYSPG 244
K S+ S L + L+ + C G G + FGD + +T S + T Y+
Sbjct: 266 KVSVPSALAASGLVASDSFSMCFGDDGVGRVNFGDAGSRGQAETPFTVRSLNPT--YNVS 323
Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLK 295
+ G E+ + V DSG+S+TYL+ Y L + ++S + +
Sbjct: 324 FTSIGIGSESVAAE-FAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVN 373
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 85/311 (27%), Positives = 134/311 (43%), Gaps = 39/311 (12%)
Query: 68 LDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNC---- 119
+DT S+LTW+QC APC C + PL+ P++ ++PC C +L
Sbjct: 142 VDTASELTWVQC-APCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACG 200
Query: 120 --EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY-NQVPGASYHP 176
E P+ C Y L Y DG S GVL D + G+ ++ GCG NQ P +
Sbjct: 201 GGEQPS-CSYTLSYRDGSYSQGVLAHDKLSL---AGEVIDG-FVFGCGTSNQGP---FGG 252
Query: 177 LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGDDL---YDSSRVVW 230
G++GLG+ + S++SQ Q V +CL G L GDD +S+ +V+
Sbjct: 253 TSGLMGLGRSQLSLISQTMDQ--FGGVFSYCLPLKESESSGSLVLGDDTSVYRNSTPIVY 310
Query: 231 TSMSSDYTK--YYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKE 288
T+M SD + +Y + + GG+ V+ DSG+ T L Y + + +
Sbjct: 311 TTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQ 370
Query: 289 LSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI 348
+ +AP L C+ F+ V +L F +G + + Y +
Sbjct: 371 FA--EYPQAPGFSILDTCFN-LTGFREVQ-----IPSLKFVF-EGNVEVEVDSSGVLYFV 421
Query: 349 ISNKGNVCLGI 359
S+ VCL +
Sbjct: 422 SSDSSQVCLAL 432
>gi|224033419|gb|ACN35785.1| unknown [Zea mays]
gi|413934980|gb|AFW69531.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 543
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 83/291 (28%), Positives = 124/291 (42%), Gaps = 44/291 (15%)
Query: 37 LLFQVHGNVYPTG-YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH---- 91
L F + Y +G Y + +G P + + LDTGSDL W+ CD C +C P
Sbjct: 93 LTFAAGNDTYQSGTLYYAEVELGTPNATFLVALDTGSDLFWVPCD--CRQCATIPSANGT 150
Query: 92 ----PLYRP-------SNDLVPCEDPICASLHAPGHHNCEDPA---QCDYELEYADGG-S 136
P RP ++ V C++P+C G N A C YE++Y S
Sbjct: 151 GQDAPSLRPYSPRRSSTSKQVACDNPLC------GQRNGCSAATNGSCPYEVQYVSANTS 204
Query: 137 SLGVLVKDAFAFNYTN------GQRLNPRLALGCGYNQVPGASYH----PLDGILGLGKG 186
S GVLV+D G+ L + GCG Q GA +DG++GLG G
Sbjct: 205 SSGVLVQDVLHLTRERPGPGAAGEALQAPVVFGCGQVQT-GAFLDGGGGAVDGLMGLGMG 263
Query: 187 KSSIVSQLHSQKLI-RNVVGHCLSGGGGGFLFFGD-DLYDSSRVVWTSMSSDYTKYYSPG 244
K S+ S L + L+ + C G G + FGD + +T S + T Y+
Sbjct: 264 KVSVPSALAASGLVASDSFSMCFGDDGVGRVNFGDAGSRGQAETPFTVRSLNPT--YNVS 321
Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLK 295
+ G E+ + V DSG+S+TYL+ Y L + ++S + +
Sbjct: 322 FTSIGVGSESVAAE-FAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVN 371
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 92/313 (29%), Positives = 136/313 (43%), Gaps = 38/313 (12%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRP---- 96
G+ Y + Y T+ +G PA P L LDTGS LTW+QC PC +C PL+ P
Sbjct: 121 GSSYDSQEYVATVGLGTPAVPQTLILDTGSSLTWVQCK-PCNSSQCYPQRLPLFDPNTSS 179
Query: 97 SNDLVPCEDPICASLHAP-GHHNCEDPAQ--CDYELEYADGGSSLGVLVKDAFAFNYTNG 153
S VPC+ C +L A C C YE+ Y G + G DA
Sbjct: 180 SYSPVPCDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTL---GP 236
Query: 154 QRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGG 211
+ R GCG++Q G + DG+LGLG+ S+ Q +++ V HCL +G
Sbjct: 237 GAIVKRFHFGCGHHQQRG-KFDMADGVLGLGRLPQSLAWQASARR-GGGVFSHCLPPTGV 294
Query: 212 GGGFLFFGDDLYDSSRVVWTSMSS--DYTKYYSPGVAELFFGGETTGLKNLP-------V 262
GFL G +D+S V+T + + D +Y + G+ L ++P V
Sbjct: 295 STGFLALGAP-HDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQ---LLDIPPAVFREGV 350
Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
+ DSG+ + L Y L + + ++ L AP L C+ F +V
Sbjct: 351 ITDSGTVLSALQETAYTALRTAFRSAMAEYPL--APPVGHLDTCFN----FTGYDNVT-- 402
Query: 323 FRTLALSFTDGKT 335
T++L+F G T
Sbjct: 403 VPTVSLTFRGGAT 415
>gi|226501154|ref|NP_001146408.1| uncharacterized protein LOC100279988 [Zea mays]
gi|219887047|gb|ACL53898.1| unknown [Zea mays]
gi|414587777|tpg|DAA38348.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 416
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 70/258 (27%), Positives = 108/258 (41%), Gaps = 19/258 (7%)
Query: 47 PTGYYNVTMYIGQPARPYFLDLDTGSDLTWL--QCD--APCVRCVEAPHPLYRP----SN 98
P+ + + +G P + + + LDTGSDL WL QCD P Y P ++
Sbjct: 3 PSSLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSATFYIPGMSSTS 62
Query: 99 DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNG--QR 155
VPC C C QC Y++ Y G SS G LV+D + N Q
Sbjct: 63 KAVPCNSNFCDL-----QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQI 117
Query: 156 LNPRLALGCGYNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
L ++ LGCG Q +G+ GLG + S+ S L + L N C G G
Sbjct: 118 LKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIG 177
Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLN 274
+ FGD ++ + Y+ ++ + G + T + + +FD+G+S+TYL
Sbjct: 178 RISFGDQESSDQEETPLDINRQHPT-YAITISGITVGNKPTDM-DFITIFDTGTSFTYLA 235
Query: 275 RVTYQTLTSIMKKELSAK 292
Y +T ++ A
Sbjct: 236 DPAYTYITQSFHAQVQAN 253
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 79/279 (28%), Positives = 115/279 (41%), Gaps = 33/279 (11%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCD-APCVRCVEAPHPLYRPSN----DLVPCED 105
Y VT+ +G PA L++DTGSD++W+QC P C PL+ P+ VPC
Sbjct: 131 YVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAA 190
Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
C+ L + N QC Y + Y DG ++ GV D +N + GCG
Sbjct: 191 ASCSQLAL--YSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGSNALK---GFLFGCG 245
Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---------KLIRNVVGHCLSGGGGGFL 216
+ Q + +DG+LGLG+ S+VSQ S +N VG+ GG
Sbjct: 246 HAQQ--GLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQNSVGYISLGGPSSTA 303
Query: 217 FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK----NLPVVFDSGSSYTY 272
F S + S+D T YY +A + GG+ + V D+G+ T
Sbjct: 304 GF-------STTPLLTASNDPT-YYIVMLAGISVGGQPLSIDASVFASGAVVDTGTVVTR 355
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR 311
L Y L S + ++ AP L C+ R
Sbjct: 356 LPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTR 394
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 89/307 (28%), Positives = 128/307 (41%), Gaps = 35/307 (11%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV--RCVEAPHPLYRPSND----LV 101
T Y VT +G P L++DTGSDL+W+QC PC C PL+ P+ V
Sbjct: 134 TSNYVVTASLGTPGMAQTLEVDTGSDLSWVQCK-PCAAPSCYRQKDPLFDPAQSSSYAAV 192
Query: 102 PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKD--AFAFNYTNGQRLNPR 159
PC CA L + + AQC Y + Y DG ++ GV D A N T L
Sbjct: 193 PCGRSACAGLGI--YASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLAANATVQGFL--- 247
Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLF 217
GCG+ Q G + +DG+LG G+ + S+V Q + V +CL G+L
Sbjct: 248 --FGCGHAQ-SGGLFTGIDGLLGFGREQPSLVQQ--TAGAYGGVFSYCLPTKSSTTGYLT 302
Query: 218 FGDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLP----VVFDSGSSYT 271
G + T + S + YY + + GG+ + V D+G+ T
Sbjct: 303 LGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAFAAGTVVDTGTVIT 362
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
L Y L S + ++ S AP L C+ F V ++AL+F+
Sbjct: 363 RLPPAAYAALRSAFRSGMA--SYPSAPPIGILDTCYS----FAGYGTVN--LTSVALTFS 414
Query: 332 DGKTRTL 338
G T TL
Sbjct: 415 SGATMTL 421
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 85/311 (27%), Positives = 134/311 (43%), Gaps = 39/311 (12%)
Query: 68 LDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNC---- 119
+DT S+LTW+QC APC C + PL+ P++ ++PC C +L
Sbjct: 141 VDTASELTWVQC-APCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACG 199
Query: 120 --EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY-NQVPGASYHP 176
E P+ C Y L Y DG S GVL D + G+ ++ GCG NQ P +
Sbjct: 200 GGEQPS-CSYTLSYRDGSYSQGVLAHDKLSL---AGEVIDG-FVFGCGTSNQGP---FGG 251
Query: 177 LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGDDL---YDSSRVVW 230
G++GLG+ + S++SQ Q V +CL G L GDD +S+ +V+
Sbjct: 252 TSGLMGLGRSQLSLISQTMDQ--FGGVFSYCLPLKESESSGSLVLGDDTSVYRNSTPIVY 309
Query: 231 TSMSSDYTK--YYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKE 288
T+M SD + +Y + + GG+ V+ DSG+ T L Y + + +
Sbjct: 310 TTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQ 369
Query: 289 LSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI 348
+ +AP L C+ F+ V +L F +G + + Y +
Sbjct: 370 FA--EYPQAPGFSILDTCFN-LTGFREVQ-----IPSLKFVF-EGNVEVEVDSSGVLYFV 420
Query: 349 ISNKGNVCLGI 359
S+ VCL +
Sbjct: 421 SSDSSQVCLAL 431
>gi|356546446|ref|XP_003541637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 160
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 38/75 (50%), Positives = 54/75 (72%), Gaps = 1/75 (1%)
Query: 302 TLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN 361
+LP+CWK + FK++HDV F+ +AL FT K +L +L PE+YLI++ G VCLGIL+
Sbjct: 58 SLPICWKDTKTFKSLHDVTSNFKPIALRFTKSK-NSLLQLQPESYLIVTKHGKVCLGILD 116
Query: 362 GAEVGLQDLNVIGGI 376
G E+GL + N+IG I
Sbjct: 117 GTEIGLGNTNIIGDI 131
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 85/314 (27%), Positives = 130/314 (41%), Gaps = 33/314 (10%)
Query: 15 VRMSSSSSSSSSSSLFNHVGSSLLFQ---VHGNVYPTGYYNVTMYIGQPARPYFLDLDTG 71
+R+ SS ++S + G + F + G +G Y + +G P + ++ LDTG
Sbjct: 3 IRVKKLSSLGATSRNLSKPGGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTG 62
Query: 72 SDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDY 127
SD+ WLQC APC C P++ P S V C P+C L +PG C C Y
Sbjct: 63 SDIVWLQC-APCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPG---CNQRQTCLY 118
Query: 128 ELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY-NQVPGASYHPLDGILGLGKG 186
++ Y DG + G V + F T + ++ALGCG+ N+ L G+ G
Sbjct: 119 QVSYGDGSYTTGEFVTETLTFRRTKVE----QVALGCGHDNEGLFVGAAGLLGLGRGGLS 174
Query: 187 KSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYY----- 241
S + +QK +V S +F + ++R + +Y
Sbjct: 175 FPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELL 234
Query: 242 ------SP--GVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKS 293
+P G+ F + TG N V+ D G+S T LN+ Y L + A S
Sbjct: 235 GISVGGTPVSGITASHFKLDRTG--NGGVIIDCGTSVTRLNKPAYIALRDAFRA--GASS 290
Query: 294 LKEAPEDETLPLCW 307
LK APE C+
Sbjct: 291 LKSAPEFSLFDTCY 304
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 55/160 (34%), Positives = 76/160 (47%), Gaps = 11/160 (6%)
Query: 45 VYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----L 100
V G + + + IG P R + +DTGSDL W QC PC +C + P++ P
Sbjct: 105 VAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQC-KPCQQCFDQSTPIFDPKQSSSFYK 163
Query: 101 VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF-NYTNGQRLNPR 159
+ C +C +L C C+Y Y D S+ GVL + F F + T Q P
Sbjct: 164 ISCSSELCGALPT---STCSSDG-CEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPG 219
Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
L GCG N G + G++GLG+G S+VSQL QK
Sbjct: 220 LGFGCG-NDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQKF 258
>gi|356540838|ref|XP_003538891.1| PREDICTED: peroxidase [Glycine max]
Length = 829
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 96/376 (25%), Positives = 154/376 (40%), Gaps = 66/376 (17%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMY----IGQPARPYFLDLDTGSDLTWLQCD-APCVRCVEA 89
S L F Y G + + +G P + + LDTGSDL WL C+ CVR VE+
Sbjct: 82 SPLTFVPANETYQIGAFGFLHFANVSVGTPPLSFLVALDTGSDLFWLPCNCTKCVRGVES 141
Query: 90 -----PHPLY----RPSNDLVPCEDPICASLHAPGHHNC-EDPAQCDYELEY-ADGGSSL 138
+Y ++ V C +C C + C YE+ Y ++G S+
Sbjct: 142 NGEKIAFNIYDLKGSSTSQTVLCNSNLCEL-----QRQCPSSDSICPYEVNYLSNGTSTT 196
Query: 139 GVLVKDAFAF--NYTNGQRLNPRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVS 192
G LV+D + + + R+ GCG Q + GA+ +G+ GLG G S+ S
Sbjct: 197 GFLVEDVLHLITDDDETKDADTRITFGCGQVQTGAFLDGAAP---NGLFGLGMGNESVPS 253
Query: 193 QLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY--------YSPG 244
L + L N C G G + FGD+ +S+ T + Y+
Sbjct: 254 ILAKEGLTSNSFSMCFGSDGLGRITFGDN---------SSLVQGKTPFNLRALHPTYNIT 304
Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
V ++ GG L+ +FDSG+S+T+LN Y+ +T+ + + + DE
Sbjct: 305 VTQIIVGGNAADLE-FHAIFDSGTSFTHLNDPAYKQITNSFNSAIKLQRYSSSSSDEL-- 361
Query: 305 LCWKGRRPFKNVHDV---KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGN--VCLGI 359
PF+ +D+ K + L+ G L + + IS +G +CLG+
Sbjct: 362 -------PFEYCYDLSSNKTVELPINLTMKGGDNY----LVTDPIVTISGEGVNLLCLGV 410
Query: 360 LNGAEVGLQDLNVIGG 375
L V + N + G
Sbjct: 411 LKSNNVNIIGQNFMTG 426
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 96/352 (27%), Positives = 143/352 (40%), Gaps = 44/352 (12%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL----VP 102
TG Y V + +G P + L DTGSDLTW QC PCV+ C P++ PS +
Sbjct: 151 TGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQ-PCVKSCYAQQQPIFDPSASKTYSNIS 209
Query: 103 CEDPICASLH-APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
C C+ L A G+ + C Y ++Y D ++G KD + +
Sbjct: 210 CTSTACSGLKSATGNSPGCSSSNCVYGIQYGDSSFTVGFFAKDTLTLTQND---VFDGFM 266
Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFG 219
GCG N + G++GLG+ SIV Q +QK + +CL S G G L FG
Sbjct: 267 FGCGQNNR--GLFGKTAGLIGLGRDPLSIVQQ-TAQKFGK-YFSYCLPTSRGSNGHLTFG 322
Query: 220 D-DLYDSSRVVWTSM------SSDYTKYYSPGVAELFFGGETTGL-----KNLPVVFDSG 267
+ + +S+ V + SS +Y V + GG+ + +N + DSG
Sbjct: 323 NGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQNAGTIIDSG 382
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
+ T L Y +L S K+ +S AP L C+ N + ++
Sbjct: 383 TVITRLPSTVYGSLKSTFKQFMS--KYPTAPALSLLDTCYD----LSNYTSIS--IPKIS 434
Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
+F +L P LI + VCL A G D + IG G+
Sbjct: 435 FNFNGNAN---VDLEPNGILITNGASQVCL-----AFAGNGDDDTIGIFGNI 478
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 95/346 (27%), Positives = 142/346 (41%), Gaps = 57/346 (16%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC-VRCVEAPHPLYRPSN----DLVPC 103
G Y +T+ IG P PY DTGSDL W QC APC +C E P PLY P++ ++PC
Sbjct: 110 GEYLMTLAIGTPPLPYAAVADTGSDLIWTQC-APCGTQCFEQPAPLYNPASSTTFSVLPC 168
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG-QRLNPRLAL 162
+ A C Y Y G ++ GV + F F + Q P +A
Sbjct: 169 NSSLSMCAGALAGAAPPPGCACMYNQTYGTGWTA-GVQGSETFTFGSSAADQARVPGVAF 227
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL 222
GC + + ++ G++GLG+G S+VSQL + + +CL+ F D
Sbjct: 228 GC--SNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRF-----SYCLTP-------FQDTN 273
Query: 223 YDSSRVVWTSMSSDYTKYYS-PGVAE-----------LFFGGETTGLKNLPV-------- 262
S+ ++ S + + T S P VA L G + G K LP+
Sbjct: 274 STSTLLLGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLK 333
Query: 263 -------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
+ DSG++ T L YQ + + +K ++ + + L LC+ P
Sbjct: 334 PDGTGGLIIDSGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSA 393
Query: 316 VHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN 361
V ++ L F DG L P +IS G CL + N
Sbjct: 394 PPAV---LPSMTLHF-DGADMVL----PADSYMISGSGVWCLAMRN 431
>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 530
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 71/246 (28%), Positives = 108/246 (43%), Gaps = 18/246 (7%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRPSNDLVPCEDPIC 108
+G P + + LDTGSDL W+ CD C++C P +Y P + P
Sbjct: 105 LGTPNVTFLVALDTGSDLFWVPCD--CIKCAPLASPDYGDLKFDMYSPRKSSTSRKVPCS 162
Query: 109 ASLHAPGHHNCEDPAQCDYELEY-ADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYN 167
+SL P C Y ++Y ++ SS GVLV+D +GQ + + G
Sbjct: 163 SSLCDPQADCSAASNSCPYSIQYLSENTSSKGVLVEDVLYLTTESGQSKITQAPITFGCG 222
Query: 168 QVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYD 224
QV S+ +G+LGLG S+ S L S+ + N C G G + FGD
Sbjct: 223 QVQSGSFLGSAAPNGLLGLGMDSKSVPSLLASKGIAANSFSMCFGEDGHGRINFGDT--G 280
Query: 225 SSRVVWTSMSS-DYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTS 283
SS + T ++ YY+ + GG++ K V DSG+S+T L+ Y +TS
Sbjct: 281 SSDQLETPLNIYKQNPYYNISITGAMVGGKSFDTK-FSAVVDSGTSFTALSDPMYTEITS 339
Query: 284 IMKKEL 289
++
Sbjct: 340 TFNAQV 345
>gi|297807039|ref|XP_002871403.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317240|gb|EFH47662.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 529
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 75/268 (27%), Positives = 113/268 (42%), Gaps = 45/268 (16%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY-----RPSNDLVP--------- 102
IG P+ + + LDTGSDL W+ C+ CV+C Y + N+ P
Sbjct: 106 IGTPSVSFLVALDTGSDLLWIPCN--CVQCAPLTSTYYSSLATKDLNEYNPSSSSSSKVF 163
Query: 103 -CEDPICASLHAPGHHNCEDPA-QCDYELEYADGG-SSLGVLVKDAFAFNYTNGQRL--- 156
C +C S +C+ P QC Y ++Y G SS G+LV+D Y RL
Sbjct: 164 LCSHKLCGS-----ASDCDSPKEQCTYTVKYLSGNTSSSGLLVEDILHLTYNTNNRLMNG 218
Query: 157 ----NPRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
R+ +GCG Q + G + DG++GLG + S+ S L L+RN C
Sbjct: 219 SSSVKARVVVGCGKKQSGDYLDGVA---PDGLMGLGPAEISVPSFLSKAGLMRNSFSLCF 275
Query: 209 SGGGGGFLFFGD---DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFD 265
G ++FGD + S+ + +S Y GV G + D
Sbjct: 276 DEEDSGRIYFGDMGPSIQQSAPFLQLENNSGYIV----GVEACCIGNSCLKQTSFTTFID 331
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKS 293
SG S+TYL Y+ + + + ++A S
Sbjct: 332 SGQSFTYLPEEIYRKVALEIDRHINATS 359
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 77/287 (26%), Positives = 123/287 (42%), Gaps = 23/287 (8%)
Query: 22 SSSSSSSLFNHVGSSLLFQVHGNVYP-TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD 80
S S S++L N ++ + + P +G Y +++ IG P Y DTGSDL W QC
Sbjct: 62 SLSRSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQC- 120
Query: 81 APCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS 136
PC++C + P++ P S VPC C ++ +C CDY Y D
Sbjct: 121 LPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNCKAID---DSHCGAQGVCDYSYTYGD--- 174
Query: 137 SLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS 196
K F + + +GCG+ + G++GLG G+ S+VSQ+
Sbjct: 175 --QTYTKGDLGFEKITIGSSSVKSVIGCGHESG--GGFGFASGVIGLGGGQLSLVSQMSQ 230
Query: 197 QKLIRNVVGHC----LSGGGGGFLFFGDDLYDSSRVVWTSM-SSDYTKYYSPGVAELFFG 251
I +C LS G F + + VV T + S + YY + + G
Sbjct: 231 TSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIG 290
Query: 252 GE--TTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
E K V+ DSG++ ++L + Y + S + K + AK +K+
Sbjct: 291 NERHMASAKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKD 337
>gi|195647908|gb|ACG43422.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|414587776|tpg|DAA38347.1| TPA: aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 498
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 84/334 (25%), Positives = 136/334 (40%), Gaps = 32/334 (9%)
Query: 57 IGQPARPYFLDLDTGSDLTWL--QCD--APCVRCVEAPHPLYRP----SNDLVPCEDPIC 108
+G P + + + LDTGSDL WL QCD P Y P ++ VPC C
Sbjct: 115 VGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSATFYIPGMSSTSKAVPCNSNFC 174
Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNG--QRLNPRLALGCG 165
C QC Y++ Y G SS G LV+D + N Q L ++ LGCG
Sbjct: 175 DL-----QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILKAQIMLGCG 229
Query: 166 YNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYD 224
Q +G+ GLG + S+ S L + L N C G G + FGD
Sbjct: 230 QTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISFGDQESS 289
Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSI 284
++ + Y+ ++ + G + T + + +FD+G+S+TYL Y +T
Sbjct: 290 DQEETPLDINRQHPT-YAITISGITVGNKPTDM-DFITIFDTGTSFTYLADPAYTYITQS 347
Query: 285 MKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFE-LTP 343
++ A + A + R PF+ +D+ + + T ++F + P
Sbjct: 348 FHAQVQAN--RHAADS---------RIPFEYCYDLSEARFPIPDIILRTVTGSMFPVIDP 396
Query: 344 EAYLIISNKGNV-CLGILNGAEVGLQDLNVIGGI 376
+ I V CL I+ ++ + N + G+
Sbjct: 397 GQVISIQEHEYVYCLAIVKSMKLNIIGQNFMTGL 430
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 94/352 (26%), Positives = 151/352 (42%), Gaps = 52/352 (14%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y + +G P + ++ LDTGSD+ WLQC PC +C ++ PS +PC
Sbjct: 127 SGEYFTRLGVGTPPKYLYMVLDTGSDVVWLQCK-PCTKCYSQTDQIFDPSKSKSFAGIPC 185
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
P+C L +PG + C Y++ Y DG + G + F + PR+A+G
Sbjct: 186 YSPLCRRLDSPGCSLKNN--LCQYQVSYGDGSFTFGDFSTETLTFR----RAAVPRVAIG 239
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS----GGGGGFLFFG 219
CG++ + G+LGLG+G S +Q ++ N +CL+ + FG
Sbjct: 240 CGHDN--EGLFVGAAGLLGLGRGGLSFPTQTGTR--FNNKFSYCLTDRTASAKPSSIVFG 295
Query: 220 DD-LYDSSRVVWTSMSSDYTKYY-----------SP--GVAELFFGGETTGLKNLPVVFD 265
D + ++R + +Y +P G++ FF ++TG N V+ D
Sbjct: 296 DSAVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTG--NGGVIID 353
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
SG+S T L R Y +L + + A LK APE C+ + +VK T
Sbjct: 354 SGTSVTRLTRPAYVSLRDAFR--VGASHLKRAPEFSLFDTCYD----LSGLSEVK--VPT 405
Query: 326 LALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
+ L F L YL+ + N G+ C + L++IG I
Sbjct: 406 VVLHFRGADV----SLPAANYLVPVDNSGSFCFAFAG----TMSGLSIIGNI 449
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 97/350 (27%), Positives = 154/350 (44%), Gaps = 44/350 (12%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y +T+YIG P DTGSDL W+QC +PC C PL+ P + C+
Sbjct: 90 GEYLMTLYIGTPPVERLAIADTGSDLIWVQC-SPCQNCFPQDTPLFEPLKSSTFKAATCD 148
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT-NGQRLN-PRLAL 162
C S+ P C QC Y Y D ++GV+ + +F T + Q ++ P
Sbjct: 149 SQPCTSV-PPSQRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGSTGDAQTVSFPSSIF 207
Query: 163 GCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFF 218
GCG YN + + G++GLG G S+VSQL Q I +CL S L F
Sbjct: 208 GCGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQ--IGYKFSYCLLPFSSNSTSKLKF 265
Query: 219 GDD-LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-------VVFDSGSSY 270
G + + ++ VV T + K P L T G K +P ++ DSG+
Sbjct: 266 GSEAIVTTNGVVSTPL---IIKPLFPSFYFLNLEAVTIGQKVVPTGRTDGNIIIDSGTVL 322
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
TYL + Y + +++ LS +S ++ LP +K P++++ +A F
Sbjct: 323 TYLEQTFYNNFVASLQEVLSVESAQD------LPFPFKFCFPYRDM-----TIPVIAFQF 371
Query: 331 TDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
T L P+ LI + ++ +CL ++ + L +++ G + F
Sbjct: 372 TGASV----ALQPKNLLIKLQDRNMLCLAVVPSS---LSGISIFGNVAQF 414
>gi|242072510|ref|XP_002446191.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
gi|241937374|gb|EES10519.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
Length = 499
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 70/247 (28%), Positives = 105/247 (42%), Gaps = 19/247 (7%)
Query: 57 IGQPARPYFLDLDTGSDLTWL--QCD--APCVRCVEAPHPLYRP----SNDLVPCEDPIC 108
+G P + + + LDTGSDL WL QCD P Y P ++ VPC C
Sbjct: 114 VGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSATFYIPGMSSTSKAVPCNSNFC 173
Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNG--QRLNPRLALGCG 165
C QC Y++ Y G SS G LV+D + N Q L ++ LGCG
Sbjct: 174 DL-----QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILKAQIMLGCG 228
Query: 166 YNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYD 224
Q +G+ GLG + S+ S L + L N C G G + FGD
Sbjct: 229 QTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISFGDQGSS 288
Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSI 284
+++ + Y+ ++ + G + T L + +FD+G+S+TYL Y +T
Sbjct: 289 DQEETPLNINQQHPT-YAITISGITIGNKPTDL-DFITIFDTGTSFTYLADPAYTYITQS 346
Query: 285 MKKELSA 291
++ A
Sbjct: 347 FHAQVQA 353
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 93/328 (28%), Positives = 130/328 (39%), Gaps = 42/328 (12%)
Query: 58 GQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV---------PCEDPIC 108
G PA + +DTGSDLTW+QC PC C PL+ P+ C D +
Sbjct: 155 GSPAANLTVIVDTGSDLTWVQCK-PCSACYAQRDPLFDPAGSATYAAVRCNASACADSLR 213
Query: 109 ASLHAPGH--HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
A+ PG +C Y L Y DG S GVL D A G L GCG
Sbjct: 214 AATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVAL---GGASLGG-FVFGCGL 269
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFF--GD 220
+ + G++GLG+ + S+VSQ S+ V +CL SG G L GD
Sbjct: 270 SNR--GLFGGTAGLMGLGRTELSLVSQTASR--YGGVFSYCLPAATSGDASGSLSLGGGD 325
Query: 221 DLYDSSR----VVWTSMSSDYTK--YYSPGVAELFFGG---ETTGLKNLPVVFDSGSSYT 271
D S R V +T M +D + +Y V GG GL V+ DSG+ T
Sbjct: 326 DAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVLIDSGTVIT 385
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
L Y+ + + ++ A AP L C+ +VK TL L
Sbjct: 386 RLAPSVYRAVRAEFMRQFGAAGYPAAPGFSILDTCYD----LTGHDEVKVPLLTLRL--- 438
Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGI 359
+G + +++ + VCL +
Sbjct: 439 EGGADVTVDAAGMLFVVRKDGSQVCLAM 466
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 85/314 (27%), Positives = 130/314 (41%), Gaps = 33/314 (10%)
Query: 15 VRMSSSSSSSSSSSLFNHVGSSLLFQ---VHGNVYPTGYYNVTMYIGQPARPYFLDLDTG 71
+R+ SS ++S + G + F + G +G Y + +G P + ++ LDTG
Sbjct: 90 IRVKKLSSLGATSRNLSKPGGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTG 149
Query: 72 SDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDY 127
SD+ WLQC APC C P++ P S V C P+C L +PG C C Y
Sbjct: 150 SDIVWLQC-APCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPG---CNQRQTCLY 205
Query: 128 ELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY-NQVPGASYHPLDGILGLGKG 186
++ Y DG + G V + F T + ++ALGCG+ N+ L G+ G
Sbjct: 206 QVSYGDGSYTTGEFVTETLTFRRTKVE----QVALGCGHDNEGLFVGAAGLLGLGRGGLS 261
Query: 187 KSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYY----- 241
S + +QK +V S +F + ++R + +Y
Sbjct: 262 FPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELL 321
Query: 242 ------SP--GVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKS 293
+P G+ F + TG N V+ D G+S T LN+ Y L + A S
Sbjct: 322 GISVGGTPVSGITASHFKLDRTG--NGGVIIDCGTSVTRLNKPAYIALRDAFRA--GASS 377
Query: 294 LKEAPEDETLPLCW 307
LK APE C+
Sbjct: 378 LKSAPEFSLFDTCY 391
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 95/350 (27%), Positives = 137/350 (39%), Gaps = 55/350 (15%)
Query: 46 YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR--CVEAPHPLYRPSND---- 99
+ T Y +G P + +DTGS L W QC A C+R CV P + S+
Sbjct: 81 WATRQYIAEYMVGDPPQRAEALIDTGSSLIWTQCTA-CLRKVCVRQDLPYFNASSSGSFA 139
Query: 100 LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
VPC+D CA + H C C + + Y GG +G L DAF F Q
Sbjct: 140 PVPCQDKACAGNYL---HFCALDGTCTFRVTYGAGGI-IGFLGTDAFTF-----QSGGAT 190
Query: 160 LALGC-GYNQVPGASY-HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF 217
LA GC + + H G++GLG+G+ S+ SQ +++ + + + G LF
Sbjct: 191 LAFGCVSFTRFAAPDVLHGASGLIGLGRGRLSLASQTGAKRFSYCLTPYFHNNGASSHLF 250
Query: 218 FGDDLYDSS------RVVWTSMSSDY---TKYYSPGVAELFFGGETT------------- 255
G S + + DY T YY P V GET
Sbjct: 251 VGAAASLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITV--GETKLAIPSTAFDLQEV 308
Query: 256 --GLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDE-TLPLCWKGRRP 312
G V+ DSGS +T L Y+ L + ++L+ + ED+ + LC
Sbjct: 309 EEGFWEGGVIIDSGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMALCVA---- 364
Query: 313 FKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNG 362
D+ + TL L F+ G L PE Y K C+ I+ G
Sbjct: 365 ---RGDLDRVVPTLVLHFSGGAD---MALPPENYWAPLEKSTACMAIVRG 408
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 93/350 (26%), Positives = 151/350 (43%), Gaps = 57/350 (16%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPIC 108
V +G+P P + +DTGSDL W+QC PC C P++ PS + + PIC
Sbjct: 93 VNFSVGRPPVPQLVGIDTGSDLLWVQC-RPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC 151
Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN-GQRLNPRLALGCGYN 167
+ +++ QC Y YADG +S G L + F ++ G + GCG++
Sbjct: 152 PNSPQKKYNHLN---QCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHS 208
Query: 168 QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSR 227
G GILGL G SIVS+L S+ +C+ G LF D Y ++
Sbjct: 209 N-RGRFDGQQSGILGLSAGDQSIVSRLGSR------FSYCI-----GDLF--DPHYTHNQ 254
Query: 228 VVW---TSMSSDYTKYYS-PGVAELFFGGETTGLKNLP---------------VVFDSGS 268
+V M T +++ G + G + G L VV DSG+
Sbjct: 255 LVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGT 314
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP--LCWKGRRPFKNVHDVKKCFRTL 326
+ T+L + + L++ +++ + + T+P LC+KGR V++ + F L
Sbjct: 315 TATFLAKDGFDPLSNEIQRLVRGHFQQVIY--RTIPGWLCYKGR-----VNEDLRGFPEL 367
Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
A F +G L + + N+ CL +L E L+++ + GI
Sbjct: 368 AFHFAEGAD---LVLDANSLFVQKNQDVFCLAVL---ESNLKNIGSVIGI 411
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 93/350 (26%), Positives = 151/350 (43%), Gaps = 57/350 (16%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPIC 108
V +G+P P + +DTGSDL W+QC PC C P++ PS + + PIC
Sbjct: 61 VNFSVGRPPVPQLVGIDTGSDLLWVQC-RPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC 119
Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN-GQRLNPRLALGCGYN 167
+ +++ QC Y YADG +S G L + F ++ G + GCG++
Sbjct: 120 PNSPQKKYNHLN---QCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHS 176
Query: 168 QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSR 227
G GILGL G SIVS+L S+ +C+ G LF D Y ++
Sbjct: 177 N-RGRFDGQQSGILGLSAGDQSIVSRLGSR------FSYCI-----GDLF--DPHYTHNQ 222
Query: 228 VVW---TSMSSDYTKYYS-PGVAELFFGGETTGLKNLP---------------VVFDSGS 268
+V M T +++ G + G + G L VV DSG+
Sbjct: 223 LVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGT 282
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP--LCWKGRRPFKNVHDVKKCFRTL 326
+ T+L + + L++ +++ + + T+P LC+KGR V++ + F L
Sbjct: 283 TATFLAKDGFDPLSNEIQRLVRGHFQQVIY--RTIPGWLCYKGR-----VNEDLRGFPEL 335
Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
A F +G L + + N+ CL +L E L+++ + GI
Sbjct: 336 AFHFAEGAD---LVLDANSLFVQKNQDVFCLAVL---ESNLKNIGSVIGI 379
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 97/348 (27%), Positives = 148/348 (42%), Gaps = 53/348 (15%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y + +++G P + + L LDTGSDL W+QC PC C E P Y P + + C
Sbjct: 192 SGEYFMDVFVGTPPKHFSLILDTGSDLNWIQC-VPCYACFEQNGPYYDPKDSSSFKNITC 250
Query: 104 EDPICASLHAPG-HHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQ-----RL 156
DP C + +P C+ Q C Y Y D ++ G + F N T + ++
Sbjct: 251 HDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKI 310
Query: 157 NPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SG 210
+ GCG +N+ +H G+LGLG+G S +QL Q L + +CL +
Sbjct: 311 VENVMFGCGHWNR---GLFHGAAGLLGLGRGPLSFATQL--QSLYGHSFSYCLVDRNSNS 365
Query: 211 GGGGFLFFGDD--LYDSSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGLKNLP--- 261
L FG+D L + +TS + +Y + + GGE +
Sbjct: 366 SVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPEETWHL 425
Query: 262 -------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFK 314
+ DSG++ TY Y+ + KE + +K P ET P +P
Sbjct: 426 SAQGGGGTIIDSGTTLTYFAEPAYEII-----KEAFMRKIKGFPLVETFPPL----KPCY 476
Query: 315 NVHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGIL 360
NV V+K A+ F DG +++ E Y I I + VCL IL
Sbjct: 477 NVSGVEKMELPEFAILFADG---AMWDFPVENYFIQIEPEDVVCLAIL 521
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 93/350 (26%), Positives = 151/350 (43%), Gaps = 57/350 (16%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPIC 108
V +G+P P + +DTGSDL W+QC PC C P++ PS + + PIC
Sbjct: 61 VNFSVGRPPVPQLVGIDTGSDLLWVQC-RPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC 119
Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN-GQRLNPRLALGCGYN 167
+ +++ QC Y YADG +S G L + F ++ G + GCG++
Sbjct: 120 PNSPQKKYNHLN---QCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHS 176
Query: 168 QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSR 227
G GILGL G SIVS+L S+ +C+ G LF D Y ++
Sbjct: 177 N-RGRFDGQQSGILGLSAGDQSIVSRLGSR------FSYCI-----GDLF--DPHYTHNQ 222
Query: 228 VVW---TSMSSDYTKYYS-PGVAELFFGGETTGLKNLP---------------VVFDSGS 268
+V M T +++ G + G + G L VV DSG+
Sbjct: 223 LVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGT 282
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP--LCWKGRRPFKNVHDVKKCFRTL 326
+ T+L + + L++ +++ + + T+P LC+KGR V++ + F L
Sbjct: 283 TATFLAKDGFDPLSNEIQRLVRGHFQQVIY--RTIPGWLCYKGR-----VNEDLRGFPEL 335
Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
A F +G L + + N+ CL +L E L+++ + GI
Sbjct: 336 AFHFAEGAD---LVLDANSLFVQKNQDVFCLAVL---ESNLKNIGSVIGI 379
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 97/357 (27%), Positives = 139/357 (38%), Gaps = 62/357 (17%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
+G Y + +G PAR ++ LDTGSD+ WLQC APC RC P++ P +PC
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQC-APCRRCYSQSDPIFDPRKSKTYATIPC 197
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF--NYTNGQRLNPRLA 161
P C L + G + C Y++ Y DG ++G + F N G +A
Sbjct: 198 SSPHCRRLDSAGCNTRRK--TCLYQVSYGDGSFTVGDFSTETLTFRRNRVKG------VA 249
Query: 162 LGCGYNQ-------------------VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRN 202
LGCG++ PG + H + K +V + S K
Sbjct: 250 LGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFN-----QKFSYCLVDRSASSKPSSV 304
Query: 203 VVGHCLSGGGGGF--LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNL 260
V G+ F L L V +S T+ PGV F + G N
Sbjct: 305 VFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRV--PGVTASLFKLDQIG--NG 360
Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
V+ DSG+S T L R Y + + + AK+LK AP C+ N+++VK
Sbjct: 361 GVIIDSGTSVTRLIRPAYIAMRDAFR--VGAKTLKRAPNFSLFDTCFD----LSNMNEVK 414
Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
T+ L F R L YLI + G C + L++IG I
Sbjct: 415 --VPTVVLHF----RRADVSLPATNYLIPVDTNGKFCFAFAG----TMGGLSIIGNI 461
>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
Length = 530
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 70/257 (27%), Positives = 110/257 (42%), Gaps = 30/257 (11%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVP---------- 102
IG P + + LD GSDL W+ CD C++C Y R N P
Sbjct: 106 IGTPNISFLVALDAGSDLLWIPCD--CIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLS 163
Query: 103 CEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAF----AFNYTNGQRL 156
C +C S NC+ P Q C Y + Y ++ SS G+L++D + + +
Sbjct: 164 CSHQLCES-----SPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDASNSSV 218
Query: 157 NPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
+ +GCG Q G P DG++GLG G+ S+ S L L++N C + G
Sbjct: 219 RAPVIIGCGMRQTGGYLDGVAP-DGLMGLGLGEISVPSFLSKAGLVKNSFSLCFNDDDSG 277
Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLN 274
+FFGD + + S + Y GV G + + DSG+S+T+L
Sbjct: 278 RIFFGDQGLATQQTTLFLPSDGKYETYIVGVEACCIGSSCIKQTSFRALVDSGASFTFLP 337
Query: 275 RVTYQTLTSIMKKELSA 291
+Y+ + K+++A
Sbjct: 338 DESYRNVVDEFDKQVNA 354
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 88/333 (26%), Positives = 133/333 (39%), Gaps = 38/333 (11%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR--CVEAPHPLYRPSN----DLV 101
TG Y V++ +G PAR + DTGSDL+W+QC PC C + PL+ PS+ V
Sbjct: 151 TGNYVVSVGLGTPARDLTVVFDTGSDLSWVQC-GPCSSGGCYKQQDPLFAPSDSSTFSAV 209
Query: 102 PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF--------NYTNG 153
C C + + G +D +C YE+ Y D + G L D + N
Sbjct: 210 RCGARECRARQSCGGSPGDD--RCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAEND 267
Query: 154 QRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SG 210
+L P GCG N + DG+ GLG+GK S+ SQ + +CL S
Sbjct: 268 NKL-PGFVFGCGENNT--GLFGQADGLFGLGRGKVSLSSQAAGK--FGEGFSYCLPSSSS 322
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYT--KYYSPGVAELFFGGETTGLKN----LPVVF 264
G+L G + + +T M + T +Y + + G + + LP++
Sbjct: 323 SAPGYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPLIV 382
Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
DSG+ T L Y+ L + + K AP L C+ F +
Sbjct: 383 DSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILDTCYD----FTAHANATVSIP 438
Query: 325 TLALSFTDGKTRTLFELTPEAYLIISNKGNVCL 357
+AL F G T + L ++ CL
Sbjct: 439 AVALVFAGGAT---ISVDFSGVLYVAKVAQACL 468
>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 511
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 70/257 (27%), Positives = 110/257 (42%), Gaps = 30/257 (11%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVP---------- 102
IG P + + LD GSDL W+ CD C++C Y R N P
Sbjct: 87 IGTPNISFLVALDAGSDLLWIPCD--CIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLS 144
Query: 103 CEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAF----AFNYTNGQRL 156
C +C S NC+ P Q C Y + Y ++ SS G+L++D + + +
Sbjct: 145 CSHQLCES-----SPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDASNSSV 199
Query: 157 NPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
+ +GCG Q G P DG++GLG G+ S+ S L L++N C + G
Sbjct: 200 RAPVIIGCGMRQTGGYLDGVAP-DGLMGLGLGEISVPSFLSKAGLVKNSFSLCFNDDDSG 258
Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLN 274
+FFGD + + S + Y GV G + + DSG+S+T+L
Sbjct: 259 RIFFGDQGLATQQTTLFLPSDGKYETYIVGVEACCIGSSCIKQTSFRALVDSGASFTFLP 318
Query: 275 RVTYQTLTSIMKKELSA 291
+Y+ + K+++A
Sbjct: 319 DESYRNVVDEFDKQVNA 335
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 86/294 (29%), Positives = 126/294 (42%), Gaps = 39/294 (13%)
Query: 15 VRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDL 74
+R ++ ++ +SL + G G + +G Y + +G P+ L +DTGSDL
Sbjct: 50 LRQRLAADAARYASLVDATGRLHSPVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDL 109
Query: 75 TWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPAQ----CD 126
WLQC +PC RC ++ P VPC P C +L PG C+ C
Sbjct: 110 VWLQC-SPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQCRALRFPG---CDSGGAAGGGCR 165
Query: 127 YELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKG 186
Y + Y DG SS G L D AF N +N + LGCG + + G+LG+G+G
Sbjct: 166 YMVAYGDGSSSTGDLATDKLAF--ANDTYVN-NVTLGCGRDNE--GLFDSAAGLLGVGRG 220
Query: 187 KSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK-- 239
K SI +Q+ +V +CL +L FG S +T++ S+ +
Sbjct: 221 KISISTQV--APAYGSVFEYCLGDRTSRSTRSSYLVFGRTPEPPS-TAFTALLSNPRRPS 277
Query: 240 YYSPGVAELFFGGE-TTGLKNLP-----------VVFDSGSSYTYLNRVTYQTL 281
Y +A GGE TG N VV DSG++ + R Y L
Sbjct: 278 LYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVVVDSGTAISRFARDAYAAL 331
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 74/278 (26%), Positives = 112/278 (40%), Gaps = 25/278 (8%)
Query: 42 HGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL 100
+G TG Y V + +G PA + + DTGSD TW+QC PCV C PL+ P+
Sbjct: 87 YGVALGTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQ-PCVAYCYRQKEPLFDPTKSA 145
Query: 101 ----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+ C C+ L+ G C C Y ++Y DG ++G +D Y +
Sbjct: 146 TYANISCSSSYCSDLYVSG---CSG-GHCLYGIQYGDGSYTIGFYAQDTLTLAYDTIKNF 201
Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGG 214
GCG + G+LGLG+GK+S+ Q + + V +CL + G G
Sbjct: 202 R----FGCGEKNR--GLFGRAAGLLGLGRGKTSLPVQAYDK--YGGVFAYCLPATSAGTG 253
Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGET-----TGLKNLPVVFDSGSS 269
FL G ++ + + +Y G+ + GG + + DSG+
Sbjct: 254 FLDLGPGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTV 313
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
T L Y L S K + AP L C+
Sbjct: 314 ITRLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCY 351
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 82/284 (28%), Positives = 118/284 (41%), Gaps = 33/284 (11%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y T+ +G P R + + +DTGSDLTW+QC +PC +C L+ P+ + C
Sbjct: 11 GEYLATVRLGTPERVFSVIVDTGSDLTWVQC-SPCGKCYSQNDALFLPNTSTSFTKLACG 69
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
+C L P + C Y Y DG + G V D + NGQ+ P A G
Sbjct: 70 SALCNGLPFPMCNQ----TTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPNFAFG 125
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGHCLSGGGGGFLFFGD 220
CG++ S+ DGILGLG+G S SQL S K +V L FGD
Sbjct: 126 CGHDN--EGSFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPPTQTSPLLFGD 183
Query: 221 D----LYDSSRVVWTSMSSDYTKYYSP-----------GVAELFFGGETTGLKNLPVVFD 265
L D + + T YY ++ F ++ G +FD
Sbjct: 184 AAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAG--TIFD 241
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
SG++ T L Y+ + + M A S ++ + L LC G
Sbjct: 242 SGTTVTQLAEAAYKEVLAAMNASTMAYS-RKIDDISRLDLCLSG 284
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 73/278 (26%), Positives = 111/278 (39%), Gaps = 25/278 (8%)
Query: 42 HGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL 100
+G TG Y V + +G PA + + DTGSD TW+QC PCV C PL+ P+
Sbjct: 152 YGVALGTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQ-PCVAYCYRQKEPLFDPTKSA 210
Query: 101 ----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+ C C+ L+ G C Y ++Y DG ++G +D Y +
Sbjct: 211 TYANISCSSSYCSDLYVSGCSG----GHCLYGIQYGDGSYTIGFYAQDTLTLAYDTIKNF 266
Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGG 214
GCG + G+LGLG+GK+S+ Q + + V +CL + G G
Sbjct: 267 R----FGCGEKNR--GLFGRAAGLLGLGRGKTSLPVQAYDK--YGGVFAYCLPATSAGTG 318
Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGET-----TGLKNLPVVFDSGSS 269
FL G ++ + + +Y G+ + GG + + DSG+
Sbjct: 319 FLDLGPGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTV 378
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
T L Y L S K + AP L C+
Sbjct: 379 ITRLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCY 416
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 56/181 (30%), Positives = 87/181 (48%), Gaps = 20/181 (11%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y V + +G P R ++ +D+GSD+ W+QC+ PC +C P++ P++ V C
Sbjct: 131 SGEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCE-PCTQCYHQSDPVFNPADSSSYAGVSC 189
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+C+ + G H +C YE+ Y DG + G L + F G+ L +A+G
Sbjct: 190 ASTVCSHVDNAGCHE----GRCRYEVSYGDGSYTKGTLALETLTF----GRTLIRNVAIG 241
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG---GGFLFFGD 220
CG++ + G+LGLG G S V QL Q +CL G G L FG
Sbjct: 242 CGHHN--QGMFVGAAGLLGLGSGPMSFVGQLGGQA--GGTFSYCLVSRGIQSSGLLQFGR 297
Query: 221 D 221
+
Sbjct: 298 E 298
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 56/167 (33%), Positives = 79/167 (47%), Gaps = 16/167 (9%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
G + + + IG P R + +DTGSDL W QC PC +C + P++ P + C
Sbjct: 364 GEFLMKLAIGSPPRSFSAIMDTGSDLIWTQC-KPCQQCFDQSTPIFDPKQSSSFYKISCS 422
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF-NYTNGQRLNPRLALG 163
+C +L C C+Y Y D S+ GVL + F F + T Q P L G
Sbjct: 423 SELCGALPTS---TCSSDG-CEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFG 478
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
CG N G + G++GLG+G S+VSQL QK +CL+
Sbjct: 479 CG-NDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQKF-----AYCLTA 519
>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 542
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 76/255 (29%), Positives = 113/255 (44%), Gaps = 26/255 (10%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVPCEDPICASLH 112
IG P + + LD GSDL W+ CD C++C Y R N+ P S H
Sbjct: 119 IGTPHVSFLVALDAGSDLLWVPCD--CLQCAPLSASYYSSLDRDLNEYSPSHS--STSKH 174
Query: 113 APGHH-------NCEDPAQ-CDYELEY-ADGGSSLGVLVKDAF--AFNYTNGQRLNPR-- 159
H NC P Q C Y ++Y + SS G+LV+D A N N + R
Sbjct: 175 LSCSHQLCELGPNCNSPKQPCPYSMDYYTENTSSSGLLVEDILHLASNGDNALSYSVRAP 234
Query: 160 LALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF 217
+ +GCG Q G P DG++GLG + S+ S L LIRN C G +F
Sbjct: 235 VVIGCGMKQSGGYLDGVAP-DGLMGLGLAEISVPSFLAKAGLIRNSFSMCFDEDDSGRIF 293
Query: 218 FGDDLYDSSRVV-WTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRV 276
FGD + + + ++ +YT Y GV G + + D+G+S+T+L
Sbjct: 294 FGDQGPTTQQSTPFLTLDGNYTTYVV-GVEGFCVGSSCLKQTSFRALVDTGTSFTFLPNG 352
Query: 277 TYQTLTSIMKKELSA 291
Y+ +T ++++A
Sbjct: 353 VYERITEEFDRQVNA 367
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 71/133 (53%), Gaps = 15/133 (11%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN---- 98
G +Y +G Y V + +G PAR F+ +DTGSDL WLQC PC C + P++ P N
Sbjct: 121 GLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQ-PCKSCYKQADPIFDPRNSSSF 179
Query: 99 DLVPCEDPICASLHAPGHHNCE----DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
+PC P+C +L H+C ++C Y++ Y DG S+G D F T +
Sbjct: 180 QRIPCLSPLCKALEI---HSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLG-TGSK 235
Query: 155 RLNPRLALGCGYN 167
++ +A GCG++
Sbjct: 236 AMS--VAFGCGFD 246
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 96/357 (26%), Positives = 139/357 (38%), Gaps = 62/357 (17%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
+G Y + +G PAR ++ LDTGSD+ WLQC APC RC P++ P +PC
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQC-APCRRCYSQSDPIFDPRKSKTYATIPC 197
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF--NYTNGQRLNPRLA 161
P C L + G + C Y++ Y DG ++G + F N G +A
Sbjct: 198 SSPHCRRLDSAGCNTRRK--TCLYQVSYGDGSFTVGDFSTETLTFRRNRVKG------VA 249
Query: 162 LGCGYNQ-------------------VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRN 202
LGCG++ PG + H + K +V + S K
Sbjct: 250 LGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFN-----QKFSYCLVDRSASSKPSSV 304
Query: 203 VVGHCLSGGGGGF--LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNL 260
V G+ F L L V +S T+ PGV F + G N
Sbjct: 305 VFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRV--PGVTASLFKLDQIG--NG 360
Query: 261 PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
V+ DSG+S T L R Y + + + AK+LK AP+ C+ N+++VK
Sbjct: 361 GVIIDSGTSVTRLIRPAYIAMRDAFR--VGAKTLKRAPDFSLFDTCFD----LSNMNEVK 414
Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
T+ L F L YLI + G C + L++IG I
Sbjct: 415 --VPTVVLHFRGADV----SLPATNYLIPVDTNGKFCFAFAG----TMGGLSIIGNI 461
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 77/259 (29%), Positives = 106/259 (40%), Gaps = 22/259 (8%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV- 101
G T Y +T+ IG PA + +DTGSD++W+QC PC +C L+ PS
Sbjct: 123 GTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCK-PCSQCHSEVDSLFDPSASSTY 181
Query: 102 ---PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
C C L N +QC Y + Y DG S+ G D T G
Sbjct: 182 SPFSCSSAACVQLSQSQQGNGCSSSQCQYIVSYVDGSSTTGTYSSDTL----TLGSNAIK 237
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL 216
GC ++ G S DG++GLG S+VSQ + +CL + G GFL
Sbjct: 238 GFQFGCSQSESGGFSDQ-TDGLMGLGGDAQSLVSQ--TAGTFGKAFSYCLPPTPGSSGFL 294
Query: 217 FFGDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGET----TGLKNLPVVFDSGSSY 270
G S V T M S+ YY + + GG+ T + + V DSG+
Sbjct: 295 TLG--AASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVFSAGSVMDSGTVI 352
Query: 271 TYLNRVTYQTLTSIMKKEL 289
T L Y L+S K +
Sbjct: 353 TRLPPTAYSALSSAFKAGM 371
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 102/357 (28%), Positives = 136/357 (38%), Gaps = 47/357 (13%)
Query: 48 TGYYNVTMYIGQPARP--YFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LV 101
+G Y + IG P RP L +DTGSDL W QC PC C + P PL+ PS V
Sbjct: 84 SGEYLIHFNIGTP-RPQRVALTMDTGSDLVWTQC-TPCPVCFDQPFPLFDPSVSSTFRAV 141
Query: 102 PCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR- 159
C DPIC C +C Y Y D + G + KD F F NG+ P
Sbjct: 142 ACPDPICRPSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMSPNGEGAPPVA 201
Query: 160 ---LALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
LA GCG YN AS GI G G+G S+ SQL + + H +
Sbjct: 202 VSGLAFGCGDYNTGVFASNE--SGIAGFGRGPLSLPSQLRVGRFSYCLTSHDETESNKTS 259
Query: 216 LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFF---GGETTGLKNLPV---------- 262
F + R + +SP ++ G T G LPV
Sbjct: 260 AVFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLPVDSSVFALKKD 319
Query: 263 -----VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
V DSG+ T ++ L + +L E L LC++ + K V
Sbjct: 320 GSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQLPLPRYDNTSEVGNL-LCFQRPKGGKQVP 378
Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
K F L+ D +L E Y+ V ++NGAEV D+ +IG
Sbjct: 379 VPKLIFH---LASAD------MDLPRENYIPEDTDSGVMCLMINGAEV---DMVLIG 423
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 71/133 (53%), Gaps = 15/133 (11%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN---- 98
G +Y +G Y V + +G PAR F+ +DTGSDL WLQC PC C + P++ P N
Sbjct: 46 GLLYGSGEYFVRLGLGTPARSLFMVVDTGSDLPWLQCQ-PCKSCYKQADPIFDPRNSSSF 104
Query: 99 DLVPCEDPICASLHAPGHHNCE----DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
+PC P+C +L H+C ++C Y++ Y DG S+G D F T +
Sbjct: 105 QRIPCLSPLCKALEV---HSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLG-TGSK 160
Query: 155 RLNPRLALGCGYN 167
++ +A GCG++
Sbjct: 161 AMS--VAFGCGFD 171
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 95/328 (28%), Positives = 141/328 (42%), Gaps = 41/328 (12%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSN----DLVPCED 105
+ VT+ G PA+ Y L +DTGSD++W+QC PC C + P++ P+ VPC
Sbjct: 161 FVVTVGFGSPAQNYTLSIDTGSDVSWIQC-LPCSGHCYKQHDPVFDPTKSATYSAVPCGH 219
Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
P CA+ C + C Y++ Y DG S+ GVL + + + T R P A GCG
Sbjct: 220 PQCAAAGG----KCSNSGTCLYKVTYGDGSSTAGVLSHETLSLSST---RDLPGFAFGCG 272
Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDL- 222
+ + +DG++GLG+G S+ SQ + +CL G+L G
Sbjct: 273 QTNL--GEFGGVDGLVGLGRGALSLPSQ--AAATFGATFSYCLPSYDTTHGYLTMGSTTP 328
Query: 223 ---YDSSRVVWTSM--SSDYTKYYSPGVAELFFGG-----ETTGLKNLPVVFDSGSSYTY 272
D V +T+M DY Y V + GG T +FDSG+ TY
Sbjct: 329 AASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFTRDGTLFDSGTILTY 388
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
L Y +L K + K AP + C+ F + + +A F+D
Sbjct: 389 LPPEAYASLRDRFK--FTMTQYKPAPAYDPFDTCYD----FTGHNAIF--MPAVAFKFSD 440
Query: 333 GKTRTLFELTPEAYLIISNKGNVCLGIL 360
G +F+L+P A LI + G L
Sbjct: 441 GA---VFDLSPVAILIYPDDTAPATGCL 465
>gi|255588450|ref|XP_002534607.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223524923|gb|EEF27776.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 260
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/184 (29%), Positives = 92/184 (50%), Gaps = 18/184 (9%)
Query: 30 FNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEA 89
+NH+ + ++G++ GYY +YIG P + + L +DTGS++T++ C C +
Sbjct: 29 YNHLHPNARMPLYGDILSYGYYATKLYIGTPPQEFTLVVDTGSNMTFVPCCGSEEYCGKH 88
Query: 90 PHPLYRPSNDLVPCEDPICASLHAP--GHHNCEDP---AQCDYELEYADGGSSLGVLVKD 144
P ++ + +S + P H +C+ +QC Y++ Y DG S GVL +D
Sbjct: 89 EDPAFQTES----------SSTYQPVNCHPSCDCDYLRSQCSYKMHYGDGSYSRGVLAED 138
Query: 145 AFAFNYTNGQRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNV 203
+F N P RL GC + + DGI+GLG+G+S+IV QL + +I +
Sbjct: 139 IISFG--NESEFAPQRLVFGCELDAIGSLYSLRADGIIGLGRGRSTIVDQLVDKGVISDS 196
Query: 204 VGHC 207
C
Sbjct: 197 FSLC 200
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 85/345 (24%), Positives = 143/345 (41%), Gaps = 57/345 (16%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPC 103
+G Y + + IG PA +DTGSDL W QC+ PC +C P P++ P + +PC
Sbjct: 93 SGEYLMNVAIGTPASSLSAIMDTGSDLIWTQCE-PCTQCFSQPTPIFNPQDSSSFSTLPC 151
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
E C L + +N C Y Y DG S+ G + + F F ++ P +A G
Sbjct: 152 ESQYCQDLPSESCYN-----DCQYTYGYGDGSSTQGYMATETFTFETSS----VPNIAFG 202
Query: 164 C-----GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF--- 215
C G+ Q GA G++G+G G S+ SQL + +C++ G
Sbjct: 203 CGEDNQGFGQGNGA------GLIGMGWGPLSLPSQLGVGQF-----SYCMTSSGSSSPST 251
Query: 216 LFFG---DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----------V 262
L G + + S SS YY + + GG+ G+ + +
Sbjct: 252 LALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGM 311
Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
+ DSG++ TYL + Y + +++ + E+ L C++ V
Sbjct: 312 IIDSGTTLTYLPQDAYNAVAQAFTDQINLSPVDES--SSGLSTCFQLPSDGSTVQ----- 364
Query: 323 FRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGL 367
+++ F G + L E LI +G +CL + + ++ G+
Sbjct: 365 VPEISMQFDGG----VLNLGEENVLISPAEGVICLAMGSSSQQGI 405
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 90/379 (23%), Positives = 147/379 (38%), Gaps = 79/379 (20%)
Query: 46 YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC------------VRCVEAPHPL 93
Y G Y+V +G P++ + L DTGSDLTW+ C C +R H
Sbjct: 7 YGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHAN 66
Query: 94 YRPSNDLVPCEDPICAS--LHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNY 150
S +PC +C + NC P C Y+ Y+DG ++LG +
Sbjct: 67 LSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVEL 126
Query: 151 TNGQRLN-PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGH 206
G+++ + +GC G S+ DG++GLG K S + + K +V H
Sbjct: 127 KEGRKMKLHNVLIGCS-ESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDH 185
Query: 207 CLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK--------YYSPGVAELFFGGETTGLK 258
+L FG S + +M+ YT+ +Y+ + + GG +
Sbjct: 186 LSHKNVSNYLTFGSS--RSKEALLNNMT--YTELVLGMVNSFYAVNMMGISIGG---AML 238
Query: 259 NLP-----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
+P + DSGSS T+L YQ + + ++ L
Sbjct: 239 KIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSL-----------------L 281
Query: 308 KGRRPFKNVHDVKKCFRT----------LALSFTDGKTRTLFELTPEAYLIISNKGNVCL 357
K R+ ++ ++ CF + L F DG FE ++Y+I + G CL
Sbjct: 282 KFRKVEMDIGPLEYCFNSTGFEESLVPRLVFHFADGAE---FEPPVKSYVISAADGVRCL 338
Query: 358 GILNGAEVGLQDLNVIGGI 376
G ++ A G +V+G I
Sbjct: 339 GFVSVAWPG---TSVVGNI 354
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 87/330 (26%), Positives = 130/330 (39%), Gaps = 36/330 (10%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC-VRCVEAPHPLYRP----S 97
G Y G Y M +G PA+PY + +DTGS LTWLQC +PC V C P++ P S
Sbjct: 129 GTSYGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQC-SPCRVSCHRQSGPVFDPKTSSS 187
Query: 98 NDLVPCEDPICASLHAPGHH--NCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
V C P C L + C C Y+ Y D S+G L KD +F G
Sbjct: 188 YAAVSCSTPQCNDLSTATLNPAACSSSDVCIYQASYGDSSFSVGYLSKDTVSF----GSN 243
Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
P GCG + + G++GL + K S++ QL + +CL
Sbjct: 244 SVPNFYYGCGQDN--EGLFGRSAGLMGLARNKLSLLYQL--APTLGYSFSYCLPSSSSSG 299
Query: 216 LFFGDDLYDSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLKNLPVVFDSGS 268
Y+ + +T M S + K VA ++ +LP + DSG+
Sbjct: 300 YLSI-GSYNPGQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSSEYSSLPTIIDSGT 358
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
T L Y L+ + + K K A L C+ G+ V V ++
Sbjct: 359 VITRLPTTVYDALSKAVAGAM--KGTKRADAYSILDTCFVGQASSLRVPAV-------SM 409
Query: 329 SFTDGKTRTLFELTPEAYLIISNKGNVCLG 358
+F+ G +L+ + L+ + CL
Sbjct: 410 AFSGGAA---LKLSAQNLLVDVDSSTTCLA 436
>gi|194700652|gb|ACF84410.1| unknown [Zea mays]
gi|414587775|tpg|DAA38346.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 500
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 69/247 (27%), Positives = 104/247 (42%), Gaps = 19/247 (7%)
Query: 57 IGQPARPYFLDLDTGSDLTWL--QCD--APCVRCVEAPHPLYRP----SNDLVPCEDPIC 108
+G P + + + LDTGSDL WL QCD P Y P ++ VPC C
Sbjct: 115 VGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSATFYIPGMSSTSKAVPCNSNFC 174
Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNG--QRLNPRLALGCG 165
C QC Y++ Y G SS G LV+D + N Q L ++ LGCG
Sbjct: 175 DL-----QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILKAQIMLGCG 229
Query: 166 YNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYD 224
Q +G+ GLG + S+ S L + L N C G G + FGD
Sbjct: 230 QTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISFGDQESS 289
Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSI 284
++ + Y+ ++ + G + T + + +FD+G+S+TYL Y +T
Sbjct: 290 DQEETPLDINRQHPT-YAITISGITVGNKPTDM-DFITIFDTGTSFTYLADPAYTYITQS 347
Query: 285 MKKELSA 291
++ A
Sbjct: 348 FHAQVQA 354
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 101/350 (28%), Positives = 153/350 (43%), Gaps = 62/350 (17%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
+G Y V +Y+G P R + + +DTGSDL WLQC APC+ C + P++ P S V C
Sbjct: 147 SGEYLVEVYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFDQRGPVFDPMASTSYRNVTC 205
Query: 104 EDPICASLHAPGH-HNCE----DPAQCDYELEYADGGSSLGVLVKDAFAFNYT-NGQRLN 157
D C + P C DP C Y Y D ++ G L +AF N T + R
Sbjct: 206 GDTRCGLVSPPAAPRTCRSSRSDP--CPYYYWYGDQSNTTGDLALEAFTVNLTASSSRRV 263
Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGH----CLSGGG- 212
+ LGCG+ +H G+LGLG+G S SQL R V GH CL G
Sbjct: 264 DGVVLGCGHRN--RGLFHGAAGLLGLGRGPLSFASQL------RAVYGHAFSYCLVDHGS 315
Query: 213 --GGFLFFGDD--LYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLP----- 261
G + FGDD L ++ +T+ S+ +Y + + GGE + ++P
Sbjct: 316 AVGSKIVFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGE---MLDIPSNTWG 372
Query: 262 ---------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRP 312
+ DSG++ +Y Y+ + ++ + K P P+ P
Sbjct: 373 VSKEDGSGGTIIDSGTTLSYFPEPAYKAI----RQAFVDRMDKAYPLIADFPVL----SP 424
Query: 313 FKNVHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGIL 360
NV V++ +L F DG +++ E Y I + +G +CL +L
Sbjct: 425 CYNVSGVERVEVPEFSLLFADG---AVWDFPAENYFIRLDTEGIMCLAVL 471
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 90/379 (23%), Positives = 147/379 (38%), Gaps = 79/379 (20%)
Query: 46 YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC------------VRCVEAPHPL 93
Y G Y+V +G P++ + L DTGSDLTW+ C C +R H
Sbjct: 78 YGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHAN 137
Query: 94 YRPSNDLVPCEDPICAS--LHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNY 150
S +PC +C + NC P C Y+ Y+DG ++LG +
Sbjct: 138 LSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVEL 197
Query: 151 TNGQRLN-PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGH 206
G+++ + +GC G S+ DG++GLG K S + + K +V H
Sbjct: 198 KEGRKMKLHNVLIGCS-ESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDH 256
Query: 207 CLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK--------YYSPGVAELFFGGETTGLK 258
+L FG S + +M+ YT+ +Y+ + + GG +
Sbjct: 257 LSHKNVSNYLTFGSS--RSKEALLNNMT--YTELVLGMVNSFYAVNMMGISIGG---AML 309
Query: 259 NLP-----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
+P + DSGSS T+L YQ + + ++ L
Sbjct: 310 KIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSL-----------------L 352
Query: 308 KGRRPFKNVHDVKKCFRT----------LALSFTDGKTRTLFELTPEAYLIISNKGNVCL 357
K R+ ++ ++ CF + L F DG FE ++Y+I + G CL
Sbjct: 353 KFRKVEMDIGPLEYCFNSTGFEESLVPRLVFHFADGAE---FEPPVKSYVISAADGVRCL 409
Query: 358 GILNGAEVGLQDLNVIGGI 376
G ++ A G +V+G I
Sbjct: 410 GFVSVAWPG---TSVVGNI 425
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 98/347 (28%), Positives = 142/347 (40%), Gaps = 58/347 (16%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC-VRCVEAPHPLYRPSN----DLVPC 103
G Y +T+ IG P PY DTGSDL W QC APC +C E P PLY P++ ++PC
Sbjct: 112 GEYLMTLAIGTPPLPYAAVADTGSDLIWTQC-APCGTQCFEQPAPLYNPASSTTFSVLPC 170
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG-QRLNPRLAL 162
+ A C Y Y G ++ GV + F F + Q P +A
Sbjct: 171 NSSLSMCAGALAGAAPPPGCACMYYQTYGTGWTA-GVQGSETFTFGSSAADQARVPGVAF 229
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL 222
GC + + ++ G++GLG+G S+VSQL + + +CL+ F D
Sbjct: 230 GC--SNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRF-----SYCLTP-------FQDTN 275
Query: 223 YDSSRVVWTSMSSDYTKYYS-PGVAE-----------LFFGGETTGLKNLPV-------- 262
S+ ++ S + + T S P VA L G + G K LP+
Sbjct: 276 STSTLLLGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLK 335
Query: 263 -------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET-LPLCWKGRRPFK 314
+ DSG++ T L YQ + + +K +L D T L LC+ P
Sbjct: 336 PDGTGGLIIDSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTS 395
Query: 315 NVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN 361
V ++ L F DG L P +IS G CL + N
Sbjct: 396 APPAV---LPSMTLHF-DGADMVL----PADSYMISGSGVWCLAMRN 434
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 89/336 (26%), Positives = 133/336 (39%), Gaps = 43/336 (12%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
+G Y V + +G PA+ + + +DTGS L+WLQC + C P++ PS +PC
Sbjct: 110 SGNYYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKALPC 169
Query: 104 -----EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
++L+APG N C Y+ Y D S+G L +D T + +
Sbjct: 170 SSSQCSSLKSSTLNAPGCSNAT--GACVYKASYGDTSFSIGYLSQDVLTL--TPSEAPSS 225
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG------ 212
GCG + + GI+GL K S++ QL K N +CL
Sbjct: 226 GFVYGCGQDN--QGLFGRSSGIIGLANDKISMLGQL--SKKYGNAFSYCLPSSFSAPNSS 281
Query: 213 --GGFLFFGDDLYDSSRVVWTSMSSDYT--KYYSPGVAELFFGGETTGLK----NLPVVF 264
GFL G SS +T + + Y + + G+ G+ N+P +
Sbjct: 282 SLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVPTII 341
Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR-RPFKNVHDVKKCF 323
DSG+ T L Y L +S K +AP L C+KG + V +++ F
Sbjct: 342 DSGTVITRLPVAVYNALKKSFVLIMS-KKYAQAPGFSILDTCFKGSVKEMSTVPEIQIIF 400
Query: 324 RTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGI 359
R A EL L+ KG CL I
Sbjct: 401 RGGA----------GLELKAHNSLVEIEKGTTCLAI 426
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/293 (28%), Positives = 130/293 (44%), Gaps = 41/293 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
TG Y + M++G P + +L LDTGSDL+W+QCD PC C E P Y P+ + C
Sbjct: 167 TGEYFIDMFVGTPPKHVWLILDTGSDLSWIQCD-PCYDCFEQNGPHYNPNESSSYRNISC 225
Query: 104 EDPICASLHAPG--HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT--NGQRLNPR 159
DP C + +P H + C Y +YADG ++ G + F N T NG+
Sbjct: 226 YDPRCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKH 285
Query: 160 LA---LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-----G 211
+ GCG+ +H G+LGLG+G S SQL Q + + +CL+
Sbjct: 286 VVDVMFGCGHWN--KGFFHGAGGLLGLGRGPLSFPSQL--QSIYGHSFSYCLTDLFSNTS 341
Query: 212 GGGFLFFGDD--LYDSSRVVWTSM-----SSDYTKYYSPGVAELFFGGETTGLKNLP--- 261
L FG+D L + + +T + + D T YY + + GGE +
Sbjct: 342 VSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYL-QIKSIVVGGEVLDIPEKTWHW 400
Query: 262 -------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
+ DSGS+ T+ Y + +K++ + + A +D + C+
Sbjct: 401 SSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQI--AADDFIMSPCY 451
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 76/274 (27%), Positives = 112/274 (40%), Gaps = 42/274 (15%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y V + +G P +L +D+GSD+ W+QC PC +C PL+ P+ V C
Sbjct: 127 SGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCR-PCEQCYAQTDPLFDPAASSSFSGVSC 185
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
IC +L G D +CDY + Y DG + G L + T Q +A+G
Sbjct: 186 GSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTAVQ----GVAIG 241
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLY 223
CG+ + G+LGLG G S+V QL V +CL+ G G
Sbjct: 242 CGHRN--SGLFVGAAGLLGLGWGAMSLVGQLGGAA--GGVFSYCLASRGAG--------- 288
Query: 224 DSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNL----------PVVFDSGSSYTYL 273
S + +Y G+ + GGE L++ VV D+G++ T L
Sbjct: 289 --------GAGSLASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRL 340
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
R Y L + A L +P L C+
Sbjct: 341 PREAYAALRGAFDGAMGA--LPRSPAVSLLDTCY 372
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 85/294 (28%), Positives = 125/294 (42%), Gaps = 39/294 (13%)
Query: 15 VRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDL 74
+R ++ ++ +SL + G G + +G Y + +G P+ L +DTGSDL
Sbjct: 50 LRQRLAADAARYASLVDATGRLHSPVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDL 109
Query: 75 TWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPAQ----CD 126
WLQC +PC RC ++ P VPC P C +L PG C+ C
Sbjct: 110 VWLQC-SPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQCRALRFPG---CDSGGAAGGGCR 165
Query: 127 YELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKG 186
Y + Y DG SS G L D AF N +N + LGCG + + G+LG+ +G
Sbjct: 166 YMVAYGDGSSSTGELATDKLAF--ANDTYVN-NVTLGCGRDNE--GLFDSAAGLLGVARG 220
Query: 187 KSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK-- 239
K SI +Q+ +V +CL +L FG S +T++ S+ +
Sbjct: 221 KISISTQV--APAYGSVFEYCLGDRTSRSTRSSYLVFGRTPEPPS-TAFTALLSNPRRPS 277
Query: 240 YYSPGVAELFFGGE-TTGLKNLP-----------VVFDSGSSYTYLNRVTYQTL 281
Y +A GGE TG N VV DSG++ + R Y L
Sbjct: 278 LYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVVVDSGTAISRFARDAYAAL 331
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 84/291 (28%), Positives = 126/291 (43%), Gaps = 47/291 (16%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL- 100
G +G Y V++ +G P + L DTGSDLTW QC PC R C P++ PS
Sbjct: 123 GATIGSGNYIVSVGLGTPKKYLSLIFDTGSDLTWTQCQ-PCARYCYNQKDPVFVPSQSTT 181
Query: 101 ---VPCEDPICASLHA-----PGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN 152
+ C P C+ L + PG C C Y ++Y D S+G K+ T+
Sbjct: 182 YSNISCSSPDCSQLESGTGNQPG---CSAARACIYGIQYGDQSFSVGYFAKETLTLTSTD 238
Query: 153 GQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SG 210
+ GCG N + G++GLG+ K SIV Q +QK V +CL +
Sbjct: 239 ---VIENFLFGCGQNNR--GLFGSAAGLIGLGQDKISIVKQT-AQKY-GQVFSYCLPKTS 291
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK----NLPV---- 262
G+L FG + + +T ++ + GVA F+G + G+K +P+
Sbjct: 292 SSTGYLTFGGGGGGGA-LKYTPITKAH------GVAN-FYGVDIVGMKVGGTQIPISSSV 343
Query: 263 ------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
+ DSG+ T L Y L S +K ++ +APE L C+
Sbjct: 344 FSTSGAIIDSGTVITRLPPDAYSALKSAFEKGMA--KYPKAPELSILDTCY 392
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 67/131 (51%), Gaps = 12/131 (9%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND- 99
+ G +G Y + +G P R ++ LDTGSD+ W+QC PC +C PL+ P+
Sbjct: 143 ISGLAQGSGEYFTRLGVGTPPRYTYMVLDTGSDIMWIQC-LPCAKCYGQTDPLFNPAASS 201
Query: 100 ---LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
VPC P+C L G C + C+Y++ Y DG ++G + F GQ +
Sbjct: 202 TYRKVPCATPLCKKLDISG---CRNKRYCEYQVSYGDGSFTVGDFSTETLTF---RGQVI 255
Query: 157 NPRLALGCGYN 167
R+ALGCG++
Sbjct: 256 R-RVALGCGHD 265
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 140/349 (40%), Gaps = 61/349 (17%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G+Y + + IG P + DTGSDLTW C PC +C + +P++ P + C+
Sbjct: 23 GHYLMEVSIGTPPFKIYGIADTGSDLTWTSC-VPCNKCYKQRNPIFDPQKSTSYRNISCD 81
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALG 163
+C L C C+Y YA + GVL ++ + T G+ + + + G
Sbjct: 82 SKLCHKLDT---GVCSPQKHCNYTYAYASAAITQGVLAQETITLSSTKGESVPLKGIVFG 138
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF----LFFG 219
CG+N G + + GI+GLG G S +SQ+ S S GG F + F
Sbjct: 139 CGHNNTGGFNDREM-GIIGLGGGPVSFISQIGS------------SFGGKRFSQCLVPFH 185
Query: 220 DDLYDSSR-------------VVWTSM--SSDYTKYY------SPGVAELFFGGETT-GL 257
D+ SS+ VV T + D T Y+ S G L F G ++ +
Sbjct: 186 TDVSVSSKMSLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQSV 245
Query: 258 KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
+ V DSG+ T L Y L + ++ E++ K + D LC++ + +
Sbjct: 246 EKGNVFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTND-LDLGPQLCYRTKNNLRG-- 302
Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVG 366
L F G + L P + G CLG N + G
Sbjct: 303 ------PVLTAHFEGGDVK----LLPTQTFVSPKDGVFCLGFTNTSSDG 341
>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 73/142 (51%), Gaps = 14/142 (9%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
+G Y + +G P + ++ LDTGSD+ W+QC APC +C P++ P S + C
Sbjct: 171 SGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQC-APCRKCYSQTDPVFDPKKSGSFSSISC 229
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
P+C L +PG C C Y++ Y DG + G + F G R+ P++ALG
Sbjct: 230 RSPLCLRLDSPG---CNSRQSCLYQVAYGDGSFTFGEFSTETLTF---RGTRV-PKVALG 282
Query: 164 CGYNQVPGASYHPLDGILGLGK 185
CG++ + G+LGLG+
Sbjct: 283 CGHDNE--GLFVGAAGLLGLGR 302
>gi|6562285|emb|CAB62655.1| putative protein [Arabidopsis thaliana]
Length = 519
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 96/350 (27%), Positives = 145/350 (41%), Gaps = 59/350 (16%)
Query: 50 YYNVTMYIGQPARPYFLDLDTGSDLTWLQCD--APCVRCVEA-------PHPLYRPS--- 97
Y NV+ +G PA + + LDTGSDL WL C+ + C+R ++ P LY P+
Sbjct: 103 YANVS--VGTPATWFLVALDTGSDLFWLPCNCGSTCIRDLKEVGLSQSRPLNLYSPNTSS 160
Query: 98 -NDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS-SLGVLVKDAFAFNYTNGQR 155
+ + C D C + C Y+++Y + + G L +D T +
Sbjct: 161 TSSSIRCSDDRCFGSSRCSSPA----SSCPYQIQYLSKDTFTTGTLFEDVLHL-VTEDEG 215
Query: 156 LNP---RLALGCGYNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG- 210
L P + LGCG NQ S ++G+LGLG S+ S L K+ N C
Sbjct: 216 LEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILAKAKITANSFSMCFGNI 275
Query: 211 -GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSS 269
G + FGD Y T P V E+ GG+ G++ L +FD+G+S
Sbjct: 276 IDVVGRISFGDKGY-------TDQMETPLLPTEPSVTEVSVGGDAVGVQ-LLALFDTGTS 327
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK-----CFR 324
+T+L Y +T ++ K PE PF+ +D+ F
Sbjct: 328 FTHLLEPEYGLITKAFDDHVTDKRRPIDPE-----------LPFEFCYDLSPNKTTILFP 376
Query: 325 TLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
+A++F G +F P L I N CLGIL + +N+IG
Sbjct: 377 RVAMTFEGGS--QMFLRNP---LFIDNSAMYCLGILKSVDF---KINIIG 418
>gi|399218365|emb|CCF75252.1| unnamed protein product [Babesia microti strain RI]
Length = 535
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 70/313 (22%), Positives = 132/313 (42%), Gaps = 32/313 (10%)
Query: 34 GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPL 93
G ++G ++ YY + ++IG P ++ LDTGS L + C C++C +P
Sbjct: 163 GKKFKIPIYGTLHDFAYYFIKIFIGTPPSVQWVVLDTGSSLLGITC-GNCIQCGNHQNPN 221
Query: 94 YRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN- 152
Y P + C ++ C+ +C + Y++G G D +F+ ++
Sbjct: 222 YEPYESATAIK---CTDVNQCKLKGCD---ECRFMQHYSEGSFISGDYYTDVISFDKSSP 275
Query: 153 GQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
G + N LGC + +GI G+ SI+SQL + I N+ CLS G
Sbjct: 276 GYKFN---NLGCVLYENKLIYNQRANGIFGMSPNDDSIISQLFKRPEIDNIFSICLSDEG 332
Query: 213 GGFLFFGDD-----LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSG 267
G + G + + ++S + WT +++D Y + + + + + N DSG
Sbjct: 333 GELIIGGIEPELFNIKNNSEMAWTRLNTDNNYYIH--INSMSYLSDHVEITNTKFSIDSG 390
Query: 268 SSYTYLNRVTYQTLTS------IMKKELSAKSL-------KEAPEDETLPLCWKGRRPFK 314
++ T L Y+++ + M +E+ L ++ P+D + + K
Sbjct: 391 TTNTVLMEKMYKSIVNGVMNICFMDREIEGYDLDIGVTVIQKKPDDIVDLMIEREENVTK 450
Query: 315 -NVHDVKKCFRTL 326
+HD + C R +
Sbjct: 451 CEIHDDEICSRNI 463
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 74/153 (48%), Gaps = 12/153 (7%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y T+ +G P R + + +DTGSDLTW+QC +PC C L+ P+ + C
Sbjct: 1 GEYLATVRLGTPERVFSVIVDTGSDLTWVQC-SPCGTCYSQNDSLFIPNTSTSFTKLACG 59
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
+C L P + C Y Y DG S G V D + NGQ+ P A G
Sbjct: 60 TELCNGLPYPMCNQ----TTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFG 115
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS 196
CG++ S+ DGILGLG+G S SQL +
Sbjct: 116 CGHDNE--GSFAGADGILGLGQGPLSFPSQLKT 146
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 93/344 (27%), Positives = 138/344 (40%), Gaps = 43/344 (12%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y + +G P+ DTGSDL+WLQC PC C PL+ P+ VPCE
Sbjct: 86 GEYLMRFSLGTPSVERLAIFDTGSDLSWLQC-TPCKTCYPQEAPLFDPTQSSTYVDVPCE 144
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT---NGQRLNPRLA 161
C +L C QC Y +Y ++G L D +F+ T G P+
Sbjct: 145 SQPC-TLFPQNQRECGSSKQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSV 203
Query: 162 LGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLF 217
GC Y+ +G +GLG G S+ SQL Q I + +C+ S G L
Sbjct: 204 FGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQ--IGHKFSYCMVPFSSTSTGKLK 261
Query: 218 FGDDLYDSSRVVWTS--MSSDYTKYYSPGVAELFFGGET--TGLKNLPVVFDSGSSYTYL 273
FG + ++ VV T ++ Y YY + + G + TG ++ DS T+L
Sbjct: 262 FG-SMAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKVLTGQIGGNIIIDSVPILTHL 320
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
+ Y S +K+ ++ E ED P + R P N++ F FT
Sbjct: 321 EQGIYTDFISSVKEAINV----EVAEDAPTPFEYCVRNP-TNLN-----FPEFVFHFTGA 370
Query: 334 KTRTLFELTPEAYLIISNKGNVCLGIL---------NGAEVGLQ 368
L P+ I + VC+ ++ N A+V Q
Sbjct: 371 DVV----LGPKNMFIALDNNLVCMTVVPSKGISIFGNWAQVNFQ 410
>gi|357517935|ref|XP_003629256.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523278|gb|AET03732.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 544
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 77/272 (28%), Positives = 113/272 (41%), Gaps = 38/272 (13%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL-------------VPC 103
+G P + + LDTGSDL WL C+ C CV DL VPC
Sbjct: 119 VGTPPLWFLVALDTGSDLFWLPCN--CTSCVRGLKTQNGKVIDLNIYELDKSSTRKNVPC 176
Query: 104 EDPICASL--HAPGHHNCEDPAQCDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR--LNP 158
+C H+ G + C YE+EY ++ SS G LV+D N Q ++
Sbjct: 177 NSNMCKQTQCHSSG-------SSCRYEVEYLSNDTSSSGFLVEDVLHLITDNDQTKDIDT 229
Query: 159 RLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
++ +GCG Q + GA+ +G+ GLG S+ S L + LI + C G G
Sbjct: 230 QITIGCGQVQTGVFLNGAA---PNGLFGLGMENVSVPSILAQKGLISDSFSMCFGSDGSG 286
Query: 215 FLFFGDD-LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
+ FGD D + + S T Y+ + ++ GG +FDSG+S+TYL
Sbjct: 287 RITFGDTGSSDQGKTPFNLRESHPT--YNVTITQIIVGGYAAD-HEFHAIFDSGTSFTYL 343
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
N Y ++ + A D LP
Sbjct: 344 NDPAYTLISEKFNSLVKANRHSPLSPDSDLPF 375
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 78/280 (27%), Positives = 115/280 (41%), Gaps = 34/280 (12%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
+G Y + +G P + ++ LDTGSD+ W+QC APC +C P++ P S + C
Sbjct: 144 SGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQC-APCRKCYSQTDPVFDPKKSGSFSSISC 202
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
P+C L +PG C C Y++ Y DG + G + F G R+ P++ALG
Sbjct: 203 RSPLCLRLDSPG---CNSRQSCLYQVAYGDGSFTFGEFSTETLTF---RGTRV-PKVALG 255
Query: 164 CGYNQ-------------VPGASYHPLDGILGLGKGKS-SIVSQLHSQKLIRNVVGHCLS 209
CG++ G P L G+ S +V + S K V G
Sbjct: 256 CGHDNEGLFVGAAGLLGLGRGRLSFPTQTGLRFGRKFSYCLVDRSASSKPSSVVFGQSAV 315
Query: 210 GGGGGF--LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSG 267
F L L + T +S + G+ F +T G N V+ DSG
Sbjct: 316 SRTAVFTPLITNPKLDTFYYLELTGISVGGARV--AGITASLFKLDTAG--NGGVIIDSG 371
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
+S T L R Y +L + A LK AP+ C+
Sbjct: 372 TSVTRLTRRAYVSLRDAFRA--GAADLKRAPDYSLFDTCF 409
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 93/323 (28%), Positives = 134/323 (41%), Gaps = 38/323 (11%)
Query: 58 GQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPIC-ASLH 112
G PA + +DTGSDLTW+QC PC C PL+ P+ V C C ASL
Sbjct: 197 GSPAANLTVIVDTGSDLTWVQCK-PCSACYAQRDPLFDPAGSATYAAVRCNASACAASLK 255
Query: 113 A----PGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ 168
A PG + +C Y L Y DG S GVL D A G L+ GCG +
Sbjct: 256 AATGTPGSCGGGNE-RCYYALAYGDGSFSRGVLATDTVAL---GGASLDG-FVFGCGLSN 310
Query: 169 VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDL-- 222
+ G++GLG+ + S+VSQ + V +CL SG G L G D
Sbjct: 311 R--GLFGGTAGLMGLGRTELSLVSQ--TALRYGGVFSYCLPATTSGDASGSLSLGGDASS 366
Query: 223 -YDSSRVVWTSMSSDYTK--YYSPGVAELFFGG---ETTGLKNLPVVFDSGSSYTYLNRV 276
+++ V +T M +D + +Y V GG GL V+ DSG+ T L
Sbjct: 367 YRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVLIDSGTVITRLAPS 426
Query: 277 TYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTR 336
Y+ + + ++ +A AP L C+ +VK TL L +G
Sbjct: 427 VYRGVRAEFTRQFAAAGYPTAPGFSILDTCYD----LTGHDEVKVPLLTLRL---EGGAE 479
Query: 337 TLFELTPEAYLIISNKGNVCLGI 359
+ +++ + VCL +
Sbjct: 480 VTVDAAGMLFVVRKDGSQVCLAM 502
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 90/343 (26%), Positives = 138/343 (40%), Gaps = 68/343 (19%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPIC 108
G + + + IG P Y +DTGSDL W QC PC +C + P P++ P +
Sbjct: 98 GEFLMNLAIGTPPETYSAIMDTGSDLIWTQCK-PCTQCFDQPSPIFDPKKSSSFSKLSCS 156
Query: 109 ASL-HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYN 167
+ L A +C D C+Y Y D S+ G + + F F G+ P + GCG +
Sbjct: 157 SQLCKALPQSSCSD--SCEYLYTYGDYSSTQGTMATETFTF----GKVSIPNVGFGCGED 210
Query: 168 QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSR 227
G + G++GLG+G S+VSQL K +CL+ DD S+
Sbjct: 211 N-EGDGFTQGSGLVGLGRGPLSLVSQLKEAKF-----SYCLTS--------IDDTKTSTL 256
Query: 228 VVWTSMSSDYTKY-----------YSPGVAELFFGGETTGLKNLPV-------------- 262
++ + S + T P L G + G LP+
Sbjct: 257 LMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDDGTGG 316
Query: 263 -VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET----LPLCWKGRRPFKNVH 317
+ DSG++ TYL + ++KKE +++ P D + L LC+ +
Sbjct: 317 LIIDSGTTITYLEESAFD----LVKKEFTSQ--MGLPVDNSGATGLELCYNLPSDTSELE 370
Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLII-SNKGNVCLGI 359
K L L FT EL E Y+I S+ G +CL +
Sbjct: 371 VPK-----LVLHFTGAD----LELPGENYMIADSSMGVICLAM 404
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 86/329 (26%), Positives = 143/329 (43%), Gaps = 40/329 (12%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
Y VTM +G ++ + +DTGSDLTW+QC+ PC+ C P+++P S V C
Sbjct: 65 YIVTMGLG--SKNMTVIIDTGSDLTWVQCE-PCMSCYNQQGPIFKPSTSSSYQSVSCNSS 121
Query: 107 ICASLH----APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
C SL G +P+ C+Y + Y DG + G L +A +F G
Sbjct: 122 TCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSF----GGVSVSDFVF 177
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFG 219
GCG N + + G++GLG+ S+VSQ ++ V +CL G G L G
Sbjct: 178 GCGRNN--KGLFGGVSGLMGLGRSYLSLVSQTNAT--FGGVFSYCLPTTEAGSSGSLVMG 233
Query: 220 DD---LYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGET----TGLKNLPVVFDSGSSY 270
++ +++ + +T M S+ + +Y + + GG N ++ DSG+
Sbjct: 234 NESSVFKNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAPLSFGNGGILIDSGTVI 293
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
T L Y+ L + K+ + AP L C+ +V T++L F
Sbjct: 294 TRLPSSVYKALKAEFLKKFTG--FPSAPGFSILDTCFN----LTGYDEVS--IPTISLRF 345
Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGI 359
+G + + T Y++ + VCL +
Sbjct: 346 -EGNAQLNVDATGTFYVVKEDASQVCLAL 373
>gi|18409320|ref|NP_566948.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|27754243|gb|AAO22575.1| unknown protein [Arabidopsis thaliana]
gi|332645259|gb|AEE78780.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 529
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 95/353 (26%), Positives = 154/353 (43%), Gaps = 55/353 (15%)
Query: 50 YYNVTMYIGQPARPYFLDLDTGSDLTWLQCD--APCVRCVEA-------PHPLYRPS--- 97
Y NV+ +G PA + + LDTGSDL WL C+ + C+R ++ P LY P+
Sbjct: 103 YANVS--VGTPATWFLVALDTGSDLFWLPCNCGSTCIRDLKEVGLSQSRPLNLYSPNTSS 160
Query: 98 -NDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS-SLGVLVKDAFAFNYTNGQR 155
+ + C D C + C Y+++Y + + G L +D T +
Sbjct: 161 TSSSIRCSDDRCFGSSRCSSPA----SSCPYQIQYLSKDTFTTGTLFEDVLHL-VTEDEG 215
Query: 156 LNP---RLALGCGYNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG- 210
L P + LGCG NQ S ++G+LGLG S+ S L K+ N C
Sbjct: 216 LEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILAKAKITANSFSMCFGNI 275
Query: 211 -GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSS 269
G + FGD Y + ++ + ++ + Y+ V E+ GG+ G++ L +FD+G+S
Sbjct: 276 IDVVGRISFGDKGY-TDQMETPLLPTEPSPTYAVSVTEVSVGGDAVGVQ-LLALFDTGTS 333
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK-----CFR 324
+T+L Y +T ++ K PE PF+ +D+ F
Sbjct: 334 FTHLLEPEYGLITKAFDDHVTDKRRPIDPE-----------LPFEFCYDLSPNKTTILFP 382
Query: 325 TLALSFTDGKTRTLFELTPEAYLIISNKGN---VCLGILNGAEVGLQDLNVIG 374
+A++F G +F P I+ N+ N CLGIL + +N+IG
Sbjct: 383 RVAMTFEGGS--QMFLRNP--LFIVWNEDNSAMYCLGILKSVDF---KINIIG 428
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 102/399 (25%), Positives = 160/399 (40%), Gaps = 63/399 (15%)
Query: 5 HNGENLCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPY 64
+ E + + R+S + SS S + V L G++ +G Y V + +G P R
Sbjct: 102 QDKERVKYINSRISKNLGQDSSVSELDSV---TLPAKSGSLIGSGNYFVVVGLGTPKRDL 158
Query: 65 FLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL----VPCEDPICASLH-APGHH- 117
L DTGSDLTW QC+ PC R C + ++ PS + C +C L A G+
Sbjct: 159 SLIFDTGSDLTWTQCE-PCARSCYKQQDAIFDPSKSTSYSNITCTSTLCTQLSTATGNEP 217
Query: 118 NCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHP 176
C + C Y ++Y D S+G ++ + T+ + GCG N +
Sbjct: 218 GCSASTKACIYGIQYGDSSFSVGYFSRERLSVTATD---IVDNFLFGCGQNN--QGLFGG 272
Query: 177 LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMS 234
G++GLG+ S V Q + + R + +CL + G L FG T+
Sbjct: 273 SAGLIGLGRHPISFVQQ--TAAVYRKIFSYCLPATSSSTGRLSFG---------TTTTSY 321
Query: 235 SDYTKYYSPGVAELFFGGETTGLK----NLPV----------VFDSGSSYTYLNRVTYQT 280
YT + + F+G + TG+ LPV + DSG+ T L Y
Sbjct: 322 VKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFSTGGAIIDSGTVITRLPPTAYTA 381
Query: 281 LTSIMKKELSAKSLKEAPEDETLPLCW--KGRRPFKNVHDVKKCFRTLALSFTDGKTRTL 338
L S ++ +S A E L C+ G F + SF G T
Sbjct: 382 LRSAFRQGMS--KYPSAGELSILDTCYDLSGYEVFS--------IPKIDFSFAGGVT--- 428
Query: 339 FELTPEAYLIISNKGNVCLGI-LNGAEVGLQDLNVIGGI 376
+L P+ L +++ VCL NG + D+ + G +
Sbjct: 429 VQLPPQGILYVASAKQVCLAFAANGDD---SDVTIYGNV 464
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 96/343 (27%), Positives = 139/343 (40%), Gaps = 35/343 (10%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS----NDLVPC 103
+G Y V + +G P + Y + LDTGS L+WLQC V C PLY PS + C
Sbjct: 122 SGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSC 181
Query: 104 EDPICASLHAPGHHN--CE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
C+ L A ++ CE D C Y Y D S+G L +D T+ Q L P+
Sbjct: 182 ASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDLLTL--TSSQTL-PQF 238
Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGD 220
GCG Q + GI+GL + K S+++QL ++ + +CL G G
Sbjct: 239 TYGCG--QDNQGLFGRAAGIIGLARDKLSMLAQLSTK--YGHAFSYCLPTANSGSSGGGF 294
Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETT---------GLKNLPVVFDSGSSYT 271
S + T +P + L T + +P + DSG+ T
Sbjct: 295 LSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVIT 354
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
L Y L K +S K K AP L C+KG K++ V + + + F
Sbjct: 355 RLPMSMYAALRQAFVKIMSTKYAK-APAYSILDTCFKGS--LKSISAVPE----IKMIFQ 407
Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
G T L + LI ++KG CL G + +IG
Sbjct: 408 GGADLT---LRAPSILIEADKGITCLAF--AGSSGTNQIAIIG 445
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 85/328 (25%), Positives = 137/328 (41%), Gaps = 43/328 (13%)
Query: 45 VYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCE 104
V+ Y + + +G P +DTGS++TW QC PCV C E P++ PS E
Sbjct: 59 VFDNSVYLMKLQVGTPPFEIQAIIDTGSEITWTQC-LPCVHCYEQNAPIFDPSKSSTFKE 117
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR-LNPRLALG 163
GH C YE++Y D ++G L + + T+G+ + P +G
Sbjct: 118 K------RCDGH-------SCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIG 164
Query: 164 CGYNQVPGASYHP-LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFG-DD 221
CG+N + + P G++GL G SS+++Q+ + ++ +C SG G + FG +
Sbjct: 165 CGHNN---SWFKPSFSGMVGLNWGPSSLITQMGGEY--PGLMSYCFSGQGTSKINFGANA 219
Query: 222 LYDSSRVVWTSMSSDYTK---YY------SPGVAELFFGGETTGLKNLPVVFDSGSSYTY 272
+ VV T+M K YY S G + G T +V DSG++ TY
Sbjct: 220 IVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALEGNIVIDSGTTLTY 279
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
V+Y L + + P + LC+ D F + + F+
Sbjct: 280 F-PVSYCNLVRQAVEHVVTAVRAADPTGNDM-LCYNS--------DTIDIFPVITMHFSG 329
Query: 333 GKTRTLFELTPEAYLIISNKGNVCLGIL 360
G L + Y+ +N G CL I+
Sbjct: 330 GVDLVLDKY--NMYMESNNGGVFCLAII 355
>gi|255079464|ref|XP_002503312.1| predicted protein [Micromonas sp. RCC299]
gi|226518578|gb|ACO64570.1| predicted protein [Micromonas sp. RCC299]
Length = 649
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 98/371 (26%), Positives = 162/371 (43%), Gaps = 40/371 (10%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPA-RPYFLDLDTGSDLTWLQCDAPCVRC-VEAPHPLYRP 96
F +HG+V GYY + +G P+ R + + +DTGS LT++ C A C +C + P
Sbjct: 100 FPLHGSVKEHGYYYANIALGDPSPRTFQVIVDTGSTLTYVPC-ATCAKCGTHTGGTRFDP 158
Query: 97 SNDLVPCEDPICASLHAPG---HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
+ + C++ C + PG +C Y YA+G G LV+D F
Sbjct: 159 TGKWLTCQEKQCKAAGGPGICAGGRGAAANRCTYSRTYAEGSGVSGDLVRDKMHFGGDIA 218
Query: 154 QRLNPRLALGCGYNQVPGASYH--PLDGILGLGKGK-SSIVSQLHSQKLIRNVVGHCL-S 209
N L + G + H DG++GLG + +SI +QL + V C S
Sbjct: 219 PATNGTLDVVFGCTNAESGTIHDQEADGLIGLGNNQFASIPNQLADTHGLPRVFSLCFGS 278
Query: 210 GGGGGFLFFGD--DLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGETTGL-KNLPV-- 262
GGG L FG + +V+T M + + YY A + G +L V
Sbjct: 279 FEGGGALSFGRLPATPHTPPLVYTDMRVNEAHPAYYVVSTAAMKIGDVAVATPSDLAVGY 338
Query: 263 --VFDSGSSYTYLNRVTYQTLTSIMKKELSA-----KSLKEAP-EDETLP--LCWKGR-- 310
V DSG+++TY+ + + + ++ K L + P D + P +C++
Sbjct: 339 GTVMDSGTTFTYVPTKVFHATAAALDAAVTTNAKPEKKLAKVPGPDPSYPDDVCFQREGA 398
Query: 311 ---RPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAEV 365
P + ++ + + L ++F DG+ +L L P YL + K G CLG+++ +
Sbjct: 399 TEIEPIVTMANLGEYYPPLTIAF-DGEGASLV-LPPSNYLFVHGKKPGAFCLGVMDNKQQ 456
Query: 366 GLQDLNVIGGI 376
G +IGGI
Sbjct: 457 G----TLIGGI 463
>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 568
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 97/358 (27%), Positives = 146/358 (40%), Gaps = 57/358 (15%)
Query: 50 YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-------------VEAPHPLYRP 96
Y NV+ +G P+ + + LDTGSDL WL C+ C C + P
Sbjct: 105 YANVS--VGTPSLDFLVALDTGSDLFWLPCE--CSSCFTYLNTSNGGKFMLNHYSPNDST 160
Query: 97 SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNGQR 155
++ VPC +C + + C YE+ Y SS+G LV+D T+
Sbjct: 161 TSSTVPCTSSLC-------NRCTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLA-TDDSL 212
Query: 156 LNP---RLALGCGYNQV-PGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
L P ++ GCG Q A+ +G++GLG K S+ S L Q L N C
Sbjct: 213 LKPVEAKITFGCGTVQTGIFATTAAPNGLIGLGMEKISVPSFLADQGLTSNSFSMCFGAD 272
Query: 212 GGGFLFFGDD-LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
G G + FGD D + + +M + Y+ + GGE + +FDSG+S+
Sbjct: 273 GYGRIDFGDTGPADQKQTPFNTMLE--YQSYNVTFNVINVGGEPNDVP-FTAIFDSGTSF 329
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV---KKCFRTLA 327
TYL Y T+T M + K + PF+ +++ K F+ L
Sbjct: 330 TYLTEPAYSTITKQMDAGMKLKRYS----------LFGPNFPFEYCYEIPPGAKEFQYLT 379
Query: 328 LSFT----DGKTRT-LFELTP----EAYLIISNKGNV-CLGILNGAEVGLQDLNVIGG 375
L+FT D T T +F P +I +V CL I ++ L N + G
Sbjct: 380 LNFTMKGGDEFTPTDIFVFLPVDVSTMNIIFEETTHVACLAIAKSTDIDLIGQNFMTG 437
>gi|357461293|ref|XP_003600928.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355489976|gb|AES71179.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 295
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 72/251 (28%), Positives = 107/251 (42%), Gaps = 57/251 (22%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPI 107
G Y V++ IG P + + + +DTGSDLTW + LY+ N+ V +
Sbjct: 15 VGGYTVSLKIGYPGQSFDVFIDTGSDLTW------------DKYKLYKLHNNFVYVRIKL 62
Query: 108 CASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYN 167
Y DG + G LV+D ++ P+
Sbjct: 63 AI---------------------YVDGLQTKGFLVQDNIPLESSDRTLQRPKCT---NIL 98
Query: 168 QVPGASYHPL-DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLYD 224
+V P+ GILGLG G++SI+SQL S+ LI+NVVGHC SG G GG
Sbjct: 99 KVTDKKPKPISKGILGLGHGETSILSQLKSKGLIKNVVGHCFSGKEGQGG---------- 148
Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSI 284
+ D Y A L F + T +K+L ++FDSG++ + N ++ L
Sbjct: 149 -------NTKIDLEGRYFSEPANLIFDEKLTFIKDLQLIFDSGTTLSAFNSKDHKVLVD- 200
Query: 285 MKKELSAKSLK 295
+ E+S LK
Sbjct: 201 PENEVSKDYLK 211
>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 457
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 58/195 (29%), Positives = 85/195 (43%), Gaps = 21/195 (10%)
Query: 14 TVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSD 73
++R S + S S + ++ SS+ + + Y Y + IG PA + D+GS
Sbjct: 64 SIRTSGARGDSIRSIMSGNITSSMKYPISRMSYTDKAYVMKFSIGSPAVDTYAIPDSGSS 123
Query: 74 LTWLQCDAP-CVRCVEAPHPLYRPSNDLV----PCEDPICASLHAPGHHNCEDPAQ-CDY 127
L WLQC P C C PL+ PS + C C + C+ P Q C Y
Sbjct: 124 LVWLQCGTPYCRNCYRQKIPLFNPSKSVTYMKRLCNTAECRVALGDEYWRCKKPNQICKY 183
Query: 128 ELEYADGGSSLGVLVKDAFAF--------NYTNGQRLNPRLALGCGYNQVPGASYHPLDG 179
+Y D + GV+ D F F NYT R+ GCGYN ++P G
Sbjct: 184 HEDYLDDSYTEGVISTDIFTFPEHISGFGNYT------LRIIFGCGYNNSDPQHFYP-PG 236
Query: 180 ILGLGKGKSSIVSQL 194
++GL K+S+V Q+
Sbjct: 237 LVGLTNNKASLVGQM 251
>gi|308813706|ref|XP_003084159.1| Aspartyl protease (ISS) [Ostreococcus tauri]
gi|116056042|emb|CAL58575.1| Aspartyl protease (ISS) [Ostreococcus tauri]
Length = 478
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 87/336 (25%), Positives = 135/336 (40%), Gaps = 26/336 (7%)
Query: 40 QVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD--APCVRCVEAPHPLYRPS 97
+V+G V TG + + A+ + L +DTGS T+L C A C + Y S
Sbjct: 24 EVYGEVLETGVLVASFELA-GAQTFELIVDTGSSRTYLPCKGCASCGAHEAGRYYDYDAS 82
Query: 98 NDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
D E CA + C C Y++ Y +G S G LV+D + + G N
Sbjct: 83 ADFSRVECSACAGIGG----KCGTSGVCRYDVHYLEGSGSEGYLVRDVVSLGGSVG---N 135
Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG------- 210
+ GC ++ DG+ G G+ ++ +QL S +I ++ C+ G
Sbjct: 136 ATVVFGCEERELGSIKQQSADGLFGFGRQAYALRAQLASASVIDDLFSMCVEGYEKLSGE 195
Query: 211 GGGGFLFFG--DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGS 268
GG L G D D+ +V+T M S Y + G + + + DSG+
Sbjct: 196 HVGGLLTLGNFDFGADAPALVYTPMVSSAMYYQVTTTSWTLGNSVVEGSRGVLTIIDSGT 255
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSL-KEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
SYTY+ + + + L K AP ++ LC+ G V + F L
Sbjct: 256 SYTYVPGNMHARFLQLAEDAARESGLEKVAPPEDYPDLCF-GNSGGLGWSTVSEYFPALK 314
Query: 328 LSFTDGKTRTLFELTPEAYLII--SNKGNVCLGILN 361
+ + G R L+PE YL N C+GIL
Sbjct: 315 IEY-HGSAR--LTLSPETYLYWHQKNASAFCVGILE 347
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 99/358 (27%), Positives = 135/358 (37%), Gaps = 63/358 (17%)
Query: 40 QVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV--RCVEAPHPLYRPS 97
QVH T Y + IG P + +DTGSDL W QC C+ C + P Y S
Sbjct: 78 QVH---RATRQYIASYLIGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYYNLS 134
Query: 98 NDL----VPCEDP--ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT 151
VPC D CA A G H C C + Y G +G L ++FAF
Sbjct: 135 QSSTFVPVPCADKAGFCA---ANGVHLCGLDGSCTFIASYG-AGRVIGSLGTESFAF--- 187
Query: 152 NGQRLNPRLALGC-GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
+ LA GC ++ + + G++GLG+G+ S+VSQ+ + + + + S
Sbjct: 188 --ESGTTSLAFGCVSLTRITSGALNDASGLIGLGRGRLSLVSQIGATRFSYCLTPYFHSS 245
Query: 211 GGGGFLFFGDDLYDSSR---VVWTSMSSDY---TKYYSPGVAELFFGGETTGLKNLP--- 261
G LF G + + DY T YY P G T G LP
Sbjct: 246 GASSHLFVGASASLGGGGASMPFVKSPKDYPYSTFYYLP------LEGITVGKTRLPAVN 299
Query: 262 -----------------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
V+ D+GS T L Y+ L + +L SL APED L
Sbjct: 300 STTFQLRQLFKGYWAGGVIIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDSGLE 359
Query: 305 LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNG 362
LC R F+ V L F G + +Y +K C+ IL G
Sbjct: 360 LC-VAREGFQKV------VPALVFHFGGGAD---MAVPAASYWAPVDKAAACMMILEG 407
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 90/379 (23%), Positives = 146/379 (38%), Gaps = 79/379 (20%)
Query: 46 YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC------------VRCVEAPHPL 93
Y G Y V +G P++ + L DTGSDLTW+ C C +R H
Sbjct: 78 YGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHAN 137
Query: 94 YRPSNDLVPCEDPICAS--LHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNY 150
S +PC +C + NC P C Y+ Y+DG ++LG +
Sbjct: 138 LSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVEL 197
Query: 151 TNGQRLN-PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGH 206
G+++ + +GC G S+ DG++GLG K S + + K +V H
Sbjct: 198 KEGRKMKLHNVLIGCS-ESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDH 256
Query: 207 CLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK--------YYSPGVAELFFGGETTGLK 258
+L FG S + +M+ YT+ +Y+ + + GG +
Sbjct: 257 LSHKNVSNYLTFGSS--RSKEALLNNMT--YTELVLGMVNSFYAVNMMGISIGG---AML 309
Query: 259 NLP-----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
+P + DSGSS T+L YQ + + ++ L
Sbjct: 310 KIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSL-----------------L 352
Query: 308 KGRRPFKNVHDVKKCFRT----------LALSFTDGKTRTLFELTPEAYLIISNKGNVCL 357
K R+ ++ ++ CF + L F DG FE ++Y+I + G CL
Sbjct: 353 KFRKVEMDIGPLEYCFNSTGFEESLVPRLVFHFADGAE---FEPPVKSYVISAADGVRCL 409
Query: 358 GILNGAEVGLQDLNVIGGI 376
G ++ A G +V+G I
Sbjct: 410 GFVSVAWPG---TSVVGNI 425
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 98/351 (27%), Positives = 151/351 (43%), Gaps = 50/351 (14%)
Query: 41 VHGNVYP-TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN- 98
V VY G + + M IG P+ + LDTGSDLTW QC PC C P P+Y PS
Sbjct: 104 VEAPVYAGNGEFLMKMAIGTPSLSFSAILDTGSDLTWTQCK-PCTDCYPQPTPIYDPSQS 162
Query: 99 ---DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
VPC +C +L ++C A C+Y Y D S+ G+L ++F Q
Sbjct: 163 STYSKVPCSSSMCQALPM---YSCSG-ANCEYLYSYGDQSSTQGILSYESFTL---TSQS 215
Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SG 210
L P +A GCG + G + G++G G+G S++SQL + + N +CL S
Sbjct: 216 L-PHIAFGCG-QENEGGGFSQGGGLVGFGRGPLSLISQLG--QSLGNKFSYCLVSITDSP 271
Query: 211 GGGGFLFFGDDLYDSSRVVWTS---MSSDYTKYYSPGVAELFFGGETTGLKNLP------ 261
LF G +++ V ++ S +Y + + GG+ + +
Sbjct: 272 SKTSPLFIGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLD 331
Query: 262 ----VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET-LPLCWKGRRPFKNV 316
V+ DSG++ TYL + Y + K +S+ +L + L LC++ +
Sbjct: 332 GTGGVIIDSGTTVTYLEQSGYDV---VKKAVISSINLPQVDGSNIGLDLCFEPQSGSSTS 388
Query: 317 HDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGIL--NGAEV 365
H F T+ F F L E Y+ + G CL +L NG +
Sbjct: 389 H-----FPTITFHFEGAD----FNLPKENYIYTDSSGIACLAMLPSNGMSI 430
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 81/274 (29%), Positives = 126/274 (45%), Gaps = 24/274 (8%)
Query: 55 MYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY-RPSND---LVPCEDPICAS 110
+ IG P ++ LDTGSDL W+QC+ PC C + P+Y R +D + C +P C S
Sbjct: 97 LSIGNPPTNVYVVLDTGSDLFWIQCE-PCDVCYKQKDPIYNRTKSDSYTEMLCNEPPCVS 155
Query: 111 LHAPGHHNCEDPAQCDYELEYADGGSSLGVLV--KDAFAFNYTNGQRLNPRLALGCGYNQ 168
L G C D C Y+ YADG + G+L K AF +Y++ + ++ GCG
Sbjct: 156 LGREGQ--CSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDK-TAQVGFGCGLQN 212
Query: 169 VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG----GGGGFLFFGDDLY- 223
+ + + G+LGLG G S+VSQL + + +C GGFL FGD Y
Sbjct: 213 LNFITSNRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNISNPNAGGFLVFGDATYL 272
Query: 224 --DSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-----VVFDSGSSYTYLNRV 276
D + +V GV E ++ + P V+ DSGS+ +
Sbjct: 273 NGDMTPMVIAEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGGVIIDSGSTLSVFPPE 332
Query: 277 TYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
Y+ + + + +L K +P + P C++G+
Sbjct: 333 VYEVVRNAVVDKLK-KGYNISPLTSS-PDCFEGK 364
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 71/131 (54%), Gaps = 12/131 (9%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND- 99
+ G +G Y + +GQPA+P+++ LDTGSD+ WLQC PC C + P++ P +
Sbjct: 145 ISGTSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQ-PCTDCYQQTDPIFDPRSSS 203
Query: 100 ---LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+PCE C +L G C ++C Y++ Y DG ++G V + F N +
Sbjct: 204 SFASLPCESQQCQALETSG---CR-ASKCLYQVSYGDGSFTVGEFVTETLTFG--NSGMI 257
Query: 157 NPRLALGCGYN 167
N +A+GCG++
Sbjct: 258 N-DVAVGCGHD 267
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 103/388 (26%), Positives = 158/388 (40%), Gaps = 54/388 (13%)
Query: 15 VRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDL 74
+R+ + +SS++ S V + + G T Y VT+ +G + L +DTGSDL
Sbjct: 106 LRIKAMTSSTTEQS----VSETQIPLTSGIKLETLNYIVTVELG--GKNMSLIVDTGSDL 159
Query: 75 TWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAP-------GHHNCEDPA 123
TW+QC PC C PLY P S V C C L A G N
Sbjct: 160 TWVQCQ-PCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATGNSGPCGGFNGVVKT 218
Query: 124 QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGL 183
C+Y + Y DG + G L ++ T + L GCG N + G++GL
Sbjct: 219 TCEYVVSYGDGSYTRGDLASESIVLGDTKLE----NLVFGCGRNN--KGLFGGASGLMGL 272
Query: 184 GKGKSSIVSQLHSQKLIRNVVGHC---LSGGGGGFLFFGDDL---YDSSRVVWTSMSSD- 236
G+ S+VSQ + K V +C L G G L FG+D +S+ V +T + +
Sbjct: 273 GRSSVSLVSQ--TLKTFNGVFSYCLPSLEDGASGTLSFGNDFSVYKNSTSVFYTPLVQNP 330
Query: 237 -YTKYYSPGVAELFFGGETTGLKNLP----VVFDSGSSYTYLNRVTYQTLTSIMKKELSA 291
+Y + GG LK L ++ DSG+ T L Y+ + + K+ S
Sbjct: 331 QLRSFYILNLTGASIGG--VELKTLSFGRGILIDSGTVITRLPPSIYKAVKTEFLKQFSG 388
Query: 292 KSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISN 351
AP L C+ + D+ T+ + F +G ++T Y + +
Sbjct: 389 --FPSAPGYSILDTCFN----LTSYEDIS--IPTIKMIF-EGNAELEVDVTGVFYFVKPD 439
Query: 352 KGNVCLGILNGAEVGLQDLNVIGGIGDF 379
VCL + L N +G IG++
Sbjct: 440 ASLVCLAL-----ASLSYENEVGIIGNY 462
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 96/360 (26%), Positives = 154/360 (42%), Gaps = 56/360 (15%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y + +Y+G P R + + +DTGSDL WLQC APC+ C E P++ P+ V C
Sbjct: 146 SGEYLIDVYVGTPPRRFRMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASSSYRNVTC 204
Query: 104 EDPICASLHAP-GHHNCEDPAQ--CDYELEYADGGSSLGVLVKDAFAFNYT--NGQRLNP 158
D C + P C PA+ C Y Y D ++ G L ++F N T R
Sbjct: 205 GDQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVD 264
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS-------GG 211
+ GCG+ +H G+LGLG+G S SQL R V GH S
Sbjct: 265 GVVFGCGHRN--RGLFHGAAGLLGLGRGPLSFASQL------RAVYGHTFSYCLVEHGSD 316
Query: 212 GGGFLFFGDD--LYDSSRVVWTSM---SSDYTKYYSPGVAELFFGGETTGLKNLP----- 261
G + FG+D + ++ +T+ SS +Y + + GG+ + +
Sbjct: 317 AGSKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWDVGK 376
Query: 262 -----VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNV 316
+ DSG++ +Y YQ + +L ++ P+ L C+ NV
Sbjct: 377 DGSGGTIIDSGTTLSYFVEPAYQVIRQAF-VDLMSRLYPLIPDFPVLNPCY-------NV 428
Query: 317 HDVKKC-FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIG 374
V++ L+L F DG +++ E Y + + G +CL + G +++IG
Sbjct: 429 SGVERPEVPELSLLFADG---AVWDFPAENYFVRLDPDGIMCLAVRGTPRTG---MSIIG 482
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 74/281 (26%), Positives = 123/281 (43%), Gaps = 36/281 (12%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC--VEAPHPLYRPSNDLVPCED 105
+G Y + M IG PA +DTGSDL W +C+ PC C P + V C+
Sbjct: 39 SGEYLIQMAIGTPALSLSAIMDTGSDLVWTKCN-PCTDCSTSSIYDPSSSSTYSKVLCQS 97
Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
+C P +C + C+Y Y D S+ G+L + F+ + Q L P + GCG
Sbjct: 98 SLC---QPPSIFSCNNDGDCEYVYPYGDRSSTSGILSDETFSI---SSQSL-PNITFGCG 150
Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDD 221
++ + + G++G G+G S+VSQL + N +CL LF G+
Sbjct: 151 HDN---QGFDKVGGLVGFGRGSLSLVSQLGPS--MGNKFSYCLVSRTDSSKTSPLFIGNT 205
Query: 222 LYDSSRVVWTS--MSSDYTKYYSPGVAELFFGGET----TGLKNLP------VVFDSGSS 269
+ V ++ + S T +Y + + GG++ TG ++ ++ DSG++
Sbjct: 206 ASLEATTVGSTPLVQSSSTNHYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTT 265
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
T+L + Y + KE S+ D L LC+ +
Sbjct: 266 LTFLQQTAYDAV-----KEAMVSSINLPQADGQLDLCFNQQ 301
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 94/366 (25%), Positives = 149/366 (40%), Gaps = 51/366 (13%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPL 93
S+ L G++ + Y V + +G P R L DTGSDLTW QC+ PC C + +
Sbjct: 30 STTLPAESGSLIGSANYVVVVGLGTPKRDLSLVFDTGSDLTWTQCE-PCAGSCYKQQDAI 88
Query: 94 YRPSNDL----VPCEDPICASLHAPG-HHNCEDP--AQCDYELEYADGGSSLGVLVKDAF 146
+ PS + C +C L + G C A C Y+ +Y D +S+G L ++
Sbjct: 89 FDPSKSSSYTNITCTSSLCTQLTSDGIKSECSSSTDASCIYDAKYGDNSTSVGFLSQERL 148
Query: 147 AFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGH 206
T+ + GCG Q ++ G++GLG+ SIV Q S + +
Sbjct: 149 TITATD---IVDDFLFGCG--QDNEGLFNGSAGLMGLGRHPISIVQQTSSN--YNKIFSY 201
Query: 207 CL--SGGGGGFLFFGDDLYDSSRVVWTSMS--SDYTKYYSPGVAELFFGGETTGLKNLPV 262
CL + G L FG ++ +++T +S S +Y + + GG LP
Sbjct: 202 CLPATSSSLGHLTFGASAATNASLIYTPLSTISGDNSFYGLDIVSISVGG-----TKLPA 256
Query: 263 V-----------FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR 311
V DSG+ T L Y L S ++ + + A E L C+
Sbjct: 257 VSSSTFSAGGSIIDSGTVITRLAPTVYAALRSAFRRXMEKYPV--ANEAGLLDTCYD-LS 313
Query: 312 PFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGI-LNGAEVGLQDL 370
+K + + F F+ G T EL L + ++ VCL NG++ D+
Sbjct: 314 GYKEISVPRIDFE-----FSGGVT---VELXHRGILXVESEQQVCLAFAANGSD---NDI 362
Query: 371 NVIGGI 376
V G +
Sbjct: 363 TVFGNV 368
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 92/366 (25%), Positives = 144/366 (39%), Gaps = 60/366 (16%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC----DAPCVRCVEAPHPLYRPSNDL--- 100
TG Y V + +G PA+P+ L DTGSDLTW++C + P ++RP+
Sbjct: 101 TGQYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQRVFRPAGSKSWS 160
Query: 101 -VPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYT--NGQRL 156
+PC+ C S NC P C Y+ Y D S+ GV+ D+ + + +G R
Sbjct: 161 PLPCDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDSATVSLSGNDGTRK 220
Query: 157 NP--RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGHCLSGG 211
+ LGC G S+ DG+L LG S S+ S+ + +V H
Sbjct: 221 AKLQEVVLGCT-TSYDGQSFKSSDGVLSLGNSNISFASRAASRFGGRFSYCLVDHLAPRN 279
Query: 212 GGGFLFFGDDLYDSS------RVVWTSMSSDYTK-YYSPGVAELFFGGETTGL------- 257
FL FG+ R + T+ +Y V + GE +
Sbjct: 280 ATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEILPDVWDF 339
Query: 258 -KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNV 316
KN + DSG+S T L Y + + K+ + P N+
Sbjct: 340 RKNGGAILDSGTSLTILATPAYDAVVKAISKQFAGV-------------------PRVNM 380
Query: 317 HDVKKCFRTLALSFTDGKTRTLFE----LTP--EAYLIISNKGNVCLGILNGAEVGLQDL 370
+ C+ +S + F L P ++Y+I + G C+G++ GA G +
Sbjct: 381 DPFEYCYNWTGVSAEIPRMELRFAGAATLAPPGKSYVIDTAPGVKCIGVVEGAWPG---V 437
Query: 371 NVIGGI 376
+VIG I
Sbjct: 438 SVIGNI 443
>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 564
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 94/352 (26%), Positives = 146/352 (41%), Gaps = 52/352 (14%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---------- 100
Y + +G P + + LDTGSDL W+ CD C+ C AP YR + D
Sbjct: 143 YYTWVDVGTPNTSFMVALDTGSDLFWVPCD--CIEC--APLAGYRETLDRDLGIYKPAES 198
Query: 101 -----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNG 153
+PC +C P C P Q C Y +Y + +S G+L++D +
Sbjct: 199 TTSRHLPCSHELC-----PPGSGCSSPKQPCPYSTDYLQENTTSSGLLIEDILHLDSRES 253
Query: 154 QR-LNPRLALGCGYNQVPGASYH---PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
+ + +GCG Q SY DG+LGLG S+ S L L+RN C
Sbjct: 254 HAPVKASVVIGCGRKQ--SGSYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFK 311
Query: 210 GGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY--YSPGVAELFFGGETTGLKNLPVVFDSG 267
G +FFGD + T Y KY Y+ V + G + + + DSG
Sbjct: 312 EDSGR-IFFGDQGVSIQQS--TPFVPLYGKYQTYAVNVDKSCVGHKCFEATSFEALVDSG 368
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
+S+T L Y+ + K++ A + + ED + C+ P K + DV T+
Sbjct: 369 TSFTALPLNVYKAVAVEFDKQVHAPRITQ--EDASFEYCYSA-SPLK-MPDVP----TVT 420
Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNV---CLGILNGAE-VGLQDLNVIGG 375
L+F K+ F+ ++ +G+V CL + E +G+ N + G
Sbjct: 421 LTFAANKS---FQAVNPTIVLKDGEGSVAGFCLALQKSPEPIGIIGQNFLTG 469
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 88/351 (25%), Positives = 132/351 (37%), Gaps = 60/351 (17%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDLVPCED 105
+G Y + + +G P + + +DTGSDL W+QC PC +C P+Y P S+
Sbjct: 1 SGAYTMEIELGSPPKKFNAIVDTGSDLVWIQCK-PCSQCYSQSDPIYDPSASSTFAKTSC 59
Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG-QRLNPRLALGC 164
+ P C Y +Y D S+ G + + G + P GC
Sbjct: 60 STSSCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGC 119
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFG 219
G ++ S+ GI+GLG+GK S+ +QL S I N +CL L FG
Sbjct: 120 G--RLNSGSFGGAAGIVGLGQGKISLSTQLGSA--INNKFSYCLVDFDDDSSKTSPLIFG 175
Query: 220 DDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGL-------------KNLPV-- 262
S + T + +S + YY G+ + GG+ L K L V
Sbjct: 176 SSASTGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRA 235
Query: 263 --------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFK 314
+FDSG++ T L+ Y + S +S LP F
Sbjct: 236 LEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSVS------------LPTVDASSSGFD 283
Query: 315 NVHDVKKC----FRTLALSFTDGKTRTLFELTPEAYLIISNKGN--VCLGI 359
+DV K F L L+F K F + Y +I + CL +
Sbjct: 284 LCYDVSKSKNFKFPALTLAFKGTK----FSPPQKNYFVIVDTAETVACLAM 330
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 71/131 (54%), Gaps = 12/131 (9%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND- 99
+ G +G Y + +GQPA+P+++ LDTGSD+ WLQC PC C + P++ P +
Sbjct: 145 ISGTSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQ-PCTDCYQQTDPIFDPRSSS 203
Query: 100 ---LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+PCE C +L G C ++C Y++ Y DG ++G V + F N +
Sbjct: 204 SFASLPCESQQCQALETSG---CR-ASKCLYQVSYGDGSFTVGEFVIETLTFG--NSGMI 257
Query: 157 NPRLALGCGYN 167
N +A+GCG++
Sbjct: 258 N-NVAVGCGHD 267
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 92/336 (27%), Positives = 130/336 (38%), Gaps = 82/336 (24%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
T Y V + +G P RP L LDTGSDL W QC APC C + PL P+ +PC
Sbjct: 83 TNEYLVHLAVGTPPRPVALTLDTGSDLVWTQC-APCRDCFDQGIPLLDPAASSTYAALPC 141
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR-------L 156
P C +L +C C Y Y D ++G + D F F NG+R
Sbjct: 142 GAPRCRALP---FTSCGG-RSCVYVYHYGDKSVTVGKIATDRFTFG-DNGRRNGDGSLPA 196
Query: 157 NPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
RL GCG +N+ G GI G G+G+ S+ SQL++ F
Sbjct: 197 TRRLTFGCGHFNK--GVFQSNETGIAGFGRGRWSLPSQLNATS----------------F 238
Query: 216 LFFGDDLYDSSRVVWT---SMSSDYTKYYS-----------PGVAELFF---GGETTGLK 258
+ ++DS + T + ++ Y+ +S P L+F G + G
Sbjct: 239 SYCFTSMFDSKSSIVTLGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKT 298
Query: 259 NLPV--------VFDSGSSYTYLNRVTYQTLTSIMKKE------------------LSAK 292
LPV + DSG+S T L Y+ + + + L
Sbjct: 299 RLPVPETKFRSTIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDVCFALPVS 358
Query: 293 SLKEAPEDETLPLC-WKGRRPFKNVHDVKKCFRTLA 327
+L P +L C W R P + H C RT A
Sbjct: 359 ALWRRPAVPSLTRCTW--RAPTGSSHAATTCSRTSA 392
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 109/368 (29%), Positives = 152/368 (41%), Gaps = 62/368 (16%)
Query: 48 TGYYNVTMYIGQP-----ARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR----PSN 98
+G Y + +G P + L D GSD+TWLQC PC RC P P+Y S
Sbjct: 122 SGEYIAKITVGTPYENDSSFEALLSPDMGSDVTWLQC-MPCFRCYHQPGPVYNRLKSSSA 180
Query: 99 DLVPCEDPICASLHAPGHHNC-EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
V C P C +L + G C + +C Y++EY DG SS G + F G R+
Sbjct: 181 SDVGCYAPACRALGSSG--GCVQFLNECQYKVEYGDGSSSAGDFGVETLTF--PPGVRV- 235
Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG--- 214
P +A+GCG + G P GILGLG+G S SQ+ + +CL+G G G
Sbjct: 236 PGVAIGCGSDN-QGLFPAPAAGILGLGRGSLSFPSQIAGR--YGRSFSYCLAGQGTGGRS 292
Query: 215 -FLFFGDDL-------YDSSRVVWTSMSSDYTKYYSPGVAELFFGG------ETTGLKNL 260
L FG S + S YT YY G+ + GG + L+
Sbjct: 293 STLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYV-GLVGISVGGVRVRGVTESDLRLD 351
Query: 261 P------VVFDSGSSYTYLNRVTYQTLTSIMK----KELSAKSLKEAPEDETLPLCWKGR 310
P V+ DSG++ T L+ Y + KEL S C+
Sbjct: 352 PSTGHGGVIVDSGTAVTRLSGPAYAAFRDAFRVAAVKELGWPS--PGGPFAFFDTCYSSV 409
Query: 311 RPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLII--SNKGNVCLGILNGAEVGLQ 368
R V K +++ F G +L P+ YLI SNKG +C A G +
Sbjct: 410 R-----GRVMKKVPAVSMHFAGG---VEVKLPPQNYLIPVDSNKGTMCFAF---AGSGDR 458
Query: 369 DLNVIGGI 376
+++IG I
Sbjct: 459 GVSIIGNI 466
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 86/315 (27%), Positives = 132/315 (41%), Gaps = 49/315 (15%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSNDL----VPCE 104
Y VT+ +G P+ L +DTGSDL+W+QC PC C PL+ PS +PC
Sbjct: 124 YVVTVGLGTPSVSQVLLIDTGSDLSWVQCQ-PCNSTTCYPQKDPLFDPSKSSTYAPIPCN 182
Query: 105 DPICASLHAPGH----HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
C L G+ + + AQC + + Y DG + GV + A L P +
Sbjct: 183 TDACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLA--------LAPGV 234
Query: 161 AL-----GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG----- 210
A+ GCG++Q + DG+LGLG S+V Q S + +CL
Sbjct: 235 AVKDFRFGCGHDQ--DGANDKYDGLLGLGGAPESLVVQTAS--VYGGAFSYCLPALNNQV 290
Query: 211 ---GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----VV 263
GG + ++S V+T M + +Y + + GGE + ++
Sbjct: 291 GFLALGGGGAPSGGVVNTSGFVFTPMIREEETFYVVNMTGITVGGEPIDVPPSAFSGGMI 350
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
DSG+ T L Y L + +K ++A L E +T C+ F +V
Sbjct: 351 IDSGTVVTELQHTAYNALQAAFRKAMAAYPLVRNGELDT---CYD----FSGYSNVT--L 401
Query: 324 RTLALSFTDGKTRTL 338
+AL+F+ G T L
Sbjct: 402 PKVALTFSGGATIDL 416
>gi|356548395|ref|XP_003542587.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 525
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 107/262 (40%), Gaps = 38/262 (14%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRC----------VEAPHPLYRPS----NDLVP 102
IG P + + LD GSD+ W+ CD C+ C ++ YRPS + +P
Sbjct: 111 IGTPNVSFLVALDAGSDMLWVPCD--CIECASLSAGNYNVLDRDLNQYRPSLSNTSRHLP 168
Query: 103 CEDPICASLHAPGHHNCE---DPAQCDYELEYADGG-SSLGVLVKDAFAF----NYTNGQ 154
C +C H C+ DP C YE++YA SS G + +D +
Sbjct: 169 CGHKLCDV-----HSFCKGSKDP--CPYEVQYASANTSSSGYVFEDKLHLTSDGKHAEQN 221
Query: 155 RLNPRLALGCGYNQVPGASYHPL--DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
+ + LGCG Q G H DG+LGLG G S+ S L LI+N CL
Sbjct: 222 SVQASIILGCGRKQT-GDYLHGAGPDGVLGLGPGNISVPSLLAKAGLIQNSFSICLDENE 280
Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTY 272
G + FGD V S Y GV G + DSGSS+T+
Sbjct: 281 SGRIIFGDQ----GHVTQHSTPFLPIIAYMVGVESFCVGSLCLKETRFQALIDSGSSFTF 336
Query: 273 LNRVTYQTLTSIMKKELSAKSL 294
L YQ + + K+++A +
Sbjct: 337 LPNEVYQKVVTEFDKQVNASRI 358
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 88/321 (27%), Positives = 134/321 (41%), Gaps = 46/321 (14%)
Query: 68 LDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLH------APGHH 117
+DT S+LTW+QC APC C + PL+ PS+ VPC C +L + G
Sbjct: 168 VDTASELTWVQC-APCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALQLATGGTSGGAA 226
Query: 118 NCE----DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGAS 173
C+ A C Y L Y DG S GVL D + G+ ++ GCG + G
Sbjct: 227 ACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSL---AGEVIDG-FVFGCGTSN-QGPP 281
Query: 174 YHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGDD---LYDSSR 227
+ G++GLG+ + S+VSQ Q V +CL G L GDD +S+
Sbjct: 282 FGGTSGLMGLGRSQLSLVSQTMDQ--FGGVFSYCLPLKESDSSGSLVIGDDSSVYRNSTP 339
Query: 228 VVWTSMSSDYTK--YYSPGVAELFFGGETT-------GLKNLPVVFDSGSSYTYLNRVTY 278
+V+ SM SD + +Y + + GG+ G + DSG+ T L Y
Sbjct: 340 IVYASMVSDPLQGPFYFVNLTGITVGGQEVESSGFSSGGGGGKAIIDSGTVITSLVPSIY 399
Query: 279 QTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTL 338
+ + + + +AP L C+ + +V+ +L L F DG
Sbjct: 400 NAVKAEFLSQFA--EYPQAPGFSILDTCFN----MTGLREVQ--VPSLKLVF-DGGVEVE 450
Query: 339 FELTPEAYLIISNKGNVCLGI 359
+ Y + S+ VCL +
Sbjct: 451 VDSGGVLYFVSSDSSQVCLAM 471
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 47/126 (37%), Positives = 65/126 (51%), Gaps = 8/126 (6%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
+G Y + IG PAR ++ LDTGSD+TWLQC APC C PL+ P S VPC
Sbjct: 193 SGEYFSRIGIGSPARQLYMVLDTGSDVTWLQC-APCADCYAQSDPLFDPALSSSYATVPC 251
Query: 104 EDPICASLHAPGHHN--CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
+ P C +L A HN + C YE+ Y DG ++G + +G +A
Sbjct: 252 DSPHCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLG-GDGSAAVHDVA 310
Query: 162 LGCGYN 167
+GCG++
Sbjct: 311 IGCGHD 316
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 71/276 (25%), Positives = 117/276 (42%), Gaps = 43/276 (15%)
Query: 40 QVHGNVYP-TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
++ V P G + + + IG P Y +DTGSDL W QC PC +C + P P++ P
Sbjct: 85 EIDAPVLPGNGEFLMKLAIGTPPETYSAIMDTGSDLIWTQCK-PCTQCFDQPTPIFDPKK 143
Query: 99 DLVPCEDPICASL-HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
+ + L A C D C+Y Y D S+ G+L + F G+
Sbjct: 144 SSSFSKLSCSSKLCEALPQSTCSD--GCEYLYGYGDYSSTQGMLASETLTF----GKVSV 197
Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL------IRNVVGHCLSGG 211
P +A GCG + G+ + G++GLG+G S+VSQL K + + L G
Sbjct: 198 PEVAFGCGEDN-EGSGFSQGSGLVGLGRGPLSLVSQLKEPKFSYCLTSVDDTKASTLLMG 256
Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV--------- 262
+ D ++ ++ S + YY L G + G +LP+
Sbjct: 257 SLASVKASDSEIKTTPLIQNSAQPSF--YY------LSLEGISVGDTSLPIKKSTFSLQE 308
Query: 263 ------VFDSGSSYTYLNRVTYQTLTSIMKKELSAK 292
+ DSG++ TYL + + ++ KE +++
Sbjct: 309 DGSGGLIIDSGTTITYLEQSAFD----LVAKEFTSQ 340
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 90/293 (30%), Positives = 124/293 (42%), Gaps = 51/293 (17%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV--EAPHPLYRPSN----DLVP 102
G YN+ + +G P + + +DTGS+L W QC APC RC P P+ +P+ +P
Sbjct: 89 GAYNMNISLGTPPLDFPVIVDTGSNLIWAQC-APCTRCFPRPTPAPVLQPARSSTFSRLP 147
Query: 103 CEDPICASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
C C L C A C Y Y G ++ G L + T G P++A
Sbjct: 148 CNGSFCQYLPTSSRPRTCNATAACAYNYTYGSGYTA-GYLATETL----TVGDGTFPKVA 202
Query: 162 LGCGY-NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFL 216
GC N V +S GI+GLG+G S+VSQL + +CL + GG +
Sbjct: 203 FGCSTENGVDNSS-----GIVGLGRGPLSLVSQLAVGRF-----SYCLRSDMADGGASPI 252
Query: 217 FFGDDLYDSSRVVWTS---MSSDY----TKYYS--PGVA----ELFFGGETTGLKNLPV- 262
FG + R V S + + Y T YY G+A EL G T G +
Sbjct: 253 LFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLG 312
Query: 263 ---VFDSGSSYTYLNRVTY----QTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
+ DSG++ TYL + Y Q S M AP D L LC+K
Sbjct: 313 GGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYD--LDLCYK 363
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 73/252 (28%), Positives = 105/252 (41%), Gaps = 23/252 (9%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
Y +T+ +G PA + +DTGSD++W+QC PC +C PL+ P + C
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCGSA 186
Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
CA L G + C +QC Y + Y DG S+ G D A G GC
Sbjct: 187 ACAQLGQEG-NGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVKSFQFGC-- 239
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL-FFGDDLY 223
+ V DG++GLG G S+VSQ + + +CL + GFL
Sbjct: 240 SNVESGFNDQTDGLMGLGGGAQSLVSQ--TAGTLGRAFSYCLPPTPSSSGFLTLGAAGGS 297
Query: 224 DSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLK----NLPVVFDSGSSYTYLNRVT 277
+S V T M SS +Y + + GG + + V DSG+ T L
Sbjct: 298 GTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVMDSGTVITRLPPTA 357
Query: 278 YQTLTSIMKKEL 289
Y L+S K +
Sbjct: 358 YSALSSAFKAGM 369
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 81/274 (29%), Positives = 125/274 (45%), Gaps = 24/274 (8%)
Query: 55 MYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY-RPSND---LVPCEDPICAS 110
+ IG P ++ LDTGSDL W+QC+ PC C + P+Y R +D + C +P C S
Sbjct: 110 LSIGNPPTNVYVVLDTGSDLFWIQCE-PCDVCYKQKDPIYNRTKSDSYTEMLCNEPPCLS 168
Query: 111 LHAPGHHNCEDPAQCDYELEYADGGSSLGVLV--KDAFAFNYTNGQRLNPRLALGCGYNQ 168
L G C D C Y+ YADG + G+L K AF +Y++ + ++ GCG
Sbjct: 169 LGREGQ--CSDSGSCLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDK-TAQVGFGCGLQN 225
Query: 169 VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG----GGGGFLFFGDDLY- 223
+ + G+LGLG G S+VSQL + + +C GGFL FGD Y
Sbjct: 226 LNFVTSSRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNLSNPNAGGFLVFGDATYL 285
Query: 224 --DSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-----VVFDSGSSYTYLNRV 276
D + +V GV E ++ + P V+ DSGS+ +
Sbjct: 286 NGDMTPMVIAEFYYVNLLGIGLGVEEPRLDINSSSFERKPDGSGGVIIDSGSTLSIFPPE 345
Query: 277 TYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
Y+ + + + +L K +P + P C++G+
Sbjct: 346 VYEVVRNAVVDKLK-KGYNISPLTSS-PDCFEGK 377
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 96/350 (27%), Positives = 152/350 (43%), Gaps = 40/350 (11%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL- 100
G++ +G Y VT+ +G P + + L DTGSDLTW QC+ PCV+ C ++ PS
Sbjct: 145 GSIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCE-PCVKSCYNQKEAIFNPSQSTS 203
Query: 101 ---VPCEDPICASL-HAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
+ C +C SL A G+ NC + C Y ++Y D S+G K+ + T+
Sbjct: 204 YANISCGSTLCDSLASATGNIFNCAS-STCVYGIQYGDSSFSIGFFGKEKLSLTATD--- 259
Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGG 213
+ GCG N + G+LGLG+ K S+VSQ + + + +CL S
Sbjct: 260 VFNDFYFGCGQNN--KGLFGGAAGLLGLGRDKLSLVSQ--TAQRYNKIFSYCLPSSSSST 315
Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DS 266
GFL FG S+ + S + +Y + + GG + P VF DS
Sbjct: 316 GFLTFGGSTSKSASFTPLATISGGSSFYGLDLTGISVGGRKLAIS--PSVFSTAGTIIDS 373
Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTL 326
G+ T L Y L+S +K +S AP L C+ F N HD + +
Sbjct: 374 GTVITRLPPAAYSALSSTFRKLMS--QYPAAPALSILDTCFD----FSN-HDTISVPK-I 425
Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
L F+ G + ++ +++ VCL ++ D+ + G +
Sbjct: 426 GLFFSGG---VVVDIDKTGIFYVNDLTQVCLAFAGNSDA--SDVAIFGNV 470
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 89/307 (28%), Positives = 131/307 (42%), Gaps = 37/307 (12%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSNDL----VPCE 104
Y VT+ G P+ P L +DTGSD++W+QC APC C PL+ PS + C
Sbjct: 125 YMVTLGFGTPSVPQVLLMDTGSDVSWVQC-APCNSTECYPQKDPLFDPSKSSTYAPIACG 183
Query: 105 DPICASLHAPGHHNCED-PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
C L + C QC Y +EY DG S+ GV + F G + G
Sbjct: 184 ADACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETITF--APGITVK-DFHFG 240
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFG-- 219
CG++Q G S DG+LGLG S+V Q S + +CL GFL G
Sbjct: 241 CGHDQR-GPS-DKFDGLLGLGGAPESLVVQTAS--VYGGAFSYCLPALNSEAGFLALGVR 296
Query: 220 -DDLYDSSRVVWTSM---SSDYTKYYSPGVAELFFGGETTGLKNLP----VVFDSGSSYT 271
++S V+T M D T Y + + GG+ + ++ DSG+ T
Sbjct: 297 PSAATNTSAFVFTPMWHLPMDATSYMV-NMTGISVGGKPLDIPRSAFRGGMLIDSGTIVT 355
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
L Y L + ++K +A + + + +T C+ F +V +AL+F+
Sbjct: 356 ELPETAYNALNAALRKAFAAYPMVASEDFDT---CYN----FTGYSNVT--VPRVALTFS 406
Query: 332 DGKTRTL 338
G T L
Sbjct: 407 GGATIDL 413
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 99/399 (24%), Positives = 158/399 (39%), Gaps = 62/399 (15%)
Query: 5 HNGENLCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPY 64
+ E + + R+S + SS + S+ L G++ +G Y V + +G P R
Sbjct: 103 QDKERVKYINSRLSKNLGQDSS---VEELDSATLPAKSGSLIGSGNYFVVVGLGTPKRDL 159
Query: 65 FLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHN- 118
L DTGSDLTW QC+ PC R C + ++ PS + C +C L ++
Sbjct: 160 SLIFDTGSDLTWTQCE-PCARSCYKQQDVIFDPSKSTSYSNITCTSALCTQLSTATGNDP 218
Query: 119 -CEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHP 176
C + C Y ++Y D S+G ++ T+ + GCG N +
Sbjct: 219 GCSASTKACIYGIQYGDSSFSVGYFSRERLTVTATD---VVDNFLFGCGQNN--QGLFGG 273
Query: 177 LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMS 234
G++GLG+ S V Q ++ R + +CL + G L FG T
Sbjct: 274 SAGLIGLGRHPISFVQQTAAK--YRKIFSYCLPSTSSSTGHLSFGP--------AATGRY 323
Query: 235 SDYTKYYSPGVAELFFGGETTGLK----NLPV----------VFDSGSSYTYLNRVTYQT 280
YT + + F+G + T + LPV + DSG+ T L Y
Sbjct: 324 LKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFSTGGAIIDSGTVITRLPPTAYGA 383
Query: 281 LTSIMKKELSAKSLKEAPEDETLPLCW--KGRRPFKNVHDVKKCFRTLALSFTDGKTRTL 338
L S ++ +S A E L C+ G + F T+ SF G T
Sbjct: 384 LRSAFRQGMS--KYPSAGELSILDTCYDLSGYKVFS--------IPTIEFSFAGGVT--- 430
Query: 339 FELTPEAYLIISNKGNVCLGI-LNGAEVGLQDLNVIGGI 376
+L P+ L +++ VCL NG + D+ + G +
Sbjct: 431 VKLPPQGILFVASTKQVCLAFAANGDD---SDVTIYGNV 466
>gi|357117138|ref|XP_003560331.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Brachypodium distachyon]
Length = 509
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 77/288 (26%), Positives = 120/288 (41%), Gaps = 38/288 (13%)
Query: 34 GSSLLFQVHGNV---YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP 90
G SLL GN + + + +G P + + LDTGSDL W+ CD C RC
Sbjct: 63 GESLLSFADGNSTTRHAGSLHYAKVALGTPNATFVVALDTGSDLFWVPCD--CKRCAPIA 120
Query: 91 H---------PLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGG-SSLGV 140
+ P ++ V C +C +A G+ N C Y ++Y SS GV
Sbjct: 121 NTSELLKPYSPRQSSTSKPVTCSHSLCDRPNACGNGN----GSCPYTVKYVSANTSSSGV 176
Query: 141 LVKDAFAFNYTN-----------GQRLNPRLALGCGYNQ----VPGASYHPLDGILGLGK 185
LV+D + G+ + R+ GCG Q + GA+ ++G+LGLG
Sbjct: 177 LVEDVLYMTRQSSSSRSGNGGNVGEAVGARVVFGCGQEQTGAFLDGAA---MEGLLGLGM 233
Query: 186 GKSSIVSQLHSQKLI-RNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPG 244
+ S+ S L + L+ + C S G G + FG+ ++ + S Y+
Sbjct: 234 DRVSVPSLLAAAGLVGSDSFSMCFSPDGNGRINFGEPSDAGAQNETPFIVSKTRPTYNIS 293
Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAK 292
V + G+ V DSG+S+TYLN Y L + ++ K
Sbjct: 294 VTAVNVKGKGAMAAEFAAVVDSGTSFTYLNDPAYSLLATSFNSQVREK 341
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 70/287 (24%), Positives = 127/287 (44%), Gaps = 26/287 (9%)
Query: 21 SSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD 80
S +SS++ +FN + + + G G Y VT+ +G P + + L DTGSDLTW QC+
Sbjct: 107 SMNSSTTGVFNEMKTRVPTTHFG-----GGYAVTVGLGTPKKDFSLLFDTGSDLTWTQCE 161
Query: 81 APCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS 136
C + P+ + C C S+ C C Y ++Y G
Sbjct: 162 PCSGGCFPQNDEKFDPTKSTSYKNLSCSSEPCKSIGKESAQGCSSSNSCLYGVKYGT-GY 220
Query: 137 SLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS 196
++G L + ++ + +GCG G + G+LGLG+ ++ SQ S
Sbjct: 221 TVGFLATETLTITPSD---VFENFVIGCGERN--GGRFSGTAGLLGLGRSPVALPSQTSS 275
Query: 197 QKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGG-- 252
+N+ +CL S G L FG + +++ +T ++S + Y V+ + GG
Sbjct: 276 T--YKNLFSYCLPASSSSTGHLSFGGGVSQAAK--FTPITSKIPELYGLDVSGISVGGRK 331
Query: 253 ---ETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
+ + + + DSG++ TYL + L+S ++ ++ +L +
Sbjct: 332 LPIDPSVFRTAGTIIDSGTTLTYLPSTAHSALSSAFQEMMTNYTLTK 378
>gi|297820902|ref|XP_002878334.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
gi|297324172|gb|EFH54593.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
Length = 362
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 66/212 (31%), Positives = 97/212 (45%), Gaps = 32/212 (15%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
+++ ++ GYY ++IG P + + L +D+GS +T++ C + C +C + L P +
Sbjct: 80 MRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPC-SDCEQCGKHQVMLSSPKD 138
Query: 99 D---LVPCE-----------------DPICASLHAPGHHNCE-----DPAQCDYELEYAD 133
LV C+ P +S + P N + D QC YE EYA+
Sbjct: 139 QILCLVSCKVQIFKISYGLFDEDPKFQPELSSTYQPVKCNMDCNCDDDKEQCVYEREYAE 198
Query: 134 GGSSLGVLVKDAFAFNYTNGQRLNP-RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVS 192
SS GVL +D +F N L P R GC + DGI+GLG+G S+V
Sbjct: 199 HSSSKGVLGEDLISFG--NESHLTPQRAVFGCKTVETGDLYSQRADGIIGLGQGDLSLVG 256
Query: 193 QLHSQKLIRNVVGHCLSG---GGGGFLFFGDD 221
QL + LI N G C G GGG + G D
Sbjct: 257 QLVDKGLISNSFGLCYGGLDVGGGSMIVGGFD 288
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 90/293 (30%), Positives = 124/293 (42%), Gaps = 51/293 (17%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV--EAPHPLYRPSN----DLVP 102
G YN+ + +G P + + +DTGS+L W QC APC RC P P+ +P+ +P
Sbjct: 89 GAYNMNISLGTPPLDFPVIVDTGSNLIWAQC-APCTRCFPRPTPAPVLQPARSSTFSRLP 147
Query: 103 CEDPICASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
C C L C A C Y Y G ++ G L + T G P++A
Sbjct: 148 CNGSFCQYLPTSSRPRTCNATAACAYNYTYGSGYTA-GYLATETL----TVGDGTFPKVA 202
Query: 162 LGCGY-NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFL 216
GC N V +S GI+GLG+G S+VSQL + +CL + GG +
Sbjct: 203 FGCSTENGVDNSS-----GIVGLGRGPLSLVSQLAVGRF-----SYCLRSDMADGGASPI 252
Query: 217 FFGD--DLYDSSRVVWTSMSSD-----YTKYYS--PGVA----ELFFGGETTGLKNLPV- 262
FG L + S V T + + T YY G+A EL G T G +
Sbjct: 253 LFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLG 312
Query: 263 ---VFDSGSSYTYLNRVTY----QTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
+ DSG++ TYL + Y Q S M AP D L LC+K
Sbjct: 313 GGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYD--LDLCYK 363
>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 525
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 76/275 (27%), Positives = 115/275 (41%), Gaps = 36/275 (13%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCV----EAPHPLYRPSNDLVP---------- 102
IG P + + LDTGSDL W+ C+ C C E+ P N P
Sbjct: 117 IGTPNVQFLVVLDTGSDLLWIPCE--CESCAPLSAESKDPRTSQLNPYTPSLSSTAKPVL 174
Query: 103 CEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSL-GVLVKDAFAF-NYTNGQRLNPR 159
C DP+C C P QC YE+ Y +S G L +D F + G +
Sbjct: 175 CSDPLCEM-----SSTCMAPTDQCPYEINYVSANTSTSGALYEDYMYFMRESGGNPVKLP 229
Query: 160 LALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
+ LGCG Q + GA+ + G++GLG S+ ++L S + + C+S GG G
Sbjct: 230 VYLGCGKVQTGSLLKGAAPN---GLMGLGTTDISVPNKLASTGQLADSFSLCISPGGSGT 286
Query: 216 LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAEL--FFGGETTGLKNLPVVFDSGSSYTYL 273
L FGD+ + R T + + E+ G T L +FD+G+S+TYL
Sbjct: 287 LTFGDEGPAAQRT--TPIIPKSVSMLDTYIVEIDSITVGNTNLLMASHALFDTGTSFTYL 344
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
++ Y ++S + P LC++
Sbjct: 345 SKTVYPQFVQAYDAQMSLPKWND-PRFSKWDLCYQ 378
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 81/311 (26%), Positives = 129/311 (41%), Gaps = 51/311 (16%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y ++ IG P F +DTGSDL WLQC+ PC +C P++ P S +PC
Sbjct: 86 GEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCE-PCKQCYPQITPIFDPSLSSSYQNIPCL 144
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
C S+ CD G L + + T G ++ P+ +G
Sbjct: 145 SDTCHSMRT---------TSCDVR----------GYLSVETLTLDSTTGYSVSFPKTMIG 185
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS----------GGGG 213
CGY G + P GI+GLG G S+ SQL + I +CL G
Sbjct: 186 CGYRNT-GTFHGPSSGIVGLGSGPMSLPSQLGTS--IGGKFSYCLGPWLPNSTSKLNFGD 242
Query: 214 GFLFFGDDLYDSSRVVWTSMSSDY--TKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYT 271
+ +GD + V + S Y + +S G + FGG T G ++ DSG+++T
Sbjct: 243 AAIVYGDGAMTTPIVKKDAQSGYYLTLEAFSVGNKLIEFGGPTYGGNEGNILIDSGTTFT 302
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK-----GRRPFKNVH----DVKKC 322
+L Y S + + ++ + +++ + T LC+ P H D+K
Sbjct: 303 FLPYDVYYRFESAVAEYINLEHVEDP--NGTFKLCYNVAYHGFEAPLITAHFKGADIKLY 360
Query: 323 FRTLALSFTDG 333
+ + + +DG
Sbjct: 361 YISTFIKVSDG 371
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/173 (29%), Positives = 85/173 (49%), Gaps = 20/173 (11%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
T + V + +G P + +++ D +D TWLQC PC++C + P ++ PS L+ C
Sbjct: 184 TSNFLVQIGVGGPPQKFYMIFDLQTDFTWLQCQ-PCIKCYDQPDSIFDPSQSSSYTLLSC 242
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
E C L + +C D C Y + Y DG ++ GVL+ + +F + R++LG
Sbjct: 243 ETKHCNLL---PNSSCSDDGYCRYNITYKDGTNTEGVLINETVSFESSGWVD---RVSLG 296
Query: 164 C-GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
C NQ P + DG GLG+G S S++++ + +CL G+
Sbjct: 297 CSNKNQGP---FVGSDGTFGLGRGSLSFPSRINASSM-----SYCLVESKDGY 341
>gi|414587774|tpg|DAA38345.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 520
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 69/251 (27%), Positives = 104/251 (41%), Gaps = 23/251 (9%)
Query: 57 IGQPARPYFLDLDTGSDLTWL--QCDAPCVRCVEAPH---------PLYRPSNDLVPCED 105
+G P + + + LDTGSDL WL QCD C A P ++ VPC
Sbjct: 115 VGTPGQTFMVALDTGSDLFWLPCQCDG-CTPPATAASGSFQATFYIPGMSSTSKAVPCNS 173
Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNG--QRLNPRLAL 162
C C QC Y++ Y G SS G LV+D + N Q L ++ L
Sbjct: 174 NFCDL-----QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILKAQIML 228
Query: 163 GCGYNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDD 221
GCG Q +G+ GLG + S+ S L + L N C G G + FGD
Sbjct: 229 GCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISFGDQ 288
Query: 222 LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTL 281
++ + Y+ ++ + G + T + + +FD+G+S+TYL Y +
Sbjct: 289 ESSDQEETPLDINRQHPT-YAITISGITVGNKPTDM-DFITIFDTGTSFTYLADPAYTYI 346
Query: 282 TSIMKKELSAK 292
T ++ A
Sbjct: 347 TQSFHAQVQAN 357
>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
Length = 506
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 95/351 (27%), Positives = 145/351 (41%), Gaps = 62/351 (17%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC------------ 86
++G+ Y + +G P + +DTGSD+ W +C C C
Sbjct: 76 LMLNGSSTSDATYYAQIGVGHPVQFLNAIVDTGSDILWFKCKL-CQGCSSKKNVIVCSSI 134
Query: 87 -VEAPHPLYRPSNDLVP----CEDPICASLHA-PGHHNCEDPAQCDYELEYADGGSSLGV 140
++ P LY P + C DP+C+ + G++N C Y++ Y D SS G+
Sbjct: 135 IMQGPITLYDPELSITASPATCSDPLCSEGGSCRGNNN-----SCAYDISYEDTSSSTGI 189
Query: 141 LVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLI 200
+D + LN + LGC + G P+DGI+G G+ K S+ +QL +Q
Sbjct: 190 YFRDVVHLGHK--ASLNTTMFLGCA-TSISG--LWPVDGIMGFGRSKVSVPNQLAAQAGS 244
Query: 201 RNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYS------------PGVA 246
N+ HCLSG GGG L G + + +V+T M ++ Y P A
Sbjct: 245 YNIFYHCLSGEKEGGGILVLGKN-DEFPEMVYTPMLANDIVYNVKLVSLSVNSKALPIEA 303
Query: 247 ELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLC 306
F T G N + DSG+S T+ + + + +K P T PL
Sbjct: 304 SEFEYNATVG--NGGTIIDSGTS-----SATFPSKALALFVKAVSKFTTAIP---TAPLE 353
Query: 307 WKGRRPFKNVHD---VKKCFRTLALSFTDGKTRTLFELTPEAYL--IISNK 352
G F ++ D V+ F + L F G T ELT YL ++S K
Sbjct: 354 SSGSPCFISISDRNSVEVDFPNVTLKFDGGAT---MELTAHNYLEAVVSRK 401
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 99/367 (26%), Positives = 145/367 (39%), Gaps = 55/367 (14%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SN 98
G + +G Y + +G P + +DTGSDL WLQC PC C PLY P ++
Sbjct: 80 GVPFDSGEYFAVINVGDPPTRALVVIDTGSDLIWLQC-VPCRHCYRQVTPLYDPRSSSTH 138
Query: 99 DLVPCEDPICAS-LHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+PC P C L PG C+ C Y + Y DG +S G L D F
Sbjct: 139 RRIPCASPRCRDVLRYPG---CDARTGGCVYMVVYGDGSASSGDLATDRLVFPDDTHVH- 194
Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL------SG 210
+ LGCG++ V G+LG+G+G+ S +QL +V +CL +
Sbjct: 195 --NVTLGCGHDNV--GLLESAAGLLGVGRGQLSFPTQL--APAYGHVFSYCLGDRLSRAQ 248
Query: 211 GGGGFLFFGDDLYDSSRVVWTSMSSDYTK---YYSPGVAELFFGGETTGLKNLP------ 261
G +L FG S +T + ++ + YY V G TG N
Sbjct: 249 NGSSYLVFGRTPEPPS-TAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPA 307
Query: 262 -----VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSL--KEAPEDETLPLCWKGRRPFK 314
+V DSG++ + R Y + +A K A + C+ R
Sbjct: 308 TGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGA 367
Query: 315 NVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGN-----VCLGILNGAEVGLQD 369
V+ ++ L F G L P+A +I +G CLG L A+ G
Sbjct: 368 PAAAVR--VPSIVLHFAGGADMAL----PQANYLIPVQGGDRRTYFCLG-LQAADDG--- 417
Query: 370 LNVIGGI 376
LNV+G +
Sbjct: 418 LNVLGNV 424
>gi|302853254|ref|XP_002958143.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
nagariensis]
gi|300256504|gb|EFJ40768.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
nagariensis]
Length = 475
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 71/264 (26%), Positives = 116/264 (43%), Gaps = 24/264 (9%)
Query: 124 QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPL-DGILG 182
+C Y YA+ SS G +V+DAF F + R+ GC N G Y L DGI+G
Sbjct: 6 KCYYSRTYAERSSSEGWMVEDAFGFP---DDQPPVRMVFGC-ENGETGEIYRQLADGIMG 61
Query: 183 LGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGD-DLYDSSRVVWTSMSSD-YTKY 240
+G ++ SQL ++ +I +V C G L GD + + V+T + ++ + Y
Sbjct: 62 MGNNHNAFQSQLVARGVIEDVFSLCFGYPKDGILLLGDVPMPKGANTVYTPLLNNLHLHY 121
Query: 241 YSPGVAELFFGGETTGL------KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSL 294
Y+ + + G L + VV DSG+++TYL + + + + + L
Sbjct: 122 YNVRMDGIAVNGVELSLNARIFTRGYGVVLDSGTTFTYLPTEAFNAMAAAIGSYALSHGL 181
Query: 295 KEAP--EDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK 352
+ P + + +CWKG N ++ F + F D L P YL +S
Sbjct: 182 QSTPGADPQYNDICWKGAP--DNFQGLENHFPSAEFVFGDNAR---LSLPPLRYLFVSRP 236
Query: 353 GNVCLGILNGAEVGLQDLNVIGGI 376
G CLG+ + G +IGG+
Sbjct: 237 GEYCLGVFDNGGSG----TLIGGV 256
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 91/356 (25%), Positives = 139/356 (39%), Gaps = 60/356 (16%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y + +G PAR ++ LDTGSD+ WLQC APC +C ++ P+ +PC
Sbjct: 115 SGEYFTRIGVGTPARYVYMVLDTGSDVVWLQC-APCRKCYTQTDHVFDPTKSRTYAGIPC 173
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
P+C L +PG N C Y++ Y DG + G + F + R+ALG
Sbjct: 174 GAPLCRRLDSPGCSN--KNKVCQYQVSYGDGSFTFGDFSTETLTFR----RNRVTRVALG 227
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQ-----LHSQKLIRNVVGHCL----SGGGGG 214
CG++ +G+ G + + + + + +CL +
Sbjct: 228 CGHDN---------EGLFTGAAGLLGLGRGRLSFPVQTGRRFNHKFSYCLVDRSASAKPS 278
Query: 215 FLFFGDDLYDSSRVVWTSMSSD---YTKYYSPGVAELFFGGETTGLK----------NLP 261
+ FGD S +T + + T YY + G GL N
Sbjct: 279 SVIFGDSAV-SRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAAGNGG 337
Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
V+ DSG+S T L R Y L + + A LK APE C+ + +VK
Sbjct: 338 VIIDSGTSVTRLTRPAYIALRDAFR--IGASHLKRAPEFSLFDTCFD----LSGLTEVK- 390
Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
T+ L F L YLI + N G+ C + L++IG I
Sbjct: 391 -VPTVVLHFRGADV----SLPATNYLIPVDNSGSFCFAFAG----TMSGLSIIGNI 437
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 88/312 (28%), Positives = 121/312 (38%), Gaps = 49/312 (15%)
Query: 46 YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR--CVEAPHPLYRPSNDL--- 100
+ T Y IG P + +DTGSDL W QC C+R C P Y S
Sbjct: 85 WATLQYVAEYLIGDPPQRAEALIDTGSDLVWTQCST-CLRKVCARQALPYYNSSASSTFA 143
Query: 101 -VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
VPC ICA+ + H C+ A C Y G G L +AFAF Q
Sbjct: 144 PVPCAARICAA-NDDIIHFCDLAAGCSVIAGYG-AGVVAGTLGTEAFAF-----QSGTAE 196
Query: 160 LALGC-GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFF 218
LA GC + ++ + H G++GLG+G+ S+VSQ + K + + + G G LF
Sbjct: 197 LAFGCVTFTRIVQGALHGASGLIGLGRGRLSLVSQTGATKFSYCLTPYFHNNGATGHLFV 256
Query: 219 GDDLYDSSRVVWTSMSSDYTK-------YYSPGVAELFFGGETTGLKNLP---------- 261
G M++ + K YY P + G T G LP
Sbjct: 257 GASASLGGH--GDVMTTQFVKGPKGSPFYYLPLI------GLTVGETRLPIPATVFDLRE 308
Query: 262 ---------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRP 312
V+ DSGS +T L Y L S + L+ + P+ + LC R
Sbjct: 309 VAPGLFSGGVIIDSGSPFTSLVHDAYDALASELAARLNGSLVAPPPDADDGALCVARRDV 368
Query: 313 FKNVHDVKKCFR 324
+ V V FR
Sbjct: 369 GRVVPAVVFHFR 380
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 93/347 (26%), Positives = 134/347 (38%), Gaps = 57/347 (16%)
Query: 46 YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD-APCVRCVEAPHPLYRPSNDL---- 100
+P Y V + G P + L LDTGSD+TW QC P C PL+ PS
Sbjct: 83 FPFTEYLVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFAS 142
Query: 101 VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN--- 157
+PC P C + G N C+Y + Y DG S G + ++ F F G+ +
Sbjct: 143 LPCSSPACETTPPCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAV 202
Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF 217
P L GCG+ G GI G G+G S+ SQL HC +
Sbjct: 203 PGLVFGCGHANR-GVFTSNETGIAGFGRGSLSLPSQLKVGNF-----SHCFT-------- 248
Query: 218 FGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFG--GETTG---LKNLPVVFDSGSSYTY 272
T + PGVA G G ++ P +SG+S T
Sbjct: 249 -----------TITGSKTSAVLLGLPGVAPPSASPLGRRRGSYRCRSTPRSSNSGTSITS 297
Query: 273 LNRVTYQTLTSIMKKELSAK-SLKEAPEDETLPL-CWKG--RRPFKNVHDVKKCFRTLAL 328
L TY+ + ++E +A+ L P + T P C+ R P +V T+AL
Sbjct: 298 LPPRTYRAV----REEFAAQVKLPVVPGNATDPFTCFSAPLRGPKPDVP-------TMAL 346
Query: 329 SFTDGKTRTLFELTPEAYLIISNKGN----VCLGILNGAEVGLQDLN 371
F R E + + GN +CL ++ G E+ L ++
Sbjct: 347 HFEGATMRLPQENYVFEVVDDDDAGNSSRIICLAVIEGGEIILGNIQ 393
>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
Group]
Length = 476
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 72/256 (28%), Positives = 112/256 (43%), Gaps = 38/256 (14%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRPSNDL----VPCE 104
+G P + + LDTGSDL W+ CD C++C P +Y P+ VPC
Sbjct: 68 LGTPNVTFLVALDTGSDLFWVPCD--CLKCAPFQSPNYGSLKFDVYSPAQSTTSRKVPCS 125
Query: 105 DPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR--LNPRL 160
+C +A C + C Y ++Y +D SS GVLV+D + Q + +
Sbjct: 126 SNLCDLQNA-----CRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPI 180
Query: 161 ALGCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF 217
GCG QV S+ +G+LGLG S+ S L S+ L N C G G +
Sbjct: 181 MFGCG--QVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRIN 238
Query: 218 FGD----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
FGD D ++ V+ YY+ + + G ++ + + DSG+S+T L
Sbjct: 239 FGDTGSSDQKETPLNVYKQ-----NPYYNITITGITVGSKSISTE-FSAIVDSGTSFTAL 292
Query: 274 NRVTYQTLTSIMKKEL 289
+ Y +TS ++
Sbjct: 293 SDPMYTQITSSFDAQI 308
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 106/374 (28%), Positives = 158/374 (42%), Gaps = 68/374 (18%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP---- 96
V G +G Y + +G PA + LDTGSD+ W+QC APC RC E P++ P
Sbjct: 119 VSGLAQGSGEYFTKIGVGTPATQALMVLDTGSDVVWVQC-APCRRCYEQSGPVFDPRRSS 177
Query: 97 SNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
S V C +C L + G C+ C Y++ Y DG + G V + F G R
Sbjct: 178 SYGAVGCGAALCRRLDSGG---CDLRRGACMYQVAYGDGSVTAGDFVTETLTF--AGGAR 232
Query: 156 LNPRLALGCGY-NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
+ R+ALGCG+ N+ + L G+ G + +S+ + + +V SG G
Sbjct: 233 V-ARVALGCGHDNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAA 291
Query: 215 -------FLFFGDDLYDSSRVVWTSMSSD---YTKYY------------SPGVAELFFGG 252
+ FG +S +T M + T YY PGVAE
Sbjct: 292 PGSHRSSTVSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAE----- 346
Query: 253 ETTGLKNLP------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETL-PL 305
+ L+ P V+ DSG+S T L R +Y L + +A L+ +P +L
Sbjct: 347 --SDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAFRAA-AAGGLRLSPGGFSLFDT 403
Query: 306 CWK--GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNG 362
C+ GRR K T+++ F G L PE YLI + ++G C G
Sbjct: 404 CYDLGGRRVVK--------VPTVSMHFAGGAEAA---LPPENYLIPVDSRGTFCFA-FAG 451
Query: 363 AEVGLQDLNVIGGI 376
+ G +++IG I
Sbjct: 452 TDGG---VSIIGNI 462
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 70/222 (31%), Positives = 97/222 (43%), Gaps = 28/222 (12%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y V + G P + +DT SDL W+QC PCV C P++ P S +VPC
Sbjct: 90 GEYLVKLGTGTPQHFFSAAIDTASDLVWMQCQ-PCVSCYRQLDPVFNPKLSSSYAVVPCT 148
Query: 105 DPICASLHAPGHHNC--EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
CA L H C +D C Y +Y+ G + G L D A G + +
Sbjct: 149 SDTCAQLDG---HRCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAI----GGDVFHAVVF 201
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFG 219
GC + V G + G++GLG+G S+VSQL + + +CL G L G
Sbjct: 202 GCSDSSVGGPAAQA-SGLVGLGRGPLSLVSQLSVHRFM-----YCLPPPMSRTSGKLVLG 255
Query: 220 ---DDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGETTG 256
D + + S V +MSS Y YY + L G +T G
Sbjct: 256 AGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQTPG 297
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 84/301 (27%), Positives = 121/301 (40%), Gaps = 31/301 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
Y +T+ +G PA + +DTGSD++W+QC PC +C PL+ P + C
Sbjct: 52 YLITVGLGSPATSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCGSA 110
Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
CA L G + C +QC Y + Y DG S+ G D A G GC
Sbjct: 111 DCAQLGQEG-NGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVRSFQFGC-- 163
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL-FFGDDLY 223
+ V DG++GLG G S+VSQ + + +CL + GFL
Sbjct: 164 SNVESGFNDQTDGLMGLGGGAQSLVSQ--TAGTLGRAFSYCLPPTPSSSGFLTLGAAGGS 221
Query: 224 DSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLK----NLPVVFDSGSSYTYLNRVT 277
+S V T M SS +Y + + GG + + V DSG+ T L
Sbjct: 222 GTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVMDSGTVITRLPPTA 281
Query: 278 YQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRT 337
Y L+S K + K A L C+ F V ++AL F+ G +
Sbjct: 282 YSALSSAFKAGM--KQYPPAQPSGILDTCFD----FSGQSSVS--IPSVALVFSGGAVVS 333
Query: 338 L 338
L
Sbjct: 334 L 334
>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 90/348 (25%), Positives = 145/348 (41%), Gaps = 51/348 (14%)
Query: 54 TMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV---------EAPHPLYRP----SNDL 100
T+ +G P + + LDTGSDL W+ CD C RC + +Y P ++
Sbjct: 100 TVELGTPGVKFMVALDTGSDLFWVPCD--CSRCAPTHGASYASDFELSIYNPRESSTSKK 157
Query: 101 VPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNYTNGQR--L 156
V C + +CA + C + C Y + Y +S G+LVKD +G R +
Sbjct: 158 VTCNNDMCAQ-----RNRCLGTFSSCPYIVSYVSAQTSTSGILVKDVLHLTTEDGGREFV 212
Query: 157 NPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
+ GCG QV S+ + +G+ GLG K S+ S L + LI + C G
Sbjct: 213 EAYVTFGCG--QVQSGSFLDIAAPNGLFGLGMEKISVPSVLSREGLIADSFSMCFGHDGI 270
Query: 214 GFLFFGD----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSS 269
G + FGD D ++ V + + V + E T L FDSG+S
Sbjct: 271 GRISFGDKGSPDQEETPFNVNPAHPTYNVTVTQARVGTMLIDVEFTAL------FDSGTS 324
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL--CWKGRRPFKNVHDVKKCFRTLA 327
+TY+ Y + + +K S K P D +P C+ P N V +++
Sbjct: 325 FTYMVDPAY---SRVSEKFHSLARDKRRPPDPRIPFEYCYD-MSPDANASLVP----SMS 376
Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGG 375
L+ G+ T+++ P + N+ CL ++ E+ + N + G
Sbjct: 377 LTMKGGRHFTVYD--PIIVISTQNEIVYCLAVVKSTELNIIGQNFMTG 422
>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
Length = 513
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 72/256 (28%), Positives = 112/256 (43%), Gaps = 38/256 (14%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRPSNDL----VPCE 104
+G P + + LDTGSDL W+ CD C++C P +Y P+ VPC
Sbjct: 105 LGTPNVTFLVALDTGSDLFWVPCD--CLKCAPLQSPNYGSLKFDVYSPAQSTTSRKVPCS 162
Query: 105 DPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR--LNPRL 160
+C +A C + C Y ++Y +D SS GVLV+D + Q + +
Sbjct: 163 SNLCDLQNA-----CRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPI 217
Query: 161 ALGCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF 217
GCG QV S+ +G+LGLG S+ S L S+ L N C G G +
Sbjct: 218 MFGCG--QVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRIN 275
Query: 218 FGD----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
FGD D ++ V+ YY+ + + G ++ + + DSG+S+T L
Sbjct: 276 FGDTGSSDQKETPLNVYKQ-----NPYYNITITGITVGSKSISTE-FSAIVDSGTSFTAL 329
Query: 274 NRVTYQTLTSIMKKEL 289
+ Y +TS ++
Sbjct: 330 SDPMYTQITSSFDAQI 345
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 106/388 (27%), Positives = 152/388 (39%), Gaps = 60/388 (15%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
RM+ S + + L + + + + + P Y + + IG P +P L LDTGSDL
Sbjct: 56 RMALRSKARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSDLV 115
Query: 76 WLQCDAPCVRCVEAPHPLY---RPSNDLVPCEDPICASLHAPGHHNC--EDPAQCDYELE 130
W QC PC C P Y R S +P D L P C + C +
Sbjct: 116 WTQCQ-PCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLD-PSVTMCVNQTVQTCAFSYS 173
Query: 131 YADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSI 190
Y D +++G L D ++ G + P + GCG N G GI G G+G S+
Sbjct: 174 YGDKSATIGFL--DVETVSFVAGASV-PGVVFGCGLNNT-GIFRSNETGIAGFGRGPLSL 229
Query: 191 VSQLHSQKLIRNVVGHCLSGGGG----GFLF-FGDDLYDSSRVVWTSMSSDYTKYYS-PG 244
SQL HC + G LF DLY + R T ++ K + P
Sbjct: 230 PSQLKVGNF-----SHCFTAVSGRKPSTVLFDLPADLYKNGR--GTVQTTPLIKNPAHPT 282
Query: 245 VAELFFGGETTGLKNLPV--------------VFDSGSSYTYLNRVTYQTLTSIMKKELS 290
L G T G LPV + DSG+++T L Y+ ++ E +
Sbjct: 283 FYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYR----LVHDEFA 338
Query: 291 AK-SLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLII 349
A L P +ET PL P V K L L F +G T L E Y+
Sbjct: 339 AHVKLPVVPSNETGPLLCFSAPPLGKAPHVPK----LVLHF-EGAT---MHLPRENYVFE 390
Query: 350 SNKG---NVCLGILNGAEVGLQDLNVIG 374
+ G ++CL I+ G ++ +IG
Sbjct: 391 AKDGGNCSICLAIIEG------EMTIIG 412
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 140/294 (47%), Gaps = 46/294 (15%)
Query: 22 SSSSSSSLFNHVGSSLLFQVHGNVYPTG-YYNVTMYIGQPARPYFLDLDTGSDLTWLQCD 80
S + SS +F + ++ + ++ PTG Y VT+ +G P + + L DTGSDLTW QC+
Sbjct: 114 SMNPSSGVFKEMQTT----IPASIVPTGGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCE 169
Query: 81 APCV-RCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCED--PAQCDYELEYAD 133
PC+ C P + P+ V C C L A G++ +D C Y ++Y
Sbjct: 170 -PCLGGCFPQNQPKFDPTTSTSYKNVSCSSEFC-KLIAEGNYPAQDCISNTCLYGIQYGS 227
Query: 134 GGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQ 193
G ++G L + A ++ + GC ++ +++ G+LGLG+ ++ SQ
Sbjct: 228 -GYTIGFLATETLAIASSDVFK---NFLFGC--SEESRGTFNGTTGLLGLGRSPIALPSQ 281
Query: 194 LHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFG 251
++ +N+ +CL S G L FG ++ +++ + SP + +L +G
Sbjct: 282 TTNK--YKNLFSYCLPASPSSTGHLSFGVEVSQAAK----------STPISPKLKQL-YG 328
Query: 252 GETTGL----KNLPV-------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSL 294
T G+ + LP+ + DSG+++T+L TY L S ++ ++ +L
Sbjct: 329 LNTVGISVRGRELPINGSISRTIIDSGTTFTFLPSPTYSALGSAFREMMANYTL 382
>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 524
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 88/349 (25%), Positives = 151/349 (43%), Gaps = 60/349 (17%)
Query: 54 TMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV---------EAPHPLYRP----SNDL 100
T+ +G P + + LDTGSDL W+ CD C +C E +Y P +N
Sbjct: 110 TVKLGTPGMRFMVALDTGSDLFWVPCD--CGKCAPTEGATYASEFELSIYNPKVSTTNKK 167
Query: 101 VPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNY--TNGQRL 156
V C + +CA + C + C Y + Y +S G+L++D N +R+
Sbjct: 168 VTCNNSLCAQ-----RNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERV 222
Query: 157 NPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
+ GCG QV S+ + +G+ GLG K S+ S L + L+ + C G
Sbjct: 223 EAYVTFGCG--QVQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGV 280
Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN-LPVVFDSGSSYTY 272
G + FGD +++ + Y+ V + G TT + + +FD+G+S+TY
Sbjct: 281 GRISFGDKGSSDQEETPFNLNPSHPN-YNITVTRVRVG--TTLIDDEFTALFDTGTSFTY 337
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK-----CFRTLA 327
L Y T++ + A+ + +P+ R PF+ +D+ +L+
Sbjct: 338 LVDPMYTTVSESFHSQ--AQDKRHSPD---------SRIPFEYCYDMSNDANASLIPSLS 386
Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNV--CLGILNGAEVGLQDLNVIG 374
L+ T+ + ++IS +G + CL I+ +E LN+IG
Sbjct: 387 LTMKGNSHFTI----NDPIIVISTEGELVYCLAIVKSSE-----LNIIG 426
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 73/252 (28%), Positives = 105/252 (41%), Gaps = 23/252 (9%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
Y +T+ +G PA + +DTGSD++W+QC PC +C PL+ P + C
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCGSA 186
Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
CA L G + C +QC Y + Y DG S+ G D A G GC
Sbjct: 187 DCAQLGQEG-NGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVRSFQFGC-- 239
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL-FFGDDLY 223
+ V DG++GLG G S+VSQ + + +CL + GFL
Sbjct: 240 SNVESGFNDQTDGLMGLGGGAQSLVSQ--TAGTLGRAFSYCLPPTPSSSGFLTLGAAGGS 297
Query: 224 DSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLK----NLPVVFDSGSSYTYLNRVT 277
+S V T M SS +Y + + GG + + V DSG+ T L
Sbjct: 298 GTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVMDSGTVITRLPPTA 357
Query: 278 YQTLTSIMKKEL 289
Y L+S K +
Sbjct: 358 YSALSSAFKAGM 369
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 73/252 (28%), Positives = 105/252 (41%), Gaps = 23/252 (9%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
Y +T+ +G PA + +DTGSD++W+QC PC +C PL+ P + C
Sbjct: 198 YLITVGLGSPATSQTMLIDTGSDVSWVQCK-PCSQCHSQADPLFDPSSSSTYSPFSCGSA 256
Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
CA L G + C +QC Y + Y DG S+ G D A G GC
Sbjct: 257 DCAQLGQEG-NGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL----GSSAVRSFQFGC-- 309
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL-FFGDDLY 223
+ V DG++GLG G S+VSQ + + +CL + GFL
Sbjct: 310 SNVESGFNDQTDGLMGLGGGAQSLVSQ--TAGTLGRAFSYCLPPTPSSSGFLTLGAAGGS 367
Query: 224 DSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLK----NLPVVFDSGSSYTYLNRVT 277
+S V T M SS +Y + + GG + + V DSG+ T L
Sbjct: 368 GTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVMDSGTVITRLPPTA 427
Query: 278 YQTLTSIMKKEL 289
Y L+S K +
Sbjct: 428 YSALSSAFKAGM 439
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 78/286 (27%), Positives = 123/286 (43%), Gaps = 47/286 (16%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDP 106
Y V + +G PA L +DTGSD++W+QC PC CV A P + P + +PC
Sbjct: 139 YYVPLQVGTPAVEVVLIMDTGSDVSWIQC-VPCKDCVPALRPPFNPRHSSSFFKLPCASS 197
Query: 107 ICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP----RLA 161
C +++ C + C + ++Y DG S G+L + A N N P +
Sbjct: 198 TCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSNIT 257
Query: 162 LGCG---YNQVP-GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-----GG 212
LGC +P GAS G+LG+ + S SQL S+ + HC
Sbjct: 258 LGCADIDREGLPTGAS-----GLLGMDRRPISFPSQLSSRYARK--FSHCFPDKIAHLNS 310
Query: 213 GGFLFFGDDLYDSSRVVWT------SMSSDYTKYYSPGVAELFFGGETTGL--KNLPV-- 262
G +FFG+ S + +T ++ S YY G+ + L KN +
Sbjct: 311 SGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDK 370
Query: 263 -------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDE 301
+ DSG+++TYL + +Q M++E A++ A D+
Sbjct: 371 VTGSGGTIIDSGTAFTYLKKPAFQA----MRREFLARTSHLAKVDD 412
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 54/164 (32%), Positives = 80/164 (48%), Gaps = 12/164 (7%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDP 106
Y + + IG P + DTGSDL WLQC PC C + +P++ + + C
Sbjct: 59 YLMELSIGTPPVKIYAQADTGSDLIWLQC-IPCTNCYKQLNPMFDSQSSSTFSNIACGSE 117
Query: 107 ICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALGC 164
C+ L++ +C D C Y Y DG + GVL ++ T G+ + + + GC
Sbjct: 118 SCSKLYS---TSCSPDQINCKYNYSYVDGSETQGVLAQETLTLTSTTGEPVAFKGVIFGC 174
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
G+N GA GI+GLG+G S+VSQ+ S L N+ CL
Sbjct: 175 GHNN-NGAFNDKEMGIIGLGRGPLSLVSQIGS-SLGGNMFSQCL 216
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 83/281 (29%), Positives = 126/281 (44%), Gaps = 38/281 (13%)
Query: 21 SSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD 80
SS S++S GS + V G +G Y V + +G P R ++ +D+GSD+ W+QC
Sbjct: 16 SSGSTASYGVEDFGSEV---VSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCK 72
Query: 81 APCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS 136
PC +C PL+ P++ V C +C + G ++ +C YE+ Y DG S
Sbjct: 73 -PCTQCYHQTDPLFDPADSASFMGVSCSSAVCDQVDNAGCNS----GRCRYEVSYGDGSS 127
Query: 137 SLGVLVKDAFAFNYTNGQRLNPRLALGCGY-NQVPGASYHPLDGILGLGKGKSSIVSQLH 195
+ G L + T G+ + +A+GCG+ NQ + G+LGLG G S V QL
Sbjct: 128 TKGTLALETL----TLGRTVVQNVAIGCGHMNQ---GMFVGAAGLLGLGGGSMSFVGQLS 180
Query: 196 SQKLIRNVVGHCLSG---GGGGFLFFGDDLYDSSRVVWTSMSSD--YTKYYSPGVAELFF 250
++ N +CL GFL FG + W + + YY G++ L
Sbjct: 181 RER--GNAFSYCLVSRVTNSNGFLEFGSEAMPVG-AAWIPLIRNPHSPSYYYIGLSGLGV 237
Query: 251 GG----------ETTGLKNLPVVFDSGSSYTYLNRVTYQTL 281
G E T L N VV D+G++ T V Y+
Sbjct: 238 GDMKVPISEDIFELTELGNGGVVMDTGTAVTRFPTVAYEAF 278
>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 522
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 88/350 (25%), Positives = 151/350 (43%), Gaps = 60/350 (17%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV---------EAPHPLYRP----SND 99
T+ +G P + + LDTGSDL W+ CD C +C E +Y P +N
Sbjct: 107 TTVKLGTPGMRFMVALDTGSDLFWVPCD--CGKCAPTEGATYASEFELSIYNPKISTTNK 164
Query: 100 LVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNY--TNGQR 155
V C + +CA + C + C Y + Y +S G+L++D N +R
Sbjct: 165 KVTCNNSLCAQ-----RNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPER 219
Query: 156 LNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
+ + GCG QV S+ + +G+ GLG K S+ S L + L+ + C G
Sbjct: 220 VEAYVTFGCG--QVQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDG 277
Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN-LPVVFDSGSSYT 271
G + FGD +++ + Y+ V + G TT + + +FD+G+S+T
Sbjct: 278 VGRISFGDKGSSDQEETPFNLNPSHPN-YNITVTRVRVG--TTLIDDEFTALFDTGTSFT 334
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK-----CFRTL 326
YL Y T++ + A+ + +P+ R PF+ +D+ +L
Sbjct: 335 YLVDPMYTTVSESFHSQ--AQDKRHSPD---------SRIPFEYCYDMSNDANASLIPSL 383
Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNV--CLGILNGAEVGLQDLNVIG 374
+L+ T+ + ++IS +G + CL I+ +E LN+IG
Sbjct: 384 SLTMKGNSHFTI----NDPIIVISTEGELVYCLAIVKSSE-----LNIIG 424
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 71/227 (31%), Positives = 97/227 (42%), Gaps = 26/227 (11%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y V + IG P + +DT SDL WLQC PCV C P++ P S +VPC
Sbjct: 86 GEYLVKLGIGTPQHYFSAAIDTASDLVWLQCQ-PCVSCYRQLDPIFNPRLSSSYAVVPCS 144
Query: 105 DPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
C+ L GH ED Q C Y +Y+ + G L D A G + + LG
Sbjct: 145 SDTCSQLD--GHRCDEDDDQACRYNYKYSGNAVTNGTLAIDKLAV----GGNVFHAVVLG 198
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL-------IRNVVGHCLSGGGGGFL 216
C + V G G++GL +G S++SQL ++ + G + G G G
Sbjct: 199 CSDSSVGGPPPQ-ASGLVGLARGPLSLLSQLSVRRFMYCLPPPMSRTPGKLVLGAGAG-- 255
Query: 217 FFGDDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGETTGLKNLP 261
D + + S V +MSS Y YY L G +T G P
Sbjct: 256 --ADAVRNVSDRVTVTMSSSTRYPSYYYLNFDGLAVGDQTPGTIRRP 300
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 103/368 (27%), Positives = 161/368 (43%), Gaps = 74/368 (20%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
+G Y + +G PA P + LDTGSD+ WLQC APC RC E ++ P S + V C
Sbjct: 137 SGEYFTKIGVGTPATPALMVLDTGSDVVWLQC-APCRRCYEQSGQVFDPRRSRSYNAVGC 195
Query: 104 EDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
P+C L + G C+ + C Y++ Y DG + G + F G R+ R+AL
Sbjct: 196 AAPLCRRLDSGG---CDLRRSACLYQVAYGDGSVTAGDFATETLTF--AGGARV-ARVAL 249
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--------SGGGGG 214
GCG++ + G+LGLG+G S +Q+ S++ R+ +CL +
Sbjct: 250 GCGHDNE--GLFVAAAGLLGLGRGSLSFPTQI-SRRYGRS-FSYCLVDRTSSANTASRSS 305
Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFF----------GGETTGLKNLP--- 261
+ FG S V ++++S +T E F+ G G+ N
Sbjct: 306 TVTFG------SGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRL 359
Query: 262 --------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPF 313
V+ DSG+S T L R Y L + +A L+ +P +L F
Sbjct: 360 DPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRG--AAAGLRLSPGGFSL---------F 408
Query: 314 KNVHDV--KKCFR--TLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQ 368
+D+ +K + T+++ F G L PE YLI + +KG C G + G
Sbjct: 409 DTCYDLSGRKVVKVPTVSMHFAGGAEAA---LPPENYLIPVDSKGTFCFA-FAGTDGG-- 462
Query: 369 DLNVIGGI 376
+++IG I
Sbjct: 463 -VSIIGNI 469
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 97/354 (27%), Positives = 145/354 (40%), Gaps = 52/354 (14%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y + +++G P + + L LDTGSDL W+QC PC+ C E P Y P + + C
Sbjct: 192 SGEYFMDVFVGTPPKHFSLILDTGSDLNWIQC-VPCIACFEQSGPYYDPKDSSSFRNISC 250
Query: 104 EDPICASLHAPGHHN-CEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYT--NGQ---RL 156
DP C + +P N C+ Q C Y Y DG ++ G + F N T NG+ +
Sbjct: 251 HDPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKH 310
Query: 157 NPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SG 210
+ GCG +N+ +H G+LGLGKG S SQ+ Q L +CL +
Sbjct: 311 VENVMFGCGHWNR---GLFHGAAGLLGLGKGPLSFASQM--QSLYGQSFSYCLVDRNSNA 365
Query: 211 GGGGFLFFGDD--LYDSSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGLKNLP--- 261
L FG+D L + +TS +Y + + E +
Sbjct: 366 SVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEETWHL 425
Query: 262 -------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFK 314
+ DSG++ TY Y+ + KE + +K E LP +P
Sbjct: 426 SSEGAGGTIIDSGTTLTYFAEPAYEII-----KEAFVRKIKGYELVEGLPPL----KPCY 476
Query: 315 NVHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGL 367
NV ++K + F DG ++ E Y I + VCL IL L
Sbjct: 477 NVSGIEKMELPDFGILFADG---AVWNFPVENYFIQIDPDVVCLAILGNPRSAL 527
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 95/352 (26%), Positives = 138/352 (39%), Gaps = 55/352 (15%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV--RCVEAPHPLYRPSNDLV----PCE 104
Y +T+ +G PA + +DTGSD++W+QC APC C L+ P+ C
Sbjct: 130 YVITVSLGTPAVTQVMSIDTGSDVSWVQC-APCAAQSCSSQKDKLFDPAKSATYSAFSCS 188
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
CA L G + C + + C Y ++Y D ++ G D ++ + GC
Sbjct: 189 SAQCAQLGGEG-NGCLN-SHCQYIVKYVDHSNTTGTYGSDTLGLTTSDAVK---NFQFGC 243
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGDD 221
+ LDG++GLG S+VSQ + +CL S GGFL G
Sbjct: 244 SHRA--NGFVGQLDGLMGLGGDTESLVSQ--TAATYGKAFSYCLPPSSSSAGGFLTLGAA 299
Query: 222 L--YDSSRVVWTSMSSDYTKYYSPGVAELFFGGET-TGLK-NLPV-------VFDSGSSY 270
SSR T + ++ P +F T G K N+P V DSG+
Sbjct: 300 AGGTSSSRYSRTPL----VRFNVPTFYGVFLQAITVAGTKLNVPASVFSGASVVDSGTVI 355
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW------KGRRPFKNVH------- 317
T L YQ L + KKE+ K+ A L C+ R P +
Sbjct: 356 TQLPPTAYQALRTAFKKEM--KAYPSAAPVGILDTCFDFSGIKTVRVPVVTLTFSRGAVM 413
Query: 318 --DVKKCFRTLALSFT----DGKTRTLFELTPEAYLIISNKGNVCLGILNGA 363
DV F L+FT DG T L + + ++ + G LG GA
Sbjct: 414 DLDVSGIFYAGCLAFTATAQDGDTGILGNVQQRTFEMLFDVGGSTLGFRPGA 465
>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
Length = 490
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 72/256 (28%), Positives = 112/256 (43%), Gaps = 38/256 (14%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRPSNDL----VPCE 104
+G P + + LDTGSDL W+ CD C++C P +Y P+ VPC
Sbjct: 82 LGTPNVTFLVALDTGSDLFWVPCD--CLKCAPFQSPNYGSLKFDVYSPAQSTTSRKVPCS 139
Query: 105 DPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR--LNPRL 160
+C +A C + C Y ++Y +D SS GVLV+D + Q + +
Sbjct: 140 SNLCDLQNA-----CRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPI 194
Query: 161 ALGCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF 217
GCG QV S+ +G+LGLG S+ S L S+ L N C G G +
Sbjct: 195 MFGCG--QVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRIN 252
Query: 218 FGD----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
FGD D ++ V+ YY+ + + G ++ + + DSG+S+T L
Sbjct: 253 FGDTGSSDQKETPLNVYKQ-----NPYYNITITGITVGSKSISTE-FSAIVDSGTSFTAL 306
Query: 274 NRVTYQTLTSIMKKEL 289
+ Y +TS ++
Sbjct: 307 SDPMYTQITSSFDAQI 322
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 94/352 (26%), Positives = 133/352 (37%), Gaps = 63/352 (17%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSND----LV 101
T Y IG P + +DTGS+L W QC C C + P Y S V
Sbjct: 81 TRQYIAEYLIGDPPQRAAALIDTGSNLIWTQCGTTCGLKACAKQDLPYYNLSRSSTFAAV 140
Query: 102 PCED--PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
PC D +CA A G H C C + Y GS G L +AF F Q +
Sbjct: 141 PCADSAKLCA---ANGVHLCGLDGSCTFAASYG-AGSVFGSLGTEAFTF-----QSGAAK 191
Query: 160 LALGC-GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFF 218
L GC ++ + + G++GLG+G+ S+VSQ + K + + + G LF
Sbjct: 192 LGFGCVSLTRITKGALNGASGLIGLGRGRLSLVSQTGATKFSYCLTPYLRNHGASSHLFV 251
Query: 219 GDDLYDS------SRVVWTSMSSDY---TKYYSPGVAELFFGGETTGLKNLP-------- 261
G S + + + DY T YY P V G + G LP
Sbjct: 252 GASASLSGGGGAVTSIPFVKSPEDYPYSTFYYLPLV------GISVGETKLPIPSAAFEL 305
Query: 262 -----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
V+ D+GS T L Y L+ + ++L+ +SL + P D L LC
Sbjct: 306 RRVAAGYWSGGVIIDTGSPVTSLAEAAYSALSDEVARQLN-RSLVQPPADTGLDLCVA-- 362
Query: 311 RPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNG 362
DV K L F G ++ +Y +K C+ I G
Sbjct: 363 -----RQDVDKVVPVLVFHFGGGAD---MAVSAGSYWGPVDKSTACMLIEEG 406
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 87/303 (28%), Positives = 123/303 (40%), Gaps = 54/303 (17%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQ-VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDL 74
RM++ S + S+ L S+ + + + P Y V M IG P +P L LDTGSDL
Sbjct: 75 RMAARSKARSARLLSGRAASARMDPGSYTDGVPDTEYLVHMAIGTPPQPVQLILDTGSDL 134
Query: 75 TWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ----CD 126
TW QC APCV C P + PS + +PC+ IC L +C + + C
Sbjct: 135 TWTQC-APCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLT---WSSCGEQSWGNGICV 190
Query: 127 YELEYADGGSSLGVLVKDAFAF---NYTNGQRLNPRLALGCG-YNQVPGASYHPLDGILG 182
Y YAD + G L D F+F ++ G P L GCG +N G GI G
Sbjct: 191 YAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNN--GIFVSNETGIAG 248
Query: 183 LGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG-----FLFFGDDLYDSSR-----VVWTS 232
+G S+ +QL +C + G FL +LY + VV
Sbjct: 249 FSRGALSMPAQLKVDNF-----SYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVV--- 300
Query: 233 MSSDYTKYYSPGVAELFFG--GETTGLKNLPV---------------VFDSGSSYTYLNR 275
S+ +Y+S + + G T G LP+ + DSG+ T L
Sbjct: 301 QSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPE 360
Query: 276 VTY 278
Y
Sbjct: 361 AVY 363
>gi|168025647|ref|XP_001765345.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683398|gb|EDQ69808.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 879
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 84/292 (28%), Positives = 134/292 (45%), Gaps = 37/292 (12%)
Query: 22 SSSSSSSLFNHVGSSLLFQVHGNV-YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC- 79
S+ SS FN + + +F + V + ++V M +G P + + +DTGS TW+ C
Sbjct: 197 STRGSSLPFNFLYYTCVFGIGPRVLMESEEFHVEMKLGVPPKKFHFHMDTGSRDTWVYCQ 256
Query: 80 -----DAPCVRCVEAPHPLYRPSND--LVPC---EDPICASLHAPGHH-NCEDPAQCDYE 128
D P + P+ + P ++ + C +C+ H N D C +
Sbjct: 257 VSRNLDEPPIEL--GPNGKFEPRDESSYIQCIGHTASLCSEYQYEPHLCNSVDKYHCVND 314
Query: 129 LEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPL---DGILGLGK 185
L YAD + GVLV ++ + + ++ C + AS HP DGI+GLG
Sbjct: 315 LNYADDSTYSGVLVNESLMVSTIDNSDMDAMGLFWC----INEAS-HPFTGTDGIIGLGN 369
Query: 186 GKSSIVSQLHSQKLI-RNVVGHCLSGGGG--GFLFFGDDL---YDSSRVVW---TSMSSD 236
K ++ Q + K+I +NV+G CL+ G G G++ G + ++ S VW T MSS
Sbjct: 370 CKKTLGDQWTTNKVISQNVLGVCLAKGPGPVGYISLGVNFKKKFEESTSVWSKLTPMSSA 429
Query: 237 YTKYYSPGVAELFFGGET---TGLKNLPVVFDSGSSYTYLNRVTYQTLTSIM 285
YS +A + F +T T NL FD+GS YL V Y+ L ++
Sbjct: 430 GECAYSSPLASISFHDKTFVFTSETNLG--FDTGSDMMYLEAVIYEPLLDML 479
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 87/303 (28%), Positives = 123/303 (40%), Gaps = 54/303 (17%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQ-VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDL 74
RM++ S + S+ L S+ + + + P Y V M IG P +P L LDTGSDL
Sbjct: 49 RMAARSKARSARLLSGRAASARMDPGSYTDGVPDTEYLVHMAIGTPPQPVQLILDTGSDL 108
Query: 75 TWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ----CD 126
TW QC APCV C P + PS + +PC+ IC L +C + + C
Sbjct: 109 TWTQC-APCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLT---WSSCGEQSWGNGICV 164
Query: 127 YELEYADGGSSLGVLVKDAFAF---NYTNGQRLNPRLALGCG-YNQVPGASYHPLDGILG 182
Y YAD + G L D F+F ++ G P L GCG +N G GI G
Sbjct: 165 YAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNN--GIFVSNETGIAG 222
Query: 183 LGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG-----FLFFGDDLYDSSR-----VVWTS 232
+G S+ +QL +C + G FL +LY + VV
Sbjct: 223 FSRGALSMPAQLKVDNF-----SYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVV--- 274
Query: 233 MSSDYTKYYSPGVAELFFG--GETTGLKNLPV---------------VFDSGSSYTYLNR 275
S+ +Y+S + + G T G LP+ + DSG+ T L
Sbjct: 275 QSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPE 334
Query: 276 VTY 278
Y
Sbjct: 335 AVY 337
>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 627
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 100/372 (26%), Positives = 154/372 (41%), Gaps = 59/372 (15%)
Query: 37 LLFQVHGNVYPTG-----YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH 91
L F G + PTG Y + +G P + + LDTGSDL W+ CD C+ C AP
Sbjct: 189 LSFSKDGGIIPTGNDFGWLYYTWVDVGTPNTSFMVALDTGSDLFWIPCD--CIEC--APL 244
Query: 92 PLYRPSND-----LVPCEDPICASLHAPGHH-------NCEDPAQ-CDYELEY-ADGGSS 137
Y S D P E S H P H +C + Q C Y +Y + +S
Sbjct: 245 SGYHGSLDRDLGIYKPAES--TTSRHLPCSHELCLLGSDCTNQKQPCPYNTKYLQENTTS 302
Query: 138 LGVLVKDAFAFNYTNGQR-LNPRLALGCGYNQVPGASYH---PLDGILGLGKGKSSIVSQ 193
G+LV+D + + + +GCG Q SY DG+LGLG S+ S
Sbjct: 303 SGLLVEDILHLDSRESHAPVKASVIIGCGRKQ--SGSYLDGIAPDGLLGLGMADISVPSF 360
Query: 194 LHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYT------KYYSPGVAE 247
L L+RN C + G +FFGD + V T S+ + + Y+ V +
Sbjct: 361 LARAGLVRNSFSMCFTKDSGR-IFFGD------QGVSTQQSTPFVPLYGKLQTYTVNVDK 413
Query: 248 LFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
G + + + DSG+S+T L Y+ + K+++A L + E + C+
Sbjct: 414 SCVGHKCFESTSFQAIVDSGTSFTALPLDIYKAVAIEFDKQVNASRLPQ--EATSFDYCY 471
Query: 308 KGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNV---CLGILNGAE 364
P V T+ L+F K+ F+ +L+ +G V CL ++ E
Sbjct: 472 SA-SPL-----VMPDVPTVTLTFAGNKS---FQPVNPTFLLHDEEGAVAGFCLAVVQSPE 522
Query: 365 -VGLQDLNVIGG 375
+G+ N + G
Sbjct: 523 PIGIIAQNFLLG 534
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 84/327 (25%), Positives = 139/327 (42%), Gaps = 38/327 (11%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
Y VTM +G +D TGSDLTW+QC+ PC+ C P+++P S V C
Sbjct: 65 YIVTMGLGSTNMTVIID--TGSDLTWVQCE-PCMSCYNQQGPIFKPSTSSSYQSVSCNSS 121
Query: 107 ICASLH-APGHHNC--EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
C SL A G+ +P+ C+Y + Y DG + G L + +F G G
Sbjct: 122 TCQSLQFATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVEQLSF----GGVSVSDFVFG 177
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGD 220
CG N + + G++GLG+ S+VSQ ++ V +CL G G L G+
Sbjct: 178 CGRNN--KGLFGGVSGLMGLGRSYLSLVSQTNAT--FGGVFSYCLPTTESGASGSLVMGN 233
Query: 221 D---LYDSSRVVWTSM--SSDYTKYYSPGVAELFFGG---ETTGLKNLPVVFDSGSSYTY 272
+ + + + +T M + + +Y + + G + N V+ DSG+ T
Sbjct: 234 ESSVFKNVTPITYTRMLPNPQLSNFYILNLTGIDVDGVALQVPSFGNGGVLIDSGTVITR 293
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
L Y+ L ++ K+ + AP L C+ +V T+++ F +
Sbjct: 294 LPSSVYKALKALFLKQFTG--FPSAPGFSILDTCFN----LTGYDEVS--IPTISMHF-E 344
Query: 333 GKTRTLFELTPEAYLIISNKGNVCLGI 359
G + T Y++ + VCL +
Sbjct: 345 GNAELKVDATGTFYVVKEDASQVCLAL 371
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 78/286 (27%), Positives = 123/286 (43%), Gaps = 47/286 (16%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDP 106
Y V + +G PA L +DTGSD++W+QC PC CV A P + P + +PC
Sbjct: 138 YYVPLQLGTPAVEVVLIMDTGSDVSWIQC-VPCKDCVPALRPPFNPRHSSSFFKLPCASS 196
Query: 107 ICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP----RLA 161
C +++ C + C + ++Y DG S G+L + A N N P +
Sbjct: 197 TCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSNIT 256
Query: 162 LGCG---YNQVP-GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-----GG 212
LGC +P GAS G+LG+ + S SQL S+ + HC
Sbjct: 257 LGCADIDREGLPTGAS-----GLLGMDRRPISFPSQLSSRYARK--FSHCFPDKIAHLNS 309
Query: 213 GGFLFFGDDLYDSSRVVWT------SMSSDYTKYYSPGVAELFFGGETTGL--KNLPV-- 262
G +FFG+ S + +T ++ S YY G+ + L KN +
Sbjct: 310 SGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDK 369
Query: 263 -------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDE 301
+ DSG+++TYL + +Q M++E A++ A D+
Sbjct: 370 VTGSGGTIIDSGTAFTYLKKPAFQA----MRREFLARTSHLAKVDD 411
>gi|224083757|ref|XP_002307112.1| predicted protein [Populus trichocarpa]
gi|222856561|gb|EEE94108.1| predicted protein [Populus trichocarpa]
Length = 492
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 77/271 (28%), Positives = 121/271 (44%), Gaps = 26/271 (9%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRC--VEAPH--PLYRPSNDLVPCEDPICASLH 112
IG P + + LD+GSDL W+ CD CV+C + A H L R ++ P + L
Sbjct: 104 IGTPHVSFMVALDSGSDLFWVPCD--CVQCAPLSASHYSSLDRDLSEYSPSQSSTSKQLS 161
Query: 113 APGHH------NCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQRLNPRLA--- 161
H NC++P Q C Y + Y + SS G+LV+D LN +
Sbjct: 162 C-SHRLCDMGPNCKNPKQSCPYSINYYTESTSSSGLLVEDIIHLASGGDDTLNTSVKAPV 220
Query: 162 -LGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFF 218
+GCG Q G P DG+LGLG + S+ S L LI+N C + G +FF
Sbjct: 221 IIGCGMKQSGGYLDGVAP-DGLLGLGLQEISVPSFLAKAGLIQNSFSMCFNEDDSGRIFF 279
Query: 219 GDDLYDSSRVV-WTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVT 277
GD + + + ++ +YT Y GV G + + DSG+S+T+L
Sbjct: 280 GDQGPATQQSAPFLKLNGNYTTYIV-GVEVCCVGTSCLKQSSFSALVDSGTSFTFLPDDV 338
Query: 278 YQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
++ + +++A + + E + C+K
Sbjct: 339 FEMIAEEFDTQVNAS--RSSFEGYSWKYCYK 367
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 106/387 (27%), Positives = 147/387 (37%), Gaps = 68/387 (17%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
RM+ S + ++ L + + + + N PT Y V + IG P +P L LDTGSDL
Sbjct: 47 RMALRSKARAARRLSSSASAPVSPGTYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLI 106
Query: 76 WLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPA-----QCD 126
W QC PC C + P + P + L C+ +C L +C P C
Sbjct: 107 WTQCQ-PCPACFDQALPYFDPSTSSTLSLTSCDSTLCQGLPV---ASCGSPKFWPNQTCV 162
Query: 127 YELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG-YNQVPGASYHPLDGILGLGK 185
Y Y D + G L D F F P +A GCG +N G GI G G+
Sbjct: 163 YTYSYGDKSVTTGFLEVDKFTFVGAGASV--PGVAFGCGLFNN--GVFKSNETGIAGFGR 218
Query: 186 GKSSIVSQLHSQKLIRNVVGHCLSGGGG-----GFLFFGDDLYDSSRVVWTSM-----SS 235
G S+ SQL HC + G L DLY S R S +
Sbjct: 219 GPLSLPSQLKVGNF-----SHCFTAVNGLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPA 273
Query: 236 DYTKYYSPGVAELFFGGETTGLKNLPV--------------VFDSGSSYTYLNRVTYQTL 281
+ T YY L G T G LPV + DSG++ T L Y+ +
Sbjct: 274 NPTFYY------LSLKGITVGSTRLPVPESEFTLKNGTGGTIIDSGTAMTSLPTRVYRLV 327
Query: 282 TSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFEL 341
++ + D L P + V K L L F +G T +L
Sbjct: 328 RDAFAAQVKLPVVSGNTTDPYFCL----SAPLRAKPYVPK----LVLHF-EGAT---MDL 375
Query: 342 TPEAYLI-ISNKGN--VCLGILNGAEV 365
E Y+ + + G+ +CL I+ G EV
Sbjct: 376 PRENYVFEVEDAGSSILCLAIIEGGEV 402
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 90/212 (42%), Gaps = 33/212 (15%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y V + +G P + +DT SDL W QC PCV+C + P++ P S +VPC
Sbjct: 86 GEYLVKLGLGTPQHCFTAAIDTASDLIWTQCQ-PCVKCYKQLDPVFNPVASTSYAVVPCN 144
Query: 105 DPICASLHAPGHHNC------EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
C L H C +D C Y Y ++ G+L D A G +
Sbjct: 145 SDTCDELDT---HRCARDGDSDDEDACQYTYSYGGNATTRGILAVDRLAI----GDDVFR 197
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGF 215
+ GC + V G + G++GLG+G S+VSQL ++ + +CL G
Sbjct: 198 GVVFGCSSSSVGGPPPQ-VSGVVGLGRGALSLVSQLSVRRFM-----YCLPPPVSRSAGR 251
Query: 216 LFFGDDLYDSSR------VVWTSMSSDYTKYY 241
L G D + R VV S S Y YY
Sbjct: 252 LVLGADAAATVRNASERVVVPMSTGSRYPSYY 283
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 101/388 (26%), Positives = 156/388 (40%), Gaps = 57/388 (14%)
Query: 4 SHNGENLCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQ--VHGNVYPTGYYNVTMYIGQPA 61
+ + E + F R+++ S+S+S++ G SL+ G +G Y V + +G PA
Sbjct: 58 TKDEERVRFLHSRLTNKESASNSATTDKLGGPSLVSTPLKSGLSIGSGNYYVKIGVGTPA 117
Query: 62 RPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS---------NDLVPCEDPICASLH 112
+ + + +DTGS L+WLQC + C P++ PS C ++L+
Sbjct: 118 KYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSVSKTYKALSCSSSQCSSLKSSTLN 177
Query: 113 APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGA 172
APG N C Y+ Y D S+G L +D T + GCG +
Sbjct: 178 APGCSNAT--GACVYKASYGDTSFSIGYLSQDVLTL--TPSAAPSSGFVYGCGQDN--QG 231
Query: 173 SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--------SGGGGGFLFFGDDLYD 224
+ GI+GL K S++ QL ++ N +CL + GFL G
Sbjct: 232 LFGRSAGIIGLANDKLSMLGQLSNK--YGNAFSYCLPSSFSAQPNSSVSGFLSIGASSLS 289
Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFFGGETT--------GLK----NLPVVFDSGSSYTY 272
SS +T + + P + L+F G TT G+ N+P + DSG+ T
Sbjct: 290 SSPYKFTPLVKN------PKIPSLYFLGLTTITVAGKPLGVSASSYNVPTIIDSGTVITR 343
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR-RPFKNVHDVKKCFRTLALSFT 331
L Y L +S K +AP L C+KG + V +++ FR A
Sbjct: 344 LPVAIYNALKKSFVMIMS-KKYAQAPGFSILDTCFKGSVKEMSTVPEIRIIFRGGA---- 398
Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGI 359
EL L+ KG CL I
Sbjct: 399 ------GLELKVHNSLVEIEKGTTCLAI 420
>gi|326504502|dbj|BAJ91083.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 537
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 79/288 (27%), Positives = 119/288 (41%), Gaps = 37/288 (12%)
Query: 29 LFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVE 88
L +L F++ G+++ Y V +G P + + LDTGSDL W+ CD C +C
Sbjct: 90 LLTFASGNLTFRLEGSLH---YAEVA--VGTPNATFLVALDTGSDLFWVPCD--CKQCAP 142
Query: 89 APH-------PLYRP-------SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADG 134
+ P RP ++ V CE +C +A C Y + Y
Sbjct: 143 IANASDLRGGPDLRPYSPGKSSTSKAVTCEHALCERPNACAAAG-NSSTSCPYTVRYVSA 201
Query: 135 G-SSLGVLVKDAFAFNYTNG----QRLNPRLALGCGYNQ----VPGASYHPLDGILGLGK 185
SS GVLV+D + + + LGCG Q + GA+ +DG+LGLG
Sbjct: 202 NTSSSGVLVEDVLHLSREAAGGASTAVTAPVVLGCGQVQTGAFLDGAA---VDGLLGLGM 258
Query: 186 GKSSIVSQLHSQKLI-RNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPG 244
K S+ S LH+ L+ + C S G G + FGD ++ + + Y
Sbjct: 259 DKVSVPSVLHAAGLVASDSFSMCFSPDGFGRINFGDSGRRGQAETPFTVRNTHPTYNISV 318
Query: 245 VAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAK 292
A G E + DSG+S+TYLN Y L + E+ +
Sbjct: 319 TAMSVSGKEVAA--EFAAIVDSGTSFTYLNDPAYTELATGFNSEVRER 364
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 95/340 (27%), Positives = 140/340 (41%), Gaps = 49/340 (14%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y + IG+P + ++DTGSDL W++C +PC C P PLY P S+ +PC
Sbjct: 85 GKYIMQFSIGEPPLLIWAEVDTGSDLMWVKC-SPCNGCNPPPSPLYDPARSRSSGKLPCS 143
Query: 105 DPICASL---HAPGHHNCEDPAQCDYELEYADGG--SSLGVLVKDAFAFNYTNGQRLNPR 159
+C +L +DP C Y Y G S+ GVL + F F +G N
Sbjct: 144 SQLCQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTFG--DGYVAN-N 200
Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIR------NVVGHCLSGGGG 213
++ G + + G+ + G++GLG+G S+VSQL + + NV L G
Sbjct: 201 VSFGRS-DTIDGSQFGGTAGLVGLGRGHLSLVSQLGAGRFAYCLAADPNVYSTILFGSLA 259
Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----------VV 263
D+ SS + T+ D +Y + + GG +K+ V
Sbjct: 260 ALDTSAGDV--SSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVF 317
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
FDSG+ T L YQ + + E+ + L D+T C+ N V +
Sbjct: 318 FDSGAIDTSLKDAAYQVVRQAITSEI--QRLGYDAGDDT---CFVA----ANQQAVAQ-M 367
Query: 324 RTLALSFTDGKTRTLFELTPEAYLIISNKGN----VCLGI 359
L L F DG L YL S KG VC+ I
Sbjct: 368 PPLVLHFDDGAD---MSLNGRNYLKTSTKGPSEVLVCMAI 404
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 106/387 (27%), Positives = 147/387 (37%), Gaps = 68/387 (17%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
RM+ S + ++ L + + + + N PT Y V + IG P +P L LDTGSDL
Sbjct: 47 RMALRSKARAARRLSSSASAPVSPGTYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLI 106
Query: 76 WLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPA-----QCD 126
W QC PC C + P + P + L C+ +C L +C P C
Sbjct: 107 WTQCQ-PCPACFDQALPYFDPSTSSTLSLTSCDSTLCQGLPV---ASCGSPKFWPNQTCV 162
Query: 127 YELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG-YNQVPGASYHPLDGILGLGK 185
Y Y D + G L D F F P +A GCG +N G GI G G+
Sbjct: 163 YTYSYGDKSVTTGFLEVDKFTFVGAGASV--PGVAFGCGLFNN--GVFKSNETGIAGFGR 218
Query: 186 GKSSIVSQLHSQKLIRNVVGHCLSGGGG-----GFLFFGDDLYDSSRVVWTSM-----SS 235
G S+ SQL HC + G L DLY S R S +
Sbjct: 219 GPLSLPSQLKVGNF-----SHCFTAVNGLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPA 273
Query: 236 DYTKYYSPGVAELFFGGETTGLKNLPV--------------VFDSGSSYTYLNRVTYQTL 281
+ T YY L G T G LPV + DSG++ T L Y+ +
Sbjct: 274 NPTFYY------LSLKGITVGSTRLPVPESEFALKNGTGGTIIDSGTAMTSLPTRVYRLV 327
Query: 282 TSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFEL 341
++ + D L P + V K L L F +G T +L
Sbjct: 328 RDAFAAQVKLPVVSGNTTDPYFCL----SAPLRAKPYVPK----LVLHF-EGAT---MDL 375
Query: 342 TPEAYLI-ISNKGN--VCLGILNGAEV 365
E Y+ + + G+ +CL I+ G EV
Sbjct: 376 PRENYVFEVEDAGSSILCLAIIEGGEV 402
>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 520
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 100/369 (27%), Positives = 155/369 (42%), Gaps = 61/369 (16%)
Query: 37 LLFQVHGNVYPT-----GYYNVTMY-IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP 90
LLF HG+ + G+ + T IG P+ + + LD GSDL W+ CD CV+C
Sbjct: 76 LLFPSHGSKTMSLGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWIPCD--CVQCAPLS 133
Query: 91 HPLY----RPSNDLVPCEDPICASLHAPGHH-------NCEDPAQ-CDYELEY-ADGGSS 137
Y R N+ P +S H H NC+ Q C Y + Y ++ SS
Sbjct: 134 SSYYSNLDRDLNEYSPSRS--LSSKHLSCSHQLCDKGSNCKSSQQQCPYMVSYLSENTSS 191
Query: 138 LGVLVKDAFAF----NYTNGQRLNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIV 191
G+LV+D + +N P + LGCG Q G P DG+LGLG G+SS+
Sbjct: 192 SGLLVEDILHLQSGGSLSNSSVQAP-VVLGCGMKQSGGYLDGVAP-DGLLGLGPGESSVP 249
Query: 192 SQLHSQKLIRNVVGHCLSGGGGGFLFFGDD---LYDSSRVVWTSMSSDYTKYYSPGVAEL 248
S L LI + C + G +FFGD + S+ + + Y+ Y GV
Sbjct: 250 SFLAKSGLIHDSFSLCFNEDDSGRIFFGDQGPTIQQSTSFL--PLDGLYSTYII-GVESC 306
Query: 249 FFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAK---------------S 293
G + + V DSG+S+T+L Y + ++++ S
Sbjct: 307 CVGNSCLKMTSFKVQVDSGTSFTFLPGHVYGAIAEEFDQQVNGSRSSFEGSPWEYCYVPS 366
Query: 294 LKEAPEDETLPLCWKGRRPFKNVHDVKKCFR--------TLALSFTDGKTRTLFELTPEA 345
+E P+ +L L ++ F V+D F LA+ T+G T+ +
Sbjct: 367 SQELPKVPSLTLTFQQNNSFV-VYDPVFVFYGNEGVIGFCLAIQPTEGDMGTIGQNFMTG 425
Query: 346 YLIISNKGN 354
Y ++ ++GN
Sbjct: 426 YRLVFDRGN 434
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 73/253 (28%), Positives = 111/253 (43%), Gaps = 33/253 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y V + +G P R ++ +D+GSD+ W+QC PC +C P++ P++ V C
Sbjct: 137 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCTQCYHQSDPVFDPADSASFTGVSC 195
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+C L G H +C YE+ Y DG + G L + F G+ + +A+G
Sbjct: 196 SSSVCDRLENAGCH----AGRCRYEVSYGDGSYTKGTLALETLTF----GRTMVRSVAIG 247
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG---GGFLFFGD 220
CG+ + G+LGLG G S V QL Q +CL G G L FG
Sbjct: 248 CGHRNR--GMFVGAAGLLGLGGGSMSFVGQLGGQT--GGAFSYCLVSRGTDSSGSLVFGR 303
Query: 221 DLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGG----------ETTGLKNLPVVFDSGS 268
+ + W + + +Y G+A L GG T L + VV D+G+
Sbjct: 304 EALPAG-AAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGT 362
Query: 269 SYTYLNRVTYQTL 281
+ T L + YQ
Sbjct: 363 AVTRLPTLAYQAF 375
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 106/393 (26%), Positives = 163/393 (41%), Gaps = 58/393 (14%)
Query: 20 SSSSSSSSSLFNHVGSSLLFQVHGNV-YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQ 78
SS + S S ++ L+ + V +G Y + ++IG P + + L LDTGSDL W+Q
Sbjct: 164 SSPAESPESYADYFSGQLMATLESGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQ 223
Query: 79 CDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPG-HHNCEDPAQ-CDYELEYA 132
C PC C E P Y P + + + C DP C + +P C+ Q C Y Y
Sbjct: 224 C-VPCFDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYG 282
Query: 133 DGGSSLGVLVKDAFAFNYTNGQ------RLNPRLALGCG-YNQVPGASYHPLDGILGLGK 185
D ++ G + F N T+ R + GCG +N+ +H G+LGLG+
Sbjct: 283 DSSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNR---GLFHGAAGLLGLGR 339
Query: 186 GKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFGD--DLYDSSRVVWTSM----S 234
G S SQL Q L + +CL L FG+ DL + +TS+
Sbjct: 340 GPLSFSSQL--QSLYGHSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKE 397
Query: 235 SDYTKYYSPGVAELFFGGETTGLK----NLP------VVFDSGSSYTYLNRVTYQTLTSI 284
+ +Y + +F GGE + NL + DSG++ +Y + Y+ +
Sbjct: 398 NPVDTFYYLQIKSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRII--- 454
Query: 285 MKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC-FRTLALSFTDGKTRTLFELTP 343
KE + +K E P+ P NV + F + F DG ++
Sbjct: 455 --KEAFLRKVKGYKLVEDFPIL----HPCYNVSGTDELNFPEFLIQFADG---AVWNFPV 505
Query: 344 EAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGG 375
E Y I I VCL +L + L++IG
Sbjct: 506 ENYFIRIQQLDIVCLAMLGTPKSA---LSIIGN 535
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 91/353 (25%), Positives = 144/353 (40%), Gaps = 69/353 (19%)
Query: 40 QVHGNVYP-TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
++ V P G + + + IG P Y LDTGSDL W QC PC +C P++ P
Sbjct: 85 EIEAPVLPGNGEFLMKLAIGTPPETYSAILDTGSDLIWTQCK-PCTQCFHQSTPIFDPKK 143
Query: 99 DLVPCEDPICASL-HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
+ + L A +C + C+Y Y D S+ G+L + F G+
Sbjct: 144 SSSFSKLSCSSQLCEALPQSSCNN--GCEYLYSYGDYSSTQGILASETLTF----GKASV 197
Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF 217
P +A GCG + G+ + G++GLG+G S+VSQL K +CL+
Sbjct: 198 PNVAFGCGADN-EGSGFSQGAGLVGLGRGPLSLVSQLKEPKF-----SYCLTT------- 244
Query: 218 FGDDLYDSSRVVWTSMSSDYTK--------YYSPGVAELFF---GGETTGLKNLPV---- 262
DD S+ ++ + S + + +SP ++ G + G LP+
Sbjct: 245 -VDDTKTSTLLMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKST 303
Query: 263 -----------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET----LPLCW 307
+ DSG++ TYL + +++ KE +AK P D + L +C+
Sbjct: 304 FSLQDDGSGGLIIDSGTTITYLEESAF----NLVAKEFTAK--INLPVDSSGSTGLDVCF 357
Query: 308 KGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGI 359
N+ K F DG EL E Y+I S+ G CL +
Sbjct: 358 TLPSGSTNIEVPKLVFH------FDGAD---LELPAENYMIGDSSMGVACLAM 401
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 99/348 (28%), Positives = 148/348 (42%), Gaps = 45/348 (12%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSNDL----VPCE 104
Y VT+ G PA P L +DTGSDL+W+QC PC C P++ PS VPC
Sbjct: 122 YVVTLGFGTPAVPQVLLIDTGSDLSWVQCQ-PCNSSTCYPQKDPVFDPSASSTYAPVPCG 180
Query: 105 DPICASLHAPGHHN-CEDPAQ----CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
C L + N C + + C Y ++Y +G +++GV + + +N
Sbjct: 181 SEACRDLDPDSYANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTETLTLSPEAATVVN-N 239
Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG--GGFLF 217
+ GCG Q + DG+LGLG S+VSQ + +CL G GFL
Sbjct: 240 FSFGCGLVQ--KGVFDLFDGLLGLGGAPESLVSQ--TTGTYGGAFSYCLPAGNSTAGFLA 295
Query: 218 FGDDLY---DSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF------DSGS 268
G +++ +T + T +Y + + GG+ ++ P VF DSG+
Sbjct: 296 LGAPATGGNNTAGFQFTPLQVVETTFYLVKLTGISVGGKQLDIE--PTVFAGGMIIDSGT 353
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
T L Y L + + +SA L +DE L C+ F +V T+AL
Sbjct: 354 IVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTCYD----FTGNTNVT--VPTVAL 407
Query: 329 SFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
+F G T L P L+ + CL + GA G D +IG +
Sbjct: 408 TFEGGVTIDLD--VPSGVLL-----DGCLAFVAGASDG--DTGIIGNV 446
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 106/393 (26%), Positives = 163/393 (41%), Gaps = 58/393 (14%)
Query: 20 SSSSSSSSSLFNHVGSSLLFQVHGNV-YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQ 78
SS + S S ++ L+ + V +G Y + ++IG P + + L LDTGSDL W+Q
Sbjct: 164 SSPAESPESYADYFSGQLMATLESGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQ 223
Query: 79 CDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPG-HHNCEDPAQ-CDYELEYA 132
C PC C E P Y P + + + C DP C + +P C+ Q C Y Y
Sbjct: 224 C-VPCFDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYG 282
Query: 133 DGGSSLGVLVKDAFAFNYTNGQ------RLNPRLALGCG-YNQVPGASYHPLDGILGLGK 185
D ++ G + F N T+ R + GCG +N+ +H G+LGLG+
Sbjct: 283 DSSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNR---GLFHGAAGLLGLGR 339
Query: 186 GKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFGD--DLYDSSRVVWTSM----S 234
G S SQL Q L + +CL L FG+ DL + +TS+
Sbjct: 340 GPLSFSSQL--QSLYGHSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKE 397
Query: 235 SDYTKYYSPGVAELFFGGETTGLK----NLP------VVFDSGSSYTYLNRVTYQTLTSI 284
+ +Y + +F GGE + NL + DSG++ +Y + Y+ +
Sbjct: 398 NPVDTFYYLQIKSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRII--- 454
Query: 285 MKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC-FRTLALSFTDGKTRTLFELTP 343
KE + +K E P+ P NV + F + F DG ++
Sbjct: 455 --KEAFLRKVKGYKLVEDFPIL----HPCYNVSGTDELNFPEFLIQFADG---AVWNFPV 505
Query: 344 EAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGG 375
E Y I I VCL +L + L++IG
Sbjct: 506 ENYFIRIQQLDIVCLAMLGTPKSA---LSIIGN 535
>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
Length = 671
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 72/256 (28%), Positives = 112/256 (43%), Gaps = 38/256 (14%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRPSNDL----VPCE 104
+G P + + LDTGSDL W+ CD C++C P +Y P+ VPC
Sbjct: 41 LGTPNVTFLVALDTGSDLFWVPCD--CLKCAPFQSPNYGSLKFDVYSPAQSTTSRKVPCS 98
Query: 105 DPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR--LNPRL 160
+C +A C + C Y ++Y +D SS GVLV+D + Q + +
Sbjct: 99 SNLCDLQNA-----CRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPI 153
Query: 161 ALGCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF 217
GCG QV S+ +G+LGLG S+ S L S+ L N C G G +
Sbjct: 154 MFGCG--QVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRIN 211
Query: 218 FGD----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
FGD D ++ V+ YY+ + + G ++ + + DSG+S+T L
Sbjct: 212 FGDTGSSDQKETPLNVYKQ-----NPYYNITITGITVGSKSISTE-FSAIVDSGTSFTAL 265
Query: 274 NRVTYQTLTSIMKKEL 289
+ Y +TS ++
Sbjct: 266 SDPMYTQITSSFDAQI 281
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 71/260 (27%), Positives = 110/260 (42%), Gaps = 36/260 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y V + +G P +L +D+GSD+ W+QC PC+ C PL+ P+ V C
Sbjct: 168 SGEYLVRVSVGSPPTEQYLVVDSGSDVMWVQCK-PCLECYVQADPLFDPATSATFSGVSC 226
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
IC L + E C+YE+ YADG + G L + T + + +G
Sbjct: 227 GSAICRILPTSACGDGE-LGGCEYEVSYADGSYTKGALALETLTLGGTAVE----GVVIG 281
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG---------- 213
CG+ + G++GLG G S+V QL + + +CL+ GG
Sbjct: 282 CGHRNR--GLFVGAAGLMGLGWGPMSLVGQLGGE--VGGAFSYCLASRGGYGSGAADDDA 337
Query: 214 GFLFFGDDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGE----TTGLKNLP------ 261
G+L G VW + + +Y G++ + G E GL L
Sbjct: 338 GWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQLTEDGAGD 397
Query: 262 VVFDSGSSYTYLNRVTYQTL 281
VV D+G++ T L + Y L
Sbjct: 398 VVMDTGTTVTRLPQEAYAAL 417
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 102/363 (28%), Positives = 156/363 (42%), Gaps = 63/363 (17%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDP 106
Y + +Y+G P R + + +DTGSDL WLQC APC+ C E P++ P+ + C DP
Sbjct: 146 YLMDVYVGTPPRRFQMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASSSYRNLTCGDP 204
Query: 107 ICASL---HAPGHHNCEDPAQ--CDYELEYADGGSSLGVLVKDAFAFNYT---NGQRLNP 158
C + AP C P + C Y Y D +S G L ++F N T R++
Sbjct: 205 RCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNLTAPGASSRVD- 263
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVG-----HCLSGGGG 213
+ GCG+ +H G+LGLG+G S SQL R V G +CL G
Sbjct: 264 GVVFGCGHRNR--GLFHGAAGLLGLGRGPLSFASQL------RAVYGGHTFSYCLVDHGS 315
Query: 214 GF---LFFGDD----LYDSSRVVWTSM---SSDYTKYYSPGVAELFFGGETTGLKNLP-- 261
+ FG+D L R+ +T+ SS +Y + + GGE + +
Sbjct: 316 DVASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGELLNISSDTWD 375
Query: 262 --------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPF 313
+ DSG++ +Y YQ + +S S P+ L C+
Sbjct: 376 ASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSG-SYPPVPDFPVLSPCY------ 428
Query: 314 KNVHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLN 371
NV V++ L+L F DG +++ E Y I + G +CL +L G ++
Sbjct: 429 -NVSGVERPEVPELSLLFADG---AVWDFPAENYFIRLDPDGIMCLAVLGTPRTG---MS 481
Query: 372 VIG 374
+IG
Sbjct: 482 IIG 484
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 89/306 (29%), Positives = 123/306 (40%), Gaps = 60/306 (19%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVY----PTGYYNVTMYIGQPARPYFLDLDTG 71
RM++ S + S+ L S+ +V Y P Y V M IG P +P L LDTG
Sbjct: 75 RMAARSKARSARLLSGRAASA---RVDPGSYTDGVPDTEYLVHMAIGTPPQPVQLILDTG 131
Query: 72 SDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ--- 124
SDLTW QC APCV C P + PS + +PC+ IC L +C + +
Sbjct: 132 SDLTWTQC-APCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLT---WSSCGEQSWGNG 187
Query: 125 -CDYELEYADGGSSLGVLVKDAFAF---NYTNGQRLNPRLALGCG-YNQVPGASYHPLDG 179
C Y YAD + G L D F+F ++ G P L GCG +N G G
Sbjct: 188 ICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNN--GIFVSNETG 245
Query: 180 ILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG-----FLFFGDDLYDSSR-----VV 229
I G +G S+ +QL +C + G FL +LY + VV
Sbjct: 246 IAGFSRGALSMPAQLKVDNF-----SYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVV 300
Query: 230 WTSMSSDYTKYYSPGVAELFFG--GETTGLKNLPV---------------VFDSGSSYTY 272
S+ +Y+S + + G T G LP+ + DSG+ T
Sbjct: 301 ---QSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTM 357
Query: 273 LNRVTY 278
L Y
Sbjct: 358 LPEAVY 363
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 77/279 (27%), Positives = 112/279 (40%), Gaps = 28/279 (10%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SN 98
G+ TG Y VT G PA+ L +DTGSD+TW+QC PC C P++ P S
Sbjct: 130 GSKVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCK-PCSDCYSQVDPIFEPQQSSSY 188
Query: 99 DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
+ C C L H C YE+ Y DG S G ++ T G P
Sbjct: 189 KHLSCLSSACTELTTMNHCRL---GGCVYEINYGDGSRSQGDFSQETL----TLGSDSFP 241
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGG 213
A GCG+ + G+LGLG+ S SQ S+ +CL S G
Sbjct: 242 SFAFGCGHTNT--GLFKGSAGLLGLGRTALSFPSQTKSK--YGGQFSYCLPDFVSSTSTG 297
Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV-----VFDSGS 268
F + ++ V +S+Y +Y G+ + GGE + + + DSG+
Sbjct: 298 SFSVGQGSIPATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGRGGTIVDSGT 357
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
T L Y L + + + ++L A L C+
Sbjct: 358 VITRLVPQAYDALKTSFRSK--TRNLPSAKPFSILDTCY 394
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 129/297 (43%), Gaps = 32/297 (10%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
G Y + + IG P P DTGSDL W QC+ PC C + PL+ P V C
Sbjct: 84 GEYLMNISIGTPPVPILAIADTGSDLIWTQCN-PCEDCYQQTSPLFDPKESSTYRKVSCS 142
Query: 105 DPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LAL 162
C +L +C D C Y + Y D + G + D + + ++ R + +
Sbjct: 143 SSQCRALE---DASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMII 199
Query: 163 GCGYNQVPGASYHPL-DGILGLGKGKSSIVSQLHSQKLIRNVVGHCL------SGGGGGF 215
GCG+ ++ P GI+GLG G +S+VSQL +K I +CL +G
Sbjct: 200 GCGHENT--GTFDPAGSGIIGLGGGSTSLVSQL--RKSINGKFSYCLVPFTSETGLTSKI 255
Query: 216 LFFGDDLYDSSRVVWTSM-SSDYTKYY-------SPGVAELFFGGETTGLKNLPVVFDSG 267
F + + VV TSM D YY S G ++ F G +V DSG
Sbjct: 256 NFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGNIVIDSG 315
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
++ T L Y L S++ + A+ +++ D L LC++ FK V D+ F+
Sbjct: 316 TTLTLLPSNFYYELESVVASTIKAERVQDP--DGILSLCYRDSSSFK-VPDITVHFK 369
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 57/169 (33%), Positives = 82/169 (48%), Gaps = 14/169 (8%)
Query: 34 GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPL 93
G L+ V +G Y + +G PA L LDT SDLTWLQC PC RC P+
Sbjct: 117 GRGLVAPVVSRAPTSGEYMAKIAVGTPAVQALLALDTASDLTWLQCQ-PCRRCYPQSGPV 175
Query: 94 YRPSNDL----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADG----GSSLGVLVKDA 145
+ P + + + P C +L G + + C Y ++Y DG +S+G LV++
Sbjct: 176 FDPRHSTSYGEMNYDAPDCQALGRSGGGDAKR-GTCIYTVQYGDGHGSTSTSVGDLVEET 234
Query: 146 FAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQL 194
F G L++GCG++ G P GILGLG+G+ SI Q+
Sbjct: 235 LTF---AGGVRQAYLSIGCGHDN-KGLFGAPAAGILGLGRGQISIPHQI 279
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 80/289 (27%), Positives = 126/289 (43%), Gaps = 24/289 (8%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
R+ S + SS +F ++ L G G Y VT+ +G P + + L DTGSD+T
Sbjct: 96 RVDSIHARLSSRGMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDIT 155
Query: 76 WLQCDAPCVR-CVEAPHPLYRPSNDL----VPCEDPICASLHAPGH---HNCEDPAQCDY 127
W QC+ PCV+ C + P PS + C +C L A G +C + C Y
Sbjct: 156 WTQCE-PCVKTCYKQKEPRLNPSTSTSYKNISCSSALC-KLVASGKKFSQSCSS-STCLY 212
Query: 128 ELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGK 187
+++Y DG S+G + + +N + GCG Q + G+LGLG+ K
Sbjct: 213 QVQYGDGSYSIGFFATETLTLSSSN---VFKNFLFGCG--QQNNGLFGGAAGLLGLGRTK 267
Query: 188 SSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGV 245
++ SQ + K + + +CL S G+L G + S + S D T +Y +
Sbjct: 268 LALPSQ--TAKTYKKLFSYCLPASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDI 325
Query: 246 AELFFGGETTGLKNLP----VVFDSGSSYTYLNRVTYQTLTSIMKKELS 290
L GG + V DSG+ T L+ Y L+S + ++
Sbjct: 326 TGLSVGGRKLSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMT 374
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 58/186 (31%), Positives = 88/186 (47%), Gaps = 21/186 (11%)
Query: 46 YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLV 101
+ T Y VTM +G + + +DTGSDLTW+QC+ PC+ C P+++PS +
Sbjct: 140 FQTLNYIVTMELG--GQDMTVIIDTGSDLTWVQCE-PCMSCYNQQGPVFKPSTSSSYQSI 196
Query: 102 PCEDPICASLHAPGHH--NCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
PC C SL + CE +P+ C Y + Y DG + G L + +F G
Sbjct: 197 PCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSF----GGISVS 252
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGF 215
GCG N + + G++GLG+ S++SQ +S V +CL G G
Sbjct: 253 NFVFGCGKNN--KGLFGGVSGLMGLGRSNLSLISQTNST--FGGVFSYCLPPTDAGASGS 308
Query: 216 LFFGDD 221
L G++
Sbjct: 309 LAMGNE 314
>gi|403222804|dbj|BAM40935.1| aspartyl(acid) protease [Theileria orientalis strain Shintoku]
Length = 509
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 83/353 (23%), Positives = 143/353 (40%), Gaps = 44/353 (12%)
Query: 40 QVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR---- 95
+V GN++ YY V + IG P L +DTGS L + C C C P Y
Sbjct: 69 KVFGNLHKFAYYYVYVGIGNPKTKQMLIIDTGSQLINVAC-GKCKECGNHLLPNYELGAS 127
Query: 96 PSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
++ L+ C+ C ++ C C + Y++G + G +V D +F+
Sbjct: 128 VTHKLIDCDSEFCKAVEGK----CGLDESCLFNESYSEGSNVEGKVVGDLISFDIKKDSS 183
Query: 156 LNPRL--ALGCGYNQVPGASYHPLDGILGLGKG-KSSIVSQ--LHSQKLI---------- 200
+GC N+ +GILGL K K +++S +Q I
Sbjct: 184 YLSTFFNYIGCVTNESQLIKSQITNGILGLAKSDKPTLISHEYFETQSFIEKYLTDHFRP 243
Query: 201 -RNVVGHCLSGGGGGFLFFGDD------LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE 253
+ + CLS GG G D + ++++++W + +++Y V + F
Sbjct: 244 MKKIFSLCLSENGGVMTLGGVDDQLNLKIKNTTQLIWAPLVK--SEFYIIKVLDASFQEN 301
Query: 254 TTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPF 313
KN V D+G++ + L + + + I + L K + E +T C ++
Sbjct: 302 KIEFKNKNFVLDTGTTISTLEKEVFNKIHKIFEG-LCEDITKLSNEKKTSSKCTVDKKTG 360
Query: 314 KNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNV------CLGI 359
K ++ L+F +G FE T ++Y+I +NK V CLGI
Sbjct: 361 KMCFSDISKLPSIVLTFENGSN---FEWTSDSYMINRTNKRTVNDYSWWCLGI 410
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 79/288 (27%), Positives = 125/288 (43%), Gaps = 22/288 (7%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
R+ S + SS +F ++ L G G Y VT+ +G P + + L DTGSD+T
Sbjct: 84 RVDSIHARLSSRGMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDIT 143
Query: 76 WLQCDAPCVR-CVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCE--DPAQCDYE 128
W QC+ PCV+ C + P PS + C +C L A G + + C Y+
Sbjct: 144 WTQCE-PCVKTCYKQKEPRLNPSTSTSYKNISCSSALC-KLVASGKKFSQSCSSSTCLYQ 201
Query: 129 LEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKS 188
++Y DG S+G + + +N + GCG Q + G+LGLG+ K
Sbjct: 202 VQYGDGSYSIGFFATETLTLSSSN---VFKNFLFGCG--QQNNGLFGGAAGLLGLGRTKL 256
Query: 189 SIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVA 246
++ SQ + K + + +CL S G+L G + S + S D T +Y +
Sbjct: 257 ALPSQ--TAKTYKKLFSYCLPASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDIT 314
Query: 247 ELFFGGETTGLKNLP----VVFDSGSSYTYLNRVTYQTLTSIMKKELS 290
L GG + V DSG+ T L+ Y L+S + ++
Sbjct: 315 GLSVGGRKLSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMT 362
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 99/385 (25%), Positives = 161/385 (41%), Gaps = 55/385 (14%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
+ +SSS + SS + S + FQ T Y VTM +G ++ + +DTGSDLT
Sbjct: 94 KRTSSSQIADSSETQVPLTSGIKFQ-------TLNYIVTMGLG--SQNMSVIVDTGSDLT 144
Query: 76 WLQCDAPCVRCVEAPHPLYRPSN----DLVPCEDPICASLHAPGHHNCEDP---AQCDYE 128
W+QC+ PC C PL++PS + C C SL + DP A CDY
Sbjct: 145 WVQCE-PCRSCYNQNGPLFKPSTSPSYQPILCNSTTCQSLELGACGS--DPSTSATCDYV 201
Query: 129 LEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKS 188
+ Y DG + G L + F G GCG N + G++GLG+ +
Sbjct: 202 VNYGDGSYTSGELGIEKLGF----GGISVSNFVFGCGRNN--KGLFGGASGLMGLGRSEL 255
Query: 189 SIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDD---LYDSSRVVWTSM--SSDYTK 239
S++SQ ++ V +CL G G L G+ + + + +T M + +
Sbjct: 256 SMISQTNAT--FGGVFSYCLPSTDQAGASGSLVMGNQSGVFKNVTPIAYTRMLPNLQLSN 313
Query: 240 YYSPGVAELFFGG-----ETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSL 294
+Y + + GG + + N V+ DSG+ + L Y+ L + ++ S
Sbjct: 314 FYILNLTGIDVGGVSLHVQASSFGNGGVILDSGTVISRLAPSVYKALKAKFLEQFSG--F 371
Query: 295 KEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGN 354
AP L C+ + V+ T+++ F +G + T YL+ +
Sbjct: 372 PSAPGFSILDTCFN-LTGYDQVN-----IPTISMYF-EGNAELNVDATGIFYLVKEDASR 424
Query: 355 VCLGILNGAEVGLQDLNVIGGIGDF 379
VCL + L D +G IG++
Sbjct: 425 VCLAL-----ASLSDEYEMGIIGNY 444
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 82/280 (29%), Positives = 112/280 (40%), Gaps = 49/280 (17%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
T Y V + +G P RP L LDTGSDL W QC APC C PL P+ +PC
Sbjct: 89 TNEYLVHLAVGTPPRPVALTLDTGSDLVWTQC-APCRDCFHQGLPLLDPAASSTYAALPC 147
Query: 104 EDPICASL----------HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
P C +L + G+ N C Y Y D ++G + D F F NG
Sbjct: 148 GAPRCRALPFTSCGGGGRSSWGNGN----RSCAYIYHYGDKSVTVGEIATDRFTFGGDNG 203
Query: 154 ---QRL-NPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGH-- 206
RL RL GCG +N+ G GI G G+G+ S+ SQL+
Sbjct: 204 DGDSRLPTRRLTFGCGHFNK--GVFQSNETGIAGFGRGRWSLPSQLNVTTFSYCFTSMFE 261
Query: 207 ------CLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNL 260
L G L + + S V T + + ++ P + L G + G L
Sbjct: 262 SKSSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQ---PSLYFLSLKGISVGKTRL 318
Query: 261 PV--------VFDSGSSYTYLNRVTYQTLTSIMKKELSAK 292
V + DSG+S T L Y+ +K E +A+
Sbjct: 319 AVPEAKLRSTIIDSGASITTLPEAVYEA----VKAEFAAQ 354
>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
sativa Japonica Group]
Length = 732
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 72/258 (27%), Positives = 113/258 (43%), Gaps = 38/258 (14%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRPSNDL----VPCE 104
+G P + + LDTGSDL W+ CD C++C P +Y P+ VPC
Sbjct: 105 LGTPNVTFLVALDTGSDLFWVPCD--CLKCAPFQSPNYGSLKFDVYSPAQSTTSRKVPCS 162
Query: 105 DPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR--LNPRL 160
+C +A C + C Y ++Y +D SS GVLV+D + Q + +
Sbjct: 163 SNLCDLQNA-----CRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKIVTAPI 217
Query: 161 ALGCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF 217
GCG QV S+ +G+LGLG S+ S L S+ L N C G G +
Sbjct: 218 MFGCG--QVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDDGHGRIN 275
Query: 218 FGD----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
FGD D ++ V+ YY+ + + G ++ + + DSG+S+T L
Sbjct: 276 FGDTGSSDQKETPLNVYKQ-----NPYYNITITGITVGSKSISTE-FSAIVDSGTSFTAL 329
Query: 274 NRVTYQTLTSIMKKELSA 291
+ Y +TS ++ +
Sbjct: 330 SDPMYTQITSSFDAQIRS 347
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 118/283 (41%), Gaps = 37/283 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV-----EAPHPLYRPSNDLVP 102
+G Y V++ IG P + L DTGSDL W++C +PC C A + + +
Sbjct: 83 SGQYFVSLRIGTPPQTLLLVADTGSDLIWVKC-SPCRNCSHRSPGSAFFARHSTTYSAIH 141
Query: 103 CEDPICASLHAPGHHNCEDP---AQCDYELEYADGGSSLGVLVKDAFAFNYTNG--QRLN 157
C P C + P + C + C Y+ YAD ++ G K+A N + G ++LN
Sbjct: 142 CYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTSTGKVKKLN 201
Query: 158 PRLALGCGYN----QVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGHCLSG 210
L+ GCG+ + GAS+ G++GLG+ S SQL + K ++ + LS
Sbjct: 202 -GLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKFSYCLMDYTLSP 260
Query: 211 GGGGFLFFG--DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV------ 262
FL G ++ S + + S + SP + G LP+
Sbjct: 261 PPTSFLTIGGAQNVAVSKKGIM-SFTPLLINPLSPTFYYIAIKGVYVNGVKLPINPSVWS 319
Query: 263 ---------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
+ DSG++ T++ Y + KK + S E
Sbjct: 320 IDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAE 362
>gi|357152658|ref|XP_003576193.1| PREDICTED: F-box/FBD/LRR-repeat protein At5g22660-like
[Brachypodium distachyon]
Length = 594
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 44/101 (43%), Positives = 57/101 (56%), Gaps = 8/101 (7%)
Query: 89 APHPLYRPS--NDLVPCEDPICASLHAP--GHHNCE-DPAQCDYELEYADGGSSLGVLVK 143
PH LY+P N L+ C D C +H +C DP QCDYE+EY +G +S+GVL+
Sbjct: 382 VPHDLYKPRRMNKLL-CGDERCVKVHKDLDIEQDCTLDPNQCDYEIEYTNGENSMGVLLA 440
Query: 144 DAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLG 184
D F+ T RLN LA GCGY G P+DG+L +G
Sbjct: 441 DTFSLPTTTNDRLN--LAFGCGYGHQGGQEVTPVDGVLRIG 479
>gi|147859621|emb|CAN83119.1| hypothetical protein VITISV_043393 [Vitis vinifera]
Length = 431
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 86/320 (26%), Positives = 120/320 (37%), Gaps = 61/320 (19%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPI 107
G Y + IG PAR Y++ ++ LT + LV C+
Sbjct: 95 VGLYYAKIGIGTPARDYYVQME----LTLYDIKESL-------------TGKLVSCDQDF 137
Query: 108 CASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVK---DAFAFNYTNGQRLNPRLA--L 162
C +++ C C Y YADG SS G VK A +N NP L L
Sbjct: 138 CYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVKGYCTASKYNSIPHLNNNPLLEVPL 197
Query: 163 GCGYNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-GGGGFLFFGD 220
C Q +S LDGILG GK +S++SQL S +R + HCL G GGG G
Sbjct: 198 RCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGH 257
Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-----------VVFDSGSS 269
+ +V T + + T +Y+ + + GG NLP + DSG++
Sbjct: 258 IV--QPKVNTTPLVPNQT-HYNVNMKAVEVGGY---FLNLPTDVFDVGDKKGTIIDSGTT 311
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS 329
YL V Y L S + W+ +HD CF+ + S
Sbjct: 312 LAYLPEVVYDQLLSKI-------------------FSWQSDLKVHTIHDQFTCFQ-YSES 351
Query: 330 FTDGKTRTLFELTPEAYLII 349
DG F YL +
Sbjct: 352 LDDGFPAVTFHFENSLYLKV 371
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 67/133 (50%), Gaps = 13/133 (9%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL 100
V G +G Y + IG P R ++ LDTGSD+ W+QC+ PC C P++ PS+ +
Sbjct: 144 VSGMEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCE-PCRECYSQADPIFNPSSSV 202
Query: 101 ----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
V C+ +C+ L A H C YE+ Y DG ++G + F T+ Q
Sbjct: 203 SFSTVGCDSAVCSQLDANDCHG----GGCLYEVSYGDGSYTVGSYATETLTFGTTSIQ-- 256
Query: 157 NPRLALGCGYNQV 169
+A+GCG++ V
Sbjct: 257 --NVAIGCGHDNV 267
>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 519
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 83/350 (23%), Positives = 140/350 (40%), Gaps = 55/350 (15%)
Query: 54 TMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL------------- 100
T+ IG P + + LDTGSDL W+ CD C RC + + DL
Sbjct: 103 TVQIGTPGVKFMVALDTGSDLFWVPCD--CTRCAASDSTAFASDFDLNVYNPNGSSTSKK 160
Query: 101 VPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNYTNGQR--L 156
V C + +C C + C Y + Y +S G+LV+D + +
Sbjct: 161 VTCNNSLCTH-----RSQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDLV 215
Query: 157 NPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG 213
+ GCG Q+ S+ + +G+ GLG K S+ S L + + C G
Sbjct: 216 EANVIFGCG--QIQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGI 273
Query: 214 GFLFFGDD-LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTY 272
G + FGD +D + S T Y+ V ++ G ++ +FDSG+S+TY
Sbjct: 274 GRISFGDKGSFDQDETPFNLNPSHPT--YNITVTQVRVGTTVIDVE-FTALFDSGTSFTY 330
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK-----KCFRTLA 327
L TY LT ++ + + R PF+ +D+ +++
Sbjct: 331 LVDPTYTRLTESFHSQVQDRRHRS-----------DSRIPFEYCYDMSPDANTSLIPSVS 379
Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNV--CLGILNGAEVGLQDLNVIGG 375
L+ G ++ + +IIS + + CL ++ AE+ + N + G
Sbjct: 380 LTMGGGSHFAVY----DPIIIISTQSELVYCLAVVKSAELNIIGQNFMTG 425
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 75/270 (27%), Positives = 111/270 (41%), Gaps = 28/270 (10%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV----PCE 104
G Y + + IG P P +DTGSDLTW QC PC C + P + P N C
Sbjct: 90 GEYIMNLSIGTPPVPVIAIVDTGSDLTWTQC-RPCTHCYKQVVPFFDPKNSSTYRDSSCG 148
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
C +L +C + +C + YADG + G L + T G+ ++ P A G
Sbjct: 149 TSFCLALG--NDRSCRNGKKCTFMYSYADGSFTGGNLAVETLTVASTAGKPVSFPGFAFG 206
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL------SGGGGGFLF 217
C + H GI+GLG + S++SQL S I +CL S F
Sbjct: 207 CVHRSGGIFDEHS-SGIVGLGVAELSMISQLKST--INGRFSYCLLPVFTDSSMSSRINF 263
Query: 218 FGDDLYDSSRVVWTS--MSSDYTKYY-------SPGVAELFFGG--ETTGLKNLPVVFDS 266
+ + V T M T YY S G L + G + ++ ++ DS
Sbjct: 264 GRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKGFSKKAEVEEGNIIVDS 323
Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
G++YTYL Y L + + K +++
Sbjct: 324 GTTYTYLPLEFYVKLEESVAHSIKGKRVRD 353
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 107/363 (29%), Positives = 145/363 (39%), Gaps = 64/363 (17%)
Query: 42 HGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----S 97
+ N PT Y V + IG P +P L LDTGSDL W QC PC C + P + P +
Sbjct: 26 YDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQ-PCPACFDQALPYFDPSTSST 84
Query: 98 NDLVPCEDPICASLHAPGHHNCEDPA-----QCDYELEYADGGSSLGVLVKDAFAFNYTN 152
L C+ +C L +C P C Y Y D + G L D F F
Sbjct: 85 LSLTSCDSTLCQGLPV---ASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAG 141
Query: 153 GQRLNPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
P +A GCG +N G GI G G+G S+ SQL HC +
Sbjct: 142 ASV--PGVAFGCGLFNN--GVFKSNETGIAGFGRGPLSLPSQLKVGNF-----SHCFTTI 192
Query: 212 GGG-----FLFFGDDLYDSSR-VVWTSMSSDYTKYYS-PGVAELFFGGETTGLKNLPV-- 262
G L DL+ + + V T+ Y K + P + L G T G LPV
Sbjct: 193 TGAIPSTVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPE 252
Query: 263 ------------VFDSGSSYTYLNRVTYQTLTSIMKKELSAK-SLKEAPEDET-LPLCWK 308
+ DSG+S T L YQ +++ E +A+ L P + T C+
Sbjct: 253 SAFALTNGTGGTIIDSGTSITSLPPQVYQ----VVRDEFAAQIKLPVVPGNATGHYTCFS 308
Query: 309 GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL--IISNKGN--VCLGILNGAE 364
P + DV K L L F +G T +L E Y+ + + GN +CL I G E
Sbjct: 309 A--PSQAKPDVPK----LVLHF-EGAT---MDLPRENYVFEVPDDAGNSIICLAINKGDE 358
Query: 365 VGL 367
+
Sbjct: 359 TTI 361
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 53/167 (31%), Positives = 83/167 (49%), Gaps = 11/167 (6%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y + +G PA L +DTGSD+TWLQC PC RC P++ P + +
Sbjct: 131 SGEYMAKIAVGTPAVEALLAMDTGSDITWLQCQ-PCRRCYPQSGPVFDPRHSTSYREMGY 189
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGS-SLGVLVKDAFAFNYTNGQRLNPRLAL 162
+ P C +L G + + C Y + Y D GS ++G +++ F G P +++
Sbjct: 190 DAPDCQALGRSGGGDAKR-MTCVYAVGYGDDGSTTVGDFIEETLTF---AGGVQVPHMSI 245
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
GCG++ G P GILGLG+G+ S SQ+ + +CL+
Sbjct: 246 GCGHDN-KGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYCLA 291
>gi|424513106|emb|CCO66690.1| predicted protein [Bathycoccus prasinos]
Length = 802
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 96/385 (24%), Positives = 158/385 (41%), Gaps = 60/385 (15%)
Query: 35 SSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVE----AP 90
SS +++G TGY+ T+ IG P + + +DTGS T++ C PC C + AP
Sbjct: 122 SSAGLELNGKARDTGYFYATVLIGTPGHQFEVIVDTGSTYTFVTC-YPCASCGQHGSNAP 180
Query: 91 HPLYRPSN-DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFN 149
+ + S+ + VPC C C+Y+ ++++ G +V D
Sbjct: 181 YDAAKSSSYERVPCGSGCIFGA-------CRASGLCEYDEKFSEDSQVGGHVVSDVID-- 231
Query: 150 YTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL----IRNVVG 205
G PR+ GC + +G++ LG+ ++ + QL + G
Sbjct: 232 -VGGSLGTPRIHFGCNSLETNMLKTQKANGMIALGRAEAGLHRQLKKKAYPPGSYDGTFG 290
Query: 206 HCL-SGGGGGFLFFG---DDLYDS--SRVVWTS----MSSDYTKYYSPGVAELFFGGETT 255
CL S GGG L G + Y + +R TS + ++YY+ V +F T
Sbjct: 291 LCLGSFEGGGVLSLGKLPEQHYANFVTRKTHTSTVKLVKGSKSQYYNVEVHRMFV--RNT 348
Query: 256 GLKN-------------LPVVFDSGSSYTYLNRVTYQTLTSIMKKEL----SAKSLKEAP 298
LK V DSG++YTYL+ + S ++ ++ A +
Sbjct: 349 ELKKPSGAELMEAFRAGYGTVLDSGTTYTYLHEDVFIPFISEIEDKVVNDHGANFFRVRG 408
Query: 299 EDETLP--LCWKGRRPFKNVHD--VKKCFRTLALSFTDGKTRTL-FELTPEAYLIIS-NK 352
D P +CW+ K + + V F T L+F L E PE YL + N+
Sbjct: 409 GDPNYPNDVCWRSLNENKQLSESNVNYLFPTFNLTFIGVNEEELPIEFLPENYLFVHPNE 468
Query: 353 GNV-CLGILNGAEVGLQDLNVIGGI 376
N C+G+ + + G ++IGGI
Sbjct: 469 PNAFCVGVFDNGQQG----SIIGGI 489
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 80/289 (27%), Positives = 127/289 (43%), Gaps = 24/289 (8%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
R+ S + SS +F ++ L G G Y VT+ +G P + + L DTGSD+T
Sbjct: 36 RVDSIHARLSSRGMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDIT 95
Query: 76 WLQCDAPCVR-CVEAPHPLYRPSNDL----VPCEDPICASLHAPGH---HNCEDPAQCDY 127
W QC+ PCV+ C + P PS + C +C L A G +C + C Y
Sbjct: 96 WTQCE-PCVKTCYKQKEPRLNPSTSTSYKNISCSSALC-KLVASGKKFSQSCSS-STCLY 152
Query: 128 ELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGK 187
+++Y DG S+G + + +N + GCG Q + G+LGLG+ K
Sbjct: 153 QVQYGDGSYSIGFFATETLTLSSSN---VFKNFLFGCG--QQNNGLFGGAAGLLGLGRTK 207
Query: 188 SSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGV 245
++ SQ + K + + +CL S G+L G + S + S D T +Y +
Sbjct: 208 LALPSQ--TAKTYKKLFSYCLPASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDI 265
Query: 246 AELFFGGETTGLK----NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELS 290
L GG + + V DSG+ T L+ Y L+S + ++
Sbjct: 266 TGLSVGGRQLSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMT 314
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 74/276 (26%), Positives = 121/276 (43%), Gaps = 30/276 (10%)
Query: 52 NVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVPCEDPI 107
N + IG + + +DTGSDLTW+QCD PC+ C P++ S + + C
Sbjct: 132 NYIVTIGLGNQNMTVIIDTGSDLTWVQCD-PCMSCYSQQGPVFNPSNSSSYNSLLCNSST 190
Query: 108 CASLH--APGHHNCE--DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
C +L CE +P+ C++ + Y DG + G L + +F G G
Sbjct: 191 CQNLQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGVEHLSF----GGISVSNFVFG 246
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGD 220
CG N + + GI+GLG+ S++SQ ++ V +CL G G L G+
Sbjct: 247 CGRNN--KGLFGGVSGIMGLGRSNLSMISQTNTT--FGGVFSYCLPTTDSGASGSLVIGN 302
Query: 221 D---LYDSSRVVWTSMSSD--YTKYYSPGVAELFFGG---ETTGLKNLPVVFDSGSSYTY 272
+ + + + +TSM S+ + +Y + + GG + T N ++ DSG+ T
Sbjct: 303 ESSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAIQDTSFGNGGILIDSGTVITR 362
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
L Y L + K+ S + AP L C+
Sbjct: 363 LAPSLYNALKAEFLKQFSGYPI--APALSILDTCFN 396
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 65/126 (51%), Gaps = 13/126 (10%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y + IG P R ++ LDTGSD+ W+QC+ PC C P++ PS+ + V C
Sbjct: 5 SGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCE-PCRECYSQADPIFNPSSSVSFSTVGC 63
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+ +C+ L A H C YE+ Y DG ++G + F T+ Q +A+G
Sbjct: 64 DSAVCSQLDANDCHG----GGCLYEVSYGDGSYTVGSYATETLTFGTTSIQ----NVAIG 115
Query: 164 CGYNQV 169
CG++ V
Sbjct: 116 CGHDNV 121
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 97/363 (26%), Positives = 150/363 (41%), Gaps = 58/363 (15%)
Query: 46 YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-----RCVEAPHPLYRPSNDL 100
Y T Y + +G PA+ + + +DTGS+LTW+ C R A S
Sbjct: 79 YGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADES---KSFKT 135
Query: 101 VPCEDPICAS--LHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
V C C ++ C P+ C Y+ YADG ++ GV K+ TNG+
Sbjct: 136 VGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMAR 195
Query: 158 -PRLALGCGYNQVPGASYHPLDGILGLGKGK---SSIVSQLHSQKLIRNVVGHCLSGGGG 213
P +GC + G S+ DG+LGL +S + L+ K +V H +
Sbjct: 196 LPGHLIGCS-SSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVS 254
Query: 214 GFLFFGDDLYDSSRVVWTSMSS----DYTK---YYSPGVAELFFGGETTGLKNLP----- 261
+L FG SSR T+ D T+ +Y+ V + G + + ++P
Sbjct: 255 NYLIFG-----SSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYD---MLDIPSQVWD 306
Query: 262 ------VVFDSGSSYTYLNRVTY-QTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFK 314
+ DSG+S T L Y Q +T + + + K +K PE + C+ F
Sbjct: 307 ATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVK--PEGVPIEYCFSFTSGF- 363
Query: 315 NVHDVKKCFRTLALSF-TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVI 373
NV + + L+F G R FE ++YL+ + G CLG ++ G NVI
Sbjct: 364 NVSKLPQ------LTFHLKGGAR--FEPHRKSYLVDAAPGVKCLGFVSA---GTPATNVI 412
Query: 374 GGI 376
G I
Sbjct: 413 GNI 415
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 83/286 (29%), Positives = 120/286 (41%), Gaps = 35/286 (12%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
+G Y V M +G P + Y + +DTGS +WLQC + C P++ PS VPC
Sbjct: 100 SGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPC 159
Query: 104 -----EDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
A+L+ P C + C Y+ Y D SLG L +D T Q L+
Sbjct: 160 SSSQCSSLKSATLNEP---TCSKQSNACVYKASYGDSSFSLGYLSQDVLTL--TPSQTLS 214
Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-------SG 210
GCG Q + DGI+GL + S++SQL + N +CL +
Sbjct: 215 -SFVYGCG--QDNQGLFGRTDGIIGLANNELSMLSQLSGK--YGNAFSYCLPTSFSTPNS 269
Query: 211 GGGGFLFFG-DDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLK----NLPVV 263
GFL G L SS +T + + + Y + + G G+ +P +
Sbjct: 270 PKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTI 329
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
DSG+ T L Y TL + LS K ++AP L C+KG
Sbjct: 330 IDSGTVITRLPTPVYTTLKNAYVTILS-KKYQQAPGISLLDTCFKG 374
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 102/357 (28%), Positives = 139/357 (38%), Gaps = 60/357 (16%)
Query: 47 PTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY---RPSNDLVPC 103
P Y + + IG P +P L LDTGS L W QC PC C P Y R S +P
Sbjct: 31 PMTEYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQ-PCAVCFNQSLPYYDASRSSTFALPS 89
Query: 104 EDPICASLHAPGHHNC--EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
D L P C + C Y Y D +++G L D ++ G + P +
Sbjct: 90 CDSTQCKLD-PSVTMCVNQTVQTCAYSYSYGDKSATIGFL--DVETVSFVAGASV-PGVV 145
Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG----GFLF 217
GCG N G GI G G+G S+ SQL HC + G LF
Sbjct: 146 FGCGLNNT-GIFRSNETGIAGFGRGPLSLPSQLKVGNF-----SHCFTAVSGRKPSTVLF 199
Query: 218 -FGDDLYDSSRVVWTSMSSDYTKYYS-PGVAELFFGGETTGLKNLPV------------- 262
DLY + R T ++ K + P L G T G LPV
Sbjct: 200 DLPADLYKNGR--GTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGG 257
Query: 263 -VFDSGSSYTYLNRVTYQTLTSIMKKELSAK-SLKEAPEDETLPLCWKGRRPFKNVHDVK 320
+ DSG+++T L Y+ ++ E +A L P +ET PL P V
Sbjct: 258 TIIDSGTAFTSLPPRVYR----LVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVP 313
Query: 321 KCFRTLALSFTDGKTRTLFELTPEAYLIISNKG---NVCLGILNGAEVGLQDLNVIG 374
K L L F +G T L E Y+ + G ++CL I+ G ++ +IG
Sbjct: 314 K----LVLHF-EGAT---MHLPRENYVFEAKDGGNCSICLAIIEG------EMTIIG 356
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 71/248 (28%), Positives = 106/248 (42%), Gaps = 24/248 (9%)
Query: 59 QPARPYFLDLDTGSDLTWLQC-DAPCVRCVEAPHPLYRPS----NDLVPCEDPICASL-- 111
+P + LDT SD+ W+QC P +C LY PS ++ C P C L
Sbjct: 177 RPGVRQLMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLGP 236
Query: 112 HAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVP 170
+A G + + A QC Y + Y DG ++ G LV D + + T+ P+ GC +
Sbjct: 237 YANGCSSSSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTS---QVPKFEFGCSHAARG 293
Query: 171 GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRV 228
S GI+ LG+G S+VSQ ++ V +C + GF G SSR
Sbjct: 294 SFSRSKTAGIMALGRGVQSLVSQTSTK--YGQVFSYCFPPTASHKGFFVLGVPRRSSSRY 351
Query: 229 VWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGS---SYTYLNRV---TYQTLT 282
T M Y + + G+ L P VF +G+ S T + R+ YQ L
Sbjct: 352 AVTPMLKT-PMLYQVRLEAIAVAGQR--LDVPPTVFAAGAALDSRTVITRLPPTAYQALR 408
Query: 283 SIMKKELS 290
S + ++S
Sbjct: 409 SAFRDKMS 416
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 106/388 (27%), Positives = 151/388 (38%), Gaps = 60/388 (15%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
RM+ S + + L + + + + + P Y + + IG P +P L LDTGS L
Sbjct: 56 RMALRSKARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSVLV 115
Query: 76 WLQCDAPCVRCVEAPHPLY---RPSNDLVPCEDPICASLHAPGHHNC--EDPAQCDYELE 130
W QC PC C P Y R S +P D L P C + C Y
Sbjct: 116 WTQCQ-PCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLD-PSVTMCVNQTVQTCAYSYS 173
Query: 131 YADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSI 190
Y D +++G L D ++ G + P + GCG N G GI G G+G S+
Sbjct: 174 YGDKSATIGFL--DVETVSFVAGASV-PGVVFGCGLNNT-GIFRSNETGIAGFGRGPLSL 229
Query: 191 VSQLHSQKLIRNVVGHCLSGGGG----GFLF-FGDDLYDSSRVVWTSMSSDYTKYYS-PG 244
SQL HC + G LF DLY + R T ++ K + P
Sbjct: 230 PSQLKVGNF-----SHCFTAVSGRKPSTVLFDLPADLYKNGR--GTVQTTPLIKNPAHPT 282
Query: 245 VAELFFGGETTGLKNLPV--------------VFDSGSSYTYLNRVTYQTLTSIMKKELS 290
L G T G LPV + DSG+++T L Y+ ++ E +
Sbjct: 283 FYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYR----LVHDEFA 338
Query: 291 AK-SLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLII 349
A L P +ET PL P V K L L F +G T L E Y+
Sbjct: 339 AHVKLPVVPSNETGPLLCFSAPPLGKAPHVPK----LVLHF-EGAT---MHLPRENYVFE 390
Query: 350 SNKG---NVCLGILNGAEVGLQDLNVIG 374
+ G ++CL I+ G ++ +IG
Sbjct: 391 AKDGGNCSICLAIIEG------EMTIIG 412
>gi|356538031|ref|XP_003537508.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 521
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 84/291 (28%), Positives = 125/291 (42%), Gaps = 33/291 (11%)
Query: 37 LLFQVHGNVYPT-----GYYNVTMY-IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP 90
LLF HG+ + G+ + T IG P+ + + LD GSDL W+ CD CV+C
Sbjct: 77 LLFPSHGSKTMSLGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWIPCD--CVQCAPLS 134
Query: 91 HPLY----RPSNDLVPCEDPICASLHAPGHH-------NCEDPAQ-CDYELEY-ADGGSS 137
Y R N+ P +S H H NC+ Q C Y + Y ++ SS
Sbjct: 135 SSYYSNLDRDLNEYSPSRS--LSSKHLSCSHRLCDKGSNCKSSQQQCPYMVSYLSENTSS 192
Query: 138 LGVLVKDAFAFN---YTNGQRLNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVS 192
G+LV+D + + + LGCG Q G P DG+LGLG G+SS+ S
Sbjct: 193 SGLLVEDILHLQSGGTLSNSSVQAPVVLGCGMKQSGGYLDGVAP-DGLLGLGPGESSVPS 251
Query: 193 QLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSR-VVWTSMSSDYTKYYSPGVAELFFG 251
L LI C + G +FFGD S + + + Y+ Y GV G
Sbjct: 252 FLAKSGLIHYSFSLCFNEDDSGRMFFGDQGPTSQQSTSFLPLDGLYSTYII-GVESCCIG 310
Query: 252 GETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKEL--SAKSLKEAPED 300
+ + DSG+S+T+L Y +T +++ S S + +P +
Sbjct: 311 NSCLKMTSFKAQVDSGTSFTFLPGHVYGAITEEFDQQVNGSRSSFEGSPWE 361
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 103/391 (26%), Positives = 160/391 (40%), Gaps = 63/391 (16%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
R +S SSS +L + +++ G +G Y + +Y+G P R + + +DTGSDL
Sbjct: 119 RTPASPSSSPRRALSERMVATV---ESGVAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLN 175
Query: 76 WLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPG-HHNCEDPAQ--CDYE 128
WLQC APC+ C + P++ P+ V C D C + P C P + C Y
Sbjct: 176 WLQC-APCLDCFDQVGPVFDPAASSSYRNVTCGDQRCGLVAPPEPPRACRRPGEDSCPYY 234
Query: 129 LEYADGGSSLGVLVKDAFAFNYT--NGQRLNPRLALGCG-YNQVPGASYHPLDGILGLGK 185
Y D ++ G L ++F N T R + GCG +N+ +H G+LGLG+
Sbjct: 235 YWYGDQSNTTGDLALESFTVNLTAPGASRRVDDVVFGCGHWNR---GLFHGAAGLLGLGR 291
Query: 186 GKSSIVSQLHSQKLIRNVVGH----CLSGGGGGF---LFFGDDLYDSSR--------VVW 230
G S SQL R V GH CL G + FG+D + +
Sbjct: 292 GPLSFASQL------RAVYGHTFSYCLVDHGSDVASKVVFGEDDALALAAAHPQLNYTAF 345
Query: 231 TSMSSDYTKYYSPGVAELFFGGETTGLKN------------LPVVFDSGSSYTYLNRVTY 278
SS +Y + + GGE + + + DSG++ +Y Y
Sbjct: 346 APASSPADTFYYVKLKGVLVGGELLNISSDTWGVGEGEGGSGGTIIDSGTTLSYFVEPAY 405
Query: 279 QTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC-FRTLALSFTDGKTRT 337
Q + + +S P+ L C+ NV V + L+L F DG
Sbjct: 406 QVIRQAFIDRM-GRSYPLIPDFPVLSPCY-------NVSGVDRPEVPELSLLFADG---A 454
Query: 338 LFELTPEAYLI-ISNKGNVCLGILNGAEVGL 367
+++ E Y I + G +CL +L G+
Sbjct: 455 VWDFPAENYFIRLDPDGIMCLAVLGTPRTGM 485
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 83/286 (29%), Positives = 120/286 (41%), Gaps = 35/286 (12%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
+G Y V M +G P + Y + +DTGS +WLQC + C P++ PS VPC
Sbjct: 100 SGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPC 159
Query: 104 -----EDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
A+L+ P C + C Y+ Y D SLG L +D T Q L+
Sbjct: 160 SSSQCSSLKSATLNEP---TCSKQSNACVYKASYGDSSFSLGYLSQDVLTL--TPSQTLS 214
Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-------SG 210
GCG Q + DGI+GL + S++SQL + N +CL +
Sbjct: 215 -SFVYGCG--QDNQGLFGRTDGIIGLANNELSMLSQLSGK--YGNAFSYCLPTSFSTPNS 269
Query: 211 GGGGFLFFG-DDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLK----NLPVV 263
GFL G L SS +T + + + Y + + G G+ +P +
Sbjct: 270 PKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTI 329
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
DSG+ T L Y TL + LS K ++AP L C+KG
Sbjct: 330 IDSGTVITRLPTPVYTTLKNAYVTILS-KKYQQAPGISLLDTCFKG 374
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 97/363 (26%), Positives = 150/363 (41%), Gaps = 58/363 (15%)
Query: 46 YPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-----RCVEAPHPLYRPSNDL 100
Y T Y + +G PA+ + + +DTGS+LTW+ C R A S
Sbjct: 101 YGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADES---KSFKT 157
Query: 101 VPCEDPICAS--LHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
V C C ++ C P+ C Y+ YADG ++ GV K+ TNG+
Sbjct: 158 VGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRMAR 217
Query: 158 -PRLALGCGYNQVPGASYHPLDGILGLGKGK---SSIVSQLHSQKLIRNVVGHCLSGGGG 213
P +GC + G S+ DG+LGL +S + L+ K +V H +
Sbjct: 218 LPGHLIGCS-SSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVS 276
Query: 214 GFLFFGDDLYDSSRVVWTSMSS----DYTK---YYSPGVAELFFGGETTGLKNLP----- 261
+L FG SSR T+ D T+ +Y+ V + G + + ++P
Sbjct: 277 NYLIFG-----SSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYD---MLDIPSQVWD 328
Query: 262 ------VVFDSGSSYTYLNRVTY-QTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFK 314
+ DSG+S T L Y Q +T + + + K +K PE + C+ F
Sbjct: 329 ATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVK--PEGVPIEYCFSFTSGF- 385
Query: 315 NVHDVKKCFRTLALSF-TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVI 373
NV + + L+F G R FE ++YL+ + G CLG ++ G NVI
Sbjct: 386 NVSKLPQ------LTFHLKGGAR--FEPHRKSYLVDAAPGVKCLGFVSA---GTPATNVI 434
Query: 374 GGI 376
G I
Sbjct: 435 GNI 437
>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 65/192 (33%), Positives = 91/192 (47%), Gaps = 16/192 (8%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSNDL----VP 102
+G Y VT+ +G P R DTGSDLTW QC+ PCV C + ++ PS L V
Sbjct: 86 SGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCE-PCVGYCYQQREHIFDPSTSLSYSNVS 144
Query: 103 CEDPICASLH-APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
C+ P C L A G+ + C Y + Y DG S+G ++ + T+ +
Sbjct: 145 CDSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTD---VFNNFQ 201
Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFG 219
GCG N + G+LGL + S+VSQ +QK + V +CL S G+L FG
Sbjct: 202 FGCGQNNR--GLFGGTAGLLGLARNPLSLVSQT-AQKYGK-VFSYCLPSSSSSTGYLSFG 257
Query: 220 DDLYDSSRVVWT 231
DS V +T
Sbjct: 258 SGDGDSKAVKFT 269
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 110/403 (27%), Positives = 164/403 (40%), Gaps = 65/403 (16%)
Query: 15 VRMSSSSSSSSSSSLFNHVGSSLLFQVH--------------GNVYPTGYYNVTMYIGQP 60
V+ + S SS++ SLF + S+ +FQ H G + G Y ++ +G P
Sbjct: 54 VKANPSPSSAAQKSLFPY--SAHIFQQHTKNPAALRSSTTTLGRKF--GEYYTSIKLGSP 109
Query: 61 ARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCED-PICASLHAPG 115
+ L +DTGS+LTWLQC PC C + +Y + V C + +C++
Sbjct: 110 GQEAILIVDTGSELTWLQC-LPCKVCAPSVDTIYDAARSASYRPVTCNNSQLCSNSSQGT 168
Query: 116 HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR--LNPRLALGCGYNQ---VP 170
+ C +QC + Y DG S G L D G + A GC VP
Sbjct: 169 YAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFGCAQGDLELVP 228
Query: 171 -GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-----GGGGFLFFGDDLYD 224
GAS GILGL GK ++ QL + + HC G +FFG+
Sbjct: 229 TGAS-----GILGLNAGKMALPMQLGQRFGWK--FSHCFPDRSSHLNSTGVVFFGNAELP 281
Query: 225 SSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGLKNLP----VVFDSGSSYTYLNRV 276
+V +TS+ S K+Y + + L LP V+ DSGSS++ R
Sbjct: 282 HEQVQYTSVALTNSELQRKFYHVALKGVSINSHE--LVFLPRGSVVILDSGSSFSSFVRP 339
Query: 277 TYQTLTSIMKKELSAKSLKEAPEDE--TLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGK 334
+ L K SLK D L C+K ++ ++ + +L+L F DG
Sbjct: 340 FHSQLREAFLKH-RPPSLKHLEGDSFGDLGTCFKVSN--DDIDELHRTLPSLSLVFEDGV 396
Query: 335 T---RTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
T ++ L P A N +C +G G +NVIG
Sbjct: 397 TIGIPSIGVLLPVARF--QNHVKMCFAFEDG---GPNPVNVIG 434
>gi|242050026|ref|XP_002462757.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
gi|241926134|gb|EER99278.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
Length = 523
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 75/270 (27%), Positives = 113/270 (41%), Gaps = 37/270 (13%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR------------PSNDLVPCE 104
+G P + + LDTGSDL W+ CD C+ C P YR ++ VPC
Sbjct: 110 LGTPNVTFLVALDTGSDLFWVPCD--CINCAPLVSPNYRDLKFDTYSPQKSSTSRKVPCS 167
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEY-ADGGSSLGVLVKDAFAF--NYTNGQRLNPRLA 161
+C A + P Y +EY +D SS GVLV+D Y + + +
Sbjct: 168 SNLCDLQSACRSASSSCP----YSIEYLSDNTSSTGVLVEDVLYLITEYGQPKIVTAPIT 223
Query: 162 LGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFG 219
GCG Q S P +G+LGLG S+ S L S+ + N C G G + FG
Sbjct: 224 FGCGRIQTGSFLGSAAP-NGLLGLGMDSISVPSLLASEGVAANSFSMCFGDDGRGRINFG 282
Query: 220 D----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNR 275
D D ++ ++ YY+ + G ++ N + DSG+S+T L+
Sbjct: 283 DTGSSDQQETPLNIYKQ-----NPYYNISITGAMVGSKSFN-TNFNAIVDSGTSFTALSD 336
Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
Y +TS ++ K + D +LP
Sbjct: 337 PMYSEITSSFNSQVQDKPTQ---LDSSLPF 363
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 110/262 (41%), Gaps = 36/262 (13%)
Query: 41 VHGNVYPT---GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRP 96
V V PT G + +T+ IG P P+ DTGSDL W QC APC R C + P PLY P
Sbjct: 72 VSAPVSPTTVPGEFLMTLAIGTPPLPFLAIADTGSDLIWTQC-APCSRQCFQQPTPLYNP 130
Query: 97 SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN--GQ 154
S+ P +SL C C Y + Y G + + + F F + Q
Sbjct: 131 SSSTTFSALPCNSSLGL-----CAPACACMYNMTYGSGWTYV-FQGTETFTFGSSTPADQ 184
Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
P +A GC N G + G++GLG+G S+VSQL + K + + +
Sbjct: 185 VRVPGIAFGCS-NASSGFNASSASGLVGLGRGSLSLVSQLGAPKFSYCLTPYQDTNSTST 243
Query: 215 FLFFGDDLYDSSRVVWTS--MSSDYTKYYSPGVAELFFGGETTGLKNLPV---------- 262
L + + VV ++ ++S + YY L G + G LP+
Sbjct: 244 LLLGPSASLNDTGVVSSTPFVASPSSIYY-----YLNLTGISLGTTALPIPPNAFSLKAD 298
Query: 263 -----VFDSGSSYTYLNRVTYQ 279
+ DSG++ T L YQ
Sbjct: 299 GTGGLIIDSGTTITMLGNTAYQ 320
>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
Length = 518
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 85/348 (24%), Positives = 149/348 (42%), Gaps = 49/348 (14%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL------------ 100
T+ +G P + + + LDTGSDL W+ CD C RC Y +L
Sbjct: 105 TTVSLGTPGKKFLVALDTGSDLFWVPCD--CSRCAPTEGTTYASDFELSIYNPKGSSTSR 162
Query: 101 -VPCEDPICASLHAPGHHN-CEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNYTNGQR- 155
V C++ +CA H N C + C Y + Y +S G+LV+D + ++
Sbjct: 163 KVTCDNSLCA------HRNRCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTTEDNRQE 216
Query: 156 -LNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
+ + GCG QV S+ + +G+ GLG K S+ S L + + C
Sbjct: 217 FVEAYVTFGCG--QVQTGSFLDIAAPNGLFGLGLEKISVPSILSKEGFTADSFSMCFGPD 274
Query: 212 GGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYT 271
G G + FGD ++++ + Y+ V ++ G L + +FDSG+S+T
Sbjct: 275 GIGRISFGDKGSPDQEETPFNLNALHPT-YNITVTQVRVGTTLIDL-DFTALFDSGTSFT 332
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP--LCWKGRRPFKNVHDVKKCFRTLALS 329
YL Y T+++K S P D +P C+ P +N + +++L+
Sbjct: 333 YLVDPIY---TNVLKSFHSQAQDSRRPPDSRIPFEFCYD-MSPGENTSLIP----SMSLT 384
Query: 330 FTDGKTRTLFELTPEAYLIISNKGNV--CLGILNGAEVGLQDLNVIGG 375
G ++ + +IIS++ + C+ ++ AE+ + N + G
Sbjct: 385 MKGGSQFPVY----DPIIIISSQSELIYCMAVVRSAELNIIGQNFMTG 428
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 66/131 (50%), Gaps = 13/131 (9%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN-- 98
+ G +G Y + IG+PAR ++ LDTGSD+ WLQC PC C P++ PS+
Sbjct: 138 ISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQC-TPCADCYHQTEPIFEPSSSS 196
Query: 99 --DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+ + C+ P C +L N A C YE+ Y DG ++G + T G L
Sbjct: 197 SYEPLSCDTPQCNALEVSECRN----ATCLYEVSYGDGSYTVGDFATETL----TIGSTL 248
Query: 157 NPRLALGCGYN 167
+A+GCG++
Sbjct: 249 VQNVAVGCGHS 259
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 81/291 (27%), Positives = 126/291 (43%), Gaps = 27/291 (9%)
Query: 27 SSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC 86
+S H+ SS+ F + + Y V + IG P + L DTGS L W QC PC C
Sbjct: 109 TSSVEHMKSSVPFYGLSKITASDYI-VNVGIGTPKKEMPLIFDTGSGLIWTQCK-PCKAC 166
Query: 87 VEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLV 142
P++ P+ +PC +C S+ C P +C Y Y D SS G L
Sbjct: 167 YPK-VPVFDPTKSASFKGLPCSSKLCQSI----RQGCSSP-KCTYLTAYVDNSSSTGTLA 220
Query: 143 KDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRN 202
+ +F++ N + +GC +QV G S GI+GL + S+ SQ + +
Sbjct: 221 TETISFSHLKYDFKN--ILIGCS-DQVSGESLGE-SGIMGLNRSPISLASQ--TANIYDK 274
Query: 203 VVGHCL--SGGGGGFLFFGDDLYDSSR---VVWTSMSSDY-TKYYSPGVAELFFGGETTG 256
+ +C+ + G G L FG + + R V T+ SSDY K V + +
Sbjct: 275 LFSYCIPSTPGSTGHLTFGGKVPNDVRFSPVSKTAPSSDYDIKMTGISVGGRKLLIDASA 334
Query: 257 LKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
K + DSG+ T L Y L S+ ++ + L + +D+ L C+
Sbjct: 335 FK-IASTIDSGAVLTRLPPKAYSALRSVFREMMKGYPLLD--QDDFLDTCY 382
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 83/345 (24%), Positives = 139/345 (40%), Gaps = 62/345 (17%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
Y + + IG P + + DTGSDL W QC PC +C + +P++ P S + C
Sbjct: 60 YLMELSIGTPPIKIYAEADTGSDLVWFQC-IPCTKCYKQQNPMFDPRSSSSYTNITCGTE 118
Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALGCG 165
C L + D C+Y YAD + GVL ++ T G+ + + + GCG
Sbjct: 119 SCNKLDS--SLCSTDQKTCNYTYSYADNSITQGVLAQETLTLTSTTGEPVAFQGIIFGCG 176
Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF---------- 215
+N G + + G++GLG+G S++SQ +G L GG F
Sbjct: 177 HNN-SGFNDREM-GLIGLGRGPLSLISQ----------IGSSLGAGGNMFSQCLVPFNTD 224
Query: 216 ------LFF--GDDLYDSSRVVWTSMSSDYTKYYSP----GVAEL---FFGGETTG-LKN 259
+ F G ++ + V +S D T Y++ V ++ F G + G +
Sbjct: 225 PSITSQMNFGKGSEVLGNGTVSTPLISKDGTGYFATLLGISVEDINLPFSNGSSLGTITK 284
Query: 260 LPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
++ DSG++ TYL Y L ++ +++ + + + LC++
Sbjct: 285 GNILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFRI----DGYELCYQTPTNLNG---- 336
Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
TL + F G LTP I N C + + E
Sbjct: 337 ----PTLTIHFEGGDVL----LTPAQMFIPVQDDNFCFAVFDTNE 373
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 127/280 (45%), Gaps = 28/280 (10%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y ++ +G P + +DTGSD+ WLQC+ PC +C P + PS + C
Sbjct: 85 GDYIMSYSVGTPPIKSYGIVDTGSDIVWLQCE-PCEQCYNQTTPKFNPSKSSSYKNISCS 143
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
+C S+ +C D C+Y + Y + S G L + T G+ ++ P+ +G
Sbjct: 144 SKLCQSVR---DTSCNDKKNCEYSINYGNQSHSQGDLSLETLTLESTTGRPVSFPKTVIG 200
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQL-------HSQKLIRNVVGHCLSGGGGGFL 216
CG N + G+ G++GLG G +S+++QL S L+R + G L
Sbjct: 201 CGTNNI-GSFKRVSSGVVGLGGGPASLITQLGPSIGGKFSYCLVRMSITLKNMSMGSSKL 259
Query: 217 FFGDDLYDSSRVVWTS--MSSDYTKYY-------SPGVAELFFGGETTGLKNLPVVFDSG 267
FGD S V ++ + D++ +Y S G + F G + G++ ++ DS
Sbjct: 260 NFGDVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSVGDKRVEFAGSSKGVEEGNIIIDSS 319
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
+ T++ Y L S + ++ + + + ++ LC+
Sbjct: 320 TIVTFVPSDVYTKLNSAIVDLVTLERVDDP--NQQFSLCY 357
>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
Length = 471
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/174 (33%), Positives = 78/174 (44%), Gaps = 13/174 (7%)
Query: 37 LLFQVHGNVYPTGYYNVTMYIGQPA---RPYFLDLDTGSDLTWLQCDAPCVRCVE-APHP 92
LL ++G Y V + IG P P ++ DTGSDL+W QC+ PC C P+P
Sbjct: 90 LLVPLYGRPQGGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCE-PCTNCSSFTPYP 148
Query: 93 LYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
+ PS + C DP+C L A C + Y DGG+ G LV D F F
Sbjct: 149 PHDPSKSRTFRRLSCFDPMC-ELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHF 207
Query: 149 NYT---NGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
G +L +A GC + + A GIL LG GK S V+QL +
Sbjct: 208 GAAGDGGGYQLERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDRF 261
>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
Length = 488
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/169 (34%), Positives = 77/169 (45%), Gaps = 13/169 (7%)
Query: 37 LLFQVHGNVYPTGYYNVTMYIGQPA---RPYFLDLDTGSDLTWLQCDAPCVRCVE-APHP 92
LL ++G Y V + IG P P ++ DTGSDL+W QC+ PC C P+P
Sbjct: 109 LLVPLYGRPQGGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCE-PCTNCSSFTPYP 167
Query: 93 LYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
+ PS + C DP+C L A C + Y DGG+ G LV D F F
Sbjct: 168 PHDPSKSRTFRRLSCFDPMC-ELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHF 226
Query: 149 NYT---NGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQL 194
G +L +A GC + + A GIL LG GK S V+QL
Sbjct: 227 GAAGDGGGYQLERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQL 275
>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
Length = 489
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/169 (34%), Positives = 77/169 (45%), Gaps = 13/169 (7%)
Query: 37 LLFQVHGNVYPTGYYNVTMYIGQPA---RPYFLDLDTGSDLTWLQCDAPCVRCVE-APHP 92
LL ++G Y V + IG P P ++ DTGSDL+W QC+ PC C P+P
Sbjct: 108 LLVPLYGRPQGGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCE-PCTNCSSFTPYP 166
Query: 93 LYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
+ PS + C DP+C L A C + Y DGG+ G LV D F F
Sbjct: 167 PHDPSKSRTFRRLSCFDPMC-ELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHF 225
Query: 149 NYT---NGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQL 194
G +L +A GC + + A GIL LG GK S V+QL
Sbjct: 226 GAAGDGGGYQLERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQL 274
>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
Length = 467
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/174 (33%), Positives = 78/174 (44%), Gaps = 13/174 (7%)
Query: 37 LLFQVHGNVYPTGYYNVTMYIGQPA---RPYFLDLDTGSDLTWLQCDAPCVRCVE-APHP 92
LL ++G Y V + IG P P ++ DTGSDL+W QC+ PC C P+P
Sbjct: 88 LLVPLYGRPQGGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCE-PCTNCSSFTPYP 146
Query: 93 LYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
+ PS + C DP+C L A C + Y DGG+ G LV D F F
Sbjct: 147 PHDPSKSRTFRRLSCFDPMC-ELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHF 205
Query: 149 NYT---NGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
G +L +A GC + + A GIL LG GK S V+QL +
Sbjct: 206 GAAGDGGGYQLERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDRF 259
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 143/364 (39%), Gaps = 55/364 (15%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD---APCVRCVEAPHPLYR--PSNDLVP 102
TG Y V +G PA+P+ L DTGSDLTW++C A +P ++R S P
Sbjct: 98 TGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSPARVFRTAASKSWAP 157
Query: 103 --CEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
C C S NC PA C Y+ Y DG ++ GV+ D+ ++G
Sbjct: 158 IACSSDTCTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVGTDSATIALSSGSGRGGG 217
Query: 160 ------------LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVV 204
+ LGC G S+ DG+L LG S S+ ++ + +V
Sbjct: 218 DSSGGRRAKLQGVVLGCAAT-YDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLV 276
Query: 205 GHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL------- 257
H +L FG + + T +Y+ V ++ GE +
Sbjct: 277 DHLAPRNATSYLTFGPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPADVWDV 336
Query: 258 -KNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNV 316
+N + DSG+S T L Y+ + + + K L+ L D PF+
Sbjct: 337 DRNGGAILDSGTSLTILATPAYRAVVTALSKHLAG--LPRVTMD-----------PFEYC 383
Query: 317 HDVKKC----FRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNV 372
++ + + F G R E ++Y+I + G C+G+ G+ G ++V
Sbjct: 384 YNWTDAGALEIPKMEVHFA-GSAR--LEPPAKSYVIDAAPGVKCIGVQEGSWPG---VSV 437
Query: 373 IGGI 376
IG I
Sbjct: 438 IGNI 441
>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
Length = 468
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/174 (33%), Positives = 78/174 (44%), Gaps = 13/174 (7%)
Query: 37 LLFQVHGNVYPTGYYNVTMYIGQPA---RPYFLDLDTGSDLTWLQCDAPCVRCVE-APHP 92
LL ++G Y V + IG P P ++ DTGSDL+W QC+ PC C P+P
Sbjct: 87 LLVPLYGRPQGGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCE-PCTNCSSFTPYP 145
Query: 93 LYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
+ PS + C DP+C L A C + Y DGG+ G LV D F F
Sbjct: 146 PHDPSKSRTFRRLSCFDPMC-ELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHF 204
Query: 149 NYT---NGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
G +L +A GC + + A GIL LG GK S V+QL +
Sbjct: 205 GAAGDGGGYQLERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVDRF 258
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 97/364 (26%), Positives = 141/364 (38%), Gaps = 61/364 (16%)
Query: 21 SSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD 80
SS+S SL H G L T Y V++ +G P R + DTGSDL+W+QC
Sbjct: 167 SSASKGVSLPAHRGLRL---------GTANYIVSVGLGTPRRDLLVVFDTGSDLSWVQCK 217
Query: 81 APCVRCVEAPHPLYRPSN----DLVPCEDPICASLHAPGHHNCED-----PAQCDYELEY 131
PC C + PL+ PS VPC G C D +C YE+ Y
Sbjct: 218 -PCNNCYKQHDPLFDPSQSTTYSAVPC-----------GAQECLDSGTCSSGKCRYEVVY 265
Query: 132 ADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIV 191
D + G L +D ++ Q GCG + + DG+ GLG+ + S+
Sbjct: 266 GDMSQTDGNLARDTLTLGPSSDQLQG--FVFGCGDDDT--GLFGRADGLFGLGRDRVSLA 321
Query: 192 SQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSM--SSDYTKYYSPGVAE 247
SQ ++ +CL S G+L G +T+M SD +Y +
Sbjct: 322 SQAAAR--YGAGFSYCLPSSWRAEGYLSLG-SAAAPPHAQFTAMVTRSDTPSFYYLDLVG 378
Query: 248 LFFGGETTGLKNLPVVF-------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPED 300
+ G T ++ P VF DSG+ T L Y L S + + K AP
Sbjct: 379 IKVAGRT--VRVAPAVFKAPGTVIDSGTVITRLPSRAYSALRSSFAGFM--RRYKRAPAL 434
Query: 301 ETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGIL 360
L C+ F V+ ++AL F G T L L ++N+ CL
Sbjct: 435 SILDTCYD----FTGRTKVQ--IPSVALLFDGGAT---LNLGFGGVLYVANRSQACLAFA 485
Query: 361 NGAE 364
+ +
Sbjct: 486 SNGD 489
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 95/360 (26%), Positives = 138/360 (38%), Gaps = 59/360 (16%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH---PLYRPSND----LVPC 103
+++T+ IG P +P L +DTGSDL W QC V A H P+Y P +PC
Sbjct: 91 HSLTVGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFAFLPC 150
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
D +C NC +C YE Y +++GVL + F F L RL G
Sbjct: 151 SDRLCQEGQF-SFKNCTSKNRCVYEDVYGS-AAAVGVLASETFTFGARRAVSL--RLGFG 206
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFGD 220
CG + S GILGL S+++QL Q+ +CL+ L FG
Sbjct: 207 CG--ALSAGSLIGATGILGLSPESLSLITQLKIQRF-----SYCLTPFADKKTSPLLFG- 258
Query: 221 DLYDSSR------VVWTSMSSDYTK---YYSPGVAELFFGGETTGLKNLPV--------- 262
+ D SR + T++ S+ K YY P V G + G K L V
Sbjct: 259 AMADLSRHKTTRPIQTTAIVSNPVKTVYYYVPLV------GISLGHKRLAVPAASLAMRP 312
Query: 263 ------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNV 316
+ DSGS+ YL ++ + + + ED L R +
Sbjct: 313 DGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVEDYELCFVLPRRTAAAAM 372
Query: 317 HDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
V+ L L F G L + Y G +CL + G +++IG +
Sbjct: 373 EAVQ--VPPLVLHFDGGAAMVLPR---DNYFQEPRAGLMCLAV--GKTTDGSGVSIIGNV 425
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 89/357 (24%), Positives = 143/357 (40%), Gaps = 53/357 (14%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSND----LVPC 103
G Y + + IG P PY DTGSDL W QC APC +C P PLY PS+ ++PC
Sbjct: 90 GEYLMALAIGTPPLPYQAIADTGSDLIWTQC-APCTSQCFRQPTPLYNPSSSTTFAVLPC 148
Query: 104 ED--PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT-NGQRLNPRL 160
+CA+ A C Y + Y G +S+ + F F T G P +
Sbjct: 149 NSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSV-FQGSETFTFGSTPAGHARVPGI 207
Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS----GGGGGFL 216
A GC G + G++GLG+G+ S+VSQL K +CL+ L
Sbjct: 208 AFGCS-TASSGFNASSASGLVGLGRGRLSLVSQLGVPKF-----SYCLTPYQDTNSTSTL 261
Query: 217 FFGDDL-------YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-------- 261
G S+ V + ++ +Y + + G TT L P
Sbjct: 262 LLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLG--TTALSIPPDAFSLNAD 319
Query: 262 ----VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
++ DSG++ T L YQ + + + ++ + + D L LC+ +
Sbjct: 320 GTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPT-TDGSADTGLDLCFM----LPSST 374
Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
++ L F L ++Y++ + G CL + N + ++N++G
Sbjct: 375 SAPPAMPSMTLHFNGAD----MVLPADSYMMSDDSGLWCLAMQNQTD---GEVNILG 424
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 92/355 (25%), Positives = 148/355 (41%), Gaps = 48/355 (13%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND- 99
V G +G Y + +G PA+ +L LDTGSD+ W+QC+ PC C + P++ P++
Sbjct: 152 VSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCE-PCADCYQQSDPVFNPTSSS 210
Query: 100 ---LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+ C P C+ L + +C Y++ Y DG ++G L D F N ++
Sbjct: 211 TYKSLTCSAPQCSLLETSACRS----NKCLYQVSYGDGSFTVGELATDTVTFG--NSGKI 264
Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGG 212
N +ALGCG++ + G+LGLG G SI +Q+ + +CL SG
Sbjct: 265 N-NVALGCGHDN--EGLFTGAAGLLGLGGGVLSITNQMKATSF-----SYCLVDRDSGKS 316
Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----------V 262
F L + +Y G++ GGE L + V
Sbjct: 317 SSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGV 376
Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
+ D G++ T L Y +L K L+ K + C+ F ++ VK
Sbjct: 377 ILDCGTAVTRLQTQAYNSLRDAFLK-LTVNLKKGSSSISLFDTCYD----FSSLSTVK-- 429
Query: 323 FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
T+A FT GK+ +L + YLI + + G C + L++IG +
Sbjct: 430 VPTVAFHFTGGKS---LDLPAKNYLIPVDDSGTFCFAFAPTSS----SLSIIGNV 477
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 88/361 (24%), Positives = 143/361 (39%), Gaps = 61/361 (16%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y + + IG P P DTGSDLTWLQ PC +C P++ PSN +PC
Sbjct: 78 GEYMMNLSIGTPPFPILAIADTGSDLTWLQ-SKPCDQCYPQKGPIFDPSNSTTFHKLPCT 136
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
C +L +C DP C Y Y D + G L D + Q N +A GC
Sbjct: 137 TAPCNALDESA-RSCTDPTTCGYTYSYGDHSYTTGYLASDTVTVGNASVQIRN--VAFGC 193
Query: 165 GYNQVPGASYHPLDGILGLGKGKS-SIVSQLHSQKLIRNVVGHCL------------SGG 211
G G ++ + G + S VSQL I +CL
Sbjct: 194 GTRN--GGNFDEQGSGIVGLGGGNLSFVSQLGDT--IGKKFSYCLLPLENEISSQPSDSP 249
Query: 212 GGGFLFFGDD-LYDSSR---VVWTS---MSSDYTKYYSPGVAELFFG------------- 251
+ FGD+ ++ SS VV+ + ++ + + YY + + G
Sbjct: 250 ATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLLYSSSSSKT 309
Query: 252 -----GETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLC 306
G + ++ ++ DSG++ T+L Y L + + +E+ + + + ++ LC
Sbjct: 310 ASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVNDV-KNSMFSLC 368
Query: 307 WKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVG 366
+K + + +K FR A EL P + + +G VC +L +VG
Sbjct: 369 FKSGKEEVELPLMKVHFRGGA----------DVELKPVNTFVRAEEGLVCFTMLPTNDVG 418
Query: 367 L 367
+
Sbjct: 419 I 419
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 76/286 (26%), Positives = 122/286 (42%), Gaps = 42/286 (14%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y ++ +G P P + +DT SD+ W+QC C C P++ PS +PC
Sbjct: 86 GDYLMSYSLGTPPFPVYGIVDTASDIIWVQCQL-CETCYNDTSPMFDPSYSKTYKNLPCS 144
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
C S+ + ++ C++ + Y DG S G L+ + N ++ PR +G
Sbjct: 145 STTCKSVQGTSCSS-DERKICEHTVNYKDGSHSQGDLIVETVTLGSYNDPFVHFPRTVIG 203
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS--GGGGGFLFFGD- 220
C N S+ + GI+GLG G S+V QL S I +CL+ L FGD
Sbjct: 204 CIRNT--NVSFDSI-GIVGLGGGPVSLVPQLSSS--ISKKFSYCLAPISDRSSKLKFGDA 258
Query: 221 -----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP--------VVFDSG 267
D S+R+V+ D+ K+Y + G ++ ++ DSG
Sbjct: 259 AMVSGDGTVSTRIVF----KDWKKFYYLTLEAFSVGNNRIEFRSSSSRSSGKGNIIIDSG 314
Query: 268 SSYTYLNRVTYQTLTS----IMKKELSAKSLKEAPEDETLPLCWKG 309
+++T L Y L S ++K E + LK+ LC+K
Sbjct: 315 TTFTVLPDDVYSKLESAVADVVKLERAEDPLKQ------FSLCYKS 354
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 84/327 (25%), Positives = 131/327 (40%), Gaps = 39/327 (11%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
G Y + +G P+ Y + +DTGS LTWLQC V C PL+ P V C
Sbjct: 132 GNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYASVRCS 191
Query: 105 DPICASLHAP--GHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
C L A C C Y+ Y D S+G L D +F G P
Sbjct: 192 ASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGSLSTDTVSF----GSTRYPSFYY 247
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFFGDD 221
GCG + + G++GL + K S++ QL + +CL + G+L G
Sbjct: 248 GCGQDN--EGLFGRSAGLIGLARNKLSLLYQLAPS--LGYSFSYCLPTAASTGYLSIGP- 302
Query: 222 LYDS----SRVVWTSMSSDYTKYYSPGVAELFFGGETTGL-----KNLPVVFDSGSSYTY 272
Y++ S S S D + Y+ ++ + GG + +LP + DSG+ T
Sbjct: 303 -YNTGHYYSYTPMASSSLDASLYFI-TLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITR 360
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
L + L+ + + ++ + AP L C++G+ V T+A++F
Sbjct: 361 LPTAVHTALSKAVAQAMAGA--QRAPAFSILDTCFEGQASQLRV-------PTVAMAFAG 411
Query: 333 GKTRTLFELTPEAYLIISNKGNVCLGI 359
G + +LT LI + CL
Sbjct: 412 GAS---MKLTTRNVLIDVDDSTTCLAF 435
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 88/326 (26%), Positives = 128/326 (39%), Gaps = 41/326 (12%)
Query: 23 SSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP 82
S SSS+ +GSSL T Y +++ +G PA + +DTGSD++W+QC+ P
Sbjct: 108 SKVSSSVPTKLGSSL---------DTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCN-P 157
Query: 83 CVR--CVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS 136
C C L+ P+ V C CA L G+ +C Y ++Y DG +
Sbjct: 158 CPNPPCYAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYGDGST 217
Query: 137 SLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS 196
+ G +D + GC + V DG++GLG G S+VSQ +
Sbjct: 218 TNGTYSRDTLTL--SGASDAVKGFQFGCSH--VESGFSDQTDGLMGLGGGAQSLVSQ--T 271
Query: 197 QKLIRNVVGHCL---SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE 253
N +CL SG G G S +Y + ++ GG+
Sbjct: 272 AAAYGNSFSYCLPPTSGSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGK 331
Query: 254 TTGLKNLPVVF------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
GL P VF DSG+ T L Y L+S K + K + AP L C
Sbjct: 332 QLGLS--PSVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGM--KQYRSAPARSILDTC- 386
Query: 308 KGRRPFKNVHDVKKCFRTLALSFTDG 333
F + T+AL F+ G
Sbjct: 387 -----FDFAGQTQISIPTVALVFSGG 407
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 87/352 (24%), Positives = 142/352 (40%), Gaps = 43/352 (12%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSND----LVPC 103
G Y + + IG P PY DTGSDL W QC APC +C P PLY PS+ ++PC
Sbjct: 30 GEYLMALAIGTPPLPYQAIADTGSDLIWTQC-APCTSQCFRQPTPLYNPSSSTTFAVLPC 88
Query: 104 ED--PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT-NGQRLNPRL 160
+CA+ A C Y + Y G +S+ + F F T G P +
Sbjct: 89 NSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSV-FQGSETFTFGSTPAGHARVPGI 147
Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFL---- 216
A GC G + G++GLG+G+ S+VSQL K + + + L
Sbjct: 148 AFGCS-TASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCLTPYQDTNSTSTLLLGPS 206
Query: 217 --FFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP------------V 262
G S+ V + ++ +Y + + G TT L P +
Sbjct: 207 ASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLG--TTALSIPPDAFSLNADGTGGL 264
Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
+ DSG++ T L YQ + + + ++ + + D L LC+ +
Sbjct: 265 IIDSGTTITLLGNTAYQQVRAAVVSLVTLPT-TDGSADTGLDLCFM----LPSSTSAPPA 319
Query: 323 FRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
++ L F L ++Y++ + G CL + N + ++N++G
Sbjct: 320 MPSMTLHFNGAD----MVLPADSYMMSDDSGLWCLAMQNQTD---GEVNILG 364
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 92/353 (26%), Positives = 153/353 (43%), Gaps = 43/353 (12%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SN 98
G + +G Y V + IG P + +L +DTGSD+ W+QC +PC C + ++ P S
Sbjct: 6 GLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQC-SPCKSCYKQNDAVFDPRASSSF 64
Query: 99 DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
+ C P C L + ++ +C Y++ Y DG ++G L D+F+ + R +P
Sbjct: 65 RRLSCSTPQCKLLDVKACASTDN--RCLYQVSYGDGSFTVGDLASDSFS---VSRGRTSP 119
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFF 218
+ GCG++ + G+LGLG GK S SQL S+K +V L F
Sbjct: 120 -VVFGCGHDN--EGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSALLF 176
Query: 219 GDD-LYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLP-----------VVF 264
GD L S+ +T + + +Y G++ + GG + + V+
Sbjct: 177 GDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVII 236
Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
DSG+S T L Y + + + + L A + C+ F + V
Sbjct: 237 DSGTSVTRLPTYAYTVMRDAFRS--ATQKLPRAADFSLFDTCYD----FSALTSVT--IP 288
Query: 325 TLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
T++ F G + +L P YL+ + G C ++ L DL++IG I
Sbjct: 289 TVSFHFEGGAS---VQLPPSNYLVPVDTSGTFCFAF---SKTSL-DLSIIGNI 334
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 86/298 (28%), Positives = 114/298 (38%), Gaps = 53/298 (17%)
Query: 16 RMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLT 75
RM+ S + ++ L + + + + N PT Y V + IG P +P L LDTGSDL
Sbjct: 47 RMALRSKARAARRLSSSASAPVSPGTYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLI 106
Query: 76 WLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPA-----QCD 126
W QC PC C + P + P + L C+ +C L +C P C
Sbjct: 107 WTQCQ-PCPACFDQALPYFDPSTSSTLSLTSCDSTLCQGLPV---ASCGSPKFWPNQTCV 162
Query: 127 YELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG-YNQVPGASYHPLDGILGLGK 185
Y Y D + G L D F F P +A GCG +N G GI G G+
Sbjct: 163 YTYSYGDKSVTTGFLEVDKFTFVGAGASV--PGVAFGCGLFNN--GVFKSNETGIAGFGR 218
Query: 186 GKSSIVSQLHSQKLIRNVVGHCLSGGGG-----GFLFFGDDLYDSSRVVWTSM-----SS 235
G S+ SQL HC + G L DLY S R S +
Sbjct: 219 GPLSLPSQLKVGNF-----SHCFTAVNGLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPA 273
Query: 236 DYTKYYSPGVAELFFGGETTGLKNLPV--------------VFDSGSSYTYLNRVTYQ 279
+ T YY L G T G LPV + DSG++ T L Y+
Sbjct: 274 NPTFYY------LSLKGITVGSTRLPVPESEFALKNGTGGTIIDSGTAMTSLPTRVYR 325
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 57/159 (35%), Positives = 76/159 (47%), Gaps = 11/159 (6%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSND----LVPC 103
G Y + + IG P PY DTGSDL W QC APC +C P PLY PS+ ++PC
Sbjct: 88 GEYLMALAIGTPPLPYQAIADTGSDLIWTQC-APCTSQCFRQPTPLYNPSSSTTFAVLPC 146
Query: 104 ED--PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT-NGQRLNPRL 160
+CA+ A C Y + Y G +S+ + F F T GQ P +
Sbjct: 147 NSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSV-FQGSETFTFGSTPAGQSRVPGI 205
Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
A GC G + G++GLG+G+ S+VSQL K
Sbjct: 206 AFGCS-TASSGFNASSASGLVGLGRGRLSLVSQLGVPKF 243
>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 515
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 84/351 (23%), Positives = 141/351 (40%), Gaps = 55/351 (15%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL------------ 100
T+ IG P + + LDTGSDL W+ CD C RC + DL
Sbjct: 98 TTVQIGTPGVKFMVALDTGSDLFWVPCD--CTRCAATDSSAFASDFDLNVYNPNGSSTSK 155
Query: 101 -VPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNYTNGQR-- 155
V C + +C +H C + C Y + Y +S G+LV+D +
Sbjct: 156 KVTCNNSLC--MH---RSQCLGTLSNCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDL 210
Query: 156 LNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
+ + GCG Q+ S+ + +G+ GLG K S+ S L + + C G
Sbjct: 211 VEANVIFGCG--QIQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDG 268
Query: 213 GGFLFFGDD-LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYT 271
G + FGD +D + S T Y+ V ++ G ++ +FDSG+S+T
Sbjct: 269 IGRISFGDKGSFDQDETPFNLNPSHPT--YNITVTQVRVGTTLIDVE-FTALFDSGTSFT 325
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK-----KCFRTL 326
YL TY LT ++ + + R PF+ +D+ ++
Sbjct: 326 YLVDPTYTRLTESFHSQVQDRRHRS-----------DSRIPFEYCYDMSPDANTSLIPSV 374
Query: 327 ALSFTDGKTRTLFELTPEAYLIISNKGNV--CLGILNGAEVGLQDLNVIGG 375
+L+ G ++ + +IIS + + CL ++ AE+ + N + G
Sbjct: 375 SLTMGGGSHFAVY----DPIIIISTQSELVYCLAVVKTAELNIIGQNFMTG 421
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 99/376 (26%), Positives = 167/376 (44%), Gaps = 50/376 (13%)
Query: 17 MSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGY-YNVTMYIGQPARPYFLDLDTGSDLT 75
M++ ++SSS SS+ V ++P G Y + + +G P + + DTGSDL
Sbjct: 26 MAARANSSSWSSMAGTT------DVESPLHPDGGGYVMDISVGTPGKRFRAIADTGSDLV 79
Query: 76 WLQCDAPCVRCVEAP--HPLYRPSNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYA 132
W+Q + PC C P + + C +CA L PG +CE + C Y EY
Sbjct: 80 WVQSE-PCTGCSGGTIFDPRQSSTFREMDCSSQLCAEL--PG--SCEPGSSTCSYSYEYG 134
Query: 133 DGGSSLGVLVKDAFAFNYT-NGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIV 191
G + G +D + T +G + P A+GCG + + + +DG++GLG+G S+
Sbjct: 135 S-GETEGEFARDTISLGTTSDGSQKFPSFAVGCG---MVNSGFDGVDGLVGLGQGPVSLT 190
Query: 192 SQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDL------YDSSRVVWTSMSSDYTKYY 241
SQL + I + +CL S L FG S+++ T S Y YY
Sbjct: 191 SQLSAA--IDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKI--TPPSDTYPTYY 246
Query: 242 SPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDE 301
V + G+T G ++ DSG++ TY+ Y + S M+ ++ + +
Sbjct: 247 LLTVNGIAVAGQTMGSPGTTII-DSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGS--SM 303
Query: 302 TLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGN-VCLGIL 360
L LC+ R +N F L + T+ + +L++ + G+ VCL +
Sbjct: 304 GLDLCYD-RSSNRNYK-----FPALTIRLAGA---TMTPPSSNYFLVVDDSGDTVCLAM- 353
Query: 361 NGAEVGLQDLNVIGGI 376
G+ GL +++IG +
Sbjct: 354 -GSASGLP-VSIIGNV 367
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 91/332 (27%), Positives = 135/332 (40%), Gaps = 43/332 (12%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y + YIG P DT SDL W+QC +PC C PL+ P + C+
Sbjct: 88 GEYLMRFYIGTPPVERLAIADTASDLIWVQC-SPCETCFPQDTPLFEPHKSSTFANLSCD 146
Query: 105 DPICAS---LHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRL 160
C S + P N C Y Y DG S+ GVL ++ F Q + P+
Sbjct: 147 SQPCTSSNIYYCPLVGNL-----CLYTNTYGDGSSTKGVLCTESIHF---GSQTVTFPKT 198
Query: 161 ALGCGYNQ-VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFL 216
GCG N + + GI+GLG G S+VSQL Q I + +CL + L
Sbjct: 199 IFGCGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQ--IGHKFSYCLLPFTSTSTIKL 256
Query: 217 FFGDDLYDSSR-VVWTSMSSD--YTKYYSPGVAELFFGGE-----TTGLKNLPVVFDSGS 268
FG+D + VV T + D Y YY + + G + TT N ++ D G+
Sbjct: 257 KFGNDTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRTTDHTNGNIIIDLGT 316
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLAL 328
TYL Y +++++ L + E +D P + F N ++ F +
Sbjct: 317 VLTYLEVNFYHNFVTLLREAL---GISETKDDIPYPFDFC----FPNQANIT--FPKIVF 367
Query: 329 SFTDGKTRTLFELTPEAYLIISNKGNVCLGIL 360
FT K +F + + +CL +L
Sbjct: 368 QFTGAK---VFLSPKNLFFRFDDLNMICLAVL 396
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 92/355 (25%), Positives = 148/355 (41%), Gaps = 48/355 (13%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND- 99
V G +G Y + +G PA+ +L LDTGSD+ W+QC+ PC C + P++ P++
Sbjct: 152 VSGASQGSGEYFSRIGVGTPAKDMYLVLDTGSDVNWIQCE-PCADCYQQSDPVFNPTSSS 210
Query: 100 ---LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+ C P C+ L + +C Y++ Y DG ++G L D F N ++
Sbjct: 211 TYKSLTCSAPQCSLLETSACRS----NKCLYQVSYGDGSFTVGELATDTVTFG--NSGKI 264
Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGG 212
N +ALGCG++ + G+LGLG G SI +Q+ + +CL SG
Sbjct: 265 N-NVALGCGHDN--EGLFTGAAGLLGLGGGVLSITNQMKATSF-----SYCLVDRDSGKS 316
Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----------V 262
F L + +Y G++ GGE L + V
Sbjct: 317 SSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGV 376
Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC 322
+ D G++ T L Y +L K L+ K + C+ F ++ VK
Sbjct: 377 ILDCGTAVTRLQTQAYNSLRDAFLK-LTVNLKKGSSSISLFDTCYD----FSSLSTVK-- 429
Query: 323 FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
T+A FT GK+ +L + YLI + + G C + L++IG +
Sbjct: 430 VPTVAFHFTGGKS---LDLPAKNYLIPVDDSGTFCFAFAPTSS----SLSIIGNV 477
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 107/401 (26%), Positives = 164/401 (40%), Gaps = 61/401 (15%)
Query: 15 VRMSSSSSSSSSSSLFNHVGSSLLFQVH--------------GNVYPTGYYNVTMYIGQP 60
V+ + S SS++ SLF + S+ +FQ H G + G Y ++ +G P
Sbjct: 54 VKANPSPSSAAQKSLFPY--SAHIFQQHTKNPAALRSSTTTLGRKF--GEYYTSIKLGSP 109
Query: 61 ARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCED-PICASLHAPG 115
+ L +DTGS+LTWL+C PC C + +Y + + V C + +C++
Sbjct: 110 GQEAILIVDTGSELTWLKC-LPCKVCAPSVDTIYDAARSVSYKPVTCNNSQLCSNSSQGT 168
Query: 116 HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR--LNPRLALGCGYNQ---VP 170
+ C +QC + Y DG S G L D G + A GC VP
Sbjct: 169 YAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFGCAQGDLELVP 228
Query: 171 -GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-----GGGGFLFFGDDLYD 224
GAS GILGL GK ++ QL + + HC G +FFG+
Sbjct: 229 TGAS-----GILGLNAGKMALPMQLGQRFGWK--FSHCFPDRSSHLNSTGVVFFGNAELP 281
Query: 225 SSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGL--KNLPVVFDSGSSYTYLNRVTY 278
+V +TS+ S K+Y + + L + V+ DSGSS++ R +
Sbjct: 282 HEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVLLPRGSVVILDSGSSFSSFVRPFH 341
Query: 279 QTLTSIMKKELSAKSLKEAPEDE--TLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKT- 335
L K SLK D L C+K ++ ++ + +L+L F DG T
Sbjct: 342 SQLREAFLKH-RPPSLKHLEGDSFGDLGTCFKVSN--DDIDELHRTLPSLSLVFEDGVTI 398
Query: 336 --RTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
++ L P A N +C +G G +NVIG
Sbjct: 399 GIPSIGVLLPVARY--QNHVKMCFAFEDG---GPNPVNVIG 434
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 82/305 (26%), Positives = 125/305 (40%), Gaps = 37/305 (12%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV--RCVEAPHPLYRPSN----DLVPCE 104
Y VT+ +G P +++DTGSD++W+QC PC C L+ P+ VPC
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCK-PCSAPACNSQRDQLFDPAKSSTYSAVPCG 201
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
C+ L C +QC Y + Y DG ++ GV D A G + L GC
Sbjct: 202 ADACSELRIY-EAGCSG-SQCGYVVSYGDGSNTTGVYGSDTLAL--APGNTVGTFL-FGC 256
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDL 222
G+ Q + +DG+L LG+ S+ SQ + V +CL G+L G
Sbjct: 257 GHAQA--GMFAGIDGLLALGRQSMSLKSQ--AAGAYGGVFSYCLPSKQSAAGYLTLGGPT 312
Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV---------VFDSGSSYTYL 273
+S T + T + +P + G + G + + V V D+G+ T L
Sbjct: 313 -SASGFATTGL---LTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVDTGTVITRL 368
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
Y L S + ++ AP + L C+ F V T+AL+F+ G
Sbjct: 369 PPTAYAALRSAFRGAIAPYGYPSAPANGILDTCYD----FSRYGVVT--LPTVALTFSGG 422
Query: 334 KTRTL 338
T L
Sbjct: 423 ATLAL 427
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 74/282 (26%), Positives = 121/282 (42%), Gaps = 30/282 (10%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCE 104
T Y +++ +G PA+ +++DTGS +W+ C+ C C P + + V C
Sbjct: 79 TSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCG 136
Query: 105 DPICASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P + G
Sbjct: 137 TSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PGFSFG 193
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL- 222
C + + +DG+LG+G G S++ Q + +CL FF
Sbjct: 194 CNMDSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDCFSYCLPLQKSERGFFSKTTG 250
Query: 223 YDSSRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGS 268
Y S V T YTK + ELFF GE GL VVFDSGS
Sbjct: 251 YFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVFDSGS 310
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
+Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 311 ELSYIPDRALSVLSQRIRELLLKRG---AAEEESERNCYDMR 349
>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
Length = 376
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 74/157 (47%), Gaps = 12/157 (7%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR-CVEAPHPLYRPSNDL- 100
G+ TG Y VT+ +G P R DTGSDLTW QC+ PC R C P++ PS
Sbjct: 130 GSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCE-PCARYCYHQQEPIFNPSKSTS 188
Query: 101 ---VPCEDPICASLHA-PGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+ C P C L + G+ + C Y ++Y D S+G +D A T+ +
Sbjct: 189 YTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTD---V 245
Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQ 193
GCG N + + G++GLG+ S++S+
Sbjct: 246 FNNFLFGCGQNNR--GLFVGVAGLIGLGRNALSLMSK 280
>gi|356496606|ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 508
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 86/350 (24%), Positives = 145/350 (41%), Gaps = 61/350 (17%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCV---------EAPHPLY----RPSNDLVPC 103
+G P + + LDTGSDL WL C+ C +CV + +Y ++ V C
Sbjct: 107 VGTPPLSFLVALDTGSDLFWLPCN--CTKCVHGIGLSNGEKIAFNIYDLKGSSTSQPVLC 164
Query: 104 EDPICASLHAPGHHNC-EDPAQCDYELEY-ADGGSSLGVLVKDAFAF--NYTNGQRLNPR 159
+C C C YE+ Y ++G S+ G LV+D + + + R
Sbjct: 165 NSSLCEL-----QRQCPSSDTICPYEVNYLSNGTSTTGFLVEDVLHLITDDDKTKDADTR 219
Query: 160 LALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
+ GCG Q + GA+ +G+ GLG S+ S L + L N C G G
Sbjct: 220 ITFGCGQVQTGAFLDGAAP---NGLFGLGMSNESVPSILAKEGLTSNSFSMCFGSDGLGR 276
Query: 216 LFFGDDLYDSSRVVWTSMSSDYTKY--------YSPGVAELFFGGETTGLKNLPVVFDSG 267
+ FGD+ +S+ T + Y+ V ++ G + L+ +FDSG
Sbjct: 277 ITFGDN---------SSLVQGKTPFNLRALHPTYNITVTQIIVGEKVDDLE-FHAIFDSG 326
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
+S+TYLN Y+ +T+ E+ + + +E PF+ +++ +T+
Sbjct: 327 TSFTYLNDPAYKQITNSFNSEIKLQRHSTSSSNEL---------PFEYCYELSPN-QTVE 376
Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGN--VCLGILNGAEVGLQDLNVIGG 375
LS L + + +S +G +CLG+L V + N + G
Sbjct: 377 LSINLTMKGGDNYLVTDPIVTVSGEGINLLCLGVLKSNNVNIIGQNFMTG 426
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 90/332 (27%), Positives = 132/332 (39%), Gaps = 36/332 (10%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
T Y V + +G P + + DTGSD TW+QC V C + L+ P+ V C
Sbjct: 160 TANYVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTYANVSC 219
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
DP CA L A G C + C Y ++Y DG ++G KD A Q G
Sbjct: 220 ADPACADLDASG---C-NAGHCLYGIQYGDGSYTVGFFAKDTLAV----AQDAIKGFKFG 271
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFF--G 219
CG + G+LGLG+G +SI Q + + +CL S G+L F
Sbjct: 272 CGEKNR--GLFGQTAGLLGLGRGPTSITVQAYEK--YGGSFSYCLPASSAATGYLEFGPL 327
Query: 220 DDLYDSSRVVWTSMSSDY-TKYYSPGVAELFFGGETTG------LKNLPVVFDSGSSYTY 272
S T M +D +Y G+ + GG+ G N + DSG+ T
Sbjct: 328 SPSSSGSNAKTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIPESVFSNSGTLVDSGTVITR 387
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
L Y L+S ++A K+A L C+ F + V T++L F
Sbjct: 388 LPDTAYAALSSAFAAAMAASGYKKAAAYSILDTCYD----FTGLSQVS--LPTVSLVFQG 441
Query: 333 GKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
G +L + ++ VCLG + +
Sbjct: 442 G---ACLDLDASGIVYAISQSQVCLGFASNGD 470
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 91/319 (28%), Positives = 124/319 (38%), Gaps = 68/319 (21%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV--RCVEAPHPLYRPSN----DLVPCE 104
Y T+ G PA P + +DTGSDLTWLQC PC +C PL+ PS+ VPC
Sbjct: 112 YVATVSFGTPAVPQVVVIDTGSDLTWLQCK-PCSSGQCSPQKDPLFDPSHSSTYSAVPCA 170
Query: 105 DPICASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
C L A + C + C + + Y DG S++GV KD +L L
Sbjct: 171 SGECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKD--------------KLTLA 216
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIV-------------SQLHSQKLIRNVVGHCLSG 210
PGA D G G KSS+ L +Q +CL
Sbjct: 217 ------PGAIVK--DFYFGCGHSKSSLPGLFDGLLGLGRLSESLGAQYGGGGGFSYCLPA 268
Query: 211 GGG--GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP------- 261
GFL FG + S V+T M + P + + G T G K L
Sbjct: 269 VNSKPGFLAFGAG-RNPSGFVFTPMGRVPGQ---PTFSTVTLAGITVGGKKLDLRPSAFS 324
Query: 262 --VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
++ DSG+ T L Y+ L + ++ + A L D L +KNV
Sbjct: 325 GGMIVDSGTVVTVLQSTVYRALRAAFREAMKAYRLVHGDLDTCYDLTG-----YKNVVVP 379
Query: 320 KKCFRTLALSFTDGKTRTL 338
K +AL+F+ G T L
Sbjct: 380 K-----IALTFSGGATINL 393
>gi|3805854|emb|CAA21474.1| putative protein [Arabidopsis thaliana]
gi|7270540|emb|CAB81497.1| putative protein [Arabidopsis thaliana]
Length = 455
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 75/290 (25%), Positives = 127/290 (43%), Gaps = 48/290 (16%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV---------EAPHPLYRP----SND 99
T+ +G P + + LDTGSDL W+ CD C +C E +Y P +N
Sbjct: 109 TTVKLGTPGMRFMVALDTGSDLFWVPCD--CGKCAPTEGATYASEFELSIYNPKVSTTNK 166
Query: 100 LVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNY--TNGQR 155
V C + +CA + C + C Y + Y +S G+L++D N +R
Sbjct: 167 KVTCNNSLCAQ-----RNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPER 221
Query: 156 LNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
+ + GCG QV S+ + +G+ GLG K S+ S L + L+ + C G
Sbjct: 222 VEAYVTFGCG--QVQSGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDG 279
Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKN-LPVVFDSGSSYT 271
G + FGD +++ + Y+ V + G TT + + +FD+G+S+T
Sbjct: 280 VGRISFGDKGSSDQEETPFNLNPSHPN-YNITVTRVRVG--TTLIDDEFTALFDTGTSFT 336
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
YL Y T++ SA+ + +P+ R PF+ +D+++
Sbjct: 337 YLVDPMYTTVSE------SAQDKRHSPD---------SRIPFEYCYDMRE 371
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 141/354 (39%), Gaps = 52/354 (14%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y + +++G P + + L LDTGSDL W+QC PC+ C E P Y P + + C
Sbjct: 194 SGEYFMDVFVGTPPKHFSLILDTGSDLNWIQC-VPCIACFEQSGPYYDPKDSSSFRNISC 252
Query: 104 EDPICASLHAPGHHN-CEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYT--NGQ---RL 156
DP C + AP C+ Q C Y Y DG ++ G + F N T NG +
Sbjct: 253 HDPRCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKH 312
Query: 157 NPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SG 210
+ GCG +N+ +H G+LGLGKG S SQ+ Q L +CL +
Sbjct: 313 VENVMFGCGHWNR---GLFHGAAGLLGLGKGPLSFASQM--QSLYGQSFSYCLVDRNSNA 367
Query: 211 GGGGFLFFGDD--LYDSSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGLKNLP--- 261
L FG+D L + +TS +Y + + E +
Sbjct: 368 SVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEETWHL 427
Query: 262 -------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFK 314
+ DSG++ TY Y+ + +++ L E LP +P
Sbjct: 428 SSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEG-----LP----PLKPCY 478
Query: 315 NVHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGL 367
NV ++K + F D ++ E Y I + VCL IL L
Sbjct: 479 NVSGIEKMELPDFGILFAD---EAVWNFPVENYFIWIDPEVVCLAILGNPRSAL 529
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 99/368 (26%), Positives = 148/368 (40%), Gaps = 70/368 (19%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y + +Y+G P R + + +DTGSDL WLQC APC+ C E P++ P+ V C
Sbjct: 148 SGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQC-APCLDCFEQRGPVFDPAASSSYRNVTC 206
Query: 104 EDPICASL------HAPGHHNCEDPAQ--CDYELEYADGGSSLGVLVKDAFAFNYT--NG 153
D C + A C P + C Y Y D ++ G L ++F N T
Sbjct: 207 GDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFTVNLTAPGA 266
Query: 154 QRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGH----CLS 209
R + GCG+ +H G+LGLG+G S SQL R V GH CL
Sbjct: 267 SRRVDGVVFGCGHRN--RGLFHGAAGLLGLGRGPLSFASQL------RAVYGHTFSYCLV 318
Query: 210 GGG---GGFLFFGDDLYDSSRVVWTSMSSDYTK-------------YYSPGVAELFFGGE 253
G G + FG+D D + + YT +Y + + GGE
Sbjct: 319 DHGSDVGSKVVFGED--DDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGGE 376
Query: 254 TTGLKNLP----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETL 303
+ + + DSG++ +Y YQ + +S +S PE L
Sbjct: 377 LLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMS-RSYPLVPEFPVL 435
Query: 304 PLCWKGRRPFKNVHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLIISNKGN---VCLGI 359
C+ NV V++ L+L F DG +++ E Y I + +CL +
Sbjct: 436 SPCY-------NVSGVERPEVPELSLLFADG---AVWDFPAENYFIRLDPDGGSIMCLAV 485
Query: 360 LNGAEVGL 367
L G+
Sbjct: 486 LGTPRTGM 493
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 81/305 (26%), Positives = 124/305 (40%), Gaps = 37/305 (12%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV--RCVEAPHPLYRPSN----DLVPCE 104
Y VT+ +G P +++DTGSD++W+QC PC C L+ P+ VPC
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCK-PCSAPACNSQRDQLFDPAKSSTYSAVPCG 201
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
C+ L C +QC Y + Y DG ++ GV D A G + L GC
Sbjct: 202 ADACSELRIY-EAGCSG-SQCGYVVSYGDGSNTTGVYGSDTLAL--APGNTVGTFL-FGC 256
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDL 222
G+ Q + +DG+L LG+ S+ SQ + V +CL G+L G
Sbjct: 257 GHAQA--GMFAGIDGLLALGRQSMSLKSQ--AAGAYGGVFSYCLPSKQSAAGYLTLGGP- 311
Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV---------VFDSGSSYTYL 273
S + + T + +P + G + G + + V V D+G+ T L
Sbjct: 312 ---SSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVDTGTVITRL 368
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
Y L S + ++ AP + L C+ F V T+AL+F+ G
Sbjct: 369 PPTAYAALRSAFRGAIAPCGYPSAPANGILDTCYD----FSRYGVVT--LPTVALTFSGG 422
Query: 334 KTRTL 338
T L
Sbjct: 423 ATLAL 427
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 82/305 (26%), Positives = 128/305 (41%), Gaps = 32/305 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV---RCVEAPHPLYRPSND----LVPC 103
Y VT +G P +++DTGSDL+W+QC PC C PL+ P+ VPC
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCK-PCSAAPSCYSQKDPLFDPAQSSSYAAVPC 198
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
P+CA L C AQC Y + Y DG ++ GV D + ++ + G
Sbjct: 199 GGPVCAGLGIYAASACSA-AQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQ---GFFFG 254
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDD 221
CG+ Q ++ +DG+LGLG+ + S+V Q + V +CL G+L G
Sbjct: 255 CGHAQ--SGLFNGVDGLLGLGREQPSLVEQ--TAGTYGGVFSYCLPTKPSTAGYLTLGLG 310
Query: 222 LYDSSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGLKNLP----VVFDSGSSYTYL 273
+ +++ S + YY + + GG+ + V D+G+ T L
Sbjct: 311 GPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVITRL 370
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
Y L S + +++ AP + L C+ F V +AL+F G
Sbjct: 371 PPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYN----FAGYGTVT--LPNVALTFGSG 424
Query: 334 KTRTL 338
T L
Sbjct: 425 ATVML 429
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 92/353 (26%), Positives = 152/353 (43%), Gaps = 43/353 (12%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SN 98
G + +G Y V + IG P + +L +DTGSD+ W+QC +PC C + ++ P S
Sbjct: 6 GLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQC-SPCKSCYKQNDAVFDPRASSSF 64
Query: 99 DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
+ C P C L + ++ +C Y++ Y DG ++G L D+F + R +P
Sbjct: 65 RRLSCSTPQCKLLDVKACASTDN--RCLYQVSYGDGSFTVGDLASDSF---LVSRGRTSP 119
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFF 218
+ GCG++ + G+LGLG GK S SQL S+K +V L F
Sbjct: 120 -VVFGCGHDN--EGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSALLF 176
Query: 219 GDD-LYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLP-----------VVF 264
GD L S+ +T + + +Y G++ + GG + + V+
Sbjct: 177 GDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVII 236
Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
DSG+S T L Y + + + + L A + C+ F + V
Sbjct: 237 DSGTSVTRLPTYAYTVMRDAFRS--ATQKLPRAADFSLFDTCYD----FSALTSVT--IP 288
Query: 325 TLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
T++ F G + +L P YL+ + G C ++ L DL++IG I
Sbjct: 289 TVSFHFEGGAS---VQLPPSNYLVPVDTSGTFCFAF---SKTSL-DLSIIGNI 334
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 95/360 (26%), Positives = 143/360 (39%), Gaps = 69/360 (19%)
Query: 51 YNVTMY-IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV----PCED 105
YNV + IG P +P +D +L W QC C RC + PL+ P+ PC
Sbjct: 66 YNVANFTIGTPPQPASAIIDVAGELVWTQCSM-CSRCFKQDLPLFVPNASSTFRPEPCGT 124
Query: 106 PICASLHAPGHHNCEDPAQCDYE--LEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
C S+ NC C YE + GG +LG++ D FA L G
Sbjct: 125 DACKSIPT---SNCSS-NMCTYEGTINSKLGGHTLGIVATDTFAIGTATAS-----LGFG 175
Query: 164 C----GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFG 219
C G + + G S G++GLG+ SS+VSQ++ K + H G L G
Sbjct: 176 CVVASGIDTMGGPS-----GLIGLGRAPSSLVSQMNITKFSYCLTPH--DSGKNSRLLLG 228
Query: 220 DDLY-------DSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL--KNLPVVFDSGSSY 270
++ V TS D ++YY + + G L V+ + +
Sbjct: 229 SSAKLAGGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIALPPSGNTVLVQTLAPM 288
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALS- 329
++L YQ L KKE++ K++ AP L +PF CF LS
Sbjct: 289 SFLVDSAYQAL----KKEVT-KAVGAAPTATPL-------QPF------DLCFPKAGLSN 330
Query: 330 -------FTDGKTRTLFELTPEAYLII--SNKGNVCLGILNGAEVGL----QDLNVIGGI 376
FT + + P YLI KG VC+ IL+ + + ++LN++G +
Sbjct: 331 ASAPDLVFTFQQGAAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSL 390
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 90/384 (23%), Positives = 152/384 (39%), Gaps = 64/384 (16%)
Query: 18 SSSSSSSSSSSLFNHVGSSLLFQVHGNVY-----PTGYYNVTMY---IGQPARPYFLDLD 69
SS S +S ++++H +L Q N Y P+ Y V + IG+P P +D
Sbjct: 52 SSLSPYNSKDTIWDHYSHKILKQTFSNDYISNLVPSPRYVVFLMNFSIGEPPIPQLAVMD 111
Query: 70 TGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDP-AQCDYE 128
TGS LTW+ C PC C + P++ PS + ++L + C+ +C Y
Sbjct: 112 TGSSLTWVMCH-PCSSCSQQSVPIFDPS------KSSTYSNLSCSECNKCDVVNGECPYS 164
Query: 129 LEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALGCGYN---QVPGASYHPLDGILGLG 184
+EY GSS G+ ++ + + P L GCG G Y ++G+ GLG
Sbjct: 165 VEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFGCGRKFSISSNGYPYQGINGVFGLG 224
Query: 185 KGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVW---TSMSSDYTK-- 239
G+ S++ + +C+ + Y +R+V +M D T
Sbjct: 225 SGRFSLLPSFGKK------FSYCIGN-------LRNTNYKFNRLVLGDKANMQGDSTTLN 271
Query: 240 ----YYSPGVAELFFGGETTGL-----------KNLPVVFDSGSSYTYLNRVTYQTLTSI 284
Y + + GG + N V+ DSG+ +T+L + ++ L S
Sbjct: 272 VINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADHTWLTKYGFEVL-SF 330
Query: 285 MKKELSAKSLKEAPEDETLP--LCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELT 342
+ L L A +D+ P LC+ G V F + F +G + +L
Sbjct: 331 EVENLLEGVLVLAQQDKHNPYTLCYSGV-----VSQDLSGFPLVTFHFAEG---AVLDLD 382
Query: 343 PEAYLIISNKGNVCLGILNGAEVG 366
+ I + + C+ +L G G
Sbjct: 383 VTSMFIQTTENEFCMAMLPGNYFG 406
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 101/393 (25%), Positives = 150/393 (38%), Gaps = 81/393 (20%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC---------DAPCVRCVEAPHPL----- 93
TG Y V +G PA+P+ L DTGSDLTW++C + AP P
Sbjct: 84 TGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRT 143
Query: 94 YRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDA--F 146
+RP +PC C C PA C Y+ Y DG ++ G + D+
Sbjct: 144 FRPDKSRTWAPIPCSSATCRESLPFSLAACATPANPCAYDYRYKDGSAARGTVGVDSATI 203
Query: 147 AFNYTNGQRLNPR-LALGC--GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLI 200
A + ++ R + LGC YN G S+ DG+L LG S S+ S+ +
Sbjct: 204 ALSGRAARKAKLRGVVLGCTTSYN---GQSFLASDGVLSLGYSNISFASRAASRFGGRFS 260
Query: 201 RNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSS------------------------- 235
+V H +L FG + SSR ++S
Sbjct: 261 YCLVDHLAPRNATSYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLD 320
Query: 236 -DYTKYYSPGVAELFFGGETTGLKNLP-----------VVFDSGSSYTYLNRVTYQTLTS 283
+Y+ V + GE L +P + DSG+S T L + Y+ + +
Sbjct: 321 HRTRPFYAVTVKGVSVAGE---LLKIPRAVWDVEQGGGAILDSGTSLTMLAKPAYRAVVA 377
Query: 284 IMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTP 343
+ K L+ L D C+ P + DV LA+ F G R E
Sbjct: 378 ALSKRLAG--LPRVTMDP-FDYCYNWTSP--SGSDVAAPLPMLAVHFA-GSAR--LEPPA 429
Query: 344 EAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
++Y+I + G C+G+ G G L+VIG I
Sbjct: 430 KSYVIDAAPGVKCIGLQEGPWPG---LSVIGNI 459
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 75/268 (27%), Positives = 109/268 (40%), Gaps = 34/268 (12%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
T Y V + +G P RP L LDTGSDL W QC APC C + P+ P+ +PC
Sbjct: 81 TNEYLVRLAVGTPRRPVALTLDTGSDLVWTQC-APCRDCFDQDLPVLDPAASSTYAALPC 139
Query: 104 EDPICASL--HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYT--NGQRLNP- 158
C +L + G + C Y Y D ++G + D F F + +G+ L+
Sbjct: 140 GAARCRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTR 199
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG---GGGGF 215
RL GCG+ G GI G G+G+ S+ SQL+ +C +
Sbjct: 200 RLTFGCGHLN-KGVFQSNETGIAGFGRGRWSLPSQLNVTSF-----SYCFTSMFESKSSL 253
Query: 216 LFFGDD---LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV--------VF 264
+ G LY + + P + L G + G LPV +
Sbjct: 254 VTLGGSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFRSTII 313
Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAK 292
DSG+S T L Y+ +K E +A+
Sbjct: 314 DSGASITTLPEEVYEA----VKAEFAAQ 337
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 65/131 (49%), Gaps = 13/131 (9%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN-- 98
+ G +G Y + IG PAR ++ LDTGSD+ WLQC PC C P++ PS+
Sbjct: 141 ISGTTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWLQC-TPCADCYHQTEPIFEPSSSS 199
Query: 99 --DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+ + C+ P C +L N A C YE+ Y DG ++G + T G L
Sbjct: 200 SYEPLSCDTPQCNALEVSECRN----ATCLYEVSYGDGSYTVGDFATETL----TIGSTL 251
Query: 157 NPRLALGCGYN 167
+A+GCG++
Sbjct: 252 VQNVAVGCGHS 262
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 72/268 (26%), Positives = 110/268 (41%), Gaps = 32/268 (11%)
Query: 39 FQVHGNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP------ 90
F V G P+ G Y + +G P R ++ +DTGSD+ W+ C + C C +
Sbjct: 63 FPVKGTFDPSQVGLYYTKVKLGTPPRELYVQIDTGSDVLWVSCGS-CNGCPQTSGLQIQL 121
Query: 91 ---HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKD-- 144
P ++ L+ C D C S +C QC Y +Y DG + G V D
Sbjct: 122 NYFDPGSSSTSSLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLM 181
Query: 145 --AFAFNYTNGQRLNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLI 200
A F T + + GC Q S +DGI G G+ S++SQL SQ +
Sbjct: 182 HFASIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIA 241
Query: 201 RNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL- 257
V HCL G GGG L G+ + +V++ + +Y+ + + G+ +
Sbjct: 242 PRVFSHCLKGDNSGGGVLVLGEIV--EPNIVYSPLVPS-QPHYNLNLQSISVNGQIVRIA 298
Query: 258 -------KNLPVVFDSGSSYTYLNRVTY 278
N + DSG++ YL Y
Sbjct: 299 PSVFATSNNRGTIVDSGTTLAYLAEEAY 326
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 54/178 (30%), Positives = 83/178 (46%), Gaps = 10/178 (5%)
Query: 21 SSSSSSSSLFNHVGSSLLFQVHGNVYPT---GYYNVTMYIGQPARPYFLDLDTGSDLTWL 77
S S+ + S +++ ++ + + +V P + + IG P P L +DTGSDLTW+
Sbjct: 55 SKSTPAPSRLDNLWTTEIADIVSHVTPIPNPAAFLANISIGDPPVPQLLLIDTGSDLTWI 114
Query: 78 QCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHA-PGHHNCEDPAQCDYELEYADGGS 136
QC PC +C P + PS ++ HA P E C Y L Y D +
Sbjct: 115 QC-LPC-KCYPQTIPFFHPSRSSTYRNASCESAPHAMPQIFRDEKTGNCRYHLRYRDFSN 172
Query: 137 SLGVLVKDAFAFNYTN-GQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQ 193
+ G+L K+ F ++ G P + GCG + Y G+LGLG G SIV++
Sbjct: 173 TRGILAKEKLTFQTSDEGLISKPNIVFGCGQDNSGFTQY---SGVLGLGPGTFSIVTR 227
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 88/326 (26%), Positives = 130/326 (39%), Gaps = 41/326 (12%)
Query: 23 SSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP 82
S SSS+ +GSSL T Y +++ +G PA + +DTGSD++W+QC+ P
Sbjct: 108 SKVSSSVPTKLGSSL---------DTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCN-P 157
Query: 83 CVR--CVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS 136
C C L+ P+ V C CA L G+ +C Y ++Y DG +
Sbjct: 158 CPNPPCHAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYGDGST 217
Query: 137 SLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS 196
+ G +D + GC + + DG++GLG G S+VSQ +
Sbjct: 218 TNGTYSRDTLTL--SGASDAVKGFQFGCSH--LESGFSDQTDGLMGLGGGAQSLVSQ--T 271
Query: 197 QKLIRNVVGHCL-SGGGGGFLFFGDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGE 253
N +CL G +S V T M S +Y + ++ GG+
Sbjct: 272 AAAYGNSFSYCLPPTSGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGK 331
Query: 254 TTGLKNLPVVF------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
GL P VF DSG+ T L Y L+S K + K + AP L C
Sbjct: 332 QLGLS--PSVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGM--KQYRSAPARSILDTC- 386
Query: 308 KGRRPFKNVHDVKKCFRTLALSFTDG 333
F + T+AL F+ G
Sbjct: 387 -----FDFAGQTQISIPTVALVFSGG 407
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 58/152 (38%), Positives = 76/152 (50%), Gaps = 16/152 (10%)
Query: 51 YNVTMYIGQPARP--YFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS-NDLVP---CE 104
Y + IG P RP L++DTGSD+ W QC PC C P P + S +D V C
Sbjct: 92 YLIHFGIGTP-RPQQVALEVDTGSDVVWTQCR-PCFDCFTQPLPRFDTSASDTVHGVLCT 149
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
DPIC +L H C C Y++ Y D ++G L KD+F F+ G ++ P L G
Sbjct: 150 DPICRALRP---HACFL-GGCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVFG 205
Query: 164 CG-YNQVPGASYHPLDGILGLGKGKSSIVSQL 194
CG YN G + GI G G+G S+ QL
Sbjct: 206 CGQYNT--GNFHSNETGIAGFGRGPLSLPRQL 235
>gi|449529194|ref|XP_004171586.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 417
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 88/356 (24%), Positives = 146/356 (41%), Gaps = 72/356 (20%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVE---APHP------LYRP----SND 99
T+ +G P + + LDTGSDL W+ CD C RC +P+ +Y P ++
Sbjct: 6 TTVQLGTPGTKFMVALDTGSDLFWVPCD--CSRCAPTEGSPYASDFELSVYSPKKSSTSK 63
Query: 100 LVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADG-GSSLGVLVKDAFAFNYTN--GQR 155
VPC + +CA C + C Y + Y S+ G+L++D N +
Sbjct: 64 TVPCNNSLCAQ-----RDQCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKTENKHSEP 118
Query: 156 LNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
+ + GCG QV S+ + +G+ GLG + S+ S L + L+ N C S G
Sbjct: 119 IQAYITFGCG--QVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSDDG 176
Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKY--------YSPGVAELFFGGETTGLKNLPVVF 264
G + FGD S+ + T + Y+ V + G T ++ +F
Sbjct: 177 VGRINFGDK---------GSLEQEETPFNLNQLHPNYNITVTSIRV-GTTLIDADITALF 226
Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
DSG+S++Y Y L++ + P R PF+ +++
Sbjct: 227 DSGTSFSYFTDPIYSKLSASFHAQTRDGRHPPNP-----------RIPFEYCYNMSP--- 272
Query: 325 TLALSFTDGKTRTLFELTP----EAYLIISNKGNV--CLGILNGAEVGLQDLNVIG 374
S T G + T+ P + ++IS + + CL ++ AE LN+IG
Sbjct: 273 DANASLTPGISLTMKGGGPFPVYDPIIVISTQNELIYCLAVVKSAE-----LNIIG 323
>gi|356551638|ref|XP_003544181.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 880
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 74/266 (27%), Positives = 112/266 (42%), Gaps = 42/266 (15%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRC----------VEAPHPLYRPS----NDLVP 102
IG P + + LD GSD+ W+ CD C+ C ++ YRPS + +P
Sbjct: 111 IGTPNVSFLVALDAGSDMLWVPCD--CIECASLSAGNYNVLDRDLNQYRPSLSNTSRHLP 168
Query: 103 CEDPICASLHAPGHHNCE---DPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNGQR--- 155
C +C H C+ DP C Y ++Y+ SS G + +D +NG+
Sbjct: 169 CGHKLCDV-----HSVCKGSKDP--CPYAVQYSSANTSSSGYVFEDKLHLT-SNGKHAEQ 220
Query: 156 --LNPRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
+ + LGCG Q + GA DG+LGLG G S+ S L LI+N C
Sbjct: 221 NSVQASIILGCGRKQTGEYLRGAGP---DGVLGLGPGNISVPSLLAKAGLIQNSFSICFE 277
Query: 210 GGGGGFLFFGDDLYDSSRVV-WTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGS 268
G + FGD + + + + + Y GV G + DSGS
Sbjct: 278 ENESGRIIFGDQGHVTQHSTPFLPIDGKFNAYIV-GVESFCVGSLCLKETRFQALIDSGS 336
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSL 294
S+T+L YQ + K+++A S+
Sbjct: 337 SFTFLPNEVYQKVVIEFDKQVNATSI 362
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 70/255 (27%), Positives = 113/255 (44%), Gaps = 32/255 (12%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y +T+ +G P + + + +DTGSDL W+QC PC C + P P + PS C
Sbjct: 37 GEYLMTLTLGSPPQSFDVIVDTGSDLNWVQC-LPCRVCYQQPGPKFDPSKSRSFRKAACT 95
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
D +C ++ A C C Y+ Y D ++ G L + + N G + P A GC
Sbjct: 96 DNLC-NVSALPLKACAANV-CQYQYTYGDQSNTNGDLAFETISLNNGAGTQSVPNFAFGC 153
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC---LSGGGGGFLFFGDD 221
G + ++ G++GLG+G S+ SQL N +C L+ L FG
Sbjct: 154 GTQNL--GTFAGAAGLVGLGQGPLSLNSQLS--HTFANKFSYCLVSLNSLSASPLTFG-S 208
Query: 222 LYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------------DS 266
+ ++ + +TS+ ++ + YY + + GG+ L P VF DS
Sbjct: 209 IAAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLA--PSVFAIDQSTGRGGTIIDS 266
Query: 267 GSSYTYLNRVTYQTL 281
G++ T L Y +
Sbjct: 267 GTTITMLTLPAYSAV 281
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 86/325 (26%), Positives = 128/325 (39%), Gaps = 36/325 (11%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
G Y + +G PA Y + +DTGS LTWLQC V C P++ P V C
Sbjct: 129 GNYVTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSVSCHRQAGPVFDPRASGTYAAVQCS 188
Query: 105 DPICASLHAP--GHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
C L A C C Y+ Y D S+G L KD +F G P
Sbjct: 189 SSECGELQAATLNPSACSVSNVCIYQASYGDSSYSVGYLSKDTVSF----GSGSFPGFYY 244
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGD 220
GCG + + G++GL K K S++ QL + +CL S G+L G
Sbjct: 245 GCGQDNE--GLFGRSAGLIGLAKNKLSLLYQLAPS--LGYAFSYCLPTSSAAAGYLSIGS 300
Query: 221 DLYDSSRVVWTSMSS---DYTKYYSP----GVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
Y+ + +T M+S D + Y+ VA + ++LP + DSG+ T L
Sbjct: 301 --YNPGQYSYTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEYRSLPTIIDSGTVITRL 358
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
Y L+ + +++ + + L C++G V V ++F G
Sbjct: 359 PPNVYTALSRAVAAAMASAAPRAP-TYSILDTCFRGSAAGLRVPRVD-------MAFAGG 410
Query: 334 KTRTLFELTPEAYLIISNKGNVCLG 358
T L+P LI + CL
Sbjct: 411 AT---LALSPGNVLIDVDDSTTCLA 432
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 101/343 (29%), Positives = 148/343 (43%), Gaps = 44/343 (12%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC---VRCVEAPHPLYRPSND----LVPC 103
+ V + +G PA+P L DTGSDL+W+QC PC C PL+ PS V C
Sbjct: 149 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQ-PCGSSGHCHPQQDPLFDPSKSSTYAAVHC 207
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+P CA A G ED C Y + Y DG S+ GVL +D A + P G
Sbjct: 208 GEPQCA---AAGGLCSEDNTTCLYLVHYGDGSSTTGVLSRDTLALTSSRALAGFP---FG 261
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFG-D 220
CG + + +DG+LGLG+G+ S+ SQ + V +CL S G+L G
Sbjct: 262 CGTRNL--GDFGRVDGLLGLGRGELSLPSQAAAS--FGAVFSYCLPSSNSTTGYLTIGAT 317
Query: 221 DLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSSYT 271
D+ +T+M + +Y + + GG L P VF DSG+ T
Sbjct: 318 PATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYI--LPVPPAVFTRGGTLLDSGTVLT 375
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
YL Y+ L + L+ + AP ++ L C+ F +V ++ F
Sbjct: 376 YLPAQAYELLRD--RFRLTMERYTPAPPNDVLDACYD----FAGESEV--IVPAVSFRFG 427
Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
DG +FEL +I ++ CL + G L++IG
Sbjct: 428 DGA---VFELDFFGVMIFLDENVGCLA-FAAMDAGGLPLSIIG 466
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 104/376 (27%), Positives = 160/376 (42%), Gaps = 75/376 (19%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP---- 96
V G +G Y + +G P P + LDTGSD+ WLQC APC RC + ++ P
Sbjct: 137 VSGLAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQC-APCRRCYDQSGQMFDPRASH 195
Query: 97 SNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
S V C P+C L + G C+ C Y++ Y DG + G + F +G R
Sbjct: 196 SYGAVDCAAPLCRRLDSGG---CDLRRKACLYQVAYGDGSVTAGDFATETLTF--ASGAR 250
Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL------- 208
+ PR+ALGCG++ + G+LGLG+G S SQ+ S++ R+ +CL
Sbjct: 251 V-PRVALGCGHDNE--GLFVAAAGLLGLGRGSLSFPSQI-SRRFGRS-FSYCLVDRTSSS 305
Query: 209 --SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFF----GGETTGLKNLP- 261
+ + FG S V S ++ +T E F+ G + G +P
Sbjct: 306 ASATSRSSTVTFG------SGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPG 359
Query: 262 ----------------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
V+ DSG+S T L R Y L + +A L+ +P +L
Sbjct: 360 VAVSDLRLDPSTGRGGVIVDSGTSVTRLARPAYAALRDAFRA--AAAGLRLSPGGFSL-- 415
Query: 306 CWKGRRPFKNVHDVK--KCFR--TLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGIL 360
F +D+ K + T+++ F G L PE YLI + ++G C
Sbjct: 416 -------FDTCYDLSGLKVVKVPTVSMHFAGGAEAA---LPPENYLIPVDSRGTFCFA-F 464
Query: 361 NGAEVGLQDLNVIGGI 376
G + G +++IG I
Sbjct: 465 AGTDGG---VSIIGNI 477
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 74/282 (26%), Positives = 121/282 (42%), Gaps = 30/282 (10%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCE 104
T Y +++ +G PA+ +++DTGS +W+ C+ C C P + + V C
Sbjct: 79 TSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCG 136
Query: 105 DPICASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P G
Sbjct: 137 TSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFG 193
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL- 222
C + + +DG+LG+G G S++ Q + + +CL FF
Sbjct: 194 CNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRF---DGFSYCLPLQKSERGFFSKTTG 250
Query: 223 YDSSRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGS 268
Y S V T YTK + ELFF GE GL VVFDSGS
Sbjct: 251 YFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 310
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
+Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 311 ELSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 349
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 56/157 (35%), Positives = 79/157 (50%), Gaps = 12/157 (7%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVE---AP--HPLYRPSNDLVP 102
T Y + + +G P RP L LDTGSDL W QC APC+ C E AP P ++ +P
Sbjct: 87 TNEYLMHVSVGTPPRPVALTLDTGSDLVWTQC-APCLDCFEQGAAPVLDPAASSTHAALP 145
Query: 103 CEDPICASL--HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF--NYTNGQRLNP 158
C+ P+C +L + G + D + C Y Y D ++G L D+F F + G
Sbjct: 146 CDAPLCRALPFTSCGGRSWGDRS-CVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGLAAR 204
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLH 195
R+ GCG+ G GI G G+G+ S+ SQL+
Sbjct: 205 RVTFGCGHIN-KGIFQANETGIAGFGRGRWSLPSQLN 240
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 69/153 (45%), Gaps = 10/153 (6%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDP 106
Y + + IG+P P+ DTGSDLTW QC PC C P+Y PS +PC
Sbjct: 71 YLMELAIGKPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPVYDPSASSTFSPLPCSSA 129
Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + + NC + C Y Y DG S G+L + ++ +A GCG
Sbjct: 130 TCLPIWS---RNCTPSSLCRYRYAYGDGAYSAGILGTETLTLGPSSAPVSVGGVAFGCGT 186
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
+ G G +GLG+G S+++QL K
Sbjct: 187 DN--GGDSLNSTGTVGLGRGTLSLLAQLGVGKF 217
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 144/339 (42%), Gaps = 63/339 (18%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y + + G P + + +DTGSDL W QC PC C A ++ P + D V C
Sbjct: 78 GEYLIDISFGSPPQKASVIVDTGSDLIWTQC-LPCETCNAAASVIFDPVKSSTYDTVSCA 136
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
C+SL +C C Y+ Y DG S+ G L + T G P +A GC
Sbjct: 137 SNFCSSLP---FQSCT--TSCKYDYMYGDGSSTSGALSTET----VTVGTGTIPNVAFGC 187
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG---GFLFFGDD 221
G+ + S+ GI+GLG+G S++SQ S + +CL G + GD
Sbjct: 188 GHTNL--GSFAGAAGIVGLGQGPLSLISQASS--ITSKKFSYCLVPLGSTKTSPMLIGDS 243
Query: 222 LYDSSRVVWTSM---SSDYTKYYS-------PGVAELF----FGGETTGLKNLPVVFDSG 267
+ V +T++ +++ T YY+ G A + F + +G + DSG
Sbjct: 244 A-AAGGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGF--ILDSG 300
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
++ TYL + L + +K E+ PE + +++ + CF T
Sbjct: 301 TTLTYLETGAFNALVAALKAEV------PFPEAD------------GSLYGLDYCFSTAG 342
Query: 328 LSFTDGKTRTL------FELTPE-AYLIISNKGNVCLGI 359
++ T T +EL PE ++ + G++CL +
Sbjct: 343 VANPTYPTMTFHFKGADYELPPENVFVALDTGGSICLAM 381
>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 64/180 (35%), Positives = 84/180 (46%), Gaps = 15/180 (8%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y V + G PAR Y + +DTGS L+WLQC V C PL+ PS + C
Sbjct: 115 SGNYYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSC 174
Query: 104 EDPICASLHAPGHHN--CEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
C+SL +N CE + C Y Y D S+G L +D Q L P
Sbjct: 175 TSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTL--APSQTL-PGF 231
Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFFG 219
GCG Q + GILGLG+ K S++ Q+ S+ +CL + GGGGFL G
Sbjct: 232 VYGCG--QDSDGLFGRAAGILGLGRNKLSMLGQVSSK--FGYAFSYCLPTRGGGGFLSIG 287
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 73/282 (25%), Positives = 112/282 (39%), Gaps = 37/282 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y + +G P R ++ LDTGSD+ WLQC +PC +C P++ P +PC
Sbjct: 107 SGEYFTRLGVGTPPRYLYMVLDTGSDVVWLQC-SPCRKCYSQSDPIFNPYKSKSFAGIPC 165
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
P+C L + G C Y++ Y DG + G + F G ++ ++ALG
Sbjct: 166 SSPLCRRLDSSGCSTRRH--TCLYQVSYGDGSFTTGDFATETLTF---RGNKI-AKVALG 219
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIR--NVVGHCL----SGGGGGFLF 217
CG++ L G SQ IR + +CL + +
Sbjct: 220 CGHHN------EGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMV 273
Query: 218 FGDDLYDS-SRVVWTSMSSDYTKYYSPGVAELFFGG-ETTGLK----------NLPVVFD 265
FGD +R + +Y G+ + GG G+ N V+ D
Sbjct: 274 FGDAAISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIID 333
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
SG+S T L R Y L + + A+ LK PE C+
Sbjct: 334 SGTSVTRLTRPAYTALRDAFR--VGARHLKRGPEFSLFDTCY 373
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 93/362 (25%), Positives = 153/362 (42%), Gaps = 62/362 (17%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND- 99
V G +G Y + +G PA+ +L LDTGSD+ W+QC+ PC C + P++ P++
Sbjct: 152 VSGVSQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCE-PCSDCYQQSDPVFNPTSSS 210
Query: 100 ---LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+ C P C+ L + +C Y++ Y DG ++G L D F N ++
Sbjct: 211 TYKSLTCSAPQCSLLETSACRS----NKCLYQVSYGDGSFTVGELATDTVTFG--NSGKI 264
Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGG 212
N +ALGCG++ + G+LGLG G SI +Q+ + +CL SG
Sbjct: 265 ND-VALGCGHDN--EGLFTGAAGLLGLGGGALSITNQMKATSF-----SYCLVDRDSGKS 316
Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----------V 262
F L + +Y G++ GG+ + + V
Sbjct: 317 SSLDFNSVQLGSGDATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSGGV 376
Query: 263 VFDSGSSYTYLNRVTYQT-------LTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKN 315
+ D G++ T L Y + LT+ +KK S+ SL + C+ F +
Sbjct: 377 ILDCGTAVTRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDT--------CYD----FSS 424
Query: 316 VHDVKKCFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIG 374
+ VK T+A FT GK+ +L + YLI + + G C + L++IG
Sbjct: 425 LSSVK--VPTVAFHFTGGKS---LDLPAKNYLIPVDDNGTFCFAFAPTSS----SLSIIG 475
Query: 375 GI 376
+
Sbjct: 476 NV 477
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 94/358 (26%), Positives = 143/358 (39%), Gaps = 51/358 (14%)
Query: 33 VGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA----PCVRC 86
VG + F V G P G Y + +G P R + + +DTGSD+ W+ C + P
Sbjct: 64 VGGVVNFPVDGASDPFLVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSE 123
Query: 87 VEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLV 142
++ + P S LV C D C S + C C Y +Y DG + G +
Sbjct: 124 LQIQLSFFDPGVSSSASLVSCSDRRCYS-NFQTESGCSPNNLCSYSFKYGDGSGTSGFYI 182
Query: 143 KDAFAFNYTNGQRL----NPRLALGCGYNQVPGASYHP---LDGILGLGKGKSSIVSQLH 195
D +F+ L + GC N G P +DGI GLG+G S++SQL
Sbjct: 183 SDFMSFDTVITSTLAINSSAPFVFGCS-NLQTGDLQRPRRAVDGIFGLGQGSLSVISQLA 241
Query: 196 SQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE 253
Q L V HCL G GGG + G V+T + +Y+ + + G+
Sbjct: 242 VQGLAPRVFSHCLKGDKSGGGIMVLGQ--IKRPDTVYTPLVPS-QPHYNVNLQSIAVNGQ 298
Query: 254 TTGLKNLPVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETL 303
+ P VF D+G++ YL Y +++ A
Sbjct: 299 ILPID--PSVFTIATGDGTIIDTGTTLAYLPDEAYSPFI---------QAIANAVSQYGR 347
Query: 304 PLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL-IISNKGNV--CLG 358
P+ ++ + F+ F ++LSF G + L P AYL I S+ G+ C+G
Sbjct: 348 PITYESYQCFEITAGDVDVFPEVSLSFAGGASMV---LRPHAYLQIFSSSGSSIWCIG 402
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 65/131 (49%), Gaps = 13/131 (9%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL 100
V G +G Y + IG+P P ++ LDTGSD++W+QC APC C E P++ P++
Sbjct: 141 VSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQC-APCAECYEQTDPIFEPTSSA 199
Query: 101 ----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+ CE C SL N C YE+ Y DG ++G V + T+
Sbjct: 200 SFTSLSCETEQCKSLDVSECRN----GTCLYEVSYGDGSYTVGDFVTETVTLGSTSLG-- 253
Query: 157 NPRLALGCGYN 167
+A+GCG+N
Sbjct: 254 --NIAIGCGHN 262
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 69/268 (25%), Positives = 114/268 (42%), Gaps = 28/268 (10%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV----PCE 104
G Y +++ +G P DTGSDL W QC PC RC + PL+ P + C+
Sbjct: 93 GEYLMSLSLGTPPFKIMGIADTGSDLIWTQCK-PCERCYKQVDPLFDPKSSKTYRDFSCD 151
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
C+ L C C Y+ Y D ++G + D + T G ++ P+ +G
Sbjct: 152 ARQCSLLD---QSTCSGNI-CQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFPKTVIG 207
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFF 218
CG+ G GI+GLG G S++SQ+ S + +CL G L F
Sbjct: 208 CGHEN-DGTFSDKGSGIVGLGAGPLSLISQMGSS--VGGKFSYCLVPLSSRAGNSSKLNF 264
Query: 219 GDDLYDSSRVVWT-------SMSSDY---TKYYSPGVAELFFGGETTGLKNLPVVFDSGS 268
G + S V + +MSS Y + S G + FG + G ++ DSG+
Sbjct: 265 GSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGEGNIIIDSGT 324
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKE 296
+ T + + L++ + ++ + ++
Sbjct: 325 TLTIVPDDFFSNLSTAVGNQVEGRRAED 352
>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 389
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 69/265 (26%), Positives = 112/265 (42%), Gaps = 21/265 (7%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-VEAPHPLYRPSNDLVPCEDPICA 109
+ ++ G P + FL +DTGS LTW QC PC C + +P YRP+ + D +C
Sbjct: 58 FMAEIHFGSPQKKQFLHMDTGSSLTWTQC-FPCSDCYAQKIYPKYRPAASIT-YRDAMCE 115
Query: 110 SLHAPGH-HNCEDPAQ--CDYELEYADGGSSLGVLVKDAFAFNYTNG--QRLNPRLALGC 164
H + H DP C Y+ Y D + G L ++ + +G +R++ + GC
Sbjct: 116 DSHPKSNPHFAFDPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFKRVH-GVYFGC 174
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYD 224
N + SY GILGLG GK SI+ + S+ +G L GD
Sbjct: 175 --NTLSDGSYFTGTGILGLGVGKYSIIGEFGSK--FSFCLGEISEPKASHNLILGDGANV 230
Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSI 284
+++ +T + + + G E T + V D+GS+ ++L+ Y
Sbjct: 231 QGHPTVINITEGHTIF---QLESIIVGEEITLDDPVQVFVDTGSTLSHLSTNLYYKFVDA 287
Query: 285 MKKELSAKSLKEAPEDETLPLCWKG 309
+ ++ L P LC+K
Sbjct: 288 FDDLIGSRPLSYEPT-----LCYKA 307
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 98/354 (27%), Positives = 151/354 (42%), Gaps = 51/354 (14%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
+G Y + + +G P R +L +DTGSD+ WLQC APCV C ++ P + + C
Sbjct: 34 SGEYFIRVSVGTPPRGMYLVMDTGSDILWLQC-APCVSCYHQCDEVFDPYKSSTYSTLGC 92
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN--GQRLNPRLA 161
C +L G +C Y+++Y DG S G DA + N T+ GQ + ++
Sbjct: 93 NSRQCLNLDVGGCVG----NKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVVLNKIP 148
Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG-----GFL 216
LGCG++ + G+LGLGKG S +Q++S+ R +CL+G L
Sbjct: 149 LGCGHDNE--GYFVGAAGLLGLGKGPLSFPNQINSENGGR--FSYCLTGRDTDSTERSSL 204
Query: 217 FFGDDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGG----------ETTGLKNLPVVF 264
FGD + V +T +S+ + +Y + + GG + L N V+
Sbjct: 205 IFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGGVII 264
Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC-F 323
DSG+S T L Y +L + S L E C+ N+ D+
Sbjct: 265 DSGTSVTRLQNAAYASLREAFRAGTS--DLVLTTEFSLFDTCY-------NLSDLSSVDV 315
Query: 324 RTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
T+ L F G +L YL+ + N CL A G ++IG I
Sbjct: 316 PTVTLHFQGGAD---LKLPASNYLVPVDNSSTFCL-----AFAGTTGPSIIGNI 361
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 74/275 (26%), Positives = 118/275 (42%), Gaps = 38/275 (13%)
Query: 14 TVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSD 73
T+ + S++SSS +FN + L V+ T Y + + IG P LDTGS+
Sbjct: 31 TIDLIHRRSNASSSRVFN---TQLGSPYADTVFDTYEYLMKLQIGTPPFEIEAVLDTGSE 87
Query: 74 LTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYAD 133
W QC PCV C P++ PS E + H + C YEL Y
Sbjct: 88 HIWTQC-LPCVHCYNQTAPIFDPSKSSTFKE------IRCDTHDH-----SCPYELVYGG 135
Query: 134 GGSSLGVLVKDAFAFNYTNGQR-LNPRLALGCGYNQVPGASYHP-LDGILGLGKGKSSIV 191
+ G LV + + T+GQ + P +GCG N + + P G++GL +G S++
Sbjct: 136 KSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNN---SGFKPGFAGVVGLDRGPKSLI 192
Query: 192 SQLHSQKLIRNVVGHCLSGGGGGFLFFG-DDLYDSSRVVWTSMSSDYTKYYSPGVAELFF 250
+Q+ + ++ +C +G G + FG + + VV T++ + K PG L
Sbjct: 193 TQMGGEY--PGLMSYCFAGKGTSKINFGANAIVAGDGVVSTTV---FVKTAKPGFYYLNL 247
Query: 251 GGETTGLKNLP------------VVFDSGSSYTYL 273
+ G + +V DSGS+ TY
Sbjct: 248 DAVSVGNTRIETVGTPFHALKGNIVIDSGSTLTYF 282
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 83/327 (25%), Positives = 132/327 (40%), Gaps = 39/327 (11%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y + +G P+ Y + +DTGS LTWLQC V C PL+ P + V C
Sbjct: 132 GNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYTSVRCS 191
Query: 105 DPICASLHAP--GHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
C L A C C Y+ Y D S+G L D +F T+ P
Sbjct: 192 ASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGYLSTDTVSFGSTS----YPSFYY 247
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFFGDD 221
GCG + + G++GL + K S++ QL + +CL + G+L G
Sbjct: 248 GCGQDN--EGLFGRSAGLIGLARNKLSLLYQLAPS--LGYSFSYCLPTAASTGYLSIGP- 302
Query: 222 LYDS----SRVVWTSMSSDYTKYYSPGVAELFFGGETTGL-----KNLPVVFDSGSSYTY 272
Y++ S S S D + Y+ ++ + GG + +LP + DSG+ T
Sbjct: 303 -YNTGHYYSYTPMASSSLDASLYFI-TLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITR 360
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
L + L+ + + ++ + AP L C++G+ V T+ ++F
Sbjct: 361 LPTAVHTALSKAVAQAMAGA--QRAPAFSILDTCFEGQASQLRV-------PTVVMAFAG 411
Query: 333 GKTRTLFELTPEAYLIISNKGNVCLGI 359
G + +LT LI + CL
Sbjct: 412 GAS---MKLTTRNVLIDVDDSTTCLAF 435
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 55/154 (35%), Positives = 71/154 (46%), Gaps = 11/154 (7%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVPCEDP 106
Y + + IG P P+ DTGSDLTW QC PC C P+Y S VPC
Sbjct: 93 YLMELAIGTPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPIYDTAVSSSFSPVPCASA 151
Query: 107 ICASLHAPGHHNC-EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
C + + NC + C Y Y DG S GVL + F G + +A GCG
Sbjct: 152 TCLPIWS--SRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGAPGVSVG-GIAFGCG 208
Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
+ G SY+ G +GLG+G S+V+QL K
Sbjct: 209 VDN-GGLSYNS-TGTVGLGRGSLSLVAQLGVGKF 240
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 67/132 (50%), Gaps = 12/132 (9%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP---- 96
V G +G Y + +G P+ P + LDTGSD+ WLQC APC RC + P++ P
Sbjct: 130 VSGLAQGSGEYFTKIGVGTPSTPALMVLDTGSDVVWLQC-APCRRCYDQSGPVFDPRRSS 188
Query: 97 SNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
S V C P+C L + G C+ C Y++ Y DG + G + F G R
Sbjct: 189 SYGAVDCAAPLCRRLDSGG---CDLRRRACLYQVAYGDGSVTAGDFATETLTF--AGGAR 243
Query: 156 LNPRLALGCGYN 167
+ R+ALGCG++
Sbjct: 244 VA-RVALGCGHD 254
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 101/343 (29%), Positives = 147/343 (42%), Gaps = 44/343 (12%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC---VRCVEAPHPLYRPSND----LVPC 103
+ V + +G PA+P L DTGSDL+W+QC PC C PL+ PS V C
Sbjct: 144 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQ-PCGSSGHCHPQQDPLFDPSKSSTYAAVHC 202
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+P CA A G ED C Y + Y DG S+ GVL +D A + P G
Sbjct: 203 GEPQCA---AAGDLCSEDNTTCLYLVRYGDGSSTTGVLSRDTLALTSSRALTGFP---FG 256
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFG-D 220
CG + + +DG+LGLG+G+ S+ SQ + V +CL S G+L G
Sbjct: 257 CGTRNL--GDFGRVDGLLGLGRGELSLPSQAAAS--FGAVFSYCLPSSNSTTGYLTIGAT 312
Query: 221 DLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSSYT 271
D+ +T+M + +Y + + GG L P VF DSG+ T
Sbjct: 313 PATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYV--LPVPPAVFTRGGTLLDSGTVLT 370
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
YL Y L + L+ + AP ++ L C+ F +V ++ F
Sbjct: 371 YLPAQAYALLRD--RFRLTMERYTPAPPNDVLDACYD----FAGESEV--VVPAVSFRFG 422
Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIG 374
DG +FEL +I ++ CL + G L++IG
Sbjct: 423 DGA---VFELDFFGVMIFLDENVGCLA-FAAMDTGGLPLSIIG 461
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 73/275 (26%), Positives = 118/275 (42%), Gaps = 38/275 (13%)
Query: 14 TVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSD 73
T+ + S++SSS +FN + L V+ T Y + + IG P LDTGS+
Sbjct: 25 TIDLIHRRSNASSSRVFN---TQLGSPYADTVFDTYEYLMKLQIGTPPFEIEAVLDTGSE 81
Query: 74 LTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYAD 133
W QC PCV C P++ PS + + H + C YEL Y
Sbjct: 82 HIWTQC-LPCVHCYNQTAPIFDPS------KSSTFKEIRCDTHDH-----SCPYELVYGG 129
Query: 134 GGSSLGVLVKDAFAFNYTNGQR-LNPRLALGCGYNQVPGASYHP-LDGILGLGKGKSSIV 191
+ G LV + + T+GQ + P +GCG N + + P G++GL +G S++
Sbjct: 130 KSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNN---SGFKPGFAGVVGLDRGPKSLI 186
Query: 192 SQLHSQKLIRNVVGHCLSGGGGGFLFFG-DDLYDSSRVVWTSMSSDYTKYYSPGVAELFF 250
+Q+ + ++ +C +G G + FG + + VV T++ + K PG L
Sbjct: 187 TQMGGEY--PGLMSYCFAGKGTSKINFGANAIVAGDGVVSTTV---FVKTAKPGFYYLNL 241
Query: 251 GGETTGLKNLP------------VVFDSGSSYTYL 273
+ G + +V DSGS+ TY
Sbjct: 242 DAVSVGNTRIETVGTPFHALKGNIVIDSGSTLTYF 276
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 99/351 (28%), Positives = 149/351 (42%), Gaps = 59/351 (16%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y + ++IG P + Y L LDTGSDL W+QC PC C E P Y P + C
Sbjct: 87 SGEYFMDVFIGTPPKHYSLILDTGSDLNWIQC-VPCHDCFEQNGPYYDPKESSSFRNIGC 145
Query: 104 EDPICASLHAPGHH-NCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTN--GQRLNPR 159
DP C + +P C+ Q C Y Y D ++ G + F N T+ G+ R
Sbjct: 146 HDPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKR 205
Query: 160 LA---LGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
+ GCG +N+ +H G+LGLG+G S SQL Q L + +CL
Sbjct: 206 VENVMFGCGHWNR---GLFHGASGLLGLGRGPLSFSSQL--QSLYGHSFSYCLVDRNSDT 260
Query: 216 -----LFFGD--DLYDSSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGLKNLP--- 261
L FG+ DL + + +T++ + +Y + + GGE + N+P
Sbjct: 261 NVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGE---VLNIPEST 317
Query: 262 ----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRR 311
+ DSG++ +Y YQ + K+ K +K P + P+
Sbjct: 318 WNMTSDGVGGTIVDSGTTLSYFTEPAYQII-----KDAFVKKVKGYPIVQDFPIL----D 368
Query: 312 PFKNVHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGIL 360
P NV V+K + F DG ++ E Y I + + VCL IL
Sbjct: 369 PCYNVSGVEKIDLPDFGILFADG---AVWNFPVENYFIRLDPEEVVCLAIL 416
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 90/346 (26%), Positives = 138/346 (39%), Gaps = 48/346 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
T Y +T+ G P + + DTGS++ W+QC V C PL+ P+ + C
Sbjct: 13 TANYVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSSTYRNISC 72
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
C L + G C + C Y + Y DG S++G L + F G N G
Sbjct: 73 TSAACTGLSSRG---CSG-STCVYGVTYGDGSSTVGFLATETFTL--AAGNVFN-NFIFG 125
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDD 221
CG N + G++GLG+ S+ SQL + + N+ +CL + G+L G+
Sbjct: 126 CGQNN--QGLFTGAAGLIGLGRSPYSLNSQLATS--LGNIFSYCLPSTSSATGYLNIGNP 181
Query: 222 LYDSSRVVWTSMSSDYTKYY------SPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNR 275
L + S T Y+ S G L +T +++ + DSG+ T L
Sbjct: 182 LRTPGYTAMLTNSRAPTLYFIDLIGISVGGTRLAL--SSTVFQSVGTIIDSGTVITRLPP 239
Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKT 335
Y L + + ++ + A L C+ R F T+ L +T
Sbjct: 240 TAYGALRTAFRAAMTQYT--RAAAASILDTCYDFSR------TTTVTFPTIKLHYTG--- 288
Query: 336 RTLFELTPEA---YLIISNKGNVCLGILNGAEVGLQDLNVIGGIGD 378
L P A Y+I S++ VCL A G D IG IG+
Sbjct: 289 --LDVTIPGAGVFYVISSSQ--VCL-----AFAGNSDSTQIGIIGN 325
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 84/327 (25%), Positives = 137/327 (41%), Gaps = 38/327 (11%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCEDP 106
Y VT+ IG R + +DTGSDLTW+QC PC C PL+ PS + C
Sbjct: 67 YIVTVEIG--GRNMTVIVDTGSDLTWVQCQ-PCRLCYNQQDPLFNPSGSPSYQTILCNSS 123
Query: 107 ICASL-HAPGHHNC--EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
C SL +A G+ + C+Y + Y DG + G L + T+ G
Sbjct: 124 TCQSLQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNLGTTHVS----NFIFG 179
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGD 220
CG N + G++GLGK S+VSQ + + V +CL + G L G
Sbjct: 180 CGRNN--KGLFGGASGLMGLGKSDLSLVSQ--TSAIFEGVFSYCLPTTAADASGSLILGG 235
Query: 221 D---LYDSSRVVWTSMSSD--YTKYYSPGVAELFFGG---ETTGLKNLPVVFDSGSSYTY 272
+ +++ + +T M ++ +Y + + GG + + ++ DSG+ T
Sbjct: 236 NSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQAPNYRQSGILIDSGTVITR 295
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTD 332
L Y+ L + K+ S AP L C+ N +D + T+ + F +
Sbjct: 296 LPPPVYRDLKAEFLKQFSG--FPSAPPFSILDTCFN-----LNGYD-EVDIPTIRMQF-E 346
Query: 333 GKTRTLFELTPEAYLIISNKGNVCLGI 359
G ++T Y + ++ VCL +
Sbjct: 347 GNAELTVDVTGIFYFVKTDASQVCLAL 373
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 128/329 (38%), Gaps = 42/329 (12%)
Query: 45 VYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCE 104
V+ Y + + +G P ++DTGSDL W QC PC C P++ PS E
Sbjct: 55 VFDYSIYLMRLQLGTPPFEIVAEIDTGSDLIWTQC-MPCPNCYTQFAPIFDPSKSSTFKE 113
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR-LNPRLALG 163
H N C YE+ YAD S G+L + T+G+ + ++G
Sbjct: 114 KRC--------HGN-----SCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAETSIG 160
Query: 164 CGYN----QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFG 219
CG N PG + GI+GL G SS++SQ+ I ++ +C S G + FG
Sbjct: 161 CGLNNSNLMTPGYAASS-SGIVGLNMGPSSLISQMDLP--IPGLISYCFSSQGTSKINFG 217
Query: 220 DDLY---DSSRVVWTSMSSDYTKYY------SPGVAELFFGGETTGLKNLPVVFDSGSSY 270
+ D + + D YY S G + G ++ + DSG++Y
Sbjct: 218 TNAVVAGDGTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFHAQDGNIFIDSGTTY 277
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
TYL + + + A + P E L LC+ D + F + L F
Sbjct: 278 TYLPTSYCNLVREAVAASVVAANQVPDPSSENL-LCYN--------WDTMEIFPVITLHF 328
Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGI 359
G L + Y+ G CL I
Sbjct: 329 AGGADLVLDKY--NMYVETITGGTFCLAI 355
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 101/357 (28%), Positives = 148/357 (41%), Gaps = 71/357 (19%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y + ++IG P R + L LDTGSDL W+QC PC C P Y P + C
Sbjct: 189 SGEYFMDVFIGTPPRHFSLILDTGSDLNWIQC-VPCYDCFVQNGPYYDPKESSSFKNIGC 247
Query: 104 EDPICASLHAPGHHNCEDPAQ--------CDYELEYADGGSSLGVLVKDAFAFNYTN--G 153
DP C + +P DP Q C Y Y D ++ G + F N T+ G
Sbjct: 248 HDPRCHLVSSP------DPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAG 301
Query: 154 QRLNPRLA---LGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS 209
+ R+ GCG +N+ +H G+LGLG+G S SQL Q L + +CL
Sbjct: 302 KSEFKRVENVMFGCGHWNR---GLFHGAAGLLGLGRGPLSFSSQL--QSLYGHSFSYCLV 356
Query: 210 GGG-----GGFLFFGD--DLYDSSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGLK 258
L FG+ DL + V +TS+ + +Y + + GGE +
Sbjct: 357 DRNSDTNVSSKLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIP 416
Query: 259 NLP----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
+ DSG++ +Y +Y+ + K+ K +K P + P+
Sbjct: 417 EETWHLSPEGAGGTIVDSGTTLSYFAEPSYEII-----KDAFVKKVKGYPVIKDFPIL-- 469
Query: 309 GRRPFKNVHDVKKC----FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGIL 360
P NV V+K FR L F DG ++ E Y I + + VCL IL
Sbjct: 470 --DPCYNVSGVEKMELPEFRIL---FEDG---AVWNFPVENYFIKLEPEEIVCLAIL 518
>gi|255637574|gb|ACU19113.1| unknown [Glycine max]
Length = 290
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 62/202 (30%), Positives = 86/202 (42%), Gaps = 21/202 (10%)
Query: 39 FQVHGNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP------ 90
F V G P+ G Y + +G P R ++ +DTGSD+ W+ C + C C +
Sbjct: 63 FPVKGTFDPSQVGLYYTKVKLGTPPRELYVQIDTGSDVLWVSCGS-CNGCPQTSGLQIQL 121
Query: 91 ---HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKD-- 144
P ++ L+ C D C S +C QC Y +Y DG + G V D
Sbjct: 122 NYFDPGSSSTSSLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLM 181
Query: 145 --AFAFNYTNGQRLNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLI 200
A F T + + GC Q S +DGI G G+ S++SQL SQ +
Sbjct: 182 HFASIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIA 241
Query: 201 RNVVGHCLSG--GGGGFLFFGD 220
V HCL G GGG L G+
Sbjct: 242 PRVFSHCLKGDNSGGGVLVLGE 263
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 84/342 (24%), Positives = 143/342 (41%), Gaps = 43/342 (12%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y + M IG P + DTGSDLTW+QC PC C PL+ PS + C
Sbjct: 92 GEYFMKMSIGTPLVEVIVIADTGSDLTWVQC-LPCDPCYRQKSPLFDPSRSSSYRHMLCG 150
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ--RLNPRLAL 162
C +L D C+Y Y D + G L + F T+ + L+P +
Sbjct: 151 SRFCNALDVSEQACTMDTNICEYHYSYGDKSYTNGNLATEKFTIGSTSSRPVHLSP-IVF 209
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKS-SIVSQLHSQKLIRNVVGHCL------SGGGGGF 215
GCG G ++ L + G + S+VSQL S +I+ +CL S
Sbjct: 210 GCGTGN--GGTFDELGSGIVGLGGGALSLVSQLSS--IIKGKFSYCLVPLSEQSNVTSKI 265
Query: 216 LFFGDDLYDSSRVVWTSMSSDY-TKYYSPGVAELFFGGE----TTGLKN-----LPVVFD 265
F D + +VV T + S YY + + G + T GL N V+ D
Sbjct: 266 KFGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNGNVEKGNVIID 325
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
SG++ T+L+ + L ++++ + A+ + + +C F++ D+
Sbjct: 326 SGTTLTFLDSEFFTELERVLEETVKAERVSDP--RGLFSVC------FRSAGDID--LPV 375
Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGL 367
+A+ F D + L P + +++ +C +++ ++G+
Sbjct: 376 IAVHFNDADVK----LQPLNTFVKADEDLLCFTMISSNQIGI 413
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 94/358 (26%), Positives = 143/358 (39%), Gaps = 51/358 (14%)
Query: 33 VGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA----PCVRC 86
VG + F V G P G Y + +G P R + + +DTGSD+ W+ C + P
Sbjct: 64 VGGVVNFPVDGASDPFLVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSE 123
Query: 87 VEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLV 142
++ + P S LV C D C S + C C Y +Y DG + G +
Sbjct: 124 LQIQLSFFDPGVSSSASLVSCSDRRCYS-NFQTESGCSPNNLCSYSFKYGDGSGTSGYYI 182
Query: 143 KDAFAFNYTNGQRL----NPRLALGCGYNQVPGASYHP---LDGILGLGKGKSSIVSQLH 195
D +F+ L + GC N G P +DGI GLG+G S++SQL
Sbjct: 183 SDFMSFDTVITSTLAINSSAPFVFGCS-NLQSGDLQRPRRAVDGIFGLGQGSLSVISQLA 241
Query: 196 SQKLIRNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE 253
Q L V HCL G GGG + G V+T + +Y+ + + G+
Sbjct: 242 VQGLAPRVFSHCLKGDKSGGGIMVLGQ--IKRPDTVYTPLVPS-QPHYNVNLQSIAVNGQ 298
Query: 254 TTGLKNLPVVF----------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETL 303
+ P VF D+G++ YL Y +++ A
Sbjct: 299 ILPID--PSVFTIATGDGTIIDTGTTLAYLPDEAYSPFI---------QAVANAVSQYGR 347
Query: 304 PLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL-IISNKGNV--CLG 358
P+ ++ + F+ F ++LSF G + L P AYL I S+ G+ C+G
Sbjct: 348 PITYESYQCFEITAGDVDVFPQVSLSFAGGASMV---LGPRAYLQIFSSSGSSIWCIG 402
>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 74/279 (26%), Positives = 119/279 (42%), Gaps = 30/279 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y +++ +G PA+ +++DTGS TW+ C+ C C P + + V C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTTWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
+ + +DG+LG+G G S++ Q + +CL FF Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
V T YTK + ELFF GE GL VVFDSGS +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268
>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 120/279 (43%), Gaps = 30/279 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y +++ +G PA+ +++DTGS +W+ C+ C C P + + V C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P + GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PGFSFGCNM 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
+ + +DG+LG+G G S++ Q + +CL FF Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDCFSYCLPLQKSERGFFSKTTGYFS 172
Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
V T YTK + ELFF GE GL VVFDSGS +
Sbjct: 173 LGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVFDSGSELS 232
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 233 YIPDRALSVLSQRIRELLLKRG---AAEEESERNCYDMR 268
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 54/156 (34%), Positives = 74/156 (47%), Gaps = 12/156 (7%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDP 106
Y + + IG P P+ DTGSDLTW QC PC C P+Y PS VPC
Sbjct: 66 YLMELAIGTPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPVYDPSASSTFSPVPCSSA 124
Query: 107 ICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAF-NYTNGQRLNP-RLALG 163
C L NC +P+ C Y Y+DG S+G+L + + GQ ++ +A G
Sbjct: 125 TC--LPTWRSRNCSNPSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAFG 182
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
CG + G G +GLG+G S+++QL K
Sbjct: 183 CGTDN--GGDSLNSTGTVGLGRGTLSLLAQLGVGKF 216
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 93/352 (26%), Positives = 142/352 (40%), Gaps = 53/352 (15%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDP 106
Y VT+ +G R + +DTGSDL+W+QC PC RC P++ PS V C P
Sbjct: 135 YIVTVELG--GRKMTVIVDTGSDLSWVQCQ-PCKRCYNQQDPVFNPSTSPSYRTVLCSSP 191
Query: 107 ICASLH-APGHHNC--EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
C SL A G+ +P C+Y + Y DG + G L + + N +N G
Sbjct: 192 TCQSLQSATGNLGVCGSNPPSCNYVVNYGDGSYTRGELGTE--HLDLGNSTAVN-NFIFG 248
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGD 220
CG N + G++GLG+ S++SQ + + V +CL G L G
Sbjct: 249 CGRNN--QGLFGGASGLVGLGRSSLSLISQ--TSAMFGGVFSYCLPITETEASGSLVMGG 304
Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFF---GGETTGLKNLP--------VVFDSGSS 269
+ S V + YT+ +F G T G + ++ DSG+
Sbjct: 305 N----SSVYKNTTPISYTRMIPNPQLPFYFLNLTGITVGSVAVQAPSFGKDGMMIDSGTV 360
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK--GRRPFKNVHDVKKCFRTLA 327
T L YQ L K+ S AP L C+ G + + + ++K F
Sbjct: 361 ITRLPPSIYQALKDEFVKQFSG--FPSAPAFMILDTCFNLSGYQEVE-IPNIKMHF---- 413
Query: 328 LSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
+G ++T Y + ++ VCL I L N +G IG++
Sbjct: 414 ----EGNAELNVDVTGVFYFVKTDASQVCLAI-----ASLSYENEVGIIGNY 456
>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 506
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 67/244 (27%), Positives = 101/244 (41%), Gaps = 25/244 (10%)
Query: 68 LDTGSDLTWLQCDAPCVR--CVEAPHPLYRPSNDLV----PCEDPICASL--HAPGHHNC 119
+DT SD+ W+QC APC + C LY P+ ++ PC P C SL +A G
Sbjct: 178 VDTASDVPWVQC-APCPQPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRYANGCTGA 236
Query: 120 EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQV-PGASYHPLD 178
+ C Y + Y DG + G V D N ++ + GC + + PG+ +
Sbjct: 237 GNTGTCQYRVLYPDGSGTSGTYVSDLLTLNADPKGAVS-KFQFGCSHALLRPGSFNNKTA 295
Query: 179 GILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMSSD 236
G + LG+G S+ SQ NV +CL +G GFL G + +SR T M
Sbjct: 296 GFMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGSHKGFLSLGVPQHAASRYAVTPM--- 352
Query: 237 YTKYYSPGVAELFFGGETTGLKNLPV---------VFDSGSSYTYLNRVTYQTLTSIMKK 287
+P + + G + LPV DS + T L Y L + +
Sbjct: 353 LKSKMAPMIYMVRLIGIDVAGQRLPVPPAVFAANAAMDSRTIITRLPPTAYMALRAAFRA 412
Query: 288 ELSA 291
++ A
Sbjct: 413 QMRA 416
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 62/124 (50%), Gaps = 13/124 (10%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y + + +G P R ++ +D+GSD+ W+QC PC +C P++ P++ VPC
Sbjct: 139 SGEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQ-PCTQCYHQTDPVFDPADSASFMGVPC 197
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+C + G H C YE+ Y DG + G L + F G+ + +A+G
Sbjct: 198 SSSVCERIENAGCH----AGGCRYEVMYGDGSYTKGTLALETLTF----GRTVVRNVAIG 249
Query: 164 CGYN 167
CG+
Sbjct: 250 CGHR 253
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 88/346 (25%), Positives = 144/346 (41%), Gaps = 39/346 (11%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSND----LVPCE 104
Y +T+ +G P R DTGSDL W++C AP + PS V C+
Sbjct: 101 YLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRSSTYGRVSCQ 160
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR----- 159
C +L G C+D + C Y Y DG ++ GVL + F F+ G +PR
Sbjct: 161 TDACEAL---GRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFD-DGGSGRSPRQVRVG 216
Query: 160 -LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGF 215
+ GC A P DG++GLG G S+V+QL + +CL S
Sbjct: 217 GVKFGC---STATAGSFPADGLVGLGGGAVSLVTQLGGATSLGRRFSYCLVPHSVNASSA 273
Query: 216 LFFG--DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTG-LKNLPVVFDSGSSYTY 272
L FG D+ + ++ D YY+ + + G +T + ++ DSG++ T+
Sbjct: 274 LNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTVASAASSRIIVDSGTTLTF 333
Query: 273 LNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK--GRRPFKNVHDVKKCFRTLALSF 330
L+ + + + ++ ++ D L LC+ GR + + L L F
Sbjct: 334 LDPSLLGPIVDELSRRITLPPVQS--PDGLLQLCYNVAGRE-----VEAGESIPDLTLEF 386
Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
G L PE + +G +CL I+ E Q ++++G +
Sbjct: 387 GGGAA---VALKPENAFVAVQEGTLCLAIVATTE--QQPVSILGNL 427
>gi|88174593|gb|ABD39371.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 120/279 (43%), Gaps = 30/279 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y +++ +G PA+ +++DTGS +W+ C+ C C P + + V C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P + GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PGFSFGCNM 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
+ + +DG+LG+G G S++ Q + +CL FF Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDCFSYCLPLQKSERGFFSKTTGYFS 172
Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
V T YTK + ELFF GE GL VVFDSGS +
Sbjct: 173 LGKVATRTDVRYTKMVARKKNTELFFVDLIAISVDGERLGLSPSVFSRKGVVFDSGSELS 232
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 233 YIPDRALSVLSQRIRELLLKRG---AAEEESERNCYDMR 268
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 77/289 (26%), Positives = 122/289 (42%), Gaps = 54/289 (18%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCE 104
G YN+ + +G P + + DTGSDL W QC APC +C + P P ++P++ +PC
Sbjct: 84 GGYNMNISVGTPLLTFSVVADTGSDLIWTQC-APCTKCFQQPAPPFQPASSSTFSKLPCT 142
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
C L P + C Y +Y G ++ G L + G P +A GC
Sbjct: 143 SSFCQFL--PNSIRTCNATGCVYNYKYGSGYTA-GYLATETLKV----GDASFPSVAFGC 195
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGD- 220
G S GI GLG+G S++ QL + +CL S G + FG
Sbjct: 196 STENGVGNS---TSGIAGLGRGALSLIPQLGVGRF-----SYCLRSGSAAGASPILFGSL 247
Query: 221 -DLYDSS--RVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV--------------- 262
+L D + + + + + YY + G T G +LPV
Sbjct: 248 ANLTDGNVQSTPFVNNPAVHPSYYYVNLT-----GITVGETDLPVTTSTFGFTQNGLGGG 302
Query: 263 -VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET--LPLCWK 308
+ DSG++ TYL + Y+ ++K+ +++ + T L LC+K
Sbjct: 303 TIVDSGTTLTYLAKDGYE----MVKQAFLSQTADVTTVNGTRGLDLCFK 347
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 66/132 (50%), Gaps = 12/132 (9%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP---- 96
V G +G Y + +G PA P + LDTGSD+ WLQC APC RC + ++ P
Sbjct: 132 VSGLAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQC-APCRRCYDQSGQVFDPRRSR 190
Query: 97 SNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
S V C P+C L + G C+ C Y++ Y DG + G + F G R
Sbjct: 191 SYGAVGCSAPLCRRLDSGG---CDLRRKACLYQVAYGDGSVTAGDFATETLTF--AGGAR 245
Query: 156 LNPRLALGCGYN 167
+ R+ALGCG++
Sbjct: 246 VA-RIALGCGHD 256
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 74/264 (28%), Positives = 106/264 (40%), Gaps = 38/264 (14%)
Query: 68 LDTGSDLTWLQCDAPCVRCVEAPH------PLYRPSND----LVPCEDPICASLHAPGHH 117
+DT SD+ W+QC APC APH LY PS PC P C +L P +
Sbjct: 160 IDTASDVPWVQC-APC----PAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNL-GPYAN 213
Query: 118 NCEDPA--QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQV-PGASY 174
C PA QC Y ++Y DG +S G + D N GC + + PG+
Sbjct: 214 GCT-PAGDQCQYRVQYPDGSASAGTYISDVLTLNPAKPASAISEFRFGCSHALLQPGSFS 272
Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG--GGFLFFGDDLYDSSRVVWTS 232
+ GI+ LG+G S+ +Q ++ +V +CL GF G +SR T
Sbjct: 273 NKTSGIMALGRGAQSLPTQ--TKATYGDVFSYCLPPTPVHSGFFILGVPRVAASRYAVTP 330
Query: 233 MSSDYTKYYSPGVAELFFGGETTGLKNLPV---------VFDSGSSYTYLNRVTYQTLTS 283
M +P + + K LPV V DS + T L Y L +
Sbjct: 331 M---LRSKAAPMLYLVRLIAIEVAGKRLPVPPAVFAAGAVMDSRTIVTRLPPTAYMALRA 387
Query: 284 IMKKELSAKSLKEAPEDETLPLCW 307
E+ ++ + A E L C+
Sbjct: 388 AFVAEM--RAYRAAAPKEHLDTCY 409
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 58/181 (32%), Positives = 86/181 (47%), Gaps = 20/181 (11%)
Query: 24 SSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC 83
+SS LFN + + VH Y + + IG P + +DTGSDL WLQC PC
Sbjct: 37 NSSQVLFNRITAQTPVSVHHYDYL-----MELSIGTPPVKTYAQVDTGSDLIWLQC-IPC 90
Query: 84 VRCVEAPHPLYRP------SNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGS 136
C + +P++ P SN E C+ L++ +C D C+Y Y D
Sbjct: 91 TNCYKQLNPMFDPQSSSTYSNIAYGSES--CSKLYS---TSCSPDQNNCNYTYSYEDDSI 145
Query: 137 SLGVLVKDAFAFNYTNGQRLNPR-LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLH 195
+ GVL ++ T G+ + + + GCG+N G GI+GLG+G S+VSQ+
Sbjct: 146 TEGVLAQETLTLTSTTGKPVALKGVIFGCGHNN-NGVFNDKEMGIIGLGRGPLSLVSQIG 204
Query: 196 S 196
S
Sbjct: 205 S 205
>gi|449434466|ref|XP_004135017.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 525
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 85/357 (23%), Positives = 147/357 (41%), Gaps = 67/357 (18%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVE---APHP------LYRP----SND 99
T+ +G P + + LDTGSDL W+ CD C RC +P+ +Y P ++
Sbjct: 114 TTVQLGTPGTKFMVALDTGSDLFWVPCD--CSRCAPTEGSPYASDFELSVYSPKKSSTSK 171
Query: 100 LVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADG-GSSLGVLVKDAFAFN--YTNGQR 155
VPC + +CA C + C Y + Y S+ G+L++D + + +
Sbjct: 172 TVPCNNNLCAQ-----RDQCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKTEHKHSEP 226
Query: 156 LNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
+ + GCG QV S+ + +G+ GLG + S+ S L + L+ N C S G
Sbjct: 227 IQAYITFGCG--QVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSDDG 284
Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKY--------YSPGVAELFFGGETTGLKNLPVVF 264
G + FGD S+ + T + Y+ V + G T ++ +F
Sbjct: 285 VGRINFGDK---------GSLEQEETPFNLNQLHPNYNITVTSIRV-GTTLIDADITALF 334
Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFR 324
DSG+S++Y Y L++ + P R PF+ +++
Sbjct: 335 DSGTSFSYFTDPIYSKLSASFHAQTRDGRHPPNP-----------RIPFEYCYNMSP--- 380
Query: 325 TLALSFTDGKTRTLFELTP----EAYLIISNKGNV--CLGILNGAEVGLQDLNVIGG 375
S T G + T+ P + ++IS + + CL ++ AE+ + N + G
Sbjct: 381 DANASLTPGISLTMKGGGPFPVYDPIIVISTQNELIYCLAVVKSAELNIIGQNFMTG 437
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 94/360 (26%), Positives = 144/360 (40%), Gaps = 48/360 (13%)
Query: 34 GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPL 93
G SLL G T Y ++ +G PA ++LDTGSD +W+QC PC C E P+
Sbjct: 123 GVSLLAN-WGKSLSTTNYVASLRLGTPATELVVELDTGSDQSWVQCK-PCADCYEQRDPV 180
Query: 94 YRPSN----DLVPCEDPICASLHAPGHHNCEDPA---QCDYELEYADGGSSLGVLVKDAF 146
+ P+ VPC C L + C YE+ Y D ++G L +D
Sbjct: 181 FDPTASSTYSAVPCGARECQELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTL 240
Query: 147 AFNYTNGQRLN---PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNV 203
+ + P GCG++ ++ +DG+LGLG GK+S+ SQ+ ++
Sbjct: 241 TLSPSPSPSPADTVPGFVFGCGHSN--AGTFGEVDGLLGLGLGKASLPSQVAAR--YGAA 296
Query: 204 VGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMSS--DYTKYYSPGVAELFFGGETTGLKN 259
+CL S G+L FG ++ +T M + D T YY L G +
Sbjct: 297 FSYCLPSSPSAAGYLSFGGAAARAN-AQFTEMVTGQDPTSYY------LNLTGIVVAGRA 349
Query: 260 LPV-----------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
+ V + DSG++++ L Y L S + + K AP C+
Sbjct: 350 IKVPASAFATAAGTIIDSGTAFSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFDTCYD 409
Query: 309 GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK-GNVCLGILNGAEVGL 367
F V+ + L F DG T L P L N CL + ++G+
Sbjct: 410 ----FTGHETVR--IPAVELVFADGAT---VHLHPSGVLYTWNDVAQTCLAFVPNHDLGI 460
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 73/272 (26%), Positives = 115/272 (42%), Gaps = 38/272 (13%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP-------LYRPSND---- 99
Y +++ +G PA + +DTGSD++W+QC+ PC AP P L+ P+
Sbjct: 135 YVISVGLGSPAMTQRVVIDTGSDVSWVQCE-PC----PAPSPCHAHAGALFDPAASSTYA 189
Query: 100 LVPCEDPICASLHAPGHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
C CA L G N C+ ++C Y ++Y DG ++ G D + ++ R
Sbjct: 190 AFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLSGSDVVR--- 246
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
GC + ++ DG++GLG S+VSQ ++ +CL GFL
Sbjct: 247 GFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAAR--YGKSFSYCLPATPASSGFL 304
Query: 217 FF----GDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPVVF------ 264
+SR T M S YY + ++ GG+ GL P VF
Sbjct: 305 TLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLS--PSVFAAGSLV 362
Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
DSG+ T L Y L+S + ++ + E
Sbjct: 363 DSGTVITRLPPAAYAALSSAFRAGMTRYARAE 394
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 62/124 (50%), Gaps = 13/124 (10%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y V + +G P R ++ +D+GSD+ W+QC PC +C P++ P++ V C
Sbjct: 198 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCTQCYHQSDPVFDPADSASFTGVSC 256
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+C L G H +C YE+ Y DG + G L + F G+ + +A+G
Sbjct: 257 SSSVCDRLENAGCH----AGRCRYEVSYGDGSYTKGTLALETLTF----GRTMVRSVAIG 308
Query: 164 CGYN 167
CG+
Sbjct: 309 CGHR 312
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 62/124 (50%), Gaps = 12/124 (9%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
+G Y + +G PA+ Y++ LDTGSD+ W+QC PC C + P++ P S + C
Sbjct: 156 SGEYFTRVGVGNPAKSYYMVLDTGSDINWIQCQ-PCSDCYQQSDPIFTPAASSSYSPLTC 214
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+ C SL N QC Y++ Y DG + G V + +F G +ALG
Sbjct: 215 DSQQCNSLQMSSCRN----GQCRYQVNYGDGSFTFGDFVTETMSF---GGSGTVNSIALG 267
Query: 164 CGYN 167
CG++
Sbjct: 268 CGHD 271
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 65/124 (52%), Gaps = 12/124 (9%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y + +G PAR +++ LDTGSD+ WLQC PC C + P++ P+ V C
Sbjct: 17 SGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQ-PCTDCYQQTDPIFDPTASSTYAPVTC 75
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+ C+SL +C QC Y++ Y DG + G ++ +F + + +ALG
Sbjct: 76 QSQQCSSLE---MSSCRS-GQCLYQVNYGDGSYTFGDFATESVSFGNSGSVK---NVALG 128
Query: 164 CGYN 167
CG++
Sbjct: 129 CGHD 132
>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 459
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 77/292 (26%), Positives = 118/292 (40%), Gaps = 62/292 (21%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPC-------ED 105
V IGQP P + +DTGS LTW+QC+ PC+ C + PLY PS+ D
Sbjct: 112 VNFSIGQPPVPQYAVMDTGSSLTWIQCE-PCINCHQQKGPLYNPSSSSTYVSCSDFDRTD 170
Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNY-TNGQRLNPRLALGC 164
+ H + C+Y YAD ++ G ++ F +G + + GC
Sbjct: 171 TTFTATHG---------SDCNYSQTYADKTTTRGTYAREQLLFETPDDGITIMHDVIFGC 221
Query: 165 GYN--QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLF----F 218
G+N Q+PG + + G+ GLG SSI+S+L G GF +
Sbjct: 222 GHNNTQLPGPTGYA-SGVFGLGDSGSSIISKL-----------------GFGFSYCIGNI 263
Query: 219 GDDLYDSSRVVW---TSMSSDYTKYYSPGVAELFFGGETTGLKNL---PVVF-------- 264
GD LY R+ + T G+ + G + G + L P+VF
Sbjct: 264 GDPLYGFHRLTLGNKLKIEGYSTPLVPRGLYYITLVGISIGQERLDIDPIVFQRVDLNGI 323
Query: 265 ------DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
DSG++ +Y+ R Y + + LS + L LC+ G+
Sbjct: 324 SSRIVIDSGATLSYIPRQAYNVVRDKVSSILSGFLSRYRYIARHLSLCYIGK 375
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 77/289 (26%), Positives = 122/289 (42%), Gaps = 54/289 (18%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCE 104
G YN+ + +G P + + DTGSDL W QC APC +C + P P ++P++ +PC
Sbjct: 84 GGYNMNISVGTPLLTFPVVADTGSDLIWTQC-APCTKCFQQPAPPFQPASSSTFSKLPCT 142
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
C L P + C Y +Y G ++ G L + G P +A GC
Sbjct: 143 SSFCQFL--PNSIRTCNATGCVYNYKYGSGYTA-GYLATETLKV----GDASFPSVAFGC 195
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFGD- 220
G S GI GLG+G S++ QL + +CL S G + FG
Sbjct: 196 STENGVGNS---TSGIAGLGRGALSLIPQLGVGRF-----SYCLRSGSAAGASPILFGSL 247
Query: 221 -DLYDSS--RVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV--------------- 262
+L D + + + + + YY + G T G +LPV
Sbjct: 248 ANLTDGNVQSTPFVNNPAVHPSYY-----YVNLTGITVGETDLPVTTSTFGFTQNGLGGG 302
Query: 263 -VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDET--LPLCWK 308
+ DSG++ TYL + Y+ ++K+ +++ + T L LC+K
Sbjct: 303 TIVDSGTTLTYLAKDGYE----MVKQAFLSQTANVTTVNGTRGLDLCFK 347
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 72/290 (24%), Positives = 110/290 (37%), Gaps = 27/290 (9%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--------SNDL 100
G Y M +G PA+ Y + +DTGS LTWLQC V C P++ P +
Sbjct: 119 GNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSSYASVSCS 178
Query: 101 VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
P D + + P C C Y+ Y D S+G L KD +F T+ P
Sbjct: 179 APQCDALTTATLNPS--TCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----VPNF 232
Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGD 220
GCG + + G++GL + K S++ QL + +CL +
Sbjct: 233 YYGCGQDN--EGLFGQSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSGYLSI 288
Query: 221 DLYDSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYL 273
Y+ + +T M+ + K VA + +LP + DSG+ T L
Sbjct: 289 GSYNPGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAYSSLPTIIDSGTVITRL 348
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
Y L+ + + K A L C++G+ V V F
Sbjct: 349 PTDVYSALSKAVAGAM--KGTPRASAFSILDTCFQGQASRLRVPQVSMAF 396
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 42/131 (32%), Positives = 68/131 (51%), Gaps = 12/131 (9%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND- 99
V G +G Y + +G PA+ ++ LDTGSD+ W+QC PC C + P++ P++
Sbjct: 154 VSGTSQGSGEYFSRIGVGTPAKEMYVVLDTGSDVNWIQC-LPCSECYQQSDPIFDPTSSS 212
Query: 100 ---LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+ C DP CASL + +C Y++ Y DG ++G D F + ++
Sbjct: 213 TFKSLTCSDPKCASLDVSACRS----NKCLYQVSYGDGSFTVGNYATDTVTFGESG--KV 266
Query: 157 NPRLALGCGYN 167
N +ALGCG++
Sbjct: 267 N-DVALGCGHD 276
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 64/131 (48%), Gaps = 13/131 (9%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL 100
V G +G Y + IG+P P ++ LDTGSD++W+QC APC C E P + P++
Sbjct: 141 VSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQC-APCAECYEQTDPXFEPTSSA 199
Query: 101 ----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+ CE C SL N C YE+ Y DG ++G V + T+
Sbjct: 200 SFTSLSCETEQCKSLDVSECRN----GTCLYEVSYGDGSYTVGDFVTETVTLGSTSLG-- 253
Query: 157 NPRLALGCGYN 167
+A+GCG+N
Sbjct: 254 --NIAIGCGHN 262
>gi|6562286|emb|CAB62656.1| putative protein [Arabidopsis thaliana]
Length = 518
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 138/356 (38%), Gaps = 49/356 (13%)
Query: 50 YYNVTMYIGQPARPYFLDLDTGSDLTWLQCD--APCVRCVE-------APHPLYRP---- 96
Y NV++ G PA + + LDTGSDL WL C+ C+ ++ P LY P
Sbjct: 92 YANVSL--GTPATWFLVALDTGSDLFWLPCNCGTTCIHDLKDARFSESVPLNLYTPNAST 149
Query: 97 SNDLVPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
++ + C D C G C P C Y++ + + G L++D T +
Sbjct: 150 TSSSIRCSDKRCF-----GSGKCSSPESICPYQIALSSNTVTTGTLLQDVLHL-VTEDED 203
Query: 156 LNP---RLALGCGYNQVPGASYH-PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG- 210
L P + LGCG NQ ++G+LGL + S+ S L + N C
Sbjct: 204 LKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLSMKEYSVPSLLAKANITANSFSMCFGRI 263
Query: 211 -GGGGFLFFGDDLY-DSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGS 268
G + FGD Y D S+ + + Y V + GG + L +FD+GS
Sbjct: 264 ISVVGRISFGDKGYTDQEETPLVSLET--STAYGVNVTGVSVGGVPVDVP-LFALFDTGS 320
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP--LCWKGRRPFKNV-----HDVKK 321
S+T L Y T + K P D P C+ R N H K
Sbjct: 321 SFTLLLESAYGVFTKAFDDLMED---KRRPVDPDFPFEFCYDLREEHLNSDARPRHMQSK 377
Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNKGN--VCLGILNGAEVGLQDLNVIGG 375
C+ F R + + + SN+G CLGIL + + N++ G
Sbjct: 378 CYNPCRDDF-----RWRIQNDSQESVSYSNEGTKMYCLGILKSINLNIIGQNLMSG 428
>gi|186510920|ref|NP_190702.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645260|gb|AEE78781.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 530
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 93/356 (26%), Positives = 138/356 (38%), Gaps = 49/356 (13%)
Query: 50 YYNVTMYIGQPARPYFLDLDTGSDLTWLQCD--APCVRCVE-------APHPLYRP---- 96
Y NV++ G PA + + LDTGSDL WL C+ C+ ++ P LY P
Sbjct: 104 YANVSL--GTPATWFLVALDTGSDLFWLPCNCGTTCIHDLKDARFSESVPLNLYTPNAST 161
Query: 97 SNDLVPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
++ + C D C G C P C Y++ + + G L++D T +
Sbjct: 162 TSSSIRCSDKRCF-----GSGKCSSPESICPYQIALSSNTVTTGTLLQDVLHL-VTEDED 215
Query: 156 LNP---RLALGCGYNQVPGASYH-PLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG- 210
L P + LGCG NQ ++G+LGL + S+ S L + N C
Sbjct: 216 LKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLSMKEYSVPSLLAKANITANSFSMCFGRI 275
Query: 211 -GGGGFLFFGDDLY-DSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGS 268
G + FGD Y D S+ + + Y V + GG + L +FD+GS
Sbjct: 276 ISVVGRISFGDKGYTDQEETPLVSLET--STAYGVNVTGVSVGGVPVDVP-LFALFDTGS 332
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP--LCWKGRRPFKNV-----HDVKK 321
S+T L Y T + K P D P C+ R N H K
Sbjct: 333 SFTLLLESAYGVFTKAFDDLMED---KRRPVDPDFPFEFCYDLREEHLNSDARPRHMQSK 389
Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNKGN--VCLGILNGAEVGLQDLNVIGG 375
C+ F R + + + SN+G CLGIL + + N++ G
Sbjct: 390 CYNPCRDDF-----RWRIQNDSQESVSYSNEGTKMYCLGILKSINLNIIGQNLMSG 440
>gi|356559246|ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 516
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 70/271 (25%), Positives = 114/271 (42%), Gaps = 33/271 (12%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEA-------------PHPLYRPS-NDLVP 102
+G P + + LDTGSDL WL CD C+ CV + L + S ++ V
Sbjct: 111 VGTPPLWFLVALDTGSDLFWLPCD--CISCVHGGLRTRTGKILKFNTYDLDKSSTSNEVS 168
Query: 103 CEDPICASLHAPGHHNCEDP-AQCDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR--LNP 158
C + S C + C Y+++Y ++ SS G +V+D + Q +
Sbjct: 169 CNN----STFCRQRQQCPSAGSTCRYQVDYLSNDTSSRGFVVEDVLHLITDDDQTKDADT 224
Query: 159 RLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
R+A GCG Q + GA+ +G+ GLG S+ S L + LI N C G
Sbjct: 225 RIAFGCGQVQTGVFLNGAA---PNGLFGLGMDNISVPSILAREGLISNSFSMCFGSDSAG 281
Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLN 274
+ FGD R ++ + Y+ + ++ L+ +FDSG+S+TY+N
Sbjct: 282 RITFGDTGSPDQRKTPFNVRKLHPT-YNITITKIIVEDSVADLE-FHAIFDSGTSFTYIN 339
Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
Y + + ++ AK D +P
Sbjct: 340 DPAYTRIGEMYNSKVKAKRHSSQSPDSNIPF 370
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 67/132 (50%), Gaps = 12/132 (9%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP---- 96
V G +G Y + +G P P + LDTGSD+ WLQC APC RC + ++ P
Sbjct: 137 VSGLAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQC-APCRRCYDQSGQMFDPRASH 195
Query: 97 SNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
S V C P+C L + G C+ C Y++ Y DG + G + F +G R
Sbjct: 196 SYGAVDCAAPLCRRLDSGG---CDLRRKACLYQVAYGDGSVTAGDFATETLTF--ASGAR 250
Query: 156 LNPRLALGCGYN 167
+ PR+ALGCG++
Sbjct: 251 V-PRVALGCGHD 261
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 65/124 (52%), Gaps = 12/124 (9%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y + +G PAR +++ LDTGSD+ WLQC PC C + P++ P+ V C
Sbjct: 158 SGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQ-PCTDCYQQTDPIFDPTASSTYAPVTC 216
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+ C+SL +C QC Y++ Y DG + G ++ +F + + +ALG
Sbjct: 217 QSQQCSSLEM---SSCRS-GQCLYQVNYGDGSYTFGDFATESVSFGNSGSVK---NVALG 269
Query: 164 CGYN 167
CG++
Sbjct: 270 CGHD 273
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 99/380 (26%), Positives = 149/380 (39%), Gaps = 56/380 (14%)
Query: 34 GSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPL 93
G L+ V +G Y + +G PA L LDT SDLTWLQC PC RC P+
Sbjct: 124 GRGLVAPVVSRAPTSGDYIAKIAVGTPAVEALLALDTASDLTWLQCQ-PCRRCYPQSGPV 182
Query: 94 YRPSNDLVPCE----DPICASLHAPGHHNCEDPAQCDYELEYADG------GSSLGVLVK 143
+ P + E P C +L G + + C Y + Y DG +S+G LV+
Sbjct: 183 FDPRHSTSYGEMNYDAPDCQALGRSGGGDAKR-GTCIYTVLYGDGDGHGSTSTSVGDLVE 241
Query: 144 DAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLH-------- 195
+ F G L++GCG++ G P GILGL +G+ SI Q+
Sbjct: 242 ETLTF---AGGVRQAYLSIGCGHDN-KGLFGAPAAGILGLSRGQISIPHQIAFLGYNASF 297
Query: 196 SQKLIRNVVGHCLSGGGGGFLFFGDDLYDSS---RVVWTSMSSDYTKYYSPGVAELFFGG 252
S L+ + G G L FG D+S T ++ + +Y + + GG
Sbjct: 298 SYCLVDFISG---PGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGG 354
Query: 253 -ETTGLKNLP-----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSA-KSLKEAPE 299
G+ V+ DSG++ T L R Y + + +
Sbjct: 355 VRVPGVTERDLQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGP 414
Query: 300 DETLPLCWKGRRPFKNVHDVKKCFRTLALS--FTDGKTRTLFELTPEAYLI-ISNKGNVC 356
C+ ++ C + A+S F G L P+ YLI + ++G VC
Sbjct: 415 SGLFDTCYT----VGGRAGLRHCVKVPAVSMHFAGG---VELSLQPKNYLITVDSRGTVC 467
Query: 357 LGILNGAEVGLQDLNVIGGI 376
A G + ++VIG I
Sbjct: 468 FAF---AGTGDRSVSVIGNI 484
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/124 (37%), Positives = 63/124 (50%), Gaps = 10/124 (8%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
+G Y M IG P R Y+L+LDTGSD+TW+QC APC C P+Y PSN V C
Sbjct: 42 SGEYFARMGIGSPQRSYYLELDTGSDVTWIQC-APCSSCYSQVDPIYDPSNSSSYRRVYC 100
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+C +L + C+ C Y + Y D +S G L ++F N +A G
Sbjct: 101 GSALCQALD---YSACQG-MGCSYRVVYGDSSASSGDLGIESFYLG-PNSSTAMRNIAFG 155
Query: 164 CGYN 167
CG++
Sbjct: 156 CGHS 159
>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 120/279 (43%), Gaps = 30/279 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y +++ +G PA+ +++DTGS +W+ C+ C C P + + V C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
+ + +DG+LG+G G S++ Q + + +CL FF Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQSSPRF---DGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
V T YTK + ELFF GE GL VVFDSGS +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268
>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 120/279 (43%), Gaps = 30/279 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y +++ +G PA+ +++DTGS +W+ C+ C C P + + V C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
+ + +DG+LG+G G S++ Q + + +CL FF Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQSSPRF---DGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
V T YTK + ELFF GE GL VVFDSGS +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268
>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 90/355 (25%), Positives = 142/355 (40%), Gaps = 50/355 (14%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS---------NDLVPC 103
V++ IG P +P L LDTGS L+W+QC ++ P P + + L+PC
Sbjct: 68 VSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKIKKRLPPLPKPKTTSFDPSLSSSFSLLPC 127
Query: 104 EDPICASLHAPGH---HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
PIC P +C+ C Y YADG + G LV++ F F+ + P +
Sbjct: 128 NHPICKP-RIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLS---TPPV 183
Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGD 220
LGC GILG+ +G+ S +SQ K V S G LF+
Sbjct: 184 ILGCAQASTEN------RGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTG--LFYLG 235
Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK------NLP------------- 261
D +SS+ + +M + SP + L + +K N+P
Sbjct: 236 DNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNVPPAAFKPDAGGSGQ 295
Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
+ DSGS TYL Y+ + + + + A K + +C+ +V +
Sbjct: 296 TMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDA----GVTAEVGR 351
Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
++ F +G +F E L KG C+GI +G+ N+IG +
Sbjct: 352 RIGGISFEFDNGV--EIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGS-NIIGTV 403
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 64/124 (51%), Gaps = 10/124 (8%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
+G Y M IG P R Y+L+LDTGSD+TW+QC APC C P+Y PSN V C
Sbjct: 9 SGEYFARMGIGNPQRSYYLELDTGSDVTWIQC-APCSSCYSQVDPIYDPSNSSSYRRVYC 67
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+C +L + C+ C Y + Y D +S G L ++F + + +A G
Sbjct: 68 GSALCQALD---YSACQG-MGCSYRVVYGDSSASSGDLGIESFYLGPNSSTAMR-NIAFG 122
Query: 164 CGYN 167
CG++
Sbjct: 123 CGHS 126
>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 120/279 (43%), Gaps = 30/279 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y +++ +G PA+ +++DTGS +W+ C+ C C P + + V C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
+ + +DG+LG+G G S++ Q + + +CL FF Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQSSPRF---DGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
V T YTK + ELFF GE GL VVFDSGS +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 166/376 (44%), Gaps = 50/376 (13%)
Query: 17 MSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGY-YNVTMYIGQPARPYFLDLDTGSDLT 75
M++ ++SSS SS+ V ++P G Y + + +G P + + DTGSDL
Sbjct: 26 MAARANSSSWSSMAGTT------DVESPLHPDGGGYVMDISVGTPGKRFRAIADTGSDLV 79
Query: 76 WLQCDAPCVRCVEAP--HPLYRPSNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYA 132
W+Q + PC C P + + C +C L PG +CE + C Y EY
Sbjct: 80 WVQSE-PCTGCSGGTIFDPRQSSTFREMDCSSQLCTEL--PG--SCEPGSSACSYSYEYG 134
Query: 133 DGGSSLGVLVKDAFAFNYTN-GQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIV 191
G + G +D + T+ G + P A+GCG + + + +DG++GLG+G S+
Sbjct: 135 S-GETEGEFARDTISLGTTSGGSQKFPSFAVGCG---MVNSGFDGVDGLVGLGQGPVSLT 190
Query: 192 SQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDL------YDSSRVVWTSMSSDYTKYY 241
SQL + I + +CL S L FG S+++ T S Y YY
Sbjct: 191 SQLSAA--IDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKI--TPPSDTYPTYY 246
Query: 242 SPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDE 301
V + G+T G ++ DSG++ TY+ Y + S M+ ++ + +
Sbjct: 247 LLTVNGIAVAGQTMGSPGTTII-DSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGS--SM 303
Query: 302 TLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGN-VCLGIL 360
L LC+ R +N F L + T+ + +L++ + G+ VCL +
Sbjct: 304 GLDLCYD-RSSNRNYK-----FPALTIRLAGA---TMTPPSSNYFLVVDDSGDTVCLAM- 353
Query: 361 NGAEVGLQDLNVIGGI 376
G+ GL +++IG +
Sbjct: 354 -GSAGGLP-VSIIGNV 367
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 88/376 (23%), Positives = 146/376 (38%), Gaps = 87/376 (23%)
Query: 44 NVYPTGY--YNVTMYIGQPARPYFLDLDTGSDLTWLQCDA--PCVRC---------VEAP 90
+++P Y Y+V++ G P + DTGS L W C A C RC +
Sbjct: 123 SLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKF 182
Query: 91 HPLYRPSNDLVPCEDPICASLHAPGH----HNCEDPA-QCD-----YELEYADGGSSLGV 140
P S +V C +P CA + P NC + +C Y L+Y G ++ G+
Sbjct: 183 VPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGATA-GI 241
Query: 141 LVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLI 200
L+ + + P +GC V H GI G G+G S+ SQ+ ++
Sbjct: 242 LLSETLDLE----NKRVPDFLVGCSVMSV-----HQPAGIAGFGRGPESLPSQMRLKRF- 291
Query: 201 RNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTK--------------------- 239
HCL G F D S V+ + SD +K
Sbjct: 292 ----SHCLVSRG-----FDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFR 342
Query: 240 -YYSPGVAELFFGGETTGLK----------NLPVVFDSGSSYTYLNRVTYQTLTSIMKKE 288
YY + + GG+ N + DSGS++T+L++ ++ + ++K+
Sbjct: 343 EYYYLSLRRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQ 402
Query: 289 LSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC--FRTLALSFTDGKTRTLFELTPEAY 346
L + E ++ G RP N+ ++ F + L F G L E Y
Sbjct: 403 LVKYPRAKDVEAQS------GLRPCFNIPKEEESAEFPDVVLKFKGGGK---LSLAAENY 453
Query: 347 L-IISNKGNVCLGILN 361
L +++++G VCL ++
Sbjct: 454 LAMVTDEGVVCLTMMT 469
>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
Length = 538
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 80/312 (25%), Positives = 123/312 (39%), Gaps = 58/312 (18%)
Query: 25 SSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC----- 79
S++S+F S L N+ G Y V++ G PA PY L LDT +DLTW+ C
Sbjct: 106 SATSMFELPMRSAL-----NIAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRR 160
Query: 80 ---------------DAPCVRCVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCE 120
D + + YRP+ + C CA L ++ C+
Sbjct: 161 KGKHYGRTMSVGAGDDGAAAKEARRKN-WYRPAKSSSWRRIRCSQKECALLP---YNTCQ 216
Query: 121 DPAQ---CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALGCGYNQVPGASYHP 176
P++ C Y + DG ++G+ K+ ++G+ P L LGC + G S
Sbjct: 217 SPSKAESCSYYQQMQDGTLTMGIYGKEKATVTVSDGRMAKLPGLILGCSVLEA-GGSVDA 275
Query: 177 LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFGDD---LYDSSRV 228
DG+L LG G+ S +H+ K CL S +L FG + + +
Sbjct: 276 HDGVLSLGNGEMSFA--VHAAKRFGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTME 333
Query: 229 VWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----------VVFDSGSSYTYLNRVTY 278
+ D Y P V +F GGE + V+ D+ +S T L Y
Sbjct: 334 TDIVYNVDVKPAYGPLVTGIFVGGERLDIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAY 393
Query: 279 QTLTSIMKKELS 290
+TS + + LS
Sbjct: 394 AAVTSALDRHLS 405
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 79/298 (26%), Positives = 126/298 (42%), Gaps = 38/298 (12%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
G Y + + IG P + DTGSDL W+QC PC C + P++ P V CE
Sbjct: 92 GEYFMRISIGTPPIEVLVIADTGSDLIWVQCQ-PCQECYKQKSPIFNPKQSSTYRRVLCE 150
Query: 105 DPICASLHAPGHHNCEDPA---QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
C +L++ C C Y Y D ++G L + F TN LA
Sbjct: 151 TRYCNALNS-DMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFIIGSTNNSI--QELA 207
Query: 162 LGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL------SGGGGGF 215
GCG N G GI+GLG G S++SQL ++ I N +CL S G
Sbjct: 208 FGCG-NSNGGNFDEVGSGIVGLGGGSLSLISQLGTK--IDNKFSYCLVPILEKSNFSLGK 264
Query: 216 LFFGDDLYDSSRVVWTS---MSSDYTKYYSPGVAELFFGGETTGLKNLP---------VV 263
+ FGD+ + S + S +S + +Y + + G E +N ++
Sbjct: 265 IVFGDNSFISGSDTYVSTPLVSKEPETFYYLTLEAISVGNERLAYENSRNDGNVEKGNII 324
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR----RPFKNVH 317
DSG++ T+L+ Y L +++K + + + + + +C++ + P VH
Sbjct: 325 IDSGTTLTFLDSKLYNKLELVLEKAVEGERVSDP--NGIFSICFRDKIGIELPIITVH 380
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 76/281 (27%), Positives = 120/281 (42%), Gaps = 39/281 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDLVP--C 103
+G Y + +G PAR ++ DTGSD++WLQC +PC +C P++ P S+ P C
Sbjct: 78 SGDYFARIGVGTPARSVYMVADTGSDVSWLQC-SPCRKCYRQQDPIFNPSLSSSFKPLAC 136
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
IC L G C +C Y++ Y DG ++G + +F G+ +A+G
Sbjct: 137 ASSICGKLKIKG---CSRKNECMYQVSYGDGSFTVGDFSTETLSF----GEHAVRSVAMG 189
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFG 219
CG N +H G+LGLG+G S SQ + +V +CL S +F
Sbjct: 190 CGRNN--QGLFHGAAGLLGLGRGPLSFPSQTGTS--YASVFSYCLPRRESAIAASLVFGP 245
Query: 220 DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-------------VVFDS 266
+ + +R + YY G+A + G N+P V+ DS
Sbjct: 246 SAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPV---NIPPDAFAMGSRGTGGVIVDS 302
Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
G T ++R+T T++ S + AP C+
Sbjct: 303 G---TAISRLTTPAYTALRDAFRSLVTFPSAPGISLFDTCY 340
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 67/129 (51%), Gaps = 13/129 (10%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SN 98
G +G Y + +GQP++P+++ LDTGSD+ WLQC PC C + P++ P S
Sbjct: 149 GTAQGSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCK-PCSDCYQQSDPIFDPTASSSY 207
Query: 99 DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
+ + C+ C L N +C Y++ Y DG ++G V + +F G
Sbjct: 208 NPLTCDAQQCQDLEMSACRN----GKCLYQVSYGDGSFTVGEYVTETVSF----GAGSVN 259
Query: 159 RLALGCGYN 167
R+A+GCG++
Sbjct: 260 RVAIGCGHD 268
>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 321
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 120/279 (43%), Gaps = 30/279 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y +++ +G P++ L++DTGS +W+ C+ C C P + + V C +
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P + GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFSFGCNM 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
+ + +DG+LG+G G S++ Q + +CL FF Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQMSERGFFSKTTGYFS 172
Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
V T YTK + ELFF GE GL VVFDSGS +
Sbjct: 173 LGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGSELS 232
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 64/208 (30%), Positives = 93/208 (44%), Gaps = 26/208 (12%)
Query: 28 SLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCV 87
+L +H ++ LF GN + V + G P + + L LDTGS +TW QC PCVRC+
Sbjct: 145 NLKDHTPNNKLFDEDGN------FLVDVAFGTPPQKFTLILDTGSSITWTQCK-PCVRCL 197
Query: 88 EAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFA 147
+A + PS ASL Y + Y D +S+G D
Sbjct: 198 KASRRHFDPS-----------ASLTYSLGSCIPSTVGNTYNMTYGDKSTSVGNYGCDTMT 246
Query: 148 FNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
+++ + P+ GCG N G DG+LGLG+G+ S VSQ S+ + V +C
Sbjct: 247 LEHSD---VFPKFQFGCGRNN-EGDFGSGADGMLGLGQGQLSTVSQTASK--FKKVFSYC 300
Query: 208 L--SGGGGGFLFFGDDLYDSSRVVWTSM 233
L G LF SS + +TS+
Sbjct: 301 LPEEDSIGSLLFGEKATSQSSSLKFTSL 328
>gi|357168101|ref|XP_003581483.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 510
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 81/336 (24%), Positives = 131/336 (38%), Gaps = 34/336 (10%)
Query: 57 IGQPARPYFLDLDTGSDLTWL--QCDA--PCVRCVEAPHPLYRPS----NDLVPCEDPIC 108
+G P + + LDTGSDL WL QCD P Y PS + VPC C
Sbjct: 108 VGTPGHTFMVALDTGSDLFWLPCQCDGCPPPASGASGSASFYIPSMSSTSQAVPCNSDFC 167
Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTNG--QRLNPRLALGCG 165
+C + C Y++ Y SS G LV+D + + Q L ++ GCG
Sbjct: 168 DH-----RKDCSTTSSCPYKMVYVSADTSSSGFLVEDVLYLSTEDNHPQILKAQIMFGCG 222
Query: 166 YNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL 222
QV S+ +G+ GLG S+ S L + L + C G G + FGD
Sbjct: 223 --QVQTGSFLDAAAPNGLFGLGIDMISVPSILAHKGLTSDSFSMCFGRDGIGRISFGDQG 280
Query: 223 YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLT 282
++ + Y+ + + G E L+ +FD+G+++TYL Y +T
Sbjct: 281 SSDQEETPLDINQKHPT-YAITITGITVGTEPMDLE-FSTIFDTGTTFTYLADPAYTYIT 338
Query: 283 SIMKKELSAKSLKEAPEDETLPL--CWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFE 340
++ A D +P C+ + FRT+ G + +
Sbjct: 339 QSFHTQVRA---NRHAADTRIPFEYCYDLSSSEARIQTPGVSFRTVG-----GSLFPVID 390
Query: 341 LTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
L + I ++ CL I+ ++ + N + G+
Sbjct: 391 LG-QVISIQQHEYVYCLAIVKSTKLNIIGQNFMTGV 425
>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
Length = 538
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 80/312 (25%), Positives = 123/312 (39%), Gaps = 58/312 (18%)
Query: 25 SSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC----- 79
S++S+F S L N+ G Y V++ G PA PY L LDT +DLTW+ C
Sbjct: 106 SATSMFELPMRSAL-----NIAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRR 160
Query: 80 ---------------DAPCVRCVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCE 120
D + + YRP+ + C CA L ++ C+
Sbjct: 161 KGKHYGRTMSVGAGDDGAAAKEARRKN-WYRPAKSSSWRRIRCSQKECALLP---YNTCQ 216
Query: 121 DPAQ---CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALGCGYNQVPGASYHP 176
P++ C Y + DG ++G+ K+ ++G+ P L LGC + G S
Sbjct: 217 SPSKAESCSYYQQMQDGTLTMGIYGKEKATVTVSDGRMAKLPGLILGCSVLEA-GGSVDA 275
Query: 177 LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFGDD---LYDSSRV 228
DG+L LG G+ S +H+ K CL S +L FG + + +
Sbjct: 276 HDGVLSLGNGEMSFA--VHAAKRFGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTME 333
Query: 229 VWTSMSSDYTKYYSPGVAELFFGGETTGLKNL----------PVVFDSGSSYTYLNRVTY 278
+ D Y P V +F GGE + V+ D+ +S T L Y
Sbjct: 334 TDIVYNVDVKPAYGPLVTGIFVGGERLDIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAY 393
Query: 279 QTLTSIMKKELS 290
+TS + + LS
Sbjct: 394 AAVTSALDRHLS 405
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 99/371 (26%), Positives = 150/371 (40%), Gaps = 71/371 (19%)
Query: 36 SLLFQVHGNVYPTGY--YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP-----CVRCV- 87
+L +V YP Y Y+V +G P + L LDTGS L W C P C C
Sbjct: 57 TLTGKVTLPAYPRSYGGYSVIFSLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTF 116
Query: 88 ----EAPHPLY-RPSNDLV---PCEDPICASLHAPGHHNCEDPAQCDYE-LEYADGGSSL 138
P+Y R + V PC P C + NC +C Y LEY GS+
Sbjct: 117 SGVDPTKIPIYARNKSSTVQSLPCRSPKCNWVFG-SDLNCSTTKRCPYYGLEYGL-GSTT 174
Query: 139 GVLVKDAFAFNYTNGQRLNPRLALGCGY--NQVPGASYHPLDGILGLGKGKSSIVSQLHS 196
G LV D + N R+ P GC N+ P +GI G G+G +SI +QL
Sbjct: 175 GQLVSDVLGLSKLN--RI-PDFLFGCSLVSNRQP-------EGIAGFGRGLASIPAQLGL 224
Query: 197 QKLIRNVVGHCLSG---GGGGFLFFGDDLYDSSR--VVWTSMS-----SDYTKYYSPGVA 246
K +V H G L G D++ V + + S Y++YY ++
Sbjct: 225 TKFSYCLVSHRFDDTPQSGDLVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLS 284
Query: 247 ELFFGGETTGLKNLPV---------------VFDSGSSYTYLNRVTYQTLTSIMKKELSA 291
++ GG K++P+ + DSGS++T++ R+ + + ++K ++
Sbjct: 285 KILVGG-----KDVPIPPRYLVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTK 339
Query: 292 -KSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIIS 350
K KE + L C+ ++ DV K L SF G +L Y +
Sbjct: 340 YKRAKEIEDSSGLGPCYNITG--QSEVDVPK----LTFSFKGGAN---MDLPLTDYFSLV 390
Query: 351 NKGNVCLGILN 361
G VC+ +L
Sbjct: 391 TDGVVCMTVLT 401
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 97/346 (28%), Positives = 148/346 (42%), Gaps = 43/346 (12%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSNDL----VPCE 104
Y VT+ IG PA + +DTGSDL+W+QC PC C PLY P+ VPC+
Sbjct: 127 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCK-PCNSSSCYPQKDPLYDPTASSTYAPVPCD 185
Query: 105 DPICASLHAPGH-HNCEDPAQ---CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
C L + H C + + C Y +EY + +++GV + + Q
Sbjct: 186 SKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTETLTL---SPQVSVKDF 242
Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG--GFLFF 218
GCG Q ++ DG+LGLG S+VSQ + + +CL G GFL
Sbjct: 243 GFGCGLVQ--QGTFDLFDGLLGLGGAPESLVSQ--TAETYGGAFSYCLPPGNSTTGFLAL 298
Query: 219 G--DDLYDSSRVVWTSMSS--DYTKYYSPGVAELFFGGETTGLKNLP----VVFDSGSSY 270
G + D++ ++T + S + +Y + + GG+ + ++ DSG+
Sbjct: 299 GAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVLSGGMIIDSGTII 358
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSF 330
T L Y L + + +SA L D+ L C+ F + +V T+AL+F
Sbjct: 359 TGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYN----FTGIANVT--VPTVALTF 412
Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
G T L P LI CL GA G D+ +IG +
Sbjct: 413 DGGATIDLD--VPSGVLI-----QDCLAFAGGASDG--DVGIIGNV 449
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 53/151 (35%), Positives = 71/151 (47%), Gaps = 12/151 (7%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDP 106
Y + + IG P P+ DTGSDLTW QC PC C P+Y PS VPC
Sbjct: 77 YLMELAIGTPPVPFVALADTGSDLTWTQCQ-PCKLCFPQDTPVYDPSASSTFSPVPCSSA 135
Query: 107 ICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAF-NYTNGQRLN-PRLALG 163
C L NC P+ C Y Y+DG S G+L + + GQ ++ +A G
Sbjct: 136 TC--LPVLRSRNCSTPSSLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSDVAFG 193
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQL 194
CG + G G +GLG+G S+++QL
Sbjct: 194 CGTDN--GGDSLNSTGTVGLGRGTLSLLAQL 222
>gi|357168204|ref|XP_003581534.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Brachypodium distachyon]
Length = 436
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 61/195 (31%), Positives = 87/195 (44%), Gaps = 31/195 (15%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP--------LYRPSN-- 98
G Y +T+ +G P+R Y+L TGSD+ W+ PC C + P P LY P N
Sbjct: 74 GLYCITVKLGNPSRHYYLAFHTGSDVMWV----PCSSCTDCPTPDDIGFSLDLYDPKNSS 129
Query: 99 --DLVPCEDPICASLHAPGHHNCEDP----AQCDYELEYADGG-SSLGVLVKDAFAFNYT 151
+ C D CA GH C QC Y YADG ++ G V D F+
Sbjct: 130 TSSEISCSDDRCADALKTGHAICHTSHSSGDQCGYNQIYADGVLATTGYYVSDDIHFDIF 189
Query: 152 NGQR----LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
G + + GC ++ + + DG++G GK S++SQL+SQ + + C
Sbjct: 190 MGNESFASSSASVIFGCSKSR---SGHLQADGVIGFGKDAPSLISQLNSQG-VSHAFSRC 245
Query: 208 L--SGGGGGFLFFGD 220
L S GGG L +
Sbjct: 246 LDDSDDGGGVLILDE 260
>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 72/279 (25%), Positives = 120/279 (43%), Gaps = 30/279 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y +++ +G P++ +++DTGS +W+ C+ C C P + + V C +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P + GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PGFSFGCNM 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
+ + +DG+LG+G G S++ Q + +CL FF Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGAMSVLKQ---SSPTFDCFSYCLPLQKSERGFFSKTTGYFS 172
Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
V T YTK + ELFF GE GL VVFDSGS +
Sbjct: 173 LGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGSELS 232
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 64/223 (28%), Positives = 90/223 (40%), Gaps = 27/223 (12%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y V + IG P + +DT SDL W QC PC C P++ P + +PC
Sbjct: 87 GEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PCTGCYHQVDPMFNPRVSSTYAALPCS 145
Query: 105 DPICASL--HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
C L H GH +D C Y Y+ ++ G L D G+ +A
Sbjct: 146 SDTCDELDVHRCGH---DDDESCQYTYTYSGNATTEGTLAVDKLVI----GEDAFRGVAF 198
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG---GFLFFG 219
GC + GA G++GLG+G S+VSQL ++ +CL G L G
Sbjct: 199 GCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRF-----AYCLPPPASRIPGKLVLG 253
Query: 220 ---DDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGETTGL 257
D +++ + M D Y YY + L G T L
Sbjct: 254 ADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSL 296
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 76/281 (27%), Positives = 120/281 (42%), Gaps = 39/281 (13%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDLVP--C 103
+G Y + +G PAR ++ DTGSD++WLQC +PC +C P++ P S+ P C
Sbjct: 11 SGDYFARIGVGTPARSVYMVADTGSDVSWLQC-SPCRKCYRQQDPIFNPSLSSSFKPLAC 69
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
IC L G C +C Y++ Y DG ++G + +F G+ +A+G
Sbjct: 70 ASSICGKLKIKG---CSRKNKCMYQVSYGDGSFTVGDFSTETLSF----GEHAVRSVAMG 122
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFG 219
CG N +H G+LGLG+G S SQ + +V +CL S +F
Sbjct: 123 CGRNN--QGLFHGAAGLLGLGRGPLSFPSQTGTS--YASVFSYCLPRRESAIAASLVFGP 178
Query: 220 DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-------------VVFDS 266
+ + +R + YY G+A + G N+P V+ DS
Sbjct: 179 SAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPV---NIPPDAFAMGSRGTGGVIVDS 235
Query: 267 GSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
G T ++R+T T++ S + AP C+
Sbjct: 236 G---TAISRLTTPAYTALRDAFRSLVTFPSAPGISLFDTCY 273
>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 72/279 (25%), Positives = 120/279 (43%), Gaps = 30/279 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y +++ +G P++ +++DTGS +W+ C+ C C P + + V C +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P + GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PGFSFGCNM 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
+ + +DG+LG+G G S++ Q + +CL FF Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGAMSVLKQ---SSPTFDCFSYCLPLQKSERGFFSKTTGYFS 172
Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
V T YTK + ELFF GE GL VVFDSGS +
Sbjct: 173 LGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGSELS 232
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 74/292 (25%), Positives = 115/292 (39%), Gaps = 30/292 (10%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV------- 101
G Y M +G PA+ Y + +DTGS LTWLQC V C P++ P
Sbjct: 125 GNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCS 184
Query: 102 --PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
C D A+L+ +C C Y+ Y D S+G L KD +F T+ P
Sbjct: 185 AQQCSDLTTATLNP---ASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----VPN 237
Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFF 218
GCG + + G++GL + K S++ QL + +CL + +
Sbjct: 238 FYYGCGQDN--EGLFGQSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSSGYL 293
Query: 219 GDDLYDSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYT 271
Y+ + +T M+S + K VA ++ +LP + DSG+ T
Sbjct: 294 SIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVIT 353
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
L Y L+ + + K A L C++G+ V +V F
Sbjct: 354 RLPTGVYSALSKAVAGAM--KGTPRASAFSILDTCFQGQAARLRVPEVTMAF 403
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 74/292 (25%), Positives = 115/292 (39%), Gaps = 30/292 (10%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV------- 101
G Y M +G PA+ Y + +DTGS LTWLQC V C P++ P
Sbjct: 127 GNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCS 186
Query: 102 --PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
C D A+L+ +C C Y+ Y D S+G L KD +F T+ P
Sbjct: 187 AQQCSDLTTATLNP---ASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----VPN 239
Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFF 218
GCG + + G++GL + K S++ QL + +CL + +
Sbjct: 240 FYYGCGQDN--EGLFGQSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSSGYL 295
Query: 219 GDDLYDSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYT 271
Y+ + +T M+S + K VA ++ +LP + DSG+ T
Sbjct: 296 SIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVIT 355
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
L Y L+ + + K A L C++G+ V +V F
Sbjct: 356 RLPTGVYSALSKAVAGAM--KGTPRASAFSILDTCFQGQAARLRVPEVTMAF 405
>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 531
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 79/294 (26%), Positives = 120/294 (40%), Gaps = 35/294 (11%)
Query: 29 LFNHVGSSLLFQVHGNVYPTGYYNVTMY-IGQPARPYFLDLDTGSDLTWLQCDAPCVRCV 87
LF GS LF GN + G+ + T IG P + + LD GSDL W+ CD C++C
Sbjct: 84 LFPSEGSDALFL--GNEF--GWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCD--CMQCA 137
Query: 88 EAPHPLY----RPSNDLVP----------CEDPICASLHAPGHHNCEDPAQCDYELEY-A 132
Y R N+ P C D +C + +DP C Y Y +
Sbjct: 138 PLSASYYDRLGRDLNEYSPSLSSTSKPLSCNDQLCE--LGSDCKSSKDP--CPYLASYYS 193
Query: 133 DGGSSLGVLVKDAFAF----NYTNGQRLNPRLALGCGYNQVPGASYHPL-DGILGLGKGK 187
+ SS G+L++D + + + + +GCG Q S DG++GLG G
Sbjct: 194 ENTSSSGLLIEDRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGD 253
Query: 188 SSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDD-LYDSSRVVWTSMSSDYTKYYSPGVA 246
S+ S L L+RN C G + FGD L + + + Y V
Sbjct: 254 LSVPSLLAKAGLVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIE-VE 312
Query: 247 ELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSA--KSLKEAP 298
G + + DSG+S+T+L Y+ + K+++A S K +P
Sbjct: 313 GYLVGSSSLKTAGFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSP 366
>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
Length = 534
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 92/363 (25%), Positives = 134/363 (36%), Gaps = 78/363 (21%)
Query: 1 MKSSHNGENLCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQP 60
M S NG + S S++S+F S L N+ G Y V++ IG P
Sbjct: 80 MGSDRNGSSRRRRAKESSKLPEVMSATSMFELPMRSAL-----NIAHVGMYLVSVRIGTP 134
Query: 61 ARPYFLDLDTGSDLTWLQCDAPCVR-------------------CVEAPHPLYRPSND-- 99
A PY L LDT +DLTW+ C + EA YRP+
Sbjct: 135 ALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSTGQTMSMGGEGAKEASKNWYRPAKSSS 194
Query: 100 --LVPCEDPICASLHAPGHHNCEDPAQ---CDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
+ C CA L ++ C+ P++ C Y + DG ++G+ K+ ++G+
Sbjct: 195 WRRIRCSQKECAVLP---YNTCQSPSKAESCSYFQKTQDGTVTIGIYGKEKATVTVSDGR 251
Query: 155 RLN-PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----- 208
P L LGC + G S DG+L LG G S +H+ K CL
Sbjct: 252 MAKLPGLILGCSVLEA-GGSVDAHDGVLSLGNGDMSFA--VHAAKRFGQRFSFCLLSANS 308
Query: 209 SGGGGGFLFFG-------------DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETT 255
S +L FG D LY+ D Y V + GGE
Sbjct: 309 SRDASSYLTFGPNPAVMGPGTMETDILYN----------VDVKPAYGAQVTGVLVGGERL 358
Query: 256 GLKNLP----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
+ + V+ D+ +S T L Y +T+ + + LS L E E
Sbjct: 359 DIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLS--HLPRVYELEGFEY 416
Query: 306 CWK 308
C+K
Sbjct: 417 CYK 419
>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 119/279 (42%), Gaps = 30/279 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y +++ +G PA+ +++DTGS +W+ C+ C C P + + V C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
+ + +DG+LG+G G S++ Q + +CL FF Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
V T YTK + ELFF GE GL VVFDSGS +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 75/289 (25%), Positives = 116/289 (40%), Gaps = 40/289 (13%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC-VRCVEAPHPLYRPSN----DLVPC 103
G Y++ + +G P + +DTGSDLTW QC APC C P PLY P+ +PC
Sbjct: 94 GAYHMILSVGTPPLAFPAIIDTGSDLTWTQC-APCTTACFAQPTPLYDPARSSTFSKLPC 152
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR---- 159
P+C +L P + C Y+ YA G ++ G L D A +G
Sbjct: 153 ASPLCQAL--PSAFRACNATGCVYDYRYAVGFTA-GYLAADTLAIGDGDGDGDASSSFAG 209
Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGF 215
+A GC + G GI+GLG+ S++SQ+ + +CL G
Sbjct: 210 VAFGC--STANGGDMDGASGIVGLGRSALSLLSQIGVGRF-----SYCLRSDADAGASPI 262
Query: 216 LF------FGDDLYDSSRVVWTSMSSDYTKYY-------SPGVAELFFGGETTGLKNL-- 260
LF GD + ++ + + YY + G +L T G
Sbjct: 263 LFGALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGA 322
Query: 261 -PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
V+ DSG+++TYL Y L + + + + LC++
Sbjct: 323 GGVIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCFE 371
>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 518
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 85/350 (24%), Positives = 139/350 (39%), Gaps = 53/350 (15%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL------------ 100
T+ +G P + + LDTGSDL W+ CD C +C Y +L
Sbjct: 103 TTVELGTPGMKFMVALDTGSDLFWVPCD--CSKCAPTQGVAYASDFELSIYDPKQSSTSK 160
Query: 101 -VPCEDPICASLHAPGHHN-CEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNY--TNGQ 154
V C + +CA H N C + C Y + Y +S G+LV+D +N +
Sbjct: 161 KVTCNNNLCA------HRNRCLGTFSSCPYMVSYVSAQTSTSGILVEDVLHLTSEDSNQE 214
Query: 155 RLNPRLALGCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG 211
+ + GCG QV S+ +G+ GLG + S+ S L + L + C
Sbjct: 215 SIKAYVTFGCG--QVQSGSFLNTAAPNGLFGLGMDQISVPSILSREGLTADSFSMCFGHD 272
Query: 212 GGGFLFFGDD-LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSY 270
G G + FGD D + S S + Y+ V ++ G + + +FDSG+S+
Sbjct: 273 GVGRISFGDKGSPDQEETPFNSNPSHPS--YNISVTQVRVGTTLVDV-DFTALFDSGTSF 329
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV-----KKCFRT 325
TYL Y ++ + K P R PF+ +D+ +
Sbjct: 330 TYLINPIYAMVSENFHAQAQDKRRPPDP-----------RIPFEYCYDMSPGANSSLIPS 378
Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGG 375
++L+ T+F+ P + N+ CL I+ E+ + N + G
Sbjct: 379 MSLTMKGRGHFTVFD--PIIVITTQNELVYCLAIVKSTELNIIGQNFMTG 426
>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 119/279 (42%), Gaps = 30/279 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y ++ +G PA+ +++DTGS ++W+ C+ C C P + + V C +
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSISWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
+ + +DG+LG+G G S++ Q + +CL FF Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
V T YTK + ELFF GE GL VVFDSGS +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268
>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 119/279 (42%), Gaps = 30/279 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y +++ +G PA+ +++DTGS +W+ C+ C C P + + V C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSASWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
+ + +DG+LG+G G S++ Q + +CL FF Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
V T YTK + ELFF GE GL VVFDSGS +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268
>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 79/294 (26%), Positives = 120/294 (40%), Gaps = 35/294 (11%)
Query: 29 LFNHVGSSLLFQVHGNVYPTGYYNVTMY-IGQPARPYFLDLDTGSDLTWLQCDAPCVRCV 87
LF GS LF GN + G+ + T IG P + + LD GSDL W+ CD C++C
Sbjct: 74 LFPSEGSDALFL--GNEF--GWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCD--CMQCA 127
Query: 88 EAPHPLY----RPSNDLVP----------CEDPICASLHAPGHHNCEDPAQCDYELEY-A 132
Y R N+ P C D +C + +DP C Y Y +
Sbjct: 128 PLSASYYDRLGRDLNEYSPSLSSTSKPLSCNDQLCE--LGSDCKSSKDP--CPYLASYYS 183
Query: 133 DGGSSLGVLVKDAFAF----NYTNGQRLNPRLALGCGYNQVPGASYHPL-DGILGLGKGK 187
+ SS G+L++D + + + + +GCG Q S DG++GLG G
Sbjct: 184 ENTSSSGLLIEDRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGD 243
Query: 188 SSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDD-LYDSSRVVWTSMSSDYTKYYSPGVA 246
S+ S L L+RN C G + FGD L + + + Y V
Sbjct: 244 LSVPSLLAKAGLVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIE-VE 302
Query: 247 ELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSA--KSLKEAP 298
G + + DSG+S+T+L Y+ + K+++A S K +P
Sbjct: 303 GYLVGSSSLKTAGFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSP 356
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 80/308 (25%), Positives = 123/308 (39%), Gaps = 42/308 (13%)
Query: 14 TVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSD 73
T ++ S S+++++ L S G + +G Y + +G P + +DTGSD
Sbjct: 61 TAQLESLHSATAAADLLRSPVMS------GVPFDSGEYFAVIGVGDPPTHALVVIDTGSD 114
Query: 74 LTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPIC-ASLHAPGHHNCE-DPAQCDY 127
L WLQC PC RC PLY P N +PC P C L PG C+ C Y
Sbjct: 115 LIWLQC-LPCRRCYRQVTPLYDPRNSKTHRRIPCASPQCRGVLRYPG---CDARTGGCVY 170
Query: 128 ELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGK 187
+ Y DG +S G L D + R++ + LGCG++ G+LG G+G+
Sbjct: 171 MVVYGDGSASSGDLATDTLVL--PDDTRVH-NVTLGCGHDNE--GLLASAAGLLGAGRGQ 225
Query: 188 SSIVSQLHSQKLIRNVVGHCL------SGGGGGFLFFGD--DLYDSSRVVWTSMSSDYTK 239
S +QL +V +CL + +L FG +L ++ + +
Sbjct: 226 LSFPTQL--APAYGHVFSYCLGDRMSRARNSSSYLVFGRTPELPSTAFTPLRTNPRRPSL 283
Query: 240 YYSPGVAELFFGGETTGLKNLP-----------VVFDSGSSYTYLNRVTYQTLTSIMKKE 288
YY V G G N VV DSG++ + R Y +
Sbjct: 284 YYVDMVGFSVGGERVAGFSNASLALNPATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSH 343
Query: 289 LSAKSLKE 296
+A ++
Sbjct: 344 AAAAGMRR 351
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 98/357 (27%), Positives = 147/357 (41%), Gaps = 71/357 (19%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
+G Y + +++G P + + L LDTGSDL W+QC PC C E P Y P S + C
Sbjct: 178 SGEYFIDVFVGTPPKHFSLILDTGSDLNWIQC-VPCYECFEQNGPHYDPGQSSSYRNIGC 236
Query: 104 EDPICASLHAPGHHNCEDPAQ--------CDYELEYADGGSSLGVLVKDAFAFNYTNGQ- 154
D C + +P DP Q C Y Y D ++ G + F N T
Sbjct: 237 HDSRCHLVSSP------DPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSG 290
Query: 155 ----RLNPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL- 208
R + GCG +N+ +H G+LGLG+G S SQL Q L + +CL
Sbjct: 291 KPELRRVENVMFGCGHWNR---GLFHGAAGLLGLGRGPLSFSSQL--QSLYGHSFSYCLV 345
Query: 209 ----SGGGGGFLFFGD--DLYDSSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGLK 258
L FG+ DL + +T++ + +Y + + GGE
Sbjct: 346 DRNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVV--- 402
Query: 259 NLP-------------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
N+P + DSG++ +Y YQ ++K+ AK +K P + P+
Sbjct: 403 NIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQ----VIKEAFMAK-VKGYPVVKDFPV 457
Query: 306 CWKGRRPFKNVHDVKKC-FRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGIL 360
P NV V++ + F+DG ++ E Y I I + VCL IL
Sbjct: 458 L----EPCYNVTGVEQPDLPDFGIVFSDG---AVWNFPVENYFIEIEPREVVCLAIL 507
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 56/160 (35%), Positives = 73/160 (45%), Gaps = 16/160 (10%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-----VEAPHPLYRPSNDLVP 102
T Y V + +G P RP L LDTGSDL W QC APC+ C + P ++ V
Sbjct: 91 TNEYLVHLSVGTPPRPVALTLDTGSDLVWTQC-APCLNCFDQGAIPVLDPAASSTHAAVR 149
Query: 103 CEDPICASL--HAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAF----NYTNGQR 155
C+ P+C +L + G C Y Y D ++G L D F F N G
Sbjct: 150 CDAPVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGGV 209
Query: 156 LNPRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQL 194
RL GCG +N+ G GI G G+G+ S+ SQL
Sbjct: 210 SERRLTFGCGHFNK--GIFQANETGIAGFGRGRWSLPSQL 247
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 74/292 (25%), Positives = 115/292 (39%), Gaps = 30/292 (10%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV------- 101
G Y M +G PA+ Y + +DTGS LTWLQC V C P++ P
Sbjct: 125 GNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCS 184
Query: 102 --PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
C D A+L+ +C C Y+ Y D S+G L KD +F T+ P
Sbjct: 185 AQQCSDLTTATLNP---ASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----VPN 237
Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFF 218
GCG + + G++GL + K S++ QL + +CL + +
Sbjct: 238 FYYGCGQDN--EGLFGQSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSSGYL 293
Query: 219 GDDLYDSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYT 271
Y+ + +T M+S + K VA ++ +LP + DSG+ T
Sbjct: 294 SIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVIT 353
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
L Y L+ + + K A L C++G+ V +V F
Sbjct: 354 RLPTGVYSALSKAVAGAM--KGTPRASAFSILDTCFQGQAARLRVPEVTMAF 403
>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
Length = 417
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 64/223 (28%), Positives = 90/223 (40%), Gaps = 27/223 (12%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y V + IG P + +DT SDL W QC PC C P++ P + +PC
Sbjct: 87 GEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PCTGCYHQVDPMFNPRVSSTYAALPCS 145
Query: 105 DPICASL--HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
C L H GH +D C Y Y+ ++ G L D G+ +A
Sbjct: 146 SDTCDELDVHRCGH---DDDESCQYTYTYSGNATTEGTLAVDKLVI----GEDAFRGVAF 198
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG---GFLFFG 219
GC + GA G++GLG+G S+VSQL ++ +CL G L G
Sbjct: 199 GCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRF-----AYCLPPPASRIPGKLVLG 253
Query: 220 ---DDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGETTGL 257
D +++ + M D Y YY + L G T L
Sbjct: 254 ADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSL 296
>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
Length = 537
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 92/367 (25%), Positives = 134/367 (36%), Gaps = 82/367 (22%)
Query: 1 MKSSHNGENLCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQP 60
M S NG + S S++S+F S L N+ G Y V++ IG P
Sbjct: 79 MGSDRNGSSRRRRAKESSKLPEVMSATSMFELPMRSAL-----NIAHVGMYLVSVRIGTP 133
Query: 61 ARPYFLDLDTGSDLTWLQC-----------------------DAPCVRCVEAPHPLYRPS 97
A PY L LDT +DLTW+ C + EA YRP+
Sbjct: 134 ALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSMGQTMSVGGEGATAAKKEASKNWYRPA 193
Query: 98 ND----LVPCEDPICASLHAPGHHNCEDPAQ---CDYELEYADGGSSLGVLVKDAFAFNY 150
+ C CA L ++ C+ P++ C Y + DG ++G+ K+
Sbjct: 194 KSSSWRRIRCSQKECAVLP---YNTCQSPSKAESCSYFQKTQDGTVTIGIYGKEKATVTV 250
Query: 151 TNGQRLN-PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL- 208
++G+ P L LGC + G S DG+L LG G S +H+ K CL
Sbjct: 251 SDGRMAKLPGLILGCSVLEA-GGSVDAHDGVLSLGNGDMSFA--VHAAKRFGQRFSFCLL 307
Query: 209 ----SGGGGGFLFFG-------------DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFG 251
S +L FG D LY+ D Y V + G
Sbjct: 308 SANSSRDASSYLTFGPNPAVMGPGTMETDILYN----------VDVKPAYGAKVTGVLVG 357
Query: 252 GETTGLKNLP----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDE 301
GE + + V+ D+ +S T L Y +T+ + + LS L E E
Sbjct: 358 GERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLS--HLPRVYELE 415
Query: 302 TLPLCWK 308
C+K
Sbjct: 416 GFEYCYK 422
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/130 (31%), Positives = 65/130 (50%), Gaps = 13/130 (10%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL 100
V G +G Y V + +G P R ++ +D+GSD+ W+QC PC C + P++ P+
Sbjct: 127 VSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQ-PCSECYQQSDPVFDPAGSA 185
Query: 101 ----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+ C+ +C L G C D +C YE+ Y DG + G L + F G+ L
Sbjct: 186 TYAGISCDSSVCDRLDNAG---CND-GRCRYEVSYGDGSYTRGTLALETLTF----GRVL 237
Query: 157 NPRLALGCGY 166
+A+GCG+
Sbjct: 238 IRNIAIGCGH 247
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 74/292 (25%), Positives = 114/292 (39%), Gaps = 30/292 (10%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV------- 101
G Y M +G PA+ Y + +DTGS LTWLQC V C P++ P
Sbjct: 127 GNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCS 186
Query: 102 --PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR 159
C D A+L +C C Y+ Y D S+G L KD +F T+ P
Sbjct: 187 AQQCSDLTTATLSP---ASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----VPN 239
Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-SGGGGGFLFF 218
GCG + + G++GL + K S++ QL + +CL + +
Sbjct: 240 FYYGCGQDN--EGLFGQSAGLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSSSSSGYL 295
Query: 219 GDDLYDSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYT 271
Y+ + +T M+S + K VA ++ +LP + DSG+ T
Sbjct: 296 SIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVIT 355
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCF 323
L Y L+ + + K A L C++G+ V +V F
Sbjct: 356 RLPTGVYSALSKAVAGAM--KGTPRASAFSILDTCFQGQAARLRVPEVTMAF 405
>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 406
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 66/136 (48%), Gaps = 11/136 (8%)
Query: 93 LYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF 148
LY P +++ VPC D C ++ C+ C Y + Y DG ++ G V D+ F
Sbjct: 48 LYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTF 107
Query: 149 NYTNG----QRLNPRLALGCGYNQ---VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
+ +G + N + GCG Q + S LDGI+G G+ SS++SQL + ++
Sbjct: 108 DEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVK 167
Query: 202 NVVGHCLSGGGGGFLF 217
+ HCL GG +F
Sbjct: 168 RIFSHCLDSHHGGGIF 183
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 71/268 (26%), Positives = 110/268 (41%), Gaps = 32/268 (11%)
Query: 39 FQVHGNVYPT--GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP------ 90
F V G P+ G Y + +G P R +++ +DTGSD+ W+ C + C C +
Sbjct: 63 FPVKGTFDPSQVGLYYTKVKLGTPPREFYVQIDTGSDVLWVSCGS-CNGCPQTSGLQIQL 121
Query: 91 ---HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKD-- 144
P ++ L+ C D C S +C QC Y +Y DG + G V D
Sbjct: 122 NYFDPRSSSTSSLISCSDRRCRSGVQTSDASCSSQNNQCTYTFQYGDGSGTSGYYVSDLM 181
Query: 145 --AFAFNYTNGQRLNPRLALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKLI 200
A F T + + GC Q S +DGI G G+ S++SQL Q +
Sbjct: 182 HFAGIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIA 241
Query: 201 RNVVGHCLSG--GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL- 257
V HCL G GGG L G+ + +V++ + +Y+ + + G+ +
Sbjct: 242 PRVFSHCLKGDNSGGGVLVLGEIV--EPNIVYSPLVQS-QPHYNLNLQSISVNGQIVPIA 298
Query: 258 -------KNLPVVFDSGSSYTYLNRVTY 278
N + DSG++ YL Y
Sbjct: 299 PAVFATSNNRGTIVDSGTTLAYLAEEAY 326
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 90/323 (27%), Positives = 129/323 (39%), Gaps = 35/323 (10%)
Query: 68 LDTGSDLTWLQCDAPCVRCVEAPHPLYRPS----NDLVPCEDPICASLHAPGHHN--CE- 120
LDTGS L+WLQC V C PLY PS + C C+ L A ++ CE
Sbjct: 3 LDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLCET 62
Query: 121 DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGI 180
D C Y Y D S+G L +D T+ Q L P+ GCG Q + GI
Sbjct: 63 DSNACLYTASYGDTSFSIGYLSQDLLTL--TSSQTL-PQFTYGCG--QDNQGLFGRAAGI 117
Query: 181 LGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY 240
+GL + K S+++QL ++ + +CL G G S + T
Sbjct: 118 IGLARDKLSMLAQLSTK--YGHAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDS 175
Query: 241 YSPGVAELFFGGETT---------GLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSA 291
+P + L T + +P + DSG+ T L Y L K +S
Sbjct: 176 KNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITRLPMSMYAALRQAFVKIMST 235
Query: 292 KSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISN 351
K K AP L C+KG K++ V + + + F G T L + LI ++
Sbjct: 236 KYAK-APAYSILDTCFKGS--LKSISAVPE----IKMIFQGGADLT---LRAPSILIEAD 285
Query: 352 KGNVCLGILNGAEVGLQDLNVIG 374
KG CL + G + +IG
Sbjct: 286 KGITCLAFAGSS--GTNQIAIIG 306
>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 91/355 (25%), Positives = 141/355 (39%), Gaps = 50/355 (14%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPS---------NDLVPC 103
V++ IG P +P L LDTGS L+W+QC V+ P P + + L+PC
Sbjct: 68 VSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLSSSFSLLPC 127
Query: 104 EDPICASLHAPGH---HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
PIC P +C+ C Y YADG + G LV++ F F+ + P +
Sbjct: 128 NHPICKP-RIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLS---TPPV 183
Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGD 220
LGC GILG+ G+ S +SQ K V S G LF+
Sbjct: 184 ILGCAQASTEN------RGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTG--LFYLG 235
Query: 221 DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK------NLP------------- 261
D +SS+ + +M + SP + L + +K N+P
Sbjct: 236 DNPNSSKFKYVTMLTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQ 295
Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
+ DSGS TYL Y+ + + + + A K + +C+ +V +
Sbjct: 296 TMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDA----GVTAEVGR 351
Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
++ F +G +F E L KG C+GI +G+ N+IG +
Sbjct: 352 RIGGISFEFDNG--VEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGS-NIIGTV 403
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 47/155 (30%), Positives = 73/155 (47%), Gaps = 10/155 (6%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCE 104
G Y ++ +G P + +DTGSD+ WLQC PC +C ++ PS ++P
Sbjct: 84 GEYLISYSVGIPPFQLYGIIDTGSDMIWLQCK-PCEKCYNQTTRIFDPSKSNTYKILPFS 142
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
C S+ + ++ C+Y + Y DG S G L + TNG + R +G
Sbjct: 143 STTCQSVEDTSCSS-DNRKMCEYTIYYGDGSYSQGDLSVETLTLGSTNGSSVKFRRTVIG 201
Query: 164 CGYNQVPGASYH-PLDGILGLGKGKSSIVSQLHSQ 197
CG N S+ GI+GLG G S+++QL +
Sbjct: 202 CGRNNT--VSFEGKSSGIVGLGNGPVSLINQLRRR 234
>gi|6579210|gb|AAF18253.1|AC011438_15 T23G18.7 [Arabidopsis thaliana]
Length = 566
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 84/200 (42%), Gaps = 28/200 (14%)
Query: 33 VGSSLLFQVHGNVYP--TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA----PCVRC 86
VG + F V G P G Y + +G P R + + +DTGSD+ W+ C + P
Sbjct: 112 VGGVVNFPVDGASDPFLVGLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSE 171
Query: 87 VEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLV 142
++ + P S LV C D C S + C C Y +Y DG + G +
Sbjct: 172 LQIQLSFFDPGVSSSASLVSCSDRRCYS-NFQTESGCSPNNLCSYSFKYGDGSGTSGYYI 230
Query: 143 KDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRN 202
D N +G PR A+ DGI GLG+G S++SQL Q L
Sbjct: 231 SDFMCSNLQSGDLQRPRRAV---------------DGIFGLGQGSLSVISQLAVQGLAPR 275
Query: 203 VVGHCLSG--GGGGFLFFGD 220
V HCL G GGG + G
Sbjct: 276 VFSHCLKGDKSGGGIMVLGQ 295
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 88/358 (24%), Positives = 143/358 (39%), Gaps = 62/358 (17%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPIC 108
+++ IG P + + LDTGS L+W+QC + P + P S +PC P+C
Sbjct: 74 ISLPIGTPPQAQQMVLDTGSQLSWIQCHR--KKLPPKPKTSFDPSLSSSFSTLPCSHPLC 131
Query: 109 ASLHAPGH---HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
P +C+ C Y YADG + G LVK+ F+ T + P L LGC
Sbjct: 132 KP-RIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE---ITPPLILGCA 187
Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG-------GFLFF 218
GILG+ +G+ S VSQ K +C+ G +
Sbjct: 188 TESSDDR------GILGMNRGRLSFVSQAKISKF-----SYCIPPKSNRPGFTPTGSFYL 236
Query: 219 GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFG----GETTGLKNLPV------------ 262
GD+ +S + S+ + P + L + G GLK L +
Sbjct: 237 GDNP-NSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGS 295
Query: 263 ---VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
+ DSGS +T+L Y + + + + + K T +C+ G NV +
Sbjct: 296 GQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDG-----NVAMI 350
Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNV-CLGILNGAEVGLQDLNVIGGI 376
+ L FT G + L P+ ++++ G + C+GI + +G N+IG +
Sbjct: 351 PRLIGDLVFVFTRG----VEILVPKERVLVNVGGGIHCVGIGRSSMLGAAS-NIIGNV 403
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 93/346 (26%), Positives = 134/346 (38%), Gaps = 61/346 (17%)
Query: 68 LDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPIC-ASLHA----PGH-- 116
+DTGSDLTW+QC PC C PL+ PS VPC C ASL A PG
Sbjct: 181 VDTGSDLTWVQCK-PCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCA 239
Query: 117 -----HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPG 171
+C Y L Y DG S GVL D A G ++ GCG +
Sbjct: 240 TVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL---GGASVDG-FVFGCGLSNR-- 293
Query: 172 ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDL---YD 224
+ G++GLG+ + S+VSQ + V +CL SG G L G D +
Sbjct: 294 GLFGGTAGLMGLGRTELSLVSQTAPR--FGGVFSYCLPAATSGDAAGSLSLGGDTSSYRN 351
Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-----------VVFDSGSSYTYL 273
++ V +T M +D P +F T V+ DSG+ T L
Sbjct: 352 ATPVSYTRMIAD------PAQPPFYFMNVTGASVGGAAVAAAGLGAANVLLDSGTVITRL 405
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
Y+ + + ++ A+ AP L C+ +VK TL L +G
Sbjct: 406 APSVYRAVRAEFARQFGAERYPAAPPFSLLDACYN----LTGHDEVKVPLLTLRL---EG 458
Query: 334 KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
+ ++ + VCL + A + +D I IG++
Sbjct: 459 GADMTVDAAGMLFMARKDGSQVCLAM---ASLSFEDQTPI--IGNY 499
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 56/164 (34%), Positives = 76/164 (46%), Gaps = 11/164 (6%)
Query: 44 NVYPT-GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC-VRCVEAPHPLYRPSND-- 99
+ PT G Y +T+ IG P Y DTGSDL W QC APC +C + P PLY PS+
Sbjct: 78 QISPTAGEYLMTLAIGTPPVSYQAIADTGSDLIWTQC-APCSSQCFQQPTPLYNPSSSTT 136
Query: 100 --LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN--GQR 155
++PC + A C Y + Y G +S+ + F F + Q
Sbjct: 137 FAVLPCNSSLSMCAAALAGTTPPPGCTCMYNMTYGSGWTSV-YQGSETFTFGSSTPANQT 195
Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
P +A GC N G + G++GLG+G S+VSQL K
Sbjct: 196 GVPGIAFGCS-NASGGFNTSSASGLVGLGRGSLSLVSQLGVPKF 238
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 93/346 (26%), Positives = 134/346 (38%), Gaps = 61/346 (17%)
Query: 68 LDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPIC-ASLHA----PGH-- 116
+DTGSDLTW+QC PC C PL+ PS VPC C ASL A PG
Sbjct: 180 VDTGSDLTWVQCK-PCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCA 238
Query: 117 -----HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPG 171
+C Y L Y DG S GVL D A G ++ GCG +
Sbjct: 239 TVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL---GGASVDG-FVFGCGLSNR-- 292
Query: 172 ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDL---YD 224
+ G++GLG+ + S+VSQ + V +CL SG G L G D +
Sbjct: 293 GLFGGTAGLMGLGRTELSLVSQTAPR--FGGVFSYCLPAATSGDAAGSLSLGGDTSSYRN 350
Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-----------VVFDSGSSYTYL 273
++ V +T M +D P +F T V+ DSG+ T L
Sbjct: 351 ATPVSYTRMIAD------PAQPPFYFMNVTGASVGGAAVAAAGLGAANVLLDSGTVITRL 404
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
Y+ + + ++ A+ AP L C+ +VK TL L +G
Sbjct: 405 APSVYRAVRAEFARQFGAERYPAAPPFSLLDACYN----LTGHDEVKVPLLTLRL---EG 457
Query: 334 KTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIGDF 379
+ ++ + VCL + A + +D I IG++
Sbjct: 458 GADMTVDAAGMLFMARKDGSQVCLAM---ASLSFEDQTPI--IGNY 498
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 74/287 (25%), Positives = 116/287 (40%), Gaps = 37/287 (12%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSNDL- 100
G++ + Y V + +G P R L DTGSDLTW QC+ PC C + ++ PS
Sbjct: 128 GSLIGSANYFVVVGLGTPKRDLSLVFDTGSDLTWTQCE-PCAGSCYKQQDAIFDPSKSSS 186
Query: 101 ---VPCEDPICASLHAPG-HHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
+ C +C L + G C C Y ++Y D +S+G L ++ T+
Sbjct: 187 YINITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQERLTITATD--- 243
Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGG 213
+ GCG Q + G++GLG+ S V Q S + + +CL +
Sbjct: 244 IVDDFLFGCG--QDNEGLFSGSAGLIGLGRHPISFVQQTSS--IYNKIFSYCLPSTSSSL 299
Query: 214 GFLFFGDDLYDSSRVVWTSMS--SDYTKYYSPGVAELFFGGETTGLKNLPVV-------- 263
G L FG ++ + +T +S S +Y + + GG LP V
Sbjct: 300 GHLTFGASAATNANLKYTPLSTISGDNTFYGLDIVGISVGG-----TKLPAVSSSTFSAG 354
Query: 264 ---FDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
DSG+ T L Y L S ++ + + A ED C+
Sbjct: 355 GSIIDSGTVITRLAPTAYAALRSAFRQGMEKYPV--ANEDGLFDTCY 399
>gi|217073142|gb|ACJ84930.1| unknown [Medicago truncatula]
Length = 191
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 73/162 (45%), Gaps = 14/162 (8%)
Query: 6 NGENLCFPTVRMSSSSS--SSSSSSLFNHVGSSLLFQVHGNVYPT--GYYNVTMYIGQPA 61
NG+NL F R ++ S SS+ F + GN PT G Y + +G P
Sbjct: 21 NGDNLVFQVERRKTTLSGIKHHDHHRRGRFLSSVDFNLGGNGLPTRTGLYFTKLGLGSPK 80
Query: 62 RPYFLDLDTGSDLTWLQCDAPCVRCVEAPH-----PLYRP----SNDLVPCEDPICASLH 112
+ Y++ +DTGSD+ W+ C C RC LY P +++L+ C+ C+S +
Sbjct: 81 KDYYVQVDTGSDILWVNC-VECSRCPTKSQIGMDLTLYDPKGSHTSELISCDHEFCSSTY 139
Query: 113 APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
C C Y + Y DG ++ G V+D F+ NG
Sbjct: 140 DGPIPGCRAETPCPYSITYGDGSATTGYYVRDYLTFDRINGN 181
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 66/131 (50%), Gaps = 13/131 (9%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SN 98
V G +G Y + + IG+P ++ LDTGSD++W+QC APC C + P++ P SN
Sbjct: 139 VSGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQC-APCSECYQQSDPIFDPISSN 197
Query: 99 DLVP--CEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
P C++P C SL N C YE+ Y DG ++G + T G
Sbjct: 198 SYSPIRCDEPQCKSLDLSECRN----GTCLYEVSYGDGSYTVGEFATETV----TLGSAA 249
Query: 157 NPRLALGCGYN 167
+A+GCG+N
Sbjct: 250 VENVAIGCGHN 260
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 94/353 (26%), Positives = 142/353 (40%), Gaps = 66/353 (18%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCE 104
G Y + +++G P R + L +DTGSDLTWLQC PC C + P++ PS ++PC
Sbjct: 169 GEYFMDVFVGNPPRHFLLIIDTGSDLTWLQC-KPCKACFDQSGPVFDPSQSTSFKIIPCN 227
Query: 105 DPICASLHAPGHHNCED------PAQCDYELEYADGGSSLGVLVKDAFAFNYTNG-QRLN 157
C + H C D P C Y Y D + G L ++ + + ++ L
Sbjct: 228 AAACDLV---VHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLE 284
Query: 158 PR-LALGCGYNQVPGASYHPLDGILGL------GKGKSSIVSQLHSQKLIRNV----VGH 206
R + +GCG++ LG + +SS + Q S L+ V
Sbjct: 285 IRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSS 344
Query: 207 CLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV---- 262
+S G G L D + V T+ S + T YY L G + LP+
Sbjct: 345 AISFGAGFALSRHFDQMRFTPFVRTNNSVE-TFYY------LGIQGIKIDQELLPIPAER 397
Query: 263 -----------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK--G 309
+ DSG++ TYLNR Y+ + S L+ S A + L +C+ G
Sbjct: 398 FAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESAF---LARISYPRADPFDILGICYNATG 454
Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISN--KGNVCLGIL 360
R F TL++ F +G +L E Y I + + CL IL
Sbjct: 455 RTAVP--------FPTLSIVFQNGAE---LDLPQENYFIQPDPQEAKHCLAIL 496
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 86/304 (28%), Positives = 119/304 (39%), Gaps = 59/304 (19%)
Query: 51 YNVTMYIGQPARP--YFLDLDTGSDLTWLQCDAPCVRCVEAPHP----LYRPSNDLVPCE 104
Y + + IG P RP L LDTGSDL W QC C C P P L + VPC
Sbjct: 100 YLIHLSIGTP-RPQRVALTLDTGSDLVWTQC--ACHVCFAQPFPTFDALASQTTLAVPCS 156
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNY---TNGQRLN---- 157
DPIC S P + C Y +YAD + G +V+D F F NG + +
Sbjct: 157 DPICTSGKYPLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAGVA 216
Query: 158 -PRLALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
P + GCG YN+ G GI G +G S+ SQL + HC +
Sbjct: 217 VPNVRFGCGQYNK--GIFKSNESGIAGFSRGPMSLPSQLKVARF-----SHCFTAIADAR 269
Query: 216 ---LFFG-----DDL--YDSSRVVWTSMS-SDYTKYYSPGVAELFFGGETTGLKNLPV-- 262
+F G D+L + + V T + S+ + YY L G T G LP+
Sbjct: 270 TSPVFLGGAPGPDNLGAHATGPVQSTPFANSNGSLYY------LTLKGITVGKTRLPLNA 323
Query: 263 ---------------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
+ DSG+ L Y++L + + E+ D LC+
Sbjct: 324 LAFAGKGTGSGSGGTIIDSGTGIRTLPGPMYRSLRAAFVARVKLPVANESAADAESTLCF 383
Query: 308 KGRR 311
+ R
Sbjct: 384 EAAR 387
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 93/331 (28%), Positives = 139/331 (41%), Gaps = 47/331 (14%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y V + +G P R ++ +D+GSD+ W+QC PC RC + P++ P++ V C
Sbjct: 140 SGEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCK-PCSRCYQQSDPVFDPADSSSFAGVSC 198
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+C L G C + +C YE+ Y DG + G L + T GQ + +A+G
Sbjct: 199 GSDVCDRLENTG---C-NAGRCRYEVSYGDGSYTKGTLALETL----TVGQVMIRDVAIG 250
Query: 164 CGY-NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFG 219
CG+ NQ + G+LGLG G S + QL Q +CL G G L FG
Sbjct: 251 CGHTNQ---GMFIGAAGLLGLGGGSMSFIGQLGGQT--GGAFSYCLVSRGTGSTGALEFG 305
Query: 220 DDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGG-------ETTGLKNL---PVVFDSG 267
W S+ + +Y G+A + GG ET L VV D+G
Sbjct: 306 RGALPVG-ATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVVMDTG 364
Query: 268 SSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLA 327
++ T Y + S +L AP C+ F++V T++
Sbjct: 365 TAVTRFPTAAYVAFRDSFTAQTS--NLPRAPGVSIFDTCYD-LNGFESVR-----VPTVS 416
Query: 328 LSFTDGKTRTLFELTPEAYLI-ISNKGNVCL 357
F+DG T L +LI + G CL
Sbjct: 417 FYFSDGPVLT---LPARNFLIPVDGGGTFCL 444
>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 72/279 (25%), Positives = 120/279 (43%), Gaps = 30/279 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y +++ +G P++ +++DTGS +W+ C+ C C P + + V C +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
+ + +DG+LG+G G S++ Q + + +CL FF Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQSSPRF---DGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
V T YTK + ELFF GE GL VVFDSGS +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 43/120 (35%), Positives = 63/120 (52%), Gaps = 15/120 (12%)
Query: 55 MYIGQPARPYFLDLDTGSDLTWLQCDAPCV---RCVEAPHPLYRP----SNDLVPCEDPI 107
M +GQP +P F LDTGSD+TWLQC PC C E P++ P S + V C+
Sbjct: 1 MRVGQPQQPSFFVLDTGSDVTWLQC-LPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQ 59
Query: 108 CASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYN 167
C L G + C Y++EY DG ++G L + F ++N P +++GCG++
Sbjct: 60 CQLLDEAGCNV----NSCIYKVEYGDGSFTIGELATETLTFVHSNSI---PNISIGCGHD 112
>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
Length = 321
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 74/280 (26%), Positives = 120/280 (42%), Gaps = 32/280 (11%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y +++ +G PA+ +++DTGS +W+ C+ C C P + + V C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PGFTFGCNL 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
+ + +DG+LG+G G S++ Q + +CL FF Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQMSERGFFSKTTGYFS 172
Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
V T YTK + ELFF GE GL VVFDSGS +
Sbjct: 173 LGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVFDSGSELS 232
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKE-APEDETLPLCWKGR 310
Y+ S++++ + LK A E+E+ C+ R
Sbjct: 233 YIP----DRALSVLRQRIRELLLKRGAAEEESERNCYDMR 268
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 50/163 (30%), Positives = 81/163 (49%), Gaps = 14/163 (8%)
Query: 44 NVYPTG---YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH-PLYRPS-- 97
N++P+ + V +GQP P +DTGS L W+QC APC C + P++ PS
Sbjct: 92 NLHPSASEPLFLVNFSMGQPPVPQLAIMDTGSSLLWIQC-APCKSCSQQIIGPMFDPSIS 150
Query: 98 --NDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN-GQ 154
D + C++ IC +AP C+ +QC Y Y +G S+GV+ + F ++ G+
Sbjct: 151 STYDSLSCKNIICR--YAPSGE-CDSSSQCVYNQTYVEGLPSVGVIATEQLIFGSSDEGR 207
Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ 197
+ GC + G+ GLG G +S+V+Q+ S+
Sbjct: 208 NAVNNVLFGCSHRN-GNYKDRRFTGVFGLGSGITSVVNQMGSK 249
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 64/124 (51%), Gaps = 13/124 (10%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y V + +G P R ++ +D+GSD+ W+QC+ PC +C P++ P++ V C
Sbjct: 133 SGEYFVRIGVGSPPRNQYVVMDSGSDIIWVQCE-PCTQCYHQSDPVFNPADSSSFSGVSC 191
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+C+ + H +C YE+ Y DG + G L + F G+ L +A+G
Sbjct: 192 ASTVCSHVDNAACHE----GRCRYEVSYGDGSYTKGTLALETITF----GRTLIRNVAIG 243
Query: 164 CGYN 167
CG++
Sbjct: 244 CGHH 247
>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
Length = 367
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 69/156 (44%), Gaps = 14/156 (8%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y V + IG P + +DT SDL W QC PC C P++ P + +PC
Sbjct: 87 GEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PCTGCYHQVDPMFNPRVSSTYAALPCS 145
Query: 105 DPICASL--HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
C L H GH +D C Y Y+ ++ G L D G+ +A
Sbjct: 146 SDTCDELDVHRCGH---DDDESCQYTYTYSGNATTEGTLAVDKLVI----GEDAFRGVAF 198
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQK 198
GC + GA G++GLG+G S+VSQL ++
Sbjct: 199 GCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRR 234
>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 118/279 (42%), Gaps = 30/279 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y ++ +G PA+ +++DTGS +W+ C+ C C P + + V C +
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
+ + +DG+LG+G G S++ Q + +CL FF Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
V T YTK + ELFF GE GL VVFDSGS +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268
>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 72/279 (25%), Positives = 119/279 (42%), Gaps = 30/279 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y +++ +G P++ +++DTGS +W+ C+ C C P + + V C +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSASWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
+ + +DG+LG+G G S++ Q + +CL FF Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
V T YTK + ELFF GE GL VVFDSGS +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268
>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 118/279 (42%), Gaps = 30/279 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y ++ +G PA+ +++DTGS +W+ C+ C C P + + V C +
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
+ + +DG+LG+G G S++ Q + +CL FF Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
V T YTK + ELFF GE GL VVFDSGS +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 87/320 (27%), Positives = 128/320 (40%), Gaps = 67/320 (20%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVR---CVEAPHPLYR 95
+ H NV T V++ +G P + + LDTGS+L+WL C R + P
Sbjct: 77 LRFHHNVSLT----VSLAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRAS 132
Query: 96 PSNDLVPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQ 154
+ VPC C S P C+ ++C L YADG SS G L D FA +G
Sbjct: 133 STFAAVPCASAQCRSRDLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAVG--SGP 190
Query: 155 RLNPRLALGC---GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG- 210
L R A GC ++ P G+LG+ +G S VSQ +++ +C+S
Sbjct: 191 PL--RAAFGCMSSAFDSSPDGVAS--AGLLGMNRGALSFVSQASTRRF-----SYCISDR 241
Query: 211 GGGGFLFFG-DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFG---------GETTGLKNL 260
G L G DL T + +YT Y P + +F G G K+L
Sbjct: 242 DDAGVLLLGHSDLP-------TFLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHL 294
Query: 261 PV---------------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPL 305
P+ + DSG+ +T+L Y L + ++ A+ L A +D +
Sbjct: 295 PIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFTRQ--ARPLLPALDDPSF-- 350
Query: 306 CWKGRRPFKNVHDVKKCFRT 325
F+ D CFR
Sbjct: 351 ------AFQEAFDT--CFRV 362
>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 68/281 (24%), Positives = 121/281 (43%), Gaps = 32/281 (11%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y +++ +G PA+ +++DTGS +W+ C+ C C P + + V C +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PGFTFGCNM 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----------SGGGGGFL 216
+ + +DG+LG+G G+ S++ Q + +CL S G F
Sbjct: 116 DSFGANEFGNVDGLLGMGAGQMSVLKQ---SSPTFDGFSYCLPLQMSERGFFSKTTGYFS 172
Query: 217 FFGDDLYDSSRVVWTSMSSDY--TKYYSPGVAELFFGGETTGL-----KNLPVVFDSGSS 269
G + V +T M + T+ + + + GE GL VVFDSGS
Sbjct: 173 LGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGSE 232
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
+Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 233 LSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 270
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 92/379 (24%), Positives = 143/379 (37%), Gaps = 69/379 (18%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD----------APCVRCVEAPHPLYRPSN 98
G Y V +G PA+P+ L DTGSDLTW++C + +P +RP
Sbjct: 93 GQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEK 152
Query: 99 DL----VPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAF----- 148
+PC C+ C P + C Y+ Y DG ++ G + ++
Sbjct: 153 SKTWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSSS 212
Query: 149 -----NYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLI 200
N +L L LGC G S+ DG+L LG S S S+ +
Sbjct: 213 SSSSKNKVKKAKLQ-GLVLGC-TGSYTGPSFEASDGVLSLGYSNVSFASHAASRFGGRFS 270
Query: 201 RNVVGHCLSGGGGGFLFFGDDLYDS----------SRVVWTSMSSDYTKYYSPGVAELFF 250
+V H +L FG + S +R + S +Y + +
Sbjct: 271 YCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKAISV 330
Query: 251 GGETTGLKNLP-----------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPE 299
GE L +P V+ DSG+S T L + Y+ + + + K+L+ P
Sbjct: 331 DGE---LLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLA-----RFPR 382
Query: 300 DETLPL--CWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCL 357
P C+ P + D LA+ F G R E ++Y+I + G C+
Sbjct: 383 VAMDPFEYCYNWTSPSRK--DEGDDLPKLAVHFA-GSAR--LEPPSKSYVIDAAPGVKCI 437
Query: 358 GILNGAEVGLQDLNVIGGI 376
G+ G G ++VIG I
Sbjct: 438 GVQEGPWPG---ISVIGNI 453
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 62/133 (46%), Gaps = 13/133 (9%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN-- 98
V G +G Y + +G P R ++ LDTGSD+ W+QC+ PC C P++ PS
Sbjct: 147 VSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCE-PCRECYSQADPIFNPSYSA 205
Query: 99 --DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
V C+ +C+ L A H+ C YE Y DG S G + F T+
Sbjct: 206 SFSTVGCDSAVCSQLDAYDCHS----GGCLYEASYGDGSYSTGSFATETLTFGTTS---- 257
Query: 157 NPRLALGCGYNQV 169
+A+GCG+ V
Sbjct: 258 VANVAIGCGHKNV 270
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 82/277 (29%), Positives = 119/277 (42%), Gaps = 38/277 (13%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSNDL----VPCE 104
Y VT+ IG P R + + DTGSDLTW+QC PC C PL+ PS VPC
Sbjct: 122 YVVTIGIGTPPRNFTVLFDTGSDLTWVQC-LPCPDSSCYPQQEPLFDPSKSSTYVDVPCS 180
Query: 105 DPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR---L 160
P C H G A C+Y ++Y D + G L ++ F + + L P +
Sbjct: 181 APEC---HIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPPS--PLAPAATGV 235
Query: 161 ALGCG--YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRN---VVGHCL--SGGGG 213
GC Y V + + G+LGLG+G SSI+SQ +++ I + V +CL G
Sbjct: 236 VFGCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQ--TRRSINSGGGVFSYCLPPRGSST 293
Query: 214 GFLFFGDDL----YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP-------V 262
G+L G S + +T + + ++ S V L ++P
Sbjct: 294 GYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSLGA 353
Query: 263 VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPE 299
V DSG+ T++ Y L + L S K PE
Sbjct: 354 VIDSGTVVTHMPAAAYYPLRDEFR--LHMGSYKMLPE 388
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 84/191 (43%), Gaps = 24/191 (12%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP-- 96
+ H NV T +++ +G P + + +DTGS+L+WL C+ + P+P + P
Sbjct: 58 LRFHHNVSLT----ISITVGTPPQNMSMVIDTGSELSWLHCNTNTTATI--PYPFFNPNI 111
Query: 97 --SNDLVPCEDPICASLHA--PGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTN 152
S + C P C + P +C+ C L YAD SS G L D F F
Sbjct: 112 SSSYTPISCSSPTCTTRTRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGF---- 167
Query: 153 GQRLNPRLALGCGYNQVPGASYHPLD--GILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
G NP + GC + S + G++G+ G S+VSQL K +C+SG
Sbjct: 168 GSSFNPGIVFGCMNSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKIPKF-----SYCISG 222
Query: 211 GG-GGFLFFGD 220
G L G+
Sbjct: 223 SDFSGILLLGE 233
>gi|326500240|dbj|BAK06209.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 84/341 (24%), Positives = 134/341 (39%), Gaps = 43/341 (12%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCV-------EAPHPLYRPS----NDLVPCED 105
+G P + + LDTGSDL WL C C C AP Y PS + VPC
Sbjct: 104 VGTPGHTFMVALDTGSDLFWLPCQ--CDGCTPPPSSAASAPASFYIPSLSSTSQAVPCNS 161
Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNY--TNGQRLNPRLAL 162
C C + C Y++ Y SS G LV+D + T+ Q L ++
Sbjct: 162 DFCGL-----RKECSKTSSCPYKMVYVSADTSSSGFLVEDVLYLSTEDTHPQFLKAQIMF 216
Query: 163 GCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFG 219
GCG +V S+ +G+ GLG S+ S L + L N C G G + FG
Sbjct: 217 GCG--EVQTGSFLDAAAPNGLFGLGVDMISVPSILAQKGLTSNSFSMCFGRDGIGRISFG 274
Query: 220 DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQ 279
D ++ + Y+ + + G L+ + +FD+G+S+TYL Y
Sbjct: 275 DQGSSDQEETPLDINQKHPT-YAITITGIAVGNNLMDLE-VSTIFDTGTSFTYLADPAYT 332
Query: 280 TLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC---FRTLALSFTDGKTR 336
+T ++ A + A + R PF+ +D+ +T ++S
Sbjct: 333 YITDGFHSQVQAN--RHAADS---------RIPFEYCYDLSSSEARIQTPSISLRTVGGS 381
Query: 337 TLFELTPEAYLIISNKGNV-CLGILNGAEVGLQDLNVIGGI 376
+ P + I V CL I+ ++ + N + G+
Sbjct: 382 LFPAIDPGQVISIQQHEYVYCLAIVKSTKLNIIGQNFMTGV 422
>gi|326499199|dbj|BAK06090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 84/341 (24%), Positives = 134/341 (39%), Gaps = 43/341 (12%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCV-------EAPHPLYRPS----NDLVPCED 105
+G P + + LDTGSDL WL C C C AP Y PS + VPC
Sbjct: 104 VGTPGHTFMVALDTGSDLFWLPCQ--CDGCTPPPSSAASAPASFYIPSLSSTSQAVPCNS 161
Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNY--TNGQRLNPRLAL 162
C C + C Y++ Y SS G LV+D + T+ Q L ++
Sbjct: 162 DFCGL-----RKECSKTSSCPYKMVYVSADTSSSGFLVEDVLYLSTEDTHPQFLKAQIMF 216
Query: 163 GCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFG 219
GCG +V S+ +G+ GLG S+ S L + L N C G G + FG
Sbjct: 217 GCG--EVQTGSFLDAAAPNGLFGLGVDMISVPSILAQKGLTSNSFSMCFGRDGIGRISFG 274
Query: 220 DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQ 279
D ++ + Y+ + + G L+ + +FD+G+S+TYL Y
Sbjct: 275 DQGSSDQEETPLDINQKHPT-YAITITGIAVGNNLMDLE-VSTIFDTGTSFTYLADPAYT 332
Query: 280 TLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC---FRTLALSFTDGKTR 336
+T ++ A + A + R PF+ +D+ +T ++S
Sbjct: 333 YITDGFHSQVQAN--RHAADS---------RIPFEYCYDLSSSEARIQTPSISLRTVGGS 381
Query: 337 TLFELTPEAYLIISNKGNV-CLGILNGAEVGLQDLNVIGGI 376
+ P + I V CL I+ ++ + N + G+
Sbjct: 382 LFPAIDPGQVISIQQHEYVYCLAIVKSTKLNIIGQNFMTGV 422
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 86/350 (24%), Positives = 139/350 (39%), Gaps = 52/350 (14%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQC--DAPCVRCVEAPHPLYRPSNDLVPCEDPICAS 110
VT+ IG P +P + LDTGS L+W+QC P + P S ++PC P+C
Sbjct: 90 VTLPIGTPPQPQQMVLDTGSQLSWIQCHNKTPPTASFD---PSLSSSFYVLPCTHPLCKP 146
Query: 111 LHAPGH---HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYN 167
P C+ C Y YADG + G LV++ AF+ + + P L LGC
Sbjct: 147 -RVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSPS---QTTPPLILGC--- 199
Query: 168 QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG---GFLFFGDDLYD 224
+ GILG+ G+ S Q K V + G + G++ +
Sbjct: 200 ---SSESRDARGILGMNLGRLSFPFQAKVTKFSYCVPTRQPANNNNFPTGSFYLGNN-PN 255
Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK------NLP-------------VVFD 265
S+R + SM + P + L + G++ N+P + D
Sbjct: 256 SARFRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVFRPNAGGSGQTMVD 315
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
SGS +T+L V Y + + + L + K +C+ G N ++ +
Sbjct: 316 SGSEFTFLVDVAYDRVREEIIRVLGPRVKKGYVYGGVADMCFDG-----NAMEIGRLLGD 370
Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNV-CLGILNGAEVGLQDLNVIG 374
+A F G + + P+ ++ G V C+GI +G N+IG
Sbjct: 371 VAFEFEKG----VEIVVPKERVLADVGGGVHCVGIGRSERLGAAS-NIIG 415
>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
Length = 409
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 77/291 (26%), Positives = 113/291 (38%), Gaps = 37/291 (12%)
Query: 68 LDTGSDLTWLQCDAPCVRCVEAPH--PLYRPSNDL----VPCEDPICASLHAPGHHNCED 121
+D+GSD+ W+QC PC V P PL+ P+ VPC CA L P C
Sbjct: 85 IDSGSDVPWVQCQ-PCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARL-GPYRRGCLA 142
Query: 122 PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGIL 181
+QC + + YA+G ++ G D + R GC + + + G L
Sbjct: 143 NSQCQFGITYANGATATGTYSSDDLTLGPYDVVR---GFLFGCAHADQGSTFSYDVAGTL 199
Query: 182 GLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRV---VWTSMSSD 236
LG G S V Q SQ V +C+ S GF+ FG ++ V V T + S
Sbjct: 200 ALGGGSQSFVQQTASQ--YSRVFSYCVPPSTSSFGFIMFGVPPQRAALVPTFVSTPLLSS 257
Query: 237 YTKYYSPGVAELFFGGETTGLKNLPV---------VFDSGSSYTYLNRVTYQTLTSIMKK 287
T SP + + LPV V DS + + + YQ L + +
Sbjct: 258 ST--MSPTFYRVLLRSIIVAGRPLPVPPTVFSASSVIDSATVISRIPPTAYQALRAAFRS 315
Query: 288 ELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTL 338
++ + AP L C+ F V + ++AL F G T L
Sbjct: 316 AMTM--YRPAPPVSILDTCYD----FSGVRSIT--LPSIALVFDGGATVNL 358
>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
Length = 366
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 62/133 (46%), Gaps = 13/133 (9%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN-- 98
V G +G Y + +G P R ++ LDTGSD+ W+QC+ PC C P++ PS
Sbjct: 147 VSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCE-PCRECYSQADPIFNPSYSA 205
Query: 99 --DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
V C+ +C+ L A H+ C YE Y DG S G + F T+
Sbjct: 206 SFSTVGCDSAVCSQLDAYDCHS----GGCLYEASYGDGSYSTGSFATETLTFGTTS---- 257
Query: 157 NPRLALGCGYNQV 169
+A+GCG+ V
Sbjct: 258 VANVAIGCGHKNV 270
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 75/280 (26%), Positives = 118/280 (42%), Gaps = 27/280 (9%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y + + IG P + DTGSDL W QC PC+ C + +P++ PS V CE
Sbjct: 89 GEYLMKISIGTPPFDVYGIYDTGSDLMWTQC-LPCLSCYKQKNPMFDPSKSTSFKEVSCE 147
Query: 105 DPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLAL 162
C L +C P + CD+ Y DG + GV+ + N +GQ + +
Sbjct: 148 SQQCRLLDT---VSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPXSIXNIVF 204
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS-----QKLIRNVVGHCLSGGGGGFLF 217
GCG+N + + + G+ G G S+ SQ+ S +K + +V +
Sbjct: 205 GCGHNNSGTFNENEM-GLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKII 263
Query: 218 FGDDLYDS-SRVVWTSM-SSDYTKYY-------SPGVAELFFGGETTGLKNLPVVFDSGS 268
FG + S S VV T + + D YY S G F + V D+G+
Sbjct: 264 FGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGT 323
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
T L R Y L +K+ + + +++ D LC++
Sbjct: 324 PPTLLPRDFYNRLVQGVKEAIPMEPVQDP--DLQPQLCYR 361
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 56/181 (30%), Positives = 88/181 (48%), Gaps = 20/181 (11%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y V + +G P R ++ +D+GSD+ W+QC PC +C PL+ P++ V C
Sbjct: 40 SGEYFVRIGLGSPPRSQYMVIDSGSDIVWVQCK-PCTQCYHQTDPLFDPADSASFMGVSC 98
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+C + G ++ +C YE+ Y DG + G L + F G+ + +A+G
Sbjct: 99 SSAVCDRVENAGCNS----GRCRYEVSYGDGSYTKGTLALETLTF----GRTVVRNVAIG 150
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG---GGFLFFGD 220
CG++ + G+LGLG G S + QL Q N +CL G GFL FG
Sbjct: 151 CGHSNR--GMFVGAAGLLGLGGGSMSFMGQLSGQT--GNAFSYCLVSRGTNTNGFLEFGS 206
Query: 221 D 221
+
Sbjct: 207 E 207
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 90/370 (24%), Positives = 140/370 (37%), Gaps = 54/370 (14%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--------VRCVEAPHPLYRPSNDL 100
G Y V +G PA+P+ L DTGSDLTW++C P P +RP +
Sbjct: 95 GQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSR 154
Query: 101 ----VPCEDPICASLHAPGHHNCEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
+ C C C P + C Y+ Y DG ++ G + ++ + +
Sbjct: 155 TWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGREE 214
Query: 156 LNPR---LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGHCLS 209
+ L LGC + G S+ DG+L LG S S S+ + +V H
Sbjct: 215 RKAKLKGLVLGCSSSYT-GPSFEASDGVLSLGYSGISFASHAASRFGGRFSYCLVDHLSP 273
Query: 210 GGGGGFLFFGDDLYDSS-------------RVVWTSMSSD--YTKYYSPGVAELFFGGET 254
+L FG + SS R T + D +Y + + GE
Sbjct: 274 RNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAGEF 333
Query: 255 TGLKNLP--------VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLC 306
+ V+ DSG+S T L + Y+ + + + K L+ L D C
Sbjct: 334 LKIPRAVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAG--LPRVTMDP-FEYC 390
Query: 307 WKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVG 366
+ P DV +A+ F G R E ++Y+I + G C+G+ G G
Sbjct: 391 YNWTSPSGKDADV--AVPKMAVHFA-GAAR--LEPPGKSYVIDAAPGVKCIGLQEGPWPG 445
Query: 367 LQDLNVIGGI 376
++VIG I
Sbjct: 446 ---ISVIGNI 452
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 63/223 (28%), Positives = 89/223 (39%), Gaps = 27/223 (12%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y V + IG P + +DT SDL W QC PC C P++ P + +PC
Sbjct: 87 GEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQ-PCTGCYHQVDPMFNPRVSSTYAALPCS 145
Query: 105 DPICASL--HAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
C L H GH +D C Y Y+ ++ G L D G+ +A
Sbjct: 146 SDTCDELDVHRCGH---DDDESCQYTYTYSGNATTEGTLAVDKLVI----GEDAFRGVAF 198
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG---GFLFFG 219
GC + GA G++GLG+G S+VSQL ++ +CL G L G
Sbjct: 199 GCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRF-----AYCLPPPASRIPGKLVLG 253
Query: 220 ---DDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGETTGL 257
D +++ + M D Y YY + L G L
Sbjct: 254 ADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRAMSL 296
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 73/272 (26%), Positives = 114/272 (41%), Gaps = 38/272 (13%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHP-------LYRPSND---- 99
Y +++ +G PA + +DTGSD++W+QC+ PC AP P L+ P+
Sbjct: 108 YVISVGLGSPAVTQRVVIDTGSDVSWVQCE-PC----PAPSPCHAHAGALFDPAASSTYA 162
Query: 100 LVPCEDPICASLHAPGHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
C CA L G N C+ ++C Y ++Y DG ++ G D + ++ R
Sbjct: 163 AFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLSGSDVVR--- 219
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFL 216
GC + ++ DG++GLG S VSQ ++ +CL GFL
Sbjct: 220 GFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAAR--YGKSFFYCLPATPASSGFL 277
Query: 217 FF----GDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPVVF------ 264
+SR T M S YY + ++ GG+ GL P VF
Sbjct: 278 TLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLS--PSVFAAGSLV 335
Query: 265 DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKE 296
DSG+ T L Y L+S + ++ + E
Sbjct: 336 DSGTVITRLPPAAYAALSSAFRAGMTRYARAE 367
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 53/162 (32%), Positives = 69/162 (42%), Gaps = 16/162 (9%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SN 98
G TG Y VT G PA+ L +DTGSDLTW+QC PC C ++ P S
Sbjct: 129 GTTVGTGNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCK-PCADCYSQVDAIFEPKQSSSY 187
Query: 99 DLVPCEDPICASLHAPGHHNCEDP---AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
+PC C L + P C YE+ Y DG SS G ++ + Q
Sbjct: 188 KTLPCLSATCTELIT--SESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTLGSDSFQ- 244
Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ 197
A GCG+ + G+LGLG+ S SQ S+
Sbjct: 245 ---NFAFGCGHTNT--GLFKGSSGLLGLGQNSLSFPSQSKSK 281
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 72/264 (27%), Positives = 110/264 (41%), Gaps = 33/264 (12%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL 100
+ G +G Y + IG+P+ P ++ LDTGSD+ W+QC APC C P++ P++
Sbjct: 134 ISGTSQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQC-APCADCYHQADPIFEPASST 192
Query: 101 ----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+ C+ C SL N C YE+ Y DG ++G V + T G
Sbjct: 193 SYSPLSCDTKQCQSLDVSECRN----NTCLYEVSYGDGSYTVGDFVTETI----TLGSAS 244
Query: 157 NPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGG 213
+A+GCG+N + G+LGLG GK S SQ+++ +CL
Sbjct: 245 VDNVAIGCGHNN--EGLFIGAAGLLGLGGGKLSFPSQINASSF-----SYCLVDRDSDSA 297
Query: 214 GFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK----------NLPVV 263
L F L + + + +Y G+ L GGE + N ++
Sbjct: 298 STLEFNSALLPHAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGII 357
Query: 264 FDSGSSYTYLNRVTYQTLTSIMKK 287
DSG++ T L Y L K
Sbjct: 358 IDSGTAVTRLQTAAYNALRDAFVK 381
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 82/312 (26%), Positives = 126/312 (40%), Gaps = 41/312 (13%)
Query: 68 LDTGSDLTWLQC----DAPCVRCVEAPH-PLYRPSNDLVPCEDPICASLHAPGHHNCEDP 122
LD+ SD+ W+QC PC V++ + P PS+ C P C +L P + C +
Sbjct: 163 LDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTAL-GPYANGCAN- 220
Query: 123 AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILG 182
QC Y + Y DG S+ G + D + N GC + + G+ GI+
Sbjct: 221 NQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVS---GFKFGCSHAEQ-GSFDARAAGIMA 276
Query: 183 LGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMS--SDYT 238
LG G S++SQ S+ N +C+ + GF G SSR V T M
Sbjct: 277 LGGGPESLLSQTASR--YGNAFSYCIPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAA 334
Query: 239 KYYSPGVAELFFGGETTGLKNLPVVFDSGS------SYTYLNRVTYQTLTSIMKKELSAK 292
+Y + + GG+ G+ P VF +GS + T L YQ L S + ++
Sbjct: 335 TFYGVLLRTITVGGQRLGVA--PAVFAAGSVLDSRTAITRLPPTAYQALRSAFRSSMTM- 391
Query: 293 SLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK 352
+ AP L C+ F V +++ ++L F + L P L
Sbjct: 392 -YRSAPPKGYLDTCYD----FTGVVNIR--LPKISLVF---DRNAVLPLDPSGILF---- 437
Query: 353 GNVCLGILNGAE 364
N CL + A+
Sbjct: 438 -NDCLAFTSNAD 448
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 84/329 (25%), Positives = 137/329 (41%), Gaps = 38/329 (11%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLH 112
V + IG P L +DT SDL WLQC PC+ C P++ PS + S +
Sbjct: 87 VNISIGSPPVTQLLHMDTASDLLWLQC-RPCINCYAQSLPIFDPSRSYTHRNESCRTSQY 145
Query: 113 A-PGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL---ALGCGYNQ 168
+ P C+Y + Y DG S G+L K+ FN + + L GCG++
Sbjct: 146 SMPSLRFNAKTRSCEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALHDVVFGCGHDN 205
Query: 169 VPGASYHPL--DGILGLGKGKSSIVSQLHSQ------KLIRNVVGH-CLSGGGGGFLFFG 219
PL GILGLG G+ S+V + ++ L H L G G G
Sbjct: 206 YG----EPLVGTGILGLGYGEFSLVHRFGTKFSYCFGSLDDPSYPHNVLVLGDDGANILG 261
Query: 220 D----DLYDS-SRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLN 274
D ++Y+ V ++S D P +F TGL + D+G+S T L
Sbjct: 262 DTTPLEIYNGFYYVTIEAISVD--GIILPIDPWVFNRNHQTGLGG--TIIDTGNSLTSLV 317
Query: 275 RVTYQTLTSIMKKELSAK-SLKEAPEDETLPL-CWKGRRPFKNVHDVKKCFRTLALSFTD 332
Y+ L + ++ + + + +D+ + C+ G +++ V+ F + F+D
Sbjct: 318 EEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLE-RDL--VESGFPIVTFHFSD 374
Query: 333 GKTRTL------FELTPEAYLIISNKGNV 355
G +L +L+P + + GN+
Sbjct: 375 GAELSLDVKSVFMKLSPNVFCLAVTPGNM 403
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 87/358 (24%), Positives = 142/358 (39%), Gaps = 62/358 (17%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPIC 108
+++ IG P + + LDTGS L+W+QC + P + P S +PC P+C
Sbjct: 74 ISLPIGTPPQAQQMVLDTGSQLSWIQCHR--KKLPPKPKTSFDPSLSSSFSTLPCSHPLC 131
Query: 109 ASLHAPGH---HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
P +C+ C Y YADG + G LVK+ F+ T + P L LGC
Sbjct: 132 KP-RIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE---ITPPLILGCA 187
Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG-------GFLFF 218
GILG+ +G+ S VSQ K +C+ G +
Sbjct: 188 TESSDDR------GILGMNRGRLSFVSQAKISKF-----SYCIPPKSNRPGFTPTGSFYL 236
Query: 219 GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFG----GETTGLKNLPV------------ 262
GD+ +S + S+ + P + L + G GLK L +
Sbjct: 237 GDNP-NSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGS 295
Query: 263 ---VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
+ DSGS +T+L Y + + + + + K T +C+ G NV +
Sbjct: 296 GQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDG-----NVAMI 350
Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNV-CLGILNGAEVGLQDLNVIGGI 376
+ L FT G + P+ ++++ G + C+GI + +G N+IG +
Sbjct: 351 PRLIGDLVFVFTRG----VEIFVPKERVLVNVGGGIHCVGIGRSSMLGAAS-NIIGNV 403
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 78/325 (24%), Positives = 130/325 (40%), Gaps = 49/325 (15%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICAS 110
Y + + +G P ++DTGSDL W QC PC C P++ PSN E
Sbjct: 61 YLMKLQVGTPPFEIEAEIDTGSDLIWTQC-MPCTNCYSQYAPIFDPSNSSTFKE------ 113
Query: 111 LHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR-LNPRLALGCGYNQV 169
C + C Y++ YAD S G L + + T+G+ + P +GCG+N
Sbjct: 114 ------KRCNGNS-CHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNS- 165
Query: 170 PGASYHP-LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDD-LYDSSR 227
+ + P G++GL G SS+++Q+ + ++ +C + G + FG + +
Sbjct: 166 --SWFKPTFSGMVGLSWGPSSLITQMGGEY--PGLMSYCFASQGTSKINFGTNAIVAGDG 221
Query: 228 VVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP------------VVFDSGSSYTYLNR 275
VV T+M K PG+ L + G ++ ++ DSG++ TY
Sbjct: 222 VVSTTMFLTTAK---PGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYF-P 277
Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKT 335
V+Y L P + LC+ D F + + F+ G
Sbjct: 278 VSYCNLVREAVDHYVTAVRTADPTGNDM-LCYYT--------DTIDIFPVITMHFSGGAD 328
Query: 336 RTLFELTPEAYLIISNKGNVCLGIL 360
L + Y+ +G CL I+
Sbjct: 329 LVLDKY--NMYIETITRGTFCLAII 351
>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 488
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 73/310 (23%), Positives = 125/310 (40%), Gaps = 56/310 (18%)
Query: 41 VHGNVYPTGY--YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP--CVRCVEAP------ 90
V +YP Y Y ++ +G P +P + LDTGS L+W+ C + C C +P
Sbjct: 79 VRTALYPHSYGGYAFSVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSSPSAMSAM 138
Query: 91 ---HPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQ------CDYELEYADGGSSLGVL 141
HP S+ LV C +P C +H+ C C L GS+ G+L
Sbjct: 139 AVFHPKNSSSSRLVGCRNPACRWIHSKSPSTCGSTGNNGNGDVCPPYLVVYGSGSTSGLL 198
Query: 142 VKDAFAFNYTNGQRLNP---RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQK 198
+ D + ++ A+GC V + P G+ G G+G S+ SQL K
Sbjct: 199 ISDTLRLSPSSSSSAPAPFRNFAIGCSIVSV----HQPPSGLAGFGRGAPSVPSQLKVPK 254
Query: 199 LIRNVVGHCL-------SGGGGGFLFFGDDLYDSSRVVWT----------SMSSDYTKYY 241
+CL + G L GD + + + T + Y+ YY
Sbjct: 255 F-----SYCLLSRRFDDNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYY 309
Query: 242 SPGVAELFFGGETTGLKN---LP-----VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKS 293
+ + GG+ L + +P + DSG+++TYL+ ++ + + M+ + +
Sbjct: 310 YLALTGISVGGKPVNLPSRAFVPSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGRY 369
Query: 294 LKEAPEDETL 303
+ P ++ L
Sbjct: 370 NRSRPVEDAL 379
>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
Length = 480
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 55/188 (29%), Positives = 82/188 (43%), Gaps = 17/188 (9%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
TG Y V +G PA+P+ L DTGSDLTW++C +AP ++R + + C
Sbjct: 109 TGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDGTGDAPRRVFRAAASRSWAPIAC 168
Query: 104 EDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYT-------NGQR 155
C S NC PA C Y+ Y DG ++ GV+ D+ + G+R
Sbjct: 169 SSDTCTSYVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGSESRDGGGRR 228
Query: 156 LNPR-LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQ---KLIRNVVGHCLSGG 211
+ + LGC G S+ DG+L LG S S+ ++ + +V H
Sbjct: 229 AKLQGVVLGC-TASYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRN 287
Query: 212 GGGFLFFG 219
+L FG
Sbjct: 288 ATSYLTFG 295
>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 252
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 60/198 (30%), Positives = 90/198 (45%), Gaps = 22/198 (11%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDP 106
Y VTM +G ++ + +DT SDLTW+QC+ PC+ C P+++P S V C
Sbjct: 65 YIVTMGLG--SKNMTVIIDTRSDLTWVQCE-PCMSCYNQQGPIFKPSTSSSYQSVSCNSS 121
Query: 107 ICASLH----APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
C SL G +P+ C+Y + Y DG + G L +A +F G
Sbjct: 122 TCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGDLGVEALSF----GGVSVSDFVF 177
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLFFG 219
GCG N + + G++GLG+ S+VSQ ++ V +CL G G L G
Sbjct: 178 GCGRNNK--GLFGGVSGLMGLGRSYLSLVSQTNA--TFGGVFSYCLPTTEAGSSGSLVMG 233
Query: 220 DDLYDSSRVVWTSMSSDY 237
++ S+ S S Y
Sbjct: 234 NEFSQISQKKKNSYGSRY 251
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 60/124 (48%), Gaps = 10/124 (8%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
+G Y + +G+PAR ++ LDTGSD+TWLQC PC C P+Y PS V C
Sbjct: 160 SGEYFSRVGVGRPARQLYMVLDTGSDVTWLQCQ-PCADCYAQSDPVYDPSVSTSYATVGC 218
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+ P C L A N C YE+ Y DG ++G + + +A+G
Sbjct: 219 DSPRCRDLDAAACRNST--GSCLYEVAYGDGSYTVGDFATETLTLGDSAPVS---NVAIG 273
Query: 164 CGYN 167
CG++
Sbjct: 274 CGHD 277
>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
Length = 328
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 85/200 (42%), Gaps = 30/200 (15%)
Query: 58 GQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV---------PCEDPIC 108
G PA + +DTGSDLTW+QC PC C PL+ P+ C D +
Sbjct: 103 GSPAANLTVIVDTGSDLTWVQCK-PCSACYAQRDPLFDPAGSATYAAVRCNASACADSLR 161
Query: 109 ASLHAPGHHNCEDPA--QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
A+ PG +C Y L Y DG S GVL D A G L GCG
Sbjct: 162 AATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVAL---GGASLG-GFVFGCGL 217
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFF--GD 220
+ + G++GLG+ + S+VSQ S+ V +CL SG G L GD
Sbjct: 218 SNR--GLFGGTAGLMGLGRTELSLVSQTASR--YGGVFSYCLPAATSGDASGSLSLGGGD 273
Query: 221 DLYDSSR----VVWTSMSSD 236
D S R V +T M +D
Sbjct: 274 DAASSYRNTTPVAYTRMIAD 293
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 62/124 (50%), Gaps = 9/124 (7%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
+G Y + + +G PA ++ LDTGSD+ WLQC +PC C P++ P+ VPC
Sbjct: 133 SGEYFMRLGVGTPATNMYMVLDTGSDVVWLQC-SPCKVCYNQSDPVFNPAKSKTFATVPC 191
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+C L C Y++ Y DG ++G + F +G R++ +ALG
Sbjct: 192 GSRLCRRLDDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTF---HGARVD-HVALG 247
Query: 164 CGYN 167
CG++
Sbjct: 248 CGHD 251
>gi|414887401|tpg|DAA63415.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 242
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 61/240 (25%), Positives = 106/240 (44%), Gaps = 19/240 (7%)
Query: 136 SSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLH 195
SS GVL +D +F + + R GC ++ DGI+GLG+G+ SI+ QL
Sbjct: 3 SSSGVLGEDIVSFGRESELKAQ-RAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLV 61
Query: 196 SQKLIRNVVGHCLSG---GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGG 252
+ +I + C G GGG + G + S +V++ + YY+ + E+ G
Sbjct: 62 EKGVINDSFSLCYGGMDIGGGAMVLGG--VPTPSDMVFSRSDPLRSPYYNIELKEIHVAG 119
Query: 253 ETTGLKNL------PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLC 306
+ + + V DSG++Y YL + + ++ + P+ +C
Sbjct: 120 KALRVDSRIFDSKHGTVLDSGTTYAYLPEQAFMAFKDAVTSKVHSLKKIRGPDPSYKDIC 179
Query: 307 WKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNK--GNVCLGILNGAE 364
+ G R +NV + + F + + F +G+ LTPE YL +K G CLG+ +
Sbjct: 180 FAGAR--RNVSKLHEVFPDVDMVFGNGQK---LSLTPENYLFRHSKVDGAYCLGVFQNGK 234
>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
Length = 447
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 52/161 (32%), Positives = 74/161 (45%), Gaps = 15/161 (9%)
Query: 15 VRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDL 74
+R ++ ++ +SL + G G + +G Y + +G P+ L +DTGSDL
Sbjct: 50 LRQRLAADAARYASLVDATGRLHSPVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDL 109
Query: 75 TWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPAQ----CD 126
WLQC +PC RC ++ P VPC P C +L PG C+ C
Sbjct: 110 VWLQC-SPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQCRALRFPG---CDSGGAAGGGCR 165
Query: 127 YELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYN 167
Y + Y DG SS G L D AF N +N + LGCG +
Sbjct: 166 YMVAYGDGSSSTGDLATDKLAF--ANDTYVN-NVTLGCGRD 203
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 78/325 (24%), Positives = 130/325 (40%), Gaps = 49/325 (15%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICAS 110
Y + + +G P ++DTGSDL W QC PC C P++ PSN E
Sbjct: 61 YLMKLQVGTPPFEIEAEIDTGSDLIWTQC-MPCTNCYSQYAPIFDPSNSSTFKE------ 113
Query: 111 LHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR-LNPRLALGCGYNQV 169
C + C Y++ YAD S G L + + T+G+ + P +GCG+N
Sbjct: 114 ------KRCNGNS-CHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNS- 165
Query: 170 PGASYHP-LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDD-LYDSSR 227
+ + P G++GL G SS+++Q+ + ++ +C + G + FG + +
Sbjct: 166 --SWFKPTFSGMVGLSWGPSSLITQMGGEY--PGLMSYCFASQGTSKINFGTNAIVAGDG 221
Query: 228 VVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP------------VVFDSGSSYTYLNR 275
VV T+M K PG+ L + G ++ ++ DSG++ TY
Sbjct: 222 VVSTTMFLTTAK---PGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYF-P 277
Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKT 335
V+Y L P + LC+ D F + + F+ G
Sbjct: 278 VSYCNLVREAVDHYVTAVRTADPTGNDM-LCYYT--------DTIDIFPVITMHFSGGAD 328
Query: 336 RTLFELTPEAYLIISNKGNVCLGIL 360
L + Y+ +G CL I+
Sbjct: 329 LVLDKY--NMYIETITRGTFCLAII 351
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 93/353 (26%), Positives = 141/353 (39%), Gaps = 66/353 (18%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCE 104
G Y + +++G P R + L +DTGSDLTWLQC PC C + P++ PS ++PC
Sbjct: 85 GEYFMDVFVGNPPRHFLLIIDTGSDLTWLQC-KPCKACFDQSGPVFDPSQSTSFKIIPCN 143
Query: 105 DPICASLHAPGHHNCED------PAQCDYELEYADGGSSLGVLVKDAFAFNYTNG-QRLN 157
C + H C D P C Y Y D + G L ++ + + ++ L
Sbjct: 144 AAACDLV---VHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLE 200
Query: 158 PR-LALGCGYNQVPGASYHPLDGILGL------GKGKSSIVSQLHSQKLIRNV----VGH 206
R + +GCG++ LG + +SS + Q S L+ V
Sbjct: 201 IRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSS 260
Query: 207 CLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPV---- 262
+S G G L D + V T+ S + T YY L G + LP+
Sbjct: 261 AISFGAGFALSRHFDQMKFTPFVRTNNSVE-TFYY------LGIQGIKIDQELLPIPAER 313
Query: 263 -----------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK--G 309
+ DSG++ TYLNR Y+ + S L+ S A + L +C+ G
Sbjct: 314 FAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAF---LARISYPRADPFDILGICYNATG 370
Query: 310 RRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISN--KGNVCLGIL 360
R F L++ F +G +L E Y I + + CL IL
Sbjct: 371 RAAVP--------FPALSIVFQNGAE---LDLPQENYFIQPDPQEAKHCLAIL 412
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 61/129 (47%), Gaps = 9/129 (6%)
Query: 45 VYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DL 100
V G Y + +G P +DTGSD+ WLQC+ PC C + P++ PS
Sbjct: 85 VASQGEYLMRYSVGSPPFQVLGIVDTGSDILWLQCE-PCEDCYKQTTPIFDPSKSKTYKT 143
Query: 101 VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PR 159
+PC C SL + C C+Y ++Y DG S G L + T+G ++ P+
Sbjct: 144 LPCSSNTCESLR---NTACSSDNVCEYSIDYGDGSHSDGDLSVETLTLGSTDGSSVHFPK 200
Query: 160 LALGCGYNQ 168
+GCG+N
Sbjct: 201 TVIGCGHNN 209
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 75/280 (26%), Positives = 118/280 (42%), Gaps = 27/280 (9%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y + + IG P + DTGSDL W QC PC+ C + +P++ PS V CE
Sbjct: 89 GEYLMKISIGTPPFDVYGIYDTGSDLMWTQC-LPCLSCYKQKNPMFDPSKSTSFKEVSCE 147
Query: 105 DPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLAL 162
C L +C P + CD+ Y DG + GV+ + N +GQ + +
Sbjct: 148 SQQCRLLDT---VSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPTSILNIVF 204
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS-----QKLIRNVVGHCLSGGGGGFLF 217
GCG+N + + + G+ G G S+ SQ+ S +K + +V +
Sbjct: 205 GCGHNNSGTFNENEM-GLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKII 263
Query: 218 FGDDLYDS-SRVVWTSM-SSDYTKYY-------SPGVAELFFGGETTGLKNLPVVFDSGS 268
FG + S S VV T + + D YY S G F + V D+G+
Sbjct: 264 FGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGT 323
Query: 269 SYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
T L R Y L +K+ + + +++ D LC++
Sbjct: 324 PPTLLPRDFYNRLVQGVKEAIPMEPVQDP--DLQPQLCYR 361
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 80/312 (25%), Positives = 127/312 (40%), Gaps = 42/312 (13%)
Query: 65 FLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCE 120
FL +DTGSD+TW+QCD PC +C + L++P+ +PC +C L + H+C
Sbjct: 2 FLLIDTGSDITWIQCD-PCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQS-FSHSCL 59
Query: 121 DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALGCGYNQVPGASYHPLDG 179
+ + C+Y + Y D ++ G + + ++ P A GCG+ A+ +G
Sbjct: 60 N-SSCNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGH-----ANKGLFNG 113
Query: 180 ILGL-GKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDL---YDSSRVVWT 231
GL G GKSSI + V +CL S G L FG+ YD
Sbjct: 114 AAGLMGLGKSSIGFPAQTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRFTPLV 173
Query: 232 SMSSDYTKYYSPGVAELFFGGETTGLKNLP----VVFDSGSSYTYLNRVTYQTLTSIMKK 287
SS ++Y+ + G G + LP V+ DSG+ + + Y+ L +
Sbjct: 174 DSSSGPSQYF------VSMTGINVGDELLPISATVMVDSGTVISRFEQSAYERLRDAFTQ 227
Query: 288 ELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYL 347
L L+ A C++ V D+ + L F D L+P L
Sbjct: 228 ILPG--LQTAVSVAPFDTCFR----VSTVDDIN--IPLITLHFRDDAE---LRLSPVHIL 276
Query: 348 IISNKGNVCLGI 359
+ G +C
Sbjct: 277 YPVDDGVMCFAF 288
>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 320
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 74/255 (29%), Positives = 107/255 (41%), Gaps = 38/255 (14%)
Query: 131 YADGGSSLGVLVKDAFAFNYTNGQR----LNPRLALGCGYNQVP--GASYHPLDGILGLG 184
Y DG S+ G LVKD + G R N + GCG Q G S +DGI+G G
Sbjct: 2 YGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFG 61
Query: 185 KGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPG 244
+ SS +SQL SQ ++ HCL GG +F ++ S +V T M S + +YS
Sbjct: 62 QSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFAIGEVV-SPKVKTTPMLSK-SAHYSVN 119
Query: 245 VAELFFGGETTGLK--------NLPVVFDSGSSYTYLNRVTYQ-TLTSIMKK--ELSAKS 293
+ + G L + V+ DSG++ YL Y L I+ EL+ +
Sbjct: 120 LNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTLHT 179
Query: 294 LKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKG 353
++E+ F H K R ++F K+ +L + P YL +
Sbjct: 180 VQES---------------FTCFHYTDKLDRFPTVTFQFDKSVSL-AVYPREYLFQVRED 223
Query: 354 NVCLGILNGAEVGLQ 368
C G NG GLQ
Sbjct: 224 TWCFGWQNG---GLQ 235
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 87/343 (25%), Positives = 152/343 (44%), Gaps = 47/343 (13%)
Query: 51 YNVTMYIGQPARP--YFLDLDTGSDLTWLQCDAPCVRCVEA-PHP--LYRPSND----LV 101
Y V++ IG P RP + L DTGSDLTW+ C+ C C + PHP ++R ++ +
Sbjct: 119 YFVSIRIGTP-RPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFRANDSSSFRTI 177
Query: 102 PCEDPICASLHAPGHHN---CEDP-AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
PC C + + + C +P A C ++ Y +G ++GV + + +++
Sbjct: 178 PCSSDDC-KIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTVGLNDHKKIR 236
Query: 158 P-RLALGC--GYNQVPGASYHPLDGILGLGKGKSSI---VSQLHSQKLIRNVVGHCLSGG 211
+ +GC +N+ G DG++GLG K S+ ++++ K +V H S
Sbjct: 237 LFDVLIGCTESFNETNGFP----DGVMGLGYRKHSLALRLAEIFGNKFSYCLVDHLSSSN 292
Query: 212 GGGFLFFGD-DLYDSSRVVWTSMSSDYTKYYSP-GVAELFFGGE----TTGLKNLP---- 261
FL FGD ++ T + Y + P V+ + GG ++ + N+
Sbjct: 293 HKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSISSDIWNVTGVGG 352
Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPED--ETLPLCWKGRRPFKNVHDV 319
++ DSG+S T L Y + + K + K K P + E C F++
Sbjct: 353 MIVDSGTSLTMLAGEAYDKVVDAL-KPIFDKHKKVVPIELPELNNFC------FEDKGFD 405
Query: 320 KKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNG 362
+ L + F DG +F+ ++Y+I +G CLGI+
Sbjct: 406 RAAVPRLLIHFADG---AIFKPPVKSYIIDVAEGIKCLGIIKA 445
>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 323
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 68/281 (24%), Positives = 121/281 (43%), Gaps = 32/281 (11%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y +++ +G P++ L++DTGS +W+ C+ C C P + + V C +
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P + GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PGFSFGCNM 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----------SGGGGGFL 216
+ + +DG+LG+G G S++ Q + +CL S G F
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQMSERGFFSKTTGYFS 172
Query: 217 FFGDDLYDSSRVVWTSMSSDY--TKYYSPGVAELFFGGETTGL-----KNLPVVFDSGSS 269
G + V +T M + T+ + + + GE GL VVFDSGS
Sbjct: 173 LGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGSE 232
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
+Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 233 LSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 270
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 144/354 (40%), Gaps = 74/354 (20%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV--RCVEAPHPLYRPSND----LVP 102
G Y +T+ IG P Y DTGSDL W QC APC +C P PLY P++ ++P
Sbjct: 90 GEYLMTLSIGTPPLSYPAIADTGSDLIWTQC-APCSGDQCFAQPAPLYNPASSTTFGVLP 148
Query: 103 CEDPI--CASLHA-----PGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNG-Q 154
C + CA + A PG C Y Y G ++ GV + F F Q
Sbjct: 149 CNSSLSMCAGVLAGKAPPPG-------CACMYNQTYGTGWTA-GVQGSETFTFGSAAADQ 200
Query: 155 RLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG 214
P +A GC + + ++ G++GLG+G S+VSQL + + +CL+
Sbjct: 201 ARVPGIAFGC--SNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRF-----SYCLTP---- 249
Query: 215 FLFFGDDLYDSSRVVWTSMSSDYTKYYS-PGVAE-----------LFFGGETTGLKNLPV 262
F D S+ ++ S + + T S P VA L G + G K L +
Sbjct: 250 ---FQDTNSTSTLLLGPSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSI 306
Query: 263 ---------------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
+ DSG++ T L YQ + + ++ ++ ++ + + L LC+
Sbjct: 307 SPDAFSLKADGTGGLIIDSGTTITSLVNAAYQQVRAAVQSLVTLPAI-DGSDSTGLDLCY 365
Query: 308 KGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILN 361
P ++ L F DG L P +IS G CL + N
Sbjct: 366 ALPTP----TSAPPAMPSMTLHF-DGADMVL----PADSYMISGSGVWCLAMRN 410
>gi|223946655|gb|ACN27411.1| unknown [Zea mays]
Length = 378
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 80/299 (26%), Positives = 131/299 (43%), Gaps = 36/299 (12%)
Query: 93 LYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQ-CDYELEY-ADGGSSLGVLVKDAF 146
+YRP+ +PC +C S+ PG C +P Q C Y ++Y ++ +S G+L++D
Sbjct: 8 IYRPAESTTSRHLPCSHELCQSV--PG---CTNPKQPCPYNIDYFSENTTSSGLLIEDTL 62
Query: 147 AFNYTNGQ-RLNPRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIR 201
NY +N + +GCG Q + G + DG+LGLG S+ S L L++
Sbjct: 63 HLNYREDHVPVNASVIIGCGQKQSGDYLDGIA---PDGLLGLGMADISVPSFLARAGLVQ 119
Query: 202 NVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP 261
N C G +FFGD S + + Y+ V + G + +
Sbjct: 120 NSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFK 179
Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
+ DSG+S+T L Y+ T K+++A + ED T C+ P + + DV
Sbjct: 180 ALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPY--EDTTWKYCYSA-SPLE-MPDVP- 234
Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLIISNK----GNVCLGILNGAE-VGLQDLNVIGG 375
T+ L+F K +L + P L ++K CL +L E +G+ N + G
Sbjct: 235 ---TITLTFAADK--SLQAVNP--ILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVG 286
>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 72/279 (25%), Positives = 118/279 (42%), Gaps = 30/279 (10%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y ++ +G P++ +++DTGS +W+ C+ C C P + + V C +
Sbjct: 1 YVTSVGLGTPSKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PSFTFGCNL 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDL-YDS 225
+ + +DG+LG+G G S++ Q + +CL FF Y S
Sbjct: 116 DSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSYCLPLQKSERGFFSKTTGYFS 172
Query: 226 SRVVWTSMSSDYTKYYSPGV-AELFF--------GGETTGL-----KNLPVVFDSGSSYT 271
V T YTK + ELFF GE GL VVFDSGS +
Sbjct: 173 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 232
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 233 YIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 268
>gi|414888271|tpg|DAA64285.1| TPA: hypothetical protein ZEAMMB73_923514, partial [Zea mays]
Length = 335
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 70/246 (28%), Positives = 105/246 (42%), Gaps = 28/246 (11%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYR------------PSNDLVPCE 104
+G P + + LDTGSDL W+ CD C+ C P YR ++ VPC
Sbjct: 94 LGTPNVTFLVALDTGSDLFWVPCD--CINCAPLVSPNYRDLKFDTYSPQKSSTSRKVPCS 151
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEY-ADGGSSLGVLVKDAFAFNYTNGQR---LNPRL 160
+C A + P Y ++Y +D SS GVLV+D G++ + +
Sbjct: 152 SNLCDEQSACRSASSSCP----YSIQYLSDNTSSTGVLVEDVLYLVTEYGRQPKIVTAPI 207
Query: 161 ALGCGYNQVPG--ASYHPLDGILGLGKGKSSIVSQLHSQKL-IRNVVGHCLSGGGGGFLF 217
GCG Q + P +G+LGLG S+ S L SQ + N C + G G +
Sbjct: 208 TFGCGRTQTGSFLGTAAP-NGLLGLGMDTISVPSLLASQGVAAANSFSMCFAQDGHGRIN 266
Query: 218 FGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVT 277
FGD + +M YY+ + G ++ K + DSG+S+T L+
Sbjct: 267 FGDTGSSDQQETPLNMYKQ-NPYYNISITGATVGSKSIHTK-FNAIVDSGTSFTALSDPM 324
Query: 278 YQTLTS 283
Y +TS
Sbjct: 325 YTQITS 330
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 90/346 (26%), Positives = 136/346 (39%), Gaps = 52/346 (15%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP--HPLYRPSNDLVPCEDPIC 108
Y V IG PA+P + LDT +D W+ C CV C + P S+ + CE P C
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSG-CVGCSSSVLFDPSKSSSSRTLQCEAPQC 146
Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGGSSL-GVLVKDAFAFNYTNGQRLNPRLALGCGYN 167
P +C C + + Y GGS++ L +D + P GC N
Sbjct: 147 KQAPNP---SCTVSKSCGFNMTY--GGSTIEAYLTQDTLTL----ASDVIPNYTFGC-IN 196
Query: 168 QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDLY 223
+ G S P G++GLG+G S++SQ SQ L ++ +CL S G L G
Sbjct: 197 KASGTSL-PAQGLMGLGRGPLSLISQ--SQNLYQSTFSYCLPNSKSSNFSGSLRLGPK-N 252
Query: 224 DSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLKNLPV-----VFDSGSSYT 271
R+ T + + Y V T+ L P +FDSG+ YT
Sbjct: 253 QPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYT 312
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
L Y + + ++ + + +T C+ G F +V F ++ T
Sbjct: 313 RLVEPAYVAVRNEFRRRVKNANATSLGGFDT---CYSGSVVFPSVT-----FMFAGMNVT 364
Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQD-LNVIGGI 376
L P+ LI S+ GN+ + A V + LNVI +
Sbjct: 365 ---------LPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASM 401
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 72/264 (27%), Positives = 100/264 (37%), Gaps = 28/264 (10%)
Query: 68 LDTGSDLTWLQCDA-PCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDP 122
LDT SD+TW+QC P C LY P S+ + C P C L P + C +
Sbjct: 173 LDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQL-GPYANGCTNN 231
Query: 123 AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASY-HPLDGIL 181
QC Y + Y DG S+ G + D R GC + S+ GI+
Sbjct: 232 NQCQYRVRYPDGTSTAGTYISDLLTITPATAVR---SFQFGCSHGVQGSFSFGSSAAGIM 288
Query: 182 GLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG-GGFLFFGDDLYDSSRVVWTSMSSDYT-- 238
LG G S+VSQ + V HC GF G + R V T M +
Sbjct: 289 ALGGGPESLVSQ--TAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKNPAIP 346
Query: 239 -KYYSPGVAELFFGGETTGLKNLPVVF------DSGSSYTYLNRVTYQTLTSIMKKELSA 291
+Y + + G+ + P VF DS ++ T L YQ L + ++
Sbjct: 347 PTFYMVRLEAIAVAGQRIAVP--PTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRMAM 404
Query: 292 KSLKEAPEDETLPLCW--KGRRPF 313
+ AP L C+ G R F
Sbjct: 405 --YQPAPPKGPLDTCYDMAGVRSF 426
>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 67/281 (23%), Positives = 121/281 (43%), Gaps = 32/281 (11%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL---VPCEDPI 107
Y +++ +G P++ +++DTGS +W+ C+ C C P + + V C +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE--CDGCHTNPRTFLQSRSTTCAKVSCGTSM 58
Query: 108 CASLHAPGH-HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
C + H + E+ C + + Y DG +S G+L +D F ++ Q++ P GC
Sbjct: 59 CLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF--SDVQKI-PGFTFGCNM 115
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----------SGGGGGFL 216
+ + +DG+LG+G G+ S++ Q + +CL S G F
Sbjct: 116 DSFGANEFGNVDGLLGMGAGQMSVLKQ---SSPTFDGFSYCLPLQMSERGFFSKTTGYFS 172
Query: 217 FFGDDLYDSSRVVWTSMSSDY--TKYYSPGVAELFFGGETTGL-----KNLPVVFDSGSS 269
G + V +T M + T+ + + + GE GL VVFDSGS
Sbjct: 173 LGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGSE 232
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
+Y+ L+ +++ L + A E+E+ C+ R
Sbjct: 233 LSYIPDRALSVLSQRIRELLLRRG---AAEEESERNCYDMR 270
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 66/143 (46%), Gaps = 15/143 (10%)
Query: 34 GSSLLFQVHGNVYP-----TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVE 88
G+SL + G V +G Y + IG PAR ++ LDTGSD+TW+QC PC C +
Sbjct: 147 GASLAAAIQGPVVSGVGQGSGEYFSRVGIGSPARELYMVLDTGSDVTWVQCQ-PCADCYQ 205
Query: 89 APHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKD 144
P++ PS V C+ P C L N C YE+ Y DG ++G +
Sbjct: 206 QSDPVFDPSLSASYAAVSCDSPRCRDLDTAACRNAT--GACLYEVAYGDGSYTVGDFATE 263
Query: 145 AFAFNYTNGQRLNPRLALGCGYN 167
+ +A+GCG++
Sbjct: 264 TLTLGDSTPVT---NVAIGCGHD 283
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 90/346 (26%), Positives = 136/346 (39%), Gaps = 52/346 (15%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP--HPLYRPSNDLVPCEDPIC 108
Y V IG PA+P + LDT +D W+ C CV C + P S+ + CE P C
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSG-CVGCSSSVLFDPSKSSSSRTLQCEAPQC 146
Query: 109 ASLHAPGHHNCEDPAQCDYELEYADGGSSL-GVLVKDAFAFNYTNGQRLNPRLALGCGYN 167
P +C C + + Y GGS++ L +D + P GC N
Sbjct: 147 KQAPNP---SCTVSKSCGFNMTY--GGSTIEAYLTQDTLTL----ASDVIPNYTFGC-IN 196
Query: 168 QVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDLY 223
+ G S P G++GLG+G S++SQ SQ L ++ +CL S G L G
Sbjct: 197 KASGTSL-PAQGLMGLGRGPLSLISQ--SQNLYQSTFSYCLPNSKSSNFSGSLRLGPK-N 252
Query: 224 DSSRVVWTSMSSD-------YTKYYSPGVAELFFGGETTGLKNLP-----VVFDSGSSYT 271
R+ T + + Y V T+ L P +FDSG+ YT
Sbjct: 253 QPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYT 312
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFT 331
L Y + + ++ + + +T C+ G F +V F ++ T
Sbjct: 313 RLVEPAYVAVRNEFRRRVKNANATSLGGFDT---CYSGSVVFPSVT-----FMFAGMNVT 364
Query: 332 DGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQD-LNVIGGI 376
L P+ LI S+ GN+ + A V + LNVI +
Sbjct: 365 ---------LPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASM 401
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 54/165 (32%), Positives = 80/165 (48%), Gaps = 16/165 (9%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV---RCVEAPHPLYRPSND----LVPC 103
Y VT +G P +++DTGSDL+W+QC PC C PL+ P+ VPC
Sbjct: 48 YVVTASLGTPGVAQTMEVDTGSDLSWVQCK-PCAAAPSCYSQKDPLFDPAQSSSYAAVPC 106
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
P+CA L C AQC Y + Y DG ++ GV D + ++ + G
Sbjct: 107 GGPVCAGLGIYAASACSA-AQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQ---GFFFG 162
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL 208
CG+ Q ++ +DG+LGLG+ + S+V Q + V +CL
Sbjct: 163 CGHAQ--SGLFNGVDGLLGLGREQPSLVEQ--TAGTYGGVFSYCL 203
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 65/131 (49%), Gaps = 13/131 (9%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SN 98
V G +G Y + + IG+P ++ LDTGSD++W+QC APC C + P++ P SN
Sbjct: 139 VSGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQC-APCSECYQQSDPIFDPVSSN 197
Query: 99 DLVP--CEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
P C+ P C SL N C YE+ Y DG ++G + T G
Sbjct: 198 SYSPIRCDAPQCKSLDLSECRN----GTCLYEVSYGDGSYTVGEFATETV----TLGTAA 249
Query: 157 NPRLALGCGYN 167
+A+GCG+N
Sbjct: 250 VENVAIGCGHN 260
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 72/264 (27%), Positives = 100/264 (37%), Gaps = 28/264 (10%)
Query: 68 LDTGSDLTWLQCDA-PCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDP 122
LDT SD+TW+QC P C LY P S+ + C P C L P + C +
Sbjct: 148 LDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQL-GPYANGCTNN 206
Query: 123 AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASY-HPLDGIL 181
QC Y + Y DG S+ G + D R GC + S+ GI+
Sbjct: 207 NQCQYRVRYPDGTSTAGTYISDLLTITPATAVR---SFQFGCSHGVQGSFSFGSSAAGIM 263
Query: 182 GLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG-GGFLFFGDDLYDSSRVVWTSMSSDYT-- 238
LG G S+VSQ + V HC GF G + R V T M +
Sbjct: 264 ALGGGPESLVSQ--TAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKNPAIP 321
Query: 239 -KYYSPGVAELFFGGETTGLKNLPVVF------DSGSSYTYLNRVTYQTLTSIMKKELSA 291
+Y + + G+ + P VF DS ++ T L YQ L + ++
Sbjct: 322 PTFYMVRLEAIAVAGQRIAVP--PTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRMAM 379
Query: 292 KSLKEAPEDETLPLCW--KGRRPF 313
+ AP L C+ G R F
Sbjct: 380 --YQPAPPKGPLDTCYDMAGVRSF 401
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 94/328 (28%), Positives = 138/328 (42%), Gaps = 64/328 (19%)
Query: 68 LDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCE-DP 122
LDTGSD+ W+QC APC RC E P++ P S V C +C L + G C+
Sbjct: 3 LDTGSDVVWVQC-APCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGG---CDLRR 58
Query: 123 AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY-NQVPGASYHPLDGIL 181
C Y++ Y DG + G V + F G R+ R+ALGCG+ N+ + L G+
Sbjct: 59 GACMYQVAYGDGSVTAGDFVTETLTF--AGGARV-ARVALGCGHDNEGLFVAAAGLLGLG 115
Query: 182 GLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGG-------FLFFGDDLYDSSRVVWTSMS 234
G + +S+ + + +V SG G + FG +S +T M
Sbjct: 116 RGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSASFTPMV 175
Query: 235 SD---YTKYY------------SPGVAELFFGGETTGLKNLP------VVFDSGSSYTYL 273
+ T YY PGVAE + L+ P V+ DSG+S T L
Sbjct: 176 RNPRMETFYYVQLVGISVGGARVPGVAE-------SDLRLDPSTGRGGVIVDSGTSVTRL 228
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETL-PLCWK--GRRPFKNVHDVKKCFRTLALSF 330
R +Y L + +A L+ +P +L C+ GRR K T+++ F
Sbjct: 229 ARASYSALRDAFRAA-AAGGLRLSPGGFSLFDTCYDLGGRRVVK--------VPTVSMHF 279
Query: 331 TDGKTRTLFELTPEAYLI-ISNKGNVCL 357
G L PE YLI + ++G C
Sbjct: 280 AGGAEAA---LPPENYLIPVDSRGTFCF 304
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 91/348 (26%), Positives = 138/348 (39%), Gaps = 56/348 (16%)
Query: 50 YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCED 105
++ +T+ IG P +P L LDTGSDL W QC R PLY P+ PC+
Sbjct: 88 HHTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTR-QHREKPLYDPAKSSSFAAAPCDG 146
Query: 106 PICASLHAPGHHNCEDPA--QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+C + G N ++ + +C Y Y ++ G L + F F +R++ L G
Sbjct: 147 RLCET----GSFNTKNCSRNKCIYTYNYGS-ATTKGELASETFTFG--EHRRVSVSLDFG 199
Query: 164 CGY---NQVPGASYHPLDGILGLGKGKSSIVSQLHSQK--------LIRNVVGHCLSGGG 212
CG +PGAS GILG+ + S+VSQL + L RN H G
Sbjct: 200 CGKLTSGSLPGAS-----GILGISPDRLSLVSQLQIPRFSYCLTPFLDRNTTSHIFFGAM 254
Query: 213 GGF-LFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK--NLPV------- 262
+ ++ +V S+Y YY P + G + G K N+PV
Sbjct: 255 ADLSKYRTTGPIQTTSLVTNPDGSNY-YYYVPLI------GISVGTKRLNVPVSSFAIGR 307
Query: 263 ------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNV 316
DSG + L V + L M + + + LC++ R
Sbjct: 308 DGSGGTFVDSGDTTGMLPSVVMEALKEAMVEAVKLPVVNATDHGYEYELCFQLPRNGGGA 367
Query: 317 HDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAE 364
+ L F DG L L ++Y++ + G +CL I +GA
Sbjct: 368 VETAVQVPPLVYHF-DGGAAML--LRRDSYMVEVSAGRMCLVISSGAR 412
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 87/309 (28%), Positives = 133/309 (43%), Gaps = 32/309 (10%)
Query: 22 SSSSSSSLFNHVGSSLLFQVHGNVYP-TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCD 80
S S S++L H+ S + + P +G + ++++IG P DTGSDLTW QC
Sbjct: 60 SFSRSATLLTHLTSVSTACIRSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQC- 118
Query: 81 APCVRCVEAPHPLYRP----SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS 136
PC C P++ P S V C C SL + +H D C Y Y D
Sbjct: 119 LPCRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLES--YHCGPDLQSCSYGYSYGDRSF 176
Query: 137 SLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLD-GILGLGKGKSSIVSQLH 195
+ G L D T G P+ +GCG+ G ++ + GI+GLG G S+VSQ+
Sbjct: 177 TYGDLASD----QITIGSFKLPKTVIGCGHQN--GGTFGGVTSGIIGLGGGSLSLVSQMR 230
Query: 196 SQKLIRNVVGHCL-----SGGGGGFLFFGDDLYDSSR-VVWTSM--SSDYTKYY------ 241
+ ++ +CL + G + FG S R VV T + S T Y+
Sbjct: 231 TIAGVKPRFSYCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAI 290
Query: 242 SPGVAELFFGGETTGLKNL-PVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPED 300
S G + + N ++ DSG++ T L R Y + S + + + AK + +
Sbjct: 291 SVGKKRFKAANGISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDD--PS 348
Query: 301 ETLPLCWKG 309
L LC+
Sbjct: 349 GILELCYSA 357
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 55/160 (34%), Positives = 71/160 (44%), Gaps = 15/160 (9%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLY----RPSNDLVPCEDP 106
Y + + IG P P+ DTGSDLTW QC PC C P+Y S VPC
Sbjct: 95 YLMELAIGTPPVPFVALADTGSDLTWTQCK-PCKLCFPQDTPIYDTAASASFSPVPCASA 153
Query: 107 ICASLHAPGHHNC--EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-----R 159
C + NC + C Y Y DG S GVL + F ++ P
Sbjct: 154 TCLPIWR-SSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPGVSVGG 212
Query: 160 LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
+A GCG + G SY+ G +GLG+G S+V+QL K
Sbjct: 213 VAFGCGVDN-GGLSYNS-TGTVGLGRGSLSLVAQLGVGKF 250
>gi|242094226|ref|XP_002437603.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
gi|241915826|gb|EER88970.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
Length = 541
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 75/261 (28%), Positives = 110/261 (42%), Gaps = 41/261 (15%)
Query: 50 YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVP------- 102
YY V + +G P + + LDTGSDL W+ CD C +C + +P+ L P
Sbjct: 111 YYAV-VEVGTPNATFLVALDTGSDLFWVPCD--CKQCASIANVTGQPATALRPYSPRESS 167
Query: 103 ------CEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSL-GVLVKDAFAFNYTN--- 152
C++ +C P + C YE++Y +S GVLV+D
Sbjct: 168 TSKQVTCDNALC---DRPNGCSAATNGSCPYEVQYLSANTSTSGVLVQDVLHLTRERPGA 224
Query: 153 ----GQRLNPRLALGCGYNQ----VPGASYHPLDGILGLGKGKSSIVSQLHSQKLI-RNV 203
G+ L + GCG Q + GA++ DG++GLG+ S+ S L S L+ +
Sbjct: 225 AAEAGEALQAPVVFGCGQVQTGTFLDGAAF---DGLMGLGRENVSVPSVLASSGLVASDS 281
Query: 204 VGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGL-KNLPV 262
C G G + FGD SS T + T Y V+ ET +
Sbjct: 282 FSMCFGDDGVGRINFGDS--GSSGQGETPFTGRRTLY---NVSFTAVNVETKSVAAEFAA 336
Query: 263 VFDSGSSYTYLNRVTYQTLTS 283
V DSG+S+TYL Y L +
Sbjct: 337 VIDSGTSFTYLADPEYTELAT 357
>gi|297819828|ref|XP_002877797.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323635|gb|EFH54056.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 530
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 94/353 (26%), Positives = 153/353 (43%), Gaps = 55/353 (15%)
Query: 50 YYNVTMYIGQPARPYFLDLDTGSDLTWLQCD--APCVRCVE-------APHPLYRPS--- 97
Y NV+ +G PA + + LDTGS+L WL C+ + C+R ++ P LY P+
Sbjct: 104 YANVS--VGTPATWFLVALDTGSNLFWLPCNCGSTCIRDLKDIGLSQSRPLNLYSPNTSS 161
Query: 98 -NDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGS-SLGVLVKDAFAFNYTNGQR 155
+ + C D C + C Y+++Y + + G L +D T
Sbjct: 162 TSSSIRCNDDRCFGSSQCSSPA----SSCPYQIQYLSKDTFTTGTLFEDVLHL-VTEDVD 216
Query: 156 LNP---RLALGCGYNQVPG-ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG- 210
L P + LGCG NQ S ++G+LGLG S+ S L K+ N C
Sbjct: 217 LKPVKANITLGCGRNQTGFLQSSAAINGLLGLGMKDYSVPSILAKAKITANSFSMCFGNI 276
Query: 211 -GGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSS 269
G + FGD Y + ++ + ++ + Y+ V E+ GG+ G++ L +FD+G+S
Sbjct: 277 IDVIGRISFGDKGY-TDQMETPLLPTEPSPTYAVNVTEVSVGGDVVGVQ-LLALFDTGTS 334
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK-----CFR 324
+T+L Y +T ++ K PE PF+ +D+ F
Sbjct: 335 FTHLLEPEYGLITKAFDDHVTDKRRPIDPE-----------IPFEFCYDLSPNSTTILFP 383
Query: 325 TLALSFTDGKTRTLFELTPEAYLIISNKGNV---CLGILNGAEVGLQDLNVIG 374
+A++F G +F P I+ N+ N CLGIL + +N+IG
Sbjct: 384 RVAMTFEGGS--LMFLRNP--LFIVWNEDNTAMYCLGILKSVDF---KINIIG 429
>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 83/347 (23%), Positives = 129/347 (37%), Gaps = 73/347 (21%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAP-------------HPLYRPS 97
Y +T+ IG P + + +DTGSDLTW+ C C++ PL+ S
Sbjct: 11 YLITLNIGTPPQAVQVYMDTGSDLTWVPCGNLSFDCIDCNDLKSNNLKSSSIFSPLHSSS 70
Query: 98 NDLVPCEDPICASLHAPGHHNCEDP---AQC---------------DYELEYADGGSSLG 139
+ C CA +H+ N DP A C + Y +GG G
Sbjct: 71 SFRASCASSFCAEIHS--SDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLVSG 128
Query: 140 VLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
+L +D R PR + GC ++YH GI G G+G S+ SQL
Sbjct: 129 ILTRDILKAR----TRDVPRFSFGCV-----TSTYHEPIGIAGFGRGLLSLPSQL---GF 176
Query: 200 IRNVVGHCL-------SGGGGGFLFFGD-----DLYDSSRVVWTSMSSDYTKYYSPGVAE 247
+ HC + L G +L DS + + Y Y G+
Sbjct: 177 LEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPVYPNSYYIGLES 236
Query: 248 LFFGGETTGLK------------NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLK 295
+ G T + N ++ DSG++YT+L Y L +I++ ++
Sbjct: 237 ITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLTILQSTITYPRAT 296
Query: 296 EAPEDETLPLCWKGRRPFKNV----HDVKKCFRTLALSFTDGKTRTL 338
E LC+K P N+ +DV F ++ +F + T L
Sbjct: 297 ETESRTGFDLCYKVPCPNNNLTSLENDVMMVFPSITFNFLNNATLLL 343
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/131 (32%), Positives = 61/131 (46%), Gaps = 12/131 (9%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP---- 96
V G +G Y + +G P R + LDTGSD+TW+QC+ PC C + P+Y P
Sbjct: 135 VSGMDQGSGEYFSRIGVGAPRRDQLMVLDTGSDVTWIQCE-PCSDCYQQSDPIYNPALSS 193
Query: 97 SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
S LV C+ +C L G C C Y++ Y DG + G + Q
Sbjct: 194 SYKLVGCQANLCQQLDVSG---CSRNGSCLYQVSYGDGSYTQGNFATETLTLGGAPLQ-- 248
Query: 157 NPRLALGCGYN 167
+A+GCG++
Sbjct: 249 --NVAIGCGHD 257
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 58/125 (46%), Gaps = 8/125 (6%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCE 104
G Y + + +G P DTGSDL W QC PC C E P++ P+ ++ CE
Sbjct: 93 GEYLMNISLGTPPVSMHGIADTGSDLLWRQC-KPCDSCYEQIEPIFDPAKSKTYQILSCE 151
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
C++L G C D C Y Y DG + G L D T G+ ++ P++ G
Sbjct: 152 GKSCSNLG--GQGGCSDDNTCIYSYSYGDGSHTSGDLAVDTLTIGSTTGRPVSVPKVVFG 209
Query: 164 CGYNQ 168
CG+N
Sbjct: 210 CGHNN 214
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 81/302 (26%), Positives = 119/302 (39%), Gaps = 69/302 (22%)
Query: 37 LLFQVHGNVYPTG-----------YYNVTM----YIGQPARPYFLDLDTGSDLTWLQCDA 81
LLF++ P G ++NV++ +G P + + LDTGS+L+WL C
Sbjct: 37 LLFELRARQVPAGALPRPASKLRFHHNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAP 96
Query: 82 PCVRCVEAPHPL-YRPSNDL----VPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGG 135
L +RP L VPC+ C S P C+ + QC L YADG
Sbjct: 97 GGGGGGGGRSALSFRPRASLTFASVPCDSAQCRSRDLPSPPACDGASKQCRVSLSYADGS 156
Query: 136 SSLGVLVKDAFAFNYTNGQRLNPRLALGC---GYNQVPGASYHPLDGILGLGKGKSSIVS 192
SS G L + F T GQ R A GC ++ P G+LG+ +G S VS
Sbjct: 157 SSDGALATEVF----TVGQGPPLRAAFGCMATAFDTSPDGVAT--AGLLGMNRGALSFVS 210
Query: 193 QLHSQKLIRNVVGHCLSG-GGGGFLFFG-DDLYDSSRVVWTSMSSDYTKYYSPGVAELFF 250
Q +++ +C+S G L G DL + +YT Y P + +F
Sbjct: 211 QASTRRF-----SYCISDRDDAGVLLLGHSDL--------PFLPLNYTPLYQPAMPLPYF 257
Query: 251 G---------GETTGLKNLPV---------------VFDSGSSYTYLNRVTYQTLTSIMK 286
G G K LP+ + DSG+ +T+L Y L +
Sbjct: 258 DRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFS 317
Query: 287 KE 288
++
Sbjct: 318 RQ 319
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 72/268 (26%), Positives = 114/268 (42%), Gaps = 31/268 (11%)
Query: 68 LDTGSDLTWLQC----DAPCVRCVEAPH-PLYRPSNDLVPCEDPICASLHAPGHHNCEDP 122
LD+ SD+ W+QC PC V++ + P P++ C P C +L P + C +
Sbjct: 33 LDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTAL-GPYANGCAN- 90
Query: 123 AQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILG 182
QC Y + Y DG S+ G + D + N GC + + G+ GI+
Sbjct: 91 NQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVS---GFKFGCSHAE-QGSFDARAAGIMA 146
Query: 183 LGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMS--SDYT 238
LG G S++SQ S+ N +C+ + GF G SSR V T M
Sbjct: 147 LGGGPESLLSQTASR--YGNAFSYCIPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAA 204
Query: 239 KYYSPGVAELFFGGETTGLKNLPVVFDSGS---SYTYLNRV---TYQTLTSIMKKELSAK 292
+Y + + GG+ G+ P VF +GS S T + R+ YQ L + + ++
Sbjct: 205 TFYGVLLRTITVGGQRLGVA--PAVFAAGSVLDSRTAITRLPPTAYQALRAAFRSSMTM- 261
Query: 293 SLKEAPEDETLPLCWKGRRPFKNVHDVK 320
+ AP L C+ F V +++
Sbjct: 262 -YRSAPPKGYLDTCYD----FTGVVNIR 284
>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
Length = 293
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 56/162 (34%), Positives = 74/162 (45%), Gaps = 19/162 (11%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSNDL----VPCED 105
Y VT+ IG P L DTGSDLTW QC+ PC+ C P + PS+ V C
Sbjct: 134 YIVTIGIGTPKHDISLMFDTGSDLTWTQCE-PCLGSCYSQKEPKFNPSSSSSYHNVSCSS 192
Query: 106 PICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCG 165
P+C + + NC Y + Y DG ++G L K+ F TN L+ + GCG
Sbjct: 193 PMCGNPESCSASNCL------YGIGYGDGSVTVGFLAKEKFTL--TNSDVLDD-IYFGCG 243
Query: 166 YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
N + GILGLG GK S L + N+ +C
Sbjct: 244 ENN--KGVFIGSAGILGLGPGKFSF--PLQTTTTYNNIFSYC 281
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 67/139 (48%), Gaps = 7/139 (5%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHA-PG 115
IG P P L +DTGSDLTW+ C PC +C P + PS ++ HA P
Sbjct: 84 IGNPPVPQLLLIDTGSDLTWIHC-LPC-KCYPQTIPFFHPSRSSTYRNASCVSAPHAMPQ 141
Query: 116 HHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALGCGYNQVPGASY 174
E C Y L Y D ++ G+L ++ F ++ ++ + + GCG + + +
Sbjct: 142 IFRDEKTGNCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGCGQDN---SGF 198
Query: 175 HPLDGILGLGKGKSSIVSQ 193
G+LGLG G SIV++
Sbjct: 199 TKYSGVLGLGPGTFSIVTR 217
>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 438
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 89/329 (27%), Positives = 130/329 (39%), Gaps = 46/329 (13%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPIC-- 108
Y + + + P DTGS L WL+C P A H S +PC+ C
Sbjct: 76 YLMALDVSTPPVRMLALADTGSSLVWLKCKLP------AAHTPASSSYARLPCDAFACKA 129
Query: 109 ----ASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
AS A G N C Y +ADG + G + DAF F+ RL GC
Sbjct: 130 LGDAASCRATGSGN----NICVYRYAFADGSCTAGPVTVDAFTFST--------RLDFGC 177
Query: 165 GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFG 219
+ G S P DG++GL G S+VSQL ++ + +CL S L FG
Sbjct: 178 A-TRTEGLSV-PDDGLVGLANGPISLVSQLSAKTPFAHKFSYCLVPYSSSETVSSSLNFG 235
Query: 220 DDLYDSSR--VVWTSMSSDYTK-YYSPGVAELFFGGETTGLK--NLPVVFDSGSSYTYLN 274
SS T + + K +Y+ + + G+ L+ ++ DSG+ TYL
Sbjct: 236 SHAIVSSSPGAATTPLVAGRNKSFYTIALDSIKVAGKPVPLQTTTTKLIVDSGTMLTYLP 295
Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETL-PLCWKGRRPFKNVHDVKKCFRTLALSFTDG 333
+ L + + +A L ETL +C+ RR + DV K + L G
Sbjct: 296 KAVLDPLVAALT---AAIKLPRVKSPETLYAVCYDVRR--RAPEDVGKSIPDVTLVLGGG 350
Query: 334 KTRTLFELTPEAYLIISNKG-NVCLGILN 361
L ++ NKG VCL ++
Sbjct: 351 GE---VRLPWGNTFVVENKGTTVCLALVE 376
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 51/150 (34%), Positives = 75/150 (50%), Gaps = 14/150 (9%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV---RCVEAPHPLYRPSND----LVPC 103
Y VT +G P +++DTGSDL+W+QC PC C PL+ P+ VPC
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCK-PCAAAPSCYSQKDPLFDPAQSSSYAAVPC 198
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
P+CA L C AQC Y + Y DG ++ GV D + ++ + G
Sbjct: 199 GGPVCAGLGIYAASACSA-AQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQ---GFFFG 254
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQ 193
CG+ Q ++ +DG+LGLG+ + S+V Q
Sbjct: 255 CGHAQ--SGLFNGVDGLLGLGREQPSLVEQ 282
>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
Length = 137
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 62/123 (50%), Gaps = 13/123 (10%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
G + + + IG+P+ Y LDTGSDLTW QC PC C + P P+Y PS V C+
Sbjct: 19 GEFLMQLAIGKPSLAYSAILDTGSDLTWTQC-IPCSDCYKQPTPIYDPSLSSTYGTVSCK 77
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
+C +L A + A C+Y Y D S+ G+L + F + + P +A GC
Sbjct: 78 SSLCLALPASACIS----ATCEYLYTYGDYSSTQGILSYETFTLS----SQSIPHIAFGC 129
Query: 165 GYN 167
G +
Sbjct: 130 GQD 132
>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
Length = 137
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 62/123 (50%), Gaps = 13/123 (10%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
G + + + IG+P+ Y LDTGSDLTW QC PC C + P P+Y PS V C+
Sbjct: 19 GEFLMQLAIGKPSLAYSAILDTGSDLTWTQC-MPCSDCYKQPTPIYDPSLSSTYGTVSCK 77
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
+C +L A + A C+Y Y D S+ G+L + F + + P +A GC
Sbjct: 78 SSLCLALPASACIS----ATCEYLYTYGDYSSTQGILSYETFTLS----SQSIPHIAFGC 129
Query: 165 GYN 167
G +
Sbjct: 130 GQD 132
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 72/241 (29%), Positives = 103/241 (42%), Gaps = 25/241 (10%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y++T IG P + DTGSDL W +C A C RCV P Y P S +PC
Sbjct: 80 GAYDMTFSIGTPPQELSALADTGSDLIWAKCGA-CTRCVPQGSPSYYPNKSSSFSKLPCS 138
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGS----SLGVLVKDAFAFNYTNGQRLNPRL 160
+C+ L P A+CDY+ Y + G L + F T G P +
Sbjct: 139 GSLCSDL--PSSQCSAGGAECDYKYSYGLASDPHHYTQGYLGSETF----TLGSDAVPGI 192
Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF--LFF 218
GC + Y G++GLG+G S+VSQL+ +CL+ L F
Sbjct: 193 GFGC--TTMSEGGYGSGSGLVGLGRGPLSLVSQLN-----VGAFSYCLTSDAAKTSPLLF 245
Query: 219 GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETT-GLKNLPVVFDSGSSYTYLNRVT 277
G + V T + T YY+ + + G TT G + ++FDSG++ +L
Sbjct: 246 GSGALTGAGVQSTPLLRTSTYYYTVNLESISIGAATTAGTGSSGIIFDSGTTVAFLAEPA 305
Query: 278 Y 278
Y
Sbjct: 306 Y 306
>gi|156099262|ref|XP_001615633.1| aspartic protease PM5 [Plasmodium vivax Sal-1]
gi|148804507|gb|EDL45906.1| aspartic protease PM5 [Plasmodium vivax]
Length = 536
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 93/412 (22%), Positives = 157/412 (38%), Gaps = 84/412 (20%)
Query: 10 LCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLD 69
LC +V+ S S+ S L ++++G++ YY + + IG P + L LD
Sbjct: 27 LCALSVQGRSESTEGHSKDLLYK------YKLYGDIDEYAYYFLDIDIGTPEQRISLILD 80
Query: 70 TGSDLTWLQCDAPCVRC---VEAPHPLYR-PSNDLVPCEDPICASLHAPGHHNCEDPAQC 125
TGS C A C C +E P L ++ ++ CE+ C P NC +C
Sbjct: 81 TGSSSLSFPC-AGCKNCGVHMENPFNLNNSKTSSILYCENEEC-----PFKLNCVK-GKC 133
Query: 126 DYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLG- 184
+Y Y +G G D + N +R+ R +GC ++ Y G+LG+
Sbjct: 134 EYMQSYCEGSQISGFYFSDVVSVVSYNNERVTFRKLMGCHMHEESLFLYQQATGVLGMSL 193
Query: 185 ---KGKSSIVSQLHSQK-LIRNVVGHCLSGGGGGFLFFGDD------------------- 221
+G + V+ L ++ V C+S GG + G D
Sbjct: 194 SKPQGIPTFVNLLFDNAPQLKQVFTICISENGGELIAGGYDPAYIVRRGGSKSVSGQGSG 253
Query: 222 ----------------LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFD 265
L ++ +VVW +++ Y Y ++F + K L ++ D
Sbjct: 254 PVSESLSESGEDPQVALREAEKVVWENVTRKYYYYIKVRGLDMFGTNMMSSSKGLEMLVD 313
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
SGS++T++ Y L LC + N +DV K +
Sbjct: 314 SGSTFTHIPEDLYNKLNYFFD-----------------ILCIQD---MNNAYDVNKRLKM 353
Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEV-----GLQDLNV 372
SF + + F+ ++ I K N+C+ I++G + GL DL V
Sbjct: 354 TNESFNNPLVQ--FDDFRKSLKSIIAKENMCVKIVDGVQCWKYLEGLPDLFV 403
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 144/356 (40%), Gaps = 54/356 (15%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPC 103
+G Y + + +G PA ++ LDTGSD+ WLQC +PC C ++ P VPC
Sbjct: 135 SGEYFMRLGVGTPATNVYMVLDTGSDVVWLQC-SPCKACYNQSDVIFDPKKSKTFATVPC 193
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+C L C Y++ Y DG + G + F +G R++ + LG
Sbjct: 194 GSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF---HGARVD-HVPLG 249
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--------SGGGGGF 215
CG++ + G+LGLG+G S SQ S+ +CL S
Sbjct: 250 CGHDN--EGLFVGAAGLLGLGRGGLSFPSQTKSR--YNGKFSYCLVDRTSSGSSSKPPST 305
Query: 216 LFFGDDLYDSSRVVWTSMSSDY--TKYY------------SPGVAELFFGGETTGLKNLP 261
+ FG+D + V +++ T YY PGV+E F + TG N
Sbjct: 306 IVFGNDAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATG--NGG 363
Query: 262 VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
V+ DSG+S T L + Y L + L A LK AP C+ + VK
Sbjct: 364 VIIDSGTSVTRLTQSAYVALRDAFR--LGATKLKRAPSYSLFDTCFD----LSGMTTVK- 416
Query: 322 CFRTLALSFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
T+ F G+ L YLI ++ +G C + L++IG I
Sbjct: 417 -VPTVVFHFGGGEV----SLPASNYLIPVNTEGRFCFAFAG----TMGSLSIIGNI 463
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 88/350 (25%), Positives = 141/350 (40%), Gaps = 49/350 (14%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQC--DAPCVRCVEAPH-PLYRPSNDLVPCEDPICA 109
V + IG P + + LDTGS L+W+QC AP A P + +PC P+C
Sbjct: 99 VDLPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVCK 158
Query: 110 SLHAPGH---HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
P +C+ C Y YADG + G LV++ F F+ + P L LGC
Sbjct: 159 P-RIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRS---LFTPPLILGCAT 214
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--GGGFLFFGDDLYD 224
S P GILG+ +G+ S SQ K V G G + G + +
Sbjct: 215 E-----STDP-RGILGMNRGRLSFASQSKITKFSYCVPTRVTRPGYTPTGSFYLGHNP-N 267
Query: 225 SSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLK------NL-PVVF------------D 265
S+ + M + P + L + G++ N+ P VF D
Sbjct: 268 SNTFRYIEMLTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAGGSGQTMLD 327
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
SGS +TYL Y + + + + + + K +C+ G N ++ +
Sbjct: 328 SGSEFTYLVNEAYDKVRAEVVRAVGPRMKKGYVYGGVADMCFDG-----NAIEIGRLIGD 382
Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNV-CLGILNGAEVGLQDLNVIG 374
+ F G + + P+ ++ + +G V C+GI N ++G N+IG
Sbjct: 383 MVFEFEKG----VQIVVPKERVLATVEGGVHCIGIANSDKLGAAS-NIIG 427
>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
Length = 459
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 94/378 (24%), Positives = 142/378 (37%), Gaps = 88/378 (23%)
Query: 45 VYPTGY--YNVTMYIGQPARPYFLDLDTGSDLTWL---------QCDAPCVRCVEAPHPL 93
+YP Y Y T +G P +P + LDTGS LTW+ C +P V HP
Sbjct: 59 LYPHSYGGYAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPK 118
Query: 94 YRPSNDLVPCEDPICASLH--------------APGHHNCEDPAQ--C-DYELEYADGGS 136
S+ LV C +P C +H +PG NC A C Y + Y GS
Sbjct: 119 NSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGS-GS 177
Query: 137 SLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS 196
+ G+L+ D R P LGC V + P G+ G G+G S+ +QL
Sbjct: 178 TAGLLIADTL----RAPGRAVPGFVLGCSLVSV----HQPPSGLAGFGRGAPSVPAQLGL 229
Query: 197 QKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAE--------- 247
K +CL F D+ S +V Y P V
Sbjct: 230 PKF-----SYCLLS-----RRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYG 279
Query: 248 ----LFFGGETTGLK--NLP-------------VVFDSGSSYTYLNRVTYQTLTSIMKKE 288
L G T G K LP + DSG+++TYL+ +Q + +
Sbjct: 280 VYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAA 339
Query: 289 LSA--KSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAY 346
+ K K+A ++ L C+ + +++ L+ F G + +L E Y
Sbjct: 340 VGGRYKRSKDAEDELGLHPCFALPQGARSM-----ALPELSFHFEGG---AVMQLPVENY 391
Query: 347 LIISNKGNV---CLGILN 361
+++ +G V CL ++
Sbjct: 392 FVVAGRGAVEAICLAVVT 409
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 74/281 (26%), Positives = 125/281 (44%), Gaps = 28/281 (9%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPCE 104
G Y + + +G P + +DTGSDL W QC PC C P++ P + +PCE
Sbjct: 80 GDYLMKLTLGSPPVDIYGLVDTGSDLVWAQC-TPCGGCYRQKSPMFEPLRSKTYSPIPCE 138
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLALG 163
C+ ++C C Y YAD + GVL ++A F+ T+G + + G
Sbjct: 139 SEQCSFFG----YSCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVVVGDIIFG 194
Query: 164 CGYNQVPGASYHPLDGILGLGKGKS-SIVSQL----HSQKLIRNVVGHCLSGGGGGFLFF 218
CG++ +++ D + G S+VSQ+ S++ + +V G + F
Sbjct: 195 CGHSN--SGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAHTSGTINF 252
Query: 219 GDDLYDSSR-VVWTSMSSD--YTKY------YSPGVAELFFGGETTGLKNLPVVFDSGSS 269
G++ S VV T ++S+ T Y S G + F T L ++ DSG+
Sbjct: 253 GEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSSET-LSKGNIMIDSGTP 311
Query: 270 YTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGR 310
TY+ + Y+ L +K + S +++ P D LC++
Sbjct: 312 ATYIPQEFYERLVEELKVQSSLLPIEDDP-DLGTQLCYRSE 351
>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
Japonica Group]
Length = 377
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 96/389 (24%), Positives = 139/389 (35%), Gaps = 65/389 (16%)
Query: 10 LCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPT-----GYYNVTMYIGQPARPY 64
LCF +V S S ++ L V ++ P G Y IG P +P
Sbjct: 11 LCFISVTACSLSEQATRGRLLAGVDATPPAAGGAVAVPIYLSSQGLYVANFTIGTPPQPV 70
Query: 65 FLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCE 120
+D +L W QC PC C E PL+ P+ +PC +C S+ +
Sbjct: 71 SAVVDLTGELVWTQCT-PCQPCFEQDLPLFDPTKSSTFRGLPCGSHLCESIPESSRNCTS 129
Query: 121 DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC------GYNQVPGASY 174
D C YE G + G D FA L GC + G S
Sbjct: 130 D--VCIYEAP-TKAGDTGGKAGTDTFAIGAA-----KETLGFGCVVMTDKRLKTIGGPS- 180
Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYD--------SS 226
GI+GLG+ S+V+Q++ +CL+G G LF G +
Sbjct: 181 ----GIVGLGRTPWSLVTQMNVTAF-----SYCLAGKSSGALFLGATAKQLAGGKNSSTP 231
Query: 227 RVVWTSMSSD---YTKYYSPGVAELFFGG---ETTGLKNLPVVFDSGSSYTYLNRVTYQT 280
V+ TS S YY +A + GG + V+ D+ S +YL Y+
Sbjct: 232 FVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQAASSSGSTVLLDTVSRASYLADGAYKA 291
Query: 281 LTSIMKKELSAKSLKEAPE--DETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTL 338
L + + + + P+ D P G P L +F G T
Sbjct: 292 LKKALTAAVGVQPVASPPKPYDLCFPKAVAGDAP------------ELVFTFDGGAALT- 338
Query: 339 FELTPEAYLIISNKGNVCLGILNGAEVGL 367
+ P YL+ S G VCL I + A + L
Sbjct: 339 --VPPANYLLASGNGTVCLTIGSSASLNL 365
>gi|168002493|ref|XP_001753948.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694924|gb|EDQ81270.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 602
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 88/421 (20%), Positives = 147/421 (34%), Gaps = 104/421 (24%)
Query: 50 YYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND-LVPC--EDP 106
+ V + +G+ + Y++ +DTGS ++W+ C E PH L++P D V C ++
Sbjct: 155 FVKVPIGLGKERQEYYMHIDTGSGISWVNCKGRGPITTEGPHGLFKPKADSYVNCKKQEE 214
Query: 107 ICASLHAPGHHNCEDPA--QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC 164
C H C+ +C ++ +Y DG G +V F+ ++G +A GC
Sbjct: 215 FCKGFQDGEEHRCDKKHHFRCIFDTQYGDGLIIEGYIVMIDLIFDLSDGSESQADVAFGC 274
Query: 165 GYN----QVPGASYH------------------------------PLDGILGLGKGKSSI 190
QV + H DG++GLG S
Sbjct: 275 ASTCPKFQVVKNTPHLSVKIASSFSIMCADKVNDEETKKLGQNTALTDGLIGLGPHPGSW 334
Query: 191 VSQLHSQKLIRN-VVGHCLSGGGG---------------GFLFFGDDL-YDSSRVVWTSM 233
+ QL+ I V+ C G GFL FG+ + +WT+
Sbjct: 335 LHQLNMLGYISEYVIAICFEPDLGKSRHAAIGPELPEPAGFLSFGNPYSAQAESTIWTAN 394
Query: 234 SSDYTKYYSPGVAE----------LFFGGETTGLKNLPVV-------------------- 263
+Y +P E + G ++ +V
Sbjct: 395 IPSPEEYANPHPHEANSTNLQYYDAMYTGRLVSIRYRDIVIQLRGNEKKRKRDHPEGVQM 454
Query: 264 -FDSGSSYTYLNRVTYQTLTSIMKKELS------AKSLKEAPEDETLPLCWK----GRRP 312
FD+GS TYL R T+ +I+ +E + E +DE CW+ G P
Sbjct: 455 GFDTGSDLTYLTRKTFDAFVTILDEEAKHLGYEITRDADEFVKDEQRK-CWRKKSGGEEP 513
Query: 313 FKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKG---NVCLGILNGAEVGLQD 369
+V D A +F + T++ + P+ Y+ G C +L E +
Sbjct: 514 --SVEDFGDMILEFA-TFAEDDTKSELVINPKYYITSEGSGRQHRTCFNMLKETEFDFGN 570
Query: 370 L 370
L
Sbjct: 571 L 571
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 96/389 (24%), Positives = 139/389 (35%), Gaps = 65/389 (16%)
Query: 10 LCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPT-----GYYNVTMYIGQPARPY 64
LCF +V S S ++ L V ++ P G Y IG P +P
Sbjct: 11 LCFISVTACSLSEQATRGRLLAGVDATPPAAGGAVAVPIYLSSQGLYVANFTIGTPPQPV 70
Query: 65 FLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCE 120
+D +L W QC PC C E PL+ P+ +PC +C S+ +
Sbjct: 71 SAVVDLTGELVWTQC-TPCQPCFEQDLPLFDPTKSSTFRGLPCGSHLCESIPESSRNCTS 129
Query: 121 DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC------GYNQVPGASY 174
D C YE G + G D FA L GC + G S
Sbjct: 130 D--VCIYEAP-TKAGDTGGKAGTDTFAIGAA-----KETLGFGCVVMTDKRLKTIGGPS- 180
Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYD--------SS 226
GI+GLG+ S+V+Q++ +CL+G G LF G +
Sbjct: 181 ----GIVGLGRTPWSLVTQMN-----VTAFSYCLAGKSSGALFLGATAKQLAGGKNSSTP 231
Query: 227 RVVWTSMSSD---YTKYYSPGVAELFFGG---ETTGLKNLPVVFDSGSSYTYLNRVTYQT 280
V+ TS S YY +A + GG + V+ D+ S +YL Y+
Sbjct: 232 FVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQAASSSGSTVLLDTVSRASYLADGAYKA 291
Query: 281 LTSIMKKELSAKSLKEAPE--DETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTL 338
L + + + + P+ D P G P L +F G T
Sbjct: 292 LKKALTAAVGVQPVASPPKPYDLCFPKAVAGDAP------------ELVFTFDGGAALT- 338
Query: 339 FELTPEAYLIISNKGNVCLGILNGAEVGL 367
+ P YL+ S G VCL I + A + L
Sbjct: 339 --VPPANYLLASGNGTVCLTIGSSASLNL 365
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 61/124 (49%), Gaps = 10/124 (8%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y + +G PAR ++ LDTGSD+TW+QC PC C + P++ PS V C
Sbjct: 164 SGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQ-PCADCYQQSDPVFDPSLSTSYASVAC 222
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
++P C L A N C YE+ Y DG ++G + + +A+G
Sbjct: 223 DNPRCHDLDAAACRNST--GACLYEVAYGDGSYTVGDFATETLTLGDSAPVS---SVAIG 277
Query: 164 CGYN 167
CG++
Sbjct: 278 CGHD 281
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 61/124 (49%), Gaps = 10/124 (8%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y + +G PAR ++ LDTGSD+TW+QC PC C + P++ PS V C
Sbjct: 160 SGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQ-PCADCYQQSDPVFDPSLSTSYASVAC 218
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
++P C L A N C YE+ Y DG ++G + + +A+G
Sbjct: 219 DNPRCHDLDAAACRNST--GACLYEVAYGDGSYTVGDFATETLTLGDSAPVS---SVAIG 273
Query: 164 CGYN 167
CG++
Sbjct: 274 CGHD 277
>gi|3036792|emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana]
Length = 335
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 72/275 (26%), Positives = 120/275 (43%), Gaps = 48/275 (17%)
Query: 68 LDTGSDLTWLQCDAPCVRCV---------EAPHPLYRP----SNDLVPCEDPICASLHAP 114
LDTGSDL W+ CD C +C E +Y P +N V C + +CA
Sbjct: 4 LDTGSDLFWVPCD--CGKCAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLCAQ---- 57
Query: 115 GHHNCEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNY--TNGQRLNPRLALGCGYNQVP 170
+ C + C Y + Y +S G+L++D N +R+ + GCG QV
Sbjct: 58 -RNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCG--QVQ 114
Query: 171 GASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSR 227
S+ + +G+ GLG K S+ S L + L+ + C G G + FGD
Sbjct: 115 SGSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQE 174
Query: 228 VVWTSMSSDYTKYYSPGVAELFFGGETTGLKN-LPVVFDSGSSYTYLNRVTYQTLTSIMK 286
+++ + Y+ V + G TT + + +FD+G+S+TYL Y T++
Sbjct: 175 ETPFNLNPSHPN-YNITVTRVRVG--TTLIDDEFTALFDTGTSFTYLVDPMYTTVSE--- 228
Query: 287 KELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKK 321
SA+ + +P+ R PF+ +D+++
Sbjct: 229 ---SAQDKRHSPD---------SRIPFEYCYDMRE 251
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 66/222 (29%), Positives = 95/222 (42%), Gaps = 23/222 (10%)
Query: 5 HNGENLCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPY 64
H ++LCF S ++ L + + L F H NV T V++ +G P +
Sbjct: 960 HLFKSLCFSATPTSMVLPLNTQMGLISQPSNKLSF--HHNVTLT----VSLTVGSPPQQV 1013
Query: 65 FLDLDTGSDLTWLQC-DAPCVRCVEAPHPLYRPSNDLVPCEDPIC--ASLHAPGHHNCED 121
+ LDTGS+L+WL C +P + V +PL S +PC PIC + P C+
Sbjct: 1014 TMVLDTGSELSWLHCKKSPNLTSVF--NPLSSSSYSPIPCSSPICRTRTRDLPNPVTCDP 1071
Query: 122 PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYH--PLDG 179
C + YAD S G L D N+ G P GC + S G
Sbjct: 1072 KKLCHAIVSYADASSLEGNLASD----NFRIGSSALPGTLFGCMDSGFSSNSEEDAKTTG 1127
Query: 180 ILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-GGGGFLFFGD 220
++G+ +G S V+QL K +C+SG G L FGD
Sbjct: 1128 LMGMNRGSLSFVTQLGLPKF-----SYCISGRDSSGVLLFGD 1164
>gi|46488413|gb|AAS99528.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488415|gb|AAS99529.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488417|gb|AAS99530.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488419|gb|AAS99531.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488421|gb|AAS99532.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488423|gb|AAS99533.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488425|gb|AAS99534.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488427|gb|AAS99535.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488429|gb|AAS99536.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488431|gb|AAS99537.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488433|gb|AAS99538.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488435|gb|AAS99539.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488437|gb|AAS99540.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488439|gb|AAS99541.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488441|gb|AAS99542.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488443|gb|AAS99543.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488445|gb|AAS99544.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488447|gb|AAS99545.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488449|gb|AAS99546.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488455|gb|AAS99549.1| aspartic protease PM5 [Plasmodium vivax]
Length = 536
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 92/412 (22%), Positives = 157/412 (38%), Gaps = 84/412 (20%)
Query: 10 LCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLD 69
LC +V+ S S+ S L ++++G++ YY + + IG P + L LD
Sbjct: 27 LCALSVQGRSESTEGHSKDLLYK------YKLYGDIDEYAYYFLDIDIGTPEQRISLILD 80
Query: 70 TGSDLTWLQCDAPCVRC---VEAPHPLYR-PSNDLVPCEDPICASLHAPGHHNCEDPAQC 125
TGS C A C C +E P L ++ ++ CE+ C P NC +C
Sbjct: 81 TGSSSLSFPC-AGCKNCGVHMENPFNLNNSKTSSILYCENEEC-----PFKLNCVK-GKC 133
Query: 126 DYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLG- 184
+Y Y +G G D + N +R+ R +GC ++ Y G+LG+
Sbjct: 134 EYMQSYCEGSQISGFYFSDVVSVVSYNNERVTFRKLMGCHMHEESLFLYQQATGVLGMSL 193
Query: 185 ---KGKSSIVSQLHSQK-LIRNVVGHCLSGGGGGFLFFGDD------------------- 221
+G + V+ L ++ V C+S GG + G D
Sbjct: 194 SKPQGIPTFVNLLFDNAPQLKQVFTICISENGGELIAGGYDPAYIVRRGGSKSVSGQGSG 253
Query: 222 ----------------LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFD 265
L ++ ++VW +++ Y Y ++F + K L ++ D
Sbjct: 254 PVSESLSESGEDPQVALREAEKIVWENVTRKYYYYIKVRGLDMFGTNMMSSSKGLEMLVD 313
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
SGS++T++ Y L LC + N +DV K +
Sbjct: 314 SGSTFTHIPEDLYNKLNYFFD-----------------ILCIQD---MNNAYDVNKRLKM 353
Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEV-----GLQDLNV 372
SF + + F+ ++ I K N+C+ I++G + GL DL V
Sbjct: 354 TNESFNNPLVQ--FDDFRKSLKSIIAKENMCVKIVDGVQCWKYLEGLPDLFV 403
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 66/131 (50%), Gaps = 13/131 (9%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP---- 96
V G +G Y + +G P R ++ LDTGSD+ W+QC+ PC +C P++ P
Sbjct: 187 VSGMAQGSGEYFTRIGVGTPMREQYMVLDTGSDVVWIQCE-PCSKCYSQVDPIFNPSLSA 245
Query: 97 SNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
S + C +C+ L A +NC C Y++ Y DG ++G + F T+ +
Sbjct: 246 SFSTLGCNSAVCSYLDA---YNCHG-GGCLYKVSYGDGSYTIGSFATEMLTFGTTSVR-- 299
Query: 157 NPRLALGCGYN 167
+A+GCG++
Sbjct: 300 --NVAIGCGHD 308
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 104/244 (42%), Gaps = 37/244 (15%)
Query: 45 VYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCE 104
V+ Y + + +G P +DTGS++TW QC PCV C + P++ PS E
Sbjct: 374 VFDNSVYLMKLQVGTPPFEIEAVIDTGSEITWTQC-LPCVHCYKQNAPIFDPSKSSTFKE 432
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR-LNPRLALG 163
C D + C YE++Y D + G L D + T+G+ + +G
Sbjct: 433 ------------KRCHDHS-CPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETIIG 479
Query: 164 CGYNQVPGASYHP-LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDD- 221
CG N + + P +G +GL G S+++Q+ + ++ +C +G G + FG +
Sbjct: 480 CGRNN---SWFRPSFEGFVGLNWGPLSLITQMGGE--YPGLMSYCFAGNGTSKINFGTNA 534
Query: 222 LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP------------VVFDSGSS 269
+ VV T+M + PG L + G + +V DSG++
Sbjct: 535 IVGGGGVVSTTM---FVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALEGNIVIDSGTT 591
Query: 270 YTYL 273
TY
Sbjct: 592 LTYF 595
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 83/352 (23%), Positives = 141/352 (40%), Gaps = 52/352 (14%)
Query: 14 TVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSD 73
T+ + S++SSS + N S V+ T Y + + IG P LDTGS+
Sbjct: 31 TIDLIHRRSNASSSRVSNTQAGS---PYADTVFDTYEYLMKLQIGTPPFEVEAVLDTGSE 87
Query: 74 LTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNCEDPAQCDYELEYAD 133
L W QC PC+ C + P++ PS E + P H C Y+L Y D
Sbjct: 88 LIWTQC-LPCLHCYDQKAPIFDPSKSSTFKE----TRCNTPDH-------SCPYKLVYDD 135
Query: 134 GGSSLGVLVKDAFAFNYTNG-QRLNPRLALGCGYNQVPGASYHP-LDGILGLGKGKSSIV 191
+ G L + + T+G + P +GC N G+ + P GI+GL +G S++
Sbjct: 136 KSYTQGTLATETVTIHSTSGVPFVMPETIIGCSRNN-SGSGFRPSSSGIVGLSRGSLSLI 194
Query: 192 SQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDY---TKYYSPGVAEL 248
SQ+ GG + GD + ++ T+ Y S G +
Sbjct: 195 SQM-----------------GGAYP--GDGVVSTTMFAKTAKRGQYYLNLDAVSVGDTRI 235
Query: 249 FFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
G N +V DSG+ TY + +++ ++A + + ++ LC+
Sbjct: 236 ETVGTPFHALNGNIVIDSGTPLTYFPVSYCNLVRKAVERVVTADRVVDPSRNDM--LCY- 292
Query: 309 GRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGIL 360
+ N ++ F + + F+ G L + Y+ ++ G CL I+
Sbjct: 293 ----YSNTIEI---FPVITVHFSGGADLVLDKY--NMYMELNRGGVFCLAII 335
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 96/371 (25%), Positives = 149/371 (40%), Gaps = 52/371 (14%)
Query: 23 SSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP 82
SSS+S L N+ ++ ++ G G Y++ IG P + DTGSDL W +CDA
Sbjct: 75 SSSASQLSNNDTDTVPLRMDGG---GGAYDMEFSIGTPPQKLTALADTGSDLIWTKCDAG 131
Query: 83 CVRCVEAP---HPLYRPSNDLVPCEDPICASLHAPGHHNCED-PAQCDYELEYADGGS-- 136
HP + +PC D +CA+L + C A+CDY+ Y G
Sbjct: 132 GGAAWGGSSSYHPNASSTFTRLPCSDRLCAALRSYSLARCAAGGAECDYKYAYGLGDDPD 191
Query: 137 -SLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLH 195
+ G L + F T G P + GC Y G++GLG+G S+VSQL
Sbjct: 192 FTQGFLGSETF----TLGGDAVPGVGFGC--TTALEGDYGEGAGLVGLGRGPLSLVSQLD 245
Query: 196 SQKLIRNVVGHCLSGGGGGF--LFFG--DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFG 251
+ + +CL+ L FG + + V ++ T +Y+ + + G
Sbjct: 246 AGTFM-----YCLTADASKASPLLFGALATMTGAGAGVQSTGLLASTTFYAVNLRSITIG 300
Query: 252 GETTG--LKNLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
TT VVFDSG++ TYL Y + + ++ + E G
Sbjct: 301 SATTAGVGGPGGVVFDSGTTLTYLAEPAYTEAKAAFLSQTTSLTPVE------------G 348
Query: 310 RRPFKNVH---DVKKCFRTLALSFTDGKTRTLFELTPEA-YLIISNKGNVCLGILNGAEV 365
R F+ + D + + L F G L P A Y++ + G VC +
Sbjct: 349 RYGFEACYEKPDSARLIPAMVLHFDGGADMAL----PVANYVVEVDDGVVCWVVQRSPS- 403
Query: 366 GLQDLNVIGGI 376
L++IG I
Sbjct: 404 ----LSIIGNI 410
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 97/349 (27%), Positives = 142/349 (40%), Gaps = 54/349 (15%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP---- 96
V G +G Y ++ +G P P L LDTGSD+ WLQC APC +C ++ P
Sbjct: 132 VSGLAQGSGEYFASVGVGTPPTPALLVLDTGSDVVWLQC-APCRQCYAQSGRVFDPRRSR 190
Query: 97 SNDLVPCEDPIC-ASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
S V C P C G C Y++ Y DG + G L + F G R
Sbjct: 191 SYAAVRCGAPPCRGLDAGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLWF--ARGAR 248
Query: 156 LNPRLALGCGYNQVPGASYHPLDGIL---GLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
+ PR+A+GCG++ +G+ G L +Q R G S
Sbjct: 249 V-PRVAVGCGHDN---------EGLFVAAAGLLGLGRGRLSLPTQTARR--YGRRFS--- 293
Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGE-TTGLKNLPVVFDSGSSYT 271
+ F G DL R + ++ GV E + +TG V+ DSG+S T
Sbjct: 294 --YCFQGSDL--DHRTIIRTVHQHVGGARVRGVGERSLRLDPSTGRGG--VILDSGTSVT 347
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETL-PLCW--KGRRPFKNVHDVKKCFRTLAL 328
L R Y + + +A L+ AP +L C+ +GRR K T+++
Sbjct: 348 RLARPVYVAVREAFRA--AAGGLRLAPGGFSLFDTCYDLRGRRVVK--------VPTVSV 397
Query: 329 SFTDGKTRTLFELTPEAYLI-ISNKGNVCLGILNGAEVGLQDLNVIGGI 376
G L PE YLI + +G CL L G + G ++++G I
Sbjct: 398 HLAGGAE---VALPPENYLIPVDTRGTFCLA-LAGTDGG---VSIVGNI 439
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 87/331 (26%), Positives = 137/331 (41%), Gaps = 47/331 (14%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDP 106
Y VT+ +G R + +DTGSDL+W+QC PC RC P++ PS V C
Sbjct: 66 YIVTVELG--GRKMTVIVDTGSDLSWVQCQ-PCNRCYNQQDPVFNPSKSPSYRTVLCNSL 122
Query: 107 ICASLH-APGHHNC--EDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
C SL A G+ +P C+Y + Y DG + G + + G G
Sbjct: 123 TCRSLQLATGNSGVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNL----GNTTVNNFIFG 178
Query: 164 CGY-NQ--VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL---SGGGGGFLF 217
CG NQ GAS G++GLG+ S++SQ+ + V +CL G L
Sbjct: 179 CGRKNQGLFGGAS-----GLVGLGRTDLSLISQIS--PMFGGVFSYCLPTTEAEASGSLV 231
Query: 218 FGDD---LYDSSRVVWTSMSSD-YTKYYSPGVAELFFGG---ETTGLKNLPVVFDSGSSY 270
G + +++ + +T M + +Y + + GG + ++ DSG+
Sbjct: 232 MGGNSSVYKNTTPISYTRMIHNPLLPFYFLNLTGITVGGVEVQAPSFGKDRMIIDSGTVI 291
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK--GRRPFKNVHDVKKCFRTLAL 328
+ L YQ L + K+ S AP L C+ G + K + D+K F
Sbjct: 292 SRLPPSIYQALKAEFVKQFSG--YPSAPSFMILDSCFNLSGYQEVK-IPDIKMYF----- 343
Query: 329 SFTDGKTRTLFELTPEAYLIISNKGNVCLGI 359
+G ++T Y + ++ VCL I
Sbjct: 344 ---EGSAELNVDVTGVFYSVKTDASQVCLAI 371
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 89/391 (22%), Positives = 153/391 (39%), Gaps = 76/391 (19%)
Query: 20 SSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC 79
+SSS + + S+ +F+ + + G Y+ + G P + L DTGS L W C
Sbjct: 50 ASSSQTRAHQIKTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPC 109
Query: 80 DAPCVRCVEAPHPLYRP------------SNDLVPCEDPICASLHA-----------PGH 116
+ + C E P P S+ LV C++P C+ + P
Sbjct: 110 TSRYL-CSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKT 168
Query: 117 HNCED--PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASY 174
NC PA Y ++Y GS+ G+L+ + F + P +GC + S
Sbjct: 169 ENCTQTCPA---YVVQYGS-GSTAGLLLSETLDF----PDKXIPNFVVGCSF-----LSI 215
Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG------GGFLFFGDDLYDSSRV 228
H GI G G+G S+ SQ+ +K +CL+ G L SS +
Sbjct: 216 HQPSGIAGFGRGSESLPSQMGLKKF-----AYCLASRKFDDSPHSGQLILDSTGVKSSGL 270
Query: 229 VWTSMSSD-------YTKYYSPGVAELFFGGETTGLK----------NLPVVFDSGSSYT 271
+T + Y +YY + ++ G + + N + DSGS++T
Sbjct: 271 TYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFT 330
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC-FRTLALSF 330
++++ + + +K+L+ + A + ETL G RP ++ K F L F
Sbjct: 331 FMDKPVLEVVAREFEKQLA--NWTRATDVETL----TGLRPCFDISKEKSVKFPELIFQF 384
Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILN 361
G L + ++S+ G CL ++
Sbjct: 385 KGGAKWAL--PLNNYFALVSSSGVACLTVVT 413
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 51/150 (34%), Positives = 75/150 (50%), Gaps = 14/150 (9%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV---RCVEAPHPLYRPSND----LVPC 103
Y VT +G P +++DTGSDL+W+QC PC C PL+ P+ VPC
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCK-PCSAAPSCYSQKDPLFDPAQSSSYAAVPC 198
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
P+CA L C AQC Y + Y DG ++ GV D + ++ + G
Sbjct: 199 GGPVCAGLGIYAASACSA-AQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQ---GFFFG 254
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQ 193
CG+ Q ++ +DG+LGLG+ + S+V Q
Sbjct: 255 CGHAQ--SGLFNGVDGLLGLGREQPSLVEQ 282
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 71/277 (25%), Positives = 116/277 (41%), Gaps = 26/277 (9%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLV- 101
G+ T Y +T+ IG PA + +DTGSD++W++C++ L+ PS
Sbjct: 121 GSALDTMEYVITVGIGSPAVTQTMMIDTGSDVSWVRCNS------TDGLTLFDPSKSTTY 174
Query: 102 ---PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
C CA L G C + + C Y ++Y DG ++ G D A + ++
Sbjct: 175 APFSCSSAACAQLGNNG-DGCSN-SGCQYRVQYGDGSNTTGTYSSDTLALSASD---TVT 229
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL 216
GC +++ +DG++GLG S+VSQ + +CL + GFL
Sbjct: 230 DFHFGCSHHE-EDFDGEKIDGLMGLGGDAQSLVSQ--TAATYGKSFSYCLPPTNRTSGFL 286
Query: 217 FFGDDLYDSSRVVWTSMSS--DYTKYYSPGVAELFFGGETTGLKNLPV----VFDSGSSY 270
FG S V T M Y + ++ GG G++ + V DSG+
Sbjct: 287 TFGAPNGTSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQPSVLSNGSVMDSGTVI 346
Query: 271 TYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
T+L R Y L+S + ++ + A L C+
Sbjct: 347 TWLPRRAYSALSSAFRSSMTRLRHQRAAPLGILDTCY 383
>gi|125556778|gb|EAZ02384.1| hypothetical protein OsI_24487 [Oryza sativa Indica Group]
Length = 551
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 89/342 (26%), Positives = 139/342 (40%), Gaps = 62/342 (18%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH---------PLYRP-------SNDL 100
+G P + + LDTGSDL W+ CD C +C + P R ++
Sbjct: 111 VGTPNTTFLVALDTGSDLFWVPCD--CKQCAPLGNLTAVDGGGGPELRQYSPSKSSTSKT 168
Query: 101 VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTN------- 152
V C +C +A + C Y + YA SS G LV+D
Sbjct: 169 VTCASNLCDQPNACATAT----SSCPYAVRYAMANTSSSGELVEDVLYLTREKGAAAAAA 224
Query: 153 GQRLNPRLALGCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCL 208
G + + GCG QV S+ DG++GLG K S+ S L S +++ N C
Sbjct: 225 GAAVRTPVVFGCG--QVQTGSFLDGAAADGLMGLGMEKVSVPSILASTGVVKSNSFSMCF 282
Query: 209 SGGGGGFLFFGD----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF 264
S G G + FGD D ++ +V ++ S YY+ + + + G KNLP+ F
Sbjct: 283 SKDGLGRINFGDTGSADQSETPFIVKSTHS-----YYNISITSM-----SVGDKNLPLGF 332
Query: 265 ----DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
DSG+S+TYLN Y T+ ++S + + + P PF+ + +
Sbjct: 333 YAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPF------PFEYCYSLS 386
Query: 321 KCFRTLALSFTDGKTR--TLFELTPEAYLIISNKGNVCLGIL 360
T+ L T +F +T Y I + N + I+
Sbjct: 387 PDQTTVELPIVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRII 428
>gi|242089103|ref|XP_002440384.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
gi|241945669|gb|EES18814.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
Length = 555
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 77/298 (25%), Positives = 117/298 (39%), Gaps = 57/298 (19%)
Query: 44 NVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC----------------------DA 81
N G Y V++ G PA PY L LDT +DLTW+ C D
Sbjct: 133 NTAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRQSSKTMSVGGDD 192
Query: 82 PCVRCV---EAPHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPAQ---CDYELEY 131
V + EA YRP+ + C + CA H P ++ C+ P++ C Y +
Sbjct: 193 DVVAALAKKEARKNWYRPAKSSSWRRIRCSEQQCA--HLP-YNTCQSPSKLESCSYYQKT 249
Query: 132 ADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALGCGYNQVPGASYHPLDGILGLGKGKSSI 190
DG ++G+ + ++G+ P L LGC + GAS DG+L LG G S
Sbjct: 250 QDGTVTIGIYGNEKATVTVSDGRMAKLPGLVLGCSVLEA-GASVDAHDGVLSLGNGHMSF 308
Query: 191 VSQLHSQKLIRNVVGHCL-----SGGGGGFLFFGDD---LYDSSRVVWTSMSSDYTKYYS 242
+H+ CL S +L FG + + + + D Y
Sbjct: 309 A--IHAVLRFGGRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETEILYNVDVKAAYG 366
Query: 243 PGVAELFFGGETTGL--------KNL--PVVFDSGSSYTYLNRVTYQTLTSIMKKELS 290
P V + GGE + K L V+ D+ +S T L Y+ L + + + L+
Sbjct: 367 PRVTAVLVGGERLDIPDDVWNIDKGLGSGVILDTSTSVTSLVPEAYEPLVAALDRHLA 424
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 63/126 (50%), Gaps = 15/126 (11%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC---VRCVEAPHPLYRPS----NDLV 101
G Y + +GQP + YF DTGSD++WLQC PC C + P++ P +
Sbjct: 182 GEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ-PCDGENGCYKQIGPIFDPKSSSSYSPL 240
Query: 102 PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
C+ C H C D C YE+EY DG ++G L + F+F ++N P L
Sbjct: 241 SCDSEQC---HLLDEAAC-DANSCIYEVEYGDGSFTVGELATETFSFRHSNSI---PNLP 293
Query: 162 LGCGYN 167
+GCG++
Sbjct: 294 IGCGHD 299
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 81/302 (26%), Positives = 118/302 (39%), Gaps = 69/302 (22%)
Query: 37 LLFQVHGNVYPTG-----------YYNVTM----YIGQPARPYFLDLDTGSDLTWLQCDA 81
LLF++ P G ++NV++ +G P + + LDTGS+L+WL C
Sbjct: 36 LLFELRARQVPAGALPRPASKLRFHHNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAP 95
Query: 82 PCVRCVEAPHPL-YRPSNDL----VPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGG 135
L +RP L VPC C S P C+ + QC L YADG
Sbjct: 96 GGGGGGGGRSALSFRPRASLTFASVPCGSAQCRSRDLPSPPACDGASKQCRVSLSYADGS 155
Query: 136 SSLGVLVKDAFAFNYTNGQRLNPRLALGC---GYNQVPGASYHPLDGILGLGKGKSSIVS 192
SS G L + F T GQ R A GC ++ P G+LG+ +G S VS
Sbjct: 156 SSDGALATEVF----TVGQGPPLRAAFGCMATAFDTSPDGVAT--AGLLGMNRGALSFVS 209
Query: 193 QLHSQKLIRNVVGHCLSG-GGGGFLFFG-DDLYDSSRVVWTSMSSDYTKYYSPGVAELFF 250
Q +++ +C+S G L G DL + +YT Y P + +F
Sbjct: 210 QASTRRF-----SYCISDRDDAGVLLLGHSDL--------PFLPLNYTPLYQPAMPLPYF 256
Query: 251 G---------GETTGLKNLPV---------------VFDSGSSYTYLNRVTYQTLTSIMK 286
G G K LP+ + DSG+ +T+L Y L +
Sbjct: 257 DRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFS 316
Query: 287 KE 288
++
Sbjct: 317 RQ 318
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 89/391 (22%), Positives = 153/391 (39%), Gaps = 76/391 (19%)
Query: 20 SSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQC 79
+SSS + + S+ +F+ + + G Y+ + G P + L DTGS L W C
Sbjct: 50 ASSSQTRAHQIKTPKSNSVFKSPLSPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPC 109
Query: 80 DAPCVRCVEAPHPLYRP------------SNDLVPCEDPICASLHA-----------PGH 116
+ + C E P P S+ LV C++P C+ + P
Sbjct: 110 TSRYL-CSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKT 168
Query: 117 HNCED--PAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASY 174
NC PA Y ++Y GS+ G+L+ + F + P +GC + S
Sbjct: 169 ENCTQTCPA---YVVQYGS-GSTAGLLLSETLDF----PDKKIPNFVVGCSF-----LSI 215
Query: 175 HPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG------GGFLFFGDDLYDSSRV 228
H GI G G+G S+ SQ+ +K +CL+ G L SS +
Sbjct: 216 HQPSGIAGFGRGSESLPSQMGLKKF-----AYCLASRKFDDSPHSGQLILDSTGVKSSGL 270
Query: 229 VWTSMSSD-------YTKYYSPGVAELFFGGETTGLK----------NLPVVFDSGSSYT 271
+T + Y +YY + ++ G + + N + DSGS++T
Sbjct: 271 TYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFT 330
Query: 272 YLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKC-FRTLALSF 330
++++ + + +K+L+ + A + ETL G RP ++ K F L F
Sbjct: 331 FMDKPVLEVVAREFEKQLA--NWTRATDVETL----TGLRPCFDISKEKSVKFPELIFQF 384
Query: 331 TDGKTRTLFELTPEAYLIISNKGNVCLGILN 361
G L + ++S+ G CL ++
Sbjct: 385 KGGAKWAL--PLNNYFALVSSSGVACLTVVT 413
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 63/126 (50%), Gaps = 15/126 (11%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC---VRCVEAPHPLYRPS----NDLV 101
G Y + +GQP + YF DTGSD++WLQC PC C + P++ P +
Sbjct: 182 GEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQ-PCDGENGCYKQIGPIFDPKSSSSYSPL 240
Query: 102 PCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLA 161
C+ C H C D C YE+EY DG ++G L + F+F ++N P L
Sbjct: 241 SCDSEQC---HLLDEAAC-DANSCIYEVEYGDGSFTVGELATETFSFRHSNSI---PNLP 293
Query: 162 LGCGYN 167
+GCG++
Sbjct: 294 IGCGHD 299
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 59/124 (47%), Gaps = 7/124 (5%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCE 104
G Y ++ +G P +DTGSD+ WLQC PC C P++ PS +PC
Sbjct: 92 GEYLMSYSVGTPPFQILGIVDTGSDIIWLQCQ-PCEDCYNQTTPIFDPSQSKTYKTLPCS 150
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
IC S+ + + + +C+Y + Y D S G L + T+G + P+ +G
Sbjct: 151 SNICQSVQSAASCSSNND-ECEYTITYGDNSHSQGDLSVETLTLGSTDGSSVQFPKTVIG 209
Query: 164 CGYN 167
CG+N
Sbjct: 210 CGHN 213
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 69/265 (26%), Positives = 106/265 (40%), Gaps = 34/265 (12%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCV-RCVEAPHPLYRPSNDL- 100
G + + Y V + +G P R L DTGS LTW QC+ PC C + P++ PS
Sbjct: 132 GRLIGSADYYVVVGLGTPKRDLSLIFDTGSYLTWTQCE-PCAGSCYKQQDPIFDPSKSSS 190
Query: 101 ---VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
+ C +C + G + D A C Y+++Y D S G L ++ T+ +
Sbjct: 191 YTNIKCTSSLCTQFRSAGCSSSTD-ASCIYDVKYGDNSISRGFLSQERLTITATD---IV 246
Query: 158 PRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGF 215
GCG Q + G++GL + S V Q S + + +CL + G
Sbjct: 247 HDFLFGCG--QDNEGLFRGTAGLMGLSRHPISFVQQTSS--IYNKIFSYCLPSTPSSLGH 302
Query: 216 LFFGDDLYDSSRVVWTSMS--SDYTKYYSPGVAELFFGGETTGLKNLPVV---------- 263
L FG ++ + +T S S +Y + + GG LP V
Sbjct: 303 LTFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGG-----TKLPAVSSSTFSAGGS 357
Query: 264 -FDSGSSYTYLNRVTYQTLTSIMKK 287
DSG+ T L Y L S ++
Sbjct: 358 IIDSGTVITRLPPTAYAALRSAFRQ 382
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 81/322 (25%), Positives = 126/322 (39%), Gaps = 32/322 (9%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
G Y M +G PA Y + +DTGS LTWLQC V C P++ P + V C
Sbjct: 120 GNYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCS 179
Query: 105 DPICASLHAPGHH--NCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
C+ L + + C C Y+ Y D S+G L KD +F T+ P
Sbjct: 180 AQQCSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----LPNFYY 235
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFF 218
GCG + + G++GL + K S++ QL + +CL S G +
Sbjct: 236 GCGQDN--EGLFGRSAGLIGLARNKLSLLYQLAPS--LGYSFTYCLPSSSSSGYLSLGSY 291
Query: 219 GDDLYDSSRVVWTSMSSD--YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRV 276
Y + +V +S+ + K VA ++ +LP + DSG+ T L
Sbjct: 292 NPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLPTS 351
Query: 277 TYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTR 336
Y L+ + + K A L C+KG+ + V +SF G
Sbjct: 352 VYSALSKAVAAAM--KGTSRASAYSILDTCFKGQASRVSAPAVT-------MSFAGGAA- 401
Query: 337 TLFELTPEAYLIISNKGNVCLG 358
+L+ + L+ + CL
Sbjct: 402 --LKLSAQNLLVDVDDSTTCLA 421
>gi|18855042|gb|AAL79734.1|AC091774_25 putative chloroplast nucleoid DNA-binding protein [Oryza sativa
Japonica Group]
gi|54291046|dbj|BAD61723.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
Group]
gi|125598520|gb|EAZ38300.1| hypothetical protein OsJ_22678 [Oryza sativa Japonica Group]
Length = 551
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 89/342 (26%), Positives = 139/342 (40%), Gaps = 62/342 (18%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH---------PLYRP-------SNDL 100
+G P + + LDTGSDL W+ CD C +C + P R ++
Sbjct: 111 VGTPNTTFLVALDTGSDLFWVPCD--CKQCAPLGNLTAVDGGGGPELRQYSPSKSSTSKT 168
Query: 101 VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGG-SSLGVLVKDAFAFNYTN------- 152
V C +C +A + C Y + YA SS G LV+D
Sbjct: 169 VTCASNLCDQPNACATAT----SSCPYAVRYAMANTSSSGELVEDVLYLTREKGAAAAAA 224
Query: 153 GQRLNPRLALGCGYNQVPGASY---HPLDGILGLGKGKSSIVSQLHSQKLIR-NVVGHCL 208
G + + GCG QV S+ DG++GLG K S+ S L S +++ N C
Sbjct: 225 GAAVRTPVVFGCG--QVQTGSFLDGAAADGLMGLGMEKVSVPSILASTGVVKSNSFSMCF 282
Query: 209 SGGGGGFLFFGD----DLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVF 264
S G G + FGD D ++ +V ++ S YY+ + + + G KNLP+ F
Sbjct: 283 SKDGLGRINFGDTGSADQSETPFIVKSTHS-----YYNISITSM-----SVGDKNLPLGF 332
Query: 265 ----DSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVK 320
DSG+S+TYLN Y T+ ++S + + + P PF+ + +
Sbjct: 333 YAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPF------PFEYCYSLS 386
Query: 321 KCFRTLALSFTDGKTR--TLFELTPEAYLIISNKGNVCLGIL 360
T+ L T +F +T Y I + N + I+
Sbjct: 387 PDQTTVELPVVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRII 428
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 136/344 (39%), Gaps = 43/344 (12%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y + + +G P + + L LDTGSDL WLQC PC C Y P + C
Sbjct: 157 SGEYFMDVLVGTPPKHFSLILDTGSDLNWLQC-LPCYDCFHQNGMFYDPKTSASFKNITC 215
Query: 104 EDPICASLHAPGHH-NCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-- 159
DP C+ + +P CE D C Y Y D ++ G + F N T + +
Sbjct: 216 NDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYK 275
Query: 160 ---LALGCG-YNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
+ GCG +N+ + L G+ SS + L+ +V +
Sbjct: 276 VGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSNTNVSSK 335
Query: 216 LFFGD--DLYDSSRVVWTSM----SSDYTKYYSPGVAELFFGGETTGLKNLP-------- 261
L FG+ DL + + + +TS + +Y + + GG+ +
Sbjct: 336 LIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSDGD 395
Query: 262 --VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDV 319
+ DSG++ +Y Y+ I+K + + K + P P+ P NV +
Sbjct: 396 GGTIIDSGTTLSYFAEPAYE----IIKNKFAEKMKENYPIFRDFPVL----DPCFNVSGI 447
Query: 320 KKC---FRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGIL 360
++ L ++F DG T++ E I ++ VCL IL
Sbjct: 448 EENNIHLPELGIAFVDG---TVWNFPAENSFIWLSEDLVCLAIL 488
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 60/135 (44%), Gaps = 10/135 (7%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCE 104
G Y + + +G P P DTGSDL W QC PC C E PL+ P + C+
Sbjct: 92 GAYLMNISLGTPPVPMLGIADTGSDLIWRQC-LPCPNCYEQVEPLFDPKESETYKTLDCD 150
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
+ C L G +C+D C Y Y D + G L D T G + P +A G
Sbjct: 151 NEFCQDLGQQG--SCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGSTEGDPASFPGIAFG 208
Query: 164 CGYNQVPGASYHPLD 178
CG++ G +++ D
Sbjct: 209 CGHDN--GGTFNEKD 221
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 72/273 (26%), Positives = 113/273 (41%), Gaps = 48/273 (17%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCEDP 106
Y ++ IG P + +DTG+D W QC PC C+ P++ PS +PC P
Sbjct: 90 YVMSYSIGTPPFQLYSLIDTGNDNIWFQCK-PCKPCLNQTSPMFHPSKSSTYKTIPCTSP 148
Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR-LALGCG 165
IC +A GH+ LGV D N NG ++ + + +GCG
Sbjct: 149 ICK--NADGHY--------------------LGV---DTLTLNSNNGTPISFKNIVIGCG 183
Query: 166 Y-NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL-----SGGGGGFLFFG 219
+ NQ P Y + G +GL +G S +SQL+S I +CL L FG
Sbjct: 184 HRNQGPLEGY--VSGNIGLARGPLSFISQLNSS--IGGKFSYCLVPLFSKENVSSKLHFG 239
Query: 220 DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP----VVFDSGSSYTYLNR 275
D S ++ + Y+ + G L+N + DSG++ T L +
Sbjct: 240 DKSTVSGLGTVSTPIKEENGYFV-SLEAFSVGDHIIKLENSDNRGNSIIDSGTTMTILPK 298
Query: 276 VTYQTLTSIMKKELSAKSLKEAPEDETLPLCWK 308
Y L S++ + K +K+ + LC++
Sbjct: 299 DVYSRLESVVLDMVKLKRVKDP--SQQFNLCYQ 329
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 95/389 (24%), Positives = 143/389 (36%), Gaps = 65/389 (16%)
Query: 10 LCFPTVRMSSSSSSSSSSSLFNHV-------GSSLLFQVHGNVYPTGYYNVTMYIGQPAR 62
LCF +V S S ++ L V G ++ ++ + G Y IG P +
Sbjct: 11 LCFISVTACSLSEQATRGRLLAGVDATPPAAGGAVAVPIY--LSSQGLYVANFTIGTPPQ 68
Query: 63 PYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHN 118
P +D +L W QC PC C E PL+ P+ +PC +C S+ +
Sbjct: 69 PVSAVVDLTGELVWTQC-TPCQPCFEQDLPLFDPTKSSTFRGLPCGSHLCESIPESSRNC 127
Query: 119 CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGC------GYNQVPGA 172
D C YE G + G+ D FA L GC + G
Sbjct: 128 TSD--VCIYEAP-TKAGDTGGMAGTDTFAIGAA-----KETLGFGCVVMTDKRLKTIGGP 179
Query: 173 SYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFFGDDLYD-------- 224
S GI+GLG+ S+V+Q++ +CL+G G LF G
Sbjct: 180 S-----GIVGLGRTPWSLVTQMN-----VTAFSYCLAGKSSGALFLGATAKQLAGGKNSS 229
Query: 225 SSRVVWTSMSSD---YTKYYSPGVAELFFGG---ETTGLKNLPVVFDSGSSYTYLNRVTY 278
+ V+ TS S YY +A + GG + V+ D+ S +YL Y
Sbjct: 230 TPFVIKTSAGSSDNGSNPYYMVKLAGIKAGGAPLQAASSSGSTVLLDTVSRASYLADGAY 289
Query: 279 QTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTL 338
+ L + + + + P+ LC+ V L +F G T
Sbjct: 290 KALKKALTAAVGVQPVASPPKPYD--LCFS--------KAVAGDAPELVFTFDGGAALT- 338
Query: 339 FELTPEAYLIISNKGNVCLGILNGAEVGL 367
+ P YL+ S G VCL I + A + L
Sbjct: 339 --VPPANYLLASGNGTVCLTIGSSASLNL 365
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 70/159 (44%), Gaps = 14/159 (8%)
Query: 51 YNVTMYIGQPARP--YFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCE 104
Y + + IG P RP L LDTGSDL W QC C C + P P++R S VPC
Sbjct: 94 YLIHLGIGTP-RPQRVVLHLDTGSDLVWTQC--ACTVCFDQPVPVFRASVSHTFSRVPCS 150
Query: 105 DPICA-SLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAF---NYTNGQRLNPRL 160
DP+C +++ P C Y Y D + G + +D F F + + P +
Sbjct: 151 DPLCGHAVYLPLSGCAARDRSCFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNI 210
Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKL 199
GCG G GI G G G S+ SQL ++
Sbjct: 211 RFGCGMMNY-GLFTPNQSGIAGFGTGPLSLPSQLKVRRF 248
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 60/124 (48%), Gaps = 10/124 (8%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
+G Y + +G PA + LDTGSD+ WLQC APC C ++ P S V C
Sbjct: 125 SGEYFAQVGVGTPATTALMVLDTGSDVVWLQC-APCRHCYAQSGRVFDPRRSRSYAAVDC 183
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
PIC L + G + C Y++ Y DG + G + F G R+ R+A+G
Sbjct: 184 VAPICRRLDSAGCDRRRN--SCLYQVAYGDGSVTAGDFASETLTF--ARGARVQ-RVAIG 238
Query: 164 CGYN 167
CG++
Sbjct: 239 CGHD 242
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 60/124 (48%), Gaps = 10/124 (8%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
+G Y + +G PA + LDTGSD+ WLQC APC C ++ P S V C
Sbjct: 119 SGEYFAQVGVGTPATTALMVLDTGSDVVWLQC-APCRHCYAQSGRVFDPRRSRSYAAVDC 177
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
PIC L + G + C Y++ Y DG + G + F G R+ R+A+G
Sbjct: 178 VAPICRRLDSAGCDRRRN--SCLYQVAYGDGSVTAGDFASETLTF--ARGARVQ-RVAIG 232
Query: 164 CGYN 167
CG++
Sbjct: 233 CGHD 236
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 60/124 (48%), Gaps = 10/124 (8%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SNDLVPC 103
+G Y + +G PA + LDTGSD+ WLQC APC C ++ P S V C
Sbjct: 119 SGEYFAQVGVGTPATTALMVLDTGSDVVWLQC-APCRHCYAQSGRVFDPRRSRSYAAVDC 177
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
PIC L + G + C Y++ Y DG + G + F G R+ R+A+G
Sbjct: 178 VAPICRRLDSAGCDRRRN--SCLYQVAYGDGSVTAGDFASETLTF--ARGARVQ-RVAIG 232
Query: 164 CGYN 167
CG++
Sbjct: 233 CGHD 236
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 87/343 (25%), Positives = 142/343 (41%), Gaps = 48/343 (13%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPC--VRCVEAPHPLYRPSND----LVPCE 104
Y +T+ +G P R DTGSDL W++C AP + PS V C+
Sbjct: 101 YLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRSSTYGRVSCQ 160
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPR----- 159
C +L G C+D + C Y Y DG ++ GVL + F F+ R +PR
Sbjct: 161 TDACEAL---GRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGAGR-SPRQVRIG 216
Query: 160 -LALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFF 218
+ GC A P DG++GLG G S+V+QL + +CL
Sbjct: 217 GVKFGC---STATAGSFPADGLVGLGGGAVSLVTQLGGATSLGRRFSYCL---------V 264
Query: 219 GDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETT--GLKNLPVVFDSGSSYTYLNRV 276
+ SS + + ++ +D T+ PG A G T + ++ DSG++ T+L+
Sbjct: 265 PHSVNASSALNFGAL-ADVTE---PGAASTPLVGNKTVASAASSRIIVDSGTTLTFLDPS 320
Query: 277 TYQTLTSIMKKELSAKSLKEAPEDETLPLCWK--GRRPFKNVHDVKKCFRTLALSFTDGK 334
+ + + ++ ++ D L LC+ GR + + L L F G
Sbjct: 321 LLGPIVDELSRRITLPPVQS--PDGLLQLCYNVAGRE-----VEAGESIPDLTLEFGGGA 373
Query: 335 TRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGIG 377
L PE + +G +CL I+ E Q ++++G +
Sbjct: 374 A---VALKPENAFVAVQEGTLCLAIVATTE--QQPVSILGNLA 411
>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 491
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 94/378 (24%), Positives = 141/378 (37%), Gaps = 88/378 (23%)
Query: 45 VYPTGY--YNVTMYIGQPARPYFLDLDTGSDLTWL---------QCDAPCVRCVEAPHPL 93
+YP Y Y T +G P +P + LDTGS LTW+ C +P V HP
Sbjct: 91 LYPHSYGGYAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPK 150
Query: 94 YRPSNDLVPCEDPICASLH--------------APGHHNCEDPAQ--C-DYELEYADGGS 136
S+ LV C +P C +H +PG NC A C Y + Y GS
Sbjct: 151 NSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGS-GS 209
Query: 137 SLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHS 196
+ G+L+ D R P LGC V + P G+ G G+G S+ +QL
Sbjct: 210 TAGLLIADTL----RAPGRAVPGFVLGCSLVSV----HQPPSGLAGFGRGAPSVPAQLGL 261
Query: 197 QKLIRNVVGHCLSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAE--------- 247
K +CL F D+ S +V Y P V
Sbjct: 262 PKF-----SYCLLS-----RRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYG 311
Query: 248 ----LFFGGETTGLK--NLPV-------------VFDSGSSYTYLNRVTYQTLTSIMKKE 288
L G T G K LP + DSG+++TYL+ +Q + +
Sbjct: 312 VYYYLALRGVTVGGKAVRLPARAFAGNAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAA 371
Query: 289 LSA--KSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAY 346
+ K K+A + L C+ + +++ L+ F G + +L E Y
Sbjct: 372 VGGRYKRSKDAEDGLGLHPCFALPQGARSM-----ALPELSFHFEGG---AVMQLPVENY 423
Query: 347 LIISNKGNV---CLGILN 361
+++ +G V CL ++
Sbjct: 424 FVVAGRGAVEAICLAVVT 441
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 100/376 (26%), Positives = 145/376 (38%), Gaps = 63/376 (16%)
Query: 23 SSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP 82
S +S++ + S+ L G +G Y VT+ IG P L DTGSDLTW QC+ P
Sbjct: 104 SKNSANEVSEAKSTELPAKSGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCE-P 162
Query: 83 CV-RCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSS 137
C+ C P + PS+ V C P+C + NC Y + Y D +
Sbjct: 163 CLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCEDAESCSASNCV------YSIGYGDKSFT 216
Query: 138 LGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQ---- 193
G L K+ F TN L + GCG N G+ G +
Sbjct: 217 QGFLAKEKFTL--TNSDVLE-DVYFGCGENN---------QGLFDGVAGLLGLGPGKLSL 264
Query: 194 -LHSQKLIRNVVGHCL---SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELF 249
+ N+ +CL + G L FG S V +T +SS + ++ G+ +
Sbjct: 265 PAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGI-SESVKFTPISS-FPSAFNYGIDII- 321
Query: 250 FGGETTGLKNLPV----------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPE 299
G + G K L + + DSG+ +T L Y L S+ K+++S S K
Sbjct: 322 --GISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTKVYAELRSVFKEKMS--SYKSTSG 377
Query: 300 DETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGI 359
C+ F + V + T+A SF G T+ EL + VCL
Sbjct: 378 YGLFDTCYD----FTGLDTVT--YPTIAFSFAGG---TVVELDGSGISLPIKISQVCL-- 426
Query: 360 LNGAEVGLQDLNVIGG 375
A G DL I G
Sbjct: 427 ---AFAGNDDLPAIFG 439
>gi|449533387|ref|XP_004173657.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 254
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 58/197 (29%), Positives = 85/197 (43%), Gaps = 32/197 (16%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN---------DLVPC 103
V++ IG P +P L LDTGS L+W+QC V+ P P + + L+PC
Sbjct: 69 VSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTATFDPSLSSSFSLLPC 128
Query: 104 EDPICASLHAPGH---HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL 160
PIC P +C+ C Y YADG + G LV++ F F + P +
Sbjct: 129 NHPICKP-RIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTF---SNSLSTPPV 184
Query: 161 ALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG----GFL 216
LGC GILG+ G+ S +SQ K +C+ G G
Sbjct: 185 ILGCAQGSTENR------GILGMNHGRLSFISQAKISKF-----SYCVPSRTGPNPTGLF 233
Query: 217 FFGDDLYDSSRVVWTSM 233
+ GD+ +SS+ + +M
Sbjct: 234 YLGDNP-NSSKFKYVTM 249
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 73/276 (26%), Positives = 120/276 (43%), Gaps = 38/276 (13%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLH 112
V + IGQP+ P + +DTGSD+ W+ C+ PC C L+ PS + P+C +
Sbjct: 103 VNLSIGQPSIPQLVVMDTGSDILWIMCN-PCTNCDNHLGLLFDPS--MSSTFSPLCKT-- 157
Query: 113 APGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTN-GQRLNPRLALGCGYNQVP 170
G C+ DP + + Y D S+ G +D F T+ G + +GCG+N
Sbjct: 158 PCGFKGCKCDPIP--FTISYVDNSSASGTFGRDILVFETTDEGTSQISDVIIGCGHNI-- 213
Query: 171 GASYHP-LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGFLFF-------GDDL 222
G + P +GILGL G +S+ +Q+ + +C+ + + G DL
Sbjct: 214 GFNSDPGYNGILGLNNGPNSLATQIGRK------FSYCIGNLADPYYNYNQLRLGEGADL 267
Query: 223 --YDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLP---VVFDSGSSYTYL---- 273
Y + V+ + S G L ET +K V+ DSG++ TYL
Sbjct: 268 EGYSTPFEVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITYLVDSA 327
Query: 274 NRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKG 309
+++ Y + +++K + AP LC+ G
Sbjct: 328 HKLLYNEVRNLLKWSFRQVIFENAP----WKLCYYG 359
>gi|46488451|gb|AAS99547.1| aspartic protease PM5 [Plasmodium vivax]
gi|46488453|gb|AAS99548.1| aspartic protease PM5 [Plasmodium vivax]
Length = 536
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 91/412 (22%), Positives = 156/412 (37%), Gaps = 84/412 (20%)
Query: 10 LCFPTVRMSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLD 69
LC +V+ S S+ S L ++++G++ YY + + IG P + L LD
Sbjct: 27 LCALSVQGRSESTEGHSKDLLYK------YKLYGDIDEYAYYFLDIDIGTPEQRISLILD 80
Query: 70 TGSDLTWLQCDAPCVRC---VEAPHPLYR-PSNDLVPCEDPICASLHAPGHHNCEDPAQC 125
TGS C A C C +E P L ++ ++ CE+ C P NC +C
Sbjct: 81 TGSSSLSFPC-AGCKNCGVHMENPFNLNNSKTSSILYCENEEC-----PFKLNCVK-GKC 133
Query: 126 DYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLG- 184
+Y Y +G G D + N +R+ R +GC ++ Y G+LG+
Sbjct: 134 EYMQSYCEGSQISGFYFSDVVSVVSYNNERVTFRKLMGCHMHEESLFLYQQATGVLGMSL 193
Query: 185 ---KGKSSIVSQLHSQK-LIRNVVGHCLSGGGGGFLFFGDD------------------- 221
+G + V+ L ++ V C+S GG + G D
Sbjct: 194 SKPQGIPTFVNLLFDNAPQLKQVFTICISENGGELIAGGYDPAYIVRRRGSKSVSGQGSG 253
Query: 222 ----------------LYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFD 265
L ++ ++VW +++ Y Y ++F + K L ++ D
Sbjct: 254 PVSESLSESGEDPQVALREAEKIVWENVTRKYYYYIKVRGLDMFGTNMMSSSKGLEMLVD 313
Query: 266 SGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRT 325
SGS++T++ Y L LC + N +D K +
Sbjct: 314 SGSTFTHIPEDLYNKLNYFFD-----------------ILCIQD---MNNAYDANKRLKM 353
Query: 326 LALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEV-----GLQDLNV 372
SF + + F+ ++ I K N+C+ I++G + GL DL V
Sbjct: 354 TNESFNNPLVQ--FDDFRKSLKSIIAKENMCVKIVDGVQCWKYLEGLPDLFV 403
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 70/151 (46%), Gaps = 16/151 (10%)
Query: 22 SSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDA 81
+SS S N GS + V G +G Y V + +G P R ++ +D+GSD+ W+QC
Sbjct: 106 ASSDSRYEVNDFGSDV---VSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQ- 161
Query: 82 PCVRCVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSS 137
PC C + P++ P+ V C +C + G H+ C YE+ Y DG +
Sbjct: 162 PCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHS----GGCRYEVMYGDGSYT 217
Query: 138 LGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ 168
G L + F T + +A+GCG+
Sbjct: 218 KGTLALETLTFAKT----VVRNVAMGCGHRN 244
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 92/359 (25%), Positives = 135/359 (37%), Gaps = 60/359 (16%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPH---PLYRPSND----LVPC 103
+++T+ I QP + L +DTGSDL W QC A H P+Y P +PC
Sbjct: 16 HSLTVGIVQPRK---LIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESSTFAFLPC 72
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
D +C NC +C YE Y +++GVL + F F L RL G
Sbjct: 73 SDRLCQEGQF-SFKNCTSKNRCVYEDVYGS-AAAVGVLASETFTFGARRAVSL--RLGFG 128
Query: 164 CGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS---GGGGGFLFFG- 219
CG + S GILGL S+++QL Q+ +CL+ L FG
Sbjct: 129 CG--ALSAGSLIGATGILGLSPESLSLITQLKIQRF-----SYCLTPFADKKTSPLLFGA 181
Query: 220 -DDL--YDSSRVVWT----SMSSDYTKYYSPGVAELFFGGETTGLKNLPV---------- 262
DL + ++R + T S + YY P V G + G K L V
Sbjct: 182 MADLSRHKTTRPIQTTAIVSNPVETVYYYVPLV------GISLGHKRLAVPAASLAMRPD 235
Query: 263 -----VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVH 317
+ DSGS+ YL ++ + + + ED L R +
Sbjct: 236 GGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVEDYELCFVLPRRTAAAAME 295
Query: 318 DVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNVCLGILNGAEVGLQDLNVIGGI 376
V+ L L F G L + Y G +CL + G +++IG +
Sbjct: 296 AVQ--VPPLVLHFDGGAAMVLPR---DNYFQEPRAGLMCLAV--GKTTDGSGVSIIGNV 347
>gi|357517921|ref|XP_003629249.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523271|gb|AET03725.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 553
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 145/378 (38%), Gaps = 79/378 (20%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRC-------------VEAPHPLYRP--- 96
T+ +G P + + LDTGSDL W+ CD C RC + +Y P
Sbjct: 103 TTIELGTPGVKFMVALDTGSDLFWVPCD--CTRCSATRSSAFASALASDFDLSVYNPNGS 160
Query: 97 -SNDLVPCEDPICASLHAPGHHN-CEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNY-- 150
++ V C + +C H N C + C Y + Y +S G+LV+D
Sbjct: 161 STSKKVTCNNSLCT------HRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPD 214
Query: 151 TNGQRLNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHC 207
N + + GCG QV S+ + +G+ GLG K S+ S L + + C
Sbjct: 215 DNHDLVEANVIFGCG--QVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMC 272
Query: 208 LSGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKY--------YSPGVAELFFGGETTGLKN 259
G G + FGD S+ D T + Y+ + ++ G ++
Sbjct: 273 FGRDGIGRISFGDK---------GSLDQDETPFNVNPSHPTYNITINQVRVGTTLIDVE- 322
Query: 260 LPVVFDSGSSYTYLNRVTYQTLT-SIMKK----------------ELSAKSLKEAPEDET 302
+FDSG+S+TYL TY L+ S+ K E+ ED
Sbjct: 323 FTALFDSGTSFTYLVDPTYSRLSESVSDKICFHLARCYLKIKVTIEVFMLQFHSQVEDRR 382
Query: 303 LPLCWKGRRPFKNVHDVKKCFRTL---ALSFTDGKTRTLFELTPEAYLIISNKGNV--CL 357
P R PF +D+ T ++S T G P +IIS + + CL
Sbjct: 383 RPP--DSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDP--IIIISTQSELVYCL 438
Query: 358 GILNGAEVGLQDLNVIGG 375
++ AE+ + N + G
Sbjct: 439 AVVKSAELNIIGQNFMTG 456
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 69/150 (46%), Gaps = 16/150 (10%)
Query: 23 SSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP 82
SS S N GS + V G +G Y V + +G P R ++ +D+GSD+ W+QC P
Sbjct: 106 SSDSRYEVNDFGSDI---VSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQ-P 161
Query: 83 CVRCVEAPHPLYRPSND----LVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSL 138
C C + P++ P+ V C +C + G H+ C YE+ Y DG +
Sbjct: 162 CKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHS----GGCRYEVMYGDGSYTK 217
Query: 139 GVLVKDAFAFNYTNGQRLNPRLALGCGYNQ 168
G L + F T + +A+GCG+
Sbjct: 218 GTLALETLTFAKT----VVRNVAMGCGHRN 243
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 68/124 (54%), Gaps = 12/124 (9%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPC 103
+G Y + + IG+P++ +++ +DTGSD+ WLQC PC C + P++ P++ + C
Sbjct: 157 SGEYFLRVGIGRPSKTFYMVIDTGSDVNWLQC-KPCDDCYQQVDPIFDPASSSSFSRLGC 215
Query: 104 EDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALG 163
+ P C +L N C Y++ Y DG ++G + +F N ++ ++A+G
Sbjct: 216 QTPQCRNLDVFACRN----DSCLYQVSYGDGSYTVGDFATETVSFG--NSGSVD-KVAIG 268
Query: 164 CGYN 167
CG++
Sbjct: 269 CGHD 272
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 75/280 (26%), Positives = 122/280 (43%), Gaps = 46/280 (16%)
Query: 17 MSSSSSSSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTW 76
++S + +S +L NH ++ LF GN + V + G P + + L LDTGS +TW
Sbjct: 99 INSKCNQYTSGNLKNHAHNNNLFDEDGN------FLVDVAFGTPPQKFKLILDTGSSITW 152
Query: 77 LQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLHAPGHHNC-EDPAQCDYELEYADGG 135
QC A CV C++ H + D + +S ++ G +C Y + Y D
Sbjct: 153 TQCKA-CVHCLKDSHRHF----------DSLASSTYSFG--SCIPSTVGNTYNMTYGDKS 199
Query: 136 SSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLH 195
+S+G D ++ + + GCG N G DG+LGLG+G+ S VSQ
Sbjct: 200 TSVGNYGCDTMTLEPSD---VFQKFQFGCGRNN-EGDFGSGADGMLGLGQGQLSTVSQTA 255
Query: 196 SQKLIRNVVGHCL--SGGGGGFLFFGDDLYDSSRVVWTSMSS-------DYTKYYSPGVA 246
S+ + V +CL G LF SS + +TS+ + + + YY +
Sbjct: 256 SK--FKKVFSYCLPEENSIGSLLFGEKATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLL 313
Query: 247 ELFFGGETTGLKNLP--------VVFDSGSSYTYLNRVTY 278
++ G + N+P + DSG+ T L + Y
Sbjct: 314 DISVGNKRL---NIPSSVFASPGTIIDSGTVITRLPQRAY 350
>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
Length = 497
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 87/358 (24%), Positives = 140/358 (39%), Gaps = 69/358 (19%)
Query: 45 VYPTGY--YNVTMYIGQPARPYFLDLDTGSDLTWL---------QCDAPCVRCVEAPHPL 93
+YP Y Y T +G P +P + LDTGS LTW+ C +P V HP
Sbjct: 95 LYPHSYGGYAFTASLGTPPQPLPVLLDTGSQLTWVPCTSNYDCRNCSSPFAAAVPVFHPK 154
Query: 94 YRPSNDLVPCEDPICASLHAPGH-HNCEDP----AQCD--------YELEYADGGSSLGV 140
S+ LV C +P C +H+ H C P A C Y + Y GS+ G+
Sbjct: 155 NSSSSRLVGCRNPSCLWVHSAEHVAKCRAPCSRGANCTPASNVCPPYAVVYGS-GSTAGL 213
Query: 141 LVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLI 200
L+ D R LGC V + P G+ G G+G S+ +QL K
Sbjct: 214 LIADTL----RAPGRAVSGFVLGCSLVSV----HQPPSGLAGFGRGAPSVPAQLGLSKF- 264
Query: 201 RNVVGHCL--------SGGGGGFLFFGDDLYDSSRVVWTSMSSD---YTKYYSPGVAELF 249
+CL + G + GD+ + S + D Y YY ++ +
Sbjct: 265 ----SYCLLSRRFDDNAAVSGSLVLGGDNDGMQYVPLVKSAAGDKQPYAVYYYLALSGVT 320
Query: 250 FGGETTGLK----------NLPVVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPE 299
GG+ L + + DSG+++TYL+ +Q + + + + +
Sbjct: 321 VGGKAVRLPARAFAANAAGSGGAIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDV 380
Query: 300 DETLPL--CWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGNV 355
+E L L C+ + K++ L+L F G + +L E Y +++ + V
Sbjct: 381 EEGLGLHPCFALPQGAKSM-----ALPELSLHFKGG---AVMQLPLENYFVVAGRAPV 430
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 48/154 (31%), Positives = 70/154 (45%), Gaps = 11/154 (7%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPCE 104
G Y +++ +G P DTGSDL W QC PC +C + PL+ P + + C+
Sbjct: 91 GEYLMSLSLGTPPFEILAIADTGSDLIWTQC-TPCDKCYKQIAPLFDPKSSKTYRDLSCD 149
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN-PRLALG 163
C +L +C C Y Y D + G L D TNG + P+ +G
Sbjct: 150 TRQCQNLGE--SSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKTVIG 207
Query: 164 CGYNQVPGASYHPLD-GILGLGKGKSSIVSQLHS 196
CG ++ D GI+GLG G S++SQ+ S
Sbjct: 208 CGRRN--NGTFDKKDSGIIGLGGGPMSLISQMGS 239
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 79/322 (24%), Positives = 128/322 (39%), Gaps = 36/322 (11%)
Query: 68 LDTGSDLTWLQC-DAPCVRCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDP 122
+DT SD+ W+QC P +C PLY P+ +PC P C L + + C
Sbjct: 173 VDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGCSPT 232
Query: 123 A-QCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGIL 181
+C Y + Y DG ++ G V D + T + GC + V G+ + GIL
Sbjct: 233 TDECKYIVNYGDGKATTGTYVTDTLTMSPTIVVK---DFRFGCSH-AVRGSFSNQNAGIL 288
Query: 182 GLGKGKSSIVSQLHSQKLIRNVVGHCLSG-GGGGFLFFGDDLYDSSRVVWTSM--SSDYT 238
LG G+ S++ Q + N +C+ GFL G + S + +T + +
Sbjct: 289 ALGGGRGSLLEQ--TADAYGNAFSYCIPKPSSAGFLSLGGPVEASLKFSYTPLIKNKHAP 346
Query: 239 KYYSPGVAELFFGGETTGLKNLP----VVFDSGSSYTYLNRVTYQTLTSIMKKELSAKSL 294
+Y + + G+ + V DSG+ T L Y L + + ++A
Sbjct: 347 TFYIVHLEAIIVAGKQLAVPPTAFATGAVMDSGAVVTQLPPQVYAALRAAFRSAMAAYGP 406
Query: 295 KEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELTPEAYLIISNKGN 354
AP L C+ F DVK ++L F G T +L P + ++ +
Sbjct: 407 LAAPV-RNLDTCYD----FTRFPDVK--VPKVSLVFAGGAT---LDLEPASIIL-----D 451
Query: 355 VCLGILNGAEVGLQDLNVIGGI 376
CL A G + + IG +
Sbjct: 452 GCLAF--AATPGEESVGFIGNV 471
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 61/124 (49%), Gaps = 9/124 (7%)
Query: 49 GYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDL--VPCE 104
G Y + + +G P + +DTGSDL W QC PC C P++ P SN +PC+
Sbjct: 48 GDYLMKLTLGTPPVDVYGLVDTGSDLVWAQC-TPCQGCYRQKSPMFEPLRSNTYTPIPCD 106
Query: 105 DPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLALG 163
C SL H+C C Y YAD + GVL ++ F+ T+G+ + + G
Sbjct: 107 SEECNSLFG---HSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVVGDIVFG 163
Query: 164 CGYN 167
CG++
Sbjct: 164 CGHS 167
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 78/292 (26%), Positives = 118/292 (40%), Gaps = 47/292 (16%)
Query: 23 SSSSSSLFNHVGSSLLFQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAP 82
S +S++ + S+ L G +G Y VT+ IG P L DTGSDLTW QC+ P
Sbjct: 104 SKNSANEVSEAKSTELPAKSGITLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCE-P 162
Query: 83 CV-RCVEAPHPLYRPSNDL----VPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSS 137
C+ C P + PS+ V C P+C + NC Y + Y D +
Sbjct: 163 CLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCEDAESCSASNCV------YSIVYGDKSFT 216
Query: 138 LGVLVKDAFAFNYTNGQRLNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQ---- 193
G L K+ F TN L + GCG N G+ G +
Sbjct: 217 QGFLAKEKFTL--TNSDVLE-DVYFGCGENN---------QGLFDGVAGLLGLGPGKLSL 264
Query: 194 -LHSQKLIRNVVGHCL---SGGGGGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELF 249
+ N+ +CL + G L FG S V +T +SS + ++ G+ +
Sbjct: 265 PAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGI-SESVKFTPISS-FPSAFNYGIDII- 321
Query: 250 FGGETTGLKNLPV----------VFDSGSSYTYLNRVTYQTLTSIMKKELSA 291
G + G K L + + DSG+ +T L Y L S+ K+++S+
Sbjct: 322 --GISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTKVYAELRSVFKEKMSS 371
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 85/273 (31%), Positives = 120/273 (43%), Gaps = 32/273 (11%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDP 106
+ V + G PA+ + LDTGSDL+W+QC C P + P+ VPC P
Sbjct: 137 FVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDPAKSSSYAAVPCGTP 196
Query: 107 ICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGY 166
+CA+ A G N C Y ++Y DG S+ GVL +D FN ++ GCG
Sbjct: 197 VCAA--AGGMCNG---TTCLYGVQYGDGSSTTGVLSRDTLTFNSSSKFT---GFTFGCGE 248
Query: 167 NQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG--GGGGFLFFGDDLYD 224
+ + +DG+LGLG+GK S+ SQ + V +CL G+L G
Sbjct: 249 KNI--GDFGEVDGLLGLGRGKLSLPSQ--AAPSFGGVFSYCLPSYNTTPGYLNIGATKPT 304
Query: 225 SS-RVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPVVF-------DSGSSYTYLN 274
S+ V +T+M Y +Y + + GG L P VF DSG+ TYL
Sbjct: 305 STVPVQYTAMIKKPQYPSFYFIELVSINIGGYI--LPVPPSVFTKTGTLLDSGTILTYLP 362
Query: 275 RVTYQTLTSIMKKELSAKSLKEAPEDETLPLCW 307
Y +L K + K AP E L C+
Sbjct: 363 PPAYTSLRDRFKFTMQGN--KPAPPYEPLDTCY 393
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 66/143 (46%), Gaps = 13/143 (9%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLH 112
V++ IG P + + LDTGS L+W+QC P A PL S ++PC +C
Sbjct: 80 VSLPIGTPPQTQQMVLDTGSQLSWIQCKVPPKTPPTAFDPLLSSSFSVLPCNHSLCKP-R 138
Query: 113 APGH---HNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQV 169
P + +C+ C Y YADG + G LV++ F F + + P L LGC +
Sbjct: 139 VPDYTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTF---SSSQTTPPLILGCATDS- 194
Query: 170 PGASYHPLDGILGLGKGKSSIVS 192
GILG+ G+ S S
Sbjct: 195 -----SDTQGILGMNLGRLSFSS 212
>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
Length = 443
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/172 (29%), Positives = 77/172 (44%), Gaps = 18/172 (10%)
Query: 57 IGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN----DLVPCEDPICASLH 112
+G P+ + DTGS+L WLQC PC C P++ P+ + V + PIC ++
Sbjct: 63 LGVPSTLVYGIADTGSELIWLQC-LPCTHCYNQTPPIFDPAESYTYETVSSDSPICNAVR 121
Query: 113 APGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP-RLALGCGYNQVPG 171
E C Y+ Y DG ++ G L D FAF + L GC ++
Sbjct: 122 RISCR--EGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTRTIVEVGYLTFGCSHDTKAR 179
Query: 172 ASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFG 219
H G++GL + +S+VSQL +K +C+ G G ++FG
Sbjct: 180 LKGHQA-GVVGLNRHPNSLVSQLKVKKF-----SYCMVIPDDHGSGSRMYFG 225
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 81/335 (24%), Positives = 137/335 (40%), Gaps = 50/335 (14%)
Query: 53 VTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDLVPCEDPICASLH 112
V + IG P L +DT SDL W+QC PC+ C P++ PS + S +
Sbjct: 87 VNISIGSPPITQLLHMDTASDLLWIQC-LPCINCYAQSLPIFDPSRSYTHRNETCRTSQY 145
Query: 113 A-PGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRL---ALGCGYNQ 168
+ P + C+Y + Y D S G+L ++ FN + + L GCG++
Sbjct: 146 SMPSLKFNANTRSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALHDVVFGCGHDN 205
Query: 169 VPGASYHPL--DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGG-----GFLFFGDD 221
PL GILGLG G+ S+V + + +C L GDD
Sbjct: 206 YG----EPLVGTGILGLGYGEFSLVHRFGKK------FSYCFGSLDDPSYPHNVLVLGDD 255
Query: 222 ----LYDSS---------RVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGS 268
L D++ V ++S D P +F TGL + D+G+
Sbjct: 256 GANILGDTTPLEIHNGFYYVTIEAISVD--GIILPIDPRVFNRNHQTGLGG--TIIDTGN 311
Query: 269 SYTYLNRVTYQTLTSIMKKELSAK-SLKEAPEDETLPL-CWKGRRPFKNVHDVKKCFRTL 326
S T L Y+ L + ++ + + + +D+ + + C+ G F+ V+ F +
Sbjct: 312 SLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECYNGN--FER-DLVESGFPIV 368
Query: 327 ALSFTDGK-----TRTLF-ELTPEAYLIISNKGNV 355
F++G ++LF +L+P + + GN+
Sbjct: 369 TFHFSEGAELSLDVKSLFMKLSPNVFCLAVTPGNL 403
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 106/262 (40%), Gaps = 29/262 (11%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP----SN 98
G T Y +T+ IG PA + +DTGSD++W+QC PC +C L+ P +
Sbjct: 114 GTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCK-PCSQCHSEVDSLFDPSSSSTY 172
Query: 99 DLVPCEDPICASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNP 158
C CA L N +QC Y + Y D S+ G D T G
Sbjct: 173 SPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTYSSDTL----TLGSSAMT 228
Query: 159 RLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL--SGGGGGFL 216
GC ++ G + DG++GLG G S+ SQ + +CL + G GFL
Sbjct: 229 DFQFGCSQSESGGFNDQ-TDGLMGLGGGAQSLASQ--TAGTFGTAFSYCLPPTSGSSGFL 285
Query: 217 FFGDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPV-------VFDSG 267
G SS V T M S+ YY + + G + NLP + DSG
Sbjct: 286 TLGT---GSSGFVKTPMLRSTQIPTYYVVLLESIKVGSQQL---NLPTSVFSAGSLMDSG 339
Query: 268 SSYTYLNRVTYQTLTSIMKKEL 289
+ T L Y L+S K +
Sbjct: 340 TIITRLPPTAYSALSSAFKAGM 361
>gi|159463556|ref|XP_001690008.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158283996|gb|EDP09746.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 547
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 79/185 (42%), Gaps = 13/185 (7%)
Query: 41 VHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP---- 96
V+GNV GYY + IG P + LDTGS L C C RC + +++P
Sbjct: 71 VYGNVPELGYYYTYLTIGTPGQTVSGILDTGSTLPAFPCSG-CTRCGPSKTGMFKPELSS 129
Query: 97 SNDLVPCEDPICASLHAPGHHNCE-DPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQR 155
++ C D C G ++C + QC Y + Y +G S+ G L +D A G
Sbjct: 130 TSSTFGCSDARCFC----GANSCSCNNEQCGYSIRYLEGSSTSGFLAEDMLAVG-DGGPA 184
Query: 156 LNPRLALGCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGGGGF 215
N GC ++ DG+ G+G+ +S+ QL Q +I + C G
Sbjct: 185 AN--FVFGCAQSESGLLYSQIADGVFGMGRTPASLYGQLVQQGVIDDAFSMCFGAPREGV 242
Query: 216 LFFGD 220
L G+
Sbjct: 243 LLLGN 247
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 74/257 (28%), Positives = 107/257 (41%), Gaps = 31/257 (12%)
Query: 48 TGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL----VPC 103
+G Y V + IG P L DTGSD+ W+QC +PC C PL+ P+N VPC
Sbjct: 120 SGEYLVRVGIGSPPLEQHLVADTGSDVIWVQC-SPCSDCYAQGDPLFDPANSASFSPVPC 178
Query: 104 EDPIC-ASLHAPGHHNCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLAL 162
+C A+ +C+Y++ Y D + GVL + +G +A+
Sbjct: 179 NSGVCRAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTL---DGGTEVQGVAM 235
Query: 163 GCGYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLS------GGGGGFL 216
GCG+ + G+LGLG G S+V QL +CL+ G G G L
Sbjct: 236 GCGHENR--GLFAEAAGLLGLGWGPMSLVGQLGGAAG--GAFSYCLAGYYSGEGSGSGSL 291
Query: 217 FFGDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLK----------NLPVVF 264
G + + VW + + D +Y GV L GE L+ VV
Sbjct: 292 VLGREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQLQDGLFDLGDDGGGGVVM 351
Query: 265 DSGSSYTYLNRVTYQTL 281
D+G++ T L Y L
Sbjct: 352 DTGTAVTRLPAEAYAAL 368
>gi|147839328|emb|CAN63378.1| hypothetical protein VITISV_015700 [Vitis vinifera]
Length = 585
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 68/254 (26%), Positives = 110/254 (43%), Gaps = 36/254 (14%)
Query: 54 TMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSNDL------------- 100
T+ +G P + + + LDTGSDL W+ CD C RC Y +L
Sbjct: 106 TVSLGTPGKKFLVALDTGSDLFWVPCD--CSRCAPTEGTTYASDFELSIYNPKGSSTSRK 163
Query: 101 VPCEDPICASLHAPGHHN-CEDP-AQCDYELEYADGGSSL-GVLVKDAFAFNYTNGQR-- 155
V C + +CA H N C + C Y + Y +S G+LV+D + ++
Sbjct: 164 VTCNNSLCA------HRNRCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTTEDNRQEF 217
Query: 156 LNPRLALGCGYNQVPGASYHPL---DGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGGG 212
+ + GCG QV S+ + +G+ GLG K S+ S L + + C G
Sbjct: 218 VEAYVTFGCG--QVQTGSFLDIAAPNGLFGLGLEKISVPSILSKEGFTADSFSMCFGPDG 275
Query: 213 GGFLFFGDDLYDSSRVVWTSMSSDYTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTY 272
G + FGD ++++ + Y+ V ++ G L + +FDSG+S+TY
Sbjct: 276 IGRISFGDKGGPDQEETPFNLNALHPT-YNITVTQVRVGTTLIDL-DFTALFDSGTSFTY 333
Query: 273 LNRVTYQTLTSIMK 286
L Y T+++K
Sbjct: 334 LVDPIY---TNVLK 344
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 74/264 (28%), Positives = 105/264 (39%), Gaps = 30/264 (11%)
Query: 43 GNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRP--SNDL 100
G+ T Y +T+ IG PA + +DTGSD++W+ C A R + P S+
Sbjct: 117 GSALDTLAYVITVSIGTPAMTQAVMIDTGSDVSWVHCHA---RAGAGSSLFFDPGKSSTY 173
Query: 101 VP--CEDPICASLHAPGHHN-CEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLN 157
P C C L G N C + C Y + Y DG ++ G D A N T
Sbjct: 174 TPFSCSSAACTRLE--GRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLALNSTEKVE-- 229
Query: 158 PRLALGCGYNQVPGASYHP--LDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSGG--GG 213
GC PG DG++GLG G S+VSQ + + +CL
Sbjct: 230 -NFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQ--TAATYGSAFSYCLPATTRSS 286
Query: 214 GFLFFGDDLYDSSRVVWTSM--SSDYTKYYSPGVAELFFGGETTGLKNLPVVF------D 265
GFL G +S V T M S +Y + + GG+ + P VF D
Sbjct: 287 GFLTLGAST-GTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAIS--PTVFAAGSIMD 343
Query: 266 SGSSYTYLNRVTYQTLTSIMKKEL 289
SG+ T L Y L++ + +
Sbjct: 344 SGTIITRLPPRAYSALSAAFRAGM 367
>gi|6580159|emb|CAB62657.2| putative protein [Arabidopsis thaliana]
Length = 475
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 56/193 (29%), Positives = 83/193 (43%), Gaps = 26/193 (13%)
Query: 51 YNVTMYIGQPARPYFLDLDTGSDLTWLQCD--APCVRCVE-------APHPLYRP----S 97
Y + +G P + + LDTGSDL WL C+ C+R +E P LY P +
Sbjct: 102 YYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTPNASTT 161
Query: 98 NDLVPCEDPICASLHAPGHHNCEDPAQ-CDYELEYADGGSSLGVLVKDAFAFNYTNGQRL 156
+ + C D C G C P+ C Y++ Y++ + G L++D T + L
Sbjct: 162 SSSIRCSDKRCF-----GSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHL-ATEDENL 215
Query: 157 NP---RLALGCGYNQVP-GASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG-- 210
P + LGCG Q + ++G+LGLG S+ S L + N C
Sbjct: 216 TPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFGRVI 275
Query: 211 GGGGFLFFGDDLY 223
G G + FGD Y
Sbjct: 276 GNVGRISFGDRGY 288
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 79/317 (24%), Positives = 124/317 (39%), Gaps = 32/317 (10%)
Query: 55 MYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSND----LVPCEDPICAS 110
M +G PA Y + +DTGS LTWLQC V C P++ P + V C C+
Sbjct: 1 MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSD 60
Query: 111 LHAPGHH--NCEDPAQCDYELEYADGGSSLGVLVKDAFAFNYTNGQRLNPRLALGCGYNQ 168
L + + C C Y+ Y D S+G L KD +F T+ P GCG +
Sbjct: 61 LPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS----LPNFYYGCGQDN 116
Query: 169 VPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCL----SGGGGGFLFFGDDLYD 224
+ G++GL + K S++ QL + +CL S G + Y
Sbjct: 117 --EGLFGRSAGLIGLARNKLSLLYQLAPS--LGYSFTYCLPSSSSSGYLSLGSYNPGQYS 172
Query: 225 SSRVVWTSMSSD--YTKYYSPGVAELFFGGETTGLKNLPVVFDSGSSYTYLNRVTYQTLT 282
+ +V +S+ + K VA ++ +LP + DSG+ T L Y L+
Sbjct: 173 YTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLPTSVYSALS 232
Query: 283 SIMKKELSAKSLKEAPEDETLPLCWKGRRPFKNVHDVKKCFRTLALSFTDGKTRTLFELT 342
+ + K A L C+KG+ + V +SF G +L+
Sbjct: 233 KAVAAAM--KGTSRASAYSILDTCFKGQASRVSAPAVT-------MSFAGGAA---LKLS 280
Query: 343 PEAYLIISNKGNVCLGI 359
+ L+ + CL
Sbjct: 281 AQNLLVDVDDSTTCLAF 297
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 87/321 (27%), Positives = 125/321 (38%), Gaps = 71/321 (22%)
Query: 39 FQVHGNVYPTGYYNVTMYIGQPARPYFLDLDTGSDLTWLQCDAPCVRCVEAPHPLYRPSN 98
+ H NV V++ +G P + + LDTGS+L+WL C R A +RP
Sbjct: 53 LRFHHNVS----LTVSLAVGTPPQNVTMVLDTGSELSWLLCAT--GRAAAAAADSFRPRA 106
Query: 99 D----LVPCEDPICASLHAPGHHNCEDPA-QCDYELEYADGGSSLGVLVKDAFAFNYTNG 153
VPC C+S P +C+ + +C L YADG +S G L D FA G
Sbjct: 107 SATFAAVPCGSARCSSRDLPAPPSCDAASRRCRVSLSYADGSASDGALATDVFAV----G 162
Query: 154 QRLNPRLALGC---GYNQVPGASYHPLDGILGLGKGKSSIVSQLHSQKLIRNVVGHCLSG 210
R A GC Y+ P A G+LG+ +G S V+Q +++ +C+S
Sbjct: 163 DAPPLRSAFGCMSAAYDSSPDAVAT--AGLLGMNRGALSFVTQASTRRF-----SYCISD 215
Query: 211 -GGGGFLFFG-DDLYDSSRVVWTSMSSDYTKYYSPGVAELFFG---------GETTGLKN 259
G L G DL + +YT Y P +F G G K
Sbjct: 216 RDDAGVLLLGHSDL--------PFLPLNYTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKP 267
Query: 260 LPV---------------VFDSGSSYTYLNRVTYQTLTSIMKKELSAKSLKEAPEDETLP 304
LP+ + DSG+ +T+L Y + + K+ K L A ED +
Sbjct: 268 LPIPPSVLAPDHTGAGQTMVDSGTQFTFLLGDAYSAVKAEFLKQ--TKPLLPALEDPSF- 324
Query: 305 LCWKGRRPFKNVHDVKKCFRT 325
F+ D CFR
Sbjct: 325 -------AFQEAFDT--CFRV 336
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.137 0.426
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,704,231,719
Number of Sequences: 23463169
Number of extensions: 311035630
Number of successful extensions: 782344
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 577
Number of HSP's successfully gapped in prelim test: 1262
Number of HSP's that attempted gapping in prelim test: 778031
Number of HSP's gapped (non-prelim): 2095
length of query: 380
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 236
effective length of database: 8,980,499,031
effective search space: 2119397771316
effective search space used: 2119397771316
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)