BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 014597
(422 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
Length = 536
Score = 469 bits (1206), Expect = e-129, Method: Compositional matrix adjust.
Identities = 224/364 (61%), Positives = 286/364 (78%), Gaps = 2/364 (0%)
Query: 35 SIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST-EDTSSSGYL 93
+I DR+LSEY PS SS+S+++SC H LC+ S+CK+ KDPCPYI +Y E+T+S+G+L
Sbjct: 149 NISLDRDLSEYSPSLSSTSRHLSCDHQLCEWGSNCKNPKDPCPYIFNYDDFENTTSAGFL 208
Query: 94 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
V+D LHLAS H + +Q+SV++GCGRKQ GS+ DGAAPDGVMGLG GD+SVPSLLAK
Sbjct: 209 VEDKLHLASVGDHTARKMLQASVVLGCGRKQGGSFFDGAAPDGVMGLGPGDISVPSLLAK 268
Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
AGLIQN FS+CFDENDSG + FGD+G A+QQST FLPI Y AYFVGVESYC+GNSCL
Sbjct: 269 AGLIQNCFSLCFDENDSGRILFGDRGHASQQSTPFLPIQGTYVAYFVGVESYCVGNSCLK 328
Query: 214 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 273
+SGF+ALVDSG+SFT+LP+E+Y E+V +FDK V++KRIS Q W YCYNASS+E+ +P
Sbjct: 329 RSGFKALVDSGSSFTYLPSEVYNELVSEFDKQVNAKRISFQDGLWDYCYNASSQELHDIP 388
Query: 274 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDREN 333
++L F +NQ+FVV N +S P ++GFT+FCL++ TDG YGIIGQNFM+G+R+VFD EN
Sbjct: 389 AIQLKFPRNQNFVVHNPTYSIPHHQGFTMFCLSLQPTDGSYGIIGQNFMIGYRMVFDIEN 448
Query: 334 LKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKS 393
LKL WS+S C++ D + VHL PPP +SPNPLPT EQQS + AP +T+ S+S
Sbjct: 449 LKLGWSNSSCQDTSDSADVHLAPPPDNKSPNPLPTNEQQSIPRTPSVAPAVAGRTS-SES 507
Query: 394 IAAS 397
AAS
Sbjct: 508 SAAS 511
>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 535
Score = 462 bits (1189), Expect = e-127, Method: Compositional matrix adjust.
Identities = 218/360 (60%), Positives = 277/360 (76%), Gaps = 2/360 (0%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
DR+LSEY PS S++S+++SC+H LC+ S CK+LKDPCPYIADY+ +TSSSG+LV+DIL
Sbjct: 147 DRDLSEYRPSLSTTSRHLSCNHQLCELGSHCKNLKDPCPYIADYADPNTSSSGFLVEDIL 206
Query: 99 HLASFSK--HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
HLAS S ++ Q VQ+SVI+GCGRKQTG YLDGAAPDGVMGLG G +SVPSLLAKAGL
Sbjct: 207 HLASVSDDSNSTQKRVQASVILGCGRKQTGGYLDGAAPDGVMGLGPGSISVPSLLAKAGL 266
Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
I+ SFS+CFD N SG++ FGDQG +Q+ST LP YDAY + VESYC+GNSCL QSG
Sbjct: 267 IRKSFSLCFDVNGSGTILFGDQGHTSQKSTPLLPTQGNYDAYLIEVESYCVGNSCLKQSG 326
Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 276
F+ALVDSGASFT+LP ++Y ++V++FDK V+++RIS QG W YCYN SS+++ VP MR
Sbjct: 327 FKALVDSGASFTYLPIDVYNKIVLEFDKQVNAQRISSQGGPWNYCYNTSSKQLDNVPAMR 386
Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 336
L F NQS ++ N + P+N+ F VFCLT+ TD +YGIIGQN+M G+R+VFD ENLKL
Sbjct: 387 LSFLMNQSLLIHNSTYYVPQNQEFAVFCLTLQPTDLNYGIIGQNYMTGYRVVFDMENLKL 446
Query: 337 AWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAA 396
WS S C+++ D++ V L P P QSPNPLPT EQQS N Q AP +T+ S+A+
Sbjct: 447 GWSSSNCKDISDETEVTLAPSPNDQSPNPLPTNEQQSVPNKQGVAPAVAGRTSSKHSVAS 506
>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 531
Score = 445 bits (1145), Expect = e-122, Method: Compositional matrix adjust.
Identities = 212/359 (59%), Positives = 275/359 (76%), Gaps = 2/359 (0%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
R+L+EY PS SS+SK +SC+ LC+ S CKS KDPCPY+A Y +E+TSSSG L++D LH
Sbjct: 149 RDLNEYSPSLSSTSKPLSCNDQLCELGSDCKSSKDPCPYLASYYSENTSSSGLLIEDRLH 208
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
LA FS+HA +SSV +SVIIGCGRKQ+G++ DGAAPDG+MGLG GD+SVPSLLAKAGL++N
Sbjct: 209 LAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPSLLAKAGLVRN 268
Query: 160 SFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQA 219
+FSICFD+N SG++ FGDQG TQ+STSF+P+ K+ Y + VE Y +G+S L +GFQA
Sbjct: 269 TFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGSSSLKTAGFQA 328
Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
LVDSG SFTFLP EIY ++VV+FDK V++ R S +G+ WKYCYN+SS+E+L +P + L+F
Sbjct: 329 LVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSPWKYCYNSSSQELLNIPTVTLVF 388
Query: 280 SKNQSFVVRNHIFSF-PENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
+ NQSF+V N + ENE F VFCL + ++GIIGQNFM G+R+VFDRENLKL W
Sbjct: 389 AMNQSFIVHNPVIKLISENEEFNVFCLPIQPIHEEFGIIGQNFMWGYRMVFDRENLKLGW 448
Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 397
S S C+++ D +HL PPP +SPNPLPT +QQ T + A AP +T P+KS A S
Sbjct: 449 STSNCQDITDGKIMHLTPPPNDRSPNPLPTNQQQMTPSRHAVAPAVAGRT-PAKSAAVS 506
>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 445 bits (1145), Expect = e-122, Method: Compositional matrix adjust.
Identities = 212/359 (59%), Positives = 275/359 (76%), Gaps = 2/359 (0%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
R+L+EY PS SS+SK +SC+ LC+ S CKS KDPCPY+A Y +E+TSSSG L++D LH
Sbjct: 139 RDLNEYSPSLSSTSKPLSCNDQLCELGSDCKSSKDPCPYLASYYSENTSSSGLLIEDRLH 198
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
LA FS+HA +SSV +SVIIGCGRKQ+G++ DGAAPDG+MGLG GD+SVPSLLAKAGL++N
Sbjct: 199 LAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPSLLAKAGLVRN 258
Query: 160 SFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQA 219
+FSICFD+N SG++ FGDQG TQ+STSF+P+ K+ Y + VE Y +G+S L +GFQA
Sbjct: 259 TFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGSSSLKTAGFQA 318
Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
LVDSG SFTFLP EIY ++VV+FDK V++ R S +G+ WKYCYN+SS+E+L +P + L+F
Sbjct: 319 LVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSPWKYCYNSSSQELLNIPTVTLVF 378
Query: 280 SKNQSFVVRNHIFSF-PENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
+ NQSF+V N + ENE F VFCL + ++GIIGQNFM G+R+VFDRENLKL W
Sbjct: 379 AMNQSFIVHNPVIKLISENEEFNVFCLPIQPIHEEFGIIGQNFMWGYRMVFDRENLKLGW 438
Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 397
S S C+++ D +HL PPP +SPNPLPT +QQ T + A AP +T P+KS A S
Sbjct: 439 STSNCQDITDGKIMHLTPPPNDRSPNPLPTNQQQMTPSRHAVAPAVAGRT-PAKSAAVS 496
>gi|356551638|ref|XP_003544181.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 880
Score = 427 bits (1097), Expect = e-117, Method: Compositional matrix adjust.
Identities = 206/384 (53%), Positives = 275/384 (71%), Gaps = 6/384 (1%)
Query: 12 NAYNALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKS 71
+A + +L +P + L G V DR+L++Y PS S++S+++ C H LC S CK
Sbjct: 123 DAGSDMLWVPCDCIECASLSAGNYNVLDRDLNQYRPSLSNTSRHLPCGHKLCDVHSVCKG 182
Query: 72 LKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG 131
KDPCPY YS+ +TSSSGY+ +D LHL S KHA Q+SVQ+S+I+GCGRKQTG YL G
Sbjct: 183 SKDPCPYAVQYSSANTSSSGYVFEDKLHLTSNGKHAEQNSVQASIILGCGRKQTGEYLRG 242
Query: 132 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI 191
A PDGV+GLG G++SVPSLLAKAGLIQNSFSICF+EN+SG + FGDQG TQ ST FLPI
Sbjct: 243 AGPDGVLGLGPGNISVPSLLAKAGLIQNSFSICFEENESGRIIFGDQGHVTQHSTPFLPI 302
Query: 192 GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
K++AY VGVES+C+G+ CL ++ FQAL+DSG+SFTFLP E+Y +VV++FDK V++ I
Sbjct: 303 DGKFNAYIVGVESFCVGSLCLKETRFQALIDSGSSFTFLPNEVYQKVVIEFDKQVNATSI 362
Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD 311
LQ NSW+YCYNASS+E++ +P + L FS+NQ+++++N IF P ++ +T+FCL V +D
Sbjct: 363 VLQ-NSWEYCYNASSQELISIPPLNLAFSRNQTYLIQNPIFIDPASQEYTIFCLPVSPSD 421
Query: 312 GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQ 371
DY IGQNF+MG+R+VFDRENL+ +WS C++ S P + SPNPLP +Q
Sbjct: 422 DDYAAIGQNFLMGYRMVFDRENLRFSWSRWNCQDRASFS-----SPYSVGSPNPLPVDQQ 476
Query: 372 QSTSNGQAAAPPSTAKTAPSKSIA 395
QS N P T+P S A
Sbjct: 477 QSFPNAHGIPPAIAGHTSPKPSAA 500
>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
Length = 530
Score = 423 bits (1088), Expect = e-116, Method: Compositional matrix adjust.
Identities = 205/361 (56%), Positives = 263/361 (72%), Gaps = 1/361 (0%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
DR+L++Y PS SS+SK++SCSH LC+S +C S K CPY +Y +E+TSSSG L++DIL
Sbjct: 145 DRDLNQYSPSGSSTSKHLSCSHQLCESSPNCDSPKQLCPYTINYYSENTSSSGLLIEDIL 204
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
HL S A SSV++ VIIGCG +QTG YLDG APDG+MGLGLG++SVPS L+KAGL++
Sbjct: 205 HLTSGIDDASNSSVRAPVIIGCGMRQTGGYLDGVAPDGLMGLGLGEISVPSFLSKAGLVK 264
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
NSFS+CF+++DSG +FFGDQG ATQQ+T FLP KY+ Y VGVE+ CIG+SC+ Q+ F+
Sbjct: 265 NSFSLCFNDDDSGRIFFGDQGLATQQTTLFLPSDGKYETYIVGVEACCIGSSCIKQTSFR 324
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
ALVDSGASFTFLP E Y VV +FDK V++ R S +G W+YCY +SS+E+LK P + L
Sbjct: 325 ALVDSGASFTFLPDESYRNVVDEFDKQVNATRFSFEGYPWEYCYKSSSKELLKNPSVILK 384
Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
F+ N SFVV N +F +G FCL + DGD GI+GQNFM G+R+VFDRENLKL W
Sbjct: 385 FALNNSFVVHNPVFVVHGYQGVVGFCLAIQPADGDIGILGQNFMTGYRMVFDRENLKLGW 444
Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASA 398
S S C+++ D + L P P + PNPLP EQQ+T +G P+ A APS AAS
Sbjct: 445 SRSNCQDLTDGERMPLTPSPNDRPPNPLPANEQQNTHSGHTIT-PAVAGRAPSNPSAAST 503
Query: 399 Q 399
Q
Sbjct: 504 Q 504
>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 511
Score = 423 bits (1087), Expect = e-115, Method: Compositional matrix adjust.
Identities = 205/361 (56%), Positives = 263/361 (72%), Gaps = 1/361 (0%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
DR+L++Y PS SS+SK++SCSH LC+S +C S K CPY +Y +E+TSSSG L++DIL
Sbjct: 126 DRDLNQYSPSGSSTSKHLSCSHQLCESSPNCDSPKQLCPYTINYYSENTSSSGLLIEDIL 185
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
HL S A SSV++ VIIGCG +QTG YLDG APDG+MGLGLG++SVPS L+KAGL++
Sbjct: 186 HLTSGIDDASNSSVRAPVIIGCGMRQTGGYLDGVAPDGLMGLGLGEISVPSFLSKAGLVK 245
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
NSFS+CF+++DSG +FFGDQG ATQQ+T FLP KY+ Y VGVE+ CIG+SC+ Q+ F+
Sbjct: 246 NSFSLCFNDDDSGRIFFGDQGLATQQTTLFLPSDGKYETYIVGVEACCIGSSCIKQTSFR 305
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
ALVDSGASFTFLP E Y VV +FDK V++ R S +G W+YCY +SS+E+LK P + L
Sbjct: 306 ALVDSGASFTFLPDESYRNVVDEFDKQVNATRFSFEGYPWEYCYKSSSKELLKNPSVILK 365
Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
F+ N SFVV N +F +G FCL + DGD GI+GQNFM G+R+VFDRENLKL W
Sbjct: 366 FALNNSFVVHNPVFVVHGYQGVVGFCLAIQPADGDIGILGQNFMTGYRMVFDRENLKLGW 425
Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASA 398
S S C+++ D + L P P + PNPLP EQQ+T +G P+ A APS AAS
Sbjct: 426 SRSNCQDLTDGERMPLTPSPNDRPPNPLPANEQQNTHSGHTIT-PAVAGRAPSNPSAAST 484
Query: 399 Q 399
Q
Sbjct: 485 Q 485
>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 542
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 207/365 (56%), Positives = 260/365 (71%), Gaps = 2/365 (0%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
DR+L+EY PS SS+SK++SCSH LC+ +C S K PCPY DY TE+TSSSG LV+DIL
Sbjct: 158 DRDLNEYSPSHSSTSKHLSCSHQLCELGPNCNSPKQPCPYSMDYYTENTSSSGLLVEDIL 217
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
HLAS +A SV++ V+IGCG KQ+G YLDG APDG+MGLGL ++SVPS LAKAGLI+
Sbjct: 218 HLASNGDNALSYSVRAPVVIGCGMKQSGGYLDGVAPDGLMGLGLAEISVPSFLAKAGLIR 277
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
NSFS+CFDE+DSG +FFGDQGP TQQST FL + Y Y VGVE +C+G+SCL Q+ F+
Sbjct: 278 NSFSMCFDEDDSGRIFFGDQGPTTQQSTPFLTLDGNYTTYVVGVEGFCVGSSCLKQTSFR 337
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
ALVD+G SFTFLP +Y + +FD+ V++ S G WKYCY +SS + KVP ++LI
Sbjct: 338 ALVDTGTSFTFLPNGVYERITEEFDRQVNATISSFNGYPWKYCYKSSSNHLTKVPSVKLI 397
Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
F N SFV+ N +F +G T FCL + T+GD G IGQNFM G+R+VFDREN+KL W
Sbjct: 398 FPLNNSFVIHNPVFMIYGIQGITGFCLAIQPTEGDIGTIGQNFMAGYRVVFDRENMKLGW 457
Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASA 398
SHS CE+ + + L P G NPLPT EQQS+ G A + P+ A APSK AA+
Sbjct: 458 SHSSCEDRSNDKRMPLT-SPNGTLVNPLPTNEQQSSPGGHAVS-PAVAGRAPSKPSAAAV 515
Query: 399 QQLDS 403
Q L S
Sbjct: 516 QLLPS 520
>gi|356548395|ref|XP_003542587.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 525
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 208/404 (51%), Positives = 278/404 (68%), Gaps = 14/404 (3%)
Query: 17 LLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPC 76
+L +P + L G V DR+L++Y PS S++S+++ C H LC S CK KDPC
Sbjct: 128 MLWVPCDCIECASLSAGNYNVLDRDLNQYRPSLSNTSRHLPCGHKLCDVHSFCKGSKDPC 187
Query: 77 PYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDG 136
PY Y++ +TSSSGY+ +D LHL S KHA Q+SVQ+S+I+GCGRKQTG YL GA PDG
Sbjct: 188 PYEVQYASANTSSSGYVFEDKLHLTSDGKHAEQNSVQASIILGCGRKQTGDYLHGAGPDG 247
Query: 137 VMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYD 196
V+GLG G++SVPSLLAKAGLIQNSFSIC DEN+SG + FGDQG TQ ST FLPI
Sbjct: 248 VLGLGPGNISVPSLLAKAGLIQNSFSICLDENESGRIIFGDQGHVTQHSTPFLPI----I 303
Query: 197 AYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 256
AY VGVES+C+G+ CL ++ FQAL+DSG+SFTFLP E+Y +VV +FDK V++ RI LQ +
Sbjct: 304 AYMVGVESFCVGSLCLKETRFQALIDSGSSFTFLPNEVYQKVVTEFDKQVNASRIVLQ-S 362
Query: 257 SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFP--ENEGFTVFCLTVMSTDGDY 314
SW+YCYNASS+E++ +P ++L FS+NQ+F+++N IF P + + +T+FCL V + DY
Sbjct: 363 SWEYCYNASSQELVNIPPLKLAFSRNQTFLIQNPIFYDPASQEQEYTIFCLPVSPSADDY 422
Query: 315 GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQST 374
IGQNF+MG+R+VFDRENL+ WS C++ P G SPNPLP +QQ+
Sbjct: 423 AAIGQNFLMGYRLVFDRENLRFGWSRWNCQD-----RASFTSPSNGGSPNPLPANQQQTV 477
Query: 375 SNGQAAAPPSTAKTAPSKSIAASAQQLDSVLRVACSLLVLMCLL 418
N + P T+P S A L + R + + L+L+C L
Sbjct: 478 PNARGVPPAIAGHTSPKPSAATPG--LVTTSRHSLASLLLICHL 519
>gi|224083757|ref|XP_002307112.1| predicted protein [Populus trichocarpa]
gi|222856561|gb|EEE94108.1| predicted protein [Populus trichocarpa]
Length = 492
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 202/353 (57%), Positives = 248/353 (70%), Gaps = 3/353 (0%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
DR+LSEY PS SS+SK +SCSH LC +CK+ K CPY +Y TE TSSSG LV+DI+
Sbjct: 143 DRDLSEYSPSQSSTSKQLSCSHRLCDMGPNCKNPKQSCPYSINYYTESTSSSGLLVEDII 202
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
HLAS +SV++ VIIGCG KQ+G YLDG APDG++GLGL ++SVPS LAKAGLIQ
Sbjct: 203 HLASGGDDTLNTSVKAPVIIGCGMKQSGGYLDGVAPDGLLGLGLQEISVPSFLAKAGLIQ 262
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
NSFS+CF+E+DSG +FFGDQGPATQQS FL + Y Y VGVE C+G SCL QS F
Sbjct: 263 NSFSMCFNEDDSGRIFFGDQGPATQQSAPFLKLNGNYTTYIVGVEVCCVGTSCLKQSSFS 322
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
ALVDSG SFTFLP +++ + +FD V++ R S +G SWKYCY SS+++ K+P +RLI
Sbjct: 323 ALVDSGTSFTFLPDDVFEMIAEEFDTQVNASRSSFEGYSWKYCYKTSSQDLPKIPSLRLI 382
Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
F +N SF+V+N +F +G FCL + DGD G IGQNFMMG+R+VFDRENLKL W
Sbjct: 383 FPQNNSFMVQNPVFMIYGIQGVIGFCLAIQPADGDIGTIGQNFMMGYRVVFDRENLKLGW 442
Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPS 391
S S CE L P+G NPLPT EQQST G A + P+ A APS
Sbjct: 443 SRSNCE--FSGISYTLPLTPSGTPQNPLPTNEQQSTPGGHAVS-PAVAVNAPS 492
>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 523
Score = 404 bits (1037), Expect = e-110, Method: Compositional matrix adjust.
Identities = 191/357 (53%), Positives = 260/357 (72%), Gaps = 13/357 (3%)
Query: 37 VQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
V DR+LSEY+P+ SS+SK++ C H LC ++CKS DPC Y DY +++TS+SG++++D
Sbjct: 146 VLDRDLSEYNPALSSTSKHLFCGHQLCAWSTTCKSANDPCTYKRDYYSDNTSTSGFMIED 205
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
L L SFSKH S +Q+SV+ GCGRKQ+GSYLDGAAPDGVMGLG G++SVP+LLA+ GL
Sbjct: 206 KLQLTSFSKHGTHSLLQASVVFGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGL 265
Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
++N+FS+CFD N SG + FGD GPATQQ+T FLP+ ++ AYF+GVES+C+G+SCL +SG
Sbjct: 266 VRNTFSLCFDNNGSGRILFGDDGPATQQTTQFLPLFGEFAAYFIGVESFCVGSSCLQRSG 325
Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKL--VSSKRISLQGNSWKYCYNASSEEMLKVPD 274
FQALVDSG+SFT+LP E+Y ++V +FDK V++ RI L+ W YCYN S+ +P
Sbjct: 326 FQALVDSGSSFTYLPAEVYKKIVFEFDKQVKVNATRIVLRELPWNYCYNISTLVSFNIPS 385
Query: 275 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 334
M+L+F NQ F + + ++ P N+G+ VFCLT+ TD DYG+IGQN M+G+R+VFDRENL
Sbjct: 386 MQLVFPLNQIF-IHDPVYVLPANQGYKVFCLTLEETDEDYGVIGQNLMVGYRMVFDRENL 444
Query: 335 KLAWSHSKCEEVIDKSHVHLVPPP---AGQSPNPLPTTEQQSTSNGQAAAPPSTAKT 388
KL WS SKC ++ + H PP +SP LP T +Q A P+ A+T
Sbjct: 445 KLGWSKSKCLDINSSTTEHAKPPSNNGNAKSPIALPPTNRQ-------AIAPTAART 494
>gi|356538031|ref|XP_003537508.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 521
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 201/365 (55%), Positives = 257/365 (70%), Gaps = 5/365 (1%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
DR+L+EY PS S SSK++SCSH LC S+CKS + CPY+ Y +E+TSSSG LV+DIL
Sbjct: 142 DRDLNEYSPSRSLSSKHLSCSHRLCDKGSNCKSSQQQCPYMVSYLSENTSSSGLLVEDIL 201
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
HL S + SSVQ+ V++GCG KQ+G YLDG APDG++GLG G+ SVPS LAK+GLI
Sbjct: 202 HLQSGGTLS-NSSVQAPVVLGCGMKQSGGYLDGVAPDGLLGLGPGESSVPSFLAKSGLIH 260
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
SFS+CF+E+DSG +FFGDQGP +QQSTSFLP+ Y Y +GVES CIGNSCL + F+
Sbjct: 261 YSFSLCFNEDDSGRMFFGDQGPTSQQSTSFLPLDGLYSTYIIGVESCCIGNSCLKMTSFK 320
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
A VDSG SFTFLP +Y + +FD+ V+ R S +G+ W+YCY SS+++ KVP L+
Sbjct: 321 AQVDSGTSFTFLPGHVYGAITEEFDQQVNGSRSSFEGSPWEYCYVPSSQDLPKVPSFTLM 380
Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
F +N SFVV + +F F NEG FCL ++ T+GD G IGQNFM G+R+VFDR N KLAW
Sbjct: 381 FQRNNSFVVYDPVFVFYGNEGVIGFCLAILPTEGDMGTIGQNFMTGYRLVFDRGNKKLAW 440
Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASA 398
S S C+++ + L P S NPLPT EQQ T NG A A P+ A AP K AAS+
Sbjct: 441 SRSNCQDLSLGKRMPLSPNET--SSNPLPTDEQQRT-NGHAVA-PAVAGRAPHKPSAASS 496
Query: 399 QQLDS 403
+ + S
Sbjct: 497 RMISS 501
>gi|449451627|ref|XP_004143563.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 532
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 192/360 (53%), Positives = 262/360 (72%), Gaps = 3/360 (0%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
D++L+EY PSSSS+SK++SCSH LC S SC+S K CPY+ DY TE+TSSSG L+ D+L
Sbjct: 148 DKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTSSSGLLIQDVL 207
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
HL+S +++ ++Q+ VI+GCG KQ+G YL G APDG+ GLGLG++SV S LAK L+Q
Sbjct: 208 HLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQ 267
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
NSFS+CF+E+ SG +FFGD+GPA+QQ+TSF+P+ KY+ Y VGVE+ CI NSCL Q+ F+
Sbjct: 268 NSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENSCLKQTSFK 327
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 277
AL+DSG SFT+LP E Y +V++FDK L ++ +S +G WKYCY S++ M KVP + L
Sbjct: 328 ALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYKISADAMPKVPSVTL 387
Query: 278 IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
+F N SFVV + +F ++G FC ++ DGD GI+GQN+M G+R+VFDR+NLKL
Sbjct: 388 LFPLNNSFVVHDPVFPIYGDQGLAGFCFAILPADGDIGILGQNYMTGYRMVFDRDNLKLG 447
Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 397
WSH+ C+++ ++ + L P PNPLP EQQS S G A A P+ A APSK AA+
Sbjct: 448 WSHANCQDLSNEKKMPLTPAKE-TPPNPLPADEQQSASGGHAVA-PAVAGRAPSKPSAAT 505
>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 520
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 199/365 (54%), Positives = 254/365 (69%), Gaps = 5/365 (1%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
DR+L+EY PS S SSK++SCSH LC S+CKS + CPY+ Y +E+TSSSG LV+DIL
Sbjct: 141 DRDLNEYSPSRSLSSKHLSCSHQLCDKGSNCKSSQQQCPYMVSYLSENTSSSGLLVEDIL 200
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
HL S + SSVQ+ V++GCG KQ+G YLDG APDG++GLG G+ SVPS LAK+GLI
Sbjct: 201 HLQSGGSLS-NSSVQAPVVLGCGMKQSGGYLDGVAPDGLLGLGPGESSVPSFLAKSGLIH 259
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
+SFS+CF+E+DSG +FFGDQGP QQSTSFLP+ Y Y +GVES C+GNSCL + F+
Sbjct: 260 DSFSLCFNEDDSGRIFFGDQGPTIQQSTSFLPLDGLYSTYIIGVESCCVGNSCLKMTSFK 319
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
VDSG SFTFLP +Y + +FD+ V+ R S +G+ W+YCY SS+E+ KVP + L
Sbjct: 320 VQVDSGTSFTFLPGHVYGAIAEEFDQQVNGSRSSFEGSPWEYCYVPSSQELPKVPSLTLT 379
Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
F +N SFVV + +F F NEG FCL + T+GD G IGQNFM G+R+VFDR N KLAW
Sbjct: 380 FQQNNSFVVYDPVFVFYGNEGVIGFCLAIQPTEGDMGTIGQNFMTGYRLVFDRGNKKLAW 439
Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASA 398
S S C+++ + L P S NPLPT EQQ T NG A A P+ A AP K AA +
Sbjct: 440 SRSNCQDLSLGKRMPLSPNET--SSNPLPTDEQQRT-NGHAVA-PAVAGRAPHKPSAAPS 495
Query: 399 QQLDS 403
+ + S
Sbjct: 496 RMISS 500
>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
Length = 632
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 197/385 (51%), Positives = 267/385 (69%), Gaps = 18/385 (4%)
Query: 15 NALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD 74
N + C P+++ + S + ++L+E+DPS+S++SK CSH LC+S +C+S K+
Sbjct: 126 NCVQCAPLSSAYY-------SSLATKDLNEFDPSASTTSKVFPCSHKLCESAPACESPKE 178
Query: 75 PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP 134
CPY Y++E+TSSSG LV+D+LHLA +S +A SSV++ V++GCG KQ+G +L G AP
Sbjct: 179 QCPYTVTYASENTSSSGLLVEDVLHLA-YSANA-SSSVKARVVVGCGEKQSGEFLKGIAP 236
Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK 194
DGVMGLG G++SVPS LAKAGL++NSFS+CFDE DSG ++FGD GP+TQQST FLP +
Sbjct: 237 DGVMGLGPGEISVPSFLAKAGLMRNSFSMCFDEEDSGRIYFGDVGPSTQQSTRFLPYKNE 296
Query: 195 YDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 254
+ AYFVGVE C+GNSCL QS F L+DSG SFTFLP EIY EV ++ D +++ ++
Sbjct: 297 FVAYFVGVEVCCVGNSCLKQSSFTTLIDSGQSFTFLPEEIYREVALEIDSHINATVKKIE 356
Query: 255 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV-MSTDGD 313
G W+YCY S E KVP ++L FS N +FV+ +F +EG FCL + S +G
Sbjct: 357 GGPWEYCYETSFEP--KVPAIKLKFSSNNTFVIHKPLFVLQRSEGLVQFCLPISASEEGT 414
Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDK-SHVHLVPPPAGQSPNPLPTTEQQ 372
G+IGQN+M G+RIVFDREN+KL WS SKC+E DK + P + SPNPLPT EQQ
Sbjct: 415 GGVIGQNYMAGYRIVFDRENMKLGWSASKCQE--DKIAPPQEASPGSTSSPNPLPTEEQQ 472
Query: 373 STSNGQAAAPPSTAKTAPSKSIAAS 397
S ++ A P+ A PSK+ +AS
Sbjct: 473 SRTH---AVSPAIAGKTPSKTSSAS 494
>gi|15238055|ref|NP_196570.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
gi|75180764|sp|Q9LX20.1|ASPL1_ARATH RecName: Full=Aspartic proteinase-like protein 1; Flags: Precursor
gi|7960727|emb|CAB92049.1| putative protein [Arabidopsis thaliana]
gi|332004108|gb|AED91491.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
Length = 528
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 189/374 (50%), Positives = 255/374 (68%), Gaps = 19/374 (5%)
Query: 15 NALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD 74
N + C P+T+ + S + ++L+EY+PSSSS+SK CSH LC S S C+S K+
Sbjct: 129 NCVQCAPLTSTYY-------SSLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKE 181
Query: 75 PCPYIADYSTEDTSSSGYLVDDILHLASFSKHA---PQSSVQSSVIIGCGRKQTGSYLDG 131
CPY +Y + +TSSSG LV+DILHL + + SSV++ V+IGCG+KQ+G YLDG
Sbjct: 182 QCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDG 241
Query: 132 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI 191
APDG+MGLG ++SVPS L+KAGL++NSFS+CFDE DSG ++FGD GP+ QQST FL +
Sbjct: 242 VAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQL 301
Query: 192 -GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
KY Y VGVE+ CIGNSCL Q+ F +DSG SFT+LP EIY +V ++ D+ +++
Sbjct: 302 DNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHINATS 361
Query: 251 ISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST 310
+ +G SW+YCY +S+E KVP ++L FS N +FV+ +F F +++G FCL + S
Sbjct: 362 KNFEGVSWEYCYESSAEP--KVPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPI-SP 418
Query: 311 DGDYGI--IGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPT 368
G GI IGQN+M G+R+VFDREN+KL WS SKC+E DK P + SPNPLPT
Sbjct: 419 SGQEGIGSIGQNYMRGYRMVFDRENMKLGWSPSKCQE--DKIEPPQASPGSTSSPNPLPT 476
Query: 369 TEQQSTSNGQAAAP 382
EQQS G A +P
Sbjct: 477 DEQQSR-GGHAVSP 489
>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
Length = 520
Score = 363 bits (933), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 178/364 (48%), Positives = 246/364 (67%), Gaps = 8/364 (2%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
DR+L Y PS S++S+++ CSH LC S C + K PCPY DY +E+T+SSG L++D+L
Sbjct: 146 DRDLGIYKPSESTTSRHLPCSHELCSPASGCTNPKQPCPYNIDYFSENTTSSGLLIEDML 205
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
HL S HAP V +SVIIGCG+KQ+GSYL+G APDG++GLG+ D+SVPS LA+AGL++
Sbjct: 206 HLDSREGHAP---VNASVIIGCGKKQSGSYLEGIAPDGLLGLGMADISVPSFLARAGLVR 262
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
NSFS+CF ++DSG +FFGDQG TQQST F+P+ K Y V V+ YCIG+ C +GFQ
Sbjct: 263 NSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQTYAVNVDKYCIGHKCTEGAGFQ 322
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
ALVD+G SFT LP + Y + ++FDK +++ R S S++YCY+ EM VP + L
Sbjct: 323 ALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDDYSFEYCYSTGPLEMPDVPTITLT 382
Query: 279 FSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
F++N+SF N I F + +G F VFCL V+ + GIIGQNFM+G+ +VFDREN+KL
Sbjct: 383 FAENKSFQAVNPILPFNDRQGEFAVFCLAVLPSPEPVGIIGQNFMVGYHVVFDRENMKLG 442
Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 397
W S+C ++ + + V L P +PLP+ EQQ++ A P+ A APS + +
Sbjct: 443 WYRSECHDLDNSTMVSLGPSQHNSPEDPLPSNEQQTS----PAVTPAVAGRAPSSGGSTT 498
Query: 398 AQQL 401
Q L
Sbjct: 499 LQNL 502
>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
Length = 520
Score = 363 bits (932), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 178/364 (48%), Positives = 246/364 (67%), Gaps = 8/364 (2%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
DR+L Y PS S++S+++ CSH LC S C + K PCPY DY +E+T+SSG L++D+L
Sbjct: 146 DRDLGIYKPSESTTSRHLPCSHELCSPASGCTNPKQPCPYNIDYFSENTTSSGLLIEDML 205
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
HL S HAP V +SVIIGCG+KQ+GSYL+G APDG++GLG+ D+SVPS LA+AGL++
Sbjct: 206 HLDSREGHAP---VNASVIIGCGKKQSGSYLEGIAPDGLLGLGMADISVPSFLARAGLVR 262
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
NSFS+CF ++DSG +FFGDQG TQQST F+P+ K Y V V+ YCIG+ C +GFQ
Sbjct: 263 NSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQTYAVNVDKYCIGHKCTEGAGFQ 322
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
ALVD+G SFT LP + Y + ++FDK +++ R S S++YCY+ EM VP + L
Sbjct: 323 ALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDDYSFEYCYSTGPLEMPDVPTITLT 382
Query: 279 FSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
F++N+SF N I F + +G F VFCL V+ + GIIGQNFM+G+ +VFDREN+KL
Sbjct: 383 FAENKSFQAVNPILPFNDRQGEFAVFCLAVLPSPEPVGIIGQNFMVGYHVVFDRENMKLG 442
Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 397
W S+C ++ + + V L P +PLP+ EQQ++ A P+ A APS + +
Sbjct: 443 WYRSECHDLDNSTTVSLGPSQHNSPEDPLPSNEQQTS----PAVTPAVAGRAPSSGGSTT 498
Query: 398 AQQL 401
Q L
Sbjct: 499 LQNL 502
>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
gi|194693730|gb|ACF80949.1| unknown [Zea mays]
gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
Length = 519
Score = 360 bits (924), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 178/364 (48%), Positives = 244/364 (67%), Gaps = 8/364 (2%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
DR+L Y P+ S++S+++ CSH LC+ S C + K PC Y DY +E+T+SSG L++D L
Sbjct: 144 DRDLGIYKPAESTTSRHLPCSHELCQPGSGCTNPKQPCTYNIDYFSENTTSSGLLIEDSL 203
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
HL S HAP V +SVIIGCGRKQ+G YLDG APDG++GLG+ D+SVPS LA+AGL++
Sbjct: 204 HLNSREGHAP---VNASVIIGCGRKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVR 260
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
NSFS+CF E+ SG +FFGDQG ++QQST F+P+ K Y V V+ CIG+ CL S FQ
Sbjct: 261 NSFSMCFKEDSSGRIFFGDQGVSSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGSSFQ 320
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
ALVDSG SFT LP ++Y +FDK +++ R+ + ++WKYCY+AS EM VP + L
Sbjct: 321 ALVDSGTSFTSLPPDVYKAFTTEFDKQINASRVPYEDSTWKYCYSASPLEMPDVPTIILA 380
Query: 279 FSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
F+ N+SF N I F + +G FCL V+ + GIIGQNF++G+ +VFDRE++KL
Sbjct: 381 FAANKSFQAVNPILPFNDEQGALARFCLAVLPSTEPIGIIGQNFLVGYHVVFDRESMKLG 440
Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 397
W S+C +V + + V L P G S +PLP+ EQQ++ P+T TAP S +
Sbjct: 441 WYRSECRDVDNSTTVPLGPSQHGSSEDPLPSNEQQTS----PPVTPATTGTAPPSSATTN 496
Query: 398 AQQL 401
Q L
Sbjct: 497 RQML 500
>gi|357463449|ref|XP_003602006.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355491054|gb|AES72257.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 529
Score = 358 bits (919), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 188/382 (49%), Positives = 249/382 (65%), Gaps = 3/382 (0%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP-CPYIADYSTEDTSSSGYLVDDI 97
DR+L+EY PS S SSK++SCSH LC S+CK+ K CPY +Y +++TSSSG LV+DI
Sbjct: 145 DRDLNEYSPSRSLSSKHLSCSHRLCDMGSNCKTSKQQQCPYTINYLSDNTSSSGLLVEDI 204
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
HL S SSVQ+ V++GCG KQ+G YLDG APDG++GLG G+ SVPS LAK+GLI
Sbjct: 205 FHLQSGDGSTSNSSVQAPVVVGCGMKQSGGYLDGTAPDGLIGLGPGESSVPSFLAKSGLI 264
Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 217
++SFS+CF+E+DSG +FFGDQG QQST FL + + Y VGVE+ CIGNSC + F
Sbjct: 265 RDSFSLCFNEDDSGRLFFGDQGSTVQQSTPFLLVDGMFSTYIVGVETCCIGNSCPKVTSF 324
Query: 218 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 277
A DSG SFTFLP Y + +FDK V++ R + QG+ W+YCY SS+++ K+P + L
Sbjct: 325 NAQFDSGTSFTFLPGHAYGAIAEEFDKQVNATRSTFQGSPWEYCYVPSSQQLPKIPTLTL 384
Query: 278 IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
+F +N SFVV N +F +G FCL + T+G G IGQNFM G+R+VFDREN KLA
Sbjct: 385 MFQQNNSFVVYNPVFVSYNEQGVDGFCLAIQPTEGGMGTIGQNFMTGYRLVFDRENKKLA 444
Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTS-NGQAAAPPSTAKTAPSKSIAA 396
WSHS C+++ + L PP G S + LP EQQ T + A A A PS + +
Sbjct: 445 WSHSNCQDLSLGKRMPL-SPPNGTSSSQLPADEQQRTKGHAVAPAVAVRAPQKPSVASSQ 503
Query: 397 SAQQLDSVLRVACSLLVLMCLL 418
++ + C L+L LL
Sbjct: 504 TSYMISYWRHWHCHWLLLFHLL 525
>gi|217426809|gb|ACK44517.1| AT5G10080-like protein [Arabidopsis arenosa]
Length = 506
Score = 358 bits (918), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 187/373 (50%), Positives = 252/373 (67%), Gaps = 20/373 (5%)
Query: 15 NALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD 74
N + C P+T+ + S + ++L+EY+PSSSS+SK CSH LC S S C+S K+
Sbjct: 129 NCVQCAPLTSTYY-------SSLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKE 181
Query: 75 PCPYIADYSTEDTSSSGYLVDDILHLASFSKHA---PQSSVQSSVIIGCGRKQTGSYLDG 131
CPY +Y + +TSSSG LV+DILHL + + SSV++ V+IGCG+KQ+G YLDG
Sbjct: 182 QCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDG 241
Query: 132 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI 191
APDG+MGLG ++SVPS L+KAGL++NSFS+CFDE DSG ++FGD GP+ QQST FL +
Sbjct: 242 VAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQL 301
Query: 192 GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
E Y VGVE+ CIGNSCL Q+ F +DSG SFT+LP EIY +V ++ D+ +++
Sbjct: 302 -ENNSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHINATSK 360
Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD 311
S +G SW+YCY +S E KVP ++L FS N +FV+ +F F +++G FCL + S
Sbjct: 361 SFEGVSWEYCYESSVEP--KVPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPI-SPS 417
Query: 312 GDYGI--IGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTT 369
G GI IGQN+M G+R+VFDREN+KL WS SKC+E +K P + SP PLPT
Sbjct: 418 GQEGIGSIGQNYMRGYRMVFDRENMKLRWSASKCQE--EKIEPPQASPGSTSSPYPLPTE 475
Query: 370 EQQSTSNGQAAAP 382
EQQ S G A +P
Sbjct: 476 EQQ--SRGHAVSP 486
>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 564
Score = 355 bits (910), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 184/388 (47%), Positives = 249/388 (64%), Gaps = 9/388 (2%)
Query: 28 CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 87
C + G DR+L Y P+ S++S+++ CSH LC S C S K PCPY DY E+T
Sbjct: 176 CAPLAGYRETLDRDLGIYKPAESTTSRHLPCSHELCPPGSGCSSPKQPCPYSTDYLQENT 235
Query: 88 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 147
+SSG L++DILHL S HAP V++SV+IGCGRKQ+GSYLDG APDG++GLG+ D+SV
Sbjct: 236 TSSGLLIEDILHLDSRESHAP---VKASVVIGCGRKQSGSYLDGIAPDGLLGLGMADISV 292
Query: 148 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 207
PS LA+AGL++NSFS+CF E DSG +FFGDQG + QQST F+P+ KY Y V V+ C+
Sbjct: 293 PSFLARAGLVRNSFSMCFKE-DSGRIFFGDQGVSIQQSTPFVPLYGKYQTYAVNVDKSCV 351
Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
G+ C + F+ALVDSG SFT LP +Y V V+FDK V + RI+ + S++YCY+AS
Sbjct: 352 GHKCFEATSFEALVDSGTSFTALPLNVYKAVAVEFDKQVHAPRITQEDASFEYCYSASPL 411
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHR 326
+M VP + L F+ N+SF N + EG FCL + + GIIGQNF+ G+
Sbjct: 412 KMPDVPTVTLTFAANKSFQAVNPTIVLKDGEGSVAGFCLALQKSPEPIGIIGQNFLTGYH 471
Query: 327 IVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTA 386
IVFD+EN+KL W S+C + + + V L P PLP++EQQ++ PP+ A
Sbjct: 472 IVFDKENMKLGWYRSECHDPDNSTTVPLGPSQHNSPGVPLPSSEQQTSPT---VTPPAVA 528
Query: 387 KTAPSKSIAASAQQLDSVLRVACSLLVL 414
AP+ S + L +L CSLL+L
Sbjct: 529 GKAPTSS-SGPPSNLHRLLANCCSLLLL 555
>gi|223946655|gb|ACN27411.1| unknown [Zea mays]
Length = 378
Score = 351 bits (901), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 176/379 (46%), Positives = 248/379 (65%), Gaps = 11/379 (2%)
Query: 37 VQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
+QDR+L Y P+ S++S+++ CSH LC+S C + K PCPY DY +E+T+SSG L++D
Sbjct: 1 MQDRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIED 60
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
LHL H P V +SVIIGCG+KQ+G YLDG APDG++GLG+ D+SVPS LA+AGL
Sbjct: 61 TLHLNYREDHVP---VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGL 117
Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
+QNSFS+CF E+ SG +FFGDQG +QQST F+P+ K Y V V+ CIG+ CL +
Sbjct: 118 VQNSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTS 177
Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 276
F+ALVDSG SFT LP ++Y ++FDK +++ R+ + +WKYCY+AS EM VP +
Sbjct: 178 FKALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTIT 237
Query: 277 LIFSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 335
L F+ ++S N I F + +G FCL V+ + GII QNF++G+ +VFDRE++K
Sbjct: 238 LTFAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHVVFDRESMK 297
Query: 336 LAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIA 395
L W S+C V D + V L P +PLP+ EQQ++ A P+TA TAP ++
Sbjct: 298 LGWYRSECRYVEDSTTVPLGPSQHDSPEDPLPSNEQQTS----PAVTPATAGTAP---LS 350
Query: 396 ASAQQLDSVLRVACSLLVL 414
+ L +L + LL+L
Sbjct: 351 CATTNLQMLLASSYPLLLL 369
>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
Length = 485
Score = 350 bits (897), Expect = 9e-94, Method: Compositional matrix adjust.
Identities = 175/377 (46%), Positives = 247/377 (65%), Gaps = 11/377 (2%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
DR+L Y P+ S++S+++ CSH LC+S C + K PCPY DY +E+T+SSG L++D L
Sbjct: 110 DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 169
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
HL H P V +SVIIGCG+KQ+G YLDG APDG++GLG+ D+SVPS LA+AGL+Q
Sbjct: 170 HLNYREDHVP---VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQ 226
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
NSFS+CF E+ SG +FFGDQG +QQST F+P+ K Y V V+ CIG+ CL + F+
Sbjct: 227 NSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFK 286
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
ALVDSG SFT LP ++Y ++FDK +++ R+ + +WKYCY+AS EM VP + L
Sbjct: 287 ALVDSGTSFTSLPLDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLT 346
Query: 279 FSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
F+ ++S N I F + +G FCL V+ + GII QNF++G+ +VFDRE++KL
Sbjct: 347 FAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHVVFDRESMKLG 406
Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 397
W S+C +V D + V L P +PLP+ EQQ++ A P+TA TAP ++ +
Sbjct: 407 WYRSECHDVEDSTTVPLGPSQRDSPEDPLPSNEQQTS----PAVTPATAGTAP---LSCA 459
Query: 398 AQQLDSVLRVACSLLVL 414
L +L + LL+L
Sbjct: 460 TTNLQMLLASSYPLLLL 476
>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
gi|194704920|gb|ACF86544.1| unknown [Zea mays]
gi|223949445|gb|ACN28806.1| unknown [Zea mays]
gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
Length = 515
Score = 348 bits (892), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 175/377 (46%), Positives = 246/377 (65%), Gaps = 11/377 (2%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
DR+L Y P+ S++S+++ CSH LC+S C + K PCPY DY +E+T+SSG L++D L
Sbjct: 140 DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 199
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
HL H P V +SVIIGCG+KQ+G YLDG APDG++GLG+ D+SVPS LA+AGL+Q
Sbjct: 200 HLNYREDHVP---VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQ 256
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
NSFS+CF E+ SG +FFGDQG +QQST F+P+ K Y V V+ CIG+ CL + F+
Sbjct: 257 NSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFK 316
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
ALVDSG SFT LP ++Y ++FDK +++ R+ + +WKYCY+AS EM VP + L
Sbjct: 317 ALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLT 376
Query: 279 FSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
F+ ++S N I F + +G FCL V+ + GII QNF++G+ +VFDRE++KL
Sbjct: 377 FAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHVVFDRESMKLG 436
Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 397
W S+C V D + V L P +PLP+ EQQ++ A P+TA TAP ++ +
Sbjct: 437 WYRSECRYVEDSTTVPLGPSQHDSPEDPLPSNEQQTS----PAVTPATAGTAP---LSCA 489
Query: 398 AQQLDSVLRVACSLLVL 414
L +L + LL+L
Sbjct: 490 TTNLQMLLASSYPLLLL 506
>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
Length = 515
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 174/377 (46%), Positives = 245/377 (64%), Gaps = 11/377 (2%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
DR+L Y P+ S++S+++ CSH LC+S C + K PCPY DY +E+T+SSG L++D L
Sbjct: 140 DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 199
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
HL H P V +SVIIGCG+KQ+G YLDG APDG++ LG+ D+SVPS LA+AGL+Q
Sbjct: 200 HLNYREDHVP---VNASVIIGCGQKQSGDYLDGIAPDGLLALGMADISVPSFLARAGLVQ 256
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
NSFS+CF E+ SG +FFGDQG +QQST F+P+ K Y V V+ CIG+ CL + F+
Sbjct: 257 NSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFK 316
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
ALVDSG SFT LP ++Y ++FDK +++ R+ + +WKYCY+AS EM VP + L
Sbjct: 317 ALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLT 376
Query: 279 FSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
F+ ++S N I F + +G FCL V+ + GII QNF++G+ +VFDRE++KL
Sbjct: 377 FAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHVVFDRESMKLG 436
Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAAS 397
W S+C V D + V L P +PLP+ EQQ++ A P+TA TAP ++ +
Sbjct: 437 WYRSECRYVEDSTTVPLGPSQHDSPEDPLPSNEQQTS----PAVTPATAGTAP---LSCA 489
Query: 398 AQQLDSVLRVACSLLVL 414
L +L + LL+L
Sbjct: 490 TTNLQMLLASSYPLLLL 506
>gi|297807039|ref|XP_002871403.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317240|gb|EFH47662.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 529
Score = 345 bits (884), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 183/372 (49%), Positives = 249/372 (66%), Gaps = 18/372 (4%)
Query: 15 NALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD 74
N + C P+T+ + S + ++L+EY+PSSSSSSK CSH LC S S C S K+
Sbjct: 129 NCVQCAPLTSTYY-------SSLATKDLNEYNPSSSSSSKVFLCSHKLCGSASDCDSPKE 181
Query: 75 PCPYIADYSTEDTSSSGYLVDDILHLASFSKHA---PQSSVQSSVIIGCGRKQTGSYLDG 131
C Y Y + +TSSSG LV+DILHL + + SSV++ V++GCG+KQ+G YLDG
Sbjct: 182 QCTYTVKYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVVGCGKKQSGDYLDG 241
Query: 132 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI 191
APDG+MGLG ++SVPS L+KAGL++NSFS+CFDE DSG ++FGD GP+ QQS FL +
Sbjct: 242 VAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSAPFLQL 301
Query: 192 GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
E Y VGVE+ CIGNSCL Q+ F +DSG SFT+LP EIY +V ++ D+ +++
Sbjct: 302 -ENNSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHINATSK 360
Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD 311
S +G SW+YCY +S E KVP ++L FS N +FV+ +F F +++G FCL + ++
Sbjct: 361 SFEGVSWEYCYESSVEP--KVPAIKLKFSHNNTFVIHKPLFVFQQSQGLVQFCLPISPSE 418
Query: 312 GD-YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTE 370
+ G IGQN+M G+R+VFDREN+KL WS SKC+E DK+ P + SP PLPT E
Sbjct: 419 QEGIGSIGQNYMRGYRMVFDRENMKLGWSPSKCQE--DKTEPPQASPGSTSSPYPLPTEE 476
Query: 371 QQSTSNGQAAAP 382
QQ S G A +P
Sbjct: 477 QQ--SRGHAVSP 486
>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 627
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 173/356 (48%), Positives = 236/356 (66%), Gaps = 9/356 (2%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
DR+L Y P+ S++S+++ CSH LC S C + K PCPY Y E+T+SSG LV+DIL
Sbjct: 252 DRDLGIYKPAESTTSRHLPCSHELCLLGSDCTNQKQPCPYNTKYLQENTTSSGLLVEDIL 311
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
HL S HAP V++SVIIGCGRKQ+GSYLDG APDG++GLG+ D+SVPS LA+AGL++
Sbjct: 312 HLDSRESHAP---VKASVIIGCGRKQSGSYLDGIAPDGLLGLGMADISVPSFLARAGLVR 368
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
NSFS+CF + DSG +FFGDQG +TQQST F+P+ K Y V V+ C+G+ C + FQ
Sbjct: 369 NSFSMCFTK-DSGRIFFGDQGVSTQQSTPFVPLYGKLQTYTVNVDKSCVGHKCFESTSFQ 427
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
A+VDSG SFT LP +IY V ++FDK V++ R+ + S+ YCY+AS M VP + L
Sbjct: 428 AIVDSGTSFTALPLDIYKAVAIEFDKQVNASRLPQEATSFDYCYSASPLVMPDVPTVTLT 487
Query: 279 FSKNQSFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
F+ N+SF N F + EG FCL V+ + GII QNF++G+ +VFDREN+KL
Sbjct: 488 FAGNKSFQPVNPTFLLHDEEGAVAGFCLAVVQSPEPIGIIAQNFLLGYHVVFDRENMKLG 547
Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKS 393
W S+C ++ + + V L P +PLP+ EQQ++ A P+ A A + S
Sbjct: 548 WYRSECHDLDNSTTVPLGPSQHNSPEDPLPSNEQQTS----PAVTPAVAGRARASS 599
>gi|357489329|ref|XP_003614952.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355516287|gb|AES97910.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 530
Score = 335 bits (860), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 182/391 (46%), Positives = 258/391 (65%), Gaps = 19/391 (4%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
DR+L++Y PS SSSS+++ C H LC S+CK KD CPYI +Y++++TSSSG+L++D L
Sbjct: 147 DRDLNQYSPSLSSSSRHLPCGHQLCNQNSNCKGFKDRCPYIKEYTSDNTSSSGFLIEDKL 206
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
HLAS +A ++S+Q+SVI+GCGRKQ+G +L+GAAP+G++GLG G +SVP+LLAKAGLI+
Sbjct: 207 HLAS--NNATKNSIQASVILGCGRKQSGYFLEGAAPNGMLGLGPGSISVPALLAKAGLIR 264
Query: 159 NSFSICFDENDSGSVFFGDQGPATQ-QSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 217
NS SIC +E SG + FGDQG ATQ +ST FL + YFVGVE +C+G+ C ++ F
Sbjct: 265 NSISICLNEKGSGRILFGDQGHATQRRSTPFLLDDGELLNYFVGVERFCVGSFCYKETEF 324
Query: 218 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMR 276
+A +D+G SFT+LP +Y VV +F+K V + RI+ Q S + CYNASS E P M+
Sbjct: 325 KAFIDTGTSFTYLPKGVYETVVAEFEKQVHATRITSQIQSDFNCCYNASSRESNNFPPMK 384
Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-------YGIIGQNFMMGHRIVF 329
FSKNQSF+++N S + + T CL V+ +D + Y I QNF+MG+ +VF
Sbjct: 385 FTFSKNQSFIIQNPFISMDQED--TTICLAVVQSDDELITIGRKYTIACQNFLMGYDMVF 442
Query: 330 DRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTA-KT 388
DRENL+ W S C++ + +S + P G SP+ +P+ +QQ N + PP+ A KT
Sbjct: 443 DRENLRFGWFRSNCQDSMGES-ANFTSPSIGGSPDSIPSNQQQRVPNNTRSVPPAIAGKT 501
Query: 389 APSKSIAASAQQLDSVLRVACSLLVLMCLLL 419
+P S A +L L L+CLLL
Sbjct: 502 SPKPSAAKPGLNSWHLLNS----LSLICLLL 528
>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
Length = 469
Score = 322 bits (826), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 152/307 (49%), Positives = 212/307 (69%), Gaps = 4/307 (1%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
DR+L Y P+ S++S+++ CSH LC+S C + K PCPY DY +E+T+SSG L++D L
Sbjct: 140 DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 199
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
HL H P V +SVIIGCG+KQ+G YLDG APDG++GLG+ D+SVPS LA+AGL+Q
Sbjct: 200 HLNYREDHVP---VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQ 256
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
NSFS+CF E+ SG +FFGDQG +QQST F+P+ K Y V V+ CIG+ CL + F+
Sbjct: 257 NSFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFK 316
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
ALVDSG SFT LP ++Y ++FDK +++ R+ + +WKYCY+AS EM VP + L
Sbjct: 317 ALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLT 376
Query: 279 FSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
F+ ++S N I F + +G FCL V+ + GII QNF++G+ +VFDRE++KL
Sbjct: 377 FAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHVVFDRESMKLG 436
Query: 338 WSHSKCE 344
W S+C+
Sbjct: 437 WYRSECK 443
>gi|110741881|dbj|BAE98882.1| predicted GPI-anchored protein [Arabidopsis thaliana]
Length = 313
Score = 298 bits (764), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 148/276 (53%), Positives = 196/276 (71%), Gaps = 9/276 (3%)
Query: 110 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 169
SSV++ V+IGCG+KQ+G YLDG APDG+MGLG ++SVPS L+KAGL++NSFS+CFDE D
Sbjct: 5 SSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEED 64
Query: 170 SGSVFFGDQGPATQQSTSFLPI-GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFT 228
SG ++FGD GP+ QQST FL + KY Y VGVE+ CIGNSCL Q+ F +DSG SFT
Sbjct: 65 SGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSFT 124
Query: 229 FLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 288
+LP EIY +V ++ D+ +++ + +G SW+YCY +S+E KVP ++L FS N +FV+
Sbjct: 125 YLPEEIYRKVALEIDRHINATSKNFEGVSWEYCYESSAEP--KVPAIKLKFSHNNTFVIH 182
Query: 289 NHIFSFPENEGFTVFCLTVMSTDGDYGI--IGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+F F +++G FCL + S G GI IGQN+M G+R+VFDREN+KL WS SKC+E
Sbjct: 183 KPLFVFQQSQGLVQFCLPI-SPSGQEGIGSIGQNYMRGYRMVFDRENMKLGWSPSKCQE- 240
Query: 347 IDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAP 382
DK P + SPNPLPT EQQS G A +P
Sbjct: 241 -DKIEPPQASPGSTSSPNPLPTDEQQSR-GGHAVSP 274
>gi|449533544|ref|XP_004173734.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like, partial [Cucumis sativus]
Length = 408
Score = 296 bits (757), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 143/255 (56%), Positives = 195/255 (76%), Gaps = 1/255 (0%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
D++L+EY PSSSS+SK++SCSH LC S SC+S K CPY+ DY TE+TSSSG L+ D+L
Sbjct: 148 DKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTSSSGLLIQDVL 207
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
HL+S +++ ++Q+ VI+GCG KQ+G YL G APDG+ GLGLG++SV S LAK L+Q
Sbjct: 208 HLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVLSSLAKEELVQ 267
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
NSFS+CF+E+ SG +FFGD+GPA+QQ+TSF+P+ KY+ Y VGVE+ CI NSCL Q+ F+
Sbjct: 268 NSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIENSCLKQTSFK 327
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 277
AL+DSG SFT+LP E Y +V++FDK L ++ +S +G WKYCY S++ M KVP + L
Sbjct: 328 ALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYKISADAMPKVPSVTL 387
Query: 278 IFSKNQSFVVRNHIF 292
+F N SFVV + +F
Sbjct: 388 LFPLNNSFVVHDPVF 402
>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 527
Score = 224 bits (570), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 130/351 (37%), Positives = 197/351 (56%), Gaps = 21/351 (5%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHLASF 103
YD SS+SKNV+C+ LC+ ++ C S CPY +Y +E+TS++G+LV+D+LHL +
Sbjct: 163 YDNKESSTSKNVACNSSLCEQKTQCSSSSGGTCPYQVEYLSENTSTTGFLVEDVLHLITD 222
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
+ Q + + GCG+ QTG++LDGAAP+G+ GLG+ DVSVPS+LAK GL NSFS+
Sbjct: 223 NDDQTQHA-NPLITFGCGQVQTGAFLDGAAPNGLFGLGMSDVSVPSILAKQGLTSNSFSM 281
Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDS 223
CF + G + FGD + Q + I + Y + V +G + F A+ D+
Sbjct: 282 CFAADGLGRITFGDNNSSLDQGKTPFNIRPSHSTYNITVTQIIVGGNSADLE-FNAIFDT 340
Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSEEMLKVPDMRLIFS 280
G SFT+L Y ++ FD + +R S + ++YCY+ + + ++VP++ L
Sbjct: 341 GTSFTYLNNPAYKQITQSFDSKIKLQRHSFSNSDDLPFEYCYDLRTNQTIEVPNINLTMK 400
Query: 281 KNQSFVVRNHIF-SFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 339
++ V + I S N G V CL V+ ++ + IIGQNFM G+RIVFDREN+ L W
Sbjct: 401 GGDNYFVMDPIITSGGGNNG--VLCLAVLKSN-NVNIIGQNFMTGYRIVFDRENMTLGWK 457
Query: 340 HSKC--EEV----IDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPS 384
S C +E+ +++SH V P +P Q + SNG P S
Sbjct: 458 ESNCYDDELSSLPVNRSHAPAVSPAMAVNPEI-----QSNPSNGPQRLPSS 503
>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 547
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 134/365 (36%), Positives = 194/365 (53%), Gaps = 25/365 (6%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
N + Y P++SS+SK V CS LC C S D CPY Y +++TSS+GYLV+DILHL
Sbjct: 175 NFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHL 234
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+ V + + +GCG+ Q+G++L AAP+G+ GLG+ +VSVPS+LA AGLI NS
Sbjct: 235 TT--NDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNS 292
Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQAL 220
FS+CF G + FGD+G Q T F +G ++ Y V + +G ++ +
Sbjct: 293 FSLCFGPARMGRIEFGDKGSPGQNETPF-NLGRRHPTYNVSITQIGVGGH-ISDLDVAVI 350
Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLKVPDMRLI 278
DSG SFT+L Y+ KF +V K+ ++ + ++ CY S ++ P M L
Sbjct: 351 FDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLT 410
Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
FV+ NH E +FCL + +D IIGQNFM G+ IVFDRE + L W
Sbjct: 411 MKGGGHFVI-NHPIVLISTESKRLFCLAIARSD-SINIIGQNFMTGYHIVFDREKMVLGW 468
Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTA-KTAPSKSIAAS 397
S C D++ +L P G +P P AAAP +TA K + +I +
Sbjct: 469 KESNCTGYEDENTNNL---PVGPTPTP-------------AAAPGTTAIKPQANSNINNT 512
Query: 398 AQQLD 402
Q ++
Sbjct: 513 TQTIE 517
>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like [Cucumis sativus]
Length = 524
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 134/365 (36%), Positives = 194/365 (53%), Gaps = 25/365 (6%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
N + Y P++SS+SK V CS LC C S D CPY Y +++TSS+GYLV+DILHL
Sbjct: 152 NFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHL 211
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+ V + + +GCG+ Q+G++L AAP+G+ GLG+ +VSVPS+LA AGLI NS
Sbjct: 212 TT--NDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNS 269
Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQAL 220
FS+CF G + FGD+G Q T F +G ++ Y V + +G ++ +
Sbjct: 270 FSLCFGPARMGRIEFGDKGSPGQNETPF-NLGRRHPTYNVSITQIGVGGH-ISDLDVAVI 327
Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLKVPDMRLI 278
DSG SFT+L Y+ KF +V K+ ++ + ++ CY S ++ P M L
Sbjct: 328 FDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLT 387
Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
FV+ NH E +FCL + +D IIGQNFM G+ IVFDRE + L W
Sbjct: 388 MKGGGHFVI-NHPIVLISTESKRLFCLAIARSD-SINIIGQNFMTGYHIVFDREKMVLGW 445
Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTA-KTAPSKSIAAS 397
S C D++ +L P G +P P AAAP +TA K + +I +
Sbjct: 446 KESNCTGYEDENTNNL---PVGPTPTP-------------AAAPGTTAIKPQANSNINNT 489
Query: 398 AQQLD 402
Q ++
Sbjct: 490 TQTIE 494
>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 533
Score = 218 bits (556), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 129/352 (36%), Positives = 191/352 (54%), Gaps = 12/352 (3%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
Y P++SS+S+ + C++ LC +S C S + CPY Y + TSS+G LV+D+LHL +
Sbjct: 165 YRPNASSTSQTIPCNNTLCSRQSRCPSAQSTCPYQVQYLSNGTSSTGVLVEDLLHLTT-- 222
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
A ++ + +I GCGR QTGS+LDGAAP+G+ GLG+ ++SVPS LA+ G NSFS+C
Sbjct: 223 DDAQSRALDAKIIFGCGRVQTGSFLDGAAPNGLFGLGMTNISVPSTLAREGYTSNSFSMC 282
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
F + G + FGD G + Q T F + + + Y V + +G F A+ DSG
Sbjct: 283 FGRDGIGRISFGDTGSSGQGETPF-NLRQLHPTYNVSITKINVGGRDADLE-FSAIFDSG 340
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNASSEEM-LKVPDMRLIFSKN 282
SFT+L Y + F+ KR S+ ++YCY SS + L++P + L+
Sbjct: 341 TSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPFEYCYEMSSNQTNLEIPTVNLVMQGG 400
Query: 283 QSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 342
F V + I G +++CL ++ + GD IIGQNFM G+RIVF+RE L W S
Sbjct: 401 SQFNVTDPIVIVILQGGASIYCLAIVKS-GDVNIIGQNFMTGYRIVFNRERNVLGWKASD 459
Query: 343 CEEVIDKSHVHLVPPPAGQSP----NPLPTTEQQSTSNGQAAAPPSTAKTAP 390
C + +D + + P G P NP T +T+ + PP AP
Sbjct: 460 CYDDMDTTTFPVDPISPGIPPATAVNPQATAGSGNTTE-VSGTPPPVGNNAP 510
>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 515
Score = 214 bits (545), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 137/393 (34%), Positives = 200/393 (50%), Gaps = 34/393 (8%)
Query: 28 CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 87
C ++ D +L+ Y+P+ SS+SK V+C++ LC RS C CPY+ Y + +T
Sbjct: 129 CAATDSSAFASDFDLNVYNPNGSSTSKKVTCNNSLCMHRSQCLGTLSNCPYMVSYVSAET 188
Query: 88 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 147
S+SG LV+D+LHL H V+++VI GCG+ Q+GS+LD AAP+G+ GLG+ +SV
Sbjct: 189 STSGILVEDVLHLTQEDNH--HDLVEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEKISV 246
Query: 148 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 207
PS+L++ G +SFS+CF + G + FGD+G Q T F + + Y + V +
Sbjct: 247 PSMLSREGFTADSFSMCFGRDGIGRISFGDKGSFDQDETPF-NLNPSHPTYNITVTQVRV 305
Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASS 266
G + L F AL DSG SFT+L Y + F V +R ++YCY+ S
Sbjct: 306 GTT-LIDVEFTALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDMSP 364
Query: 267 EEMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
+ +P + L F V + I + V+CL V+ T + IIGQNFM G+
Sbjct: 365 DANTSLIPSVSLTMGGGSHFAVYDPIIII-STQSELVYCLAVVKT-AELNIIGQNFMTGY 422
Query: 326 RIVFDRENLKLAWSHSKCEEVID-------KSHVHLVPPPA-----GQSPNPLPTTEQQS 373
R+VFDRE L L W C ++ D + H H PPA G P PT ++S
Sbjct: 423 RVVFDREKLVLGWKKFDCYDIEDHNDAIPTRPHSHADVPPAVAAGLGNYPATDPT--RKS 480
Query: 374 TSNGQAAAPPSTAKTAPSKSIAASAQQLDSVLR 406
N Q K + + Q L S+LR
Sbjct: 481 KYNSQ------------RKWLTNTTQWLRSMLR 501
>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 525
Score = 214 bits (544), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 125/344 (36%), Positives = 189/344 (54%), Gaps = 9/344 (2%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
L+ Y PS SS++K V CS PLC+ S+C + D CPY +Y + +TS+SG L +D ++
Sbjct: 159 LNPYTPSLSSTAKPVLCSDPLCEMSSTCMAPTDQCPYEINYVSANTSTSGALYEDYMY-- 216
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
F + + + V+ V +GCG+ QTGS L GAAP+G+MGLG D+SVP+ LA G + +SF
Sbjct: 217 -FMRESGGNPVKLPVYLGCGKVQTGSLLKGAAPNGLMGLGTTDISVPNKLASTGQLADSF 275
Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIG-EKYDAYFVGVESYCIGNSCLTQSGFQAL 220
S+C SG++ FGD+GPA Q++T +P D Y V ++S +GN+ L + AL
Sbjct: 276 SLCISPGGSGTLTFGDEGPAAQRTTPIIPKSVSMLDTYIVEIDSITVGNTNLLMAS-HAL 334
Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVS-SKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
D+G SFT+L +Y + V +D +S K + + W CY S+ +VP + L
Sbjct: 335 FDTGTSFTYLSKTVYPQFVQAYDAQMSLPKWNDPRFSKWDLCYQTSNTN-FQVPVVSLAL 393
Query: 280 SKNQSFVVRNHIFSF-PENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
S S V + + S +N C+TVM + IIGQNFM + I ++R + + W
Sbjct: 394 SGGNSLDVVSGLKSIVDDNNAMIAVCVTVMDSGAGLSIIGQNFMTNYSITYNRAKMTIGW 453
Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPN-PLPTTEQQSTSNGQAAA 381
+ S C + S+ PA P PLP + ++ N A
Sbjct: 454 TPSDCSTDLTLSNSTPGSVPAALPPTAPLPAVPRPASPNSTVTA 497
>gi|449529194|ref|XP_004171586.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 417
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 124/340 (36%), Positives = 188/340 (55%), Gaps = 15/340 (4%)
Query: 28 CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 87
C G+ D LS Y P SS+SK V C++ LC R C CPY+ Y + +T
Sbjct: 37 CAPTEGSPYASDFELSVYSPKKSSTSKTVPCNNSLCAQRDQCTEAFGNCPYVVSYVSAET 96
Query: 88 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 147
S++G L++D+LHL + +KH+ +Q+ + GCG+ Q+GS+LD AAP+G+ GLG+ +SV
Sbjct: 97 STTGILIEDLLHLKTENKHS--EPIQAYITFGCGQVQSGSFLDVAAPNGLFGLGMEQISV 154
Query: 148 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 207
PS+L++ GL+ NSFS+CF ++ G + FGD+G Q+ T F + + + Y + V S +
Sbjct: 155 PSILSREGLMANSFSMCFSDDGVGRINFGDKGSLEQEETPF-NLNQLHPNYNITVTSIRV 213
Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASS 266
G + L + AL DSG SF++ IY+++ F R ++YCYN S
Sbjct: 214 GTT-LIDADITALFDSGTSFSYFTDPIYSKLSASFHAQTRDGRHPPNPRIPFEYCYNMSP 272
Query: 267 EEMLKV-PDMRLIFSKNQSFVVRNHIFSF-PENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
+ + P + L F V + I +NE ++CL V+ + + IIGQNFM G
Sbjct: 273 DANASLTPGISLTMKGGGPFPVYDPIIVISTQNE--LIYCLAVVKS-AELNIIGQNFMTG 329
Query: 325 HRIVFDRENLKLAWSHSKCEEVIDKSHVHLVP-----PPA 359
+RIVFDRE L L W C ++ +KS + P PPA
Sbjct: 330 YRIVFDREKLVLGWKKFDCYDIEEKSLFPMKPDVTTVPPA 369
>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/319 (39%), Positives = 178/319 (55%), Gaps = 10/319 (3%)
Query: 28 CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 87
C GAS D LS Y+P SS+SK V+C++ +C R+ C CPYI Y + T
Sbjct: 130 CAPTHGASYASDFELSIYNPRESSTSKKVTCNNDMCAQRNRCLGTFSSCPYIVSYVSAQT 189
Query: 88 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 147
S+SG LV D+LHL + + + V++ V GCG+ Q+GS+LD AAP+G+ GLG+ +SV
Sbjct: 190 STSGILVKDVLHLTT--EDGGREFVEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISV 247
Query: 148 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 207
PS+L++ GLI +SFS+CF + G + FGD+G Q+ T F + + Y V V +
Sbjct: 248 PSVLSREGLIADSFSMCFGHDGIGRISFGDKGSPDQEETPF-NVNPAHPTYNVTVTQARV 306
Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASS 266
G + L F AL DSG SFT++ Y+ V KF L KR ++YCY+ S
Sbjct: 307 G-TMLIDVEFTALFDSGTSFTYMVDPAYSRVSEKFHSLARDKRRPPDPRIPFEYCYDMSP 365
Query: 267 EEMLK-VPDMRLIFSKNQSFVVRNHIFSF-PENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
+ VP M L + F V + I +NE V+CL V+ + + IIGQNFM G
Sbjct: 366 DANASLVPSMSLTMKGGRHFTVYDPIIVISTQNE--IVYCLAVVKST-ELNIIGQNFMTG 422
Query: 325 HRIVFDRENLKLAWSHSKC 343
+R+VFDRE L L W C
Sbjct: 423 YRVVFDREKLVLGWKKFDC 441
>gi|449434466|ref|XP_004135017.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 525
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 124/340 (36%), Positives = 187/340 (55%), Gaps = 15/340 (4%)
Query: 28 CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 87
C G+ D LS Y P SS+SK V C++ LC R C CPY+ Y + +T
Sbjct: 145 CAPTEGSPYASDFELSVYSPKKSSTSKTVPCNNNLCAQRDQCTEAFGNCPYVVSYVSAET 204
Query: 88 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 147
S++G L++D+LHL + KH+ +Q+ + GCG+ Q+GS+LD AAP+G+ GLG+ +SV
Sbjct: 205 STTGILIEDLLHLKTEHKHS--EPIQAYITFGCGQVQSGSFLDVAAPNGLFGLGMEQISV 262
Query: 148 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 207
PS+L++ GL+ NSFS+CF ++ G + FGD+G Q+ T F + + + Y + V S +
Sbjct: 263 PSILSREGLMANSFSMCFSDDGVGRINFGDKGSLEQEETPF-NLNQLHPNYNITVTSIRV 321
Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASS 266
G + L + AL DSG SF++ IY+++ F R ++YCYN S
Sbjct: 322 GTT-LIDADITALFDSGTSFSYFTDPIYSKLSASFHAQTRDGRHPPNPRIPFEYCYNMSP 380
Query: 267 EEMLKV-PDMRLIFSKNQSFVVRNHIFSF-PENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
+ + P + L F V + I +NE ++CL V+ + + IIGQNFM G
Sbjct: 381 DANASLTPGISLTMKGGGPFPVYDPIIVISTQNE--LIYCLAVVKS-AELNIIGQNFMTG 437
Query: 325 HRIVFDRENLKLAWSHSKCEEVIDKSHVHLVP-----PPA 359
+RIVFDRE L L W C ++ +KS + P PPA
Sbjct: 438 YRIVFDREKLVLGWKKFDCYDIEEKSLFPMKPDVTTVPPA 477
>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 524
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 116/316 (36%), Positives = 182/316 (57%), Gaps = 8/316 (2%)
Query: 33 GASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGY 92
GA+ + LS Y+P S+++K V+C++ LC R+ C CPY+ Y + TS+SG
Sbjct: 145 GATYASEFELSIYNPKVSTTNKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQTSTSGI 204
Query: 93 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
L++D++HL + K+ + V++ V GCG+ Q+GS+LD AAP+G+ GLG+ +SVPS+LA
Sbjct: 205 LMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLA 262
Query: 153 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
+ GL+ +SFS+CF + G + FGD+G + Q+ T F + + Y + V +G + L
Sbjct: 263 REGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPF-NLNPSHPNYNITVTRVRVGTT-L 320
Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLK 271
F AL D+G SFT+L +Y V F KR S ++YCY+ S++
Sbjct: 321 IDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEYCYDMSNDANAS 380
Query: 272 -VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
+P + L N F + + I EG V+CL ++ + + IIGQN+M G+R+VFD
Sbjct: 381 LIPSLSLTMKGNSHFTINDPIIVI-STEGELVYCLAIVKS-SELNIIGQNYMTGYRVVFD 438
Query: 331 RENLKLAWSHSKCEEV 346
RE L LAW C ++
Sbjct: 439 REKLVLAWKKFDCYDI 454
>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 518
Score = 211 bits (537), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 133/366 (36%), Positives = 192/366 (52%), Gaps = 19/366 (5%)
Query: 28 CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 87
C G + D LS YDP SS+SK V+C++ LC R+ C CPY+ Y + T
Sbjct: 134 CAPTQGVAYASDFELSIYDPKQSSTSKKVTCNNNLCAHRNRCLGTFSSCPYMVSYVSAQT 193
Query: 88 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 147
S+SG LV+D+LHL S + + Q S+++ V GCG+ Q+GS+L+ AAP+G+ GLG+ +SV
Sbjct: 194 STSGILVEDVLHLTS--EDSNQESIKAYVTFGCGQVQSGSFLNTAAPNGLFGLGMDQISV 251
Query: 148 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 207
PS+L++ GL +SFS+CF + G + FGD+G Q+ T F + +Y + V +
Sbjct: 252 PSILSREGLTADSFSMCFGHDGVGRISFGDKGSPDQEETPFNS-NPSHPSYNISVTQVRV 310
Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS- 265
G + L F AL DSG SFT+L IYA V F KR ++YCY+ S
Sbjct: 311 GTT-LVDVDFTALFDSGTSFTYLINPIYAMVSENFHAQAQDKRRPPDPRIPFEYCYDMSP 369
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSF-PENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
+P M L F V + I +NE V+CL ++ + + IIGQNFM G
Sbjct: 370 GANSSLIPSMSLTMKGRGHFTVFDPIIVITTQNE--LVYCLAIVKST-ELNIIGQNFMTG 426
Query: 325 HRIVFDRENLKLAWSHSKCEE-----VIDKSHVHLVPPPA----GQSPNPLPTTEQQSTS 375
+R+VFDRE L L W + C + + H VPP G +P T + + S
Sbjct: 427 YRVVFDREKLVLGWKETDCYDQEYNSFPTEPHASDVPPAVAAGLGNYSSPHSTNQDRKKS 486
Query: 376 NGQAAA 381
A+
Sbjct: 487 QSSVAS 492
>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 529
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 126/342 (36%), Positives = 189/342 (55%), Gaps = 10/342 (2%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
Y P SS+S+ V CS +C ++ C + + CPY +Y +++TSS G LV+D+++LA+ S
Sbjct: 157 YSPRKSSTSRKVPCSSNMCDLQTECSAASNSCPYKIEYLSDNTSSKGVLVEDVMYLATES 216
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
H+ Q+ + GCG+ QTGS+L AAP+G++GLG+ SVPSLLA G+ NSFS+C
Sbjct: 217 GHS--KITQAPITFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASQGVAANSFSMC 274
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
F E+ G + FGD G A Q T L I + Y + + G + + F A+VDSG
Sbjct: 275 FGEDGHGRINFGDTGSADQLETP-LNIYKHNPYYNISIVGAMAGGKTFS-TKFSAVVDSG 332
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQ 283
SFT L +Y E+ FDK V KR + ++YCY SS+ + P++ L
Sbjct: 333 TSFTALSDPMYTEITSAFDKQVKEKRNPADSSLPFEYCYTISSKGAVSPPNISLTAKGGS 392
Query: 284 SFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 342
F V++ I + + V +CL +M ++G +IG+NFM G ++VFDRE L L W
Sbjct: 393 VFPVKDPIITITDISSSPVGYCLAIMKSEG-VNLIGENFMSGLKVVFDRERLVLGWKSFN 451
Query: 343 CEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPS 384
C V + + + P + P P+ +SN +AA PS
Sbjct: 452 CYSVDHSTKLPVSPNSSAIPPKPV---SGPGSSNPEAAKRPS 490
>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
Length = 515
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 120/305 (39%), Positives = 175/305 (57%), Gaps = 8/305 (2%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+L+ Y P++SS+S V C+ LC C S CPY Y + TSS+G LV+D+LHL
Sbjct: 151 DLNIYSPNASSTSSKVPCNSTLCTRVDRCASPLSDCPYQIRYLSNGTSSTGVLVEDVLHL 210
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
S K++ +++ + +GCG QTG + DGAAP+G+ GLGL D+SVPS+LAK G+ NS
Sbjct: 211 VSMEKNS--KPIRARITLGCGLVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANS 268
Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQAL 220
FS+CF ++ +G + FGD+G Q+ T L I + + Y V V +G + F A+
Sbjct: 269 FSMCFGDDGAGRISFGDKGSVDQRETP-LNIRQPHPTYNVTVTQISVGGNT-GDLEFDAV 326
Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLKVPDMRLI 278
D+G SFT+L Y + F+ L KR ++YCY S +++ + PD+ L
Sbjct: 327 FDTGTSFTYLTDAPYTLISESFNSLALDKRYQTDSELPFEYCYAVSPNKKSFEYPDVNLT 386
Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
S+ V + + P E V+CL +M ++ D IIGQNFM G+R+VFDRE L L W
Sbjct: 387 MKGGSSYPVYHPLIVVPI-EDTVVYCLAIMKSE-DISIIGQNFMTGYRVVFDREKLILGW 444
Query: 339 SHSKC 343
S C
Sbjct: 445 KESDC 449
>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 522
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 116/316 (36%), Positives = 182/316 (57%), Gaps = 8/316 (2%)
Query: 33 GASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGY 92
GA+ + LS Y+P S+++K V+C++ LC R+ C CPY+ Y + TS+SG
Sbjct: 143 GATYASEFELSIYNPKISTTNKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQTSTSGI 202
Query: 93 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
L++D++HL + K+ + V++ V GCG+ Q+GS+LD AAP+G+ GLG+ +SVPS+LA
Sbjct: 203 LMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLA 260
Query: 153 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
+ GL+ +SFS+CF + G + FGD+G + Q+ T F + + Y + V +G + L
Sbjct: 261 REGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPF-NLNPSHPNYNITVTRVRVGTT-L 318
Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLK 271
F AL D+G SFT+L +Y V F KR S ++YCY+ S++
Sbjct: 319 IDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEYCYDMSNDANAS 378
Query: 272 -VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
+P + L N F + + I EG V+CL ++ + + IIGQN+M G+R+VFD
Sbjct: 379 LIPSLSLTMKGNSHFTINDPIIVI-STEGELVYCLAIVKS-SELNIIGQNYMTGYRVVFD 436
Query: 331 RENLKLAWSHSKCEEV 346
RE L LAW C ++
Sbjct: 437 REKLVLAWKKFDCYDI 452
>gi|326504502|dbj|BAJ91083.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 537
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 133/368 (36%), Positives = 197/368 (53%), Gaps = 26/368 (7%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP---CPYIADYSTEDTSSSGYLVDDI 97
+L Y P SS+SK V+C H LC+ ++C + + CPY Y + +TSSSG LV+D+
Sbjct: 154 DLRPYSPGKSSTSKAVTCEHALCERPNACAAAGNSSTSCPYTVRYVSANTSSSGVLVEDV 213
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
LHL+ + ++V + V++GCG+ QTG++LDGAA DG++GLG+ VSVPS+L AGL+
Sbjct: 214 LHLSREAAGGASTAVTAPVVLGCGQVQTGAFLDGAAVDGLLGLGMDKVSVPSVLHAAGLV 273
Query: 158 -QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
+SFS+CF + G + FGD G Q T F + + Y + V + + +
Sbjct: 274 ASDSFSMCFSPDGFGRINFGDSGRRGQAETPFT-VRNTHPTYNISVTAMSVSGKEVAAE- 331
Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPD 274
F A+VDSG SFT+L Y E+ F+ V +R +L + ++YCY + L VP+
Sbjct: 332 FAAIVDSGTSFTYLNDPAYTELATGFNSEVRERRANLSASIPFEYCYELGRGQTELFVPE 391
Query: 275 MRLIFSKNQSF-VVRNHIFSFPE-NEGFTV---FCLTVMSTDGDYGIIGQNFMMGHRIVF 329
+ L F V R + + E ++G V +CL V+ D IIGQNFM G ++VF
Sbjct: 392 VSLTTRGGAVFPVTRPIVVIYGETSDGRIVAAGYCLAVLKNDITIDIIGQNFMTGLKVVF 451
Query: 330 DRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTT----EQQSTSNGQ--AAAPP 383
DRE L W C + ++ + G +P P PTT Q +NG A P
Sbjct: 452 DRERSVLGWHEFDCYKDVETEEL-------GAAPGPSPTTRLKPRQSEVANGTPYPGAVP 504
Query: 384 STAKTAPS 391
T + A S
Sbjct: 505 VTPRQAGS 512
>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
Length = 518
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 127/339 (37%), Positives = 187/339 (55%), Gaps = 14/339 (4%)
Query: 28 CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 87
C G + D LS Y+P SS+S+ V+C + LC R+ C CPY+ Y + +T
Sbjct: 136 CAPTEGTTYASDFELSIYNPKGSSTSRKVTCDNSLCAHRNRCLGTFSNCPYMVSYVSAET 195
Query: 88 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 147
S+SG LV+D+LHL + + Q V++ V GCG+ QTGS+LD AAP+G+ GLGL +SV
Sbjct: 196 STSGILVEDVLHLTT--EDNRQEFVEAYVTFGCGQVQTGSFLDIAAPNGLFGLGLEKISV 253
Query: 148 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 207
PS+L+K G +SFS+CF + G + FGD+G Q+ T F + + Y + V +
Sbjct: 254 PSILSKEGFTADSFSMCFGPDGIGRISFGDKGSPDQEETPF-NLNALHPTYNITVTQVRV 312
Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNAS- 265
G + L F AL DSG SFT+L IY V+ F + S+R +++CY+ S
Sbjct: 313 GTT-LIDLDFTALFDSGTSFTYLVDPIYTNVLKSFHSQAQDSRRPPDSRIPFEFCYDMSP 371
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
E +P M L F V + I ++ ++C+ V+ + + IIGQNFM G+
Sbjct: 372 GENTSLIPSMSLTMKGGSQFPVYDPIIII-SSQSELIYCMAVVRS-AELNIIGQNFMTGY 429
Query: 326 RIVFDRENLKLAWSHSKCEEVIDKSHVHLVP-----PPA 359
RI+FDRE L L W +C++ I+ S V + P PPA
Sbjct: 430 RIIFDREKLVLGWKEFECDD-IENSSVPIRPRATSVPPA 467
>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 130/375 (34%), Positives = 187/375 (49%), Gaps = 42/375 (11%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
+L+ Y P++SS+S+ V C+ LC R C S + CPY Y + TS++GY+V D+L
Sbjct: 107 DLNIYSPNTSSTSEKVPCNSTLCSQTQRDRCPSDQSNCPYQVVYLSNGTSTTGYIVQDLL 166
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
HL S + +V + + GCG+ QTGS+L G AP+G+ GLG+ ++SVPS LA G
Sbjct: 167 HL--ISDDSQSKAVDAKITFGCGKVQTGSFLTGGAPNGLFGLGMSNISVPSTLAHNGYTS 224
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
SFS+CF N G + FGD+G Q TSF + Y + + IG + +
Sbjct: 225 GSFSMCFSPNGIGRISFGDKGSTGQGETSFNQGQPRSSLYNISITQTSIGGQA-SDLVYS 283
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS------------ 266
A+ DSG SFT+L Y + F+KLV R S + YCY+ S
Sbjct: 284 AIFDSGTSFTYLNDPAYTLIAESFNKLVKETRRSSTQVPFDYCYDIRSFISAQILPFSCA 343
Query: 267 ---EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
+ +P + L+ S F V + I +G V+CL ++ + GD IIGQNFM
Sbjct: 344 YANQTEPTIPAVTLVMSGGDYFNVTDPIVLVQLADGSAVYCLGMIKS-GDVNIIGQNFMT 402
Query: 324 GHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPP 383
GHRIVFDRE + L W S C + +D + + + P A PP
Sbjct: 403 GHRIVFDRERMILGWKPSNCYDNMDTNTLAVSP---------------------NTAVPP 441
Query: 384 STAKTAPSKSIAASA 398
+TA +K I AS+
Sbjct: 442 ATAVNPEAKQIPASS 456
>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 519
Score = 208 bits (530), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 127/357 (35%), Positives = 189/357 (52%), Gaps = 20/357 (5%)
Query: 35 SIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLV 94
+ D +L+ Y+P+ SS+SK V+C++ LC RS C CPY+ Y + +TS+SG LV
Sbjct: 140 AFASDFDLNVYNPNGSSTSKKVTCNNSLCTHRSQCLGTFSNCPYMVSYVSAETSTSGILV 199
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
+D+LHL H V+++VI GCG+ Q+GS+LD AAP+G+ GLG+ +SVPS+L++
Sbjct: 200 EDVLHLTQEDNH--HDLVEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEKISVPSMLSRE 257
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 214
G +SFS+CF + G + FGD+G Q T F + + Y + V +G + +
Sbjct: 258 GFTADSFSMCFGRDGIGRISFGDKGSFDQDETPF-NLNPSHPTYNITVTQVRVGTTVIDV 316
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLK-V 272
F AL DSG SFT+L Y + F V +R ++YCY+ S + +
Sbjct: 317 E-FTALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIPFEYCYDMSPDANTSLI 375
Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRE 332
P + L F V + I + V+CL V+ + + IIGQNFM G+R+VFDRE
Sbjct: 376 PSVSLTMGGGSHFAVYDPIIII-STQSELVYCLAVVKS-AELNIIGQNFMTGYRVVFDRE 433
Query: 333 NLKLAWSHSKCEEVID---------KSHVHLVPP--PAGQSPNPLPTTEQQSTSNGQ 378
L L W C ++ D +SH VPP AG P + ++S N Q
Sbjct: 434 KLVLGWKKFDCYDIEDHNDAIPTRPRSHAD-VPPAVAAGLGNYPATDSTRKSKYNSQ 489
>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
Length = 513
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 118/323 (36%), Positives = 189/323 (58%), Gaps = 8/323 (2%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
Y P+ S++S+ V CS LC +++C+S + CPY Y +++TSSSG LV+D+L+L S S
Sbjct: 148 YSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDS 207
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
A V + ++ GCG+ QTGS+L AAP+G++GLG+ SVPSLLA GL NSFS+C
Sbjct: 208 --AQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMC 265
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
F ++ G + FGD G + Q+ T L + ++ Y + + +G+ ++ F A+VDSG
Sbjct: 266 FGDDGHGRINFGDTGSSDQKETP-LNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSG 323
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQ 283
SFT L +Y ++ FD + S R L + +++CY+ S+ ++ P++ L
Sbjct: 324 TSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGS 382
Query: 284 SFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 342
F V + I + +N V +CL +M ++G +IG+NFM G ++VFDRE + L W +
Sbjct: 383 IFPVNDPIITITDNAFNPVGYCLAIMKSEG-VNLIGENFMSGLKVVFDRERMVLGWKNFN 441
Query: 343 CEEVIDKSHVHLVPPPAGQSPNP 365
C + S + + P P+ P P
Sbjct: 442 CYNFDESSRLPVNPSPSAVPPKP 464
>gi|356540838|ref|XP_003538891.1| PREDICTED: peroxidase [Glycine max]
Length = 829
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 130/394 (32%), Positives = 210/394 (53%), Gaps = 29/394 (7%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
YD SS+S+ V C+ LC+ + C S CPY +Y + TS++G+LV+D+LHL +
Sbjct: 151 YDLKGSSTSQTVLCNSNLCELQRQCPSSDSICPYEVNYLSNGTSTTGFLVEDVLHLITDD 210
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
+ + + GCG+ QTG++LDGAAP+G+ GLG+G+ SVPS+LAK GL NSFS+C
Sbjct: 211 DETKDADTR--ITFGCGQVQTGAFLDGAAPNGLFGLGMGNESVPSILAKEGLTSNSFSMC 268
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
F + G + FGD Q T F + + Y + V +G + F A+ DSG
Sbjct: 269 FGSDGLGRITFGDNSSLVQGKTPF-NLRALHPTYNITVTQIIVGGNAADLE-FHAIFDSG 326
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSEEMLKVPDMRLIFSK 281
SFT L Y ++ F+ + +R S + ++YCY+ SS + +++P + L
Sbjct: 327 TSFTHLNDPAYKQITNSFNSAIKLQRYSSSSSDELPFEYCYDLSSNKTVELP-INLTMKG 385
Query: 282 NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 341
+++V + I + EG + CL V+ ++ + IIGQNFM G+RIVFDREN+ L W S
Sbjct: 386 GDNYLVTDPIVTI-SGEGVNLLCLGVLKSN-NVNIIGQNFMTGYRIVFDRENMILGWRES 443
Query: 342 KC--EEV----IDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIA 395
C +E+ I++S+ + P +P E + SN +P + K P+ +
Sbjct: 444 NCYVDELSTLAINRSNSPAISPAIAVNPE-----ETSNQSNDPELSPNLSFKIKPTSAFM 498
Query: 396 AS--------AQQLDSVLRVACSLLVLMCLLLSS 421
+ + Q+ + VA L++M ++S+
Sbjct: 499 MALLVPKNHRSTQISMAVMVAFLNLIIMFSVVST 532
>gi|357517935|ref|XP_003629256.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523278|gb|AET03732.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 544
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 131/358 (36%), Positives = 192/358 (53%), Gaps = 39/358 (10%)
Query: 37 VQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
V D N+ E D SS+ KNV C+ +CK ++ C S C Y +Y + DTSSSG+LV+D
Sbjct: 157 VIDLNIYELD--KSSTRKNVPCNSNMCK-QTQCHSSGSSCRYEVEYLSNDTSSSGFLVED 213
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
+LHL + + + + + IGCG+ QTG +L+GAAP+G+ GLG+ +VSVPS+LA+ GL
Sbjct: 214 VLHL--ITDNDQTKDIDTQITIGCGQVQTGVFLNGAAPNGLFGLGMENVSVPSILAQKGL 271
Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
I +SFS+CF + SG + FGD G + Q T F + E + Y V + +G
Sbjct: 272 ISDSFSMCFGSDGSGRITFGDTGSSDQGKTPF-NLRESHPTYNVTITQIIVGGYAADHE- 329
Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS-LQGNS---WKYCYNASSEEMLKV 272
F A+ DSG SFT+L Y + KF+ LV + R S L +S ++YCY+ S ++ ++V
Sbjct: 330 FHAIFDSGTSFTYLNDPAYTLISEKFNSLVKANRHSPLSPDSDLPFEYCYDMSPDQTIEV 389
Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ------------- 319
P + L + V + I + CL + +D + IIG+
Sbjct: 390 PFLNLTMKGGDDYYVTDPIVPVSSEVEGNLLCLGIQKSD-NLNIIGREYTTEEEFLHLKH 448
Query: 320 ---------NFMMGHRIVFDRENLKLAWSHSKC-EEVI----DKSHVHLVPPPAGQSP 363
NFM G+RIVFDREN+ L W S C EEV+ +KSH + P +P
Sbjct: 449 MIIKFFIQKNFMTGYRIVFDRENMNLGWKESNCTEEVLSIPTNKSHSPAISPAIAVNP 506
>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 513
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 119/306 (38%), Positives = 173/306 (56%), Gaps = 9/306 (2%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+L+ Y P++SS+S V C+ LC C S + CPY Y + TSS+G LV+D+LHL
Sbjct: 150 DLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHL 209
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
S K + ++ + V GCG+ QTG + DGAAP+G+ GLGL D+SVPS+LAK G+ NS
Sbjct: 210 VSNDKSS--KAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANS 267
Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQAL 220
FS+CF + +G + FGD+G Q+ T L I + + Y + V +G + F A+
Sbjct: 268 FSMCFGNDGAGRISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVGGNT-GDLEFDAV 325
Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYNAS-SEEMLKVPDMRL 277
DSG SFT+L Y + F+ L KR + ++YCY S +++ + P + L
Sbjct: 326 FDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNL 385
Query: 278 IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
S+ V + + P + V+CL +M + D IIGQNFM G+R+VFDRE L L
Sbjct: 386 TMKGGSSYPVYHPLVVIPMKDT-DVYCLAIMKIE-DISIIGQNFMTGYRVVFDREKLILG 443
Query: 338 WSHSKC 343
W S C
Sbjct: 444 WKESDC 449
>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
Group]
Length = 476
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 125/363 (34%), Positives = 203/363 (55%), Gaps = 16/363 (4%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
Y P+ S++S+ V CS LC +++C+S + CPY Y +++TSSSG LV+D+L+L S S
Sbjct: 111 YSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDS 170
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
A V + ++ GCG+ QTGS+L AAP+G++GLG+ SVPSLLA GL NSFS+C
Sbjct: 171 --AQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMC 228
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
F ++ G + FGD G + Q+ T L + ++ Y + + +G+ ++ F A+VDSG
Sbjct: 229 FGDDGHGRINFGDTGSSDQKETP-LNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSG 286
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQ 283
SFT L +Y ++ FD + S R L + +++CY+ S+ ++ P++ L
Sbjct: 287 TSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGS 345
Query: 284 SFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 342
F V + I + +N V +CL +M ++G +IG+NFM G ++VFDRE + L W +
Sbjct: 346 IFPVNDPIITITDNAFNPVGYCLAIMKSEG-VNLIGENFMSGLKVVFDRERMVLGWKNFN 404
Query: 343 CEEVIDKSHVHLVPPPAGQSPNP-------LPTTEQQSTSNG-QAAAPPSTAKTAPSKSI 394
C + S + + P P+ P P + + NG Q PS + +S+
Sbjct: 405 CYNFDESSRLPVNPSPSAVPSKPGLGPSSYTPEAAKGALPNGTQVNVMPSASSPLQPQSV 464
Query: 395 AAS 397
+A+
Sbjct: 465 SAT 467
>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 530
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 131/385 (34%), Positives = 199/385 (51%), Gaps = 13/385 (3%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
D Y P SS+S+ V CS LC ++ C + + CPY Y +E+TSS G LV+D+L
Sbjct: 142 DLKFDMYSPRKSSTSRKVPCSSSLCDPQADCSAASNSCPYSIQYLSENTSSKGVLVEDVL 201
Query: 99 HLASFSKHAPQSSV-QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
+L + S QS + Q+ + GCG+ Q+GS+L AAP+G++GLG+ SVPSLLA G+
Sbjct: 202 YLTTESG---QSKITQAPITFGCGQVQSGSFLGSAAPNGLLGLGMDSKSVPSLLASKGIA 258
Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 217
NSFS+CF E+ G + FGD G + Q T L I ++ Y + + +G + F
Sbjct: 259 ANSFSMCFGEDGHGRINFGDTGSSDQLETP-LNIYKQNPYYNISITGAMVGGKSF-DTKF 316
Query: 218 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMR 276
A+VDSG SFT L +Y E+ F+ V R L + ++YCY+ S++ + P++
Sbjct: 317 SAVVDSGTSFTALSDPMYTEITSTFNAQVKESRKHLDASMPFEYCYSISAQGAVNPPNIS 376
Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 335
L F V I + + + +CL +M ++G +IG+NFM G +IVFDRE L
Sbjct: 377 LTAKGGSIFPVNGPIITITDTSSRPIAYCLAIMKSEG-VNLIGENFMSGLKIVFDRERLV 435
Query: 336 LAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIA 395
L W C + S + + P+ P P + + A+P T P S +
Sbjct: 436 LGWKTFNCYNFDNSSKLPVNRNPSADPPKPALGPSSSNPEAAKGASPNITQIDVPHSS-S 494
Query: 396 ASAQQLD---SVLRVACSLLVLMCL 417
+S +L + L +LL L L
Sbjct: 495 SSETRLHLSGTFLSATIALLFLAAL 519
>gi|356559246|ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 516
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 110/309 (35%), Positives = 171/309 (55%), Gaps = 10/309 (3%)
Query: 42 LSEYDPSSSSSSKNVSCSH-PLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+ YD SS+S VSC++ C+ R C S C Y DY + DTSS G++V+D+LHL
Sbjct: 153 FNTYDLDKSSTSNEVSCNNSTFCRQRQQCPSAGSTCRYQVDYLSNDTSSRGFVVEDVLHL 212
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+ + + + GCG+ QTG +L+GAAP+G+ GLG+ ++SVPS+LA+ GLI NS
Sbjct: 213 ITDDDQTKDADTR--IAFGCGQVQTGVFLNGAAPNGLFGLGMDNISVPSILAREGLISNS 270
Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQAL 220
FS+CF + +G + FGD G Q+ T F + + + Y + + + +S + F A+
Sbjct: 271 FSMCFGSDSAGRITFGDTGSPDQRKTPF-NVRKLHPTYNITITKIIVEDS-VADLEFHAI 328
Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS----WKYCYNASSEEMLKVPDMR 276
DSG SFT++ Y + ++ V +KR S Q + YCY+ S + ++VP +
Sbjct: 329 FDSGTSFTYINDPAYTRIGEMYNSKVKAKRHSSQSPDSNIPFDYCYDISISQTIEVPFLN 388
Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 336
L + V + I E + CL + +D IIGQNFM G++IVFDR+N+ L
Sbjct: 389 LTMKGGDDYYVMDPIIQVSSEEEGDLLCLGIQKSDS-VNIIGQNFMTGYKIVFDRDNMNL 447
Query: 337 AWSHSKCEE 345
W + C +
Sbjct: 448 GWKETNCSD 456
>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
Length = 490
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 124/363 (34%), Positives = 203/363 (55%), Gaps = 16/363 (4%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
Y P+ S++S+ V CS LC +++C+S + CPY Y +++TSSSG LV+D+L+L S S
Sbjct: 125 YSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDS 184
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
+ V + ++ GCG+ QTGS+L AAP+G++GLG+ SVPSLLA GL NSFS+C
Sbjct: 185 AQS--KIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMC 242
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
F ++ G + FGD G + Q+ T L + ++ Y + + +G+ ++ F A+VDSG
Sbjct: 243 FGDDGHGRINFGDTGSSDQKETP-LNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSG 300
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQ 283
SFT L +Y ++ FD + S R L + +++CY+ S+ ++ P++ L
Sbjct: 301 TSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGS 359
Query: 284 SFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 342
F V + I + +N V +CL +M ++G +IG+NFM G ++VFDRE + L W +
Sbjct: 360 IFPVNDPIITITDNAFNPVGYCLAIMKSEG-VNLIGENFMSGLKVVFDRERMVLGWKNFN 418
Query: 343 CEEVIDKSHVHLVPPPAGQSPNP-------LPTTEQQSTSNG-QAAAPPSTAKTAPSKSI 394
C + S + + P P+ P P + + NG Q PS + +S+
Sbjct: 419 CYNFDESSRLPVNPSPSAVPSKPGLGPSSYTPEAAKGALPNGTQVNVMPSASSPLQPQSV 478
Query: 395 AAS 397
+A+
Sbjct: 479 SAT 481
>gi|224033419|gb|ACN35785.1| unknown [Zea mays]
gi|413934980|gb|AFW69531.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 543
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 135/374 (36%), Positives = 193/374 (51%), Gaps = 20/374 (5%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILH 99
+L Y P SS+SK V+C +PLC R+ C + + CPY Y + +TSSSG LV D+LH
Sbjct: 156 SLRPYSPRRSSTSKQVACDNPLCGQRNGCSAATNGSCPYEVQYVSANTSSSGVLVQDVLH 215
Query: 100 LASF--SKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKAG 155
L A ++Q+ V+ GCG+ QTG++LDG A DG+MGLG+G VSVPS LA +G
Sbjct: 216 LTRERPGPGAAGEALQAPVVFGCGQVQTGAFLDGGGGAVDGLMGLGMGKVSVPSALAASG 275
Query: 156 LI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 214
L+ +SFS+CF ++ G V FGD G Q T F + Y V S +G+ +
Sbjct: 276 LVASDSFSMCFGDDGVGRVNFGDAGSRGQAETPFT-VRSLNPTYNVSFTSIGVGSESVAA 334
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-----WKYCYNASSEEM 269
F A++DSG SFT+L Y ++ KF+ VS +R++ S ++YCY S +
Sbjct: 335 E-FAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFSSGSADPFPFEYCYRLSPNQT 393
Query: 270 -LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYG--IIGQNFMMGH 325
+ +PD+ L F V + G V +CL +M D G IIGQNFM G
Sbjct: 394 EVAMPDVSLTAKGGALFPVTQPFIPVGDTTGRAVGYCLAIMRNDMAIGIDIIGQNFMTGL 453
Query: 326 RIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPST 385
++VFDRE L W C + P +P PT ++G + P
Sbjct: 454 KVVFDRERSVLGWEKFDCYRNARVADAPDGSPGPSSAPAAGPTKITPRQNDGSGSGYPGA 513
Query: 386 A---KTAPSKSIAA 396
A ++A S++ AA
Sbjct: 514 APLPRSAGSRNAAA 527
>gi|242094226|ref|XP_002437603.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
gi|241915826|gb|EER88970.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
Length = 541
Score = 204 bits (518), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 131/364 (35%), Positives = 188/364 (51%), Gaps = 33/364 (9%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHL 100
L Y P SS+SK V+C + LC + C + + CPY Y + +TS+SG LV D+LHL
Sbjct: 158 LRPYSPRESSTSKQVTCDNALCDRPNGCSAATNGSCPYEVQYLSANTSTSGVLVQDVLHL 217
Query: 101 ASFSKHAPQSS------VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
++ P ++ +Q+ V+ GCG+ QTG++LDGAA DG+MGLG +VSVPS+LA +
Sbjct: 218 ---TRERPGAAAEAGEALQAPVVFGCGQVQTGTFLDGAAFDGLMGLGRENVSVPSVLASS 274
Query: 155 GLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF--VGVESYCIGNSC 211
GL+ +SFS+CF ++ G + FGD G + Q T F Y+ F V VE+ +
Sbjct: 275 GLVASDSFSMCFGDDGVGRINFGDSGSSGQGETPFTGRRTLYNVSFTAVNVETKSVA--- 331
Query: 212 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-----WKYCY--NA 264
+ F A++DSG SFT+L Y E+ F+ LV +R + S ++YCY
Sbjct: 332 ---AEFAAVIDSGTSFTYLADPEYTELATNFNSLVRERRTNFSSGSADPFPFEYCYALGP 388
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD--GDYGIIGQNFM 322
+ E L +PD+ L F V + +CL +M D ++ IIGQNFM
Sbjct: 389 NQTEAL-IPDVSLTTKGGARFPVTQPVIGVASGRTVVGYCLAIMKNDLGVNFNIIGQNFM 447
Query: 323 MGHRIVFDRENLKLAWSHSKC---EEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQA 379
G ++VFDRE L W C V D P PA P + + +SNG
Sbjct: 448 TGLKVVFDREKSVLGWEKFDCYKNARVADAPDGSPSPAPAAD-PTKITPRQNDGSSNGFP 506
Query: 380 AAPP 383
AA P
Sbjct: 507 AAAP 510
>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
sativa Japonica Group]
Length = 732
Score = 204 bits (518), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 117/323 (36%), Positives = 188/323 (58%), Gaps = 8/323 (2%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
Y P+ S++S+ V CS LC +++C+S + CPY Y +++TSSSG LV+D+L+L S S
Sbjct: 148 YSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDS 207
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
A V + ++ GCG+ QTGS+L AAP+G++GLG+ SVPSLLA GL NSFS+C
Sbjct: 208 --AQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMC 265
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
F ++ G + FGD G + Q+ T L + ++ Y + + +G+ ++ F A+VDSG
Sbjct: 266 FGDDGHGRINFGDTGSSDQKETP-LNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSG 323
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQ 283
SFT L +Y ++ FD + S R L + +++CY+ S+ ++ P++ L
Sbjct: 324 TSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGS 382
Query: 284 SFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 342
F V + I + +N V +CL +M ++G +IG+NFM G ++VFDRE + L W +
Sbjct: 383 IFPVNDPIITITDNAFNPVGYCLAIMKSEG-VNLIGENFMSGLKVVFDRERMVLGWKNFN 441
Query: 343 CEEVIDKSHVHLVPPPAGQSPNP 365
C + S + + P P+ P
Sbjct: 442 CYNFDESSRLPVNPSPSAVPSKP 464
>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 119/307 (38%), Positives = 176/307 (57%), Gaps = 11/307 (3%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+L+ Y P++SS+S V C+ LC C S + CPY Y + TSS+G LV+D+LHL
Sbjct: 150 DLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESNCPYQIRYLSNGTSSTGVLVEDVLHL 209
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
S K + ++ + V +GCG+ QTG + DGAAP+G+ GLGL D+SVPS+LAK G+ NS
Sbjct: 210 VSNDKSS--KAIPARVTLGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANS 267
Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI-GNSCLTQSGFQA 219
FS+CF + +G + FGD+G Q+ T L I + + Y + V + GN+ + F A
Sbjct: 268 FSMCFGNDGAGRISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVEGNTGDLE--FDA 324
Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYNAS-SEEMLKVPDMR 276
+ DSG SFT+L Y + F+ L KR + ++YCY S +++ + P +
Sbjct: 325 VFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVN 384
Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 336
L S+ V + + P + V+CL ++ + D IIGQNFM G+R+VFDRE L L
Sbjct: 385 LTMKGGSSYPVYHPLVVIPMKDT-DVYCLAILKIE-DISIIGQNFMTGYRVVFDREKLIL 442
Query: 337 AWSHSKC 343
W S C
Sbjct: 443 GWKESDC 449
>gi|242050026|ref|XP_002462757.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
gi|241926134|gb|EER99278.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
Length = 523
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 120/330 (36%), Positives = 190/330 (57%), Gaps = 7/330 (2%)
Query: 38 QDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
+D Y P SS+S+ V CS LC +S+C+S CPY +Y +++TSS+G LV+D+
Sbjct: 146 RDLKFDTYSPQKSSTSRKVPCSSNLCDLQSACRSASSSCPYSIEYLSDNTSSTGVLVEDV 205
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L+L +++ V + + GCGR QTGS+L AAP+G++GLG+ +SVPSLLA G+
Sbjct: 206 LYL--ITEYGQPKIVTAPITFGCGRIQTGSFLGSAAPNGLLGLGMDSISVPSLLASEGVA 263
Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 217
NSFS+CF ++ G + FGD G + QQ T L I ++ Y + + +G+ + F
Sbjct: 264 ANSFSMCFGDDGRGRINFGDTGSSDQQETP-LNIYKQNPYYNISITGAMVGSKSF-NTNF 321
Query: 218 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMR 276
A+VDSG SFT L +Y+E+ F+ V K L + +++CY+ S + + P++
Sbjct: 322 NAIVDSGTSFTALSDPMYSEITSSFNSQVQDKPTQLDSSLPFEFCYSISPKGSVNPPNIS 381
Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 335
L+ F V + I + ++ + +CL VM ++G +IG+NFM G ++VFDRE
Sbjct: 382 LMAKGGSIFPVNDPIITITDDASNPMAYCLAVMKSEG-VNLIGENFMSGLKVVFDRERKV 440
Query: 336 LAWSHSKCEEVIDKSHVHLVPPPAGQSPNP 365
L W C V + S++ + P P+G P P
Sbjct: 441 LGWKKFNCYSVDNSSNLPVNPNPSGVPPKP 470
>gi|226499286|ref|NP_001147826.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|195613980|gb|ACG28820.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 545
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 134/373 (35%), Positives = 192/373 (51%), Gaps = 20/373 (5%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHL 100
L Y P SS+S+ V+C +PLC R+ C + + CPY Y + +TSSSG LV D+LHL
Sbjct: 159 LRPYSPRRSSTSEQVACDNPLCGRRNGCSAATNGSCPYEVQYVSANTSSSGVLVQDVLHL 218
Query: 101 ASF--SKHAPQSSVQSSVIIGCGRKQTGSYLD--GAAPDGVMGLGLGDVSVPSLLAKAGL 156
A ++Q+ V+ GCG+ QTG++LD G A DG+MGLG+G VSVPS LA +GL
Sbjct: 219 TRERPGPGAAGEALQAPVVFGCGQVQTGAFLDDGGGAVDGLMGLGMGKVSVPSALAASGL 278
Query: 157 I-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 215
+ +SFS+CF ++ G V FGD G Q T F + Y V S IG+ +
Sbjct: 279 VASDSFSMCFGDDGVGRVNFGDAGSRGQAETPFT-VRSLNPTYNVSFTSIGIGSESVAAE 337
Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-----WKYCYNASSEEM- 269
F A++DSG SFT+L Y ++ KF+ VS +R++ S ++YCY S +
Sbjct: 338 -FAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFSSGSADPFPFEYCYRLSPNQTE 396
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYG--IIGQNFMMGHR 326
+ +PD+ L F V + G + +CL +M D G IIGQNFM G +
Sbjct: 397 VAMPDVSLTAKGGALFPVTQPFIPVGDTTGRAIGYCLAIMRNDMAIGIDIIGQNFMTGLK 456
Query: 327 IVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTA 386
+VFDRE L W C + P +P PT ++G + P A
Sbjct: 457 VVFDRERSVLGWEKFDCYRNARVADAPDGSPGPSSAPAAGPTKITPRQNDGSGSGYPGAA 516
Query: 387 ---KTAPSKSIAA 396
++A S++ AA
Sbjct: 517 PLPRSAGSRNAAA 529
>gi|357517921|ref|XP_003629249.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523271|gb|AET03725.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 553
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 128/380 (33%), Positives = 194/380 (51%), Gaps = 43/380 (11%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
D +LS Y+P+ SS+SK V+C++ LC R+ C CPY+ Y + +TS+SG LV+D+L
Sbjct: 149 DFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMVSYVSAETSTSGILVEDVL 208
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
HL + V+++VI GCG+ Q+GS+LD AAP+G+ GLG+ +SVPS+L++ G
Sbjct: 209 HLTQPDDN--HDLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTA 266
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
+SFS+CF + G + FGD+G Q T F + + Y + + +G + L F
Sbjct: 267 DSFSMCFGRDGIGRISFGDKGSLDQDETPF-NVNPSHPTYNITINQVRVGTT-LIDVEFT 324
Query: 219 ALVDSGASFTFLPTEIYAEV--------------------------VVKFDKLVSSKRIS 252
AL DSG SFT+L Y+ + +++F V +R
Sbjct: 325 ALFDSGTSFTYLVDPTYSRLSESVSDKICFHLARCYLKIKVTIEVFMLQFHSQVEDRRRP 384
Query: 253 LQGN-SWKYCYNASSEEMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST 310
+ YCY+ S + +P M L FVV + I + V+CL V+ +
Sbjct: 385 PDSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIII-STQSELVYCLAVVKS 443
Query: 311 DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKS-------HVHLVPPPAGQSP 363
+ IIGQNFM G+R+VFDRE L L W S C ++ D + H VPP
Sbjct: 444 -AELNIIGQNFMTGYRVVFDREKLILGWKKSDCYDIEDHNNAIPIGQHSDKVPPAVAAGL 502
Query: 364 NPLPTTE--QQSTSNGQAAA 381
PTT+ ++S N Q ++
Sbjct: 503 GDYPTTDSSRKSKYNSQHSS 522
>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
Length = 473
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 118/315 (37%), Positives = 173/315 (54%), Gaps = 18/315 (5%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+L+ Y P++SS+S V C+ LC C S + CPY Y + TSS+G LV+D+LHL
Sbjct: 101 DLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHL 160
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
S K + ++ + V GCG+ QTG + DGAAP+G+ GLGL D+SVPS+LAK G+ NS
Sbjct: 161 VSNDKSS--KAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANS 218
Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQAL 220
FS+CF + +G + FGD+G Q+ T L I + + Y + V +G + F A+
Sbjct: 219 FSMCFGNDGAGRISFGDKGSVDQRETP-LNIRQPHPTYNITVTKISVGGNT-GDLEFDAV 276
Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCY----------NASSEE 268
DSG SFT+L Y + F+ L KR + ++YCY + +++
Sbjct: 277 FDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALRLPLYSGHHHPNKD 336
Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
+ P + L S+ V + + P + V+CL +M + D IIGQNFM G+R+V
Sbjct: 337 SFQYPAVNLTMKGGSSYPVYHPLVVIPMKDT-DVYCLAIMKIE-DISIIGQNFMTGYRVV 394
Query: 329 FDRENLKLAWSHSKC 343
FDRE L L W S C
Sbjct: 395 FDREKLILGWKESDC 409
>gi|242072510|ref|XP_002446191.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
gi|241937374|gb|EES10519.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
Length = 499
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 118/303 (38%), Positives = 174/303 (57%), Gaps = 12/303 (3%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
Y P SS+SK V C+ C + C + CPY Y + TSSSG+LV+D+L+L++ +
Sbjct: 155 YIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFLVEDVLYLSTEN 213
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
H PQ +++ +++GCG+ QTGS+LD AAP+G+ GLG+ +VSVPS+LA+ GL NSFS+C
Sbjct: 214 AH-PQI-LKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMC 271
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
F + G + FGDQG + Q+ T L I +++ Y + + IGN T F + D+G
Sbjct: 272 FGRDGIGRISFGDQGSSDQEETP-LNINQQHPTYAITISGITIGNKP-TDLDFITIFDTG 329
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFSKN 282
SFT+L Y + F V + R + ++YCY+ +SSE +PD+ L
Sbjct: 330 TSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPDIILRTVSG 389
Query: 283 QSFVVRN--HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
F V + + S E+E V+CL ++ + IIGQNFM G R+VFDRE L W
Sbjct: 390 SLFPVIDPGQVISIQEHE--YVYCLAIVKSR-KLNIIGQNFMTGLRVVFDRERKILGWKK 446
Query: 341 SKC 343
C
Sbjct: 447 FNC 449
>gi|356496606|ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 508
Score = 198 bits (503), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 124/376 (32%), Positives = 199/376 (52%), Gaps = 21/376 (5%)
Query: 28 CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 87
C+ G S + + YD SS+S+ V C+ LC+ + C S CPY +Y + T
Sbjct: 134 CVHGIGLSNGEKIAFNIYDLKGSSTSQPVLCNSSLCELQRQCPSSDTICPYEVNYLSNGT 193
Query: 88 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 147
S++G+LV+D+LHL + + + + GCG+ QTG++LDGAAP+G+ GLG+ + SV
Sbjct: 194 STTGFLVEDVLHLITDDDKTKDADTR--ITFGCGQVQTGAFLDGAAPNGLFGLGMSNESV 251
Query: 148 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 207
PS+LAK GL NSFS+CF + G + FGD Q T F + + Y + V +
Sbjct: 252 PSILAKEGLTSNSFSMCFGSDGLGRITFGDNSSLVQGKTPF-NLRALHPTYNITVTQIIV 310
Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNA 264
G + F A+ DSG SFT+L Y ++ F+ + +R S ++ ++YCY
Sbjct: 311 GEK-VDDLEFHAIFDSGTSFTYLNDPAYKQITNSFNSEIKLQRHSTSSSNELPFEYCYEL 369
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
S + +++ + L +++V + I + EG + CL V+ ++ + IIGQNFM G
Sbjct: 370 SPNQTVEL-SINLTMKGGDNYLVTDPIVTV-SGEGINLLCLGVLKSN-NVNIIGQNFMTG 426
Query: 325 HRIVFDRENLKLAWSHSKC--EEV----IDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQ 378
+RIVFDREN+ L W S C +E+ I++S+ + P +P + S SN
Sbjct: 427 YRIVFDRENMILGWRESNCYDDELSTLPINRSNTPAISPAIAVNPE-----ARSSQSNNP 481
Query: 379 AAAPPSTAKTAPSKSI 394
+P + K P+ +
Sbjct: 482 VLSPNLSFKIKPTSAF 497
>gi|125546587|gb|EAY92726.1| hypothetical protein OsI_14476 [Oryza sativa Indica Group]
Length = 530
Score = 197 bits (502), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 113/306 (36%), Positives = 175/306 (57%), Gaps = 12/306 (3%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
Y PS SS+S+ V C+ C+ R C + CPY Y + DTSSSG+LV+D+L+L++
Sbjct: 163 YIPSMSSTSQAVPCNSQFCELRKECSTTSQ-CPYKMVYVSADTSSSGFLVEDVLYLST-- 219
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
+ A +++ ++ GCG+ QTGS+LD AAP+G+ GLG+ +S+PS+LA+ GL NSF++C
Sbjct: 220 EDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMC 279
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
F + G + FGDQG + Q+ T L + ++ Y + + +GNS LT F + D+G
Sbjct: 280 FSRDGIGRISFGDQGSSDQEETP-LDVNPQHPTYTISISEMTVGNS-LTDLEFSTIFDTG 337
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFSKN 282
SFT+L Y + F V + R + ++YCY+ +SSE+ ++ P + L
Sbjct: 338 TSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQTPSISLRTVGG 397
Query: 283 QSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
F V + S ++E V+CL ++ + IIGQNFM G R+VFDRE L W
Sbjct: 398 SVFPVIDEGQVISIQQHE--YVYCLAIVKS-AKLNIIGQNFMTGLRVVFDRERKILGWKK 454
Query: 341 SKCEEV 346
C +
Sbjct: 455 FNCYDT 460
>gi|116308959|emb|CAH66084.1| H0209A05.1 [Oryza sativa Indica Group]
Length = 530
Score = 197 bits (502), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 113/306 (36%), Positives = 175/306 (57%), Gaps = 12/306 (3%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
Y PS SS+S+ V C+ C+ R C + CPY Y + DTSSSG+LV+D+L+L++
Sbjct: 163 YIPSMSSTSQAVPCNSQFCELRKECSTTSQ-CPYKMVYVSADTSSSGFLVEDVLYLST-- 219
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
+ A +++ ++ GCG+ QTGS+LD AAP+G+ GLG+ +S+PS+LA+ GL NSF++C
Sbjct: 220 EDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMC 279
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
F + G + FGDQG + Q+ T L + ++ Y + + +GNS LT F + D+G
Sbjct: 280 FSRDGIGRISFGDQGSSDQEETP-LDVNPQHPTYTISISEITVGNS-LTDLEFSTIFDTG 337
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFSKN 282
SFT+L Y + F V + R + ++YCY+ +SSE+ ++ P + L
Sbjct: 338 TSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQTPSISLRTVGG 397
Query: 283 QSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
F V + S ++E V+CL ++ + IIGQNFM G R+VFDRE L W
Sbjct: 398 SVFPVIDEGQVISIQQHE--YVYCLAIVKS-AKLNIIGQNFMTGLRVVFDRERKILGWKK 454
Query: 341 SKCEEV 346
C +
Sbjct: 455 FNCYDT 460
>gi|115457374|ref|NP_001052287.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza sativa Japonica Group]
gi|113563858|dbj|BAF14201.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|215740420|dbj|BAG97076.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222626225|gb|EEE60357.1| hypothetical protein OsJ_13479 [Oryza sativa Japonica Group]
Length = 530
Score = 197 bits (502), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 113/306 (36%), Positives = 175/306 (57%), Gaps = 12/306 (3%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
Y PS SS+S+ V C+ C+ R C + CPY Y + DTSSSG+LV+D+L+L++
Sbjct: 163 YIPSMSSTSQAVPCNSQFCELRKECSTTSQ-CPYKMVYVSADTSSSGFLVEDVLYLST-- 219
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
+ A +++ ++ GCG+ QTGS+LD AAP+G+ GLG+ +S+PS+LA+ GL NSF++C
Sbjct: 220 EDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMC 279
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
F + G + FGDQG + Q+ T L + ++ Y + + +GNS LT F + D+G
Sbjct: 280 FSRDGIGRISFGDQGSSDQEETP-LDVNPQHPTYTISISEITVGNS-LTDLEFSTIFDTG 337
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFSKN 282
SFT+L Y + F V + R + ++YCY+ +SSE+ ++ P + L
Sbjct: 338 TSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQTPSISLRTVGG 397
Query: 283 QSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
F V + S ++E V+CL ++ + IIGQNFM G R+VFDRE L W
Sbjct: 398 SVFPVIDEGQVISIQQHE--YVYCLAIVKS-AKLNIIGQNFMTGLRVVFDRERKILGWKK 454
Query: 341 SKCEEV 346
C +
Sbjct: 455 FNCYDT 460
>gi|326500240|dbj|BAK06209.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 194 bits (494), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 132/379 (34%), Positives = 197/379 (51%), Gaps = 28/379 (7%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
Y PS SS+S+ V C+ C R C S CPY Y + DTSSSG+LV+D+L+L++
Sbjct: 146 YIPSLSSTSQAVPCNSDFCGLRKEC-SKTSSCPYKMVYVSADTSSSGFLVEDVLYLSTED 204
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
H PQ +++ ++ GCG QTGS+LD AAP+G+ GLG+ +SVPS+LA+ GL NSFS+C
Sbjct: 205 TH-PQF-LKAQIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQKGLTSNSFSMC 262
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
F + G + FGDQG + Q+ T L I +K+ Y + + +GN+ L + D+G
Sbjct: 263 FGRDGIGRISFGDQGSSDQEETP-LDINQKHPTYAITITGIAVGNN-LMDLEVSTIFDTG 320
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFSKN 282
SFT+L Y + F V + R + ++YCY+ +SSE ++ P + L
Sbjct: 321 TSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSSSEARIQTPSISLRTVGG 380
Query: 283 QSF--VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
F + + S ++E V+CL ++ + IIGQNFM G R+VFDRE L W
Sbjct: 381 SLFPAIDPGQVISIQQHE--YVYCLAIVKST-KLNIIGQNFMTGVRVVFDRERKILGWKK 437
Query: 341 SKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASAQQ 400
C + S NPL + ST + +P T A + + +
Sbjct: 438 FNCYDT--------------DSLNPLSINSRNSTP--ENYSPQETKNPAGASQLGHVSSS 481
Query: 401 LDSVLRVACSLLVLMCLLL 419
V SLL++M +LL
Sbjct: 482 PPLVWWHNNSLLLMMFVLL 500
>gi|326499199|dbj|BAK06090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 194 bits (493), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 132/379 (34%), Positives = 197/379 (51%), Gaps = 28/379 (7%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
Y PS SS+S+ V C+ C R C S CPY Y + DTSSSG+LV+D+L+L++
Sbjct: 146 YIPSLSSTSQAVPCNSDFCGLRKEC-SKTSSCPYKMVYVSADTSSSGFLVEDVLYLSTED 204
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
H PQ +++ ++ GCG QTGS+LD AAP+G+ GLG+ +SVPS+LA+ GL NSFS+C
Sbjct: 205 TH-PQF-LKAQIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQKGLTSNSFSMC 262
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
F + G + FGDQG + Q+ T L I +K+ Y + + +GN+ L + D+G
Sbjct: 263 FGRDGIGRISFGDQGSSDQEETP-LDINQKHPTYAITITGIAVGNN-LMDLEVSTIFDTG 320
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFSKN 282
SFT+L Y + F V + R + ++YCY+ +SSE ++ P + L
Sbjct: 321 TSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSSSEARIQTPSISLRTVGG 380
Query: 283 QSF--VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
F + + S ++E V+CL ++ + IIGQNFM G R+VFDRE L W
Sbjct: 381 SLFPAIDPGQVISIQQHE--YVYCLAIVKST-KLNIIGQNFMTGVRVVFDRERKILGWKK 437
Query: 341 SKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASAQQ 400
C + S NPL + ST + +P T A + + +
Sbjct: 438 FNCYDT--------------DSLNPLSINSRNSTP--ENYSPQETKNPAGASQLRHVSSS 481
Query: 401 LDSVLRVACSLLVLMCLLL 419
V SLL++M +LL
Sbjct: 482 PPLVWWHNNSLLLMMFVLL 500
>gi|226501154|ref|NP_001146408.1| uncharacterized protein LOC100279988 [Zea mays]
gi|219887047|gb|ACL53898.1| unknown [Zea mays]
gi|414587777|tpg|DAA38348.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 416
Score = 194 bits (492), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 116/306 (37%), Positives = 173/306 (56%), Gaps = 12/306 (3%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
Y P SS+SK V C+ C + C + CPY Y + TSSSG+LV+D+L+L++ +
Sbjct: 54 YIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFLVEDVLYLSTEN 112
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
H PQ +++ +++GCG+ QTGS+LD AAP+G+ GLG+ +VSVPS+LA+ GL NSFS+C
Sbjct: 113 AH-PQI-LKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMC 170
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
F + G + FGDQ + Q+ T L I ++ Y + + +GN T F + D+G
Sbjct: 171 FGRDGIGRISFGDQESSDQEETP-LDINRQHPTYAITISGITVGNK-PTDMDFITIFDTG 228
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFSKN 282
SFT+L Y + F V + R + ++YCY+ +SSE +PD+ L
Sbjct: 229 TSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPDIILRTVTG 288
Query: 283 QSFVVRN--HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
F V + + S E+E V+CL ++ + IIGQNFM G R+VFDRE L W
Sbjct: 289 SMFPVIDPGQVISIQEHE--YVYCLAIVKS-MKLNIIGQNFMTGLRVVFDRERKILGWKK 345
Query: 341 SKCEEV 346
C +
Sbjct: 346 FNCYDT 351
>gi|194700652|gb|ACF84410.1| unknown [Zea mays]
gi|414587775|tpg|DAA38346.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 500
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 116/303 (38%), Positives = 172/303 (56%), Gaps = 12/303 (3%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
Y P SS+SK V C+ C + C + CPY Y + TSSSG+LV+D+L+L++ +
Sbjct: 156 YIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFLVEDVLYLSTEN 214
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
H PQ +++ +++GCG+ QTGS+LD AAP+G+ GLG+ +VSVPS+LA+ GL NSFS+C
Sbjct: 215 AH-PQI-LKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMC 272
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
F + G + FGDQ + Q+ T L I ++ Y + + +GN T F + D+G
Sbjct: 273 FGRDGIGRISFGDQESSDQEETP-LDINRQHPTYAITISGITVGNKP-TDMDFITIFDTG 330
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFSKN 282
SFT+L Y + F V + R + ++YCY+ +SSE +PD+ L
Sbjct: 331 TSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPDIILRTVTG 390
Query: 283 QSFVVRN--HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
F V + + S E+E V+CL ++ + IIGQNFM G R+VFDRE L W
Sbjct: 391 SMFPVIDPGQVISIQEHE--YVYCLAIVKSM-KLNIIGQNFMTGLRVVFDRERKILGWKK 447
Query: 341 SKC 343
C
Sbjct: 448 FNC 450
>gi|414587774|tpg|DAA38345.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 520
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 116/306 (37%), Positives = 173/306 (56%), Gaps = 12/306 (3%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
Y P SS+SK V C+ C + C + CPY Y + TSSSG+LV+D+L+L++ +
Sbjct: 158 YIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFLVEDVLYLSTEN 216
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
H PQ +++ +++GCG+ QTGS+LD AAP+G+ GLG+ +VSVPS+LA+ GL NSFS+C
Sbjct: 217 AH-PQI-LKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMC 274
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
F + G + FGDQ + Q+ T L I ++ Y + + +GN T F + D+G
Sbjct: 275 FGRDGIGRISFGDQESSDQEETP-LDINRQHPTYAITISGITVGNKP-TDMDFITIFDTG 332
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFSKN 282
SFT+L Y + F V + R + ++YCY+ +SSE +PD+ L
Sbjct: 333 TSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPDIILRTVTG 392
Query: 283 QSFVVRN--HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
F V + + S E+E V+CL ++ + IIGQNFM G R+VFDRE L W
Sbjct: 393 SMFPVIDPGQVISIQEHE--YVYCLAIVKS-MKLNIIGQNFMTGLRVVFDRERKILGWKK 449
Query: 341 SKCEEV 346
C +
Sbjct: 450 FNCYDT 455
>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 508
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 109/304 (35%), Positives = 164/304 (53%), Gaps = 9/304 (2%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
L+ Y ++SS+S V CS LC+ + C S K CPY Y +E++SS+GYLV DILH+A
Sbjct: 151 LNHYSSNASSTSIRVPCSSSLCELANQCSSNKSSCPYQTHYLSENSSSAGYLVQDILHMA 210
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
+ + V V +GCG+ QTG + + AP+G++GLG+G VSVPS LA GL +SF
Sbjct: 211 T--DDSQLKPVDVKVTLGCGKVQTGKFSNVTAPNGLIGLGMGKVSVPSFLASQGLTTDSF 268
Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 221
S+CF G + FGD GP Q+ T F P Y+ + + I + T A++
Sbjct: 269 SMCFGYYGYGRIDFGDIGPVGQRETPFNPASLSYNVTILQI----IVTNRPTNVHLTAII 324
Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFS 280
DSGASFT+L Y+ + D + +RI + ++YCY S + + P++
Sbjct: 325 DSGASFTYLTDPFYSIITENMDAAMELERIKSDSDFPFEYCYRLSLATIFQQPNLNFTME 384
Query: 281 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
+ F V S ++G CL ++ + D +IG NF G+R+VF+RE + L W
Sbjct: 385 GGRKFDVITSYVSVDTDDG-PALCLAIVKST-DINVIGHNFFGGYRVVFNREKMTLGWKE 442
Query: 341 SKCE 344
C+
Sbjct: 443 VDCD 446
>gi|357168101|ref|XP_003581483.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 510
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 135/387 (34%), Positives = 201/387 (51%), Gaps = 35/387 (9%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
S Y PS SS+S+ V C+ C R C S CPY Y + DTSSSG+LV+D+L+L++
Sbjct: 147 SFYIPSMSSTSQAVPCNSDFCDHRKDC-STTSSCPYKMVYVSADTSSSGFLVEDVLYLST 205
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
H PQ +++ ++ GCG+ QTGS+LD AAP+G+ GLG+ +SVPS+LA GL +SFS
Sbjct: 206 EDNH-PQI-LKAQIMFGCGQVQTGSFLDAAAPNGLFGLGIDMISVPSILAHKGLTSDSFS 263
Query: 163 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVD 222
+CF + G + FGDQG + Q+ T L I +K+ Y + + +G + F + D
Sbjct: 264 MCFGRDGIGRISFGDQGSSDQEETP-LDINQKHPTYAITITGITVGTEPMDLE-FSTIFD 321
Query: 223 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYN-ASSEEMLKVPDMRLIFS 280
+G +FT+L Y + F V + R + ++YCY+ +SSE ++ P +
Sbjct: 322 TGTTFTYLADPAYTYITQSFHTQVRANRHAADTRIPFEYCYDLSSSEARIQTPGVSFRTV 381
Query: 281 KNQSFVVRN--HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
F V + + S ++E V+CL ++ + IIGQNFM G R+VFDRE L W
Sbjct: 382 GGSLFPVIDLGQVISIQQHE--YVYCLAIVKST-KLNIIGQNFMTGVRVVFDRERKILGW 438
Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASA 398
C + S NPL + S+ PST +K+ A +
Sbjct: 439 KKFNCYDT--------------DSTNPLSINSRNSS-----GFSPSTYSPQETKNPAGAT 479
Query: 399 Q--QLDSVLRVAC--SLLVLMCLLLSS 421
Q L+S V + LVLM LL+ S
Sbjct: 480 QLRHLNSSPPVMWHNNSLVLMFLLVHS 506
>gi|357117138|ref|XP_003560331.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Brachypodium distachyon]
Length = 509
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 126/363 (34%), Positives = 193/363 (53%), Gaps = 22/363 (6%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
L Y P SS+SK V+CSH LC ++C + CPY Y + +TSSSG LV+D+L++
Sbjct: 126 LKPYSPRQSSTSKPVTCSHSLCDRPNACGNGNGSCPYTVKYVSANTSSSGVLVEDVLYMT 185
Query: 102 SFSKHAPQ-------SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
S + +V + V+ GCG++QTG++LDGAA +G++GLG+ VSVPSLLA A
Sbjct: 186 RQSSSSRSGNGGNVGEAVGARVVFGCGQEQTGAFLDGAAMEGLLGLGMDRVSVPSLLAAA 245
Query: 155 GLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
GL+ +SFS+CF + +G + FG+ A Q+ + + + Y + V + +
Sbjct: 246 GLVGSDSFSMCFSPDGNGRINFGEPSDAGAQNETPFIVSKTRPTYNISVTAVNVKGKGAM 305
Query: 214 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEM-LK 271
+ F A+VDSG SFT+L Y+ + F+ V KR +L + ++YCY S + +
Sbjct: 306 AAEFAAVVDSGTSFTYLNDPAYSLLATSFNSQVREKRANLSASIPFEYCYALSRGQTEVL 365
Query: 272 VPDMRLIFSKNQSF-VVRNHIFSFPENEGFTV----FCLTVMSTDGDYGIIGQNFMMGHR 326
+P++ L F V R + E V +CL V +D IIGQNFM G +
Sbjct: 366 MPEVSLTTRGGAVFPVTRPFVIVAGETTDGQVHAVGYCLAVFKSDIPIDIIGQNFMTGLK 425
Query: 327 IVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTE---QQSTSNGQAAAPP 383
+VFDR+ L W+ C + + V PA +P P+P T+ +QS + A P
Sbjct: 426 VVFDRQRSVLGWTKFDCYKNM---KVEDDGSPAA-APGPMPVTQLRPRQSDTPFPGAVQP 481
Query: 384 STA 386
+A
Sbjct: 482 RSA 484
>gi|195647908|gb|ACG43422.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|414587776|tpg|DAA38347.1| TPA: aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 498
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 115/302 (38%), Positives = 170/302 (56%), Gaps = 12/302 (3%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
Y P SS+SK V C+ C + C + CPY Y + TSSSG+LV+D+L+L++ +
Sbjct: 156 YIPGMSSTSKAVPCNSNFCDLQKECSTALQ-CPYKMVYVSAGTSSSGFLVEDVLYLSTEN 214
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
H PQ +++ +++GCG+ QTGS+LD AAP+G+ GLG+ +VSVPS+LA+ GL NSFS+C
Sbjct: 215 AH-PQI-LKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMC 272
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
F + G + FGDQ + Q+ T L I ++ Y + + +GN T F + D+G
Sbjct: 273 FGRDGIGRISFGDQESSDQEETP-LDINRQHPTYAITISGITVGNKP-TDMDFITIFDTG 330
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQ 283
SFT+L Y + F V + R + ++YCY+ SE +PD+ L
Sbjct: 331 TSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDL-SEARFPIPDIILRTVTGS 389
Query: 284 SFVVRN--HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 341
F V + + S E+E V+CL ++ + IIGQNFM G R+VFDRE L W
Sbjct: 390 MFPVIDPGQVISIQEHE--YVYCLAIVKSM-KLNIIGQNFMTGLRVVFDRERKILGWKKF 446
Query: 342 KC 343
C
Sbjct: 447 NC 448
>gi|125556778|gb|EAZ02384.1| hypothetical protein OsI_24487 [Oryza sativa Indica Group]
Length = 551
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 132/379 (34%), Positives = 203/379 (53%), Gaps = 41/379 (10%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
L +Y PS SS+SK V+C+ LC ++C + CPY Y+ +TSSSG LV+D+L+L
Sbjct: 155 LRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANTSSSGELVEDVLYLT 214
Query: 102 ---SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
+ A ++V++ V+ GCG+ QTGS+LDGAA DG+MGLG+ VSVPS+LA G+++
Sbjct: 215 REKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEKVSVPSILASTGVVK 274
Query: 159 -NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 217
NSFS+CF ++ G + FGD G A Q T F+ + + Y + + S +G+ L GF
Sbjct: 275 SNSFSMCFSKDGLGRINFGDTGSADQSETPFI-VKSTHSYYNISITSMSVGDKNLPL-GF 332
Query: 218 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS------WKYCYNASSEE-ML 270
A+ DSG SFT+L Y F+ +S +R + G++ ++YCY+ S ++ +
Sbjct: 333 YAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFEYCYSLSPDQTTV 392
Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFP---ENEGFTV--FCLTVMSTDGDYGIIGQNFMMGH 325
++P + L + F V + ++ N + +CL V+ +D IIGQNFM G
Sbjct: 393 ELPIVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYCLAVIKSDLPIDIIGQNFMTGL 452
Query: 326 RIVFDRENLKLAWSHSKC---EEVIDK--------------SHVHLVP----PPAGQSPN 364
++VF+RE L W C E++ D +HV P PAG++P
Sbjct: 453 KVVFNREKSVLGWQKFDCYKDEKMTDDGSSVGSPSPSPGPTTHVFPQPQESDSPAGRTPI 512
Query: 365 P--LPTTEQQSTSNGQAAA 381
P P S + G A
Sbjct: 513 PGAAPVPRSSSAAAGGRAG 531
>gi|18855042|gb|AAL79734.1|AC091774_25 putative chloroplast nucleoid DNA-binding protein [Oryza sativa
Japonica Group]
gi|54291046|dbj|BAD61723.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
Group]
gi|125598520|gb|EAZ38300.1| hypothetical protein OsJ_22678 [Oryza sativa Japonica Group]
Length = 551
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 132/379 (34%), Positives = 203/379 (53%), Gaps = 41/379 (10%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
L +Y PS SS+SK V+C+ LC ++C + CPY Y+ +TSSSG LV+D+L+L
Sbjct: 155 LRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANTSSSGELVEDVLYLT 214
Query: 102 ---SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
+ A ++V++ V+ GCG+ QTGS+LDGAA DG+MGLG+ VSVPS+LA G+++
Sbjct: 215 REKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEKVSVPSILASTGVVK 274
Query: 159 -NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 217
NSFS+CF ++ G + FGD G A Q T F+ + + Y + + S +G+ L GF
Sbjct: 275 SNSFSMCFSKDGLGRINFGDTGSADQSETPFI-VKSTHSYYNISITSMSVGDKNLPL-GF 332
Query: 218 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS------WKYCYNASSEE-ML 270
A+ DSG SFT+L Y F+ +S +R + G++ ++YCY+ S ++ +
Sbjct: 333 YAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFEYCYSLSPDQTTV 392
Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFP---ENEGFTV--FCLTVMSTDGDYGIIGQNFMMGH 325
++P + L + F V + ++ N + +CL V+ +D IIGQNFM G
Sbjct: 393 ELPVVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYCLAVIKSDLPIDIIGQNFMTGL 452
Query: 326 RIVFDRENLKLAWSHSKC---EEVIDK--------------SHVHLVP----PPAGQSPN 364
++VF+RE L W C E++ D +HV P PAG++P
Sbjct: 453 KVVFNREKSVLGWQKFDCYKDEKMTDDGSSVGSPSPSPGPTTHVFPQPQESDSPAGRTPI 512
Query: 365 P--LPTTEQQSTSNGQAAA 381
P P S + G A
Sbjct: 513 PGAAPVPRSSSAAAGGRAG 531
>gi|388505672|gb|AFK40902.1| unknown [Lotus japonicus]
Length = 207
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 95/207 (45%), Positives = 130/207 (62%), Gaps = 3/207 (1%)
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 274
+ F+A VDSG SFTFLP Y + +FDK V++ R S +G+ W+YCY +SSE++ KVP
Sbjct: 2 TSFKAQVDSGTSFTFLPGHAYGAITEEFDKQVNASRSSFEGSPWEYCYPSSSEQLPKVPS 61
Query: 275 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 334
+ L+F +N SFVV N +F+F +N+G FCL + T+GD G IGQNFM G+R+VFDREN
Sbjct: 62 LTLMFQQNNSFVVYNPVFTFYDNQGVVGFCLAIQPTEGDMGTIGQNFMTGYRLVFDRENK 121
Query: 335 KLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSI 394
LAWS S C+++ + L PP S PLPT EQQ T NG A AP + +P S
Sbjct: 122 NLAWSPSNCQDLSLGKRMPLSPPNKTSS-APLPTDEQQRT-NGHAVAPAIAGRASPKPS- 178
Query: 395 AASAQQLDSVLRVACSLLVLMCLLLSS 421
AA ++ + + S L+ LLS+
Sbjct: 179 AAPSRIISCQVHYWHSYWFLLFQLLSA 205
>gi|147839328|emb|CAN63378.1| hypothetical protein VITISV_015700 [Vitis vinifera]
Length = 585
Score = 181 bits (458), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 120/332 (36%), Positives = 167/332 (50%), Gaps = 68/332 (20%)
Query: 33 GASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGY 92
G + D LS Y+P SS+S+ V+C++ LC R+ C CPY+ Y + +TS+SG
Sbjct: 141 GTTYASDFELSIYNPKGSSTSRKVTCNNSLCAHRNRCLGTFSNCPYMVSYVSAETSTSGI 200
Query: 93 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
LV+D+LHL + + Q V++ V GCG+ QTGS+LD AAP+G+ GLGL +SVPS+L+
Sbjct: 201 LVEDVLHLTT--EDNRQEFVEAYVTFGCGQVQTGSFLDIAAPNGLFGLGLEKISVPSILS 258
Query: 153 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
K G +SFS+CF + G + FGD+G Q+ T F + + Y + V +G + L
Sbjct: 259 KEGFTADSFSMCFGPDGIGRISFGDKGGPDQEETPF-NLNALHPTYNITVTQVRVGTT-L 316
Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
F AL DSG SFT+L IY V L SS+ I YC
Sbjct: 317 IDLDFTALFDSGTSFTYLVDPIYTNV------LKSSELI--------YCMA--------- 353
Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRE 332
VVR+ + IIGQNFM G+RI+FDRE
Sbjct: 354 -------------VVRS----------------------AELNIIGQNFMTGYRIIFDRE 378
Query: 333 NLKLAWSHSKCEEVIDKSHVHLVP-----PPA 359
L L W +C++ I+ S V + P PPA
Sbjct: 379 KLVLGWKEFECDD-IENSSVPIRPRATSVPPA 409
>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 528
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 109/315 (34%), Positives = 171/315 (54%), Gaps = 10/315 (3%)
Query: 36 IVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+ Q L+ Y P++S++S ++ CS C C S CPY YS T + G L+
Sbjct: 145 VPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISYSNS-TGTKGTLLQ 203
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
D+LHLA+ ++ + V+++V +GCG+KQTG + + +GV+GLG+ SVPSLLAKA
Sbjct: 204 DVLHLATEDENL--TPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKAN 261
Query: 156 LIQNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
+ NSFS+CF + G + FGD+G Q+ T F+ + AY V + + +
Sbjct: 262 ITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISVAPS-TAYGVNISGVSVAGDPVD 320
Query: 214 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLK 271
F A D+G+SFT L Y + FD+LV +R + +++CY+ S + ++
Sbjct: 321 IRLF-AKFDTGSSFTHLREPAYGVLTKSFDELVEDRRRPVDPELPFEFCYDLSPNATTIQ 379
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFD 330
P + + F ++ N F+ EG ++CL V+ + G +IGQNF+ G+RIVFD
Sbjct: 380 FPLVEMTFIGGSKIILNNPFFTARTQEGNVMYCLGVLKSVGLKINVIGQNFVAGYRIVFD 439
Query: 331 RENLKLAWSHSKCEE 345
RE + L W S C E
Sbjct: 440 RERMILGWKQSLCFE 454
>gi|449517142|ref|XP_004165605.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 430
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 119/335 (35%), Positives = 173/335 (51%), Gaps = 29/335 (8%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
L+ Y P+ S++S V C+ LC + C S ++ CPY Y + +TSS GYLV+D+LHLA
Sbjct: 3 LNHYSPNDSTTSSTVPCTSSLC---NRCTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLA 59
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
+ + V++ + GCG QTG + AAP+G++GLG+ +SVPS LA GL NSF
Sbjct: 60 T--DDSLLKPVEAKITFGCGTVQTGIFATTAAPNGLIGLGMEKISVPSFLADQGLTSNSF 117
Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 221
S+CF + G + FGD GPA Q+ T F + E Y +Y V +G F A+
Sbjct: 118 SMCFGADGYGRIDFGDTGPADQKQTPFNTMLE-YQSYNVTFNVINVGGEP-NDVPFTAIF 175
Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYN-ASSEEMLKVPDMRLI 278
DSG SFT+L Y+ + + D + KR SL G + ++YCY + + +
Sbjct: 176 DSGTSFTYLTEPAYSTITKQMDAGMKLKRYSLFGPNFPFEYCYEIPPGAKEFQYLTLNFT 235
Query: 279 FSKNQSFVVRNHIFSFPEN---------EGFTVFCLTVM-STDGDYGIIGQNFMMGHRIV 328
F + P + E V CL + STD D +IGQNFM G+RI
Sbjct: 236 MKGGDEFTPTDIFVFLPVDVSTMNIIFEETTHVACLAIAKSTDID--LIGQNFMTGYRIT 293
Query: 329 FDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSP 363
F+R+ + L WS S C + + V P+G +P
Sbjct: 294 FNRDQMVLGWSSSDCYD-------NGVGTPSGDTP 321
>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 568
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 119/335 (35%), Positives = 173/335 (51%), Gaps = 29/335 (8%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
L+ Y P+ S++S V C+ LC + C S ++ CPY Y + +TSS GYLV+D+LHLA
Sbjct: 151 LNHYSPNDSTTSSTVPCTSSLC---NRCTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLA 207
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
+ + V++ + GCG QTG + AAP+G++GLG+ +SVPS LA GL NSF
Sbjct: 208 T--DDSLLKPVEAKITFGCGTVQTGIFATTAAPNGLIGLGMEKISVPSFLADQGLTSNSF 265
Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 221
S+CF + G + FGD GPA Q+ T F + E Y +Y V +G F A+
Sbjct: 266 SMCFGADGYGRIDFGDTGPADQKQTPFNTMLE-YQSYNVTFNVINVGGEP-NDVPFTAIF 323
Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYNA-SSEEMLKVPDMRLI 278
DSG SFT+L Y+ + + D + KR SL G + ++YCY + + +
Sbjct: 324 DSGTSFTYLTEPAYSTITKQMDAGMKLKRYSLFGPNFPFEYCYEIPPGAKEFQYLTLNFT 383
Query: 279 FSKNQSFVVRNHIFSFPEN---------EGFTVFCLTVM-STDGDYGIIGQNFMMGHRIV 328
F + P + E V CL + STD D +IGQNFM G+RI
Sbjct: 384 MKGGDEFTPTDIFVFLPVDVSTMNIIFEETTHVACLAIAKSTDID--LIGQNFMTGYRIT 441
Query: 329 FDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSP 363
F+R+ + L WS S C + + V P+G +P
Sbjct: 442 FNRDQMVLGWSSSDCYD-------NGVGTPSGDTP 469
>gi|18409320|ref|NP_566948.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|27754243|gb|AAO22575.1| unknown protein [Arabidopsis thaliana]
gi|332645259|gb|AEE78780.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 529
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 117/315 (37%), Positives = 170/315 (53%), Gaps = 13/315 (4%)
Query: 38 QDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
Q R L+ Y P++SS+S ++ CS C S C S CPY Y ++DT ++G L +D+
Sbjct: 147 QSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLSKDTFTTGTLFEDV 206
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
LHL + + V++++ +GCG+ QTG AA +G++GLGL D SVPS+LAKA +
Sbjct: 207 LHLVT--EDEGLEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILAKAKIT 264
Query: 158 QNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 215
NSFS+CF + G + FGD+G Q T LP E Y V V +G +
Sbjct: 265 ANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPT-EPSPTYAVSVTEVSVGGDAV--- 320
Query: 216 GFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLK 271
G Q AL D+G SFT L Y + FD V+ KR + +++CY+ S ++ +
Sbjct: 321 GVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTIL 380
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIGQNFMMGHRIVFD 330
P + + F +RN +F + ++CL ++ S D IIGQNFM G+RIVFD
Sbjct: 381 FPRVAMTFEGGSQMFLRNPLFIVWNEDNSAMYCLGILKSVDFKINIIGQNFMSGYRIVFD 440
Query: 331 RENLKLAWSHSKCEE 345
RE + L W S C E
Sbjct: 441 RERMILGWKRSDCFE 455
>gi|297819836|ref|XP_002877801.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323639|gb|EFH54060.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 177 bits (449), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 107/317 (33%), Positives = 171/317 (53%), Gaps = 19/317 (5%)
Query: 38 QDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
Q L+ Y+PS S+SS V+C+ LC R+ C S CPY Y + + S+G LV+D+
Sbjct: 161 QRIRLNIYNPSISTSSSKVTCNSTLCALRNRCISPLSDCPYRIRYLSPGSKSTGVLVEDV 220
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
+H+++ A + + GC Q G + + A +G+MGL + D++VP++L KAG+
Sbjct: 221 IHMSTEEGEARDARIT----FGCSETQLGLFQE-VAVNGIMGLAMADIAVPNMLVKAGVA 275
Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF--VGVESYCIGNSCLTQS 215
+SFS+CF N G++ FGD+G + Q T P+G F V + + +G + ++
Sbjct: 276 SDSFSMCFGPNGKGTISFGDKGSSDQHET---PLGGTISPLFYDVSITKFKVGKVTV-ET 331
Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCY---NASSEEMLK 271
F A+ DSG + T+L Y + F V +R+ +S +++CY + S EE K
Sbjct: 332 KFSAIFDSGTAVTWLLDPYYTALTTNFHLSVPDRRLPANVDSTFEFCYIITSTSDEE--K 389
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTD-GDYGIIGQNFMMGHRIVF 329
+P + ++ V + I F ++G F V+CL V+ D D+ IIGQNFM +RIV
Sbjct: 390 LPSISFEMKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQDKADFNIIGQNFMTNYRIVH 449
Query: 330 DRENLKLAWSHSKCEEV 346
DRE + L W S C +
Sbjct: 450 DRERMILGWKKSNCNDT 466
>gi|242094534|ref|XP_002437757.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
gi|241915980|gb|EER89124.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
Length = 575
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 113/316 (35%), Positives = 161/316 (50%), Gaps = 18/316 (5%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSL---KDPCPYIADYSTEDTSSSGYLVDDILHLA 101
Y PS SS+SK V C HPLC+ +C + CPY Y + +T SSG LV+D+LHL
Sbjct: 162 YSPSLSSTSKTVPCGHPLCERPDACATAGKSSSSCPYEVKYVSANTGSSGVLVEDVLHLV 221
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNS 160
+VQ+ ++ GCG+ QTG++L GAA G+MGLGL VSVPS LA +GL+ +S
Sbjct: 222 DGGGGGGGKAVQAPIVFGCGQVQTGAFLRGAAAGGLMGLGLDKVSVPSALASSGLVASDS 281
Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQA 219
FS+CF + G + FGD G Q T + G +Y+ + V + + + + F A
Sbjct: 282 FSMCFSRDGVGRINFGDAGSPDQAETPLIAAGSLQPSYYNISVGAITVDSKAMAVE-FTA 340
Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVS--SKRISLQGNSWKYCYNASSEE--MLKVPDM 275
+VDSG SFT+L Y + F+ VS S+ +++CY S + M ++P M
Sbjct: 341 VVDSGTSFTYLDDPAYTFLTTNFNSRVSEASETYGSGYEKFEFCYRLSPGQTSMKRLPAM 400
Query: 276 RLIFSKNQSFVVRNHIFSF--PENEG---FTVFCLTVMST---DGDYGIIGQNFMMGHRI 327
L F + I N G +CL ++ T + IGQNFM G ++
Sbjct: 401 SLTTKGGAVFPITWPIIPVLASTNGGPYHPIGYCLGIIKTSILSTEDATIGQNFMTGLKV 460
Query: 328 VFDRENLKLAWSHSKC 343
VFDR L W C
Sbjct: 461 VFDRRKSVLGWEKFDC 476
>gi|297819834|ref|XP_002877800.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323638|gb|EFH54059.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 531
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 111/319 (34%), Positives = 172/319 (53%), Gaps = 14/319 (4%)
Query: 36 IVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+ Q L+ Y P++S++S ++ CS C C S K CPY YS T ++G L+
Sbjct: 145 VPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPKSICPYQISYSNS-TGTTGTLLQ 203
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
D+LHLA+ ++ + V+++V +GCG+KQTG + + +GV+GLG+ SVPSLLAKA
Sbjct: 204 DVLHLATEDENL--TPVKTNVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKAN 261
Query: 156 LIQNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
+ +SFS+CF + G + FGD+G Q+ T F+ + AY + V +G +
Sbjct: 262 ITADSFSMCFGRVIGNVGRISFGDKGYTDQEETPFISVAPS-TAYGLNVTGVSVGGDPVG 320
Query: 214 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEM-LK 271
F A D+G+SFT L Y + FD LV KR + +++CY+ S ++
Sbjct: 321 TRLF-AKFDTGSSFTHLMEPAYGVLTKSFDDLVEDKRRPVDPELPFEFCYDLSPNATSIE 379
Query: 272 VPDMRLIFSKNQSFVVRNHIFS----FPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHR 326
P + + F ++ N F+ EG ++CL V+ + G +IGQNF+ G+R
Sbjct: 380 FPFVEMTFVGGSKIILNNPFFTARTQARHGEGNVMYCLGVLKSVGLKINVIGQNFVAGYR 439
Query: 327 IVFDRENLKLAWSHSKCEE 345
IVFDRE + L W S C E
Sbjct: 440 IVFDRERMILGWKPSLCFE 458
>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
Length = 671
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 104/285 (36%), Positives = 166/285 (58%), Gaps = 7/285 (2%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
Y P+ S++S+ V CS LC +++C+S + CPY Y +++TSSSG LV+D+L+L S S
Sbjct: 84 YSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDS 143
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
A V + ++ GCG+ QTGS+L AAP+G++GLG+ SVPSLLA GL NSFS+C
Sbjct: 144 --AQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMC 201
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSG 224
F ++ G + FGD G + Q+ T L + ++ Y + + +G+ ++ F A+VDSG
Sbjct: 202 FGDDGHGRINFGDTGSSDQKETP-LNVYKQNPYYNITITGITVGSKSISTE-FSAIVDSG 259
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQ 283
SFT L +Y ++ FD + S R L + +++CY+ S+ ++ P++ L
Sbjct: 260 TSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSANGIVH-PNVSLTAKGGS 318
Query: 284 SFVVRNHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMGHRI 327
F V + I + +N V +CL +M ++G I G NF R+
Sbjct: 319 IFPVNDPIITITDNAFNPVGYCLAIMKSEGVNLIGGYNFDESSRL 363
>gi|297819828|ref|XP_002877797.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323635|gb|EFH54056.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 530
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 113/313 (36%), Positives = 167/313 (53%), Gaps = 9/313 (2%)
Query: 38 QDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
Q R L+ Y P++SS+S ++ C+ C S C S CPY Y ++DT ++G L +D+
Sbjct: 148 QSRPLNLYSPNTSSTSSSIRCNDDRCFGSSQCSSPASSCPYQIQYLSKDTFTTGTLFEDV 207
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
LHL + + V++++ +GCGR QTG AA +G++GLG+ D SVPS+LAKA +
Sbjct: 208 LHLVT--EDVDLKPVKANITLGCGRNQTGFLQSSAAINGLLGLGMKDYSVPSILAKAKIT 265
Query: 158 QNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 215
NSFS+CF + G + FGD+G Q T LP E Y V V +G +
Sbjct: 266 ANSFSMCFGNIIDVIGRISFGDKGYTDQMETPLLPT-EPSPTYAVNVTEVSVGGDVVGVQ 324
Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLKVP 273
AL D+G SFT L Y + FD V+ KR + +++CY+ S + + P
Sbjct: 325 -LLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPEIPFEFCYDLSPNSTTILFP 383
Query: 274 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIGQNFMMGHRIVFDRE 332
+ + F +RN +F + ++CL ++ S D IIGQNFM G+R+VFDRE
Sbjct: 384 RVAMTFEGGSLMFLRNPLFIVWNEDNTAMYCLGILKSVDFKINIIGQNFMSGYRVVFDRE 443
Query: 333 NLKLAWSHSKCEE 345
+ L W S C E
Sbjct: 444 RMILGWKRSDCFE 456
>gi|6562285|emb|CAB62655.1| putative protein [Arabidopsis thaliana]
Length = 519
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 114/313 (36%), Positives = 168/313 (53%), Gaps = 19/313 (6%)
Query: 38 QDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
Q R L+ Y P++SS+S ++ CS C S C S CPY Y ++DT ++G L +D+
Sbjct: 147 QSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLSKDTFTTGTLFEDV 206
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
LHL + + V++++ +GCG+ QTG AA +G++GLGL D SVPS+LAKA +
Sbjct: 207 LHLVT--EDEGLEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILAKAKIT 264
Query: 158 QNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 215
NSFS+CF + G + FGD+G Q T LP VG ++ +G L
Sbjct: 265 ANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPSVTEVSVGGDA--VGVQLL--- 319
Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLKVP 273
AL D+G SFT L Y + FD V+ KR + +++CY+ S ++ + P
Sbjct: 320 ---ALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTILFP 376
Query: 274 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIGQNFMMGHRIVFDRE 332
+ + F +RN +F + ++CL ++ S D IIGQNFM G+RIVFDRE
Sbjct: 377 RVAMTFEGGSQMFLRNPLFI----DNSAMYCLGILKSVDFKINIIGQNFMSGYRIVFDRE 432
Query: 333 NLKLAWSHSKCEE 345
+ L W S C E
Sbjct: 433 RMILGWKRSDCFE 445
>gi|296084698|emb|CBI25840.3| unnamed protein product [Vitis vinifera]
Length = 306
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 97/278 (34%), Positives = 154/278 (55%), Gaps = 10/278 (3%)
Query: 120 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG 179
CG+ QTGS+L+GAAP+G+ GLG+G +SVPS+LAK GL+ +SFS+CF + +G + FGD+G
Sbjct: 13 CGKVQTGSFLEGAAPNGLFGLGMGSISVPSILAKEGLVADSFSMCFGNDGTGRISFGDEG 72
Query: 180 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVV 239
+ Q+ T F P + Y + + +G + F A+ DSG SFT+L Y +
Sbjct: 73 SSGQEETPFNPSKSQL-LYNISITQISVGGTS-ADLNFDAIFDSGTSFTYLNDPAYTSIS 130
Query: 240 VKFDKLVSSKRISLQGN-SWKYCYNASSEE-MLKVPDMRLIFSKNQSFVVRNHIFSFPEN 297
F+ KR S + ++YCY+ S ++ ++ P + L +F V + I
Sbjct: 131 ESFNLRAKDKRSSSDSDLPFEYCYDISEQQTTVEYPIVNLTMKGGDNFFVTDPIVIVSIQ 190
Query: 298 EGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPP 357
G+ V+CL V+ + GD IIGQNFM G+RI+FDRE + L W+ S C + + + + + P
Sbjct: 191 GGY-VYCLGVVKS-GDINIIGQNFMTGYRIIFDREKMVLGWTKSNCYDTEESNTLPINPA 248
Query: 358 PAGQSPNPLPTTEQQSTSNGQAA----APPSTAKTAPS 391
+ P + + + NG + AP A +P+
Sbjct: 249 NSPVVPPTVSVEPEATAGNGNGSHISEAPSPLANGSPT 286
>gi|359496966|ref|XP_002269916.2| PREDICTED: aspartic proteinase-like protein 1-like, partial [Vitis
vinifera]
Length = 294
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 97/278 (34%), Positives = 155/278 (55%), Gaps = 10/278 (3%)
Query: 120 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG 179
CG+ QTGS+L+GAAP+G+ GLG+G +SVPS+LAK GL+ +SFS+CF + +G + FGD+G
Sbjct: 1 CGKVQTGSFLEGAAPNGLFGLGMGSISVPSILAKEGLVADSFSMCFGNDGTGRISFGDEG 60
Query: 180 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVV 239
+ Q+ T F P + Y + + +G + + F A+ DSG SFT+L Y +
Sbjct: 61 SSGQEETPFNPSKSQL-LYNISITQISVGGTSADLN-FDAIFDSGTSFTYLNDPAYTSIS 118
Query: 240 VKFDKLVSSKRISLQGN-SWKYCYNASSEE-MLKVPDMRLIFSKNQSFVVRNHIFSFPEN 297
F+ KR S + ++YCY+ S ++ ++ P + L +F V + I
Sbjct: 119 ESFNLRAKDKRSSSDSDLPFEYCYDISEQQTTVEYPIVNLTMKGGDNFFVTDPIVIVSIQ 178
Query: 298 EGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPP 357
G+ V+CL V+ + GD IIGQNFM G+RI+FDRE + L W+ S C + + + + + P
Sbjct: 179 GGY-VYCLGVVKS-GDINIIGQNFMTGYRIIFDREKMVLGWTKSNCYDTEESNTLPINPA 236
Query: 358 PAGQSPNPLPTTEQQSTSNGQAA----APPSTAKTAPS 391
+ P + + + NG + AP A +P+
Sbjct: 237 NSPVVPPTVSVEPEATAGNGNGSHISEAPSPLANGSPT 274
>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 488
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 111/349 (31%), Positives = 177/349 (50%), Gaps = 15/349 (4%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
L+ Y+PS S SS V+C+ LC R+ C S CPY Y + + S+G LV+D++H++
Sbjct: 137 LNIYNPSKSKSSSKVTCNSTLCALRNRCISPVSDCPYRIRYLSPGSKSTGVLVEDVIHMS 196
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
+ A + + GC Q G + + A +G+MGL + D++VP++L KAG+ +SF
Sbjct: 197 TEEGEARDARIT----FGCSESQLGLFKE-VAVNGIMGLAIADIAVPNMLVKAGVASDSF 251
Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF--VGVESYCIGNSCLTQSGFQA 219
S+CF N G++ FGD+G + Q T P+ F V + + +G + + F A
Sbjct: 252 SMCFGPNGKGTISFGDKGSSDQLET---PLSGTISPMFYDVSITKFKVGKVTV-DTEFTA 307
Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCY-NASSEEMLKVPDMRL 277
DSG + T+L Y + F V +R+S +S +++CY S+ + K+P +
Sbjct: 308 TFDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDSPFEFCYIITSTSDEDKLPSVSF 367
Query: 278 IFSKNQSFVVRNHIFSFPENEG-FTVFCLTVMS-TDGDYGIIGQNFMMGHRIVFDRENLK 335
++ V + I F ++G F V+CL V+ + D+ IIGQNFM +RIV DRE
Sbjct: 368 EMKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQVNADFSIIGQNFMTNYRIVHDRERRI 427
Query: 336 LAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPS 384
L W S C + + + P +P P T S+ AA S
Sbjct: 428 LGWKKSNCNDTNGFTGPTALAKPPSMAPTSSPRTINLSSRLNPLAAASS 476
>gi|186510920|ref|NP_190702.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645260|gb|AEE78781.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 530
Score = 164 bits (414), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 112/317 (35%), Positives = 166/317 (52%), Gaps = 20/317 (6%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
L+ Y P++S++S ++ CS C C S + CPY S+ +T ++G L+ D+LHL
Sbjct: 152 LNLYTPNASTTSSSIRCSDKRCFGSGKCSSPESICPYQIALSS-NTVTTGTLLQDVLHLV 210
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
+ + V ++V +GCG+ QTG++ A +GV+GL + + SVPSLLAKA + NSF
Sbjct: 211 T--EDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLSMKEYSVPSLLAKANITANSF 268
Query: 162 SICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQA 219
S+CF S G + FGD+G Q+ T + + E AY V V +G + F A
Sbjct: 269 SMCFGRIISVVGRISFGDKGYTDQEETPLVSL-ETSTAYGVNVTGVSVGGVPVDVPLF-A 326
Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLI 278
L D+G+SFT L Y FD L+ KR + + +++CY+ E + R +
Sbjct: 327 LFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDFPFEFCYDLREEHLNSDARPRHM 386
Query: 279 FSK-----NQSFVVR-----NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
SK F R S+ NEG ++CL ++ + + IIGQN M GHRIV
Sbjct: 387 QSKCYNPCRDDFRWRIQNDSQESVSY-SNEGTKMYCLGILKSI-NLNIIGQNLMSGHRIV 444
Query: 329 FDRENLKLAWSHSKCEE 345
FDRE + L W S C E
Sbjct: 445 FDRERMILGWKQSNCFE 461
>gi|6562286|emb|CAB62656.1| putative protein [Arabidopsis thaliana]
Length = 518
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 112/317 (35%), Positives = 166/317 (52%), Gaps = 20/317 (6%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
L+ Y P++S++S ++ CS C C S + CPY S+ +T ++G L+ D+LHL
Sbjct: 140 LNLYTPNASTTSSSIRCSDKRCFGSGKCSSPESICPYQIALSS-NTVTTGTLLQDVLHLV 198
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
+ + V ++V +GCG+ QTG++ A +GV+GL + + SVPSLLAKA + NSF
Sbjct: 199 T--EDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLSMKEYSVPSLLAKANITANSF 256
Query: 162 SICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQA 219
S+CF S G + FGD+G Q+ T + + E AY V V +G + F A
Sbjct: 257 SMCFGRIISVVGRISFGDKGYTDQEETPLVSL-ETSTAYGVNVTGVSVGGVPVDVPLF-A 314
Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLI 278
L D+G+SFT L Y FD L+ KR + + +++CY+ E + R +
Sbjct: 315 LFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDFPFEFCYDLREEHLNSDARPRHM 374
Query: 279 FSK-----NQSFVVR-----NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
SK F R S+ NEG ++CL ++ + + IIGQN M GHRIV
Sbjct: 375 QSKCYNPCRDDFRWRIQNDSQESVSY-SNEGTKMYCLGILKSI-NLNIIGQNLMSGHRIV 432
Query: 329 FDRENLKLAWSHSKCEE 345
FDRE + L W S C E
Sbjct: 433 FDRERMILGWKQSNCFE 449
>gi|3036792|emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana]
Length = 335
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 88/244 (36%), Positives = 141/244 (57%), Gaps = 9/244 (3%)
Query: 28 CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 87
C GA+ + LS Y+P S+++K V+C++ LC R+ C CPY+ Y + T
Sbjct: 20 CAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQT 79
Query: 88 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV 147
S+SG L++D++HL + K+ + V++ V GCG+ Q+GS+LD AAP+G+ GLG+ +SV
Sbjct: 80 STSGILMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISV 137
Query: 148 PSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI 207
PS+LA+ GL+ +SFS+CF + G + FGD+G + Q+ T F + + Y + V +
Sbjct: 138 PSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPF-NLNPSHPNYNITVTRVRV 196
Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASS 266
G + L F AL D+G SFT+L +Y V + KR S ++YCY+
Sbjct: 197 GTT-LIDDEFTALFDTGTSFTYLVDPMYTTV----SESAQDKRHSPDSRIPFEYCYDMRE 251
Query: 267 EEML 270
+ +L
Sbjct: 252 KLVL 255
>gi|3805854|emb|CAA21474.1| putative protein [Arabidopsis thaliana]
gi|7270540|emb|CAB81497.1| putative protein [Arabidopsis thaliana]
Length = 455
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 87/239 (36%), Positives = 140/239 (58%), Gaps = 9/239 (3%)
Query: 33 GASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGY 92
GA+ + LS Y+P S+++K V+C++ LC R+ C CPY+ Y + TS+SG
Sbjct: 145 GATYASEFELSIYNPKVSTTNKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQTSTSGI 204
Query: 93 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
L++D++HL + K+ + V++ V GCG+ Q+GS+LD AAP+G+ GLG+ +SVPS+LA
Sbjct: 205 LMEDVMHLTTEDKNPER--VEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLA 262
Query: 153 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
+ GL+ +SFS+CF + G + FGD+G + Q+ T F + + Y + V +G + L
Sbjct: 263 REGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPF-NLNPSHPNYNITVTRVRVGTT-L 320
Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEML 270
F AL D+G SFT+L +Y V + KR S ++YCY+ + +L
Sbjct: 321 IDDEFTALFDTGTSFTYLVDPMYTTV----SESAQDKRHSPDSRIPFEYCYDMREKLVL 375
>gi|374255989|gb|AEZ00856.1| putative peptidase A1 protein, partial [Elaeis guineensis]
Length = 263
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 91/241 (37%), Positives = 133/241 (55%), Gaps = 6/241 (2%)
Query: 112 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG 171
V++ ++ GCG+ QTG++LD AAP+G+ GLG+ VSVPS+LA G NSFS+CF + G
Sbjct: 11 VKAPIVFGCGQVQTGAFLDSAAPNGLFGLGMDKVSVPSVLASKGYASNSFSMCFGSDGMG 70
Query: 172 SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLP 231
++FGD G + Q T F + + Y + + +GNS + + A+VDSG SFT L
Sbjct: 71 RIYFGDTGSSDQGETPF-DVNHSHPTYNISLIGMEVGNSSIDVNS-SAIVDSGTSFTCLA 128
Query: 232 TEIYAEVVVKFDKLVSSKR-ISLQGNSWKYCYNAS-SEEMLKVPDMRLIFSKNQSFVVRN 289
+Y ++ F V R S G ++YCY S ++ + +P + L F + +
Sbjct: 129 DPMYTKLSESFHAQVRENRHESDPGIPFEYCYGLSRNQNSILLPKINLTTKGGSQFPIND 188
Query: 290 HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDK 349
I +E + +CL ++ + IIGQNFM G RIVFDRE L L W S C E D
Sbjct: 189 PIIVI-SSEQSSFYCLGIVKSS-QLNIIGQNFMTGLRIVFDRERLVLGWKESDCYEAEDS 246
Query: 350 S 350
S
Sbjct: 247 S 247
>gi|297819832|ref|XP_002877799.1| hypothetical protein ARALYDRAFT_906483 [Arabidopsis lyrata subsp.
lyrata]
gi|297323637|gb|EFH54058.1| hypothetical protein ARALYDRAFT_906483 [Arabidopsis lyrata subsp.
lyrata]
Length = 414
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 94/296 (31%), Positives = 145/296 (48%), Gaps = 20/296 (6%)
Query: 65 SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 124
S+ C S CPY Y TS+ G L +D+LHL + + V++++ +GCG+ Q
Sbjct: 123 SQGGCSSPASVCPYQIPYLFNTTSTRGTLFEDVLHLVT--EDEGLEPVKANITLGCGQNQ 180
Query: 125 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE--NDSGSVFFGDQGPAT 182
TG Y A +G++GLG+ D SVPS+LAK + NSFS+CF + G + FGD+G
Sbjct: 181 TGLYRKSLAVNGLLGLGMKDYSVPSVLAKENITANSFSMCFGNIIDFIGRISFGDRGHTD 240
Query: 183 QQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF 242
Q T +PI E Y V V +G L + AL D+G SFT L Y + F
Sbjct: 241 QLQTPLVPI-EPNPTYAVNVTEVTVGGDIL-EIQMLALFDTGTSFTHLLEPAYGLLTKAF 298
Query: 243 DKLVSSKRISLQGN-SWKYCYNASSE-EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGF 300
D V+ KR + +++CY+ S + K P + + F +R+ +F+
Sbjct: 299 DDHVTDKRRPIDPEIPFEFCYDTSPNIKSFKFPRVNMTFVGGSKLTLRDPLFTVWNEARH 358
Query: 301 TVFCLTVMSTDGD------------YGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
+ ++ +D + ++ +N M G+RIVFDRE + L W S C+
Sbjct: 359 GAWMSSLTFSDREKKKKEYVLNAFHIWVVSENLMSGYRIVFDRERMILGWKRSDCK 414
>gi|414888271|tpg|DAA64285.1| TPA: hypothetical protein ZEAMMB73_923514, partial [Zea mays]
Length = 335
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 81/203 (39%), Positives = 123/203 (60%), Gaps = 4/203 (1%)
Query: 38 QDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
+D Y P SS+S+ V CS LC +S+C+S CPY Y +++TSS+G LV+D+
Sbjct: 130 RDLKFDTYSPQKSSTSRKVPCSSNLCDEQSACRSASSSCPYSIQYLSDNTSSTGVLVEDV 189
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL- 156
L+L + P+ V + + GCGR QTGS+L AAP+G++GLG+ +SVPSLLA G+
Sbjct: 190 LYLVTEYGRQPK-IVTAPITFGCGRTQTGSFLGTAAPNGLLGLGMDTISVPSLLASQGVA 248
Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
NSFS+CF ++ G + FGD G + QQ T L + ++ Y + + +G+ + +
Sbjct: 249 AANSFSMCFAQDGHGRINFGDTGSSDQQETP-LNMYKQNPYYNISITGATVGSKSI-HTK 306
Query: 217 FQALVDSGASFTFLPTEIYAEVV 239
F A+VDSG SFT L +Y ++
Sbjct: 307 FNAIVDSGTSFTALSDPMYTQIT 329
>gi|6580159|emb|CAB62657.2| putative protein [Arabidopsis thaliana]
Length = 475
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 93/317 (29%), Positives = 145/317 (45%), Gaps = 67/317 (21%)
Query: 36 IVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+ Q L+ Y P++S++S ++ CS C C S CPY YS T + G L+
Sbjct: 145 VPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISYS-NSTGTKGTLLQ 203
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
D+LHLA+ ++ + V+++V +GCG+KQTG + + +GV+GLG+ SVPSLLAKA
Sbjct: 204 DVLHLATEDENL--TPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKAN 261
Query: 156 LIQNSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
+ NSFS+CF + G + FGD+G Q+ T F+ + +
Sbjct: 262 ITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISVAPR------------------- 302
Query: 214 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS-SEEMLKV 272
+ VD F F CY+ S + ++
Sbjct: 303 ----RRPVDPELPFEF-------------------------------CYDLSPNATTIQF 327
Query: 273 PDMRLIFSKNQSFVVRNHIFS----FPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
P + + F ++ N F+ EG ++CL V+ + G+ NF+ G+RIV
Sbjct: 328 PLVEMTFIGGSKIILNNPFFTARTQARHGEGNVMYCLGVLKS---VGLKINNFVAGYRIV 384
Query: 329 FDRENLKLAWSHSKCEE 345
FDRE + L W S C E
Sbjct: 385 FDRERMILGWKQSLCFE 401
>gi|115469998|ref|NP_001058598.1| Os06g0717900 [Oryza sativa Japonica Group]
gi|54291047|dbj|BAD61724.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
Group]
gi|113596638|dbj|BAF20512.1| Os06g0717900 [Oryza sativa Japonica Group]
Length = 307
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 84/263 (31%), Positives = 134/263 (50%), Gaps = 36/263 (13%)
Query: 137 VMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY 195
+MGLG+ VSVPS+LA G+++ NSFS+CF ++ G + FGD G A Q T F+ + +
Sbjct: 9 LMGLGMEKVSVPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFI-VKSTH 67
Query: 196 DAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 255
Y + + S +G+ L GF A+ DSG SFT+L Y F+ +S +R + G
Sbjct: 68 SYYNISITSMSVGDKNLPL-GFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSG 126
Query: 256 NS------WKYCYNASSEEM-LKVPDMRLIFSKNQSFVVRNHIFSFP---ENEGFTV--F 303
++ ++YCY+ S ++ +++P + L + F V + ++ N + +
Sbjct: 127 STRSGPFPFEYCYSLSPDQTTVELPVVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGY 186
Query: 304 CLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC---EEVID------------ 348
CL V+ +D IIGQNFM G ++VF+RE L W C E++ D
Sbjct: 187 CLAVIKSDLPIDIIGQNFMTGLKVVFNREKSVLGWQKFDCYKDEKMTDDGSSVGSPSPSP 246
Query: 349 --KSHVHLVP----PPAGQSPNP 365
+HV P PAG++P P
Sbjct: 247 GPTTHVFPQPQESDSPAGRTPIP 269
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 90/318 (28%), Positives = 152/318 (47%), Gaps = 22/318 (6%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
L+ YD +SS++K+VSCS C RS C S C Y+ Y + +S++GYLV D++
Sbjct: 128 LTPYDVDASSTAKSVSCSDNFCSYVNQRSECHS-GSTCQYVIMYG-DGSSTNGYLVKDVV 185
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLI 157
HL + + S ++I GCG KQ+G + AA DG+MG G + S S LA G +
Sbjct: 186 HLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKV 245
Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC--LTQS 215
+ SF+ C D N+ G +F G P+ K Y V + + +GNS L+ +
Sbjct: 246 KRSFAHCLDNNNGGGIF--AIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSN 303
Query: 216 GFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
F + ++DSG + +LP +Y ++ + L S ++L + ++++
Sbjct: 304 AFDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEI--LASHPELTLHTVQESFTCFHYTDKL 361
Query: 270 LKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDG--DYGIIGQNFMMGH 325
+ P + F K+ S V R ++F E+ + + T G I+G +
Sbjct: 362 DRFPTVTFQFDKSVSLAVYPREYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNK 421
Query: 326 RIVFDRENLKLAWSHSKC 343
+V+D EN + W++ C
Sbjct: 422 LVVYDIENQVIGWTNHNC 439
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 90/318 (28%), Positives = 150/318 (47%), Gaps = 22/318 (6%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
L+ YD +SS++K+VSCS C RS C S C Y+ Y + +S++GYLV D++
Sbjct: 128 LTPYDADASSTAKSVSCSDNFCSYVNQRSECHS-GSTCQYVILYG-DGSSTNGYLVRDVV 185
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLI 157
HL + + S ++I GCG KQ+G + AA DG+MG G + S S LA G +
Sbjct: 186 HLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKV 245
Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-- 215
+ SF+ C D N+ G +F G P+ K Y V + + +GNS L S
Sbjct: 246 KRSFAHCLDNNNGGGIF--AIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLQLSSD 303
Query: 216 GFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
F + ++DSG + +LP +Y ++ + L S + ++L + + +
Sbjct: 304 AFDSGDDKGVIIDSGTTLVYLPDAVYNPLMNQI--LASHQELNLHTVQDSFTCFHYIDRL 361
Query: 270 LKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDG--DYGIIGQNFMMGH 325
+ P + F K+ S V + ++F E+ + + T G I+G +
Sbjct: 362 DRFPTVTFQFDKSVSLAVYPQEYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNK 421
Query: 326 RIVFDRENLKLAWSHSKC 343
+V+D EN + W++ C
Sbjct: 422 LVVYDIENQVIGWTNHNC 439
>gi|6562288|emb|CAB62658.1| putative protein [Arabidopsis thaliana]
Length = 426
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 78/262 (29%), Positives = 136/262 (51%), Gaps = 17/262 (6%)
Query: 65 SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 124
+++ C S CPY Y + + S+G LV+D++H+++ A + I G Q
Sbjct: 124 TKARCISPVSDCPYRIRYLSPGSKSTGVLVEDVIHMSTEEGEARDAR------ITFGESQ 177
Query: 125 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQ 184
G + + A +G+MGL + D++VP++L KAG+ +SFS+CF N G++ FGD+G + Q
Sbjct: 178 LGLFKE-VAVNGIMGLAIADIAVPNMLVKAGVASDSFSMCFGPNGKGTISFGDKGSSDQL 236
Query: 185 STSFLPIGEKYDAYF--VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF 242
T P+ F V + + +G + + F A DSG + T+L Y + F
Sbjct: 237 ET---PLSGTISPMFYDVSITKFKVGKVTV-DTEFTATFDSGTAVTWLIEPYYTALTTNF 292
Query: 243 DKLVSSKRISLQGNS-WKYCY-NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEG- 299
V +R+S +S +++CY S+ + K+P + ++ V + I F ++G
Sbjct: 293 HLSVPDRRLSKSVDSPFEFCYIITSTSDEDKLPSVSFEMKGGAAYDVFSPILVFDTSDGS 352
Query: 300 FTVFCLTVMS-TDGDYGIIGQN 320
F V+CL V+ + D+ IIG+N
Sbjct: 353 FQVYCLAVLKQVNADFSIIGRN 374
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 85/328 (25%), Positives = 152/328 (46%), Gaps = 29/328 (8%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSL----KDPCPYIADYSTEDTSSSGYLVDDI 97
L+ YDP S +S+ VSC H C S + L ++PCPY Y + ++++GY V D
Sbjct: 113 LTLYDPKRSKTSEFVSCEHNFCSSTYEGRILGCKAENPCPYSISYG-DGSATTGYYVQDY 171
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKAG 155
L + + ++ SS+I GCG Q+G++ + A DG++G G + SV S LA +G
Sbjct: 172 LTFNRVNGNPHTATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASG 231
Query: 156 LIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
++ FS C D N G +F G+ ++T +P Y+ +E + L
Sbjct: 232 KVKKIFSHCLDTNVGGGIFSIGEVVEPKVKTTPLVPNMAHYNVILKNIE---VDGDILQL 288
Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNAS 265
+++G ++DSG + +LP +Y +++ K L R+ + +Y C+ +
Sbjct: 289 PSDTFDSENGKGTVIDSGTTLAYLPRIVYDQLMSKV--LAKQPRLKVYLVEEQYSCFQYT 346
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL------TVMSTDGDYGIIGQ 319
P ++L F + S V H + F +G + +C+ + D ++G
Sbjct: 347 GNVDSGFPIVKLHFEDSLSLTVYPHDYLF-NYKGDSYWCIGWQKSASETKNGKDMTLLGD 405
Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVI 347
+ +V+D EN+ + W+ C I
Sbjct: 406 FVLSNKLVVYDLENMTIGWTDYNCSSSI 433
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 93/333 (27%), Positives = 151/333 (45%), Gaps = 36/333 (10%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKD-PCPYIADYSTEDTSSSGYLV 94
L+ YDP+SS +SK V C C S S CK KD CPY Y T+S Y+
Sbjct: 118 ELTLYDPNSSKTSKVVPCDDEFCTSTYDGPISGCK--KDMSCPYSITYGDGSTTSGSYIK 175
Query: 95 DDIL--HLASFSKHAPQSSVQSSVIIGCGRKQTG--SYLDGAAPDGVMGLGLGDVSVPSL 150
DD+ + + P ++ SVI GCG KQ+G S + DG++G G + SV S
Sbjct: 176 DDLTFDRVVGDLRTVPDNT---SVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQ 232
Query: 151 LAKAGLIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 209
LA AG ++ FS C D + G +F G+ ++T +P Y+ +E G+
Sbjct: 233 LAAAGKVKRVFSHCLDTVNGGGIFAIGEVVQPKVKTTPLVPRMAHYNVVLKDIE--VAGD 290
Query: 210 SCL-------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
+ SG ++DSG + +LP IY +++ K S + L + + C+
Sbjct: 291 PIQLPTDIFDSTSGRGTIIDSGTTLAYLPVSIYDQLLEKTLAQRSGMELYLVEDQFT-CF 349
Query: 263 NASSEEMLK--VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DY 314
+ S E+ L P ++ F + + H + FP E ++C+ T + DG D
Sbjct: 350 HYSDEKSLDDAFPTVKFTFEEGLTLTAYPHDYLFPFKE--DMWCIGWQKSTAQTKDGKDL 407
Query: 315 GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
++G + ++D +N+ + W+ C I
Sbjct: 408 ILLGDLVLTNKLFIYDLDNMSIGWTDYNCSSSI 440
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 88/332 (26%), Positives = 154/332 (46%), Gaps = 36/332 (10%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+L+ YDP S +S+ +SC C + CKS + PCPY Y + ++++GY V
Sbjct: 113 DLTLYDPKGSETSELISCDQEFCSATYDGPIPGCKS-EIPCPYSITYG-DGSATTGYYVQ 170
Query: 96 DIL---HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSL 150
D L H+ + APQ+S S+I GCG Q+G+ + A DG++G G + SV S
Sbjct: 171 DYLTYNHVNDNLRTAPQNS---SIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQ 227
Query: 151 LAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
LA +G ++ FS C D G +F G + S P+ + Y V ++S +
Sbjct: 228 LAASGKVKKIFSHCLDNIRGGGIF--AIGEVVEPKVSTTPLVPRMAHYNVVLKSIEVDTD 285
Query: 211 CL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-C 261
L + +G ++DSG + +LP +Y E++ K + R+ L ++ C
Sbjct: 286 ILQLPSDIFDSGNGKGTIIDSGTTLAYLPAIVYDELIPKV--MARQPRLKLYLVEQQFSC 343
Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYG 315
+ + P ++L F + S V H + F +G ++C+ + +G D
Sbjct: 344 FQYTGNVDRGFPVVKLHFEDSLSLTVYPHDYLFQFKDG--IWCIGWQKSVAQTKNGKDMT 401
Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
++G + +++D EN+ + W+ C I
Sbjct: 402 LLGDLVLSNKLVIYDLENMAIGWTDYNCSSSI 433
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 101/368 (27%), Positives = 166/368 (45%), Gaps = 57/368 (15%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ P SSS V C+ +C S K C Y Y+ E +SSSG L +DI+
Sbjct: 129 RFQPDLSSSYSPVKCN-----VDCTCDSDKKQCTYERQYA-EMSSSSGVLGEDIVSFGRE 182
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ PQ + I GC +TG A DG+MGLG G +S+ L + G+I +SFS+
Sbjct: 183 SELKPQHA-----IFGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSL 236
Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------Q 214
C+ D G V G P ++ P+ Y Y + ++ + L
Sbjct: 237 CYGGMDIGGGAMVLGGMLAPPDMIFSNSDPLRSPY--YNIELKEIHVAGKALRVESRIFN 294
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ-----GNSWK-YCYNASSEE 268
S ++DSG ++ +LP + + V F + V+SK SL+ S+K C+ +
Sbjct: 295 SKHGTVLDSGTTYAYLPEQAF----VAFKEAVTSKVHSLKKIRGPDPSYKDICFAGAGRN 350
Query: 269 MLKV----PDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GII 317
+ K+ PD+ ++F Q S N++F + +G +CL V D GII
Sbjct: 351 VSKLHEVFPDVDMVFGNGQKLSLTPENYLFRHSKVDG--AYCLGVFQNGKDPTTLLGGII 408
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNG 377
+N + + +DR N K+ + + C E+ ++ H+ G +P+P P+++ S +
Sbjct: 409 VRNTL----VTYDRHNEKIGFWKTNCSELWERLHI-------GDTPSPAPSSDTSSEHDM 457
Query: 378 QAAAPPST 385
A PS
Sbjct: 458 SPAPAPSN 465
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 99/368 (26%), Positives = 164/368 (44%), Gaps = 57/368 (15%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ P SSS V C+ +C S K C Y Y+ E +SSSG L +DI+
Sbjct: 130 RFQPDLSSSYSPVKCN-----VDCTCDSDKKQCTYERQYA-EMSSSSGVLGEDIVSFGRE 183
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ PQ +V GC +TG A DG+MGLG G +S+ L + G+I +SFS+
Sbjct: 184 SELKPQRAV-----FGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSL 237
Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------Q 214
C+ D G V G P+ + P+ Y Y + ++ + L
Sbjct: 238 CYGGMDIGGGAMVLGGVPAPSDMVFSHSDPLRSPY--YNIELKEIHVAGKALRVDSRVFN 295
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEE 268
S ++DSG ++ +LP + + V F V+SK SL+ N C+ +
Sbjct: 296 SKHGTVLDSGTTYAYLPEQAF----VAFKDAVTSKVHSLKKIRGPDPNYKDICFAGAGRN 351
Query: 269 MLKV----PDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GII 317
+ K+ PD+ ++F Q S N++F + +G +CL V D GII
Sbjct: 352 VSKLHEVFPDVDMVFGNGQKLSLTPENYLFRHSKVDG--AYCLGVFQNGKDPTTLLGGII 409
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNG 377
+N + + +DR N K+ + + C E+ ++ H+ +P+P P+++ S ++
Sbjct: 410 VRNTL----VTYDRHNEKIGFWKTNCSELWERLHI-------SDAPSPAPSSDTNSETDM 458
Query: 378 QAAAPPST 385
A PS+
Sbjct: 459 SPAPAPSS 466
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 85/328 (25%), Positives = 148/328 (45%), Gaps = 28/328 (8%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+L+ YDP S +S VSC C + CKS + PCPY Y + ++++GY V
Sbjct: 113 DLTLYDPKGSETSDVVSCDQDFCSATFDGPIPGCKS-EIPCPYSITYG-DGSATTGYYVQ 170
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAK 153
D L + + S SS+I GCG Q+G+ + A DG++G G + SV S LA
Sbjct: 171 DYLTYNRINGNLRTSPQNSSIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAA 230
Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 212
+G ++ FS C D G +F G + S P+ + Y V ++S + L
Sbjct: 231 SGKVKKIFSHCLDNVRGGGIF--AIGEVVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQ 288
Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
+ +G ++DSG + +LP +Y E++ K ++ L ++ C+ +
Sbjct: 289 LPSDIFDSVNGKGTVIDSGTTLAYLPDIVYDELIQKVLARQPGLKLYLVEQQFR-CFLYT 347
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQ 319
P ++L F + S V H + F +G ++C+ + +G D ++G
Sbjct: 348 GNVDRGFPVVKLHFKDSLSLTVYPHDYLFQFKDG--IWCIGWQRSVAQTKNGKDMTLLGD 405
Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVI 347
+ +++D EN+ + W+ C I
Sbjct: 406 LVLSNKLVIYDLENMVIGWTDYNCSSSI 433
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 83/316 (26%), Positives = 137/316 (43%), Gaps = 18/316 (5%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
R L+ YDP SS SSK V C +C SR C ++ CPYI Y+ + + G L D+LH
Sbjct: 125 RKLTFYDPRSSVSSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLH 182
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQ 158
+ +SV GCG +Q+GS + A A DG++G G + + S LA AG +
Sbjct: 183 YHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTK 242
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL----- 212
FS C D + G +F G + PI + + Y V ++S + + L
Sbjct: 243 KIFSHCLDSTNGGGIF--AIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPAN 300
Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
T +DSG++ +LP IY+E+++ I++ C++
Sbjct: 301 IFGTTKTKGTFIDSGSTLVYLPEIIYSELILAV--FAKHPDITMGAMYNFQCFHFLGSVD 358
Query: 270 LKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
K P + F + + V +++ + N+ F + D I+G + +
Sbjct: 359 DKFPKITFHFENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVV 418
Query: 328 VFDRENLKLAWSHSKC 343
V+D E + W+ C
Sbjct: 419 VYDMEKQAIGWTEHNC 434
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 144/312 (46%), Gaps = 23/312 (7%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSR-SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+DP SS+ + + CS LC SC+ C Y +Y + +T G D + L +
Sbjct: 95 FDPRQSSTFREMDCSSQLCAELPGSCEPGSSTCSYSYEYGSGETE--GEFARDTISLGTT 152
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S + + S +GCG +G DG DG++GLG G VS+ S L+ A I + FS
Sbjct: 153 SDGSQKFP---SFAVGCGMVNSG--FDGV--DGLVGLGQGPVSLTSQLSAA--IDSKFSY 203
Query: 164 CF----DENDSGSVFFGDQGP---ATQQSTSFLPIGEKYDAYFV-GVESYCIGNSCLTQS 215
C +++S + FG QST P + Y Y++ V + +
Sbjct: 204 CLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSP 263
Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 275
G ++DSG + T++P+ +Y V+ + + +V+ R+ CY+ SS K P +
Sbjct: 264 G-TTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPAL 322
Query: 276 RLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDRENL 334
+ + ++ F ++ G TV CL + S G IIG G+ I++DR +
Sbjct: 323 TIRLAGATMTPPSSNYFLVVDDSGDTV-CLAMGSASGLPVSIIGNVMQQGYHILYDRGSS 381
Query: 335 KLAWSHSKCEEV 346
+L++ +KCE +
Sbjct: 382 ELSFVQAKCESL 393
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 85/319 (26%), Positives = 140/319 (43%), Gaps = 19/319 (5%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
R L+ YDP SS SSK V C +C SR C ++ CPYI Y+ + + G L D+LH
Sbjct: 101 RKLTFYDPRSSVSSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLH 158
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQ 158
+ +SV GCG +Q+GS + A A DG++G G + + S LA AG +
Sbjct: 159 YHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTK 218
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL----- 212
FS C D + G +F G + PI + + Y V ++S + + L
Sbjct: 219 KIFSHCLDSTNGGGIF--AIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPAN 276
Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
T +DSG++ +LP IY+E+++ I++ C++
Sbjct: 277 IFGTTKTKGTFIDSGSTLVYLPEIIYSELILAV--FAKHPDITMGAMYNFQCFHFLGSVD 334
Query: 270 LKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
K P + F + + V +++ + N+ F + D I+G + +
Sbjct: 335 DKFPKITFHFENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVV 394
Query: 328 VFDRENLKLAWS-HSKCEE 345
V+D E + W+ H+ EE
Sbjct: 395 VYDMEKQAIGWTEHNSVEE 413
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 96/354 (27%), Positives = 156/354 (44%), Gaps = 36/354 (10%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ Y+ S S K VSC C S CK+ CPY+ Y + +S++GY V D
Sbjct: 124 LTLYNIDESDSGKLVSCDDDFCYQISGGPLSGCKA-NMSCPYLEIYG-DGSSTAGYFVKD 181
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA---APDGVMGLGLGDVSVPSLLAK 153
++ S + + SVI GCG +Q+G LD + A DG++G G + S+ S LA
Sbjct: 182 VVQYDSVAGDLKTQTANGSVIFGCGARQSGD-LDSSNEEALDGILGFGKANSSMISQLAS 240
Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
+G ++ F+ C D + G +F G Q + P+ Y V + + +G LT
Sbjct: 241 SGRVKKIFAHCLDGRNGGGIF--AIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLT 298
Query: 214 ------QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
Q G + A++DSG + +LP IY +V K + ++ + +K C+ S
Sbjct: 299 IPADLFQPGDRKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYK-CFQYS 357
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTD-GDYGIIGQ 319
P++ F + V H + FP +EG ++C+ + S D + ++G
Sbjct: 358 GRVDEGFPNVTFHFENSVFLRVYPHDYLFP-HEG--MWCIGWQNSAMQSRDRRNMTLLGD 414
Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVID-----KSHVHLVPPPAGQSPNPLPT 368
+ +++D EN + W+ C I VHLV S PL T
Sbjct: 415 LVLSNKLVLYDLENQLIGWTEYNCSSSIKVKDEGTGTVHLVGSHFISSALPLDT 468
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 144/312 (46%), Gaps = 23/312 (7%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSR-SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+DP SS+ + + CS LC SC+ C Y +Y + +T G D + L +
Sbjct: 95 FDPRQSSTFREMDCSSQLCTELPGSCEPGSSACSYSYEYGSGETE--GEFARDTISLGTT 152
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S + + S +GCG +G DG DG++GLG G VS+ S L+ A I + FS
Sbjct: 153 SGGSQKFP---SFAVGCGMVNSG--FDGV--DGLVGLGQGPVSLTSQLSAA--IDSKFSY 203
Query: 164 CF----DENDSGSVFFGDQGP---ATQQSTSFLPIGEKYDAYFV-GVESYCIGNSCLTQS 215
C +++S + FG QST P + Y Y++ V + +
Sbjct: 204 CLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSP 263
Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 275
G ++DSG + T++P+ +Y V+ + + +V+ R+ CY+ SS K P +
Sbjct: 264 G-TTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPAL 322
Query: 276 RLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDRENL 334
+ + ++ F ++ G TV CL + S G IIG G+ I++DR +
Sbjct: 323 TIRLAGATMTPPSSNYFLVVDDSGDTV-CLAMGSAGGLPVSIIGNVMQQGYHILYDRGSS 381
Query: 335 KLAWSHSKCEEV 346
+L++ +KCE +
Sbjct: 382 ELSFVQAKCESL 393
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 94/337 (27%), Positives = 159/337 (47%), Gaps = 39/337 (11%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++ P SS+ + V C+ +C K+ C Y +Y+ E +SS G L +D++ +
Sbjct: 135 KFQPELSSTYQPVKCNM-----DCNCDDDKEQCVYEREYA-EHSSSKGVLGEDLISFGNE 188
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ PQ +V GC +TG A DG++GLG GD+S+ L GLI NSF +
Sbjct: 189 SQLTPQRAV-----FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGL 242
Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQ 214
C+ D G + G P+ T P Y Y + + + NS +
Sbjct: 243 CYGGMDVGGGSMILGGFDYPSDMIFTDSDPDRSPY--YNIDLTGIRVAGKKLSLNSRVFD 300
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWK---YCYNASSE--E 268
A++DSG ++ +LP +A + VS K+I ++K + AS++ E
Sbjct: 301 GEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSPLKQIDGPDPNFKDTCFLVAASNDVSE 360
Query: 269 MLKV-PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFM 322
+ K+ P + +IF QS+++ + F ++ +CL V D+ GI+ +N +
Sbjct: 361 LSKIFPSVEMIFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTL 420
Query: 323 MGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPA 359
+V+DREN K+ + + C E+ D+ H+ PPPA
Sbjct: 421 ----VVYDRENSKVGFWRTNCSELSDRLHIDGAPPPA 453
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 90/337 (26%), Positives = 158/337 (46%), Gaps = 39/337 (11%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++ P SS+ + V C+ +C ++ C Y +Y+ E +SS G L +D++ +
Sbjct: 134 KFQPEMSSTYQPVKCNM-----DCNCDDDREQCVYEREYA-EHSSSKGVLGEDLISFGNE 187
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ PQ +V GC +TG A DG++GLG GD+S+ L GLI NSF +
Sbjct: 188 SQLTPQRAV-----FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGL 241
Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------Q 214
C+ D G + G P+ T P Y Y + + + L+
Sbjct: 242 CYGGMDVGGGSMILGGFDYPSDMVFTDSDPDRSPY--YNIDLTGIRVAGKQLSLHSRVFD 299
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWK-YCYNASSE----E 268
A++DSG ++ +LP +A + VS+ K+I ++K C+ ++ E
Sbjct: 300 GEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAASNYVSE 359
Query: 269 MLKV-PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFM 322
+ K+ P + ++F QS+++ + F ++ +CL V D+ GI+ +N +
Sbjct: 360 LSKIFPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTL 419
Query: 323 MGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPA 359
+V+DREN K+ + + C E+ D+ H+ PPPA
Sbjct: 420 ----VVYDRENSKVGFWRTNCSELSDRLHIDGAPPPA 452
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 90/328 (27%), Positives = 147/328 (44%), Gaps = 24/328 (7%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +DP SS+++ VSCS +C S S+C + C Y+ Y + + +SGY V D
Sbjct: 127 LNFFDPGSSTTASLVSCSDQICALGVQSSDSACFGQSNQCAYVFQYG-DGSGTSGYYVMD 185
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
++HL + S+ +SV+ GC QTG A DG+ G G D+SV S L+ G
Sbjct: 186 MIHLDVVIDSSVTSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRG 245
Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
+ FS C +DSG G + + + P+ Y + ++S + L
Sbjct: 246 IAPKVFSHCLKGDDSGGGIL-VLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQVLPIS 304
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNAS 265
T S ++DSG + +L E Y VV +V S++ + L+GN CY S
Sbjct: 305 PAVFATSSSQGTIIDSGTTLAYLAEEAYNAFVVAVTNIVSQSTQSVVLKGNR---CYVTS 361
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGD-YGIIGQNFM 322
S P + L F+ S V+ + +N G TV+C+ G I+G +
Sbjct: 362 SSVSDIFPQVSLNFAGGASLVLGAQDYLIQQNSVGGTTVWCIGFQKIPGQGITILGDLVL 421
Query: 323 MGHRIVFDRENLKLAWSHSKCEEVIDKS 350
++D N ++ W++ C ++ S
Sbjct: 422 KDKIFIYDLANQRIGWTNYDCSMSVNVS 449
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 88/335 (26%), Positives = 153/335 (45%), Gaps = 40/335 (11%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSC-KSLKDPCPYIADYSTEDTSSSGYLV 94
+L+ YDP+ S +SK V C C S S C K + CPY Y T+S Y+
Sbjct: 117 DLTLYDPNLSKTSKAVPCDDEFCTSTYDGQISGCTKGMS--CPYSITYGDGSTTSGSYIK 174
Query: 95 DDIL--HLASFSKHAPQSSVQSSVIIGCGRKQTG--SYLDGAAPDGVMGLGLGDVSVPSL 150
DD+ + + P ++ SVI GCG KQ+G S + DG++G G + SV S
Sbjct: 175 DDLTFDRVVGDLRTVPDNT---SVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQ 231
Query: 151 LAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
LA AG ++ FS C D G +F G Q P+ + Y V ++ +
Sbjct: 232 LAAAGKVKRIFSHCLDSISGGGIFA--IGEVVQPKVKTTPLLQGMAHYNVVLKDIEVAGD 289
Query: 211 CL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
+ + SG ++DSG + +LP IY +++ K S ++ L + + C+
Sbjct: 290 PIQLPSDILDSSSGRGTIIDSGTTLAYLPVSIYDQLLEKILAQRSGMKLYLVEDQFT-CF 348
Query: 263 NASSEEMLK--VPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCL-----TVMSTDGD 313
+ S EE + P ++ F + + R+++F F E+ ++C+ + DG
Sbjct: 349 HYSDEESVDDLFPTVKFTFEEGLTLTTYPRDYLFLFKED----MWCVGWQKSMAQTKDGK 404
Query: 314 YGIIGQNFMMGHR-IVFDRENLKLAWSHSKCEEVI 347
I+ + ++ ++ +V+D +N+ + W+ C I
Sbjct: 405 ELILLGDLVLANKLVVYDLDNMAIGWADYNCSSSI 439
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 82/312 (26%), Positives = 136/312 (43%), Gaps = 18/312 (5%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
R L+ YDP SS SSK V C +C SR C ++ CPYI Y+ + + G L D+LH
Sbjct: 125 RKLTFYDPRSSVSSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLH 182
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQ 158
+ +SV GCG +Q+GS + A A DG++G G + + S LA AG +
Sbjct: 183 YHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTK 242
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL----- 212
FS C D + G +F G + PI + + Y V ++S + + L
Sbjct: 243 KIFSHCLDSTNGGGIF--AIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPAN 300
Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
T +DSG++ +LP IY+E+++ I++ C++
Sbjct: 301 IFGTTKTKGTFIDSGSTLVYLPEIIYSELILAV--FAKHPDITMGAMYNFQCFHFLGSVD 358
Query: 270 LKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
K P + F + + V +++ + N+ F + D I+G + +
Sbjct: 359 DKFPKITFHFENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVV 418
Query: 328 VFDRENLKLAWS 339
V+D E + W+
Sbjct: 419 VYDMEKQAIGWT 430
>gi|351722911|ref|NP_001237772.1| uncharacterized protein LOC100500675 [Glycine max]
gi|255630909|gb|ACU15817.1| unknown [Glycine max]
Length = 244
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 65/234 (27%), Positives = 113/234 (48%), Gaps = 9/234 (3%)
Query: 163 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVD 222
+CF + +G + FGD G Q+ T F + + + Y + + + +S + F A+ D
Sbjct: 1 MCFGPDGAGRITFGDTGSPDQRKTPFN-VRKLHPTYNITITQIVVEDS-VADLEFHAIFD 58
Query: 223 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS----WKYCYNASSEEMLKVPDMRLI 278
SG SFT++ Y + ++ V + R S Q ++YCY+ S + ++VP + L
Sbjct: 59 SGTSFTYINDPAYTRLGEMYNSKVKANRHSSQSPDSNIPFEYCYDISINQTIEVPFLNLT 118
Query: 279 FSKNQSFVVRNHIFS-FPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
+ V + I F E EG + CL + +D IIGQNFM+G++IVFDR+N+ L
Sbjct: 119 MKGGDDYYVMDPIVQVFSEEEG-DLLCLGIQKSDS-VNIIGQNFMIGYKIVFDRDNMNLG 176
Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPS 391
W + C + + + + P + +P +TSN P + + P+
Sbjct: 177 WKETNCSDDVLSNTSPINTPSPSPAVSPAIAVNPVATSNPSINPPNRSFRIKPT 230
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 83/321 (25%), Positives = 140/321 (43%), Gaps = 19/321 (5%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
R L+ YDP SS SSK V C +C SR C ++ CPYI Y+ + + G L D+LH
Sbjct: 101 RKLTFYDPRSSVSSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLH 158
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQ 158
+ +SV GCG +Q+GS + A A DG++G G + + S LA AG +
Sbjct: 159 YHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTK 218
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL----- 212
FS C D + G +F G + PI + + Y V ++S + + L
Sbjct: 219 KIFSHCLDSTNGGGIF--AIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPAN 276
Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
T +DSG++ +LP IY+E+++ I++ C++
Sbjct: 277 IFGTTKTKGTFIDSGSTLVYLPEIIYSELILAV--FAKHPDITMGAMYNFQCFHFLGSVD 334
Query: 270 LKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
K P + F + + V +++ + N+ F + D I+G + +
Sbjct: 335 DKFPKITFHFENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVV 394
Query: 328 VFDRENLKLAWS-HSKCEEVI 347
V+D E + W+ H+ ++
Sbjct: 395 VYDMEKQAIGWTEHNSMARIV 415
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 95/354 (26%), Positives = 154/354 (43%), Gaps = 36/354 (10%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ Y+ S S K VSC C S CK+ CPY+ Y + +S++GY V D
Sbjct: 124 LTLYNIDESDSGKLVSCDDDFCYQISGGPLSGCKA-NMSCPYLEIYG-DGSSTAGYFVKD 181
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA---APDGVMGLGLGDVSVPSLLAK 153
++ S + + SVI GCG +Q+G LD + A DG++G G + S+ S LA
Sbjct: 182 VVQYDSVAGDLKTQTANGSVIFGCGARQSGD-LDSSNEEALDGILGFGKANSSMISQLAS 240
Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
+G ++ F+ C D + G +F G Q + P+ Y V + + +G L
Sbjct: 241 SGRVKKIFAHCLDGRNGGGIF--AIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLN 298
Query: 214 ------QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
Q G + A++DSG + +LP IY +V K + ++ + +K C+ S
Sbjct: 299 IPADLFQPGDRKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYK-CFQYS 357
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTD-GDYGIIGQ 319
P++ F + V H + FP EG ++C+ + S D + ++G
Sbjct: 358 GRVDEGFPNVTFHFENSVFLRVYPHDYLFPY-EG--MWCIGWQNSAMQSRDRRNMTLLGD 414
Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVID-----KSHVHLVPPPAGQSPNPLPT 368
+ +++D EN + W+ C I VHLV S PL T
Sbjct: 415 LVLSNKLVLYDLENQLIGWTEYNCSSSIKVKDEGTGTVHLVGSHFISSALPLDT 468
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 84/320 (26%), Positives = 143/320 (44%), Gaps = 18/320 (5%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
LS YD +SS+SKNV C C +S K PC Y Y + ++S G V D +
Sbjct: 121 LSLYDSKASSTSKNVGCEDAFCSFIMQSETCGAKKPCSYHVVYG-DGSTSDGDFVKDNIT 179
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
L + + + + V+ GCG+ Q+G +A DG+MG G + SV S LA G ++
Sbjct: 180 LDQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVK 239
Query: 159 NSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGV----ESYCIGNSCLT 213
FS C D + G +F G+ ++T +P Y+ G+ E + S +
Sbjct: 240 RIFSHCLDNMNGGGIFAIGEVESPVVKTTPLVPNQVHYNVILKGMDVDGEPIDLPPSLAS 299
Query: 214 QSG-FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLK 271
+G ++DSG + +LP +Y ++ +K+ + +++ L + C++ +S
Sbjct: 300 TNGDGGTIIDSGTTLAYLPQNLYNSLI---EKITAKQQVKLHMVQETFACFSFTSNTDKA 356
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT---VMSTDG-DYGIIGQNFMMGHRI 327
P + L F + V H + F E F + + DG D ++G + +
Sbjct: 357 FPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLV 416
Query: 328 VFDRENLKLAWSHSKCEEVI 347
V+D EN + W+ C I
Sbjct: 417 VYDLENEVIGWADHNCSSSI 436
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 85/337 (25%), Positives = 162/337 (48%), Gaps = 42/337 (12%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++ P SSS+ + V C+ +C S + C Y Y+ E ++SSG L +D++ +
Sbjct: 125 KFQPESSSTYQPVKCT-----IDCNCDSDRMQCVYERQYA-EMSTSSGVLGEDLISFGNQ 178
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ APQ +V GC +TG A DG+MGLG GD+S+ L +I +SFS+
Sbjct: 179 SELAPQRAV-----FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVISDSFSL 232
Query: 164 CFDEND--SGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIG------NSCLTQ 214
C+ D G++ G P + + ++ P+ Y Y + ++ + N+ +
Sbjct: 233 CYGGMDVGGGAMVLGGISPPSDMAFAYSDPVRSPY--YNIDLKEIHVAGKRLPLNANVFD 290
Query: 215 SGFQALVDSGASFTFLPTE---IYAEVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEML 270
++DSG ++ +LP + + +VK +L S K+IS ++ C++ + ++
Sbjct: 291 GKHGTVLDSGTTYAYLPEAAFLAFKDAIVK--ELQSLKKISGPDPNYNDICFSGAGIDVS 348
Query: 271 KV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNF 321
++ P + ++F Q + + + F ++ +CL V D GII +N
Sbjct: 349 QLSKSFPVVDMVFENGQKYTLSPENYMFRHSKVRGAYCLGVFQNGNDQTTLLGGIIVRNT 408
Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPP 358
+ +V+DRE K+ + + C E+ ++ + + PPP
Sbjct: 409 L----VVYDREQTKIGFWKTNCAELWERLQISVAPPP 441
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 95/345 (27%), Positives = 153/345 (44%), Gaps = 40/345 (11%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKS-LKDPCPYIADYSTEDTSSSGYLV 94
+L+ Y+ SSS K V C LCK + C S D CPY+ Y + +S++GY V
Sbjct: 116 DLTLYNIKESSSGKLVPCDQELCKEINGGLLTGCTSKTNDSCPYLEIYG-DGSSTAGYFV 174
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTG--SYLDGAAPDGVMGLGLGDVSVPSLLA 152
D++ S +S SVI GCG +Q+G SY + A DG++G G + S+ S L+
Sbjct: 175 KDVVLFDQVSGDLKTASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLS 234
Query: 153 KAGLIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 211
+G ++ F+ C + + G +F G T +T LP Y ++ +G++
Sbjct: 235 SSGKVKKMFAHCLNGVNGGGIFAIGHVVQPTVNTTPLLPDQPHYSVNMTAIQ---VGHTF 291
Query: 212 L---TQSGFQ-----ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CY 262
L T + Q ++DSG + +LP IY +V K L + +Q +Y C+
Sbjct: 292 LNLSTDASEQRDSKGTIIDSGTTLAYLPDGIYQPLVYKI--LSQQPNLKVQTLHDEYTCF 349
Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF-PENEGFTVFCL-----TVMSTDGDYGI 316
S P++ F S V H + F EN ++C+ S D
Sbjct: 350 QYSGSVDDGFPNVTFYFENGLSLKVYPHDYLFLSEN----LWCIGWQNSGAQSRDSKNMT 405
Query: 317 IGQNFMMGHRIVF-DRENLKLAWSHSKCEEVI-----DKSHVHLV 355
+ + ++ +++VF D EN + W+ C I VHLV
Sbjct: 406 LLGDLVLSNKLVFYDLENQVIGWTEYNCSSSIKVRDEKTGTVHLV 450
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 94/326 (28%), Positives = 149/326 (45%), Gaps = 36/326 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRS-SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+DP +SS++ +SC+ P C S C C Y Y+ E +SSSG L++D+L L
Sbjct: 122 FDPEASSTASRISCTSPKCSCGSPRCGCSTQQCTYTRSYA-EQSSSSGILLEDVLALHDG 180
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
AP +I GC ++TG A DG+ GLG D SV + L KAG+I + FS+
Sbjct: 181 LPGAP-------IIFGCETRETGEIFRQRA-DGLFGLGNSDASVVNQLVKAGVIDDVFSL 232
Query: 164 CFD--ENDSGSVFFGDQ---GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS--- 215
CF E D G++ GD G + Q T L Y V + S + L S
Sbjct: 233 CFGMVEGD-GALLLGDAEVPGSISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLPVSQSL 291
Query: 216 ---GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS---KRISLQGNSW-KYCY-NASSE 267
G+ ++DSG +FT++P+ ++ +K S KR+ + C+ A S
Sbjct: 292 FDQGYGTVLDSGTTFTYMPSPVFKAFAGAVEKYALSHGLKRVPGPDPQFDDICFGQAPSH 351
Query: 268 EMLKV-----PDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQN 320
+ L+ P M + F + S V+ N++F N G +CL V ++G
Sbjct: 352 DDLEALSSVFPSMEVQFDQGTSLVLGPLNYLFVHTFNSG--KYCLGVFDNGRAGTLLGGI 409
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEV 346
+ +DR N ++ + + C+E+
Sbjct: 410 TFRNVLVRYDRANQRVGFGPALCKEL 435
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 93/342 (27%), Positives = 150/342 (43%), Gaps = 50/342 (14%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ P SS+ V CS + +C S K C Y Y+ E +SSSG L +DI+ +
Sbjct: 126 RFQPDLSSTYSPVKCS-----ADCTCDSDKSQCTYERQYA-EMSSSSGVLGEDIVSFGTE 179
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ PQ +V GC +TG A DG+MGLG G +S+ L G+I +SFS+
Sbjct: 180 SELKPQRAV-----FGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSM 233
Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------Q 214
C+ D G V P + P+ Y Y + ++ + L
Sbjct: 234 CYGGMDIGGGAMVLGAMPAPPDMVFSRSDPVRSPY--YNIELKEIHVAGKALRLDPRIFD 291
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEE 268
S ++DSG ++ +LP + + V F V+SK L+ N C+ +
Sbjct: 292 SKHGTVLDSGTTYAYLPEQAF----VAFKDAVTSKVRPLKKIRGPDPNYKDICFAGAGRN 347
Query: 269 MLKV----PDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGD-----YGII 317
+ ++ PD+ ++F Q S N++F + EG +CL V D GI+
Sbjct: 348 VSQLSQAFPDVDMVFGDGQKLSLSPENYLFRHSKVEG--AYCLGVFQNGKDPTTLLGGIV 405
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPA 359
+N + + +DR N K+ + + C E+ ++ HV P PA
Sbjct: 406 VRNTL----VTYDRHNEKIGFWKTNCSELWERLHVSGAPSPA 443
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 91/326 (27%), Positives = 134/326 (41%), Gaps = 26/326 (7%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDD 96
+L YDP SSS VSC C + K + PC Y Y + +S++GY V D
Sbjct: 126 DLRLYDPKGSSSGSTVSCDQKFCAATYGGKLPGCAKNIPCEYSVMYG-DGSSTTGYFVSD 184
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAG 155
L S +SVI GCG +Q G A DG++G G + S+ S LA AG
Sbjct: 185 SLQYNQVSGDGQTRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAG 244
Query: 156 LIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
++ FS C D G +F GD +ST +P Y+ V +ES +G + L
Sbjct: 245 EVKKIFSHCLDTIKGGGIFAIGDVVQPKVKSTPLVPDMPHYN---VNLESINVGGTTLQL 301
Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNAS 265
T ++DSG + T+LP +Y +V+ F K + S+Q C
Sbjct: 302 PSHMFETGEKKGTIIDSGTTLTYLPELVYKDVLAAVFAKHPDTTFHSVQD---FLCIQYF 358
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT---VMSTDG-DYGIIGQNF 321
P + F + V H + F + F + S DG D ++G
Sbjct: 359 QSVDDGFPKITFHFEDDLGLNVYPHDYFFQNGDNLYCFGFQNGGLQSKDGKDMVLLGDLV 418
Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVI 347
+ +V+D EN + W+ C I
Sbjct: 419 LSNKVVVYDLENQVVGWTDYNCSSSI 444
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 85/320 (26%), Positives = 143/320 (44%), Gaps = 45/320 (14%)
Query: 63 CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 122
C +C + + C Y Y+ E +SSSG L +DI+ S+ PQ +V GC
Sbjct: 156 CNVDCTCDNERSQCTYERQYA-EMSSSSGVLGEDIMSFGKESELKPQRAV-----FGCEN 209
Query: 123 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VFFGDQG 179
+TG A DG+MGLG G +S+ L + G+I +SFS+C+ D G V G
Sbjct: 210 TETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVLGGMPA 268
Query: 180 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGASFTFLPTE 233
P + P+ Y Y + ++ + L S ++DSG ++ +LP +
Sbjct: 269 PPDMVFSHSNPVRSPY--YNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAYLPEQ 326
Query: 234 IYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----PDMRLIFSKNQ 283
+ V F V++K SL+ N C+ + + ++ PD+ ++F Q
Sbjct: 327 AF----VAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQ 382
Query: 284 --SFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRIVFDRENLKL 336
S N++F + EG +CL V D GI+ +N + + +DR N K+
Sbjct: 383 KLSLSPENYLFRHSKVEG--AYCLGVFQNGKDPTTLLGGIVVRNTL----VTYDRHNEKI 436
Query: 337 AWSHSKCEEVIDKSHVHLVP 356
+ + C E+ ++ H+ VP
Sbjct: 437 GFWKTNCSELWERLHISEVP 456
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 85/320 (26%), Positives = 143/320 (44%), Gaps = 45/320 (14%)
Query: 63 CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 122
C +C + + C Y Y+ E +SSSG L +DI+ S+ PQ +V GC
Sbjct: 157 CNVDCTCDNERSQCTYERQYA-EMSSSSGVLGEDIMSFGKESELKPQRAV-----FGCEN 210
Query: 123 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VFFGDQG 179
+TG A DG+MGLG G +S+ L + G+I +SFS+C+ D G V G
Sbjct: 211 TETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVLGGMPA 269
Query: 180 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGASFTFLPTE 233
P + P+ Y Y + ++ + L S ++DSG ++ +LP +
Sbjct: 270 PPDMVFSHSNPVRSPY--YNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAYLPEQ 327
Query: 234 IYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----PDMRLIFSKNQ 283
+ V F V++K SL+ N C+ + + ++ PD+ ++F Q
Sbjct: 328 AF----VAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQ 383
Query: 284 --SFVVRNHIFSFPENEGFTVFCLTVMSTDGD-----YGIIGQNFMMGHRIVFDRENLKL 336
S N++F + EG +CL V D GI+ +N + + +DR N K+
Sbjct: 384 KLSLSPENYLFRHSKVEG--AYCLGVFQNGKDPTTLLGGIVVRNTL----VTYDRHNEKI 437
Query: 337 AWSHSKCEEVIDKSHVHLVP 356
+ + C E+ ++ H+ VP
Sbjct: 438 GFWKTNCSELWERLHISEVP 457
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 85/320 (26%), Positives = 143/320 (44%), Gaps = 45/320 (14%)
Query: 63 CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 122
C +C + + C Y Y+ E +SSSG L +DI+ S+ PQ +V GC
Sbjct: 146 CNVDCTCDNERSQCTYERQYA-EMSSSSGVLGEDIMSFGKESELKPQRAV-----FGCEN 199
Query: 123 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VFFGDQG 179
+TG A DG+MGLG G +S+ L + G+I +SFS+C+ D G V G
Sbjct: 200 TETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVLGGMPA 258
Query: 180 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGASFTFLPTE 233
P + P+ Y Y + ++ + L S ++DSG ++ +LP +
Sbjct: 259 PPDMVFSHSNPVRSPY--YNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAYLPEQ 316
Query: 234 IYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----PDMRLIFSKNQ 283
+ V F V++K SL+ N C+ + + ++ PD+ ++F Q
Sbjct: 317 AF----VAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQ 372
Query: 284 --SFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRIVFDRENLKL 336
S N++F + EG +CL V D GI+ +N + + +DR N K+
Sbjct: 373 KLSLSPENYLFRHSKVEG--AYCLGVFQNGKDPTTLLGGIVVRNTL----VTYDRHNEKI 426
Query: 337 AWSHSKCEEVIDKSHVHLVP 356
+ + C E+ ++ H+ VP
Sbjct: 427 GFWKTNCSELWERLHISEVP 446
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 86/329 (26%), Positives = 140/329 (42%), Gaps = 30/329 (9%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLV 94
+L+ YDP++S+SSK V+C C + + SC + PC Y Y + +S++G+ V
Sbjct: 132 DLTLYDPTASASSKTVTCGQEFCATATNGGVPPSCAA-NSPCQYSITYG-DGSSTTGFFV 189
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAK 153
D L S + +SV GCG K G+ A DG++G G + S+ S L
Sbjct: 190 ADFLQYDQVSGDGQTNLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTS 249
Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
AG + FS C D + G +F G Q P+ Y V +++ +G S L
Sbjct: 250 AGKVTKIFSHCLDTVNGGGIF--AIGNVVQPKVKTTPLVPGMPHYNVVLKTIDVGGSTLQ 307
Query: 214 ---------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
++DSG + +LP +Y V+ + ++L+ C+
Sbjct: 308 LPTNIFDIGGGSRGTIIDSGTTLAYLPEVVYKAVLSAV--FSNHPDVTLKNVQDFLCFQY 365
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIG 318
S P++ F + VV H + F E V+C+ V S DG D ++G
Sbjct: 366 SGSVDNGFPEVTFHFDGDLPLVVYPHDYLFQNTE--DVYCVGFQSGGVQSKDGKDMVLLG 423
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
+ +V+D EN + W++ C I
Sbjct: 424 DLALSNKLVVYDLENQVIGWTNYNCSSSI 452
>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
Length = 388
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 67/211 (31%), Positives = 102/211 (48%), Gaps = 14/211 (6%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
R L+ YDP SS SSK V C +C SR C ++ CPYI Y+ + + G L D+LH
Sbjct: 125 RKLTFYDPRSSVSSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLH 182
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQ 158
+ +SV GCG +Q+GS + A A DG++G G + + S LA AG +
Sbjct: 183 YHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTK 242
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL----- 212
FS C D + G +F G + PI + + Y V ++S + + L
Sbjct: 243 KIFSHCLDSTNGGGIF--AIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPAN 300
Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVV 240
T +DSG++ +LP IY+E+++
Sbjct: 301 IFGTTKTKGTFIDSGSTLVYLPEIIYSELIL 331
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 83/326 (25%), Positives = 137/326 (42%), Gaps = 28/326 (8%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ YDP SS+ VSC C + C + PC Y Y + +S++GY V D
Sbjct: 133 LTLYDPKDSSTGSKVSCDQGFCAATYGGLLPGCTT-SLPCEYSVTYG-DGSSTTGYFVSD 190
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
+L S S+V GCG +Q G A DG++G G + S+ S L+ AG
Sbjct: 191 LLQFDQVSGDGQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAG 250
Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
++ F+ C D + G +F G Q P+ Y V ++S +G + L
Sbjct: 251 KVKKIFAHCLDTINGGGIF--AIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLP 308
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
T ++DSG + T+LP +Y E+++ K I+ C+
Sbjct: 309 SHMFDTGEKKGTIIDSGTTLTYLPEIVYKEIMLAV--FAKHKDITFHNVQEFLCFQYVGR 366
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDGDYGIIGQNFM 322
P + F + V H + F EN G ++C+ + S DG ++ + +
Sbjct: 367 VDDDFPKITFHFENDLPLNVYPHDYFF-EN-GDNLYCVGFQNGGLQSKDGKGMVLLGDLV 424
Query: 323 MGHR-IVFDRENLKLAWSHSKCEEVI 347
+ ++ +V+D EN + W+ C I
Sbjct: 425 LSNKLVVYDLENQVIGWTEYNCSSSI 450
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 90/335 (26%), Positives = 151/335 (45%), Gaps = 43/335 (12%)
Query: 38 QDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
QD N + P SS+ + + CS +C S C Y Y+ E +SSSG L +DI
Sbjct: 130 QDPN---FQPDWSSTYQPLKCS-----MECTCDSEMMHCVYDRQYA-EMSSSSGVLGEDI 180
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
+ S+ PQ +V GC +TG A DG+MGLG GD+S+ L + G+I
Sbjct: 181 VSFGKQSELKPQRTV-----FGCENVETGDIYSQRA-DGIMGLGRGDLSIVDQLVEKGVI 234
Query: 158 QNSFSICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------ 208
NSFS+C+ D G V G PA T P Y Y + ++ I
Sbjct: 235 GNSFSLCYGGMDVGGGAMVLGGISPPAGMVFTHSDPARSAY--YNIDLKEIHIAGKQLPI 292
Query: 209 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNAS 265
N + + ++DSG ++ +LP + K ++S ++ +QG Y C++
Sbjct: 293 NPMVFDGKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKL-IQGPDRNYNDICFSGV 351
Query: 266 SEEMLKV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GI 316
++ ++ P + L+FS + + F ++ +CL + + D GI
Sbjct: 352 GSDVSQLSKTFPAVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGI 411
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSH 351
I +N + +++DRE+LK+ + + C E+ + H
Sbjct: 412 IVRNTL----VMYDREHLKIGFWKTNCSEIWEILH 442
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 90/335 (26%), Positives = 151/335 (45%), Gaps = 43/335 (12%)
Query: 38 QDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
QD N + P SS+ + + CS +C S C Y Y+ E +SSSG L +DI
Sbjct: 130 QDPN---FQPDWSSTYQPLKCS-----MECTCDSEMMHCVYDRQYA-EMSSSSGVLGEDI 180
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
+ S+ PQ +V GC +TG A DG+MGLG GD+S+ L + G+I
Sbjct: 181 VSFGKQSELKPQRTV-----FGCENVETGDIYSQRA-DGIMGLGRGDLSIVDQLVEKGVI 234
Query: 158 QNSFSICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------ 208
NSFS+C+ D G V G PA T P Y Y + ++ I
Sbjct: 235 GNSFSLCYGGMDVGGGAMVLGGISPPAGMVFTHSDPARSAY--YNIDLKEIHIAGKQLPI 292
Query: 209 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNAS 265
N + + ++DSG ++ +LP + K ++S ++ +QG Y C++
Sbjct: 293 NPMVFDGKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKL-IQGPDRNYNDICFSGV 351
Query: 266 SEEMLKV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GI 316
++ ++ P + L+FS + + F ++ +CL + + D GI
Sbjct: 352 GSDVSQLSKTFPAVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGI 411
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSH 351
I +N + +++DRE+LK+ + + C E+ + H
Sbjct: 412 IVRNTL----VMYDREHLKIGFWKTNCSEIWEILH 442
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 82/322 (25%), Positives = 139/322 (43%), Gaps = 22/322 (6%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
LS YD +SS+SKNV C C +S K PC Y Y TS ++ D+I
Sbjct: 118 LSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNIT- 176
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
L + + + + V+ GCG+ Q+G +A DG+MG G + S+ S LA G +
Sbjct: 177 LEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTK 236
Query: 159 NSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS------- 210
FS C D + G +F G+ ++T +P Y+ G++ G+
Sbjct: 237 RIFSHCLDNMNGGGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMD--VDGDPIDLPPSL 294
Query: 211 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEM 269
T ++DSG + +LP +Y ++ +K+ + +++ L + C++ +S
Sbjct: 295 ASTNGDGGTIIDSGTTLAYLPQNLYNSLI---EKITAKQQVKLHMVQETFACFSFTSNTD 351
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT---VMSTDG-DYGIIGQNFMMGH 325
P + L F + V H + F E F + + DG D ++G +
Sbjct: 352 KAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNK 411
Query: 326 RIVFDRENLKLAWSHSKCEEVI 347
+V+D EN + W+ C I
Sbjct: 412 LVVYDLENEVIGWADHNCSSSI 433
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 82/322 (25%), Positives = 139/322 (43%), Gaps = 22/322 (6%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
LS YD +SS+SKNV C C +S K PC Y Y TS ++ D+I
Sbjct: 122 LSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNIT- 180
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
L + + + + V+ GCG+ Q+G +A DG+MG G + S+ S LA G +
Sbjct: 181 LEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTK 240
Query: 159 NSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS------- 210
FS C D + G +F G+ ++T +P Y+ G++ G+
Sbjct: 241 RIFSHCLDNMNGGGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMD--VDGDPIDLPPSL 298
Query: 211 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEM 269
T ++DSG + +LP +Y ++ +K+ + +++ L + C++ +S
Sbjct: 299 ASTNGDGGTIIDSGTTLAYLPQNLYNSLI---EKITAKQQVKLHMVQETFACFSFTSNTD 355
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT---VMSTDG-DYGIIGQNFMMGH 325
P + L F + V H + F E F + + DG D ++G +
Sbjct: 356 KAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNK 415
Query: 326 RIVFDRENLKLAWSHSKCEEVI 347
+V+D EN + W+ C I
Sbjct: 416 LVVYDLENEVIGWADHNCSSSI 437
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 83/326 (25%), Positives = 137/326 (42%), Gaps = 28/326 (8%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ YDP SS+ VSC C + C + PC Y Y + +S++GY V D
Sbjct: 48 LTLYDPKDSSTGSKVSCDQGFCAATYGGLLPGCTT-SLPCEYSVTYG-DGSSTTGYFVSD 105
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
+L S S+V GCG +Q G A DG++G G + S+ S L+ AG
Sbjct: 106 LLQFDQVSGDGQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAG 165
Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
++ F+ C D + G +F G Q P+ Y V ++S +G + L
Sbjct: 166 KVKKIFAHCLDTINGGGIF--AIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLP 223
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
T ++DSG + T+LP +Y E+++ K I+ C+
Sbjct: 224 SHMFDTGEKKGTIIDSGTTLTYLPEIVYKEIMLAV--FAKHKDITFHNVQEFLCFQYVGR 281
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDGDYGIIGQNFM 322
P + F + V H + F EN G ++C+ + S DG ++ + +
Sbjct: 282 VDDDFPKITFHFENDLPLNVYPHDYFF-EN-GDNLYCVGFQNGGLQSKDGKGMVLLGDLV 339
Query: 323 MGHR-IVFDRENLKLAWSHSKCEEVI 347
+ ++ +V+D EN + W+ C I
Sbjct: 340 LSNKLVVYDLENQVIGWTEYNCSSSI 365
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 84/327 (25%), Positives = 140/327 (42%), Gaps = 28/327 (8%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDD 96
+L+ YDP +SS+ V C C + S PC Y Y + +S+ G V+D
Sbjct: 131 DLTLYDPKASSTGSTVMCDQGFCADTFGGRLPKCSANVPCEYSVTYG-DGSSTVGSFVND 189
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAG 155
L + +SVI GCG +Q G + A DG++G G + S+ S LA AG
Sbjct: 190 ALQFDQVTGDGQTQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAG 249
Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-- 213
++ F+ C D G +F G Q P+ Y V +++ +G + L
Sbjct: 250 KVKKIFAHCLDTIKGGGIF--AIGDVVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLELP 307
Query: 214 ----QSGFQ--ALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASS 266
+ G + ++DSG + T+LP ++ +V++ F+K + I+ C+ S
Sbjct: 308 ADIFKPGEKRGTIIDSGTTLTYLPELVFKKVMLAVFNK---HQDITFHDVQDFLCFEYSG 364
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQN 320
P + F + + V H + FP G V+C+ + S DG D ++G
Sbjct: 365 SVDDGFPTLTFHFEDDLALHVYPHEYFFP--NGNDVYCVGFQNGALQSKDGKDIVLMGDL 422
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVI 347
+ +V+D EN + W+ C I
Sbjct: 423 VLSNKLVVYDLENRVIGWTDYNCSSSI 449
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 85/331 (25%), Positives = 146/331 (44%), Gaps = 37/331 (11%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
L+ YDP SS+S+ + C C + + K L PC Y Y + +S++G+ V
Sbjct: 126 LTLYDPQSSTSATRIYCDDDFCAATYNGVLQGCTKDL--PCQYSVVYG-DGSSTAGFFVK 182
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKA 154
D L + + SS SVI GCG KQ+G A DG++G G + S+ S LA A
Sbjct: 183 DNLQFDRVTGNLQTSSANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAA 242
Query: 155 GLIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 212
G ++ F+ C D G +F G+ +T +P Y+ +E +G + L
Sbjct: 243 GKVKRVFAHCLDNVKGGGIFAIGEVVSPKVNTTPMVPNQPHYNVVMKEIE---VGGNVLE 299
Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK---YCY 262
T ++DSG + +LP +Y ++ K + S++ L+ ++ + C+
Sbjct: 300 LPTDIFDTGDRRGTIIDSGTTLAYLPEVVYESMMTK----IVSEQPGLKLHTVEEQFTCF 355
Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGI 316
+ P ++ F+ + S V H + F +E V+C + S DG D +
Sbjct: 356 QYTGNVNEGFPVVKFHFNGSLSLTVNPHDYLFQIHE--EVWCFGWQNSGMQSKDGRDMTL 413
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
+G + +++D EN + W+ C I
Sbjct: 414 LGDLVLSNKLVLYDLENQAIGWTDYNCSSSI 444
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 87/358 (24%), Positives = 155/358 (43%), Gaps = 36/358 (10%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLC--KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
+S +DP S+S ++SC+ C S S C CPY Y + +S++GYL++D+L
Sbjct: 92 ISIFDPEKSTSKTSISCTDEECYLASNSKCSFNSMSCPYSTLYG-DGSSTAGYLINDVLS 150
Query: 100 LASF-SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
S ++ +S + + GCG QTG++L DG++G G +VS+PS L+K +
Sbjct: 151 FNQVPSGNSTATSGTARLTFGCGSNQTGTWLT----DGLVGFGQAEVSLPSQLSKQNVSV 206
Query: 159 NSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
N F+ C D SG++ G T +P Y+ + + G + T +
Sbjct: 207 NIFAHCLQGDNKGSGTLVIGHIREPGLVYTPIVPKQSHYNVELLNIG--VSGTNVTTPTA 264
Query: 217 FQ------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 270
F ++DSG + T+L Y + K + S + + + + + E
Sbjct: 265 FDLSNSGGVIMDSGTTLTYLVQPAYDQFQAKVRDCMRSGVLPV-----AFQFFCTIEGYF 319
Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFPE--NEGFTVFCLTVMSTDGDYG-----IIGQNFMM 323
P++ L F+ + ++ + + E G + +C + + + YG I G N +
Sbjct: 320 --PNVTLYFAGGAAMLLSPSSYLYKEMLTTGLSAYCFSWLESTSVYGYLSYTIFGDNVLK 377
Query: 324 GHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPP----PAGQSPNPLPTTEQQSTSNG 377
+V+D N ++ W + C + I S P P+ P T + SNG
Sbjct: 378 DQLVVYDNVNNRIGWKNFDCTKEISVSSTATSMPVTVFPSKAGPPGAFVTTNNAHSNG 435
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 83/327 (25%), Positives = 136/327 (41%), Gaps = 28/327 (8%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDD 96
+L+ YDP +SSS VSC C + K + PC Y Y + +S++G+ V D
Sbjct: 127 DLTFYDPKASSSGSTVSCDQGFCAATYGGKLPGCTANVPCEYSVMYG-DGSSTTGFFVTD 185
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
L + ++V GCG +Q G A DG++G G + S+ S LA AG
Sbjct: 186 ALQFDQVTGDGQTQPGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAG 245
Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
++ F+ C D G +F G Q P+ Y V ++S +G + L
Sbjct: 246 KVKKIFAHCLDTIKGGGIF--AIGNVVQPKVKTTPLVADMPHYNVNLKSIDVGGTTLQLP 303
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASS 266
T ++DSG + T+LP ++ EV+ F+K + I C+
Sbjct: 304 AHVFETGERKGTIIDSGTTLTYLPELVFKEVMAAIFNK---HQDIVFHNVQDFMCFQYPG 360
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQN 320
P + F + + V H + FP G ++C+ + S DG D ++G
Sbjct: 361 SVDDGFPTITFHFEDDLALHVYPHEYFFP--NGNDMYCVGFQNGALQSKDGKDIVLMGDL 418
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVI 347
+ +++D EN + W+ C I
Sbjct: 419 VLSNKLVIYDLENQVIGWTDYNCSSSI 445
>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 432
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 91/322 (28%), Positives = 138/322 (42%), Gaps = 37/322 (11%)
Query: 51 SSSKNVSCSHPLCKS--------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+ SK V C H LC S + C+S + C Y+ Y+ + SS+G LV+D S
Sbjct: 110 TKSKLVPCVHRLCASLHNALTGGKHRCESPHEQCDYVIKYA-DQGSSTGVLVND-----S 163
Query: 103 FSKHAPQSSV-QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNS 160
F+ SV + SV GCG Q D ++P DGV+GLG G VS+ S L + G+ +N
Sbjct: 164 FALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNV 223
Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCLTQSGFQ 218
C G +FFGD Q++T + P+ + Y G S G+ L +
Sbjct: 224 VGHCLSLRGGGFLFFGDDLVPYQRAT-WTPMARSAFRNYYSPGSASLYFGDRSLGVRLAK 282
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
+ DSG+SFT+ + Y +V +S S C+ E V D+R
Sbjct: 283 VVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKG-QEPFKSVLDVRKE 341
Query: 279 FSKNQSFVV-----RNHIFSFPENEGFTVF-----CLTVMSTD----GDYGIIGQNFMMG 324
F +S V+ + + P V CL +++ D IIG M
Sbjct: 342 F---KSLVLNFASGKKTLMEIPPENYLIVTENGNACLGILNGSEIGLKDLSIIGDITMQD 398
Query: 325 HRIVFDRENLKLAWSHSKCEEV 346
H +++D E K+ W + C+
Sbjct: 399 HMVIYDNEKGKIGWIRAPCDRA 420
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 88/331 (26%), Positives = 140/331 (42%), Gaps = 34/331 (10%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYL 93
+L+ YDP SSS VSC + C + C + K PC Y A+Y + +S++G
Sbjct: 130 DLALYDPKGSSSGSAVSCDNKFCAATYGSGEKLPGCTAGK-PCEYRAEYG-DGSSTAGSF 187
Query: 94 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLA 152
V D L S +A +++VI GCG +Q G A DG++G G + S S LA
Sbjct: 188 VSDSLQYNQLSGNAQTRHAKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLA 247
Query: 153 KAGLIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 211
AG ++ FS C D G +F G+ +ST LP Y+ V ++S + +
Sbjct: 248 SAGEVKKIFSHCLDTIKGGGIFAIGEVVQPKVKSTPLLPNMSHYN---VNLQSIDVAGNA 304
Query: 212 L--------TQSGFQALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCY 262
L T ++DSG + T+LP +Y +++ F K ++QG C+
Sbjct: 305 LQLPPHIFETSEKRGTIIDSGTTLTYLPELVYKDILAAVFQKHQDITFRTIQG---FLCF 361
Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD------GDYGI 316
S P + F + V H + F G ++CL + D +
Sbjct: 362 EYSESVDDGFPKITFHFEDDLGLNVYPHDYFF--QNGDNLYCLGFQNGGFQPKDAKDMVL 419
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
+G + +V+D E + W+ C I
Sbjct: 420 LGDLVLSNKVVVYDLEKQVIGWTDYNCSSSI 450
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 88/345 (25%), Positives = 149/345 (43%), Gaps = 45/345 (13%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGY 92
++N S Y P SS+ +N+SC P C+ SS CK+ CPY DY+ ++ +
Sbjct: 207 EQNGSHYYPKDSSTYRNISCYDPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDF 266
Query: 93 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
+ ++ + V+ GCG G + GA+ G++GLG G +S PS +
Sbjct: 267 ASETFTVNLTWPNGKEKFKQVVDVMFGCGHWNKG-FFYGAS--GLLGLGRGPISFPSQIQ 323
Query: 153 KAGLIQNSFSICFDE-----NDSGSVFFGDQGPATQQS----TSFLPIGEKYDA--YFVG 201
+ +SFS C + + S + FG+ T+ L E D Y++
Sbjct: 324 --SIYGHSFSYCLTDLFSNTSVSSKLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQ 381
Query: 202 VESYCIGNSCLTQS---------------GFQALVDSGASFTFLPTEIYAEVVVKFDKLV 246
++S +G L S G ++DSG++ TF P Y + F+K +
Sbjct: 382 IKSIMVGGEVLDISEQTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKI 441
Query: 247 SSKRISLQGNSWKYCYNASSEEM-LKVPDMRLIFSKN--QSFVVRNHIFSFPENEGFTVF 303
++I+ CYN S M +++PD + F+ +F N+ + + +E V
Sbjct: 442 KLQQIAADDFVMSPCYNVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDE---VI 498
Query: 304 CLTVMST--DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
CL +M T IIG I++D + +L +S +C EV
Sbjct: 499 CLAIMKTPNHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRCAEV 543
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 88/329 (26%), Positives = 153/329 (46%), Gaps = 38/329 (11%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++DP SSS+ K + C+ C S C Y Y+ E ++SSG L +D++ +
Sbjct: 124 KFDPESSSTYKPIKCNIDCI-----CDSDGVQCVYERQYA-EMSTSSGVLGEDVISFGNQ 177
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ PQ +V GC +TG A DG+MGLG GD+S+ L + G I +SFS+
Sbjct: 178 SELIPQRAV-----FGCENMETGDLFSQRA-DGIMGLGTGDLSLVDQLVEKGAINDSFSL 231
Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGV-ESYCIGNSCLTQSG--- 216
C+ D G V G P+ T P+ Y Y V + E + G SG
Sbjct: 232 CYGGMDIGGGAMVLGGISPPSDMIFTYSDPVRSPY--YNVDLKEIHVAGKKLPLSSGIFD 289
Query: 217 --FQALVDSGASFTFLPTEIYAEVV-VKFDKLVSSKRISLQGNSWK-YCYNA----SSEE 268
+ A++DSG ++ +LP E ++ D++ S K+I ++K C++ ++E
Sbjct: 290 GRYGAVLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAEL 349
Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMM 323
K P + ++F Q + + F ++ +CL + D GI+ +N +
Sbjct: 350 SNKFPTVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTL- 408
Query: 324 GHRIVFDRENLKLAWSHSKCEEVIDKSHV 352
+++DR N K+ + + C E+ ++ +
Sbjct: 409 ---VMYDRANSKIGFWKTNCSELWERLRI 434
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 88/329 (26%), Positives = 153/329 (46%), Gaps = 38/329 (11%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++DP SSS+ K + C+ C S C Y Y+ E ++SSG L +D++ +
Sbjct: 124 KFDPESSSTYKPIKCNIDCI-----CDSDGVQCVYERQYA-EMSTSSGVLGEDVISFGNQ 177
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ PQ +V GC +TG A DG+MGLG GD+S+ L + G I +SFS+
Sbjct: 178 SELIPQRAV-----FGCENMETGDLFSQRA-DGIMGLGTGDLSLVDQLVEKGAINDSFSL 231
Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGV-ESYCIGNSCLTQSG--- 216
C+ D G V G P+ T P+ Y Y V + E + G SG
Sbjct: 232 CYGGMDIGGGAMVLGGISPPSDMIFTYSDPVRSPY--YNVDLKEIHVAGKKLPLSSGIFD 289
Query: 217 --FQALVDSGASFTFLPTEIYAEVV-VKFDKLVSSKRISLQGNSWK-YCYNA----SSEE 268
+ A++DSG ++ +LP E ++ D++ S K+I ++K C++ ++E
Sbjct: 290 GRYGAVLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAEL 349
Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMM 323
K P + ++F Q + + F ++ +CL + D GI+ +N +
Sbjct: 350 SNKFPTVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTL- 408
Query: 324 GHRIVFDRENLKLAWSHSKCEEVIDKSHV 352
+++DR N K+ + + C E+ ++ +
Sbjct: 409 ---VMYDRANSKIGFWKTNCSELWERLRI 434
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 84/327 (25%), Positives = 145/327 (44%), Gaps = 22/327 (6%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +DP SSS+S ++CS C S ++C S + C Y Y + + +SGY V D
Sbjct: 122 LNFFDPGSSSTSSMIACSDQRCNNGKQSSDATCSSQNNQCSYTFQYG-DGSGTSGYYVSD 180
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
++HL + + + ++ + V+ GC +QTG A DG+ G G ++SV S L+ G
Sbjct: 181 MMHLNTIFEGSMTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQG 240
Query: 156 LIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF----VGVESYCIGN 209
+ FS C D + G + G+ TS +P Y+ V ++ I +
Sbjct: 241 IAPRIFSHCLKGDSSGGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSISVNGQTLQIDS 300
Query: 210 SCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNASS 266
S S + +VDSG + +L E Y V + S + + +GN CY +S
Sbjct: 301 SVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITAAIPQSVRTVVSRGNQ---CYLITS 357
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGD-YGIIGQNFMM 323
P + L F+ S ++R + +N G V+C+ G I+G +
Sbjct: 358 SVTDVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLK 417
Query: 324 GHRIVFDRENLKLAWSHSKCEEVIDKS 350
+V+D ++ W++ C ++ S
Sbjct: 418 DKIVVYDLAGQRIGWANYDCSLSVNVS 444
>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
Length = 433
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 90/321 (28%), Positives = 137/321 (42%), Gaps = 36/321 (11%)
Query: 51 SSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ SK V C H LC S + C S + C Y+ Y+ + SS+G L++D SF
Sbjct: 112 TKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYA-DQGSSTGVLIND-----SF 165
Query: 104 SKHAPQSSV-QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSF 161
+ SV + SV GCG Q D ++P DGV+GLG G VS+ S L + G+ +N
Sbjct: 166 ALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVV 225
Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCLTQSGFQA 219
C G +FFGD Q++T + P+ + Y G S G+ L +
Sbjct: 226 GHCLSLRGGGFLFFGDDLVPYQRAT-WTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKV 284
Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
+ DSG+SFT+ + Y +V +S S C+ E V D+R F
Sbjct: 285 VFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKG-QEPFKSVLDVRKEF 343
Query: 280 SKNQSFVV-----RNHIFSFPENEGFTVF-----CLTVMSTD----GDYGIIGQNFMMGH 325
+S V+ + + P V CL +++ D IIG M H
Sbjct: 344 ---KSLVLNFASGKKTLMEIPPENYLIVTENGNACLGILNGSEIGLKDLSIIGDITMQDH 400
Query: 326 RIVFDRENLKLAWSHSKCEEV 346
+++D E K+ W + C+
Sbjct: 401 MVIYDNEKGKIGWIRAPCDRA 421
>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
gi|194692946|gb|ACF80557.1| unknown [Zea mays]
Length = 424
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 90/320 (28%), Positives = 137/320 (42%), Gaps = 36/320 (11%)
Query: 51 SSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ SK V C H LC S + C S + C Y+ Y+ + SS+G L++D SF
Sbjct: 103 TKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYA-DQGSSTGVLIND-----SF 156
Query: 104 SKHAPQSSV-QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSF 161
+ SV + SV GCG Q D ++P DGV+GLG G VS+ S L + G+ +N
Sbjct: 157 ALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVV 216
Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCLTQSGFQA 219
C G +FFGD Q++T + P+ + Y G S G+ L +
Sbjct: 217 GHCLSLRGGGFLFFGDDLVPYQRAT-WTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKV 275
Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
+ DSG+SFT+ + Y +V +S S C+ E V D+R F
Sbjct: 276 VFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKG-QEPFKSVLDVRKEF 334
Query: 280 SKNQSFVV-----RNHIFSFPENEGFTVF-----CLTVMSTD----GDYGIIGQNFMMGH 325
+S V+ + + P V CL +++ D IIG M H
Sbjct: 335 ---KSLVLNFASGKKTLMEIPPENYLIVTENGNACLGILNGSEIGLKDLSIIGDITMQDH 391
Query: 326 RIVFDRENLKLAWSHSKCEE 345
+++D E K+ W + C+
Sbjct: 392 MVIYDNEKGKIGWIRAPCDR 411
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 88/340 (25%), Positives = 147/340 (43%), Gaps = 36/340 (10%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ YDP S S + V+C C + SC S PC Y Y + +S++G+ V D
Sbjct: 134 LTMYDPRGSQSGELVTCDQQFCVANYGGVLPSCTSTS-PCEYSISYG-DGSSTAGFFVTD 191
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
L S + +SV GCG K G A DG++G G + S+ S LA AG
Sbjct: 192 FLQYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAG 251
Query: 156 LIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
++ F+ C D + G +F G+ ++T +P Y+ G++ +G + L
Sbjct: 252 KVRKMFAHCLDTVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGID---VGGTALGL 308
Query: 213 ------TQSGFQALVDSGASFTFLPTEIY-AEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
+ + ++DSG + ++P +Y A + FDK + IS+Q C+ S
Sbjct: 309 PTNIFDSGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDK---HQDISVQTLQDFSCFQYS 365
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQ 319
P++ F + S +V H + F G ++C+ V + DG D ++G
Sbjct: 366 GSVDDGFPEVTFHFEGDVSLIVSPHDYLF--QNGKNLYCMGFQNGGVQTKDGKDMVLLGD 423
Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVI----DKSHVHLV 355
+ +++D EN + W+ C I DK + V
Sbjct: 424 LVLSNKLVLYDLENQAIGWADYNCSSSIKISDDKGSTYTV 463
>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 320
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 72/272 (26%), Positives = 128/272 (47%), Gaps = 17/272 (6%)
Query: 85 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLG 143
+ +S++GYLV D++HL + + S ++I GCG KQ+G + AA DG+MG G
Sbjct: 4 DGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQS 63
Query: 144 DVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE 203
+ S S LA G ++ SF+ C D N+ G +F G P+ K Y V +
Sbjct: 64 NSSFISQLASQGKVKRSFAHCLDNNNGGGIF--AIGEVVSPKVKTTPMLSKSAHYSVNLN 121
Query: 204 SYCIGNSC--LTQSGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 255
+ +GNS L+ + F + ++DSG + +LP +Y ++ + L S ++L
Sbjct: 122 AIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEI--LASHPELTLHT 179
Query: 256 NSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDG- 312
+ ++++ + P + F K+ S V R ++F E+ + + T G
Sbjct: 180 VQESFTCFHYTDKLDRFPTVTFQFDKSVSLAVYPREYLFQVREDTWCFGWQNGGLQTKGG 239
Query: 313 -DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
I+G + +V+D EN + W++ C
Sbjct: 240 ASLTILGDMALSNKLVVYDIENQVIGWTNHNC 271
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 79/333 (23%), Positives = 150/333 (45%), Gaps = 32/333 (9%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++ P SS+ ++V C+ +C K C Y Y+ E ++SSG L +DI+ +
Sbjct: 54 KFQPDLSSTYQSVKCN-----IDCNCDDEKQQCVYERQYA-EMSTSSGVLGEDIISFGNL 107
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S APQ +V GC +TG A DG+MG+G GD+S+ L G+I +SFS+
Sbjct: 108 SALAPQRAV-----FGCENMETGDLYSQHA-DGIMGMGRGDLSIVDHLVDKGVINDSFSL 161
Query: 164 CF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQ 214
C+ V G P+ + P+ Y Y + ++ + N +
Sbjct: 162 CYGGMGIGGGAMVLGGISPPSNMVFSQSDPVRSPY--YNIDLKEIHVAGKPLPLNPTVFD 219
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLK 271
++DSG ++ +LP + K + S + ++G Y C++ + ++ +
Sbjct: 220 GKHGTILDSGTTYAYLPEAAFVSFKDAIMKELHSLK-PIRGPDPNYNDICFSGAGSDISQ 278
Query: 272 V----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFMMGHR 326
+ P + ++F Q ++ + F ++ +CL + D ++G +
Sbjct: 279 LSSSFPAVEMVFGNGQKLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTL 338
Query: 327 IVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPA 359
+++DREN K+ + + C E+ ++ +V PPPA
Sbjct: 339 VLYDRENSKIGFWKTNCSELWERLNVDGAPPPA 371
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 83/330 (25%), Positives = 144/330 (43%), Gaps = 28/330 (8%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +DP SSS+S ++CS C S ++C S + C Y Y + + +SGY V D
Sbjct: 119 LNFFDPGSSSTSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYG-DGSGTSGYYVSD 177
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
++HL + + + ++ + V+ GC +QTG A DG+ G G ++SV S L+ G
Sbjct: 178 MMHLNTIFEGSVTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQG 237
Query: 156 LIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 212
+ FS C D + G + G+ TS +P Y+ + ++S + L
Sbjct: 238 IAPRVFSHCLKGDSSGGGILVLGEIVEPNIVYTSLVPAQPHYN---LNLQSIAVNGQTLQ 294
Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYN 263
T + +VDSG + +L E Y V + S + +GN CY
Sbjct: 295 IDSSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITASIPQSVHTVVSRGNQ---CYL 351
Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGD-YGIIGQN 320
+S P + L F+ S ++R + +N G V+C+ G I+G
Sbjct: 352 ITSSVTEVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDL 411
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVIDKS 350
+ +V+D ++ W++ C ++ S
Sbjct: 412 VLKDKIVVYDLAGQRIGWANYDCSLSVNVS 441
>gi|15010764|gb|AAK74041.1| AT3g51330/F24M12_370 [Arabidopsis thaliana]
gi|23505835|gb|AAN28777.1| At3g51330/F24M12_370 [Arabidopsis thaliana]
Length = 260
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 62/180 (34%), Positives = 88/180 (48%), Gaps = 9/180 (5%)
Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ--ALVDSGASFT 228
G + FGD+G Q T LP E Y V V +G + G Q AL D+G SFT
Sbjct: 11 GRISFGDKGYTDQMETPLLPT-EPSPTYAVSVTEVSVGGDAV---GVQLLALFDTGTSFT 66
Query: 229 FLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNAS-SEEMLKVPDMRLIFSKNQSFV 286
L Y + FD V+ KR + +++CY+ S ++ + P + + F
Sbjct: 67 HLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTILFPRVAMTFEGGSQMF 126
Query: 287 VRNHIFSFPENEGFTVFCLTVM-STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
+RN +F + ++CL ++ S D IIGQNFM G+RIVFDRE + L W S C E
Sbjct: 127 LRNPLFIVWNEDNSAMYCLGILKSVDFKINIIGQNFMSGYRIVFDRERMILGWKRSDCFE 186
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 85/327 (25%), Positives = 144/327 (44%), Gaps = 35/327 (10%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ YDPS SS+ +SC C + SC S C Y Y + +S+ GY + D
Sbjct: 82 LTTYDPSRSSTDGALSCRDSNCGAALGSNEVSCTS-AGYCAYSTTYG-DGSSTQGYFIQD 139
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYL-DGAAPDGVMGLGLGDVSVPSLLAKAG 155
++ + Q + +SV GCG Q+G+ L A DG++G G VS+PS LA G
Sbjct: 140 VMTFQEIHNNT-QVNGTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMG 198
Query: 156 LIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI-GNSCL 212
+ N F+ C D G++ G ++ + S+ PI + + Y VG+++ + G +
Sbjct: 199 KVGNRFAHCLQGDNQGGGTIVIGS---VSEPNISYTPIVSR-NHYAVGMQNIAVNGRNVT 254
Query: 213 TQSGFQA--------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
T + F ++DSG + +L Y + V VS+ S+ + + A
Sbjct: 255 TPASFDTTSTSAGGVIMDSGTTLAYLVDPAYTQFV----NAVSTFESSMFSSHSQCLQLA 310
Query: 265 SSEEMLKVPDMRLIFSKN--QSFVVRNHIFSFPENEGFTVFCL-----TVMSTDGDYGII 317
P ++L F + RN+++S P G +C+ T + Y I+
Sbjct: 311 WCSLQADFPTVKLFFDAGAVMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAGYLSYSIL 370
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCE 344
G + H +V+D +N + W C+
Sbjct: 371 GDIVLKDHLVVYDNDNRVVGWKSFDCK 397
>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 406
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 87/328 (26%), Positives = 146/328 (44%), Gaps = 29/328 (8%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+L+ YDP+ S +S V C C S CK CPY Y + +++SG V+
Sbjct: 45 DLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQ-DMSCPYSITYG-DGSTTSGSFVN 102
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAK 153
D L S + SSVI GCG KQ+GS + A DG++G G + SV S LA
Sbjct: 103 DSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAA 162
Query: 154 AGLIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
+G ++ FS C D + G +F G +T +P Y+ ++ G L
Sbjct: 163 SGKVKRIFSHCLDSHHGGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDMD--VDGEPIL 220
Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
+ SG ++DSG + +LP IY +++ K ++ + + + C++ S
Sbjct: 221 LPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVEDQFT-CFHYS 279
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQ 319
+ P ++ F + S V H + F E ++C+ + + +G D +IG
Sbjct: 280 DKLDEGFPVVKFHF-EGLSLTVHPHDYLFLYKE--DIYCIGWQKSSTQTKEGRDLILIGD 336
Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVI 347
+ +V+D EN+ + W++ C I
Sbjct: 337 LVLSNKLVVYDLENMVIGWTNFNCSSSI 364
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 90/366 (24%), Positives = 161/366 (43%), Gaps = 47/366 (12%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++ P S + + V C+ P C +C + C Y Y+ E +SSSG L +D++ +
Sbjct: 130 KFQPDLSETYQPVKCT-PDC----NCDGDTNQCMYDRQYA-EMSSSSGVLGEDVVSFGNL 183
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ APQ +V GC +TG A DG+MGLG GD+S+ L +I +SFS+
Sbjct: 184 SELAPQRAV-----FGCENDETGDLYSQRA-DGIMGLGRGDLSIMDQLVDKKVISDSFSL 237
Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQ 214
C+ D G + G P T P Y Y + ++ + N +
Sbjct: 238 CYGGMDVGGGAMILGGISPPEDMVFTHSDPDRSPY--YNINLKEMHVAGKKLQLNPKVFD 295
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEE 268
++DSG ++ +LP + + F + + +R SL+ N C+ + +
Sbjct: 296 GKHGTVLDSGTTYAYLPETAF----LAFKRAIMKERNSLKQINGPDPNYKDICFTGAGID 351
Query: 269 MLKV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFMM 323
+ ++ P + ++F + + F ++ +CL V S D ++G F+
Sbjct: 352 VSQLAKSFPVVDMVFENGHKLSLSPENYLFRHSKVRGAYCLGVFSNGRDPTTLLGGIFVR 411
Query: 324 GHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPP 383
+++DREN K+ + + C E+ + H +P+PLP+ + +N A P
Sbjct: 412 NTLVMYDRENSKIGFWKTNCSELWETLHT-------SDAPSPLPSNSE--VTNLTKAFAP 462
Query: 384 STAKTA 389
S A +A
Sbjct: 463 SVAPSA 468
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 87/328 (26%), Positives = 146/328 (44%), Gaps = 29/328 (8%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+L+ YDP+ S +S V C C S CK CPY Y + +++SG V+
Sbjct: 115 DLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQ-DMSCPYSITYG-DGSTTSGSFVN 172
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAK 153
D L S + SSVI GCG KQ+GS + A DG++G G + SV S LA
Sbjct: 173 DSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAA 232
Query: 154 AGLIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
+G ++ FS C D + G +F G +T +P Y+ ++ G L
Sbjct: 233 SGKVKRIFSHCLDSHHGGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDMD--VDGEPIL 290
Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
+ SG ++DSG + +LP IY +++ K ++ + + + C++ S
Sbjct: 291 LPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVEDQFT-CFHYS 349
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQ 319
+ P ++ F + S V H + F E ++C+ + + +G D +IG
Sbjct: 350 DKLDEGFPVVKFHF-EGLSLTVHPHDYLFLYKE--DIYCIGWQKSSTQTKEGRDLILIGD 406
Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVI 347
+ +V+D EN+ + W++ C I
Sbjct: 407 LVLSNKLVVYDLENMVIGWTNFNCSSSI 434
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 143/339 (42%), Gaps = 34/339 (10%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ YDP S S + V+C C + SC S PC Y Y + +S++G+ V D
Sbjct: 134 LTMYDPRGSQSGELVTCDQQFCVANYGGVLPSCTSTS-PCEYSISYG-DGSSTAGFFVTD 191
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
L S + +SV GCG K G A DG++G G + S+ S LA AG
Sbjct: 192 FLQYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAG 251
Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
++ F+ C D + G +F G Q P+ Y V ++ +G + L
Sbjct: 252 KVRKMFAHCLDTVNGGGIFA--IGNVVQPKVKTTPLVSDMPHYNVILKGIDVGGTALGLP 309
Query: 213 -----TQSGFQALVDSGASFTFLPTEIY-AEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
+ + ++DSG + ++P +Y A + FDK + IS+Q C+ S
Sbjct: 310 TNIFDSGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDK---HQDISVQTLQDFSCFQYSG 366
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQN 320
P++ F + S +V H + F G ++C+ V + DG D ++G
Sbjct: 367 SVDDGFPEVTFHFEGDVSLIVSPHDYLF--QNGKNLYCMGFQNGGVQTKDGKDMVLLGDL 424
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVI----DKSHVHLV 355
+ +++D EN + W+ C I DK + V
Sbjct: 425 VLSNKLVLYDLENQAIGWADYNCSSSIKISDDKGSTYTV 463
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 87/341 (25%), Positives = 144/341 (42%), Gaps = 34/341 (9%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+L+ YD SSS K V C CK + C + CPY+ Y + +S++GY V
Sbjct: 126 DLTLYDIKESSSGKLVPCDQEFCKEINGGLLTGCTA-NISCPYLEIYG-DGSSTAGYFVK 183
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTG--SYLDGAAPDGVMGLGLGDVSVPSLLAK 153
DI+ S S S++ GCG +Q+G S + A DG++G G + S+ S LA
Sbjct: 184 DIVLYDQVSGDLKTDSANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLAS 243
Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 212
+G ++ F+ C + + G +F G Q + P+ Y V + + +G++ L
Sbjct: 244 SGKVKKMFAHCLNGVNGGGIF--AIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHTFLS 301
Query: 213 --TQSGFQA-----LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
T + Q ++DSG + +LP IY +V K ++ + + C+ S
Sbjct: 302 LSTDTSAQGDRKGTIIDSGTTLAYLPEGIYEPLVYKMISQHPDLKVQTLHDEYT-CFQYS 360
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDGDYGIIGQN 320
P + F S V H + FP +C+ S D + +
Sbjct: 361 ESVDDGFPAVTFFFENGLSLKVYPHDYLFPS---VNFWCIGWQNSGTQSRDSKNMTLLGD 417
Query: 321 FMMGHRIVF-DRENLKLAWSHSKCEEVID-----KSHVHLV 355
++ +++VF D EN + W+ C I VHLV
Sbjct: 418 LVLSNKLVFYDLENQAIGWAEYNCSSSIKVRDERTGTVHLV 458
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 75/325 (23%), Positives = 137/325 (42%), Gaps = 34/325 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRS-SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+DP S+++K ++C PLC + SC D C Y Y+ E +SS G++++D
Sbjct: 55 FDPDKSTTAKKLACGDPLCNCGTPSCTCNNDRCYYSRTYA-ERSSSEGWMIEDTFGF--- 110
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
P S ++ GC +TG A DG+MG+G + S L + +I++ FS+
Sbjct: 111 ----PDSDSPVRLVFGCENGETGEIYRQMA-DGIMGMGNNHNAFQSQLVQRKVIEDVFSL 165
Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG--------NSCLTQS 215
CF G + GD +T + P+ ++ V+ I ++ +
Sbjct: 166 CFGYPKDGILLLGDVTLPEGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDR 225
Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKY---CYNASSEEMLK 271
G+ ++DSG +FT+LPT+ + + V K + S G +Y C+ + ++
Sbjct: 226 GYGTVLDSGTTFTYLPTDAFKAMAKAVGDYVEKKGLQSTPGADPQYNDICWKGAPDQFKD 285
Query: 272 V----PDMRLIFSKNQSFV---VRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
+ P +F +R S P +CL + ++G +
Sbjct: 286 LDKYFPPAEFVFGGGAKLTLPPLRYLFLSKPAE-----YCLGIFDNGNSGALVGGVSVRD 340
Query: 325 HRIVFDRENLKLAWSHSKCEEVIDK 349
+ +DR N K+ ++ C +V K
Sbjct: 341 VVVTYDRRNSKVGFTTMACADVARK 365
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 83/340 (24%), Positives = 149/340 (43%), Gaps = 32/340 (9%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDD 96
+L+ Y+ + S + K V C C + + + CPY+ Y + +S++GY V D
Sbjct: 121 DLTLYNINESDTGKLVPCDQEFCYEINGGQLPGCTANMSCPYLEIYG-DGSSTAGYFVKD 179
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY--LDGAAPDGVMGLGLGDVSVPSLLAKA 154
++ A S ++ SVI GCG +Q+G + A DG++G G + S+ S LA
Sbjct: 180 VVQYARVSGDLKTTAANGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVT 239
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 213
G ++ F+ C D + G +F G Q + P+ Y V + + +G+ L+
Sbjct: 240 GKVKKIFAHCLDGTNGGGIFV--IGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGHEFLSL 297
Query: 214 -----QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
++G + A++DSG + +LP +Y +V K ++ + + C+ S
Sbjct: 298 PTDVFEAGDRKGAIIDSGTTLAYLPEMVYKPLVSKIISQQPDLKVHTVRDEYT-CFQYSD 356
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTD-GDYGIIGQN 320
P++ F + V H + FP EG ++C+ V S D + ++G
Sbjct: 357 SLDDGFPNVTFHFENSVILKVYPHEYLFPF-EG--LWCIGWQNSGVQSRDRRNMTLLGDL 413
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVID-----KSHVHLV 355
+ +++D EN + W+ C I VHLV
Sbjct: 414 VLSNKLVLYDLENQAIGWTEYNCSSSIQVQDERTGTVHLV 453
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 82/327 (25%), Positives = 136/327 (41%), Gaps = 28/327 (8%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLK----DPCPYIADYSTEDTSSSGYLVDD 96
+L+ YDP +SS+ V C C + K K PC Y Y + +S+ G V D
Sbjct: 129 DLTLYDPKASSTGSMVMCDQAFCAATFGGKLPKCGANVPCEYSVTYG-DGSSTIGSFVTD 187
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
L ++ +SVI GCG +Q G A DG++G G + S+ S L AG
Sbjct: 188 ALQFDQVTRDGQTQPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAG 247
Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-- 213
++ F+ C D G +F G Q P+ Y V +++ +G + L
Sbjct: 248 KVKKIFAHCLDTIKGGGIF--SIGDVVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLQLP 305
Query: 214 ----QSGFQ--ALVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASS 266
+ G + ++DSG + T+LP ++ EV++ F+K + I+ C+
Sbjct: 306 AHIFEPGEKKGTIIDSGTTLTYLPELVFKEVMLAVFNK---HQDITFHDVQGFLCFQYPG 362
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV-----MSTDG-DYGIIGQN 320
P + F + + V H + F G V+C+ S DG D ++G
Sbjct: 363 SVDDGFPTITFHFEDDLALHVYPHEYFFA--NGNDVYCVGFQNGASQSKDGKDIVLMGDL 420
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVI 347
+ +++D EN + W+ C I
Sbjct: 421 VLSNKLVIYDLENRVIGWTDYNCSSSI 447
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 79/336 (23%), Positives = 158/336 (47%), Gaps = 40/336 (11%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++ P SSS+ + V C+ +C + C Y Y+ E ++SSG L +D++ +
Sbjct: 153 KFQPESSSTYQPVKCT-----IDCNCDGDRMQCVYERQYA-EMSTSSGVLGEDVISFGNQ 206
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ APQ +V GC +TG A DG+MGLG GD+S+ L +I +SFS+
Sbjct: 207 SELAPQRAV-----FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKKVISDSFSL 260
Query: 164 CFDEND--SGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQS 215
C+ D G++ G P + + ++ ++ Y + ++ + N+ +
Sbjct: 261 CYGGMDVGGGAMVLGGISPPSDMTFAY-SDPDRSPYYNIDLKEMHVAGKRLPLNANVFDG 319
Query: 216 GFQALVDSGASFTFLPTE---IYAEVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEMLK 271
++DSG ++ +LP + + +VK +L S K+IS ++ C++ + ++ +
Sbjct: 320 KHGTVLDSGTTYAYLPEAAFLAFKDAIVK--ELQSLKQISGPDPNYNDICFSGAGNDVSQ 377
Query: 272 V----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFM 322
+ P + ++F + + + F ++ +CL + D GII +N +
Sbjct: 378 LSKSFPVVDMVFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGNDQTTLLGGIIVRNTL 437
Query: 323 MGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPP 358
+++DRE K+ + + C E+ ++ + PPP
Sbjct: 438 ----VMYDREQTKIGFWKTNCAELWERLQTSIAPPP 469
>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
from this gene [Arabidopsis thaliana]
Length = 388
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 77/271 (28%), Positives = 122/271 (45%), Gaps = 26/271 (9%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ Y+ S S K VSC C S CK+ CPY+ Y + +S++GY V D
Sbjct: 124 LTLYNIDESDSGKLVSCDDDFCYQISGGPLSGCKA-NMSCPYLEIYG-DGSSTAGYFVKD 181
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA---APDGVMGLGLGDVSVPSLLAK 153
++ S + + SVI GCG +Q+G LD + A DG++G G + S+ S LA
Sbjct: 182 VVQYDSVAGDLKTQTANGSVIFGCGARQSGD-LDSSNEEALDGILGFGKANSSMISQLAS 240
Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
+G ++ F+ C D + G +F G Q + P+ Y V + + +G LT
Sbjct: 241 SGRVKKIFAHCLDGRNGGGIF--AIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLT 298
Query: 214 ------QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
Q G + A++DSG + +LP IY +V K L ++ + +K C+ S
Sbjct: 299 IPADLFQPGDRKGAIIDSGTTLAYLPEIIYEPLVKKEPAL----KVHIVDKDYK-CFQYS 353
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE 296
P++ F + V H + FP
Sbjct: 354 GRVDEGFPNVTFHFENSVFLRVYPHDYLFPH 384
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 86/341 (25%), Positives = 144/341 (42%), Gaps = 34/341 (9%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+L+ YD SSS K V C CK + C + CPY+ Y + +S++GY V
Sbjct: 128 DLTLYDIKESSSGKFVPCDQEFCKEINGGLLTGCTA-NISCPYLEIYG-DGSSTAGYFVK 185
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTG--SYLDGAAPDGVMGLGLGDVSVPSLLAK 153
DI+ S S S++ GCG +Q+G S + A G++G G + S+ S LA
Sbjct: 186 DIVLYDQVSGDLKTDSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLAS 245
Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 212
+G ++ F+ C + + G +F G Q + P+ Y V + + +G++ L
Sbjct: 246 SGKVKKMFAHCLNGVNGGGIF--AIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHAFLS 303
Query: 213 --TQSGFQA-----LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
T + Q ++DSG + +LP IY +V K ++ + + C+ S
Sbjct: 304 LSTDTSTQGDRKGTIIDSGTTLAYLPEGIYEPLVYKIISQHPDLKVRTLHDEYT-CFQYS 362
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDGDYGIIGQN 320
P + F S V H + FP + +C+ S D + +
Sbjct: 363 ESVDDGFPAVTFYFENGLSLKVYPHDYLFPSGD---FWCIGWQNSGTQSRDSKNMTLLGD 419
Query: 321 FMMGHRIVF-DRENLKLAWSHSKCEEVID-----KSHVHLV 355
++ +++VF D EN + W+ C I VHLV
Sbjct: 420 LVLSNKLVFYDLENQVIGWTEYNCSSSIKVRDERTGTVHLV 460
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 90/383 (23%), Positives = 170/383 (44%), Gaps = 51/383 (13%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++ P SS+ + V C+ +C + + C Y Y+ E ++SSG L +D++ +
Sbjct: 122 KFQPDLSSTYQPVKCT-----LDCNCDNDRMQCVYERQYA-EMSTSSGVLGEDVVSFGNQ 175
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ APQ +V GC +TG A DG+MGLG GD+S+ L ++ +SFS+
Sbjct: 176 SELAPQRAV-----FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVVSDSFSL 229
Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQ 214
C+ D G V G P+ P+ Y Y + ++ + N +
Sbjct: 230 CYGGMDVGGGAMVLGGISPPSDMVFAQSDPVRSPY--YNIDLKEIHVAGKRLPLNPSVFD 287
Query: 215 SGFQALVDSGASFTFLPTE---IYAEVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEML 270
+++DSG ++ +LP E + E +VK +L S +IS ++ C++ + ++
Sbjct: 288 GKHGSVLDSGTTYAYLPEEAFLAFKEAIVK--ELQSFSQISGPDPNYNDLCFSGAGIDVS 345
Query: 271 KV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-----YGIIGQNF 321
++ P + +IF + + + F ++ +CL + D GI+ +N
Sbjct: 346 QLSKTFPVVDMIFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGKDPTTLLGGIVVRNT 405
Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAA 381
+ +++DRE K+ + + C E+ ++ + PPP P TE +N +
Sbjct: 406 L----VLYDREQTKIGFWKTNCAELWERLQISSAPPPMP------PNTE---ATNSTKSV 452
Query: 382 PPSTAKTAPSKSIAASAQQLDSV 404
PS A + +I Q+ +
Sbjct: 453 DPSVAPSVSQHNIPRGEFQIAQI 475
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/332 (27%), Positives = 144/332 (43%), Gaps = 27/332 (8%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +DP SS+++ VSCS C S S C S + C Y Y + + +SGY V D
Sbjct: 128 LTFFDPGSSTTAALVSCSDQRCTAGIQSSDSLCSSRTNQCGYTFQYG-DGSGTSGYYVAD 186
Query: 97 ILHLASFSKHAPQ-----SSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSL 150
++HL + + + + SSV C QTG A DG+ G G ++SV S
Sbjct: 187 LMHLDTLLLSSGELSQICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQ 246
Query: 151 LAKAGLIQNSFSICFDENDSGS--VFFGDQGPATQQSTSFLPIGEKYDAYF----VGVES 204
LA G+ FS C +DSG + G+ T +P Y+ Y V ++
Sbjct: 247 LASQGITPRVFSHCLKGDDSGGGVLVLGEIVEPNIVYTPLVPSQPHYNLYLQSISVAGQT 306
Query: 205 YCIGNSCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVS-SKRISL-QGNSWKYC 261
I S S Q +VDSG + +L Y V +VS + R L +GN C
Sbjct: 307 LAIDPSVFGASSNQGTIVDSGTTLAYLAEGAYDPFVSAITSVVSLNARTYLSKGNQ---C 363
Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGDYGIIGQ 319
Y +S P + L F+ S ++ + +N G V+C+ T G I
Sbjct: 364 YLVTSSVNDVFPQVSLNFAGGASLILNPQDYLLQQNSVGGAAVWCVGFQKTPGQQITILG 423
Query: 320 NFMMGHRI-VFDRENLKLAWSHSKCEEVIDKS 350
+ ++ +I V+D N ++ W++ C ++ S
Sbjct: 424 DLVLKDKIFVYDIANQRVGWTNYDCSMSVNVS 455
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 98/384 (25%), Positives = 164/384 (42%), Gaps = 54/384 (14%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++ P SSS K + C +P C +C C Y Y+ E +SSSG L +D++ +
Sbjct: 121 KFQPELSSSYKALKC-NPDC----NCDDEGKLCVYERRYA-EMSSSSGVLSEDLISFGNE 174
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ PQ +V GC +TG A DG+MGLG G +SV L G+I++ FS+
Sbjct: 175 SQLTPQRAV-----FGCENVETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSL 228
Query: 164 CFD--ENDSGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIGNSCLT------Q 214
C+ E G++ G P S P Y Y + ++ + L
Sbjct: 229 CYGGMEVGGGAMVLGKISPPAGMVFSHSDPFRSPY--YNIDLKQMHVAGKSLKLNPKVFN 286
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKY---CYNASSEEML 270
++DSG ++ + P E + + K + S KRI G Y C++ + ++
Sbjct: 287 GKHGTVLDSGTTYAYFPKEAFIAIKDAIIKEIPSLKRI--HGPDPNYDDVCFSGAGRDVA 344
Query: 271 KV----PDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
++ P++ + F Q ++ N++F + G +CL + ++G +
Sbjct: 345 EIHNFFPEIDMEFGNGQKLILSPENYLFRHTKVRG--AYCLGIFPDRDSTTLLGGIVVRN 402
Query: 325 HRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPS 384
+ +DREN KL + + C ++ + L P +SP P Q +SN PS
Sbjct: 403 TLVTYDRENDKLGFLKTNCSDLWRR----LAAP---ESPAPTSPISQNKSSN----ISPS 451
Query: 385 TAKTAPSKSIAASAQQLDSVLRVA 408
AK+ + L VLRV
Sbjct: 452 PAKS------ESPTTDLPGVLRVG 469
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 86/340 (25%), Positives = 147/340 (43%), Gaps = 36/340 (10%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ YDP S S + V+C C + SC S PC Y Y + +S++G+ V D
Sbjct: 134 LTMYDPRGSQSGELVTCDQQFCVANYGGVLPSCTSTS-PCEYSISYG-DGSSTAGFFVTD 191
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
L S + +SV GCG K G A DG++G G + S+ S LA AG
Sbjct: 192 FLQYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAG 251
Query: 156 LIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
++ F+ C D + G +F G+ ++T +P Y+ G++ +G + L
Sbjct: 252 KVRKMFAHCLDTVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGID---VGGTALGL 308
Query: 213 ------TQSGFQALVDSGASFTFLPTEIY-AEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
+ + ++DSG + ++P +Y A + FDK + IS+Q C+ S
Sbjct: 309 PTNIFDSGNSKGTIIDSGTTLAYVPEGVYKALFAMVFDK---HQDISVQTLQDFSCFQYS 365
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-----STDGDYGIIGQN 320
P++ F + S +V H + F G ++C+ + DG + +
Sbjct: 366 GSVDDGFPEVTFHFEGDVSLIVSPHDYLF--QNGKNLYCMGFQNGGGKTKDGKDLGLLGD 423
Query: 321 FMMGHRIV-FDRENLKLAWSHSKCEEVI----DKSHVHLV 355
++ +++V +D EN + W+ C I DK + V
Sbjct: 424 LVLSNKLVLYDLENQAIGWADYNCSSSIKISDDKGSTYTV 463
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 72/319 (22%), Positives = 134/319 (42%), Gaps = 19/319 (5%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
L+ +D +SSS+++ V CSHP+C S+ + C + C Y Y + + +SGY V
Sbjct: 124 QLNYFDTTSSSTARLVPCSHPICTSQIQTTATQCPPQSNQCSYAFQYG-DGSGTSGYYVS 182
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKA 154
D + + + ++ ++++ GC Q+G A DG+ G G G++SV S L+
Sbjct: 183 DTFYFDAVLGESLIANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSH 242
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
G+ FS C DSG G + + P+ Y + ++S + L
Sbjct: 243 GITPRVFSHCLKGEDSGGGIL-VLGEILEPGIVYSPLVPSQPHYNLDLQSIAVSGQLLPI 301
Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
T S ++D+G + +L E Y V V S+ + N CY S+
Sbjct: 302 DPAAFATSSNRGTIIDTGTTLAYLVEEAYDPFVSAITAAV-SQLATPTINKGNQCYLVSN 360
Query: 267 EEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
P + F+ + +++ ++ G ++C+ G I+G +
Sbjct: 361 SVSEVFPPVSFNFAGGATMLLKPEEYLMYLTNYAGAALWCIGFQKIQGGITILGDLVLKD 420
Query: 325 HRIVFDRENLKLAWSHSKC 343
V+D + ++ W++ C
Sbjct: 421 KIFVYDLAHQRIGWANYDC 439
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 89/348 (25%), Positives = 152/348 (43%), Gaps = 52/348 (14%)
Query: 63 CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 122
C +C S K+ C Y Y+ E +SSSG L +DI+ + S+ PQ +V GC
Sbjct: 143 CNVDCTCDSDKNQCTYERQYA-EMSSSSGVLGEDIVSFGTESELKPQRAV-----FGCEN 196
Query: 123 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VFFGDQG 179
+TG A DG+MGLG G +S+ L G+I +SFS+C+ D G V
Sbjct: 197 SETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPA 255
Query: 180 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGASFTFLPTE 233
P T + Y Y + ++ + L ++DSG ++ +LP +
Sbjct: 256 PPGMIYTHSNAVRSPY--YNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYAYLPEQ 313
Query: 234 IYAEVVVKFDKLVSS-----KRISLQGNSWK-YCYNASSEEMLKV----PDMRLIFSKNQ 283
+ V F VSS K+I +++K C+ + + ++ P + ++F Q
Sbjct: 314 AF----VAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEVFPKVDMVFGNGQ 369
Query: 284 --SFVVRNHIFSFPENEGFTVFCLTVMSTDGD-----YGIIGQNFMMGHRIVFDRENLKL 336
S N++F + EG +CL V D GI+ +N + + +DR N K+
Sbjct: 370 KLSLSPENYLFRHSKVEG--AYCLGVFQNGKDPTTLLGGIVVRNTL----VTYDRHNEKI 423
Query: 337 AWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPS 384
+ + C E+ ++ +G +P+P P+ + ++ A PS
Sbjct: 424 GFWKTNCSELWERLQ-------SGGAPSPAPSNDPGPQADLSPAPAPS 464
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 82/337 (24%), Positives = 153/337 (45%), Gaps = 42/337 (12%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++ P S + + V C+ + +C + + C Y Y+ E ++SSG L +D++ +
Sbjct: 134 KFRPEDSETYQPVKCTW-----QCNCDNDRKQCTYERRYA-EMSTSSGALGEDVVSFGNQ 187
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
++ +PQ + I GC +TG + A DG+MGLG GD+S+ L + +I +SFS+
Sbjct: 188 TELSPQRA-----IFGCENDETGDIYNQRA-DGIMGLGRGDLSIMDQLVEKKVISDSFSL 241
Query: 164 CF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQ 214
C+ V G PA T P+ Y Y + ++ + N +
Sbjct: 242 CYGGMGVGGGAMVLGGISPPADMVFTRSDPVRSPY--YNIDLKEIHVAGKRLHLNPKVFD 299
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKY---CYNASSEEML 270
++DSG ++ +LP + K S KRIS G +Y C++ + ++
Sbjct: 300 GKHGTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRIS--GPDPRYNDICFSGAEIDVS 357
Query: 271 KV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNF 321
++ P + ++F + + F ++ +CL V S D GI+ +N
Sbjct: 358 QISKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNT 417
Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPP 358
+ +++DRE+ K+ + + C E+ ++ HV PPP
Sbjct: 418 L----VMYDREHTKIGFWKTNCSELWERLHVSDAPPP 450
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 89/348 (25%), Positives = 151/348 (43%), Gaps = 52/348 (14%)
Query: 63 CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 122
C +C S K+ C Y Y+ E +SSSG L +DI+ + S+ PQ +V GC
Sbjct: 143 CNVDCTCDSDKNQCTYERQYA-EMSSSSGVLGEDIVSFGTESELKPQRAV-----FGCEN 196
Query: 123 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VFFGDQG 179
+TG A DG+MGLG G +S+ L G+I +SFS+C+ D G V
Sbjct: 197 SETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPA 255
Query: 180 PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------QSGFQALVDSGASFTFLPTE 233
P T + Y Y + ++ + L ++DSG ++ +LP +
Sbjct: 256 PPGMIYTHSNAVRSPY--YNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYAYLPEQ 313
Query: 234 IYAEVVVKFDKLVSS-----KRISLQGNSWK-YCYNASSEEMLKV----PDMRLIFSKNQ 283
+ V F VSS K+I ++K C+ + + ++ P + ++F Q
Sbjct: 314 AF----VAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPKVDMVFGNGQ 369
Query: 284 --SFVVRNHIFSFPENEGFTVFCLTVMSTDGD-----YGIIGQNFMMGHRIVFDRENLKL 336
S N++F + EG +CL V D GI+ +N + + +DR N K+
Sbjct: 370 KLSLSPENYLFRHSKVEG--AYCLGVFQNGKDPTTLLGGIVVRNTL----VTYDRHNEKI 423
Query: 337 AWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPS 384
+ + C E+ ++ +G +P+P P+ + ++ A PS
Sbjct: 424 GFWKTNCSELWERLQ-------SGGAPSPAPSNDPGPQADLSPAPAPS 464
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 93/338 (27%), Positives = 148/338 (43%), Gaps = 52/338 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCK-SRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP+SSSSS + C C R C S K C Y Y+ E +SS+G LV D L L
Sbjct: 106 FDPASSSSSAVIGCDSDKCICGRPPCGCSEKRECTYQRTYA-EQSSSAGLLVSDQLQLR- 163
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+V+ V+ GC K+TG + A DG++GLG +VS+ + LA +G+I + F+
Sbjct: 164 ------DGAVE--VVFGCETKETGEIYNQEA-DGILGLGNSEVSLVNQLAGSGVIDDVFA 214
Query: 163 ICFD--ENDSGSVFFGDQGPATQ----QSTSFLPIGEKYDAYFVGVESYCIGNSCLT--- 213
+CF E D G++ GD A Q T+ L Y V +E+ +G L
Sbjct: 215 LCFGSVEGD-GALMLGDVDAAEYDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKP 273
Query: 214 ---QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK----------- 259
+ G+ ++DSG +FT+LP+E + F + VS+ + NS K
Sbjct: 274 ERYEEGYGTVLDSGTTFTYLPSEAFQ----LFKEAVSAYALEHGLNSVKGPDPKEKSFAQ 329
Query: 260 ---YCY-------NASSEEMLKV-PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM 308
C+ +A ++ KV P L F+ + F +CL V
Sbjct: 330 FHDICFGGAPHAGHADQSKLEKVFPVFELQFADGVRLRTGPLNYLFMHTGEMGAYCLGVF 389
Query: 309 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
++G + +DR N ++ + + C+E+
Sbjct: 390 DNGASGTLLGGISFRNILVQYDRRNRRVGFGAASCQEI 427
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 87/308 (28%), Positives = 143/308 (46%), Gaps = 34/308 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRS----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
++PS SSS KN+ C+ CK + SC + D C Y Y D S G L +D L L
Sbjct: 131 FNPSKSSSYKNIPCTSSTCKDTNDTHISCSNGGDVCEYSITYGG-DAKSQGDLSNDSLTL 189
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
S S S + +++IGCG D + GV+G+G G +S+ + + + +
Sbjct: 190 DSTSG---SSVLFPNIVIGCGHINV--LQDNSQSSGVVGMGRGPMSLIKQVGSSS-VGSK 243
Query: 161 FSICF-----DENDSGSVFFGDQGPATQQ---STSFLPIGEKYDAYFVGVESYCIGNSCL 212
FS C D N S + FG+ + + ST + + + + YF+ +E++ +GN+ +
Sbjct: 244 FSYCLIPYNSDSNSSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRI 303
Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
S L+DSG T LP +++V + V RI + CYN +
Sbjct: 304 EYGERSNASTQNILIDSGTPLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHLSLCYNTTG 363
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG--DYGIIGQNFMMG 324
++ L VPD+ F+ + N F FP +G + C +S++G +G I QN ++
Sbjct: 364 KQ-LNVPDITAHFNGADVKLNSNGTF-FPFEDG--IMCFGFISSNGLEIFGNIAQNNLL- 418
Query: 325 HRIVFDRE 332
I +D E
Sbjct: 419 --IDYDLE 424
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 84/341 (24%), Positives = 138/341 (40%), Gaps = 40/341 (11%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSSCK----SLKDPCPYIADYSTEDTSSSGYLVDD 96
+L+ YDP +SSS VSC C + K + PC Y Y + +S++G+ + D
Sbjct: 130 DLTFYDPKASSSGSTVSCDQGFCAATYGGKLPGCTANVPCEYSVMYG-DGSSTTGFFITD 188
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAG 155
L + +++ GCG +Q G + A DG++G G + S+ S LA AG
Sbjct: 189 ALQFDQVTGDGQTQPGNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAG 248
Query: 156 LIQNSFSICFDENDSGS--------------VFFGDQGPATQQSTSFLPIGEKYDAYFVG 201
+ F+ C D G VFF G + I Y V
Sbjct: 249 KAKKIFAHCLDTIKGGGIFAIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVN 308
Query: 202 VESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR-IS 252
++S +G + L T ++DSG + T+LP ++ +V+ D + S R I+
Sbjct: 309 LKSIDVGGTTLQLPAHVFETGEKKGTIIDSGTTLTYLPELVFKQVM---DVVFSKHRDIA 365
Query: 253 LQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TV 307
C+ S P + F + + V H + FP G ++C+ +
Sbjct: 366 FHNLQDFLCFQYSGSVDDGFPTITFHFEDDLALHVYPHEYFFP--NGNDIYCVGFQNGAL 423
Query: 308 MSTDG-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
S DG D ++G + +V+D EN + W+ C I
Sbjct: 424 QSKDGKDIVLMGDLVLSNKLVVYDLENQVIGWTDYNCSSSI 464
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 80/326 (24%), Positives = 134/326 (41%), Gaps = 29/326 (8%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ YDPS SSS V+C C + SC PC Y Y + +S++G+ V D
Sbjct: 125 LTLYDPSGSSSGTGVTCGQDFCVATHGGVIPSCVPAA-PCQYSISYG-DGSSTTGFFVTD 182
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAG 155
L S ++ + +S+ GCG K G + A DG++G G + S+ S LA AG
Sbjct: 183 FLQYNQVSGNSQTTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAG 242
Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
++ F+ C D + G +F G Q S P+ Y V +E+ +G L
Sbjct: 243 KVRKVFAHCLDTINGGGIF--AIGDVVQPKVSTTPLVPGMPHYNVNLEAIDVGGVKLQLP 300
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
++DSG + +LP +Y ++ K + L+ + C+ S
Sbjct: 301 TNIFDIGESKGTIIDSGTTLAYLPGVVYNAIMSKV--FAQYGDMPLKNDQDFQCFRYSGS 358
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDGDYGIIGQNFM 322
P + F + H + F E ++C+ + + DG ++ +
Sbjct: 359 VDDGFPIITFHFEGGLPLNIHPHDYLFQNGE---LYCMGFQTGGLQTKDGKDMVLLGDLA 415
Query: 323 MGHRIV-FDRENLKLAWSHSKCEEVI 347
+R+V +D EN + W+ C I
Sbjct: 416 FSNRLVLYDLENQVIGWTDYNCSSSI 441
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/327 (24%), Positives = 140/327 (42%), Gaps = 28/327 (8%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSS----CK-SLKDPCPYIADYSTEDTSSSGYLVD 95
+L+ YD +S++S V C C CK L+ C Y Y + +S++GY V
Sbjct: 117 DLTLYDMKASTTSDAVGCDDNFCSLYDGPLPGCKPGLQ--CLYSVLYG-DGSSTTGYFVQ 173
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKA 154
D + S + + +V+ GCG KQ+G + A DG++G G + S+ S LA +
Sbjct: 174 DFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASS 233
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 213
G ++ FS C D D G +F G + + P+ + Y V ++ +G L
Sbjct: 234 GKVKKVFSHCLDNVDGGGIFA--IGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDV 291
Query: 214 -----QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
+SG + ++DSG + + P E+Y ++ K R+ ++ C++ +
Sbjct: 292 PSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFT-CFDYTG 350
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQN 320
P + L F K+ S V H + F E +C+ + DG D ++G
Sbjct: 351 NVDDGFPTVTLHFDKSISLTVYPHEYLFQVKE--FEWCIGWQNSGAQTKDGKDLTLLGDL 408
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVI 347
+ +V+D E + W C I
Sbjct: 409 VLSNKLVVYDLEKQGIGWVEYNCSSSI 435
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 82/320 (25%), Positives = 136/320 (42%), Gaps = 19/320 (5%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDIL 98
LS +D ++SS+SK V C C S S + C Y Y+ E TS G + D+L
Sbjct: 117 RLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIVYADESTSD-GKFIRDML 175
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLI 157
L + + V+ GCG Q+G +G +A DGVMG G + SV S LA G
Sbjct: 176 TLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDA 235
Query: 158 QNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVE----SYCIGNSCL 212
+ FS C D G +F G ++T +P Y+ +G++ S + S +
Sbjct: 236 KRVFSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIV 295
Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLK 271
G +VDSG + + P +Y ++ + +++ + + L + C++ S+
Sbjct: 296 RNGG--TIVDSGTTLAYFPKVLYDSLI---ETILARQPVKLHIVEETFQCFSFSTNVDEA 350
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV--MSTDGDYGII--GQNFMMGHRI 327
P + F + V H + F E F ++TD +I G + +
Sbjct: 351 FPPVSFEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLV 410
Query: 328 VFDRENLKLAWSHSKCEEVI 347
V+D +N + W+ C I
Sbjct: 411 VYDLDNEVIGWADHNCSSSI 430
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 86/352 (24%), Positives = 155/352 (44%), Gaps = 44/352 (12%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++ P S+S + + C +P C +C C Y Y+ E +SSSG L +D++ +
Sbjct: 117 KFQPELSTSYQALKC-NPDC----NCDDEGKLCVYERRYA-EMSSSSGVLSEDLISFGNE 170
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ +PQ +V GC ++TG A DG+MGLG G +SV L G+I++ FS+
Sbjct: 171 SQLSPQRAV-----FGCENEETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSL 224
Query: 164 CFD--ENDSGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIGNSCLT------Q 214
C+ E G++ G P S P Y Y + ++ + L
Sbjct: 225 CYGGMEVGGGAMVLGKISPPPGMVFSHSDPFRSPY--YNIDLKQMHVAGKSLKLNPKVFN 282
Query: 215 SGFQALVDSGASFTFLPTEIY---AEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEE 268
++DSG ++ + P E + + V+K ++ S KRI G Y C++ + +
Sbjct: 283 GKHGTVLDSGTTYAYFPKEAFIAIKDAVIK--EIPSLKRI--HGPDPNYDDVCFSGAGRD 338
Query: 269 MLKV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
+ ++ P++ + F Q ++ + F + +CL + ++G +
Sbjct: 339 VAEIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRN 398
Query: 325 HRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSN 376
+ +DREN KL + + C ++ + L P +SP P Q +SN
Sbjct: 399 TLVTYDRENDKLGFLKTNCSDIWRR----LAAP---ESPAPTSPISQNKSSN 443
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 86/352 (24%), Positives = 155/352 (44%), Gaps = 44/352 (12%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++ P S+S + + C +P C +C C Y Y+ E +SSSG L +D++ +
Sbjct: 117 KFQPELSTSYQALKC-NPDC----NCDDEGKLCVYERRYA-EMSSSSGVLSEDLISFGNE 170
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ +PQ +V GC ++TG A DG+MGLG G +SV L G+I++ FS+
Sbjct: 171 SQLSPQRAV-----FGCENEETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSL 224
Query: 164 CFD--ENDSGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIGNSCLT------Q 214
C+ E G++ G P S P Y Y + ++ + L
Sbjct: 225 CYGGMEVGGGAMVLGKISPPPGMVFSHSDPFRSPY--YNIDLKQMHVAGKSLKLNPKVFN 282
Query: 215 SGFQALVDSGASFTFLPTEIY---AEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEE 268
++DSG ++ + P E + + V+K ++ S KRI G Y C++ + +
Sbjct: 283 GKHGTVLDSGTTYAYFPKEAFIAIKDAVIK--EIPSLKRI--HGPDPNYDDVCFSGAGRD 338
Query: 269 MLKV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
+ ++ P++ + F Q ++ + F + +CL + ++G +
Sbjct: 339 VAEIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRN 398
Query: 325 HRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSN 376
+ +DREN KL + + C ++ + L P +SP P Q +SN
Sbjct: 399 TLVTYDRENDKLGFLKTNCSDIWRR----LAAP---ESPAPTSPISQNKSSN 443
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 83/328 (25%), Positives = 142/328 (43%), Gaps = 24/328 (7%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +DP SS ++ +SCS C S S C + + C Y Y + + +SGY V D
Sbjct: 96 LNFFDPGSSPTASLISCSDQRCSLGLQSSDSVCSAQNNLCGYNFQYG-DGSGTSGYYVSD 154
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
+LH + + ++ + ++ GC QTG A DG+ G G D+SV S LA G
Sbjct: 155 LLHFDTVLGGSVMNNSSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQG 214
Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
+ +FS C +DSG G + + + P+ Y + ++S + L
Sbjct: 215 ISPRAFSHCLKGDDSGGGIL-VLGEIVEPNIVYTPLVPSQPHYNLNMQSISVNGQTLAID 273
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS-SKRISL-QGNSWKYCYNAS 265
T S ++DSG + +L Y + +VS S R L +GN +CY S
Sbjct: 274 PSVFGTSSSQGTIIDSGTTLAYLAEAAYDPFISAITSIVSPSVRPYLSKGN---HCYLIS 330
Query: 266 SEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFM 322
S P + L F+ S ++ ++++ G ++C+ G I+G +
Sbjct: 331 SSINDIFPQVSLNFAGGASMILIPQDYLIQQSSIGGAALWCIGFQKIQGQGITILGDLVL 390
Query: 323 MGHRIVFDRENLKLAWSHSKCEEVIDKS 350
V+D N ++ W++ C ++ S
Sbjct: 391 KDKIFVYDIANQRIGWANYDCSMSVNVS 418
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/327 (24%), Positives = 140/327 (42%), Gaps = 28/327 (8%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSS----CK-SLKDPCPYIADYSTEDTSSSGYLVD 95
+L+ YD +S++S V C C CK L+ C Y Y + +S++GY V
Sbjct: 198 DLTLYDMKASTTSDAVGCDDNFCSLYDGPLPGCKPGLQ--CLYSVLYG-DGSSTTGYFVQ 254
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKA 154
D + S + + +V+ GCG KQ+G + A DG++G G + S+ S LA +
Sbjct: 255 DFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASS 314
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 213
G ++ FS C D D G +F G + + P+ + Y V ++ +G L
Sbjct: 315 GKVKKVFSHCLDNVDGGGIF--AIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDV 372
Query: 214 -----QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
+SG + ++DSG + + P E+Y ++ K R+ ++ C++ +
Sbjct: 373 PSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFT-CFDYTG 431
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQN 320
P + L F K+ S V H + F E +C+ + DG D ++G
Sbjct: 432 NVDDGFPTVTLHFDKSISLTVYPHEYLFQVKE--FEWCIGWQNSGAQTKDGKDLTLLGDL 489
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVI 347
+ +V+D E + W C I
Sbjct: 490 VLSNKLVVYDLEKQGIGWVEYNCSSSI 516
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/328 (24%), Positives = 138/328 (42%), Gaps = 24/328 (7%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
L+ +D SSSS++ V CS P+C S + C S D C Y Y + + +SGY V
Sbjct: 109 QLNFFDSSSSSTAGQVRCSDPICTSAVQTTATQCSSQTDQCSYTFQYG-DGSGTSGYYVS 167
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKA 154
D L+ + + + + ++ GC Q+G A DG+ G G G++SV S L+
Sbjct: 168 DTLYFDAILGQSLIDNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTR 227
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
G+ FS C + SG G + + P+ Y + + S + L
Sbjct: 228 GITPRVFSHCLKGDGSGGGIL-VLGEILEPGIVYSPLVPSQPHYNLNLLSIAVNGQLLPI 286
Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR--ISLQGNSWKYCYNA 264
T + +VDSG + +L E Y V + +VS I+ +GN CY
Sbjct: 287 DPAAFATSNSQGTIVDSGTTLAYLVAEAYDPFVSAVNAIVSPSVTPITSKGNQ---CYLV 343
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
S+ P F+ S V++ +++ F + G ++C+ G I+G +
Sbjct: 344 STSVSQMFPLASFNFAGGASMVLKPEDYLIPFGSSGGSAMWCIGFQKVQG-VTILGDLVL 402
Query: 323 MGHRIVFDRENLKLAWSHSKCEEVIDKS 350
V+D ++ W++ C ++ S
Sbjct: 403 KDKIFVYDLVRQRIGWANYDCSLSVNVS 430
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 80/334 (23%), Positives = 144/334 (43%), Gaps = 31/334 (9%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+L Y+P SSS+S ++C P C + CK C Y Y + ++++GY V+
Sbjct: 116 DLQLYNPKSSSTSTLITCDQPFCSATYDAPIPGCKP-DLLCQYKVIYG-DGSATAGYFVN 173
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKA 154
D + L + S S++ GCG KQ+G + A DG++G G + S+ S LA
Sbjct: 174 DYIQLQRAVGNHKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAAT 233
Query: 155 GLIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 212
G ++ F+ C D G +F G+ ++T +P Y+ GV+ +G++ L
Sbjct: 234 GKVKKIFAHCLDSISGGGIFAIGEVVEPKLKTTPVVPNQAHYNVVLNGVK---VGDTALD 290
Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNA 264
T A++DSG + +LP IY ++ K L + + L+ ++ C+
Sbjct: 291 LPLGLFETSYKRGAIIDSGTTLAYLPDSIYLPLMEKI--LGAQPDLKLRTVDDQFTCFVF 348
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIG 318
P + F ++ + H + F + V+C+ S DG + ++G
Sbjct: 349 DKNVDDGFPTVTFKFEESLILTIYPHEYLFQIRD--DVWCVGWQNSGAQSKDGNEVTLLG 406
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHV 352
+ + ++ EN + W+ C I V
Sbjct: 407 DLVLQNKLVYYNLENQTIGWTEYNCSSGIKLKDV 440
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/328 (24%), Positives = 141/328 (42%), Gaps = 32/328 (9%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVD 95
L++YDP+ S ++ V C C + S +C S PC + Y + +S++G+ V
Sbjct: 129 LTQYDPAGSGTT--VGCDQEFCVANSPNGLPPACPSTSSPCQFRIAYG-DGSSTTGFYVS 185
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKA 154
D + S + + +S+ GCG + G + A DG++G G D S+ S LA A
Sbjct: 186 DSVQYNQVSGNGQTTPSNASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAA 245
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 213
++ F+ C D G +F G Q P+ + Y V ++ +G + L
Sbjct: 246 RKVRKIFAHCLDTVHGGGIF--AIGNVVQPKVKTTPLVQNVTHYNVNLQGISVGGATLQL 303
Query: 214 -QSGFQA------LVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNAS 265
S F + ++DSG + +LP E+Y ++ FDK + ++L C+ S
Sbjct: 304 PSSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLTAVFDKY---QDLALHNYQDFVCFQFS 360
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIGQ 319
P + F + V H + F +NE ++C+ V + DG D ++G
Sbjct: 361 GSIDDGFPVVTFSFEGEITLNVYPHDYLF-QNEN-DLYCMGFLDGGVQTKDGKDMVLLGD 418
Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVI 347
+ +V+D E + W+ C I
Sbjct: 419 LVLSNKLVVYDLEKQVIGWADYNCSSSI 446
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 89/360 (24%), Positives = 153/360 (42%), Gaps = 43/360 (11%)
Query: 57 SCSHPL-CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 115
S HP+ C +C C Y Y+ E +SSSG L +DI+ + S+ PQ +V
Sbjct: 136 STYHPVKCNMDCNCDHDGVNCVYERRYA-EMSSSSGVLGEDIISFGNQSEVVPQRAV--- 191
Query: 116 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--ENDSGSV 173
GC +TG A DG+MGLG G +S+ L +I +SFS+C+ G++
Sbjct: 192 --FGCENVETGDLYSQRA-DGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVGGGAM 248
Query: 174 FFGDQGPATQQSTSFLPIGEKYDAYFVGVE---SYCIGNSC-LTQSGFQ----ALVDSGA 225
G P S + Y + + +E + G L+ S F ++DSG
Sbjct: 249 VLGGIPPPPDMVFSR---SDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKHGTVLDSGT 305
Query: 226 SFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----PDM 275
++ +LP E + V F + K +L+ N C++ + ++ ++ P++
Sbjct: 306 TYAYLPEEAF----VAFRDAIIKKSHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAFPEV 361
Query: 276 RLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDREN 333
++FS Q S N++F + G +CL + ++G + + +DREN
Sbjct: 362 DMVFSNGQKLSLTPENYLFQHTKVHG--AYCLGIFRNGDSTTLLGGIIVRNTLVTYDREN 419
Query: 334 LKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNP----LPTTEQQSTSNGQAAAPPSTAKTA 389
K+ + + C E+ + H+ P A P P P +N PP+ A +
Sbjct: 420 EKIGFWKTNCSELWKRLHIPGAPAAAPIVPTPKSVSAPAPVVSYNNNTTVGMPPTVAPSG 479
>gi|297841447|ref|XP_002888605.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
gi|297334446|gb|EFH64864.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
Length = 410
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 75/325 (23%), Positives = 136/325 (41%), Gaps = 35/325 (10%)
Query: 44 EYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
+Y P ++ V CS P+C + C + K+ C Y +Y+ + +S ++D
Sbjct: 96 QYKPKGNT----VPCSDPICLALHFPNNPQCPNPKEQCDYEVNYADQGSSMGALVIDQFP 151
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD---GVMGLGLGDVSVPSLLAKAG 155
K S++Q + GCG Q SY P GV+GLG G + + + L AG
Sbjct: 152 F-----KLLNGSAMQPRLAFGCGYDQ--SYPSAHPPPATAGVLGLGRGKIGLLTQLVSAG 204
Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 215
L +N C G +FFGD ++ P+ + Y G
Sbjct: 205 LTRNVVGHCLSSKGGGYLFFGDTL-IPSLGVAWTPLLPPDNHYTTGPAELLFNGKPTGLK 263
Query: 216 GFQALVDSGASFTFLPTEIYAEVV--VKFDKLVSSKRISLQGNSWKYCYNASS--EEMLK 271
G + + D+G+S+T+ ++ Y +V + D VS +++ + + C+ + + +L+
Sbjct: 264 GLKLIFDTGSSYTYFNSKTYQTIVNLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVLE 323
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVF------CLTVMSTD----GDYGIIGQNF 321
V + + N + RN P E + + CL +++ + +IG
Sbjct: 324 VKNFFKTITINFTNARRNTQLQIPP-ESYLIISKTGNACLGLLNGSEVGLQNSNVIGDIS 382
Query: 322 MMGHRIVFDRENLKLAWSHSKCEEV 346
M G I++D E +L W S C ++
Sbjct: 383 MQGLLIIYDNEKQQLGWVSSNCNKL 407
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/324 (26%), Positives = 144/324 (44%), Gaps = 32/324 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
YDP+ SS+ + C+ PLC++ S + + DY ++GYL D L +
Sbjct: 139 YDPARSSTFSKLPCASPLCQALPSAFRACNATGCVYDYRYAVGFTAGYLAADTLAIGDGD 198
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
SS + V GC G +DGA+ G++GLG S SLL++ G+ FS C
Sbjct: 199 GDGDASSSFAGVAFGCSTANGGD-MDGAS--GIVGLGR---SALSLLSQIGV--GRFSYC 250
Query: 165 FDEN-DSGS--VFFGDQGPATQ---QSTSFL--PIGEKYDA--YFVGVESYCIGNSCLTQ 214
+ D+G+ + FG T QST+ L P+ + A Y+V + +G++ L
Sbjct: 251 LRSDADAGASPILFGALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPV 310
Query: 215 S----GFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY--CY 262
+ GF A +VDSG +FT+L Y + F + + G + + C+
Sbjct: 311 TSSTFGFTAAGAGGVIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCF 370
Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
A + + VP + F+ + V + +EG V CL V+ T G +IG
Sbjct: 371 EAGAADT-PVPRLVFRFAGGAEYAVPRQSYFDAVDEGGRVACLLVLPTRG-VSVIGNVMQ 428
Query: 323 MGHRIVFDRENLKLAWSHSKCEEV 346
M +++D + +++ + C +
Sbjct: 429 MDLHVLYDLDGATFSFAPADCASL 452
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 83/335 (24%), Positives = 145/335 (43%), Gaps = 53/335 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
YDP SSS+ + + C+ P C+ C + C Y+ Y + ++SSG L D L
Sbjct: 130 YDPRSSSTHRRIPCASPRCRDVLRYPGCDARTGGCVYMVVYG-DGSASSGDLATDRLVF- 187
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
P + +V +GCG G L+ AA G++G+G G +S P+ LA A + F
Sbjct: 188 ------PDDTHVHNVTLGCGHDNVG-LLESAA--GLLGVGRGQLSFPTQLAPA--YGHVF 236
Query: 162 SICFD------ENDSGSVFFGDQGPATQQSTSFLPIG---EKYDAYFVGVESYCIGNSCL 212
S C +N S + FG ST+F P+ + Y+V + + +G +
Sbjct: 237 SYCLGDRLSRAQNGSSYLVFGRT--PEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERV 294
Query: 213 TQSGFQ--------------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS----KRISLQ 254
T GF +VDSG + + + YA V FD ++ ++++ +
Sbjct: 295 T--GFSNASLALNPATGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATK 352
Query: 255 GNSWKYCY----NASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVM 308
+ + CY N + ++VP + L F+ + N++ + T FCL +
Sbjct: 353 FSVFDACYDLRGNGAPAAAVRVPSIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQ 412
Query: 309 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
+ D ++G G +VFD E ++ ++ + C
Sbjct: 413 AADDGLNVLGNVQQQGFGLVFDVERGRIGFTPNGC 447
>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 451
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 138/331 (41%), Gaps = 42/331 (12%)
Query: 51 SSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ +K V C +C + R C S K C Y Y+ + SS G LV D L
Sbjct: 104 TKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYA-DQGSSLGVLVTDSFAL--- 159
Query: 104 SKHAPQSSVQSSVIIGCGR-KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+ A S V+ + GCG +Q GS + +A DGV+GLG G VS+ S L + G+ +N
Sbjct: 160 -RLANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVG 218
Query: 163 ICFDENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 221
C G +FFGD P ++ + + + + Y G + G L + +
Sbjct: 219 HCLSTRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVF 278
Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 281
DSG+SFT+ + Y +V +S + +S C+ + V D++ F
Sbjct: 279 DSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKG-KKPFKSVLDVKKEF-- 335
Query: 282 NQSFVVRNHIFSF-----------PEN----EGFTVFCLTVMSTD----GDYGIIGQNFM 322
R + SF PEN + CL +++ D I+G M
Sbjct: 336 ------RTVVLSFSNGKKALMEIPPENYLIVTKYGNACLGILNGSEVGLKDLNIVGDITM 389
Query: 323 MGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 353
+++D E ++ W + C+ + + + +H
Sbjct: 390 QDQMVIYDNERGQIGWIRAPCDRIPNDNTIH 420
>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 87/365 (23%), Positives = 153/365 (41%), Gaps = 51/365 (13%)
Query: 45 YDPSSSSSSKNVSCSHP----LCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
YD S + + C LC+ + +C+S C Y+ Y+ E +SS GY+V D +
Sbjct: 80 YDYDRSMEFERLDCGEASDATLCEETMKGTCQS-DGRCSYVVSYA-EGSSSRGYVVRDRV 137
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
L + ++ + + GC +T + + A DG+ G G G +V + LA AGLI+
Sbjct: 138 RLG-------EGTLSAMLAFGCEEAETNAIYEQKA-DGLFGFGRGTATVHAQLASAGLIE 189
Query: 159 NSFSIC---FDENDS----GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 211
N FS C F N G FG PA + T + + V S+ +G+S
Sbjct: 190 NVFSFCVEGFGANGGVLTLGRFDFGADAPALAR-TPLVADPANPAFHNVRTSSWKLGDSL 248
Query: 212 LTQ-SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL-QGNSWKY---CYNASS 266
+ + + +DSG +FTF+P ++ + D + + + G +Y CY S+
Sbjct: 249 IEHLNSYTTTLDSGTTFTFVPRSVWVSFKTRLDTQATQAGLEIVAGPDPQYDDVCYGVSA 308
Query: 267 EEMLKV----------PDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDY 314
M P + + + S + N++F+ N FC+ + + +
Sbjct: 309 AAMNMTLSQSTVSEWFPPLTIAYEGGVSLTLGPENYLFAHETNS--AAFCVGIFANPNNQ 366
Query: 315 GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQST 374
++GQ M + FD N ++ + + C + +K + H SP P P+ +
Sbjct: 367 ILLGQITMRDTLMEFDVANSRVGMAPANCRRLREK-YTH-------DSPEPTPSNSSTPS 418
Query: 375 SNGQA 379
G A
Sbjct: 419 GGGDA 423
>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 423
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 89/344 (25%), Positives = 146/344 (42%), Gaps = 44/344 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
Y+P + K V C P+C C S C Y +Y+ + +S+ G LV+D L
Sbjct: 83 YNPKKA---KVVDCHLPVCAQIQQGGSYECNSDVKQCDYEVEYA-DGSSTMGVLVEDTLT 138
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
+ + + +Q+ IIGCG Q G+ A+ DGV+GL V++P+ LA+ G+I+
Sbjct: 139 V----RLTNGTLIQTKAIIGCGYDQQGTLAKSPASTDGVIGLSSSKVALPAQLAEKGIIK 194
Query: 159 NSFSICFDE--NDSGSVFFGDQGPATQQSTSFLPIGE--------KYDAYFVGVESYCIG 208
N C + N G +FFGD+ + T +G+ + + G +S +
Sbjct: 195 NVLGHCLADGSNGGGYLFFGDELVPSWGMTWTPMMGKPEMLGYQARLQSIRYGGDSLVLN 254
Query: 209 N-SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS- 266
N LT+S + DSG SFT+L + YA V+ K R+ + YC+ S
Sbjct: 255 NDEDLTRSTSSVMFDSGTSFTYLVPQAYASVLSAVTKQSGLLRVK-SDTTLPYCWRGPSP 313
Query: 267 -EEMLKV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTV------FCLTVMSTDGD-- 313
+ + V + L F F + + P +G+ + CL ++ G
Sbjct: 314 FQSITDVHQYFKTLTLDFGGRNWFATDSTLDLSP--QGYLIVSTQGNVCLGILDASGASL 371
Query: 314 --YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLV 355
IIG M G+ +V+D ++ W C K+ V
Sbjct: 372 EVTNIIGDVSMRGYLVVYDNVRDRIGWIRRNCHSRPTKTSSQFV 415
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 79/324 (24%), Positives = 138/324 (42%), Gaps = 23/324 (7%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSS----CK-SLKDPCPYIADYSTEDTSSSGYLVD 95
+L+ YD +S++S V C C CK L+ C Y Y + +S++GY V
Sbjct: 198 DLTLYDMKASTTSDAVGCDDNFCSLYDGPLPGCKPGLQ--CLYSVLYG-DGSSTTGYFVQ 254
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKA 154
D + S + + +V+ GCG KQ+G + A DG++G G + S+ S LA +
Sbjct: 255 DFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASS 314
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 213
G ++ FS C D D G +F G + + P+ + Y V ++ +G L
Sbjct: 315 GKVKKVFSHCLDNVDGGGIF--AIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDV 372
Query: 214 -----QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
+SG + ++DSG + + P E+Y ++ K R+ ++ C++ +
Sbjct: 373 PSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFT-CFDYTG 431
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT--VMSTDG-DYGIIGQNFMM 323
P + L F K+ S V H + F + + + DG D ++G +
Sbjct: 432 NVDDGFPTVTLHFDKSISLTVYPHEYLFQHEFEWCIGWQNSGAQTKDGKDLTLLGDLVLS 491
Query: 324 GHRIVFDRENLKLAWSHSKCEEVI 347
+V+D E + W C I
Sbjct: 492 NKLVVYDLEKQGIGWVEYNCSSSI 515
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 78/325 (24%), Positives = 145/325 (44%), Gaps = 37/325 (11%)
Query: 60 HPL-CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 118
HP+ C +C + D C Y Y+ E +SSSG L +D++ + S+ PQ +V
Sbjct: 47 HPVKCNPDCTCDTENDQCTYERQYA-EMSSSSGILGEDLVSFGNMSELKPQRAV-----F 100
Query: 119 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--ENDSGSVFFG 176
GC +TG A DG+MGLG GD+S+ L + G+I +SFS+C+ E G++ G
Sbjct: 101 GCENAETGDLFSQHA-DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLG 159
Query: 177 DQGPATQQSTSFL-PIGEKYDAYFVGVESYCIG------NSCLTQSGFQALVDSGASFTF 229
P + S P Y Y + + + N + ++DSG ++ +
Sbjct: 160 QISPPSDMVFSHSDPDRSPY--YNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAY 217
Query: 230 LPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----PDMRLIF 279
LP + + F + ++S+ L+ N C++ + E+ ++ P + ++F
Sbjct: 218 LPEAAF----LPFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVF 273
Query: 280 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFMMGHRIVFDRENLKLAW 338
+ + + + F ++ +CL V D ++G + + +DRE+ K+ +
Sbjct: 274 DNGEKYSLSPENYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGF 333
Query: 339 SHSKCE---EVIDKSHVHLVPPPAG 360
+ C E ++ S + P P G
Sbjct: 334 WKTNCSVLWERLNASSISPAPAPLG 358
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 82/330 (24%), Positives = 141/330 (42%), Gaps = 28/330 (8%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L +D + SS++ VSC P+C + S C S + C Y Y + + ++GY V D
Sbjct: 127 LDFFDTAGSSTAALVSCGDPICSYAVQTATSECSSQANQCSYTFQYG-DGSGTTGYYVSD 185
Query: 97 ILHLAS-FSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKA 154
++ + + ++ S++I GC Q+G A DG+ G G G +SV S L+
Sbjct: 186 TMYFDTVLLGQSVVANSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSR 245
Query: 155 GLIQNSFSICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
G+ FS C EN G + G+ + S + P+ Y + ++S + L
Sbjct: 246 GVTPKVFSHCLKGGENGGGVLVLGE---ILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLL 302
Query: 213 --------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS--SKRISLQGNSWKYCY 262
T + +VDSG + +L E Y V VS SK I +GN CY
Sbjct: 303 PIDSNVFATTNNQGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQFSKPIISKGNQ---CY 359
Query: 263 NASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQN 320
S+ P + L F S V+ +++ + +G ++C+ + + I+G
Sbjct: 360 LVSNSVGDIFPQVSLNFMGGASMVLNPEHYLMHYGFLDGAAMWCIGFQKVEQGFTILGDL 419
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVIDKS 350
+ V+D N ++ W+ C ++ S
Sbjct: 420 VLKDKIFVYDLANQRIGWADYDCSLSVNVS 449
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 80/334 (23%), Positives = 143/334 (42%), Gaps = 31/334 (9%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+L Y+P SSS+S ++C P C + CK C Y Y + ++++GY V+
Sbjct: 116 DLQLYNPKSSSTSTLITCDQPFCSATYDAPIPGCKP-DLLCQYKVIYG-DGSATAGYFVN 173
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKA 154
D + L + S S++ GCG KQ+G + A DG++G G + S+ S LA
Sbjct: 174 DYIQLQRAVGNHKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAAT 233
Query: 155 GLIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 212
G ++ F+ C D G +F G+ +T +P Y+ GV+ +G++ L
Sbjct: 234 GKVKKIFAHCLDSISGGGIFAIGEVVEPKLXNTPVVPNQAHYNVVLNGVK---VGDTALD 290
Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNA 264
T A++DSG + +LP IY ++ K L + + L+ ++ C+
Sbjct: 291 LPLGLFETSYKRGAIIDSGTTLAYLPESIYLPLMEKI--LGAQPDLKLRTVDDQFTCFVF 348
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIG 318
P + F ++ + H + F + V+C+ S DG + ++G
Sbjct: 349 DKNVDDGFPTVTFKFEESLILTIYPHEYLFQIRD--DVWCVGWQNSGAQSKDGNEVTLLG 406
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHV 352
+ + ++ EN + W+ C I V
Sbjct: 407 DLVLQNKLVYYNLENQTIGWTEYNCSSGIKLKDV 440
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 78/325 (24%), Positives = 145/325 (44%), Gaps = 37/325 (11%)
Query: 60 HPL-CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVII 118
HP+ C +C + D C Y Y+ E +SSSG L +D++ + S+ PQ +V
Sbjct: 47 HPVKCNPDCTCDTENDQCTYERQYA-EMSSSSGILGEDLVSFGNMSELKPQRAV-----F 100
Query: 119 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD--ENDSGSVFFG 176
GC +TG A DG+MGLG GD+S+ L + G+I +SFS+C+ E G++ G
Sbjct: 101 GCENAETGDLFSQHA-DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLG 159
Query: 177 DQGPATQQSTSFL-PIGEKYDAYFVGVESYCIG------NSCLTQSGFQALVDSGASFTF 229
P + S P Y Y + + + N + ++DSG ++ +
Sbjct: 160 QISPPSDMVFSHSDPDRSPY--YNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAY 217
Query: 230 LPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNASSEEMLKV----PDMRLIF 279
LP + + F + ++S+ L+ N C++ + E+ ++ P + ++F
Sbjct: 218 LPEAAF----LPFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVF 273
Query: 280 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFMMGHRIVFDRENLKLAW 338
+ + + + F ++ +CL V D ++G + + +DRE+ K+ +
Sbjct: 274 DNGEKYSLSPENYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGF 333
Query: 339 SHSKCE---EVIDKSHVHLVPPPAG 360
+ C E ++ S + P P G
Sbjct: 334 WKTNCSVLWERLNASSISPAPAPLG 358
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 143/331 (43%), Gaps = 36/331 (10%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLV 94
L++YDP+ S ++ V C C + S+ C S PC + Y + +S++G+ V
Sbjct: 129 LTQYDPAGSGTT--VGCEQEFCVANSAASGVPPACPSAASPCQFRITYG-DGSSTTGFYV 185
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAK 153
D + S + + S+ GCG + G + A DG++G G D S+ S LA
Sbjct: 186 TDFVQYNQVSGNGQTTPSNVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQLAA 245
Query: 154 AGLIQNSFSICFDENDSGSVF-FGD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 211
A ++ F+ C D G +F G+ P ++T +P Y+ G+ +G +
Sbjct: 246 ARKVRKIFAHCLDTVRGGGIFAIGNVVQPPIVKTTPLVPNATHYNVNLQGIS---VGGAT 302
Query: 212 LT--QSGFQA------LVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCY 262
L S F + ++DSG + +LP E+Y ++ FDK ++++ C+
Sbjct: 303 LQLPTSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLTAVFDK---HPDLAVRNYEDFICF 359
Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGI 316
S + P + F + + V H + F G ++C+ V + DG D +
Sbjct: 360 QFSGSLDEEFPVITFSFEGDLTLNVYPHDYLF--QNGNDLYCMGFLDGGVQTKDGKDMVL 417
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
+G + +V+D E + W+ C I
Sbjct: 418 LGDLVLSNKLVVYDLEKQVIGWTDYNCSSSI 448
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 85/355 (23%), Positives = 147/355 (41%), Gaps = 33/355 (9%)
Query: 5 ICFGSHANAYNALLCLPV-TTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLC 63
+ FG+ A Y + + + CL G Q + +DP+ S++ V C HP C
Sbjct: 139 VGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPI--FDPTKSATYSVVPCGHPQC 196
Query: 64 KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRK 123
+ K C Y +Y + +SS+G L + L L S + GCG+
Sbjct: 197 AAADGSKCSNGTCLYKVEYG-DGSSSAGVLSHETLSLTS-------TRALPGFAFGCGQT 248
Query: 124 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPA 181
G + D DG++GLG G +S+ S A + +FS C D G + G PA
Sbjct: 249 NLGDFGD---VDGLIGLGRGQLSLSSQAAAS--FGGTFSYCLPSDNTTHGYLTIGPTTPA 303
Query: 182 TQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL-------TQSGFQALVDSGASFTFLP 231
+ + + +K D YFV + S IG L T G +DSG T+LP
Sbjct: 304 SNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTDDG--TFLDSGTILTYLP 361
Query: 232 TEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNH- 290
E Y + +F ++ + + + + CY+ + + + +P + FS F +
Sbjct: 362 PEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFTGQSAIFIPAVSFKFSDGSVFDLSFFG 421
Query: 291 IFSFPENEGFTVFCLTVMSTDG--DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
I FP++ + CL ++ + I+G +++D K+ ++ + C
Sbjct: 422 ILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 80/335 (23%), Positives = 150/335 (44%), Gaps = 38/335 (11%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++ P +S + + V C+ + +C + C Y Y+ E ++SSG L +D++ +
Sbjct: 134 KFRPEASETYQPVKCT-----WQCNCDDDRKQCTYERRYA-EMSTSSGVLGEDVVSFGNQ 187
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ +PQ + I GC +TG + A DG+MGLG GD+S+ L + +I ++FS+
Sbjct: 188 SELSPQRA-----IFGCENDETGDIYNQRA-DGIMGLGRGDLSIMDQLVEKKVISDAFSL 241
Query: 164 CF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG------NSCLTQ 214
C+ V G PA T P+ Y Y + ++ + N +
Sbjct: 242 CYGGMGVGGGAMVLGGISPPADMVFTHSDPVRSPY--YNIDLKEIHVAGKRLHLNPKVFD 299
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWK-YCYNASSEEMLKV 272
++DSG ++ +LP + K S KRIS + C++ + + ++
Sbjct: 300 GKHGTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPHYNDICFSGAEINVSQL 359
Query: 273 ----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMM 323
P + ++F + + F ++ +CL V S D GI+ +N +
Sbjct: 360 SKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTL- 418
Query: 324 GHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPP 358
+++DRE+ K+ + + C E+ ++ HV PPP
Sbjct: 419 ---VMYDREHSKIGFWKTNCSELWERLHVSNAPPP 450
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 82/317 (25%), Positives = 134/317 (42%), Gaps = 33/317 (10%)
Query: 51 SSSKNVSCSHPLCKSRSSCKS------LKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+ +K V C++ +C + S S + C Y Y T+ SS G LV D L +
Sbjct: 103 TKNKLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKY-TDKASSLGVLVTDSFSLPLRN 161
Query: 105 KHAPQSSVQSSVIIGCGR-KQTGSYLDGAAP---DGVMGLGLGDVSVPSLLAKAGLIQNS 160
K S+V+ S+ GCG +Q G +GAAP DG++GLG G VS+ S L + G+ +N
Sbjct: 162 K----SNVRPSLSFGCGYDQQVGK--NGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNV 215
Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNSCLTQSGFQ 218
C + G +FFGD T + T ++P+ + Y G + L+ +
Sbjct: 216 LGHCLSTSGGGFLFFGDDMVPTSRVT-WVPMVRSTSGNYYSPGSATLYFDRRSLSTKPME 274
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
+ DSG+++T+ + Y + +S + S C+ + V D++
Sbjct: 275 VVFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPLCWKG-QKAFKSVSDVKKD 333
Query: 279 FSKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTDG-----DYGIIGQNFMMGHRIV 328
F Q +N + P V CL ++ DG + IIG M ++
Sbjct: 334 FKSLQFIFGKNAVMEIPPENYLIVTKNGNVCLGIL--DGSAAKLSFSIIGDITMQDQMVI 391
Query: 329 FDRENLKLAWSHSKCEE 345
+D E +L W C
Sbjct: 392 YDNEKAQLGWIRGSCSR 408
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 79/319 (24%), Positives = 135/319 (42%), Gaps = 18/319 (5%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +DP SS ++ +SCS C S S C + + C Y Y + + +SGY V D
Sbjct: 134 LNFFDPGSSPTASLISCSDQRCSLGLQSSDSVCAAQNNQCGYTFQYG-DGSGTSGYYVSD 192
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAG 155
+LH + + + + ++ GC QTG A DG+ G G D+SV S LA G
Sbjct: 193 LLHFDTILGGSVMKNSSAPIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQG 252
Query: 156 LIQNSFSICFDENDSGS--VFFGDQGPATQQSTSFLPIGEKYD----AYFVGVESYCIGN 209
+ FS C +DSG + G+ T +P Y+ + +V ++ I
Sbjct: 253 ITPRVFSHCLKGDDSGGGILVLGEIVEPNIVYTPLVPSQPHYNLNLQSIYVNGQTLAIDP 312
Query: 210 SCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
S S Q ++DSG + +L Y + V S +S + CY SS
Sbjct: 313 SVFATSSNQGTIIDSGTTLAYLTEAAYDPFISAITSTV-SPSVSPYLSKGNQCYLTSSSI 371
Query: 269 MLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGH 325
P + L F+ S ++ ++++ G ++C+ G + I+G +
Sbjct: 372 NDVFPQVSLNFAGGTSMILIPQDYLIQQSSINGAALWCVGFQKIQGQEITILGDLVLKDK 431
Query: 326 RIVFDRENLKLAWSHSKCE 344
V+D ++ W++ C+
Sbjct: 432 IFVYDIAGQRIGWANYDCK 450
>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 421
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 81/316 (25%), Positives = 134/316 (42%), Gaps = 26/316 (8%)
Query: 51 SSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ +K V C LC S + C S K C Y Y+ + SS G L+ D SF
Sbjct: 104 TKNKIVPCVDQLCSSLHGGLSGKHKCDSPKQQCDYEIKYA-DQGSSLGVLLTD-----SF 157
Query: 104 SKHAPQSS-VQSSVIIGCGR-KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
+ SS V+ S+ GCG +Q GS + A DGV+GLG G +S+ S L + G+ +N
Sbjct: 158 AVRLANSSIVRPSLAFGCGYDQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVV 217
Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV-GVESYCIGNSCLTQSGFQAL 220
C G +FFGD ++T + + Y+ G S G L + +
Sbjct: 218 GHCLSIRGGGFLFFGDNLVPYSRATWVPMVRSAFKNYYSPGTASLYFGGRSLGVRPMEVV 277
Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS--EEMLKVPD---- 274
+DSG+SFT+ + Y +V +S + S C+ + +L V
Sbjct: 278 LDSGSSFTYFGAQPYQALVTALKSDLSKTLKEVFDPSLPLCWKGKKPFKSVLDVKKEFKS 337
Query: 275 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFD 330
+ L FS + ++ ++ F CL +++ D I+G M +++D
Sbjct: 338 LVLSFSNGKKALMEIPPENYLIVTKFGNACLGILNGSEIGLKDLNIVGDITMQDQMVIYD 397
Query: 331 RENLKLAWSHSKCEEV 346
E ++ W + C+ +
Sbjct: 398 NERGQIGWIRAPCDRI 413
>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
Length = 573
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 86/317 (27%), Positives = 142/317 (44%), Gaps = 36/317 (11%)
Query: 54 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 113
K++ C L +++ C++ K C Y +Y+ + +SS G L D +H+ + + +
Sbjct: 257 KDLLCQE-LQGNQNYCETCKQ-CDYEIEYA-DRSSSMGVLARDDMHIITTNG----GREK 309
Query: 114 SSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDS 170
+ GC Q G L A DG++GL +S+PS LA G+I N F C D N
Sbjct: 310 LDFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDPNGG 369
Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSG-----FQALVDSG 224
G +F GD TS PI D F + G+ L+ G Q + DSG
Sbjct: 370 GYMFLGDDYVPRWGMTS-TPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQVIFDSG 428
Query: 225 ASFTFLPTEIYAEVVV-------KFDKLVSSKRISL-QGNSWKYCYNASSEEMLKVPDMR 276
+S+T+LP EIY ++ F + S + + L + Y +++ K +
Sbjct: 429 SSYTYLPDEIYKNLIAAIKYAYPNFVQDSSDRTLPLCLATDFPVRYLEDVKQLFK--PLN 486
Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTV----FCLTVMS-TDGDYG---IIGQNFMMGHRIV 328
L F K + FV+ P+N CL ++ D D+G I+G N + G +V
Sbjct: 487 LHFGK-RWFVMPRTFTILPDNYLIISDKGNVCLGFLNGKDIDHGSTVIVGDNALRGKLVV 545
Query: 329 FDRENLKLAWSHSKCEE 345
+D + ++ W++S C +
Sbjct: 546 YDNQQRQIGWTNSDCTK 562
>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
Length = 574
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 86/317 (27%), Positives = 142/317 (44%), Gaps = 36/317 (11%)
Query: 54 KNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 113
K++ C L +++ C++ K C Y +Y+ + +SS G L D +H+ + + +
Sbjct: 258 KDLLCQE-LQGNQNYCETCKQ-CDYEIEYA-DRSSSMGVLARDDMHIITTNG----GREK 310
Query: 114 SSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDS 170
+ GC Q G L A DG++GL +S+PS LA G+I N F C D N
Sbjct: 311 LDFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDPNGG 370
Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSG-----FQALVDSG 224
G +F GD TS PI D F + G+ L+ G Q + DSG
Sbjct: 371 GYMFLGDDYVPRWGMTS-TPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQVIFDSG 429
Query: 225 ASFTFLPTEIYAEVVV-------KFDKLVSSKRISL-QGNSWKYCYNASSEEMLKVPDMR 276
+S+T+LP EIY ++ F + S + + L + Y +++ K +
Sbjct: 430 SSYTYLPDEIYKNLIAAIKYAYPNFVQDSSDRTLPLCLATDFPVRYLEDVKQLFK--PLN 487
Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTV----FCLTVMS-TDGDYG---IIGQNFMMGHRIV 328
L F K + FV+ P+N CL ++ D D+G I+G N + G +V
Sbjct: 488 LHFGK-RWFVMPRTFTILPDNYLIISDKGNVCLGFLNGKDIDHGSTVIVGDNALRGKLVV 546
Query: 329 FDRENLKLAWSHSKCEE 345
+D + ++ W++S C +
Sbjct: 547 YDNQQRQIGWTNSDCTK 563
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 82/323 (25%), Positives = 145/323 (44%), Gaps = 27/323 (8%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +DP SSS+S +SC C+S +SC + C Y Y + + +SGY V D
Sbjct: 121 LNYFDPGSSSTSSLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYG-DGSGTSGYYVSD 179
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAG 155
++H AS + ++ +SV+ GC QTG A DG+ G G +SV S L+ G
Sbjct: 180 LMHFASIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQG 239
Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
+ FS C ++SG G + + + P+ Y + ++S + +
Sbjct: 240 IAPRVFSHCLKGDNSGGGVL-VLGEIVEPNIVYSPLVPSQPHYNLNLQSISVNGQIVRIA 298
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNAS 265
T + +VDSG + +L E Y V+ ++ S + + +GN CY +
Sbjct: 299 PSVFATSNNRGTIVDSGTTLAYLAEEAYNPFVIAIAAVIPQSVRSVLSRGNQ---CYLIT 355
Query: 266 SEEMLKV-PDMRLIFSKNQSFVVRNHIFSFPEN---EGFTVFCLTVMSTDGDYGIIGQNF 321
+ + + P + L F+ S V+R + +N EG +V+C+ G I +
Sbjct: 356 TSSNVDIFPQVSLNFAGGASLVLRPQDYLMQQNFIGEG-SVWCIGFQKISGQSITILGDL 414
Query: 322 MMGHRI-VFDRENLKLAWSHSKC 343
++ +I V+D ++ W++ C
Sbjct: 415 VLKDKIFVYDLAGQRIGWANYDC 437
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 80/330 (24%), Positives = 141/330 (42%), Gaps = 28/330 (8%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L +D + SS++ VSC+ P+C + S C S + C Y Y + + ++GY V D
Sbjct: 127 LDFFDTAGSSTAALVSCADPICSYAVQTATSGCSSQANQCSYTFQYG-DGSGTTGYYVSD 185
Query: 97 ILHLAS-FSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKA 154
++ + + ++ S+++ GC Q+G A DG+ G G G +SV S L+
Sbjct: 186 TMYFDTVLLGQSMVANSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSR 245
Query: 155 GLIQNSFSICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
G+ FS C EN G + G+ + S + P+ Y + ++S + L
Sbjct: 246 GVTPKVFSHCLKGGENGGGVLVLGE---ILEPSIVYSPLVPSLPHYNLNLQSIAVNGQLL 302
Query: 213 --------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS--SKRISLQGNSWKYCY 262
T + +VDSG + +L E Y V VS SK I +GN CY
Sbjct: 303 PIDSNVFATTNNQGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFSKPIISKGNQ---CY 359
Query: 263 NASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQN 320
S+ P + L F S V+ +++ + + ++C+ + + I+G
Sbjct: 360 LVSNSVGDIFPQVSLNFMGGASMVLNPEHYLMHYGFLDSAAMWCIGFQKVERGFTILGDL 419
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVIDKS 350
+ V+D N ++ W+ C ++ S
Sbjct: 420 VLKDKIFVYDLANQRIGWADYNCSLAVNVS 449
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/320 (25%), Positives = 142/320 (44%), Gaps = 40/320 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+ P +SSS N SC+ LC + R +C S+++ C Y Y + + +
Sbjct: 50 FIPLASSSYSNASCTDSLCDALPRPTC-SMRNTCTYSYSYGDGSNTRGDF---------A 99
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
F S + + GCG Q G++ A DG++GLG G +S+PS L + + FS
Sbjct: 100 FETVTLNGSTLARIGFGCGHNQEGTF---AGADGLIGLGQGPLSLPSQLNSS--FTHIFS 154
Query: 163 ICF-DENDSGS---VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQ- 214
C D++ +G+ + FG+ A SF P+ + D Y+VGVES +GN +
Sbjct: 155 YCLVDQSTTGTFSPITFGNA--AENSRASFTPLLQNEDNPSYYYVGVESISVGNRRVPTP 212
Query: 215 -SGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
S F+ ++DSG + T+ + ++ + + +S CY+ S
Sbjct: 213 PSAFRIDANGVGGVILDSGTTITYWRLAAFIPILAELRRQISYPEADPTPYGLNLCYDIS 272
Query: 266 --SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
S L +P M + + + ++++ +N G TV T MST + IIG
Sbjct: 273 SVSASSLTLPSMTVHLTNVDFEIPVSNLWVLVDNFGETV--CTAMSTSDQFSIIGNVQQQ 330
Query: 324 GHRIVFDRENLKLAWSHSKC 343
+ IV D N ++ + + C
Sbjct: 331 NNLIVTDVANSRVGFLATDC 350
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 80/303 (26%), Positives = 142/303 (46%), Gaps = 34/303 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCK--SRSSCKS-LKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
++PS SS+ KN+ CS P+CK ++ C S K C Y Y + + S G + D L L
Sbjct: 132 FNPSKSSTYKNIRCSSPICKRGEKTRCSSNRKRKCEYEITY-LDRSGSQGDISKDTLTLN 190
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
S + +P S + ++IGCG K + + +G A G++G G G+ S+ S L + I F
Sbjct: 191 S-NDGSPISFPK--IVIGCGHKNSLT-TEGLA-SGIIGFGRGNFSIVSQLGSS--IGGKF 243
Query: 162 SICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGN----- 209
S C N S ++FGD + P+ + + YF +E++ +G+
Sbjct: 244 SYCLASLFSKANISSKLYFGDMAVVSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKL 303
Query: 210 ---SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
S + + A++DSG++ T LP ++Y+++ +V KR+ CY +
Sbjct: 304 KDSSLIPDNEGNAVIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTT- 362
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QNFM 322
LK ++ +I + + V+ + F+ V C S+ + + G QNF+
Sbjct: 363 ---LKKYEVPIITAHFRGADVKLNAFNTFIQMNHEVMCFAFNSSAFPWVVYGNIAQQNFL 419
Query: 323 MGH 325
+G+
Sbjct: 420 VGY 422
>gi|356522749|ref|XP_003530008.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1336
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 76/292 (26%), Positives = 127/292 (43%), Gaps = 31/292 (10%)
Query: 76 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAP 134
C Y Y+ + +SS G LV D LHL + + S + +V+ GCG Q G L+ A
Sbjct: 271 CDYEIQYA-DHSSSLGVLVRDELHLVTTNG----SKTKLNVVFGCGYDQEGLILNTLAKT 325
Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS--VFFGDQGPATQQSTSFLPIG 192
DG+MGL VS+P LA GLI+N C + +G +F GD +++P+
Sbjct: 326 DGIMGLSRAKVSLPYQLASKGLIKNVVGHCLSNDGAGGGYMFLGDDF-VPYWGMNWVPMA 384
Query: 193 EKY--DAYFVGVESYCIGNSCLTQSG----FQALVDSGASFTFLPTEIYAEVVVKFDKLV 246
D Y + GN L G + DSG+S+T+ P E Y ++V +++
Sbjct: 385 YTLTTDLYQTEILGINYGNRQLKFDGQSKVGKVFFDSGSSYTYFPKEAYLDLVASLNEVS 444
Query: 247 SSKRISLQGNS-----WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT 301
+ ++ W+ + S + +K L + + + +F P EG+
Sbjct: 445 GLGLVQDDSDTTLPICWQANFQIRSIKDVKDYFKTLTLRFGSKWWILSTLFQIPP-EGYL 503
Query: 302 VF------CLTVMS----TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
+ CL ++ DG I+G + G+ +V+D K+ W + C
Sbjct: 504 IISNKGHVCLGILDGSKVNDGSSIILGDISLRGYSVVYDNVKQKIGWKRADC 555
>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 425
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 76/321 (23%), Positives = 134/321 (41%), Gaps = 44/321 (13%)
Query: 56 VSCSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 108
V CS P+C + S C PC Y Y+ + S+ G LV D +H+ S P
Sbjct: 116 VKCSDPICVATQSTHVLGQICSKQSPPCVYNVQYA-DHASTLGVLVRDYMHIGS-----P 169
Query: 109 QSSVQSSVI-IGCGRKQ--TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 165
SS + ++ GCG +Q +G + P G++GLG G S+ S L G I N C
Sbjct: 170 SSSTKDPLVAFGCGYEQKFSGPTPPHSKPAGILGLGNGKTSILSQLTSIGFIHNVLGHCL 229
Query: 166 DENDSGSVFFGDQ---------GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
G +F GD+ P Q S EK+ Y G G
Sbjct: 230 SAEGGGYLFLGDKFVPSSGIVWTPIIQSSL------EKH--YNTGPVDLFFNGKPTPAKG 281
Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-----WKYC--YNASSEEM 269
Q + DSG+S+T+ + +Y V + + K +S + WK + + +E
Sbjct: 282 LQIIFDSGSSYTYFSSPVYTIVANMVNNDLKGKPLSRVKDPSLPICWKGVKPFKSLNEVN 341
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGH 325
+ L F+K+++ + ++ + CL +++ + G+ ++G +
Sbjct: 342 NYFKPLTLSFTKSKNLQFQLPPVAYLIITKYGNVCLGILNGNEAGLGNRNVVGDISLQDK 401
Query: 326 RIVFDRENLKLAWSHSKCEEV 346
+V+D E ++ W+ + C+++
Sbjct: 402 VVVYDNEKQQIGWASANCKQI 422
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/312 (25%), Positives = 134/312 (42%), Gaps = 19/312 (6%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDIL 98
LS +D ++SS+SK V C C S S + C Y Y+ E TS G + D+L
Sbjct: 117 RLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIVYADESTSD-GKFIRDML 175
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLI 157
L + + V+ GCG Q+G +G +A DGVMG G + SV S LA G
Sbjct: 176 TLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDA 235
Query: 158 QNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVE----SYCIGNSCL 212
+ FS C D G +F G ++T +P Y+ +G++ S + S +
Sbjct: 236 KRVFSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIV 295
Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLK 271
G +VDSG + + P +Y ++ + +++ + + L + C++ S+
Sbjct: 296 RNGG--TIVDSGTTLAYFPKVLYDSLI---ETILARQPVKLHIVEETFQCFSFSTNVDEA 350
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV--MSTDGDYGII--GQNFMMGHRI 327
P + F + V H + F E F ++TD +I G + +
Sbjct: 351 FPPVSFEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLV 410
Query: 328 VFDRENLKLAWS 339
V+D +N + W+
Sbjct: 411 VYDLDNEVIGWA 422
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/329 (24%), Positives = 148/329 (44%), Gaps = 44/329 (13%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ P SSS+ K + C+ P C +C C Y Y+ E +SSSG L +D+L +
Sbjct: 129 RFQPESSSTYKPMQCN-PSC----NCDDEGKQCTYERRYA-EMSSSSGLLAEDVLSFGNE 182
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ PQ + I GC +TG A DG+MGLG G +SV L ++ NSFS+
Sbjct: 183 SELTPQRA-----IFGCETVETGELFSQRA-DGIMGLGRGPLSVVDQLVIKEVVGNSFSL 236
Query: 164 CFDEND--SGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGV---ESYCIG-----NSCLT 213
C+ D G++ G+ P + + Y + + + E + G N +
Sbjct: 237 CYGGMDVVGGAMVLGNIPPPPDMVFAH---SDPYRSAYYNIELKELHVAGKRLKLNPRVF 293
Query: 214 QSGFQALVDSGASFTFLPTEIYA---EVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEM 269
++DSG ++ +LP E + + ++K K + K+I S+ C++ + ++
Sbjct: 294 DGKHGTVLDSGTTYAYLPEEAFVAFKDAIIKEIKFL--KQIHGPDPSYNDICFSGAGRDV 351
Query: 270 LKV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQN 320
++ P++ ++F Q + + F + +CL + D GI+ +N
Sbjct: 352 SQLSKIFPEVNMVFGNGQKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIVVRN 411
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVIDK 349
+ + +DR+N K+ + + C E+ +
Sbjct: 412 TL----VTYDRDNDKIGFWKTNCSELWKR 436
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 142/342 (41%), Gaps = 38/342 (11%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ YD S++ K VSC C S C + CPY+ Y + +S++GY V D
Sbjct: 131 LTPYDLEESTTGKLVSCDEQFCLEVNGGPLSGCTT-NMSCPYLQIYG-DGSSTAGYFVKD 188
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKA 154
+ S ++ S+ GCG +Q+G A DG++G G + S+ S LA
Sbjct: 189 YVQYNRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLAST 248
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 214
++ F+ C D + G +F G Q + P+ Y V + +G+ L
Sbjct: 249 RKVKKMFAHCLDGTNGGGIF--AMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNI 306
Query: 215 SG--FQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNAS 265
S F+A ++DSG + +LP IY +V K L + +Q +Y C+ S
Sbjct: 307 SADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVAKI--LSQQHNLEVQTIHGEYKCFQYS 364
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFP-ENEGFTVFCL-----TVMSTDGDYGIIGQ 319
P + F + V H + F EN ++C+ + S D +
Sbjct: 365 ERVDDGFPPVIFHFENSLLLKVYPHEYLFQYEN----LWCIGWQNSGMQSRDRKNVTLFG 420
Query: 320 NFMMGHRIV-FDRENLKLAWSHSKCEEVI-----DKSHVHLV 355
+ ++ +++V +D EN + W+ C I VHLV
Sbjct: 421 DLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQDEQTGTVHLV 462
>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 421
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/324 (25%), Positives = 134/324 (41%), Gaps = 42/324 (12%)
Query: 51 SSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ +K V C +C + R C S K C Y Y+ + SS G LV D L
Sbjct: 104 TKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYA-DQGSSLGVLVTDSFAL--- 159
Query: 104 SKHAPQSSVQSSVIIGCGR-KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+ A S V+ + GCG +Q GS + +A DGV+GLG G VS+ S L + G+ +N
Sbjct: 160 -RLANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVG 218
Query: 163 ICFDENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 221
C G +FFGD P ++ + + + + Y G + G L + +
Sbjct: 219 HCLSTRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVF 278
Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 281
DSG+SFT+ + Y +V +S + +S C+ + V D++ F
Sbjct: 279 DSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKG-KKPFKSVLDVKKEF-- 335
Query: 282 NQSFVVRNHIFSF-----------PEN----EGFTVFCLTVMSTD----GDYGIIGQNFM 322
R + SF PEN + CL +++ D I+G M
Sbjct: 336 ------RTVVLSFSNGKKALMEIPPENYLIVTKYGNACLGILNGSEVGLKDLNIVGDITM 389
Query: 323 MGHRIVFDRENLKLAWSHSKCEEV 346
+++D E ++ W + C+ +
Sbjct: 390 QDQMVIYDNERGQIGWIRAPCDRI 413
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/314 (26%), Positives = 139/314 (44%), Gaps = 33/314 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++P S+S +V C+ C + ++ C Y Y + T S G L + + + S
Sbjct: 122 FNPLKSTSFSHVPCNTQTCHAVDDGHCGVQGVCDYSYTYG-DRTYSKGDLGFEKITIGS- 179
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
SSV+S +IGCG +G + GV+GLG G +S+ S +++ I FS
Sbjct: 180 ------SSVKS--VIGCGHASSGGF---GFASGVIGLGGGQLSLVSQMSQTSGISRRFSY 228
Query: 164 CFD---ENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNS---CLTQS 215
C + +G + FG + P+ K Y++ +E+ IGN +
Sbjct: 229 CLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMAFAKQ 288
Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY----NASSEEMLK 271
G ++DSG + +FLP E+Y VV K+V +KR+ GN W C+ N ++ +
Sbjct: 289 G-NVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIP 347
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM--STDGDYGIIGQNFMMGHRIVF 329
+ + N + + N N V CLT+ S ++GIIG + I +
Sbjct: 348 IITAQFSGGANVNLLPVNTFQKVANN----VNCLTLTPASPTDEFGIIGNLALANFLIGY 403
Query: 330 DRENLKLAWSHSKC 343
D E +L++ + C
Sbjct: 404 DLEAKRLSFKPTVC 417
>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
Length = 418
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/316 (25%), Positives = 131/316 (41%), Gaps = 31/316 (9%)
Query: 51 SSSKNVSCSHPLCKSRSSCKS------LKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+ +K V C++ +C + S S + C Y Y T+ SS G LV D L +
Sbjct: 103 TKNKLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKY-TDKASSLGVLVMDSFSLPLRN 161
Query: 105 KHAPQSSVQSSVIIGCGR-KQTGSYLDGAAP---DGVMGLGLGDVSVPSLLAKAGLIQNS 160
K S+V+ S+ GCG +Q G +GAAP DG++GLG G VS+ S L + G+ +N
Sbjct: 162 K----SNVRPSLSFGCGYDQQVGK--NGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNV 215
Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV-GVESYCIGNSCLTQSGFQA 219
C + G +FFGD T + T + Y+ G + L+ +
Sbjct: 216 LGHCLSTSGGGFLFFGDDMVPTSRVTWVSMVRSTSGNYYSPGSATLYFDRRSLSTKPMEV 275
Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
+ DSG+++T+ + Y + +S + S C+ + V D++ F
Sbjct: 276 VFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPLCWKG-QKAFKSVSDVKKDF 334
Query: 280 SKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTDG-----DYGIIGQNFMMGHRIVF 329
Q +N + P + CL ++ DG + IIG M +++
Sbjct: 335 KSLQFIFGKNAVMDIPPENYLIITKNGNVCLGIL--DGSAAKLSFSIIGDITMQDQMVIY 392
Query: 330 DRENLKLAWSHSKCEE 345
D E +L W C
Sbjct: 393 DNEKAQLGWIRGSCSR 408
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 61/213 (28%), Positives = 95/213 (44%), Gaps = 18/213 (8%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ YDP SS+ VSC C + C + PC Y Y + +S++GY V D
Sbjct: 77 LTLYDPKDSSTGSKVSCDQGFCAATYGGLLPGCTT-SLPCEYSVTYG-DGSSTTGYFVSD 134
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
+L S S+V GCG +Q G A DG++G G + S+ S L+ AG
Sbjct: 135 LLQFDQVSGDGQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAG 194
Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
++ F+ C D + G +F G Q P+ Y V ++S +G + L
Sbjct: 195 KVKKIFAHCLDTINGGGIF--AIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLP 252
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVV 240
T ++DSG + T+LP +Y E+++
Sbjct: 253 SHMFDTGEKKGTIIDSGTTLTYLPEIVYKEIML 285
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 85/323 (26%), Positives = 136/323 (42%), Gaps = 45/323 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
YD +S+SS V CS P C S S C ++ C Y Y + + + GYLV+D+LH
Sbjct: 83 YDVKASASSSKVPCSDPSCTLITQISESGCND-QNQCGYSFQYG-DGSGTLGYLVEDVLH 140
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
+ ++VI GCG KQ+G A DG++G G D+S S LAK G
Sbjct: 141 Y--------MVNATATVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTP 192
Query: 159 NSFSICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--- 213
N F+ C D E G + G+ Q T +P Y+ V ++S + N+ LT
Sbjct: 193 NVFAHCLDGGERGGGILVLGNVIEPDIQYTPLVPYMYHYN---VVLQSISVNNANLTIDP 249
Query: 214 ----QSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
Q + DSG + +LP E Y ++ +SL + C S
Sbjct: 250 KLFSNDVMQGTIFDSGTTLAYLPDEAYQAF---------TQAVSLVVAPFLLCDTRLSRF 300
Query: 269 MLKV-PDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMS-----TDGDYGIIGQNF 321
+ K+ P++ L F + + ++ ++C+ S ++ Y I G
Sbjct: 301 IYKLFPNVVLYFEGASMTLTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLV 360
Query: 322 MMGHRIVFDRENLKLAWSHSKCE 344
+ +V+D E ++ W C+
Sbjct: 361 LKNKLVVYDLERGRIGWRPFDCK 383
>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
Length = 429
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/315 (25%), Positives = 132/315 (41%), Gaps = 25/315 (7%)
Query: 51 SSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ +K V C LC S + C S + C Y+ Y+ + SS+G LV+D L
Sbjct: 112 TKNKLVPCVDQLCASLHNGLNRKHKCDSPYEQCDYVIKYA-DQGSSTGVLVNDSFAL--- 167
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
+ A S V+ S+ GCG Q S + + DGV+GLG G VS+ S + G+ +N
Sbjct: 168 -RLANGSVVRPSLAFGCGYDQQVSSGEMSPTDGVLGLGTGSVSLLSQFKQHGVTKNVVGH 226
Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV-GVESYCIGNSCLTQSGFQALVD 222
C G +FFGD Q+ T + Y+ G S G+ L + + D
Sbjct: 227 CLSLRGGGFLFFGDDLVPYQRVTWTPMVRSPLRNYYSPGSASLYFGDQSLRVKLTEVVFD 286
Query: 223 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF-SK 281
SG+SFT+ + Y +V +S + S C+ + V D++ F S
Sbjct: 287 SGSSFTYFAAQPYQALVTALKGDLSRTLKEVSDPSLPLCWKG-KKPFKSVLDVKKEFKSL 345
Query: 282 NQSFVVRNHIFSFPENEGFTVF------CLTVMSTD----GDYGIIGQNFMMGHRIVFDR 331
+F N F + + + CL +++ D I+G M +++D
Sbjct: 346 VLNFGNGNKAFMEIPPQNYLIVTKYGNACLGILNGSEVGLKDLSILGDITMQDQMVIYDN 405
Query: 332 ENLKLAWSHSKCEEV 346
E ++ W + C+ +
Sbjct: 406 EKGQIGWIRAPCDRI 420
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 85/323 (26%), Positives = 136/323 (42%), Gaps = 45/323 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
YD +S+SS V CS P C S S C ++ C Y Y + + + GYLV+D+LH
Sbjct: 83 YDVKASASSSKVPCSDPSCTLITQISESGCND-QNQCGYSFQYG-DGSGTLGYLVEDVLH 140
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
+ ++VI GCG KQ+G A DG++G G D+S S LAK G
Sbjct: 141 Y--------MVNATATVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTP 192
Query: 159 NSFSICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--- 213
N F+ C D E G + G+ Q T +P Y+ V ++S + N+ LT
Sbjct: 193 NVFAHCLDGGERGGGILVLGNVIEPDIQYTPLVPYMSHYN---VVLQSISVNNANLTIDP 249
Query: 214 ----QSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
Q + DSG + +LP E Y ++ +SL + C S
Sbjct: 250 KLFSNDVMQGTIFDSGTTLAYLPDEAYQAF---------TQAVSLVVAPFLLCDTRLSRF 300
Query: 269 MLKV-PDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMS-----TDGDYGIIGQNF 321
+ K+ P++ L F + + ++ ++C+ S ++ Y I G
Sbjct: 301 IYKLFPNVVLYFEGASMTLTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLV 360
Query: 322 MMGHRIVFDRENLKLAWSHSKCE 344
+ +V+D E ++ W C+
Sbjct: 361 LKNKLVVYDLERGRIGWRPFDCK 383
>gi|226500708|ref|NP_001149229.1| aspartic proteinase nepenthesin-2 [Zea mays]
gi|195625632|gb|ACG34646.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 150/342 (43%), Gaps = 46/342 (13%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSSC----------KSLKDPCPYIADYS-TEDTSSSG 91
+E + S S + + C P C+ R+SC + C Y Y + S++G
Sbjct: 139 TEKECSRSKTRSMLPCCSPKCEQRASCGCGRSELKAEAEKETKCTYAIIYGGNANDSTAG 198
Query: 92 YLVDDILHLASF-SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 150
+ +D L + + SK P S V IGC T + D + GV GLG S+P
Sbjct: 199 VMYEDKLTIVAVASKAVPSSQSFKEVAIGCSTSATLKFKDPSI-KGVFGLGRSATSLPRQ 257
Query: 151 LAKAGLIQNSFSIC---FDENDSGSVFFGDQGP----------ATQQSTSFLPIGEKYDA 197
L + FS C + E D S P A +T+ P +
Sbjct: 258 LNFS-----KFSYCLSSYQEPDLPSYLLLTAAPDMATGAVGGGAAVATTALQPNSDYKTL 312
Query: 198 YFVGVESYCIGNSCL----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 253
YFV +++ IG + T+SG VD+GASFT L ++A++V + D+++ ++
Sbjct: 313 YFVHLQNISIGGTRFPAVSTKSGGNMFVDTGASFTRLEGTVFAKLVTELDRIMKERKYVK 372
Query: 254 Q---GNSWKYCY---NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV 307
+ N+ + CY + +++E K+PDM L F+ + + V+ + + + CL +
Sbjct: 373 EQPGRNNGQICYSPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKTT---SKLCLAI 429
Query: 308 MSTD--GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
++ G ++G M ++ D N KL++ + C +VI
Sbjct: 430 YKSNIKGGISVLGNFQMQNTHMLLDTGNEKLSFVRADCSKVI 471
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/339 (24%), Positives = 149/339 (43%), Gaps = 39/339 (11%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGY 92
++N Y+P+ SSS +N+SC P C+ SS CK+ CPY DY+ ++ +
Sbjct: 206 EQNGPHYNPNESSSYRNISCYDPRCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDF 265
Query: 93 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
++ ++ + V+ GCG G + ++GLG G +S PS L
Sbjct: 266 ALETFTVNLTWPNGKEKFKHVVDVMFGCGHWNKGFFHGAGG---LLGLGRGPLSFPSQLQ 322
Query: 153 KAGLIQNSFSICFDE---NDSGS---VFFGDQGPATQQSTSFLPI--GEKY---DAYFVG 201
+ +SFS C + N S S +F D+ + +F + GE+ Y++
Sbjct: 323 --SIYGHSFSYCLTDLFSNTSVSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQ 380
Query: 202 VESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
++S +G L ++ ++DSG++ TF P Y + F+K + ++I
Sbjct: 381 IKSIVVGGEVLDIPEKTWHWSSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQI 440
Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKN--QSFVVRNHIFSFPENEGFTVFCLTVMS 309
+ CYN S +++PD + F+ +F N+ + + +E V CL ++
Sbjct: 441 AADDFIMSPCYNVSGAMQVELPDYGIHFADGAVWNFPAENYFYQYEPDE---VICLAILK 497
Query: 310 T--DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
T IIG I++D + +L +S +C EV
Sbjct: 498 TPNHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRCAEV 536
>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 564
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 80/302 (26%), Positives = 136/302 (45%), Gaps = 34/302 (11%)
Query: 67 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 126
++CK C Y +Y+ + +SS G L D +H+ + + + + GC Q G
Sbjct: 263 ATCKQ----CDYEIEYA-DRSSSMGVLAKDDMHMIATNG----GREKLDFVFGCAYDQQG 313
Query: 127 SYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQ 183
L A DG++GL +S+PS LA G+I N F C + N G +F GD +
Sbjct: 314 QLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCITKEPNGGGYMFLGDDY-VPR 372
Query: 184 QSTSFLPI-GEKYDAYFVGVESYCIGNSCLTQSG-----FQALVDSGASFTFLPTEIYAE 237
++ PI G + Y + G+ L G Q + DSG+S+T+LP EIY +
Sbjct: 373 WGMTWAPIRGGPDNLYHTEAQKVNYGDQQLRMHGQAGSSIQVIFDSGSSYTYLPDEIYKK 432
Query: 238 VV--VKFD--KLVSSKRISLQGNSWKYCYNASSEEMLK--VPDMRLIFSKNQSFVVRNHI 291
+V +K+D V + WK ++ E +K + L F N+ FV+
Sbjct: 433 LVTAIKYDYPSFVQDTSDTTLPLCWKADFDVRYLEDVKQFFKPLNLHFG-NRWFVIPRTF 491
Query: 292 FSFPENEGFTV----FCLTVMS-TDGDYG---IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
P++ CL +++ + D+ I+G + G +V+D E ++ W+ S+C
Sbjct: 492 TILPDDYLIISDKGNVCLGLLNGAEIDHASTLIVGDVSLRGKLVVYDNERRQIGWADSEC 551
Query: 344 EE 345
+
Sbjct: 552 TK 553
>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
Length = 583
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 79/298 (26%), Positives = 131/298 (43%), Gaps = 40/298 (13%)
Query: 76 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-AP 134
C Y +Y+ + + S G L D HL K S +S ++ GCG Q G L+
Sbjct: 281 CDYEIEYA-DHSYSMGVLTKDKFHL----KLHNGSLAESDIVFGCGYDQQGLLLNTLLKT 335
Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFG-DQGPATQQSTSFLPI 191
DG++GL +S+PS LA G+I N C D N G +F G D P+ +++P+
Sbjct: 336 DGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSDLVPS--HGMTWVPM 393
Query: 192 --GEKYDAYFVGVESYCIGNSCLTQSGF-----QALVDSGASFTFLPTEIYAEVVVKFDK 244
+ DAY + V G L+ G + L D+G+S+T+ P + Y+++V +
Sbjct: 394 LHDSRLDAYQMQVTKMSYGQGMLSLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQE 453
Query: 245 LVSSKRISLQ--GNSWKYCYNASSE-EMLKVPDMRLIFSK------NQSFVVRNHIFSFP 295
VS ++ + C+ A + + D++ F ++ ++ + P
Sbjct: 454 -VSGLELTRDDSDETLPICWRAKTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQP 512
Query: 296 E------NEGFTVFCLTVMS----TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
E N+G CL ++ DG I+G M GH IV+D ++ W S C
Sbjct: 513 EDYLIISNKGNV--CLGILDGSSVHDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDC 568
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 84/340 (24%), Positives = 143/340 (42%), Gaps = 34/340 (10%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ Y+ S S K V C C S C + CPY+ Y + +S++GY V D
Sbjct: 130 LTLYNIKDSVSGKLVPCDEEFCYEVNGGPLSGCTA-NMSCPYLEIYG-DGSSTAGYFVKD 187
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY--LDGAAPDGVMGLGLGDVSVPSLLAKA 154
++ S +S SVI GCG +Q+G A DG++G G + S+ S LA
Sbjct: 188 VVQYDRVSGDLQTTSSNGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAAT 247
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 213
++ F+ C D + G +F G Q + P+ Y V + + +G L
Sbjct: 248 RKVKKIFAHCLDGINGGGIF--AIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGEDFLHL 305
Query: 214 -----QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
++G + A++DSG + +LP +Y +V K ++ + + + C+ S
Sbjct: 306 PTEEFEAGDRKGAIIDSGTTLAYLPEIVYEPLVSKIISQQPDLKVHIVRDEYT-CFQYSG 364
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTD-GDYGIIGQN 320
P++ F + V H + FP EG ++C+ + S D + ++G
Sbjct: 365 SVDDGFPNVTFHFENSVFLKVHPHEYLFP-FEG--LWCIGWQNSGMQSRDRRNMTLLGDL 421
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVID-----KSHVHLV 355
+ +++D EN + W+ C I VHLV
Sbjct: 422 VLSNKLVLYDLENQAIGWTEYNCSSSIKVQDERTGTVHLV 461
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 138/329 (41%), Gaps = 34/329 (10%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVD 95
L++YDP+ S ++ V C C + S +C S PC + Y + ++++G+ V
Sbjct: 128 LTQYDPAGSGTT--VGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYG-DGSTTTGFYVT 184
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAK 153
D + S + ++ +S+ GCG Q G L + A DG++G G D S+ S LA
Sbjct: 185 DFVQYNQVSGNGQTTTSNASITFGCG-AQLGGDLGSSNQALDGILGFGQSDSSMLSQLAA 243
Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
A ++ F+ C D G +F G Q P+ Y V ++ +G + L
Sbjct: 244 ARRVRKIFAHCLDTVRGGGIF--AIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQ 301
Query: 214 --QSGFQA------LVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNA 264
S F + ++DSG + +LP E+Y ++ FDK + + L C+
Sbjct: 302 LPTSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLAAVFDKY---QDLPLHNYQDFVCFQF 358
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIG 318
S P + F + + V + F ++C+ V + DG D ++G
Sbjct: 359 SGSIDDGFPVITFSFEGDLTLNVYPDDYLFQNRN--DLYCMGFLDGGVQTKDGKDMLLLG 416
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
+ +V+D E + W+ C I
Sbjct: 417 DLVLSNKLVVYDLEKEVIGWTDYNCSSSI 445
>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 381
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 91/329 (27%), Positives = 140/329 (42%), Gaps = 43/329 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
YDP + + V C PLC +C C Y +Y+ + +S+ G L++D +
Sbjct: 66 YDPKKA---RLVDCRVPLCALVQQGGSYACGGPVRQCDYDVEYA-DGSSTMGVLMEDTIT 121
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
L + +S +++ IIGCG Q G+ A+ DGVMGL +S+PS LAK G+++
Sbjct: 122 L--LLTNGTRS--KTTAIIGCGYDQQGTLAQTPASTDGVMGLSSAKISLPSQLAKKGIVR 177
Query: 159 NSFSICF--DENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 215
N C N G +FFGD PA ++ PI K +G +S +
Sbjct: 178 NVIGHCLAGGSNGGGYLFFGDSLVPAL--GMTWTPIMGKSITGNIGGKSGDADDKTGDIG 235
Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK---RISLQGNSWKYCYNASS--EEML 270
G + DSG SFT+L E Y V+ + V RI N+ +C+ S E +
Sbjct: 236 G--VMFDSGTSFTYLVPEAYNAVLSAMEMQVEKSGLVRIKTD-NTLPFCWRGPSPFESVA 292
Query: 271 KV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTV------FCLTVMSTDGD----YGI 316
V + L F K + + P EG+ + CL ++ G I
Sbjct: 293 DVQRYFKTVTLDFGKRNWYSASRVLELSP--EGYLIVSTQGNVCLGILDASGASLEVTNI 350
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
IG M G+ +V+D ++ W C
Sbjct: 351 IGDVSMRGYLVVYDNARNQIGWVRRNCHN 379
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 138/329 (41%), Gaps = 34/329 (10%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVD 95
L++YDP+ S ++ V C C + S +C S PC + Y + ++++G+ V
Sbjct: 128 LTQYDPAGSGTT--VGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYG-DGSTTTGFYVT 184
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAK 153
D + S + ++ +S+ GCG Q G L + A DG++G G D S+ S LA
Sbjct: 185 DFVQYNQVSGNGQTTTSNASITFGCG-AQLGGDLGSSNQALDGILGFGQSDSSMLSQLAA 243
Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
A ++ F+ C D G +F G Q P+ Y V ++ +G + L
Sbjct: 244 ARRVRKIFAHCLDTVRGGGIF--AIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQ 301
Query: 214 --QSGFQA------LVDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNA 264
S F + ++DSG + +LP E+Y ++ FDK + + L C+
Sbjct: 302 LPTSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLAAVFDKY---QDLPLHNYQDFVCFQF 358
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTDG-DYGIIG 318
S P + F + + V + F ++C+ V + DG D ++G
Sbjct: 359 SGSIDDGFPVITFSFKGDLTLNVYPDDYLFQNRN--DLYCMGFLDGGVQTKDGKDMLLLG 416
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
+ +V+D E + W+ C I
Sbjct: 417 DLVLSNKLVVYDLEKEVIGWTDYNCSSSI 445
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 78/315 (24%), Positives = 134/315 (42%), Gaps = 24/315 (7%)
Query: 51 SSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ +K V C +C + R C S K C Y Y+ + SS G LV D L
Sbjct: 104 TKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYA-DQGSSLGVLVTDSFAL--- 159
Query: 104 SKHAPQSSVQSSVIIGCGR-KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+ A S V+ + GCG +Q GS + +A DGV+GLG G VS+ S L + G+ +N
Sbjct: 160 -RLANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVG 218
Query: 163 ICFDENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 221
C G +FFGD P ++ + + + + Y G + G L + +
Sbjct: 219 HCLSTRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVF 278
Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS--EEMLKVPD----M 275
DSG+SFT+ + Y +V +S + +S C+ + +L V +
Sbjct: 279 DSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFKTV 338
Query: 276 RLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDR 331
L FS + ++ ++ + CL +++ D I+G M +++D
Sbjct: 339 VLSFSNGKKALMEIPPENYLIVTKYGNACLGILNGSEVGLKDLNIVGDITMQDQMVIYDN 398
Query: 332 ENLKLAWSHSKCEEV 346
E ++ W + C+ +
Sbjct: 399 ERGQIGWIRAPCDRI 413
>gi|413923876|gb|AFW63808.1| hypothetical protein ZEAMMB73_793799 [Zea mays]
Length = 415
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 149/342 (43%), Gaps = 46/342 (13%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD----------PCPYIADYS-TEDTSSSG 91
+E + S S + + C P C+ R+SC + C Y Y + S++G
Sbjct: 83 TEKECSRSKTRSMLPCCSPKCEQRASCGCRRSELKAEAEKETKCTYAIKYGGNANDSTAG 142
Query: 92 YLVDDILHLASF-SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 150
L +D L + + SK P S V IGC T + D + GV GLG S+P
Sbjct: 143 VLYEDKLTIVAVASKAVPGSQSFEEVAIGCSTSATLKFKDPSI-KGVFGLGRSATSLPRQ 201
Query: 151 LAKAGLIQNSFSIC---FDENDSGSVFFGDQGP----------ATQQSTSFLPIGEKYDA 197
L + FS C + + D S P A +T+ P +
Sbjct: 202 LNFS-----KFSYCLSSYQKPDLPSYLLLTAAPDMATGAVGGAAAVATTALQPNSDYKTR 256
Query: 198 YFVGVESYCIGNSCL----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 253
YFV ++ IG + L T+SG VD+G SFT L ++A++V + D+++ ++
Sbjct: 257 YFVDLQGISIGGTRLPAVSTKSGGNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVK 316
Query: 254 Q---GNSWKYCY---NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV 307
+ N+ + CY + +++E K+PDM L F+ + + V+ + + + CL +
Sbjct: 317 EQPGRNNGQICYSPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKTT---SKLCLAI 373
Query: 308 MSTD--GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
++ G ++G M ++ D N KL++ + C +VI
Sbjct: 374 DKSNIKGGISVLGNFQMQNTHMLLDTGNEKLSFVRADCSKVI 415
>gi|15219354|ref|NP_175079.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12320825|gb|AAG50556.1|AC074228_11 nucellin, putative [Arabidopsis thaliana]
gi|332193902|gb|AEE32023.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 405
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 77/312 (24%), Positives = 133/312 (42%), Gaps = 29/312 (9%)
Query: 56 VSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
+ CS+P+C + + C + ++ C Y Y+ + SS G LV D L K S
Sbjct: 99 IPCSNPICTALHWPNKPHCPNPQEQCDYEVKYA-DQGSSMGALVTDQFPL----KLVNGS 153
Query: 111 SVQSSVIIGCGRKQTGSYLDGAAPD---GVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE 167
+Q V GCG Q SY P GV+GLG G + + + L AGL +N C
Sbjct: 154 FMQPPVAFGCGYDQ--SYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSS 211
Query: 168 NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASF 227
G +FFGD ++ P+ + + Y G G + + D+G+S+
Sbjct: 212 KGGGFLFFGDN-LVPSIGVAWTPLLSQDNHYTTGPADLLFNGKPTGLKGLKLIFDTGSSY 270
Query: 228 TFLPTEIYAEVV--VKFDKLVSSKRISLQGNSWKYCYNASS--EEMLKVPDMRLIFSKNQ 283
T+ ++ Y ++ + D VS +++ + + C+ + + +L+V + + N
Sbjct: 271 TYFNSKAYQTIINLIGNDLKVSPLKVAKEDKTLPICWKGAKPFKSVLEVKNFFKTITINF 330
Query: 284 SFVVRN-HIFSFPE------NEGFTVFCLTVMSTDG--DYGIIGQNFMMGHRIVFDRENL 334
+ RN ++ PE G L S G + +IG M G +++D E
Sbjct: 331 TNGRRNTQLYLAPELYLIVSKTGNVCLGLLNGSEVGLQNSNVIGDISMQGLMMIYDNEKQ 390
Query: 335 KLAWSHSKCEEV 346
+L W S C ++
Sbjct: 391 QLGWVSSDCNKL 402
>gi|145324889|ref|NP_001077691.1| aspartyl protease [Arabidopsis thaliana]
gi|332194268|gb|AEE32389.1| aspartyl protease [Arabidopsis thaliana]
Length = 410
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 79/298 (26%), Positives = 131/298 (43%), Gaps = 40/298 (13%)
Query: 76 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-AP 134
C Y +Y+ + + S G L D HL K S +S ++ GCG Q G L+
Sbjct: 108 CDYEIEYA-DHSYSMGVLTKDKFHL----KLHNGSLAESDIVFGCGYDQQGLLLNTLLKT 162
Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFG-DQGPATQQSTSFLPI 191
DG++GL +S+PS LA G+I N C D N G +F G D P+ +++P+
Sbjct: 163 DGILGLSRAKISLPSQLASRGIISNVVGHCLASDLNGEGYIFMGSDLVPS--HGMTWVPM 220
Query: 192 --GEKYDAYFVGVESYCIGNSCLTQSGF-----QALVDSGASFTFLPTEIYAEVVVKFDK 244
+ DAY + V G L+ G + L D+G+S+T+ P + Y+++V +
Sbjct: 221 LHDSRLDAYQMQVTKMSYGQGMLSLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQE 280
Query: 245 LVSSKRISLQ--GNSWKYCYNASSE-EMLKVPDMRLIFS------KNQSFVVRNHIFSFP 295
VS ++ + C+ A + + D++ F ++ ++ + P
Sbjct: 281 -VSGLELTRDDSDETLPICWRAKTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQP 339
Query: 296 E------NEGFTVFCLTVMS----TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
E N+G CL ++ DG I+G M GH IV+D ++ W S C
Sbjct: 340 EDYLIISNKGNV--CLGILDGSSVHDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDC 395
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 85/317 (26%), Positives = 139/317 (43%), Gaps = 40/317 (12%)
Query: 46 DPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
DP+ S+S KN+SCS CK SC S C Y Y + + S G+ + L L
Sbjct: 177 DPTKSTSYKNISCSSAFCKLLDTEGGESCSS--PTCLYQVQYG-DGSYSIGFFATETLTL 233
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+S S+V + + GCG++ +G + GAA G++GLG +S+PS A+ +
Sbjct: 234 SS-------SNVFKNFLFGCGQQNSGLF-RGAA--GLLGLGRTKLSLPSQTAQK--YKKL 281
Query: 161 FSICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYDA----------YFVGVESYCIG 208
FS C + S G + FG Q ++ F P+ E + + VG I
Sbjct: 282 FSYCLPASSSSKGYLSFGGQ---VSKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSID 338
Query: 209 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
S + SG ++DSG T LP+ Y+ + F KL++ + + + CY+ S E
Sbjct: 339 ASIFSTSG--TVIDSGTVITRLPSTAYSALSSAFQKLMTDYPSTDGYSIFDTCYDFSKNE 396
Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY--GIIGQNFMMGHR 326
+K+P + + F + +P N G CL D I G ++
Sbjct: 397 TIKIPKVGVSFKGGVEMDIDVSGILYPVN-GLKKVCLAFAGNGDDVKAAIFGNTQQKTYQ 455
Query: 327 IVFDRENLKLAWSHSKC 343
+V+D ++ ++ S C
Sbjct: 456 VVYDDAKGRVGFAPSGC 472
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 78/315 (24%), Positives = 129/315 (40%), Gaps = 31/315 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
+DPS+S + K++SC+ C S C++ + C Y A Y + + S GYL D+
Sbjct: 56 FDPSASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYG-DSSYSMGYLSQDL 114
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L LA S + GCG+ G + A G++GLG +S+ ++
Sbjct: 115 LTLAP-------SQTLPGFVYGCGQDSEGLFGRAA---GILGLGRNKLSMLGQVSSK--F 162
Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGE---KYDAYFVGVESYCIGNSCLTQ 214
+FS C G + + F P+ YF+ + + +G L
Sbjct: 163 GYAFSYCLPTRGGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGV 222
Query: 215 SGFQ----ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEM 269
+ Q ++DSG T LP +Y F K++SSK G S C+ + ++M
Sbjct: 223 AAAQYRVPTIIDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNLKDM 282
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 329
VP++RLIF +R +EG T CL +G IIG + ++
Sbjct: 283 QSVPEVRLIFQGGADLNLRPVNVLLQVDEGLT--CLAFAGNNG-VAIIGNHQQQTFKVAH 339
Query: 330 DRENLKLAWSHSKCE 344
D ++ ++ C
Sbjct: 340 DISTARIGFATGGCN 354
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 125/279 (44%), Gaps = 17/279 (6%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +DP SSS+S ++CS C S ++C S + C Y Y + + +SGY V D
Sbjct: 69 LNFFDPGSSSTSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYG-DGSGTSGYYVSD 127
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
++HL + + + ++ + V+ GC +QTG A DG+ G G ++SV S L+ G
Sbjct: 128 MMHLNTIFEGSVTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQG 187
Query: 156 LIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYD----AYFVGVESYCIGN 209
+ FS C D + G + G+ TS +P Y+ + V ++ I +
Sbjct: 188 IAPRVFSHCLKGDSSGGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDS 247
Query: 210 SCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
S S + +VDSG + +L E Y V + + + + CY +S
Sbjct: 248 SVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITASI-PQSVHTAVSRGNQCYLITSSV 306
Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENE--GFTVFCL 305
P + L F+ S ++R + +N G V+C+
Sbjct: 307 TEVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCI 345
>gi|223950045|gb|ACN29106.1| unknown [Zea mays]
Length = 392
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 149/342 (43%), Gaps = 46/342 (13%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD----------PCPYIADYS-TEDTSSSG 91
+E + S S + + C P C+ R+SC + C Y Y + S++G
Sbjct: 60 TEKECSRSKTRSMLPCCSPKCEQRASCGCRRSELKAEAEKETKCTYAIKYGGNANDSTAG 119
Query: 92 YLVDDILHLASF-SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 150
L +D L + + SK P S V IGC T + D + GV GLG S+P
Sbjct: 120 VLYEDKLTIVAVASKAVPGSQSFEEVAIGCSTSATLKFKDPSI-KGVFGLGRSATSLPRQ 178
Query: 151 LAKAGLIQNSFSIC---FDENDSGSVFFGDQGP----------ATQQSTSFLPIGEKYDA 197
L + FS C + + D S P A +T+ P +
Sbjct: 179 LNFS-----KFSYCLSSYQKPDLPSYLLLTAAPDMATGAVGGAAAVATTALQPNSDYKTR 233
Query: 198 YFVGVESYCIGNSCL----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 253
YFV ++ IG + L T+SG VD+G SFT L ++A++V + D+++ ++
Sbjct: 234 YFVDLQGISIGGTRLPAVSTKSGGNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVK 293
Query: 254 Q---GNSWKYCY---NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV 307
+ N+ + CY + +++E K+PDM L F+ + + V+ + + + CL +
Sbjct: 294 EQPGRNNGQICYSPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKTT---SKLCLAI 350
Query: 308 MSTD--GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
++ G ++G M ++ D N KL++ + C +VI
Sbjct: 351 DKSNIKGGISVLGNFQMQNTHMLLDTGNEKLSFVRADCSKVI 392
>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 86/316 (27%), Positives = 132/316 (41%), Gaps = 39/316 (12%)
Query: 51 SSSKNVSCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKHAP 108
+ +K V C+ LC S + K P C Y Y T+ SS G L+ D L+ +
Sbjct: 119 TKNKIVPCAASLCTSLTPNKKCAVPQQCDYQIKY-TDKASSLGVLIADNFTLSLRN---- 173
Query: 109 QSSVQSSVIIGCGR-KQTGSYLDGA---APDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
S+V++++ GCG +Q G +GA A DG++GLG G VS+ S L + G+ +N C
Sbjct: 174 SSTVRANLTFGCGYDQQVGK--NGAVQAATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHC 231
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNSCLTQSGFQALVD 222
F N G +FFGD T + T ++P+ + Y G + L + + D
Sbjct: 232 FSTNGGGFLFFGDDIVPTSRVT-WVPMARTTSGNYYSPGSGTLYFDRRSLGMKPMEVVFD 290
Query: 223 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS------SEEMLKVPDMR 276
SG+++ + E Y V +S + S C+ SE +
Sbjct: 291 SGSTYAYFAAEPYQATVSALKAGLSKSLKEVSDVSLPLCWKGQKVFKSVSEVKNDFKSLF 350
Query: 277 LIFSKNQSFVVRNHIFSFPEN----EGFTVFCLTVMSTDG-----DYGIIGQNFMMGHRI 327
L F KN + PEN + CL ++ DG + IIG M I
Sbjct: 351 LSFGKNSVMEIP------PENYLIVTKYGNVCLGIL--DGTTAKLKFNIIGDITMQDQMI 402
Query: 328 VFDRENLKLAWSHSKC 343
++D E +L W C
Sbjct: 403 IYDNEKGQLGWIRGSC 418
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 83/328 (25%), Positives = 137/328 (41%), Gaps = 42/328 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP SSS +SC LC S R SC C Y Y + + + G L + + L S
Sbjct: 82 FDPEGSSSYTTMSCGDTLCDSLPRKSCSP---DCDYSYGYG-DGSGTRGTLSSETVTLTS 137
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+ ++ GCG GS+ D + G++GLG G++S S L L + FS
Sbjct: 138 TQG---EKLAAKNIAFGCGHLNRGSFNDAS---GLVGLGRGNLSFVSQLGD--LFGHKFS 189
Query: 163 ICF-----DENDSGSVFFGDQGPATQQST----SFLPIGEK---YDAYFVGVESYCIGNS 210
C + + +FFGD+ + +F P+ Y+V ++ I
Sbjct: 190 YCLVPWRDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGR 249
Query: 211 CL---------TQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
L G ++ DSG + T LP Y V+ +S +I
Sbjct: 250 ALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSAGLDL 309
Query: 261 CYNASSEEM---LKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGI 316
CY+ S + +K+P M F + V N+ + N+ T+ CL ++S++ D GI
Sbjct: 310 CYDVSGSKASYKMKIPAMVFHFEGADYQLPVENYFIA--ANDAGTIVCLAMVSSNMDIGI 367
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCE 344
G R+++D + K+ W+ S+C+
Sbjct: 368 YGNMMQQNFRVMYDIGSSKIGWAPSQCD 395
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 86/314 (27%), Positives = 142/314 (45%), Gaps = 33/314 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP S+S +V C+ CK+ S C + + C Y Y + T + G L + + + S
Sbjct: 134 FDPLKSTSFSHVPCNSQNCKAIDDSHCGA-QGVCDYSYTYG-DQTYTKGDLGFEKITIGS 191
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
SSV+S +IGCG + G + + V+GLG G +S+ S +++ I FS
Sbjct: 192 -------SSVKS--VIGCGHESGGGFGFASG---VIGLGGGQLSLVSQMSQTSGISRRFS 239
Query: 163 ICFD---ENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCLTQSGF 217
C + +G + FG + P+ K Y+V +E+ IGN S
Sbjct: 240 YCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASAK 299
Query: 218 QA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY----NASSEEMLK 271
Q ++DSG + +FLP E+Y VV K+V +KR+ GN W C+ N ++ +
Sbjct: 300 QGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIP 359
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM--STDGDYGIIGQNFMMGHRIVF 329
+ + N + + N N V CLT+ S ++GIIG + I +
Sbjct: 360 IITAQFSGGANVNLLPVNTFQKVANN----VNCLTLTPASPTDEFGIIGNLALANFLIGY 415
Query: 330 DRENLKLAWSHSKC 343
D E +L++ + C
Sbjct: 416 DLEAKRLSFKPTVC 429
>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 570
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 84/305 (27%), Positives = 136/305 (44%), Gaps = 36/305 (11%)
Query: 76 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAP 134
C Y Y+ + +SS G LV D L + + S + + I GC Q G L+ +
Sbjct: 275 CNYEVQYADQ-SSSLGVLVKDEFTL----RFSNGSLTKLNAIFGCAYDQQGLLLNTLSKT 329
Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFL--- 189
DG++GL VS+PS LA G+I N C D G +F GD Q +++
Sbjct: 330 DGILGLSRAKVSLPSQLASRGIINNVVGHCLTGDPAGGGYLFLGDDF-VPQWGMAWVAML 388
Query: 190 --PIGEKYDAYFVGVESYCIGNSCLT--QSGFQALVDSGASFTFLPTEIYAEVVVKFDKL 245
P + Y V ++ I S T S Q + DSG+S+T+ E Y ++V ++
Sbjct: 389 DSPSIDFYQTKVVRIDYGSIPLSLDTWGSSREQVVFDSGSSYTYFTKEAYYQLVANLEE- 447
Query: 246 VSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK------NQSFVVRNHIFSFPEN-- 297
VS+ + LQ +S C+ + + + V D++ F ++ ++V + PEN
Sbjct: 448 VSAFGLILQDSSDTICWK-TEQSIRSVKDVKHFFKPLTLQFGSRFWLVSTKLVILPENYL 506
Query: 298 ----EGFTVFCLTVMST----DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDK 349
EG CL ++ DG I+G N + G +V+D N ++ W+ S C
Sbjct: 507 LINKEGNV--CLGILDGSQVHDGSTIILGDNALRGKLVVYDNVNQRIGWTSSDCHNPRKI 564
Query: 350 SHVHL 354
H+ L
Sbjct: 565 KHLPL 569
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 83/322 (25%), Positives = 133/322 (41%), Gaps = 37/322 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKS-----------LKDPCPYIADYSTEDTSSSGYL 93
YDPS SSS K V C+ C+ + +K C Y+ Y + + + G L
Sbjct: 178 YDPSVSSSYKTVFCNSSTCQDLVAATGNSGPCGGFNGVVKTTCEYVVSYG-DGSYTRGDL 236
Query: 94 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
+ + L + +++ GCGR G + G+MGLG VS+ S K
Sbjct: 237 ASESIVLG--------DTKLENLVFGCGRNNKGLF---GGASGLMGLGRSSVSLVSQTLK 285
Query: 154 AGLIQNSFSIC---FDENDSGSVFFGDQGPATQQSTS--FLPIGEK---YDAYFVGVESY 205
FS C ++ SG++ FG+ + STS + P+ + Y + +
Sbjct: 286 T--FNGVFSYCLPSLEDGASGTLSFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILNLTGA 343
Query: 206 CIGNSCLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
IG L F L+DSG T LP IY V +F K S + + C+N
Sbjct: 344 SIGGVELKTLSFGRGILIDSGTVITRLPPSIYKAVKTEFLKQFSGFPSAPGYSILDTCFN 403
Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNF 321
+S E + +P +++IF N V +F F + + V L +S + + GIIG
Sbjct: 404 LTSYEDISIPTIKMIFEGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQ 463
Query: 322 MMGHRIVFDRENLKLAWSHSKC 343
R+++D +L + C
Sbjct: 464 QKNQRVIYDTTQERLGIAGENC 485
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 79/317 (24%), Positives = 135/317 (42%), Gaps = 32/317 (10%)
Query: 51 SSSKNVSCSHPLCKSRSSCKSLKDPCP------YIADYSTEDTSSSGYLVDDILHLASFS 104
++++ V C++ LC + S + + CP Y Y T+ SS G L++D SFS
Sbjct: 99 TANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKY-TDSASSQGVLIND-----SFS 152
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDG--AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S+++ + GCG Q AA DG++GLG G VS+ S L + G+ +N
Sbjct: 153 LPMRSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVG 212
Query: 163 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNSCLTQSGFQAL 220
C N G +FFGD + + T ++P+ ++ + Y G + L + +
Sbjct: 213 HCLSTNGGGFLFFGDDVVPSSRVT-WVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVV 271
Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF- 279
DSG+++T+ + Y VV +S + + C+ + V D++ F
Sbjct: 272 FDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCWKG-QKAFKSVFDVKNEFK 330
Query: 280 SKNQSFV-VRNHIFSFPENEGFTV-----FCLTVMSTDG-----DYGIIGQNFMMGHRIV 328
S SF +N P V CL ++ DG + +IG M ++
Sbjct: 331 SMFLSFASAKNAAMEIPPENYLIVTKNGNVCLGIL--DGTAAKLSFNVIGDITMQDQMVI 388
Query: 329 FDRENLKLAWSHSKCEE 345
+D E +L W+ C
Sbjct: 389 YDNEKSQLGWARGACTR 405
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 85/323 (26%), Positives = 146/323 (45%), Gaps = 27/323 (8%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +DP SSS+S +SCS C+S +SC S + C Y Y + + +SGY V D
Sbjct: 121 LNYFDPRSSSTSSLISCSDRRCRSGVQTSDASCSSQNNQCTYTFQYG-DGSGTSGYYVSD 179
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAG 155
++H A + ++ +SV+ GC QTG A DG+ G G +SV S L+ G
Sbjct: 180 LMHFAGIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQG 239
Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
+ FS C ++SG G + + + P+ + Y + ++S + +
Sbjct: 240 IAPRVFSHCLKGDNSGGGVL-VLGEIVEPNIVYSPLVQSQPHYNLNLQSISVNGQIVPIA 298
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNAS 265
T + +VDSG + +L E Y V LV S + + +GN CY +
Sbjct: 299 PAVFATSNNRGTIVDSGTTLAYLAEEAYNPFVNAITALVPQSVRSVLSRGNQ---CYLIT 355
Query: 266 SEEMLKV-PDMRLIFSKNQSFVVRNHIFSFPEN---EGFTVFCLTVMSTDGDYGIIGQNF 321
+ + + P + L F+ S V+R + +N EG +V+C+ G I +
Sbjct: 356 TSSNVDIFPQVSLNFAGGASLVLRPQDYLMQQNYIGEG-SVWCIGFQRIPGQSITILGDL 414
Query: 322 MMGHRI-VFDRENLKLAWSHSKC 343
++ +I V+D ++ W++ C
Sbjct: 415 VLKDKIFVYDLAGQRIGWANYDC 437
>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
Length = 357
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 79/317 (24%), Positives = 135/317 (42%), Gaps = 32/317 (10%)
Query: 51 SSSKNVSCSHPLCKSRSSCKSLKDPCP------YIADYSTEDTSSSGYLVDDILHLASFS 104
++++ V C++ LC + S + + CP Y Y T+ SS G L++D SFS
Sbjct: 41 TANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKY-TDSASSQGVLIND-----SFS 94
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDG--AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S+++ + GCG Q AA DG++GLG G VS+ S L + G+ +N
Sbjct: 95 LPMRSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVG 154
Query: 163 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNSCLTQSGFQAL 220
C N G +FFGD + + T ++P+ ++ + Y G + L + +
Sbjct: 155 HCLSTNGGGFLFFGDDVVPSSRVT-WVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVV 213
Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF- 279
DSG+++T+ + Y VV +S + + C+ + V D++ F
Sbjct: 214 FDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCWKG-QKAFKSVFDVKNEFK 272
Query: 280 SKNQSFV-VRNHIFSFPENEGFTV-----FCLTVMSTDG-----DYGIIGQNFMMGHRIV 328
S SF +N P V CL ++ DG + +IG M ++
Sbjct: 273 SMFLSFASAKNAAMEIPPENYLIVTKNGNVCLGIL--DGTAAKLSFNVIGDITMQDQMVI 330
Query: 329 FDRENLKLAWSHSKCEE 345
+D E +L W+ C
Sbjct: 331 YDNEKSQLGWARGACTR 347
>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 418
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 78/321 (24%), Positives = 135/321 (42%), Gaps = 29/321 (9%)
Query: 47 PSSSSSSKNVSCSHPLCKSRSSCKSLK----DPCPYIADYSTEDTSSSGYLVDDILHLAS 102
P S+ V C PLC S S + D C Y +Y+ + SS G LV D+ L +
Sbjct: 98 PLYQPSNDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYA-DGGSSLGVLVRDVFPL-N 155
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+ P ++ + +GCG Q DG++GLG G VS+ S L G+++N
Sbjct: 156 LTNGDP---IRPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVG 212
Query: 163 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALV 221
CF+ G +FFGD G + P+ Y ++ G +
Sbjct: 213 HCFNSKGGGYLFFGD-GIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVF 271
Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
DSG+S+T+ + Y + ++ ++ K R ++ ++ C+ + + + D+R F
Sbjct: 272 DSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRG-RKPIKSLRDVRKYF 330
Query: 280 S----KNQSFVVRNHIFSFPENEGFTVF------CLTVMS-TD---GDYGIIGQNFMMGH 325
S +F P EG+ + CL +++ TD + IIG M
Sbjct: 331 KPLALSFSSGGRSKAVFEIP-TEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDK 389
Query: 326 RIVFDRENLKLAWSHSKCEEV 346
+V++ E + W+ + C+ V
Sbjct: 390 MVVYNNEKQAIGWATANCDRV 410
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 79/317 (24%), Positives = 135/317 (42%), Gaps = 32/317 (10%)
Query: 51 SSSKNVSCSHPLCKSRSSCKSLKDPCP------YIADYSTEDTSSSGYLVDDILHLASFS 104
++++ V C++ LC + S + + CP Y Y T+ SS G L++D SFS
Sbjct: 99 TANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKY-TDSASSQGVLIND-----SFS 152
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDG--AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S+++ + GCG Q AA DG++GLG G VS+ S L + G+ +N
Sbjct: 153 LPMRSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVG 212
Query: 163 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNSCLTQSGFQAL 220
C N G +FFGD + + T ++P+ ++ + Y G + L + +
Sbjct: 213 HCLSTNGGGFLFFGDDVVPSSRVT-WVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVV 271
Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF- 279
DSG+++T+ + Y VV +S + + C+ + V D++ F
Sbjct: 272 FDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCWKG-QKAFKSVFDVKNEFK 330
Query: 280 SKNQSF-VVRNHIFSFPENEGFTV-----FCLTVMSTDG-----DYGIIGQNFMMGHRIV 328
S SF +N P V CL ++ DG + +IG M ++
Sbjct: 331 SMFLSFSSAKNAAMEIPPENYLIVTKNGNVCLGIL--DGTAAKLSFNVIGDITMQDQMVI 388
Query: 329 FDRENLKLAWSHSKCEE 345
+D E +L W+ C
Sbjct: 389 YDNEKSQLGWARGACTR 405
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 87/322 (27%), Positives = 143/322 (44%), Gaps = 36/322 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++PS SSS KN+ CS LC S R + S ++ C Y Y + + S G L D L L S
Sbjct: 129 FNPSKSSSYKNIPCSSKLCHSVRDTSCSDQNSCQYKISYG-DSSHSQGDLSVDTLSLEST 187
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S +P S + ++IGCG G++ G A G++GLG G VS+ + L + I FS
Sbjct: 188 SG-SPVSFPK--IVIGCGTDNAGTF--GGASSGIVGLGGGPVSLITQLGSS--IGGKFSY 240
Query: 164 CF------DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQSG 216
C + N S + FGD + P+ +K YF+ ++++ +GN + G
Sbjct: 241 CLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGG 300
Query: 217 F--------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
++DSG + T +P+++Y + LV R+ + CY+ S E
Sbjct: 301 SSEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLCYSLKSNE 360
Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFS--FPENEGFTVFCLTVMSTDGD-YGIIG-QNFMMG 324
D +I + V H S P +G F G +G + QN ++G
Sbjct: 361 Y----DFPIITVHFKGADVELHSISTFVPITDGIVCFAFQPSPQLGSIFGNLAQQNLLVG 416
Query: 325 HRIVFDRENLKLAWSHSKCEEV 346
+D + +++ + C +V
Sbjct: 417 ----YDLQQKTVSFKPTDCTKV 434
>gi|413924529|gb|AFW64461.1| hypothetical protein ZEAMMB73_591827 [Zea mays]
Length = 217
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 39/81 (48%), Positives = 54/81 (66%), Gaps = 3/81 (3%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
DR+L Y P+ S++S+++ CSH LC+S C + K PCPY DY +E+T+SSG L++D L
Sbjct: 140 DRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTL 199
Query: 99 HLASFSKHAPQSSVQSSVIIG 119
HL H P V +SVIIG
Sbjct: 200 HLNYREDHVP---VNASVIIG 217
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 79/318 (24%), Positives = 145/318 (45%), Gaps = 32/318 (10%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLV 94
+N ++DP++S+S KNVSCS CK + + + + C Y Y + T G+L
Sbjct: 178 QNQPKFDPTTSTSYKNVSCSSEFCKLIAEGNYPAQDCISNTCLYGIQYGSGYTI--GFLA 235
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
+ L +AS S V + + GC + G++ +G G++GLG +++PS
Sbjct: 236 TETLAIAS-------SDVFKNFLFGCSEESRGTF-NGTT--GLLGLGRSPIALPSQTTNK 285
Query: 155 GLIQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLP-IGEKYDAYFVGVESYCIGNSC 211
+N FS C + S G + FG + +ST P + + Y VG+ +
Sbjct: 286 --YKNLFSYCLPASPSSTGHLSFGVEVSQAAKSTPISPKLKQLYGLNTVGIS---VRGRE 340
Query: 212 LTQSGF--QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS--E 267
L +G + ++DSG +FTFLP+ Y+ + F +++++ ++ +S++ CY+ S+
Sbjct: 341 LPINGSISRTIIDSGTTFTFLPSPTYSALGSAFREMMANYTLTNGTSSFQPCYDFSNIGN 400
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGH 325
L +P + + F + P N G CL T D D+ I G +
Sbjct: 401 GTLTIPGISIFFEGGVEVEIDVSGIMIPVN-GLKEVCLAFADTGSDSDFAIFGNYQQKTY 459
Query: 326 RIVFDRENLKLAWSHSKC 343
+++D + ++ C
Sbjct: 460 EVIYDVAKGMVGFAPKGC 477
>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 413
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 83/315 (26%), Positives = 130/315 (41%), Gaps = 29/315 (9%)
Query: 51 SSSKNVSCSHPLCKSRSSCKSLKDPC--PYIADYS---TEDTSSSGYLVDDILHLASFSK 105
+ +K V C+ +C + S +S C P DY T+ SS G LV D L +
Sbjct: 98 TKNKLVPCAASICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTLPLRN- 156
Query: 106 HAPQSSVQSSVIIGCGRKQT--GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
SSV+ S GCG Q + + A DG++GLG G VS+ S L G+ +N
Sbjct: 157 ---SSSVRPSFTFGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGH 213
Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNSCLTQSGFQALV 221
C N G +FFGD T ++T ++P+ + Y G + L + +
Sbjct: 214 CLSTNGGGFLFFGDNVVPTSRAT-WVPMVRSTSGNYYSPGSGTLYFDRRSLGVKPMEVVF 272
Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK-VPDMRLIFS 280
DSG+++T+ + Y V +S + S C+ +++ K V D++ F
Sbjct: 273 DSGSTYTYFAAQPYQATVSALKAGLSKSLQQVSDPSLPLCWKG--QKVFKSVSDVKNDFK 330
Query: 281 KNQSFVVRNHIFSFPENEGFTVF-----CLTVMSTDGD-----YGIIGQNFMMGHRIVFD 330
V+N + P V CL ++ DG + IIG M I++D
Sbjct: 331 SLFLSFVKNSVLEIPPENYLIVTKNGNACLGIL--DGSAAKLTFNIIGDITMQDQLIIYD 388
Query: 331 RENLKLAWSHSKCEE 345
E +L W C
Sbjct: 389 NERGQLGWIRGSCSR 403
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 83/329 (25%), Positives = 144/329 (43%), Gaps = 40/329 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCK---------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+DP++S S +NV+C C+ R + DPCPY Y + ++
Sbjct: 191 FDPAASISYRNVTCGDDRCRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGD---- 246
Query: 96 DILHLASFSKHAPQSSVQS--SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
L L +F+ + QS + V GCG + G + A ++GLG G +S S L +
Sbjct: 247 --LALEAFTVNLTQSGTRRVDGVAFGCGHRNRGLFHGAAG---LLGLGRGPLSFASQL-R 300
Query: 154 AGLIQNSFSICFDENDSGS---VFFGDQGPATQQS----TSFLPIGEKYDAYFVGVESYC 206
++FS C E+ S + + FG T+F P + Y++ ++S
Sbjct: 301 GVYGGHAFSYCLVEHGSAAGSKIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSIL 360
Query: 207 IGNSCL-----TQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKY 260
+G + T S ++DSG + ++ P Y + F D++ S + L
Sbjct: 361 VGGEAVNISSDTLSAGGTIIDSGTTLSYFPEPAYQAIRQAFIDRMSPSYPLILGFPVLSP 420
Query: 261 CYNASSEEMLKVPDMRLIFSKNQS--FVVRNHIFSFPENEGFTVFCLTVMST-DGDYGII 317
CYN S E ++VP++ L+F+ + F N+ E EG + CL V+ T II
Sbjct: 421 CYNVSGAEKVEVPELSLVFADGAAWEFPAENYFIRL-EPEG--IMCLAVLGTPRSGMSII 477
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
G +++D E+ +L ++ +C +V
Sbjct: 478 GNYQQQNFHVLYDLEHNRLGFAPRRCADV 506
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 84/328 (25%), Positives = 135/328 (41%), Gaps = 42/328 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP SSS +SC LC S R SC C Y Y + + + G L + + L S
Sbjct: 82 FDPEGSSSYTTMSCGDTLCDSLPRKSCSP---NCDYSYGYG-DGSGTRGTLSSETVTLTS 137
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+ ++ GCG GS+ D + G++GLG G++S S L L + FS
Sbjct: 138 TQG---EKLAAKNIAFGCGHLNRGSFNDAS---GLVGLGRGNLSFVSQLGD--LFGHKFS 189
Query: 163 ICF-----DENDSGSVFFGDQGPATQQST----SFLPIGEK---YDAYFVGVESYCIGNS 210
C + + +FFGD+ + +F P+ Y+V ++ I
Sbjct: 190 YCLVPWRDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGR 249
Query: 211 CL---------TQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
L G ++ DSG + T LP Y V+ VS I
Sbjct: 250 ALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGSSAGLDL 309
Query: 261 CYNASSEEM---LKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGI 316
CY+ S + K+P M F + V N+ + N+ T+ CL ++S++ D GI
Sbjct: 310 CYDVSGSKASYKKKIPAMVFHFEGADHQLPVENYFIA--ANDAGTIVCLAMVSSNMDIGI 367
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCE 344
G R+++D + K+ W+ S+C+
Sbjct: 368 YGNMMQQNFRVMYDIGSSKIGWAPSQCD 395
>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
Length = 410
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 76/314 (24%), Positives = 135/314 (42%), Gaps = 33/314 (10%)
Query: 56 VSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
V CS+ LC++ S+ C + D C Y +Y+ + SS G L+ D L + + +
Sbjct: 104 VPCSNSLCQAVSTGENYHCDAPDDQCDYEIEYA-DLGSSIGVLLSDSFPL----RLSNGT 158
Query: 111 SVQSSVIIGCG--RKQTGSYLDGAAPD--GVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 166
+Q + GCG +K G + PD G++GLG G VS+ S L G+ QN CF
Sbjct: 159 LLQPKMAFGCGYDQKHLGPH---PPPDTAGILGLGRGKVSILSQLRTLGITQNVVGHCFS 215
Query: 167 ENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGA 225
G +FFGD P+++ + + + Y G G G Q + DSG+
Sbjct: 216 RARGGFLFFGDHLFPSSRITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGS 275
Query: 226 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-----WKYCYNASS----EEMLKVPDMR 276
S+T+ ++Y ++ K ++ K + WK S + K +
Sbjct: 276 SYTYFNAQVYQSILNLVRKDLAGKPLKDAPEKELAVCWKTAKPIKSILDIKSYFKPLTIS 335
Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDRE 332
+ +KN + + +G CL +++ G++ +IG FM +++D E
Sbjct: 336 FMNAKNVQLQLAPEDYLIITKDGNV--CLGILNGSEQQLGNFNVIGDIFMQDRVVIYDNE 393
Query: 333 NLKLAWSHSKCEEV 346
++ W + C+ +
Sbjct: 394 KQQIGWFPANCDRL 407
>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 578
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 85/326 (26%), Positives = 138/326 (42%), Gaps = 51/326 (15%)
Query: 56 VSCSHPLC------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 109
V S P C + C+S C Y +Y+ + + S G L D HL K
Sbjct: 251 VRSSEPFCVEVQRNQLTEHCESCHQ-CDYEIEYA-DHSYSMGVLTKDKFHL----KLHNG 304
Query: 110 SSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--D 166
S +S ++ GCG Q G L+ DG++GL +S+PS LA G+I N C D
Sbjct: 305 SLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLASD 364
Query: 167 ENDSGSVFFG-DQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLTQSGF-----Q 218
N G +F G D P+ +++P+ + Y + V GN+ L+ G +
Sbjct: 365 LNGEGYIFMGSDLVPS--HGMTWVPMLHHPHLEVYQMQVTKMSYGNAMLSLDGENGRVGK 422
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS------WKYCYNASSEEM--- 269
L D+G+S+T+ P + Y+++V + VS ++ + W+ N+ +
Sbjct: 423 VLFDTGSSYTYFPNQAYSQLVTSLQE-VSDLELTRDDSDEALPICWRAKTNSPISSLSDV 481
Query: 270 --------LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS----TDGDYGII 317
L++ LI SK +++ + N+G CL ++ DG II
Sbjct: 482 KKFFRPITLQIGSKWLIISK--KLLIQPEDYLIISNKGNV--CLGILDGSNVHDGSTIII 537
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKC 343
G M G IV+D ++ W S C
Sbjct: 538 GDISMRGRLIVYDNVKQRIGWMKSDC 563
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 81/323 (25%), Positives = 137/323 (42%), Gaps = 25/323 (7%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDIL 98
+LS +D ++SS+SK V C C S S + C Y Y+ E TS G + D L
Sbjct: 117 HLSLFDVNASSTSKKVGCDDDFCSFISQSDSCQPAVGCSYHIVYADESTSE-GNFIRDKL 175
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L + + V+ GCG Q+G +A DGVMG G + SV S LA G
Sbjct: 176 TLEQVTGDLQTGPLGQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDA 235
Query: 158 QNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVE----SYCIGNSCL 212
+ FS C D G +F G ++T +P Y+ +G++ + + S +
Sbjct: 236 KRVFSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTALDLPPSIM 295
Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNASSEEMLK 271
G +VDSG + + P +Y ++ + +++ + + L + C++ S +
Sbjct: 296 RNGG--TIVDSGTTLAYFPKVLYDSLI---ETILARQPVKLHIVEDTFQCFSFSENVDVA 350
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFP-ENEGFTVFCLTVMS---TDGDYG---IIGQNFMMG 324
P + F + V H + F E E ++C + T G+ ++G +
Sbjct: 351 FPPVSFEFEDSVKLTVYPHDYLFTLEKE---LYCFGWQAGGLTTGERTEVILLGDLVLSN 407
Query: 325 HRIVFDRENLKLAWSHSKCEEVI 347
+V+D EN + W+ C I
Sbjct: 408 KLVVYDLENEVIGWADHNCSSSI 430
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 66/242 (27%), Positives = 116/242 (47%), Gaps = 30/242 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++PS SSS KN+ CS LC+S R + + ++ C Y ++S + + S G L + L L S
Sbjct: 129 FNPSKSSSYKNIPCSSNLCQSVRYTSCNKQNSCEYTINFS-DQSYSQGELSVETLTLDST 187
Query: 104 SKHA---PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+ H+ P++ +IGCG G + G++GLG+G VS+ + L + I
Sbjct: 188 TGHSVSFPKT------VIGCGHNNRGMF--QGETSGIVGLGIGPVSLTTQLKSS--IGGK 237
Query: 161 FSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCL- 212
FS C D N + + FGD + P +K Y++ +E++ +GN +
Sbjct: 238 FSYCLLPLLVDSNKTSKLNFGDAAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIE 297
Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
++ G ++DSG + T LP+ +Y + +LV R+ CY+ +S
Sbjct: 298 FEVLDDSEEG-NIILDSGTTLTLLPSHVYTNLESAVAQLVKLDRVDDPNQLLNLCYSITS 356
Query: 267 EE 268
++
Sbjct: 357 DQ 358
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 81/332 (24%), Positives = 148/332 (44%), Gaps = 46/332 (13%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ P SS+ + V C+ + +C C Y Y+ E ++SSG L +D++
Sbjct: 130 RFQPELSSTYQPVKCN-----ADCNCDENGVQCTYERRYA-EMSTSSGVLAEDVMSFGKE 183
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ PQ +V GC ++G A DG+MGLG G +SV L G++ NSFS+
Sbjct: 184 SELVPQRAV-----FGCETMESGDLYTQRA-DGIMGLGRGTLSVMDQLVGKGVVSNSFSL 237
Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------Q 214
C+ D G V G P + P Y Y + ++ + L
Sbjct: 238 CYGGMDVGGGAMVLGGISSPPGMVFSHSDPSRSPY--YNIELKEIHVAGKPLKLNPRTFD 295
Query: 215 SGFQALVDSGASFTFLPTEIY---AEVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEML 270
+ A++DSG ++ + P + Y + ++K K+ K+IS ++K C++ + ++
Sbjct: 296 GKYGAILDSGTTYAYFPEKAYYAFKDAIMK--KISFLKQISGPDPNFKDICFSGAGRDVT 353
Query: 271 KV----PDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQ 319
++ P++ ++F+ Q S N++F + G +CL + D GII +
Sbjct: 354 ELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSG--AYCLGIFKNGNDQTTLLGGIIVR 411
Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVIDKSH 351
N + + ++REN + + + C E+ H
Sbjct: 412 NTL----VTYNRENSTIGFWKTNCSELWKNLH 439
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 89/326 (27%), Positives = 139/326 (42%), Gaps = 41/326 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP S S V C P+C+ S C ++ C Y Y + + ++G + L A
Sbjct: 170 FDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYG-DGSVTAGDFASETLTFAR 228
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
++ VQ V IGCG G ++ A G++GLG G +S PS +A++ SFS
Sbjct: 229 GAR------VQR-VAIGCGHDNEGLFI---AASGLLGLGRGRLSFPSQIARS--FGRSFS 276
Query: 163 ICFDEN---------DSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS 210
C + S +V FG A SF P+G Y+V + + +G +
Sbjct: 277 YCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGA 336
Query: 211 ---CLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS- 257
++QS + ++DSG S T L +Y V F R+S G S
Sbjct: 337 RVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSL 396
Query: 258 WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
+ CYN S ++KVP + + + S + + P + T FC + TDG II
Sbjct: 397 FDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGT-FCFAMAGTDGGVSII 455
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKC 343
G G R+VFD + ++ + C
Sbjct: 456 GNIQQQGFRVVFDGDAQRVGFVPKSC 481
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 83/340 (24%), Positives = 143/340 (42%), Gaps = 36/340 (10%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ YD S + K VSC C + S C + C Y Y+ + +SS GY V D
Sbjct: 142 LTLYDIKESLTGKLVSCDQDFCYAINGGPPSYCIA-NMSCSYTEIYA-DGSSSFGYFVRD 199
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
I+ S +S SVI GC Q+G A DG++G G + S+ S LA +G
Sbjct: 200 IVQYDQVSGDLETTSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGK 259
Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--- 213
++ F+ C D + G +F G Q + P+ Y V +++ +G L
Sbjct: 260 VRKMFAHCLDGLNGGGIF--AIGHIVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPT 317
Query: 214 -------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
+ G ++DSG + +LP +Y +++ K S ++ + + C+ S
Sbjct: 318 DVFDVGDKKG--TIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFT-CFQYSE 374
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTD-GDYGIIGQN 320
P + F + V H + F + ++C+ + S D + ++G
Sbjct: 375 SLDDGFPAVTFHFENSLYLKVHPHEYLFSYD---GLWCIGWQNSGMQSRDRRNITLLGDL 431
Query: 321 FMMGHRIVFDRENLKLAWSHSKCE---EVIDKSH--VHLV 355
+ +++D EN + W+ C +V+D+ VHLV
Sbjct: 432 ALSNKLVLYDLENQVIGWTEYNCSSSIKVVDEQSGTVHLV 471
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 81/333 (24%), Positives = 140/333 (42%), Gaps = 38/333 (11%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ P SS+ + V C+ P C +C C Y Y+ E +SSSG + +D++ +
Sbjct: 118 RFQPDLSSTYRPVKCN-PSC----NCDDEGKQCTYERRYA-EMSSSSGVIAEDVVSFGNE 171
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ PQ +V GC +TG A DG+MGLG G +SV L G+I +SFS+
Sbjct: 172 SELKPQRAV-----FGCENVETGDLYSQRA-DGIMGLGRGRLSVVDQLVDKGVIGDSFSL 225
Query: 164 CFDEND--SGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGVESYCIGNSCLT------Q 214
C+ D G++ G P S P Y Y + ++ + L
Sbjct: 226 CYGGMDVGGGAMVLGQISPPPNMVFSHSNPYRSPY--YNIELKELHVAGKPLKLKPKVFD 283
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRI-SLQGNSWKYCYNASSEEMLKV 272
++DSG ++ + P + + K + K+I N C++ + E+ +
Sbjct: 284 EKHGTVLDSGTTYAYFPEAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREVSHL 343
Query: 273 ----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQNFMM 323
P++ ++F Q + + F + +CL + D GI+ +N +
Sbjct: 344 SKVFPEVNMVFGSGQKLSLSPENYLFRHTKVSGAYCLGIFQNGNDLTTLLGGIVVRNTL- 402
Query: 324 GHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVP 356
+ +DREN K+ + + C E+ V VP
Sbjct: 403 ---VTYDRENDKIGFWKTNCSELWKSLQVPGVP 432
>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
Length = 446
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 77/304 (25%), Positives = 129/304 (42%), Gaps = 38/304 (12%)
Query: 70 KSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY- 128
K C Y Y+ + S G+LV D + +K + + ++ + GCG Q S
Sbjct: 151 KEASQRCDYDVAYA-DHGYSEGFLVRDSVRALLTNK----TVLTANSVFGCGYNQRESLP 205
Query: 129 LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQST 186
+ A DG++GLG G S+PS AK GLI+N C D G +FFGD +T T
Sbjct: 206 VSDARTDGILGLGSGMASLPSQWAKQGLIKNVIGHCIFGAGRDGGYMFFGDDLVSTSAMT 265
Query: 187 SFLPIGE-KYDAYFVGVESYCIGNSCLTQSGFQA-----LVDSGASFTFLPTEIYAEVVV 240
+G Y+VG GN L + G + DSG+++T+ + Y +
Sbjct: 266 WVPMLGRPSIKHYYVGAAQMNFGNKPLDKDGDGKKLGGIIFDSGSTYTYFTNQAYGAFLS 325
Query: 241 KFDKLVSSKRISLQGNS------W--KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIF 292
+ +S K++ + W K + + +E + L F ++ +
Sbjct: 326 VVKENLSGKQLEQDSSDSFLSLCWRRKEGFRSVAEAAAYFKPLTLKFRSTKT----KQME 381
Query: 293 SFPENEGFTV------FCLTVMSTDG----DYGIIGQNFMMGHRIVFDRENLKLAWSHSK 342
FP EG+ V CL +++ D ++G G +V+D E ++ W+ S
Sbjct: 382 IFP--EGYLVVNKKGNVCLGILNGTAIGIVDTNVLGDISFQGQLVVYDNEKNQIGWARSD 439
Query: 343 CEEV 346
C+E+
Sbjct: 440 CQEI 443
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 81/332 (24%), Positives = 148/332 (44%), Gaps = 46/332 (13%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ P SS+ + V C+ + +C C Y Y+ E ++SSG L +D++
Sbjct: 130 RFQPELSSTYQPVKCN-----ADCNCDENGVQCTYERRYA-EMSTSSGVLAEDVMSFGKE 183
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ PQ +V GC ++G A DG+MGLG G +SV L G++ NSFS+
Sbjct: 184 SELVPQRAV-----FGCETMESGDLYTQRA-DGIMGLGRGTLSVMDQLVGKGVVSNSFSL 237
Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------Q 214
C+ D G V G P + P Y Y + ++ + L
Sbjct: 238 CYGGMDVGGGAMVLGGISSPPGMVFSHSDPSRSPY--YNIELKEIHVAGKPLKLNPRTFD 295
Query: 215 SGFQALVDSGASFTFLPTEIY---AEVVVKFDKLVSSKRISLQGNSWK-YCYNASSEEML 270
+ A++DSG ++ + P + Y + ++K K+ K+IS ++K C++ + ++
Sbjct: 296 GKYGAILDSGTTYAYFPEKAYYAFKDAIMK--KISFLKQISGPDPNFKDICFSGAGRDVT 353
Query: 271 KV----PDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-----GIIGQ 319
++ P++ ++F+ Q S N++F + G +CL + D GII +
Sbjct: 354 ELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSG--AYCLGIFKNGNDQTTLLGGIIVR 411
Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVIDKSH 351
N + + ++REN + + + C E+ H
Sbjct: 412 NTL----VTYNRENSTIGFWKTNCSELWKNLH 439
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 88/333 (26%), Positives = 144/333 (43%), Gaps = 51/333 (15%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
++L ++PS S++ + VSCS P+C +SC S K C Y Y +++ S G D
Sbjct: 122 QDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSC-SFKPDCTYSISYG-DNSHSQGDFAVD 179
Query: 97 ILHLASFSKHA---PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
L + S S P+++ IGCG GS+ A G++GLGLG S+ +
Sbjct: 180 TLTMGSTSGRVVAFPRTA------IGCGHDNAGSF--DANVSGIVGLGLGPASLIKQMGS 231
Query: 154 AGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYF------- 199
A + FS C D+ S + FG + PI +K+ +++
Sbjct: 232 A--VGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAV 289
Query: 200 -VGVES--YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 256
VG + Y NS L ++DSG + T LP ++Y ++ +R
Sbjct: 290 SVGRNNTFYSTANSILGGKA-NIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQ 348
Query: 257 SWKYCYNASSEEMLKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGD- 313
+YC+ ++++ KVP + + F N N + +N V CL + D D
Sbjct: 349 FLEYCFETTTDD-YKVPFIAMHFEGANLRLQRENVLIRVSDN----VICLAFAGAQDNDI 403
Query: 314 --YGIIGQ-NFMMGHRIVFDRENLKLAWSHSKC 343
YG I Q NF++G +D N+ L++ C
Sbjct: 404 SIYGNIAQINFLVG----YDVTNMSLSFKPMNC 432
>gi|326533540|dbj|BAK05301.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 78/320 (24%), Positives = 133/320 (41%), Gaps = 41/320 (12%)
Query: 55 NVSCSHPLCKS-RSSCK-----SLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKH 106
V C PLC + R S DP C Y Y T S G L DI+ + K
Sbjct: 93 KVVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVT--GKSEGDLATDIISVNGRDK- 149
Query: 107 APQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSIC 164
+ GCG KQ +P DG++GLG+G + + L +I +N C
Sbjct: 150 -------KRIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVIGHC 202
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDS 223
G ++ GD P T+ T + P+ E Y G+ I + F+A+ DS
Sbjct: 203 LSSKGKGVLYVGDFNPPTRGVT-WAPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDS 261
Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNASS--------EEMLKVPD 274
G+++T +P +IY E+V K +S + ++G + C+ + K
Sbjct: 262 GSTYTHVPAQIYNEIVSKVRVTLSESSLEEVKGRALPLCWKGKKPFGSVNDVKNQFKALS 321
Query: 275 MRLIFSKNQSFV-VRNHIFSFPENEGFTVFCLTVMSTDGD-------YGIIGQNFMMGHR 326
+++ ++ S + + + F + +G T CL ++ D + +IG M
Sbjct: 322 LKITHARGTSNLDIPPQNYLFVKEDGET--CLAILDASLDPVLKELNFILIGAVTMQDLF 379
Query: 327 IVFDRENLKLAWSHSKCEEV 346
+++D E +L W ++C+ V
Sbjct: 380 VIYDNEKKQLGWVRAQCDRV 399
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 89/326 (27%), Positives = 139/326 (42%), Gaps = 41/326 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP S S V C P+C+ S C ++ C Y Y + + ++G + L A
Sbjct: 164 FDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYG-DGSVTAGDFASETLTFAR 222
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
++ VQ V IGCG G ++ A G++GLG G +S PS +A++ SFS
Sbjct: 223 GAR------VQR-VAIGCGHDNEGLFI---AASGLLGLGRGRLSFPSQIARS--FGRSFS 270
Query: 163 ICFDEN---------DSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS 210
C + S +V FG A SF P+G Y+V + + +G +
Sbjct: 271 YCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGA 330
Query: 211 ---CLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS- 257
++QS + ++DSG S T L +Y V F R+S G S
Sbjct: 331 RVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSL 390
Query: 258 WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
+ CYN S ++KVP + + + S + + P + T FC + TDG II
Sbjct: 391 FDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGT-FCFAMAGTDGGVSII 449
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKC 343
G G R+VFD + ++ + C
Sbjct: 450 GNIQQQGFRVVFDGDAQRVGFVPKSC 475
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 88/333 (26%), Positives = 144/333 (43%), Gaps = 51/333 (15%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
++L ++PS S++ + VSCS P+C +SC S K C Y Y +++ S G D
Sbjct: 122 QDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSC-SFKPDCTYSISYG-DNSHSQGDFAVD 179
Query: 97 ILHLASFSKHA---PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
L + S S P+++ IGCG GS+ A G++GLGLG S+ +
Sbjct: 180 TLTMGSTSGRVVAFPRTA------IGCGHDNAGSF--DANVSGIVGLGLGPASLIKQMGS 231
Query: 154 AGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYF------- 199
A + FS C D+ S + FG + PI +K+ +++
Sbjct: 232 A--VGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAV 289
Query: 200 -VGVES--YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 256
VG + Y NS L ++DSG + T LP ++Y ++ +R
Sbjct: 290 SVGRNNTFYSTANSILGGKA-NIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQ 348
Query: 257 SWKYCYNASSEEMLKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGD- 313
+YC+ ++++ KVP + + F N N + +N V CL + D D
Sbjct: 349 FLEYCFETTTDD-YKVPFIAMHFEGANLRLQRENVLIRVSDN----VICLAFAGAQDNDI 403
Query: 314 --YGIIGQ-NFMMGHRIVFDRENLKLAWSHSKC 343
YG I Q NF++G +D N+ L++ C
Sbjct: 404 SIYGNIAQINFLVG----YDVTNMSLSFKPMNC 432
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 80/329 (24%), Positives = 132/329 (40%), Gaps = 26/329 (7%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +DP SS ++ VSCS C S S C + C Y Y + + +SG+ V D
Sbjct: 125 LNFFDPGSSVTATPVSCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYG-DGSGTSGFYVSD 183
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
+L + + + V+ GC QTG + A DG+ G G +SV S LA G
Sbjct: 184 VLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQG 243
Query: 156 LIQNSFSICFD-ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
L FS C EN G + G + + F P+ Y V + S + L
Sbjct: 244 LAPRVFSHCLKGENGGGGILV--LGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPI 301
Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNA 264
T +G ++D+G + +L Y V VS + + +GN CY
Sbjct: 302 NPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ---CYVI 358
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGDYGIIGQNFM 322
++ P + L F+ S + + +N G V+C+ I + +
Sbjct: 359 ATSVADIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLV 418
Query: 323 MGHRI-VFDRENLKLAWSHSKCEEVIDKS 350
+ +I V+D ++ W++ C ++ S
Sbjct: 419 LKDKIFVYDLVGQRIGWANYDCSMSVNVS 447
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 79/328 (24%), Positives = 136/328 (41%), Gaps = 57/328 (17%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ YDP+SS S+ VSC C S + CK + PC Y Y + +S++GY V D
Sbjct: 71 LTLYDPASSVSATRVSCDDDFCTSTYNGLLPDCKK-ELPCQYNVVYG-DGSSTAGYFVSD 128
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
+ + + +V GCG +Q+G G A DG++G
Sbjct: 129 AVQFERVTGNLQTGLSNGTVTFGCGAQQSGGLGTSGEALDGILG---------------- 172
Query: 156 LIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 213
+F+ C D + G +F G+ +T +P Y+ Y +E +G + L
Sbjct: 173 ----AFAHCLDNVNGGGIFAIGELVSPKVNTTPMVPNQAHYNVYMKEIE---VGGTVLEL 225
Query: 214 -----QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR-----ISLQGNSWKY- 260
SG + ++DSG + +LP +Y D +++ R +SL ++
Sbjct: 226 PTDVFDSGDRRGTIIDSGTTLAYLPEVVY-------DSMMNEIRSQQPGLSLHTVEEQFI 278
Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT---VMSTDG-DYGI 316
C+ S PD++ F + + V H + F +E F + S DG D +
Sbjct: 279 CFKYSGNVDDGFPDIKFHFKDSLTLTVYPHDYLFQISEDIWCFGWQNGGMQSKDGRDMTL 338
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCE 344
+G + +++D EN + W+ C+
Sbjct: 339 LGDLVLSNKLVLYDIENQAIGWTEYNCK 366
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 80/327 (24%), Positives = 137/327 (41%), Gaps = 25/327 (7%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKD-PCPYIADYSTEDTSSSGYLVD 95
L ++P +SS+S + CS C S + C++ + PC Y Y + + +SGY V
Sbjct: 135 LEFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYG-DGSGTSGYYVS 193
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKA 154
D ++ S + ++ +S++ GC Q+G A DG+ G G +SV S L
Sbjct: 194 DTMYFDSVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSL 253
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC-------I 207
G+ FS C +D+G G + + P+ Y + +ES I
Sbjct: 254 GVSPKVFSHCLKGSDNGGGIL-VLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPI 312
Query: 208 GNSCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNA 264
+S T S Q +VDSG + +L Y V VS SL +GN C+
Sbjct: 313 DSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQ---CFVT 369
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
SS P + L F + V+ N++ + ++C+ G I + +
Sbjct: 370 SSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLV 429
Query: 323 MGHRI-VFDRENLKLAWSHSKCEEVID 348
+ +I V+D N+++ W+ C ++
Sbjct: 430 LKDKIFVYDLANMRMGWTDYDCSTSVN 456
>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
Length = 383
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 73/240 (30%), Positives = 107/240 (44%), Gaps = 19/240 (7%)
Query: 51 SSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ SK V C H LC S + C S + C Y+ Y+ + SS+G L++D SF
Sbjct: 112 TKSKLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYA-DQGSSTGVLIND-----SF 165
Query: 104 SKHAPQSSV-QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSF 161
+ SV + SV GCG Q D ++P DGV+GLG G VS+ S L + G+ +N
Sbjct: 166 ALRLTNGSVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVV 225
Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCLTQSGFQA 219
C G +FFGD Q++T + P+ + Y G S G+ L +
Sbjct: 226 GHCLSLRGGGFLFFGDDLVPYQRAT-WTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKV 284
Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
+ DSG+SFT+ + Y +V +S S C+ E V D+R F
Sbjct: 285 VFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKG-QEPFKSVLDVRKEF 343
>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 78/296 (26%), Positives = 139/296 (46%), Gaps = 23/296 (7%)
Query: 62 LCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 121
L ++ C++ K C Y +Y+ + +SS G L D +HL + + + + GC
Sbjct: 252 LQGDQNYCETCKQ-CDYEIEYA-DRSSSMGVLAKDDMHLIATNG----GREKLDFVFGCA 305
Query: 122 RKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQ 178
Q G L A DG++GL +S+PS LA G+I N F C + N G +F GD
Sbjct: 306 YDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCITRETNGGGYMFLGDD 365
Query: 179 GPATQQSTSFLPI-GEKYDAYFVGVESYCIGNSCL-TQSGFQALVDSGASFTFLPTEIYA 236
+ ++ PI G + Y + G+ L + Q + DSG+S+T+LP E+Y
Sbjct: 366 Y-VPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGNSVQVIFDSGSSYTYLPEEMYK 424
Query: 237 EVV--VKFD--KLVSSKRISLQGNSWKYCYNASS-EEMLKVPDMRLIFSKNQSFVVRNHI 291
++ +K D V + WK ++ S + L + R F ++F +
Sbjct: 425 NLIDAIKEDSPSFVQDSSDTTLPLCWKADFSVRSFFKPLNLHFGRRWFVVPKTFTIVPDD 484
Query: 292 FSFPENEGFTVFCLTVMS-TDGDYG---IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
+ ++G CL +++ T+ ++G I+G + G +V+D E ++ W++S+C
Sbjct: 485 YLIISDKGNV--CLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNERRQIGWANSEC 538
>gi|356529585|ref|XP_003533370.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1388
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 75/301 (24%), Positives = 131/301 (43%), Gaps = 34/301 (11%)
Query: 76 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAP 134
C Y Y+ + +SS G LV D LHL + + S + +V+ GCG Q G L+
Sbjct: 269 CDYEIQYA-DHSSSLGVLVRDELHLVTTNG----SKTKLNVVFGCGYDQAGLLLNTLGKT 323
Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS--VFFGDQGPATQQSTSFLPIG 192
DG+MGL VS+P LA GLI+N C + +G +F GD +++P+
Sbjct: 324 DGIMGLSRAKVSLPYQLASKGLIKNVVGHCLSNDGAGGGYMFLGDDF-VPYWGMNWVPMA 382
Query: 193 EKY--DAYFVGVESYCIGNSCLTQSG----FQALVDSGASFTFLPTEIYAEVVVKFDKLV 246
D Y + GN L G + + DSG+S+T+ P E Y ++V +++
Sbjct: 383 YTLTTDLYQTEILGINYGNRQLRFDGQSKVGKMVFDSGSSYTYFPKEAYLDLVASLNEVS 442
Query: 247 SSKRISLQGNS-----WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT 301
+ ++ W+ + S + +K L + + + +F EG+
Sbjct: 443 GLGLVQDDSDTTLPICWQANFPIKSVKDVKDYFKTLTLRFGSKWWILSTLFQISP-EGYL 501
Query: 302 VF------CLTVMS----TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSH 351
+ CL ++ DG I+G + G+ +V+D K+ W + C +D+ +
Sbjct: 502 IISNKGHVCLGILDGSNVNDGSSIILGDISLRGYSVVYDNVKQKIGWKRADC---VDRCY 558
Query: 352 V 352
+
Sbjct: 559 I 559
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 90/327 (27%), Positives = 141/327 (43%), Gaps = 42/327 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
YDPS+SS+ V CS C +SR +C + PC YI YS + S G L + L +
Sbjct: 108 YDPSASSTFSPVPCSSATCLPTWRSR-NCSNPSSPCRYIYSYS-DGAYSVGILGTETLTI 165
Query: 101 ASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
S P +V SV GCG G L+ G +GLG G + SLLA+ G+
Sbjct: 166 GS---SVPGQTVSVGSVAFGCGTDNGGDSLNST---GTVGLGRGTL---SLLAQLGV--G 214
Query: 160 SFSIC----FDENDSGSVFFGD-----QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
FS C F+ F G GP T QST L YFV ++ +G+
Sbjct: 215 KFSYCLTDFFNSTMDSPFFLGTLAELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDV 274
Query: 211 CL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
L +VDSG +FT L + EVV + +L+ ++
Sbjct: 275 RLPIPNGTFDLRADGNGGMMVDSGTTFTILAKSGFREVVDRVAQLLGQPPVNASSLD-SP 333
Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 319
C+ + E +PD+ L F+ + R++ S+ NE + FCL ++ + + +G
Sbjct: 334 CFPSPDGEPF-MPDLVLHFAGGADMRLHRDNYMSY--NEDDSSFCLNIVGSPSTWSRLGN 390
Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEV 346
+++FD +L++ + C ++
Sbjct: 391 FQQQNIQMLFDMTVGQLSFLPTDCSKL 417
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 88/326 (26%), Positives = 139/326 (42%), Gaps = 41/326 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP S S V C P+C+ S C ++ C Y Y + + ++G + L A
Sbjct: 164 FDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYG-DGSVTAGDFASETLTFAR 222
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
++ VQ V IGCG G ++ A G++GLG G +S P+ +A++ SFS
Sbjct: 223 GAR------VQR-VAIGCGHDNEGLFI---AASGLLGLGRGRLSFPTQIARS--FGRSFS 270
Query: 163 ICFDEN---------DSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS 210
C + S +V FG A SF P+G Y+V + + +G +
Sbjct: 271 YCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGA 330
Query: 211 ---CLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS- 257
++QS + ++DSG S T L +Y V F R+S G S
Sbjct: 331 RVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSL 390
Query: 258 WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
+ CYN S ++KVP + + + S + + P + T FC + TDG II
Sbjct: 391 FDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGT-FCFAMAGTDGGVSII 449
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKC 343
G G R+VFD + ++ + C
Sbjct: 450 GNIQQQGFRVVFDGDAQRVGFVPKSC 475
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 79/327 (24%), Positives = 137/327 (41%), Gaps = 25/327 (7%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKD-PCPYIADYSTEDTSSSGYLVD 95
L ++P +SS+S + CS C S + C++ + PC Y Y + + +SGY V
Sbjct: 135 LEFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYG-DGSGTSGYYVS 193
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKA 154
D ++ + + ++ +S++ GC Q+G A DG+ G G +SV S L
Sbjct: 194 DTMYFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSL 253
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC-------I 207
G+ FS C +D+G G + + P+ Y + +ES I
Sbjct: 254 GVSPKVFSHCLKGSDNGGGIL-VLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPI 312
Query: 208 GNSCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNA 264
+S T S Q +VDSG + +L Y V VS SL +GN C+
Sbjct: 313 DSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQ---CFVT 369
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
SS P + L F + V+ N++ + ++C+ G I + +
Sbjct: 370 SSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLV 429
Query: 323 MGHRI-VFDRENLKLAWSHSKCEEVID 348
+ +I V+D N+++ W+ C ++
Sbjct: 430 LKDKIFVYDLANMRMGWTDYDCSTSVN 456
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 79/327 (24%), Positives = 137/327 (41%), Gaps = 25/327 (7%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKD-PCPYIADYSTEDTSSSGYLVD 95
L ++P +SS+S + CS C S + C++ + PC Y Y + + +SGY V
Sbjct: 161 LEFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYG-DGSGTSGYYVS 219
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKA 154
D ++ + + ++ +S++ GC Q+G A DG+ G G +SV S L
Sbjct: 220 DTMYFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSL 279
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC-------I 207
G+ FS C +D+G G + + P+ Y + +ES I
Sbjct: 280 GVSPKVFSHCLKGSDNGGGIL-VLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPI 338
Query: 208 GNSCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNA 264
+S T S Q +VDSG + +L Y V VS SL +GN C+
Sbjct: 339 DSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQ---CFVT 395
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
SS P + L F + V+ N++ + ++C+ G I + +
Sbjct: 396 SSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLV 455
Query: 323 MGHRI-VFDRENLKLAWSHSKCEEVID 348
+ +I V+D N+++ W+ C ++
Sbjct: 456 LKDKIFVYDLANMRMGWTDYDCSTSVN 482
>gi|2290202|gb|AAB96882.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|2290204|gb|AAB96883.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|45357050|gb|AAS58479.1| nucellin [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 77/322 (23%), Positives = 131/322 (40%), Gaps = 45/322 (13%)
Query: 55 NVSCSHPLCKS-RSSCK-----SLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKH 106
V C PLC + R S DP C Y Y T S G L DI+ + K
Sbjct: 93 KVVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVT--GKSEGDLATDIISVNGRDK- 149
Query: 107 APQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSIC 164
+ GCG KQ +P DG++GLG+G + L +I +N C
Sbjct: 150 -------KRIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKENVIGHC 202
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDS 223
G ++ GD P T+ ++ P+ E Y G+ I + F+A+ DS
Sbjct: 203 LSSKGKGVLYVGDFNPPTR-GVTWAPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDS 261
Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNASS--------EEMLKVPD 274
G+++T +P +IY E+V K +S + ++G + C+ + K
Sbjct: 262 GSTYTHVPAQIYNEIVSKVRGTLSESSLEEVKGRALPLCWKGKKPFGSVNDVKNQFKALS 321
Query: 275 MRLIFSK---NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-------YGIIGQNFMMG 324
+++ ++ N +N++F + E CL ++ D + +IG M
Sbjct: 322 LKITHARGTNNLDIPPQNYLFVKEDGE----TCLAILDASLDPVLKELNFILIGAVTMQD 377
Query: 325 HRIVFDRENLKLAWSHSKCEEV 346
+++D E +L W ++C+ V
Sbjct: 378 LFVIYDNEKKQLGWVRAQCDRV 399
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 81/315 (25%), Positives = 129/315 (40%), Gaps = 38/315 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST---EDTSSSGYLVDDILHLA 101
+DP++SSS VSC +C++ S DYS + + + G L + L L
Sbjct: 172 FDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLG 231
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
++VQ V IGCG + +G ++ A G++GLG G +S+ L G F
Sbjct: 232 G-------TAVQ-GVAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQL--GGAAGGVF 278
Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSC--------- 211
S C +G G T +P G + + Y+VG+ +G
Sbjct: 279 SYCLASRGAGGA-----GSLVLGRTEAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQ 333
Query: 212 LTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 270
LT+ G +V D+G + T LP E YA + FD + + S + CY+ S +
Sbjct: 334 LTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASV 393
Query: 271 KVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
+VP + F + + RN + G VFCL + I+G G +I
Sbjct: 394 RVPTVSFYFDQGAVLTLPARNLLVEV----GGAVFCLAFAPSSSGISILGNIQQEGIQIT 449
Query: 329 FDRENLKLAWSHSKC 343
D N + + + C
Sbjct: 450 VDSANGYVGFGPNTC 464
>gi|255558640|ref|XP_002520345.1| nucellin, putative [Ricinus communis]
gi|223540564|gb|EEF42131.1| nucellin, putative [Ricinus communis]
Length = 424
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 77/313 (24%), Positives = 130/313 (41%), Gaps = 30/313 (9%)
Query: 56 VSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
+SC PLC + + C+S D C Y Y+ E SS G LV D L + S
Sbjct: 117 LSCIDPLCSAVQNSGTYQCQSATDQCDYEIQYADEG-SSLGVLVTDYFPLRLMNG----S 171
Query: 111 SVQSSVIIGCGRKQTGSYLDGAAPD-GVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 169
++ + GCG Q P GV+GLG G S+ S L G++ N C
Sbjct: 172 FLRPKMTFGCGYDQKSPGPVAPPPTTGVLGLGNGKTSIISQLQALGVMGNVIGHCLSRKG 231
Query: 170 SGSVFFGDQGPATQQSTSFLPIGEK-YDAYFV-GVESYCIGNSCLTQSGFQALVDSGASF 227
G +FFG Q P S+ P+ +K D Y+ G G + + DSG+S+
Sbjct: 232 GGFLFFG-QDPVPSFGISWAPMSQKSLDKYYASGPAELLYGGKPTGTKAEEFIFDSGSSY 290
Query: 228 TFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNAS------SEEMLKVPDMRLIF 279
T+ ++Y + K +S K R + + + C+ + +E L F
Sbjct: 291 TYFNAQVYQSTLNLIRKELSGKPLRDAPEEKALAICWKGTKRFKSVNEVKSYFKPFALSF 350
Query: 280 SKNQS--FVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDREN 333
+K +S + + N+G CL +++ G++ +IG N +++D +
Sbjct: 351 TKAKSVQLQIPPEDYLIVTNDGNV--CLGILNGSEVGLGNFNVIGDNLFQDKLVIYDSDK 408
Query: 334 LKLAWSHSKCEEV 346
++ W + C+ +
Sbjct: 409 HQIGWIPANCDRL 421
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 78/320 (24%), Positives = 138/320 (43%), Gaps = 36/320 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
++PS+S S + V CS P C+S S C S C Y+ +Y + + + G L +
Sbjct: 175 FNPSTSPSYRTVLCSSPTCQSLQSATGNLGVCGSNPPSCNYVVNYG-DGSYTRGELGTEH 233
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L L + S+ ++ I GCGR G + G++GLG +S+ S + +
Sbjct: 234 LDLGN-------STAVNNFIFGCGRNNQGLF---GGASGLVGLGRSSLSLIS--QTSAMF 281
Query: 158 QNSFSICF---DENDSGSVFFGDQGPATQQS-----TSFLPIGEKYDAYFVGVESYCIGN 209
FS C + SGS+ G + + T +P + YF+ + +G+
Sbjct: 282 GGVFSYCLPITETEASGSLVMGGNSSVYKNTTPISYTRMIP-NPQLPFYFLNLTGITVGS 340
Query: 210 SCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
+ F ++DSG T LP IY + +F K S + C+N S
Sbjct: 341 VAVQAPSFGKDGMMIDSGTVITRLPPSIYQALKDEFVKQFSGFPSAPAFMILDTCFNLSG 400
Query: 267 EEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQNFMM 323
+ +++P++++ F N V +F F + + V CL + +S + + GIIG
Sbjct: 401 YQEVEIPNIKMHFEGNAELNVDVTGVFYFVKTDASQV-CLAIASLSYENEVGIIGNYQQK 459
Query: 324 GHRIVFDRENLKLAWSHSKC 343
R+++D + L ++ C
Sbjct: 460 NQRVIYDTKGSMLGFAAEAC 479
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 78/304 (25%), Positives = 133/304 (43%), Gaps = 52/304 (17%)
Query: 75 PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP 134
PC Y DY+ + +S++G+L D A+ S + V GCG + G G
Sbjct: 141 PCGYAYDYA-DGSSTTGFLARDT---ATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTG- 195
Query: 135 DGVMGLGLGDVSVPSLLAKAG-LIQNSFSICFDENDSGS-------VFFGDQGPATQQST 186
GV+GLG G +S P A++G L +FS C + + G +F G P + +
Sbjct: 196 -GVIGLGQGQLSFP---AQSGSLFAQTFSYCLLDLEGGRRGRSSSFLFLGR--PERRAAF 249
Query: 187 SFLPIGEKYDA---YFVGVESYCIGNSCLTQSGFQ----------ALVDSGASFTFLPTE 233
++ P+ A Y+VGV + +GN L G + ++DSG++ T+L
Sbjct: 250 AYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLG 309
Query: 234 IYAEVVVKFDKLVSSKRIS-----LQGNSWKYCYNASSEEMLK-----VPDMRLIFSKNQ 283
Y +V F V RI QG + CYN SS L P + + F++
Sbjct: 310 AYLHLVSAFAASVHLPRIPSSATFFQG--LELCYNVSSSSSLAPANGGFPRLTIDFAQGL 367
Query: 284 SFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYG--IIGQNFMMGHRIVFDRENLKLAWS 339
S + N++ ++ V CL + T + ++G G+ + FDR + ++ ++
Sbjct: 368 SLELPTGNYLVDVADD----VKCLAIRPTLSPFAFNVLGNLMQQGYHVEFDRASARIGFA 423
Query: 340 HSKC 343
++C
Sbjct: 424 RTEC 427
>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 79/306 (25%), Positives = 137/306 (44%), Gaps = 33/306 (10%)
Query: 62 LCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 121
L +++ C + K C Y Y+ + +SS+G L D + L + A ++ GC
Sbjct: 190 LQGNQNYCDTCKQ-CDYEIAYA-DRSSSAGVLARDNMELIT----ADGERENMDLVFGCA 243
Query: 122 RKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS--VFFGDQ 178
Q G L A+ DG++GL G +S+P+ LAK G+I N F C + SGS +F GD
Sbjct: 244 HDQQGKLLGSPASSDGILGLSNGAMSLPTQLAKQGIISNVFGHCIATDPSGSAYMFLGDD 303
Query: 179 GPATQQSTSFLPIG----EKYDAYFVGVESYCIGNSCLTQSG--FQALVDSGASFTFLPT 232
+ +++P+ + Y V C + Q+G Q + DSG+S+T+ P
Sbjct: 304 Y-VPRWGMTWVPVRNGPEDVYSTVVQKVNYGCQELNVREQAGKLTQVIFDSGSSYTYFPH 362
Query: 233 EIYAEVVVKFDKLVSSKRISLQGNSWKYC----YNASSEEMLKVPDMRLIFSKNQSFVVR 288
EIY ++ + + + +C + S + +K L+ +++++V
Sbjct: 363 EIYTSLITSLEAVSPGFVRDESDQTLPFCMKPNFPVRSVDDVKQLHKPLLLHFSKTWLVI 422
Query: 289 NHIFSF-PEN----EGFTVFCLTVMSTDG-DYG-----IIGQNFMMGHRIVFDRENLKLA 337
F PEN G CL V+ DG + G +IG + G + +D + ++
Sbjct: 423 PRTFEISPENYLIISGKGNVCLGVL--DGTEIGHSSTIVIGDVSLRGKLVAYDNDANQIG 480
Query: 338 WSHSKC 343
W+ S C
Sbjct: 481 WAQSDC 486
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 71/297 (23%), Positives = 132/297 (44%), Gaps = 29/297 (9%)
Query: 76 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAP 134
C Y Y + +S++GY V D + L + + +S S++ GCG +Q+G AA
Sbjct: 156 CEYRVAYG-DGSSTAGYFVRDHVVLDRVTGNFQTTSTNGSIVFGCGAQQSGQLGATSAAL 214
Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVF-FGDQGPATQQSTSFLPIGE 193
DG++G G + S+ S LA +G ++ F+ C D + G +F G+ ++T +P
Sbjct: 215 DGILGFGQANSSMISQLASSGKVKRVFAHCLDNINGGGIFAIGEVVQPKVRTTPLVPQQA 274
Query: 194 KYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVK-FDK 244
Y+ + +E + N L T ++DSG + + P IY ++ K F +
Sbjct: 275 HYNVFMKAIE---VDNEVLNLPTDVFDTDLRKGTIIDSGTTLAYFPDVIYEPLISKIFAR 331
Query: 245 LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNH--IFSFPENEGFTV 302
+ K +++ + Y+ + ++ P + F + S V H +F N+
Sbjct: 332 QSTLKLHTVEEQFTCFEYDGNVDDGF--PTVTFHFEDSLSLTVYPHEYLFDIDSNK---- 385
Query: 303 FCL-----TVMSTDGDYGIIGQNFMMGHRIV-FDRENLKLAWSHSKCEEVIDKSHVH 353
+C+ S DG I+ + ++ +R+V +D EN + W+ C I H
Sbjct: 386 WCVGWQNSGAQSRDGKDMILLGDLVLQNRLVMYDLENQTIGWTEYNCSSSIKVRDEH 442
>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 82/331 (24%), Positives = 139/331 (41%), Gaps = 51/331 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI-- 97
Y P++++ ++C PLC S CKS D C Y +Y+ + SS G LV+D
Sbjct: 98 YKPNNNA----LNCFEPLCTSLHPITNHHCKSADDQCQYEIEYA-DHGSSLGVLVNDHVP 152
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD-GVMGLGLGDVSVPSLLAKAGL 156
L L + S AP+ + GCG S D + P GV+GLG G+VS S L+ G+
Sbjct: 153 LKLTNGSLAAPR------IAFGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGV 206
Query: 157 IQNSFSICFDENDSGSVFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
++N C + + G +FFGD+ T S S IG Y + G G
Sbjct: 207 VRNVVGHCLSD-EGGFLFFGDEFVPSSGVTWTSMSHESIGSYYSS---GPAEVYFGGKAT 262
Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASS---- 266
+ DSG+S+T+ ++ Y ++ + K + + + S C+ +
Sbjct: 263 GIKDLTLVFDSGSSYTYFNSQAYNSILALVKNNLRGKPLEDAPEDKSLPVCWKGTRPFKS 322
Query: 267 ----EEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN----EGFTVFCLTVMSTD----GDY 314
++ + +R +KN + PEN + C +++ GD
Sbjct: 323 LRDVKKYFNLLALRFTKTKNAQIQLP------PENYLIITKYGNVCFGILNGTEVGLGDL 376
Query: 315 GIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
IIG + +++D E ++ W + C +
Sbjct: 377 NIIGDISLKDKMVIYDNERRRIGWFPTNCNK 407
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 78/329 (23%), Positives = 130/329 (39%), Gaps = 26/329 (7%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +DP SS ++ +SCS C S S C + C Y Y + + +SG+ V D
Sbjct: 125 LNFFDPGSSVTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYG-DGSGTSGFYVSD 183
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
+L + + + V+ GC QTG + A DG+ G G +SV S LA G
Sbjct: 184 VLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQG 243
Query: 156 LIQNSFSICFD-ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
+ FS C EN G + G + + F P+ Y V + S + L
Sbjct: 244 IAPRVFSHCLKGENGGGGILV--LGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPI 301
Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNA 264
T +G ++D+G + +L Y V VS + + +GN CY
Sbjct: 302 NPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ---CYVI 358
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGD-YGIIGQNF 321
++ P + L F+ S + + +N G V+C+ I+G
Sbjct: 359 TTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLV 418
Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVIDKS 350
+ V+D ++ W++ C ++ S
Sbjct: 419 LKDKIFVYDLVGQRIGWANYDCSTSVNVS 447
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 147/332 (44%), Gaps = 34/332 (10%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSR--SSC--KSLKDPCPYIADYSTEDTS-SSGYL 93
D++L DP++SS+ + C C++ +SC ++L + I Y D S + G +
Sbjct: 120 DQDLPVLDPAASSTYAALPCGAARCRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEI 179
Query: 94 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
D S + +S + GCG G + G+ G G G S+PS L
Sbjct: 180 ATDRFTFGD-SGGSGESLHTRRLTFGCGHLNKGVFQSNET--GIAGFGRGRWSLPSQLNV 236
Query: 154 AGLIQNSFSICFD---ENDSGSVFFGDQGPATQ--------QSTSFLPIGEKYDAYFVGV 202
SFS CF E+ S V G A ++T L + YF+ +
Sbjct: 237 -----TSFSYCFTSMFESKSSLVTLGGSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSL 291
Query: 203 ESYCIGNSCLT--QSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 259
+ +G + L ++ F++ ++DSGAS T LP E+Y V +F V ++G++
Sbjct: 292 KGISVGKTRLPVPETKFRSTIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALD 351
Query: 260 YCYNASSEEMLK---VPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYG 315
C+ + + VP + L + R N++F E+ G V C+ + + G+
Sbjct: 352 LCFALPVTALWRRPAVPSLTLHLEGADWELPRSNYVF---EDLGARVMCIVLDAAPGEQT 408
Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
+IG +V+D EN +L+++ ++C+ ++
Sbjct: 409 VIGNFQQQNTHVVYDLENDRLSFAPARCDRLV 440
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 85/322 (26%), Positives = 143/322 (44%), Gaps = 36/322 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++PS SSS KN+ C LC S R + S ++ C Y Y + + S G L D L L S
Sbjct: 129 FNPSKSSSYKNIPCLSKLCHSVRDTSCSDQNSCQYKISYG-DSSHSQGDLSVDTLSLEST 187
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S +P S ++ +IGCG G++ G A G++GLG G VS+ + L + I FS
Sbjct: 188 SG-SPVSFPKT--VIGCGTDNAGTF--GGASSGIVGLGGGPVSLITQLGSS--IGGKFSY 240
Query: 164 CF------DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQSG 216
C + N S + FGD + P+ +K YF+ ++++ +GN + G
Sbjct: 241 CLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGG 300
Query: 217 F--------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
++DSG + T +P+++Y + LV R+ + CY+ S E
Sbjct: 301 SSEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLCYSLKSNE 360
Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFS--FPENEGFTVFCLTVMSTDGD-YGIIG-QNFMMG 324
D +I + + + H S P +G F G +G + QN ++G
Sbjct: 361 Y----DFPIITAHFKGADIELHSISTFVPITDGIVCFAFQPSPQLGSIFGNLAQQNLLVG 416
Query: 325 HRIVFDRENLKLAWSHSKCEEV 346
+D + +++ + C +V
Sbjct: 417 ----YDLQQKTVSFKPTDCTKV 434
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 87/320 (27%), Positives = 134/320 (41%), Gaps = 36/320 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP--------CPYIADYSTEDTS-SSGYLVD 95
YDPS S + K +SC+ C SR +L DP C Y A Y DTS S GYL
Sbjct: 29 YDPSVSKTYKKLSCASVEC-SRLKAATLNDPLCETDSNACLYTASYG--DTSFSIGYLSQ 85
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KA 154
D+L L S S+ PQ GCG+ G + A G++GL +S+ + L+ K
Sbjct: 86 DLLTLTS-SQTLPQ------FTYGCGQDNQGLFGRAA---GIIGLARDKLSMLAQLSTKY 135
Query: 155 GLIQNSFSICFDENDSGSVFFGDQ-----GPATQQSTSFLPIGEKYDAYFVGVESYCIGN 209
G ++FS C +SGS G P + + T L + YF+ + + +
Sbjct: 136 G---HAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSG 192
Query: 210 SCLTQSG----FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNA 264
L + L+DSG T LP +YA + F K++S+K S C+
Sbjct: 193 RPLDLAAAMYRVPTLIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKG 252
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
S + + VP++++IF +R ++G T S IIG
Sbjct: 253 SLKSISAVPEIKMIFQGGADLTLRAPSILIEADKGITCLAFAGSSGTNQIAIIGNRQQQT 312
Query: 325 HRIVFDRENLKLAWSHSKCE 344
+ I +D ++ ++ C
Sbjct: 313 YNIAYDVSTSRIGFAPGSCH 332
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 78/329 (23%), Positives = 130/329 (39%), Gaps = 26/329 (7%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +DP SS ++ +SCS C S S C + C Y Y + + +SG+ V D
Sbjct: 125 LNFFDPGSSVTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYG-DGSGTSGFYVSD 183
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
+L + + + V+ GC QTG + A DG+ G G +SV S LA G
Sbjct: 184 VLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQG 243
Query: 156 LIQNSFSICFD-ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
+ FS C EN G + G + + F P+ Y V + S + L
Sbjct: 244 IAPRVFSHCLKGENGGGGILV--LGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPI 301
Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNA 264
T +G ++D+G + +L Y V VS + + +GN CY
Sbjct: 302 NPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ---CYVI 358
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGD-YGIIGQNF 321
++ P + L F+ S + + +N G V+C+ I+G
Sbjct: 359 TTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLV 418
Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVIDKS 350
+ V+D ++ W++ C ++ S
Sbjct: 419 LKDKIFVYDLVGQRIGWANYDCSTSVNVS 447
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 85/318 (26%), Positives = 144/318 (45%), Gaps = 41/318 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++P S+S +V C+ C + ++ C Y Y + T S G L + + + S
Sbjct: 134 FNPLKSTSFSHVPCNTQTCHAVDDGHCGVQGVCDYSYTYG-DRTYSKGDLGFEKITIGS- 191
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
SSV+S +IGCG +G + GV+GLG G +S+ S +++ I FS
Sbjct: 192 ------SSVKS--VIGCGHASSGGF---GFASGVIGLGGGQLSLVSQMSQTSGISRRFSY 240
Query: 164 CFD---ENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNS---CLTQS 215
C + +G + FG+ + P+ K Y++ +E+ IGN +
Sbjct: 241 CLPTLLSHANGKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMAFAKQ 300
Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN--ASSEEMLKVP 273
G ++DSG + T LP E+Y VV K+V +KR+ S C++ ++ L +P
Sbjct: 301 G-NVIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDGINAAASLGIP 359
Query: 274 DMRLIFS--KNQSFVVRNHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQ----NFMMGH 325
+ FS N + + N +N V CLT+ S ++GIIG NF++G
Sbjct: 360 VITAHFSGGANVNLLPINTFRKVADN----VNCLTLKAASPTTEFGIIGNLAQANFLIG- 414
Query: 326 RIVFDRENLKLAWSHSKC 343
+D E +L++ + C
Sbjct: 415 ---YDLEAKRLSFKPTVC 429
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 78/333 (23%), Positives = 138/333 (41%), Gaps = 51/333 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS---RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
YDP +S + + + C+ P C+ C + C Y+ Y + ++SSG L D L L
Sbjct: 134 YDPRNSKTHRRIPCASPQCRGVLRYPGCDARTGGCVYMVVYG-DGSASSGDLATDTLVL- 191
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
P + +V +GCG G A G++G G G +S P+ LA A + F
Sbjct: 192 ------PDDTRVHNVTLGCGHDNEGLLASAA---GLLGAGRGQLSFPTQLAPA--YGHVF 240
Query: 162 SICFDE------NDSGSVFFGDQGPATQQSTSFLPIG---EKYDAYFVGVESYCIGNSCL 212
S C + N S + FG ST+F P+ + Y+V + + +G +
Sbjct: 241 SYCLGDRMSRARNSSSYLVFGRT--PELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERV 298
Query: 213 TQSGFQ--------------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS---KRISLQG 255
+GF +VDSG + + + YA V F ++ +R+ +
Sbjct: 299 --AGFSNASLALNPATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKF 356
Query: 256 NSWKYCYNASSE---EMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMST 310
+ + CY+ ++VP + L F+ + N++ + T FCL + +
Sbjct: 357 SVFDTCYDVHGNGPGTGVRVPSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAA 416
Query: 311 DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
D ++G G +VFD E ++ ++ + C
Sbjct: 417 DDGLNVLGNVQQQGFGVVFDVERGRIGFTPNGC 449
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 87/320 (27%), Positives = 134/320 (41%), Gaps = 36/320 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP--------CPYIADYSTEDTS-SSGYLVD 95
YDPS S + K +SC+ C SR +L DP C Y A Y DTS S GYL
Sbjct: 168 YDPSVSKTYKKLSCASVEC-SRLKAATLNDPLCETDSNACLYTASYG--DTSFSIGYLSQ 224
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KA 154
D+L L S S+ PQ GCG+ G + A G++GL +S+ + L+ K
Sbjct: 225 DLLTLTS-SQTLPQ------FTYGCGQDNQGLFGRAA---GIIGLARDKLSMLAQLSTKY 274
Query: 155 GLIQNSFSICFDENDSGSVFFGDQ-----GPATQQSTSFLPIGEKYDAYFVGVESYCIGN 209
G ++FS C +SGS G P + + T L + YF+ + + +
Sbjct: 275 G---HAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSG 331
Query: 210 SCLTQSG----FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNA 264
L + L+DSG T LP +YA + F K++S+K S C+
Sbjct: 332 RPLDLAAAMYRVPTLIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKG 391
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
S + + VP++++IF +R ++G T S IIG
Sbjct: 392 SLKSISAVPEIKMIFQGGADLTLRAPSILIEADKGITCLAFAGSSGTNQIAIIGNRQQQT 451
Query: 325 HRIVFDRENLKLAWSHSKCE 344
+ I +D ++ ++ C
Sbjct: 452 YNIAYDVSTSRIGFAPGSCH 471
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 77/324 (23%), Positives = 135/324 (41%), Gaps = 31/324 (9%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ YD S + K VSC C + S C + C Y Y+ + +SS GY V D
Sbjct: 142 LTLYDIKESLTGKLVSCDQDFCYAINGGPPSYCIA-NMSCSYTEIYA-DGSSSFGYFVRD 199
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
I+ S +S SVI GC Q+G A DG++G G + S+ S LA +G
Sbjct: 200 IVQYDQVSGDLETTSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGK 259
Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--- 213
++ F+ C D + G +F G Q + P+ Y V +++ +G L
Sbjct: 260 VRKMFAHCLDGLNGGGIF--AIGHIVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPT 317
Query: 214 -------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
+ G ++DSG + +LP +Y +++ K S ++ + + C+ S
Sbjct: 318 DVFDVGDKKG--TIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFT-CFQYSE 374
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL-----TVMSTD-GDYGIIGQN 320
P + F + V H + F + ++C+ + S D + ++G
Sbjct: 375 SLDDGFPAVTFHFENSLYLKVHPHEYLFSYD---GLWCIGWQNSGMQSRDRRNITLLGDL 431
Query: 321 FMMGHRIVFDRENLKLAWSHSKCE 344
+ +++D EN + W+ C+
Sbjct: 432 ALSNKLVLYDLENQVIGWTEYNCK 455
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 84/341 (24%), Positives = 140/341 (41%), Gaps = 46/341 (13%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYL 93
D+ +DPS SS+ + V+C P+C+ S S+C C Y+ Y + + ++GY+
Sbjct: 124 DQPFPLFDPSVSSTFRAVACPDPICRPSSGLSVSACALKTFRCFYLCSYG-DKSITAGYI 182
Query: 94 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
D S + S + GCG TG + + G+ G G G +S+PS L +
Sbjct: 183 FKDTFTFMSPNGEGAPPVAVSGLAFGCGDYNTGVFASNES--GIAGFGRGPLSLPSQL-R 239
Query: 154 AGLIQNSFSICF------DENDSGSVFFG---------DQGPATQQSTSFLPIGEKYDAY 198
G FS C + N + +VF G GP +ST + Y
Sbjct: 240 VG----RFSYCLTSHDETESNKTSAVFLGTPPNGLRAHSSGPF--RSTPIIHSPSFPTFY 293
Query: 199 FVGVESYCIGNSCL-TQSGFQAL---------VDSGASFTFLPTEIYAEVVVKFDKLVSS 248
++ +E +G + L S AL +DSG T P ++ ++ +F +
Sbjct: 294 YLSLEGITVGKTRLPVDSSVFALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQLPL 353
Query: 249 KRISLQGNSWKYCYNASSEEMLKVPDMRLIF---SKNQSFVVRNHIFSFPENEGFTVFCL 305
R + +VP +LIF S + N+I PE+ V CL
Sbjct: 354 PRYDNTSEVGNLLCFQRPKGGKQVPVPKLIFHLASADMDLPRENYI---PEDTDSGVMCL 410
Query: 306 TVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+ + D +IG IV+D EN KL ++ ++C+++
Sbjct: 411 MINGAEVDMVLIGNFQQQNMHIVYDVENSKLLFASAQCDKM 451
>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
[Cucumis sativus]
Length = 420
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 70/269 (26%), Positives = 112/269 (41%), Gaps = 22/269 (8%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ YD S++ K VSC C S C + CPY+ Y + +S++GY V D
Sbjct: 131 LTPYDLEESTTGKLVSCDEQFCLEVNGGPLSGCTT-NMSCPYLQIYG-DGSSTAGYFVKD 188
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLAKA 154
+ S ++ S+ GCG +Q+G A DG++G G + S+ S LA
Sbjct: 189 YVQYNRVSGDLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLAST 248
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 214
++ F+ C D + G +F G Q + P+ Y V + +G+ L
Sbjct: 249 RKVKKMFAHCLDGTNGGGIF--AMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNI 306
Query: 215 SG--FQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-CYNAS 265
S F+A ++DSG + +LP IY +V K L + +Q +Y C+ S
Sbjct: 307 SADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVAKI--LSQQHNLEVQTIHGEYKCFQYS 364
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSF 294
P + F + V H + F
Sbjct: 365 ERVDDGFPPVIFHFENSLLLKVYPHEYLF 393
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 84/339 (24%), Positives = 145/339 (42%), Gaps = 40/339 (11%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGY 92
++N YDP SSS KN++C P C+ SS CK CPY Y ++ +
Sbjct: 231 EQNGPYYDPKDSSSFKNITCHDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDF 290
Query: 93 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
++ + + P+ + +V+ GCG G + A ++GLG G +S + L
Sbjct: 291 ALETFTVNLTTPEGKPELKIVENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFATQL- 346
Query: 153 KAGLIQNSFSICFDENDSGS------VFFGDQGPATQQSTSFLP-IGEKYDA----YFVG 201
L +SFS C + +S S +F D+ + + +F +G K + Y+V
Sbjct: 347 -QSLYGHSFSYCLVDRNSNSSVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVL 405
Query: 202 VESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
++S +G L Q G ++DSG + T+ Y + F + + +
Sbjct: 406 IKSIMVGGEVLKIPEETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPL 465
Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ--SFVVRNHIFSF-PENEGFTVFCLTVM 308
K CYN S E +++P+ ++F+ F V N+ PE+ V CL ++
Sbjct: 466 VETFPPLKPCYNVSGVEKMELPEFAILFADGAMWDFPVENYFIQIEPED----VVCLAIL 521
Query: 309 ST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
T IIG I++D + +L ++ KC +V
Sbjct: 522 GTPRSALSIIGNYQQQNFHILYDLKKSRLGYAPMKCADV 560
>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 418
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 76/321 (23%), Positives = 128/321 (39%), Gaps = 50/321 (15%)
Query: 56 VSCSHPLCKS--------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHA 107
V CS P+C + C PC Y +Y+ ++ S+G L D +H+ S
Sbjct: 116 VKCSDPICAAVQPPFSTFGQKCAKPIPPCVYKVEYA-DNAESTGALARDYMHIGS----- 169
Query: 108 PQSSVQSSVIIGCGRKQT-GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 166
P S V+ GCG +Q + GV+GLG G +S+ S L G I N C
Sbjct: 170 PSGSNVPLVVFGCGYEQKFSGPTPPPSTPGVLGLGNGKISILSQLHSMGFIHNVLGHCLS 229
Query: 167 ENDSGSVFFGDQ---------GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 217
G +F GD+ P Q S EK+ Y G G
Sbjct: 230 AEGGGYLFLGDKFIPSSGIFWTPIIQSSL------EKH--YSTGPVDLFFNGKPTPAKGL 281
Query: 218 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS------WKYC--YNASSEEM 269
Q + DSG+S+T+ +Y V + + K + + WK + + +E
Sbjct: 282 QIIFDSGSSYTYFSPRVYTIVANMVNNDLKGKPLRRETKDPSLPICWKGVKPFKSLNEVN 341
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGH 325
+ L F+K+ +N F P + F CL +++ + G+ ++G +
Sbjct: 342 NYFKPLTLSFTKS-----KNLQFQLPPVK-FGNVCLGILNGNEAGLGNRNVVGDISLQDK 395
Query: 326 RIVFDRENLKLAWSHSKCEEV 346
+V+D E ++ W+ + C+++
Sbjct: 396 VVVYDNEKQQIGWASANCKQI 416
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 73/325 (22%), Positives = 134/325 (41%), Gaps = 33/325 (10%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSS----CKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +D + SSS++ + C+ P+C + S+ C + D C Y Y + + +SG+ V D
Sbjct: 127 ELNLFDTTKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYR-DRSGTSGFYVTD 185
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAG 155
+H + ++ ++++ GC Q G A DG+ G G G+ SV S L+ G
Sbjct: 186 SMHFDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRG 245
Query: 156 LIQNSFSICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC-- 211
+ FS C EN G + G+ + S + P+ Y + ++S +
Sbjct: 246 ITPKVFSHCLKGGENGGGILVLGE---ILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFP 302
Query: 212 ------LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
++ +G + ++DSG + +L E+Y +V VS + C+ S
Sbjct: 303 NPTMFPISNAG-ETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQ-CFRVS 360
Query: 266 SEEMLKVPDMRLIFSKNQSFVVR-------NHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
P +R F S VV + I P ++C+ + I+G
Sbjct: 361 MSVADIFPVLRFNFEGIASMVVTPEEYLQFDSIVREP-----ALWCIGFQKAEDGLNILG 415
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
+ IV+D ++ W++ C
Sbjct: 416 DLVLKDKIIVYDLARQRIGWANYDC 440
>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 395
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 84/331 (25%), Positives = 138/331 (41%), Gaps = 39/331 (11%)
Query: 51 SSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 106
+ K V PLC+ +++ C++ K C Y Y+ + +SS G L D + L +
Sbjct: 62 TEGKIVHPRDPLCEELQGNQNYCETCKQ-CDYEITYA-DRSSSKGVLARDNMQLTT---- 115
Query: 107 APQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 165
A + GC Q G LD + DG++GL G +S+ + LA +G+I N F C
Sbjct: 116 ADGEMKNVDFVFGCAHNQQGKLLDSPTSTDGILGLSNGAISLSTQLANSGIISNVFGHCM 175
Query: 166 --DENDSGSVFFGDQGPATQQSTSFLPIGE-KYDAYFVGVESYCIGNSCLTQSG-----F 217
D + G +F GD + +++PI + Y V G L G
Sbjct: 176 ATDPSSGGYMFLGDDY-VPRWGMTWVPIRNGPGNVYSTEVPKVNYGAQELNLRGQAGKLT 234
Query: 218 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 277
Q + DSG+S+T+ P EIY ++ + + +C + + V D+
Sbjct: 235 QVIFDSGSSYTYFPHEIYTNLIALLEDASPGFVRDESDQTLPFCMKPNV-PVRSVGDVEQ 293
Query: 278 IFS------KNQSFVVRNHIFSFPENEGFTV----FCLTVMSTDG-DYG-----IIGQNF 321
+F+ + + FV+ PEN CL V+ DG + G IIG
Sbjct: 294 LFNPLILQLRKRWFVIPTTFAISPENYLIISDKGNVCLGVL--DGTEIGHSSTIIIGDAS 351
Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVIDKSHV 352
+ G +V+D + ++ W S C +S V
Sbjct: 352 LRGKFVVYDNDENRIGWVQSDCTRPQKQSRV 382
>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
Length = 583
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 79/294 (26%), Positives = 126/294 (42%), Gaps = 34/294 (11%)
Query: 76 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAP 134
C Y +Y+ + +SS G L D LHL A SS GC Q G L+
Sbjct: 284 CDYEIEYA-DHSSSMGVLARDELHLT----MANGSSTNLKFNFGCAYDQQGLLLNTLVKT 338
Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIG 192
DG++GL VS+PS LA G+I N C D G +F GD + S++P+
Sbjct: 339 DGILGLSKAKVSLPSQLANRGIINNVVGHCLANDVVGGGYMFLGDDF-VPRWGMSWVPML 397
Query: 193 E--KYDAYFVGVESYCIGNSCLTQSGFQALV-----DSGASFTFLPTEIYAEVVVKFDKL 245
+ D+Y + G+ L+ G + V DSG+S+T+ E Y+E+V ++
Sbjct: 398 DSPSIDSYQTQIMKLNYGSGPLSLGGQERRVRRIVFDSGSSYTYFTKEAYSELVASLKQV 457
Query: 246 VSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSK-----NQSFVVRNHIFSFPENEG 299
I + + +C+ A + V D++ F + + + F P EG
Sbjct: 458 SGEALIQDTSDPTLPFCWRAKF-PIRSVIDVKQYFKTLTLQFGSKWWIISTKFRIPP-EG 515
Query: 300 FTVF------CLTVMS----TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
+ + CL ++ DG I+G + G I++D N K+ W+ S C
Sbjct: 516 YLIISNKGNVCLGILDGSDVHDGSSIILGDISLRGQLIIYDNVNNKIGWTQSDC 569
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/316 (22%), Positives = 131/316 (41%), Gaps = 32/316 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
+DPS SS+ +++CS C+ + +C S K CPY Y+ +D+ + G L D L
Sbjct: 176 FDPSKSSTYSDITCSSRECQELGSSHKHNCSSDKK-CPYEITYA-DDSYTVGNLARDTLT 233
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
L +P +V + GCG GS+ DG++GLG G S+ S + A
Sbjct: 234 L------SPTDAVP-GFVFGCGHNNAGSF---GEIDGLLGLGRGKASLSSQV--AARYGA 281
Query: 160 SFSICFDENDSGSVFFGDQG-----PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 213
FS C + S + + G P Q T + G+ Y++ + + +
Sbjct: 282 GFSYCLPSSPSATGYLSFSGAAAAAPTNAQFTEMV-AGQHPSFYYLNLTGITVAGRAIKV 340
Query: 214 -----QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
+ ++DSG +F+ LP YA + + + + + CY+ + E
Sbjct: 341 PPSVFATAAGTIIDSGTAFSCLPPSAYAALRSSVRSAMGRYKRAPSSTIFDTCYDLTGHE 400
Query: 269 MLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
+++P + L+F+ + + + + N T D G++G +
Sbjct: 401 TVRIPSVALVFADGATVHLHPSGVLYTWSNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAV 460
Query: 328 VFDRENLKLAWSHSKC 343
++D +N K+ + + C
Sbjct: 461 IYDVDNQKVGFGANGC 476
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 73/328 (22%), Positives = 136/328 (41%), Gaps = 24/328 (7%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +D SS++ + CS P+C SR + C + C Y Y + + +SGY V D
Sbjct: 122 LNFFDTVGSSTAALIPCSDPICTSRVQGAAAECSPRVNQCSYTFQYG-DGSGTSGYYVSD 180
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAG 155
++ + P + ++++ GC Q+G A DG+ G G G +SV S L+ G
Sbjct: 181 AMYFSLIMGQPPAVNSSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRG 240
Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
+ FS C + D G + S + P+ Y + ++S + L
Sbjct: 241 ITPKVFSHCL-KGDGDGGGVLVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLPIN 299
Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNA 264
+ + +VD G + +L E Y +V + V S+++ + +GN CY
Sbjct: 300 PAVFSISNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQ---CYLV 356
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN--EGFTVFCLTVMSTDGDYGIIGQNFM 322
S+ P + L F S V++ + +G ++C+ I+G +
Sbjct: 357 STSIGDIFPSVSLNFEGGASMVLKPEQYLMHNGYLDGAEMWCIGFQKFQEGASILGDLVL 416
Query: 323 MGHRIVFDRENLKLAWSHSKCEEVIDKS 350
+V+D ++ W++ C ++ S
Sbjct: 417 KDKIVVYDIAQQRIGWANYDCSLSVNVS 444
>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
Length = 407
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 81/319 (25%), Positives = 140/319 (43%), Gaps = 36/319 (11%)
Query: 53 SKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 105
K V C+ PLC + C+ D C Y +Y+ + T+S G L+ D L + S
Sbjct: 89 KKLVPCADPLCDALHKDLGTTKDCREEPDQCHYQINYA-DGTTSLGVLLLDKFSLPTGS- 146
Query: 106 HAPQSSVQSSVIIGCGRKQTGSYLDGAAP----DGVMGLGLGDVSVPSLLAKAGLI-QNS 160
++ GCG Q A DG++GLG G V + S L +G + +N
Sbjct: 147 -------ARNIAFGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQLKHSGAVSKNV 199
Query: 161 FSICFDENDSGSVFFGDQG-PATQQSTSFLP-IGEKYDAYFVGVESYCIGNSCLTQSGFQ 218
C G +F G++ P++ ++ I + + Y G + +G + + F+
Sbjct: 200 IGHCLSSKGGGYLFIGEENVPSSHLHIIYIYCISREPNHYSPGQATLHLGRNPIGTKPFK 259
Query: 219 ALVDSGASFTFLPTEIYAEVV--VKFDKLVSS-KRISLQGNSWKYCYNASSEEMLKVPDM 275
A+ DSG+++T+LP ++A++V +K + SS K +S C+ + V D+
Sbjct: 260 AIFDSGSTYTYLPENLHAQLVSALKASLIKSSLKLVSDTDTRLHLCWKG-PKPFKTVHDL 318
Query: 276 RLIFSKNQSFVVRNHIFSF---PEN----EGFTVFCLTVMSTDG-DYGIIGQNFMMGHRI 327
F K+ + +H + PEN G C ++ G D +IG M +
Sbjct: 319 PKEF-KSLVTLKFDHGVTMTIPPENYLIITGHGNACFGILELPGYDLFVIGGISMQEQLV 377
Query: 328 VFDRENLKLAWSHSKCEEV 346
+ D E +LAW S C+++
Sbjct: 378 IHDNEKGRLAWMPSPCDKM 396
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 137/329 (41%), Gaps = 25/329 (7%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
L+ +D SSSS++ V CS P+C S + C + C Y Y + + +SGY V
Sbjct: 109 QLNFFDSSSSSTAGLVHCSDPICTSAVQTTVTQCSPQTNQCSYTFQYE-DGSGTSGYYVS 167
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKA 154
D L+ + + + + ++ GC Q+G + A DG+ G G G++SV S L+
Sbjct: 168 DTLYFDAILGESLVVNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTH 227
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
G+ FS C + + G + + P+ Y + ++S + L
Sbjct: 228 GITPRVFSHCL-KGEGIGGGILVLGEILEPGMVYSPLVPSQPHYNLNLQSIAVNGKLLPI 286
Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNA 264
T + +VDSG + +L E Y V + +VS + +GN CY
Sbjct: 287 DPSVFATSNSQGTIVDSGTTLAYLVAEAYDPFVSAVNVIVSPSVTPIISKGNQ---CYLV 343
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSF-PENEGFTVFCLTVMSTDGDYGIIGQNF 321
S+ P F+ S V++ +++ F P G ++C+ G I+G
Sbjct: 344 STSVSQMFPLASFNFAGGASMVLKPEDYLIPFGPSQGGSVMWCIGFQKVQG-VTILGDLV 402
Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVIDKS 350
+ V+D ++ W++ C ++ S
Sbjct: 403 LKDKIFVYDLVRQRIGWANYDCSLSVNVS 431
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 80/291 (27%), Positives = 127/291 (43%), Gaps = 41/291 (14%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ P SSS V C+ +C S K C Y Y+ E +SSSG L +DI+
Sbjct: 130 RFQPDLSSSYSPVKCN-----VDCTCDSDKKQCTYERQYA-EMSSSSGVLGEDIVSFGRE 183
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ Q +V GC +TG A DG+MGLG G +S+ L + G+I +SFS+
Sbjct: 184 SELKAQRAV-----FGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVINDSFSL 237
Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------Q 214
C+ D G V G P+ + P+ Y Y + ++ + L
Sbjct: 238 CYGGMDIGGGAMVLGGVPTPSDMVFSRSDPLRSPY--YNIELKEIHVAGKALRVDSRIFD 295
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL---QGNSWKY---CYNASSEE 268
S ++DSG ++ +LP + + + F V+SK SL +G Y C+ +
Sbjct: 296 SKHGTVLDSGTTYAYLPEQAF----MAFKDAVTSKVHSLKKIRGPDPSYKDICFAGARRN 351
Query: 269 MLKV----PDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGD 313
+ K+ PD+ ++F Q S N++F + +G +CL V D
Sbjct: 352 VSKLHEVFPDVDMVFGNGQKLSLTPENYLFRHSKVDG--AYCLGVFQNGKD 400
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 84/331 (25%), Positives = 134/331 (40%), Gaps = 44/331 (13%)
Query: 40 RNLSE-YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
R LS YDP SS+ CS P C++ +C C Y Y + +S+SG L D L
Sbjct: 135 RQLSPLYDPRGSSTYAQTPCSPPQCRNPQTCDGTTGGCGYRIVYG-DASSTSGNLATDRL 193
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
++ +SV +V +GCG G + A G++G+ G+ S + +A +
Sbjct: 194 VFSN------DTSV-GNVTLGCGHDNEGLFGSAA---GLLGVARGNNSFATQVADS--YG 241
Query: 159 NSFSICF-DENDSGS----VFFGDQGPATQQSTSFLPIG---EKYDAYFVGVESYCIGNS 210
F+ C D SGS + FG P S F P+ + Y+V + + +G
Sbjct: 242 RYFAYCLGDRTRSGSSSSYLVFGRTAPEPPSSV-FTPLRSNPRRPSLYYVDMVGFSVGGE 300
Query: 211 CLTQSGFQ--------------ALVDSGASFTFLPTEIYAEVVVKFDKL---VSSKRISL 253
+T GF +VDSG S T + Y + FD V +++
Sbjct: 301 PVT--GFSNASLSLDPATGRGGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGR 358
Query: 254 QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEG-FTVFCLTVMSTDG 312
+ + CY+ + P + L F+ + + PE G + F L DG
Sbjct: 359 GISVFDACYDLRGVAVADAPGVVLHFAGGADVALPPENYLVPEESGRYHCFALEAAGHDG 418
Query: 313 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
+IG R+VFD EN ++ + + C
Sbjct: 419 -LSVIGNVLQQRFRVVFDVENERVGFEPNGC 448
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 80/322 (24%), Positives = 132/322 (40%), Gaps = 35/322 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
++PSSSSS K + CS LC + L + C Y ADY + + D+++ +F
Sbjct: 58 FNPSSSSSFKVLDCSSSLCLNLDVMGCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAF- 116
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
P V +++ +GCG G++ A G++GLG G +S P+ L + +N FS C
Sbjct: 117 --GPGQVVLTNIPLGCGHDNEGTFGTAA---GILGLGRGPLSFPNNLDAS--TRNIFSYC 169
Query: 165 F-----DENDSGSVFFGDQG-PATQQ-STSFLPIGEKYDA---YFVGVESYCIGNSCLTQ 214
D N ++ FGD P T S F+P Y+V + +G + LT
Sbjct: 170 LPDRESDPNHKSTLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTN 229
Query: 215 ---SGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
S FQ + DSG + T L Y V F + + CY+
Sbjct: 230 IPASVFQLDSHGNGGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFKIFDTCYD 289
Query: 264 ASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNF 321
+ + VP + F + + N+I N +FC ++ G +IG
Sbjct: 290 FTGMNSISVPTVTFHFQGDVDMRLPPSNYIVPVSNNN---IFCFAFAASMGP-SVIGNVQ 345
Query: 322 MMGHRIVFDRENLKLAWSHSKC 343
R+++D + ++ +C
Sbjct: 346 QQSFRVIYDNVHKQIGLLPDQC 367
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 73/323 (22%), Positives = 135/323 (41%), Gaps = 26/323 (8%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSS----CKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +D + SSS++ + C+ P+C + S+ C + D C Y Y + + +SG+ V D
Sbjct: 127 ELNLFDTTKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYR-DRSGTSGFYVTD 185
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAG 155
+H + ++ ++++ GC Q G A DG+ G G G+ SV S L+ G
Sbjct: 186 SMHFDILLGESTIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRG 245
Query: 156 LIQNSFSICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC-- 211
+ FS C EN G + G+ + S + P+ Y + ++S +
Sbjct: 246 ITPKVFSHCLKGGENGGGILVLGE---ILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFP 302
Query: 212 ------LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
++ +G + ++DSG + +L E+Y +V VS + C+ S
Sbjct: 303 NPTMFPISNAG-ETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQ-CFRVS 360
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIF----SFPENEGF-TVFCLTVMSTDGDYGIIGQN 320
P +R F S VV + S F +++C+ + I+G
Sbjct: 361 MSVADIFPVLRFNFEGIASMVVTPEEYLQFDSIVSCYKFASLWCIGFQKAEDGLNILGDL 420
Query: 321 FMMGHRIVFDRENLKLAWSHSKC 343
+ IV+D ++ W++ C
Sbjct: 421 VLKDKIIVYDLAQQRIGWANYDC 443
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 81/318 (25%), Positives = 142/318 (44%), Gaps = 55/318 (17%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+ PS SS+ K + C+ P+CK+ + YL D L L S +
Sbjct: 132 FHPSKSSTYKTIPCTSPICKN----------------------ADGHYLGVDTLTLNS-N 168
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
P S +++IGCG + G L+G G +GL G +S S L + I FS C
Sbjct: 169 NGTPISF--KNIVIGCGHRNQGP-LEGYV-SGNIGLARGPLSFISQLNSS--IGGKFSYC 222
Query: 165 F-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----TQS 215
EN S + FGD+ + T PI E+ + YFV +E++ +G+ + + +
Sbjct: 223 LVPLFSKENVSSKLHFGDKSTVSGLGTVSTPIKEE-NGYFVSLEAFSVGDHIIKLENSDN 281
Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML-KVPD 274
+++DSG + T LP ++Y+ + +V KR+ + CY +S +L KV
Sbjct: 282 RGNSIIDSGTTMTILPKDVYSRLESVVLDMVKLKRVKDPSQQFNLCYQTTSTTLLTKVLI 341
Query: 275 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY-------GIIGQNFMMGHRI 327
+ FS ++ + + F +P + V C +S G++ ++ QNF++G
Sbjct: 342 ITAHFSGSEVHLNALNTF-YPITD--EVICFAFVS-GGNFSSLAIFGNVVQQNFLVG--- 394
Query: 328 VFDRENLKLAWSHSKCEE 345
FD +++ + C +
Sbjct: 395 -FDLNKKTISFKPTDCTK 411
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 84/325 (25%), Positives = 138/325 (42%), Gaps = 40/325 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP S S V C+ PLC+ S C + C Y Y + + ++G + L A
Sbjct: 182 FDPRRSRSYNAVGCAAPLCRRLDSGGCDLRRSACLYQVAYG-DGSVTAGDFATETLTFAG 240
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
++ A V +GCG G ++ A ++GLG G +S P+ +++ SFS
Sbjct: 241 GARVA-------RVALGCGHDNEGLFVAAAG---LLGLGRGSLSFPTQISR--RYGRSFS 288
Query: 163 ICFDEN--------DSGSVFFGDQGPATQQSTSFLPI--GEKYDAYFV---------GVE 203
C + S +V FG + ++SF P+ + + ++ G
Sbjct: 289 YCLVDRTSSANTASRSSTVTFGSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGAR 348
Query: 204 SYCIGNSCLT---QSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-W 258
+ NS L SG +VDSG S T L Y+ + F + R+S G S +
Sbjct: 349 VPGVANSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLF 408
Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
CY+ S +++KVP + + F+ + + P + T FC TDG IIG
Sbjct: 409 DTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGT-FCFAFAGTDGGVSIIG 467
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
G R+VFD + ++A++ C
Sbjct: 468 NIQQQGFRVVFDGDGQRVAFTPKGC 492
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 77/319 (24%), Positives = 134/319 (42%), Gaps = 34/319 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
++PS S S + + C+ C+S C S C Y+ +Y + + + G L +
Sbjct: 107 FNPSGSPSYQTILCNSSTCQSLQYATGNLGVCGSNTPTCNYVVNYG-DGSYTRGDLGMEQ 165
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L+L + H S+ I GCGR G + G+MGLG D+S+ S + +
Sbjct: 166 LNLGT--THV------SNFIFGCGRNNKGLF---GGASGLMGLGKSDLSLVS--QTSAIF 212
Query: 158 QNSFSICFDE---NDSGSVFFGDQGPATQQST-----SFLPIGEKYDAYFVGVESYCIGN 209
+ FS C + SGS+ G + +T + + YF+ + IG
Sbjct: 213 EGVFSYCLPTTAADASGSLILGGNSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGG 272
Query: 210 SCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
L ++ L+DSG T LP +Y ++ +F K S + + C+N +
Sbjct: 273 VALQAPNYRQSGILIDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNG 332
Query: 267 EEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTV-FCLTVMSTDGDYGIIGQNFMMG 324
+ + +P +R+ F N V IF F + + V L +S D + IIG
Sbjct: 333 YDEVDIPTIRMQFEGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRN 392
Query: 325 HRIVFDRENLKLAWSHSKC 343
R++++ + KL ++ C
Sbjct: 393 QRVIYNTKESKLGFAAEAC 411
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 76/327 (23%), Positives = 131/327 (40%), Gaps = 23/327 (7%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSR--------SSCKSLKDPCPYIADYSTEDTSSSGYL 93
L ++P SSS++ ++CS C + + S PC Y Y + + +SGY
Sbjct: 135 LESFNPDSSSTASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYG-DGSGTSGYY 193
Query: 94 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLA 152
V D + + + ++ +S++ GC Q+G A DG+ G G +SV S L
Sbjct: 194 VSDTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLN 253
Query: 153 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC------ 206
G+ FS C +D+G G + + P+ Y + +ES
Sbjct: 254 SLGVSPKVFSHCLKGSDNGGGIL-VLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKL 312
Query: 207 -IGNSCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
I +S T S Q +VDSG + +L Y V VS SL + C+
Sbjct: 313 PIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ-CFIT 371
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNF 321
SS P + L F + V+ N++ + ++C+ G + I+G
Sbjct: 372 SSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLV 431
Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVID 348
+ V+D N+++ W+ C ++
Sbjct: 432 LKDKIFVYDLANMRMGWADYDCSMSVN 458
>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 127/315 (40%), Gaps = 36/315 (11%)
Query: 56 VSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
V C PLCK+ S C + C Y +Y+ + SS G L+ D + L K S
Sbjct: 114 VKCGDPLCKAIQSAPNHHCAGPNEQCDYEVEYA-DQGSSLGVLLRDNIPL----KFTNGS 168
Query: 111 SVQSSVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 169
+ + GCG Q + A+ GV+GLG G S+ S L GLI+N C E
Sbjct: 169 LARPILAFGCGYDQKHVGHNPSASTAGVLGLGNGKTSILSQLHSLGLIRNVVGHCLSERG 228
Query: 170 SGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQSGFQALVDSGASF 227
G +FFGDQ Q + P+ + Y G + G Q + DSG+S+
Sbjct: 229 GGFLFFGDQ-LVPQSGVVWTPLLQSSSTQHYKTGPADLFFDRKPTSVKGLQLIFDSGSSY 287
Query: 228 TFLPTEIYAEVVVKFDKLVSSKRIS--LQGNSWKYCYNASS------EEMLKVPDMRLIF 279
T+ ++ + +V + K +S + +S C+ + + L F
Sbjct: 288 TYFNSKAHKALVNLVTNDLRGKPLSRATEDSSLPICWRGPKPFKSLHDVTSNFKPLLLSF 347
Query: 280 SKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD----GDYGIIGQNFMMGHRIVFD 330
+K+ +N + P V CL ++ G+ IIG + +++D
Sbjct: 348 TKS-----KNSLLQLPPEAYLIVTKHGNVCLGILDGTEIGLGNTNIIGDISLQDKLVIYD 402
Query: 331 RENLKLAWSHSKCEE 345
E ++ W+ + C+
Sbjct: 403 NEKQQIGWASANCDR 417
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 75/328 (22%), Positives = 145/328 (44%), Gaps = 24/328 (7%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+L +D S ++ +V+CS P+C S + C S + C Y Y + + +SGY +
Sbjct: 143 DLHFFDAPGSFTAGSVTCSDPICSSVFQTTAAQC-SENNQCGYSFRYG-DGSGTSGYYMT 200
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKA 154
D + + + ++ + ++ GC Q+G A DG+ G G G +SV S L+
Sbjct: 201 DTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSR 260
Query: 155 GLIQNSFSICFDENDSGSVFF--GDQGPATQQSTSFLPIGEKYDAYF--VGVESYCIGNS 210
G+ FS C + SG F G+ + LP Y+ +GV +
Sbjct: 261 GITPPVFSHCLKGDGSGGGVFVLGEILVPGMVYSPLLPSQPHYNLNLLSIGVNGQILP-- 318
Query: 211 CLTQSGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
+ + F+A +VD+G + T+L E Y + V S+ ++L ++ + CY
Sbjct: 319 -IDAAVFEASNTRGTIVDTGTTLTYLVKEAYDPFLNAISNSV-SQLVTLIISNGEQCYLV 376
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
S+ P + L F+ S ++R +++F + +G +++C+ + I+G +
Sbjct: 377 STSISDMFPPVSLNFAGGASMMLRPQDYLFHYGFYDGASMWCIGFQKAPEEQTILGDLVL 436
Query: 323 MGHRIVFDRENLKLAWSHSKCEEVIDKS 350
V+D ++ W++ C ++ S
Sbjct: 437 KDKVFVYDLARQRIGWANYDCSMSVNVS 464
>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
Length = 506
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 80/332 (24%), Positives = 140/332 (42%), Gaps = 43/332 (12%)
Query: 34 ASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSS-GY 92
+SI+ ++ YDP S ++ +CS PLC SC+ + C Y D S EDTSSS G
Sbjct: 132 SSIIMQGPITLYDPELSITASPATCSDPLCSEGGSCRGNNNSCAY--DISYEDTSSSTGI 189
Query: 93 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
D++HL ++S+ +++ +GC +G + DG+MG G VSVP+ LA
Sbjct: 190 YFRDVVHLGH------KASLNTTMFLGCATSISGLW----PVDGIMGFGRSKVSVPNQLA 239
Query: 153 KAGLIQNSFSICFD-ENDSGSVFF---GDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG 208
N F C E + G + D+ P + P+ Y V + S +
Sbjct: 240 AQAGSYNIFYHCLSGEKEGGGILVLGKNDEFP----EMVYTPMLANDIVYNVKLVSLSVN 295
Query: 209 NSCL--TQSGFQ---------ALVDSGASFTFLPTE---IYAEVVVKFDKLVSSKRISLQ 254
+ L S F+ ++DSG S P++ ++ + V KF + + +
Sbjct: 296 SKALPIEASEFEYNATVGNGGTIIDSGTSSATFPSKALALFVKAVSKFTTAIPTAPLESS 355
Query: 255 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIF-------SFPENEGFTVFCLTV 307
G+ + + + P++ L F + + H + E+ F L
Sbjct: 356 GSPCFISISDRNSVEVDFPNVTLKFDGGATMELTAHNYLEAVVSRKLSESTHFQGVRLVC 415
Query: 308 MS-TDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
+S + G+ I+G + +V+D E ++ W
Sbjct: 416 ISWSVGNSTILGDAILKDKVVVYDMEKSRIGW 447
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 76/327 (23%), Positives = 131/327 (40%), Gaps = 23/327 (7%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSR--------SSCKSLKDPCPYIADYSTEDTSSSGYL 93
L ++P SSS++ ++CS C + + S PC Y Y + + +SGY
Sbjct: 133 LESFNPDSSSTASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYG-DGSGTSGYY 191
Query: 94 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLA 152
V D + + + ++ +S++ GC Q+G A DG+ G G +SV S L
Sbjct: 192 VSDTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLN 251
Query: 153 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC------ 206
G+ FS C +D+G G + + P+ Y + +ES
Sbjct: 252 SLGVSPKVFSHCLKGSDNGGGIL-VLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKL 310
Query: 207 -IGNSCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
I +S T S Q +VDSG + +L Y V VS SL + C+
Sbjct: 311 PIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ-CFIT 369
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNF 321
SS P + L F + V+ N++ + ++C+ G + I+G
Sbjct: 370 SSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLV 429
Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVID 348
+ V+D N+++ W+ C ++
Sbjct: 430 LKDKIFVYDLANMRMGWADYDCSMSVN 456
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 90/322 (27%), Positives = 140/322 (43%), Gaps = 37/322 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSL-KDPCPYIADYSTEDTSSSGYLVDDILHLA 101
+DPS SS+ K + CS P CK+ + C S K C Y Y E S G L D L L
Sbjct: 131 FDPSKSSTYKTIPCSSPKCKNVENTHCSSDDKKVCEYSFTYGGE-AYSQGDLSIDTLTLN 189
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
S + P S +++IGCG + G L+G G +GLG G +S S L + I F
Sbjct: 190 S-NNDTPISF--KNIVIGCGHRNKGP-LEGYV-SGNIGLGRGPLSFISQLNSS--IGGKF 242
Query: 162 SICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---- 212
S C +E SG + FGD+ + T PI Y + + +G+ +
Sbjct: 243 SYCLVPLFSNEGISGKLHFGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHIIKFEN 302
Query: 213 ----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
+ ++DSG + T LP +Y+ + +V +R +K CY A+ +
Sbjct: 303 STSKNDNLGNTIIDSGTTLTILPENVYSRLESIVTSMVKLERAKSPNQQFKLCYKATLKN 362
Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG-IIG----QNFMM 323
L VP + F+ + + F ++E V C +S G IIG QNF++
Sbjct: 363 -LDVPIITAHFNGADVHLNSLNTFYPIDHE---VVCFAFVSVGNFPGTIIGNIAQQNFLV 418
Query: 324 GHRIVFDRENLKLAWSHSKCEE 345
G FD + +++ + C +
Sbjct: 419 G----FDLQKNIISFKPTDCTK 436
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 78/328 (23%), Positives = 137/328 (41%), Gaps = 29/328 (8%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ YDP SS++ VSCS PLC + + C + C YI Y + ++S GY V D
Sbjct: 46 LTMYDPRESSTTSLVSCSDPLCVRGRRFAEAQCSQATNNCEYIFSYG-DGSTSEGYYVRD 104
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
+ S + ++ S V+ GC +QTG A DG++G G ++SVP+ LA
Sbjct: 105 AMQYNVISSNG-LANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQ 163
Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
I FS C E + G + ++ P+ Y V + + ++ L
Sbjct: 164 NIPRVFSHCL-EGEKRGGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPID 222
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
+ + ++DSG + + P+ Y V + S+ + +QG + C+ S
Sbjct: 223 AEDFSSTNDTGVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQ-CFLVSGR 281
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPEN--EGFT-VFCLTVMSTDGDYG--------I 316
P++ L F + ++ + G T V+C+ S+ G I
Sbjct: 282 LSDLFPNVTLNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTI 341
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCE 344
+G + +V+D +N ++ W C+
Sbjct: 342 LGDIVLKDKLVVYDLDNSRIGWMSYNCK 369
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 76/327 (23%), Positives = 131/327 (40%), Gaps = 23/327 (7%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSR--------SSCKSLKDPCPYIADYSTEDTSSSGYL 93
L ++P SSS++ ++CS C + + S PC Y Y + + +SGY
Sbjct: 49 LESFNPDSSSTASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYG-DGSGTSGYY 107
Query: 94 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLA 152
V D + + + ++ +S++ GC Q+G A DG+ G G +SV S L
Sbjct: 108 VSDTMFFETVMGNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLN 167
Query: 153 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC------ 206
G+ FS C +D+G G + + P+ Y + +ES
Sbjct: 168 SLGVSPKVFSHCLKGSDNGGGIL-VLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKL 226
Query: 207 -IGNSCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
I +S T S Q +VDSG + +L Y V VS SL + C+
Sbjct: 227 PIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ-CFIT 285
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNF 321
SS P + L F + V+ N++ + ++C+ G + I+G
Sbjct: 286 SSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLV 345
Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVID 348
+ V+D N+++ W+ C ++
Sbjct: 346 LKDKIFVYDLANMRMGWADYDCSMSVN 372
>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 86/336 (25%), Positives = 143/336 (42%), Gaps = 61/336 (18%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD--I 97
Y P++++ ++C PLC S CKS D C Y +Y+ + SS G LV+D
Sbjct: 98 YKPNNNA----LNCFEPLCTSLHPITNHHCKSADDQCQYEIEYA-DHGSSLGVLVNDHVP 152
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD-GVMGLGLGDVSVPSLLAKAGL 156
L L + S AP+ + GCG S D + P GV+GLG G+VS S L+ G+
Sbjct: 153 LKLTNGSLAAPR------IAFGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGV 206
Query: 157 IQNSFSICFDENDSGSVFFGDQ----GPATQQSTSFLPIGEKY-----DAYFVGVESYCI 207
++N C + + G +FFGD+ T S S IG Y + YF G +
Sbjct: 207 VRNVVGHCLSD-EGGFLFFGDEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFSGKAT--- 262
Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNAS 265
G LT + DSG+S+T+ ++ Y ++ + K + + + S C+ +
Sbjct: 263 GIKDLT-----LVFDSGSSYTYFNSQAYNSILALVKNNLRGKPLEDAPEDKSLPVCWKGT 317
Query: 266 S--------EEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN----EGFTVFCLTVMSTD-- 311
++ +R +KN + PEN + C +++
Sbjct: 318 RPFKSLRDVKKYFNPLALRFTKTKNAQIQLP------PENYLIITKYGNVCFGILNGTEV 371
Query: 312 --GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
GD IIG + +++D E ++ W + C +
Sbjct: 372 GLGDLNIIGDISLKDKMVIYDNERRRIGWFPTNCNK 407
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 81/324 (25%), Positives = 129/324 (39%), Gaps = 47/324 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST---EDTSSSGYLVDDILHLA 101
+DP++SSS VSC +C++ S DYS + + + G L + L L
Sbjct: 172 FDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLG 231
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
++VQ V IGCG + +G ++ A G++GLG G +S+ L G F
Sbjct: 232 G-------TAVQ-GVAIGCGHRNSGLFVGAA---GLLGLGWGAMSLIGQL--GGAAGGVF 278
Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY----------DAYFVGVESYCIGNSC 211
S C +G G T +P+G + Y+VG+ +G
Sbjct: 279 SYCLASRGAGGA-----GSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGER 333
Query: 212 ---------LTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
LT+ G +V D+G + T LP E YA + FD + + S + C
Sbjct: 334 LPLQDGLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTC 393
Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 319
Y+ S ++VP + F + + RN + G VFCL + I+G
Sbjct: 394 YDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEV----GGAVFCLAFAPSSSGISILGN 449
Query: 320 NFMMGHRIVFDRENLKLAWSHSKC 343
G +I D N + + + C
Sbjct: 450 IQQEGIQITVDSANGYVGFGPNTC 473
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 81/324 (25%), Positives = 129/324 (39%), Gaps = 47/324 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST---EDTSSSGYLVDDILHLA 101
+DP++SSS VSC +C++ S DYS + + + G L + L L
Sbjct: 172 FDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLG 231
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
++VQ V IGCG + +G ++ A G++GLG G +S+ L G F
Sbjct: 232 G-------TAVQ-GVAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQL--GGAAGGVF 278
Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY----------DAYFVGVESYCIGNSC 211
S C +G G T +P+G + Y+VG+ +G
Sbjct: 279 SYCLASRGAGGA-----GSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGER 333
Query: 212 ---------LTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
LT+ G +V D+G + T LP E YA + FD + + S + C
Sbjct: 334 LPLQDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTC 393
Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 319
Y+ S ++VP + F + + RN + G VFCL + I+G
Sbjct: 394 YDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEV----GGAVFCLAFAPSSSGISILGN 449
Query: 320 NFMMGHRIVFDRENLKLAWSHSKC 343
G +I D N + + + C
Sbjct: 450 IQQEGIQITVDSANGYVGFGPNTC 473
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 136/331 (41%), Gaps = 40/331 (12%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDD 96
D++ +DP S S V CS PLC+ S C + C Y Y + + ++G +
Sbjct: 178 DQSGQVFDPRRSRSYGAVGCSAPLCRRLDSGGCDLRRKACLYQVAYG-DGSVTAGDFATE 236
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
L A ++ A + +GCG G ++ A ++GLG G +S P+ +++
Sbjct: 237 TLTFAGGARVA-------RIALGCGHDNEGLFVAAAG---LLGLGRGSLSFPAQISR--R 284
Query: 157 IQNSFSICFDENDSG--------SVFFGDQGPATQQSTSFLPIGEK------YDAYFVGV 202
SFS C + S +V FG + + SF P+ + Y VG+
Sbjct: 285 YGRSFSYCLVDRTSSANPASHSSTVTFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGI 344
Query: 203 ESYCIGNSCLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 253
S + S + +VDSG S T L Y+ + F + R+S
Sbjct: 345 SVGGARVSGVADSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSP 404
Query: 254 QGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG 312
G S + CY+ S +++KVP + + F+ + + P + T FC TDG
Sbjct: 405 GGFSLFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGT-FCFAFAGTDG 463
Query: 313 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
IIG G R+VFD + ++ + C
Sbjct: 464 GVSIIGNIQQQGFRVVFDGDGQRVGFVPKGC 494
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 73/330 (22%), Positives = 136/330 (41%), Gaps = 28/330 (8%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +D SS++ + CS +C S + C + C Y Y + + +SGY V D
Sbjct: 112 LNFFDTVGSSTAALIPCSDLICTSGVQGAAAECSPRVNQCSYTFQYG-DGSGTSGYYVSD 170
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAG 155
++ P + ++++ GC Q+G A DG+ G G G +SV S L+ G
Sbjct: 171 AMYFNLIMGQPPAVNSTATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSQG 230
Query: 156 LIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 212
+ FS C D N G + G+ + S + P+ Y + ++S + L
Sbjct: 231 ITPKVFSHCLKGDGNGGGILVLGE---ILEPSIVYSPLVPSQPHYNLNLQSIAVNGQPLP 287
Query: 213 --------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCY 262
+ + +VD G + +L E Y +V + V S+++ + +GN CY
Sbjct: 288 INPAVFSISNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQ---CY 344
Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN--EGFTVFCLTVMSTDGDYGIIGQN 320
S+ P + L F S V++ + +G ++C+ I+G
Sbjct: 345 LVSTSIGDIFPLVSLNFEGGASMVLKPEQYLMHNGYLDGAEMWCVGFQKLQEGASILGDL 404
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVIDKS 350
+ +V+D ++ W++ C ++ S
Sbjct: 405 VLKDKIVVYDIAQQRIGWANYDCSLSVNVS 434
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 73/309 (23%), Positives = 126/309 (40%), Gaps = 24/309 (7%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP+ SS+ NVSC+ P C + C Y Y + + S G+ D L L+S+
Sbjct: 222 FDPARSSTYANVSCAAPACSDLDTRGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY- 279
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
GCG + G + + A G++GLG G S+P K G + F+
Sbjct: 280 ------DAVKGFRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAH 327
Query: 164 CFDENDSGSVF--FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQ- 218
C +G+ + FG PA + +T+ + + Y+VG+ +G L QS F
Sbjct: 328 CLPARSTGTGYLDFGAGSPAARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFAT 387
Query: 219 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPD 274
+VDSG T LP Y+ + F +S++ + + + CY+ + + +P
Sbjct: 388 AGTIVDSGTVITRLPPAAYSSLRSAFAAAMSARGYKKAPAVSLLDTCYDFAGMSQVAIPT 447
Query: 275 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 334
+ L+F V + + GD GI+G + + +D
Sbjct: 448 VSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKK 507
Query: 335 KLAWSHSKC 343
+++S C
Sbjct: 508 VVSFSPGAC 516
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 78/327 (23%), Positives = 136/327 (41%), Gaps = 29/327 (8%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ YDP SS++ VSCS PLC + + C + C YI Y + ++S GY V D
Sbjct: 73 LTMYDPRESSTTSLVSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYG-DGSTSEGYYVRD 131
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
+ S + ++ S V+ GC +QTG A DG++G G ++SVP+ LA
Sbjct: 132 AMQYNVISSNG-LANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQ 190
Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
I FS C E + G + ++ P+ Y V + + ++ L
Sbjct: 191 NIPRVFSHCL-EGEKRGGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPID 249
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
+ + ++DSG + + P+ Y V + S+ + +QG + C+ S
Sbjct: 250 AEDFSSTNDTGVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQ-CFLVSGR 308
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPEN--EGFT-VFCLTVMSTDGDYG--------I 316
P++ L F + ++ + G T V+C+ S+ G I
Sbjct: 309 LSDLFPNVTLNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTI 368
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKC 343
+G + +V+D +N ++ W C
Sbjct: 369 LGDIVLKDKLVVYDLDNSRIGWMSYNC 395
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 80/320 (25%), Positives = 141/320 (44%), Gaps = 34/320 (10%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKS--RSSCK--SLKDPCPYIADYSTEDTSSSGYLVD 95
+N ++DP+ S+S KN+SCS CKS + S + S + C Y Y T T G+L
Sbjct: 170 QNDEKFDPTKSTSYKNLSCSSEPCKSIGKESAQGCSSSNSCLYGVKYGTGYT--VGFLAT 227
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
+ L + S V + +IGCG + G + A G++GLG V++PS +
Sbjct: 228 ETLTIT-------PSDVFENFVIGCGERNGGRFSGTA---GLLGLGRSPVALPSQTSST- 276
Query: 156 LIQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKY-DAYFVGVESYCIGNSCL 212
+N FS C + S G + FG Q+ F PI K + Y + V +G L
Sbjct: 277 -YKNLFSYCLPASSSSTGHLSFGG---GVSQAAKFTPITSKIPELYGLDVSGISVGGRKL 332
Query: 213 --TQSGFQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS-- 265
S F+ ++DSG + T+LP+ ++ + F +++++ ++ + + CY+ S
Sbjct: 333 PIDPSVFRTAGTIIDSGTTLTYLPSTAHSALSSAFQEMMTNYTLTKGTSGLQPCYDFSKH 392
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM--STDGDYGIIGQNFMM 323
+ + + +P + + F + + N G CL D D I G
Sbjct: 393 ANDNITIPQISIFFEGGVEVDIDDSGIFIAAN-GLEEVCLAFKDNGNDTDVAIFGNVQQK 451
Query: 324 GHRIVFDRENLKLAWSHSKC 343
+ +V+D + ++ C
Sbjct: 452 TYEVVYDVAKGMVGFAPGGC 471
>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 73/315 (23%), Positives = 129/315 (40%), Gaps = 29/315 (9%)
Query: 52 SSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ 109
+ V C+ LC++ ++C + C Y +Y+ + SS G L+ D L +
Sbjct: 114 KNNRVPCASSLCQAIQNNNCDIPTEQCDYEVEYA-DLGSSLGVLLSDYFPL----RLNNG 168
Query: 110 SSVQSSVIIGCGRKQTGSYLDGAAPD---GVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 166
S +Q + GCG Q YL +P G++GLG G S+ S L G+ QN CF
Sbjct: 169 SLLQPRIAFGCGYDQ--KYLGPHSPPDTAGILGLGRGKASILSQLRTLGITQNVVGHCFS 226
Query: 167 ENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGA 225
G +FFGD P + + + + Y G G G Q + DSG+
Sbjct: 227 RVTGGFLFFGDHLLPPSGITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGS 286
Query: 226 SFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASS--------EEMLKVPDM 275
S+T+ ++Y ++ K +S + + + + C+ + + K +
Sbjct: 287 SYTYFNAQVYQSILNLVRKDLSGMPLKDAPEEKALAVCWKTAKPIKSILDIKSFFKPLTI 346
Query: 276 RLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDR 331
I +KN + + +G CL +++ G+ +IG FM +V+D
Sbjct: 347 NFIKAKNVQLQLAPEDYLIITKDGNV--CLGILNGGEQGLGNLNVIGDIFMQDRVVVYDN 404
Query: 332 ENLKLAWSHSKCEEV 346
E ++ W + C +
Sbjct: 405 ERQQIGWFPTNCNRL 419
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 84/335 (25%), Positives = 145/335 (43%), Gaps = 61/335 (18%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSR----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
Y P+ S++ NVSC P+C++ S C C Y Y + TS+ G L + L
Sbjct: 135 YAPARSATYANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSYG-DGTSTDGVLATETFTL 193
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
S + V GCG + GS + + G++G+G G + SL+++ G+ +
Sbjct: 194 GS-------DTAVRGVAFGCGTENLGSTDNSS---GLVGMGRGPL---SLVSQLGVTR-- 238
Query: 161 FSIC---FDENDSGSVFFGDQG--PATQQSTSFLP-----IGEKYDAYFVGVESYCIGNS 210
FS C F+ + +F G + ++T F+P + Y++ +E +G++
Sbjct: 239 FSYCFTPFNATAASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDT 298
Query: 211 C---------LTQSG-FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--- 257
LT G ++DSG +FT L + V L S R+ L +
Sbjct: 299 LLPIDPAVFRLTPMGDGGVIIDSGTTFTALEERAF---VALARALASRVRLPLASGAHLG 355
Query: 258 WKYCYNASSEEMLKVPDMRLIFS------KNQSFVVRNHIFSFPENEGFTVFCLTVMSTD 311
C+ A+S E ++VP + L F + +S+VV E+ V CL ++S
Sbjct: 356 LSLCFAAASPEAVEVPRLVLHFDGADMELRRESYVV--------EDRSAGVACLGMVSAR 407
Query: 312 GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
G ++G I++D E L++ +KC E+
Sbjct: 408 G-MSVLGSMQQQNTHILYDLERGILSFEPAKCGEL 441
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 77/314 (24%), Positives = 134/314 (42%), Gaps = 37/314 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+DP+ S+S K + CS LC+S R C S K C Y+ Y +++SS+G L + + +
Sbjct: 173 FDPTKSASFKGLPCSSKLCQSIRQGCSSPK--CTYLTAY-VDNSSSTGTLATETISFSHL 229
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
+++IGC + +G L G+MGL +S+ S A + FS
Sbjct: 230 KYDF------KNILIGCSDQVSGESL---GESGIMGLNRSPISLAS--QTANIYDKLFSY 278
Query: 164 CFDEN--DSGSVFFGDQGPATQQSTSFLPIGEK-----YDAYFVGVESYCIGNSCL--TQ 214
C +G + FG + P F P+ + YD G+ +G L
Sbjct: 279 CIPSTPGSTGHLTFGGKVP---NDVRFSPVSKTAPSSDYDIKMTGIS---VGGRKLLIDA 332
Query: 215 SGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
S F+ + +DSGA T LP + Y+ + F +++ + Q + CY+ S+ + +
Sbjct: 333 SAFKIASTIDSGAVLTRLPPKAYSALRSVFREMMKGYPLLDQDDFLDTCYDFSNYSTVAI 392
Query: 273 PDMRLIFSK--NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
P + + F V ++ P G V+CL D + I G + +VFD
Sbjct: 393 PSISVFFEGGVEMDIDVSGIMWQVP---GSKVYCLAFAELDDEVSIFGNFQQKTYTVVFD 449
Query: 331 RENLKLAWSHSKCE 344
++ ++ C+
Sbjct: 450 GAKERIGFAPGGCD 463
>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 78/330 (23%), Positives = 134/330 (40%), Gaps = 32/330 (9%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLV 94
R L P SS + C+ PLCK S C++ + C Y +Y+ + SS G LV
Sbjct: 94 RCLEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCET-PEQCDYEVEYA-DGGSSLGVLV 151
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
D+ + + + + +GCG Q DGV+GLG G VS+ S L
Sbjct: 152 RDVFSM----NYTKGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQ 207
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF---VGVESYCIGNSC 211
G ++N C G +FFGD + + S+ P+ +Y ++ +G E G
Sbjct: 208 GYVKNVIGHCLSSLGGGILFFGDDLYDSSR-VSWTPMSREYSKHYSPAMGGE-LLFGGRT 265
Query: 212 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNAS---- 265
+ DSG+S+T+ ++ Y V + +S K + + ++ C+
Sbjct: 266 TGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFM 325
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD----GDYGI 316
S E +K L S + + +F P + CL +++ + +
Sbjct: 326 SIEEVKKYFKPLALSFKTGWRSKT-LFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNL 384
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
IG M I++D E + W + C+E+
Sbjct: 385 IGDISMQDQMIIYDNEKQSIGWMPADCDEL 414
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 68/236 (28%), Positives = 110/236 (46%), Gaps = 41/236 (17%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DPS SSS +N+ C C S +SC D Y++ + S++GY V S
Sbjct: 130 FDPSLSSSYQNIPCLSDTCHSMRTTSC----DVRGYLSVETLTLDSTTGYSV-------S 178
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
F K +IGCG + TG++ ++ G++GLG G +S+PS L + I FS
Sbjct: 179 FPK----------TMIGCGYRNTGTFHGPSS--GIVGLGSGPMSLPSQLGTS--IGGKFS 224
Query: 163 ICFDE---NDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQSG 216
C N + + FGD PI +K DA Y++ +E++ +GN + G
Sbjct: 225 YCLGPWLPNSTSKLNFGDAAIVYGDGAMTTPIVKK-DAQSGYYLTLEAFSVGNKLIEFGG 283
Query: 217 -------FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
L+DSG +FTFLP ++Y + ++ + + ++K CYN +
Sbjct: 284 PTYGGNEGNILIDSGTTFTFLPYDVYYRFESAVAEYINLEHVEDPNGTFKLCYNVA 339
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 84/335 (25%), Positives = 145/335 (43%), Gaps = 61/335 (18%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSR----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
Y P+ S++ NVSC P+C++ S C C Y Y + TS+ G L + L
Sbjct: 135 YAPARSATYANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSYG-DGTSTDGVLATETFTL 193
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
S + V GCG + GS + + G++G+G G + SL+++ G+ +
Sbjct: 194 GS-------DTAVRGVAFGCGTENLGSTDNSS---GLVGMGRGPL---SLVSQLGVTR-- 238
Query: 161 FSIC---FDENDSGSVFFGDQG--PATQQSTSFLP-----IGEKYDAYFVGVESYCIGNS 210
FS C F+ + +F G + ++T F+P + Y++ +E +G++
Sbjct: 239 FSYCFTPFNATAASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDT 298
Query: 211 C---------LTQSG-FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--- 257
LT G ++DSG +FT L + V L S R+ L +
Sbjct: 299 LLPIDPAVFRLTPMGDGGVIIDSGTTFTALEESAF---VALARALASRVRLPLASGAHLG 355
Query: 258 WKYCYNASSEEMLKVPDMRLIFS------KNQSFVVRNHIFSFPENEGFTVFCLTVMSTD 311
C+ A+S E ++VP + L F + +S+VV E+ V CL ++S
Sbjct: 356 LSLCFAAASPEAVEVPRLVLHFDGADMELRRESYVV--------EDRSAGVACLGMVSAR 407
Query: 312 GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
G ++G I++D E L++ +KC E+
Sbjct: 408 G-MSVLGSMQQQNTHILYDLERGILSFEPAKCGEL 441
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 74/329 (22%), Positives = 135/329 (41%), Gaps = 25/329 (7%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +D SS++ V CS P+C S + C + C Y Y + + +SG V D
Sbjct: 128 LNFFDTVGSSTAALVPCSDPMCASAIQGAAAQCSPQVNQCSYTFQYE-DGSGTSGVYVSD 186
Query: 97 ILHLASFSKHAPQSSVQSS--VIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAK 153
++ + ++V SS ++ GC Q+G A DG++G G G++SV S L+
Sbjct: 187 AMYFDMILGQSTPANVASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSS 246
Query: 154 AGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 211
G+ FS C D N G + G+ + S + P+ Y + ++S +
Sbjct: 247 RGITPKVFSHCLKGDGNGGGILVLGE---ILEPSIVYSPLVPSQPHYNLNLQSIAVNGQV 303
Query: 212 L--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
L T ++DSG + ++L E Y +V D VS S + CY
Sbjct: 304 LSINPAVFATSDKRGTIIDSGTTLSYLVQEAYDPLVNAVDTAVSQFATSFISKGSQ-CYL 362
Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNF 321
+ P + F S ++ ++ + +G ++C+ I+G
Sbjct: 363 VLTSIDDSFPTVSFNFEGGASMDLKPSQYLLNRGFQDGAKMWCIGFQKVQEGVTILGDLV 422
Query: 322 MMGHRIVFDRENLKLAWSHSKCEEVIDKS 350
+ +V+D ++ W++ C ++ S
Sbjct: 423 LKDKIVVYDLARQQIGWTNYDCSMSVNVS 451
>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
Length = 413
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 81/331 (24%), Positives = 135/331 (40%), Gaps = 34/331 (10%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLV 94
R L P SS + C+ PLCK S C++ + C Y +Y+ + SS G LV
Sbjct: 82 RCLEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCET-PEQCDYEVEYA-DGGSSLGVLV 139
Query: 95 DDILHLASFSKHAPQS-SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
D+ FS + Q + + +GCG Q DGV+GLG G VS+ S L
Sbjct: 140 RDV-----FSMNYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHS 194
Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF---VGVESYCIGNS 210
G ++N C G +FFGD + + S+ P+ +Y ++ +G E G
Sbjct: 195 QGYVKNVIGHCLSSLGGGILFFGDDLYDSSR-VSWTPMSREYSKHYSPAMGGE-LLFGGR 252
Query: 211 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNAS--- 265
+ DSG+S+T+ ++ Y V + +S K + + ++ C+
Sbjct: 253 TTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPF 312
Query: 266 -SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD----GDYG 315
S E +K L S + + +F P + CL +++ +
Sbjct: 313 MSIEEVKKYFKPLALSFKTGWRSKT-LFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLN 371
Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+IG M I++D E + W C+E+
Sbjct: 372 LIGDISMQDQMIIYDNEKQSIGWMPVDCDEL 402
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 82/327 (25%), Positives = 139/327 (42%), Gaps = 23/327 (7%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP---CPYIADYSTEDTSSSGYLVDDI 97
LS +DP SSS+ VSCS C S +S P C Y Y + + +SG+ + D
Sbjct: 127 QLSFFDPGVSSSASLVSCSDRRCYSNFQTESGCSPNNLCSYSFKYG-DGSGTSGFYISDF 185
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGL 156
+ + + + + GC QTG A DG+ GLG G +SV S LA GL
Sbjct: 186 MSFDTVITSTLAINSSAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGL 245
Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---- 212
FS C + SG G + T + P+ Y V ++S + L
Sbjct: 246 APRVFSHCLKGDKSGGGIM-VLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDP 304
Query: 213 ----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
+G ++D+G + +LP E Y+ + VS + S++ C+ ++ +
Sbjct: 305 SVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAIANAVSQYGRPITYESYQ-CFEITAGD 363
Query: 269 MLKVPDMRLIFSKNQSFVVRNH----IFSFPENEGFTVFCLTVMS-TDGDYGIIGQNFMM 323
+ P++ L F+ S V+R H IFS + G +++C+ + I+G +
Sbjct: 364 VDVFPEVSLSFAGGASMVLRPHAYLQIFS---SSGSSIWCIGFQRMSHRRITILGDLVLK 420
Query: 324 GHRIVFDRENLKLAWSHSKCEEVIDKS 350
+V+D ++ W+ C ++ S
Sbjct: 421 DKVVVYDLVRQRIGWAEYDCSLEVNVS 447
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 83/330 (25%), Positives = 144/330 (43%), Gaps = 26/330 (7%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
LS +DPSSSS++ VSCSHP+C S + C + C Y Y + + ++GY V
Sbjct: 129 ELSFFDPSSSSTTSLVSCSHPICTSLVQTTAAECSPQSNQCSYSFHYG-DGSGTTGYYVS 187
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKA 154
D+L+ + + ++ +S++ GC Q+G A DG+ G G D+SV S L+
Sbjct: 188 DMLYFDTVLGDSLIANSSASIVFGCSTYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSL 247
Query: 155 GLIQNSFSICFD-ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 212
G+ FS C E D G G + + + P+ Y + ++S + L
Sbjct: 248 GITPKVFSHCLKGEGDGGGKLV--LGEILEPNIIYSPLVPSQSHYNLNLQSISVNGQLLP 305
Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYN 263
T + +VDSG + T+L Y V VSS + +GN CY
Sbjct: 306 IDPAVFATSNNQGTIVDSGTTLTYLVETAYDPFVSAITATVSSSTTPVLSKGNQ---CYL 362
Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMST-DGDYGIIGQN 320
S+ P + L F+ S V++ ++ ++G ++C+ + I+G
Sbjct: 363 VSTSVDEIFPPVSLNFAGGASMVLKPGEYLMHLGFSDGAAMWCIGFQKVAEPGITILGDL 422
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVIDKS 350
+ V+D + ++ W++ C ++ S
Sbjct: 423 VLKDKIFVYDLAHQRIGWANYDCSLSVNVS 452
>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 81/331 (24%), Positives = 135/331 (40%), Gaps = 34/331 (10%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLV 94
R L P SS + C+ PLCK S C++ + C Y +Y+ + SS G LV
Sbjct: 94 RCLEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCET-PEQCDYEVEYA-DGGSSLGVLV 151
Query: 95 DDILHLASFSKHAPQS-SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
D+ FS + Q + + +GCG Q DGV+GLG G VS+ S L
Sbjct: 152 RDV-----FSMNYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHS 206
Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF---VGVESYCIGNS 210
G ++N C G +FFGD + + S+ P+ +Y ++ +G E G
Sbjct: 207 QGYVKNVIGHCLSSLGGGILFFGDDLYDSSR-VSWTPMSREYSKHYSPAMGGE-LLFGGR 264
Query: 211 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNAS--- 265
+ DSG+S+T+ ++ Y V + +S K + + ++ C+
Sbjct: 265 TTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPF 324
Query: 266 -SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD----GDYG 315
S E +K L S + + +F P + CL +++ +
Sbjct: 325 MSIEEVKKYFKPLALSFKTGWRSKT-LFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLN 383
Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+IG M I++D E + W C+E+
Sbjct: 384 LIGDISMQDQMIIYDNEKQSIGWMPVDCDEL 414
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 67/269 (24%), Positives = 126/269 (46%), Gaps = 33/269 (12%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+++P SS+ + VSC+ +C + + C Y Y+ E +SSSG L +DI+ +
Sbjct: 131 KFEPELSSTYQPVSCN-----IDCTCDNERKQCVYERQYA-EMSSSSGVLGEDIISFGNQ 184
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ PQ + I GC ++TG A DG+MGLG GD+S+ L + G+I +SFS+
Sbjct: 185 SELVPQRA-----IFGCENQETGDLYSQRA-DGIMGLGRGDLSIVDQLVEKGVISDSFSL 238
Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------Q 214
C+ D G + G P+ P+ +Y Y + +++ + L
Sbjct: 239 CYGGMDIGGGAMILGGISPPSGMVFAESDPVRSQY--YNIDLKAIHVAGKQLHLDPSIFD 296
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLK 271
++DSG ++ +LP + K ++S + + G Y C++ + ++ +
Sbjct: 297 GKHGTVLDSGTTYAYLPEAAFTAFKDAMMKELTSLK-QIHGPDPNYNDICFSGAESDVSQ 355
Query: 272 V----PDMRLIFSKNQSFVV--RNHIFSF 294
+ P + ++FS Q + N++F +
Sbjct: 356 LSNTFPAVEMVFSNGQKLSLSPENYLFQY 384
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 86/328 (26%), Positives = 131/328 (39%), Gaps = 47/328 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKS-----------LKDPCPYI-----ADYSTEDTS 88
YDPS SSS K V C+ C+ + S +K PC Y+ Y+ D +
Sbjct: 175 YDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLA 234
Query: 89 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 148
S L+ D L +F + GCGR G + + G+ VS+
Sbjct: 235 SESILLGDT-KLENF-------------VFGCGRNNKGLFGGSSGLMGLG---RSSVSLV 277
Query: 149 SLLAKAGLIQNSFSIC---FDENDSGSVFFGDQGPATQQST--SFLPIGEK---YDAYFV 200
S K FS C ++ SGS+ FG+ ST S+ P+ + Y +
Sbjct: 278 SQTLKT--FNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYIL 335
Query: 201 GVESYCIGNSCLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 258
+ IG L S F L+DSG T LP IY V ++F K S + +
Sbjct: 336 NLTGASIGGVELKSSSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSIL 395
Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTV-FCLTVMSTDGDYGI 316
C+N +S E + +P +++IF N V +F F + + V L +S + + GI
Sbjct: 396 DTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGI 455
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCE 344
IG R+++D +L C
Sbjct: 456 IGNYQQKNQRVIYDTTQERLGIVGENCR 483
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 88/333 (26%), Positives = 140/333 (42%), Gaps = 43/333 (12%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDD 96
D++ +DP +S S V C+ PLC+ S C + C Y Y + + ++G +
Sbjct: 183 DQSGQMFDPRASHSYGAVDCAAPLCRRLDSGGCDLRRKACLYQVAYG-DGSVTAGDFATE 241
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
L AS ++ P+ V +GCG G ++ A ++GLG G +S PS +++
Sbjct: 242 TLTFASGAR-VPR------VALGCGHDNEGLFVAAAG---LLGLGRGSLSFPSQISR--R 289
Query: 157 IQNSFSICFDENDSGS---------VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVES 204
SFS C + S S V FG + SF P+ + Y+V +
Sbjct: 290 FGRSFSYCLVDRTSSSASATSRSSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMG 349
Query: 205 YCIGNSCL-------------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
+G + + T G +VDSG S T L YA + F + R+
Sbjct: 350 ISVGGARVPGVAVSDLRLDPSTGRG-GVIVDSGTSVTRLARPAYAALRDAFRAAAAGLRL 408
Query: 252 SLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST 310
S G S + CY+ S +++KVP + + F+ + + P + T FC T
Sbjct: 409 SPGGFSLFDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGT-FCFAFAGT 467
Query: 311 DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
DG IIG G R+VFD + +L + C
Sbjct: 468 DGGVSIIGNIQQQGFRVVFDGDGQRLGFVPKGC 500
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 79/317 (24%), Positives = 139/317 (43%), Gaps = 33/317 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLA 101
+ P+ SS+ +SC C+ S++SC + + C Y YS D S + G L +
Sbjct: 149 FQPTRSSTYSQLSCQSNACQALSQASCDADSE-CQY--QYSYGDGSRTIGVLSTETF--- 202
Query: 102 SFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
SF + V+ V GC G++ DG++GLG G S+ S L I
Sbjct: 203 SFVDGGGKGQVRVPRVNFGCSTASAGTFRS----DGLVGLGAGAFSLVSQLGATTHIDRK 258
Query: 161 FSIC----FDENDSGSVFFGDQGPATQQSTSFLP-IGEKYDAYF-VGVESYCIGNSCLTQ 214
S C +D N S ++ FG + ++ + P + D+Y+ V +ES +G +
Sbjct: 259 LSYCLIPSYDANSSSTLNFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQEVAT 318
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA---SSEEMLK 271
+ +VDSG + TFL + +V + ++ + +R+ + CY+ S +
Sbjct: 319 HDSRIIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQLCYDVQGKSETDNFG 378
Query: 272 VPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QNFMMGHR 326
+PD+ L F + +R + FS + EG L +S I+G QNF +G
Sbjct: 379 IPDVTLRFGGGAAVTLRPENTFSLLQ-EGTLCLVLVPVSESQPVSILGNIAQQNFHVG-- 435
Query: 327 IVFDRENLKLAWSHSKC 343
+D + + ++ + C
Sbjct: 436 --YDLDARTVTFAAADC 450
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 79/330 (23%), Positives = 140/330 (42%), Gaps = 50/330 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCK--SRSSCKSLK-DPCPYIADYSTEDTSSSGYLVDDILHLA 101
+DP++S++ VSC +C+ S+C + C Y Y+ + + + G L + L L
Sbjct: 213 FDPATSATFSGVSCGSAICRILPTSACGDGELGGCEYEVSYA-DGSYTKGALALETLTLG 271
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
+ V+IGCG + G ++ A G+MGLG G +S+ L G + +F
Sbjct: 272 --------GTAVEGVVIGCGHRNRGLFVGAA---GLMGLGWGPMSLVGQL--GGEVGGAF 318
Query: 162 SICF----------DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIG 208
S C ++D+G + G + A + ++P+ A Y+VG+ +G
Sbjct: 319 SYCLASRGGYGSGAADDDAGWLVLG-RSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVG 377
Query: 209 NSCL-TQSG-FQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS- 257
+ L Q+G FQ ++D+G + T LP E YA + F ++ QG S
Sbjct: 378 DERLPLQAGLFQLTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSS 437
Query: 258 --WKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGD 313
CY+ S ++VP + F + ++ RN + ++CL +
Sbjct: 438 SVLDTCYDLSGYASVRVPTVSFCFDGDARLILAARNVLLEVD----MGIYCLAFAPSSSG 493
Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
I+G G +I D N + + + C
Sbjct: 494 LSIMGNTQQAGIQITVDSANGYIGFGPANC 523
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 77/319 (24%), Positives = 135/319 (42%), Gaps = 32/319 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP SSSS N++C C S C + + C Y Y+ +++ + G L + L L S
Sbjct: 102 FDPRSSSSYTNITCGTESCNKLDSSLCSTDQKTCNYTYSYA-DNSITQGVLAQETLTLTS 160
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA-GLIQNSF 161
+ + +I GCG +G + D G++GLG G +S+ S + + G N F
Sbjct: 161 TTG---EPVAFQGIIFGCGHNNSG-FNDREM--GLIGLGRGPLSLISQIGSSLGAGGNMF 214
Query: 162 SICF-----DENDSGSVFFGDQGPATQQSTSFLPI----GEKYDAYFVGVE------SYC 206
S C D + + + FG T P+ G Y A +G+ +
Sbjct: 215 SQCLVPFNTDPSITSQMNFGKGSEVLGNGTVSTPLISKDGTGYFATLLGISVEDINLPFS 274
Query: 207 IGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
G+S T + L+DSG + T+LP E Y ++ + V+ + + G ++ CY +
Sbjct: 275 NGSSLGTITKGNILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFRIDG--YELCYQTPT 332
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
L P + + F + +F +++ FC V T+ +Y G +
Sbjct: 333 N--LNGPTLTIHFEGGDVLLTPAQMFIPVQDDN---FCFAVFDTNEEYVTYGNYAQSNYL 387
Query: 327 IVFDRENLKLAWSHSKCEE 345
I FD E +++ + C +
Sbjct: 388 IGFDLERQVVSFKATDCTK 406
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 80/319 (25%), Positives = 133/319 (41%), Gaps = 33/319 (10%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRS----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
++DPS S S + +C+ LC + +C + + C Y Y + ++ + I
Sbjct: 80 KFDPSKSRSFRKAACTDNLCNVSALPLKACAA--NVCQYQYTYGDQSNTNGDLAFETI-- 135
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
S + A SV + GCG + G++ A G++GLG G +S+ S L+ N
Sbjct: 136 --SLNNGAGTQSV-PNFAFGCGTQNLGTF---AGAAGLVGLGQGPLSLNSQLSHT--FAN 187
Query: 160 SFSICFDENDSGS---VFFGDQGPATQ-QSTSFLPIGEKYDAYFVGVESYCIGNSCLT-- 213
FS C +S S + FG A Q TS + Y+V + S +G L
Sbjct: 188 KFSYCLVSLNSLSASPLTFGSIAAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLA 247
Query: 214 -------QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
QS + ++DSG + T L Y+ V+ ++ V+ R+ C+N
Sbjct: 248 PSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPRLDGSAYGLDLCFNI 307
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
+ VPDM F + F +R + T CL + + G + IIG
Sbjct: 308 AGVSNPSVPDMVFKF-QGADFQMRGENLFVLVDTSATTLCLAMGGSQG-FSIIGNIQQQN 365
Query: 325 HRIVFDRENLKLAWSHSKC 343
H +V+D E K+ ++ + C
Sbjct: 366 HLVVYDLEAKKIGFATADC 384
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 86/328 (26%), Positives = 131/328 (39%), Gaps = 47/328 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKS-----------LKDPCPYI-----ADYSTEDTS 88
YDPS SSS K V C+ C+ + S +K PC Y+ Y+ D +
Sbjct: 175 YDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLA 234
Query: 89 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 148
S L+ D L +F + GCGR G + + G+ VS+
Sbjct: 235 SESILLGDT-KLENF-------------VFGCGRNNKGLFGGSSGLMGLG---RSSVSLV 277
Query: 149 SLLAKAGLIQNSFSIC---FDENDSGSVFFGDQGPATQQST--SFLPIGEK---YDAYFV 200
S K FS C ++ SGS+ FG+ ST S+ P+ + Y +
Sbjct: 278 SQTLKT--FNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYIL 335
Query: 201 GVESYCIGNSCLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 258
+ IG L S F L+DSG T LP IY V ++F K S + +
Sbjct: 336 NLTGASIGGVELKSSSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSIL 395
Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTV-FCLTVMSTDGDYGI 316
C+N +S E + +P +++IF N V +F F + + V L +S + + GI
Sbjct: 396 DTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGI 455
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCE 344
IG R+++D +L C
Sbjct: 456 IGNYQQKNQRVIYDSTQERLGIVGENCR 483
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 86/328 (26%), Positives = 131/328 (39%), Gaps = 47/328 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKS-----------LKDPCPYI-----ADYSTEDTS 88
YDPS SSS K V C+ C+ + S +K PC Y+ Y+ D +
Sbjct: 127 YDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLA 186
Query: 89 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 148
S L+ D L +F + GCGR G + + G+ VS+
Sbjct: 187 SESILLGDT-KLENF-------------VFGCGRNNKGLFGGSSGLMGLG---RSSVSLV 229
Query: 149 SLLAKAGLIQNSFSIC---FDENDSGSVFFGDQGPATQQST--SFLPIGEK---YDAYFV 200
S K FS C ++ SGS+ FG+ ST S+ P+ + Y +
Sbjct: 230 SQTLKT--FNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYIL 287
Query: 201 GVESYCIGNSCLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 258
+ IG L S F L+DSG T LP IY V ++F K S + +
Sbjct: 288 NLTGASIGGVELKSSSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSIL 347
Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTV-FCLTVMSTDGDYGI 316
C+N +S E + +P +++IF N V +F F + + V L +S + + GI
Sbjct: 348 DTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGI 407
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCE 344
IG R+++D +L C
Sbjct: 408 IGNYQQKNQRVIYDTTQERLGIVGENCR 435
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 74/326 (22%), Positives = 137/326 (42%), Gaps = 20/326 (6%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+L +D S ++ +V+CS P+C S + C S + C Y Y + + +SGY +
Sbjct: 143 DLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQC-SENNQCGYSFRYG-DGSGTSGYYMT 200
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKA 154
D + + + ++ + ++ GC Q+G A DG+ G G G +SV S L+
Sbjct: 201 DTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSR 260
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 214
G+ FS C + SG F G + P+ Y + + S + L
Sbjct: 261 GITPPVFSHCLKGDGSGGGVF-VLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPL 319
Query: 215 SG--FQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
F+A +VD+G + T+L E Y + VS + N + CY S+
Sbjct: 320 DAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNG-EQCYLVST 378
Query: 267 EEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
P + L F+ S ++R +++F + +G +++C+ + I+G +
Sbjct: 379 SISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKD 438
Query: 325 HRIVFDRENLKLAWSHSKCEEVIDKS 350
V+D ++ W+ C ++ S
Sbjct: 439 KVFVYDLARQRIGWASYDCSMSVNVS 464
>gi|348690234|gb|EGZ30048.1| pepsin-like aspartic protease A1 [Phytophthora sojae]
Length = 654
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 78/322 (24%), Positives = 144/322 (44%), Gaps = 37/322 (11%)
Query: 45 YDPSSSSSSKNVSCS----HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+ +SS+ +V+CS H CK C D C Y E +S +V+D+++L
Sbjct: 107 FQADNSSTLIHVTCSQQQSHFQCKE---CTEKSDTCAISQSY-MEGSSWKASVVEDVVYL 162
Query: 101 ---ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
+SF A + + GC +TG ++ A DG+MGL D + + L + I
Sbjct: 163 GGESSFHDEAMRDRYGTHFQFGCQSSETGLFVTQVA-DGIMGLSNSDTHIVAKLHRENKI 221
Query: 158 -QNSFSICFDENDSGSVFFGD-QGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL 212
N FS+CF EN G++ G+ A + S+ + + A Y V ++ IG +
Sbjct: 222 PSNLFSLCFTEN-GGTMSVGEPNTKAHRGEISYAKVIKDRSAGHFYNVNMKDIRIGGKSI 280
Query: 213 TQ-----SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
+ +VDSG + ++LP + E + F ++ R G S C+ ++E
Sbjct: 281 NAKEEAYTRGHYIVDSGTTDSYLPRAMKNEFLQVFKEVAG--RDYQVGTS---CHGYTNE 335
Query: 268 EMLKVPDMRLIFSKNQSFVVRNH--IFSFPENEGF----TVFCLTVMSTDGDYGIIGQNF 321
++ +P ++L+ +++ N I P + +C ++ ++ G+IG N
Sbjct: 336 DLASLPKIQLVM---EAYGDENGEVIIDIPPEQYLLHNDNSYCGSIYLSENAGGVIGANL 392
Query: 322 MMGHRIVFDRENLKLAWSHSKC 343
MM ++FD N ++ + + C
Sbjct: 393 MMNRDVIFDNGNQRVGFVDADC 414
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 86/335 (25%), Positives = 132/335 (39%), Gaps = 50/335 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
YDP + + V C P C + +C C Y DY + +S+ G LV+D +
Sbjct: 74 YDPKRA---RVVDCRRPTCAQVQRGGQFTCSGDVRQCDYEVDY-VDGSSTMGILVEDTIT 129
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
L + + Q+ +IGCG Q G+ A DGV+GL +S+PS LA G+
Sbjct: 130 LVLTNG----TRFQTRAVIGCGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIAN 185
Query: 159 NSFSICF--DENDSGSVFFGDQ-GPATQQSTSFL---PIGEKYDAYFVGVESYCIGNSCL 212
N C N G +FFGD PA + + + P+ E Y A ++ G L
Sbjct: 186 NVIGHCLAGGSNGGGYLFFGDTLVPALGMTWTPMIGRPLVEGYQARLRSIK---YGGEVL 242
Query: 213 TQSGFQ-----ALVDSGASFTFLPTEIYAEV---VVKFDKLVSSKRISLQGNSWKYCYNA 264
G A+ DSG SFT+L Y V VV+ + +RI + +C+
Sbjct: 243 ELEGTTDDVGGAMFDSGTSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTD-TTLPFCWRG 301
Query: 265 SSEEMLKVPDMRLIFSK------NQSFVVRNHIFSFPENEGFTV------FCLTVMSTDG 312
S V D+ F ++ + EG+ + CL V+
Sbjct: 302 PS-PFESVADVSAYFKTVTLDFGGSTWWSSGKLLEL-SPEGYLIVSTQGNVCLGVLDASV 359
Query: 313 D----YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
I+G M G+ +V+D ++ W C
Sbjct: 360 ASLEVTNILGDISMRGYLVVYDNMREQIGWVRRNC 394
>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
gi|219888491|gb|ACL54620.1| unknown [Zea mays]
Length = 557
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 80/308 (25%), Positives = 135/308 (43%), Gaps = 37/308 (12%)
Query: 62 LCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 121
L +++ C++ K C Y +Y+ + +SS G L D +H+ + + + + GC
Sbjct: 248 LQGNQNYCETCKQ-CDYEIEYA-DQSSSMGVLARDDMHMIATNG----GREKLDFVFGCA 301
Query: 122 RKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQ 178
Q G L A DG++GL +S PS LA G+I N F C ++ G +F GD
Sbjct: 302 YDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGGGYMFLGDD 361
Query: 179 GPATQQSTSFLPIGEKYD-AYFVGVESYCIGNSCLTQ-----SGFQALVDSGASFTFLPT 232
+ ++ I D Y G+ L + S Q + DSG+S+T+LP
Sbjct: 362 Y-VPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFDSGSSYTYLPN 420
Query: 233 EIYAEVVVK-------FDKLVSSKRISLQGNSWKYCYNASSEEMLK--VPDMRLIFSKNQ 283
EIY +V F + S + + L WK + E +K + L F K
Sbjct: 421 EIYENLVAAIKYASPGFVQDTSDRTLPL---CWKADFPVRYLEDVKQFFEPLNLHFGKKW 477
Query: 284 SFVVRNHIFSFPENEGFTV----FCLTVMS-TDGDYG---IIGQNFMMGHRIVFDRENLK 335
F+ + S PE+ CL +++ T+ ++G I+G + G +V+D + +
Sbjct: 478 LFMSKTFTIS-PEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNQRKQ 536
Query: 336 LAWSHSKC 343
+ W+ S C
Sbjct: 537 IGWADSDC 544
>gi|2570402|gb|AAB97155.1| EEA1 [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 76/322 (23%), Positives = 130/322 (40%), Gaps = 45/322 (13%)
Query: 55 NVSCSHPLCKS-RSSCK-----SLKDP--CPYIADYSTEDTSSSGYLVDDILHLASFSKH 106
V C PLC + R S DP C Y Y T S G L DI+ + K
Sbjct: 93 KVVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVT--GKSEGDLATDIISVNGRDK- 149
Query: 107 APQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSIC 164
+ GCG KQ +P +G++GLG+G + L +I +N C
Sbjct: 150 -------KRIAFGCGYKQEEPPDSPPSPVNGILGLGMGKAGFAAQLKGLKMIKENVIGHC 202
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDS 223
G ++ GD P T+ ++ P+ E Y G+ I + F+A+ DS
Sbjct: 203 LSSKGKGVLYVGDFNPPTR-GVTWAPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDS 261
Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNASS--------EEMLKVPD 274
G+++T +P +IY E+V K S + ++G + C+ + K
Sbjct: 262 GSTYTHVPAQIYNEIVSKVRGTFSESSLEEVKGRALPLCWKGKKPFGSVNDVKNQFKALS 321
Query: 275 MRLIFSK---NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-------YGIIGQNFMMG 324
+++ ++ N +N++F + E CL ++ D + +IG M
Sbjct: 322 LKITHARGTNNLDIPPQNYLFVKEDGET----CLAILDASLDPVLKELNFILIGAVTMQD 377
Query: 325 HRIVFDRENLKLAWSHSKCEEV 346
+++D E +L W ++C+ V
Sbjct: 378 LFVIYDNEKKQLGWVRAQCDRV 399
>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
Length = 557
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 80/308 (25%), Positives = 135/308 (43%), Gaps = 37/308 (12%)
Query: 62 LCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 121
L +++ C++ K C Y +Y+ + +SS G L D +H+ + + + + GC
Sbjct: 248 LQGNQNYCETCKQ-CDYEIEYA-DQSSSMGVLARDDMHMIATNG----GREKLDFVFGCA 301
Query: 122 RKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQ 178
Q G L A DG++GL +S PS LA G+I N F C ++ G +F GD
Sbjct: 302 YDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGGGYMFLGDD 361
Query: 179 GPATQQSTSFLPIGEKYD-AYFVGVESYCIGNSCLTQ-----SGFQALVDSGASFTFLPT 232
+ ++ I D Y G+ L + S Q + DSG+S+T+LP
Sbjct: 362 Y-VPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFDSGSSYTYLPN 420
Query: 233 EIYAEVVVK-------FDKLVSSKRISLQGNSWKYCYNASSEEMLK--VPDMRLIFSKNQ 283
EIY +V F + S + + L WK + E +K + L F K
Sbjct: 421 EIYENLVAAIKYASPGFVQDTSDRTLPL---CWKADFPVRYLEDVKQFFEPLNLHFGKKW 477
Query: 284 SFVVRNHIFSFPENEGFTV----FCLTVMS-TDGDYG---IIGQNFMMGHRIVFDRENLK 335
F+ + S PE+ CL +++ T+ ++G I+G + G +V+D + +
Sbjct: 478 LFMSKTFTIS-PEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNQRKQ 536
Query: 336 LAWSHSKC 343
+ W+ S C
Sbjct: 537 IGWADSDC 544
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 127/314 (40%), Gaps = 49/314 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST---EDTSSSGYLVDDILHLA 101
+DP++SSS VSC +C++ S DYS + + + G L + L L
Sbjct: 172 FDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLG 231
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
++VQ V IGCG + +G ++ A G++GLG G +S+ L G F
Sbjct: 232 G-------TAVQ-GVAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQL--GGAAGGVF 278
Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC---------L 212
S C +G A ++SF Y+VG+ +G L
Sbjct: 279 SYCLASRGAGG--------AGSLASSF---------YYVGLTGIGVGGERLPLQDSLFQL 321
Query: 213 TQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
T+ G +V D+G + T LP E YA + FD + + S + CY+ S ++
Sbjct: 322 TEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVR 381
Query: 272 VPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 329
VP + F + + RN + G VFCL + I+G G +I
Sbjct: 382 VPTVSFYFDQGAVLTLPARNLLVEV----GGAVFCLAFAPSSSGISILGNIQQEGIQITV 437
Query: 330 DRENLKLAWSHSKC 343
D N + + + C
Sbjct: 438 DSANGYVGFGPNTC 451
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 77/325 (23%), Positives = 139/325 (42%), Gaps = 33/325 (10%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+N +DP SS+ K V C C S+ +C C Y Y LV
Sbjct: 129 QNAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRACVGKSGQCYYQYIYGDHT------LVS 182
Query: 96 DILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
IL S + + ++++ + GC + + G++GLG+G +S+ S L
Sbjct: 183 GILGFESINFGSKNNAIKFPKLTFGCTFSNNDTVDESKRNMGLVGLGVGPLSLISQLGYQ 242
Query: 155 GLIQNSFSICF---DENDSGSVFFGDQGPATQ----QSTSFL--PIGEKYDAYFVGVESY 205
I FS CF N + + FG+ Q ST + IG Y Y++ +E
Sbjct: 243 --IGRKFSYCFPPLSSNSTSKMRFGNDAIVKQIKGVVSTPLIIKSIGPSY--YYLNLEGV 298
Query: 206 CIGNSCLTQSGFQA----LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
IGN + S Q L+DSG SFT L Y + V ++ + + + + +C
Sbjct: 299 SIGNKKVKTSESQTDGNILIDSGTSFTILKQSFYNKFVALVKEVYGVEAVKIPPLVYNFC 358
Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST-DGDYGIIGQN 320
+ + + PD+ +F+ + V +++F E E + C+ + T D D I G +
Sbjct: 359 FENKGKRK-RFPDVVFLFTGAKVRVDASNLF---EAEDNNLLCMVALPTSDEDDSIFGNH 414
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEE 345
+G+++ +D + ++++ + C +
Sbjct: 415 AQIGYQVEYDLQGGMVSFAPADCAK 439
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 74/326 (22%), Positives = 137/326 (42%), Gaps = 20/326 (6%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+L +D S ++ +V+CS P+C S + C S + C Y Y + + +SGY +
Sbjct: 148 DLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQC-SENNQCGYSFRYG-DGSGTSGYYMT 205
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKA 154
D + + + ++ + ++ GC Q+G A DG+ G G G +SV S L+
Sbjct: 206 DTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSR 265
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 214
G+ FS C + SG F G + P+ Y + + S + L
Sbjct: 266 GITPPVFSHCLKGDGSGGGVF-VLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPL 324
Query: 215 SG--FQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
F+A +VD+G + T+L E Y + VS + N + CY S+
Sbjct: 325 DAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNG-EQCYLVST 383
Query: 267 EEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
P + L F+ S ++R +++F + +G +++C+ + I+G +
Sbjct: 384 SISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKD 443
Query: 325 HRIVFDRENLKLAWSHSKCEEVIDKS 350
V+D ++ W+ C ++ S
Sbjct: 444 KVFVYDLARQRIGWASYDCSMSVNVS 469
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 73/320 (22%), Positives = 135/320 (42%), Gaps = 20/320 (6%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+L +D S ++ +V+CS P+C S + C S + C Y Y + + +SGY +
Sbjct: 143 DLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQC-SENNQCGYSFRYG-DGSGTSGYYMT 200
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKA 154
D + + + ++ + ++ GC Q+G A DG+ G G G +SV S L+
Sbjct: 201 DTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSR 260
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 214
G+ FS C + SG F G + P+ Y + + S + L
Sbjct: 261 GITPPVFSHCLKGDGSGGGVF-VLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPL 319
Query: 215 SG--FQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
F+A +VD+G + T+L E Y + VS + N + CY S+
Sbjct: 320 DAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNG-EQCYLVST 378
Query: 267 EEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
P + L F+ S ++R +++F + +G +++C+ + I+G +
Sbjct: 379 SISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKD 438
Query: 325 HRIVFDRENLKLAWSHSKCE 344
V+D ++ W+ C+
Sbjct: 439 KVFVYDLARQRIGWASYDCK 458
>gi|299471769|emb|CBN76990.1| aspartic protease PM5 [Ectocarpus siliculosus]
Length = 947
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 76/321 (23%), Positives = 136/321 (42%), Gaps = 27/321 (8%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+D S S+SS V+C C C+ K C + YS E +S Y V+D+L + +
Sbjct: 168 WDQSKSTSSHIVTCED--CHGSFRCQKDKR-CGFSQRYS-EGSSWRAYQVEDVLWVGELT 223
Query: 105 KHAPQ------SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI- 157
+ S+ + GC QTG + A DG+MG+ ++ LAKAG I
Sbjct: 224 LQQSEKINHDESAYSVEFMFGCIESQTGLFKTQLA-DGIMGMSADSHTLVWQLAKAGKIK 282
Query: 158 QNSFSICFDENDSGSVFFGDQGPATQ--QSTSFLPIGEKYDAYFVGVESYCIG------N 209
+ +FS+CF +N V G + + P + + V V + +
Sbjct: 283 ERTFSLCFGKNGGTMVIGGYDTRLNKPGHEMMYTPSTKTNGWFTVQVTDITVNRVSIAQD 342
Query: 210 SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
+ Q G +VDSG + T+LP + +++ S + + N +C +S E+
Sbjct: 343 PAIFQRGKGIIVDSGTTDTYLPRSVAKGFSAAWERATGSPYANCKDN--HFCMILTSAEL 400
Query: 270 LKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
+P + + VR ++ + ++ + + T+ G++G N M+ H +
Sbjct: 401 EALPTVTIHMDGGLEVNVRPSGYMDALGKD---NAYAPRIYLTESMGGVLGANVMLDHNV 457
Query: 328 VFDRENLKLAWSHSKCEEVID 348
VFD EN + ++ C+ D
Sbjct: 458 VFDYENHLVGFAEGVCDYRAD 478
>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 389
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 74/295 (25%), Positives = 122/295 (41%), Gaps = 28/295 (9%)
Query: 44 EYDPSSSSSSKNVSC--SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
+Y P++S + ++ C SHP + L C Y Y ++T+ G L +++
Sbjct: 100 KYRPAASITYRDAMCEDSHPKSNPHFAFDPLTRICTYQQHY-LDETNIKGTLAQEMI--- 155
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
+ H V GC GSY G G++GLG+G S+ G + F
Sbjct: 156 TVDTHDGGFKRVHGVYFGCNTLSDGSYFTGT---GILGLGVGKYSI------IGEFGSKF 206
Query: 162 SICFDE----NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 217
S C E S ++ GD G Q + + I E + + +ES +G
Sbjct: 207 SFCLGEISEPKASHNLILGD-GANVQGHPTVINITEGHTIF--QLESIIVGEEITLDDPV 263
Query: 218 QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 277
Q VD+G++ + L T +Y + V FD L+ S+ +S + CY A + E L+ D+
Sbjct: 264 QVFVDTGSTLSHLSTNLYYKFVDAFDDLIGSRPLSYEPT---LCYKADTIERLEKMDVGF 320
Query: 278 IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG--IIGQNFMMGHRIVFD 330
F V H F + + CL + + + IIG M G+ + +D
Sbjct: 321 KFDVGAELSVNIHNI-FIQQGPPEIRCLAIQNNKESFSHVIIGVIAMQGYNVGYD 374
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 119/280 (42%), Gaps = 28/280 (10%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSS----CK-SLKDPCPYIADYSTEDTSSSGYLVD 95
+L+ YD +S++S V C C CK L+ C Y Y + +S++GY V
Sbjct: 121 DLTLYDMKASTTSDAVGCDDNFCSLYDGPLPGCKPGLQ--CLYSVLYG-DGSSTTGYFVQ 177
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKA 154
D + S + + +V+ GCG KQ+G + A DG++G G + S+ S LA +
Sbjct: 178 DFVQYNRISGNFQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASS 237
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGE--------KYDAYFVGVESYC 206
G ++ FS C D D G +F G + FL + Y V ++
Sbjct: 238 GKVKKVFSHCLDNVDGGGIFA--IGEVVEPKVRFLLMNSVMIVVLFLSRAHYNVVMKEIE 295
Query: 207 IGNSCLT------QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 258
+G L +SG + ++DSG + + P E+Y ++ K R+ ++
Sbjct: 296 VGGDPLDVPSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAF 355
Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE 298
C++ + P + L F K+ S V H + F E
Sbjct: 356 T-CFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQVKE 394
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 77/323 (23%), Positives = 137/323 (42%), Gaps = 35/323 (10%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ P +SSS + VSC+ P C ++ C + C Y Y+ E +SS G L D+L +
Sbjct: 143 RFKPDNSSSYQTVSCNSPDCITKM-CDARVHQCKYERVYA-EMSSSKGVLGKDLLGFGNG 200
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGS-YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S+ P ++ GC +TG YL A DG+MGLG G +S+ L G +++SFS
Sbjct: 201 SRLQPHP-----LLFGCETAETGDLYLQHA--DGIMGLGRGPLSIVDQLVGTGAMEDSFS 253
Query: 163 ICFDENDSG--SVFFGDQGPATQQSTSFLPIGEKYDAYF------VGVESYCIGNSCLTQ 214
+C+ D G S+ G P + F Y+ + V+ +
Sbjct: 254 LCYGGMDEGGGSMVLGAIPPPP--AMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVF 311
Query: 215 SG-FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ---GNSWKY---CYNASSE 267
+G ++DSG ++ +LP + + F ++ + SLQ G Y C+ +
Sbjct: 312 NGRLGTVLDSGTTYAYLPDKAFD----AFKDAITQQLGSLQAVPGPDPSYPDVCFAGAGS 367
Query: 268 EMLKV----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
+ + P + +FS NQ + + F + +CL ++G +
Sbjct: 368 DSKALGKHFPPVDFVFSGNQKVFLAPENYLFKHTKVPGAYCLGFFKNQDATTLLGGIVVR 427
Query: 324 GHRIVFDRENLKLAWSHSKCEEV 346
+ +DR N ++ + + C +
Sbjct: 428 NTLVTYDRANHQIGFFKTNCTNL 450
>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
Length = 557
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 82/310 (26%), Positives = 139/310 (44%), Gaps = 37/310 (11%)
Query: 62 LCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 121
L +++ C++ K C Y +Y+ + +SS G L D +HL + + + + GC
Sbjct: 248 LQGNQNYCETCKQ-CDYEIEYA-DQSSSMGVLARDDMHLIATNG----GREKLDFVFGCA 301
Query: 122 RKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQ 178
Q G L A DG++GL +S+PS LA G+I N F C ++ G +F GD
Sbjct: 302 YDQQGQLLSSPAKTDGILGLSNAAISLPSQLASHGIISNIFGHCITREQGGGGYMFLGDD 361
Query: 179 GPATQQSTSFLPIGEKYD-AYFVGVESYCIGNSCLT---QSG--FQALVDSGASFTFLPT 232
+ ++ I D Y G+ L Q+G Q + DSG+S+T+LP
Sbjct: 362 Y-VPRWGITWTSIRSGPDNLYHTEAHHVKYGDQQLRMREQAGNTVQVIFDSGSSYTYLPD 420
Query: 233 EIYAEVVVK-------FDKLVSSKRISLQGNSWKYCYNASSEEMLK--VPDMRLIFSKNQ 283
EIY +V F + S + + L WK + E +K + L F K
Sbjct: 421 EIYENLVAAIKYASPGFVQDSSDRTLPL---CWKADFPVRYLEDVKQFFKPLNLHFGKKW 477
Query: 284 SFVVRNHIFSFPENEGFTV----FCLTVMS-TDGDYG---IIGQNFMMGHRIVFDRENLK 335
F+ + S PE+ CL +++ T+ ++G I+G + G +V+D + +
Sbjct: 478 LFMSKTFTIS-PEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNQRRQ 536
Query: 336 LAWSHSKCEE 345
+ W++S C +
Sbjct: 537 IGWTNSDCTK 546
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 82/338 (24%), Positives = 141/338 (41%), Gaps = 47/338 (13%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSR--SSCKSLK----DPCPYIADYSTEDTSSSGY 92
D+ L +DPS+SS+ SC LC+ +SC S K C Y Y + + ++G+
Sbjct: 71 DQALPYFDPSTSSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYG-DKSVTTGF 129
Query: 93 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
L D P V GCG G + G+ G G G +S+PS L
Sbjct: 130 LEVDKFTFVGAGASVP------GVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL- 180
Query: 153 KAGLIQNSFSICFDE-----------NDSGSVFFGDQGPATQQSTSFLPIGEKY---DAY 198
K G +FS CF + +F QG Q+T + + Y
Sbjct: 181 KVG----NFSHCFTTITGAIPSTVLLDLPADLFSNGQGAV--QTTPLIQYAKNEANPTLY 234
Query: 199 FVGVESYCIGNS---------CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 249
++ ++ +G++ LT ++DSG S T LP ++Y V +F +
Sbjct: 235 YLSLKGITVGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLP 294
Query: 250 RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVM 308
+ C++A S+ VP + L F + R N++F P++ G ++ CL +
Sbjct: 295 VVPGNATGHYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAIN 354
Query: 309 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
D + IIG +++D +N L++ ++C+++
Sbjct: 355 KGD-ETTIIGNFQQQNMHVLYDLQNNMLSFVAAQCDKL 391
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 82/324 (25%), Positives = 140/324 (43%), Gaps = 37/324 (11%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ P +SSS + + C C + C S C Y Y+ E ++S G L D+L
Sbjct: 92 RFKPENSSSYQKIGCRSSDCIT-GLCDSNSHQCKYERMYA-EMSTSKGVLGKDLLDFG-- 147
Query: 104 SKHAPQSSVQSSVI-IGCGRKQTGS-YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
P S +QS ++ GC ++G YL A DG+MGLG G +S+ L G I++SF
Sbjct: 148 ----PASRLQSQLLSFGCETAESGDLYLQVA--DGIMGLGRGPLSIVDQLVGNGAIEDSF 201
Query: 162 SICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKY---DAYFVGVESYCIG-NSCLTQ 214
S+C+ DE V P+ P Y + + V+ + +S +
Sbjct: 202 SLCYGGMDEGGGSMVLGAIPAPSGMVFAKSDPRRSNYYNLELTEIQVQGASLKLDSNVFN 261
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG------NSWKYCYNAS--- 265
F ++DSG ++ +LP + F V ++ SLQ N CY +
Sbjct: 262 GKFGTILDSGTTYAYLPDRAFE----AFTDAVVAQLGSLQAVDGPDPNYPDICYAGAGTD 317
Query: 266 SEEMLK-VPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
++E+ K P + +F++NQ S N++F + G +CL ++G +
Sbjct: 318 TKELGKHFPLVDFVFAENQKVSLAPENYLFKHTKVPG--AYCLGFFKNQDATTLLGGIIV 375
Query: 323 MGHRIVFDRENLKLAWSHSKCEEV 346
+ +DR N ++ + + C E+
Sbjct: 376 RNMLVTYDRYNHQIGFLKTNCTEL 399
>gi|297842525|ref|XP_002889144.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
gi|297334985|gb|EFH65403.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
Length = 467
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 75/325 (23%), Positives = 127/325 (39%), Gaps = 33/325 (10%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
+Y P+ ++ + CSH LC C +D C Y YS + SS G LV D
Sbjct: 110 QYKPNHNT----LPCSHLLCSGLDLTQNRPCDDPEDQCDYEIGYS-DHASSIGALVTDEF 164
Query: 99 HLASFSKHAPQSSVQSSVIIGCG-RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L K A S + + GCG +Q G++GLG G V + + L G+
Sbjct: 165 PL----KLANGSIMNPHLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGISTQLKSLGIT 220
Query: 158 QNSFSICFDENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
+N C G + GD+ P++ + + L Y G + G
Sbjct: 221 KNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSASKNYMTGPAELLFNDKTTGVKG 280
Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASS-------- 266
+ DSG+S+T+ E Y ++ K ++ K + + S C+
Sbjct: 281 INVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEV 340
Query: 267 EEMLKVPDMRLIFSKN-QSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNF 321
++ K +R + KN Q F V + +G CL +++ Y I+G
Sbjct: 341 KKYFKTITLRFGYQKNGQLFQVPPESYLIITEKGNV--CLGILNGTEVGLDSYNIVGDIS 398
Query: 322 MMGHRIVFDRENLKLAWSHSKCEEV 346
G +++D E ++ W S C+++
Sbjct: 399 FQGIMVIYDNEKQRIGWISSDCDKI 423
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 82/326 (25%), Positives = 137/326 (42%), Gaps = 46/326 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+DP +SS+ ++ SC C K RS K K C + Y+ + + + G L + L +
Sbjct: 134 FDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKK--CTFRYSYA-DGSFTGGNLASETLTV 190
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
S A + GCG G + + G++GLG G++S+ S L I
Sbjct: 191 DS---TAGKPVSFPGFAFGCGHSSGGIF--DKSSSGIVGLGGGELSLISQLKST--INGL 243
Query: 161 FSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNSCLT 213
FS C D + S + FG G + T P+ +K Y++ +E +G L
Sbjct: 244 FSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLP 303
Query: 214 QSGF---------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
G+ +VDSG ++TFLP E Y+++ + KR+ + CYN
Sbjct: 304 YKGYSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNT 363
Query: 265 SSEEMLKVPDMRLIFSK-NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ---- 319
++E + P + F N N E+ + C TV T D G++G
Sbjct: 364 TAE--INAPIITAHFKDANVELQPLNTFMRMQED----LVCFTVAPTS-DIGVLGNLAQV 416
Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEE 345
NF++G FD ++++ + C +
Sbjct: 417 NFLVG----FDLRKKRVSFKAADCTQ 438
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 77/304 (25%), Positives = 132/304 (43%), Gaps = 52/304 (17%)
Query: 75 PCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP 134
PC Y DY+ + +S++G+L D A+ S + V GCG + G G
Sbjct: 140 PCGYAYDYA-DGSSTTGFLARDT---ATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTG- 194
Query: 135 DGVMGLGLGDVSVPSLLAKAG-LIQNSFSICFDENDSGS-------VFFGDQGPATQQST 186
GV+GLG G +S P A++G L +FS C + + G +F G P + +
Sbjct: 195 -GVIGLGQGQLSFP---AQSGSLFAQTFSYCLLDLEGGRRGRSSSFLFLGR--PERRAAF 248
Query: 187 SFLPIGEKYDA---YFVGVESYCIGNSCLTQSGFQ----------ALVDSGASFTFLPTE 233
++ P+ A Y+VGV + +GN L G + ++DSG++ T+L
Sbjct: 249 AYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLG 308
Query: 234 IYAEVVVKFDKLVSSKRIS-----LQGNSWKYCYNASSEEMLK-----VPDMRLIFSKNQ 283
Y +V F V RI QG + CYN SS P + + F++
Sbjct: 309 AYLHLVSAFAASVHLPRIPSSATFFQG--LELCYNVSSSSSSAPANGGFPRLTIDFAQGL 366
Query: 284 SFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYG--IIGQNFMMGHRIVFDRENLKLAWS 339
S + N++ ++ V CL + T + ++G G+ + FDR + ++ ++
Sbjct: 367 SLELPTGNYLVDVADD----VKCLAIRPTLSPFAFNVLGNLMQQGYHVEFDRASARIGFA 422
Query: 340 HSKC 343
++C
Sbjct: 423 RTEC 426
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 78/338 (23%), Positives = 140/338 (41%), Gaps = 48/338 (14%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCK---SRSSCKSLKDP---CPYIADYSTEDTSSSGY 92
D+ L +D S SS++ + C CK + + C L C Y Y +++ + G
Sbjct: 71 DQPLPYFDTSRSSTNALLPCESTQCKLDPTVTVCVKLNQTVQTCAYYTSYG-DNSVTIGL 129
Query: 93 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
L D + + V GCG TG + + G+ G G G +S+PS L
Sbjct: 130 LAADKFTFVA-------GTSLPGVTFGCGLNNTGVF--NSNETGIAGFGRGPLSLPSQL- 179
Query: 153 KAGLIQNSFSICFDE-----------NDSGSVFFGDQGPATQQSTSFLPIGEKY---DAY 198
K G +FS CF + +F QG Q+T + + Y
Sbjct: 180 KVG----NFSHCFTTITGAIPSTVLLDLPADLFSNGQGAV--QTTPLIQYAKNEANPTLY 233
Query: 199 FVGVESYCIGNS---------CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 249
++ ++ +G++ LT ++DSG S T LP ++Y V +F +
Sbjct: 234 YLSLKGITVGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLP 293
Query: 250 RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVM 308
+ C++A S+ VP + L F + R N++F P++ G ++ CL +
Sbjct: 294 VVPGNATGHYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAIN 353
Query: 309 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
D + IIG +++D +N L++ ++C+++
Sbjct: 354 KGD-ETTIIGNFQQQNMHVLYDLQNNMLSFVAAQCDKL 390
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 84/337 (24%), Positives = 142/337 (42%), Gaps = 29/337 (8%)
Query: 25 LLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYS 83
L W V + + RN +DP S++ +N+SC LC K + S + C Y Y+
Sbjct: 95 LTWTSCVPCNNCYKQRN-PMFDPQKSTTYRNISCDSKLCHKLDTGVCSPQKRCNYTYAYA 153
Query: 84 TEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG 143
+ + G L + + L+S +S ++ GCG TG + D G++GLG G
Sbjct: 154 SAAITR-GVLAQETITLSSTKG---KSVPLKGIVFGCGHNNTGGFNDHEM--GIIGLGGG 207
Query: 144 DVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA- 197
VS+ S + + FS C D + S + FG + + P+ K D
Sbjct: 208 PVSLISQMGSS-FGGKRFSQCLVPFHTDVSVSSKMSFGKGSKVSGKGVVSTPLVAKQDKT 266
Query: 198 -YFVGVESYCIGNSCLTQSG-------FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 249
YFV + + N+ L +G +DSG T LPT++Y +VV + V+ K
Sbjct: 267 PYFVTLLGISVENTYLHFNGSSQNVEKGNMFLDSGTPPTILPTQLYDQVVAQVRSEVAMK 326
Query: 250 RISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM 308
++ + + CY ++ L+ P + F + F P++ VFCL
Sbjct: 327 PVTDDPDLGPQLCYR--TKNNLRGPVLTAHFEGADVKLSPTQTFISPKDG---VFCLGFT 381
Query: 309 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
+T D G+ G + I FD + +++ C +
Sbjct: 382 NTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPKDCTK 418
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 85/352 (24%), Positives = 138/352 (39%), Gaps = 63/352 (17%)
Query: 37 VQDRNLSEYDPSSSSSSKNVSCSHPLC--------KSR-SSCKSLKDPC-----PYIADY 82
++ + + P SSSSK + C +P C +S+ C S C PY+ Y
Sbjct: 125 IKKTGIPTFLPKLSSSSKLIGCKNPRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYVIQY 184
Query: 83 STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
+ S++G L+ + L P ++GC S P+G+ G G
Sbjct: 185 GSG--STAGLLLSETLDF-------PNKKTIPDFLVGC------SIFSIKQPEGIAGFGR 229
Query: 143 GDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQG-------PATQQSTSFL--PIGE 193
S+PS L S FD+ + S D G A T FL P
Sbjct: 230 SPESLPSQLGLKKFSYCLVSHAFDDTPTSSDLVLDTGSGSGVTKTAGLSHTPFLKNPTTA 289
Query: 194 KYDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFD 243
D Y+V + + IG++ + T +VDSG +FTF+ +Y V +F+
Sbjct: 290 FRDYYYVLLRNIVIGDTHVKVPYKFLVPGTDGNGGTIVDSGTTFTFMENPVYELVAKEFE 349
Query: 244 KLVSSKRISLQGNS---WKYCYNASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEG 299
K ++ ++ + + + CYN S E+ L VPD+ F + ++ FS ++
Sbjct: 350 KQMAHYTVATEIQNLTGLRPCYNISGEKSLSVPDLIFQFKGGAKMALPLSNYFSIVDSG- 408
Query: 300 FTVFCLTVMSTDGDYG--------IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
V CLT++S + I+G + FD EN K + C
Sbjct: 409 --VICLTIVSDNVAGPGLGGGPAIILGNYQQRNFYVEFDLENEKFGFKQQSC 458
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 79/334 (23%), Positives = 136/334 (40%), Gaps = 47/334 (14%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
L +D S+S + V C+ P+C++ C Y +Y +++ + G L D
Sbjct: 132 LPRFDTSASDTVHGVLCTDPICRALRPHACFLGGCTYQVNYG-DNSVTIGQLAKDSF--- 187
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
+F ++ GCG+ TG++ G+ G G G +S+P L + SF
Sbjct: 188 TFDGKGGGKVTVPDLVFGCGQYNTGNFHSNET--GIAGFGRGPLSLPRQLGVS-----SF 240
Query: 162 SICFD---ENDSGSVFFGD----------QGPATQQSTSFLPIGEKYDAYFVGVESYCIG 208
S CF E+ S VF G GP ST FLP +Y Y++ ++ +G
Sbjct: 241 SYCFTTIFESKSTPVFLGGAPADGLRAHATGPIL--STPFLPNHPEY--YYLSLKGITVG 296
Query: 209 NSCLT--QSGF--------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ--GN 256
+ L +S F ++DSG + T P ++ + F V S G
Sbjct: 297 KTRLAVPESAFVVKADGSGGTIIDSGTAITAFPRAVFRSLWEAFVAQVPLPHTSYNDTGE 356
Query: 257 SWKYCYNASS---EEMLKVPDMRL-IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG 312
C++ S + VP M L + + N++ +P+++ C+ V++ D
Sbjct: 357 PTLQCFSTESVPDASKVPVPKMTLHLEGADWELPRENYMAEYPDSD---QLCVVVLAGDD 413
Query: 313 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
D +IG IV D KL ++C+++
Sbjct: 414 DRTMIGNFQQQNMHIVHDLAGNKLVIEPAQCDKM 447
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 82/319 (25%), Positives = 136/319 (42%), Gaps = 31/319 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+DP SS+ N+SC PLC K + S + C Y Y +++ + G L D A+F
Sbjct: 110 FDPLKSSTYNNISCDSPLCHKLDTGVCSPEKRCNYTYGYG-DNSLTKGVLAQDT---ATF 165
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI--QNSF 161
+ + + S + GCG TG + D G++GLG G SL+++ G + F
Sbjct: 166 TSNTGKPVSLSRFLFGCGHNNTGGFNDHEM--GLIGLGGGPT---SLISQIGPLFGGKKF 220
Query: 162 SICF-----DENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYFVGV------ESYCIG 208
S C D S + FG P+ EK +YFV + ++Y
Sbjct: 221 SQCLVPFLTDIKISSRMSFGKGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYFPM 280
Query: 209 NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSE 267
NS + ++ LVDSG LP ++Y +V + V+ K I+ + + CY +
Sbjct: 281 NSTIGKA--NMLVDSGTPPILLPQQLYDKVFAEVRNKVALKPITDDPSLGTQLCYRTQTN 338
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS-TDGDYGIIGQNFMMGHR 326
LK P + F + F P + +FCL + + T+ D G+ G +
Sbjct: 339 --LKGPTLTFHFVGANVLLTPIQTFIPPTPQTKGIFCLAIYNRTNSDPGVYGNFAQSNYL 396
Query: 327 IVFDRENLKLAWSHSKCEE 345
I FD + +++ + C +
Sbjct: 397 IGFDLDRQVVSFKPTDCTK 415
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 77/323 (23%), Positives = 140/323 (43%), Gaps = 42/323 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKD--PCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP +SS+ ++ SC C + + +S ++ C ++ Y+ + + + G L + L +AS
Sbjct: 134 FDPKNSSTYRDSSCGTSFCLALGNDRSCRNGKKCTFMYSYA-DGSFTGGNLAVETLTVAS 192
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
A + GC + G + + ++ G++GLG+ ++S+ S L I FS
Sbjct: 193 ---TAGKPVSFPGFAFGCVHRSGGIFDEHSS--GIVGLGVAELSMISQLKST--INGRFS 245
Query: 163 ICF-----DENDSGSVFFGDQGPATQQSTSFLPI---GEKYDAYFVGVESYCIGNSCLTQ 214
C D + S + FG G + T P+ G Y + +E + +G L+
Sbjct: 246 YCLLPVFTDSSMSSRINFGRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSY 305
Query: 215 SGF---------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
GF +VDSG ++T+LP E Y ++ + KR+ CYN +
Sbjct: 306 KGFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYNTT 365
Query: 266 SEEMLKVPDMRLIFSK-NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ----N 320
++ + P + F N N E+ + C TV+ T D GI+G N
Sbjct: 366 VDQ-IDAPIITAHFKDANVELQPWNTFLRMQED----LVCFTVLPTS-DIGILGNLAQVN 419
Query: 321 FMMGHRIVFDRENLKLAWSHSKC 343
F++G FD ++++ + C
Sbjct: 420 FLVG----FDLRKKRVSFKAADC 438
>gi|348690233|gb|EGZ30047.1| hypothetical protein PHYSODRAFT_474645 [Phytophthora sojae]
Length = 642
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 86/356 (24%), Positives = 146/356 (41%), Gaps = 40/356 (11%)
Query: 53 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQ--- 109
SK+ + + C SC+S + YI+ E + +VD+++ + FS A +
Sbjct: 140 SKSTTAKYLACHDFDSCRSCEQDRCYISQSYMEGSMWEAVMVDELVWVGGFSSPADEMEG 199
Query: 110 --SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFD 166
+ +GC K+TG ++ +G+MGLG +V S + AG + QN F++CF
Sbjct: 200 VLKTFGFRFPVGCQTKETGLFIT-QKENGIMGLGRHRSTVMSYMLNAGRVTQNLFTLCF- 257
Query: 167 ENDSGSVFFGDQGPATQQS-TSFLPIGEKYDAYF-VGVESYCIGNSCL------TQSGFQ 218
D G + FG + S + P+ AY+ V V+ + L SG
Sbjct: 258 AGDGGELVFGGVDYSHHTSDVGYTPLLSDKSAYYPVHVKDILLNGVSLGIDTGTINSGRG 317
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLV----SSKRISLQGNSWKYCYNASSEEMLKVPD 274
+VDSG + TF + + F K S R+ L +SEE+ +P
Sbjct: 318 VIVDSGTTDTFFDGKGKRAFMSAFSKAAGRDYSESRMKL-----------TSEELAALPV 366
Query: 275 MRLIFSKNQSFVVRNHIFSFPENEGFT------VFCLTVMSTDGDYGIIGQNFMMGHRIV 328
+ +I S + + P ++ T + ++ G++G + M+G ++
Sbjct: 367 ISIILSGMKGDGTDDVQLDVPASQYLTPADDGKSYYGNFHFSERSGGVLGASAMVGFDVI 426
Query: 329 FDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPN-PLPTTEQQSTSNGQAAAPP 383
FD EN ++ ++ S C S+ P A S N P P T SN P
Sbjct: 427 FDVENKRVGFAESDCGR--SYSNATTAAPIASDSTNQPAPATPVSVDSNATEQPAP 480
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 145/331 (43%), Gaps = 40/331 (12%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD--PCPYIADYSTEDTSSSGYLVDDI 97
+N +DPS S++ KNV+CS P+C S D C Y Y +D+ S G L D
Sbjct: 120 QNAPMFDPSKSTTYKNVACSSPVCSYSGDGSSCSDDSECLYSIAYG-DDSHSQGNLAVDT 178
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
+ + S S + +IGCG G++ A G++GLG G S+ + L A
Sbjct: 179 VTMQSTSG---RPVAFPRTVIGCGHDNAGTF--NANVSGIVGLGRGPASLVTQLGPA--T 231
Query: 158 QNSFSICF------DENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYF-VGVESYCIG 208
FS C NDS + FG + T PI +Y ++ + +E+ +G
Sbjct: 232 GGKFSYCLIPIGTGSTNDSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVG 291
Query: 209 NSCL------TQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
++ ++ G ++ ++DSG + T+LP+ + + +S Y
Sbjct: 292 DTKFNFPEGASKLGGESNIIIDSGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDY 351
Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD----YGI 316
C+ A++ + ++P + + F + R ++F ++ CL S D YG
Sbjct: 352 CF-ATTTDDYEMPPVTMHFEGADVPLQRENLFVRLSDD---TICLAFGSFPDDNIFIYGN 407
Query: 317 IGQ-NFMMGHRIVFDRENLKLAWSHSKCEEV 346
I Q NF++G +D +NL +++ + C V
Sbjct: 408 IAQSNFLVG----YDIKNLAVSFQPAHCGAV 434
>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
Length = 426
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 81/319 (25%), Positives = 138/319 (43%), Gaps = 47/319 (14%)
Query: 56 VSCSHPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI--LHLASFSKHAPQ 109
V C P+C S C D C Y +Y+ + SS G LV+D+ ++L S + P+
Sbjct: 117 VVCKDPICASLHPDNYRCDD-PDQCDYEVEYA-DGGSSIGVLVNDLFPVNLTSGMRARPR 174
Query: 110 SSVQSSVIIGCGRKQTGSYLDGAAP---DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 166
+ IGCG Q L G A DGV+GLG G S+ + L+ GL++N CF
Sbjct: 175 ------LTIGCGYDQ----LPGIAYHPLDGVLGLGRGSSSIVAQLSSQGLVRNVVGHCFS 224
Query: 167 ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV---DS 223
G +FFGD + P+ Y ++ + I N SG + L+ DS
Sbjct: 225 RRGGGYLFFGDD-IYDSSKVIWTPMSRDYLKHYTPGFAELILNG--RSSGLKNLLVVFDS 281
Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASSEEMLKVPDMRLIFS- 280
G+S+T+ T+ Y ++ K + K + +++ ++ C+ + + D + F
Sbjct: 282 GSSYTYFNTQTYQTLLSFIKKDLHGKPLKEAVEDDTLPVCWRG-KKPFKSIRDAKKYFKP 340
Query: 281 ---------KNQS-FVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHR 326
K +S F ++ + ++G CL +++ +Y IIG M
Sbjct: 341 LALSFGSGWKTKSQFEIQQESYLIISSKGSV--CLGILNGTEVGLQNYNIIGDISMQEKL 398
Query: 327 IVFDRENLKLAWSHSKCEE 345
+++D E + W S C+
Sbjct: 399 VIYDNEKQVIGWQPSNCDR 417
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 95/340 (27%), Positives = 141/340 (41%), Gaps = 59/340 (17%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
YDPS+SS+ SCS C+S S C S C Y Y + +S+ G + L L S
Sbjct: 46 YDPSASSTFAKTSCSTSSCQSLPASGCSSSAKTCIYGYQYG-DSSSTQGDFALETLTLRS 104
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S + GCGR +GS+ GAA G++GLG G +S+ + L A I N FS
Sbjct: 105 ---SGGSSKAFPNFQFGCGRLNSGSF-GGAA--GIVGLGQGKISLSTQLGSA--INNKFS 156
Query: 163 IC---FDENDSGS--VFFGDQGPATQQ--STSFLPIGEKYDAYFVGVESYCIGNSCLT-- 213
C FD++ S + + FG ST +P + YFVG+E +G L+
Sbjct: 157 YCLVDFDDDSSKTSPLIFGSSASTGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLA 216
Query: 214 ----------------------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
SG + DSG + T L +Y++V F VS +
Sbjct: 217 TRAIDFLSVRSKKKLRVRALEVNSG-GTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTV 275
Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGF-------TVFC 304
+ + CY+ S + K P + L F + FS P+ F TV C
Sbjct: 276 DASSSGFDLCYDVSKSKNFKFPALTLAF--------KGTKFSPPQKNYFVIVDTAETVAC 327
Query: 305 LTVMSTDGDYGIIGQNFM-MGHRIVFDRENLKLAWSHSKC 343
L + + I N M + +V+DR ++ S ++C
Sbjct: 328 LAMGGSGSLGLGIIGNLMQQNYHVVYDRGTSTISMSPAQC 367
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 90/301 (29%), Positives = 143/301 (47%), Gaps = 31/301 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKS-LKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
+DPS S++ K + S C+S +SC S + C Y Y + + S G L + L L
Sbjct: 128 FDPSKSNTYKILPFSSTTCQSVEDTSCSSDNRKMCEYTIYYG-DGSYSQGDLSVETLTLG 186
Query: 102 SFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-VPSLLAKAGLIQN 159
S SSV+ +IGCGR T S+ +G + G++GLG G VS + L ++ I
Sbjct: 187 S----TNGSSVKFRRTVIGCGRNNTVSF-EGKS-SGIVGLGNGPVSLINQLRRRSSSIGR 240
Query: 160 SFSICFDE--NDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCL--T 213
FS C N S + FGD + T PI + Y++ +E++ +GN+ + T
Sbjct: 241 KFSYCLASMSNISSKLNFGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFT 300
Query: 214 QSGFQ------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
S F+ ++DSG + T LP +IY+++ LV R+ CY ++ +
Sbjct: 301 SSSFRFGEKGNIIIDSGTTLTLLPNDIYSKLESAVADLVELDRVKDPLKQLSLCYRSTFD 360
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD--YGIIG-QNFMMG 324
E L P + FS V N + +F E E V CL +S+ +G + QNF++G
Sbjct: 361 E-LNAPVIMAHFSGAD--VKLNAVNTFIEVEQ-GVTCLAFISSKIGPIFGNMAQQNFLVG 416
Query: 325 H 325
+
Sbjct: 417 Y 417
>gi|308813706|ref|XP_003084159.1| Aspartyl protease (ISS) [Ostreococcus tauri]
gi|116056042|emb|CAL58575.1| Aspartyl protease (ISS) [Ostreococcus tauri]
Length = 478
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 77/337 (22%), Positives = 141/337 (41%), Gaps = 50/337 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
YD +S+ V CS C C Y Y E + S GYLV D++ L
Sbjct: 77 YDYDASADFSRVECS--ACAGIGGKCGTSGVCRYDVHY-LEGSGSEGYLVRDVVSLGG-- 131
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
S ++V+ GC ++ GS + + DG+ G G ++ + LA A +I + FS+C
Sbjct: 132 -----SVGNATVVFGCEERELGS-IKQQSADGLFGFGRQAYALRAQLASASVIDDLFSMC 185
Query: 165 FDENDS------------GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
+ + G+ FG PA + P+ Y V S+ +GNS +
Sbjct: 186 VEGYEKLSGEHVGGLLTLGNFDFGADAPAL----VYTPMVSSAMYYQVTTTSWTLGNSVV 241
Query: 213 TQS-GFQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQ-------------GN 256
S G ++DSG S+T++P ++A +F +L +++ L+ GN
Sbjct: 242 EGSRGVLTIIDSGTSYTYVPGNMHA----RFLQLAEDAARESGLEKVAPPEDYPDLCFGN 297
Query: 257 SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGI 316
S ++ SE P +++ + + + + + + + FC+ ++ D + +
Sbjct: 298 SGGLGWSTVSEYF---PALKIEYHGSARLTLSPETYLYWHQKNASAFCVGILEHDDNRIL 354
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 353
+GQ M FD ++ + + CE + +K H
Sbjct: 355 LGQITMRNTFTEFDVARSQVGMASANCEMLREKYVEH 391
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 79/302 (26%), Positives = 130/302 (43%), Gaps = 27/302 (8%)
Query: 45 YDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLA 101
+ PS S++ +SC C+ S++SC + + C Y Y+ D S + G L + A
Sbjct: 144 FHPSRSTTYSLLSCQSAACQALSQASCDADSE-CQY--QYAYGDGSRTIGVLSTETFSFA 200
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
+ V GC GS+ DG++GLG G +S+ S L A I F
Sbjct: 201 AAGGGGEGQVRVPRVSFGCSTGSAGSFRS----DGLVGLGAGALSLVSQLGAAARIARRF 256
Query: 162 SICF-----DENDSGSVFFGDQGPATQQSTSFLP-IGEKYDAYF-VGVESYCI-GNSCLT 213
S C N S ++ FG + + + P + + D+Y+ V +ES + G +
Sbjct: 257 SYCLVPPYAAANSSSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQDVAS 316
Query: 214 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA---SSEEML 270
+ + +VDSG + TFL + +V + ++ + R + CY+ S E
Sbjct: 317 ANSSRIIVDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQGKSQAEDF 376
Query: 271 KVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QNFMMGH 325
+PD+ L F S +R + FS E EG L +S I+G QNF +G+
Sbjct: 377 GIPDVTLRFGGGASVTLRPENTFSLLE-EGTLCLVLVPVSESQPVSILGNIAQQNFHVGY 435
Query: 326 RI 327
+
Sbjct: 436 DL 437
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 86/361 (23%), Positives = 144/361 (39%), Gaps = 53/361 (14%)
Query: 15 NALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLC------KSRSS 68
N L C P CL F D+ +DP +S+S +NV+C C + +
Sbjct: 174 NWLQCAP------CLDCF------DQRGPVFDPMASTSYRNVTCGDTRCGLVSPPAAPRT 221
Query: 69 CKSLK-DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 127
C+S + DPCPY Y + ++ D L + + A S V++GCG + G
Sbjct: 222 CRSSRSDPCPYYYWYGDQSNTTG----DLALEAFTVNLTASSSRRVDGVVLGCGHRNRGL 277
Query: 128 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG---SVFFGDQGPATQQ 184
+ A G+ L S L A G ++FS C ++ S + FGD
Sbjct: 278 FHGAAGLLGLGRGPLSFAS--QLRAVYG---HAFSYCLVDHGSAVGSKIVFGDDNVLLSH 332
Query: 185 S----TSFLPIGEKYDAYFVGVESYCIGNSCL-----------TQSGFQALVDSGASFTF 229
T+F P + Y+V ++ +G L ++DSG + ++
Sbjct: 333 PQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDGSGGTIIDSGTTLSY 392
Query: 230 LPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ--SFV 286
P Y + F D++ + + CYN S E ++VP+ L+F+ F
Sbjct: 393 FPEPAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFP 452
Query: 287 VRNHIFSFPENEGFTVFCLTVMST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
N+ + EG + CL V+ T IIG +++D + +L ++ +C E
Sbjct: 453 AENYFIRL-DTEG--IMCLAVLGTPRSAMSIIGNYQQQNFHVLYDLHHNRLGFAPRRCAE 509
Query: 346 V 346
V
Sbjct: 510 V 510
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 83/355 (23%), Positives = 144/355 (40%), Gaps = 32/355 (9%)
Query: 5 ICFGSHANAYNALLCLPVTTLLW--CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPL 62
+ FG+ A Y L+ + + W CL G Q + +DP+ S++ V C HP
Sbjct: 124 VGFGTPAQTYT-LMFDTGSDVSWIQCLPCSGHCYKQHDPI--FDPTKSATYSAVPCGHPQ 180
Query: 63 CKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGR 122
C + S C Y Y + +S++G L + L L S + GCG
Sbjct: 181 CAAAGGKCSSNGTCLYKVQYG-DGSSTAGVLSHETLSLTS-------ARALPGFAFGCGE 232
Query: 123 KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPAT 182
G + D DG++GLG G +S+ S A + S+ + G + G PA+
Sbjct: 233 TNLGDFGDV---DGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNTSHGYLTIGTTTPAS 289
Query: 183 -QQSTSFLPIGEKYDA---YFVGVESYCIGNSCL-------TQSGFQALVDSGASFTFLP 231
+ + +K D YFV + S +G L T+ G L+DSG T+LP
Sbjct: 290 GSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRDG--TLLDSGTVLTYLP 347
Query: 232 TEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNH- 290
E Y + +F ++ + + + + CY+ + + + +P + FS SF +
Sbjct: 348 PEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFAGQNAIFMPLVSFKFSDGSSFDLSPFG 407
Query: 291 IFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
+ FP++ CL + + I+G +++D K+ + C
Sbjct: 408 VLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVSGSC 462
>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 435
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 73/320 (22%), Positives = 131/320 (40%), Gaps = 27/320 (8%)
Query: 47 PSSSSSSKNVSCSHPLCKSRSSCK--SLKDP--CPYIADYSTEDTSSSGYLVDDILHLAS 102
P S+ + C PLC S + +DP C Y Y+ + S+ G L++D+ +L +
Sbjct: 115 PLYKPSNDFIPCKDPLCASLQPTDDYTCEDPNQCDYEIKYA-DQYSTLGVLLNDV-YLLN 172
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
F+ ++ + +GCG Q S DG++GLG G S+ S L GL++N
Sbjct: 173 FTNGV---QLKVRMALGCGYDQIFSPSTYHPLDGILGLGRGKASLISQLNSQGLVRNVMG 229
Query: 163 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVD 222
C G +FFG+ +++ S + + + Y G G + D
Sbjct: 230 HCLSSRGGGYIFFGNVYDSSRMSWTPISSIDSGKHYSAGPAELVFGGRKTGVGSLNIIFD 289
Query: 223 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS------W--KYCYNASSEEMLKVPD 274
+G+S+T+ ++ Y ++ +K + K I + W K + + +E
Sbjct: 290 TGSSYTYFNSQAYQAMISLLNKELHRKPIKAAPDDQTLPMCWHGKRPFRSINEVKKYFKP 349
Query: 275 MRLIFSK----NQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHR 326
+ L F+ F + + N G CL +++ G+ +IG M+
Sbjct: 350 LTLSFTNGGRVKPQFEIPPEAYLIISNMGNV--CLGILNGPEVGLGELNLIGDISMLDKV 407
Query: 327 IVFDRENLKLAWSHSKCEEV 346
+VFD E + W + C V
Sbjct: 408 MVFDNEKQLIGWGPADCNSV 427
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 84/338 (24%), Positives = 140/338 (41%), Gaps = 30/338 (8%)
Query: 25 LLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYS 83
L W V + RN +DP S+S +N+SC LC K + S + C Y Y+
Sbjct: 48 LTWTSCVPCNKCYKQRN-PIFDPQKSTSYRNISCDSKLCHKLDTGVCSPQKHCNYTYAYA 106
Query: 84 TEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG 143
+ + G L + + L+S +S ++ GCG TG + D G++GLG G
Sbjct: 107 SAAI-TQGVLAQETITLSSTKG---ESVPLKGIVFGCGHNNTGGFNDREM--GIIGLGGG 160
Query: 144 DVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA- 197
VS S + + FS C D + S + G + + P+ K D
Sbjct: 161 PVSFISQIGSS-FGGKRFSQCLVPFHTDVSVSSKMSLGKGSEVSGKGVVSTPLVAKQDKT 219
Query: 198 -YFVGVESYCIGNSCLTQSG--------FQALVDSGASFTFLPTEIYAEVVVKFDKLVSS 248
YFV + +GN+ L +G +DSG T LPT++Y +V + V+
Sbjct: 220 PYFVTLLGISVGNTYLHFNGSSSQSVEKGNVFLDSGTPPTILPTQLYDRLVAQVRSEVAM 279
Query: 249 KRISLQGN-SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV 307
K ++ + + CY ++ L+ P + F ++ F P++ VFCL
Sbjct: 280 KPVTNDLDLGPQLCYR--TKNNLRGPVLTAHFEGGDVKLLPTQTFVSPKDG---VFCLGF 334
Query: 308 MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
+T D G+ G + I FD + +++ C +
Sbjct: 335 TNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPMDCTK 372
>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
Length = 395
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 63/223 (28%), Positives = 98/223 (43%), Gaps = 14/223 (6%)
Query: 51 SSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ +K V C +C + R C S K C Y Y+ + SS G LV D L
Sbjct: 104 TKNKLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYA-DQGSSLGVLVTDSFAL--- 159
Query: 104 SKHAPQSSVQSSVIIGCGR-KQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+ A S V+ + GCG +Q GS + +A DGV+GLG G VS+ S L + G+ +N
Sbjct: 160 -RLANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVG 218
Query: 163 ICFDENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 221
C G +FFGD P ++ + + + + Y G + G L + +
Sbjct: 219 HCLSTRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVF 278
Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
DSG+SFT+ + Y +V +S + +S C+
Sbjct: 279 DSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKG 321
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 87/331 (26%), Positives = 140/331 (42%), Gaps = 43/331 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI- 97
YDP SSS +N++C P CK SS CK CPY Y ++ + ++
Sbjct: 234 YDPKESSSFENITCHDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFT 293
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
++L + + + Q V++ V+ GCG G + A ++GLG G +S S L +
Sbjct: 294 VNLTTPNGKSEQKHVEN-VMFGCGHWNRGLFHGAAG---LLGLGRGPLSFASQLQ--SIY 347
Query: 158 QNSFSICF-DENDSGSV----FFGDQGPATQQS----TSFLPIGEKYDA---YFVGVESY 205
+SFS C D N SV FG+ TSF+ GE+ Y+VG++S
Sbjct: 348 GHSFSYCLVDRNSDTSVSSKLIFGEDKELLSHPNLNFTSFVG-GEENSVDTFYYVGIKSI 406
Query: 206 CIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 255
+ L + G ++DSG + T+ Y + F K + +
Sbjct: 407 MVDGEVLKIPEETWHLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVEGF 466
Query: 256 NSWKYCYNASSEEMLKVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMST-DG 312
K CYN S E +++PD ++FS F V N+ + + CL ++ T
Sbjct: 467 PPLKPCYNVSGIEKMELPDFGILFSDGAMWDFPVENYFIQIEPD----LVCLAILGTPKS 522
Query: 313 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
IIG I++D + +L ++ KC
Sbjct: 523 ALSIIGNYQQQNFHILYDMKKSRLGYAPMKC 553
>gi|30699261|ref|NP_850981.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17065172|gb|AAL32740.1| nucellin-like protein [Arabidopsis thaliana]
gi|24899795|gb|AAN65112.1| nucellin-like protein [Arabidopsis thaliana]
gi|332197863|gb|AEE35984.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 466
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 130/329 (39%), Gaps = 31/329 (9%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
+Y P+ ++ + CSH LC C +D C Y YS + SS G LV D +
Sbjct: 109 QYKPNHNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGYS-DHASSIGALVTDEV 163
Query: 99 HLASFSKHAPQSSVQSSVIIGCG-RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L K A S + + GCG +Q G++GLG G V + + L G+
Sbjct: 164 PL----KLANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGIT 219
Query: 158 QNSFSICFDENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
+N C G + GD+ P++ + + L Y G + G
Sbjct: 220 KNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKG 279
Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS--LQGNSWKYCYNASS-------- 266
+ DSG+S+T+ E Y ++ K ++ K ++ S C+
Sbjct: 280 INVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEV 339
Query: 267 EEMLKVPDMRLIFSKN-QSFVVRNHIFSFPENEG---FTVFCLTVMSTDGDYGIIGQNFM 322
++ K +R KN Q F V + +G + T + +G Y IIG
Sbjct: 340 KKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEG-YNIIGDISF 398
Query: 323 MGHRIVFDRENLKLAWSHSKCEEVIDKSH 351
G +++D E ++ W S C+++ + +H
Sbjct: 399 QGIMVIYDNEKQRIGWISSDCDKLPNVNH 427
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 73/314 (23%), Positives = 127/314 (40%), Gaps = 32/314 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP+ SS+ N+SC+ P C + C Y Y + + S G+ D L L+S+
Sbjct: 229 FDPARSSTDANISCAAPACSDLYTKGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY- 286
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
GCG + G + + A G++GLG G S+P K G + F+
Sbjct: 287 ------DAIKGFRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQAYDKYGGV---FAH 334
Query: 164 CFDENDSGSVFFGDQGPATQQS-----TSFLPIGEKYDAYFVGVESYCIGN-------SC 211
CF SG+ + D GP + + T+ + + Y+VG+ +G S
Sbjct: 335 CFPARSSGTGYL-DFGPGSSPAVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSV 393
Query: 212 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEM 269
T +G +VDSG T LP Y+ + F ++++ + + + CY+ +
Sbjct: 394 FTTAG--TIVDSGTVITRLPPAAYSSLRSAFASAIAARGYKKAPALSLLDTCYDFTGMSQ 451
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 329
+ +P + L+F S V + + D D GI+G + +V+
Sbjct: 452 VAIPTVSLLFQGGASLDVDASGIIYAASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVY 511
Query: 330 DRENLKLAWSHSKC 343
D + +S C
Sbjct: 512 DIGKKVVGFSPGAC 525
>gi|356554625|ref|XP_003545645.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 452
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 82/318 (25%), Positives = 128/318 (40%), Gaps = 40/318 (12%)
Query: 56 VSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
V C LC +C S D C Y +Y+ + SS G LV D + + S
Sbjct: 114 VQCVDQLCSEVQLSMEYTCASPDDQCDYEVEYA-DHGSSLGVLVRDYIPF----QFTNGS 168
Query: 111 SVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 169
V+ V GCG Q S + A GV+GLG G S+ S L GLI N C
Sbjct: 169 VVRPRVAFGCGYDQKYSGSNSPPATSGVLGLGNGRASILSQLHSLGLIHNVVGHCLSARG 228
Query: 170 SGSVFFGDQGPATQQ--STSFLP-IGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGAS 226
G +FFGD + TS LP EK+ Y G G + + DSG+S
Sbjct: 229 GGFLFFGDDFIPSSGIVWTSMLPSSSEKH--YSSGPAELVFNGKATVVKGLELIFDSGSS 286
Query: 227 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNS------WKYCYNASSEEMLKVPDMRLIFS 280
+T+ ++ Y VV + + K++ + WK + S +K L S
Sbjct: 287 YTYFNSQAYQAVVDLVTQDLKGKQLKRATDDPSLPICWKGAKSFKSLSDVKKYFKPLALS 346
Query: 281 KNQSFVVRNHIFSFPENEGFTVF------CLTVMSTDG------DYGIIGQNFMMGHRIV 328
++ +++ H+ E + + CL ++ DG + IIG + ++
Sbjct: 347 FTKTKILQMHL----PPEAYLIITKHGNVCLGIL--DGTEVGLENLNIIGDISLQDKMVI 400
Query: 329 FDRENLKLAWSHSKCEEV 346
+D E ++ W S C+ +
Sbjct: 401 YDNEKQQIGWVSSNCDRL 418
>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 686
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 78/295 (26%), Positives = 128/295 (43%), Gaps = 37/295 (12%)
Query: 76 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAP 134
C Y +Y+ + +SS G L D LHL A S + ++ GC Q G L+ A
Sbjct: 390 CDYEIEYA-DHSSSMGVLASDDLHLML----ANGSLTKLGIMFGCAYDQQGLLLNSLAKT 444
Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPI- 191
DG++GL VS+PS LA +I N C D G +F GD +++P+
Sbjct: 445 DGILGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFLGDDF-VPYWGMAWVPML 503
Query: 192 ---GEKYDAYFVGVESYCIGNSCLTQSGF--QALVDSGASFTFLPTEIYAEVVVKFDKLV 246
Y + + + S Q G + + D+G+S+T+ P E Y +V K V
Sbjct: 504 NSHSPNYHSQIMKISHGSRQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALVASL-KDV 562
Query: 247 SSKRISLQGN--SWKYCYNASSEEMLKVPDMRLIFS------KNQSFVVRNHIFSFPENE 298
S + + G+ + C+ A + V D++ F +++ ++V F P E
Sbjct: 563 SDEGLIQDGSDPTLPVCWRAKF-PIRSVIDVKQFFQPLTLQFRSKWWIVSTK-FRIPP-E 619
Query: 299 GFTVF------CLTVMST----DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
G+ + CL ++ DG I+G + G +V+D N K+ W+ S C
Sbjct: 620 GYLIISNKGNVCLGILDGSNVHDGSTIILGDISLRGKLVVYDNVNQKIGWAQSTC 674
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 75/324 (23%), Positives = 133/324 (41%), Gaps = 45/324 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
++PS S S + V C+ C+S C S C Y+ +Y + + +SG + +
Sbjct: 106 FNPSKSPSYRTVLCNSLTCRSLQLATGNSGVCGSNPPTCNYVVNYG-DGSYTSGEVGMEH 164
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L+L + + ++ I GCGRK G + G++GLG D+S+ S ++ +
Sbjct: 165 LNLGN--------TTVNNFIFGCGRKNQGLF---GGASGLVGLGRTDLSLISQISP--MF 211
Query: 158 QNSFSICFDEND---SGSVFFGDQGPATQQSTS----------FLPIGEKYDAYFVGVES 204
FS C + SGS+ G + +T LP YF+ +
Sbjct: 212 GGVFSYCLPTTEAEASGSLVMGGNSSVYKNTTPISYTRMIHNPLLPF------YFLNLTG 265
Query: 205 YCIGNSCLTQSGF---QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
+G + F + ++DSG + LP IY + +F K S + C
Sbjct: 266 ITVGGVEVQAPSFGKDRMIIDSGTVISRLPPSIYQALKAEFVKQFSGYPSAPSFMILDSC 325
Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQ 319
+N S + +K+PD+++ F + V + + CL + S + + GIIG
Sbjct: 326 FNLSGYQEVKIPDIKMYFEGSAELNVDVTGVFYSVKTDASQVCLAIASLPYEDEVGIIGN 385
Query: 320 NFMMGHRIVFDRENLKLAWSHSKC 343
RI++D + L ++ C
Sbjct: 386 YQQKNQRIIYDTKGSMLGFAEEAC 409
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 77/326 (23%), Positives = 133/326 (40%), Gaps = 21/326 (6%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +DP SSS++ +SCS C S + C S + C Y Y + + +SGY V D
Sbjct: 127 LNFFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYG-DGSGTSGYYVSD 185
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
+L+ + + +S +S++ GC QTG A DG+ G G D+SV S ++ G
Sbjct: 186 LLNFDAIVGSSVTNS-SASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQG 244
Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
+ FS C + G ++ + P+ Y + ++S + L
Sbjct: 245 ITPKVFSHCLKGDGG-GGGILVLGEIVEEDIVYSPLVPSQPHYNLNLQSISVNGKSLAID 303
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
T + +VDSG + +L E Y V + VS L + CY +S
Sbjct: 304 PEVFATSTNRGTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQ-CYLITSS 362
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGD-YGIIGQNFMMG 324
P + L F+ S ++ + +N V+C+ G I+G +
Sbjct: 363 VKGIFPTVSLNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKD 422
Query: 325 HRIVFDRENLKLAWSHSKCEEVIDKS 350
V+D ++ W++ C ++ S
Sbjct: 423 KIFVYDLAGQRIGWANYDCSMSVNVS 448
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 80/326 (24%), Positives = 137/326 (42%), Gaps = 41/326 (12%)
Query: 34 ASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD----PCPYIADYSTEDTSS 89
AS ++ L +DPS+SSS ++ CS P C++ C D PC Y Y + + S
Sbjct: 121 ASACFNQTLPLFDPSASSSFASLPCSSPACETTPPCGGGNDATSRPCNYSISYG-DGSVS 179
Query: 90 SGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS 149
G + ++ AS + ++V ++ GCG G + G+ G G G +S+PS
Sbjct: 180 RGEIGREVFTFASGTGEGSSAAV-PGLVFGCGHANRGVFTSNET--GIAGFGRGSLSLPS 236
Query: 150 LLAKAGLIQNSFSICFDE---NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC 206
L K G +FS CF + + +V G G A ++ P+G + +Y
Sbjct: 237 QL-KVG----NFSHCFTTITGSKTSAVLLGLPGVAPPSAS---PLGRRRGSY-------- 280
Query: 207 IGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS- 265
C + +SG S T LP Y V +F V + C++A
Sbjct: 281 ---RCRSTPRSS---NSGTSITSLPPRTYRAVREEFAAQVKLPVVPGNATDPFTCFSAPL 334
Query: 266 SEEMLKVPDMRLIFS-KNQSFVVRNHIFSFPENEGF----TVFCLTVMSTDGDYGIIGQN 320
VP M L F N++F +++ + CL V+ +G I+G
Sbjct: 335 RGPKPDVPTMALHFEGATMRLPQENYVFEVVDDDDAGNSSRIICLAVI--EGGEIILGNI 392
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEV 346
+++D +N KL++ ++C+++
Sbjct: 393 QQQNMHVLYDLQNSKLSFVPAQCDQL 418
>gi|356500374|ref|XP_003519007.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Glycine max]
Length = 454
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 80/316 (25%), Positives = 126/316 (39%), Gaps = 36/316 (11%)
Query: 56 VSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
V C LC +C S DPC Y +Y+ + SS G LV D + + S
Sbjct: 114 VQCVDQLCSEVHLSMAYNCPSPDDPCDYEVEYA-DHGSSLGVLVRDYIPF----QFTNGS 168
Query: 111 SVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 169
V+ V GCG Q S + A GV+GLG G S+ S L GLI+N C
Sbjct: 169 VVRPRVAFGCGYDQKYSGSNSPPATSGVLGLGNGRASILSQLHSLGLIRNVVGHCLSAQG 228
Query: 170 SGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFT 228
G +FFGD P++ + + Y G G + + DSG+S+T
Sbjct: 229 GGFLFFGDDFIPSSGIVWTSMLSSSSEKHYSSGPAELVFNGKATAVKGLELIFDSGSSYT 288
Query: 229 FLPTEIYAEVVVKFDKLVSSKRISLQGNS------WKYCYNASSEEMLKVPDMRLIFSKN 282
+ ++ Y VV K + K++ + WK + S +K L S
Sbjct: 289 YFNSQAYQAVVDLVTKDLKGKQLKRATDDPSLPICWKGAKSFESLSDVKKYFKPLALSFK 348
Query: 283 QSFVVRNHIFSFPENEGFTVF------CLTVMSTDG------DYGIIGQNFMMGHRIVFD 330
+S ++ H+ E + + CL ++ DG + IIG + +++D
Sbjct: 349 KSXNLQMHL----PPESYLIITKHGNVCLGIL--DGTEVGLENLNIIGDITLQDKMVIYD 402
Query: 331 RENLKLAWSHSKCEEV 346
E ++ W S C+ +
Sbjct: 403 NEKQQIGWVSSNCDRL 418
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 81/325 (24%), Positives = 139/325 (42%), Gaps = 17/325 (5%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
L+ +D SSSSSS VSCS P+C S + C + + C Y Y + + +SGY V
Sbjct: 122 QLNFFDASSSSSSSLVSCSDPICNSAFQTTATQCLTQSNQCSYTFQYG-DGSGTSGYYVS 180
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKA 154
+ ++ + ++ +SV+ GC Q+G A DG+ G G GD+SV S L+
Sbjct: 181 ESMYFDMVMGQSMIANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSAR 240
Query: 155 GLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF----VGVESYCIG 208
G+ FS C + N G + G+ + +P Y+ Y V ++ I
Sbjct: 241 GITPKVFSHCLKGEGNGGGILVLGEVLEPGIVYSPLVPSQPHYNLYLQSISVNGQTLPID 300
Query: 209 NSCLTQS-GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
S S ++DSG + +L E Y V V S+ ++ + CY S+
Sbjct: 301 PSVFATSINRGTIIDSGTTLAYLVEEAYTPFVSAITAAV-SQSVTPTISKGNQCYLVSTS 359
Query: 268 EMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
P + L F+ + S V++ ++ +G ++C+ I+G M
Sbjct: 360 VGEIFPLVSLNFAGSASMVLKPEEYLMHLGFYDGAALWCIGFQKVQEGVTILGDLVMKDK 419
Query: 326 RIVFDRENLKLAWSHSKCEEVIDKS 350
V+D ++ W+ C + ++ S
Sbjct: 420 IFVYDLARQRIGWASYDCSQAVNVS 444
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 83/311 (26%), Positives = 134/311 (43%), Gaps = 31/311 (9%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+++PSSSS+ +NVSCS P+C+ SC + C Y Y + + + G+L + L
Sbjct: 174 KFNPSSSSTYQNVSCSSPMCEDAESCSA--SNCVYSIGYG-DKSFTQGFLAKEKFTLT-- 228
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS-FS 162
S V V GCG G + DGV GL SL A+ N+ FS
Sbjct: 229 -----NSDVLEDVYFGCGENNQGLF------DGVAGLLGLGPGKLSLPAQTTTTYNNIFS 277
Query: 163 IC---FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE--SYCIGNS--CLTQS 215
C F N +G + FG G +S F PI A+ G++ +G+ +T +
Sbjct: 278 YCLPSFTSNSTGHLTFGSAG--ISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPN 335
Query: 216 GFQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
F A++DSG FT LPT++YAE+ F + +SS + + + CY+ + + +
Sbjct: 336 SFSTEGAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTY 395
Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRE 332
P + F+ + S P + CL D I G +V+D
Sbjct: 396 PTIAFSFAGGTVVELDGSGISLPIK--ISQVCLAFAGNDDLPAIFGNVQQTTLDVVYDVA 453
Query: 333 NLKLAWSHSKC 343
++ ++ + C
Sbjct: 454 GGRVGFAPNGC 464
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 79/316 (25%), Positives = 136/316 (43%), Gaps = 34/316 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP-------CPYIADYSTEDTSSSGYLVDDI 97
++PS+S++ + + CS C S +L DP C Y A Y + + S GYL D+
Sbjct: 163 FEPSASNTYRPLYCSSSEC-SLLKAATLNDPLCTASGVCVYTASYG-DASYSMGYLSRDL 220
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGL 156
L L S S GCG+ G + A G++GL +S+ + L+ K G
Sbjct: 221 LTLTP-------SQTLPSFTYGCGQDNEGLFGKAA---GIVGLARDKLSMLAQLSPKYGY 270
Query: 157 IQNSFSICFDENDS---GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS--C 211
+FS C + S G + G P++ + T + + YF+ + + +
Sbjct: 271 ---AFSYCLPTSTSSGGGFLSIGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVG 327
Query: 212 LTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEE 268
+ +G+Q ++DSG T LP IYA + F K++S + S C+ S +
Sbjct: 328 VAAAGYQVPTIIDSGTVVTRLPISIYAALREAFVKIMSRRYEQAPAYSILDTCFKGSLKS 387
Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
M P++R+IF +R ++G + CL S++ IIG + + I
Sbjct: 388 MSGAPEIRMIFQGGADLSLRAPNILIEADKG--IACLAFASSN-QIAIIGNHQQQTYNIA 444
Query: 329 FDRENLKLAWSHSKCE 344
+D K+ ++ C
Sbjct: 445 YDVSASKIGFAPGGCR 460
>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 535
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 83/379 (21%), Positives = 145/379 (38%), Gaps = 76/379 (20%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+L+ +D +SSS++ VSCS P+C + S C S + C Y Y + + +SGY V
Sbjct: 114 DLNYFDTASSSTAALVSCSDPVCSYAVQTATSQCSSQANQCSYTFQYG-DGSGTSGYYVY 172
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKA 154
D ++ + S+ S+V+ GC Q+G A DG+ G G G +SV S ++
Sbjct: 173 DAMYFDVIMGQSVFSNSSSTVVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVSSQ 232
Query: 155 GLIQNSFSICFDENDSGS--VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
G+ FS C SG + G+ T +P+ Y+ + ++S + L
Sbjct: 233 GMAPKVFSHCLKGQGSGGGILVLGEILEPNIVYTPLVPLQPHYN---LNLQSIAVNGQIL 289
Query: 213 --------TQSGFQALVDSGASFTFLPTEIY----------------------------- 235
T + +VDSG + +L E Y
Sbjct: 290 PIDQDVFATGNNRGTIVDSGTTLAYLVQEAYDPFLNAGSPCHFFTHFNEPTNNIKYEDGN 349
Query: 236 ----------------AEVVVKFDKLVS------SKRISLQGNSWKYCYNASSEEMLKVP 273
+V+K +++ SK I +GN CY + P
Sbjct: 350 NNHQSRVKRHYYDEVTLRLVLKHSAIITTTVSQFSKPIISKGNQ---CYLVPTSLGDIFP 406
Query: 274 DMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 331
+ L F S V++ ++ + +G ++C+ Y I+G + V+D
Sbjct: 407 LVSLNFMGGASMVLKPEQYLIHYGFLDGAAMWCIGFQKVQKGYTILGDLVLKDKIFVYDL 466
Query: 332 ENLKLAWSHSKCEEVIDKS 350
N ++ W+ C ++ S
Sbjct: 467 ANQRIGWTDYDCSLAVNVS 485
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 82/323 (25%), Positives = 135/323 (41%), Gaps = 39/323 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
++PS+S + K V CS C + +C + C Y A Y + + S GYL D+
Sbjct: 146 FNPSASKTYKTVPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYG-DSSFSLGYLSQDV 204
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L L S SS + GCG+ G + DG++GL ++S+ S L +G
Sbjct: 205 LTLT-------PSQTLSSFVYGCGQDNQGLF---GRTDGIIGLANNELSMLSQL--SGKY 252
Query: 158 QNSFSICF-------DENDSGSVFFGDQGPATQQSTSFLPIGEKYD---AYFVGVESYCI 207
N+FS C + G + G S F P+ + + YF+ +ES +
Sbjct: 253 GNAFSYCLPTSFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITV 312
Query: 208 GNSCL--TQSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCY 262
L S ++ ++DSG T LPT +Y + + ++S K G S C+
Sbjct: 313 AGRPLGVAASSYKVPTIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCF 372
Query: 263 NASSEEMLKV-PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNF 321
S + +V PD+R+IF ++ H G T CL M+ IIG
Sbjct: 373 KGSLAGISEVAPDIRIIFKGGADLQLKGHNSLVELETGIT--CL-AMAGSSSIAIIGNYQ 429
Query: 322 MMGHRIVFDRENLKLAWSHSKCE 344
++ +D N ++ ++ C+
Sbjct: 430 QQTVKVAYDVGNSRVGFAPGGCQ 452
>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
Length = 420
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 84/349 (24%), Positives = 140/349 (40%), Gaps = 53/349 (15%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLV 94
R L P SS + C+ PLCK S C++ + C Y +Y+ + SS G LV
Sbjct: 72 RCLEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCET-PEQCDYEVEYA-DGGSSLGVLV 129
Query: 95 DDILHLASFSKHAPQS-SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
D+ FS + Q + + +GCG Q DGV+GLG G VS+ S L
Sbjct: 130 RDV-----FSMNYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHS 184
Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF---VGVESYCIGNS 210
G ++N C G +FFGD + + S+ P+ +Y ++ +G E G
Sbjct: 185 QGYVKNVIGHCLSSLGGGILFFGDDLYDSSR-VSWTPMSREYSKHYSPAMGGE-LLFGGR 242
Query: 211 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNA---- 264
+ DSG+S+T+ ++ Y V + +S K + + ++ C+
Sbjct: 243 TTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPF 302
Query: 265 -SSEEMLKV---------------------PDMRLIFSKNQSF-VVRNHIFSFPENEGFT 301
S EE+ K P+ LI S S +++ + +G
Sbjct: 303 MSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEAYLIISVWFSHTMLKGRFIKMLQMKGNV 362
Query: 302 VFCLTVMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
CL +++ + +IG M I++D E + W C+E+
Sbjct: 363 --CLGILNGTEIGLQNLNLIGDISMQDQMIIYDNEKQSIGWMPVDCDEL 409
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/326 (23%), Positives = 133/326 (40%), Gaps = 21/326 (6%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +DP SSS++ +SCS C S + C S + C Y Y + + +SGY V D
Sbjct: 112 LNFFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYG-DGSGTSGYYVSD 170
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
+L+ + + +S +S++ GC QTG A DG+ G G D+SV S ++ G
Sbjct: 171 LLNFDAIVGSSVTNS-SASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQG 229
Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
+ FS C + G ++ + P+ Y + ++S + L
Sbjct: 230 ITPKVFSHCLKGDGG-GGGILVLGEIVEEDIVYSPLVPSQPHYNLNLQSISVNGKSLAID 288
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
T + +VDSG + +L E Y V + VS L + CY +S
Sbjct: 289 PEVFATSTNRGTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQ-CYLITSS 347
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENE--GFTVFCLTVMSTDGD-YGIIGQNFMMG 324
P + L F+ S ++ + +N V+C+ G I+G +
Sbjct: 348 VKGIFPTVSLNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKD 407
Query: 325 HRIVFDRENLKLAWSHSKCEEVIDKS 350
V+D ++ W++ C ++ S
Sbjct: 408 KIFVYDLAGQRIGWANYDCSMSVNVS 433
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 86/349 (24%), Positives = 146/349 (41%), Gaps = 55/349 (15%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLC---------KSRSSCKSLKDPCPYIADYSTEDTSSS 90
R+ +DP++S S + V C LC S C + C Y Y + +S+
Sbjct: 30 RSRPVFDPAASQSYRQVPCISQLCLAVQQQTSNGSSQPCVNSSAACTYSLSYG-DSRNST 88
Query: 91 GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 150
G D++ L S + + Q+ V GC G +D + G++G G++S+PS
Sbjct: 89 GDFSQDVIFLNS-TNSSSQAVQFRDVAFGCAHSPQGFLVDLGSL-GIVGFNRGNLSLPSQ 146
Query: 151 LAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGE------KYDAYF 199
L K L + FS CF +G +F GD G ++ S+ P+ + + Y+
Sbjct: 147 L-KDRLGGSKFSYCFPSQPWQPRATGVIFLGDSG-LSKSKVSYTPLLDNPVTPARSQLYY 204
Query: 200 VGVESYCIGNSCLT--QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS 248
VG+ S + L +S F+ ++DSG +FT + + Y F +S
Sbjct: 205 VGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAF---AAS 261
Query: 249 KRISLQGN-----SWKYCYNASSEEML-KVPDMRLIFSKNQSFVVR-NHIF---SFPENE 298
R L+ + CYN S+ L VP++RL N +R H+F S NE
Sbjct: 262 NRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNE 321
Query: 299 GFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
CL ++S+ G ++G + + +D E ++ + + C
Sbjct: 322 --VTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFERADC 368
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 76/321 (23%), Positives = 129/321 (40%), Gaps = 23/321 (7%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSR--------SSCKSLKDPCPYIADYSTEDTSSSGYL 93
L ++P SSS+S + CS C + S S PC Y Y + + +SG+
Sbjct: 133 LEFFNPDSSSTSSRIPCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYG-DGSGTSGFY 191
Query: 94 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLA 152
V D ++ + + ++ +SV+ GC Q+G + A DG+ G G +SV S L
Sbjct: 192 VSDTMYFDTVMGNEQTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLY 251
Query: 153 KAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC------ 206
G+ +FS C +D+G G + F P+ Y + +ES
Sbjct: 252 SLGVSPKTFSHCLKGSDNGGGIL-VLGEIVEPGLVFTPLVPSQPHYNLNLESIAVSGQKL 310
Query: 207 -IGNSCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
I +S S Q +VDSG + +L Y + VS S+ + C+
Sbjct: 311 PIDSSLFATSNTQGTIVDSGTTLVYLVDGAYDPFINAIAAAVSPSVRSVVSKGIQ-CFVT 369
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
+S P L F S V+ N++ + ++C+ + G I+G +
Sbjct: 370 TSSVDSSFPTATLYFKGGVSMTVKPENYLLQQGSVDNNVLWCIGWQRSQG-ITILGDLVL 428
Query: 323 MGHRIVFDRENLKLAWSHSKC 343
V+D N+++ W+ C
Sbjct: 429 KDKIFVYDLANMRMGWADYDC 449
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 141/338 (41%), Gaps = 40/338 (11%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYL 93
+N YDP SSS KN+ C P C SS CK+ CPY Y ++ +
Sbjct: 229 QNGPYYDPKESSSFKNIGCHDPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFA 288
Query: 94 VDDI-LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
++ ++L S + + V+ +V+ GCG G + A ++GLG G +S S L
Sbjct: 289 LETFTVNLTSPAGKSEFKRVE-NVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL- 343
Query: 153 KAGLIQNSFSICF-----DENDSGSVFFG-DQGPATQQSTSF--LPIGEKYDA---YFVG 201
L +SFS C D N S + FG D+ +F L G++ Y+V
Sbjct: 344 -QSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQ 402
Query: 202 VESYCIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
++S +G L + +VDSG + ++ Y + F K V +
Sbjct: 403 IKSIMVGGEVLKIPEETWHLSPEGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPV 462
Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMS 309
CYN S E +++P+ R++F +F V N+ E + CL ++
Sbjct: 463 IKDFPILDPCYNVSGVEKMELPEFRILFEDGAVWNFPVENYFIKLEPEE---IVCLAILG 519
Query: 310 T-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
T IIG I++D + +L ++ KC +V
Sbjct: 520 TPRSALSIIGNYQQQNFHILYDTKKSRLGYAPMKCADV 557
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 82/323 (25%), Positives = 135/323 (41%), Gaps = 39/323 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
++PS+S + K V CS C + +C + C Y A Y + + S GYL D+
Sbjct: 146 FNPSASKTYKTVPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYG-DSSFSLGYLSQDV 204
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L L S SS + GCG+ G + DG++GL ++S+ S L +G
Sbjct: 205 LTLT-------PSQTLSSFVYGCGQDNQGLF---GRTDGIIGLANNELSMLSQL--SGKY 252
Query: 158 QNSFSICF-------DENDSGSVFFGDQGPATQQSTSFLPIGEKYD---AYFVGVESYCI 207
N+FS C + G + G S F P+ + + YF+ +ES +
Sbjct: 253 GNAFSYCLPTSFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITV 312
Query: 208 GNSCL--TQSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCY 262
L S ++ ++DSG T LPT +Y + + ++S K G S C+
Sbjct: 313 AGRPLGVAASSYKVPTIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCF 372
Query: 263 NASSEEMLKV-PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNF 321
S + +V PD+R+IF ++ H G T CL M+ IIG
Sbjct: 373 KGSLAGISEVAPDIRIIFKGGADLQLKGHNSLVELETGIT--CL-AMAGSSSIAIIGNYQ 429
Query: 322 MMGHRIVFDRENLKLAWSHSKCE 344
++ +D N ++ ++ C+
Sbjct: 430 QQTVKVAYDVGNSRVGFAPGGCQ 452
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 81/312 (25%), Positives = 124/312 (39%), Gaps = 30/312 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP+SSSS + C P C++ D C Y Y Y V D A+ +
Sbjct: 202 FDPASSSSFSRLGCQTPQCRNLDVFACRNDSCLYQVSYG-----DGSYTVGD---FATET 253
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
S V IGCG G ++ A G+ G L SL ++ + +SFS C
Sbjct: 254 VSFGNSGSVDKVAIGCGHDNEGLFVGAAGLIGLGGGPL------SLTSQ--IKASSFSYC 305
Query: 165 F---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA 219
D DS ++ F P+ + + Y+VG+ +G L S F+
Sbjct: 306 LVNRDSVDSSTLEFNSAKPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEV 365
Query: 220 --------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
+VD G + T L T+ Y + F KL + + CYN SS ++
Sbjct: 366 DGSGKGGIIVDCGTAVTRLQTQAYNALRDTFVKLTKDLPSTSGFALFDTCYNLSSRTSVR 425
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 331
VP + +F +S + + P + T FCL T IIG G R+ +D
Sbjct: 426 VPTVAFLFDGGKSLPLPPSNYLIPVDSAGT-FCLAFAPTTASLSIIGNVQQQGTRVTYDL 484
Query: 332 ENLKLAWSHSKC 343
N ++++S KC
Sbjct: 485 ANSQVSFSSRKC 496
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 83/311 (26%), Positives = 135/311 (43%), Gaps = 31/311 (9%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+++PSSSS+ +NVSCS P+C+ SC + C Y Y + + + G+L + L
Sbjct: 174 KFNPSSSSTYQNVSCSSPMCEDAESCSA--SNCVYSIVYG-DKSFTQGFLAKEKFTLT-- 228
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS-FS 162
S V V GCG G + DGV GL SL A+ N+ FS
Sbjct: 229 -----NSDVLEDVYFGCGENNQGLF------DGVAGLLGLGPGKLSLPAQTTTTYNNIFS 277
Query: 163 IC---FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE--SYCIGNS--CLTQS 215
C F N +G + FG G +S F PI A+ G++ +G+ +T +
Sbjct: 278 YCLPSFTSNSTGHLTFGSAG--ISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPN 335
Query: 216 GFQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
F A++DSG FT LPT++YAE+ F + +SS + + + CY+ + + +
Sbjct: 336 SFSTEGAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTY 395
Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRE 332
P + F+ + + S P + CL D I G +V+D
Sbjct: 396 PTIAFSFAGSTVVELDGSGISLPIK--ISQVCLAFAGNDDLPAIFGNVQQTTLDVVYDVA 453
Query: 333 NLKLAWSHSKC 343
++ ++ + C
Sbjct: 454 GGRVGFAPNGC 464
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 59/250 (23%), Positives = 101/250 (40%), Gaps = 30/250 (12%)
Query: 116 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS----- 170
V GCG + GS++ GV+GLG G +S S A +N F+ C S
Sbjct: 149 VAFGCGNRNQGSFVSAG---GVLGLGQGALSFTSQAGYA--FENKFAYCLTSYLSPTSVF 203
Query: 171 GSVFFGDQGPATQQSTSFLPIGEKY---DAYFVGVESYCIGNSCL--TQSGFQ------- 218
S+ FGD +T F P+ Y+V + C G L S ++
Sbjct: 204 SSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKIDSVGNG 263
Query: 219 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 277
+ DSG + T+ + YA ++ F+K V R C N S + P +
Sbjct: 264 GTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPLCVNVSGIDHPIYPSFTI 323
Query: 278 IFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFMMGHRIVFDRENL 334
F + ++ N+ N + CL ++ + D + +IG + + +DRE
Sbjct: 324 EFDQGATYRPNQGNYFIEVSPN----IDCLAMLESSSDGFNVIGNIIQQNYLVQYDREEH 379
Query: 335 KLAWSHSKCE 344
++ ++H+ C+
Sbjct: 380 RIGFAHANCD 389
>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
Length = 473
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 78/295 (26%), Positives = 128/295 (43%), Gaps = 37/295 (12%)
Query: 76 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAP 134
C Y +Y+ + +SS G L D LHL A S + ++ GC Q G L+ A
Sbjct: 177 CDYEIEYA-DHSSSMGVLASDDLHLML----ANGSLTKLGIMFGCAYDQQGLLLNSLAKT 231
Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPI- 191
DG++GL VS+PS LA +I N C D G +F GD +++P+
Sbjct: 232 DGILGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYMFLGDDF-VPYWGMAWVPML 290
Query: 192 ---GEKYDAYFVGVESYCIGNSCLTQSGF--QALVDSGASFTFLPTEIYAEVVVKFDKLV 246
Y + + + S Q G + + D+G+S+T+ P E Y +V K V
Sbjct: 291 NSHSPNYHSQIMKISHGSRQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALVASL-KDV 349
Query: 247 SSKRISLQGN--SWKYCYNASSEEMLKVPDMRLIFS------KNQSFVVRNHIFSFPENE 298
S + + G+ + C+ A + V D++ F +++ ++V F P E
Sbjct: 350 SDEGLIQDGSDPTLPVCWRAKF-PIRSVIDVKQFFQPLTLQFRSKWWIVSTK-FRIPP-E 406
Query: 299 GFTVF------CLTVMST----DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
G+ + CL ++ DG I+G + G +V+D N K+ W+ S C
Sbjct: 407 GYLIISNKGNVCLGILDGSNVHDGSTIILGDISLRGKLVVYDNVNQKIGWAQSTC 461
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 84/346 (24%), Positives = 141/346 (40%), Gaps = 61/346 (17%)
Query: 46 DPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
DP++SS+ V C P+C++ R + C Y+ Y + + + G L D
Sbjct: 138 DPAASSTHAAVRCDAPVCRALPFTSCGRGGSSWGERSCVYVYHYG-DKSITVGKLASDRF 196
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
+ + GCG G + A G+ G G G S+PS L
Sbjct: 197 TFGPGDNADGGGVSERRLTFGCGHFNKGIFQ--ANETGIAGFGRGRWSLPSQLGV----- 249
Query: 159 NSFSICFD---ENDSGSVFFGDQGPAT------QQSTSFLPIGEKYDAYFVGVESYCIGN 209
SFS CF E+ S V G PA QST L + YF+ +++ +G
Sbjct: 250 TSFSYCFTSMFESTSSLVTLG-VAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGA 308
Query: 210 SCLTQSGFQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
+ + + A++DSGAS T LP ++Y V +F V +++G++ C+
Sbjct: 309 TRIPIPERRQRLREASAIIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALDLCF 368
Query: 263 NASSEEM-----------------LKVPDMRLIF----SKNQSFVVRNHIFSFPENEGFT 301
S ++VP RL+F + N++F E+ G
Sbjct: 369 ALPSAAAPKSAFGWRWRGRGRAMPVRVP--RLVFHLGGGADWELPRENYVF---EDYGAR 423
Query: 302 VFCLTV--MSTDGDYGIIGQNFMMGH-RIVFDRENLKLAWSHSKCE 344
V CL + + GD ++ N+ + +V+D EN L+++ ++CE
Sbjct: 424 VMCLVLDAATGGGDQTVVIGNYQQQNTHVVYDLENDVLSFAPARCE 469
>gi|12323376|gb|AAG51657.1|AC010704_1 nucellin-like protein; 27671-25467 [Arabidopsis thaliana]
Length = 427
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 76/325 (23%), Positives = 128/325 (39%), Gaps = 31/325 (9%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
++Y P+ ++ + CSH LC C +D C Y YS + SS G LV D
Sbjct: 103 TKYKPNHNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGYS-DHASSIGALVTDE 157
Query: 98 LHLASFSKHAPQSSVQSSVIIGCG-RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
+ L K A S + + GCG +Q G++GLG G V + + L G+
Sbjct: 158 VPL----KLANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGI 213
Query: 157 IQNSFSICFDENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 215
+N C G + GD+ P++ + + L Y G +
Sbjct: 214 TKNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVK 273
Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASS------- 266
G + DSG+S+T+ E Y ++ K ++ K + + S C+
Sbjct: 274 GINVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDE 333
Query: 267 -EEMLKVPDMRLIFSKN-QSFVVRNHIFSFPENEG---FTVFCLTVMSTDGDYGIIGQNF 321
++ K +R KN Q F V + +G + T + +G Y IIG
Sbjct: 334 VKKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEG-YNIIGDIS 392
Query: 322 MMGHRIVFDRENLKLAWSHSKCEEV 346
G +++D E ++ W S C+++
Sbjct: 393 FQGIMVIYDNEKQRIGWISSDCDKL 417
>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
Length = 358
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 61/206 (29%), Positives = 99/206 (48%), Gaps = 23/206 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP------YIADYSTEDTSSSGYLVDDIL 98
Y P+++S V C++ LC + S + CP Y Y T+ SS G L++D
Sbjct: 97 YRPTANSL---VPCANALCTALHSGHGSNNKCPSPKQCDYQIKY-TDSASSQGVLIND-- 150
Query: 99 HLASFSKHAPQSSVQSSVIIGCGR-KQTGSYLDGA---APDGVMGLGLGDVSVPSLLAKA 154
+FS S+++ + GCG +Q G +GA A DG++GLG G VS+ S L +
Sbjct: 151 ---NFSLPMRSSNIRPGLTFGCGYDQQVGK--NGAVQAATDGMLGLGRGSVSLVSQLKQQ 205
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV-GVESYCIGNSCLT 213
G+ +N C N G +FFGD T + T ++P+ + Y+ G + L
Sbjct: 206 GITKNVLGHCLSTNGGGFLFFGDDIVPTSRVT-WVPMAKISGNYYSPGSGTLYFDRRSLG 264
Query: 214 QSGFQALVDSGASFTFLPTEIYAEVV 239
+ + DSG+++T+ + Y VV
Sbjct: 265 VKPMEVVFDSGSTYTYFTAQPYQAVV 290
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 75/313 (23%), Positives = 126/313 (40%), Gaps = 30/313 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP+ SS+ NVSC+ P C + C Y Y + + S G+ D L L+S+
Sbjct: 225 FDPARSSTYANVSCAAPACSDLYTRGCSGGHCLYSVQYG-DGSYSIGFFAMDTLTLSSY- 282
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
GCG + G + + A G++GLG G S+P K G + F+
Sbjct: 283 ------DAVKGFRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAH 330
Query: 164 CFDENDSGSVF--FGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLT--QS 215
C SG+ + FG PA + P+ G + Y+VG+ +G L+ QS
Sbjct: 331 CLPARSSGTGYLDFGPGSPAAVGARQTTPMLTDNGPTF--YYVGMTGIRVGGQLLSIPQS 388
Query: 216 GFQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEML 270
F +VDSG T LP Y+ + F ++++ + + + CY+ + +
Sbjct: 389 VFSTAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYDFTGMSEV 448
Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
+P + L+F V + + D D GI+G + +V+D
Sbjct: 449 AIPKVSLLFQGGAYLDVNASGIMYAASLSQVCLGFAANEDDDDVGIVGNTQLKTFGVVYD 508
Query: 331 RENLKLAWSHSKC 343
+ +S C
Sbjct: 509 IGKKTVGFSPGAC 521
>gi|224130234|ref|XP_002328687.1| predicted protein [Populus trichocarpa]
gi|222838863|gb|EEE77214.1| predicted protein [Populus trichocarpa]
Length = 603
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 83/329 (25%), Positives = 134/329 (40%), Gaps = 64/329 (19%)
Query: 74 DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-A 132
D C Y +Y+ + +SS G L D L L A S + + I GC Q G L
Sbjct: 264 DQCDYEIEYA-DHSSSMGVLATDKLLLMV----ANGSLTKLNFIFGCAYDQQGLLLKTLV 318
Query: 133 APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLP 190
DG++GL VS+PS LA G+I N C D G +F GD + +++P
Sbjct: 319 KTDGILGLSRAKVSLPSQLASQGIINNVIGHCLTTDLGGGGYMFLGDDF-VPRWGMAWVP 377
Query: 191 IGE--KYDAYFVGVESYCIGNSCLTQSGFQA-----LVDSGASFTFLPTEIYAEVVVKFD 243
+ + + Y V G+S L+ G ++ L DSG+S+T+ P E Y+E+V +
Sbjct: 378 MLDSPSMEFYHTEVVKLNYGSSPLSLGGMESRVKHILFDSGSSYTYFPKEAYSELVASLN 437
Query: 244 KLVSSKRI-SLQGNSWKYCYNAS-------SEEMLKVP---------------------- 273
++ + + S + C+ A+ L P
Sbjct: 438 EVSGAGLVQSTSDTTLPLCWRANFPIRKFIYRTELTRPIRRRRRRRRRRRRRRRRRRQHI 497
Query: 274 --DMR-----LIFSKNQSFVVRNHIFSFPENEGFTVF------CLTVMS----TDGDYGI 316
D++ L F ++V + F P EG+ + CL ++ DG I
Sbjct: 498 KGDVKKFFKTLTFQFGTKWLVISTKFRIPP-EGYLMMSDKGNVCLGILEGSKVHDGSTII 556
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
+G + G +V+D N K+ W+ S C +
Sbjct: 557 LGDISLRGQLVVYDNVNKKIGWTPSDCAK 585
>gi|147802609|emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]
Length = 424
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 77/312 (24%), Positives = 128/312 (41%), Gaps = 33/312 (10%)
Query: 56 VSCSHPLCKSRS----SCKSLKDPCPYIADYSTEDTSSSGYLVDDI--LHLASFSKHAPQ 109
V C P+C C+ + C Y +Y+ + SS G LV D+ L+ + + AP+
Sbjct: 117 VICKDPMCAXLHPPGYKCEH-PEQCDYEVEYA-DGGSSLGVLVKDVFPLNFTNGLRLAPR 174
Query: 110 SSVQSSVIIGCGRKQT--GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE 167
+ +GCG Q SY DGV+GLG G S+ S L G+I+N C
Sbjct: 175 ------LALGCGYDQIPGXSY---HPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSS 225
Query: 168 NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASF 227
+ G +FFGD + + + +++ Y G +G DSG+S+
Sbjct: 226 HGGGFLFFGDDLYDSSRVVWTPMLRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSY 285
Query: 228 TFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSK-NQS 284
T+L + Y +V K +S K R +L + C+ V D+R F S
Sbjct: 286 TYLNSLAYQALVHLVRKELSEKPVREALDDQTLPLCWRG-KRPFKSVRDVRKFFKPLALS 344
Query: 285 FVVRNHI---FSFPENEGFTV---FCLTVMSTD----GDYGIIGQNFMMGHRIVFDRENL 334
F + P + CL +++ D+ +IG M +V+D E
Sbjct: 345 FAGGGRTKTQYDIPLESYLIISGNVCLGILNGTEAGLQDFNLIGDISMQDKMVVYDNEKN 404
Query: 335 KLAWSHSKCEEV 346
++ W+ + C+ +
Sbjct: 405 QIGWAPTNCDRL 416
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 78/342 (22%), Positives = 136/342 (39%), Gaps = 46/342 (13%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKS----------RSSCKSLKDPCPYIADYSTEDTSS 89
+ L DP++SS+ + C P C++ RSS + C YI Y + + +
Sbjct: 129 QGLPLLDPAASSTYAALPCGAPRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYG-DKSVT 187
Query: 90 SGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS 149
G + D + + GCG G + G+ G G G S+PS
Sbjct: 188 VGEIATDRFTFGGDNGDGDSRLPTRRLTFGCGHFNKGVFQSNET--GIAGFGRGRWSLPS 245
Query: 150 LLAKAGLIQNSFSICFD---ENDSGSVFFGDQGPATQ-------------QSTSFLPIGE 193
L +FS CF E+ S V G PA ++T L
Sbjct: 246 QLNV-----TTFSYCFTSMFESKSSLVTLGG-APAAALLYSHAAHISGEVRTTPLLKNPS 299
Query: 194 KYDAYFVGVESYCIGNSCLTQSGFQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
+ YF+ ++ +G + L + ++DSGAS T LP +Y V +F V
Sbjct: 300 QPSLYFLSLKGISVGKTRLAVPEAKLRSTIIDSGASITTLPEAVYEAVKAEFAAQVGLPP 359
Query: 251 ISL-QGNSWKYCYNASSEEMLK---VPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCL 305
+ +G++ C+ + + VP + L + R N++F E+ V C+
Sbjct: 360 TGVVEGSALDLCFALPVTALWRRPPVPSLTLHLDGADWELPRGNYVF---EDLAARVMCV 416
Query: 306 TVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
+ + GD +IG +V+D EN L+++ ++C+ ++
Sbjct: 417 VLDAAPGDQTVIGNFQQQNTHVVYDLENDWLSFAPARCDSLV 458
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 66/243 (27%), Positives = 110/243 (45%), Gaps = 43/243 (17%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
++ ++ PS SS+ KN+ CS LCKS G L D L
Sbjct: 123 NQTTPKFKPSKSSTYKNIPCSSDLCKS----------------------GQQGNLSVDTL 160
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
L S + H P S ++ +IGCG T S+ +GA+ G++GLG G S+ + L + I
Sbjct: 161 TLESSTGH-PISFPKT--VIGCGTDNTVSF-EGAS-SGIVGLGGGPASLITQLGSS--ID 213
Query: 159 NSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSC 211
FS C + N + + FGD + PI +K Y++ +E++ +GN
Sbjct: 214 AKFSYCLLPNPVESNTTSKLNFGDTAVVSGDGVVSTPIVKKDPIVFYYLTLEAFSVGNKR 273
Query: 212 LTQSGF-------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
+ G ++DSG + T +PT++Y + +LV KR++ + CY+
Sbjct: 274 IEFEGSSNGGHEGNIIIDSGTTLTVIPTDVYNNLESAVLELVKLKRVNDPTRLFNLCYSV 333
Query: 265 SSE 267
+S+
Sbjct: 334 TSD 336
>gi|30699263|ref|NP_177872.3| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332197862|gb|AEE35983.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 432
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 76/324 (23%), Positives = 127/324 (39%), Gaps = 31/324 (9%)
Query: 44 EYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
+Y P+ ++ + CSH LC C +D C Y YS + SS G LV D +
Sbjct: 109 QYKPNHNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGYS-DHASSIGALVTDEV 163
Query: 99 HLASFSKHAPQSSVQSSVIIGCG-RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L K A S + + GCG +Q G++GLG G V + + L G+
Sbjct: 164 PL----KLANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGIT 219
Query: 158 QNSFSICFDENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
+N C G + GD+ P++ + + L Y G + G
Sbjct: 220 KNVIVHCLSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKG 279
Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASS-------- 266
+ DSG+S+T+ E Y ++ K ++ K + + S C+
Sbjct: 280 INVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEV 339
Query: 267 EEMLKVPDMRLIFSKN-QSFVVRNHIFSFPENEG---FTVFCLTVMSTDGDYGIIGQNFM 322
++ K +R KN Q F V + +G + T + +G Y IIG
Sbjct: 340 KKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGRVCLGILNGTEIGLEG-YNIIGDISF 398
Query: 323 MGHRIVFDRENLKLAWSHSKCEEV 346
G +++D E ++ W S C+++
Sbjct: 399 QGIMVIYDNEKQRIGWISSDCDKL 422
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/330 (24%), Positives = 140/330 (42%), Gaps = 40/330 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+DPS SS+ +V CS P C ++ C + C Y Y E + + G L ++ L
Sbjct: 166 FDPSKSSTYVDVPCSAPECHIGGVQQTRCGATS--CEYSVKYGDE-SETHGSLAEETFTL 222
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
+ S AP + + V+ GC + + D G G++GLG GD S+ L++ N
Sbjct: 223 SPPSPLAPAA---TGVVFGCSHEYISVFNDTGMGVAGLLGLGRGDSSI---LSQTRRSIN 276
Query: 160 S----FSICFDENDS--GSVFFGDQGPATQQ---STSFLP----IGEKYDAYFVGVESYC 206
S FS C S G + G A QQ + SF P I + AY V +
Sbjct: 277 SGGGVFSYCLPPRGSSTGYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVS 336
Query: 207 IGNSC--LTQSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKY 260
+ + + S F A++DSG T +P Y + +F + S ++ +G+
Sbjct: 337 VNGAAVDIPASAFSLGAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDT 396
Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEG----FTVFCLTVMSTD-GD 313
CY+ + ++++ P + L F V + P +G T+ CL + T+
Sbjct: 397 CYDVTGQDVVTAPRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAG 456
Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
I+G + +VFD + ++ + + C
Sbjct: 457 LVIVGNMQQRAYNVVFDVDGGRIGFGPNGC 486
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 80/330 (24%), Positives = 129/330 (39%), Gaps = 25/330 (7%)
Query: 23 TTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADY 82
TT + C A Q L +DP+SSS+ NVSC+ P C C Y Y
Sbjct: 206 TTWVQCQPCVVACYEQREKL--FDPASSSTYANVSCAAPACSDLDVSGCSGGHCLYGVQY 263
Query: 83 STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
+ + S G+ D L L+S+ GCG + G + + A G++GLG
Sbjct: 264 G-DGSYSIGFFAMDTLTLSSY-------DAVKGFRFGCGERNDGLFGEAA---GLLGLGR 312
Query: 143 GDVSVPSLLAKAGLIQNSFSICFDENDSGSVF--FGDQGPATQQSTSFLPIGEKYDAYFV 200
G S+P + G F+ C +G+ + FG P +T L G Y+V
Sbjct: 313 GKTSLP--VQTYGKYGGVFAHCLPARSTGTGYLDFGAGSPPATTTTPML-TGNGPTFYYV 369
Query: 201 GVESYCIGNSCL--TQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISL 253
G+ +G L S F A +VDSG T LP Y+ + F ++++ R +
Sbjct: 370 GMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAA 429
Query: 254 QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD 313
+ CY+ + + +P + L+F + V + + GD
Sbjct: 430 AVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTVSASQVCLAFAGNEDGGD 489
Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
GI+G + + +D + +S C
Sbjct: 490 VGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|281200780|gb|EFA74998.1| putative aspartyl protease [Polysphondylium pallidum PN500]
Length = 394
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 89/365 (24%), Positives = 158/365 (43%), Gaps = 58/365 (15%)
Query: 1 MLGAICFGSHANAYNALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSC-- 58
++G F + ++L+ +P+ C DR YDP+ S SK VSC
Sbjct: 46 IVGNHTFTVQVDTGSSLMAIPMVNCNTC---------HDR--PSYDPTHSQYSKVVSCFS 94
Query: 59 --------SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
+ P CK+R+ +D C ++ Y + + SG + D+++L+ S A
Sbjct: 95 EHCLGSGSAPPQCKNRA-----EDDCDFVILYG-DGSRVSGKIYQDVVNLSGLSGIAN-- 146
Query: 111 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG-DVSVPSL---LAKAGLIQNSFSICFD 166
G R +TG + + DG++G G VP++ L +A ++N F++ D
Sbjct: 147 -------FGANRIETGDF-EYPRADGIVGFGRSCKTCVPTVFESLVQAHGLKNIFAMSMD 198
Query: 167 ENDSGSVFFGDQGPATQ-QSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS--GFQALVDS 223
G++ G+ P+ + P+ E Y + ++ + ++ + G Q +VDS
Sbjct: 199 YEGRGTLSLGELNPSNHIGEIQYTPLFEDGPFYNIKPTNFKVDDTVILPRLLGRQVIVDS 258
Query: 224 GASFTFLPTEIYAEVVVKFDK-------LVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 276
G+S L + Y +V F K + S I L G+ CYN++S L +P +
Sbjct: 259 GSSALSLASGAYDALVHHFRKNYCHVAGICDSPSI-LDGS---ICYNSASSLDL-LPTIY 313
Query: 277 LIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 334
L F V +N++ P G + +C + D I+G FM G+ VFD E
Sbjct: 314 LTFEGGVKVAVPPKNYLTKAPLTNGASGYCWMIDRADPSTTILGDVFMRGYYTVFDNEEK 373
Query: 335 KLAWS 339
++ ++
Sbjct: 374 RIGFA 378
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 80/330 (24%), Positives = 129/330 (39%), Gaps = 25/330 (7%)
Query: 23 TTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADY 82
TT + C A Q L +DP+SSS+ NVSC+ P C C Y Y
Sbjct: 203 TTWVQCQPCVVACYEQREKL--FDPASSSTYANVSCAAPACSDLDVSGCSGGHCLYGVQY 260
Query: 83 STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
+ + S G+ D L L+S+ GCG + G + + A G++GLG
Sbjct: 261 G-DGSYSIGFFAMDTLTLSSY-------DAVKGFRFGCGERNDGLFGEAA---GLLGLGR 309
Query: 143 GDVSVPSLLAKAGLIQNSFSICFDENDSGSVF--FGDQGPATQQSTSFLPIGEKYDAYFV 200
G S+P + G F+ C +G+ + FG P +T L G Y+V
Sbjct: 310 GKTSLP--VQTYGKYGGVFAHCLPPRSTGTGYLDFGAGSPPATTTTPML-TGNGPTFYYV 366
Query: 201 GVESYCIGNSCL--TQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISL 253
G+ +G L S F A +VDSG T LP Y+ + F ++++ R +
Sbjct: 367 GMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAA 426
Query: 254 QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD 313
+ CY+ + + +P + L+F + V + + GD
Sbjct: 427 AVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTVSASQVCLAFAGNEDGGD 486
Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
GI+G + + +D + +S C
Sbjct: 487 VGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 516
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 80/330 (24%), Positives = 129/330 (39%), Gaps = 25/330 (7%)
Query: 23 TTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADY 82
TT + C A Q L +DP+SSS+ NVSC+ P C C Y Y
Sbjct: 202 TTWVQCQPCVVACYEQREKL--FDPASSSTYANVSCAAPACSDLDVSGCSGGHCLYGVQY 259
Query: 83 STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
+ + S G+ D L L+S+ GCG + G + + A G++GLG
Sbjct: 260 G-DGSYSIGFFAMDTLTLSSY-------DAVKGFRFGCGERNDGLFGEAA---GLLGLGR 308
Query: 143 GDVSVPSLLAKAGLIQNSFSICFDENDSGSVF--FGDQGPATQQSTSFLPIGEKYDAYFV 200
G S+P + G F+ C +G+ + FG P +T L G Y+V
Sbjct: 309 GKTSLP--VQTYGKYGGVFAHCLPARSTGTGYLDFGAGSPPATTTTPML-TGNGPTFYYV 365
Query: 201 GVESYCIGNSCL--TQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISL 253
G+ +G L S F A +VDSG T LP Y+ + F ++++ R +
Sbjct: 366 GMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAA 425
Query: 254 QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD 313
+ CY+ + + +P + L+F + V + + GD
Sbjct: 426 AVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTVSASQVCLAFAGNEDGGD 485
Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
GI+G + + +D + +S C
Sbjct: 486 VGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 515
>gi|357464807|ref|XP_003602685.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355491733|gb|AES72936.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 440
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 79/329 (24%), Positives = 135/329 (41%), Gaps = 51/329 (15%)
Query: 47 PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASF 103
P S+ V C HPLC S + + + DY E SS G LV+D+ ++ +F
Sbjct: 126 PLYRPSNDLVPCRHPLCASVHQTDNYECEVEHQCDYEVEYADHYSSLGVLVNDV-YVLNF 184
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
+ ++ + +GCG Q DG++GLG G S+ S L GL++N
Sbjct: 185 TNGV---QLKVRMALGCGYDQIFPDSSYHPVDGMLGLGRGKSSLISQLNGQGLVRNVVGH 241
Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEK-YDAYFVGVESYCIGNSCLTQSGFQALVD 222
C G +FFGD +++ ++ P+ + Y Y G +G A+ D
Sbjct: 242 CLSAQGGGYIFFGDVYDSSR--LAWTPMSSRDYKHYSAGAAELVLGGKRTGFGNLLAVFD 299
Query: 223 SGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASSEEMLKVPDMRLIFS 280
+G+S+T+ + Y + K ++ K I + + + C+ K P R ++
Sbjct: 300 AGSSYTYFNSNAY-----QLTKELAGKPIKEAPEDQTLPLCWYG------KRP-FRSVYE 347
Query: 281 KNQSFVVRNHIFSFPEN-----------EGFTVF------CLTVMSTDG------DYGII 317
+ F + SFP + E + + CL ++ DG D +I
Sbjct: 348 VKKYF--KPIALSFPGSRRSKAQFEIPPEAYLIISNMGNVCLGIL--DGSEVGVEDLNLI 403
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
G M+ +VFD E + W+ + C V
Sbjct: 404 GDISMLDKVMVFDNEKQLIGWTAADCNRV 432
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 81/334 (24%), Positives = 140/334 (41%), Gaps = 43/334 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI- 97
YDP SSS +N+SC P C+ S+ CK+ CPY Y ++ + ++
Sbjct: 239 YDPKDSSSFRNISCHDPRCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFT 298
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
++L + + + V+ +V+ GCG G + A G+ L S L
Sbjct: 299 VNLTTPNGTSELKHVE-NVMFGCGHWNRGLFHGAAGLLGLGKGPLSFAS-----QMQSLY 352
Query: 158 QNSFSICF-DENDSGSV----FFG-DQGPATQQSTSFLPIGEKYDA-----YFVGVESYC 206
SFS C D N + SV FG D+ + + +F G D Y+V ++S
Sbjct: 353 GQSFSYCLVDRNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVM 412
Query: 207 IGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 256
+ + L ++ ++DSG + T+ Y + F + + ++
Sbjct: 413 VDDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLP 472
Query: 257 SWKYCYNASSEEMLKVPDMRLIFSKNQ--SFVVRNH-IFSFPENEGFTVFCLTVMST-DG 312
K CYN S E +++PD ++F+ +F V N+ I+ PE V CL ++
Sbjct: 473 PLKPCYNVSGIEKMELPDFGILFADEAVWNFPVENYFIWIDPE-----VVCLAILGNPRS 527
Query: 313 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
IIG I++D + +L ++ KC +V
Sbjct: 528 ALSIIGNYQQQNFHILYDMKKSRLGYAPMKCADV 561
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 79/313 (25%), Positives = 127/313 (40%), Gaps = 31/313 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
+DP+ SS+ V C+ P C+ SRS + K C Y Y + + + G L D L L
Sbjct: 188 FDPARSSTYSAVPCASPECQGLDSRSCSRDKK--CRYEVVYG-DQSQTDGALARDTLTLT 244
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
QS V + GCG + TG L G A DG++GLG VS+ S A F
Sbjct: 245 -------QSDVLPGFVFGCGEQDTG--LFGRA-DGLVGLGREKVSLSSQAASK--YGAGF 292
Query: 162 SICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YF-----VGVESYCIGNSCLT 213
S C + S + + GPA + F + ++D+ Y+ V V + S +
Sbjct: 293 SYCLPSSPSAAGYLSLGGPAPANA-RFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIV 351
Query: 214 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS---KRISLQGNSWKYCYNASSEEML 270
S ++DSG T LP +YA + F + + KR + CY+ + +
Sbjct: 352 FSAAGTVIDSGTVITRLPPRVYAALRSAFARSMGRYGYKRAPAL-SILDTCYDFTGHTTV 410
Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
++P + L+F+ + + + D GIIG +V+D
Sbjct: 411 RIPSVALVFAGGAAVGLDFSGVLYVAKVSQACLAFAPNGDGADAGIIGNTQQKTLAVVYD 470
Query: 331 RENLKLAWSHSKC 343
K+ + + C
Sbjct: 471 VARQKIGFGANGC 483
>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 401
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 62/219 (28%), Positives = 96/219 (43%), Gaps = 18/219 (8%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLV 94
R L P SS + C+ PLCK S C++ + C Y +Y+ + SS G LV
Sbjct: 91 RCLEAPHPLYQPSSDLIPCNDPLCKALHLNSNQRCET-PEQCDYEVEYA-DGGSSLGVLV 148
Query: 95 DDILHLASFSKHAPQS-SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
D+ FS + Q + + +GCG Q DGV+GLG G VS+ S L
Sbjct: 149 RDV-----FSMNYTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHS 203
Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF---VGVESYCIGNS 210
G ++N C G +FFGD + + S+ P+ +Y ++ +G E G
Sbjct: 204 QGYVKNVIGHCLSSLGGGILFFGDDLYDSSR-VSWTPMSREYSKHYSPAMGGE-LLFGGR 261
Query: 211 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 249
+ DSG+S+T+ ++ Y V + +S K
Sbjct: 262 TTGLKNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGK 300
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 85/338 (25%), Positives = 139/338 (41%), Gaps = 38/338 (11%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGY 92
++N YDP SSS +N+ C C SS CK+ CPY Y ++ +
Sbjct: 217 EQNGPHYDPGQSSSYRNIGCHDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDF 276
Query: 93 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
++ + S P+ +V+ GCG G + A ++GLG G +S S L
Sbjct: 277 ALETFTVNLTMSSGKPELRRVENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQLQ 333
Query: 153 KAGLIQNSFSICF-----DENDSGSVFFG-DQGPATQQSTSF--LPIGEKYDA---YFVG 201
L +SFS C D N S + FG D+ + +F L G++ Y+V
Sbjct: 334 S--LYGHSFSYCLVDRNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQ 391
Query: 202 VESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
++S +G + T ++DSG + ++ Y + F V +
Sbjct: 392 IKSIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPV 451
Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMS 309
+ CYN + E +PD ++FS +F V N+ F E E V CL ++
Sbjct: 452 VKDFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENY---FIEIEPREVVCLAILG 508
Query: 310 T-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
T IIG I++D + +L ++ +KC +V
Sbjct: 509 TPPSALSIIGNYQQQNFHILYDTKKSRLGFAPTKCADV 546
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 83/335 (24%), Positives = 140/335 (41%), Gaps = 47/335 (14%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDD 96
D++ +DP SSS V C+ PLC+ S C + C Y Y + + ++G +
Sbjct: 176 DQSGPVFDPRRSSSYGAVDCAAPLCRRLDSGGCDLRRRACLYQVAYG-DGSVTAGDFATE 234
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
L A ++ A V +GCG G ++ A ++GLG G +S P+ +++
Sbjct: 235 TLTFAGGARVA-------RVALGCGHDNEGLFVAAAG---LLGLGRGSLSFPTQISR--R 282
Query: 157 IQNSFSICFDENDSG------------SVFFGDQGPATQQSTSFLPIGEKYDA---YFVG 201
SFS C + S +V FG P + + SF P+ Y+V
Sbjct: 283 YGKSFSYCLVDRTSSSSSGAASRSRSSTVTFG---PPSASAASFTPMVRNPRMETFYYVQ 339
Query: 202 VESYCIGNS---CLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 249
+ +G + + +S + +VDSG S T L Y+ + F +
Sbjct: 340 LVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGL 399
Query: 250 RISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM 308
R+S G S + CY+ +++KVP + + F+ + + P + T FC
Sbjct: 400 RLSPGGFSLFDTCYDLGGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGT-FCFAFA 458
Query: 309 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
TDG IIG G R+VFD + ++ ++ C
Sbjct: 459 GTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 493
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 76/315 (24%), Positives = 133/315 (42%), Gaps = 35/315 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+DP++S++ V C +C++ R+S C Y Y + + + G L + L L
Sbjct: 169 FDPATSATFSAVPCGSAVCRTLRTSGCGDSGGCDYEVSYG-DGSYTKGALALETLTLG-- 225
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
+ V IGCG + G ++ A G++GLG G +S+ L A +FS
Sbjct: 226 ------GTAVEGVAIGCGHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGA--AGGAFSY 274
Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSC--------- 211
C +GS+ G + A + ++P+ A Y+VG+ +G+
Sbjct: 275 CLASRGAGSLVLG-RSEAVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQ 333
Query: 212 LTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 270
LT+ G +V D+G + T LP E YA + F V + + + CY+ S +
Sbjct: 334 LTEDGAGGVVMDTGTAVTRLPQEAYAALRDAFVAAVGALPRAPGVSLLDTCYDLSGYTSV 393
Query: 271 KVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
+VP + F + + RN + E +G ++CL + I+G G +I
Sbjct: 394 RVPTVSFYFDGAATLTLPARNLLL---EVDG-GIYCLAFAPSSSGPSILGNIQQEGIQIT 449
Query: 329 FDRENLKLAWSHSKC 343
D N + + + C
Sbjct: 450 VDSANGYIGFGPTTC 464
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 80/320 (25%), Positives = 129/320 (40%), Gaps = 44/320 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DPS SSSS+N+ C P CK +C + K C + Y +S L D L LA
Sbjct: 131 FDPSKSSSSRNLQCDAPQCKQAPNPTCTAGKS-CGFNMTYGGSTIEAS--LTQDTLTLA- 186
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+ V S GC K TG+ L G+MGLG G +S+ S L ++FS
Sbjct: 187 -------NDVIKSYTFGCISKATGTSLPA---QGLMGLGRGPLSLIS--QTQNLYMSTFS 234
Query: 163 ICF----DENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----- 212
C N SGS+ G + P ++T L + Y+V + +GN +
Sbjct: 235 YCLPNSKSSNFSGSLRLGPKYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTS 294
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
+G + DSG FT L Y V +F + + + + G + CY+ S
Sbjct: 295 ALAFDASTGAGTIFDSGTVFTRLVEPAYVAVRNEFRRRIKNANATSLG-GFDTCYSGS-- 351
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD----YGIIGQNFMM 323
+ P + +F+ + +++ + + CL + + + +I
Sbjct: 352 --VVYPSVTFMFAGMNVTLPPDNLLI--HSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQ 407
Query: 324 GHRIVFDRENLKLAWSHSKC 343
HR++ D N +L S C
Sbjct: 408 NHRVLIDLPNSRLGISRETC 427
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 83/322 (25%), Positives = 143/322 (44%), Gaps = 33/322 (10%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
+++PS SSS KN+SCS LC+S +SC K+ C Y +Y + + S G L + L L
Sbjct: 128 KFNPSKSSSYKNISCSSKLCQSVRDTSCNDKKN-CEYSINYGNQ-SHSQGDLSLETLTLE 185
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSV---PSLLAKAGLIQ 158
S + P S ++ +IGCG GS+ ++ +G G + PS+ K
Sbjct: 186 S-TTGRPVSFPKT--VIGCGTNNIGSFKRVSSGVVGLGGGPASLITQLGPSIGGKFSYCL 242
Query: 159 NSFSICFDENDSGS--VFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQ 214
SI GS + FGD + + PI +K + Y++ +E++ +G+ +
Sbjct: 243 VRMSITLKNMSMGSSKLNFGDVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSVGDKRVEF 302
Query: 215 SGF-------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
+G ++DS TF+P+++Y ++ LV+ +R+ + CYN SS+
Sbjct: 303 AGSSKGVEEGNIIIDSSTIVTFVPSDVYTKLNSAIVDLVTLERVDDPNQQFSLCYNVSSD 362
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QNFMM 323
E P M F + + F + V C ++G I G Q+FM+
Sbjct: 363 EEYDFPYMTAHFKGADILLYATNTFVEVARD---VLCFAFAPSNGG-AIFGSFSQQDFMV 418
Query: 324 GHRIVFDRENLKLAWSHSKCEE 345
G +D + +++ C E
Sbjct: 419 G----YDLQQKTVSFKSVDCTE 436
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 79/333 (23%), Positives = 135/333 (40%), Gaps = 41/333 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDI- 97
YDP SSS +N+SC P C+ SS CK+ CPY Y ++ + ++
Sbjct: 237 YDPKDSSSFRNISCHDPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFT 296
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
++L + + + V+ +V+ GCG G + A G+ L S L
Sbjct: 297 VNLTTPNGKSELKHVE-NVMFGCGHWNRGLFHGAAGLLGLGKGPLSFAS-----QMQSLY 350
Query: 158 QNSFSICF-DENDSGSV----FFG-DQGPATQQSTSFLPIGEKYDA-----YFVGVESYC 206
SFS C D N + SV FG D+ + + +F G D Y+V + S
Sbjct: 351 GQSFSYCLVDRNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVM 410
Query: 207 IGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 256
+ + L ++ ++DSG + T+ Y + F + + +
Sbjct: 411 VDDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGLP 470
Query: 257 SWKYCYNASSEEMLKVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMST-DGD 313
K CYN S E +++PD ++F+ +F V N+ + V CL ++
Sbjct: 471 PLKPCYNVSGIEKMELPDFGILFADGAVWNFPVENYFIQIDPD----VVCLAILGNPRSA 526
Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
IIG I++D + +L ++ KC +V
Sbjct: 527 LSIIGNYQQQNFHILYDMKKSRLGYAPMKCADV 559
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 87/359 (24%), Positives = 147/359 (40%), Gaps = 55/359 (15%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLC---------KSRSSCKSLKDPCPYIADYSTEDTSSS 90
R+ +DP++S S + V C LC S C + C Y Y + +S+
Sbjct: 131 RSRPVFDPAASQSYRQVPCISQLCLAVQQQTSNGSSQPCVNSSATCTYSLSYG-DSRNST 189
Query: 91 GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 150
G D++ L S + + Q+ V GC G +D + G++G G++S+PS
Sbjct: 190 GDFSQDVIFLNS-TNSSGQAVQFRDVAFGCAHSPQGFLVDLGSL-GIVGFNRGNLSLPSQ 247
Query: 151 LAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGE------KYDAYF 199
L K L + FS CF +G +F GD G ++ + P+ + + Y+
Sbjct: 248 L-KDRLGGSKFSYCFPSQPWQPRATGVIFLGDSG-LSKSKVGYTPLLDNPVTPARSQLYY 305
Query: 200 VGVESYCIGNSCLT--QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS 248
VG+ S + L +S F+ ++DSG +FT + + Y F +S
Sbjct: 306 VGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAF---AAS 362
Query: 249 KRISLQGN-----SWKYCYNASSEEML-KVPDMRLIFSKNQSFVVR-NHIF---SFPENE 298
R L+ + CYN S+ L VP++RL N +R H+F S NE
Sbjct: 363 NRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNE 422
Query: 299 GFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 353
CL ++S+ G ++G + + +D E ++ + + C VH
Sbjct: 423 --VTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFERADCSGAAGSFLVH 479
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 77/318 (24%), Positives = 128/318 (40%), Gaps = 40/318 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DPS SSSS+ + C P CK SC ++ C + Y ++ YL D L LA
Sbjct: 128 FDPSKSSSSRTLQCEAPQCKQAPNPSC-TVSKSCGFNMTYG--GSTIEAYLTQDTLTLA- 183
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S V + GC K +G+ L G+MGLG G +S+ S L Q++FS
Sbjct: 184 -------SDVIPNYTFGCINKASGTSLPA---QGLMGLGRGPLSLIS--QSQNLYQSTFS 231
Query: 163 ICF----DENDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----- 212
C N SGS+ G + P ++T L + Y+V + +GN +
Sbjct: 232 YCLPNSKSSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTS 291
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
+G + DSG +T L Y V +F + V + + G + CY+ S
Sbjct: 292 ALAFDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLG-GFDTCYSGS-- 348
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQNFMMGH 325
+ P + +F+ + +++ + + ++ + +I H
Sbjct: 349 --VVFPSVTFMFAGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNH 406
Query: 326 RIVFDRENLKLAWSHSKC 343
R++ D N +L S C
Sbjct: 407 RVLIDVPNSRLGISRETC 424
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 69/281 (24%), Positives = 111/281 (39%), Gaps = 23/281 (8%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +DP SS ++ +SCS C S S C + C Y Y + + +SG+ V D
Sbjct: 125 LNFFDPGSSVTASPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYG-DGSGTSGFYVSD 183
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAG 155
+L + + + V+ GC QTG + A DG+ G G +SV S LA G
Sbjct: 184 VLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQG 243
Query: 156 LIQNSFSICFD-ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
+ FS C EN G + G + + F P+ Y V + S + L
Sbjct: 244 IAPRVFSHCLKGENGGGGILV--LGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPI 301
Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNA 264
T +G ++D+G + +L Y V VS + + +GN CY
Sbjct: 302 NPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ---CYVI 358
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL 305
++ P + L F+ S + + +N + C
Sbjct: 359 TTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVASALCF 399
>gi|115467508|ref|NP_001057353.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|53791766|dbj|BAD53531.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|53793187|dbj|BAD54393.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113595393|dbj|BAF19267.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|125596798|gb|EAZ36578.1| hypothetical protein OsJ_20919 [Oryza sativa Japonica Group]
gi|215767941|dbj|BAH00170.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 538
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 75/296 (25%), Positives = 130/296 (43%), Gaps = 36/296 (12%)
Query: 76 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA-P 134
C Y Y+ + +SS G L D + L + A + GCG Q G+ L A
Sbjct: 233 CDYEITYA-DRSSSMGILARDNMQLIT----ADGERENLDFVFGCGYDQQGNLLSSPANT 287
Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIG 192
DG++GL +S+P+ LA G+I N F C D ++ G +F GD + +++PI
Sbjct: 288 DGILGLSNAAISLPTQLASQGIISNVFGHCIAADPSNGGYMFLGDDY-VPRWGMTWMPIR 346
Query: 193 E-KYDAYFVGVESYCIGNSCLT---QSG--FQALVDSGASFTFLPTEIYAEVVVKFDKLV 246
+ Y V+ G+ L ++G Q + DSG+S+T+LP + Y ++ L
Sbjct: 347 NGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQVIFDSGSSYTYLPHDDYTNLIASLKSLS 406
Query: 247 SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN-----EGFT 301
S + +C + + + D++ +F K S V + +F P E +
Sbjct: 407 PSLLQDESDRTLPFCMKPNF-PVRSMDDVKHLF-KPLSLVFKKRLFILPRTFVIPPEDYL 464
Query: 302 V------FCLTVMSTDG-DYG-----IIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
+ CL V+ DG + G +IG + G +V++ + ++ W S C +
Sbjct: 465 IISDKNNICLGVL--DGTEIGHDSAIVIGDVSLRGKLVVYNNDEKQIGWVQSDCAK 518
>gi|125554848|gb|EAZ00454.1| hypothetical protein OsI_22475 [Oryza sativa Indica Group]
Length = 538
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 75/296 (25%), Positives = 130/296 (43%), Gaps = 36/296 (12%)
Query: 76 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA-P 134
C Y Y+ + +SS G L D + L + A + GCG Q G+ L A
Sbjct: 233 CDYEITYA-DRSSSMGILARDNMQLIT----ADGERENLDFVFGCGYDQQGNLLSSPANT 287
Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIG 192
DG++GL +S+P+ LA G+I N F C D ++ G +F GD + +++PI
Sbjct: 288 DGILGLSNAAISLPTQLASQGIISNVFGHCIAADPSNGGYMFLGDDY-VPRWGMTWMPIR 346
Query: 193 E-KYDAYFVGVESYCIGNSCLT---QSG--FQALVDSGASFTFLPTEIYAEVVVKFDKLV 246
+ Y V+ G+ L ++G Q + DSG+S+T+LP + Y ++ L
Sbjct: 347 NGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQVIFDSGSSYTYLPHDDYTNLIASLKSLS 406
Query: 247 SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN-----EGFT 301
S + +C + + + D++ +F K S V + +F P E +
Sbjct: 407 PSLLQDESDRTLPFCMKPNF-PVRSMDDVKHLF-KPLSLVFKKRLFILPRTFVIPPEDYL 464
Query: 302 V------FCLTVMSTDG-DYG-----IIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
+ CL V+ DG + G +IG + G +V++ + ++ W S C +
Sbjct: 465 IISDKNNICLGVL--DGTEIGHDSAIVIGDVSLRGKLVVYNNDEKQIGWVQSDCAK 518
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 77/318 (24%), Positives = 128/318 (40%), Gaps = 40/318 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DPS SSSS+ + C P CK SC ++ C + Y ++ YL D L LA
Sbjct: 128 FDPSKSSSSRTLQCEAPQCKQAPNPSC-TVSKSCGFNMTYG--GSTIEAYLTQDTLTLA- 183
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S V + GC K +G+ L G+MGLG G +S+ S L Q++FS
Sbjct: 184 -------SDVIPNYTFGCINKASGTSLPA---QGLMGLGRGPLSLIS--QSQNLYQSTFS 231
Query: 163 ICF----DENDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----- 212
C N SGS+ G + P ++T L + Y+V + +GN +
Sbjct: 232 YCLPNSKSSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTS 291
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
+G + DSG +T L Y V +F + V + + G + CY+ S
Sbjct: 292 ALAFDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLG-GFDTCYSGS-- 348
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQNFMMGH 325
+ P + +F+ + +++ + + ++ + +I H
Sbjct: 349 --VVFPSVTFMFAGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNH 406
Query: 326 RIVFDRENLKLAWSHSKC 343
R++ D N +L S C
Sbjct: 407 RVLIDVPNSRLGISRETC 424
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 76/292 (26%), Positives = 118/292 (40%), Gaps = 31/292 (10%)
Query: 69 CKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 127
C + C Y A Y DTS S GYL D+L L P ++ S + GCG+ G
Sbjct: 181 CSNATGACVYKASYG--DTSFSIGYLSQDVLTLT------PSAAPSSGFVYGCGQDNQGL 232
Query: 128 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF------DENDSGSVFFG-DQGP 180
+ A G++GL +S+ L+ N+FS C N S S F
Sbjct: 233 FGRSA---GIIGLANDKLSMLGQLSNK--YGNAFSYCLPSSFSAQPNSSVSGFLSIGASS 287
Query: 181 ATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQSG----FQALVDSGASFTFLPTE 233
+ F P+ + YF+G+ + + L S ++DSG T LP
Sbjct: 288 LSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYNVPTIIDSGTVITRLPVA 347
Query: 234 IYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIF 292
IY + F ++S K G S C+ S +EM VP++R+IF ++ H
Sbjct: 348 IYNALKKSFVMIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIRIIFRGGAGLELKVHNS 407
Query: 293 SFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
+G T CL + ++ IIG + +D N K+ ++ C+
Sbjct: 408 LVEIEKGTT--CLAIAASSNPISIIGNYQQQTFTVAYDVANSKIGFAPGGCQ 457
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 75/317 (23%), Positives = 125/317 (39%), Gaps = 38/317 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP+ SS+ N+SC+ P C + C Y Y + + S G+ D L L+S+
Sbjct: 223 FDPARSSTYANISCAAPACSDLDTRGCSGGNCLYGVQYG-DGSYSIGFFAMDTLTLSSY- 280
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
GCG + G + + A G++GLG G S+P K G + F+
Sbjct: 281 ------DAVKGFRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAH 328
Query: 164 CFDENDSGSVF--FGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLT--QS 215
C SG+ + FG PA + P+ G + Y+VG+ +G L+ QS
Sbjct: 329 CLPARSSGTGYLDFGPGSPAAAGARLTTPMLTDNGPTF--YYVGMTGIRVGGQLLSIPQS 386
Query: 216 GFQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK------RISLQGNSWKYCYNASS 266
F +VDSG T LP Y+ + F ++++ +SL CY+ +
Sbjct: 387 VFTTAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPAVSL----LDTCYDFTG 442
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
+ +P + L+F V + + GD GI+G +
Sbjct: 443 MSQVAIPTVSLLFQGGARLDVDASGIMYAASVSQVCLGFAANEDGGDVGIVGNTQLKTFG 502
Query: 327 IVFDRENLKLAWSHSKC 343
+ +D + +S C
Sbjct: 503 VAYDIGKKVVGFSPGAC 519
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 76/323 (23%), Positives = 134/323 (41%), Gaps = 44/323 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
+DPS SS+ ++CS C +C + + C Y Y + + + GY
Sbjct: 67 FDPSKSSTYNKIACSSSACADLLGTQTCSAAAN-CIYAYGYG-DGSVTRGY--------- 115
Query: 102 SFSKHA--PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
FSK + V G TG++ D +G++GLG G VS+PS L ++ N
Sbjct: 116 -FSKETITATDTAGEEVKFGASVYNTGTFGDTGG-EGILGLGQGPVSMPSQLGS--VLGN 171
Query: 160 SFSICFDE-----NDSGSVFFGDQG-PATQ-QSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
FS C + +++ +++FGD P+ + Q T +P + Y++ V+ +G S L
Sbjct: 172 KFSYCLVDWLSAGSETSTMYFGDAAVPSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLL 231
Query: 213 --TQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
QS ++ ++DSG + T+L E++ +V + V + C+
Sbjct: 232 DIDQSVYEIDSGGSGGTIIDSGTTITYLQQEVFNALVAAYTSQVRYPTTT-SATGLDLCF 290
Query: 263 NASSEEMLKVPDMRL-IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST-DGDYGIIGQN 320
N P M + + + N S N + CL S D I G
Sbjct: 291 NTRGTGSPVFPAMTIHLDGVHLELPTANTFISLETN----IICLAFASALDFPIAIFGNI 346
Query: 321 FMMGHRIVFDRENLKLAWSHSKC 343
IV+D +N+++ ++ + C
Sbjct: 347 QQQNFDIVYDLDNMRIGFAPADC 369
>gi|356507437|ref|XP_003522473.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 440
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 75/328 (22%), Positives = 129/328 (39%), Gaps = 43/328 (13%)
Query: 47 PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASF 103
P S+ V C H LC S + P+ DY + SS G L+ D+ L +F
Sbjct: 120 PLYRPSNDLVPCRHALCASLHLSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTL-NF 178
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
+ ++ + +GCG Q DG++GLG G S+ S L GL++N
Sbjct: 179 TNGV---QLKVRMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGH 235
Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEK-YDAYFV-GVESYCIGNSCLTQSGFQALV 221
C G +FFGD + + ++ P+ + Y Y V G G A+
Sbjct: 236 CLSAQGGGYIFFGDVYDSFR--LTWTPMSSRDYKHYSVAGAAELLFGGKKSGVGNLHAVF 293
Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASSEEMLKVPDMRLIF 279
D+G+S+T+ + Y ++ K K + + + C+ R I+
Sbjct: 294 DTGSSYTYFNSYAYQVLISWLKKESGGKPLKEAHDDQTLPLCWRGRRP-------FRSIY 346
Query: 280 SKNQSFVVRNHIFSFPEN-----------EGFTV------FCLTVMSTD----GDYGIIG 318
+ F + + SF N E + + CL +++ GD +IG
Sbjct: 347 EVRKYF--KPIVLSFTSNGRSKAQFEMLPEAYLIVSNMGNVCLGILNGSEVGMGDLNLIG 404
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEV 346
M+ +VFD + + W+ + C++V
Sbjct: 405 DISMLNKVMVFDNDKQLIGWAPADCDQV 432
>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
Length = 426
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 78/322 (24%), Positives = 130/322 (40%), Gaps = 51/322 (15%)
Query: 56 VSCSHPLCKSRS----SCKSLKDPCPYIADYSTEDTSSSGYLVDDI--LHLASFSKHAPQ 109
V C P+C S C+ + C Y +Y+ + SS G LV D+ L+ + + AP+
Sbjct: 117 VICKDPMCASLHPPGYKCEH-PEQCDYEVEYA-DGGSSLGVLVKDVFPLNFTNGLRLAPR 174
Query: 110 SSVQSSVIIGCGRKQT--GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE 167
+ +GCG Q SY DGV+GLG G S+ S L G+I+N C
Sbjct: 175 ------LALGCGYDQIPGQSY---HPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSS 225
Query: 168 NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASF 227
G +FFGD + + + +++ Y G +G DSG+S+
Sbjct: 226 RGGGFLFFGDDLYDSSRVVWTPMLRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSY 285
Query: 228 TFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF 285
T+L + Y +V K +S K R +L + C+ V D++ F
Sbjct: 286 TYLNSLAYQALVHLVRKELSEKPVREALDDQTLPLCWRG-KRPFKSVRDVKKFF------ 338
Query: 286 VVRNHIFSFPEN-----------EGFTVF------CLTVMSTD----GDYGIIGQNFMMG 324
+ SFP E + + CL +++ D+ +IG M
Sbjct: 339 --KPLALSFPGGGRTKTQYDIPLESYLIISLKGNVCLGILNGTEAGLQDFNLIGDISMQD 396
Query: 325 HRIVFDRENLKLAWSHSKCEEV 346
+V+D E ++ W+ + C+ +
Sbjct: 397 KMVVYDNEKNQIGWAPTNCDRL 418
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 81/314 (25%), Positives = 142/314 (45%), Gaps = 58/314 (18%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSR--SSCKS-LKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
+DPS S + KN+ CS CKS +SC S + C + +Y + + S G L+ + + L
Sbjct: 130 FDPSYSKTYKNLPCSSTTCKSVQGTSCSSDERKICEHTVNYK-DGSHSQGDLIVETVTLG 188
Query: 102 SFSK---HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
S++ H P++ +IGC R S+ G++GLG G VS+ L+ + I
Sbjct: 189 SYNDPFVHFPRT------VIGCIRNTNVSF----DSIGIVGLGGGPVSLVPQLSSS--IS 236
Query: 159 NSFSICFD--ENDSGSVFFGD----QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
FS C + S + FGD G T + +K+ Y++ +E++ +GN+ +
Sbjct: 237 KKFSYCLAPISDRSSKLKFGDAAMVSGDGTVSTRIVFKDWKKF--YYLTLEAFSVGNNRI 294
Query: 213 --------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
+ ++DSG +FT LP ++Y+++ +V +R + CY
Sbjct: 295 EFRSSSSRSSGKGNIIIDSGTTFTVLPDDVYSKLESAVADVVKLERAEDPLKQFSLCYK- 353
Query: 265 SSEEMLKVPDMRLIFSKN-------QSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
S+ + + VP + FS +F+V +H V CL +S+ I
Sbjct: 354 STYDKVDVPVITAHFSGADVKLNALNTFIVASH----------RVVCLAFLSSQSG-AIF 402
Query: 318 G----QNFMMGHRI 327
G QNF++G+ +
Sbjct: 403 GNLAQQNFLVGYDL 416
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 78/315 (24%), Positives = 129/315 (40%), Gaps = 36/315 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
YDP+ SS+ + C P CK S C D C YI +Y + +++G V D L
Sbjct: 200 YDPAKSSTFAPIPCGSPACKELGSSYGNGCSPTTDECKYIVNYG-DGKATTGTYVTDTLT 258
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
++ + V GC GS+ + A G++ LG G S+ L A N
Sbjct: 259 MSP-------TIVVKDFRFGCSHAVRGSFSNQNA--GILALGGGRGSL--LEQTADAYGN 307
Query: 160 SFSICFDENDSGSVFFGDQGPATQQ-STSFLPIGEKYDA---YFVGVESYCIGNSCL--- 212
+FS C + S F GP S+ P+ + A Y V +E+ + L
Sbjct: 308 AFSYCIPKPSSAG-FLSLGGPVEASLKFSYTPLIKNKHAPTFYIVHLEAIIVAGKQLAVP 366
Query: 213 -TQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNASSEEML 270
T A++DSGA T LP ++YA + F + + ++ + CY+ + +
Sbjct: 367 PTAFATGAVMDSGAVVTQLPPQVYAALRAAFRSAMAAYGPLAAPVRNLDTCYDFTRFPDV 426
Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFMMGHRIV 328
KVP + L+F+ + + +G CL +T G+ G IG + ++
Sbjct: 427 KVPKVSLVFAGGATLDLEPASIIL---DG----CLAFAATPGEESVGFIGNVQQQTYEVL 479
Query: 329 FDRENLKLAWSHSKC 343
+D K+ + C
Sbjct: 480 YDVGGGKVGFRRGAC 494
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 77/318 (24%), Positives = 137/318 (43%), Gaps = 32/318 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS------RSSCKSLKDP-CPYIADYSTEDTSSSGYLVDDI 97
+DPS SSS N++C+ LC +S C S D C Y A Y ++++S G+L +
Sbjct: 89 FDPSKSSSYTNITCTSSLCTQLTSDGIKSECSSSTDASCIYDAKYG-DNSTSVGFLSQER 147
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L + + + + + GCG+ G + +G+A G+MGLG +S+ + +
Sbjct: 148 LTITA-------TDIVDDFLFGCGQDNEGLF-NGSA--GLMGLGRHPISI--VQQTSSNY 195
Query: 158 QNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPIGE-KYDAYFVGVE--SYCIGNSCL 212
FS C S G + FG AT S + P+ D F G++ S +G + L
Sbjct: 196 NKIFSYCLPATSSSLGHLTFG-ASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKL 254
Query: 213 ---TQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
+ S F A ++DSG T L +YA + F + + ++ + CY+ S
Sbjct: 255 PAVSSSTFSAGGSIIDSGTVITRLAPTVYAALRSAFRRXMEKYPVANEAGLLDTCYDLSG 314
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
+ + VP + FS + + + E+E +D D + G
Sbjct: 315 YKEISVPRIDFEFSGGVTVELXHRGILXVESEQQVCLAFAANGSDNDITVFGNVQQKTLE 374
Query: 327 IVFDRENLKLAWSHSKCE 344
+V+D + ++ + + C+
Sbjct: 375 VVYDVKGGRIGFGAAGCK 392
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 81/306 (26%), Positives = 130/306 (42%), Gaps = 33/306 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS---RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
+DPS S + K + CS +C+S +SC S D C Y Y +++ S G L + L L
Sbjct: 136 FDPSQSKTYKTLPCSSNICQSVQSAASCSSNNDECEYTITYG-DNSHSQGDLSVETLTLG 194
Query: 102 SFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
S SSVQ +IGCG G++ +G +GLG V + + I
Sbjct: 195 S----TDGSSVQFPKTVIGCGHNNKGTF----QREGSGIVGLGGGPVSLISQLSSSIGGK 246
Query: 161 FSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCLT 213
FS C N S + FGD+ + + T PI K YF+ +E++ +G++ +
Sbjct: 247 FSYCLAPLFSQSNSSSKLNFGDEAVVSGRGTVSTPIVPKNGLGFYFLTLEAFSVGDNRIE 306
Query: 214 QSGF---------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
++DSG + T LP + Y + + +R+ + CY
Sbjct: 307 FGSSSFESSGGEGNIIIDSGTTLTILPEDDYLNLESAVADAIELERVEDPSKFLRLCYRT 366
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE-NEGFTVFCLTVMSTDGDYGIIG-QNFM 322
+S + L VP + F V N I +F E +EG F +G + QN +
Sbjct: 367 TSSDELNVPVITAHFKGAD--VELNPISTFIEVDEGVVCFAFRSSKIGPIFGNLAQQNLL 424
Query: 323 MGHRIV 328
+G+ +V
Sbjct: 425 VGYDLV 430
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 79/317 (24%), Positives = 130/317 (41%), Gaps = 33/317 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP+ S++ V C HP C + S C Y Y + +S++G L + L L+S +
Sbjct: 204 FDPTKSATYSAVPCGHPQCAAAGGKCSNSGTCLYKVTYG-DGSSTAGVLSHETLSLSS-T 261
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
+ P GCG+ G + ++GLG G +S+PS A +FS C
Sbjct: 262 RDLP------GFAFGCGQTNLGEFGGVDG---LVGLGRGALSLPS--QAAATFGATFSYC 310
Query: 165 FDENDS--GSVFFGDQGPATQ------QSTSFLPIGEKYDAYFVGVESYCIGNSCL---- 212
D+ G + G PA Q T+ + + YFV V S IG L
Sbjct: 311 LPSYDTTHGYLTMGSTTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPP 370
Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
T+ G L DSG T+LP E YA + +F ++ + + + + CY+ +
Sbjct: 371 TVFTRDG--TLFDSGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYDPFDTCYDFTGHNA 428
Query: 270 LKVPDMRLIFSKNQSFVVRN-HIFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFMMGHR 326
+ +P + FS F + I +P++ CL + + IIG G
Sbjct: 429 IFMPAVAFKFSDGAVFDLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTE 488
Query: 327 IVFDRENLKLAWSHSKC 343
+++D K+ + C
Sbjct: 489 VIYDVAAEKIGFGQFTC 505
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 90/339 (26%), Positives = 143/339 (42%), Gaps = 40/339 (11%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGY 92
++N YDP SSS +N+ C P C SS CK+ CPY Y ++ +
Sbjct: 126 EQNGPYYDPKESSSFRNIGCHDPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDF 185
Query: 93 LVDDI-LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
+ ++L S + + V+ +V+ GCG G + GA+ G++GLG G +S S L
Sbjct: 186 ATETFTVNLTSPTGKSEFKRVE-NVMFGCGHWNRGLF-HGAS--GLLGLGRGPLSFSSQL 241
Query: 152 AKAGLIQNSFSICF-----DENDSGSVFFG-DQGPATQQSTSFLP-IGEKYDA----YFV 200
L +SFS C D N S + FG D+ +F +G K + Y+V
Sbjct: 242 QS--LYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYV 299
Query: 201 GVESYCIGNSCL---------TQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
++S +G L T G +VDSG + ++ Y + F K V
Sbjct: 300 QIKSIMVGGEVLNIPESTWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYP 359
Query: 251 ISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVM 308
I CYN S E + +PD ++F+ +F V N+ E V CL ++
Sbjct: 360 IVQDFPILDPCYNVSGVEKIDLPDFGILFADGAVWNFPVENYFIRLDPEE---VVCLAIL 416
Query: 309 ST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
T IIG +++D + +L ++ C +V
Sbjct: 417 GTPRSALSIIGNYQQQNFHVLYDTKKSRLGYAPMNCADV 455
>gi|302853254|ref|XP_002958143.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
nagariensis]
gi|300256504|gb|EFJ40768.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
nagariensis]
Length = 475
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 62/262 (23%), Positives = 104/262 (39%), Gaps = 27/262 (10%)
Query: 73 KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 132
+ C Y Y+ E +SS G++V+D P ++ GC +TG
Sbjct: 4 NEKCYYSRTYA-ERSSSEGWMVEDAFGF-------PDDQPPVRMVFGCENGETGEIYRQL 55
Query: 133 APDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIG 192
A DG+MG+G + S L G+I++ FS+CF G + GD +T + P+
Sbjct: 56 A-DGIMGMGNNHNAFQSQLVARGVIEDVFSLCFGYPKDGILLLGDVPMPKGANTVYTPLL 114
Query: 193 EKYDAYFVGVESYCIG--------NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK 244
++ V I N+ + G+ ++DSG +FT+LPTE + +
Sbjct: 115 NNLHLHYYNVRMDGIAVNGVELSLNARIFTRGYGVVLDSGTTFTYLPTEAFNAMAAAIGS 174
Query: 245 LVSSKRI-SLQGNSWKY---CYNASSEEMLKV----PDMRLIFSKNQSFVVRNHIFSFPE 296
S + S G +Y C+ + + + P +F N + + F
Sbjct: 175 YALSHGLQSTPGADPQYNDICWKGAPDNFQGLENHFPSAEFVFGDNARLSLPPLRYLFVS 234
Query: 297 NEGFTVFCLTVMSTDGDYGIIG 318
G +CL V G +IG
Sbjct: 235 RPG--EYCLGVFDNGGSGTLIG 254
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 85/328 (25%), Positives = 141/328 (42%), Gaps = 34/328 (10%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCK---SRSSC-KSLKDPCPYIADYSTEDTSSSGYLV 94
+++L YD S SS+ SC CK S + C C Y YS D S++ +
Sbjct: 71 NQSLPYYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAY--SYSYGDKSATIGFL 128
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
D + SF A SV V+ GCG TG + G+ G G G +S+PS L K
Sbjct: 129 D--VETVSFVAGA---SV-PGVVFGCGLNNTGIFRSNET--GIAGFGRGPLSLPSQL-KV 179
Query: 155 GLIQNSFSICFDENDSGSVF-----FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 209
G + F+ S +F G T Q+T + Y++ ++ +G+
Sbjct: 180 GNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGS 239
Query: 210 SCLT--QSGFQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
+ L +S F ++DSG +FT LP +Y V +F V +
Sbjct: 240 TRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLL 299
Query: 261 CYNASS-EEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
C++A + VP + L F + R N++F ++ G CL ++ +G+ IIG
Sbjct: 300 CFSAPPLGKAPHVPKLVLHFEGATMHLPRENYVFE-AKDGGNCSICLAII--EGEMTIIG 356
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+++D +N KL++ +KC+++
Sbjct: 357 NFQQQNMHVLYDLKNSKLSFVRAKCDKL 384
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 85/328 (25%), Positives = 141/328 (42%), Gaps = 34/328 (10%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCK---SRSSC-KSLKDPCPYIADYSTEDTSSSGYLV 94
+++L YD S SS+ SC CK S + C C Y YS D S++ +
Sbjct: 127 NQSLPYYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAY--SYSYGDKSATIGFL 184
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
D + SF A SV V+ GCG TG + G+ G G G +S+PS L K
Sbjct: 185 D--VETVSFVAGA---SV-PGVVFGCGLNNTGIFRSNET--GIAGFGRGPLSLPSQL-KV 235
Query: 155 GLIQNSFSICFDENDSGSVF-----FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 209
G + F+ S +F G T Q+T + Y++ ++ +G+
Sbjct: 236 GNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGS 295
Query: 210 SCLT--QSGFQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
+ L +S F ++DSG +FT LP +Y V +F V +
Sbjct: 296 TRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLL 355
Query: 261 CYNASS-EEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
C++A + VP + L F + R N++F ++ G CL ++ +G+ IIG
Sbjct: 356 CFSAPPLGKAPHVPKLVLHFEGATMHLPRENYVFE-AKDGGNCSICLAII--EGEMTIIG 412
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+++D +N KL++ +KC+++
Sbjct: 413 NFQQQNMHVLYDLKNSKLSFVRAKCDKL 440
>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 438
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 74/328 (22%), Positives = 128/328 (39%), Gaps = 43/328 (13%)
Query: 47 PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASF 103
P S+ V C H LC S + P+ DY + SS G L+ D+ L +F
Sbjct: 118 PLYRPSNDFVPCRHSLCASLHHSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTL-NF 176
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
+ ++ + +GCG Q DG++GLG G S+ S L GL++N
Sbjct: 177 TNGV---QLKVRMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGH 233
Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEK-YDAY-FVGVESYCIGNSCLTQSGFQALV 221
C G +FFGD +++ ++ P+ + Y Y G G A+
Sbjct: 234 CLSAQGGGYIFFGDVYDSSR--LTWTPMSSRDYKHYSAAGAAELLFGGKKSGIGSLHAVF 291
Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
D+G+S+T+ Y ++ K K + + + C+ R I+
Sbjct: 292 DTGSSYTYFNPYAYQALISWLGKESGGKPLKEAHDDQTLPLCWRGRRP-------FRSIY 344
Query: 280 SKNQSFVVRNHIFSFPEN-----------EGFTVF------CLTVMSTD----GDYGIIG 318
+ F + + SF N E + + CL +++ GD +IG
Sbjct: 345 EVRKYF--KPIVLSFTSNGRSKAQFEMPPEAYLIISNMGNVCLGILNGSEVGMGDLNLIG 402
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEV 346
M+ +VFD + + W+ + C++V
Sbjct: 403 DISMLNKVMVFDNDKQLIGWTPADCDQV 430
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 73/293 (24%), Positives = 118/293 (40%), Gaps = 33/293 (11%)
Query: 69 CKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 127
C + C Y A Y DTS S GYL D+L L P + S + GCG+ G
Sbjct: 187 CSNATGACVYKASYG--DTSFSIGYLSQDVLTLT------PSEAPSSGFVYGCGQDNQGL 238
Query: 128 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND--------SGSVFFGDQG 179
+ G++GL +S+ L+K N+FS C + SG + G
Sbjct: 239 F---GRSSGIIGLANDKISMLGQLSKK--YGNAFSYCLPSSFSAPNSSSLSGFLSIGASS 293
Query: 180 PATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLTQSG----FQALVDSGASFTFLPT 232
T F P+ + YF+ + + + L S ++DSG T LP
Sbjct: 294 -LTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVPTIIDSGTVITRLPV 352
Query: 233 EIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHI 291
+Y + F ++S K G S C+ S +EM VP++++IF ++ H
Sbjct: 353 AVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIQIIFRGGAGLELKAHN 412
Query: 292 FSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
+G T CL + ++ IIG ++ +D N K+ ++ C+
Sbjct: 413 SLVEIEKGTT--CLAIAASSNPISIIGNYQQQTFKVAYDVANFKIGFAPGGCQ 463
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 131/314 (41%), Gaps = 33/314 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DPS S+S +V+C +P C ++C++ C Y Y + + + G + L L
Sbjct: 205 FDPSLSTSYASVACDNPRCHDLDAAACRNSTGACLYEVAYG-DGSYTVGDFATETLTLG- 262
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S+ SSV IGCG G ++ A + G L S PS ++ +FS
Sbjct: 263 ------DSAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFS 308
Query: 163 ICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGF 217
C + DS S + FGD A + + + Y+VG+ +G L+ S F
Sbjct: 309 YCLVDRDSPSSSTLQFGDAADA-EVTAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAF 367
Query: 218 Q--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
+VDSG + T L + YA + F + S + + + CY+ S
Sbjct: 368 AMDGTGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTS 427
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 329
++VP + L F+ + + P + G +CL T+ IIG G R+ F
Sbjct: 428 VEVPAVSLRFAGGGELRLPAKNYLIPVD-GAGTYCLAFAPTNAAVSIIGNVQQQGTRVSF 486
Query: 330 DRENLKLAWSHSKC 343
D + ++ +KC
Sbjct: 487 DTAKSTVGFTSNKC 500
>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
Length = 424
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 73/321 (22%), Positives = 127/321 (39%), Gaps = 28/321 (8%)
Query: 47 PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASF 103
P S+ + C+ PLCK+ + + P DY E SS G LV D+ L
Sbjct: 98 PLYQPSNDLIPCNDPLCKALHFNGNHRCETPEQCDYEVEYADGGSSLGVLVRDVFSL--- 154
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
+ + + +GCG Q DGV+GLG G VS+ S L G ++N
Sbjct: 155 -NYTKGLRLTPRLALGCGYDQIPGASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNVVGH 213
Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF---VGVESYCIGNSCLTQSGFQAL 220
C G +FFG+ + + S+ P+ + ++ +G E G +
Sbjct: 214 CLSSLGGGILFFGNDLYDSSR-VSWTPMARENSKHYSPAMGGE-LLFGGRTTGLKNLLTV 271
Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNAS----SEEMLKVPD 274
DSG+S+T+ ++ Y V + +S K + + ++ C+ S E +K
Sbjct: 272 FDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKYF 331
Query: 275 MRLIFSKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD----GDYGIIGQNFMMGH 325
L S + + +F P + CL +++ + +IG M
Sbjct: 332 KPLALSFKTGWRSKT-LFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQ 390
Query: 326 RIVFDRENLKLAWSHSKCEEV 346
I++D E + W + C+E+
Sbjct: 391 MIIYDNEKQSIGWIPADCDEI 411
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 73/299 (24%), Positives = 122/299 (40%), Gaps = 28/299 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP+ SS+ N+SC+ P C C Y Y + + S G+ D L L+S+
Sbjct: 204 FDPARSSTYANISCAAPACSDLYIKGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY- 261
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
GCG + G Y + A G++GLG G S+P K G + F+
Sbjct: 262 ------DAIKGFRFGCGERNEGLYGEAA---GLLGLGRGKTSLPVQAYDKYGGV---FAH 309
Query: 164 CFDENDSGSVFFGDQGPA-----TQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSG 216
CF SG+ + D GP + + T+ + + Y+VG+ +G L+ QS
Sbjct: 310 CFPARSSGTGYL-DFGPGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSV 368
Query: 217 FQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLK 271
F +VDSG T LP Y+ + F ++ + + + + CY+ + +
Sbjct: 369 FTTSGTIVDSGTVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFTGMSEVA 428
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
+P + L+F S V + + D D GI+G + +V+D
Sbjct: 429 IPTVSLLFQGGASLDVHASGIIYAASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVYD 487
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 139/333 (41%), Gaps = 41/333 (12%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSR--SSCKSLK----DPCPYIADYSTEDTSSSGY 92
D+ L +DPS+SS+ SC LC+ +SC S K C Y Y + + ++G+
Sbjct: 118 DQALPYFDPSTSSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYG-DKSVTTGF 176
Query: 93 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
L D P V GCG G + G+ G G G +S+PS L
Sbjct: 177 LEVDKFTFVGAGASVP------GVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL- 227
Query: 153 KAGLIQNSFSICFDENDS---GSVFFG------DQGPATQQSTSFLPIGEKYDAYFVGVE 203
K G +FS CF + +V G QST + Y++ ++
Sbjct: 228 KVG----NFSHCFTAVNGLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLK 283
Query: 204 SYCIGNSCLT--QSGFQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 254
+G++ L +S F ++DSG + T LPT +Y V F V +S
Sbjct: 284 GITVGSTRLPVPESEFTLKNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGN 343
Query: 255 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGD 313
+C +A VP + L F + R N++F E+ G ++ CL ++ G+
Sbjct: 344 TTDPYFCLSAPLRAKPYVPKLVLHFEGATMDLPRENYVFEV-EDAGSSILCLAIIE-GGE 401
Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
IG +++D +N KL++ ++C+++
Sbjct: 402 VTTIGNFQQQNMHVLYDLQNSKLSFVPAQCDKL 434
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 131/314 (41%), Gaps = 33/314 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DPS S+S +V+C +P C ++C++ C Y Y + + + G + L L
Sbjct: 209 FDPSLSTSYASVACDNPRCHDLDAAACRNSTGACLYEVAYG-DGSYTVGDFATETLTLG- 266
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S+ SSV IGCG G ++ A + G L S PS ++ +FS
Sbjct: 267 ------DSAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFS 312
Query: 163 ICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGF 217
C + DS S + FGD A + + + Y+VG+ +G L+ S F
Sbjct: 313 YCLVDRDSPSSSTLQFGDAADA-EVTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAF 371
Query: 218 Q--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
+VDSG + T L + YA + F + S + + + CY+ S
Sbjct: 372 AMDSTGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTS 431
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 329
++VP + L F+ + + P +G +CL T+ IIG G R+ F
Sbjct: 432 VEVPAVSLRFAGGGELRLPAKNYLIPV-DGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSF 490
Query: 330 DRENLKLAWSHSKC 343
D + ++ +KC
Sbjct: 491 DTAKSTVGFTTNKC 504
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 82/321 (25%), Positives = 140/321 (43%), Gaps = 45/321 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+DPS SS+ ++S P+C + K + + C Y A Y+ TSS +DI+ F
Sbjct: 101 FDPSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIV----F 156
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
+ SSV+ GCG G + DG G++GL GD S+ S L + FS
Sbjct: 157 ETSDQGTVTVSSVVFGCGHSNRGRF-DGQQS-GILGLSAGDQSIVSRLG------SRFSY 208
Query: 164 C----FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------- 212
C FD + + + G + S++ P Y+V +E +G + L
Sbjct: 209 CIGDLFDPHYTHNQLVLGDGVKMEGSST--PFHTFNGFYYVTLEGISVGETRLDINPEVF 266
Query: 213 --TQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASSE 267
T+SG +V DSG + TFL + + + + +LV +++ + CY
Sbjct: 267 QRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVN 326
Query: 268 EMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD-----GDYGIIGQNF 321
E L+ P++ F++ V+ + +N+ VFCL V+ ++ GI+ Q
Sbjct: 327 EDLRGFPELAFHFAEGADLVLDANSLFVQKNQ--DVFCLAVLESNLKNIGSVIGIMAQQH 384
Query: 322 ------MMGHRIVFDRENLKL 336
++G R+ F R + +L
Sbjct: 385 YNVAYDLIGKRVYFQRTDCEL 405
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 63/242 (26%), Positives = 108/242 (44%), Gaps = 19/242 (7%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP+ SS+ +N+SC+ C SS C Y Y + +S+ G+L + LA+
Sbjct: 59 FDPTLSSTYRNISCTSAACTGLSSRGCSGSTCVYGVTYG-DGSSTVGFLATETFTLAA-- 115
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
+V ++ I GCG+ G + GAA G++GLG S+ S LA + + N FS C
Sbjct: 116 -----GNVFNNFIFGCGQNNQGLF-TGAA--GLIGLGRSPYSLNSQLATS--LGNIFSYC 165
Query: 165 FDENDSGSVFFGDQGP-ATQQSTSFLPIGEKYDAYFVGVESYCIGNS--CLTQSGFQA-- 219
S + + P T T+ L YF+ + +G + L+ + FQ+
Sbjct: 166 LPSTSSATGYLNIGNPLRTPGYTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSVG 225
Query: 220 -LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
++DSG T LP Y + F ++ + + CY+ S + P ++L
Sbjct: 226 TIIDSGTVITRLPPTAYGALRTAFRAAMTQYTRAAAASILDTCYDFSRTTTVTFPTIKLH 285
Query: 279 FS 280
++
Sbjct: 286 YT 287
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 82/321 (25%), Positives = 140/321 (43%), Gaps = 45/321 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+DPS SS+ ++S P+C + K + + C Y A Y+ TSS +DI+ F
Sbjct: 101 FDPSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIV----F 156
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
+ SSV+ GCG G + DG G++GL GD S+ S L + FS
Sbjct: 157 ETSDQGTVTVSSVVFGCGHSNRGRF-DGQQS-GILGLSAGDQSIVSRLG------SRFSY 208
Query: 164 C----FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------- 212
C FD + + + G + S++ P Y+V +E +G + L
Sbjct: 209 CIGDLFDPHYTHNQLVLGDGVKMEGSST--PFHTFNGFYYVTLEGISVGETRLDINPEVF 266
Query: 213 --TQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASSE 267
T+SG +V DSG + TFL + + + + +LV +++ + CY
Sbjct: 267 QRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVN 326
Query: 268 EMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD-----GDYGIIGQNF 321
E L+ P++ F++ V+ + +N+ VFCL V+ ++ GI+ Q
Sbjct: 327 EDLRGFPELAFHFAEGADLVLDANSLFVQKNQ--DVFCLAVLESNLKNIGSVIGIMAQQH 384
Query: 322 ------MMGHRIVFDRENLKL 336
++G R+ F R + +L
Sbjct: 385 YNVAYDLIGKRVYFQRTDCEL 405
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 82/321 (25%), Positives = 140/321 (43%), Gaps = 45/321 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+DPS SS+ ++S P+C + K + + C Y A Y+ TSS +DI+ F
Sbjct: 133 FDPSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIV----F 188
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
+ SSV+ GCG G + DG G++GL GD S+ S L + FS
Sbjct: 189 ETSDQGTVTVSSVVFGCGHSNRGRF-DGQQS-GILGLSAGDQSIVSRLG------SRFSY 240
Query: 164 C----FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------- 212
C FD + + + G + S++ P Y+V +E +G + L
Sbjct: 241 CIGDLFDPHYTHNQLVLGDGVKMEGSST--PFHTFNGFYYVTLEGISVGETRLDINPEVF 298
Query: 213 --TQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASSE 267
T+SG +V DSG + TFL + + + + +LV +++ + CY
Sbjct: 299 QRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVN 358
Query: 268 EMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD-----GDYGIIGQNF 321
E L+ P++ F++ V+ + +N+ VFCL V+ ++ GI+ Q
Sbjct: 359 EDLRGFPELAFHFAEGADLVLDANSLFVQKNQ--DVFCLAVLESNLKNIGSVIGIMAQQH 416
Query: 322 ------MMGHRIVFDRENLKL 336
++G R+ F R + +L
Sbjct: 417 YNVAYDLIGKRVYFQRTDCEL 437
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 79/312 (25%), Positives = 132/312 (42%), Gaps = 42/312 (13%)
Query: 56 VSCSHPLCKSRS--SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 113
V C LC+ S SC + D C Y+ Y + +S+SG L D+ ++S S
Sbjct: 93 VLCQSSLCQPPSIFSCNNDGD-CEYVYPYG-DRSSTSGILSDETFSISSQSL-------- 142
Query: 114 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DEND 169
++ GCG G D G++G G G +S+ S L + + N FS C D +
Sbjct: 143 PNITFGCGHDNQG--FDKVG--GLVGFGRGSLSLVSQLGPS--MGNKFSYCLVSRTDSSK 196
Query: 170 SGSVFFGDQG--PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--------QSGFQA 219
+ +F G+ AT ++ L + Y++ +E +G L QS
Sbjct: 197 TSPLFIGNTASLEATTVGSTPLVQSSSTNHYYLSLEGISVGGQSLAIPTGTFDIQSDGSG 256
Query: 220 --LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRL 277
++DSG + TFL Y V + +VSS + C+N P M
Sbjct: 257 GLIIDSGTTLTFLQQTAYDAVK---EAMVSSINLPQADGQLDLCFNQQGSSNPGFPSMTF 313
Query: 278 IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD---GDYGIIGQNFMMGHRIVFDRENL 334
F K + V + FP++ + CL +M T+ G+ I G ++I++D EN
Sbjct: 314 HF-KGADYDVPKENYLFPDSTS-DIVCLAMMPTNSNLGNMAIFGNVQQQNYQILYDNENN 371
Query: 335 KLAWSHSKCEEV 346
L+++ + C+ +
Sbjct: 372 VLSFAPTACDTL 383
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 139/333 (41%), Gaps = 41/333 (12%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSR--SSCKSLK----DPCPYIADYSTEDTSSSGY 92
D+ L +DPS+SS+ SC LC+ +SC S K C Y Y + + ++G+
Sbjct: 118 DQALPYFDPSTSSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYG-DKSVTTGF 176
Query: 93 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
L D P V GCG G + G+ G G G +S+PS L
Sbjct: 177 LEVDKFTFVGAGASVP------GVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL- 227
Query: 153 KAGLIQNSFSICFDENDS---GSVFFG------DQGPATQQSTSFLPIGEKYDAYFVGVE 203
K G +FS CF + +V G QST + Y++ ++
Sbjct: 228 KVG----NFSHCFTAVNGLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLK 283
Query: 204 SYCIGNSCLT--QSGFQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 254
+G++ L +S F ++DSG + T LPT +Y V F V +S
Sbjct: 284 GITVGSTRLPVPESEFALKNGTGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGN 343
Query: 255 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGD 313
+C +A VP + L F + R N++F E+ G ++ CL ++ G+
Sbjct: 344 TTDPYFCLSAPLRAKPYVPKLVLHFEGATMDLPRENYVFEV-EDAGSSILCLAIIE-GGE 401
Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
IG +++D +N KL++ ++C+++
Sbjct: 402 VTTIGNFQQQNMHVLYDLQNSKLSFVPAQCDKL 434
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 80/320 (25%), Positives = 138/320 (43%), Gaps = 36/320 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLA 101
+DP SS + ++ SC C +S+C + C Y YS D S + G + D + L
Sbjct: 137 FDPKSSKTYRDFSCDARQCSLLDQSTCSG--NICQY--QYSYGDRSYTMGNVASDTITLD 192
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
S + +P S ++ +IGCG + G++ D + G++GLG G +S+ S + + + F
Sbjct: 193 S-TTGSPVSFPKT--VIGCGHENDGTFSDKGS--GIVGLGAGPLSLISQMGSS--VGGKF 245
Query: 162 SICF-----DENDSGSVFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGN--- 209
S C +S + FG GP Q ST L YF+ +E+ +GN
Sbjct: 246 SYCLVPLSSRAGNSSKLNFGSNAVVSGPGVQ-STPLLSSETMSSFYFLTLEAMSVGNERI 304
Query: 210 ----SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
S L ++DSG + T +P + ++ + V +R CY+A+
Sbjct: 305 KFGDSSLGTGEGNIIIDSGTTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCYSAT 364
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
S+ LKVP + F+ + + F ++ V CL ST I G M
Sbjct: 365 SD--LKVPAITAHFTGADVKLKPINTFVQVSDD---VVCLAFASTTSGISIYGNVAQMNF 419
Query: 326 RIVFDRENLKLAWSHSKCEE 345
+ ++ + L++ + C +
Sbjct: 420 LVEYNIQGKSLSFKPTDCTK 439
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 82/314 (26%), Positives = 129/314 (41%), Gaps = 34/314 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSC-----KSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
+DP+ S+S NVSCS PLC S S + C Y Y + + S G+L + L
Sbjct: 168 FDPTKSTSYANVSCSTPLCSSVISATGNPSRCAASTCVYGIQYG-DGSYSIGFLGKERLT 226
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
+ S + + ++ GCG+ G + A G++GLG +SV S A
Sbjct: 227 IGS-------TDIFNNFYFGCGQDVDGLFGKAA---GLLGLGRDKLSVVSQTAPK--YNQ 274
Query: 160 SFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF--------VGVESYCIGNSC 211
FS C S S F G + +S F P+ +++ VG + I S
Sbjct: 275 LFSYCLPS--SSSTGFLSFGSSQSKSAKFTPLSSGPSSFYNLDLTGITVGGQKLAIPLSV 332
Query: 212 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
+ +G ++DSG T LP Y+ + F K ++S + + CY+ S + +K
Sbjct: 333 FSTAG--TIIDSGTVVTRLPPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKTIK 390
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG--DYGIIGQNFMMGHRIVF 329
VP + + FS V + F N G CL G D I G +V+
Sbjct: 391 VPKIVISFSGGVDVDV-DQAGIFVAN-GLKQVCLAFAGNTGARDTAIFGNTQQRNFEVVY 448
Query: 330 DRENLKLAWSHSKC 343
D K+ ++ + C
Sbjct: 449 DVSGGKVGFAPASC 462
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 85/328 (25%), Positives = 141/328 (42%), Gaps = 41/328 (12%)
Query: 45 YDPSSSSSSKNVSCSH----PLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
YDPS+SS+ V CS P+ +SR +C + C Y YS + S+G L + L L
Sbjct: 119 YDPSASSTFSPVPCSSATCLPVLRSR-NCSTPSSLCRYGYSYS-DGAYSAGILGTETLTL 176
Query: 101 ASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
S P +V S V GCG G L+ G +GLG G + SLLA+ G+
Sbjct: 177 GS---SVPGQAVSVSDVAFGCGTDNGGDSLNST---GTVGLGRGTL---SLLAQLGV--G 225
Query: 160 SFSIC----FDENDSGSVFFGD-----QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
FS C F+ G GP QST L Y V ++ +G+
Sbjct: 226 KFSYCLTDFFNSTLDSPFLLGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDV 285
Query: 211 CL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG-NSWK 259
L S +VDSG +F+ LP + VV +++ ++ +S
Sbjct: 286 RLPIPNKTFDLHANSTGGMVVDSGTTFSILPESGFRVVVDHVAQVLGQPPVNASSLDSPC 345
Query: 260 YCYNASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
+ A ++ +PD+ L F+ + R++ S+ N+ + FCL ++ T + ++G
Sbjct: 346 FPAPAGERQLPFMPDLVLHFAGGADMRLHRDNYMSY--NQEDSSFCLNIVGTTSTWSMLG 403
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+++FD +L++ + C ++
Sbjct: 404 NFQQQNIQMLFDMTVGQLSFLPTDCSKL 431
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 76/318 (23%), Positives = 128/318 (40%), Gaps = 40/318 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DPS SSSS+ + C P CK SC ++ C + Y ++ YL D L LA
Sbjct: 128 FDPSKSSSSRTLQCEAPQCKQAPNPSC-TVSKSCGFNMTYG--GSAIEAYLTQDTLTLA- 183
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+ V + GC K +G+ L G+MGLG G +S+ S L Q++FS
Sbjct: 184 -------TDVIPNYTFGCINKASGTSLPA---QGLMGLGRGPLSLIS--QSQNLYQSTFS 231
Query: 163 ICF----DENDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----- 212
C N SGS+ G + P ++T L + Y+V + +GN +
Sbjct: 232 YCLPNSKSSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTS 291
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
+G + DSG +T L Y + +F + V + + G + CY+ S
Sbjct: 292 ALAFDPATGAGTIFDSGTVYTRLVEPAYVAMRNEFRRRVKNANATSLG-GFDTCYSGS-- 348
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFMMGH 325
+ P + +F+ + +++ + + T+ + +I H
Sbjct: 349 --VVFPSVTFMFAGMNVTLPPDNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIASMQQQNH 406
Query: 326 RIVFDRENLKLAWSHSKC 343
R++ D N +L S C
Sbjct: 407 RVLIDVPNSRLGISRETC 424
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 82/343 (23%), Positives = 134/343 (39%), Gaps = 47/343 (13%)
Query: 40 RNLSEYDPSSS---SSSKNVSCSH---------PLCKS-RSSCKSLKDPCPYIADYSTED 86
RN + + P S+ S S +H PL K R + L PC Y +YS D
Sbjct: 121 RNCTRHTPGSAFLARHSTTFSPNHCYDSACQLVPLPKHHRCNHARLHSPCRY--EYSYGD 178
Query: 87 TS-SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA---PDGVMGLGL 142
S +SG+ + L + S + + + GC + +G + GA+ GVMGLG
Sbjct: 179 GSKTSGFFSKETTTLNTSSG---REAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGR 235
Query: 143 GDVSVPSLLAKAGLIQNSFSICFDEND-----SGSVFFG----DQGPATQQSTSFLPIGE 193
G +S+ S L N FS C ++D + + G D P ++ F P+
Sbjct: 236 GPISLSSQLGHR--FGNKFSYCLMDHDISPSPTSYLLIGSTQNDVAPG-KRRMRFTPLHI 292
Query: 194 KYDA---YFVGVESYCI-GNSCLTQSGFQAL---------VDSGASFTFLPTEIYAEVVV 240
+ Y++G+ES + G AL VDSG + TFLP Y +++
Sbjct: 293 NPLSPTFYYIGIESVSVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILT 352
Query: 241 KFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGF 300
+ V + + C N S E ++P + + F + +E
Sbjct: 353 VIKRRVRLPSPAEPTPGFDLCVNVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDEDV 412
Query: 301 TVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
L + T + +IG G + FD++ +L +S C
Sbjct: 413 KCLALQAVMTPSGFSVIGNLMQQGFLLEFDKDRTRLGFSRHGC 455
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 79/325 (24%), Positives = 137/325 (42%), Gaps = 19/325 (5%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP---CPYIADYSTEDTSSSGYLVDDI 97
LS +DP SSS+ VSCS C S +S P C Y Y + + +SGY + D
Sbjct: 127 QLSFFDPGVSSSASLVSCSDRRCYSNFQTESGCSPNNLCSYSFKYG-DGSGTSGYYISDF 185
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAGL 156
+ + + + + GC Q+G A DG+ GLG G +SV S LA GL
Sbjct: 186 MSFDTVITSTLAINSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGL 245
Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---- 212
FS C + SG G + T + P+ Y V ++S + L
Sbjct: 246 APRVFSHCLKGDKSGGGIM-VLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDP 304
Query: 213 ----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
+G ++D+G + +LP E Y+ + VS + S++ C+ ++ +
Sbjct: 305 SVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAVANAVSQYGRPITYESYQ-CFEITAGD 363
Query: 269 MLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMS-TDGDYGIIGQNFMMGH 325
+ P + L F+ S V+ R ++ F + G +++C+ + I+G +
Sbjct: 364 VDVFPQVSLSFAGGASMVLGPRAYLQIF-SSSGSSIWCIGFQRMSHRRITILGDLVLKDK 422
Query: 326 RIVFDRENLKLAWSHSKCEEVIDKS 350
+V+D ++ W+ C ++ S
Sbjct: 423 VVVYDLVRQRIGWAEYDCSLEVNVS 447
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 82/329 (24%), Positives = 131/329 (39%), Gaps = 40/329 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLK-DPCPYIADYSTEDTSSS---GYLVDDIL 98
+DP S+S ++ P C++ RS K C Y Y S+S G LV++ L
Sbjct: 176 FDPRHSTSYGEMNYDAPDCQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETL 235
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
A + Q+ + IGCG G L GA G++GLG G +S+P +A G
Sbjct: 236 TFAGGVR-------QAYLSIGCGHDNKG--LFGAPAAGILGLGRGQISIPHQIAFLGY-N 285
Query: 159 NSFSICFDENDSG------SVFFGDQGPATQQSTSFLP------IGEKYDAYFVGVESYC 206
SFS C + SG ++ FG T SF P + Y +GV
Sbjct: 286 ASFSYCLVDFISGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGG 345
Query: 207 IGNSCLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGN 256
+ +T+ Q ++DSG + T L Y F +S ++S G
Sbjct: 346 VRVPGVTERDLQLDPYTGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGP 405
Query: 257 S--WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY 314
S + CY +KVP + + F+ ++ + P + TV + D
Sbjct: 406 SGLFDTCYTVGGRAGVKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSV 465
Query: 315 GIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
+IG G R+V+D ++ ++ + C
Sbjct: 466 SVIGNILQQGFRVVYDLAGQRVGFAPNNC 494
>gi|302774304|ref|XP_002970569.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
gi|300162085|gb|EFJ28699.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
Length = 490
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 91/391 (23%), Positives = 158/391 (40%), Gaps = 65/391 (16%)
Query: 26 LWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTE 85
++C F +QD S P+ SSS K + C + C + S K Y Y+ E
Sbjct: 63 MFCSFFF----LQDPRFS---PALSSSYKPLECGNE-CSTGFCDGSRK----YQRQYA-E 109
Query: 86 DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 145
++SSG L D++ ++ S Q ++ GC +TG D A DG++GLG G +
Sbjct: 110 KSTSSGVLGKDVISFSNSSDLGGQR-----LVFGCETAETGDLYDQTA-DGIIGLGRGPL 163
Query: 146 SVPSLLAKAGLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGV 202
S+ L + +++ FS+C+ DE + G Q P TS P Y Y + +
Sbjct: 164 SIIDQLVEKNAMEDVFSLCYGGMDEGGGAMILGGFQPPKDMVFTSSDPHRSPY--YNLML 221
Query: 203 ESYCIGNSCLT------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 256
+ +G S L + ++DSG ++ + P + + V S + + G
Sbjct: 222 KGIRVGGSPLRLKPEVFDGKYGTVLDSGTTYAYFPGAAFQAFKSAVKEQVGSLK-EVPGP 280
Query: 257 SWKY---CYNASSEEMLKV----PDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTV 307
K+ CY + + + P + +F QS + N++F + G +CL V
Sbjct: 281 DEKFKDICYAGAGTNVSNLSQFFPSVDFVFGDGQSVTLSPENYLFRHTKISG--AYCLGV 338
Query: 308 MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLP 367
++G + + ++R + + +KC ++ + LP
Sbjct: 339 FENGDPTTLLGGIIVRNMLVTYNRGKASIGFLKTKCNDLWSR----------------LP 382
Query: 368 TTEQ--QSTSNGQAAAPPSTAKTAPSKSIAA 396
T + ST Q PP APS S+ A
Sbjct: 383 ETNEPGHSTQPAQFLLPP-----APSPSVGA 408
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 78/332 (23%), Positives = 134/332 (40%), Gaps = 48/332 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
+DP SS+ + V CS P C++ S + C Y+ Y + +SS+G L D L
Sbjct: 128 FDPRRSSTYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYG-DGSSSTGDLATDKLA 186
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
A+ + ++V +GCGR G + D AA G++G+G G +S+ + +A A +
Sbjct: 187 FAN-------DTYVNNVTLGCGRDNEGLF-DSAA--GLLGVGRGKISISTQVAPA--YGS 234
Query: 160 SFSICFDENDSGS------VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
F C + S S VF P + T+ L + Y+V + + +G +T
Sbjct: 235 VFEYCLGDRTSRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVT 294
Query: 214 QSGFQ--------------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL---QGN 256
GF +VDSG + + + YA + FD + + + +
Sbjct: 295 --GFSNASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHS 352
Query: 257 SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD 311
+ CY+ P + L F+ + + P + G CL + D
Sbjct: 353 VFDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAAD 412
Query: 312 GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
+IG G R+VFD E ++ ++ C
Sbjct: 413 DGLSVIGNVQQQGFRVVFDVEKERIGFAPKGC 444
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 76/338 (22%), Positives = 139/338 (41%), Gaps = 30/338 (8%)
Query: 25 LLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADY 82
L+W + S + +N +DPS S+S K VSC C+ SC + C + Y
Sbjct: 114 LMWTQCLPCLSCYKQKN-PMFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGY 172
Query: 83 STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
+ + + G + + L L S ++ Q +++ GCG +G++ + G+ G G
Sbjct: 173 G-DGSLAQGVIATETLTLNS---NSGQPXSIXNIVFGCGHNNSGTFNENEM--GLFGTGG 226
Query: 143 GDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA 197
+S+ S + FS C D + + + FG + + P+ K D
Sbjct: 227 RPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSXVVSTPLVTKDDP 286
Query: 198 --YFVGVESYCIGN--------SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 247
YFV ++ +G+ S + G +D+G T LP + Y +V + +
Sbjct: 287 TYYFVTLDGISVGDKLFPFSSSSPMATKG-NVFIDAGTPPTLLPRDFYNRLVQGVKEAIP 345
Query: 248 SKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV 307
+ + + CY +++ ++ P + F + + F P+ EG V+C +
Sbjct: 346 MEPVQDPDLQPQLCYRSAT--LIDGPILTAHFDGADVQLKPLNTFISPK-EG--VYCFAM 400
Query: 308 MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
DGD GI G M I FD + K+++ C +
Sbjct: 401 QPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCTK 438
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 80/349 (22%), Positives = 138/349 (39%), Gaps = 65/349 (18%)
Query: 47 PSSSSSSKNVSCSHPLCKS-------------RSSCKSLKDPCP-YIADYSTEDTSSSGY 92
P SSS V+C+ CK+ S K+ + CP Y Y S++G
Sbjct: 33 PRMSSSLHLVTCADSNCKTLYGNNTELLCQSCAGSLKNCSETCPPYGIQYGR--GSTAGL 90
Query: 93 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
L+ + L+L ++ + + +GC S + P G+ G G G +S+PS L
Sbjct: 91 LLTETLNLPL--ENGEGARAITHFAVGC------SIVSSQQPSGIAGFGRGALSMPSQLG 142
Query: 153 KAGLIQNSFSIC-----FDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDA--------- 197
+ + ++ F+ C FDE + S+ GD+ ++ P A
Sbjct: 143 EH-IGKDRFAYCLQSHRFDEENKKSLMVLGDKALPNNIPLNYTPFLTNSRAPPSSQYGVY 201
Query: 198 YFVGVESYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV 246
Y++G+ IG L T+ ++DSG +FT EI+ + F +
Sbjct: 202 YYIGLRGVSIGGKRLKQLPSKLLRFDTKGNGGTIIDSGTTFTVFSDEIFKHIAAGFASQI 261
Query: 247 SSKRIS--LQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV--VRNHIFSFPENEGFTV 302
+R CY+ + E + +P+ F V V N+ F F
Sbjct: 262 GYRRAGEVEDKTGMGLCYDVTGLENIVLPEFAFHFKGGSDMVLPVANYFSYFSS---FDS 318
Query: 303 FCLTVMSTDG----DYG---IIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
CLT++S+ G D G I+G + +++DRE +L ++ C+
Sbjct: 319 ICLTMISSRGLLEVDSGPAVILGNDQQQDFYLLYDREKNRLGFTQQTCK 367
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 86/326 (26%), Positives = 140/326 (42%), Gaps = 42/326 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
++PS SSS KN+ C LC+S +SC K+ C Y + Y +++ S G L D L L S
Sbjct: 129 FNPSKSSSYKNIPCPSKLCQSMEDTSCND-KNYCEY-STYYGDNSHSGGDLSVDTLTLES 186
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+ +++IGCG SY +GA+ G++G G G S + L + FS
Sbjct: 187 TNGLTVSF---PNIVIGCGTNNILSY-EGAS-SGIVGFGSGPASFITQLGSS--TGGKFS 239
Query: 163 ICF---------DENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSC 211
C N + + FGD + PI +K Y++ +E++ +GN
Sbjct: 240 YCLTPLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRR 299
Query: 212 LTQSGF-------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
+ G ++DSG + T L + Y+ + LV +R+ + CY+
Sbjct: 300 VEIGGVPNGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERVDDPTQTLNLCYSV 359
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG----QN 320
+E D +I + V H S + VFCL S+ D+ I G QN
Sbjct: 360 KAEGY----DFPIITMHFKGADVDLHPISTFVSVADGVFCLAFESSQ-DHAIFGNLAQQN 414
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEV 346
M+G +D + +++ S C +V
Sbjct: 415 LMVG----YDLQQKIVSFKPSDCTKV 436
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 76/338 (22%), Positives = 140/338 (41%), Gaps = 30/338 (8%)
Query: 25 LLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADY 82
L+W + S + +N +DPS S+S K VSC C+ SC + C + Y
Sbjct: 114 LMWTQCLPCLSCYKQKN-PMFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGY 172
Query: 83 STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
+ + + G + + L L S ++ Q + +++ GCG +G++ + G+ G G
Sbjct: 173 G-DGSLAQGVIATETLTLNS---NSGQPTSILNIVFGCGHNNSGTFNENEM--GLFGTGG 226
Query: 143 GDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA 197
+S+ S + FS C D + + + FG + + P+ K D
Sbjct: 227 RPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKDDP 286
Query: 198 --YFVGVESYCIGN--------SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 247
YFV ++ +G+ S + G +D+G T LP + Y +V + +
Sbjct: 287 TYYFVTLDGISVGDKLFPFSSSSPMATKG-NVFIDAGTPPTLLPRDFYNRLVQGVKEAIP 345
Query: 248 SKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV 307
+ + + CY +++ ++ P + F + + F P+ EG V+C +
Sbjct: 346 MEPVQDPDLQPQLCYRSAT--LIDGPILTAHFDGADVQLKPLNTFISPK-EG--VYCFAM 400
Query: 308 MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
DGD GI G M I FD + K+++ C +
Sbjct: 401 QPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCTK 438
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 78/314 (24%), Positives = 129/314 (41%), Gaps = 35/314 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP++SSS ++C C+ S+C++ K C Y Y + Y+ + + S
Sbjct: 199 FDPTASSSYNPLTCDAQQCQDLEMSACRNGK--CLYQVSYGDGSFTVGEYVTETV----S 252
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
F + + V IGCG G + V GL + L + + SFS
Sbjct: 253 FGAGS-----VNRVAIGCGHDNEGLF--------VGSAGLLGLGGGPLSLTSQIKATSFS 299
Query: 163 ICFDENDSG---SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------ 213
C + DSG ++ F P L + Y+V + +G +T
Sbjct: 300 YCLVDRDSGKSSTLEFNSPRPGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETF 359
Query: 214 ---QSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
QSG +VDSG + T L T+ Y V F + S+ R + + CY+ SS +
Sbjct: 360 AVDQSGAGGVIVDSGTAITRLRTQAYNSVRDAFKRKTSNLRPAEGVALFDTCYDLSSLQS 419
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 329
++VP + FS ++++ + + P +G +C T IIG G R+ F
Sbjct: 420 VRVPTVSFHFSGDRAWALPAKNYLIPV-DGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSF 478
Query: 330 DRENLKLAWSHSKC 343
D N + +S +KC
Sbjct: 479 DLANSLVGFSPNKC 492
>gi|325183199|emb|CCA17657.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 873
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 65/307 (21%), Positives = 140/307 (45%), Gaps = 23/307 (7%)
Query: 52 SSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 111
++K+ S + CK C + +D I +E + ++ D++ + + +
Sbjct: 90 ATKSTSINFVQCKYEEGCDTCRDNLCVIHQRYSEGSMWEAVVMQDLIWVGNVDSDRAEMI 149
Query: 112 VQSSVI---IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDE 167
++ I GC ++TG ++ +G+MGLG+G ++ + + KA ++ + F++CF +
Sbjct: 150 MRRYGIRFKFGCQTRETGLFI-TQVENGIMGLGIGRNNIATEMYKAKRVEEHKFALCFGQ 208
Query: 168 NDSGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLT------QSGFQAL 220
V G ++ P+ + + Y + V+ IG L +SG A+
Sbjct: 209 KGGSFVIGGVDYSHHTTKIAYTPLAKHGTSNYPIEVKDVRIGGISLQVDAEHFKSGRGAI 268
Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS-LQGNSWKYCYNASSEEMLKVPDMRLIF 279
VDSG + T+ P+ F KRI+ ++ N K N + E + +P++ LI
Sbjct: 269 VDSGTTDTYFPSAAATPFQEAF------KRITGVEYNENKM--NLTPEMVETLPNVSLII 320
Query: 280 S--KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLA 337
+ + F + + + N+ F T+ ++ ++G + MMG+ ++FD E ++
Sbjct: 321 AGEDGEDFEISLNASDYILNDSNHHFFGTLHFSERRGAVLGASIMMGYDVIFDLEKKRVG 380
Query: 338 WSHSKCE 344
++ + C+
Sbjct: 381 FAEATCD 387
>gi|298707682|emb|CBJ25999.1| aspartyl protease [Ectocarpus siliculosus]
Length = 547
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 77/316 (24%), Positives = 134/316 (42%), Gaps = 24/316 (7%)
Query: 45 YDPSSSSSSKNVSCSH-PLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA-- 101
+DPS SS++ V+C C C+S K C + ++ TE +S VDD+L +
Sbjct: 150 WDPSQSSTAHIVTCDETERCHGAYKCQSDKK-C-VLREHYTEGSSWRAKQVDDLLWVGER 207
Query: 102 --SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-Q 158
S S+ S+ GC TG + A DG+MGL ++ + LA AG I +
Sbjct: 208 TLSDSQKHDDSAFSVDFTFGCIESLTGLFKTQLA-DGIMGLNADSRTLITQLATAGKISE 266
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTS---FLPIGEKYDAYFVGVESYCIGNSCLT-- 213
FS+CF E G++ G P + S + P + A V V + +T
Sbjct: 267 RKFSLCFSET-GGTMVIGGYDPLLNKPGSEMQYTPSTGEISAPTVKVTDVTLNGVSITTD 325
Query: 214 ----QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
Q G + SG + T+LP + ++ S + + N ++C ++ E+
Sbjct: 326 ASVFQKGTGIKIVSGTTNTYLPRAVAEGFSAAWEAATGSPYATCKMN--EFCMTRTTVEL 383
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVF-CLTVMSTDGDYGIIGQNFMMGHRIV 328
+P + + VR + ++ V+ L + G G++G N + H +V
Sbjct: 384 EALPVLMIHMDGGVEVNVRPEAYMDASSDEENVYPSLPPPCSMG--GVLGANLLRDHNVV 441
Query: 329 FDRENLKLAWSHSKCE 344
FD +N + ++ C+
Sbjct: 442 FDYDNHVVGFADGACD 457
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 83/355 (23%), Positives = 125/355 (35%), Gaps = 69/355 (19%)
Query: 40 RNLSEYDPSS------SSSSKNVSCSHPLCK-----SRSSCK--SLKDPCPYIADYSTED 86
RN S + PSS SSS C P C+ C L PC ++ Y+ +
Sbjct: 120 RNCSHHPPSSAFLPRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYA-DG 178
Query: 87 TSSSGYLVDDILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGA---APDGVMGLGL 142
+ SSG+ + L S S S + + GCG + +G + GA GVMGLG
Sbjct: 179 SLSSGFFSKETTTLKSLSG----SEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGR 234
Query: 143 GDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA----- 197
G +S S L + N FS C + + TSFL IG +
Sbjct: 235 GSISFSSQLGRR--FGNKFSYCLMDYT-----------LSPPPTSFLMIGGGLHSLPLTN 281
Query: 198 ------------------YFVGVESYCIGNSCL----------TQSGFQALVDSGASFTF 229
Y++ + S I L Q +VDSG + T+
Sbjct: 282 ATKISYTPLQINPLSPTFYYITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTY 341
Query: 230 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE-EMLKVPDMRLIFSKNQSFVVR 288
L Y EV+ + V + + C NAS E +P +R F
Sbjct: 342 LTKTAYEEVLKSVRRRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPP 401
Query: 289 NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
+ EG + + + + +IG G + FD+E +L ++ C
Sbjct: 402 PRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 456
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 78/334 (23%), Positives = 129/334 (38%), Gaps = 40/334 (11%)
Query: 38 QDRNLSEYDPSSSSSSKNVSCSHPLCK--------SRSSCKSLKDPCPYIADYSTEDTSS 89
D+ +DPSSS S V C+ C S +C C Y Y + + S
Sbjct: 146 HDQQEPLFDPSSSPSYAAVPCNSSSCDALRVATGMSGQACDDQPAACSYTLSYR-DGSYS 204
Query: 90 SGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS 149
G L D L LA +Q + GCG G + G+MGLG +S+ S
Sbjct: 205 RGVLAHDRLSLAG-------EDIQG-FVFGCGTSNQGPF---GGTSGLMGLGRSQLSLIS 253
Query: 150 -LLAKAGLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----YFV 200
+ + G + FS C + SGS+ GD + ST + D Y
Sbjct: 254 QTMDQFGGV---FSYCLPPKESGSSGSLVLGDDASVYRNSTPIVYTAMVSDPLQGPFYLA 310
Query: 201 GVESYCIGNSCLTQSGF------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 254
+ +G + GF +A+VDSG T L +YA V +F ++ +
Sbjct: 311 NLTGITVGGEDVQSPGFSAGGGGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAP 370
Query: 255 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY 314
+ C++ + ++VP ++L+F V + + + CL + S +Y
Sbjct: 371 FSILDTCFDLTGLREVQVPSLKLVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEY 430
Query: 315 G--IIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
IIG R++FD ++ ++ C+ +
Sbjct: 431 DTPIIGNYQQKNLRVIFDTVGSQIGFAQETCDYI 464
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 136/329 (41%), Gaps = 47/329 (14%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
+N + +DP +SSS + +SCS P CK +C S + C Y Y + + + G L D
Sbjct: 51 QNDAVFDPRASSSFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYG-DGSFTVGDLASDS 109
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L S + +P V+ GCG G ++ A G+ L S PS L+
Sbjct: 110 F-LVSRGRTSP-------VVFGCGHDNEGLFVGAAGLLGLGAGKL---SFPSQLS----- 153
Query: 158 QNSFSICFDENDSG-----SVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGN 209
FS C D+G ++ FGD T S ++ + K D Y+ G+ IG
Sbjct: 154 SRKFSYCLVSRDNGVRASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGG 213
Query: 210 SCLT--QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKF----DKLVSSKRISLQ 254
+ L+ + F+ ++DSG S T LPT Y + F KL + SL
Sbjct: 214 TLLSIPSTAFKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSL- 272
Query: 255 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY 314
+ CY+ S+ + +P + F S + + P + T FC T D
Sbjct: 273 ---FDTCYDFSALTSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGT-FCFAFSKTSLDL 328
Query: 315 GIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
IIG R+ D ++ ++ ++ +C
Sbjct: 329 SIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 79/324 (24%), Positives = 135/324 (41%), Gaps = 44/324 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+DP+SS++ VSC +C++ R+S C Y Y + + + G L + L L
Sbjct: 167 FDPASSATFSAVSCGSAICRTLRTSGCGDSGGCEYEVSYG-DGSYTKGTLALETLTLG-- 223
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
+ V IGCG + G ++ A G++GLG G +S+ L A +FS
Sbjct: 224 ------GTAVEGVAIGCGHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGA--AGGAFSY 272
Query: 164 CFDE---------NDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSC 211
C + +GS+ G + A + ++P+ A Y+VGV +G+
Sbjct: 273 CLASRGGSGSGAADAAGSLVLG-RSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDER 331
Query: 212 ---------LTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
LT+ G +V D+G + T LP E YA + F V + + + C
Sbjct: 332 LPLQDGLFQLTEDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLLDTC 391
Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 319
Y+ S ++VP + F + + RN + E +G ++CL + I+G
Sbjct: 392 YDLSGYTSVRVPTVSFYFDGAATLTLPARNLLL---EVDG-GIYCLAFAPSSSGLSILGN 447
Query: 320 NFMMGHRIVFDRENLKLAWSHSKC 343
G +I D N + + + C
Sbjct: 448 IQQEGIQITVDSANGYIGFGPATC 471
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 83/327 (25%), Positives = 134/327 (40%), Gaps = 51/327 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS---CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
Y+ SSS+ +V C P C++ S C + C Y +Y +S+ + V+ +
Sbjct: 172 YNRLKSSSASDVGCYAPACRALGSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLTF-- 229
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
P V IGCG G + AA G++GLG G +S PS + AG SF
Sbjct: 230 ------PPGVRVPGVAIGCGSDNQGLFPAPAA--GILGLGRGSLSFPSQI--AGRYGRSF 279
Query: 162 SICFDENDSG----SVFFGDQGPA------TQQSTSFLPIGEKYDAYFVGVESYCIGN-- 209
S C +G ++ FG A T L Y Y+VG+ +G
Sbjct: 280 SYCLAGQGTGGRSSTLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVR 339
Query: 210 -SCLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNS 257
+T+S + +VDSG + T L YA F ++ + K + G
Sbjct: 340 VRGVTESDLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAF-RVAAVKELGWPSPGGP 398
Query: 258 WKY---CYNA-SSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTD 311
+ + CY++ M KVP + + F+ + +N++ N+G F +
Sbjct: 399 FAFFDTCYSSVRGRVMKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAF---AGS 455
Query: 312 GDYG--IIGQNFMMGHRIVFDRENLKL 336
GD G IIG + G R+V+D + ++
Sbjct: 456 GDRGVSIIGNIQLQGFRVVYDVDGQRV 482
>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
Length = 488
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 81/360 (22%), Positives = 147/360 (40%), Gaps = 40/360 (11%)
Query: 12 NAYNALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSR--SSC 69
+A N+++CL + ++ L +D S+SS+ SC LC+ +SC
Sbjct: 144 DAGNSIICLAINKGDETTIIGNFQQQNMHALPYFDRSTSSTLLLTSCDSTLCQGLLVASC 203
Query: 70 KSLK----DPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQT 125
+ K C Y Y+ + ++ +L + F+ A S V GCG
Sbjct: 204 GNTKFWPNQTCVYTYYYNDKSVTTG------LLEVDKFTFGAGASV--PGVAFGCGLFNN 255
Query: 126 GSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS---GSVFFG------ 176
G + G+ G G G +S+PS L K G +FS CF + +V
Sbjct: 256 GVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSHCFTAVNGLKQSTVLLDLLADLY 308
Query: 177 DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS---------CLTQSGFQALVDSGASF 227
G QST + Y++ ++ +G++ LT ++DSG S
Sbjct: 309 KNGRGAVQSTPLIQNSANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGTIIDSGTSI 368
Query: 228 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 287
T LP ++Y V +F + + C++A S+ VP + L F +
Sbjct: 369 TSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSAPSQAKPDVPKLVLHFEGATMDLP 428
Query: 288 R-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
R N++F P++ G ++ CL + + IG +++D +N L++ ++C+++
Sbjct: 429 RENYVFEVPDDAGNSMICLAINELGDERATIGNFQQQNMHVLYDLQNNMLSFVAAQCDKL 488
Score = 46.6 bits (109), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 54/124 (43%), Gaps = 6/124 (4%)
Query: 212 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
LT ++DSG S T LP ++Y V +F + + C++A S+
Sbjct: 58 LTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSAPSQAKPD 117
Query: 272 VPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM---MGHRI 327
VP + L F + R N++F P++ G ++ CL + GD I NF M
Sbjct: 118 VPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAI--NKGDETTIIGNFQQQNMHALP 175
Query: 328 VFDR 331
FDR
Sbjct: 176 YFDR 179
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 84/328 (25%), Positives = 141/328 (42%), Gaps = 34/328 (10%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCK---SRSSC-KSLKDPCPYIADYSTEDTSSSGYLV 94
+++L YD S SS+ SC CK S + C C + YS D S++ +
Sbjct: 127 NQSLPYYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAF--SYSYGDKSATIGFL 184
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
D + SF A SV V+ GCG TG + G+ G G G +S+PS L K
Sbjct: 185 D--VETVSFVAGA---SV-PGVVFGCGLNNTGIFRSNET--GIAGFGRGPLSLPSQL-KV 235
Query: 155 GLIQNSFSICFDENDSGSVF-----FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 209
G + F+ S +F G T Q+T + Y++ ++ +G+
Sbjct: 236 GNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGS 295
Query: 210 SCLT--QSGFQ-------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
+ L +S F ++DSG +FT LP +Y V +F V +
Sbjct: 296 TRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLL 355
Query: 261 CYNASS-EEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
C++A + VP + L F + R N++F ++ G CL ++ +G+ IIG
Sbjct: 356 CFSAPPLGKAPHVPKLVLHFEGATMHLPRENYVFE-AKDGGNCSICLAII--EGEMTIIG 412
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+++D +N KL++ +KC+++
Sbjct: 413 NFQQQNMHVLYDLKNSKLSFVRAKCDKL 440
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 76/317 (23%), Positives = 125/317 (39%), Gaps = 38/317 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP+ SS+ NVSC+ P C + C Y Y + + S G+ D L L+S+
Sbjct: 222 FDPARSSTYANVSCAAPACFDLDTRGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY- 279
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
GCG + G + + A G++GLG G S+P K G + F+
Sbjct: 280 ------DAVKGFRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAH 327
Query: 164 CFDENDSGSVF--FGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLT--QS 215
C SG+ + FG PA + P+ G + Y+VG+ +G L+ QS
Sbjct: 328 CLPARSSGTGYLDFGPGSPAAAGARLTTPMLTDNGPTF--YYVGMTGIRVGGQLLSIPQS 385
Query: 216 GFQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK------RISLQGNSWKYCYNASS 266
F +VDSG T LP Y+ + F ++++ +SL CY+ +
Sbjct: 386 VFATAGTIVDSGTVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVSL----LDTCYDFTG 441
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
+ +P + L+F V + + GD GI+G +
Sbjct: 442 MSQVAIPTVSLLFQGGAILDVDASGIMYAASVSQVCLGFAANEDGGDVGIVGNTQLKTFG 501
Query: 327 IVFDRENLKLAWSHSKC 343
+ +D + +S C
Sbjct: 502 VAYDIGKKVVGFSPGAC 518
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 86/344 (25%), Positives = 144/344 (41%), Gaps = 68/344 (19%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+ +DPS SSS ++ CSHPLCK R +SC S + C Y Y+ + T + G LV
Sbjct: 122 TSFDPSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRL-CHYSYFYA-DGTFAEGNLVK 179
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
+ ++ P +I+GC ++ T G++G+ LG + S +++A
Sbjct: 180 EKFTFSNSQTTPP-------LILGCAKESTDE-------KGILGMNLGRL---SFISQAK 222
Query: 156 LIQNSFSICFDEN-----DSGSVFFGDQG-------------PATQQSTSFLPIGEKYDA 197
+ + S+ I N +GS + GD P +Q+ + P+ A
Sbjct: 223 ISKFSYCIPTRSNRPGLASTGSFYLGDNPNSRGFKYVSLLTFPQSQRMPNLDPL-----A 277
Query: 198 YFVGVESYCIGNSCLTQSGF----------QALVDSGASFTFLPTEIYAEVVVKFDKLVS 247
Y V ++ IG L G Q +VDSG+ FT L Y +V + +LV
Sbjct: 278 YTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVG 337
Query: 248 S--KRISLQGNSWKYCY--NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVF 303
S K+ + G++ C+ N S E + D+ F + +V S N G +
Sbjct: 338 SRLKKGYVYGSTADMCFDGNHSMEIGRLIGDLVFEFGRGVEILVEKQ--SLLVNVGGGIH 395
Query: 304 CLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
C+ + + IIG + FD N ++ +S ++C
Sbjct: 396 CVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFSKAECR 439
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 74/316 (23%), Positives = 125/316 (39%), Gaps = 37/316 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
++ S++ K V C P CK + K C + Y + +++ L D++ LA+ S
Sbjct: 135 FNNVKSTTFKTVGCEAPQCKQVPNSKCGGSACAFNMTYGSSSIAAN--LSQDVVTLATDS 192
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
S GC + TGS + P G++GLG G +S+ L L Q++FS C
Sbjct: 193 I--------PSYTFGCLTEATGSSIP---PQGLLGLGRGPMSL--LSQTQNLYQSTFSYC 239
Query: 165 FDE----NDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------- 212
N SGS+ G G P ++T L + Y+V + + +G +
Sbjct: 240 LPSFRSLNFSGSLRLGPVGQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSAL 299
Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
+G + DSG FT L Y V F K V + ++ G + CY +
Sbjct: 300 AFNPTTGAGTIFDSGTVFTRLVAPAYTAVRDAFRKRVGNATVTSLGG-FDTCYTSP---- 354
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM--STDGDYGIIGQNFMMGHRI 327
+ P + +FS + +++ T + + + +I HRI
Sbjct: 355 IVAPTITFMFSGMNVTLPPDNLLIHSTASSITCLAMAAAPDNVNSVLNVIANMQQQNHRI 414
Query: 328 VFDRENLKLAWSHSKC 343
+FD N +L + C
Sbjct: 415 LFDVPNSRLGVAREPC 430
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 85/316 (26%), Positives = 134/316 (42%), Gaps = 36/316 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP+SSS+ K+++CS P C S S+C+S K C Y Y + Y D +
Sbjct: 206 FDPTSSSTFKSLTCSDPKCASLDVSACRSNK--CLYQVSYGDGSFTVGNYATDTVTF--- 260
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+S + V +GCG G + A G+ G L + + AK SFS
Sbjct: 261 -----GESGKVNDVALGCGHDNEGLFTGAAGLLGLGGGALSMTN--QIKAK------SFS 307
Query: 163 ICFDENDSG---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNS--CLTQSG 216
C + DS S+ F +T+ L K D Y+VG+ + +G + S
Sbjct: 308 YCLVDRDSAKSSSLDFNSVQIGAGDATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSL 367
Query: 217 FQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKYCYNASSE 267
F+ ++D G + T L T+ Y + F KL + K+ + + + CY+ SS
Sbjct: 368 FEVDASGAGGVILDCGTAVTRLQTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCYDFSSL 427
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
+KVP + F+ +S + + P ++ T FC T IIG G RI
Sbjct: 428 STVKVPTVTFHFTGGKSLNLPAKNYLIPIDDAGT-FCFAFAPTSSSLSIIGNVQQQGTRI 486
Query: 328 VFDRENLKLAWSHSKC 343
+D N + S +KC
Sbjct: 487 TYDLANNLIGLSANKC 502
>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
Length = 390
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 79/322 (24%), Positives = 134/322 (41%), Gaps = 32/322 (9%)
Query: 47 PSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI--LH 99
P + ++C+ P+C S+ CK+ + C Y Y+ + SS G LV DI L
Sbjct: 76 PPYKPNKGPITCNDPMCSALHWPSKPPCKASHEQCDYEVSYA-DHGSSLGVLVHDIFSLQ 134
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP---DGVMGLGLGDVSVPSLLAKAGL 156
L + + AP+ + GCG Q SY AP DGV+GLG G S+ + L GL
Sbjct: 135 LTNGTLAAPR------LAFGCGYDQ--SYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGL 186
Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCLTQ 214
I++ C G +F GD +T + P+ K AY +G
Sbjct: 187 IRSIVGHCLSGRGGGFLFLGDGL-STTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGV 245
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS--EEMLKV 272
G + + DSG+S+T+ + Y + K ++ K S C+ + + + +V
Sbjct: 246 KGLRLVFDSGSSYTYFNAQAYKTTLSLVRKYLNGKLKETADESLPVCWRGAKPFKSIFEV 305
Query: 273 PD----MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMG 324
+ L F+K +S ++ S+ CL +++ GD +IG
Sbjct: 306 KNYFKPFALSFTKAKSAQLQLPPESYLIISKHGNACLGILNGSEVGLGDSNVIGDIAFQD 365
Query: 325 HRIVFDRENLKLAWSHSKCEEV 346
+++D E ++ W C ++
Sbjct: 366 KMVIYDNERQQIGWVPKDCNKL 387
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 83/371 (22%), Positives = 147/371 (39%), Gaps = 54/371 (14%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ P+ SSS K + C C + S K Y Y+ E ++SSG L D++ ++
Sbjct: 76 RFSPALSSSYKPLECGSE-CSTGFCDGSRK----YQRQYA-EKSTSSGVLGKDVIGFSNS 129
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S Q ++ GC +TG D A DG++GLG G +S+ L + +++ FS+
Sbjct: 130 SDLGGQR-----LVFGCETAETGDLYDQTA-DGIIGLGRGPLSIIDQLVEKNAMEDVFSL 183
Query: 164 CF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------Q 214
C+ DE + G Q P T+ P Y Y + ++ +G S L
Sbjct: 184 CYGGMDEGGGAMILGGFQPPKDMVFTASDPHRSPY--YNLMLKGIRVGGSPLRLKPEVFD 241
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASSEEMLK 271
+ ++DSG ++ + P + + V S + + G K+ CY + +
Sbjct: 242 GKYGTVLDSGTTYAYFPGAAFQAFKSAVKEQVGSLK-EVPGPDEKFKDICYAGAGTNVSN 300
Query: 272 V----PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
+ P + +F QS + + F + +CL V ++G + +
Sbjct: 301 LSQFFPSVDFVFGDGQSVTLSPENYLFRHTKISGAYCLGVFENGDPTTLLGGIIVRNMLV 360
Query: 328 VFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQ--QSTSNGQAAAPPST 385
++R + + +KC ++ + LP T + ST Q PP
Sbjct: 361 TYNRGKASIGFLKTKCNDLWSR----------------LPETNEPGHSTQPAQFLLPP-- 402
Query: 386 AKTAPSKSIAA 396
APS S+ A
Sbjct: 403 ---APSPSVGA 410
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 86/329 (26%), Positives = 137/329 (41%), Gaps = 47/329 (14%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
+N + +DP +SSS + +SCS P CK +C S + C Y Y + + + G L D
Sbjct: 51 QNDAVFDPRASSSFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYG-DGSFTVGDLASD- 108
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
SFS ++ S V+ GCG G ++ A G+ L S PS L+
Sbjct: 109 ----SFSVSRGRT---SPVVFGCGHDNEGLFVGAAGLLGLGAGKL---SFPSQLSS---- 154
Query: 158 QNSFSICFDENDSG-----SVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGN 209
FS C D+G ++ FGD T S ++ + K D Y+ G+ IG
Sbjct: 155 -RKFSYCLVSRDNGVRASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGG 213
Query: 210 SCLT--QSGFQ---------ALVDSGASFTFLPTEIYAEVVVKF----DKLVSSKRISLQ 254
+ L+ + F+ ++DSG S T LPT Y + F KL + SL
Sbjct: 214 TLLSIPSTAFKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSL- 272
Query: 255 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY 314
+ CY+ S+ + +P + F S + + P + T FC T D
Sbjct: 273 ---FDTCYDFSALTSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGT-FCFAFSKTSLDL 328
Query: 315 GIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
IIG R+ D ++ ++ ++ +C
Sbjct: 329 SIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 86/338 (25%), Positives = 139/338 (41%), Gaps = 54/338 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLC-------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
+DP++SSS +NV+C C R+ + +D CPY Y + ++
Sbjct: 191 FDPAASSSYRNVTCGDQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGD------ 244
Query: 98 LHLASFSKH--APQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
L L SF+ + AP +S + V+ GCG + G + A G+ L S L A
Sbjct: 245 LALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFAS--QLRAVY 302
Query: 155 GLIQNSFSICFDEN--DSGS-VFFGDQ----GPATQQSTSFLPIGEKYDA-YFVGVESYC 206
G ++FS C E+ D+GS V FG+ + T+F P D Y+V ++
Sbjct: 303 G---HTFSYCLVEHGSDAGSKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVL 359
Query: 207 IGNSCLTQSGFQ----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG- 255
+G L S ++DSG + ++ Y + F L+S +
Sbjct: 360 VGGDLLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDF 419
Query: 256 NSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT------VFCLTVMS 309
CYN S E +VP++ L+F+ ++ FP F + CL V
Sbjct: 420 PVLNPCYNVSGVERPEVPELSLLFADGA-------VWDFPAENYFVRLDPDGIMCLAVRG 472
Query: 310 T-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
T IIG +V+D +N +L ++ +C EV
Sbjct: 473 TPRTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRCAEV 510
>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 453
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 136/320 (42%), Gaps = 34/320 (10%)
Query: 56 VSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI--LHLASFSKHAP 108
++C+ P+C S+ CK+ + C Y Y+ + SS G LV DI L L + + AP
Sbjct: 118 ITCNDPMCSALHWPSKPPCKASHEQCDYEVSYA-DHGSSLGVLVHDIFSLQLTNGTLAAP 176
Query: 109 QSSVQSSVIIGCGRKQTGSYLDGAAP---DGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 165
+ + GCG Q SY AP DGV+GLG G S+ + L GLI++ C
Sbjct: 177 R------LAFGCGYDQ--SYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCL 228
Query: 166 DENDSGSVFFGDQGPATQQSTSFLPIGEK--YDAYFVGVESYCIGNSCLTQSGFQALVDS 223
G +F GD +T + P+ K AY +G G + + DS
Sbjct: 229 SGRGGGFLFLGDGL-STTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGVKGLRLVFDS 287
Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS--EEMLKVPD----MRL 277
G+S+T+ + Y + K ++ K S C+ + + + +V + L
Sbjct: 288 GSSYTYFNAQAYKTTLSLVRKYLNGKLKETADESLPVCWRGAKPFKSIFEVKNYFKPFAL 347
Query: 278 IFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDREN 333
F+K +S ++ S+ CL +++ GD +IG +++D E
Sbjct: 348 SFTKAKSAQLQLPPESYLIISKHGNACLGILNGSEVGLGDSNVIGDIAFQDKMVIYDNER 407
Query: 334 LKLAWSHSKCEEV--IDKSH 351
++ W C ++ +D+ +
Sbjct: 408 QQIGWVPKDCNKLPKVDRDY 427
>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 417
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 97/398 (24%), Positives = 155/398 (38%), Gaps = 68/398 (17%)
Query: 7 FGSHANAYNALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSR 66
GSH + L + L+W I+ + + P + + S VSC P C +
Sbjct: 25 LGSHPSQSITLYMDTGSDLVWFPCAPFECILCEGKFNATKPLNITRSHRVSCQSPACSTA 84
Query: 67 SSCKSLKDPCPY----IADYSTEDTSSSG-----YLVDDILHLASFSKHAPQSSVQSSVI 117
S S D C + + T D SS+ Y D SF H + ++ S +
Sbjct: 85 HSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYGD----GSFIAHLHRDTLSMSQL 140
Query: 118 ------IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSIC-----F 165
GC A P GV G G G +S+P+ LA + + N FS C F
Sbjct: 141 FLKNFTFGCAHTAL------AEPTGVAGFGRGLLSLPAQLATLSPNLGNRFSYCLVSHSF 194
Query: 166 DE---NDSGSVFFGDQGPATQQSTSFL---PIGEKYDAYF--VGVESYCIGNSCL----- 212
D+ + G + + F+ + +YF VG+ +G +
Sbjct: 195 DKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNPKHSYFYCVGLTGISVGKRTILAPEM 254
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRIS--LQGNSWKYCYN 263
+ +VDSG +FT LP +Y VV +FD+ V KR S + CY
Sbjct: 255 LRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKRASEVEEKTGLGPCYF 314
Query: 264 ASSEEMLKVPDMRLIFSKNQSFVV---RNHIFSFPENEG---FTVFCLTVMS-------T 310
E +++VP + F N S V+ N+ + F + E V CL +M+ +
Sbjct: 315 L--EGLVEVPTVTWHFLGNNSNVMLPRMNYFYEFLDGEDEARRKVGCLMLMNGGDDTELS 372
Query: 311 DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVID 348
G I+G G +V+D EN ++ ++ +C + D
Sbjct: 373 GGPGAILGNYQQQGFEVVYDLENQRVGFAKRQCASLWD 410
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 77/307 (25%), Positives = 122/307 (39%), Gaps = 23/307 (7%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DPS SS+ +NVSC+ P C S+ C Y Y + +S+ G+L D L
Sbjct: 59 FDPSLSSTYRNVSCTEPACVGLSTRGCSSSTCLYGVFYG-DGSSTIGFLAMDTFMLTPAQ 117
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL-GLGDVSVPSLLAK-AGLIQNSFS 162
K + I GCG+ TG + G GL GLG S SL ++ A + N FS
Sbjct: 118 KF-------KNFIFGCGQNNTGLF------QGTAGLVGLGRSSTYSLNSQVAPSLGNVFS 164
Query: 163 ICFDENDSGSVFFGDQGPA-TQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQA 219
C S + + P T T+ L YF+ + +G + L+ S FQ+
Sbjct: 165 YCLPSTSSATGYLNIGNPQNTPGYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQS 224
Query: 220 ---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 276
++DSG T LP Y+ + ++ ++ CY+ S + P +
Sbjct: 225 VGTIIDSGTVITRLPPTAYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIV 284
Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 336
L F+ + +F F N + GIIG + + +D E ++
Sbjct: 285 LHFAGLDVRIPATGVF-FVFNSSQVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRI 343
Query: 337 AWSHSKC 343
+S C
Sbjct: 344 GFSAGAC 350
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 76/330 (23%), Positives = 135/330 (40%), Gaps = 42/330 (12%)
Query: 46 DPSSSSSSKNVSCSHPLCKSR--SSC--KSLKD-PCPYIADYSTEDTSSSGYLVDDILHL 100
DP++SS+ + C PLC++ +SC +S D C Y+ Y + + + G L D
Sbjct: 134 DPAASSTHAALPCDAPLCRALPFTSCGGRSWGDRSCVYVYHYG-DRSLTVGQLATDSFTF 192
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
++ + V GCG G + A G+ G G G S+PS L S
Sbjct: 193 GGDDNAGGLAARR--VTFGCGHINKGIFQ--ANETGIAGFGRGRWSLPSQLNV-----TS 243
Query: 161 FSICF----DENDSGSVFFGDQGP-----------ATQQSTSFLPIGEKYDAYFVGVESY 205
FS CF D S V G ++T + + YFV +
Sbjct: 244 FSYCFTSMFDTKSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGI 303
Query: 206 CIGNS--CLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
+G + + +S ++ ++DSGAS T LP ++Y V +F V + + C
Sbjct: 304 SVGGARVAVPESRLRSSTIIDSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLC 363
Query: 262 YNASSEEMLKVP-----DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGI 316
+ + + P + L + N++F E+ V C+ + + G+ +
Sbjct: 364 FALPVAALWRRPAVPALTLHLDGGADWELPRGNYVF---EDYAARVLCVVLDAAAGEQVV 420
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
IG +V+D EN L+++ ++C+++
Sbjct: 421 IGNYQQQNTHVVYDLENDVLSFAPARCDKL 450
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 56/209 (26%), Positives = 99/209 (47%), Gaps = 21/209 (10%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
+S +DP S++ ++SC+ C + C + CPY Y + +S++GY ++D+
Sbjct: 85 MSTFDPRKSTTKISISCTDAECGVLNKKLQCSPERLSCPYSLLYG-DGSSTAGYYLNDVF 143
Query: 99 HLASF-SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
S ++ S + ++ GCG QTGS+ + DG++G G VS+P+ LA+ +
Sbjct: 144 TFNQVPSDNSTAKSGTARLVFGCGGTQTGSW----SVDGLLGFGPTTVSLPNQLAQQNIS 199
Query: 158 QNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI-GNSCLTQ 214
N F+ C D + GS+ G + + P+ D Y V + + I G + T
Sbjct: 200 VNIFAHCLQGDVSGRGSLVIGT---IREPDLVYTPMVFGEDHYNVQLLNIGISGRNVTTP 256
Query: 215 SGFQ------ALVDSGASFTFLPTEIYAE 237
+ F ++DSG + T+L Y E
Sbjct: 257 ASFDLEYTGGVIIDSGTTLTYLVQPAYDE 285
>gi|357152725|ref|XP_003576216.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like,
partial [Brachypodium distachyon]
Length = 354
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 69/300 (23%), Positives = 126/300 (42%), Gaps = 34/300 (11%)
Query: 64 KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRK 123
+ + CK + C Y Y+ + SS G L+ D L P + ++ GCG
Sbjct: 66 RFKHDCKENPNQCDYDVRYAGGE-SSLGVLIADKFSL-------PGRDARPTLTFGCGYD 117
Query: 124 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPAT 182
Q G + DGV+G+G G + S L + G I +N C G +FFG +
Sbjct: 118 QEGGKAEMPV-DGVLGIGRGTRDLASQLKQQGAIAENVIGHCLRIQGGGYLFFGHE-KVP 175
Query: 183 QQSTSFLPIGEKYDAYFVGVESY----CIGNSCLTQSGFQALVDSGASFTFLPTEIYAEV 238
+++P+ Y G+ + +GN ++ + + ++DSG+++T++PTE Y +
Sbjct: 176 SSVVTWVPMVPNNHYYSPGLAALHFNGNLGNP-ISVAPMEVVIDSGSTYTYMPTETYRRL 234
Query: 239 V-VKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF--- 294
V V L S ++ + C+ A E + D++ F + ++ +
Sbjct: 235 VFVVIASLSKSSLTLVRDPALPVCW-AGKEPFKXIGDVKDKFKPLELAFIQGTSQAIMEI 293
Query: 295 -PEN----EGFTVFCLTVMSTDG------DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
PEN G C+ ++ DG +IG M +++D E ++ W + C
Sbjct: 294 PPENYLIISGEGNVCMGIL--DGTQAGLRKLNVIGDISMQNQLVIYDNERARIGWVRAPC 351
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 81/315 (25%), Positives = 128/315 (40%), Gaps = 45/315 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+DP +SS+ ++ SC C K RS K K C + Y+ + + + G L + L +
Sbjct: 134 FDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKK--CTFRYSYA-DGSFTGGNLASETLTV 190
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
S A + GCG G + + G++GLG G++S+ S L I
Sbjct: 191 DS---TAGKPVSFPGFAFGCGHSSGGIF--DKSSSGIVGLGGGELSLISQLKST--INGL 243
Query: 161 FSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS 215
FS C D + S + FG G + T P+ Y Y E +
Sbjct: 244 FSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPLRLPYKGYSKKTE---------VEE 294
Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 275
G +VDSG ++TFLP E Y+++ + KR+ + CYN ++E + P +
Sbjct: 295 G-NIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAE--INAPII 351
Query: 276 RLIFSK-NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ----NFMMGHRIVFD 330
F N N E+ + C TV T D G++G NF++G FD
Sbjct: 352 TAHFKDANVELQPLNTFMRMQED----LVCFTVAPTS-DIGVLGNLAQVNFLVG----FD 402
Query: 331 RENLKLAWSHSKCEE 345
+ ++ EE
Sbjct: 403 LRKKRGFSKKAEVEE 417
Score = 41.6 bits (96), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 15/129 (11%)
Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
+VDSG ++T+LP E Y ++ + KR+ CYN + ++ + P + F
Sbjct: 421 IVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYNTTVDQ-IDAPIITAHF 479
Query: 280 SK-NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ----NFMMGHRIVFDRENL 334
N N E+ + C TV+ T D GI+G NF++G FD
Sbjct: 480 KDANVELQPWNTFLRMQED----LVCFTVLPTS-DIGILGNLAQVNFLVG----FDLRKK 530
Query: 335 KLAWSHSKC 343
++++ + C
Sbjct: 531 RVSFKAADC 539
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 138/317 (43%), Gaps = 41/317 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
++P+SS+S +SC+ C+S + D C Y Y + + + G V + + L S
Sbjct: 191 FEPASSASFSTLSCNTRQCRSLDVSECRNDTCLYEVSYG-DGSYTVGDFVTETITLGS-- 247
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
AP +V IGCG G ++ A ++GLG G +S PS + SFS C
Sbjct: 248 --APVDNVA----IGCGHNNEGLFVGAAG---LLGLGGGSLSFPSQINAT-----SFSYC 293
Query: 165 FDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ- 218
+ DS S + F P S L Y+VG+ +G ++ +S FQ
Sbjct: 294 LVDRDSESASTLEFNSTLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQI 353
Query: 219 -------ALVDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSE 267
+VDSG + T L T++Y + F K L S+ I+L + CY+ SS+
Sbjct: 354 DESGNGGVIVDSGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIAL----FDTCYDLSSK 409
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFP-ENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
++VP + F + + + P ++EG FC T IIG G R
Sbjct: 410 GNVEVPTVSFHFPDGKELPLPAKNYLVPLDSEG--TFCFAFAPTASSLSIIGNVQQQGTR 467
Query: 327 IVFDRENLKLAWSHSKC 343
+V+D N + + +KC
Sbjct: 468 VVYDLVNHLVGFVPNKC 484
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 81/320 (25%), Positives = 134/320 (41%), Gaps = 32/320 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+DP SS+ N+SC PLC K S + C Y Y+ + + + G L + + L S
Sbjct: 106 FDPLKSSTYTNISCDSPLCYKPYIGECSPEKRCDYTYGYA-DSSLTKGVLAQETVTLTSN 164
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI--QNSF 161
+ S+Q ++ GCG TG++ D G++GLG G SL+++ G + F
Sbjct: 165 TGKP--ISLQ-GILFGCGHNNTGNFNDHEM--GLIGLGGGPT---SLVSQIGPLFGGKKF 216
Query: 162 SICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEK------YDAYFVGV---ESYCI 207
S C D S + FG + P+ ++ Y +G+ ++Y
Sbjct: 217 SQCLVPFLTDITISSQMSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLP 276
Query: 208 GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASS 266
NS + + LVDSG LP ++Y V V+ V + I+ + + CY +
Sbjct: 277 MNSTIEKGNM--LVDSGTPPNILPQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCYRTQT 334
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS-TDGDYGIIGQNFMMGH 325
LK P + F + F P E VFCL + + + D GI G +
Sbjct: 335 N--LKGPTLTYHFEGANLLLTPIQTFIPPTPETKGVFCLAITNCANSDPGIYGNFAQTNY 392
Query: 326 RIVFDRENLKLAWSHSKCEE 345
I FD + +++ + C +
Sbjct: 393 LIGFDLDRQIVSFKPTDCTK 412
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 77/325 (23%), Positives = 129/325 (39%), Gaps = 36/325 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPC----PYIADYS---TEDTSSSGY 92
+DPSSS S V C P C + + + PC P Y+ + + S G
Sbjct: 183 FDPSSSPSYAAVPCDSPSCDALQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGV 242
Query: 93 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
L D L LA V + GCG G G + G+MGLG +S+ S
Sbjct: 243 LAHDRLSLA--------GEVIDGFVFGCGTSNQGPPFGGTS--GLMGLGRSQLSLVSQTV 292
Query: 153 K--AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--------YFVGV 202
G+ + + + SGS+ GD A + ST + ++ Y V +
Sbjct: 293 DQFGGVFSYCLPLSRESDASGSLVLGDDPSAYRNSTPVVYTSMVSNSDPLLQGPFYLVNL 352
Query: 203 ESYCIGNSCLTQSGF--QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
+G + +GF +A+VDSG T L +Y V +F ++ + +
Sbjct: 353 TGITVGGQEVESTGFSARAIVDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGFSILDT 412
Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS--TDGDYGIIG 318
C+N + + ++VP + L+F V + + + + CL V S ++ + IIG
Sbjct: 413 CFNMTGLKEVQVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSEDETSIIG 472
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
R+VFD ++ ++ C
Sbjct: 473 NYQQKNLRVVFDTSASQVGFAQETC 497
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 88/363 (24%), Positives = 143/363 (39%), Gaps = 49/363 (13%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+ ++P S++ +V C+ C+ + +C + C Y Y +++G L +
Sbjct: 132 APFNPVRSTTVADVPCTDDACQQFAPQTCGAGASECAYTYMYGGGAANTTGLLGTEAFTF 191
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+ V+ GCG K G + + GV+GLG G++S+ S L +
Sbjct: 192 GD--------TRIDGVVFGCGLKNVGDF---SGVSGVIGLGRGNLSLVSQLQV-----DR 235
Query: 161 FSICFDENDS----GSVFFGDQG-PATQQ--STSFLPIGEKYDAYFVGVESYCI-GNSCL 212
FS F +DS + FGD P T ST L Y+V + + G
Sbjct: 236 FSYHFAPDDSVDTQSFILFGDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLA 295
Query: 213 TQSG-FQALVDSGASFTFLP----TEIYAEVVVKFDKLVSSKRISL---QGNSW--KYCY 262
SG F G+ FL + E K + + +I L G++ CY
Sbjct: 296 IPSGTFDLRNKDGSGGVFLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLDLCY 355
Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVM-STDGDYGIIGQN 320
S KVP M L+F+ + + F G CLT++ S+ GD ++G
Sbjct: 356 TGESLAKAKVPSMALVFAGGAVMELELGNYFYMDSTTGLA--CLTILPSSAGDGSVLGSL 413
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAA 380
+G +++D KL + E + + PPP+G S T QQ+ A+
Sbjct: 414 IQVGTHMMYDINGSKLVF-----ESLAQAA----APPPSGSSQQTSSKTNQQAGGRRSAS 464
Query: 381 APP 383
APP
Sbjct: 465 APP 467
>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 492
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 87/388 (22%), Positives = 152/388 (39%), Gaps = 71/388 (18%)
Query: 19 CLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY 78
C P T C+L G N S + S+ + C+ P C + S D C
Sbjct: 113 CAPFT----CMLCEGKPTPPGNNNSSNPLPPPTDSRRIPCASPFCSAAHSSAPPADLCAA 168
Query: 79 ----IADYSTEDTSSSG------YLVDDILHLASFSKHAPQSSVQSSVII-----GCGRK 123
+ D T ++S Y D +A + + + +SV + C
Sbjct: 169 ARCPLDDIETGSCAASHACPPLYYAYGDGSLVARLRRG--RVGIAASVAVENFTFACAHT 226
Query: 124 QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND--------SGSVFF 175
G P GV G G G +S+P+ LA A L FS C + +
Sbjct: 227 ALGE------PVGVAGFGRGPLSLPAQLAPAAL-SGRFSYCLVAHSFRADRPIRPSPLIL 279
Query: 176 GD---QGPATQQSTSFLPI--GEKYDAYF-VGVESYCIGNSCLT---------QSGFQAL 220
G + PA++ + P+ K+ ++ V +E+ +G + + ++G +
Sbjct: 280 GRSPGEDPASETGIVYTPLLHNPKHPYFYSVALEAVSVGGTRIPARPELGRVGRAGDGGM 339
Query: 221 V-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---------WKYCYNASSEE-- 268
V DSG +FT LP E YA V +F + +++ R + + Y ++AS+ E
Sbjct: 340 VVDSGTTFTMLPNETYARVAEEFGRAMAAARFERAEAAEDQTGLAPCYYYDHDASAAEEG 399
Query: 269 -MLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMS-----TDGDYGIIGQN 320
VP + + F + V+ RN+ F E V CL +M+ G G +G
Sbjct: 400 SARAVPPLAMHFRGEATVVLPRRNYFMGFRSEERRRVGCLMLMNGGEDDGGGPAGTLGNF 459
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVID 348
G +V+D + ++ ++ +C ++ D
Sbjct: 460 QQQGFEVVYDVDAGRVGFARRRCTDLWD 487
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 77/332 (23%), Positives = 133/332 (40%), Gaps = 48/332 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
+DP SS+ + V CS P C++ S + C Y+ Y + +SS+G L D L
Sbjct: 128 FDPRRSSTYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYG-DGSSSTGELATDKLA 186
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
A+ + ++V +GCGR G + D AA G++G+ G +S+ + +A A +
Sbjct: 187 FAN-------DTYVNNVTLGCGRDNEGLF-DSAA--GLLGVARGKISISTQVAPA--YGS 234
Query: 160 SFSICFDENDSGS------VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT 213
F C + S S VF P + T+ L + Y+V + + +G +T
Sbjct: 235 VFEYCLGDRTSRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVT 294
Query: 214 QSGFQ--------------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL---QGN 256
GF +VDSG + + + YA + FD + + + +
Sbjct: 295 --GFSNASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHS 352
Query: 257 SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD 311
+ CY+ P + L F+ + + P + G CL + D
Sbjct: 353 VFDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAAD 412
Query: 312 GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
+IG G R+VFD E ++ ++ C
Sbjct: 413 DGLSVIGNVQQQGFRVVFDVEKERIGFAPKGC 444
>gi|388518245|gb|AFK47184.1| unknown [Lotus japonicus]
Length = 245
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 57/237 (24%), Positives = 96/237 (40%), Gaps = 21/237 (8%)
Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK 194
DG++GLG G S+ S L GL++N C G +FFGD +++ ++ P+ +
Sbjct: 13 DGMLGLGRGKSSLVSQLNSQGLVRNVVGHCLSAQGGGYIFFGDVYDSSR--LTWTPMSSR 70
Query: 195 -YDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 253
Y G G G + D+G+S+T+ + Y V+ K ++ K +
Sbjct: 71 DLKHYVAGAAELIFGGKKTGIGGLLPVFDTGSSYTYFNSNAYQAVISWLKKELAGKPLKE 130
Query: 254 QGNS------W--KYCYNASSEEMLKVPDMRLIFS----KNQSFVVRNHIFSFPENEGFT 301
+ W K + + E M L F+ N F + + N G
Sbjct: 131 APDDQTLPLCWHGKRPFRSVYEVRKYFKSMALSFTSSGRTNTQFEIPPEAYLIVSNMGNV 190
Query: 302 VFCLTVMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHL 354
CL ++ GD +IG M+ +VFD E + W+ + C V + HV +
Sbjct: 191 --CLGILDGSEVGMGDLNLIGDISMLDKVMVFDNEKRLIGWAPADCNRVPNSRHVSI 245
>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
Length = 434
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 81/338 (23%), Positives = 131/338 (38%), Gaps = 60/338 (17%)
Query: 58 CSHPLCKSRSSCKSLKDPCPYIADYSTED---------------TSSSGYLVDDILHLAS 102
C PLC S + DPC +A S T +G +V L +
Sbjct: 92 CVSPLCSDVHSSDNSYDPCA-VAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLTRDT 150
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+ H S V C +Y + P G+ G G G +S+PS L G +Q FS
Sbjct: 151 LTTHGSSPSFTREVPNFCFGCVGSTYRE---PIGIAGFGRGVLSLPSQL---GFLQKGFS 204
Query: 163 ICF-------DENDSGSVFFGDQGPATQ---QSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
CF + N S + GD ++ Q TS L + Y++G+E+ +GN+
Sbjct: 205 HCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNATA 264
Query: 213 TQ-----------SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ--GNSWK 259
Q ++DSG ++T LP Y +++ +++ R Q +
Sbjct: 265 IQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQEARTGFD 324
Query: 260 YCY------NASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSF--PENEGFTVFCLTVMS 309
CY N ++ +P + FS N S V+ NH ++ P N V CL + +
Sbjct: 325 LCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNST-VVKCLLLQN 383
Query: 310 TD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
D G G+ G ++V+D E ++ + C
Sbjct: 384 MDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 421
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 88/326 (26%), Positives = 139/326 (42%), Gaps = 42/326 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
YDPS+SS+ + CS C SR+ S C Y Y + S+G L + L L
Sbjct: 113 YDPSASSTFSPLPCSSATCLPIWSRNCTPS--SLCRYRYAYG-DGAYSAGILGTETLTLG 169
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
S AP S V GCG G L+ G +GLG G +S LLA+ G+ F
Sbjct: 170 PSS--APVSV--GGVAFGCGTDNGGDSLNST---GTVGLGRGTLS---LLAQLGV--GKF 217
Query: 162 SIC----FDENDSGSVFFGD-----QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
S C F+ G GP+T QST L + YFV ++ +G+ L
Sbjct: 218 SYCLTDFFNSALDSPFLLGTLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRL 277
Query: 213 T----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
+VDSG +FT L + EVV + +++ ++ C+
Sbjct: 278 PIPNGTFDLRGDGTGGMIVDSGTTFTILAESGFREVVGRVARVLGQPPVNASSLDAP-CF 336
Query: 263 NASSEEMLKVPDMRLIFSKNQSF-VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNF 321
A + E +PD+ L F+ + R++ S+ NE + FCL + T + + NF
Sbjct: 337 PAPAGEPPYMPDLVLHFAGGADMRLYRDNYMSY--NEEDSSFCLNIAGTTPESTSVLGNF 394
Query: 322 MMGH-RIVFDRENLKLAWSHSKCEEV 346
+ +++FD +L++ + C ++
Sbjct: 395 QQQNIQMLFDTTVGQLSFLPTDCSKL 420
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 85/316 (26%), Positives = 142/316 (44%), Gaps = 36/316 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
++P+SSS+ K+++CS P C S+C+S K C Y Y + + + G L D + +
Sbjct: 204 FNPTSSSTYKSLTCSAPQCSLLETSACRSNK--CLYQVSYG-DGSFTVGELATDTVTFGN 260
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
K + V +GCG G + G++GLG G +S+ + + SFS
Sbjct: 261 SGKI-------NDVALGCGHDNEGLF---TGAAGLLGLGGGALSITNQMKAT-----SFS 305
Query: 163 ICFDENDSG---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQ---- 214
C + DSG S+ F + +T+ L +K D Y+VG+ + +G +
Sbjct: 306 YCLVDRDSGKSSSLDFNSVQLGSGDATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAI 365
Query: 215 -----SGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKYCYNASSE 267
SG ++ D G + T L T+ Y + F KL ++ K+ + + + CY+ SS
Sbjct: 366 FDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDTCYDFSSL 425
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
+KVP + F+ +S + + P ++ T FC T IIG G RI
Sbjct: 426 SSVKVPTVAFHFTGGKSLDLPAKNYLIPVDDNGT-FCFAFAPTSSSLSIIGNVQQQGTRI 484
Query: 328 VFDRENLKLAWSHSKC 343
+D N + S +KC
Sbjct: 485 TYDLANKIIGLSGNKC 500
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 84/313 (26%), Positives = 136/313 (43%), Gaps = 33/313 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
++P+SS+S ++SC CKS + C Y Y + + + G V + + L S S
Sbjct: 193 FEPTSSASFTSLSCETEQCKSLDVSECRNGTCLYEVSYG-DGSYTVGDFVTETVTLGSTS 251
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
++ IGCG G ++ A ++GLG G +S PS L + SFS C
Sbjct: 252 L--------GNIAIGCGHNNEGLFIGAAG---LLGLGGGSLSFPSQLNAS-----SFSYC 295
Query: 165 FDENDSGSVFFGD-QGPATQQS-TSFLPIGEKYDAYF-VGVESYCIGNSCLT--QSGFQA 219
+ DS S D P T + T+ L D +F +G+ +G + L ++ FQ
Sbjct: 296 LVDRDSDSTSTLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQM 355
Query: 220 --------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
+VDSG + T L T +Y + F K + + + CY+ SS+ ++
Sbjct: 356 SEDGNGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVE 415
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFP-ENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
VP + F+ + + P ++EG FC TD I+G G R+ FD
Sbjct: 416 VPTVSFHFANGNELPLPAKNYLIPVDSEG--TFCFAFAPTDSTLSILGNAQQQGTRVGFD 473
Query: 331 RENLKLAWSHSKC 343
N + +S +KC
Sbjct: 474 LANSLVGFSPNKC 486
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 84/313 (26%), Positives = 136/313 (43%), Gaps = 33/313 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
++P+SS+S ++SC CKS + C Y Y + + + G V + + L S S
Sbjct: 193 FEPTSSASFTSLSCETEQCKSLDVSECRNGTCLYEVSYG-DGSYTVGDFVTETVTLGSTS 251
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
++ IGCG G ++ A ++GLG G +S PS L + SFS C
Sbjct: 252 L--------GNIAIGCGHNNEGLFIGAAG---LLGLGGGSLSFPSQLNAS-----SFSYC 295
Query: 165 FDENDSGSVFFGD-QGPATQQS-TSFLPIGEKYDAYF-VGVESYCIGNSCLT--QSGFQA 219
+ DS S D P T + T+ L D +F +G+ +G + L ++ FQ
Sbjct: 296 LVDRDSDSTSTLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQM 355
Query: 220 --------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
+VDSG + T L T +Y + F K + + + CY+ SS+ ++
Sbjct: 356 SEDGNGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVE 415
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFP-ENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
VP + F+ + + P ++EG FC TD I+G G R+ FD
Sbjct: 416 VPTVSFHFANGNELPLPAKNYLIPVDSEG--TFCFAFAPTDSTLSILGNAQQQGTRVGFD 473
Query: 331 RENLKLAWSHSKC 343
N + +S +KC
Sbjct: 474 LANSLVGFSPNKC 486
>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 417
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 81/338 (23%), Positives = 131/338 (38%), Gaps = 60/338 (17%)
Query: 58 CSHPLCKSRSSCKSLKDPCPYIADYSTED---------------TSSSGYLVDDILHLAS 102
C PLC S + DPC +A S T +G +V L +
Sbjct: 75 CVSPLCSDVHSSDNSYDPCA-VAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLTRDT 133
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+ H S V C +Y + P G+ G G G +S+PS L G +Q FS
Sbjct: 134 LTTHGSSPSFTREVPNFCFGCVGSTYRE---PIGIAGFGRGVLSLPSQL---GFLQKGFS 187
Query: 163 ICF-------DENDSGSVFFGDQGPATQ---QSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
CF + N S + GD ++ Q TS L + Y++G+E+ +GN+
Sbjct: 188 HCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNATA 247
Query: 213 TQ-----------SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ--GNSWK 259
Q ++DSG ++T LP Y +++ +++ R Q +
Sbjct: 248 IQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQEARTGFD 307
Query: 260 YCY------NASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSF--PENEGFTVFCLTVMS 309
CY N ++ +P + FS N S V+ NH ++ P N V CL + +
Sbjct: 308 LCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNST-VVKCLLLQN 366
Query: 310 TD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
D G G+ G ++V+D E ++ + C
Sbjct: 367 MDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 404
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 84/311 (27%), Positives = 141/311 (45%), Gaps = 48/311 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP SS+ + VSCS C++ +SC + ++ C Y Y +++ + G + D + + S
Sbjct: 128 FDPKESSTYRKVSCSSSQCRALEDASCSTDENTCSYTITYG-DNSYTKGDVAVDTVTMGS 186
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S P S ++IIGCG + TG++ A G++GLG G S+ S L K+ I FS
Sbjct: 187 -SGRRPVS--LRNMIIGCGHENTGTF--DPAGSGIIGLGGGSTSLVSQLRKS--INGKFS 239
Query: 163 ICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCL--T 213
C + + + FG G + + +K A YF+ +E+ +G+ + T
Sbjct: 240 YCLVPFTSETGLTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFT 299
Query: 214 QSGF-----QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
+ F ++DSG + T LP+ Y E+ + ++R+ CY SS
Sbjct: 300 STIFGTGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSSS- 358
Query: 269 MLKVPDMRLIFSKN-------QSFVVRNH---IFSFPENEGFTVFCLTVMSTDGDYGIIG 318
KVPD+ + F +FV + F+F NE T+F G +
Sbjct: 359 -FKVPDITVHFKGGDVKLGNLNTFVAVSEDVSCFAFAANEQLTIF-----------GNLA 406
Query: 319 Q-NFMMGHRIV 328
Q NF++G+ V
Sbjct: 407 QMNFLVGYDTV 417
>gi|356509401|ref|XP_003523438.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 407
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 86/343 (25%), Positives = 134/343 (39%), Gaps = 51/343 (14%)
Query: 33 GASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDT 87
G ++ +DR +Y P + V C PLC + S C + + C Y +Y+ +
Sbjct: 82 GCTLPRDR---QYKPHGNL----VKCVDPLCAAIQSAPNPPCVNPNEQCDYEVEYA-DQG 133
Query: 88 SSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVS 146
SS G LV DI+ L K + S + GCG QT + + GV+GLG G S
Sbjct: 134 SSLGVLVRDIIPL----KLTNGTLTHSMLAFGCGYDQTHVGHNPPPSAAGVLGLGNGRAS 189
Query: 147 VPSLLAKAGLIQNSFSICFDENDSGSVFFGDQ---------GPATQQSTSFLPIGEKYDA 197
+ S L GLI+N C G +FFGDQ P Q S+S L
Sbjct: 190 ILSQLNSKGLIRNVVGHCLSGTGGGFLFFGDQLIPQSGVVWTPILQSSSSLL------KH 243
Query: 198 YFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS 257
Y G + G + DSG+S+T+ + + +V + K +S
Sbjct: 244 YKTGPADMFFNGKATSVKGLELTFDSGSSYTYFNSLAHKALVDLITNDIKGKPLSRATED 303
Query: 258 ------WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-----FCLT 306
WK S + L+ S +S +N +F P V CL
Sbjct: 304 PSLPICWKGPKPFKSLHDVTSNFKPLVLSFTKS---KNSLFQVPPEAYLIVTKHGNVCLG 360
Query: 307 VMSTD----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
++ G+ IIG + +++D E ++ W+ + C+
Sbjct: 361 ILDGTEIGLGNTNIIGDISLQDKLVIYDNEKQRIGWASANCDR 403
>gi|414887401|tpg|DAA63415.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 242
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 67/248 (27%), Positives = 111/248 (44%), Gaps = 35/248 (14%)
Query: 87 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 146
+SSSG L +DI+ S+ Q +V GC +TG A DG+MGLG G +S
Sbjct: 2 SSSSGVLGEDIVSFGRESELKAQRAV-----FGCENSETGDLFSQHA-DGIMGLGRGQLS 55
Query: 147 VPSLLAKAGLIQNSFSICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVE 203
+ L + G+I +SFS+C+ D G V G P+ + P+ Y Y + ++
Sbjct: 56 IMDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGVPTPSDMVFSRSDPLRSPY--YNIELK 113
Query: 204 SYCIGNSCLT------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ--- 254
+ L S ++DSG ++ +LP + + + F V+SK SL+
Sbjct: 114 EIHVAGKALRVDSRIFDSKHGTVLDSGTTYAYLPEQAF----MAFKDAVTSKVHSLKKIR 169
Query: 255 --GNSWK-YCYNASSEEMLKV----PDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCL 305
S+K C+ + + K+ PD+ ++F Q S N++F + +G +CL
Sbjct: 170 GPDPSYKDICFAGARRNVSKLHEVFPDVDMVFGNGQKLSLTPENYLFRHSKVDG--AYCL 227
Query: 306 TVMSTDGD 313
V D
Sbjct: 228 GVFQNGKD 235
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 74/315 (23%), Positives = 124/315 (39%), Gaps = 34/315 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
+DP++SS++ V C P C+S S +S C Y+ +YS +D +++G + D L
Sbjct: 179 FDPTTSSTAAAVRCRSPACRSLGPYGNGCSNRSANAECRYLIEYS-DDRATAGTYMTDTL 237
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
++ ++ + GC G + D A G M LG G S+ + A++ +
Sbjct: 238 TISG-------TTAVRNFRFGCSHAVRGRFSDLTA--GTMSLGGGAQSLLAQTARS--LG 286
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA------YFVGVESYCIGNSCL 212
N+FS C + S S F GPAT ST+ + Y V ++ + L
Sbjct: 287 NAFSYCVPQA-SASGFLSIGGPATTNSTTVFATTPLVRSAINPSLYLVRLQGIVVAGRRL 345
Query: 213 ----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
A++DS A T LP Y + F + + S + CY+
Sbjct: 346 GIPPVAFSAGAVMDSSAVITQLPPTAYRALRRAFRNAMRAYPRSGATGTLDTCYDFLGLT 405
Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
++VP + L+F V+ P T S+D G IG H ++
Sbjct: 406 NVRVPAVSLVFGGGAVVVLDP-----PAVMIGGCLAFTATSSDLALGFIGNVQQQTHEVL 460
Query: 329 FDRENLKLAWSHSKC 343
+D + + C
Sbjct: 461 YDVAAGGVGFRRGAC 475
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 82/323 (25%), Positives = 126/323 (39%), Gaps = 33/323 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLC-KSRSSCKSL--KDPCPYIADYSTEDTSSSGYLVDDILHLA 101
+ PSSSS+ V C P C ++R SC S D CPY Y + + + G+L +D L L
Sbjct: 129 FAPSSSSTFSAVRCGEPECPRARQSCSSSPGDDRCPYEVVYG-DKSRTVGHLGNDTLTLG 187
Query: 102 ---SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
S + S+ + GCG TG L G A DG+ GLG G VS+ S AG
Sbjct: 188 TTPSTNASENNSNKLPGFVFGCGENNTG--LFGKA-DGLFGLGRGKVSLSS--QAAGKYG 242
Query: 159 NSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL 212
FS C N G + G PA + F P+ + + Y+V + + +
Sbjct: 243 EGFSYCLPSSSSNAHGYLSLGTPAPAPAHA-RFTPMLNRSNTPSFYYVKLVGIRVAGRAI 301
Query: 213 TQSGFQAL------VDSGASFTFLPTEIYAEVVVKFDKLVS------SKRISLQGNSWKY 260
S AL VDSG T L Y+ + F + + R+S+ Y
Sbjct: 302 KVSSRPALWPAGLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSILDTC--Y 359
Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQN 320
+ A + + +P + L+F+ + V + GI+G
Sbjct: 360 DFTAHANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQACLAFAPNGNGRSAGILGNT 419
Query: 321 FMMGHRIVFDRENLKLAWSHSKC 343
+V+D K+ ++ C
Sbjct: 420 QQRTVAVVYDVGRQKIGFAAKGC 442
>gi|224096119|ref|XP_002310541.1| predicted protein [Populus trichocarpa]
gi|222853444|gb|EEE90991.1| predicted protein [Populus trichocarpa]
Length = 379
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 79/325 (24%), Positives = 131/325 (40%), Gaps = 38/325 (11%)
Query: 47 PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASF 103
P S+ V+C P+C+S + + P DY E SS G LV D +L +F
Sbjct: 61 PYYKPSNNLVACKDPICQSLHTGGDQRCENPGQCDYEVEYADGGSSLGVLVKDAFNL-NF 119
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
+ QS + + + G + G+Y DGV+GLG G S+ S L+ GL++N
Sbjct: 120 TSEKRQSPLLALGLCGYDQLPGGTY---HPIDGVLGLGRGKPSIVSQLSGLGLVRNVIGH 176
Query: 164 CFDENDSGSVFFGDQGPATQQSTS---FLPIGEKYDAYFVGVESYCIGNSCLTQSGFQAL 220
C SG +S + P+ Y G +GF+ L
Sbjct: 177 CL----SGRGGGFLFFGDDLYDSSRVAWTPMSPNAKHYSPGFAELTFDGKT---TGFKNL 229
Query: 221 V---DSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDM 275
+ DSGAS+T+L +++Y ++ + +S+K R +L + C+ + V D+
Sbjct: 230 IVAFDSGASYTYLNSQVYQGLISLIKRELSTKPLREALDDQTLPICWKG-RKPFKSVRDV 288
Query: 276 RLIFSKNQSFVVRNH-----IFSFPENEGFTV-----FCLTVMSTD----GDYGIIGQNF 321
+ F K + N FP V CL V++ D +IG
Sbjct: 289 KKYF-KTFALSFANDGKSKTQLEFPPEAYLIVSSKGNACLGVLNGTEVGLNDLNVIGDIS 347
Query: 322 MMGHRIVFDRENLKLAWSHSKCEEV 346
M +++D E + W+ C+ +
Sbjct: 348 MQDRVVIYDNEKQLIGWAPRNCDRI 372
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 69/309 (22%), Positives = 119/309 (38%), Gaps = 25/309 (8%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DPS SS+ V+C P C+ S C S C Y Y + + + G LV D L L++
Sbjct: 191 FDPSLSSTYAAVACGAPECQELDASGCSS-DSRCRYEVQYG-DQSQTDGNLVRDTLTLSA 248
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S + GCG + G + DG+ GLG VS+PS A + F+
Sbjct: 249 -------SDTLPGFVFGCGDQNAGLF---GQVDGLFGLGREKVSLPSQGAPS--YGPGFT 296
Query: 163 ICFDENDSGSVF--FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------TQ 214
C + SG + G PA Q T+ L G Y++ + +G +
Sbjct: 297 YCLPSSSSGRGYLSLGGAPPANAQFTA-LADGATPSFYYIDLVGIKVGGRAIRIPATAFA 355
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 274
+ ++DSG T LP YA + F + ++ + + + CY+ + ++P
Sbjct: 356 AAGGTVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQIPT 415
Query: 275 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 334
+ L F+ + + + + D I+G + +D N
Sbjct: 416 VELAFAGGATVSLDFTGVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVAYDVANQ 475
Query: 335 KLAWSHSKC 343
++ + C
Sbjct: 476 RIGFGAKGC 484
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 84/316 (26%), Positives = 137/316 (43%), Gaps = 36/316 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
++P+SSS+ K+++CS P C S+C+S K C Y Y + + + G L D + +
Sbjct: 204 FNPTSSSTYKSLTCSAPQCSLLETSACRSNK--CLYQVSYG-DGSFTVGELATDTVTFGN 260
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
K ++V +GCG G + A G+ V S+ + + SFS
Sbjct: 261 SGKI-------NNVALGCGHDNEGLFTGAAGLLGLG------GGVLSITNQ--MKATSFS 305
Query: 163 ICFDENDSG---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNS--CLTQSG 216
C + DSG S+ F +T+ L +K D Y+VG+ + +G L +
Sbjct: 306 YCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAI 365
Query: 217 FQ--------ALVDSGASFTFLPTEIYAEVVVKFDKL-VSSKRISLQGNSWKYCYNASSE 267
F ++D G + T L T+ Y + F KL V+ K+ S + + CY+ SS
Sbjct: 366 FDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSL 425
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
+KVP + F+ +S + + P ++ T FC T IIG G RI
Sbjct: 426 STVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGT-FCFAFAPTSSSLSIIGNVQQQGTRI 484
Query: 328 VFDRENLKLAWSHSKC 343
+D + S +KC
Sbjct: 485 TYDLSKNVIGLSGNKC 500
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 89/349 (25%), Positives = 137/349 (39%), Gaps = 60/349 (17%)
Query: 40 RNLSEYDPSS------SSSSKNVSCSHPLCK--------SRSSCKSLKDPCPYIADYSTE 85
RN S + P++ SS+ C P+C+ R + + CPY +Y
Sbjct: 115 RNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPY--EYGYA 172
Query: 86 DTS-SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA---PDGVMGLG 141
D S +SG + L + S + + SV GCG + +G + G + +GVMGLG
Sbjct: 173 DGSLTSGLFARETTSLKTSSG---KEAKLKSVAFGCGFRISGQSVSGTSFNGANGVMGLG 229
Query: 142 LGDVSVPSLLAKAGLIQNSFSICFDE-----NDSGSVFFGDQGPATQQ--STSFL--PIG 192
G +S S L + N FS C + + + GD G A + T L P+
Sbjct: 230 RGPISFASQLGRR--FGNKFSYCLMDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLS 287
Query: 193 EKYDAYFVGVESYCIGNSCLT---------QSGFQALV-DSGASFTFLPTEIYAEVVVKF 242
+ Y+V ++S + + L SG V DSG + FL Y V+
Sbjct: 288 PTF--YYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAAV 345
Query: 243 DKLVSSKRISLQGNSWKYCYNASS----EEMLKVPDMRLIFSKNQSFVV--RNHIFSFPE 296
+ + + C N S E++L P ++ FS FV RN+ E
Sbjct: 346 KQRIKLPNADELTPGFDLCVNVSGVTKPEKIL--PRLKFEFSGGAVFVPPPRNYFIETEE 403
Query: 297 NEGFTVFCLTVMSTDGDYG--IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
+ CL + S D G +IG G FDR+ +L +S C
Sbjct: 404 Q----IQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGC 448
>gi|301119611|ref|XP_002907533.1| aspartyl protease family A01B, putative [Phytophthora infestans
T30-4]
gi|262106045|gb|EEY64097.1| aspartyl protease family A01B, putative [Phytophthora infestans
T30-4]
Length = 681
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 70/317 (22%), Positives = 138/317 (43%), Gaps = 28/317 (8%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL---A 101
+ ++SS+ +++C+ C D C Y E +S +V+DI++L +
Sbjct: 109 FQAANSSTLVHITCAQKSLFQCKECHVQSDTCGISQSY-MEGSSWKASVVEDIVYLGGES 167
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNS 160
SF ++ + GC + G ++ A DG+MGL + + + L + I N
Sbjct: 168 SFDDKEMRNRYGTHFQFGCQSSEKGLFVTQVA-DGIMGLSNTENHIIAKLHRENKIASNL 226
Query: 161 FSICFDENDSGSVFFGDQGPATQQ-STSFLPI------GEKYDAYF----VGVESYCIGN 209
FS+CF EN G++ G A + S++ + G Y+ + +G +S
Sbjct: 227 FSLCFTEN-GGTMSVGQPHKAAHRGEISYVKVIADRSAGHFYNVHMKDIRIGGKSINAKE 285
Query: 210 SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
T+ + +VDSG + ++LP + E + F ++ R GNS C +++++
Sbjct: 286 EAYTRGHY--IVDSGTTDSYLPRALKTEFLQMFKEIAG--RDYQVGNS---CKGFTNKDL 338
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPEN---EGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
+P ++L+ + PE E +C + ++ G+IG N MM
Sbjct: 339 ASLPTIQLVMEAYGDENAEVILDVPPEQYLLESNGAYCGGIYLSENSGGVIGANLMMNRD 398
Query: 327 IVFDRENLKLAWSHSKC 343
++FD + ++ + + C
Sbjct: 399 VIFDLGDQRVGFVDADC 415
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 84/316 (26%), Positives = 137/316 (43%), Gaps = 36/316 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
++P+SSS+ K+++CS P C S+C+S K C Y Y + + + G L D + +
Sbjct: 204 FNPTSSSTYKSLTCSAPQCSLLETSACRSNK--CLYQVSYG-DGSFTVGELATDTVTFGN 260
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
K ++V +GCG G + A G+ V S+ + + SFS
Sbjct: 261 SGKI-------NNVALGCGHDNEGLFTGAAGLLGLG------GGVLSITNQ--MKATSFS 305
Query: 163 ICFDENDSG---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNS--CLTQSG 216
C + DSG S+ F +T+ L +K D Y+VG+ + +G L +
Sbjct: 306 YCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAI 365
Query: 217 FQ--------ALVDSGASFTFLPTEIYAEVVVKFDKL-VSSKRISLQGNSWKYCYNASSE 267
F ++D G + T L T+ Y + F KL V+ K+ S + + CY+ SS
Sbjct: 366 FDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSL 425
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
+KVP + F+ +S + + P ++ T FC T IIG G RI
Sbjct: 426 STVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGT-FCFAFAPTSSSLSIIGNVQQQGTRI 484
Query: 328 VFDRENLKLAWSHSKC 343
+D + S +KC
Sbjct: 485 TYDLSKNVIGLSGNKC 500
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 76/318 (23%), Positives = 145/318 (45%), Gaps = 34/318 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
+DP +SS+ K+VSCS C ++++SC + C Y+ Y+ + + + G D L L
Sbjct: 136 FDPKASSTYKDVSCSSSQCTALENQASCSTEDKTCSYLVSYA-DGSYTMGKFAVDTLTLG 194
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNS 160
S Q ++IIGCG+ ++ + ++ +G SL+ + G I
Sbjct: 195 STDNRPVQ---LKNIIIGCGQNNAVTFRNKSSGVVGLG-----GGAVSLIKQLGDSIDGK 246
Query: 161 FSICF-DENDSGS-VFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
FS C END S + FG GP T + L + + Y++ ++S +G+ +
Sbjct: 247 FSYCLVPENDQTSKINFGTNAVVSGPGTVSTP--LVVKSRDTFYYLTLKSISVGSKNMQT 304
Query: 213 --TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 270
+ ++DSG + T LP + Y E+ L+++ + + CYNA+++ L
Sbjct: 305 PDSNIKGNMVIDSGTTLTLLPVKYYIEIENAVASLINADKSKDERIGSSLCYNATAD--L 362
Query: 271 KVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ-NFMMGHRIV 328
+P + + F + N F E+ F ++ +G YG + Q NF++G
Sbjct: 363 NIPVITMHFEGADVKLYPYNSFFKVTEDLVCLAFGMSFYR-NGIYGNVAQKNFLVG---- 417
Query: 329 FDRENLKLAWSHSKCEEV 346
+D + +++ + C ++
Sbjct: 418 YDTASKTMSFKPTDCAKM 435
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 79/303 (26%), Positives = 129/303 (42%), Gaps = 39/303 (12%)
Query: 57 SCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQS 114
+C P+C+ S C ++ C Y Y + + ++G + L A ++ VQ
Sbjct: 177 NCVAPICRRLDSAGCDRRRNSCLYQVAYG-DGSVTAGDFASETLTFARGAR------VQR 229
Query: 115 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSGSV 173
V IGCG G ++ A G++GLG G +S PS +A++ SFS C D S
Sbjct: 230 -VAIGCGHDNEGLFI---AASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRTSSRRA 283
Query: 174 FFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS---CLTQSGFQ---------ALV 221
+ T + +F Y+V + + +G + ++QS + ++
Sbjct: 284 RPSRRWGGTPRMATF---------YYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVIL 334
Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFS 280
DSG S T L +Y V F R+S G S + CYN S ++KVP + + +
Sbjct: 335 DSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLA 394
Query: 281 KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSH 340
S + + P + T FC + TDG IIG G R+VFD + ++ +
Sbjct: 395 GGASVALPPENYLIPVDTSGT-FCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVP 453
Query: 341 SKC 343
C
Sbjct: 454 KSC 456
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 69/309 (22%), Positives = 119/309 (38%), Gaps = 25/309 (8%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DPS SS+ V+C P C+ S C S C Y Y + + + G LV D L L++
Sbjct: 191 FDPSLSSTYAAVACGAPECQELDASGCSS-DSRCRYEVQYG-DQSQTDGNLVRDTLTLSA 248
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S + GCG + G + DG+ GLG VS+PS A + F+
Sbjct: 249 -------SDTLPGFVFGCGDQNAGLF---GQVDGLFGLGREKVSLPSQGAPS--YGPGFT 296
Query: 163 ICFDENDSGSVF--FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------TQ 214
C + SG + G PA Q T+ L G Y++ + +G +
Sbjct: 297 YCLPSSSSGRGYLSLGGAPPANAQFTA-LADGATPSFYYIDLVGIKVGGRAIRIPATAFA 355
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 274
+ ++DSG T LP YA + F + ++ + + + CY+ + ++P
Sbjct: 356 AAGGTVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQIPT 415
Query: 275 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 334
+ L F+ + + + + D I+G + +D N
Sbjct: 416 VELAFAGGATVSLDFTGVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVTYDVANQ 475
Query: 335 KLAWSHSKC 343
++ + C
Sbjct: 476 RIGFGAKGC 484
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 83/340 (24%), Positives = 137/340 (40%), Gaps = 33/340 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP SSS+ N++ C +SC ++ C Y Y +D+ + G L + L L S
Sbjct: 101 FDPQSSSTYSNIAYGSESCSKLYSTSCSPDQNNCNYTYSYE-DDSITEGVLAQETLTLTS 159
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+ + VI GCG G + D G++GLG G +S+ S + + FS
Sbjct: 160 TTG---KPVALKGVIFGCGHNNNGVFNDKEM--GIIGLGRGPLSLVSQIGSS-FGGKMFS 213
Query: 163 IC---FDENDS--GSVFFGDQGPATQQSTSFLPIGEK--YDAYF------VGVESYCI-- 207
C F N S + FG P+ K + A++ + VE +
Sbjct: 214 QCLVPFHTNPSITSPMSFGKGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISVEDINLPF 273
Query: 208 --GNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNA 264
G+S + ++DSG T LP + Y +V + V+ I + ++ CY
Sbjct: 274 NDGSSLEPITKGNMVIDSGTPTTLLPEDFYHRLVEEVRNKVALDPIPIDPTLGYQLCYRT 333
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST-DGDYGIIGQNFMM 323
+ LK + F + IF P +G +FC ST +YGI G +
Sbjct: 334 PTN--LKGTTLTAHFEGADVLLTPTQIF-IPVQDG--IFCFAFTSTFSNEYGIYGNHAQS 388
Query: 324 GHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSP 363
+ I FD E +++ + C + D ++ V P +P
Sbjct: 389 NYLIGFDLEKQLVSFKATDCTNLQDAPSINGVLPNVLSAP 428
>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 535
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 72/332 (21%), Positives = 140/332 (42%), Gaps = 37/332 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
Y P+ ++ + + S PLC+ + + C Y Y+ +S Y+ D + +
Sbjct: 204 YRPARTADA--LPASDPLCEG--AQHENPNQCDYEISYADGSSSMGVYVRDSMQFVGEDG 259
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
+ + ++ GCG Q G L+ DGV+GL +S+P+ LA G+I N+F
Sbjct: 260 ERE-----NADIVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGH 314
Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLTQSG-- 216
C + SG+ +F GD + +++PI G D V+ G+ L G
Sbjct: 315 CMSTDPSGAGGYLFLGDDY-IPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKL 373
Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDM 275
Q + D+G+++T+ P E ++ + S + + + + +C S + V D+
Sbjct: 374 TQVVFDTGSTYTYFPDEALTRLISSLKEAASPRFVQDDSDKTLPFCMK-SDFPVRSVEDV 432
Query: 276 RLIFSK-----------NQSFVVRNHIFSFPENEGFTVFCLTVMS-TDGDYG---IIGQN 320
+ F +++F +R + ++G CL V++ T Y I+G
Sbjct: 433 KHFFKPLSLQFEKRFFFSRTFNIRPEHYLVISDKGNV--CLGVLNGTTIGYDSVVIVGDV 490
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVIDKSHV 352
+ G + +D + ++ W C +S +
Sbjct: 491 SLRGKLVAYDNDKNEVGWVDFDCTNPRKRSRI 522
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 78/353 (22%), Positives = 136/353 (38%), Gaps = 65/353 (18%)
Query: 37 VQDRNLSEYDPSSSSSSKNVSCSHPLC------KSRSSCKSLKDPC---------PYIAD 81
++ + + P SSSS + C + C K +S C+ DP PY+
Sbjct: 134 IEVTGIPTFIPKQSSSSNLIGCKNHKCSWLFGPKVQSKCQEC-DPTTQNCTQSCPPYVIQ 192
Query: 82 YSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG 141
Y S++G L+ + L P ++GC S P+G+ G G
Sbjct: 193 YGLG--STAGLLLSETLDF-------PHKKTIPGFLVGC------SLFSIRQPEGIAGFG 237
Query: 142 LGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQST----SFLPIGEK--- 194
S+PS L S FD+ + S D G + + S+ P +
Sbjct: 238 RSPESLPSQLGLKKFSYCLVSHAFDDTPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTA 297
Query: 195 --YDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKF 242
D Y+V + + IG++ + + +VDSG +FTF+ +Y V +F
Sbjct: 298 AFRDYYYVLLRNIVIGDTHVKVPYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEF 357
Query: 243 DKLVSSKRISLQ---GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENE 298
+K V+ ++ + + C+N S E+ + VP+ F + + FSF ++
Sbjct: 358 EKQVAHYTVATEVQNQTGLRPCFNISGEKSVSVPEFIFHFKGGAKMALPLANYFSFVDS- 416
Query: 299 GFTVFCLTVMSTD--------GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
V CLT++S + G I+G + FD +N + + C
Sbjct: 417 --GVICLTIVSDNMSGSGIGGGPAIILGNYQQRNFHVEFDLKNERFGFKQQNC 467
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 86/345 (24%), Positives = 144/345 (41%), Gaps = 55/345 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLC------------KSRSSCKSLKDPCPYIADYSTEDTSSSGY 92
+DP++SSS +NV+C C R+ + +DPCPY Y + ++
Sbjct: 193 FDPAASSSYRNVTCGDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGD- 251
Query: 93 LVDDILHLASFSKH--APQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS 149
L L SF+ + AP +S + V+ GCG + G + A G+ L S
Sbjct: 252 -----LALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFAS--Q 304
Query: 150 LLAKAGLIQNSFSICFDEN--DSGS-VFFGDQGPATQ-------QSTSFLPIGEKYDA-- 197
L A G ++FS C ++ D GS V FG+ A + T+F P
Sbjct: 305 LRAVYG---HTFSYCLVDHGSDVGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPAD 361
Query: 198 --YFVGVESYCIGNSCLTQSGFQ----------ALVDSGASFTFLPTEIYAEVVVKF-DK 244
Y+V ++ +G L S ++DSG + ++ Y + F D+
Sbjct: 362 TFYYVKLKGVLVGGELLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDR 421
Query: 245 LVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTV 302
+ S + + CYN S E +VP++ L+F+ F N+ + +G ++
Sbjct: 422 MSRSYPLVPEFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRL-DPDGGSI 480
Query: 303 FCLTVMST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
CL V+ T IIG +V+D +N +L ++ +C EV
Sbjct: 481 MCLAVLGTPRTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRCAEV 525
>gi|47497551|dbj|BAD19623.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
gi|47847593|dbj|BAD21980.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
Length = 297
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 65/139 (46%), Gaps = 8/139 (5%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ YDP S S + V+C C + SC S PC Y Y + +S++G+ V D
Sbjct: 134 LTMYDPRGSQSGELVTCDQQFCVANYGGVLPSCTS-TSPCEYSISYG-DGSSTAGFFVTD 191
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
L S + +SV GCG K G A DG++G G + S+ S LA AG
Sbjct: 192 FLQYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAG 251
Query: 156 LIQNSFSICFDENDSGSVF 174
++ F+ C D + G +F
Sbjct: 252 KVRKMFAHCLDTVNGGGIF 270
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 77/316 (24%), Positives = 130/316 (41%), Gaps = 34/316 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DPS S+S VSC P C+ ++C++ C Y Y + + + G + L L
Sbjct: 211 FDPSLSASYAAVSCDSPRCRDLDTAACRNATGACLYEVAYG-DGSYTVGDFATETLTLG- 268
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S+ ++V IGCG G ++ A + G L S PS ++ ++FS
Sbjct: 269 ------DSTPVTNVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----STFS 314
Query: 163 ICFDENDS---GSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLT--QSG 216
C + DS ++ FG G T+ L + Y+V + +G L+ S
Sbjct: 315 YCLVDRDSPAASTLQFGADGAEADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSA 374
Query: 217 FQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
F +VDSG + T L + YA + F + S + + + CY+ S
Sbjct: 375 FAMDATSGSGGVIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDR 434
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
++VP + L F + + + P +G +CL T+ IIG G R+
Sbjct: 435 TSVEVPAVSLRFEGGGALRLPAKNYLIPV-DGAGTYCLAFAPTNAAVSIIGNVQQQGTRV 493
Query: 328 VFDRENLKLAWSHSKC 343
FD + ++ +KC
Sbjct: 494 SFDTAKGVVGFTPNKC 509
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 138/339 (40%), Gaps = 53/339 (15%)
Query: 40 RNLSEYDPSSSSSSKNVSC-----SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLV 94
++L +DPS S + +N SC S P + + +S C Y Y + T S G L
Sbjct: 122 QSLPIFDPSRSYTHRNESCRTSQYSMPSLRFNAKTRS----CEYSMRY-MDGTGSKGILA 176
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
++L + + +++ V+ GCG G L G G++GLG G+ SL+ +
Sbjct: 177 KEMLMFNTIYDESSSAALH-DVVFGCGHDNYGEPLVGT---GILGLGYGEF---SLVHRF 229
Query: 155 GLIQNSFSICFDENDSGS-----VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 209
G FS CF D S + GD G T+ L I + Y+V +E+ +
Sbjct: 230 G---TKFSYCFGSLDDPSYPHNVLVLGDDGANILGDTTPLEIYNGF--YYVTIEAISVDG 284
Query: 210 SCL----------TQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL---QG 255
L Q+G ++D+G S T L E Y + K + + + Q
Sbjct: 285 IILPIDPWVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNKIEDYFEGRFTAADVNQD 344
Query: 256 NSWKY-CYNASSEEML---KVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMS 309
+ +K CYN + E L P + FS S V++ N VFCL V
Sbjct: 345 DMFKVECYNGNLERDLVESGFPIVTFHFSDGAELSLDVKSVFMKLSPN----VFCLAV-- 398
Query: 310 TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVID 348
T G+ IG + I +D E K+++ C + D
Sbjct: 399 TPGNMNSIGATAQQSYNIGYDLEAKKISFERIDCGVLFD 437
>gi|306015415|gb|ADM76761.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015421|gb|ADM76764.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015423|gb|ADM76765.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015427|gb|ADM76767.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015429|gb|ADM76768.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015445|gb|ADM76776.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015449|gb|ADM76778.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015455|gb|ADM76781.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015457|gb|ADM76782.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015469|gb|ADM76788.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015475|gb|ADM76791.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015479|gb|ADM76793.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015485|gb|ADM76796.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015487|gb|ADM76797.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015489|gb|ADM76798.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015491|gb|ADM76799.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015505|gb|ADM76806.1| aspartyl protease-like protein, partial [Picea sitchensis]
Length = 114
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 48/77 (62%), Gaps = 8/77 (10%)
Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQ----SPNPLPTTEQ 371
IIGQNFM +R+VFDRENLKL WS S C + +D++ + P P+ Q + PL +Q
Sbjct: 1 IIGQNFMTSYRLVFDRENLKLGWSPSDCYQ-LDENEGAVAPAPSPQNGWRTRTPL---QQ 56
Query: 372 QSTSNGQAAAPPSTAKT 388
Q TS G+A AP +T
Sbjct: 57 QQTSPGRAVAPAIAGRT 73
>gi|306015413|gb|ADM76760.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015419|gb|ADM76763.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015425|gb|ADM76766.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015431|gb|ADM76769.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015433|gb|ADM76770.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015435|gb|ADM76771.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015437|gb|ADM76772.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015439|gb|ADM76773.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015441|gb|ADM76774.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015443|gb|ADM76775.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015447|gb|ADM76777.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015451|gb|ADM76779.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015453|gb|ADM76780.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015459|gb|ADM76783.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015461|gb|ADM76784.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015463|gb|ADM76785.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015465|gb|ADM76786.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015467|gb|ADM76787.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015471|gb|ADM76789.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015473|gb|ADM76790.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015477|gb|ADM76792.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015481|gb|ADM76794.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015483|gb|ADM76795.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015493|gb|ADM76800.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015495|gb|ADM76801.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015497|gb|ADM76802.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015499|gb|ADM76803.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015501|gb|ADM76804.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015503|gb|ADM76805.1| aspartyl protease-like protein, partial [Picea sitchensis]
gi|306015507|gb|ADM76807.1| aspartyl protease-like protein, partial [Picea sitchensis]
Length = 114
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 48/77 (62%), Gaps = 8/77 (10%)
Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQ----SPNPLPTTEQ 371
IIGQNFM +R+VFDRENLKL WS S C + +D++ + P P+ Q + PL +Q
Sbjct: 1 IIGQNFMTSYRLVFDRENLKLGWSPSDCYQ-LDENEGAVAPAPSPQNGWKTRTPL---QQ 56
Query: 372 QSTSNGQAAAPPSTAKT 388
Q TS G+A AP +T
Sbjct: 57 QQTSPGRAVAPAIAGRT 73
>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
Length = 416
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 82/349 (23%), Positives = 138/349 (39%), Gaps = 60/349 (17%)
Query: 57 SCSHPLCKSRSSCKSLKDPCPYIADYSTED---------------TSSSGYLVDDILHLA 101
SC+ P C S + DPC +A S T +G +V L
Sbjct: 74 SCASPYCTDIHSSDNSFDPCT-VAGCSLSTLIKATCARPCPSFAYTYGAGGVVTGTLTRD 132
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
+ H + V + C +Y + P G+ G G +S PS L GL++ F
Sbjct: 133 TLRVHEGPARVTKDIPKFCFGCVGSTYHE---PIGIAGFVRGTLSFPSQL---GLLKKGF 186
Query: 162 SICF-------DENDSGSVFFGDQGPATQQSTSFLPIGEK---YDAYFVGVESYCIGNSC 211
S CF + N S + GD +++ + F P+ + + Y++G+E+ +GN
Sbjct: 187 SHCFLAFKYANNPNISSPLVIGDTALSSKDNMQFTPMLKSPMYPNYYYIGLEAITVGNVS 246
Query: 212 LT-----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR---ISLQGNS 257
T Q L+DSG ++T LP Y++++ F +++ R + ++
Sbjct: 247 ATTVPLNLREFDSQGNGGMLIDSGTTYTHLPEPFYSQLLSIFKAIITYPRATEVEMRA-G 305
Query: 258 WKYCY------NASSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVF-CLTVM 308
+ CY N +++ P + F N SFV+ NH ++ TV CL
Sbjct: 306 FDLCYKVPCPNNRLTDDDNLFPSITFHFLNNVSFVLPQGNHFYAMSAPSNSTVVKCLLFQ 365
Query: 309 S-TDGDY---GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 353
S D DY G+ G +IV+D E ++ + C +H
Sbjct: 366 SMADSDYGPAGVFGSFQQQNVQIVYDLEKERIGFQPMDCASAAVSQGLH 414
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 90/326 (27%), Positives = 129/326 (39%), Gaps = 39/326 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
+DP +SSS + + C PLCK S S + C Y Y + + S G D+
Sbjct: 171 FDPRNSSSFQRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAYG-DGSFSVGDFSSDLFT 229
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
L + SK SV GCG G + A G+ L S + N
Sbjct: 230 LGTGSKAM-------SVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTAN 282
Query: 160 SFSICFDEND------SGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCL 212
SFS C + S S+ FG + + S L K D Y+ + +G + L
Sbjct: 283 SFSYCLVDRSNPMTRSSSSLIFGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQL 342
Query: 213 ---------TQSGFQA-LVDSGASFTFLPTEIYAEVVVKFD----KLVSSKRISLQGNSW 258
+QSG ++DSG S T PT +YA + F L S+ R SL +
Sbjct: 343 PISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSAPRYSL----F 398
Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
CYN S + + VP + L F + + P N + FCL T + GIIG
Sbjct: 399 DTCYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPINTAGS-FCLAFAPTSMELGIIG 457
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCE 344
RI FD + LA++ +C+
Sbjct: 458 NIQQQSFRIGFDLQKSHLAFAPQQCK 483
>gi|306015417|gb|ADM76762.1| aspartyl protease-like protein, partial [Picea sitchensis]
Length = 114
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 48/77 (62%), Gaps = 8/77 (10%)
Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQ----SPNPLPTTEQ 371
IIGQNFM +R+VFDRENLKL WS S C + +D++ + P P+ Q + PL +Q
Sbjct: 1 IIGQNFMTSYRLVFDRENLKLGWSPSDCYQ-LDENEGAVAPAPSPQNGWRTRTPL---QQ 56
Query: 372 QSTSNGQAAAPPSTAKT 388
Q TS G+A AP +T
Sbjct: 57 QQTSPGRAVAPAIAGRT 73
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/333 (23%), Positives = 133/333 (39%), Gaps = 49/333 (14%)
Query: 46 DPSSSSSSKNVSCSHPLCKSR----SSCKSLK----DPCPYIADYSTEDTSSSGYLVDDI 97
+PS + PL SR +SC S K C Y Y + + ++G+L D
Sbjct: 24 NPSPECFEQAFPYFEPLTFSRGLPFASCGSPKFWPNQTCVYTYSYG-DKSVTTGFLEVDK 82
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
P V GCG G + G+ G G G +S+PS L K G
Sbjct: 83 FTFVGAGASVP------GVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG-- 131
Query: 158 QNSFSICFDE-----------NDSGSVFFGDQGPATQQSTSFLPIGEKY---DAYFVGVE 203
+FS CF + +F QG Q+T + + Y++ ++
Sbjct: 132 --NFSHCFTTITGAIPSTVLLDLPADLFSNGQGAV--QTTPLIQYAKNEANPTLYYLSLK 187
Query: 204 SYCIGNS---------CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 254
+G++ LT ++DSG S T LP ++Y V +F + +
Sbjct: 188 GITVGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGN 247
Query: 255 GNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGD 313
C++A S+ VP + L F + R N++F P++ G ++ CL + D +
Sbjct: 248 ATGHYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGD-E 306
Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
IIG +++D +N L++ ++C+++
Sbjct: 307 TTIIGNFQQQNMHVLYDLQNNMLSFVAAQCDKL 339
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 139/333 (41%), Gaps = 56/333 (16%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
++P+ S+S ++ CS +C + S ++ C Y A Y + SS+G L ++ +F
Sbjct: 130 FEPAKSTSYASLPCSSAMCNALYSPLCFQNACVYQAFYG-DSASSAGVLANETF---TFG 185
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
++ + +V V GCG G+ +G+ G++G G G +S+ S L FS C
Sbjct: 186 TNSTRVAVP-RVSFGCGNMNAGTLFNGS---GMVGFGRGALSLVSQLGSP-----RFSYC 236
Query: 165 ---FDENDSGSVFFG-----------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
F + ++FG GP QST F+ YF+ + +
Sbjct: 237 LTSFMSPATSRLYFGAYATLNSTNTSSSGPV--QSTPFIVNPALPTMYFLNMTGISVAGD 294
Query: 211 CL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSW 258
L T ++DSG + TFL YA V F V R + +++
Sbjct: 295 LLPIDPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTF 354
Query: 259 KYCYN--ASSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG 315
C+ M+ +P+M L F + + N++ + G CL ++ +D D
Sbjct: 355 DTCFKWPPPPRRMVTLPEMVLHFDGADMELPLENYMV---MDGGTGNLCLAMLPSD-DGS 410
Query: 316 IIG----QNFMMGHRIVFDRENLKLAWSHSKCE 344
IIG QNF M ++D EN L++ + C
Sbjct: 411 IIGSFQHQNFHM----LYDLENSLLSFVPAPCN 439
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 84/337 (24%), Positives = 133/337 (39%), Gaps = 45/337 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
++P SSS + C+ +C RS +C C + Y + + + G + +I
Sbjct: 41 FNPGLSSSFISEPCTSSVCLGRSKLGFQSACNRSTGSCSFQVAY-LDGSEAYGVIAREIF 99
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL---AKAG 155
L S+ A S VI GC K +D ++ G +GL G S P+ + +K+G
Sbjct: 100 SLQSWDGAA---STLGDVIFGCASKDLQRPVDFSS--GTLGLNRGSFSFPAQIGSRSKSG 154
Query: 156 LIQNSFSICFDE-----NDSGSVFFGDQG-PATQQSTSFL----PIGEKYDAYFVGVESY 205
L + FS CF N SG + FGD G PA L PI D Y+VG++
Sbjct: 155 L-SDRFSYCFPNRAEHLNSSGVIIFGDSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGI 213
Query: 206 CIGNSCL--TQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLV-SSKRISLQ 254
+G L +S F+ DSG + +FL + +V F + V R S
Sbjct: 214 SVGGELLHIPRSAFKIDRLGNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGS 273
Query: 255 GNSWKYCYN--ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFP--ENEGFTVFCLTVMS- 309
+ + CY+ A + P + L F N +R P CL ++
Sbjct: 274 DFTKELCYDVAAGDARLPTAPLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNA 333
Query: 310 ---TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
G +IG + I D E ++ ++ + C
Sbjct: 334 GAVAQGGVNVIGNYQQQDYLIEHDLERSRIGFAPANC 370
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 86/335 (25%), Positives = 141/335 (42%), Gaps = 45/335 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDIL 98
YD ++S+S V C+ C SR+ + PC Y Y+ +D + S+G L + L
Sbjct: 137 YDTAASASFSPVPCASATCLPIWRSSRNCTATTTSPCRY--RYAYDDGAYSAGVLGTETL 194
Query: 99 HLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
A S AP V V GCG G + G +GLG G +S L+A+ G+
Sbjct: 195 TFAGSSPGAPGPGVSVGGVAFGCGVDNGGLSYNST---GTVGLGRGSLS---LVAQLGV- 247
Query: 158 QNSFSIC----FDENDSGSVFFGDQ---------GPATQQSTSFLPIGEKYDAYFVGVES 204
FS C F+ + V FG G A QST + Y+V +E
Sbjct: 248 -GKFSYCLTDFFNTSLGSPVLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEG 306
Query: 205 YCIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 254
+G++ L +VDSG FT L + VV +++ ++
Sbjct: 307 ISLGDARLPIPNGTFDLRDDGSGGMIVDSGTIFTVLVESAFRVVVNHVAGVLNQPVVNAS 366
Query: 255 G-NSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMSTDG 312
+S + A +++ +PDM L F+ + R++ SF N+ + FCL +
Sbjct: 367 SLDSPCFPATAGEQQLPDMPDMLLHFAGGADMRLHRDNYMSF--NQESSSFCLNIAGAPS 424
Query: 313 DYGIIGQNFMMGH-RIVFDRENLKLAWSHSKCEEV 346
YG I NF + +++FD +L++ + C ++
Sbjct: 425 AYGSILGNFQQQNIQMLFDITVGQLSFVPTDCSKL 459
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/329 (23%), Positives = 132/329 (40%), Gaps = 41/329 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-RSSCKSLKDPCP----------YIADYSTEDTSSSGYL 93
+DPSSS S V C+ C + R + + PC Y Y + + S G L
Sbjct: 160 FDPSSSPSYAAVPCNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSYR-DGSYSRGVL 218
Query: 94 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-VPSLLA 152
D L LA + GCG G+ G + G+MGLG VS V +
Sbjct: 219 ARDKLRLAGQDIEG--------FVFGCGTSNQGAPFGGTS--GLMGLGRSHVSLVSQTMD 268
Query: 153 KAGLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-------YFVGV 202
+ G + FS C + SGS+ GD A + ST + D+ YF+ +
Sbjct: 269 QFGGV---FSYCLPMRESGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNL 325
Query: 203 ESYCIGNSCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 259
+G + F A ++DSG T L +Y V +F ++ + +
Sbjct: 326 TGITVGGQEVESPWFSAGRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSILD 385
Query: 260 YCYNASSEEMLKVPDMRLIFSKNQSFVVRNH-IFSFPENEGFTV-FCLTVMSTDGDYGII 317
C+N + + ++VP ++ +F + V + + F ++ V L + ++ D II
Sbjct: 386 TCFNLTGLKEVQVPSLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSII 445
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
G R++FD ++ ++ C+ +
Sbjct: 446 GNYQQKNLRVIFDTLGSQIGFAQETCDYI 474
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/340 (22%), Positives = 144/340 (42%), Gaps = 46/340 (13%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSL---KDPCPYIADYSTEDTSSSGYLV 94
++L ++PS S + + C +C+ + SSC C Y Y+ + + ++G+L
Sbjct: 148 QSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWGNGICVYAYAYA-DHSITTGHLD 206
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
D AS + HA + + GCG G ++ G+ G G +S+P A
Sbjct: 207 SDTFSFAS-ADHAIGGASVPDLTFGCGLFNNGIFVSNET--GIAGFSRGALSMP-----A 258
Query: 155 GLIQNSFSICFDE---NDSGSVFFG----------DQGPATQQSTSFLPI-GEKYDAYFV 200
L ++FS CF ++ VF G G QST+ + + AY++
Sbjct: 259 QLKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYI 318
Query: 201 GVESYCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
++ +G + L +S F +VDSG T LP +Y V D V+ +
Sbjct: 319 SLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVC---DAFVAQTK 375
Query: 251 ISLQGNS---WKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLT 306
+++ ++ + C++ VP + L F + R N++F E G + CL
Sbjct: 376 LTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLA 435
Query: 307 VMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+ + + D +IG +++D N L++ ++C ++
Sbjct: 436 INAGE-DLSVIGNFQQQNMHVLYDLANDMLSFVPARCNKI 474
>gi|242067693|ref|XP_002449123.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
gi|241934966|gb|EES08111.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
Length = 408
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/328 (23%), Positives = 135/328 (41%), Gaps = 50/328 (15%)
Query: 51 SSSKNVSCSHPLCK-------SRSSCKSL-KDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+ K V C+ PLC + C + K+ C Y Y +S L+D
Sbjct: 88 TRKKLVPCADPLCDALHKDLGTTKKCTDVRKNQCDYKVKYQDGLSSLGVLLLD------- 140
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP----DGVMGLGLGDVSVPSLLAKAGLI- 157
K + + ++ GCG Q A DG++GLG G V + S L +G +
Sbjct: 141 --KFSLPTGGARNIAFGCGYDQMKGSKKKAPEKVPVDGILGLGRGSVDLASQLKHSGAVS 198
Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI-----GEKYDAYFVGVESYCIGNSCL 212
+N C G +F G++ + T ++P+ GE + Y G + + ++ +
Sbjct: 199 KNVIGHCLSSKGGGYLFIGEENVPSSHVT-WVPMAPTTPGEP-NHYSPGQATLHLDSNPI 256
Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
+A+ DSG+++T+LP ++A+ LVS+ + SL +S K + + K
Sbjct: 257 GTKPLKAIFDSGSTYTYLPENLHAQ-------LVSALKASLSKSSLKQVSDPALPLCWKG 309
Query: 273 PD-MRLIFSKNQSFV--------VRNHIFSFPEN----EGFTVFCLTVMSTDG-DYGIIG 318
P + + + F + + PEN G C ++ G D IIG
Sbjct: 310 PKPFKTVHDTPKEFKSLVTLKFDLGVTMIIPPENYLIITGHGNACFGILDMPGLDQYIIG 369
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEV 346
M +++D E +LAW S C+++
Sbjct: 370 DITMQEQLVIYDNEKGRLAWMPSPCDKI 397
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 85/182 (46%), Gaps = 23/182 (12%)
Query: 118 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-GSVFFG 176
GCGR G + GA DG++GLG G +S S A + FS C E DS GS+ FG
Sbjct: 171 FGCGRNNEGDFGSGA--DGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEEDSIGSLLFG 226
Query: 177 DQGPATQQSTSFLPIG--------EKYDAYFVGVESYCIGNSCLT--QSGFQA---LVDS 223
++ +Q S F + E+ YFV + +GN L S F + ++DS
Sbjct: 227 EKA-TSQSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNVPSSVFASPGTIIDS 285
Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSKRIS----LQGNSWKYCYNASSEEMLKVPDMRLIF 279
G T LP Y+ + F K ++ +S +G+ CYN S + + +P++ L F
Sbjct: 286 GTVITCLPQRAYSALTAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHF 345
Query: 280 SK 281
+
Sbjct: 346 GE 347
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 142/342 (41%), Gaps = 57/342 (16%)
Query: 45 YDPSSSSSSKNVSCSHPLC--------KSRSSCKSL-KDPCPYIADYSTEDTSSSGYLVD 95
+DP++SSS +N++C P C + +C+ +DPCPY Y + S+
Sbjct: 188 FDPAASSSYRNLTCGDPRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGD---- 243
Query: 96 DILHLASFSKH--APQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
L L SF+ + AP +S + V+ GCG + G + A ++GLG G +S S L
Sbjct: 244 --LALESFTVNLTAPGASSRVDGVVFGCGHRNRGLFHGAAG---LLGLGRGPLSFASQL- 297
Query: 153 KAGLIQNSFSICFDENDS---GSVFFGDQ------GPATQQSTSFLPIGEKYDA-YFVGV 202
+A ++FS C ++ S V FG+ + T+F P D Y+V +
Sbjct: 298 RAVYGGHTFSYCLVDHGSDVASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRL 357
Query: 203 ESYCIGNSCLTQSGFQ----------ALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRI 251
+G L S ++DSG + ++ Y + F D++ S
Sbjct: 358 TGVLVGGELLNISSDTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPP 417
Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT------VFCL 305
CYN S E +VP++ L+F+ ++ FP F + CL
Sbjct: 418 VPDFPVLSPCYNVSGVERPEVPELSLLFADGA-------VWDFPAENYFIRLDPDGIMCL 470
Query: 306 TVMST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
V+ T IIG + +D N +L ++ +C EV
Sbjct: 471 AVLGTPRTGMSIIGNFQQQNFHVAYDLHNNRLGFAPRRCAEV 512
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/316 (24%), Positives = 129/316 (40%), Gaps = 35/316 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP++SS+ V+C C S SSC+S + C Y +Y Y D A+
Sbjct: 203 FDPTASSTYAPVTCQSQQCSSLEMSSCRSGQ--CLYQVNYG-----DGSYTFGD---FAT 252
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S S +V +GCG G + V GL + L L SFS
Sbjct: 253 ESVSFGNSGSVKNVALGCGHDNEGLF--------VGAAGLLGLGGGPLSLTNQLKATSFS 304
Query: 163 ICFDENDSG---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLT--QSG 216
C DS ++ F T+ L K D Y+VG+ +G ++ +S
Sbjct: 305 YCLVNRDSAGSSTLDFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPEST 364
Query: 217 FQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
F+ +VD G + T L T+ Y + F ++ + +++ + CY+ S +
Sbjct: 365 FRLDESGNGGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQA 424
Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
++VP + F+ +S+ + + P + T +C T IIG G R+
Sbjct: 425 SVRVPTVSFHFADGKSWNLPAANYLIPVDSAGT-YCFAFAPTTSSLSIIGNVQQQGTRVT 483
Query: 329 FDRENLKLAWSHSKCE 344
FD N ++ +S +KC+
Sbjct: 484 FDLANNRMGFSPNKCQ 499
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 79/315 (25%), Positives = 126/315 (40%), Gaps = 35/315 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
YDPS S+S V C P C+ ++C++ C Y Y + + + G + L L
Sbjct: 205 YDPSVSTSYATVGCDSPRCRDLDAAACRNSTGSCLYEVAYG-DGSYTVGDFATETLTLG- 262
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S+ S+V IGCG G ++ A + G L S PS ++ +FS
Sbjct: 263 ------DSAPVSNVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFS 308
Query: 163 ICFDENDSGS---VFFGD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSG 216
C + DS S + FGD + PA P + Y+V + +G L+ S
Sbjct: 309 YCLVDRDSPSSSTLQFGDSEQPAVTAPLIRSPRTNTF--YYVALSGISVGGEALSIPSSA 366
Query: 217 FQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
F +VDSG + T L + Y + F + S + + + CY+ +
Sbjct: 367 FAMDDAGSGGVIVDSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRS 426
Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
++VP + L F + + P + T +CL T G IIG G R+
Sbjct: 427 SVQVPAVALWFEGGGELKLPAKNYLIPVDAAGT-YCLAFAGTSGPVSIIGNVQQQGVRVS 485
Query: 329 FDRENLKLAWSHSKC 343
FD + ++ KC
Sbjct: 486 FDTAKNTVGFTADKC 500
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 79/302 (26%), Positives = 136/302 (45%), Gaps = 29/302 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
+DP +SS+ K+VSCS C ++++SC + + C Y Y + + + G + D L L
Sbjct: 136 FDPKASSTYKDVSCSSSQCTALENQASCSTEDNTCSYSTSYG-DRSYTKGNIAVDTLTLG 194
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
S Q ++IIGCG G++ G++GLG G VS+ + L + I F
Sbjct: 195 STDTRPVQ---LKNIIIGCGHNNAGTF--NKKGSGIVGLGGGAVSLITQLGDS--IDGKF 247
Query: 162 SICF----DENDSGS-VFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLTQ 214
S C END S + FG + P+ K Y++ ++S +G+ +
Sbjct: 248 SYCLVPLTSENDRTSKINFGTNAVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQY 307
Query: 215 SGFQA-------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
G + ++DSG + T LPTE Y+E+ + +++ CY+A+ +
Sbjct: 308 PGSDSGSGEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLSLCYSATGD 367
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ-NFMMGHR 326
LKVP + + F + ++ F +E F + YG + Q NF++G+
Sbjct: 368 --LKVPAITMHFDGADVNLKPSNCF-VQISEDLVCFAFRGSPSFSIYGNVAQMNFLVGYD 424
Query: 327 IV 328
V
Sbjct: 425 TV 426
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/340 (22%), Positives = 144/340 (42%), Gaps = 46/340 (13%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSL---KDPCPYIADYSTEDTSSSGYLV 94
++L ++PS S + + C +C+ + SSC C Y Y+ + + ++G+L
Sbjct: 148 QSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWGNGICVYAYAYA-DHSITTGHLD 206
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
D AS + HA + + GCG G ++ G+ G G +S+P A
Sbjct: 207 SDTFSFAS-ADHAIGGASVPDLTFGCGLFNNGIFVSNET--GIAGFSRGALSMP-----A 258
Query: 155 GLIQNSFSICFDE---NDSGSVFFG----------DQGPATQQSTSFLPI-GEKYDAYFV 200
L ++FS CF ++ VF G G QST+ + + AY++
Sbjct: 259 QLKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYI 318
Query: 201 GVESYCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
++ +G + L +S F +VDSG T LP +Y V D V+ +
Sbjct: 319 SLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVC---DAFVAQTK 375
Query: 251 ISLQGNS---WKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLT 306
+++ ++ + C++ VP + L F + R N++F E G + CL
Sbjct: 376 LTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLA 435
Query: 307 VMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+ + + D +IG +++D N L++ ++C ++
Sbjct: 436 INAGE-DLSVIGNFQQQNMHVLYDLANDMLSFVPARCNKI 474
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/340 (22%), Positives = 144/340 (42%), Gaps = 46/340 (13%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSL---KDPCPYIADYSTEDTSSSGYLV 94
++L ++PS S + + C +C+ + SSC C Y Y+ + + ++G+L
Sbjct: 122 QSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWGNGICVYAYAYA-DHSITTGHLD 180
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
D AS + HA + + GCG G ++ G+ G G +S+P A
Sbjct: 181 SDTFSFAS-ADHAIGGASVPDLTFGCGLFNNGIFVSNET--GIAGFSRGALSMP-----A 232
Query: 155 GLIQNSFSICFDE---NDSGSVFFG----------DQGPATQQSTSFLPI-GEKYDAYFV 200
L ++FS CF ++ VF G G QST+ + + AY++
Sbjct: 233 QLKVDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYI 292
Query: 201 GVESYCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
++ +G + L +S F +VDSG T LP +Y V D V+ +
Sbjct: 293 SLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVC---DAFVAQTK 349
Query: 251 ISLQGNS---WKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLT 306
+++ ++ + C++ VP + L F + R N++F E G + CL
Sbjct: 350 LTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLA 409
Query: 307 VMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+ + + D +IG +++D N L++ ++C ++
Sbjct: 410 INAGE-DLSVIGNFQQQNMHVLYDLANDMLSFVPARCNKI 448
>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
Length = 437
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 81/324 (25%), Positives = 135/324 (41%), Gaps = 37/324 (11%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
LS Y+ S+SS+S SCS PLC SRS S C Y++ Y + S Y+ D
Sbjct: 127 LSIYNLSASSTSSVSSCSDPLCTGEEVVCSRSGNNS---ACAYVSSYQDKSASVGAYVRD 183
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
D+ ++ H ++ S + GC TGS+ DG+MG GL +VP+ +A
Sbjct: 184 DMHYVL----HGGNATT-SRIFFGCATNITGSW----PVDGIMGFGLISKTVPNQIATQR 234
Query: 156 LIQNSFSICF-DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
+ FS C E G + + P T + F P+ Y V + S + + L
Sbjct: 235 NMSRVFSHCLGGEKHGGGILEFGEAPNTTEMV-FTPLLNVTTHYNVDLLSISVNSKVLPI 293
Query: 213 ----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR-ISLQGNSWKYC 261
+ + ++DSG +F L T+ + + L ++K L+G Y
Sbjct: 294 DPKEFSYVRNSTNNTGVIIDSGTTFVLLTTKANRMLFQEIKSLTTAKLGPKLEGLECFYL 353
Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 319
+ + E P++ L FS + ++ N++ + +C S DG I G+
Sbjct: 354 KSGLTMET-SFPNVTLTFSGGSTMKLKPDNYLVMAEYKKKRNGYCYAWSSADG-LTIFGE 411
Query: 320 NFMMGHRIVFDRENLKLAWSHSKC 343
+ + +D EN ++ W C
Sbjct: 412 IVLKDKLVFYDVENRRIGWKGQNC 435
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 139/333 (41%), Gaps = 56/333 (16%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
++P+ S+S ++ CS +C + S ++ C Y A Y + SS+G L ++ +F
Sbjct: 127 FEPAKSTSYASLPCSSAMCNALYSPLCFQNACVYQAFYG-DSASSAGVLANETF---TFG 182
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
++ + +V V GCG G+ +G+ G++G G G +S+ S L FS C
Sbjct: 183 TNSTRVAVP-RVSFGCGNMNAGTLFNGS---GMVGFGRGALSLVSQLGSP-----RFSYC 233
Query: 165 ---FDENDSGSVFFG-----------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
F + ++FG GP QST F+ YF+ + +
Sbjct: 234 LTSFMSPATSRLYFGAYATLNSTNTSSSGPV--QSTPFIVNPALPTMYFLNMTGISVAGD 291
Query: 211 CL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSW 258
L T ++DSG + TFL YA V F V R + +++
Sbjct: 292 LLPIDPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTF 351
Query: 259 KYCYN--ASSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG 315
C+ M+ +P+M L F + + N++ + G CL ++ +D D
Sbjct: 352 DTCFKWPPPPRRMVTLPEMVLHFDGADMELPLENYMV---MDGGTGNLCLAMLPSD-DGS 407
Query: 316 IIG----QNFMMGHRIVFDRENLKLAWSHSKCE 344
IIG QNF M ++D EN L++ + C
Sbjct: 408 IIGSFQHQNFHM----LYDLENSLLSFVPAPCN 436
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 72/312 (23%), Positives = 133/312 (42%), Gaps = 21/312 (6%)
Query: 45 YDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+DP+ SS+ +V C C +++ C S K C Y+ Y T D+ + G L D +
Sbjct: 130 FDPTQSSTYVDVPCESQPCTLFPQNQRECGSSKQ-CIYLHQYGT-DSFTIGRLGYDTISF 187
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+S ++ SV GC ++ +G +GLG G +S+ S L I +
Sbjct: 188 SSTGMGQGGATFPKSVF-GCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQ--IGHK 244
Query: 161 FSIC---FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFV-GVESYCIG-NSCLT-Q 214
FS C F +G + FG P + ++ I Y +Y+V +E +G LT Q
Sbjct: 245 FSYCMVPFSSTSTGKLKFGSMAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKVLTGQ 304
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 274
G ++DS T L IY + + + ++ + ++YC + L P+
Sbjct: 305 IGGNIIIDSVPILTHLEQGIYTDFISSVKEAINVEVAEDAPTPFEYCVRNPTN--LNFPE 362
Query: 275 MRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 334
F+ + ++F +N + C+TV+ + G I G + ++ +D
Sbjct: 363 FVFHFTGADVVLGPKNMFIALDNN---LVCMTVVPSKG-ISIFGNWAQVNFQVEYDLGEK 418
Query: 335 KLAWSHSKCEEV 346
K++++ + C +
Sbjct: 419 KVSFAPTNCSTI 430
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 76/312 (24%), Positives = 133/312 (42%), Gaps = 29/312 (9%)
Query: 46 DPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHL 100
+PS+S+S KN+SCS LCK +S K C Y Y + + S G+ + L L
Sbjct: 175 NPSTSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYG-DGSYSIGFFATETLTL 233
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+S S+V + + GCG++ G + A G+ L ++PS AK +
Sbjct: 234 SS-------SNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKL---ALPSQTAKT--YKKL 281
Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT--QS 215
FS C + S + G +S F P+ +D+ Y + + +G L+ +S
Sbjct: 282 FSYCLPASSSSKGYL-SLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDES 340
Query: 216 GFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 273
F A ++DSG T L Y+E+ F L++ + + + CY+ S + +++P
Sbjct: 341 AFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIP 400
Query: 274 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFDR 331
+ + F + +P N G CL D D I G +++V+D
Sbjct: 401 KVGVTFKGGVEMDIDVSGILYPVN-GLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDG 459
Query: 332 ENLKLAWSHSKC 343
++ ++ C
Sbjct: 460 AKGRVGFAPGGC 471
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 140/333 (42%), Gaps = 53/333 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+ P +SSS + + C+ LC SC+ D C Y Y + T++ G + +S
Sbjct: 146 FSPGASSSYEPMRCAGELCNDILHHSCQR-PDTCTYRYSYG-DGTTTRGVYATERFTFSS 203
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S + + + + GCG GS +G+ G++G G +S+ S LA FS
Sbjct: 204 SSSGGETTKLSAPLGFGCGTMNKGSLNNGS---GIVGFGRAPLSLVSQLAI-----RRFS 255
Query: 163 ICFDENDSG---SVFFG-------DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
C SG ++ FG D AT Q+T L + Y+V +G L
Sbjct: 256 YCLTPYASGRKSTLLFGSLRGGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRL 315
Query: 213 TQ--SGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK--- 259
S F A+VDSG + T P + AEVV F S R+ N
Sbjct: 316 RIPISAFALRPDGSGGAIVDSGTALTLFPAPVLAEVVRAFR---SQLRLPFAANGSSGPD 372
Query: 260 --YCYNASSEEMLK---VPDMRLIF---SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD 311
C+ A++ + + VP R++F + RN++ +++ CL +++
Sbjct: 373 DGVCFAAAASRVPRPAVVP--RMVFHLQGADLDLPRRNYVL---DDQRKGNLCL-LLADS 426
Query: 312 GDYGIIGQNFM-MGHRIVFDRENLKLAWSHSKC 343
GD G NF+ R+++D E L+++ ++C
Sbjct: 427 GDSGTTIGNFVQQDMRVLYDLEADTLSFAPAQC 459
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 89/354 (25%), Positives = 144/354 (40%), Gaps = 79/354 (22%)
Query: 45 YDPSSSSSSKNVSCSHPLC------KSRSSCKSLKDPCP--------YIADYSTEDTSSS 90
+ P SSSSSK + C +P C K +S C+ + P Y+ Y + T
Sbjct: 139 FIPKSSSSSKVLGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITG-- 196
Query: 91 GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 150
G ++ + L L K P + I+GC S L + P G+ G G G S+PS
Sbjct: 197 GIMLSETLDLPG--KGVP------NFIVGC------SVLSTSQPAGISGFGRGPPSLPSQ 242
Query: 151 LAKAGLIQNSFSICF------DENDSGSVFFGDQGPATQQST--SFLPIGEKYDA----- 197
L GL FS C D +S S+ + + +++ S+ P +
Sbjct: 243 L---GL--KKFSYCLLSRRYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHA 297
Query: 198 ----YFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFD 243
Y++G+ +G + ++DSG +FT++ EI+ V +F+
Sbjct: 298 FSVYYYLGLRHITVGGKHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFE 357
Query: 244 KLVSSKRIS-LQG-NSWKYCYNASSEEMLKVPDMRLIF--SKNQSFVVRNHIFSFPENEG 299
K V SKR + ++G + C+N S P++ L F + N++ G
Sbjct: 358 KQVQSKRATEVEGITGLRPCFNISGLNTPSFPELTLKFRGGAEMELPLANYVAFL---GG 414
Query: 300 FTVFCLTVMSTDGDYG--------IIGQNFMMGHRIV-FDRENLKLAWSHSKCE 344
V CLT++ TDG G II NF + V +D N +L + C+
Sbjct: 415 DDVVCLTIV-TDGAAGKEFSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSCK 467
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 91/354 (25%), Positives = 138/354 (38%), Gaps = 61/354 (17%)
Query: 25 LLW-----CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY- 78
LLW CL + QD L Y PS+SS+ V C P C + + PC +
Sbjct: 88 LLWVQCAPCLQCY----AQDTPL--YAPSNSSTFNPVPCLSPECLLIPATEGF--PCDFH 139
Query: 79 -----IADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA 133
+Y DTS S + ++ V GCGR GS+ AA
Sbjct: 140 YPGACAYEYRYADTSLSKGVF-------AYESATVDDVRIDKVAFGCGRDNQGSF---AA 189
Query: 134 PDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSGSV----FFGDQGPATQQSTSF 188
GV+GLG G +S S + A N F+ C + D SV FGD+ +T F
Sbjct: 190 AGGVLGLGQGPLSFGSQVGYA--YGNKFAYCLVNYLDPTSVSSWLIFGDELISTIHDLQF 247
Query: 189 LPI---GEKYDAYFVGVESYCIGNSCL--TQSGFQ--------ALVDSGASFTFLPTEIY 235
PI Y+V +E +G L + S + ++ DSG + T+ Y
Sbjct: 248 TPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGNGGSIFDSGTTVTYWLPPAY 307
Query: 236 AEVVVKFDKLVSSKRI-SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR--NHIF 292
++ FDK V R S+QG C + + + P ++ F + N+
Sbjct: 308 RNILAAFDKNVRYPRAASVQG--LDLCVDVTGVDQPSFPSFTIVLGGGAVFQPQQGNYFV 365
Query: 293 SFPENEGFTVFCLTVM---STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
N V CL + S+ G + IG + +DRE ++ ++ +KC
Sbjct: 366 DVAPN----VQCLAMAGLPSSVGGFNTIGNLLQQNFLVQYDREENRIGFAPAKC 415
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 76/312 (24%), Positives = 133/312 (42%), Gaps = 29/312 (9%)
Query: 46 DPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHL 100
+PS+S+S KN+SCS LCK +S K C Y Y + + S G+ + L L
Sbjct: 163 NPSTSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYG-DGSYSIGFFATETLTL 221
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+S S+V + + GCG++ G + A G+ L ++PS AK +
Sbjct: 222 SS-------SNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKL---ALPSQTAKT--YKKL 269
Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT--QS 215
FS C + S + G +S F P+ +D+ Y + + +G L+ +S
Sbjct: 270 FSYCLPASSSSKGYL-SLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDES 328
Query: 216 GFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 273
F A ++DSG T L Y+E+ F L++ + + + CY+ S + +++P
Sbjct: 329 AFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIP 388
Query: 274 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFDR 331
+ + F + +P N G CL D D I G +++V+D
Sbjct: 389 KVGVTFKGGVEMDIDVSGILYPVN-GLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDG 447
Query: 332 ENLKLAWSHSKC 343
++ ++ C
Sbjct: 448 AKGRVGFAPGGC 459
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 79/327 (24%), Positives = 133/327 (40%), Gaps = 45/327 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLC-------KSRSSCKSLKDP-CPYIADYSTEDTSSSGYLVDD 96
++PS+SSS ++ C+ P C S C + C Y DY + + S G L
Sbjct: 106 FNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYG-DGSYSRGEL--- 161
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
F K + + I GCGR G + G+MGL ++S+ S + L
Sbjct: 162 -----GFEKLTLGKTEIDNFIFGCGRNNKGLF---GGASGLMGLARSELSLVS--QTSSL 211
Query: 157 IQNSFSICFDEN---DSGSVFFGDQGPATQQSTSFLPIG--------EKYDAYFVGVESY 205
+ FS C SGS+ G + ++ S PI + + YF+ +
Sbjct: 212 FGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNIS--PISYTRMIQNPQMSNFYFLNLTGI 269
Query: 206 CIGNSCL------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 259
IG L + G +L+DSG T L IY +F+K S R + +
Sbjct: 270 SIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILN 329
Query: 260 YCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMST--DGDYGI 316
C+N + E + +P ++ IF N +V +F F +++ + CL S + I
Sbjct: 330 TCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQI-CLAFASLGYEDQTMI 388
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKC 343
IG R++++ + K+ ++ C
Sbjct: 389 IGNYQQKNQRVIYNSKESKVGFAGEPC 415
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 78/312 (25%), Positives = 124/312 (39%), Gaps = 31/312 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
++PSSSSS + +SC P C + + C Y Y + + + G + L + S
Sbjct: 190 FEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYG-DGSYTVGDFATETLTIGS-- 246
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
++ +V +GCG G + V GL + L + L SFS C
Sbjct: 247 ------TLVQNVAVGCGHSNEGLF--------VGAAGLLGLGGGLLALPSQLNTTSFSYC 292
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT--QSGFQA 219
+ DS S D G + P+ + Y++G+ +G L QS F+
Sbjct: 293 LVDRDSDSASTVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEM 352
Query: 220 --------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
++DSG + T L TEIY + F K + + CYN S++ ++
Sbjct: 353 DESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVE 412
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 331
VP + F + + + P + T FCL T IIG G R+ FD
Sbjct: 413 VPTVAFHFPGGKMLALPAKNYMIPVDSVGT-FCLAFAPTASSLAIIGNVQQQGTRVTFDL 471
Query: 332 ENLKLAWSHSKC 343
N + +S +KC
Sbjct: 472 ANSLIGFSSNKC 483
>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
Length = 483
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 80/335 (23%), Positives = 128/335 (38%), Gaps = 55/335 (16%)
Query: 58 CSHPLCKSRSSCKSLKDPCPYIA-DYST-------------EDTSSSGYLVDDILHLASF 103
C+ P C S + DPC ST T +G +V L +
Sbjct: 143 CTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATCSWPCPPFAYTYGAGGVVTGTLTRDTL 202
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
H V + C SY + P G+ G G G +S+PS L G ++ FS
Sbjct: 203 RVHGRNLGVTQEIPRFCFGCVASSYRE---PIGIAGFGRGALSLPSQL---GFLRKGFSH 256
Query: 164 CF-------DENDSGSVFFGDQGPATQQSTSFLPIGEK---YDAYFVGVESYCIGNSCLT 213
CF + N S + GD ++ F P+ + + Y+VG+E+ +GN T
Sbjct: 257 CFLAFKYANNPNISSPLIIGDIALTSKDDMQFTPMLKSPMYPNYYYVGLEAITVGNVSAT 316
Query: 214 Q-----------SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS--LQGNSWKY 260
+ LVDSG ++T LP Y++V+ +++ R + +
Sbjct: 317 EVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVLSVLQSIINYPRATDMEMRTGFDL 376
Query: 261 CYNA--SSEEMLK---VPDMRLIFSKNQSFVVRN--HIFSFPENEGFTVF-CLTVMST-D 311
CY + +L +P + F N S V+ H ++ TV CL S D
Sbjct: 377 CYKVPCQNNSILTGDLLPSITFHFLNNASLVLSRGSHFYAMSAPSNSTVVKCLLFQSMDD 436
Query: 312 GDY---GIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
GDY G++G +V+D E ++ + C
Sbjct: 437 GDYGPAGVLGSFQQQDVEVVYDMEKERIGFRPMDC 471
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 90/377 (23%), Positives = 148/377 (39%), Gaps = 71/377 (18%)
Query: 17 LLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLC----KSRSSCKS- 71
++ P T+ C +S + + P SSSSK + C +P C S +C
Sbjct: 90 IVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESSSSKLLGCKNPKCSWIHHSNINCDQD 149
Query: 72 ------LKDPCP-YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ 124
L CP Y+ Y + T G + + LHL S SK + ++GC
Sbjct: 150 CSIKSCLNQTCPPYMIFYGSGTTG--GVALSETLHLHSLSK--------PNFLVGC---- 195
Query: 125 TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS---VFFGDQGPA 181
S P G+ G G G S+PS L S FD++ S V +Q +
Sbjct: 196 --SVFSSHQPAGIAGFGRGLSSLPSQLGLGKFSYCLLSHRFDDDTKKSSSLVLDMEQLDS 253
Query: 182 TQQSTS--FLPI--GEKYDA-------YFVGVESYCIGNSCLT----------QSGFQAL 220
+++ + + P K D Y++G+ +G + +
Sbjct: 254 DKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGHHVKVPYKYLSPGEDGNGGVI 313
Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ---GNSWKYCYNASSEEMLKVPDMRL 277
+DSG +FTF+ E + + +F + + R + + C+N S + + P++RL
Sbjct: 314 IDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAIGLRPCFNVSDAKTVSFPELRL 373
Query: 278 IFS--KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG---------IIGQNFMMGHR 326
F + + V N+ F+F E V CLTV+ TDG G I+G M
Sbjct: 374 YFKGGADVALPVENY-FAFVGGE---VACLTVV-TDGVAGPERVGGPGMILGNFQMQNFY 428
Query: 327 IVFDRENLKLAWSHSKC 343
+ +D N +L + KC
Sbjct: 429 VEYDLRNERLGFKQEKC 445
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 79/327 (24%), Positives = 133/327 (40%), Gaps = 45/327 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLC-------KSRSSCKSLKDP-CPYIADYSTEDTSSSGYLVDD 96
++PS+SSS ++ C+ P C S C + C Y DY + + S G L
Sbjct: 185 FNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYG-DGSYSRGEL--- 240
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
F K + + I GCGR G + G+MGL ++S+ S + L
Sbjct: 241 -----GFEKLTLGKTEIDNFIFGCGRNNKGLF---GGASGLMGLARSELSLVS--QTSSL 290
Query: 157 IQNSFSICFDEN---DSGSVFFGDQGPATQQSTSFLPIG--------EKYDAYFVGVESY 205
+ FS C SGS+ G + ++ S PI + + YF+ +
Sbjct: 291 FGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNIS--PISYTRMIQNPQMSNFYFLNLTGI 348
Query: 206 CIGNSCL------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 259
IG L + G +L+DSG T L IY +F+K S R + +
Sbjct: 349 SIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILN 408
Query: 260 YCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMST--DGDYGI 316
C+N + E + +P ++ IF N +V +F F +++ + CL S + I
Sbjct: 409 TCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQI-CLAFASLGYEDQTMI 467
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKC 343
IG R++++ + K+ ++ C
Sbjct: 468 IGNYQQKNQRVIYNSKESKVGFAGEPC 494
>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
Length = 393
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 82/318 (25%), Positives = 135/318 (42%), Gaps = 43/318 (13%)
Query: 56 VSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHL--ASFSKHAPQS 110
V C P+C+S S + P DY E SS G LV D +L S +H+P
Sbjct: 84 VPCMDPICQSLHSNGDHRCENPGQCDYEVEYADGGSSFGVLVTDTFNLNFTSEKRHSPL- 142
Query: 111 SVQSSVIIGCGRKQ--TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN 168
+ +GCG Q GS+ DGV+GLG G S+ S L+ GL++N C +
Sbjct: 143 -----LALGCGYDQFPGGSH---HPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGH 194
Query: 169 DSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV---DSGA 225
G +FFGD + + ++ P+ Y G+ +GF+ L+ DSGA
Sbjct: 195 GGGFLFFGDDLYDSSR-VAWTPMSPDAKHYSPGLAELTFDGK---TTGFKNLLTTFDSGA 250
Query: 226 SFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQ 283
S+T+L ++ Y ++ K +S K R +L + C+ + + D++ F K
Sbjct: 251 SYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKG-RKPFKSIRDVKKYF-KTF 308
Query: 284 SFVVRNHI-----FSFPENEGFTVF------CLTVMSTD----GDYGIIGQNFMMGHRIV 328
+ N FP E + + CL +++ D +IG M ++
Sbjct: 309 ALSFTNERKSKTELEFPP-EAYLIISSKGNACLGILNGTEVGLNDLNVIGDISMQDRVVI 367
Query: 329 FDRENLKLAWSHSKCEEV 346
+D E ++ W+ C +
Sbjct: 368 YDNEKERIGWAPGNCNRL 385
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 76/312 (24%), Positives = 133/312 (42%), Gaps = 29/312 (9%)
Query: 46 DPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHL 100
+PS+S+S KN+SCS LCK +S K C Y Y + + S G+ + L L
Sbjct: 115 NPSTSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYG-DGSYSIGFFATETLTL 173
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+S S+V + + GCG++ G + A G+ L ++PS AK +
Sbjct: 174 SS-------SNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKL---ALPSQTAKT--YKKL 221
Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT--QS 215
FS C + S + G +S F P+ +D+ Y + + +G L+ +S
Sbjct: 222 FSYCLPASSSSKGYL-SLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDES 280
Query: 216 GFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 273
F A ++DSG T L Y+E+ F L++ + + + CY+ S + +++P
Sbjct: 281 AFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIP 340
Query: 274 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFDR 331
+ + F + +P N G CL D D I G +++V+D
Sbjct: 341 KVGVTFKGGVEMDIDVSGILYPVN-GLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDG 399
Query: 332 ENLKLAWSHSKC 343
++ ++ C
Sbjct: 400 AKGRVGFAPGGC 411
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 77/318 (24%), Positives = 130/318 (40%), Gaps = 35/318 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
++PS S+S N+SCS P C + SC + C Y Y + + S G+ D
Sbjct: 181 FNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSA--STCVYGIQYG-DQSYSVGFFAQDK 237
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGL 156
L L S + V ++ + GCG+ G ++ A G++GLG +S+ S A K G
Sbjct: 238 LALTS-------TDVFNNFLFGCGQNNRGLFVGVA---GLIGLGRNALSLVSQTAQKYGK 287
Query: 157 IQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPI---GEKYDAYFVGVESYCIGNSC 211
+ FS C S G + FG G T ++ F P + YF+ + + +G
Sbjct: 288 L---FSYCLPSTSSSTGYLTFGSGG-GTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRK 343
Query: 212 LTQSG-----FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
L+ S ++DSG + LP Y+++ F + +S + + CY+ S
Sbjct: 344 LSTSASVFSTAGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTCYDFSQ 403
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
+ + VP + L FS + + N S D I+G
Sbjct: 404 YDTVDVPKINLYFSDGAEMDLDPSGIFYILNISQVCLAFAGNSDATDIAILGNVQQKTFD 463
Query: 327 IVFDRENLKLAWSHSKCE 344
+V+D ++ ++ CE
Sbjct: 464 VVYDVAGGRIGFAPGGCE 481
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 85/341 (24%), Positives = 140/341 (41%), Gaps = 52/341 (15%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSL---KDPCPYIADYSTEDTSSSGYLV 94
R L DPS+SS+ + CS P+C + SSC C Y+ Y + G +
Sbjct: 452 RALGPLDPSNSSTFDVLPCSSPVCDNLTWSSCGKHNWGNQTCVYVYAY------ADGSIT 505
Query: 95 DDILHLASFSKHAPQSSVQSSV---IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
L +F+ A + Q++V GCG G + G+ G G G +S+PS L
Sbjct: 506 TGHLDAETFTFAAADGTGQATVPDLAFGCGLFNNGIFTSNET--GIAGFGRGALSLPSQL 563
Query: 152 AKAGLIQNSFSICFDE---NDSGSVFFG------DQGPATQQSTSFLPIGEKYDAYFVGV 202
++FS CF ++ SV G QST + AY++ +
Sbjct: 564 KV-----DNFSHCFTAITGSEPSSVLLGLPANLYSDADGAVQSTPLVQNFSSLRAYYLSL 618
Query: 203 ESYCIGNS---------CLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 252
+ +G++ L Q G ++DSG T LP + Y V D + R+
Sbjct: 619 KGITVGSTRLPIPESTFALKQDGTGGTIIDSGTGMTTLPQDAYKLV---HDAFTAQVRLP 675
Query: 253 LQGNS----WKYCYNASSEEMLK--VPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCL 305
+ + + C++ S K VP + L F + R N++F F E+ G +V CL
Sbjct: 676 VDNATSSSLSRLCFSFSVPRRAKPDVPKLVLHFEGATLDLPRENYMFEF-EDAGGSVTCL 734
Query: 306 TVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+ + D D IIG +++D L++ ++C +
Sbjct: 735 AINAGD-DLTIIGNYQQQNLHVLYDLVRNMLSFVPAQCNRL 774
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 76/320 (23%), Positives = 133/320 (41%), Gaps = 39/320 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLA 101
++P SSS + VSC+ C+S S C C Y YS D S + G L D + +
Sbjct: 132 FNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSCSY--GYSYGDRSFTYGDLASDQITIG 189
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
SF P++ +IGCG + G++ G + G V + AG ++ F
Sbjct: 190 SFK--LPKT------VIGCGHQNGGTF-GGVTSGIIGLGGGSLSLVSQMRTIAG-VKPRF 239
Query: 162 SICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGN----- 209
S C + N +G++ FG + + + P+ + YF+ +E+ +G
Sbjct: 240 SYCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKA 299
Query: 210 ----SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
S +T G ++DSG + T LP +Y V +++ +KR+ + CY+A
Sbjct: 300 ANGISAMTNHG-NIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAG 358
Query: 266 SEEMLKVPDMRLIFS--KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
+ L +P + F+ + + N +N V CLT + I G +
Sbjct: 359 QVDDLNIPIITAHFAGGADVKLLPVNTFAPVADN----VTCLT-FAPATQVAIFGNLAQI 413
Query: 324 GHRIVFDRENLKLAWSHSKC 343
+ +D N +L++ C
Sbjct: 414 NFEVGYDLGNKRLSFEPKLC 433
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 81/327 (24%), Positives = 134/327 (40%), Gaps = 40/327 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSR--------SSCKSLKDPCPYIADYSTEDTSSS-GYLVD 95
+ P++S S + CS CKS S+ + PC Y DY +D SS+ G +
Sbjct: 157 FRPANSKSWAPIPCSSDTCKSYVPFSLANCSAGTTPPAPCGY--DYRYKDKSSARGVVGT 214
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
D +A + + + V++GC G + DGV+ LG ++S S A
Sbjct: 215 DAATIALSGSGSDRKAKLQEVVLGCTTSYDGQSFQSS--DGVLSLGNSNISFASR--AAA 270
Query: 156 LIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGN 209
FS C N + + FG G A S + L + + ++ V V++ +
Sbjct: 271 RFGGRFSYCLVDHLAPRNATSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAG 330
Query: 210 SCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKY 260
L + A++DSG S T L T Y VV K L R+++ + ++Y
Sbjct: 331 KALNIPAEVWDVKKNGGAILDSGTSLTILATPAYKAVVAALSKQLARVPRVTM--DPFEY 388
Query: 261 CYN-ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY---GI 316
CYN ++ VP + + F+ S +R S+ + V C+ + +G + +
Sbjct: 389 CYNWTATRRPPAVPRLEVRFAG--SARLRPPTKSYVIDAAPGVKCIGLQ--EGVWPGVSV 444
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKC 343
IG H FD N L + S+C
Sbjct: 445 IGNILQQEHLWEFDLANRWLRFQESRC 471
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 80/312 (25%), Positives = 124/312 (39%), Gaps = 31/312 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
++PSSSSS + +SC P C + + C Y Y + + + G + L + S
Sbjct: 193 FEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYG-DGSYTVGDFATETLTIGS-- 249
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
++ +V +GCG G + V GL + L + L SFS C
Sbjct: 250 ------TLVQNVAVGCGHSNEGLF--------VGAAGLLGLGGGLLALPSQLNTTSFSYC 295
Query: 165 FDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA 219
+ DS S V FG P L + Y++G+ +G L QS F+
Sbjct: 296 LVDRDSDSASTVEFGTSLPPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEM 355
Query: 220 --------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
++DSG + T L T IY + F K S + + CYN S++ ++
Sbjct: 356 DESGSGGIIIDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIE 415
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 331
VP + F + + + P + T FCL T IIG G R+ FD
Sbjct: 416 VPTVAFHFPGGKMLALPAKNYMIPVDSVGT-FCLAFAPTASSLAIIGNVQQQGTRVTFDL 474
Query: 332 ENLKLAWSHSKC 343
N + +S +KC
Sbjct: 475 ANSLIGFSSNKC 486
>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Cucumis sativus]
Length = 418
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 74/321 (23%), Positives = 130/321 (40%), Gaps = 29/321 (9%)
Query: 47 PSSSSSSKNVSCSHPLCKSRSSCKSLK----DPCPYIADYSTEDTSSSGYLVDDILHLAS 102
P S+ V C PLC S S + D C Y +Y+ + SS G LV D+ L +
Sbjct: 98 PLYQPSNDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYA-DGGSSLGVLVRDVFPL-N 155
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+ P ++ + +GCG Q DG++GLG G VS+ S L G+++N
Sbjct: 156 LTNGDP---IRPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVG 212
Query: 163 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSCLTQSGFQALV 221
CF+ F G + P+ Y ++ G +
Sbjct: 213 HCFNSKGG-GYXFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVF 271
Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIF 279
DSG+S+T+ + Y + ++ ++ K R ++ ++ C+ + + + D+R F
Sbjct: 272 DSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRG-RKPIKSLRDVRKYF 330
Query: 280 S----KNQSFVVRNHIFSFPENEGFTVF------CLTVMS-TD---GDYGIIGQNFMMGH 325
S +F P EG+ + CL +++ TD + IIG M
Sbjct: 331 KPLALSFSSGGRSKAVFEIP-TEGYMIISSMGNVCLGILNGTDVGLENSNIIGDISMQDK 389
Query: 326 RIVFDRENLKLAWSHSKCEEV 346
+V++ E + W+ + C+ V
Sbjct: 390 MVVYNNEKQAIGWATANCDRV 410
>gi|357168204|ref|XP_003581534.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Brachypodium distachyon]
Length = 436
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 67/299 (22%), Positives = 118/299 (39%), Gaps = 41/299 (13%)
Query: 60 HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG 119
H +C + S D C Y Y+ +++GY V D +H F + +S +SVI G
Sbjct: 149 HAICHTSHSSG---DQCGYNQIYADGVLATTGYYVSDDIHFDIFMGNESFASSSASVIFG 205
Query: 120 CGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-NDSGSVFFGDQ 178
C + ++G DGV+G G S+ S L G + ++FS C D+ +D G V D+
Sbjct: 206 CSKSRSGH----LQADGVIGFGKDAPSLISQLNSQG-VSHAFSRCLDDSDDGGGVLILDE 260
Query: 179 GPATQQSTSFLPIGEKYDAYFVGVESYCIGN-------SCLTQSGFQA-LVDSGASFTFL 230
+ F + Y + ++S + N S T S Q +DSG S +
Sbjct: 261 --VGEPGLEFTSLVASRPCYNLNMKSIAVNNQNVPIDSSLFTTSSTQGTFLDSGTSLAYF 318
Query: 231 PTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV--R 288
P +Y V+ + S R P + F + V
Sbjct: 319 PDGVYDPVIRAILFIYFSTR-----------------SFSSFPTVTXYFEGGAAMKVGPE 361
Query: 289 NHIFSFPENEGFTVFCLTVMSTDGDYG---IIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
N++ + + C+ ++GDY I+G + V++ + +++ W + C+
Sbjct: 362 NYLLRRGSYDNDSYMCIAFQRSEGDYKQTTILGDLILHDKIFVYNLKKMQIGWVNYNCK 420
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 90/325 (27%), Positives = 128/325 (39%), Gaps = 39/325 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
+DP +SSS + + C PLCK S S + C Y Y + + S G D+
Sbjct: 96 FDPRNSSSFQRIPCLSPLCKALEVHSCSGSRGATSRCSYQVAYG-DGSFSVGDFSSDLFT 154
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
L + SK SV GCG G + A G+ L S + N
Sbjct: 155 LGTGSKAM-------SVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTAN 207
Query: 160 SFSICFDEND------SGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCL 212
SFS C + S S+ FG + + S L K D Y+ + +G + L
Sbjct: 208 SFSYCLVDRSNPMTRSSSSLIFGVAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQL 267
Query: 213 ---------TQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSW 258
+QSG ++DSG S T PT +YA + F L S+ R SL +
Sbjct: 268 PISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATINLPSAPRYSL----F 323
Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
CYN S + + VP + L F + + P N + FCL T + GIIG
Sbjct: 324 DTCYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPINTAGS-FCLAFAPTSMELGIIG 382
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
RI FD + LA++ +C
Sbjct: 383 NIQQQSFRIGFDLQKSHLAFAPQQC 407
>gi|255079464|ref|XP_002503312.1| predicted protein [Micromonas sp. RCC299]
gi|226518578|gb|ACO64570.1| predicted protein [Micromonas sp. RCC299]
Length = 649
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 56/211 (26%), Positives = 98/211 (46%), Gaps = 26/211 (12%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSS---CKSLK----DPCPYIADYSTEDTSSSGYLVD 95
+ +DP+ K ++C CK+ C + + C Y Y+ E + SG LV
Sbjct: 154 TRFDPTG----KWLTCQEKQCKAAGGPGICAGGRGAAANRCTYSRTYA-EGSGVSGDLVR 208
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD-VSVPSLLAKA 154
D +H AP ++ V+ GC ++G+ D A DG++GLG S+P+ LA
Sbjct: 209 DKMHFGG--DIAPATNGTLDVVFGCTNAESGTIHDQEA-DGLIGLGNNQFASIPNQLADT 265
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSF----LPIGEKYDAYFV-GVESYCIGN 209
+ FS+CF + G + PAT + + + E + AY+V + IG+
Sbjct: 266 HGLPRVFSLCFGSFEGGGALSFGRLPATPHTPPLVYTDMRVNEAHPAYYVVSTAAMKIGD 325
Query: 210 SCLTQS-----GFQALVDSGASFTFLPTEIY 235
+ G+ ++DSG +FT++PT+++
Sbjct: 326 VAVATPSDLAVGYGTVMDSGTTFTYVPTKVF 356
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 79/321 (24%), Positives = 134/321 (41%), Gaps = 46/321 (14%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
++N +DPS SS+ K C CPY DY + T + G L + +
Sbjct: 101 EQNAPIFDPSKSSTFKEKRCD-------------GHSCPYEVDYF-DHTYTMGTLATETI 146
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
L S S + V IIGCG S+ + G++GL G S+ + G
Sbjct: 147 TLHSTSG---EPFVMPETIIGCGHNN--SWFKPSF-SGMVGLNWGPSSL--ITQMGGEYP 198
Query: 159 NSFSICFDENDSGSVFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 214
S CF + + FG G +T F+ K Y++ +++ +GN+ +
Sbjct: 199 GLMSYCFSGQGTSKINFGANAIVAGDGVVSTTMFMTTA-KPGFYYLNLDAVSVGNTRIET 257
Query: 215 SG--FQAL-----VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
G F AL +DSG + T+ P V + +V++ R + + CYN+ +
Sbjct: 258 MGTTFHALEGNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYNSDTI 317
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM----STDGDYGIIGQ-NFM 322
++ V M FS V+ + N G VFCL ++ + + +G Q NF+
Sbjct: 318 DIFPVITMH--FSGGVDLVLDKYNMYMESNNG-GVFCLAIICNSPTQEAIFGNRAQNNFL 374
Query: 323 MGHRIVFDRENLKLAWSHSKC 343
+G +D +L +++S + C
Sbjct: 375 VG----YDSSSLLVSFSPTNC 391
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 79/315 (25%), Positives = 132/315 (41%), Gaps = 35/315 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP++SS+ V+C C S SSC+S + C Y +Y Y D A+
Sbjct: 62 FDPTASSTYAPVTCQSQQCSSLEMSSCRSGQ--CLYQVNYG-----DGSYTFGD---FAT 111
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S S +V +GCG G ++ A G+ G L SL + L SFS
Sbjct: 112 ESVSFGNSGSVKNVALGCGHDNEGLFVGAAGLLGLGGGPL------SLTNQ--LKATSFS 163
Query: 163 ICFDENDSG---SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLT--QSG 216
C DS ++ F T+ L K D Y+VG+ +G ++ +S
Sbjct: 164 YCLVNRDSAGSSTLDFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPEST 223
Query: 217 FQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
F+ +VD G + T L T+ Y + F ++ + +++ + CY+ S +
Sbjct: 224 FRLDESGNGGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQA 283
Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
++VP + F+ +S+ + + P + T +C T IIG G R+
Sbjct: 284 SVRVPTVSFHFADGKSWNLPAANYLIPVDSAGT-YCFAFAPTTSSLSIIGNVQQQGTRVT 342
Query: 329 FDRENLKLAWSHSKC 343
FD N ++ +S +KC
Sbjct: 343 FDLANNRMGFSPNKC 357
>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 77/352 (21%), Positives = 142/352 (40%), Gaps = 59/352 (16%)
Query: 35 SIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS---------CKSLKDPC-----PYIA 80
S + + + ++P SSSSK + C +P C + SS C C PY
Sbjct: 127 SDAEPKKVPIFNPKLSSSSKILGCRNPKCVNTSSPDVHLGCPPCNGNSKNCSHACPPYSL 186
Query: 81 DYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL 140
Y T SS +L++++ + P ++ ++GC G A + G
Sbjct: 187 QYGT-GASSGDFLLENL--------NFPGKTIH-EFLVGCTTSAVGEVTSAA----LAGF 232
Query: 141 GLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYD---- 196
G S+P + S +D+ + S D + S+ P +
Sbjct: 233 GRSMFSLPMQMGVKKFAYCLNSHDYDDTRNSSKLILDYSDGETKGLSYAPFLKNPPDFPI 292
Query: 197 AYFVGVESYCIGNSCLT-QSGFQA---------LVDSGASFTFLPTEIYAEVVVKFDKLV 246
Y++GV+ IGN L S + A ++DSG ++ ++ ++ +V + K +
Sbjct: 293 YYYLGVKDIKIGNKLLRIPSKYLAPGSDGRGGLMIDSGFAYGYMTGPVFKKVTNELKKRM 352
Query: 247 SSKRISLQGNSW---KYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFT 301
S R SL+ + CYN + ++ +K+PD+ F + VV +N+ PE +
Sbjct: 353 SKYRRSLEAEAEIGVTPCYNFTGQKSIKIPDLIYQFRGGATMVVPGKNYFVLIPE---IS 409
Query: 302 VFCL---------TVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
+ C T+ T G I+G + + + + FD +N +L + C+
Sbjct: 410 LACFPLTTDAGTNTLEFTPGPSIILGNSQHVDYYVEFDLKNERLGFRQQTCQ 461
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 89/348 (25%), Positives = 140/348 (40%), Gaps = 65/348 (18%)
Query: 38 QDRNLSEYDPSSSSSSKNVSCSHPLCKSRS----SCKSLKDPCPYIADYSTEDTSSSGYL 93
Q R YDP+ SSS C LC++ S +C ++ C Y +Y + T G L
Sbjct: 124 QHREKPLYDPAKSSSFAAAPCDGRLCETGSFNTKNCS--RNKCIYTYNYGSATT--KGEL 179
Query: 94 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
+ +F +H V S+ GCG+ +GS L GA+ G++G+ +S+ S L
Sbjct: 180 ASETF---TFGEH---RRVSVSLDFGCGKLTSGS-LPGAS--GILGISPDRLSLVSQLQI 230
Query: 154 AGLIQNSFSIC----FDENDSGSVFFG---------DQGPATQQSTSFLPIGEKYDAYF- 199
FS C D N + +FFG GP S P G Y Y
Sbjct: 231 P-----RFSYCLTPFLDRNTTSHIFFGAMADLSKYRTTGPIQTTSLVTNPDGSNYYYYVP 285
Query: 200 ------------VGVESYCIGNSCLTQSGFQALVDSGASFTFLPT---EIYAEVVVKFDK 244
V V S+ IG SG VDSG + LP+ E E +V+ K
Sbjct: 286 LIGISVGTKRLNVPVSSFAIGRD---GSG-GTFVDSGDTTGMLPSVVMEALKEAMVEAVK 341
Query: 245 LVSSKRISLQGNSWKYCYN------ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE 298
L G ++ C+ + E ++VP + F + ++R + +
Sbjct: 342 LPVVNATD-HGYEYELCFQLPRNGGGAVETAVQVPPLVYHFDGGAAMLLRRDSYMVEVSA 400
Query: 299 GFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
G CL V+S+ IIG ++FD EN + +++ ++C ++
Sbjct: 401 G--RMCL-VISSGARGAIIGNYQQQNMHVLFDVENHEFSFAPTQCNQI 445
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 82/330 (24%), Positives = 131/330 (39%), Gaps = 46/330 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP SSS V C LC+ S C + C Y Y + + ++G V + L A
Sbjct: 28 FDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGACMYQVAYG-DGSVTAGDFVTETLTFAG 86
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
++ A V +GCG G ++ A G+ +S P+ +++ SFS
Sbjct: 87 GARVA-------RVALGCGHDNEGLFVAAAGLLGLGRG---GLSFPTQISR--RYGRSFS 134
Query: 163 ICF-DENDSG-----------SVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCI 207
C D SG +V FG G S SF P+ Y+V + +
Sbjct: 135 YCLVDRTSSGAGAAPGSHRSSTVSFG-AGSVGASSASFTPMVRNPRMETFYYVQLVGISV 193
Query: 208 GNS---CLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK-RISLQ 254
G + + +S + +VDSG S T L Y+ + F + R+S
Sbjct: 194 GGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPG 253
Query: 255 GNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD 313
G S + CY+ ++KVP + + F+ + + P + T FC TDG
Sbjct: 254 GFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGT-FCFAFAGTDGG 312
Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
IIG G R+VFD + ++ ++ C
Sbjct: 313 VSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 82/330 (24%), Positives = 131/330 (39%), Gaps = 46/330 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP SSS V C LC+ S C + C Y Y + + ++G V + L A
Sbjct: 171 FDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGACMYQVAYG-DGSVTAGDFVTETLTFAG 229
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
++ A V +GCG G ++ A G+ +S P+ +++ SFS
Sbjct: 230 GARVA-------RVALGCGHDNEGLFVAAAGLLGLGRG---GLSFPTQISR--RYGRSFS 277
Query: 163 ICF-DENDSG-----------SVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCI 207
C D SG +V FG G S SF P+ Y+V + +
Sbjct: 278 YCLVDRTSSGAGAAPGSHRSSTVSFG-AGSVGASSASFTPMVRNPRMETFYYVQLVGISV 336
Query: 208 GNS---CLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK-RISLQ 254
G + + +S + +VDSG S T L Y+ + F + R+S
Sbjct: 337 GGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPG 396
Query: 255 GNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD 313
G S + CY+ ++KVP + + F+ + + P + T FC TDG
Sbjct: 397 GFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGT-FCFAFAGTDGG 455
Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
IIG G R+VFD + ++ ++ C
Sbjct: 456 VSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 485
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 77/323 (23%), Positives = 126/323 (39%), Gaps = 42/323 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
+DPS+S + N+SC+ C S C S C Y Y + + + G+ D
Sbjct: 197 FDPSASKTYSNISCTSTACSGLKSATGNSPGCSS--SNCVYGIQYG-DSSFTVGFFAKDT 253
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L L Q+ V + GCG+ G + A G++GLG +S+ A+
Sbjct: 254 LTLT-------QNDVFDGFMFGCGQNNRGLFGKTA---GLIGLGRDPLSIVQQTAQK--F 301
Query: 158 QNSFSICF--DENDSGSVFFGD-----QGPATQQSTSFLPIGEKYDA--YFVGVESYCIG 208
FS C +G + FG+ A + +F P A YF+ V +G
Sbjct: 302 GKYFSYCLPTSRGSNGHLTFGNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVG 361
Query: 209 NSCLTQSG--FQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
L+ S FQ ++DSG T LP+ +Y + F + +S + + CY+
Sbjct: 362 GKALSISPMLFQNAGTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLDTCYD 421
Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGD--YGIIGQN 320
S+ + +P + F+ N + + N I G + CL D GI G
Sbjct: 422 LSNYTSISIPKISFNFNGNANVDLEPNGILI---TNGASQVCLAFAGNGDDDTIGIFGNI 478
Query: 321 FMMGHRIVFDRENLKLAWSHSKC 343
+V+D +L + + C
Sbjct: 479 QQQTLEVVYDVAGGQLGFGYKGC 501
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 80/327 (24%), Positives = 133/327 (40%), Gaps = 46/327 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DPSSSS+ + CS LC S K C Y Y + +S+ G L + LA
Sbjct: 144 FDPSSSSTYAALPCSSTLCSDLPSSKCTSAKCGYTYTYG-DSSSTQGVLAAETFTLA--- 199
Query: 105 KHAPQSSVQSSVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
+ V GCG G + GA G++GLG G + SL+++ GL N FS
Sbjct: 200 -----KTKLPDVAFGCGDTNEGDGFTQGA---GLVGLGRGPL---SLVSQLGL--NKFSY 246
Query: 164 C---FDENDSGSVFFGDQGPATQ--------QSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
C D+ + G ++ Q+T + + Y+V ++ +G++ +
Sbjct: 247 CLTSLDDTSKSPLLLGSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHI 306
Query: 213 T--QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
T S F +VDSG S T+L + Y + F + G C+
Sbjct: 307 TLPSSAFAVQDDGTGGVIVDSGTSITYLELQGYRALKKAFAAQMKLPAADGSGIGLDTCF 366
Query: 263 NASSEEMLKVPDMRLIF---SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 319
A + + +V +L+F + N++ + G CLTVM + G IIG
Sbjct: 367 EAPASGVDQVEVPKLVFHLDGADLDLPAENYMV---LDSGSGALCLTVMGSRG-LSIIGN 422
Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEV 346
+ V+D L+++ +C ++
Sbjct: 423 FQQQNIQFVYDVGENTLSFAPVQCAKL 449
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 86/369 (23%), Positives = 154/369 (41%), Gaps = 62/369 (16%)
Query: 15 NALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------ 68
N L CLP C F +N + YDP +S+S KN++C+ P C SS
Sbjct: 186 NWLQCLP------CYDCF------HQNEAFYDPKTSASFKNITCNDPRCSLISSPEPPVQ 233
Query: 69 CKSLKDPCPYIADYSTEDTSSSGYLVDDI-LHLASFSKHAPQSSVQSSVIIGCGRKQTGS 127
CKS CPY Y ++ + V+ ++L + + + V+ +++ GCG G
Sbjct: 234 CKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKVE-NMMFGCGHWNRGL 292
Query: 128 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPAT 182
+ + ++GLG G +S S L L +SFS C D N S + FG+
Sbjct: 293 FSGASG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLL 347
Query: 183 QQS----TSFLPIGEK--YDAYFVGVESYCIGNSCLT----------QSGFQALVDSGAS 226
+ TSF+ E Y++ ++S +G L ++DSG +
Sbjct: 348 NHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNISPDGAGGTIIDSGTT 407
Query: 227 FTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNAS--SEEMLKVPDMRLIFSKNQ 283
++ Y + KF +K+ + + C+N S E + +P++ + F+
Sbjct: 408 LSYFAEPAYEIIKNKFAEKMKENYLVFRDFPVLDPCFNVSGIEENNIHLPELGIAFADGA 467
Query: 284 SFVVRNHIFSFPENEGFT-----VFCLTVMST-DGDYGIIGQNFMMGHRIVFDRENLKLA 337
+++FP F + CL ++ T + IIG I++D + +L
Sbjct: 468 -------VWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKMSRLG 520
Query: 338 WSHSKCEEV 346
++ +KC ++
Sbjct: 521 FTPTKCADI 529
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 77/337 (22%), Positives = 132/337 (39%), Gaps = 51/337 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLC-------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
+DP++S S +NV+C P C R+ + DPCPY Y + ++ D
Sbjct: 194 FDPATSLSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTG----DLA 249
Query: 98 LHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
L + + AP +S + V+ GCG G + A G+ L S L A G
Sbjct: 250 LEAFTVNLTAPGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFAS--QLRAVYG- 306
Query: 157 IQNSFSICFDENDS---GSVFFGDQG-----PATQQSTSFLPIGEKYDA-YFVGVESYCI 207
++FS C ++ S + FGD P + D Y+V ++ +
Sbjct: 307 --HAFSYCLVDHGSSVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLV 364
Query: 208 GNSCLTQS----------GFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGN 256
G L S ++DSG + ++ Y + F +++ + +
Sbjct: 365 GGEKLNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFP 424
Query: 257 SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT------VFCLTVMST 310
CYN S E ++VP+ L+F+ ++ FP F + CL V+ T
Sbjct: 425 VLSPCYNVSGVERVEVPEFSLLFADGA-------VWDFPAENYFVRLDPDGIMCLAVLGT 477
Query: 311 -DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
IIG +++D +N +L ++ +C EV
Sbjct: 478 PRSAMSIIGNFQQQNFHVLYDLQNNRLGFAPRRCAEV 514
>gi|301119613|ref|XP_002907534.1| aspartyl protease family A01B, putative [Phytophthora infestans
T30-4]
gi|262106046|gb|EEY64098.1| aspartyl protease family A01B, putative [Phytophthora infestans
T30-4]
Length = 350
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 65/278 (23%), Positives = 125/278 (44%), Gaps = 43/278 (15%)
Query: 93 LVDDILHLASFSKHAPQSSVQSSVI-------IGCGRKQTGSYLDGAAPDGVMGLGLGDV 145
+VD+++ + FS P ++ + +GC K+TG ++ +G+MGLG
Sbjct: 1 MVDELVWVGGFS--TPSDEMEGILKTFGFRFPVGCQTKETGLFIT-QKENGIMGLGRHRS 57
Query: 146 SVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQS-TSFLPIGEKYDAYF---- 199
+V S + AG + QN F++CF D G + FG + S + P+ + AY+
Sbjct: 58 TVMSYMLNAGRVTQNLFTLCF-AGDGGELVFGGVDYSHHTSDVGYTPLLDDKSAYYPVHV 116
Query: 200 --VGVESYCIG-NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFD----KLVSSKRIS 252
+ + +G ++ SG +VDSG + TF ++ + F + S KR+
Sbjct: 117 KDIRMNGVSLGIDAGTINSGRGVIVDSGTTDTFFDSKGSRAFMKAFQNAAGREYSEKRMD 176
Query: 253 LQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG 312
L +++E+ +P + +I S + + P + T V S +G
Sbjct: 177 L-----------TADELAALPTISIILSGMKGDGTEDIQLDIPASSYLTP-SDKVGSYNG 224
Query: 313 DY-------GIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
++ G++G + M+G ++FD EN ++ ++ S C
Sbjct: 225 NFHFSERSGGVLGASTMIGFDVIFDTENKRVGFAESDC 262
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 85/326 (26%), Positives = 129/326 (39%), Gaps = 42/326 (12%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYL 93
D+ ++PS S+S NVSCS C S SS C Y Y + + S G+L
Sbjct: 170 DQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYG-DQSFSVGFL 228
Query: 94 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
D L S S V V GCG G + A G++GLG +S PS A
Sbjct: 229 AKDKFTLTS-------SDVFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTAT 278
Query: 154 AGLIQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYD----------AYFVG 201
A FS C + S G + FG G +S F PI D A VG
Sbjct: 279 A--YNKIFSYCLPSSASYTGHLTFGSAG--ISRSVKFTPISTITDGTSFYGLNIVAITVG 334
Query: 202 VESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
+ I ++ + G AL+DSG T LP + YA + F +S + + C
Sbjct: 335 GQKLPIPSTVFSTPG--ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTC 392
Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVM--STDGDYGII 317
++ S + + +P + FS + + ++F ++ CL S D + I
Sbjct: 393 FDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYAFKISQ----VCLAFAGNSDDSNAAIF 448
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKC 343
G +V+D ++ ++ + C
Sbjct: 449 GNVQQQTLEVVYDGAGGRVGFAPNGC 474
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 77/337 (22%), Positives = 132/337 (39%), Gaps = 51/337 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLC-------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
+DP++S S +NV+C P C R+ + DPCPY Y + ++ D
Sbjct: 194 FDPAASLSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTG----DLA 249
Query: 98 LHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
L + + AP +S + V+ GCG G + A G+ L S L A G
Sbjct: 250 LEAFTVNLTAPGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFAS--QLRAVYG- 306
Query: 157 IQNSFSICFDENDS---GSVFFGDQG-----PATQQSTSFLPIGEKYDA-YFVGVESYCI 207
++FS C ++ S + FGD P + D Y+V ++ +
Sbjct: 307 --HAFSYCLVDHGSSVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLV 364
Query: 208 GNSCLTQS----------GFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGN 256
G L S ++DSG + ++ Y + F +++ + +
Sbjct: 365 GGEKLNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFP 424
Query: 257 SWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT------VFCLTVMST 310
CYN S E ++VP+ L+F+ ++ FP F + CL V+ T
Sbjct: 425 VLSPCYNVSGVERVEVPEFSLLFADGA-------VWDFPAENYFVRLDPDGIMCLAVLGT 477
Query: 311 -DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
IIG +++D +N +L ++ +C EV
Sbjct: 478 PRSAMSIIGNFQQQNFHVLYDLQNNRLGFAPRRCAEV 514
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 83/316 (26%), Positives = 139/316 (43%), Gaps = 49/316 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DPS SS+ K + C + CPY Y + + + G LV + + + S S
Sbjct: 107 FDPSKSSTFKEIRC-----------DTHDHSCPYELVYGGK-SYTKGTLVTETVTIHSTS 154
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
Q V IIGCGR +G + G A GV+GL G S+ + G S C
Sbjct: 155 G---QPFVMPETIIGCGRNNSG-FKPGFA--GVVGLDRGPKSL--ITQMGGEYPGLMSYC 206
Query: 165 FDENDSGSVFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQ 218
F + + FG G +T F+ K Y++ +++ +GN+ + G F
Sbjct: 207 FAGKGTSKINFGANAIVAGDGVVSTTVFVKTA-KPGFYYLNLDAVSVGNTRIETVGTPFH 265
Query: 219 AL-----VDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
AL +DSG++ T+ P E Y +V K +++V++ R S CY + + ++ V
Sbjct: 266 ALKGNIVIDSGSTLTYFP-ESYCNLVRKAVEQVVTAVRFP---RSDILCYYSKTIDIFPV 321
Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIG----QNFMMGHRI 327
M FS V+ + N G VFCL ++ ++ + I G NF++G
Sbjct: 322 ITMH--FSGGADLVLDKYNMYVASNTG-GVFCLAIICNSPIEEAIFGNRAQNNFLVG--- 375
Query: 328 VFDRENLKLAWSHSKC 343
+D +L +++ + C
Sbjct: 376 -YDSSSLLVSFKPTNC 390
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 91/364 (25%), Positives = 143/364 (39%), Gaps = 81/364 (22%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD-------------------PCPYIADYS 83
S + P SS+S SC+ C S + D PCP A
Sbjct: 133 SVFSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTY 192
Query: 84 TEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG 143
E SG L DIL + ++ P+ S GC T +Y + P G+ G G G
Sbjct: 193 GEGGLISGILTRDILK--ARTRDVPRFS------FGC---VTSTYRE---PIGIAGFGRG 238
Query: 144 DVSVPSLLAKAGLIQNSFSICF-------DENDSGSVFFGDQGPATQ-----QSTSFLPI 191
+S+PS L G ++ FS CF + N S + G + Q T L
Sbjct: 239 LLSLPSQL---GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNT 295
Query: 192 GEKYDAYFVGVESYCIGNSC------LTQSGFQA------LVDSGASFTFLPTEIYAEVV 239
++Y++G+ES IG + LT F + LVDSG ++T LP Y++++
Sbjct: 296 PMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLL 355
Query: 240 VKFDKLVSSKRI--SLQGNSWKYCY-------NASSEE---MLKVPDMRLIFSKNQSFVV 287
++ R + + CY N +S E M+ P + F N + ++
Sbjct: 356 TTLQSTITYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLL 415
Query: 288 RN----HIFSFPENEGFTVFCLTVMST-DGDY---GIIGQNFMMGHRIVFDRENLKLAWS 339
+ S P ++G V CL + DGDY G+ G ++V+D E ++ +
Sbjct: 416 PQGNSFYAMSAP-SDGSVVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGFQ 474
Query: 340 HSKC 343
C
Sbjct: 475 AMDC 478
>gi|413936885|gb|AFW71436.1| hypothetical protein ZEAMMB73_738128, partial [Zea mays]
Length = 320
Score = 62.0 bits (149), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 41/142 (28%), Positives = 69/142 (48%), Gaps = 12/142 (8%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYLV 94
L++YDP+ S ++ V C C + S +C S PC + Y + ++++G+ V
Sbjct: 127 ELTQYDPAGSGTT--VGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYG-DGSTTTGFYV 183
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVPSLLA 152
D + S + ++ +S+ GCG Q G L + A DG++G G D S+ S LA
Sbjct: 184 TDFVQYNQVSGNGQTTTSNASITFGCG-AQLGGDLGSSNQALDGILGFGQSDSSMLSQLA 242
Query: 153 KAGLIQNSFSICFDENDSGSVF 174
A ++ F+ C D G +F
Sbjct: 243 AARRVRKIFAHCLDTVRGGGIF 264
>gi|328875414|gb|EGG23778.1| putative aspartyl protease [Dictyostelium fasciculatum]
Length = 507
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 78/321 (24%), Positives = 139/321 (43%), Gaps = 47/321 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSC------KSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
Y PSS+S+ V+CS CK S S + C + Y + + SGY+ +D++
Sbjct: 161 YHPSSTST--KVACSSDQCKGSGSTPPSCSRTSSGESCDFQIRYG-DGSHVSGYIYEDVV 217
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-VP----SLLAK 153
+LA +Q G ++TG + + DG++G G S VP SL++
Sbjct: 218 NLAG---------LQGKANFGANDEETGDF-EYPRADGIIGFGRTCSSCVPTVWDSLVSD 267
Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQS-TSFLPIGEKYDAYF------VGVESYC 206
GL +N F + + GS+ G+ + + P+ +K ++ + + Y
Sbjct: 268 LGL-KNQFGMLLNYEGGGSLSLGEINTSYYTGDIRYTPLVQKNTPFYSVKSTGIRINDYT 326
Query: 207 IGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG-----NSWKYC 261
I S L Q + +VDSG++ L + Y ++ F + S+QG N ++
Sbjct: 327 IPGSKLGQ---EVIVDSGSTALSLASGAYDQLRNYFQ----THYCSIQGVCENPNIFQGS 379
Query: 262 YNASSEEML-KVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
SS+++L K P + F + +N++ P G +C + D I+G
Sbjct: 380 ICYSSDDVLSKFPTLYFTFDGGVQVAIPPKNYLVKAPLTNGKYGYCFMIERADSTMTILG 439
Query: 319 QNFMMGHRIVFDRENLKLAWS 339
FM G+ VFD N ++ ++
Sbjct: 440 DVFMRGYYTVFDNVNDRVGFA 460
>gi|325183198|emb|CCA17656.1| aspartyl protease family A01B putative [Albugo laibachii Nc14]
Length = 656
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 95/409 (23%), Positives = 172/409 (42%), Gaps = 51/409 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL---A 101
++ + SSS + +SC+H S + C + +PC E +S S +++DI++L A
Sbjct: 137 FNTNLSSSIQPISCNHRTYFSCAYCTNPTEPCRTY----MEGSSWSAKVMEDIVYLGDVA 192
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGL-GLGDVSVPSLLAKAGLIQNS 160
S S + + GC K+TG ++ A DG+MG+ G+ V L + + N+
Sbjct: 193 SAKDTNLHHSYSTRYMFGCQNKETGLFIPQVA-DGIMGIHNNGNDIVTKLFREKKIPSNT 251
Query: 161 FSICFDENDSGSVFFGDQGPATQQ-STSFLPI----GEKYDAYF---VGVESYCIGNSCL 212
F++CF G G + ++ I GE Y A F + V + I
Sbjct: 252 FTLCFSPR-GGYFALGAMDTSRHAGEVTYARINDAYGENYYAVFMTDIRVGGHSIDIDMK 310
Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
+ ++ +VDSG + + + ++ + L K L N C S ++ ++
Sbjct: 311 ATNSYRYIVDSGTTNSIISGRAGQALMDLYRNLTHLKN-PLNDND---CILLSPSQIEQL 366
Query: 273 PDMRLIFS-----KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
P ++ + + ++ + EN T F + V T G+IG + MM H +
Sbjct: 367 PTLQFVMEGVNGDRAILEILASQYLQKGENNK-TCFNILV-DTRKIGGVIGASMMMNHDV 424
Query: 328 VFDRENLKLAWSHSKCEEVID---KSHVHLVP--------PPAGQSPNP--LPTTEQQST 374
+FDR K+ + + C D SH + +P P + QS N E++
Sbjct: 425 IFDRSQNKVGFVPANCTFAGDTEPNSHKNAIPSDDANGALPVSKQSNNKSNENAEEKKGL 484
Query: 375 SNGQAAAPPSTAKTAPS-----KSIAASAQQLDS----VLRVACSLLVL 414
SN P ++PS KS Q+++ ++++ +LLVL
Sbjct: 485 SNDTHTDPVVEPVSSPSLEGETKSANVKLQEVEKERPIIVKLVGTLLVL 533
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 82/339 (24%), Positives = 139/339 (41%), Gaps = 39/339 (11%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGY 92
++N YDP S S +N++C+ P C+ SS CK CPY Y ++ +
Sbjct: 232 EQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDF 291
Query: 93 LVDDI-LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
++ ++L S + + +V+ GCG G + A ++GLG G +S S L
Sbjct: 292 ALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL 348
Query: 152 AKAGLIQNSFSICFDENDSGS------VFFGDQGPATQQSTSFL--------PIGEKY-- 195
L +SFS C + DS + +F D+ T +F P+ Y
Sbjct: 349 --QSLYGHSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYL 406
Query: 196 --DAYFVGVESYCIGNSCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
+ FVG E I S A ++DSG + ++ Y + F + V +
Sbjct: 407 QIKSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYK 466
Query: 251 ISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN--QSFVVRNHIFSFPENEGFTVFCLTVM 308
+ CYN S + L P+ + F+ +F V N+ F + + CL ++
Sbjct: 467 LVEDFPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENY---FIRIQQLDIVCLAML 523
Query: 309 ST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
T IIG I++D +N +L ++ +C E+
Sbjct: 524 GTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMRCAEI 562
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 83/316 (26%), Positives = 139/316 (43%), Gaps = 49/316 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DPS SS+ K + C + CPY Y + + + G LV + + + S S
Sbjct: 101 FDPSKSSTFKEIRC-----------DTHDHSCPYELVYGGK-SYTKGTLVTETVTIHSTS 148
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
Q V IIGCGR +G + G A GV+GL G S+ + G S C
Sbjct: 149 G---QPFVMPETIIGCGRNNSG-FKPGFA--GVVGLDRGPKSL--ITQMGGEYPGLMSYC 200
Query: 165 FDENDSGSVFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQ 218
F + + FG G +T F+ K Y++ +++ +GN+ + G F
Sbjct: 201 FAGKGTSKINFGANAIVAGDGVVSTTVFVKTA-KPGFYYLNLDAVSVGNTRIETVGTPFH 259
Query: 219 AL-----VDSGASFTFLPTEIYAEVVVK-FDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
AL +DSG++ T+ P E Y +V K +++V++ R S CY + + ++ V
Sbjct: 260 ALKGNIVIDSGSTLTYFP-ESYCNLVRKAVEQVVTAVRFP---RSDILCYYSKTIDIFPV 315
Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIG----QNFMMGHRI 327
M FS V+ + N G VFCL ++ ++ + I G NF++G
Sbjct: 316 ITMH--FSGGADLVLDKYNMYVASNTG-GVFCLAIICNSPIEEAIFGNRAQNNFLVG--- 369
Query: 328 VFDRENLKLAWSHSKC 343
+D +L +++ + C
Sbjct: 370 -YDSSSLLVSFKPTNC 384
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 77/328 (23%), Positives = 132/328 (40%), Gaps = 47/328 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--------RSSCKSLKDP-CPYIADYSTEDTSSSGYLVD 95
+DP+SS S + C+ C + +C + P C Y Y + + S G L
Sbjct: 166 FDPASSPSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSYR-DGSYSQGVLAH 224
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS-LLAKA 154
D L LA V + GCG G + G+MGLG +S+ S + +
Sbjct: 225 DKLSLAG--------EVIDGFVFGCGTSNQGPF---GGTSGLMGLGRSQLSLISQTMDQF 273
Query: 155 GLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFL-------PIGEKYDAYFVGVES 204
G + FS C + SGS+ GD + ST + P+ + YFV +
Sbjct: 274 GGV---FSYCLPLKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPF--YFVNLTG 328
Query: 205 YCIGNSCLTQSGFQALVDSGASFTFLPTEIY----AEVVVKFDKLVSSKRISLQGNSWKY 260
IG + S + +VDSG T L +Y AE + +F + + S+
Sbjct: 329 ITIGGQEVESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSI----LDT 384
Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY--GIIG 318
C+N + +++P ++ +F N V + + + + CL + S +Y IIG
Sbjct: 385 CFNLTGFREVQIPSLKFVFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIG 444
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEV 346
R++FD ++ ++ C+ +
Sbjct: 445 NYQQKNLRVIFDTLGSQIGFAQETCDYI 472
>gi|110738505|dbj|BAF01178.1| hypothetical protein [Arabidopsis thaliana]
Length = 284
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 42/128 (32%), Positives = 66/128 (51%), Gaps = 12/128 (9%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++ P SS+ + V C+ +C ++ C Y +Y+ E +SS G L +D++ +
Sbjct: 134 KFQPEMSSTYQPVKCNM-----DCNCDDDREQCVYEREYA-EHSSSKGVLGEDLISFGNE 187
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S+ PQ +V GC +TG A DG++GLG GD+S+ L GLI NSF +
Sbjct: 188 SQLTPQRAV-----FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGL 241
Query: 164 CFDENDSG 171
C+ D G
Sbjct: 242 CYGGMDVG 249
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 77/328 (23%), Positives = 132/328 (40%), Gaps = 47/328 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--------RSSCKSLKDP-CPYIADYSTEDTSSSGYLVD 95
+DP+SS S + C+ C + +C + P C Y Y + + S G L
Sbjct: 167 FDPASSPSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSYR-DGSYSQGVLAH 225
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS-LLAKA 154
D L LA V + GCG G + G+MGLG +S+ S + +
Sbjct: 226 DKLSLAG--------EVIDGFVFGCGTSNQGPF---GGTSGLMGLGRSQLSLISQTMDQF 274
Query: 155 GLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFL-------PIGEKYDAYFVGVES 204
G + FS C + SGS+ GD + ST + P+ + YFV +
Sbjct: 275 GGV---FSYCLPLKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPF--YFVNLTG 329
Query: 205 YCIGNSCLTQSGFQALVDSGASFTFLPTEIY----AEVVVKFDKLVSSKRISLQGNSWKY 260
IG + S + +VDSG T L +Y AE + +F + + S+
Sbjct: 330 ITIGGQEVESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSI----LDT 385
Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY--GIIG 318
C+N + +++P ++ +F N V + + + + CL + S +Y IIG
Sbjct: 386 CFNLTGFREVQIPSLKFVFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIG 445
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEV 346
R++FD ++ ++ C+ +
Sbjct: 446 NYQQKNLRVIFDTLGSQIGFAQETCDYI 473
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 81/329 (24%), Positives = 137/329 (41%), Gaps = 48/329 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DPSSSS+ V CS C S C S C Y Y + +S+ G L + LA
Sbjct: 137 FDPSSSSTYATVPCSSASCSDLPTSKCTS-ASKCGYTYTYG-DSSSTQGVLATETFTLAK 194
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S V+ GCG G A G++GLG G +S L+++ GL + FS
Sbjct: 195 --------SKLPGVVFGCGDTNEGDGFSQGA--GLVGLGRGPLS---LVSQLGL--DKFS 239
Query: 163 ICF---DENDSGSVFFGDQGPATQ--------QSTSFLPIGEKYDAYFVGVESYCIGNS- 210
C D+ ++ + G ++ Q+T + + Y+V +++ +G++
Sbjct: 240 YCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTR 299
Query: 211 -CLTQSGFQA--------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
L S F +VDSG S T+L + Y + F ++ G C
Sbjct: 300 ISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLC 359
Query: 262 YNASSEEMLKVPDMRLIF----SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
+ A ++ + +V RL+F + N++ + G CLTVM + G II
Sbjct: 360 FRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMV---LDGGSGALCLTVMGSRG-LSII 415
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
G + V+D + L+++ +C ++
Sbjct: 416 GNFQQQNFQFVYDVGHDTLSFAPVQCNKL 444
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 83/325 (25%), Positives = 134/325 (41%), Gaps = 43/325 (13%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
D+ S +DPS SSS +SC C SSC S C Y Y + T++ G L+++
Sbjct: 223 DQPDSIFDPSQSSSYTLLSCETKHCNLLPNSSC-SDDGYCRYNITYK-DGTNTEGVLINE 280
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
+ S S V +GC K G ++ DG GLG G +S PS + +
Sbjct: 281 TVSFES-------SGWVDRVSLGCSNKNQGPFV---GSDGTFGLGRGSLSFPSRINAS-- 328
Query: 157 IQNSFSICFDENDSG----SVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG---- 208
S S C E+ G ++ F + L + + Y+VG++ +G
Sbjct: 329 ---SMSYCLVESKDGYSSSTLEFNSPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKI 385
Query: 209 ---NSCLTQSGFQ---ALVDSGASFTFLPTEIYAEV----VVKFDKLVSSKRISLQGNSW 258
NS T + +V S + T L + Y V V K L K LQ ++
Sbjct: 386 DVPNSTFTIDPYGNGGMIVSSSSLITMLENDTYNVVRDAFVAKTQHLERLKAF-LQFDT- 443
Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
CYN SS +++P + + +S+++ + + ++ T FC + G + I+G
Sbjct: 444 --CYNLSSNNTVELPILEFEVNDGKSWLLPKESYLYAVDKNGT-FCFAFAPSKGSFSILG 500
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
G R+ FD N + H+ C
Sbjct: 501 TLQQYGTRVTFDLVN-SFVYLHTLC 524
>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 91/364 (25%), Positives = 143/364 (39%), Gaps = 81/364 (22%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKD-------------------PCPYIADYS 83
S + P SSSS SC+ C S + D PCP A
Sbjct: 61 SIFSPLHSSSSFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTY 120
Query: 84 TEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG 143
E SG L DIL + ++ P+ S GC T +Y + P G+ G G G
Sbjct: 121 GEGGLVSGILTRDILK--ARTRDVPRFS------FGC---VTSTYHE---PIGIAGFGRG 166
Query: 144 DVSVPSLLAKAGLIQNSFSICF-------DENDSGSVFFGDQGPATQ-----QSTSFLPI 191
+S+PS L G ++ FS CF + N S + G + Q T L
Sbjct: 167 LLSLPSQL---GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNT 223
Query: 192 GEKYDAYFVGVESYCIGNSC------LTQSGFQA------LVDSGASFTFLPTEIYAEVV 239
++Y++G+ES IG + LT F + LVDSG ++T LP Y++++
Sbjct: 224 PVYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLL 283
Query: 240 VKFDKLVSSKRI--SLQGNSWKYCY-------NASSEE---MLKVPDMRLIFSKNQSFVV 287
++ R + + CY N +S E M+ P + F N + ++
Sbjct: 284 TILQSTITYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMVFPSITFNFLNNATLLL 343
Query: 288 RN----HIFSFPENEGFTVFCLTVMST-DGDY---GIIGQNFMMGHRIVFDRENLKLAWS 339
+ S P ++G V CL + DG+Y G+ G ++V+D E ++ +
Sbjct: 344 PQGNSFYAMSAP-SDGSVVQCLLFQNMEDGNYGPAGVFGSFQQQNVKVVYDLEKERIGFQ 402
Query: 340 HSKC 343
C
Sbjct: 403 AMDC 406
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 78/318 (24%), Positives = 127/318 (39%), Gaps = 43/318 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP SS+S + C P CKS + C Y Y + + + G + + L
Sbjct: 191 FDPISSNSYSPIRCDEPQCKSLDLSECRNGTCLYEVSYG-DGSYTVGEFATETVTLG--- 246
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
S+ +V IGCG G ++ A G+ G L S P A + SFS C
Sbjct: 247 -----SAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKL---SFP-----AQVNATSFSYC 293
Query: 165 FDENDSGSVF---FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQA 219
DS +V F P + + E Y++G++ +G L +S F+
Sbjct: 294 LVNRDSDAVSTLEFNSPLPRNAATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEV 353
Query: 220 --------LVDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSE 267
++DSG + T L +E+Y + F K + + +SL + CY+ SS
Sbjct: 354 DAIGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSL----FDTCYDLSSR 409
Query: 268 EMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
E +++P + F + + + RN++ + FC T IIG G
Sbjct: 410 ESVEIPTVSFRFPEGRELPLPARNYLIPV---DSVGTFCFAFAPTTSSLSIIGNVQQQGT 466
Query: 326 RIVFDRENLKLAWSHSKC 343
R+ FD N + +S C
Sbjct: 467 RVGFDIANSLVGFSVDSC 484
>gi|172034220|gb|ACB69715.1| putative nucellin-like aspartic protease [Hordeum vulgare]
Length = 310
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 63/248 (25%), Positives = 115/248 (46%), Gaps = 19/248 (7%)
Query: 113 QSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF--DEND 169
++S ++G Q G L A G++GL +S+PS LA G+I N F C + N
Sbjct: 11 KASFVLGVTFDQQGQLLSSPAKTSGILGLSSAAISLPSQLASKGIISNVFGHCITRETNG 70
Query: 170 SGSVFFGDQGPATQQSTSFLPI-GEKYDAYFVGVESYCIGNSCLTQSGF--QALVDSGAS 226
G +F GD + ++ PI G + Y + G+ L +G Q + G S
Sbjct: 71 GGYMFLGDD-YVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQEL-HAGIPVQVISRCGTS 128
Query: 227 FTFLPTEIYAEVV--VKFD--KLVSSKRISLQGNSWKYCYNASS-EEMLKVPDMRLIFSK 281
+T+LP E+Y ++ +K D V + WK ++ S + L + R F
Sbjct: 129 YTYLPEEMYKNLIDAIKEDSPSFVQDSSDTTLPLCWKADFSVRSFFKPLNLHFGRRWFVV 188
Query: 282 NQSFVVRNHIFSFPENEGFTVFCLTVMS-TDGDYG---IIGQNFMMGHRIVFDRENLKLA 337
++F + + ++G CL +++ T+ ++G I+G + G +V+D E ++
Sbjct: 189 PKTFTIVPDDYLIISDKGNV--CLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNERRQIG 246
Query: 338 WSHSKCEE 345
W++S+C +
Sbjct: 247 WANSECTK 254
>gi|213998824|gb|ACJ60779.1| nucellin [Hordeum chilense]
Length = 140
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 66/135 (48%), Gaps = 4/135 (2%)
Query: 116 VIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSV 173
+ GCG KQ +P DG++GLG+G + L +I N C G +
Sbjct: 1 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 60
Query: 174 FFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTFLPT 232
+FGD P ++ T ++P+ E Y G+ I N + F+A+ DSG+++T +P
Sbjct: 61 YFGDFNPPSRGVT-WVPMKESXXYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPA 119
Query: 233 EIYAEVVVKFDKLVS 247
+IY E+V K +S
Sbjct: 120 QIYNEIVSKVRGTLS 134
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 82/339 (24%), Positives = 139/339 (41%), Gaps = 39/339 (11%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGY 92
++N YDP S S +N++C+ P C+ SS CK CPY Y ++ +
Sbjct: 232 EQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDF 291
Query: 93 LVDDI-LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
++ ++L S + + +V+ GCG G + A ++GLG G +S S L
Sbjct: 292 ALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL 348
Query: 152 AKAGLIQNSFSICFDENDSGS------VFFGDQGPATQQSTSFL--------PIGEKY-- 195
L +SFS C + DS + +F D+ T +F P+ Y
Sbjct: 349 --QSLYGHSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYL 406
Query: 196 --DAYFVGVESYCIGNSCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
+ FVG E I S A ++DSG + ++ Y + F + V +
Sbjct: 407 QIKSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYK 466
Query: 251 ISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN--QSFVVRNHIFSFPENEGFTVFCLTVM 308
+ CYN S + L P+ + F+ +F V N+ F + + CL ++
Sbjct: 467 LVEDFPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENY---FIRIQQLDIVCLAML 523
Query: 309 ST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
T IIG I++D +N +L ++ +C E+
Sbjct: 524 GTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMRCAEI 562
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 80/319 (25%), Positives = 129/319 (40%), Gaps = 44/319 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
YDP +SS+ V CS C + S+C S+++ C Y A Y + + S GYL D
Sbjct: 177 YDPRASSTYATVPCSASQCDELQAATLNPSAC-SVRNVCIYQASYG-DSSFSVGYLSRDT 234
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
+ S S + GCG+ G + A G++GL +S+ LA + +
Sbjct: 235 VSFGSGSY--------PNFYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--L 281
Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK-YDA--YFVGVESYCIGNSCLT- 213
SFS C S + GP T S+ P+ DA YFV + +G S L
Sbjct: 282 GYSFSYCLPT--PASTGYLSIGPYTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAV 339
Query: 214 ----QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG----NSWKYCYNAS 265
S ++DSG T LPT +Y K V++ + +Q + C+
Sbjct: 340 SPAEYSSLPTIIDSGTVITRLPTAVY----TALSKAVAAAMVGVQSAPAFSILDTCFQGQ 395
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
+ + L+VP + + F+ + + ++ T CL TD IIG
Sbjct: 396 ASQ-LRVPAVAMAFAGGATLKLATQNVLIDVDDSTT--CLAFAPTDSTT-IIGNTQQQTF 451
Query: 326 RIVFDRENLKLAWSHSKCE 344
+V+D ++ ++ C
Sbjct: 452 SVVYDVAQSRIGFAAGGCS 470
>gi|452821303|gb|EME28335.1| aspartyl protease isoform 2 [Galdieria sulphuraria]
Length = 532
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 78/356 (21%), Positives = 151/356 (42%), Gaps = 70/356 (19%)
Query: 37 VQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSL--------------KDPCPYIADY 82
V DR Y+ ++S++ +SC+ P C + ++C C + +Y
Sbjct: 193 VPDR----YNLANSTTGTVISCNSPTCGA-NTCNQQICSSCSSSQACCSENGICGFFIEY 247
Query: 83 STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
+ T+++G L DI+ + +S A + + +T ++L G A GV+GL
Sbjct: 248 G-DGTTATGALYQDIVTVGEYSVQATFAGADT---------ETANFLVGKAA-GVLGLAY 296
Query: 143 GDVS--------VPSLLAKAGLIQNSFSICFDENDSGSVFFGD------QGPATQQSTSF 188
+S V L ++ + N FS+ ++ D G+ G +GP S +
Sbjct: 297 SSLSCNPTCISPVFHQLVESFSLPNIFSVLINQ-DIGAFVVGGVNSSLYEGPIEYSSLAN 355
Query: 189 LPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS 248
+ YD V +ES + ++ L+ F A+VD+G + I+ + F +
Sbjct: 356 EQNPQFYD---VTIESVQVNSNSLSIPSFNAIVDTGTTLIVASPYIFDALKEYFQTNFCN 412
Query: 249 -----KRISLQGNSW---KYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENE 298
S G +W YC N + EE+ ++PD+ + + + +++F N
Sbjct: 413 VPGLCPSSSNPGVTWFGTDYCVNLTPEELSQLPDIEFSLAGGVTLSLGPEHYMFHVSSNN 472
Query: 299 GFTV----FCLTVM--------STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 342
F+ +CL + ++DG+ I+G + + +VFDREN ++ ++ K
Sbjct: 473 IFSAASGSYCLGIQPSSQNLGPTSDGNEMILGNTLQLKYYLVFDRENKRIGFAKGK 528
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 81/329 (24%), Positives = 137/329 (41%), Gaps = 48/329 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DPSSSS+ V CS C S C S C Y Y + +S+ G L + LA
Sbjct: 116 FDPSSSSTYATVPCSSASCSDLPTSKCTS-ASKCGYTYTYG-DSSSTQGVLATETFTLAK 173
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S V+ GCG G A G++GLG G +S L+++ GL + FS
Sbjct: 174 --------SKLPGVVFGCGDTNEGDGFSQGA--GLVGLGRGPLS---LVSQLGL--DKFS 218
Query: 163 ICF---DENDSGSVFFGDQGPATQ--------QSTSFLPIGEKYDAYFVGVESYCIGNS- 210
C D+ ++ + G ++ Q+T + + Y+V +++ +G++
Sbjct: 219 YCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTR 278
Query: 211 -CLTQSGFQA--------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
L S F +VDSG S T+L + Y + F ++ G C
Sbjct: 279 ISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLC 338
Query: 262 YNASSEEMLKVPDMRLIF----SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
+ A ++ + +V RL+F + N++ + G CLTVM + G II
Sbjct: 339 FRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMV---LDGGSGALCLTVMGSRG-LSII 394
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
G + V+D + L+++ +C ++
Sbjct: 395 GNFQQQNFQFVYDVGHDTLSFAPVQCNKL 423
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 81/329 (24%), Positives = 137/329 (41%), Gaps = 48/329 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DPSSSS+ V CS C S C S C Y Y + +S+ G L + LA
Sbjct: 147 FDPSSSSTYATVPCSSASCSDLPTSKCTS-ASKCGYTYTYG-DSSSTQGVLATETFTLAK 204
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S V+ GCG G A G++GLG G +S L+++ GL + FS
Sbjct: 205 --------SKLPGVVFGCGDTNEGDGFSQGA--GLVGLGRGPLS---LVSQLGL--DKFS 249
Query: 163 ICF---DENDSGSVFFGDQGPATQ--------QSTSFLPIGEKYDAYFVGVESYCIGNS- 210
C D+ ++ + G ++ Q+T + + Y+V +++ +G++
Sbjct: 250 YCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTR 309
Query: 211 -CLTQSGFQA--------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
L S F +VDSG S T+L + Y + F ++ G C
Sbjct: 310 ISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLC 369
Query: 262 YNASSEEMLKVPDMRLIF----SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
+ A ++ + +V RL+F + N++ + G CLTVM + G II
Sbjct: 370 FRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMV---LDGGSGALCLTVMGSRG-LSII 425
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
G + V+D + L+++ +C ++
Sbjct: 426 GNFQQQNFQFVYDVGHDTLSFAPVQCNKL 454
>gi|297820902|ref|XP_002878334.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
gi|297324172|gb|EFH54593.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
Length = 362
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 65/128 (50%), Gaps = 12/128 (9%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++ P SS+ + V C+ +C K+ C Y +Y+ E +SS G L +D++ +
Sbjct: 163 KFQPELSSTYQPVKCNM-----DCNCDDDKEQCVYEREYA-EHSSSKGVLGEDLISFGNE 216
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S PQ +V GC +TG A DG++GLG GD+S+ L GLI NSF +
Sbjct: 217 SHLTPQRAV-----FGCKTVETGDLYSQRA-DGIIGLGQGDLSLVGQLVDKGLISNSFGL 270
Query: 164 CFDENDSG 171
C+ D G
Sbjct: 271 CYGGLDVG 278
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 81/329 (24%), Positives = 137/329 (41%), Gaps = 48/329 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DPSSSS+ V CS C S C S C Y Y + +S+ G L + LA
Sbjct: 209 FDPSSSSTYATVPCSSASCSDLPTSKCTSASK-CGYTYTYG-DSSSTQGVLATETFTLAK 266
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S V+ GCG G A G++GLG G +S L+++ GL + FS
Sbjct: 267 --------SKLPGVVFGCGDTNEGDGFSQGA--GLVGLGRGPLS---LVSQLGL--DKFS 311
Query: 163 ICF---DENDSGSVFFGD--------QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS- 210
C D+ ++ + G ++ Q+T + + Y+V +++ +G++
Sbjct: 312 YCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTR 371
Query: 211 -CLTQSGFQA--------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
L S F +VDSG S T+L + Y + F ++ G C
Sbjct: 372 ISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLC 431
Query: 262 YNASSEEMLKVPDMRLIF----SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
+ A ++ + +V RL+F + N++ + G CLTVM + G II
Sbjct: 432 FRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMV---LDGGSGALCLTVMGSRG-LSII 487
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
G + V+D + L+++ +C ++
Sbjct: 488 GNFQQQNFQFVYDVGHDTLSFAPVQCNKL 516
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 78/327 (23%), Positives = 124/327 (37%), Gaps = 43/327 (13%)
Query: 48 SSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSS-GYLVDDILHLA 101
++S S ++CS C S ++C S PC Y DY D S++ G + D +A
Sbjct: 150 AASKSWAPIACSSDTCTSYVPFSLANCSSPASPCAY--DYRYRDGSAARGVVGTDSATIA 207
Query: 102 --------SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
+ + V++GC G + DGV+ LG ++S S
Sbjct: 208 LSSGSGRGGGDSSGGRRAKLQGVVLGCAATYDGQSFQSS--DGVLSLGNSNISFASR--A 263
Query: 154 AGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG 208
A FS C N + + FG A T L Y V V++ +
Sbjct: 264 AARFGGRFSYCLVDHLAPRNATSYLTFGPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVA 323
Query: 209 NSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWK 259
L A++DSG S T L T Y VV K L R+++ + ++
Sbjct: 324 GEALDIPADVWDVDRNGGAILDSGTSLTILATPAYRAVVTALSKHLAGLPRVTM--DPFE 381
Query: 260 YCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY---GI 316
YCYN + L++P M + F+ + + G V C+ V +G + +
Sbjct: 382 YCYNWTDAGALEIPKMEVHFAGSARLEPPAKSYVIDAAPG--VKCIGVQ--EGSWPGVSV 437
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKC 343
IG H FD + L + H++C
Sbjct: 438 IGNILQQEHLWEFDLRDRWLRFKHTRC 464
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 83/346 (23%), Positives = 145/346 (41%), Gaps = 67/346 (19%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDI 97
S +DPS SSS + C+HPLCK R +L C + + + + T + G LV +
Sbjct: 122 SVFDPSLSSSFSVLPCNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREK 181
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
+ +FS+ S +I+GC + + + G++G+ LG +S S +A L
Sbjct: 182 I---TFSR----SQSTPPLILGCAEESSDA-------KGILGMNLGRLSFAS---QAKLT 224
Query: 158 QNSFSICFDEND-------SGSVFFGDQGPA-------------TQQSTSFLPIGEKYDA 197
+ FS C +GS + G+ + +Q+ + P+ A
Sbjct: 225 K--FSYCVPTRQVRPGFTPTGSFYLGENPNSGGFRYINLLTFSQSQRMPNLDPL-----A 277
Query: 198 YFVGVESYCIGNSCLT--QSGF--------QALVDSGASFTFLPTEIYAEVVVKFDKLVS 247
Y V ++ IGN L S F Q ++DSG+ FT+L E Y +V + +LV
Sbjct: 278 YTVAMQGIRIGNQKLNIPISAFRPDPSGAGQTMIDSGSEFTYLVDEAYNKVREEVVRLVG 337
Query: 248 S--KRISLQGNSWKYCYNASSEEMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFC 304
+ K+ + G C+N ++ E+ + + +M F K VV + G V C
Sbjct: 338 ARLKKGYVYGGVSDMCFNGNAIEIGRLIGNMVFEFDKGVEIVVEKE--RVLADVGGGVHC 395
Query: 305 LTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
+ + ++ IIG + FD N ++ + + C +
Sbjct: 396 VGIGRSEMLGAASNIIGNFHQQNIWVEFDLANRRVGFGKADCSRSV 441
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 76/326 (23%), Positives = 124/326 (38%), Gaps = 36/326 (11%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
L +D ++S++ ++V+CS PLC + S C Y++ Y S +L D
Sbjct: 132 LPRFDTAASNTVRSVACSDPLCNAHSEHGCFLHGCTYVSGYGDGSLSFGHFLRDSF---- 187
Query: 102 SFSKHAPQSSVQSSVI-IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+F V I GCG G +L G+ G G G +S+PS L
Sbjct: 188 TFDDGKGGGKVTVPDIGFGCGMYNAGRFLQ--TETGIAGFGRGPLSLPSQLK-----VRQ 240
Query: 161 FSICFD---ENDSGSVFFGDQGPATQQ------STSF---LPIGEKYDAYFVGVESYCIG 208
FS CF E S VF G G ST F LP G Y + + +G
Sbjct: 241 FSYCFTTRFEAKSSPVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVG 300
Query: 209 NSCLTQSGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
+ L +A +DSG T P ++ ++ F + ++ + C+
Sbjct: 301 KTRLPVPEIKADGSGATFIDSGTDITTFPDAVFRQLKSAFIAQAALP-VNKTADEDDICF 359
Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG--DYGIIGQN 320
+ ++ +P + + R + + G C+ V ST G D +IG
Sbjct: 360 SWDGKKTAAMPKLVFHLEGADWDLPRENYVTEDRESG--QVCVAV-STSGQMDRTLIGNF 416
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEV 346
IV+D KL ++C+++
Sbjct: 417 QQQNTHIVYDLAAGKLLLVPAQCDKL 442
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 68/250 (27%), Positives = 105/250 (42%), Gaps = 22/250 (8%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSSCKS--LKDPCPYIADYSTEDTSSSGYLVDDILHL 100
S YDPS S SS SCS P C + + + C Y+ Y + +S+SG + D+L L
Sbjct: 188 SFYDPSRSPSSAPFSCSSPTCTALGPYANGCANNQCQYLVRYP-DGSSTSGAYIADLLTL 246
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+ + S GC + GS+ AA G+M LG G S+ L A N+
Sbjct: 247 DA-------GNAVSGFKFGCSHAEQGSFDARAA--GIMALGGGPESL--LSQTASRYGNA 295
Query: 161 FSICFDENDSGSVFFGDQGPATQQS----TSFLPIGEKYDAYFVGVESYCIGNSCL--TQ 214
FS C S S FF P S T + + Y V + + +G L
Sbjct: 296 FSYCIPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAP 355
Query: 215 SGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
+ F A ++DS + T LP Y + F ++ R + CY+ + +++
Sbjct: 356 AVFAAGSVLDSRTAITRLPPTAYQALRSAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRL 415
Query: 273 PDMRLIFSKN 282
P + L+F +N
Sbjct: 416 PKISLVFDRN 425
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 82/343 (23%), Positives = 137/343 (39%), Gaps = 59/343 (17%)
Query: 45 YDPSSSSSSKNVSCSHPLC-------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
+DP++SSS +NV+C C R+ + +D CPY Y + ++
Sbjct: 193 FDPAASSSYRNVTCGDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGD------ 246
Query: 98 LHLASFSKH--APQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
L L SF+ + AP +S + V+ GCG G + A G+ L S L A
Sbjct: 247 LALESFTVNLTAPGASRRVDDVVFGCGHWNRGLFHGAAGLLGLGRGPLSFAS--QLRAVY 304
Query: 155 GLIQNSFSICFDENDS---GSVFFGDQGPATQQS-------TSFLPIGEKYDA-YFVGVE 203
G ++FS C ++ S V FG+ + T+F P D Y+V ++
Sbjct: 305 G---HTFSYCLVDHGSDVASKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLK 361
Query: 204 SYCIGNSCLTQSG------------FQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKR 250
+G L S ++DSG + ++ Y + F D++ S
Sbjct: 362 GVLVGGELLNISSDTWGVGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYP 421
Query: 251 ISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT------VFC 304
+ CYN S + +VP++ L+F+ ++ FP F + C
Sbjct: 422 LIPDFPVLSPCYNVSGVDRPEVPELSLLFADGA-------VWDFPAENYFIRLDPDGIMC 474
Query: 305 LTVMST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
L V+ T IIG +V+D +N +L ++ +C EV
Sbjct: 475 LAVLGTPRTGMSIIGNFQQQNFHVVYDLKNNRLGFAPRRCAEV 517
>gi|115484513|ref|NP_001065918.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|122221757|sp|Q0IU52.1|ASP1_ORYSJ RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
gi|33340111|gb|AAQ14543.1|AF308691_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|33340113|gb|AAQ14544.1|AF308692_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|62954898|gb|AAY23267.1| nucellin-like protein [Oryza sativa Japonica Group]
gi|77548967|gb|ABA91764.1| Aspartic proteinase Asp1 precursor, putative, expressed [Oryza
sativa Japonica Group]
gi|113644622|dbj|BAF27763.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|215766817|dbj|BAG99045.1| unnamed protein product [Oryza sativa Japonica Group]
gi|385717694|gb|AFI71282.1| aspartic proteinase [Oryza sativa Japonica Group]
Length = 410
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 68/324 (20%), Positives = 133/324 (41%), Gaps = 42/324 (12%)
Query: 54 KNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 106
K V+C+ LC C S K C Y+ Y D+SS G LV D FS
Sbjct: 87 KLVTCADSLCTDLYTDLGKPKRCGSQKQ-CDYVIQYV--DSSSMGVLVID-----RFSLS 138
Query: 107 APQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSIC 164
A + +++ GCG Q + P D ++GL G V++ S L G+I ++ C
Sbjct: 139 ASNGTNPTTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHC 198
Query: 165 FDENDSGSVFFGD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDS 223
G +FFGD Q P + + + + KY + G + + ++ + + DS
Sbjct: 199 ISSKGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSNSKAISAAPMAVIFDS 258
Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSK-----RISLQGNSWKYCYNASSEEMLKVPDMRLI 278
GA++T+ + Y + ++S+ ++ + + C+ ++++ + +++
Sbjct: 259 GATYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCWKGK-DKIVTIDEVKKC 317
Query: 279 FS----------KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY------GIIGQNFM 322
F K + + + EG CL ++ ++ +IG M
Sbjct: 318 FRSLSLEFADGDKKATLEIPPEHYLIISQEGHV--CLGILDGSKEHLSLAGTNLIGGITM 375
Query: 323 MGHRIVFDRENLKLAWSHSKCEEV 346
+ +++D E L W + +C+ +
Sbjct: 376 LDQMVIYDSERSLLGWVNYQCDRI 399
>gi|449449906|ref|XP_004142705.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500739|ref|XP_004161182.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 410
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 86/339 (25%), Positives = 142/339 (41%), Gaps = 47/339 (13%)
Query: 33 GASIVQDRNLSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDT 87
G ++ DR Y P ++ V C PLC S+S CK+ D C Y +Y+ +
Sbjct: 89 GCTLPHDR---LYKPHNNV----VRCGEPLCSALFSASKSPCKNPNDQCDYEVEYA-DHG 140
Query: 88 SSSGYLVDDI--LHLASFSKHAPQSSVQSSVIIGCGRKQ--TGSYLDGAAPDGVMGLGLG 143
SS G LV D L L + + AP ++ GCG Q GS L GV+GLG
Sbjct: 141 SSIGVLVKDPVPLRLTNGTILAP------NLGFGCGYDQHNGGSQLPPLT-AGVLGLGNS 193
Query: 144 DVSVPSLLAKAGLIQNSFSIC-FDENDSGSVFFGDQGPATQQSTSFLPI----GEKYDAY 198
++ + L+ ++N C + F GD P++ S++PI G KY A
Sbjct: 194 KATMATQLSALSHVRNVLGHCFSGQGGGFLFFGGDLVPSS--GMSWMPILRTPGGKYSA- 250
Query: 199 FVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGN 256
G G + + G DSG+S+T+ +++Y V+ + + R + +
Sbjct: 251 --GPAEVYFGGNPVGIRGLILTFDSGSSYTYFNSQVYGAVLNLLRNGLKGQPLRDAPEDK 308
Query: 257 SWKYCYNASSEEMLKVPDMRLIFSK-NQSFVVRNHIFSFPENEGFTV-----FCLTVMST 310
+ C+ S+ V D+R F SF F P + CL +++
Sbjct: 309 TLPICWKG-SKAFKSVADVRNFFKPLALSFGNSKVQFQIPPEAYLIISNLGNVCLGILNG 367
Query: 311 D----GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
G+ +IG M+ +V+D E ++ W+ + C +
Sbjct: 368 SQVGLGNVNLIGDISMLDKMMVYDNERQQIGWAPANCSK 406
>gi|255637574|gb|ACU19113.1| unknown [Glycine max]
Length = 290
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 46/136 (33%), Positives = 69/136 (50%), Gaps = 7/136 (5%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +DP SSS+S +SC C+S +SC + C Y Y + + +SGY V D
Sbjct: 121 LNYFDPGSSSTSSLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYG-DGSGTSGYYVSD 179
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA-APDGVMGLGLGDVSVPSLLAKAG 155
++H AS + ++ +SV+ GC QTG A DG+ G G +SV S L+ G
Sbjct: 180 LMHFASIFEGTLTTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQG 239
Query: 156 LIQNSFSICFDENDSG 171
+ FS C ++SG
Sbjct: 240 IAPRVFSHCLKGDNSG 255
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 74/314 (23%), Positives = 125/314 (39%), Gaps = 31/314 (9%)
Query: 44 EYDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
++DP+ S+S NVSCS C S C + C Y Y + + S G+ + L
Sbjct: 177 KFDPTKSTSYNNVSCSSASCNLLPTSERGCSASNSTCLYQIIYG-DQSYSQGFFATETLT 235
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
++S S V ++ + GCG+ G + A G+ SV A Q
Sbjct: 236 ISS-------SDVFTNFLFGCGQSNNGLFGQAAGLLGLS-----SSSVSLPSQTAEKYQK 283
Query: 160 SFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYF----VGV----ESYCIGNSC 211
FS C S + + + G Q+ F PI + +++ VG+ I S
Sbjct: 284 QFSYCLPSTPSSTGYL-NFGGKVSQTAGFTPISPAFSSFYGIDIVGISVAGSQLPIDPSI 342
Query: 212 LTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
T SG A++DSG T LP Y + FD+ +S+ + CY+ S+ +
Sbjct: 343 FTTSG--AIIDSGTVITRLPPTAYKALKEAFDEKMSNYPKTNGDELLDTCYDFSNYTTVS 400
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVF 329
P + + F + + N G + CL + D ++GI G + + +V+
Sbjct: 401 FPKVSVSFKGGVEVDIDASGILYLVN-GVKMVCLAFAANKDDSEFGIFGNHQQKTYEVVY 459
Query: 330 DRENLKLAWSHSKC 343
D + ++ C
Sbjct: 460 DGAKGMIGFAAGAC 473
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 65/243 (26%), Positives = 103/243 (42%), Gaps = 22/243 (9%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKD-PCPYIADYSTEDTSSSGYLVD 95
L ++P +SS+S + CS C S + C++ + PC Y Y + + +SGY V
Sbjct: 135 LEFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYG-DGSGTSGYYVS 193
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKA 154
D ++ + + ++ +S++ GC Q+G A DG+ G G +SV S L
Sbjct: 194 DTMYFDTVMGNEQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSL 253
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC-------I 207
G+ FS C +D+G G + + P+ Y + +ES I
Sbjct: 254 GVSPKVFSHCLKGSDNGGGIL-VLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPI 312
Query: 208 GNSCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL--QGNSWKYCYNA 264
+S T S Q +VDSG + +L Y V VS SL +GN C+
Sbjct: 313 DSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQ---CFVT 369
Query: 265 SSE 267
SS
Sbjct: 370 SSR 372
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 85/354 (24%), Positives = 141/354 (39%), Gaps = 75/354 (21%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPC-----PYIADYSTEDTS 88
+S + P SSSSK + C +P C + C + C PY+ Y + T
Sbjct: 119 RISPFLPKHSSSSKIIGCKNPKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYGSGTT- 177
Query: 89 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 148
G + + LHL + + ++GC S P G+ G G G S+P
Sbjct: 178 -GGVALSETLHLHGL--------IVPNFLVGC------SVFSSRQPAGIAGFGRGPSSLP 222
Query: 149 SLLAKAGLIQNSFSICF------DENDSGSVFFGDQGPATQQSTSFL-------PIGEKY 195
S L GL + FS C D +S S+ Q + +++ + + P +
Sbjct: 223 SQL---GLTK--FSYCLLSHKFDDTQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDK 277
Query: 196 DA----YFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVK 241
A Y+V + IG + ++DSG +FT++ TE + + +
Sbjct: 278 PAFSVYYYVSLRRISIGGRSVKIPYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNE 337
Query: 242 FDKLVSSKRISLQGNS---WKYCYNASSEEMLKVPDMRLIFS--KNQSFVVRNHIFSFPE 296
F V + +L + K C+N S + L++P +RL F + + N+
Sbjct: 338 FISQVKNYERALMVEALSGLKPCFNVSGAKELELPQLRLHFKGGADVELPLENYFAFLGS 397
Query: 297 NEGFTVFCLTVMSTDGDY-----GIIGQNFMMGHRIV-FDRENLKLAWSHSKCE 344
E V C TV+ TDG G+I NF M + V +D +N +L + C+
Sbjct: 398 RE---VACFTVV-TDGAEKASGPGMILGNFQMQNFYVEYDLQNERLGFKKESCK 447
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 78/318 (24%), Positives = 136/318 (42%), Gaps = 39/318 (12%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+ +DP++S+S + V C PLC ++C C + Y+ D+S L D L +
Sbjct: 152 APFDPAASASYRTVPCGSPLCAQAPNAACPPGGKACGFSLTYA--DSSLQAALSQDSLAV 209
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
A + A GC ++ TG+ A P G++GLG G +S L + + +
Sbjct: 210 AGNAVKA--------YTFGCLQRATGT---AAPPQGLLGLGRGPLSF--LSQTKDMYEAT 256
Query: 161 FSICFDE----NDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
FS C N SG++ G G P ++T L + Y+V + +G +
Sbjct: 257 FSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTPLLANPHRSSLYYVNMTGVRVGRKVVPIP 316
Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
+G ++DSG FT L Y V + + V + SL G + C+N ++
Sbjct: 317 AFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRVGAPVSSLGG--FDTCFNTTA--- 371
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST-DG---DYGIIGQNFMMGH 325
+ P M L+F Q + ++ + T+ CL + + DG +I H
Sbjct: 372 VAWPPMTLLFDGMQVTLPEENVVI--HSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNH 429
Query: 326 RIVFDRENLKLAWSHSKC 343
R++FD N ++ ++ +C
Sbjct: 430 RVLFDVPNGRVGFARERC 447
>gi|452821304|gb|EME28336.1| aspartyl protease isoform 1 [Galdieria sulphuraria]
Length = 456
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 78/356 (21%), Positives = 151/356 (42%), Gaps = 70/356 (19%)
Query: 37 VQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP--------------CPYIADY 82
V DR Y+ ++S++ +SC+ P C + ++C C + +Y
Sbjct: 117 VPDR----YNLANSTTGTVISCNSPTCGA-NTCNQQICSSCSSSQACCSENGICGFFIEY 171
Query: 83 STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
+ T+++G L DI+ + +S A + + +T ++L G A GV+GL
Sbjct: 172 G-DGTTATGALYQDIVTVGEYSVQATFAGADT---------ETANFLVGKAA-GVLGLAY 220
Query: 143 GDVS--------VPSLLAKAGLIQNSFSICFDENDSGSVFFGD------QGPATQQSTSF 188
+S V L ++ + N FS+ ++ D G+ G +GP S +
Sbjct: 221 SSLSCNPTCISPVFHQLVESFSLPNIFSVLINQ-DIGAFVVGGVNSSLYEGPIEYSSLAN 279
Query: 189 LPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS 248
+ YD V +ES + ++ L+ F A+VD+G + I+ + F +
Sbjct: 280 EQNPQFYD---VTIESVQVNSNSLSIPSFNAIVDTGTTLIVASPYIFDALKEYFQTNFCN 336
Query: 249 -----KRISLQGNSW---KYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENE 298
S G +W YC N + EE+ ++PD+ + + + +++F N
Sbjct: 337 VPGLCPSSSNPGVTWFGTDYCVNLTPEELSQLPDIEFSLAGGVTLSLGPEHYMFHVSSNN 396
Query: 299 GFTV----FCLTVM--------STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSK 342
F+ +CL + ++DG+ I+G + + +VFDREN ++ ++ K
Sbjct: 397 IFSAASGSYCLGIQPSSQNLGPTSDGNEMILGNTLQLKYYLVFDRENKRIGFAKGK 452
>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
Length = 493
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 93/380 (24%), Positives = 142/380 (37%), Gaps = 76/380 (20%)
Query: 28 CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDT 87
C+L G + ++ S P SS++++V C C + S D C IAD E
Sbjct: 116 CILCEGKA--ENTTASTPPPRLSSTARSVHCKSSACSAAHSNLPTSDLCA-IADCPLESI 172
Query: 88 SSS----------------GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDG 131
+S G LV + H + A S + GC
Sbjct: 173 ETSDCHSFSCPSFYYAYGDGSLVARLYHDSIKLPLATPSLSLHNFTFGCAHTAL------ 226
Query: 132 AAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDENDSGS--------VFFGDQGPAT 182
A P GV G G G +S+P+ LA A + N FS C + S + G
Sbjct: 227 AEPVGVAGFGRGVLSLPAQLASFAPQLGNRFSYCLVSHSFNSDRLRLPSPLILGHSDDKE 286
Query: 183 QQ---------STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF----------QALVDS 223
++ TS L + Y VG+E IG + F +VDS
Sbjct: 287 KRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGISIGKKKIPAPEFLKRVDREGSGGVVVDS 346
Query: 224 GASFTFLPTEIYAEVVVKFDKLV-----SSKRISLQGNSWKYCYNASSEEMLKVPDMRLI 278
G +FT LP +Y VV +FD V +K + CY + ++ +P + L
Sbjct: 347 GTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVE-DKTGLGPCYYY--DTVVNIPSLVLH 403
Query: 279 FSKNQSFVV---RNHIFSFPE-----NEGFTVFCLTVMS-------TDGDYGIIGQNFMM 323
F N+S VV +N+ + F + V CL +M+ T G +G
Sbjct: 404 FVGNESSVVLPKKNYFYDFLDGGDGVRRKRRVGCLMLMNGGEEAELTGGPGATLGNYQQH 463
Query: 324 GHRIVFDRENLKLAWSHSKC 343
G +V+D E ++ ++ KC
Sbjct: 464 GFEVVYDLEQRRVGFARRKC 483
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 79/323 (24%), Positives = 136/323 (42%), Gaps = 43/323 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP++S+S ++V C PLC ++C C + Y+ D+S L D L +A
Sbjct: 152 FDPAASTSYRSVPCGSPLCAQAPNAACPPGGKACGFSLTYA--DSSLQAALSQDSLAVA- 208
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+ GC +K TG+ A P G++GLG G +S L + Q +FS
Sbjct: 209 -------GDAVKTYTFGCLQKATGT---AAPPQGLLGLGRGPLSF--LSQTRDMYQGTFS 256
Query: 163 ICFDE----NDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----- 212
C N SG++ G G P ++T L + Y+V + +G +
Sbjct: 257 YCLPSFKSLNFSGTLRLGRNGQPPRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPP 316
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
+G ++DSG FT L Y V + + V + SL G + C+N ++
Sbjct: 317 ALAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRVGAPVSSLGG--FDTCFNTTA- 373
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST-DG---DYGIIGQNFMM 323
+ P + L+F Q + ++ + T+ CL + + DG +I
Sbjct: 374 --VAWPPVTLLFDGMQVTLPEENVVI--HSTYGTISCLAMAAAPDGVNTVLNVIASMQQQ 429
Query: 324 GHRIVFDRENLKLAWSHSKCEEV 346
HR++FD N ++ ++ +C V
Sbjct: 430 NHRVLFDVPNGRVGFARERCTAV 452
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/316 (23%), Positives = 138/316 (43%), Gaps = 26/316 (8%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLK-------DPCPYIADYSTEDTSSSGYLVDDI 97
+DP++SS+ V C C+ +S S + CPY Y +D+ + G L D
Sbjct: 181 FDPTASSTYSAVPCGARECQELASSSSSRNCSSDNNKNCPYEVSYD-DDSHTVGDLARDT 239
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L L+ +P +V + GCG G++ + DG++GLGLG S+PS + A
Sbjct: 240 LTLSPSPSPSPADTVPG-FVFGCGHSNAGTFGE---VDGLLGLGLGKASLPSQV--AARY 293
Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLT-- 213
+FS C + S + + G A + + F + G+ +Y++ + + +
Sbjct: 294 GAAFSYCLPSSPSAAGYLSFGGAAARANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVP 353
Query: 214 QSGFQ----ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYNASSE 267
S F ++DSG +F+ LP YA + F + R +S + CY+ +
Sbjct: 354 ASAFATAAGTIIDSGTAFSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFDTCYDFTGH 413
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
E +++P + L+F+ + + + N+ CL + + D GI+G +
Sbjct: 414 ETVRIPAVELVFADGATVHLHPSGVLYTWND-VAQTCLAFVP-NHDLGILGNTQQRTLAV 471
Query: 328 VFDRENLKLAWSHSKC 343
++D + ++ + C
Sbjct: 472 IYDVGSQRIGFGRKGC 487
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 67/250 (26%), Positives = 105/250 (42%), Gaps = 22/250 (8%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSSCKS--LKDPCPYIADYSTEDTSSSGYLVDDILHL 100
S YDPS S +S SCS P C + + + C Y+ Y + +S+SG + D+L L
Sbjct: 58 SFYDPSRSPTSAAFSCSSPTCTALGPYANGCANNQCQYLVRYP-DGSSTSGAYIADLLTL 116
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+ + S GC + GS+ AA G+M LG G S+ L A N+
Sbjct: 117 DA-------GNAVSGFKFGCSHAEQGSFDARAA--GIMALGGGPESL--LSQTASRYGNA 165
Query: 161 FSICFDENDSGSVFFGDQGPATQQS----TSFLPIGEKYDAYFVGVESYCIGNSCL--TQ 214
FS C S S FF P S T + + Y V + + +G L
Sbjct: 166 FSYCIPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAP 225
Query: 215 SGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
+ F A ++DS + T LP Y + F ++ R + CY+ + +++
Sbjct: 226 AVFAAGSVLDSRTAITRLPPTAYQALRAAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRL 285
Query: 273 PDMRLIFSKN 282
P + L+F +N
Sbjct: 286 PKISLVFDRN 295
>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 330
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 77/318 (24%), Positives = 127/318 (39%), Gaps = 41/318 (12%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSR--SSCKSLK----DPCPYIADYSTEDTSSSGYLVD 95
L +D S+SS+ SC LC+ +SC + K C Y Y+ + ++ VD
Sbjct: 22 LPYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLIEVD 81
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
A S V GCG G + G+ G G G +S+PS L K G
Sbjct: 82 KFTFGAGASV--------PGVAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG 130
Query: 156 LIQNSFSICFDENDS---GSVFFG------DQGPATQQSTSFLPIGEKYDAYFVGVESYC 206
+FS CF + +V G QST + Y++ ++
Sbjct: 131 ----NFSHCFTAVNGLKQSTVLLDLPADLYKNGRGAVQSTPLIQNSANPTFYYLSLKGIT 186
Query: 207 IGNS---------CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS 257
+G++ LT ++DSG S T LP ++Y V +F + +
Sbjct: 187 VGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATG 246
Query: 258 WKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGI 316
C++A S+ VP + L F + R N++F P++ G ++ CL + D + I
Sbjct: 247 PYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGD-ETTI 305
Query: 317 IGQNFMMGHRIVFDRENL 334
IG +++D +N+
Sbjct: 306 IGNFQQQNMHVLYDLQNM 323
>gi|222615640|gb|EEE51772.1| hypothetical protein OsJ_33215 [Oryza sativa Japonica Group]
Length = 775
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 68/324 (20%), Positives = 133/324 (41%), Gaps = 42/324 (12%)
Query: 54 KNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKH 106
K V+C+ LC C S K C Y+ Y D+SS G LV D FS
Sbjct: 452 KLVTCADSLCTDLYTDLGKPKRCGSQKQ-CDYVIQYV--DSSSMGVLVID-----RFSLS 503
Query: 107 APQSSVQSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSIC 164
A + +++ GCG Q + P D ++GL G V++ S L G+I ++ C
Sbjct: 504 ASNGTNPTTIAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHC 563
Query: 165 FDENDSGSVFFGD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDS 223
G +FFGD Q P + + + + KY + G + + ++ + + DS
Sbjct: 564 ISSKGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSNSKAISAAPMAVIFDS 623
Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSK-----RISLQGNSWKYCYNASSEEMLKVPDMRLI 278
GA++T+ + Y + ++S+ ++ + + C+ ++++ + +++
Sbjct: 624 GATYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCWKGK-DKIVTIDEVKKC 682
Query: 279 FS----------KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY------GIIGQNFM 322
F K + + + EG CL ++ ++ +IG M
Sbjct: 683 FRSLSLEFADGDKKATLEIPPEHYLIISQEGHV--CLGILDGSKEHLSLAGTNLIGGITM 740
Query: 323 MGHRIVFDRENLKLAWSHSKCEEV 346
+ +++D E L W + +C+ +
Sbjct: 741 LDQMVIYDSERSLLGWVNYQCDRI 764
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 70/318 (22%), Positives = 118/318 (37%), Gaps = 31/318 (9%)
Query: 76 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQ-TGSYLDGAAP 134
C Y Y+ + S+ G L+ D L P+ + + ++ GCG Q G +P
Sbjct: 29 CDYEIKYA-DGASTIGALIVDQFSL-------PRIATRPNLPFGCGYNQGIGENFQQTSP 80
Query: 135 -DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIG 192
+G++GL G VS S L G+I ++ C G +F GD + + +
Sbjct: 81 VNGILGLDRGKVSFVSQLKMLGIITKHVVGHCLSSGGGGLLFVGDG------DGNLVLLH 134
Query: 193 EKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 252
Y Y G + L + + DSG+++T+ + Y V +SS +
Sbjct: 135 ANY--YSPGSATLYFDRHSLGMNPMDVVFDSGSTYTYFTAQPYQATVYAIKGGLSSTSLE 192
Query: 253 -LQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV-----FCLT 306
+ S C+ + V D++ F Q N + P V CL
Sbjct: 193 QVSDPSLPLCWKGQ-KAFESVFDVKKEFKSLQLNFGNNAVMEIPPENYLIVTEYGNVCLG 251
Query: 307 VM-STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNP 365
++ ++ IIG M +++D E +L W C D S P+ +
Sbjct: 252 ILHGCRLNFNIIGDITMQDQMVIYDNEREQLGWIRGSC----DGSQEAPTQAPSAEEVVG 307
Query: 366 LPTTEQQSTSNGQAAAPP 383
+ S + G APP
Sbjct: 308 AAARREASQATGSYLAPP 325
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 72/308 (23%), Positives = 134/308 (43%), Gaps = 42/308 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS---RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
+DP SS + +++SC C++ SSC S + C Y + Y + + ++G L D + L
Sbjct: 135 FDPKSSKTYRDLSCDTRQCQNLGESSSCSS-EQLCQY-SYYYGDRSFTNGNLAVDTVTLP 192
Query: 102 SFSK---HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
S + + P++ +IGCGR+ G++ G++GLG G +S+ S + + +
Sbjct: 193 STNGGPVYFPKT------VIGCGRRNNGTF--DKKDSGIIGLGGGPMSLISQMGSS--VG 242
Query: 159 NSFSICF------DENDSGSVFFGDQGPATQQSTSFLPIGEKY--DAYFVGVESYCIGNS 210
FS C +S + FG + P+ K Y++ +E+ +G+
Sbjct: 243 GKFSYCLVPFSSESAGNSSKLHFGRNAVVSGSGVQSTPLISKNPDTFYYLTLEAMSVGDK 302
Query: 211 CL-------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCY 262
+ S ++DSG S T P + E + +++ +R +CY
Sbjct: 303 KIEFGGSSFGGSEGNIIIDSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCY 362
Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD--YGIIGQ- 319
+ + LKVP + F+ + + F ++ V CL ST +G + Q
Sbjct: 363 RPTPD--LKVPVITAHFNGADVVLQTLNTFILISDD---VLCLAFNSTQSGAIFGNVAQM 417
Query: 320 NFMMGHRI 327
NF++G+ I
Sbjct: 418 NFLIGYDI 425
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 89/317 (28%), Positives = 146/317 (46%), Gaps = 40/317 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP SSSS +SC+ CK +++C S D C Y Y + + ++G L + L +
Sbjct: 193 FDPKSSSSYSPLSCNSQQCKLLDKANCNS--DTCIYQVHYG-DGSFTTGELATETLSFGN 249
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S P ++ IGCG G + GA ++GLG G +S+ S L + SFS
Sbjct: 250 -SNSIP------NLPIGCGHDNEGLFAGGAG---LIGLGGGAISLSSQLKAS-----SFS 294
Query: 163 IC---FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL------ 212
C D + S ++ F P+ TS L +++ +Y +V V +G L
Sbjct: 295 YCLVNLDSDSSSTLEFNSNMPS-DSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTR 353
Query: 213 ---TQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
+SG +VDSG + LP+++Y + F KL SS + + + CYN S +
Sbjct: 354 FEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQS 413
Query: 269 MLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
++VP + + S+ S + RN++ + G +CL + T IIG G R
Sbjct: 414 NVEVPTIAFVLSEGTSLRLPARNYLIML-DTAG--TYCLAFIKTKSSLSIIGSFQQQGIR 470
Query: 327 IVFDRENLKLAWSHSKC 343
+ +D N + +S +KC
Sbjct: 471 VSYDLTNSLVGFSTNKC 487
>gi|213998848|gb|ACJ60790.1| nucellin [Psathyrostachys stoloniformis]
Length = 154
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 70/138 (50%), Gaps = 4/138 (2%)
Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDS 170
+ ++ GCG KQ +P DG++GLG+G + L +I +N C
Sbjct: 6 KKNIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITENVIGHCLSSKGK 65
Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTF 229
G ++ GD P T+ T ++P+ E Y G+ + I + + F+A+ DSG+++T+
Sbjct: 66 GVLYVGDFNPPTRGVT-WVPMRESLFYYSPGLAALFIDKQPIRGNPTFEAVFDSGSTYTY 124
Query: 230 LPTEIYAEVVVKFDKLVS 247
+P +IY E+V K +S
Sbjct: 125 MPAQIYNELVSKIRGTLS 142
>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
Length = 458
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 140/371 (37%), Gaps = 74/371 (19%)
Query: 21 PVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------------ 68
P TT C S + + ++P SSS K + C P C + SS
Sbjct: 114 PCTTHYTCT---NCSFSNPKKVPIFNPELSSSDKILGCRDPKCANTSSPDVHLGCPRCNG 170
Query: 69 -CKSLKDPCP-YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 126
K CP Y Y T ++SG+ + + L + H ++GC T
Sbjct: 171 NSKKCSHACPQYTLQYGTG--AASGFFLLENLDFPGKTIH--------KFLVGC----TT 216
Query: 127 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND------SGSVFFGDQGP 180
S + D + G G S+P + F+ C + +D SG + D
Sbjct: 217 SADREPSSDALAGFGRTMFSLPMQMG-----VKKFAYCLNSHDYDDTRNSGKLIL-DYSD 270
Query: 181 ATQQSTSFLPIGEKYD----AYFVGVESYCIGNSCLTQSGFQ----------ALVDSGAS 226
Q S+ P + Y++GV+ IGN L G ++DSG +
Sbjct: 271 GETQGLSYAPFLKNPPDYPFYYYLGVKDMKIGNKLLRIPGKYLTPGSDSRGGVMIDSGFA 330
Query: 227 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSEEMLKVPDMRLIFSKNQ 283
+ ++ ++ V + K +S R SL+ + CYN + + +K+PD+ F+
Sbjct: 331 YGYMTLPVFKIVTNELKKQMSKYRRSLEAETQSGLTPCYNFTGHKSIKIPDLIYQFTGGA 390
Query: 284 SFVV--RNHIFSFPENEGFTVFCLTVMS---------TDGDYGIIGQNFMMGHRIVFDRE 332
+ VV N+ F E ++ C V + T G I+G + H + FD +
Sbjct: 391 NMVVPGMNYFLLFSEA---SLGCFPVTTDSPTNNLEFTPGPSIILGNYQQVDHYVEFDLK 447
Query: 333 NLKLAWSHSKC 343
N +L + C
Sbjct: 448 NERLGFRQQTC 458
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 75/315 (23%), Positives = 130/315 (41%), Gaps = 50/315 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSR----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
Y P+ S++ NVSC P+C++ S C C Y Y + TS+ G L + L
Sbjct: 135 YAPARSATYANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSYG-DGTSTDGVLATETFTL 193
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
S + V GCG + GS + + G++G+G G + SL+++ G+ +
Sbjct: 194 GS-------DTAVRGVAFGCGTENLGSTDNSS---GLVGMGRGPL---SLVSQLGVTRPR 240
Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQAL 220
S G P T + +G+ + ++ + + G +
Sbjct: 241 RSCRARAAARGGGA-----PTTTSPLEGITVGDT----LLPIDPAVFRLTPMGDGGV--I 289
Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS---WKYCYNASSEEMLKVPDMRL 277
+DSG +FT L + V L S R+ L + C+ A+S E ++VP + L
Sbjct: 290 IDSGTTFTALEERAF---VALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVL 346
Query: 278 IFS------KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 331
F + +S+VV E+ V CL ++S G ++G I++D
Sbjct: 347 HFDGADMELRRESYVV--------EDRSAGVACLGMVSARG-MSVLGSMQQQNTHILYDL 397
Query: 332 ENLKLAWSHSKCEEV 346
E L++ +KC E+
Sbjct: 398 ERGILSFEPAKCGEL 412
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 81/322 (25%), Positives = 127/322 (39%), Gaps = 44/322 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
YDP +SS+ V CS P C + SSC S C Y A Y + + S GYL D
Sbjct: 151 YDPRASSTYAAVPCSAPQCAELQAATLNPSSC-SGSGVCQYQASYG-DGSFSFGYLSKDT 208
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
+ L+S S GCG+ G + A G++GL +S+ S LA + +
Sbjct: 209 VSLSS-------SGSFPGFYYGCGQDNVGLFGRAA---GLIGLARNKLSLLSQLAPS--V 256
Query: 158 QNSFSICFDEN---DSGSVFFG----DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
NSF+ C + +G + FG ++ P TS + YFV + + S
Sbjct: 257 GNSFAYCLPTSAAASAGYLSFGSNSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGS 316
Query: 211 CLT-----QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW---KYCY 262
L ++DSG T LPT +Y K V + + ++ + C+
Sbjct: 317 PLAVPSSEYGSLPTIIDSGTVITRLPTPVY----TALSKAVGAALAAPSAPAYSILQTCF 372
Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
+ L VP + + F+ + + NE T CL TD IIG
Sbjct: 373 KGQVAK-LPVPAVNMAFAGGATLRLTPGNVLVDVNE--TTTCLAFAPTDST-AIIGNTQQ 428
Query: 323 MGHRIVFDRENLKLAWSHSKCE 344
+V+D + ++ ++ C
Sbjct: 429 QTFSVVYDVKGSRIGFAAGGCS 450
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 68/313 (21%), Positives = 123/313 (39%), Gaps = 33/313 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP+ SS+ NVSC+ C + C Y Y + + + G+ D L +A
Sbjct: 206 FDPAKSSTYANVSCTDSACADLDTNGCTGGHCLYAVQYG-DGSYTVGFFAQDTLTIA--- 261
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
GCG K G + A G+MGLG G S+ + +F+ C
Sbjct: 262 -----HDAIKGFRFGCGEKNNGLFGKTA---GLMGLGRGKTSL--TVQAYNKYGGAFAYC 311
Query: 165 FDENDSGSVFFGDQGPATQQSTSFL-PI----GEKYDAYFVGVESYCIGN-------SCL 212
+G+ + D GP + + + L P+ G+ + Y+VG+ +G S
Sbjct: 312 LPALTTGTGYL-DFGPGSAGNNARLTPMLTDKGQTF--YYVGMTGIRVGGQQVPVAESVF 368
Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEML 270
+ +G LVDSG T LP Y + FDK++ ++ + + + CY+ + +
Sbjct: 369 STAG--TLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDV 426
Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
++P + L+F V + +E D I+G + +++D
Sbjct: 427 ELPTVSLVFQGGACLDVDVSGIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYD 486
Query: 331 RENLKLAWSHSKC 343
+ ++ C
Sbjct: 487 LGKKTVGFAPGSC 499
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 79/316 (25%), Positives = 132/316 (41%), Gaps = 33/316 (10%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRSS-CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
++DP+ SSS V C P+C + C C Y Y + +S++G L D L S
Sbjct: 179 DFDPAKSSSYAAVPCGTPVCAAAGGMCNGTT--CLYGVQYG-DGSSTTGVLSRDTLTFNS 235
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
SK + GCG K G D DG++GLG G +S+PS A + FS
Sbjct: 236 SSKF-------TGFTFGCGEKNIG---DFGEVDGLLGLGRGKLSLPSQAAPS--FGGVFS 283
Query: 163 ICFDENDS--GSVFFGDQGPATQ---QSTSFLPIGEKYDAYFVGVESYCIGN-------S 210
C ++ G + G P + Q T+ + + YF+ + S IG S
Sbjct: 284 YCLPSYNTTPGYLNIGATKPTSTVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPS 343
Query: 211 CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 270
T++G L+DSG T+LP Y + +F + + + CY+ + + +
Sbjct: 344 VFTKTG--TLLDSGTILTYLPPPAYTSLRDRFKFTMQGNKPAPPYEPLDTCYDFTGQGAI 401
Query: 271 KVPDMRLIFSKNQSFVVRNH-IFSFPENEGFTVFCLTVMSTDG--DYGIIGQNFMMGHRI 327
+P + FS F + + I FP++ + CL +S + I+G +
Sbjct: 402 VIPAVSFNFSDGAVFDLDFYGIMIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEV 461
Query: 328 VFDRENLKLAWSHSKC 343
++D + K+ + C
Sbjct: 462 IYDVPSQKIGFIPISC 477
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 71/318 (22%), Positives = 116/318 (36%), Gaps = 41/318 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP+ S++ N+SCS C C Y Y + + + G+ D L LA
Sbjct: 204 FDPTKSATYANISCSSSYCSDLYVSGCSGGHCLYGIQYG-DGSYTIGFYAQDTLTLA--- 259
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
+ GCG K G + A G++GLG G S+P K G + F+
Sbjct: 260 -----YDTIKNFRFGCGEKNRGLFGRAA---GLLGLGRGKTSLPVQAYDKYGGV---FAY 308
Query: 164 CFDENDSGSVF--FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG----- 216
C +G+ F G PA + + + Y+VG+ +G L G
Sbjct: 309 CLPATSAGTGFLDLGPGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFST 368
Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK---------YCYNASSE 267
LVDSG T LP YA + F K ++QG + CY+ +
Sbjct: 369 AGTLVDSGTVITRLPPSAYAPLRSAFSK-------AMQGLGYSAAPAFSILDTCYDLTGH 421
Query: 268 E--MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
+ + +P + L+F V + + + D D I+G H
Sbjct: 422 KGGSIALPAVSLVFQGGACLDVDASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTH 481
Query: 326 RIVFDRENLKLAWSHSKC 343
+++D + ++ C
Sbjct: 482 GVLYDIGKKIVGFAPGAC 499
>gi|145351657|ref|XP_001420185.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580418|gb|ABO98478.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 498
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 63/283 (22%), Positives = 121/283 (42%), Gaps = 39/283 (13%)
Query: 91 GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA--APDGVMGLGLGDVSVP 148
GY+ +D L + AP + + GCG Y DG+ DG+ G G+ +
Sbjct: 166 GYMAEDTFTLGD--ELAP-----AKITFGCGGMY---YPDGSNLRQDGMAGFSRGNTAFH 215
Query: 149 SLLAKAGLIQ-NSFSICFDENDS-------GSVFFGDQGPATQQSTSFLPIGEKYDAYFV 200
+ LAKAG+I + F C + ++ G FG + P + +GE D V
Sbjct: 216 TQLAKAGVIDAHVFGFCSEGMETSTAMLTLGRYNFGRRVPELAWTRM---LGE--DDLAV 270
Query: 201 GVESYCIGNSCL-TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 259
S+ +G+ + + S ++DSG + T LP+ ++ + + ++ S +S+
Sbjct: 271 RTMSWKLGDKTIASSSNVYTVLDSGTTLTVLPSAMHHDFMTHLNETARSAGLSVVVRGTH 330
Query: 260 YCYNASSEEMLK-------VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST-- 310
Y + L P + + + + + V+R + F + FC +MS
Sbjct: 331 CFYENQRQSSLTQYTLTRWFPSLTITYDPDVTLVLRPENYLFADTVNLHAFCAGIMSASD 390
Query: 311 ----DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDK 349
+G+ I+GQ + + +D EN ++ + +CE++ +K
Sbjct: 391 AALANGEQIILGQQTLRNTFVEYDLENSRVGMATVQCEKLREK 433
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 68/313 (21%), Positives = 123/313 (39%), Gaps = 33/313 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP+ SS+ NVSC+ C + C Y Y + + + G+ D L +A
Sbjct: 206 FDPAKSSTYANVSCTDSACADLDTNGCTGGHCLYAVQYG-DGSYTVGFFAQDTLTIA--- 261
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
GCG K G + A G+MGLG G S+ + +F+ C
Sbjct: 262 -----HDAIKGFRFGCGEKNNGLFGKTA---GLMGLGRGKTSL--TVQAYNKYGGAFAYC 311
Query: 165 FDENDSGSVFFGDQGPATQQSTSFL-PI----GEKYDAYFVGVESYCIGN-------SCL 212
+G+ + D GP + + + L P+ G+ + Y+VG+ +G S
Sbjct: 312 LPALTTGTGYL-DFGPGSAGNNARLTPMLTDKGQTF--YYVGMTGIRVGGQQVPVAESVF 368
Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEML 270
+ +G LVDSG T LP Y + FDK++ ++ + + + CY+ + +
Sbjct: 369 STAG--TLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDV 426
Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
++P + L+F V + +E D I+G + +++D
Sbjct: 427 ELPTVSLVFQGGACLDVDVSGIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYD 486
Query: 331 RENLKLAWSHSKC 343
+ ++ C
Sbjct: 487 LGKKTVGFAPGSC 499
>gi|340500865|gb|EGR27703.1| plasmepsin 5, putative [Ichthyophthirius multifiliis]
Length = 602
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 76/348 (21%), Positives = 135/348 (38%), Gaps = 53/348 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA--- 101
YD S ++K C + C + C + Y+ E +S SGY+ D + L
Sbjct: 88 YDLEKSLTAKKEKCKSTKLSCQGYCNNFSQECNWSVSYA-EGSSISGYMAGDYVVLGDEM 146
Query: 102 ----------SFSKHAPQSSV----QSSVII--GCGRKQTGSYLDGAAPDGVMGLGLGDV 145
S+ Q + SV + GC +T +L PDG++GL D
Sbjct: 147 QDYIEKLTKNQISEKEEQEYLTYIKHESVFLNFGCTTNETNLFL-SQVPDGIIGLAPSDK 205
Query: 146 S--------VPSLLAKAGLIQNS----FSICFDENDSGSVFFGDQGPATQQS---TSFLP 190
S V + K QN+ FS+C + G + G + T +P
Sbjct: 206 SGRANTGNIVDEIFKKHK--QNNETHVFSLCLNAEKGGYMSVGGYNYELHEKNARTQIIP 263
Query: 191 IGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
Y V ++ I N+ + + ++DSG + P+ I ++ K ++L S++
Sbjct: 264 FDSDSGYYSVSIKQILIQNNVIVTNIGYTIIDSGTTIVLGPSRIINPIIQKINELCESEQ 323
Query: 251 ISLQGNSW-------KYCYNASSEE------MLKVPDMRLIFSKNQSFVVR--NHIFSFP 295
S G+ K+ YN S E P++ F Q V + +++
Sbjct: 324 YSCGGSKKNGDKQQSKFLYNPSKYENNVNNFFDSFPNIDFKFENGQVIVWKPSAYLYIDR 383
Query: 296 ENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
+N ++ + + +G FM + I+FDR+N ++ ++ SKC
Sbjct: 384 KNGYKNLYQFGFEAYESGKLYLGGPFMKNYDILFDRDNQEIHFTASKC 431
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 77/325 (23%), Positives = 134/325 (41%), Gaps = 38/325 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRS----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+DPS SS+ +V C P CK +C C Y Y + + + G L + L
Sbjct: 169 FDPSKSSTYVDVPCGTPQCKIGGGQDLTCGGTT--CEYSVKYG-DQSVTRGNLAQEAFTL 225
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD----GVMGLGLGDVSVPSLLAKAGL 156
+P + + V+ GC + + S + GA + G++GLG GD S+ S + G
Sbjct: 226 ------SPSAPPAAGVVFGCSHEYS-SGVKGAEEEMSVAGLLGLGRGDSSILS-QTRRGN 277
Query: 157 IQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNS 210
+ FS C S G + G P Q + SF P+ + Y V + + +
Sbjct: 278 SGDVFSYCLPPRGSSAGYLTIGAAAPP-QSNLSFTPLVTDNSQLSSVYVVNLVGISVSGA 336
Query: 211 CL--TQSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN--SWKYCYNA 264
L S F ++DSG T +P Y + +F + + + +G+ S CY+
Sbjct: 337 ALPIDASAFYIGTVIDSGTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCYDV 396
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNH----IFSF-PENEGFTVFCLTVMSTD-GDYGIIG 318
+ +++ P + L F V +F+ + T+ CL + T+ + IIG
Sbjct: 397 TGHDVVTAPPVALEFGGGARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFVIIG 456
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
+ +VFD E ++ + + C
Sbjct: 457 NMQQRAYNVVFDVEGRRIGFGANGC 481
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 87/349 (24%), Positives = 157/349 (44%), Gaps = 68/349 (19%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLK---DP---CPYIADYSTEDTSSSGYLVDD 96
S ++P +S + + CS CK+R+S +L DP C +I Y+ + +S G+L +
Sbjct: 103 SIFNPLASKTYTKIPCSSQTCKTRTSDLTLPVTCDPAKLCHFIISYA-DASSVEGHLAFE 161
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAG 155
S ++ A + GC + S + A G+MG+ G +S + + G
Sbjct: 162 TFRFGSLTRPA--------TVFGCMDSGSSSNTEEDAKTTGLMGMNRGSLS---FVNQMG 210
Query: 156 LIQNSFSICFDENDS-GSVFFGDQ----------GPATQQSTSFLPIGEKYDAYFVGVES 204
FS C DS G + G+ P Q ST LP ++ AY V +E
Sbjct: 211 F--RKFSYCISGLDSTGFLLLGEARYSWLKPLNYTPLVQISTP-LPYFDRV-AYSVQLEG 266
Query: 205 YCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKF-------DKLV 246
+ N L T +G Q +VDSG FTFL +Y+ + +F +++
Sbjct: 267 IKVNNKVLPLPKSVFVPDHTGAG-QTMVDSGTQFTFLLGPVYSALRKEFLLQTAGVLRVL 325
Query: 247 SSKRISLQGNSWKYCY--NASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFP-ENEGF-T 301
+ + QG + CY +++S + +P ++L+F + V + ++ P E G +
Sbjct: 326 NEPQYVFQG-AMDLCYLIDSTSSTLPNLPVVKLMFRGAEMSVSGQRLLYRVPGEVRGKDS 384
Query: 302 VFCLTVMSTDGDYGIIGQNFMMGHR------IVFDRENLKLAWSHSKCE 344
V+C T ++D + GI +F++GH + +D EN ++ ++ +C+
Sbjct: 385 VWCFTFGNSD-ELGI--SSFLIGHHQQQNVWMEYDLENSRIGFAELRCD 430
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 73/315 (23%), Positives = 126/315 (40%), Gaps = 23/315 (7%)
Query: 45 YDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
++P SS+ K +C C S+ C L C Y Y + + S G L + L
Sbjct: 131 FEPLKSSTYKYATCDSQPCTLLQPSQRDCGKLGQ-CIYGIMYG-DKSFSVGILGTETLSF 188
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
S Q+ + I GCG + G+ GLG G +S+ S L I +
Sbjct: 189 GS--TGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQ--IGHK 244
Query: 161 FSIC---FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT- 213
FS C +D + + FG + T P+ K YF+ +E+ IG ++
Sbjct: 245 FSYCLLPYDSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVST 304
Query: 214 -QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
Q+ ++DSG T+L Y V + + K + + K C+ + L +
Sbjct: 305 GQTDGNIVIDSGTPLTYLENTFYNNFVASLQETLGVKLLQDLPSPLKTCF--PNRANLAI 362
Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDR 331
PD+ F+ S +R P + + CL V+ + G + G ++ +D
Sbjct: 363 PDIAFQFT-GASVALRPKNVLIPLTDS-NILCLAVVPSSGIGISLFGSIAQYDFQVEYDL 420
Query: 332 ENLKLAWSHSKCEEV 346
E K++++ + C +V
Sbjct: 421 EGKKVSFAPTDCAKV 435
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 119/291 (40%), Gaps = 39/291 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+DP+ SSS V C C S+C + + C Y+ Y + ++++G D L L
Sbjct: 181 FDPAQSSSYAAVPCGRSACAGLGIYASACSAAQ--CGYVVSYG-DGSNTTGVYSSDTLTL 237
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQN 159
A+ ++VQ + GCG Q+G G DG++G G PSL+ + AG
Sbjct: 238 AA------NATVQ-GFLFGCGHAQSGGLFTGI--DGLLGFGR---EQPSLVQQTAGAYGG 285
Query: 160 SFSICFDENDSGSVFFGDQGPATQ----QSTSFLPIGEKYDAYFVGVESYCIGNSCLT-- 213
FS C S + + GP+ +T LP Y V + +G L+
Sbjct: 286 VFSYCLPTKSSTTGYLTLGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVP 345
Query: 214 QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
S F A +VD+G T LP YA + F ++S + CY+ + +
Sbjct: 346 ASAFAAGTVVDTGTVITRLPPAAYAALRSAFRSGMASYPSAPPIGILDTCYSFAGYGTVN 405
Query: 272 VPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMS--TDGDYGIIGQ 319
+ + L FS + + + I SF CL S +DG I+G
Sbjct: 406 LTSVALTFSSGATMTLGADGIMSF--------GCLAFASSGSDGSMAILGN 448
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 74/324 (22%), Positives = 121/324 (37%), Gaps = 34/324 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSR--------SSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
+DP+ S++ V C+ C + SC + C Y Y + + S G L D
Sbjct: 232 FDPAGSATYAAVRCNASACAASLKAATGTPGSCGGGNERCYYALAYG-DGSFSRGVLATD 290
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS--LLAKA 154
+ L S + GCG G + G+MGLG ++S+ S L
Sbjct: 291 TVALGGAS--------LDGFVFGCGLSNRGLF---GGTAGLMGLGRTELSLVSQTALRYG 339
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----YFVGVESYCIGN 209
G+ + SGS+ G + + +T D YF+ V +G
Sbjct: 340 GVFSYCLPATTSGDASGSLSLGGDASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGG 399
Query: 210 SCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNS-WKYCYNA 264
+ L G A L+DSG T L +Y V +F + ++ + G S CY+
Sbjct: 400 TALAAQGLGASNVLIDSGTVITRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSILDTCYDL 459
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV--MSTDGDYGIIGQNFM 322
+ + +KVP + L V F + + CL + +S + IIG
Sbjct: 460 TGHDEVKVPLLTLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNYQQ 519
Query: 323 MGHRIVFDRENLKLAWSHSKCEEV 346
R+V+D +L ++ C V
Sbjct: 520 KNKRVVYDTVGSRLGFADEDCNYV 543
>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
Length = 499
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 92/357 (25%), Positives = 137/357 (38%), Gaps = 66/357 (18%)
Query: 47 PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY----IADYSTEDTS-----SSGYLVDDI 97
P + S S +SC C + + S D C + + T D S S Y D
Sbjct: 139 PLNVSKSSLISCKSRACSTAHNSPSTSDLCAIAKCPLDEIETSDCSNYHCPSFYYAYGDG 198
Query: 98 LHLASFSKH---APQSSVQ----SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 150
+A KH P +S + GC G P GV G G G +S+P+
Sbjct: 199 SLIAKLHKHNLIMPSTSNKPFSLKDFTFGCAHSALGE------PIGVAGFGFGSLSLPAQ 252
Query: 151 LAKAGL-IQNSFSIC-----FDENDS--------GSVFFGDQGPATQQSTSFLPIGEKYD 196
LA + N FS C FD G V D TQ + + K+
Sbjct: 253 LANLSPDLGNQFSYCLVSHSFDSTKLHHPSPLILGKVKERDFDEITQFVYTPMLDNPKHP 312
Query: 197 AYF-VGVESYCIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKL 245
++ V +E+ +G+S + +VDSG ++T LPT Y V + D+
Sbjct: 313 YFYSVSMEAISVGSSRVRAPNALIRIDRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRR 372
Query: 246 V------SSKRISLQGNSWKYCYNASSEEMLK--VPDMRLIFSKNQSFVV--RNHIFSF- 294
V +S+ S G S Y + E L VP + F N S V+ RN+ + F
Sbjct: 373 VGRVFKRASETESKTGLSPCYYLEGNGVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFL 432
Query: 295 ---PENEGFTVFCLTVM-----STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
E +G V CL +M S G +G G ++V+D E ++ ++ KC
Sbjct: 433 DGEDEKKGRKVGCLMLMDGGDESEGGPGATLGNYQQQGFQVVYDLEERRVGFAPRKC 489
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 75/303 (24%), Positives = 136/303 (44%), Gaps = 30/303 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
+DP +SS+ K+VSCS C ++++SC + + C Y Y +++ + G + D L L
Sbjct: 132 FDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYG-DNSYTKGNIAVDTLTLG 190
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
S Q ++IIGCG G++ G++GLG G VS+ L + I F
Sbjct: 191 SSDTRPMQ---LKNIIIGCGHNNAGTF--NKKGSGIVGLGGGPVSLIKQLGDS--IDGKF 243
Query: 162 SICF-----DENDSGSVFFGDQGPATQQ---STSFLPIGEKYDAYFVGVESYCIGNSCL- 212
S C ++ + + FG + ST + + Y++ ++S +G+ +
Sbjct: 244 SYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQ 303
Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
S ++DSG + T LPTE Y+E+ + +++ + CY+A+
Sbjct: 304 YSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATG 363
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ-NFMMGH 325
+ LKVP + + F + ++ F +E F + YG + Q NF++G+
Sbjct: 364 D--LKVPVITMHFDGADVKLDSSNAF-VQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGY 420
Query: 326 RIV 328
V
Sbjct: 421 DTV 423
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 70/300 (23%), Positives = 121/300 (40%), Gaps = 30/300 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP+ SS+ NVSC+ P C + C Y Y + + S G+ D L L+S+
Sbjct: 223 FDPARSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY- 280
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
GCG + G + + A G++GLG G S+P K G + F+
Sbjct: 281 ------DAVKGFRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAH 328
Query: 164 CFDENDSGSVF--FGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSCLT--QS 215
C +G+ + FG A ++ P+ G + Y+VG+ +G L+ QS
Sbjct: 329 CLPARSTGTGYLDFGAGSLAAARARLTTPMLTENGPTF--YYVGMTGIRVGGQLLSIPQS 386
Query: 216 GFQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEML 270
F +VDSG T LP Y+ + F ++++ + + + CY+ + +
Sbjct: 387 VFATAGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQV 446
Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
+P + L+F V + + GD GI+G + + +D
Sbjct: 447 AIPTVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYD 506
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 75/316 (23%), Positives = 126/316 (39%), Gaps = 34/316 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DPS S+S VSC C+ ++C++ C Y Y + + + G + L L
Sbjct: 208 FDPSLSASYAAVSCDSQRCRDLDTAACRNATGACLYEVAYG-DGSYTVGDFATETLTLG- 265
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S+ +V IGCG G ++ A + G L S PS ++ ++FS
Sbjct: 266 ------DSTPVGNVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----STFS 311
Query: 163 ICFDENDS---GSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCL------ 212
C + DS ++ FGD T+ L + Y+V + +G L
Sbjct: 312 YCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASA 371
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
T +VDSG + T L + YA + F + S + + + CY+ S
Sbjct: 372 FAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSDR 431
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
++VP + L F + + + P +G +CL T+ IIG G R+
Sbjct: 432 TSVEVPAVSLRFEGGGALRLPAKNYLIPV-DGAGTYCLAFAPTNAAVSIIGNVQQQGTRV 490
Query: 328 VFDRENLKLAWSHSKC 343
FD + ++ +KC
Sbjct: 491 SFDTARGAVGFTPNKC 506
>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
Length = 437
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 82/327 (25%), Positives = 137/327 (41%), Gaps = 43/327 (13%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
LS Y+ S+SS+S SCS PLC SRS S C Y Y + TS Y+ D
Sbjct: 127 LSIYNLSASSTSSVSSCSDPLCTGEQAVCSRSGSNS---ACAYGISYQDKSTSIGAYVKD 183
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
D+ ++ ++ S + GC TGS+ DG+MG G +VP+ +A
Sbjct: 184 DMHYVLQ-----GGNATTSHIFFGCAINITGSW----PADGIMGFGQISKTVPNQIATQR 234
Query: 156 LIQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL- 212
+ FS C +++ G + FG++ T+ F P+ Y V + S + + L
Sbjct: 235 NMSRVFSHCLGGEKHGGGILEFGEEPNTTEM--VFTPLLNVTTHYNVDLLSISVNSKVLP 292
Query: 213 -------------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR-ISLQGNSW 258
++G ++DSG SF L T+ + + L ++K L+G
Sbjct: 293 IDSKEFSYVSNSTNETG--VIIDSGTSFALLATKANRILFSEIKNLTTAKLGPKLEGLQC 350
Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGDYGI 316
Y + + E P++ L FS + ++ N++ + +C S DG I
Sbjct: 351 FYLKSGLTVET-SFPNVTLTFSGGSTMKLKPDNYLVMVELKKKRNGYCYAWSSADG-LTI 408
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKC 343
G+ + + +D EN ++ W C
Sbjct: 409 FGEIVLKDKLVFYDVENRRIGWKGQNC 435
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 78/318 (24%), Positives = 136/318 (42%), Gaps = 39/318 (12%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+ +DP+SS+S + V C PLC ++C C + Y+ D+S L D L +
Sbjct: 152 APFDPASSASYRTVPCGSPLCAQAPNAACPPGGKACGFSLTYA--DSSLQAALSQDSLAV 209
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
A + A GC ++ TG+ A P G++GLG G +S L + + +
Sbjct: 210 AGNAVKA--------YTFGCLQRATGT---AAPPQGLLGLGRGPLSF--LSQTKDMYEAT 256
Query: 161 FSICFDE----NDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
FS C N SG++ G G P ++T L + Y+V + +G +
Sbjct: 257 FSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIP 316
Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
+G ++DSG FT L Y V + + V + SL G + C+N ++
Sbjct: 317 AFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRVGAPVSSLGG--FDTCFNTTA--- 371
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST-DG---DYGIIGQNFMMGH 325
+ P + L+F Q + ++ + T+ CL + + DG +I H
Sbjct: 372 VAWPPVTLLFDGMQVTLPEENVVI--HSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNH 429
Query: 326 RIVFDRENLKLAWSHSKC 343
R++FD N ++ ++ +C
Sbjct: 430 RVLFDVPNGRVGFARERC 447
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 79/325 (24%), Positives = 122/325 (37%), Gaps = 37/325 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
+ P +S S V CS CK S ++C S PC Y Y + G + D
Sbjct: 130 FRPEASKSWAPVPCSSDTCKLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSAT 189
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
+A Q V++GC G DGV+ LG +S S A
Sbjct: 190 IALPGGKVAQ---LQDVVLGCSSTHDGQSFKSV--DGVLSLGNAKISFASR--AAARFGG 242
Query: 160 SFSICF-----DENDSGSVFFG----DQGPATQQSTSFLPI----GEKYDAYFVGVESYC 206
SFS C N +G + FG + PATQ P G K DA V ++
Sbjct: 243 SFSYCLVDHLAPRNATGYLAFGPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALD 302
Query: 207 IGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQGNSWKYCYN-- 263
I ++DSG + T L T Y VV KL++ ++ +++CYN
Sbjct: 303 IPAEVWDPKSGGVILDSGTTLTVLATPAYKAVVAALTKLLAGVPKVDFP--PFEHCYNWT 360
Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY---GIIGQN 320
A ++P + + F+ + G V C+ + +G++ +IG
Sbjct: 361 APRPGAPEIPKLAVQFTGCARLEPPAKSYVIDVKPG--VKCIGLQ--EGEWPGVSVIGNI 416
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEE 345
H FD +N+++ + S C
Sbjct: 417 MQQEHLWEFDLKNMEVRFMPSTCTR 441
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 72/305 (23%), Positives = 133/305 (43%), Gaps = 34/305 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
+DP +SS+ K+VSCS C ++++SC + + C Y Y +++ + G + D L L
Sbjct: 132 FDPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYG-DNSYTKGNIAVDTLTLG 190
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAG-LIQN 159
S Q ++IIGCG G++ + +G P SL+ + G I
Sbjct: 191 SSDTRPMQ---LKNIIIGCGHNNAGTF------NKKGSGIVGLGGGPVSLIKQLGDSIDG 241
Query: 160 SFSICF-----DENDSGSVFFGDQGPATQQ---STSFLPIGEKYDAYFVGVESYCIGNSC 211
FS C ++ + + FG + ST + + Y++ ++S +G+
Sbjct: 242 KFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQ 301
Query: 212 L-------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
+ S ++DSG + T LPTE Y+E+ + +++ + CY+A
Sbjct: 302 IQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSA 361
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ-NFMM 323
+ + LKVP + + F + ++ F +E F + YG + Q NF++
Sbjct: 362 TGD--LKVPVITMHFDGADVKLDSSNAF-VQVSEDLVCFAFRGSPSFSIYGNVAQMNFLV 418
Query: 324 GHRIV 328
G+ V
Sbjct: 419 GYDTV 423
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 76/319 (23%), Positives = 119/319 (37%), Gaps = 28/319 (8%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSL--KDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+ PS SS+ V C C++R SC D CPY Y + + + G+L +D L L +
Sbjct: 198 FAPSDSSTFSAVRCGARECRARQSCGGSPGDDRCPYEVVYG-DKSRTQGHLGNDTLTLGT 256
Query: 103 FS---KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
+ A + + GCG TG L G A DG+ GLG G VS+ S AG
Sbjct: 257 MAPANASAENDNKLPGFVFGCGENNTG--LFGQA-DGLFGLGRGKVSLSS--QAAGKFGE 311
Query: 160 SFSICFDENDSGSVFFGDQG-----PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 214
FS C + S + + G PA Q T L Y+V + + +
Sbjct: 312 GFSYCLPSSSSSAPGYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRV 371
Query: 215 S----GFQALVDSGASFTFLPTEIYAEVVVKFDKLVS------SKRISLQGNSWKYCYNA 264
S +VDSG T L Y + F + + R+S+ Y + A
Sbjct: 372 SSPRVALPLIVDSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILDTC--YDFTA 429
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
+ + +P + L+F+ + V + GI+G
Sbjct: 430 HANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQACLAFAPNGDGRSAGILGNTQQRT 489
Query: 325 HRIVFDRENLKLAWSHSKC 343
+V+D K+ ++ C
Sbjct: 490 LAVVYDVARQKIGFAAKGC 508
>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
Length = 468
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 82/362 (22%), Positives = 148/362 (40%), Gaps = 83/362 (22%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLC--------KSR-SSCKSLKDPC-----PYIADYSTED 86
+ ++ P SSSSK + C +P C +S+ +C C PYI Y
Sbjct: 130 KIPKFMPRLSSSSKLIGCKNPKCAWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLG- 188
Query: 87 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 146
S++G L+ + ++ P ++ S + GC S L P+G+ G G S
Sbjct: 189 -STAGLLLSETINF-------PNKTI-SDFLAGC------SLLSTRQPEGIAGFGRSQES 233
Query: 147 VPSLLAKAGLIQNSFSIC-----FDENDSGSVFFGDQGPATQQST----SFLPIGEKY-- 195
+P L GL FS C FD++ S D GP+T S S+ P +
Sbjct: 234 LPLQL---GL--KKFSYCLVSRRFDDSPVSSDLILDMGPSTSDSKTTGLSYTPFQKNLAS 288
Query: 196 -------DAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEV 238
+ Y+V + +G + + + +VDSG++FTF+ ++ +
Sbjct: 289 QSNPAFQEYYYVMLRKIIVGKTHVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELL 348
Query: 239 VVKFDKLVSSKRISL---QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSF 294
+F+K +++ ++ + + C++ S E+ + +PD+ F + ++ F+F
Sbjct: 349 AKEFEKQMANYTVATNVQKLTGLRPCFDISGEKSVVIPDLTFQFKGGAKMQLPLSNYFAF 408
Query: 295 PENEGFTVFCLTVMSTD-----GDYG--------IIGQNFMMGHRIVFDRENLKLAWSHS 341
+ V CLT++S + GD G I+G I +D EN + +
Sbjct: 409 VD---MGVVCLTIVSDNAAALGGDGGVRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQ 465
Query: 342 KC 343
C
Sbjct: 466 SC 467
>gi|213998826|gb|ACJ60780.1| nucellin [Hordeum intercedens]
Length = 148
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 67/138 (48%), Gaps = 4/138 (2%)
Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 170
+ V GCG KQ +P DG++GLG+G + L +I N C
Sbjct: 6 KKKVAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGK 65
Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTF 229
G ++ GD P ++ T ++P+ E Y G+ I N + + F+A+ DSG+++T
Sbjct: 66 GVLYVGDFNPPSRGVT-WVPMKESLFYYSAGLAELLIDNQPIRGNPTFEAVFDSGSTYTH 124
Query: 230 LPTEIYAEVVVKFDKLVS 247
+P +IY E+V K +S
Sbjct: 125 VPAQIYNEIVSKVRGTLS 142
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 56/242 (23%), Positives = 98/242 (40%), Gaps = 17/242 (7%)
Query: 114 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF---DENDS 170
S+ + GCGR G + G+MGLG ++S+ S FS C D S
Sbjct: 241 SNFVFGCGRNNKGLF---GGVSGIMGLGRSNLSMISQTNTT--FGGVFSYCLPTTDSGAS 295
Query: 171 GSVFFGDQGPATQQ-----STSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQ---ALVD 222
GS+ G++ + TS + + + Y + + +G + + F L+D
Sbjct: 296 GSLVIGNESSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAIQDTSFGNGGILID 355
Query: 223 SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKN 282
SG T L +Y + +F K S I+ + C+N + E + +P + + F N
Sbjct: 356 SGTVITRLAPSLYNALKAEFLKQFSGYPIAPALSILDTCFNLTGIEEVSIPTLSMHFENN 415
Query: 283 QSFVVRN-HIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 341
V I P++ L +S + D IIG R+++D + K+ ++
Sbjct: 416 VDLNVDAVGILYMPKDGSQVCLALASLSDENDMAIIGNYQQRNQRVIYDAKQSKIGFARE 475
Query: 342 KC 343
C
Sbjct: 476 DC 477
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 73/259 (28%), Positives = 104/259 (40%), Gaps = 34/259 (13%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYL 93
D+ ++PS S+S NVSCS C S SS C Y Y + + S G+L
Sbjct: 141 DQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYG-DQSFSVGFL 199
Query: 94 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
+ L S V V GCG G + A G++GLG +S PS A
Sbjct: 200 AKEKFTLT-------NSDVFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTAT 249
Query: 154 AGLIQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYD----------AYFVG 201
A FS C + S G + FG G +S F PI D A VG
Sbjct: 250 A--YNKIFSYCLPSSASYTGHLTFGSAG--ISRSVKFTPISTITDGTSFYGLNIVAITVG 305
Query: 202 VESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
+ I ++ + G AL+DSG T LP + YA + F +S + + C
Sbjct: 306 GQKLPIPSTVFSTPG--ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTC 363
Query: 262 YNASSEEMLKVPDMRLIFS 280
++ S + + +P + FS
Sbjct: 364 FDLSGFKTVTIPKVAFSFS 382
>gi|242035209|ref|XP_002464999.1| hypothetical protein SORBIDRAFT_01g030210 [Sorghum bicolor]
gi|241918853|gb|EER91997.1| hypothetical protein SORBIDRAFT_01g030210 [Sorghum bicolor]
Length = 107
Score = 59.7 bits (143), Expect = 3e-06, Method: Composition-based stats.
Identities = 33/76 (43%), Positives = 45/76 (59%), Gaps = 1/76 (1%)
Query: 115 SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSV 173
+V C TGS+LDG A +G+MGLG VSV +L +GL+ +SFS+CF E+ G +
Sbjct: 13 AVAKACRCGPTGSFLDGGAFNGLMGLGKEKVSVAGMLTASGLVASDSFSMCFSEDVVGRI 72
Query: 174 FFGDQGPATQQSTSFL 189
FGD G Q F+
Sbjct: 73 NFGDAGIRGQGEMPFI 88
>gi|224083514|ref|XP_002307058.1| predicted protein [Populus trichocarpa]
gi|222856507|gb|EEE94054.1| predicted protein [Populus trichocarpa]
Length = 376
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 78/314 (24%), Positives = 133/314 (42%), Gaps = 34/314 (10%)
Query: 56 VSCSHPLCKSRSSCKSLKDPCPYIADYSTE---DTSSSGYLVDDILHLASFSKHAPQSSV 112
V C P+C+S S + P DY E SS G LV D +L +F+ S +
Sbjct: 70 VPCMDPICQSLHSNGDHRCENPGQCDYEVEYADGGSSFGVLVRDTFNL-NFTSEKRHSPL 128
Query: 113 QSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 172
+ + G + GS+ DGV+GLG G S+ S L+ GL++N C + G
Sbjct: 129 LALGLCGYDQFPGGSH---HPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHCLSGHGGGF 185
Query: 173 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV---DSGASFTF 229
+FFGD + + ++ P+ Y G+ +GF+ L+ DSGAS+T+
Sbjct: 186 LFFGDDLYDSSR-VAWTPMSPDAKHYSPGLAELTFDGK---TTGFKNLLTTFDSGASYTY 241
Query: 230 LPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV 287
L ++ Y ++ K +S K R +L + C+ + + D++ F K +
Sbjct: 242 LNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKG-RKPFKSIRDVKKYF-KTFALSF 299
Query: 288 RNHI-----FSFPENEGFTVF------CLTVMSTD----GDYGIIGQNFMMGHRIVFDRE 332
N FP E + + CL +++ D +IG M +++D E
Sbjct: 300 TNERKSKTELEFPP-EAYLIISSKGNACLGILNGTEVGLNDLNVIGDISMQDRVVIYDNE 358
Query: 333 NLKLAWSHSKCEEV 346
++ W+ C +
Sbjct: 359 KERIGWAPGNCNRL 372
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 80/333 (24%), Positives = 134/333 (40%), Gaps = 42/333 (12%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSS-G 91
R+ + + SSS K + C +CK S ++C + PC Y DY D S++ G
Sbjct: 129 RHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGY--DYRYSDGSTALG 186
Query: 92 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
+ ++ + + K + + + V+IGC G A DGVMGLG S +
Sbjct: 187 FFANETVTVEL--KEGRKMKLHN-VLIGCSESFQGQSFQAA--DGVMGLGYSKYSFA--I 239
Query: 152 AKAGLIQNSFSICF-----DENDSGSVFFG----DQGPATQQSTSFLPIGEKYDAYFVGV 202
A FS C +N S + FG + + + L +G Y V +
Sbjct: 240 KAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNM 299
Query: 203 ESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFD-KLVSSKRISL 253
IG + L + ++DSG+S TFL Y V+ L+ +++ +
Sbjct: 300 MGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEM 359
Query: 254 QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV--VRNHIFSFPENEGFTVFCLTVMSTD 311
+YC+N++ E VP + F+ F V++++ S + V CL +S
Sbjct: 360 DIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADG----VRCLGFVSVA 415
Query: 312 G-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
++G H FD KL ++ S C
Sbjct: 416 WPGTSVVGNIMQQNHLWEFDLGLKKLGFAPSSC 448
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 80/333 (24%), Positives = 134/333 (40%), Gaps = 42/333 (12%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSS-G 91
R+ + + SSS K + C +CK S ++C + PC Y DY D S++ G
Sbjct: 58 RHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGY--DYRYSDGSTALG 115
Query: 92 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
+ ++ + + K + + + V+IGC G A DGVMGLG S +
Sbjct: 116 FFANETVTVEL--KEGRKMKLHN-VLIGCSESFQGQSFQAA--DGVMGLGYSKYSFA--I 168
Query: 152 AKAGLIQNSFSICF-----DENDSGSVFFG----DQGPATQQSTSFLPIGEKYDAYFVGV 202
A FS C +N S + FG + + + L +G Y V +
Sbjct: 169 KAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNM 228
Query: 203 ESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFD-KLVSSKRISL 253
IG + L + ++DSG+S TFL Y V+ L+ +++ +
Sbjct: 229 MGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEM 288
Query: 254 QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV--VRNHIFSFPENEGFTVFCLTVMSTD 311
+YC+N++ E VP + F+ F V++++ S + V CL +S
Sbjct: 289 DIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADG----VRCLGFVSVA 344
Query: 312 G-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
++G H FD KL ++ S C
Sbjct: 345 WPGTSVVGNIMQQNHLWEFDLGLKKLGFAPSSC 377
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 82/346 (23%), Positives = 145/346 (41%), Gaps = 68/346 (19%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+ +DPS SSS ++ CSHPLCK R +SC S C Y Y+ + T + G LV
Sbjct: 123 TSFDPSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDS-NRLCHYSYFYA-DGTFAEGNLVK 180
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
+ ++ P +I+GC ++ T G++G+ LG + S +++A
Sbjct: 181 EKFTFSNSQTTPP-------LILGCAKESTDV-------KGILGMNLGRL---SFISQAK 223
Query: 156 LIQNSFSICFDEN-----DSGSVFFGDQG-------------PATQQSTSFLPIGEKYDA 197
+ + S+ I N +GS + G+ P +Q+ + P+ A
Sbjct: 224 ISKFSYCIPTRSNRPGLASTGSFYLGENPNSRGFKYVSLLTFPQSQRMPNLDPL-----A 278
Query: 198 YFVGVESYCIGNSCLT--QSGF--------QALVDSGASFTFLPTEIYAEVVVKFDKLVS 247
Y V + IG L S F Q +VDSG+ FT L Y +V + +LV
Sbjct: 279 YTVPLLGIRIGQKRLNIPSSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVG 338
Query: 248 S--KRISLQGNSWKYCYNASSEEMLK--VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVF 303
S K+ + G++ C++ + + ++ + D+ F + +V N G +
Sbjct: 339 SRLKKGYVYGSTADMCFDGNHQMVIGRLIGDLVFEFGRGVEILVEKQ--RLLVNVGGGIH 396
Query: 304 CLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
C+ + + IIG + FD N ++ +S ++C +
Sbjct: 397 CVGIGRSSMLGAASNIIGNVHQQNLWVEFDVANRRVGFSKAECSRL 442
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 77/341 (22%), Positives = 138/341 (40%), Gaps = 57/341 (16%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDI 97
+ +DPS SSS + C+HPLCK R +L C + + + + T + G LV +
Sbjct: 121 TSFDPSLSSSFSVLPCNHPLCKPRIPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREK 180
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS----------- 146
+ +S P +I+GC T G++G+ LG S
Sbjct: 181 ITFSSSQSTPP-------LILGCAEASTDE-------KGILGMNLGRRSFASQAKISKFS 226
Query: 147 --VPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPAT--QQSTSFLPIGEKYDAYFVGV 202
VP+ A+AGL + +SG + + T Q+S + P+ AY + +
Sbjct: 227 YCVPTRQARAGLSSTGSFYLGNNPNSGRFQYINLLTFTPSQRSPNLDPL-----AYTIPM 281
Query: 203 ESYCIGNSCLTQSGF----------QALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KR 250
+ +GN+ L S Q ++DSG+ FT+L E Y +V + +LV K+
Sbjct: 282 QGIRMGNARLNISATLFRPDPSGAGQTIIDSGSEFTYLVDEAYNKVREEVVRLVGPKLKK 341
Query: 251 ISLQGNSWKYCYNASSEEMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS 309
+ G C++ + E+ + + +M F K V+ + + G V C+ +
Sbjct: 342 GYVYGGVSDMCFDGNPMEIGRLIGNMVFEFEKGVEIVIDK--WRVLADVGGGVHCIGIGR 399
Query: 310 TD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
++ IIG + +D N ++ + C +
Sbjct: 400 SEMLGAASNIIGNFHQQNLWVEYDLANRRIGLGKADCSRSV 440
>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
Length = 458
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 140/371 (37%), Gaps = 74/371 (19%)
Query: 21 PVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSS------------ 68
P TT C S + + ++P SSS K + C P C SS
Sbjct: 114 PCTTHYTCT---NCSFSNPKKVPIFNPELSSSDKILGCRDPKCADTSSPBVHLGXPRCNG 170
Query: 69 -CKSLKDPCP-YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 126
K CP Y Y T ++SG+ + + L + H ++GC T
Sbjct: 171 NSKKCSHACPQYTLQYGTG--AASGFFLLENLDFPGKTIH--------KFLVGC----TT 216
Query: 127 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND------SGSVFFGDQGP 180
S + D + G G S+P + F+ C + +D SG + D
Sbjct: 217 SADREPSSDALAGFGRTMFSLPMQMG-----VKKFAYCLNSHDYDDTRNSGKLIL-DYSD 270
Query: 181 ATQQSTSFLPIGEKYD----AYFVGVESYCIGNSCLTQSGFQ----------ALVDSGAS 226
Q S+ P + Y++GV+ IGN L G ++DSG +
Sbjct: 271 GETQGLSYAPFXKNPPDYPIYYYLGVKDMKIGNKVLRIPGKYLTPGSDSRGGVVIDSGFA 330
Query: 227 FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK---YCYNASSEEMLKVPDMRLIFSKNQ 283
++++ ++ V + K +S R SL+ + CYN + + +K+PD+ F+
Sbjct: 331 YSYMTLPVFKIVTNELKKQMSKYRRSLELEAQTGVTPCYNFTGHKSIKIPDLIYQFTGGA 390
Query: 284 SFVV--RNHIFSFPENEGFTVFCLTVMS---------TDGDYGIIGQNFMMGHRIVFDRE 332
+ VV N+ F E ++ C V + T G I+G + H + FD +
Sbjct: 391 NMVVPGMNYFLLFSEA---SLGCFPVTTDSPTSNLEFTPGPSIILGNYQQVDHYVEFDLK 447
Query: 333 NLKLAWSHSKC 343
N +L + C
Sbjct: 448 NERLGFRQQTC 458
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 89/317 (28%), Positives = 146/317 (46%), Gaps = 40/317 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCK--SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP SSSS +SC+ CK +++C S D C Y Y + + ++G L + L +
Sbjct: 193 FDPKSSSSYSPLSCNSQQCKLLDKANCNS--DTCIYQVHYG-DGSFTTGELATETLSFGN 249
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S P ++ IGCG G + GA ++GLG G +S+ S L + SFS
Sbjct: 250 -SNSIP------NLPIGCGHDNEGLFAGGAG---LIGLGGGAISLSSQLKAS-----SFS 294
Query: 163 IC---FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAY-FVGVESYCIGNSCL------ 212
C D + S ++ F P+ TS L +++ +Y +V V +G L
Sbjct: 295 YCLVNLDSDSSSTLEFNSYMPS-DSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTR 353
Query: 213 ---TQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
+SG +VDSG + LP+++Y + F KL SS + + + CYN S +
Sbjct: 354 FEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQS 413
Query: 269 MLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
++VP + + S+ S + RN++ + G +CL + T IIG G R
Sbjct: 414 NVEVPTIAFVLSEGTSLRLPARNYLIML-DTAG--TYCLAFIKTKSSLSIIGSFQQQGIR 470
Query: 327 IVFDRENLKLAWSHSKC 343
+ +D N + +S +KC
Sbjct: 471 VSYDLTNSIVGFSTNKC 487
>gi|213998845|gb|ACJ60789.1| nucellin [Psathyrostachys fragilis subsp. fragilis]
Length = 150
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 70/138 (50%), Gaps = 4/138 (2%)
Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDS 170
+ ++ GCG KQ +P DG++GLG+G + L +I +N C
Sbjct: 4 KKNIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITENVIGHCLSSKGK 63
Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTF 229
G ++ GD P T+ T ++P+ E Y G+ + I + + F+A+ DSG+++T+
Sbjct: 64 GVLYVGDFNPPTRGVT-WVPMRESLFYYSPGLAALFIDKQPIRGNPTFEAVFDSGSTYTY 122
Query: 230 LPTEIYAEVVVKFDKLVS 247
+P +IY E+V K +S
Sbjct: 123 VPAQIYNELVSKIRGTLS 140
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 80/333 (24%), Positives = 134/333 (40%), Gaps = 42/333 (12%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSS-G 91
R+ + + SSS K + C +CK S ++C + PC Y DY D S++ G
Sbjct: 129 RHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGY--DYRYSDGSTALG 186
Query: 92 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
+ ++ + + K + + + V+IGC G A DGVMGLG S +
Sbjct: 187 FFANETVTVEL--KEGRKMKLHN-VLIGCSESFQGQSFQAA--DGVMGLGYSKYSFA--I 239
Query: 152 AKAGLIQNSFSICF-----DENDSGSVFFG----DQGPATQQSTSFLPIGEKYDAYFVGV 202
A FS C +N S + FG + + + L +G Y V +
Sbjct: 240 KAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNM 299
Query: 203 ESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKFD-KLVSSKRISL 253
IG + L + ++DSG+S TFL Y V+ L+ +++ +
Sbjct: 300 MGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEM 359
Query: 254 QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFV--VRNHIFSFPENEGFTVFCLTVMSTD 311
+YC+N++ E VP + F+ F V++++ S + V CL +S
Sbjct: 360 DIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADG----VRCLGFVSVA 415
Query: 312 G-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
++G H FD KL ++ S C
Sbjct: 416 WPGTSVVGNIMQQNHLWEFDLGLKKLGFAPSSC 448
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 81/313 (25%), Positives = 126/313 (40%), Gaps = 29/313 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS-CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+DPS SS+ V C P C + C C Y+ Y + +S++G L D L L S
Sbjct: 189 FDPSKSSTYAAVHCGEPQCAAAGDLCSEDNTTCLYLVRYG-DGSSTTGVLSRDTLALTS- 246
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S + GCG + G + DG++GLG G++S+PS A + FS
Sbjct: 247 ------SRALTGFPFGCGTRNLGDF---GRVDGLLGLGRGELSLPSQAAAS--FGAVFSY 295
Query: 164 CFDENDSGSVFFG-DQGPATQ----QSTSFLPIGEKYDAYFVGVESYCIGNSCL------ 212
C ++S + + PAT Q T+ L + YFV + S IG L
Sbjct: 296 CLPSSNSTTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAV 355
Query: 213 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
T+ G L+DSG T+LP + YA + +F + + + CY+ + E +
Sbjct: 356 FTRGG--TLLDSGTVLTYLPAQAYALLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVV 413
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFD 330
VP + F F + +E M T G IIG +++D
Sbjct: 414 VPAVSFRFGDGAVFELDFFGVMIFLDENVGCLAFAAMDTGGLPLSIIGNTQQRSAEVIYD 473
Query: 331 RENLKLAWSHSKC 343
K+ + + C
Sbjct: 474 VAAEKIGFVPASC 486
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 79/325 (24%), Positives = 127/325 (39%), Gaps = 46/325 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
+DPS+S + N+SC+ C S S C S C Y Y + + + G+ D
Sbjct: 197 FDPSTSKTYSNISCTSAACSSLKSATGNSPGCSS--SNCVYGIQYG-DSSFTIGFFAKDK 253
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L L Q+ V + GCG+ G + A G++GLG +S+ A+
Sbjct: 254 LTLT-------QNDVFDGFMFGCGQNNKGLFGKTA---GLIGLGRDPLSIVQQTAQK--F 301
Query: 158 QNSFSICF--DENDSGSVFFGD-----QGPATQQSTSFLPI----GEKYDAYFVGVESYC 206
FS C +G + FG+ A + +F P G Y YF+ V
Sbjct: 302 GKYFSYCLPTSRGSNGHLTFGNGNGVKASKAVKNGITFTPFASSQGTAY--YFIDVLGIS 359
Query: 207 IGNSCLTQSG--FQ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
+G L+ S FQ ++DSG T LP+ Y + F + +S + + C
Sbjct: 360 VGGKALSISPMLFQNAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDTC 419
Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMST--DGDYGIIG 318
Y+ S+ + +P + F+ N + + N I G + CL D GI G
Sbjct: 420 YDLSNYTSISIPKISFNFNGNANVELDPNGIL---ITNGASQVCLAFAGNGDDDSIGIFG 476
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
+V+D +L + + C
Sbjct: 477 NIQQQTLEVVYDVAGGQLGFGYKGC 501
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 83/323 (25%), Positives = 134/323 (41%), Gaps = 42/323 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+DP++S+S V C +C++ S C C Y Y + + + G L + L
Sbjct: 175 FDPAASASFTAVPCDSGVCRTLPGGSSGCAD-SGACRYQVSYG-DGSYTQGVLAMETL-- 230
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+F P VQ V IGCG + G ++ A G++GLG G +S+ L A +
Sbjct: 231 -TFGDSTP---VQG-VAIGCGHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGA--AGGA 280
Query: 161 FSICFD----ENDSGSVFFG--DQGPATQQSTSFLPIGEKYDAYFVGVESYCI------- 207
FS C + +GS+ FG D P L ++ Y+VG+ +
Sbjct: 281 FSYCLASRGADAGAGSLVFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPL 340
Query: 208 --GNSCLTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW-KYCYN 263
G LT+ G +V D+G + T LP + YA + F + G S CY+
Sbjct: 341 QDGLFDLTEDGGGGVVMDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLDTCYD 400
Query: 264 ASSEEMLKVPDMRLIFSKNQSFVV---RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQN 320
S ++VP + L F ++ + + RN + G V+CL ++ I+G
Sbjct: 401 LSGYASVRVPTVALYFGRDGAALTLPARNLLVEM----GGGVYCLAFAASASGLSILGNI 456
Query: 321 FMMGHRIVFDRENLKLAWSHSKC 343
G +I D N + + S C
Sbjct: 457 QQQGIQITVDSANGYVGFGPSTC 479
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 77/356 (21%), Positives = 131/356 (36%), Gaps = 73/356 (20%)
Query: 40 RNLSEYDPSS------SSSSKNVSCSHPLCK-----SRSSCKS--LKDPCPYIADYSTED 86
RN S P S S++ + C P C+ + C L PC Y Y+ +
Sbjct: 118 RNCSHRSPGSAFFARHSTTYSAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYA-DS 176
Query: 87 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA---PDGVMGLGLG 143
++++G+ + L L + + + + + GCG + +G L GA+ GVMGLG
Sbjct: 177 STTTGFFSKEALTLNTSTGKVKK---LNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRA 233
Query: 144 DVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA------ 197
+S S L + + FS C + + TSFL IG +
Sbjct: 234 PISFSSQLGRR--FGSKFSYCLMDYT-----------LSPPPTSFLTIGGAQNVAVSKKG 280
Query: 198 ----------------YFVGVESYCIGNSCLTQS----------GFQALVDSGASFTFLP 231
Y++ ++ + L + ++DSG + TF+
Sbjct: 281 IMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFIT 340
Query: 232 TEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF--VVRN 289
Y E++ F K V + + C N S +P M + F RN
Sbjct: 341 EPAYTEILKAFKKRVKLPSPAEPTPGFDLCMNVSGVTRPALPRMSFNLAGGSVFSPPPRN 400
Query: 290 HIFSFPENEGFTVFCLTV--MSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
+ G + CL V +S DG + ++G G + FDR+ +L ++ C
Sbjct: 401 YFI----ETGDQIKCLAVQPVSQDGGFSVLGNLMQQGFLLEFDRDKSRLGFTRRGC 452
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 80/329 (24%), Positives = 128/329 (38%), Gaps = 43/329 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSSSGYLVD-DIL 98
+ P+ S S + C CKS ++C S DPC Y DY +D SS+ +V D
Sbjct: 151 FRPAGSKSWSPLPCDSDTCKSYVPFSLANCSSPPDPCSY--DYRYKDNSSARGVVGLDSA 208
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
++ + + V++GC G + DGV+ LG ++S S A
Sbjct: 209 TVSLSGNDGTRKAKLQEVVLGCTTSYDGQSFKSS--DGVLSLGNSNISFAS--RAASRFG 264
Query: 159 NSFSICF-----DENDSGSVFFGDQGPATQQSTS-------FLPIGEKYDAYFVGVESYC 206
FS C N + + FG+ + +S L YFV V++
Sbjct: 265 GRFSYCLVDHLAPRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVT 324
Query: 207 IGNSCLT--------QSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNS 257
+ L + A++DSG S T L T Y VV K R+++ +
Sbjct: 325 VAGERLEILPDVWDFRKNGGAILDSGTSLTILATPAYDAVVKAISKQFAGVPRVNM--DP 382
Query: 258 WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY--- 314
++YCYN + ++P M L F+ + + G V C+ V+ +G +
Sbjct: 383 FEYCYNWTGVSA-EIPRMELRFAGAATLAPPGKSYVIDTAPG--VKCIGVV--EGAWPGV 437
Query: 315 GIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
+IG H FD N L + S+C
Sbjct: 438 SVIGNILQQEHLWEFDLANRWLRFKQSRC 466
>gi|213998798|gb|ACJ60766.1| nucellin [Hordeum brevisubulatum subsp. violaceum]
Length = 141
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 66/136 (48%), Gaps = 4/136 (2%)
Query: 116 VIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSV 173
+ GCG KQ +P DG++GLG+G + L +I +N C G +
Sbjct: 1 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMIKENVIGHCLSSKGKGVL 60
Query: 174 FFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTFLPT 232
+ GD P ++ T ++P+ E Y G+ I N + F+A+ DSG+++T +P
Sbjct: 61 YVGDFNPPSRGVT-WVPMRESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPA 119
Query: 233 EIYAEVVVKFDKLVSS 248
+IY E+V K +S
Sbjct: 120 QIYNEIVSKVRGTLSE 135
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 71/318 (22%), Positives = 116/318 (36%), Gaps = 41/318 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP+ S++ N+SCS C C Y Y + + + G+ D L LA
Sbjct: 139 FDPTKSATYANISCSSSYCSDLYVSGCSGGHCLYGIQYG-DGSYTIGFYAQDTLTLA--- 194
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
+ GCG K G + A G++GLG G S+P K G + F+
Sbjct: 195 -----YDTIKNFRFGCGEKNRGLFGRAA---GLLGLGRGKTSLPVQAYDKYGGV---FAY 243
Query: 164 CFDENDSGSVF--FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG----- 216
C +G+ F G PA + + + Y+VG+ +G L G
Sbjct: 244 CLPATSAGTGFLDLGPGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFST 303
Query: 217 FQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW---------KYCYNASSE 267
LVDSG T LP YA + F K ++QG + CY+ +
Sbjct: 304 AGTLVDSGTVITRLPPSAYAPLRSAFSK-------AMQGLGYSAAPAFSILDTCYDLTGH 356
Query: 268 E--MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
+ + +P + L+F V + + + D D I+G H
Sbjct: 357 KGGSIALPAVSLVFQGGACLDVDASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTH 416
Query: 326 RIVFDRENLKLAWSHSKC 343
+++D + ++ C
Sbjct: 417 GVLYDIGKKIVGFAPGAC 434
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 70/318 (22%), Positives = 128/318 (40%), Gaps = 39/318 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
++P SSS + C C+ S + C Y Y + +++ GY+ +
Sbjct: 138 FNPQDSSSFSTLPCESQYCQDLPSETCNNNECQYTYGYG-DGSTTQGYMATETFTF---- 192
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
++S ++ GCG G A G++G+G G +S+PS L FS C
Sbjct: 193 ----ETSSVPNIAFGCGEDNQGFGQGNGA--GLIGMGWGPLSLPSQLGVG-----QFSYC 241
Query: 165 FDENDSGS---VFFGDQG---PATQQSTSFLPIGEKYDAYFVGVESYCIG--NSCLTQSG 216
S S + G P ST+ + Y++ ++ +G N + S
Sbjct: 242 MTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSST 301
Query: 217 FQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE- 267
FQ ++DSG + T+LP + Y V F ++ + + C+ S+
Sbjct: 302 FQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDG 361
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG--IIGQNFMMGH 325
++VP++ + F + +I P EG V CL M + G I G
Sbjct: 362 STVQVPEISMQFDGGVLNLGEQNILISPA-EG--VICL-AMGSSSQLGISIFGNIQQQET 417
Query: 326 RIVFDRENLKLAWSHSKC 343
++++D +NL +++ ++C
Sbjct: 418 QVLYDLQNLAVSFVPTQC 435
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 78/321 (24%), Positives = 132/321 (41%), Gaps = 42/321 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRS--SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP+ S S N+ C PLC+ C + K C Y Y D +
Sbjct: 187 FDPTKSRSFANIPCGSPLCRRLDYPGCSTKKQICLYQVSYG-----------DGSFTVGE 235
Query: 103 FSKHAP--QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
FS + + V++GCG G ++ A ++GLG G +S PS + + +
Sbjct: 236 FSTETLTFRGTRVGRVVLGCGHDNEGLFVGAAG---LLGLGRGRLSFPSQIGRR--FNSK 290
Query: 161 FSICFDENDS----GSVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGN---S 210
FS C + + S+ FGD A ++T F P+ K D Y+V + +G S
Sbjct: 291 FSYCLGDRSASSRPSSIVFGDS--AISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVS 348
Query: 211 CLTQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
++ S F+ ++DSG S T L Y + F S+ + + + + + C+
Sbjct: 349 GISASLFKLDSTGNGGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFSLFDTCF 408
Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
+ S + +KVP + L F + ++ +N G FC T IIG
Sbjct: 409 DLSGKTEVKVPTVVLHFRGADVPLPASNYLIPVDNSG--SFCFAFAGTASGLSIIGNIQQ 466
Query: 323 MGHRIVFDRENLKLAWSHSKC 343
G R+V+D ++ ++ C
Sbjct: 467 QGFRVVYDLATSRVGFAPRGC 487
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 68/307 (22%), Positives = 114/307 (37%), Gaps = 22/307 (7%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DPS S++ V C C +C S K C Y Y + + + G L D L L
Sbjct: 230 FDPSQSTTYSAVPCGAQECLDSGTCSSGK--CRYEVVYG-DMSQTDGNLARDTLTL---- 282
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
P S + GCG TG + DG+ GLG VS+ S A FS C
Sbjct: 283 --GPSSDQLQGFVFGCGDDDTGLF---GRADGLFGLGRDRVSLAS--QAAARYGAGFSYC 335
Query: 165 FDE--NDSGSVFFGD-QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC--LTQSGFQA 219
G + G P Q T+ + + Y++ + + + + F+A
Sbjct: 336 LPSSWRAEGYLSLGSAAAPPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKA 395
Query: 220 ---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMR 276
++DSG T LP+ Y+ + F + + + + CY+ + +++P +
Sbjct: 396 PGTVIDSGTVITRLPSRAYSALRSSFAGFMRRYKRAPALSILDTCYDFTGRTKVQIPSVA 455
Query: 277 LIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKL 336
L+F + + + N D GI+G +V+D N K+
Sbjct: 456 LLFDGGATLNLGFGGVLYVANRSQACLAFASNGDDTSVGILGNMQQKTFAVVYDLANQKI 515
Query: 337 AWSHSKC 343
+ C
Sbjct: 516 GFGAKGC 522
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 75/316 (23%), Positives = 126/316 (39%), Gaps = 34/316 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DPS S+S VSC C+ ++C++ C Y Y + + + G + L L
Sbjct: 28 FDPSLSASYAAVSCDSQRCRDLDTAACRNATGACLYEVAYG-DGSYTVGDFATETLTLG- 85
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S+ +V IGCG G ++ A + G L S PS ++ ++FS
Sbjct: 86 ------DSTPVGNVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----STFS 131
Query: 163 ICFDENDS---GSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCL------ 212
C + DS ++ FGD T+ L + Y+V + +G L
Sbjct: 132 YCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASA 191
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
T +VDSG + T L + YA + F + S + + + CY+ S
Sbjct: 192 FAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSDR 251
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
++VP + L F + + + P +G +CL T+ IIG G R+
Sbjct: 252 TSVEVPAVSLRFEGGGALRLPAKNYLIPV-DGAGTYCLAFAPTNAAVSIIGNVQQQGTRV 310
Query: 328 VFDRENLKLAWSHSKC 343
FD + ++ +KC
Sbjct: 311 SFDTARGAVGFTPNKC 326
>gi|213998812|gb|ACJ60773.1| nucellin [Hordeum euclaston]
Length = 154
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 65/132 (49%), Gaps = 4/132 (3%)
Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 170
+ + GCG KQ +P DG++GLG+G + L +I N C
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGK 65
Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTF 229
G ++ GD P ++ T ++P+ E Y G+ I N + + F+A+ DSG+++T
Sbjct: 66 GVLYVGDFNPPSRGVT-WVPMKESLFYYSAGLAELLIDNQPIRGNPTFEAVFDSGSTYTH 124
Query: 230 LPTEIYAEVVVK 241
+P +IY E+V K
Sbjct: 125 VPAQIYNEIVSK 136
>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
Length = 492
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 86/367 (23%), Positives = 141/367 (38%), Gaps = 53/367 (14%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
+ ++P S++ +V C+ C+ + + C Y Y +++G L +
Sbjct: 132 APFNPVRSTTVADVPCTDDACQQFAPQTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTE 191
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
+ V+ GCG + G + + GV+GLG G++S+ S L
Sbjct: 192 AFTFGD--------TRIDGVVFGCGLQNVGDF---SGVSGVIGLGRGNLSLVSQLQV--- 237
Query: 157 IQNSFSICFDENDS----GSVFFGDQG-PATQQ--STSFLPIGEKYDAYFVGVESYCI-G 208
+ FS F +DS + FGD P T ST L Y+V + + G
Sbjct: 238 --DRFSYHFAPDDSVDTQSFILFGDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDG 295
Query: 209 NSCLTQSG-FQALVDSGASFTFLP----TEIYAEVVVKFDKLVSSKRISL---QGNSW-- 258
SG F G+ FL + E K + + +I L G++
Sbjct: 296 KDLAIPSGTFDLRNKDGSGGVFLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGL 355
Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVM-STDGDYGI 316
CY S KVP M L+F+ + + F G CLT++ S+ GD +
Sbjct: 356 DLCYTGESLAKAKVPSMALVFAGGAVMELELGNYFYMDSTTGLA--CLTILPSSAGDGSV 413
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSN 376
+G +G +++D KL + E + + PPP+G S T QQ+
Sbjct: 414 LGSLIQVGTHMMYDINGSKLVF-----ESLAQAA----APPPSGSSQQTSSKTNQQAGGR 464
Query: 377 GQAAAPP 383
A+APP
Sbjct: 465 RSASAPP 471
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 73/259 (28%), Positives = 104/259 (40%), Gaps = 34/259 (13%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYL 93
D+ ++PS S+S NVSCS C S SS C Y Y + + S G+L
Sbjct: 169 DQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYG-DQSFSVGFL 227
Query: 94 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
+ L S V V GCG G + A G++GLG +S PS A
Sbjct: 228 AKEKFTLT-------NSDVFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTAT 277
Query: 154 AGLIQNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYD----------AYFVG 201
A FS C + S G + FG G +S F PI D A VG
Sbjct: 278 A--YNKIFSYCLPSSASYTGHLTFGSAG--ISRSVKFTPISTITDGTSFYGLNIVAITVG 333
Query: 202 VESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
+ I ++ + G AL+DSG T LP + YA + F +S + + C
Sbjct: 334 GQKLPIPSTVFSTPG--ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTC 391
Query: 262 YNASSEEMLKVPDMRLIFS 280
++ S + + +P + FS
Sbjct: 392 FDLSGFKTVTIPKVAFSFS 410
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 62/252 (24%), Positives = 107/252 (42%), Gaps = 32/252 (12%)
Query: 116 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-------- 167
V GCG G + G+ G G G +S+PS L K G +FS CF
Sbjct: 175 VAFGCGLFNNGVFKSNET--GIAGFGRGPLSLPSQL-KVG----NFSHCFTTITGAIPST 227
Query: 168 ---NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQ---- 218
+ +F QG Q+T + Y++ ++ +G++ L +S F
Sbjct: 228 VLLDLPADLFSNGQGAV--QTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNG 285
Query: 219 ---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDM 275
++DSG + T LPT +Y V F V +S +C +A VP +
Sbjct: 286 TGGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKL 345
Query: 276 RLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENL 334
L F + R N++F E+ G ++ CL ++ G+ IG +++D +N
Sbjct: 346 VLHFEGATMDLPRENYVFEV-EDAGSSILCLAIIE-GGEVTTIGNFQQQNMHVLYDLQNS 403
Query: 335 KLAWSHSKCEEV 346
KL++ ++C+++
Sbjct: 404 KLSFVPAQCDKL 415
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 87/351 (24%), Positives = 139/351 (39%), Gaps = 64/351 (18%)
Query: 45 YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSS-GYLVDDIL 98
+ P S + + CS C S ++C + PC Y +Y +D S++ G + D
Sbjct: 125 FRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAY--EYRYKDGSAARGTVGTDSA 182
Query: 99 HLASFSKHAPQSSVQSS---VIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
+A + A + ++ V++GC TG S+L A DGV+ LG +VS S A
Sbjct: 183 TIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFL---ASDGVLSLGYSNVSFASR--AA 237
Query: 155 GLIQNSFSICF-----DENDSGSVFFGDQ------------------GPATQQSTSFLPI 191
FS C N + + FG P +Q T L
Sbjct: 238 ARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRTACAGSAAAPGARQ-TPLLLD 296
Query: 192 GEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVKF- 242
Y V V + L Q G A++DSG S T L + Y VV
Sbjct: 297 HRMRPFYAVAVNGVSVDGELLRIPRLVWDVQKGGGAILDSGTSLTVLVSPAYRAVVAALG 356
Query: 243 DKLVSSKRISLQGNSWKYCYNASS----EEM-LKVPDMRLIFSKNQSFVVRNHIFSFPEN 297
KLV R+++ + + YCYN +S E++ + VP + + F+ + +
Sbjct: 357 KKLVGLPRVAM--DPFDYCYNWTSPLTGEDLAVAVPALAVHFAGSARLQPPPKSYVIDAA 414
Query: 298 EGFTVFCLTVMSTDGDY---GIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
G V C+ + +GD+ +IG H FD +N +L + S+C +
Sbjct: 415 PG--VKCIGLQ--EGDWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRCMQ 461
>gi|414888272|tpg|DAA64286.1| TPA: hypothetical protein ZEAMMB73_677781 [Zea mays]
Length = 118
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 62/118 (52%), Gaps = 5/118 (4%)
Query: 303 FCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVID-KSHVHLVPPPAGQ 361
+CL VM ++G +IG+NFM G ++VFDRE L W + C V + +S++ + P P+G
Sbjct: 3 YCLAVMKSEG-VNLIGENFMSGLKVVFDRERKVLGWKNFDCYSVGNSRSNLPVNPNPSGV 61
Query: 362 SPNPLPTTEQQSTSNGQAAAPPSTAKTA--PSKSIAASAQQLDSVLRVACSLLVLMCL 417
P P + + A+P T PS S + + +VL VA +LL L+ L
Sbjct: 62 PPKPALGPNSYTPEATKGASPNGTQVNVLQPSASFSPKLRCNRNVL-VAAALLFLVIL 118
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 80/313 (25%), Positives = 127/313 (40%), Gaps = 41/313 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLK-DPCPYIADYSTEDTSSSGYLVDDILHLA 101
+DP S+S + +S + C++ RS K C Y Y + +++ G +++ L A
Sbjct: 180 FDPRHSTSYREMSFNAADCQALGRSGGGDAKRGTCVYTVGYG-DGSTTVGDFIEETLTFA 238
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
P+ S IGCG G L GA G++GLG G +S P+ + G +F
Sbjct: 239 G-GVRLPRIS------IGCGHDNKG--LFGAPAAGILGLGRGLMSFPNQIDHNG----TF 285
Query: 162 SICFDENDSG------SVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGN--- 209
S C + SG ++ FG T SF P + Y+V + +G
Sbjct: 286 SYCLVDFLSGPGSLSSTLTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRV 345
Query: 210 SCLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKL-VSSKRISLQGNS-- 257
+T+ Q +VDSG + T L Y F + V ++S+ G S
Sbjct: 346 PGVTERDLQLDPYTGRGGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGF 405
Query: 258 WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
+ CY M KVP + + F+ + ++ + P + TV + D II
Sbjct: 406 FDTCYTVGGRGMKKVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSII 465
Query: 318 GQNFMMGHRIVFD 330
G G RIV+D
Sbjct: 466 GNIQQQGFRIVYD 478
>gi|213998804|gb|ACJ60769.1| nucellin [Hordeum muticum]
gi|213998808|gb|ACJ60771.1| nucellin [Hordeum erectifolium]
gi|213998820|gb|ACJ60777.1| nucellin [Hordeum patagonicum subsp. mustersii]
gi|213998822|gb|ACJ60778.1| nucellin [Hordeum patagonicum subsp. santacrucense]
gi|333069937|gb|AEF13570.1| nucellin, partial [Hordeum pubiflorum]
Length = 154
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 67/138 (48%), Gaps = 4/138 (2%)
Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 170
+ + GCG KQ +P DG++GLG+G + L +I N C
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGK 65
Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTF 229
G ++ GD P ++ T ++P+ E Y G+ I N + + F+A+ DSG+++T
Sbjct: 66 GVLYVGDFNPPSRGVT-WVPMKESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTH 124
Query: 230 LPTEIYAEVVVKFDKLVS 247
+P +IY E+V K +S
Sbjct: 125 VPAQIYNEIVSKVRGTLS 142
>gi|213998834|gb|ACJ60784.1| nucellin [Hordeum bulbosum]
Length = 154
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 66/132 (50%), Gaps = 4/132 (3%)
Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDS 170
+ + GCG KQ +P DG++GLG+G + L +I +N C
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLRGHKMIKENVIGHCLSSKGK 65
Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTF 229
G ++ GD P T+ T ++P+ E Y G+ I + + F+A+ DSG+++T
Sbjct: 66 GVLYVGDFNPPTRGVT-WVPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTH 124
Query: 230 LPTEIYAEVVVK 241
+P +IY+E+V K
Sbjct: 125 VPAQIYSEIVSK 136
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 78/320 (24%), Positives = 126/320 (39%), Gaps = 33/320 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP SS+ + C+ C + + + C Y DY + + S+G D + L S S
Sbjct: 79 FDPYKSSTYSTLGCNSRQCLNLDVGGCVGNKCLYQVDYG-DGSFSTGEFATDAVSLNSTS 137
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
V + + +GCG G ++ A G+ +S P+ + FS C
Sbjct: 138 GGG--QVVLNKIPLGCGHDNEGYFVGAAGLLGLGKG---PLSFPNQINSEN--GGRFSYC 190
Query: 165 F-----DENDSGSVFFGDQG--PATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCLT- 213
D + S+ FGD PA F P Y++ + +G S LT
Sbjct: 191 LTGRDTDSTERSSLIFGDAAVPPA---GVRFTPQASNLRVSTFYYLKMTGISVGGSILTI 247
Query: 214 -QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
S FQ ++DSG S T L YA + F S ++ + + + CYN
Sbjct: 248 PTSAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNL 307
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
S + VP + L F + + P + T FCL T G IIG G
Sbjct: 308 SDLSSVDVPTVTLHFQGGADLKLPASNYLVPVDNSST-FCLAFAGTTGP-SIIGNIQQQG 365
Query: 325 HRIVFDRENLKLAWSHSKCE 344
R+++D + ++ + S+C+
Sbjct: 366 FRVIYDNLHNQVGFVPSQCD 385
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 71/314 (22%), Positives = 125/314 (39%), Gaps = 33/314 (10%)
Query: 47 PSSSSSSKNVSCSHPLCKSRSSCK------SLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
PS S++ N+SCS P C S S C Y Y + + S GY + L L
Sbjct: 176 PSQSTTYSNISCSSPDCSQLESGTGNQPGCSAARACIYGIQYG-DQSFSVGYFAKETLTL 234
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQN 159
S + V + + GCG+ G + A G++GLG +S+ A K G +
Sbjct: 235 TS-------TDVIENFLFGCGQNNRGLFGSAA---GLIGLGQDKISIVKQTAQKYGQV-- 282
Query: 160 SFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYD-AYFVGVE---------SYCIGN 209
FS C + S + + G + + PI + + A F GV+ I +
Sbjct: 283 -FSYCLPKTSSSTGYLTFGGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISS 341
Query: 210 SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
S + SG A++DSG T LP + Y+ + F+K ++ + + + CY+ S
Sbjct: 342 SVFSTSG--AIIDSGTVITRLPPDAYSALKSAFEKGMAKYPKAPELSILDTCYDLSKYST 399
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 329
+++P + +F + + + + IIG ++V+
Sbjct: 400 IQIPKVGFVFKGGEELDLDGIGIMYGASTSQVCLAFAGNQDPSTVAIIGNVQQKTLQVVY 459
Query: 330 DRENLKLAWSHSKC 343
D K+ + ++ C
Sbjct: 460 DVGGGKIGFGYNGC 473
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 74/332 (22%), Positives = 126/332 (37%), Gaps = 29/332 (8%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ +DP SS++ +SC C S S C + + C Y +Y + + + GY V D
Sbjct: 85 LNFFDPRGSSTASPLSCIDSKCVSSNQISESVCTTDRY-CGYSFEYG-DGSGTLGYYVSD 142
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAG 155
+ ++ + + GC Q+G A DG+ G G D+SV S L G
Sbjct: 143 EFDYNQYVNQYVTNNASAKITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQG 202
Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
L FS C + D G G T+ + PI Y + ++ + L
Sbjct: 203 LAPKIFSHCLEGADPGGGILV-LGEITEPGMVYTPIVPSQPHYNLNLQGIAVNGQQLSID 261
Query: 213 -----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV--SSKRISLQGNSWKYCYNAS 265
T + ++D G + +L E Y V V S++ L+GN C+
Sbjct: 262 PQVFATTNTRGTIIDCGTTLAYLAEEAYEPFVNTIIAAVSQSTQPFMLKGNP---CFLTV 318
Query: 266 SEEMLKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTV-----MSTD-GDYGIIG 318
P + L F ++++ + V+C+ +TD I+G
Sbjct: 319 HSIDEIFPSVTLYFEGAPMDLKPKDYLIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILG 378
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEVIDKS 350
+ V+D EN ++ W+ C ++ S
Sbjct: 379 DLVLKDKVFVYDLENQRIGWTSFDCSSTVNVS 410
>gi|255563835|ref|XP_002522918.1| nucellin, putative [Ricinus communis]
gi|223537845|gb|EEF39461.1| nucellin, putative [Ricinus communis]
Length = 433
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 76/323 (23%), Positives = 129/323 (39%), Gaps = 32/323 (9%)
Query: 47 PSSSSSSKNVSCSHPLCKSRS--SCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLAS 102
P S+ V C PLC S + +DP C Y +Y+ + SS G LV D+ L +
Sbjct: 112 PLYRPSNNLVICEDPLCASLQPPGVHNCQDPDQCDYEVEYA-DGGSSLGVLVKDVFVL-N 169
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
F+ + + +GCG Q + DG++GLG G S+PS L+ GL+ N
Sbjct: 170 FTN---GKRLNPLLALGCGYDQLPGRSNHPL-DGILGLGRGISSIPSQLSSQGLVSNVIG 225
Query: 163 ICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY-DAYFVGVESYCIGNSCLTQSGFQALV 221
C G +FFG+ ++ P+ + Y G +
Sbjct: 226 HCLSGRGGGFLFFGED-IYDSSGVTWTPMSRDHLKHYSPGFAELIFDGKSTGIRNLLVVF 284
Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRIS--LQGNSWKYCYNASSEEMLKVPDMRLIF 279
DSG+S+T+L + Y +V + +S K IS L + C+ + D++ F
Sbjct: 285 DSGSSYTYLNAQAYQHLVFSLKRELSRKPISEALDDQTLPLCWKG-KRPFKSIRDVKKYF 343
Query: 280 S------KNQSFVVRNHIFSFPENEGFTVF------CLTVMSTD----GDYGIIGQNFMM 323
K S F F E + + CL +++ D +IG M+
Sbjct: 344 KPFALVFKTSSGRSSKTQFEF-SPEAYLIISSKGNACLGILNGTEVGLRDLNVIGDVSML 402
Query: 324 GHRIVFDRENLKLAWSHSKCEEV 346
++++ E + W+ + C+ +
Sbjct: 403 DRLVIYNNEKQMIGWAAASCDRL 425
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 75/347 (21%), Positives = 149/347 (42%), Gaps = 69/347 (19%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+ +DPS SSS + CSHPLCK R +SC S + C Y Y+ + T + G LV
Sbjct: 111 TSFDPSLSSSFSTLPCSHPLCKPRIPDFTLPTSCDSNRL-CHYSYFYA-DGTFAEGNLVK 168
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
+ + ++ + + +I+GC + + G++G+ G + S +++A
Sbjct: 169 EKITFSN-------TEITPPLILGCATESSDD-------RGILGMNRGRL---SFVSQAK 211
Query: 156 LIQNSFSICFDEND-----SGSVFFGDQG-------------PATQQSTSFLPIGEKYDA 197
+ + S+ I N +GS + GD P +Q+ + P+ Y
Sbjct: 212 ISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLA--YTV 269
Query: 198 YFVGVESYCIGNSCLTQSGF--------QALVDSGASFTFLPTEIY----AEVVVKFDKL 245
+G+ + + ++ S F Q +VDSG+ FT L Y AE++ + +
Sbjct: 270 PMIGIR-FGLKKLNISGSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRR 328
Query: 246 VSSKRISLQGNSWKYCYNASSEEMLK-VPDMRLIFSKN-QSFVVRNHIFSFPENEGFTVF 303
+ K+ + G + C++ + + + + D+ +F++ + FV + + N G +
Sbjct: 329 L--KKGYVYGGTADMCFDGNVAMIPRLIGDLVFVFTRGVEIFVPKERVLV---NVGGGIH 383
Query: 304 CLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
C+ + + IIG + FD N ++ ++ + C V+
Sbjct: 384 CVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFAKADCSRVV 430
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 58.5 bits (140), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 76/338 (22%), Positives = 140/338 (41%), Gaps = 48/338 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
YDP +S+S KN++C+ P C SS C+S CPY Y ++ + V+
Sbjct: 202 YDPKTSASFKNITCNDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFT 261
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
+ ++ +++ GCG G + + ++GLG G +S S L L
Sbjct: 262 VNLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASG---LLGLGRGPLSFSSQL--QSLYG 316
Query: 159 NSFSICF-----DENDSGSVFFGDQGPATQQS----TSFLPIGEK--YDAYFVGVESYCI 207
+SFS C + N S + FG+ + TSF+ E Y++ ++S +
Sbjct: 317 HSFSYCLVDRNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILV 376
Query: 208 GNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGN 256
G L + ++DSG + ++ Y + KF +K+ + I
Sbjct: 377 GGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFP 436
Query: 257 SWKYCYNAS--SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT-----VFCLTVMS 309
C+N S E + +P++ + F V +++FP F + CL ++
Sbjct: 437 VLDPCFNVSGIEENNIHLPELGIAF-------VDGTVWNFPAENSFIWLSEDLVCLAILG 489
Query: 310 T-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
T + IIG I++D + +L ++ +KC ++
Sbjct: 490 TPKSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKCADI 527
>gi|281210961|gb|EFA85127.1| hypothetical protein PPL_02125 [Polysphondylium pallidum PN500]
Length = 601
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 60/268 (22%), Positives = 121/268 (45%), Gaps = 24/268 (8%)
Query: 93 LVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAP---DGVMGLG---LGDVS 146
LV+D + + +S + +V +++ K+ + D P DG+ GL + D +
Sbjct: 165 LVEDTVRIGGYSIDSIFGNVNKILLLAFQYKECPA-PDVYTPRSFDGIFGLSTKVIDDTA 223
Query: 147 VPSLLAKAGL---IQNSFSICFDENDSGSVF-FGDQGPA-TQQSTSFLPIGEKYDAYFVG 201
+L + L + NSFS+CF E+ G F G P + ++P+ + Y Y +
Sbjct: 224 GEDILTQISLKYNLSNSFSLCFGESGYGGQFKIGGYDPELIVEPMRYIPVAKPY-TYNLT 282
Query: 202 VESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVV-VKFDK--LVSSKRISLQGNSW 258
+ IG L + + A +DSG++ +PT +Y ++ ++K L + + S+
Sbjct: 283 ISQVHIGQYKLEHTTYNAWIDSGSASIVIPTPLYNNMINTMYEKFPLAGFQDGAFWNTSF 342
Query: 259 KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN-----EGFTVFCLTVMSTDGD 313
C +++ P + F + H+ P+N E + L + + D +
Sbjct: 343 P-CAFIDEKDIPNYPKFNISFVDTDGEIF--HLSVLPQNYLVYNEEEKCYELLLRTVDNN 399
Query: 314 YGIIGQNFMMGHRIVFDRENLKLAWSHS 341
Y IIG ++G+ I FD++N ++ ++ +
Sbjct: 400 YFIIGDLGLIGYNIHFDKQNQRIGFAKA 427
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 82/325 (25%), Positives = 136/325 (41%), Gaps = 35/325 (10%)
Query: 45 YDPSSSSSSKNVSCS-HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
Y S S S K VSC+ H C+ + L C Y Y + +SG L ++ +
Sbjct: 134 YTSSQSKSYKPVSCNQHSFCEPNQCKEGL---CAYNVTYG-PGSYTSGNLANETFTF--Y 187
Query: 104 SKHAPQSSVQSSVIIGCG---RKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-Q 158
S H ++++S + GC R ++L P GV+G+G G S LA+ G I
Sbjct: 188 SNHGKHTALKS-ISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRS---FLAQLGSISH 243
Query: 159 NSFSICFDENDSGSVF--FGDQGPATQ--QSTSFLPI--GEKYDAYFVGVESYCIG---- 208
FS C N++ + + FG ++ Q+T + + Y +G+ +
Sbjct: 244 GKFSYCITANNTHNTYLRFGKHVVKSKNLQTTKIMQVKPSAAYHVNLLGISVNGVKLNIT 303
Query: 209 --NSCLTQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSS----KRISLQGNSWKYC 261
+ + + G + ++D+G T L I+ + +SS KR + C
Sbjct: 304 KTDLAVRKDGSRGCIIDAGTLATLLVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHKDLC 363
Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQN 320
Y S+ K + +N V+ IF F E EG VFCL+++S D IIG
Sbjct: 364 YEQLSDAGRKNLPVVTFHLENADLEVKPEAIFLFREFEGKNVFCLSMLSDD-SKTIIGAY 422
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEE 345
M + V+D + L++ CE+
Sbjct: 423 QQMKQKFVYDTKARVLSFGPEDCEK 447
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 75/320 (23%), Positives = 132/320 (41%), Gaps = 33/320 (10%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+N YDP +SS+ + C C S+ C D C Y Y +++ S G L
Sbjct: 135 QNTPLYDPLNSSTFTLLPCDSQPCTQLPYSQYVCSDYGD-CIYAYTYG-DNSYSYGGLSS 192
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
D + L H S + GCG + + G++GLG G +S+ S L
Sbjct: 193 DSIRLMLLQLH-----YNSKICFGCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDE- 246
Query: 156 LIQNSFSIC---FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNS 210
I + FS C F N + + FG+ P+ K D Y++ +E +G
Sbjct: 247 -IGHKFSYCLLPFSSNSNSKLKFGEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVGAK 305
Query: 211 CLT--QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
+ Q+ ++DSG++ T+L Y E V + V+ + + +C+ E
Sbjct: 306 TVKTGQTDGNIIIDSGSTLTYLEESFYNEFVSLVKETVAVEEDQYIPYPFDFCF-TYKEG 364
Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD----YGIIGQ-NFMM 323
M PD+ F+ + + E+ + C TV+ + D +G +GQ +F +
Sbjct: 365 MSTPPDVVFHFTGGDVVLKPMNTLVLIEDN---LICSTVVPSHFDGIAIFGNLGQIDFHV 421
Query: 324 GHRIVFDRENLKLAWSHSKC 343
G +D + K++++ + C
Sbjct: 422 G----YDIQGGKVSFAPTDC 437
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 70/311 (22%), Positives = 120/311 (38%), Gaps = 22/311 (7%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DPS S++ V C C+ S C Y Y + + + G L D L L S
Sbjct: 180 FDPSQSTTYSAVPCGAQECRRLDSGSCSSGKCRYEVVYG-DMSQTDGNLARDTLTLGPSS 238
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS-LLAKAGLIQNSFSI 163
+ +Q + GCG TG + DG+ GLG VS+ S AK G FS
Sbjct: 239 SSSSSDQLQ-EFVFGCGDDDTGLF---GKADGLFGLGRDRVSLASQAAAKYGA---GFSY 291
Query: 164 CFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG--FQ- 218
C + + G + G P + T+ + + Y++ + + + S F+
Sbjct: 292 CLPSSSTAEGYLSLGSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRT 351
Query: 219 --ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG----NSWKYCYNASSEEMLKV 272
++DSG T LP+ YA + F L+ +R S + + CY+ + +++
Sbjct: 352 PGTVIDSGTVITRLPSRAYAALRSSFAGLM--RRYSYKRAPALSILDTCYDFTGRNKVQI 409
Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRE 332
P + L+F + + + N+ D I+G +V+D
Sbjct: 410 PSVALLFDGGATLNLGFGEVLYVANKSQACLAFASNGDDTSIAILGNMQQKTFAVVYDVA 469
Query: 333 NLKLAWSHSKC 343
N K+ + C
Sbjct: 470 NQKIGFGAKGC 480
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 76/314 (24%), Positives = 124/314 (39%), Gaps = 34/314 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+ P++SSS ++C C S SSC++ + C Y +Y + + + G V + +
Sbjct: 201 FTPAASSSYSPLTCDSQQCNSLQMSSCRNGQ--CRYQVNYG-DGSFTFGDFVTETMSFGG 257
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S +S+ +GCG G + V GL + L + L SFS
Sbjct: 258 -------SGTVNSIALGCGHDNEGLF--------VGAAGLLGLGGGPLSLTSQLKATSFS 302
Query: 163 ICFDENDSG--SVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLT--QSGF 217
C DS S + P + L K D Y+VG+ +G L Q F
Sbjct: 303 YCLVNRDSAASSTLDFNSAPVGDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVF 362
Query: 218 Q--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
+ +VD G + T L +E Y + F + R + + CY+ S +
Sbjct: 363 KLDDSGDGGVIVDCGTAITRLQSEAYNSLRDSFVSMSRHLRSTSGVALFDTCYDLSGQSS 422
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 329
+KVP + F +S+ + + P + T +C T IIG G R+ F
Sbjct: 423 VKVPTVSFHFDGGKSWDLPAANYLIPVDSAGT-YCFAFAPTTSSLSIIGNVQQQGTRVSF 481
Query: 330 DRENLKLAWSHSKC 343
D N ++ +S +KC
Sbjct: 482 DLANNRVGFSTNKC 495
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 75/326 (23%), Positives = 128/326 (39%), Gaps = 36/326 (11%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
+N S + P++S+S ++C LC + C Y Y + + S+G V D +
Sbjct: 40 QNDSLFIPNTSTSFTKLACGTELCNGLPYPMCNQTTCVYWYSYG-DGSLSTGDFVYDTIT 98
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
+ + Q + GCG GS+ A DG++GLG G +S PS L +
Sbjct: 99 MDGINGQKQQVP---NFAFGCGHDNEGSF---AGADGILGLGQGPLSFPSQLKT--VFNG 150
Query: 160 SFSICFDE-----NDSGSVFFGDQGPATQQSTSFL-----PIGEKYDAYFVGVESYCIGN 209
FS C + + + FGD T ++ P Y Y+V + +G
Sbjct: 151 KFSYCLVDWLAPPTQTSPLLFGDAAVPTFPGVKYISLLTNPKVPTY--YYVKLNGISVGG 208
Query: 210 SCL--TQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFD-KLVSSKRISLQGNSW 258
L + + F + DSG + T L E++ EV+ + + R S +
Sbjct: 209 KLLNISSTAFDIDSVGRAGTIFDSGTTVTQLAGEVHQEVLAAMNASTMDYPRKSDDSSGL 268
Query: 259 KYCYNASSEEML-KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
C +E L VP M F + ++ F F E+ +C +++S+ D II
Sbjct: 269 DLCLGGFAEGQLPTVPSMTFHFEGGDMELPPSNYFIFLESS--QSYCFSMVSSP-DVTII 325
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKC 343
G ++ +D K+ + C
Sbjct: 326 GSIQQQNFQVYYDTVGRKIGFVPKSC 351
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 86/350 (24%), Positives = 144/350 (41%), Gaps = 70/350 (20%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSSCKSL------KDPCPYIADYSTEDTSSSGYLVDD 96
S +DP SSS + C+ P C++R+ S+ K C I Y+ + +S G L D
Sbjct: 99 SVFDPLRSSSYSPIPCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYA-DASSIEGNLASD 157
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAG 155
H+ +S + I GC S D + G++G+ G + S + + G
Sbjct: 158 TFHIG--------NSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSL---SFVTQMG 206
Query: 156 LIQNSFSICFDEND-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVES 204
L FS C D SG + FG+ P Q ST LP ++ AY V +E
Sbjct: 207 L--QKFSYCISGQDSSGILLFGESSFSWLKALKYTPLVQISTP-LPYFDRV-AYTVQLEG 262
Query: 205 YCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFD-------KLV 246
+ NS L T +G Q +VDSG FTFL +Y + +F K++
Sbjct: 263 IKVANSMLQLPKSVYAPDHTGAG-QTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVL 321
Query: 247 SSKRISLQGNSWKYCYNA--SSEEMLKVPDMRLIFS-KNQSFVVRNHIFSFPE--NEGFT 301
QG + CY + + +P + L+F S ++ P +
Sbjct: 322 EDPNFVFQG-AMDLCYRVPLTRRTLPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDS 380
Query: 302 VFCLTVMSTDGDYGIIG-QNFMMGHR------IVFDRENLKLAWSHSKCE 344
V+C T G+ ++G +++++GH + FD ++ ++ +C+
Sbjct: 381 VYCFTF----GNSELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVRCD 426
>gi|413953656|gb|AFW86305.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 406
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 90/197 (45%), Gaps = 18/197 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
Y P+ ++ + + S PLC+ + + C Y Y+ +S Y+ D + +
Sbjct: 204 YRPARTADA--LPASDPLCEG--AQHENPNQCDYEISYADGSSSMGVYVRDSMQFVGEDG 259
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
+ + ++ GCG Q G L+ DGV+GL +S+P+ LA G+I N+F
Sbjct: 260 ERE-----NADIVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGH 314
Query: 164 CFDENDSGS---VFFGDQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLTQSG-- 216
C + SG+ +F GD + +++PI G D V+ G+ L G
Sbjct: 315 CMSTDPSGAGGYLFLGDDY-IPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKL 373
Query: 217 FQALVDSGASFTFLPTE 233
Q + D+G+++T+ P E
Sbjct: 374 TQVVFDTGSTYTYFPDE 390
>gi|213998816|gb|ACJ60775.1| nucellin [Hordeum patagonicum subsp. patagonicum]
Length = 152
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 67/138 (48%), Gaps = 4/138 (2%)
Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 170
+ + GCG KQ +P DG++GLG+G + L +I N C
Sbjct: 4 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKVITGNVIGHCLSSKGK 63
Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTF 229
G ++ GD P ++ T ++P+ E Y G+ I N + + F+A+ DSG+++T
Sbjct: 64 GVLYVGDFNPPSRGVT-WVPMKESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTH 122
Query: 230 LPTEIYAEVVVKFDKLVS 247
+P +IY E+V K +S
Sbjct: 123 VPAQIYNEIVSKVRGTLS 140
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 58.2 bits (139), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 57/253 (22%), Positives = 104/253 (41%), Gaps = 24/253 (9%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
D+ + DP++SS+ + C P C++ C Y+ Y + + + G + D
Sbjct: 122 DQGIPLLDPAASSTYAALPCGAPRCRALPFTSCGGRSCVYVYHYG-DKSVTVGKIATDRF 180
Query: 99 HLASFSKHAPQSSVQSS--VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
+ S+ ++ + GCG G + G+ G G G S+PS L
Sbjct: 181 TFGDNGRRNGDGSLPATRRLTFGCGHFNKGVFQSNE--TGIAGFGRGRWSLPSQLNA--- 235
Query: 157 IQNSFSICFD---ENDSGSVFFGDQGPATQ--------QSTSFLPIGEKYDAYFVGVESY 205
SFS CF ++ S V G A ++T + YF+ ++
Sbjct: 236 --TSFSYCFTSMFDSKSSIVTLGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGI 293
Query: 206 CIGNSCLT--QSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
+G + L ++ F++ ++DSGAS T LP E+Y V +F V ++G++ C+
Sbjct: 294 SVGKTRLPVPETKFRSTIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDVCF 353
Query: 263 NASSEEMLKVPDM 275
+ + P +
Sbjct: 354 ALPVSALWRRPAV 366
>gi|213998836|gb|ACJ60785.1| nucellin [Hordeum bogdanii]
Length = 154
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 67/138 (48%), Gaps = 4/138 (2%)
Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 170
+ + GCG KQ +P DG++GLG+G + L +I N C
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGK 65
Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTF 229
G ++ GD P ++ T ++P+ E Y G+ I N + + F+A+ DSG+++T
Sbjct: 66 GVLYVGDFNPPSRGVT-WVPMRESLFYYSPGLAELLIDNQPIGGNPTFEAVFDSGSTYTH 124
Query: 230 LPTEIYAEVVVKFDKLVS 247
+P +IY E+V K +S
Sbjct: 125 VPAQIYNEIVSKVRGTLS 142
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 88/358 (24%), Positives = 145/358 (40%), Gaps = 72/358 (20%)
Query: 37 VQDRNLSEYDPSSSSSSKNVSCSHPLC--------KSR-----SSCKSLKDPCP-YIADY 82
V +S++ P SSS K V C +P C KSR S + D CP Y Y
Sbjct: 174 VDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQY 233
Query: 83 STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
+ T+ G L+ + L L + K P ++GC S + P G+ G G
Sbjct: 234 GSGATA--GILLSETLDLEN--KRVPD------FLVGC------SVMSVHQPAGIAGFGR 277
Query: 143 GDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTS----FLPIGEK---- 194
G S+PS + S FD++ S D G + +S + + P E
Sbjct: 278 GPESLPSQMRLKRFSHCLVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVS 337
Query: 195 ----YDAYFVGVESYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVV 239
+ Y++ + IG + T +G A++DSG++FTFL I+ +
Sbjct: 338 NAAFREYYYLSLRRILIGGKPVKFPYKYLVPDSTGNG-GAIIDSGSTFTFLDKPIFEAIA 396
Query: 240 VKFDKLV----SSKRISLQGNSWKYCYN-ASSEEMLKVPDMRLIFS--KNQSFVVRNHIF 292
+ +K + +K + Q + + C+N EE + PD+ L F S N++
Sbjct: 397 DELEKQLVKYPRAKDVEAQ-SGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYL- 454
Query: 293 SFPENEGFTVFCLTVMSTDGDY------GIIGQNFMMGHRIV-FDRENLKLAWSHSKC 343
+ +EG V CLT+M+ + II F + +V +D ++ + KC
Sbjct: 455 AMVTDEG--VVCLTMMTDEAVVGGGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 510
>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
Length = 451
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 75/328 (22%), Positives = 123/328 (37%), Gaps = 52/328 (15%)
Query: 44 EYDPSSSSSSKNVSCSHPLCKSRS--SCKS-LKDPCPYIADYSTEDTSS-----SGYLVD 95
+DP+ SS+ + V C P C SC L C + Y+ + + L D
Sbjct: 146 SFDPTRSSTYRPVRCGAPQCSQAPAPSCPGGLGSSCAFNLSYAASTFQALLGQDALALHD 205
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
D+ +A+++ GC TG + P G++G G G +S PS
Sbjct: 206 DVDAVAAYT-------------FGCLHVVTGGSVP---PQGLVGFGRGPLSFPSQTKD-- 247
Query: 156 LIQNSFSICF----DENDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
+ + FS C N SG++ G G P ++T L + Y+V + +G
Sbjct: 248 VYGSVFSYCLPSYKSSNFSGTLRLGPAGQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGR 307
Query: 211 CLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
+ SG +VD+G FT L +YA V F V + G +
Sbjct: 308 PVPVPASALAFDPTSGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRVRAPVAGPLGG-FDT 366
Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS-----TDGDYG 315
CYN + + VP + F S + + G + CL + + D
Sbjct: 367 CYNVT----ISVPTVTFSFDGRVSVTLPEENVVIRSSSG-GIACLAMAAGPPDGVDAALN 421
Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
++ HR++FD N ++ +S C
Sbjct: 422 VLASMQQQNHRVLFDVANGRVGFSRELC 449
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 83/349 (23%), Positives = 141/349 (40%), Gaps = 66/349 (18%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKSRSS----CKSLKDPCPYIADYSTEDTSSSGYLVD 95
R + P+SSS+ + C+ LC+ +S C + C Y Y T+ GYL
Sbjct: 127 RPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATG--CVYYYPYGMGFTA--GYLAT 182
Query: 96 DILHL--ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
+ LH+ ASF A S ++ V G + G++GLG +S L+++
Sbjct: 183 ETLHVGGASFPGVAFGCSTENGV--------------GNSSSGIVGLGRSPLS---LVSQ 225
Query: 154 AGLIQNSFSICF----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----YFVGVES 204
G+ FS C D DS + FG T + P+ E + Y+V +
Sbjct: 226 VGV--GRFSYCLRSDADAGDS-PILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTG 282
Query: 205 YCIGNSCL----TQSGFQ----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
+G + L T GF +VDSG + T+L E YA V F +++
Sbjct: 283 ITVGATDLPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATAN 342
Query: 251 ISLQGNSWKY----CYNASSE---EMLKVPDMRLIFSKNQSFVVRNH----IFSFPENEG 299
++ N ++ C++A++ + VP + L F+ + VR + +
Sbjct: 343 LTTTVNGTRFGFDLCFDATAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVAVDSQGR 402
Query: 300 FTVFCLTVM--STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
V CL V+ S IIG M +++D + +++ + C V
Sbjct: 403 AAVECLLVLPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCANV 451
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 79/326 (24%), Positives = 126/326 (38%), Gaps = 42/326 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
+ P +S S + CS CK + ++C S PC Y DY ++ S+ I+
Sbjct: 154 FRPKTSRSWAPIPCSSDTCKLDVPFTLANCSSPASPCTY--DYRYKEGSAG---ARGIVG 208
Query: 100 LASFSKHAPQSSVQ--SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
S + P V V++GC G A DGV+ LG +S + A
Sbjct: 209 TESATIALPGGKVAQLKDVVLGCSSSHDGQSFRSA--DGVLSLGNAKISFAT--QAAARF 264
Query: 158 QNSFSICF-----DENDSGSVFFGD----QGPATQQSTSFLP----IGEKYDAYFVGVES 204
SFS C N +G + FG + PATQ P G K DA V ++
Sbjct: 265 GGSFSYCLVDHLAPRNATGYLAFGPGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKA 324
Query: 205 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-LVSSKRISLQGNSWKYCYN 263
I ++DSG + T L Y VV K L ++S +++CYN
Sbjct: 325 LDIPAEVWDAKSGGVILDSGNTLTVLAAPAYKAVVAALSKHLDGVPKVSFP--PFEHCYN 382
Query: 264 ASSEEMLK---VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY---GII 317
++ +P + + F+ + + G V C+ V +G++ +I
Sbjct: 383 WTARRPGAPEIIPKLAVQFAGSARLEPPAKSYVIDVKPG--VKCIGVQ--EGEWPGLSVI 438
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKC 343
G H FD +N+++ + S C
Sbjct: 439 GNIMQQEHLWEFDLKNMQVRFKQSNC 464
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 78/341 (22%), Positives = 139/341 (40%), Gaps = 59/341 (17%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDI 97
+ +DPS SS+ + C+HP+CK R +L C + + + + T + G LV +
Sbjct: 137 ASFDPSLSSTFSTLPCTHPVCKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREK 196
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS----------- 146
+FS+ S +I+GC + T P G++G+ G +S
Sbjct: 197 F---TFSR----SLFTPPLILGCATESTD-------PRGILGMNRGRLSFASQSKITKFS 242
Query: 147 --VPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFL-PIGEKYDAYFVGV 202
VP+ + + G SF + + N + + A Q L P+ AY V +
Sbjct: 243 YCVPTRVTRPGYTPTGSFYLGHNPNSNTFRYIEMLTFARSQRMPNLDPL-----AYTVAL 297
Query: 203 ESYCIGNSCLTQS----------GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KR 250
+ IG L S Q ++DSG+ FT+L E Y +V + + V K+
Sbjct: 298 QGIRIGGRKLNISPAVFRADAGGSGQTMLDSGSEFTYLVNEAYDKVRAEVVRAVGPRMKK 357
Query: 251 ISLQGNSWKYCYNASSEEMLK-VPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVM 308
+ G C++ ++ E+ + + DM F K VV + + + E V C+ +
Sbjct: 358 GYVYGGVADMCFDGNAIEIGRLIGDMVFEFEKGVQIVVPKERVLATVEGG---VHCIGIA 414
Query: 309 STD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
++D IIG + FD N ++ + + C +
Sbjct: 415 NSDKLGAASNIIGNFHQQNLWVEFDLVNRRMGFGTADCSRL 455
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 88/352 (25%), Positives = 150/352 (42%), Gaps = 75/352 (21%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP--------CPYIADYSTEDTSSSGYLV 94
S ++P SSSS + CS P+C++R+ + L +P C I Y+ + +S G L
Sbjct: 76 SVFNPLSSSSYSPIPCSSPVCRTRT--RDLPNPVTCDPKKLCHAIVSYA-DASSLEGNLA 132
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAK 153
D + SS + GC S + A G+MG+ G + S + +
Sbjct: 133 SDNFRIG--------SSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSL---SFVTQ 181
Query: 154 AGLIQNSFSICFDEND-SGSVFFGDQ----------GPATQQSTSFLPIGEKYDAYFVGV 202
GL + FS C D SG + FGD P Q ST LP ++ AY V +
Sbjct: 182 LGLPK--FSYCISGRDSSGVLLFGDSHLSWLGNLTYTPLVQISTP-LPYFDRV-AYTVQL 237
Query: 203 ESYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
+ +GN L T +G Q +VDSG FTFL +Y + +F +
Sbjct: 238 DGIRVGNKILPLPKSIFAPDHTGAG-QTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLA 296
Query: 252 SLQGNSWKY------CYNA-SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT--- 301
L ++ + CY + ++ ++P + L+F + VV + + + G
Sbjct: 297 PLGDPNFVFQGAMDLCYRVPAGGKLPELPAVSLMF-RGAEMVVGGEVLLY-KVPGMMKGK 354
Query: 302 --VFCLTVMSTDGDYGIIG-QNFMMGHR------IVFDRENLKLAWSHSKCE 344
V+CLT ++D ++G + F++GH + FD ++ + ++C+
Sbjct: 355 EWVYCLTFGNSD----LLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRCD 402
>gi|158513711|sp|A2ZC67.2|ASP1_ORYSI RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
Length = 410
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 62/299 (20%), Positives = 123/299 (41%), Gaps = 36/299 (12%)
Query: 73 KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 132
K+ C Y Y SS G L+ D SFS A + +S+ GCG Q + +
Sbjct: 112 KNQCHYGIQYV--GGSSIGVLIVD-----SFSLPASNGTNPTSIAFGCGYNQGKNNHNVP 164
Query: 133 AP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLP 190
P +G++GLG G V++ S L G+I ++ C G +FFGD T T + P
Sbjct: 165 TPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVT-WSP 223
Query: 191 IGEKYDAY--FVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS 248
+ ++ Y G + + ++ + + + DSGA++T+ + Y + +S
Sbjct: 224 MNREHKHYSPRQGTLQFNSNSKPISAAPMEVIFDSGATYTYFALQPYHATLSVVKSTLSK 283
Query: 249 K-----RISLQGNSWKYCYNASSEEMLKVPDMRLIFS----------KNQSFVVRNHIFS 293
+ + + + C+ +++ + +++ F K + + +
Sbjct: 284 ECKFLTEVKEKDRALTVCWKG-KDKIRTIDEVKKCFRSLSLKFADGDKKATLEIPPEHYL 342
Query: 294 FPENEGFTVFCLTVMSTDGDY------GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
EG CL ++ ++ +IG M+ +++D E L W + +C+ +
Sbjct: 343 IISQEGHV--CLGILDGSKEHPSLAGTNLIGGITMLDQMVIYDSERSLLGWVNYQCDRI 399
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 86/349 (24%), Positives = 143/349 (40%), Gaps = 70/349 (20%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSSCKSL------KDPCPYIADYSTEDTSSSGYLVDD 96
S +DP SSS + C+ P C++R+ S+ K C I Y+ + +S G L D
Sbjct: 92 SVFDPLRSSSYSPIPCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYA-DASSIEGNLASD 150
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKAG 155
H+ +S + I GC S D + G++G+ G + S + + G
Sbjct: 151 TFHIG--------NSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSL---SFVTQMG 199
Query: 156 LIQNSFSICFDEND-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVES 204
L FS C D SG + FG+ P Q ST LP ++ AY V +E
Sbjct: 200 L--QKFSYCISGQDSSGILLFGESSFSWLKALKYTPLVQISTP-LPYFDRV-AYTVQLEG 255
Query: 205 YCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFD-------KLV 246
+ NS L T +G Q +VDSG FTFL +Y + +F K++
Sbjct: 256 IKVANSMLQLPKSVYAPDHTGAG-QTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVL 314
Query: 247 SSKRISLQGNSWKYCYNA--SSEEMLKVPDMRLIFS-KNQSFVVRNHIFSFPE--NEGFT 301
QG + CY + + +P + L+F S ++ P +
Sbjct: 315 EDPNFVFQG-AMDLCYRVPLTRRTLPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDS 373
Query: 302 VFCLTVMSTDGDYGIIG-QNFMMGHR------IVFDRENLKLAWSHSKC 343
V+C T G+ ++G +++++GH + FD ++ ++ +C
Sbjct: 374 VYCFTF----GNSELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 418
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 86/355 (24%), Positives = 130/355 (36%), Gaps = 72/355 (20%)
Query: 40 RNLSEYDPSS------SSSSKNVSCSHPLCKSRSSCKSLKDP-CPYIADYST---EDTSS 89
RN S + P++ SS+ C P+C R K + P C + +ST E +
Sbjct: 116 RNCSHHSPATVFFPRHSSTFSPAHCYDPVC--RLVPKPDRAPICNHTRIHSTCHYEYGYA 173
Query: 90 SGYLVDDIL--HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAA---PDGVMGLGLGD 144
G L + S + + + SV GCG + +G + G + +GVMGLG G
Sbjct: 174 DGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGP 233
Query: 145 VSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA------- 197
+S S L + N FS C + + TS+L IG D
Sbjct: 234 ISFASQLGRR--FGNKFSYCLMDYT-----------LSPPPTSYLIIGNGGDGISKLFFT 280
Query: 198 -----------YFVGVESYCIGNSCLT----------QSGFQALVDSGASFTFLPTEIYA 236
Y+V ++S + + L +VDSG + FL Y
Sbjct: 281 PLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYR 340
Query: 237 EVVVKFDKLVSSKRISLQGNSWKYCYNASS----EEMLKVPDMRLIFSKNQSFV--VRNH 290
V+ + V + C N S E++L P ++ FS FV RN+
Sbjct: 341 SVIAAVRRRVKLPIADALTPGFDLCVNVSGVTKPEKIL--PRLKFEFSGGAVFVPPPRNY 398
Query: 291 IFSFPENEGFTVFCLTVMSTDGDYG--IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
E + CL + S D G +IG G FDR+ +L +S C
Sbjct: 399 FIETEEQ----IQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGC 449
>gi|37542275|gb|AAK81698.1| aspartyl proteinase [Oryza sativa]
Length = 410
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 62/299 (20%), Positives = 123/299 (41%), Gaps = 36/299 (12%)
Query: 73 KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 132
K+ C Y Y SS G L+ D SFS A + +S+ GCG Q + +
Sbjct: 112 KNQCHYGIQYV--GGSSIGVLIVD-----SFSLPASNGTNPTSIAFGCGYNQGKNNHNVP 164
Query: 133 AP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLP 190
P +G++GLG G V++ S L G+I ++ C G +FFGD T T + P
Sbjct: 165 TPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVT-WSP 223
Query: 191 IGEKYDAY--FVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS 248
+ ++ Y G + + ++ + + + DSGA++T+ + Y + +S
Sbjct: 224 MNREHKHYSPRQGTLHFNSNSKPISAAPMEVIFDSGATYTYFALQPYHATLSVVKSTLSK 283
Query: 249 K-----RISLQGNSWKYCYNASSEEMLKVPDMRLIFS----------KNQSFVVRNHIFS 293
+ + + + C+ +++ + +++ F K + + +
Sbjct: 284 ECKFLTEVKEKDRALTVCWKG-KDKIRTIDEVKKCFRSLSLKFADGDKKATLEIPPEHYL 342
Query: 294 FPENEGFTVFCLTVMSTDGDY------GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
EG CL ++ ++ +IG M+ +++D E L W + +C+ +
Sbjct: 343 IISQEGHV--CLGILDGSKEHPSLAGTNLIGGITMLDQMVIYDSERSLLGWVNYQCDRI 399
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 80/328 (24%), Positives = 132/328 (40%), Gaps = 45/328 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DPSSSS+ + CS LC S+C S C Y Y + +S+ G L + LA
Sbjct: 160 FDPSSSSTYSTLPCSSSLCSDLPTSTCTSAAKDCGYTYTYG-DASSTQGVLAAETFTLA- 217
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
+ V GCG G + GA G++GLG G + SL+++ GL F
Sbjct: 218 -------KTKLPGVAFGCGDTNEGDGFTQGA---GLVGLGRGPL---SLVSQLGL--GKF 262
Query: 162 SIC---FDENDSGSVFFGD--------QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
S C D+ + G A Q+T + + Y+V +++ +G++
Sbjct: 263 SYCLTSLDDTSKSPLLLGSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGST 322
Query: 211 C--LTQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
L S F +VDSG S T+L + Y + F +
Sbjct: 323 RIPLPGSAFAVQDDGTGGVIVDSGTSITYLELQGYRPLKKAFAAQMKLPVADGSAVGLDL 382
Query: 261 CYN--ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
C+ AS + ++VP + L F + + ++ CLTVM + G IIG
Sbjct: 383 CFKAPASGVDDVEVPKLVLHFDGGADLDLPAENYMVLDSAS-GALCLTVMGSRG-LSIIG 440
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+ V+D + L+++ +C ++
Sbjct: 441 NFQQQNIQFVYDVDKDTLSFAPVQCAKL 468
>gi|213998830|gb|ACJ60782.1| nucellin [Hordeum pusillum]
Length = 147
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 64/129 (49%), Gaps = 4/129 (3%)
Query: 116 VIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSV 173
+ GCG KQ +P DG++GLG+G + L +I N C G +
Sbjct: 2 IAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVL 61
Query: 174 FFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTFLPT 232
+ GD P ++ T ++P+ E Y G+ I N + + F+A+ DSG+++T +P
Sbjct: 62 YVGDFNPPSRGVT-WVPMKESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPA 120
Query: 233 EIYAEVVVK 241
+IY E+V K
Sbjct: 121 QIYNEIVSK 129
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 55/182 (30%), Positives = 84/182 (46%), Gaps = 22/182 (12%)
Query: 118 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-GSVFFG 176
GCGR G + GA DG++GLG G +S S A + FS C E DS GS+ FG
Sbjct: 258 FGCGRNNEGDFGSGA--DGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEEDSIGSLLFG 313
Query: 177 DQGPATQQSTSFLPIG--------EKYDAYFVGVESYCIGNSCLT--QSGFQA---LVDS 223
++ + S F + E+ YFV + +GN L S F + ++DS
Sbjct: 314 EKATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVFASPGTIIDS 373
Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSKRIS----LQGNSWKYCYNASSEEMLKVPDMRLIF 279
G T LP Y+ + F K ++ +S +G+ CYN S + + +P++ L F
Sbjct: 374 GTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHF 433
Query: 280 SK 281
+
Sbjct: 434 GE 435
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 84/369 (22%), Positives = 145/369 (39%), Gaps = 55/369 (14%)
Query: 7 FGSHANAYNALLCLPVTTLLW-----CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHP 61
G+ A Y+A+L + L+W CLL D+ +DP++SS+ +++ CS P
Sbjct: 98 IGTPARFYSAILDTG-SDLIWTQCAPCLLCV------DQPTPYFDPANSSTYRSLGCSAP 150
Query: 62 LCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 121
C + + C Y Y + S++G L ++ + + GCG
Sbjct: 151 ACNALYYPLCYQKTCVYQYFYG-DSASTAGVLANETFTFGTNDTRVTLPRIS----FGCG 205
Query: 122 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVFFG-- 176
GS +G+ G++G G G +S+ S L FS C F ++FG
Sbjct: 206 NLNAGSLANGS---GMVGFGRGSLSLVSQLGSP-----RFSYCLTSFLSPVRSRLYFGAY 257
Query: 177 ----DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-----------TQSGFQALV 221
+T QST F+ YF+ + +G + L T ++
Sbjct: 258 ATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTII 317
Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISL---QGNSWKYCYN--ASSEEMLKVPDMR 276
DSG + T+L Y V F ++S L + + C+ + + +P +
Sbjct: 318 DSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVTLPQLV 377
Query: 277 LIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLK 335
L F + ++N++ P G CL M+T D IIG +++D EN
Sbjct: 378 LHFDGADWELPLQNYMLVDPSTGG---LCL-AMATSSDGSIIGSYQHQNFNVLYDLENSL 433
Query: 336 LAWSHSKCE 344
L++ + C
Sbjct: 434 LSFVPAPCN 442
>gi|213998838|gb|ACJ60786.1| nucellin [Hordeum vulgare subsp. vulgare]
Length = 154
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 63/132 (47%), Gaps = 4/132 (3%)
Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDS 170
+ + GCG KQ +P DG++GLG+G + L +I +N C
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKENVIGHCLSSKGK 65
Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTF 229
G ++ GD P T+ T + P+ E Y G+ I + F+A+ DSG+++T
Sbjct: 66 GVLYVGDFNPPTRGVT-WAPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTH 124
Query: 230 LPTEIYAEVVVK 241
+P +IY E+V K
Sbjct: 125 VPAQIYNEIVSK 136
>gi|218185383|gb|EEC67810.1| hypothetical protein OsI_35379 [Oryza sativa Indica Group]
Length = 423
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 62/299 (20%), Positives = 123/299 (41%), Gaps = 36/299 (12%)
Query: 73 KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 132
K+ C Y Y SS G L+ D SFS A + +S+ GCG Q + +
Sbjct: 125 KNQCHYGIQYV--GGSSIGVLIVD-----SFSLPASNGTNPTSIAFGCGYNQGKNNHNVP 177
Query: 133 AP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLP 190
P +G++GLG G V++ S L G+I ++ C G +FFGD T T + P
Sbjct: 178 TPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVT-WSP 236
Query: 191 IGEKYDAY--FVGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS 248
+ ++ Y G + + ++ + + + DSGA++T+ + Y + +S
Sbjct: 237 MNREHKHYSPRQGTLQFNSNSKPISAAPMEVIFDSGATYTYFALQPYHATLSVVKSTLSK 296
Query: 249 K-----RISLQGNSWKYCYNASSEEMLKVPDMRLIFS----------KNQSFVVRNHIFS 293
+ + + + C+ +++ + +++ F K + + +
Sbjct: 297 ECKFLTEVKEKDRALTVCWKG-KDKIRTIDEVKKCFRSLSLKFADGDKKATLEIPPEHYL 355
Query: 294 FPENEGFTVFCLTVMSTDGDY------GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
EG CL ++ ++ +IG M+ +++D E L W + +C+ +
Sbjct: 356 IISQEGHV--CLGILDGSKEHPSLAGTNLIGGITMLDQMVIYDSERSLLGWVNYQCDRI 412
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 73/352 (20%), Positives = 137/352 (38%), Gaps = 63/352 (17%)
Query: 37 VQDRNLSEYDPSSSSSSKNVSCSHPLCK------SRSSCKSLK-------DPCP-YIADY 82
+ + + P SSSSK V C +P C +S C+S CP Y+ Y
Sbjct: 123 IDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQY 182
Query: 83 STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
+ S++G L+ + L K P + ++GC S+L P G+ G G
Sbjct: 183 GS--GSTAGLLLSETLDFPD--KKIP------NFVVGC------SFLSIHQPSGIAGFGR 226
Query: 143 GDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK-------- 194
G S+PS + S FD++ D ++ P +
Sbjct: 227 GSESLPSQMGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAY 286
Query: 195 YDAYFVGVESYCIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDK 244
+ Y++ + +GN + +++DSG++FTF+ + V +F+K
Sbjct: 287 KEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEK 346
Query: 245 LVSSKRISLQGNS---WKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGF 300
+++ + + + C++ S E+ +K P++ F + + N+ F+ + G
Sbjct: 347 QLANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSG- 405
Query: 301 TVFCLTVMSTDGDYG---------IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
V CLTV++ + G I+G + +D N +L + C
Sbjct: 406 -VACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 87/357 (24%), Positives = 147/357 (41%), Gaps = 78/357 (21%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
++P+ SSS +SCS P C +R+ SC S + C Y+ + +SS G L D
Sbjct: 107 FNPNISSSYTPISCSSPTCTTRTRDFPIPASCDS-NNLCHATLSYA-DASSSEGNLASDT 164
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD----GVMGLGLGDVSVPSLLAK 153
SS ++ GC SY + D G+MG+ LG +S+ S L
Sbjct: 165 FGFG--------SSFNPGIVFGC---MNSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKI 213
Query: 154 AGLIQNSFSICFDEND-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGV 202
FS C +D SG + G+ P Q ST LP ++ AY V +
Sbjct: 214 P-----KFSYCISGSDFSGILLLGESNFSWGGSLNYTPLVQISTP-LPYFDR-SAYTVRL 266
Query: 203 ESYCIGNSCLTQSGF----------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 252
E I + L SG Q + D G F++L +Y + +F + +
Sbjct: 267 EGIKISDKLLNISGNLFVPDHTGAGQTMFDLGTQFSYLLGPVYNALRDEFLNQTNGTLRA 326
Query: 253 LQGNSWKY------CYN--ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGF---- 300
L ++ + CY + E+ ++P + L+F + V + + GF
Sbjct: 327 LDDPNFVFQIAMDLCYRVPVNQSELPELPSVSLVFEGAEMRVFGDQLLY--RVPGFVWGN 384
Query: 301 -TVFCLTVMSTDGDYGIIG-QNFMMGHR------IVFDRENLKLAWSHSKCEEVIDK 349
+V+C T ++D ++G + F++GH + FD ++ +H++C+ V K
Sbjct: 385 DSVYCFTFGNSD----LLGVEAFIIGHHHQQSMWMEFDLVEHRVGLAHARCDLVGQK 437
>gi|213998840|gb|ACJ60787.1| nucellin [Hordeum patagonicum subsp. magellanicum]
Length = 154
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 67/138 (48%), Gaps = 4/138 (2%)
Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 170
+ + GCG KQ +P DG++GLG+G + L +I N C
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGK 65
Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTF 229
G ++ GD P ++ T ++P+ E Y G+ I N + + F+A+ DSG+++T
Sbjct: 66 GVLYVGDFNPPSRGVT-WVPMKESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTH 124
Query: 230 LPTEIYAEVVVKFDKLVS 247
+P +IY E++ K +S
Sbjct: 125 VPAQIYNEILSKVRGTLS 142
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 90/329 (27%), Positives = 143/329 (43%), Gaps = 73/329 (22%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP--------CPYIADYSTEDTSSSGYLV 94
S ++P SSSS + CS P+C++R+ + L +P C I Y+ + +S G L
Sbjct: 1036 SVFNPLSSSSYSPIPCSSPICRTRT--RDLPNPVTCDPKKLCHAIVSYA-DASSLEGNLA 1092
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAK 153
D + SS + GC S + A G+MG+ G + S + +
Sbjct: 1093 SDNFRIG--------SSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSL---SFVTQ 1141
Query: 154 AGLIQNSFSICFDEND-SGSVFFGD----------QGPATQQSTSFLPIGEKYDAYFVGV 202
GL + FS C D SG + FGD P Q ST LP ++ AY V +
Sbjct: 1142 LGLPK--FSYCISGRDSSGVLLFGDLHLSWLGNLTYTPLVQISTP-LPYFDRV-AYTVQL 1197
Query: 203 ESYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKL------ 245
+ +GN L T +G Q +VDSG FTFL +Y + +F +
Sbjct: 1198 DGIRVGNKILPLPKSIFAPDHTGAG-QTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLA 1256
Query: 246 -VSSKRISLQGNSWKYCYN-ASSEEMLKVPDMRLIFSKNQSFVVRNHI--FSFPE----N 297
+ QG + CY+ A+ ++ +P + L+F + VV + + PE N
Sbjct: 1257 PLGDPNFVFQG-AMDLCYSVAAGGKLPTLPSVSLMF-RGAEMVVGGEVLLYRVPEMMKGN 1314
Query: 298 EGFTVFCLTVMSTDGDYGIIG-QNFMMGH 325
E V+CLT ++D ++G + F++GH
Sbjct: 1315 E--WVYCLTFGNSD----LLGIEAFVIGH 1337
>gi|297852200|ref|XP_002893981.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
gi|297339823|gb|EFH70240.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
Length = 354
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 65/145 (44%), Gaps = 21/145 (14%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
+ +Y P ++ V C P+C ++ C + K+ C Y +Y+ + SS G LV D
Sbjct: 94 IRQYKPKGNT----VPCLDPICLALHFPNKPQCPNPKEQCDYEVNYA-DQGSSMGALVID 148
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD----GVMGLGLGDVSVPSLLA 152
L + S++Q + GCG Q L A P GV+GLG G + V L
Sbjct: 149 QFPLKLLNG----SAMQPRLAFGCGYDQI---LPKAHPPPATAGVLGLGRGKIGVLPQLV 201
Query: 153 KAGLIQNSFSICFDENDSGSVFFGD 177
AGL +N C G +FFGD
Sbjct: 202 AAGLTRNVVGHCLSSKGGGYLFFGD 226
>gi|213998818|gb|ACJ60776.1| nucellin [Hordeum patagonicum subsp. setifolium]
Length = 149
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 66/138 (47%), Gaps = 4/138 (2%)
Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 170
+ + GCG KQ +P DG++GLG+G + L +I N C
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGK 65
Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTF 229
G ++ GD P ++ T ++P+ E Y G+ I N + F+A+ DSG+++T
Sbjct: 66 GVLYVGDFNPPSRGVT-WVPMKESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTH 124
Query: 230 LPTEIYAEVVVKFDKLVS 247
+P +IY E++ K +S
Sbjct: 125 VPAQIYNEILSKVRGTLS 142
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 76/341 (22%), Positives = 139/341 (40%), Gaps = 46/341 (13%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKSRS------SCKSLKDPCPYIADYSTEDTSSSGYL 93
+N + YDP +S+S KN++C+ P C S CKS CPY Y ++ +
Sbjct: 192 QNGAFYDPKASASYKNITCNDPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFA 251
Query: 94 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
V+ + S + + +++ GCG G + A ++GLG G +S S L
Sbjct: 252 VETFTVNLTTSGGSSELYNVENMMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL-- 306
Query: 154 AGLIQNSFSICF-----DENDSGSVFFGDQGPATQQS----TSFLPIGEKY--DAYFVGV 202
L +SFS C D N S + FG+ TSF+ E Y+V +
Sbjct: 307 QSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQI 366
Query: 203 ESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR-I 251
+S + L + ++DSG + ++ Y + K + K +
Sbjct: 367 KSIIVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPV 426
Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT-----VFCLT 306
C+N S + +++P++ + F+ +++FP F + CL
Sbjct: 427 YRDFPILDPCFNVSGIDSIQLPELGIAFADGA-------VWNFPTENSFIWLNEDLVCLA 479
Query: 307 VMST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
++ T + IIG I++D + +L ++ +KC ++
Sbjct: 480 ILGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKCADI 520
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 77/354 (21%), Positives = 138/354 (38%), Gaps = 67/354 (18%)
Query: 37 VQDRNLSEYDPSSSSSSKNVSCSHPLCK------SRSSCKSLK-------DPCP-YIADY 82
+ + + P SSSSK V C +P C +S C+S CP Y+ Y
Sbjct: 123 IDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQY 182
Query: 83 STEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGL 142
+ S++G L+ + L K P + ++GC S+L P G+ G G
Sbjct: 183 GS--GSTAGLLLSETLDFPD--KXIP------NFVVGC------SFLSIHQPSGIAGFGR 226
Query: 143 GDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK-------- 194
G S+PS + S FD++ D ++ P +
Sbjct: 227 GSESLPSQMGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAY 286
Query: 195 YDAYFVGVESYCIGNSCLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDK 244
+ Y++ + +GN + +++DSG++FTF+ + V +F+K
Sbjct: 287 KEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEK 346
Query: 245 -LVSSKRI----SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENE 298
L + R +L G + C++ S E+ +K P++ F + + N+ F+ +
Sbjct: 347 QLANWTRATDVETLTG--LRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSS 404
Query: 299 GFTVFCLTVMSTDGDYG---------IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
G V CLTV++ + G I+G + +D N +L + C
Sbjct: 405 G--VACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456
>gi|357440767|ref|XP_003590661.1| Basic 7S globulin [Medicago truncatula]
gi|355479709|gb|AES60912.1| Basic 7S globulin [Medicago truncatula]
Length = 500
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 67/276 (24%), Positives = 114/276 (41%), Gaps = 50/276 (18%)
Query: 51 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK-HAPQ 109
S +K+ SC + C + C I D + +++ G L +D+L + S S + Q
Sbjct: 97 SLAKSDSCGDCFSSPKPGCN---NTCGLIPDNTITHSATRGDLAEDVLSIQSTSGFNTGQ 153
Query: 110 SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 169
+ V S + C L G A G+ GLG +++PS LA A + + F+ CF +D
Sbjct: 154 NVVVSRFLFSCAPTSLLRGLAGGA-SGMAGLGRTKIALPSQLASAFIFKRKFAFCFSSSD 212
Query: 170 SGSVFFGDQGPAT--------------QQSTSFLPI-------------GEKYDAYFVGV 202
G + FGD GP + +S ++ P+ GE YF+GV
Sbjct: 213 -GVIIFGD-GPYSFLADNPSLPNVVFDSKSLTYTPLLINHVSTASAFLQGESSVEYFIGV 270
Query: 203 ESYCI-GNSCLTQSGFQALVDSGAS---------FTFLPTEIYAEVVVKFDKLVSSKRIS 252
++ I G S ++ + G +T L IY V F K ++ I+
Sbjct: 271 KTIKIDGKVVSLNSSLLSIDNKGVGGTKISTVDPYTVLEASIYKAVTDAFVKASVARNIT 330
Query: 253 LQGNS--WKYCYN----ASSEEMLKVPDMRLIFSKN 282
+ +S +++CY+ + VP + L+ N
Sbjct: 331 TEDSSPPFEFCYSFDNLPGTPLGASVPTIELLLQNN 366
>gi|226530663|ref|NP_001146528.1| uncharacterized protein LOC100280120 [Zea mays]
gi|219887685|gb|ACL54217.1| unknown [Zea mays]
Length = 292
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 65/264 (24%), Positives = 117/264 (44%), Gaps = 30/264 (11%)
Query: 114 SSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 172
+ ++ GCG Q G L+ DGV+GL +S+P+ LA G+I N+F C + SG+
Sbjct: 21 ADIVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPSGA 80
Query: 173 ---VFFGDQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLTQSG--FQALVDSGA 225
+F GD + +++PI G D V+ G+ L G Q + D+G+
Sbjct: 81 GGYLFLGDD-YIPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLTQVVFDTGS 139
Query: 226 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK---YC----YNASSEEMLK--VPDMR 276
++T+ P E ++ + S + + Q +S K +C + S E +K +
Sbjct: 140 TYTYFPDEALTRLISSLKEAASPRFV--QDDSDKTLPFCMKSDFPVRSVEDVKHFFKPLS 197
Query: 277 LIFSKN----QSFVVRNHIFSFPENEGFTVFCLTVMS-TDGDYG---IIGQNFMMGHRIV 328
L F K ++F +R + ++G CL V++ T Y I+G + G +
Sbjct: 198 LQFEKRFFFSRTFNIRPEHYLVISDKGNV--CLGVLNGTTIGYDSVVIVGDVSLRGKLVA 255
Query: 329 FDRENLKLAWSHSKCEEVIDKSHV 352
+D + ++ W C +S +
Sbjct: 256 YDNDKNEVGWVDFDCTNPRKRSRI 279
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 78/309 (25%), Positives = 132/309 (42%), Gaps = 41/309 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+DPS S + K + CS C+S R++ S + C Y DY + + S G L + L L S
Sbjct: 133 FDPSKSKTYKTLPCSSNTCESLRNTACSSDNVCEYSIDYG-DGSHSDGDLSVETLTLGST 191
Query: 104 ---SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
S H P++ +IGCG G++ + +G +GLG V + + I
Sbjct: 192 DGSSVHFPKT------VIGCGHNNGGTFQE----EGSGIVGLGGGPVSLISQLSSSIGGK 241
Query: 161 FSICF-----DENDSGSVFFGDQGPATQQSTSFLPI----GEKYDAYFVGVESYCIGNSC 211
FS C + N S + FGD + + T P+ G+ + YF+ +E++ +G++
Sbjct: 242 FSYCLAPIFSESNSSSKLNFGDAAVVSGRGTVSTPLDPLNGQVF--YFLTLEAFSVGDNR 299
Query: 212 L----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
+ ++DSG + T LP E Y + ++ +R C
Sbjct: 300 IEFSGSSSSGSGSGDGNIIIDSGTTLTLLPQEDYLNLESAVSDVIKLERARDPSKLLSLC 359
Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF-PENEGFTVFCLTVMSTDGDYGIIG-Q 319
Y +S+E L +P + F V N I +F P +G F +G + Q
Sbjct: 360 YKTTSDE-LDLPVITAHFKGAD--VELNPISTFVPVEKGVVCFAFISSKIGAIFGNLAQQ 416
Query: 320 NFMMGHRIV 328
N ++G+ +V
Sbjct: 417 NLLVGYDLV 425
>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 485
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 89/367 (24%), Positives = 141/367 (38%), Gaps = 75/367 (20%)
Query: 47 PSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPY----IADYSTEDTSS-----------SG 91
P + +SS +VSC P C + + S D C + T D SS G
Sbjct: 124 PPNITSSASVSCKSPACSAAHTSLSSSDLCAMARCPLELIETSDCSSFSCPPFYYAYGDG 183
Query: 92 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
LV L+ S S A V + GC G P GV G G G +S+P+ L
Sbjct: 184 SLVAR-LYRDSLSMPASSPLVLHNFTFGCAHTALGE------PVGVAGFGRGVLSLPAQL 236
Query: 152 AK-AGLIQNSFSIC-----FD-----------------ENDSGSVFFGDQGPATQQSTSF 188
A + + N FS C FD +++ D+G T+
Sbjct: 237 ASFSPHLGNQFSYCLVSHSFDADRVRRPSPLILGRYSLDDEKKKRVGHDRGEFVY--TAM 294
Query: 189 LPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEV 238
L + Y VG+E +GN + + +VDSG +FT LP +Y +
Sbjct: 295 LDNPKHPYFYCVGLEGITVGNRKIPVPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESL 354
Query: 239 VVKFDKLVSS--KRISL--QGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIF 292
V +F+ + KR + + CY S + KVP + L F N + ++ N+ +
Sbjct: 355 VTEFNHRMGRVYKRATQIEERTGLGPCY-YSDDSAAKVPAVALHFVGNSTVILPRNNYYY 413
Query: 293 SF-----PENEGFTVFCLTVMS------TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 341
F + + V CL +M+ + G +G G +V+D E ++ ++
Sbjct: 414 EFFDGRDGQKKKRKVGCLMLMNGGDEAESGGPAATLGNYQQQGFEVVYDLEKHRVGFARR 473
Query: 342 KCEEVID 348
KC + D
Sbjct: 474 KCALLWD 480
>gi|255588450|ref|XP_002534607.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223524923|gb|EEF27776.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 260
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 61/121 (50%), Gaps = 12/121 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+ SSS+ + V+C HP C C L+ C Y Y + + S G L +DI+ + S
Sbjct: 93 FQTESSSTYQPVNC-HPSCD----CDYLRSQCSYKMHYG-DGSYSRGVLAEDIISFGNES 146
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
+ APQ ++ GC GS L DG++GLG G ++ L G+I +SFS+C
Sbjct: 147 EFAPQR-----LVFGCELDAIGS-LYSLRADGIIGLGRGRSTIVDQLVDKGVISDSFSLC 200
Query: 165 F 165
+
Sbjct: 201 Y 201
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 79/328 (24%), Positives = 128/328 (39%), Gaps = 39/328 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKD-PCPYIADYSTEDTSSSGYLVDDILHLA 101
+DP S+S + + P C++ RS K C Y Y + +++ G +++ L A
Sbjct: 176 FDPRHSTSYREMGYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFA 235
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
P S IGCG G + AA G++GLG G +S PS +A G SF
Sbjct: 236 G-GVQVPHMS------IGCGHDNKGLFAAPAA--GILGLGRGQISCPSQIAALGYNVTSF 286
Query: 162 SICFDE--------NDSGSVFFGDQGPATQQSTSFLPIGEK------YDAYFVGVESYCI 207
S C + + S ++ GD A SF P + Y VGV +
Sbjct: 287 SYCLADFFLSSPGRSVSSTLTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGV 346
Query: 208 GNSCLTQSGFQ---------ALVDSGASFTFLPTEIY-AEVVVKFDKLVSSKRISLQGNS 257
+T+ + ++DSG + T L Y A V ++S+ G S
Sbjct: 347 RVPGVTEDDLKLDPYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPS 406
Query: 258 --WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG 315
+ CY M KVP + + F+ + + P + TV + D
Sbjct: 407 GFFDTCYTMGGRAM-KVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSVS 465
Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
IIG G R+V++ ++ ++ + C
Sbjct: 466 IIGNIQQQGFRVVYNIGGGRVGFAPNSC 493
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 67/246 (27%), Positives = 109/246 (44%), Gaps = 27/246 (10%)
Query: 118 IGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-GSVFFG 176
GCGR G + G+ DG++GLG G +S S A FS C E DS GS+ FG
Sbjct: 224 FGCGRNNKGDF--GSGVDGMLGLGQGQLSTVSQTASK--FNKVFSYCLPEEDSIGSLLFG 279
Query: 177 DQGPATQQSTSF----LPIG----EKYDAYFVGVESYCIGNSCLT--QSGFQA---LVDS 223
++ AT QS+S L G ++ YFV + +GN L S F + ++DS
Sbjct: 280 EK--ATSQSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVFASPGTIIDS 337
Query: 224 GASFTFLPTEIYAEVVVKFDKLVSSKRIS----LQGNSWKYCYNASSEEMLKVPDMRLIF 279
T LP Y+ + F K ++ +S +G+ CYN S + + +P++ L F
Sbjct: 338 RTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHF 397
Query: 280 SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWS 339
VR + + + CL T + IIG + +++D + ++ +
Sbjct: 398 GGGAD--VRLNGTNIVWGSDASRLCLAFAGTS-ELTIIGNRQQLSLTVLYDIQGRRIGFG 454
Query: 340 HSKCEE 345
+ C +
Sbjct: 455 GNGCSK 460
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 80/322 (24%), Positives = 130/322 (40%), Gaps = 44/322 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP S S +++C PLC S C + K C Y Y D
Sbjct: 168 FDPRKSRSFASIACRSPLCHRLDSPGCNTQKQTCMYQVSYG-----------DGSFTFGD 216
Query: 103 FSKHAP--QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
FS + + + V +GCG G ++ A ++GLG G +S PS + +
Sbjct: 217 FSTETLTFRRTRVARVALGCGHDNEGLFVGAAG---LLGLGRGRLSFPSQTGRR--FNHK 271
Query: 161 FSICFDENDSGS----VFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS--- 210
FS C + + S + FGD A ++ F P+ K D Y+V + +G +
Sbjct: 272 FSYCLVDRSASSKPSSMVFGDS--AVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVP 329
Query: 211 CLTQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
+T S F+ ++DSG S T L Y F S+ + + Q + + C+
Sbjct: 330 GITASLFKLDQTGNGGVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSLFDTCF 389
Query: 263 NASSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNF 321
+ S + +KVP + L F + S N++ + F CL T G IIG
Sbjct: 390 DLSGKTEVKVPTVVLHFRGADVSLPASNYLIPVDTSGNF---CLAFAGTMGGLSIIGNIQ 446
Query: 322 MMGHRIVFDRENLKLAWSHSKC 343
G R+V+D ++ ++ C
Sbjct: 447 QQGFRVVYDLAGSRVGFAPHGC 468
>gi|255552245|ref|XP_002517167.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223543802|gb|EEF45330.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 435
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 66/240 (27%), Positives = 98/240 (40%), Gaps = 44/240 (18%)
Query: 46 DPSSSSSSKNVSCSHPLCKSRSS------CKSLKDP------CPYIADYSTEDTSSSGYL 93
D SSS V C LCK S C S P C +I S+SG +
Sbjct: 78 DNYVSSSYTPVRCDSALCKLADSHSCTTECYSSPKPGCYNNTCSHIPYNPVVHVSTSGDI 137
Query: 94 VDDILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPD--GVMGLGLGDVSVPSL 150
D++ L S P +V +V CG TG L+ A GV GLG G++S+P+
Sbjct: 138 GLDVVSLQSMDGKYPGRNVSVPNVPFVCG---TGFMLENLADGVLGVAGLGRGNISLPAY 194
Query: 151 LAKAGLIQNSFSICFDE--NDSGSVFFGDQ-GPATQQSTSFLPI-------------GEK 194
+ A +Q+ F+IC N SG ++FGD GP + + P+ G+
Sbjct: 195 FSSALGLQSKFAICLSSLTNSSGVIYFGDSIGPLSSDFLIYTPLVRNPVSTAGAYFEGQS 254
Query: 195 YDAYFVGVESYCIGNSCLTQSGFQALVDSGAS----------FTFLPTEIYAEVVVKFDK 244
YF+ V++ +G + + +D+ +T L T IY V+ F K
Sbjct: 255 STDYFIAVKTLRVGGKEIKFNKTLLSIDNEGKGGTRISTVHPYTLLHTSIYKAVIKAFAK 314
>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 488
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 65/236 (27%), Positives = 102/236 (43%), Gaps = 39/236 (16%)
Query: 45 YDPSSSSSSKNVSCSHPLC-----KSRSSCKSLK-----DPCP-YIADYSTEDTSSSGYL 93
+ P +SSSS+ V C +P C KS S+C S D CP Y+ Y + TS G L
Sbjct: 141 FHPKNSSSSRLVGCRNPACRWIHSKSPSTCGSTGNNGNGDVCPPYLVVYGSGSTS--GLL 198
Query: 94 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
+ D L L+ S + + ++ IGC P G+ G G G SVPS L
Sbjct: 199 ISDTLRLSPSSSSSAPAPFRN-FAIGCSIVSVHQ-----PPSGLAGFGRGAPSVPSQLKV 252
Query: 154 AGLIQNSFSICFDEND--SGSVFFGD-QGPATQQSTS--FLPIGEKYDA-------YFVG 201
S FD+N SG + GD PA ++ T+ ++P+ + Y++
Sbjct: 253 PKFSYCLLSRRFDDNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYYLA 312
Query: 202 VESYCIG--------NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 249
+ +G + + SG A++DSG +FT+L ++ V + V +
Sbjct: 313 LTGISVGGKPVNLPSRAFVPSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGR 368
>gi|213998842|gb|ACJ60788.1| nucellin [Hordeum cordobense]
Length = 154
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/138 (28%), Positives = 66/138 (47%), Gaps = 4/138 (2%)
Query: 113 QSSVIIGCGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDS 170
+ + GCG KQ +P DG++GLG+G + L +I N C
Sbjct: 6 KKKIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGK 65
Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGASFTF 229
G ++ GD P ++ T ++P+ E Y G+ I N + + F+ + DSG+++T
Sbjct: 66 GVLYVGDFNPPSRGVT-WVPMKESLFYYSPGLAELLIDNQPIRGNPTFEVVFDSGSTYTH 124
Query: 230 LPTEIYAEVVVKFDKLVS 247
+P +IY E+V K +S
Sbjct: 125 VPAQIYNEIVSKVRGTLS 142
>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 73/316 (23%), Positives = 122/316 (38%), Gaps = 34/316 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
++P++SS+ K V C LC + R SC + + C Y Y + + S G + D
Sbjct: 166 FNPNASSTYKVVGCGSALCNAVPSATMARKSCMAPTEGCSYRQSYH-DYSLSVGVVSSDT 224
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L S+ I GC G G G++G+ + S+ S + G
Sbjct: 225 LTYGLGSQ---------KFIFGCCNLFRGV---GGRYSGILGMSVNKFSLFSQMT-VGHR 271
Query: 158 QNSFSICFDE-NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQ 214
+ S CF + G + FG + + F P+ + YFV V + + L
Sbjct: 272 YRAMSYCFPHPRNQGFLQFG-RYDEHKSLLRFTPLYIDGNNYFVHVSNVMVETMSLDVQS 330
Query: 215 SGFQAL---VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS---EE 268
SG Q + D+G +T LP ++ + LV + ++ + C+ A E
Sbjct: 331 SGNQTMRCFFDTGTPYTMLPQSLFVSLSDTVGNLVEGY-YRVGASTGQTCFQADGNWIEG 389
Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
L +P +++ F + + F E VFCL DG ++G +MG V
Sbjct: 390 DLYMPTVKIEFQNGARITLNSEDLMFMEEP--NVFCLAFKMNDGGDIVLGSRHLMGVHTV 447
Query: 329 FDRENLKLAWSHSKCE 344
D E + + C
Sbjct: 448 VDLEMMTMGLRGQGCN 463
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 82/290 (28%), Positives = 117/290 (40%), Gaps = 34/290 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
YDPS S SS++ +CS P C+ SS + C Y Y + +++SG LV D
Sbjct: 213 YDPSKSRSSESFACSSPTCRQLGPYANGCSSSSNSAGQCQYRVRYP-DGSTTSGTLVADQ 271
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGL 156
L L +P S V GC GS+ + G+M LG G S+ S + K G
Sbjct: 272 LSL------SPTSQV-PKFEFGCSHAARGSF-SRSKTAGIMALGRGVQSLVSQTSTKYGQ 323
Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQST-SFLPIGEKYDAYFVGVESYCIGNSCL--- 212
+ FS CF S FF P S + P+ + Y V +E+ + L
Sbjct: 324 V---FSYCFPPTASHKGFFVLGVPRRSSSRYAVTPMLKTPMLYQVRLEAIAVAGQRLDVP 380
Query: 213 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
T A +DS T LP Y + F +S R + CY+ + +
Sbjct: 381 PTVFAAGAALDSRTVITRLPPTAYQALRSAFRDKMSMYRPAAANGQLDTCYDFTGVSSIM 440
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD---YGIIG 318
+P + L+F + + V + P F CL ST GD GIIG
Sbjct: 441 LPTISLVFDRTGAGVQLD-----PSGVLFGS-CLAFASTAGDDRATGIIG 484
>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 315
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 76/308 (24%), Positives = 126/308 (40%), Gaps = 32/308 (10%)
Query: 57 SCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSS 115
SC PLC K + S + C Y Y +++ + G L D A+F+ + + S
Sbjct: 20 SCDSPLCHKLDTGVCSPEKRCNYTYGYG-DNSLTKGVLAQDT---ATFTSNTGKLVSLSR 75
Query: 116 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI--QNSFSICF-----DEN 168
+ GCG TG + D G++GLG G SL+++ G + FS C D
Sbjct: 76 FLFGCGHNNTGGFNDHEM--GLIGLGGGPT---SLISQIGPLFGGKKFSQCLVPFLTDIK 130
Query: 169 DSGSVFFGDQGPATQQSTSFLPIGEK---YDAYFVGV------ESYCIGNSCLTQSGFQA 219
S + FG P+ ++ +YFV + ++Y NS + +
Sbjct: 131 ISSRMSFGKGSQVLGDGVVTTPLVQREQDMTSYFVTLLGISVEDTYLPMNSTIEKG--NM 188
Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN-SWKYCYNASSEEMLKVPDMRLI 278
LVDSG LP ++Y V V+ V + I+ + + CY + LK P +
Sbjct: 189 LVDSGTPPNILPQQLYDRVYVEVKNNVPLELITNDPSLGPQLCYRTQTN--LKGPTLTYH 246
Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMS-TDGDYGIIGQNFMMGHRIVFDRENLKLA 337
F + F P E VFCL + + T+ + G+ G + I FD + ++
Sbjct: 247 FEGANLLLTPIQTFIPPTPETKGVFCLAINNYTNSNGGVYGNFAQSNYLIGFDLDRQVVS 306
Query: 338 WSHSKCEE 345
+ + C +
Sbjct: 307 FKATDCTK 314
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 77/318 (24%), Positives = 126/318 (39%), Gaps = 43/318 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP SS+S + C P CKS + C Y Y + + + G + + L
Sbjct: 191 FDPVSSNSYSPIRCDAPQCKSLDLSECRNGTCLYEVSYG-DGSYTVGEFATETVTLG--- 246
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
++ +V IGCG G ++ A G+ G L S P A + SFS C
Sbjct: 247 -----TAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKL---SFP-----AQVNATSFSYC 293
Query: 165 FDENDSGSVF---FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSGFQA 219
DS +V F P + E Y++G++ +G L +S F+
Sbjct: 294 LVNRDSDAVSTLEFNSPLPRNVVTAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEV 353
Query: 220 --------LVDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSE 267
++DSG + T L +E+Y + F K + + +SL + CY+ SS
Sbjct: 354 DAIGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSL----FDTCYDLSSR 409
Query: 268 EMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
E ++VP + F + + + RN++ + FC T I+G G
Sbjct: 410 ESVQVPTVSFHFPEGRELPLPARNYLIPV---DSVGTFCFAFAPTTSSLSIMGNVQQQGT 466
Query: 326 RIVFDRENLKLAWSHSKC 343
R+ FD N + +S C
Sbjct: 467 RVGFDIANSLVGFSADSC 484
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 79/318 (24%), Positives = 130/318 (40%), Gaps = 40/318 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+ P S++ KNVSC+ P CK + C + Y + +S + LV D + LA +
Sbjct: 117 FAPEKSTTFKNVSCAAPECKQVPNPGCGVSSCNFNLTYGS--SSIAANLVQDTITLA--T 172
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
P S GC K TG+ A P G++GLG G +S+ S L Q++FS C
Sbjct: 173 DPVP------SYTFGCVSKTTGT---SAPPQGLLGLGRGPLSLLS--QTQNLYQSTFSYC 221
Query: 165 FDE----NDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------- 212
N SGS+ G P + T L + Y+V +E+ +G +
Sbjct: 222 LPSFKSLNFSGSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAAL 281
Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
+G + DSG FT L +Y V +F + V K + CYN
Sbjct: 282 AFNPTTGAGTIFDSGTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNVP---- 337
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM----STDGDYGIIGQNFMMGH 325
+ VP + IF+ + +++I + + CL + + + +I H
Sbjct: 338 IVVPTITFIFTGMNVTLPQDNILI--HSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNH 395
Query: 326 RIVFDRENLKLAWSHSKC 343
R+++D N ++ + C
Sbjct: 396 RVLYDVPNSRVGVARELC 413
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 71/319 (22%), Positives = 131/319 (41%), Gaps = 42/319 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
++P SSS + C C+ S +S + C Y Y + +S+ GY+ +
Sbjct: 138 FNPQDSSSFSTLPCESQYCQDLPS-ESCYNDCQYTYGYG-DGSSTQGYMATETFTF---- 191
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
++S ++ GCG G A G++G+G G +S+PS L FS C
Sbjct: 192 ----ETSSVPNIAFGCGEDNQGFGQGNGA--GLIGMGWGPLSLPSQLGVG-----QFSYC 240
Query: 165 FDENDS--------GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIG--NSCLTQ 214
+ S GS G P ST+ + Y++ ++ +G N +
Sbjct: 241 MTSSGSSSPSTLALGSAASGV--PEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPS 298
Query: 215 SGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
S FQ ++DSG + T+LP + Y V F ++ + + C+ S
Sbjct: 299 STFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLSPVDESSSGLSTCFQLPS 358
Query: 267 E-EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV-MSTDGDYGIIGQNFMMG 324
+ ++VP++ + F + ++ P EG V CL + S+ I G
Sbjct: 359 DGSTVQVPEISMQFDGGVLNLGEENVLISPA-EG--VICLAMGSSSQQGISIFGNIQQQE 415
Query: 325 HRIVFDRENLKLAWSHSKC 343
++++D +NL +++ ++C
Sbjct: 416 TQVLYDLQNLAVSFVPTQC 434
>gi|50878437|gb|AAT85211.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 435
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 78/324 (24%), Positives = 125/324 (38%), Gaps = 43/324 (13%)
Query: 53 SKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP-QSS 111
+K+ +C+ C +S L D C +Y+ S+ G ++ D L L + + P +
Sbjct: 102 AKSAACATG-CSGAASPGCLNDTCTGFPEYTITRVSTGGNIITDKLSLYTTCRPMPVPRA 160
Query: 112 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND-S 170
+ CG L GAA G+M L ++P+ +A F++C + S
Sbjct: 161 TAPGFLFTCGATSLTKGL-GAAATGMMSLSRARFALPTQVASIFRFSRKFALCLAPAESS 219
Query: 171 GSVFFGDQ----GPATQQSTSFL-------PI----GEKYDAYFVGVESYCI-GNSCLTQ 214
G V FGD P S S + P+ G+K YF+GV + G +
Sbjct: 220 GVVVFGDAPYEFQPVMDLSKSLIYTPLLVNPVTTTGGDKSTEYFIGVTGIKVNGRAVPLN 279
Query: 215 SGFQALVDSGAS---------FTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN-- 263
+ A+ SG +T L T IY V F + +K CY+
Sbjct: 280 ATLLAIAKSGVGGTKLSMLSPYTVLETSIYKAVTDAFAAETAMIPRVPAVAPFKLCYDGT 339
Query: 264 --ASSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG----- 315
S+ VP + L+ SK S+VV +G C V+ DG
Sbjct: 340 MVGSTRAGPAVPTVELVLQSKAVSWVVFGANSMVATKDG--ALCFGVV--DGGVAPETSV 395
Query: 316 IIGQNFMMGHRIVFDRENLKLAWS 339
+IG + M + + FD E +L ++
Sbjct: 396 VIGGHMMEDNLLEFDLEGSRLGFT 419
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 74/347 (21%), Positives = 148/347 (42%), Gaps = 69/347 (19%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+ +DPS SSS + CSHPLCK R +SC S + C Y Y+ + T + G LV
Sbjct: 111 TSFDPSLSSSFSTLPCSHPLCKPRIPDFTLPTSCDSNRL-CHYSYFYA-DGTFAEGNLVK 168
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
+ + ++ + + +I+GC + + G++G+ G + S +++A
Sbjct: 169 EKITFSN-------TEITPPLILGCATESSDD-------RGILGMNRGRL---SFVSQAK 211
Query: 156 LIQNSFSICFDEND-----SGSVFFGDQG-------------PATQQSTSFLPIGEKYDA 197
+ + S+ I N +GS + GD P +Q+ + P+ Y
Sbjct: 212 ISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLA--YTV 269
Query: 198 YFVGVESYCIGNSCLTQSGF--------QALVDSGASFTFLPTEIY----AEVVVKFDKL 245
+G+ + + ++ S F Q +VDSG+ FT L Y AE++ + +
Sbjct: 270 PMIGIR-FGLKKLNISGSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRR 328
Query: 246 VSSKRISLQGNSWKYCYNASSEEMLK-VPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVF 303
+ K+ + G + C++ + + + + D+ +F++ +V + + N G +
Sbjct: 329 L--KKGYVYGGTADMCFDGNVAMIPRLIGDLVFVFTRGVEILVPKERVLV---NVGGGIH 383
Query: 304 CLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
C+ + + IIG + FD N ++ ++ + C V+
Sbjct: 384 CVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFAKADCSRVV 430
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 73/305 (23%), Positives = 125/305 (40%), Gaps = 28/305 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS---RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
+DPS S + K + CS +C+S SC S K C Y Y + + S G L + L L
Sbjct: 139 FDPSKSKTYKTLPCSSNMCQSVISTPSCSSDKIGCKYTIKYG-DGSHSQGDLSVETLTLG 197
Query: 102 SFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
S + SSVQ + +IGCG G++ + +G G + + G
Sbjct: 198 STNG----SSVQFPNTVIGCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSIGGKFSYC 253
Query: 161 FSICFDENDSGSVF-FGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL---- 212
+ F +++S S FGD + P+ K + Y++ +E++ +G+ +
Sbjct: 254 LAPMFSQSNSSSKLNFGDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVG 313
Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
+ ++DSG + T LP E Y+ + + + R+S N CY +
Sbjct: 314 GSSSSGSSNGEGNIIIDSGTTLTLLPQEDYSNLESAVADAIQANRVSDPSNFLSLCYQTT 373
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE-NEGFTVFCLTVMSTDGDYGIIGQ-NFMM 323
L VP + F V N I +F + EG F +G + Q N ++
Sbjct: 374 PSGQLDVPVITAHFKGAD--VELNPISTFVQVAEGVVCFAFHSSEVVSIFGNLAQLNLLV 431
Query: 324 GHRIV 328
G+ ++
Sbjct: 432 GYDLM 436
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 76/312 (24%), Positives = 123/312 (39%), Gaps = 30/312 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
YDP+ SSSS SC+ P C + C + + C Y Y + TS++G + D+L +
Sbjct: 175 YDPTKSSSSGVFSCNSPTCTQLGPYANGCTN-NNQCQYRVRYP-DGTSTAGTYISDLLTI 232
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
P ++V+ S GC GS+ G++ G+M LG G S+ S A
Sbjct: 233 ------TPATAVR-SFQFGCSHGVQGSFSFGSSAAGIMALGGGPESLVS--QTAATYGRV 283
Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----YFVGVESYCIGNSCL--- 212
FS CF + FF P L K A Y V +E+ + +
Sbjct: 284 FSHCFPP-PTRRGFFTLGVPRVAAWRYVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVP 342
Query: 213 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
T A +DS + T LP Y + F ++ + + CY+ +
Sbjct: 343 PTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRMAMYQPAPPKGPLDTCYDMAGVRSFA 402
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 331
+P + L+F KN + + F +G F T D GIIG + ++++
Sbjct: 403 LPRITLVFDKNAAVELDPSGVLF---QGCLAF--TAGPNDQVPGIIGNIQLQTLEVLYNI 457
Query: 332 ENLKLAWSHSKC 343
+ + H+ C
Sbjct: 458 PAALVGFRHAAC 469
>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
Length = 315
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 80/334 (23%), Positives = 142/334 (42%), Gaps = 57/334 (17%)
Query: 50 SSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
SS+ K V+C P+C+ S S+C C Y+ Y + + ++G++ D +F+
Sbjct: 2 SSTFKAVACPDPICRPSSGVSVSACAMENFQCFYLCSYG-DRSITAGHIFKD-----TFT 55
Query: 105 KHAPQSS--VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+P S + GCG TG ++ + G+ G G G S+PS L K G FS
Sbjct: 56 FMSPNGVPVAVSELAFGCGDYNTGLFVSNES--GIAGFGRGPQSLPSQL-KVG----RFS 108
Query: 163 ICFD---ENDSGSVFFGD-----------QGPATQQSTSFLPIGEKYDAYFVGVESYCIG 208
C E+ S V G GP + P+ + Y++ +E +G
Sbjct: 109 YCLTLVTESKSSVVILGTPPDPDGLRAHTTGPFQSTPIIYNPLIPTF--YYLSLEGITVG 166
Query: 209 NSCLT--QSGFQ--------ALVDSGASFTFLPTEIYA----EVVVKFDKLVSSKRISLQ 254
+ L +S F ++DSG S T LP ++ E+V +F L
Sbjct: 167 KTRLPFDKSVFALKKDGSGGTVIDSGTSLTTLPEAVFELLQEELVAQF-PLPRYDNTPEV 225
Query: 255 GNSWKYCYN-ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD 313
G+ + C+ + + VP + L + + R++ F + G V CL + +
Sbjct: 226 GD--RLCFRRPKGGKQVPVPKLILHLAGADMDLPRDNYFVEEPDSG--VMCLQINGAEDT 281
Query: 314 YGIIGQNFMMGH-RIVFDRENLKLAWSHSKCEEV 346
++ NF + +V+D EN KL ++ ++C+++
Sbjct: 282 TMVLIGNFQQQNMHVVYDVENNKLLFAPAQCDKL 315
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 76/312 (24%), Positives = 122/312 (39%), Gaps = 30/312 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
YDP+ SSSS SC+ P C + C + + C Y Y + TS++G + D+L +
Sbjct: 200 YDPTKSSSSGVFSCNSPTCTQLGPYANGCTN-NNQCQYRVRYP-DGTSTAGTYISDLLTI 257
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
P ++V+ S GC GS+ G++ G+M LG G S+ S A
Sbjct: 258 ------TPATAVR-SFQFGCSHGVQGSFSFGSSAAGIMALGGGPESLVS--QTAATYGRV 308
Query: 161 FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----YFVGVESYCIGNSCL--- 212
FS CF FF P L K A Y V +E+ + +
Sbjct: 309 FSHCFPPPTRRG-FFTLGVPRVAAWRYVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVP 367
Query: 213 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
T A +DS + T LP Y + F ++ + + CY+ +
Sbjct: 368 PTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRMAMYQPAPPKGPLDTCYDMAGVRSFA 427
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 331
+P + L+F KN + + F +G F T D GIIG + ++++
Sbjct: 428 LPRITLVFDKNAAVELDPSGVLF---QGCLAF--TAGPNDQVPGIIGNIQLQTLEVLYNI 482
Query: 332 ENLKLAWSHSKC 343
+ + H+ C
Sbjct: 483 PAALVGFRHAAC 494
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 66/284 (23%), Positives = 120/284 (42%), Gaps = 28/284 (9%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+++DPS SS+ VSC C++ R++C + C Y+ Y + ++++G L +
Sbjct: 144 TQFDPSRSSTYGRVSCQTDACEALGRATCDDGSN-CAYLYAYG-DGSNTTGVLSTETFTF 201
Query: 101 A-SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
S +P+ V GC GS+ G VS+ + L A +
Sbjct: 202 DDGGSGRSPRQVRVGGVKFGCSTATAGSFPADGLVGLGGGA----VSLVTQLGGATSLGR 257
Query: 160 SFSICF---DENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYFVGVESYCIGNSCLTQ 214
FS C N S ++ FG T+ + P+ G+ Y V ++S +GN +
Sbjct: 258 RFSYCLVPHSVNASSALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTVAS 317
Query: 215 SGF-QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM---L 270
+ + +VDSG + TFL + +V + + ++ + + CYN + E+
Sbjct: 318 AASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGE 377
Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV----FCLTVMST 310
+PD+ L F + ++ PEN V CL +++T
Sbjct: 378 SIPDLTLEFGGGAAVALK------PENAFVAVQEGTLCLAIVAT 415
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 80/319 (25%), Positives = 129/319 (40%), Gaps = 39/319 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
+DPS+S S NVSC P C+ S C S C Y Y + + S G+ +
Sbjct: 190 FDPSTSLSYSNVSCDSPSCEKLESATGNSPGCSS--STCLYGIRYG-DGSYSIGFFAREK 246
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGL 156
L L S + V ++ GCG+ G + G A G++GL +S+ S A K G
Sbjct: 247 LSLTS-------TDVFNNFQFGCGQNNRGLF-GGTA--GLLGLARNPLSLVSQTAQKYGK 296
Query: 157 IQNSFSICF--DENDSGSVFFGDQGPATQQSTSFLP--IGEKYDAYF--------VGVES 204
+ FS C + +G + FG G ++ F P + Y +++ VG
Sbjct: 297 V---FSYCLPSSSSSTGYLSFG-SGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERK 352
Query: 205 YCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
I S + +G ++DSG + LP +Y+ V F +L+S + CY+
Sbjct: 353 LPIPKSVFSTAG--TIIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDL 410
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
S + +KVP + L FS + + S D + IIG
Sbjct: 411 SKYKTVKVPKIILYFSGGAEMDLAPEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKT 470
Query: 325 HRIVFDRENLKLAWSHSKC 343
+V+D ++ ++ S C
Sbjct: 471 IHVVYDDAEGRVGFAPSGC 489
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 60/250 (24%), Positives = 104/250 (41%), Gaps = 27/250 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
+DPS S+S N++C+ LC S+ C + C Y Y + + S GY +
Sbjct: 188 FDPSKSTSYSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQYG-DSSFSVGYFSRER 246
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L + + + + + + GCG+ G + A G++GLG +S + A +
Sbjct: 247 LSVTA-------TDIVDNFLFGCGQNNQGLFGGSA---GLIGLGRHPISF--VQQTAAVY 294
Query: 158 QNSFSICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
+ FS C S G + FG + + T F I Y + + +G + L
Sbjct: 295 RKIFSYCLPATSSSTGRLSFGTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVS 354
Query: 213 --TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 270
T S A++DSG T LP Y + F + +S + + + CY+ S E+
Sbjct: 355 SSTFSTGGAIIDSGTVITRLPPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLSGYEVF 414
Query: 271 KVPDMRLIFS 280
+P + F+
Sbjct: 415 SIPKIDFSFA 424
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 87/330 (26%), Positives = 139/330 (42%), Gaps = 45/330 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCK---SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
YD + SSS V C+ C S +C + PC Y Y + S+G L + L
Sbjct: 135 YDTAVSSSFSPVPCASATCLPIWSSRNCTASSSPCRYRYAYG-DGAYSAGVLGTETLTFP 193
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
AP SV + GCG G + G +GLG G +S L+A+ G+ F
Sbjct: 194 G----APGVSV-GGIAFGCGVDNGGLSYNST---GTVGLGRGSLS---LVAQLGV--GKF 240
Query: 162 SIC----FDENDSGSVFFGD----QGPATQ---QSTSFLPIGEKYDAYFVGVESYCIGNS 210
S C F+ + V FG P+T QST + Y+V +E +G++
Sbjct: 241 SYCLTDFFNTSLGSPVLFGALAELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDA 300
Query: 211 CLT----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
L +VDSG +FTFL E VVV V + + +
Sbjct: 301 RLPIPNGTFDLRDDGSGGMIVDSGTTFTFL-VESAFRVVVDHVAGVLRQPVVNASSLDSP 359
Query: 261 CYNASS--EEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVM-STDGDYGI 316
C+ A++ +++ +PDM L F+ + R++ SF + E + FCL + S D I
Sbjct: 360 CFPAATGEQQLPAMPDMVLHFAGGADMRLHRDNYMSFNQEE--SSFCLNIAGSPSADVSI 417
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+G +++FD +L++ + C ++
Sbjct: 418 LGNFQQQNIQMLFDITVGQLSFMPTDCGKL 447
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 84/371 (22%), Positives = 144/371 (38%), Gaps = 57/371 (15%)
Query: 15 NALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRS-----SC 69
N + L + L LL A + + P +SS+ V C+ C+SR +C
Sbjct: 97 NVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPCASAQCRSRDLPSPPAC 156
Query: 70 KSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYL 129
C Y+ + +SS G L D+ + S GC S
Sbjct: 157 DGASSRCSVSLSYA-DGSSSDGALATDVFAVGS--------GPPLRAAFGCMSSAFDSSP 207
Query: 130 DGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSGSVFFGDQGPATQQSTSF 188
DG A G++G+ G +S S + FS C D +D+G + G T ++
Sbjct: 208 DGVASAGLLGMNRGALSFVSQAST-----RRFSYCISDRDDAGVLLLGHSDLPTFLPLNY 262
Query: 189 LPIGEK------YD--AYFVGVESYCIGNSCL-----------TQSGFQALVDSGASFTF 229
P+ + +D AY V + +G L T +G Q +VDSG FTF
Sbjct: 263 TPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAG-QTMVDSGTQFTF 321
Query: 230 LPTEIYAEVVVKFDKLVSSKRISLQGNSWKY------CYN---ASSEEMLKVPDMRLIFS 280
L + Y+ + +F + +L S+ + C+ S ++P + L+F+
Sbjct: 322 LLGDAYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRVPQGRSPPTARLPGVTLLFN 381
Query: 281 KNQSFVVRNH-IFSFP--ENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHRIVFDREN 333
+ V + ++ P G V+CLT + D Y +IG + M + +D E
Sbjct: 382 GAEMAVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPIMAY-VIGHHHQMNVWVEYDLER 440
Query: 334 LKLAWSHSKCE 344
++ + +C+
Sbjct: 441 GRVGLAPVRCD 451
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 69/266 (25%), Positives = 109/266 (40%), Gaps = 32/266 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+DP+ SSS V C P+C SSC + + C Y+ Y + + ++G D L L
Sbjct: 184 FDPAQSSSYAAVPCGGPVCGGLGIYASSCSAAQ--CGYVVSYG-DGSKTTGVYSSDTLTL 240
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+P +V+ GCG Q+G + DG++GLG + S+ + AG
Sbjct: 241 ------SPNDAVR-GFFFGCGHAQSGFTGN----DGLLGLGREEASL--VEQTAGTYGGV 287
Query: 161 FSICFDENDSGSVFFGDQGPATQ-----QSTSFLPIGEKYDAYFVGVESYCIGNSCLT-- 213
FS C S + + GP+ +T L Y V + +G L+
Sbjct: 288 FSYCLPTRPSTTGYLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVP 347
Query: 214 QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASSEEM 269
S F +VD+G T LP YA + F ++S + CYN S
Sbjct: 348 SSVFAGGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILDTCYNFSGYGT 407
Query: 270 LKVPDMRLIFSKNQSFVV-RNHIFSF 294
+ +P++ L FS + + + I SF
Sbjct: 408 VTLPNVALTFSGGATVTLGADGILSF 433
>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
Length = 486
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 82/354 (23%), Positives = 135/354 (38%), Gaps = 74/354 (20%)
Query: 59 SHPLCKSRSSCKSLKD-------------------PCPYIADYSTEDTSSSGYLVDDILH 99
+ P C S + D PCP A T +G +V IL
Sbjct: 146 ASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFA-----YTYGAGGVVTGILT 200
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
+ + V + C +Y + P G+ G G G + S++++ G +Q
Sbjct: 201 RDTLRVNGSSPGVAKEIPKFCFGCVGSAYRE---PIGIAGFGRGTL---SMVSQLGFLQK 254
Query: 160 SFSICF-------DENDSGSVFFGDQGPATQQSTSFLPI--GEKY-DAYFVGVESYCIGN 209
FS CF + N S + GD ++ F P+ Y + Y+VG+E+ +GN
Sbjct: 255 GFSHCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLNSPMYPNFYYVGLEAITVGN 314
Query: 210 SCLTQ-----SGFQAL------VDSGASFTFLPTEIYAEVVVKFDKLVSSKR---ISLQG 255
T+ F +L +DSG ++T LP Y++V+ ++ R + +Q
Sbjct: 315 VSATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVLSILQSTINYPRDTGMEMQ- 373
Query: 256 NSWKYCYNA--------SSEEMLKVPDMRLIFSKNQSFVVR--NHIF--SFPENEGFTVF 303
+ CY +S+++L P + F N S V+ NH + S P N V
Sbjct: 374 TGFDLCYKVPRPNNNTLTSDDLL--PSITFHFLNNVSLVLPQGNHFYPVSAPGNPA-VVK 430
Query: 304 CLTVMST----DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 353
CL ST DG G+ G +V+D E ++ + C +H
Sbjct: 431 CLMFQSTDDGDDGPAGVFGSFQQQNVEVVYDLEKERIGFQPMDCASAASSQGLH 484
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 72/325 (22%), Positives = 126/325 (38%), Gaps = 34/325 (10%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
+N + + P++S+S ++C LC + C Y Y + + ++G V D +
Sbjct: 50 QNDALFLPNTSTSFTKLACGSALCNGLPFPMCNQTTCVYWYSYG-DGSLTTGDFVYDTIT 108
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
+ + Q + GCG GS+ A DG++GLG G +S S L +
Sbjct: 109 MDGINGQKQQV---PNFAFGCGHDNEGSF---AGADGILGLGQGPLSFHSQLKS--VYNG 160
Query: 160 SFSICFDE-----NDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSC 211
FS C + + + FGD +LPI Y+V + +G++
Sbjct: 161 KFSYCLVDWLAPPTQTSPLLFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNL 220
Query: 212 LTQS----------GFQALVDSGASFTFLPTEIYAEVVVKFD--KLVSSKRISLQGNSWK 259
L S G + DSG + T L Y EV+ + + S++I +
Sbjct: 221 LNISSTVFDIDSVGGAGTIFDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKID-DISRLD 279
Query: 260 YCYNASSEEML-KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
C + ++ L VP M F + ++ F + E+ +C M++ D IIG
Sbjct: 280 LCLSGFPKDQLPTVPAMTFHFEGGDMVLPPSNYFIYLESS--QSYCF-AMTSSPDVNIIG 336
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
++ +D KL + C
Sbjct: 337 SVQQQNFQVYYDTAGRKLGFVPKDC 361
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 75/329 (22%), Positives = 128/329 (38%), Gaps = 34/329 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLC-----KSRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDIL 98
YD SSSSS + + C+ C SSC S+K P P Y D S ++G L + +
Sbjct: 72 YDKSSSSSYREIPCTDDECLFLPAPIGSSC-SIKSPSPCDYTYGYSDQSRTTGILAYETI 130
Query: 99 HLASFSK-------HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
+ S + H ++ +V +GC R+ G+ GA+ GV+GLG G +S+ +
Sbjct: 131 SMKSRKRSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGAS--GVLGLGQGPISLATQT 188
Query: 152 AKAGLIQNSFSICFDENDSGSVF--FGDQGPATQQSTSFLPIGEKYDA---YFVGVESYC 206
L FS C + GS F G + + PI A Y+V V
Sbjct: 189 RHTAL-GGIFSYCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVA 247
Query: 207 IGNSCL-----TQSGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 255
+ + + G + DSG + ++L Y++V+ + + R
Sbjct: 248 VDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIP 307
Query: 256 NSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG 315
++ CYN + E +P + + F + + + E L ++T
Sbjct: 308 EGFELCYNVTRMEK-GMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSN 366
Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
I+G H I +D ++ + S C
Sbjct: 367 ILGNLLQQDHHIEYDLAKARIGFKWSPCH 395
>gi|213998802|gb|ACJ60768.1| nucellin [Hordeum murinum subsp. glaucum]
Length = 142
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 63/131 (48%), Gaps = 4/131 (3%)
Query: 120 CGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGD 177
CG KQ +P DG++GLG+G L +I +N C G ++ GD
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGFAVQLKGQKMIKENIIGHCLSSKGKGVLYVGD 60
Query: 178 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTFLPTEIYA 236
P ++ T ++P+ E Y G+ I N + F+A+ DSG+++T +P IY+
Sbjct: 61 FNPPSRGVT-WVPMRESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPAHIYS 119
Query: 237 EVVVKFDKLVS 247
E+V K +S
Sbjct: 120 EIVSKVRGTLS 130
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 76/340 (22%), Positives = 141/340 (41%), Gaps = 55/340 (16%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDI 97
+ +DPS SS+ + C+HPLCK R +L C + + + + T + G LV +
Sbjct: 111 ASFDPSLSSTFSILPCTHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREK 170
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS----------- 146
+FS+ S +I+GC + T P G++G+ LG +S
Sbjct: 171 F---TFSR----SVSTPPLILGCATESTD-------PRGILGMNLGRLSFAKQSKITKFS 216
Query: 147 --VPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE 203
VP + G SF + + + G + G + Q+ +F P+ Y VG+
Sbjct: 217 YCVPPRQTRPGFTPTGSFYLGNNPSSKGFKYVGMMTSSRQRMPNFDPLA--YTIPMVGIR 274
Query: 204 --------SYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISL 253
S + + SG Q ++DSG+ FT+L +E Y +V + + V K+ +
Sbjct: 275 IAGKKLNISPAVFRADAGGSG-QTMIDSGSEFTYLVSEAYDKVRAQVVRAVGPRLKKGYV 333
Query: 254 QGNSWKYCYNA--SSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMST 310
G C+++ + E + +M F + V+ + + + + G V C+ + S+
Sbjct: 334 YGGVADMCFDSVKAVEIGRLIGEMVFEFERGVEVVIPKERVLA---DVGGGVHCVGIGSS 390
Query: 311 D---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
D IIG + FD ++ + + C ++
Sbjct: 391 DKLGAASNIIGNFHQQNLWVEFDLVRRRVGFGKADCSRLV 430
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 79/335 (23%), Positives = 129/335 (38%), Gaps = 54/335 (16%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-----------RSSCKSLKD---PCPYIADYSTEDTSSS 90
+DPSSS S V C+ C + ++C+ C Y Y + + S
Sbjct: 193 FDPSSSPSYAAVPCNSSSCDALQLATGGTSGGAAACQGQDQSAAACSYTLSYR-DGSYSR 251
Query: 91 GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS-VPS 149
G L D L LA V + GCG G G + G+MGLG +S V
Sbjct: 252 GVLAHDRLSLAG--------EVIDGFVFGCGTSNQGPPFGGTS--GLMGLGRSQLSLVSQ 301
Query: 150 LLAKAGLIQNSFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----YFVG 201
+ + G + FS C + + SGS+ GD + ST + D YFV
Sbjct: 302 TMDQFGGV---FSYCLPLKESDSSGSLVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFVN 358
Query: 202 VESYCIGNSCL-------TQSGFQALVDSGASFTFLPTEIY----AEVVVKFDKLVSSKR 250
+ +G + G +A++DSG T L IY AE + +F + +
Sbjct: 359 LTGITVGGQEVESSGFSSGGGGGKAIIDSGTVITSLVPSIYNAVKAEFLSQFAEYPQAPG 418
Query: 251 ISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST 310
S+ C+N + ++VP ++L+F V + + + + CL +
Sbjct: 419 FSI----LDTCFNMTGLREVQVPSLKLVFDGGVEVEVDSGGVLYFVSSDSSQVCLAMAPL 474
Query: 311 DGDY--GIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
+Y IIG R++FD ++ ++ C
Sbjct: 475 KSEYETNIIGNYQQKNLRVIFDTSGSQVGFAQETC 509
>gi|291002744|gb|ADD71504.1| xyloglucanase inhibitor 2 [Humulus lupulus]
Length = 445
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 61/240 (25%), Positives = 100/240 (41%), Gaps = 43/240 (17%)
Query: 87 TSSSGYLVDDILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAP--DGVMGLGLG 143
TS+SG L DI+ + S + P V +VI CG L+G A G+ GLG
Sbjct: 129 TSTSGELAQDIISIQSTNGSNPSKVVSFPNVIFTCGST---FLLEGLASGVTGIAGLGRK 185
Query: 144 DVSVPSLLAKAGLIQNSFSICFDEND--SGSVFFGDQGPA-------TQQSTSFLPI--- 191
+++PS A A + F++C + +G VFFGD GP Q+ + P+
Sbjct: 186 KIALPSQFAAAFSFKRKFALCLSSSTRATGVVFFGD-GPYIMLPNKDVSQNLIYTPLILN 244
Query: 192 ----------GEKYDAYFVGVESYCI-GNSCLTQSGFQALVDSGAS---------FTFLP 231
GE YF+GV+ + G + ++ G +T L
Sbjct: 245 PVSTAGASFEGEPSADYFIGVKGIKVNGEDVKLNTSLLSIAKDGTGGTKISTTQPYTSLE 304
Query: 232 TEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK----VPDMRLIFSKNQSFVV 287
T IY V+ F K V+ ++ C+N++S + VP + L+ N+++ +
Sbjct: 305 TSIYKAVIGAFGKAVAKVPRVTAVAPFELCFNSTSFSSTRVGPGVPQIDLVLPNNKAWTI 364
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 79/315 (25%), Positives = 130/315 (41%), Gaps = 43/315 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+DPS SS+ SCS C + + C S C YI Y+ + +S++G D L L
Sbjct: 173 FDPSLSSTYSPFSCSSAACAQLGQDGNGCSS-SSQCQYIVRYA-DGSSTTGTYSSDTLAL 230
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQN 159
S + S+ GC ++G + D DG+MGLG G PSL ++ AG
Sbjct: 231 GS--------NTISNFQFGCSHVESG-FND--LTDGLMGLGGG---APSLASQTAGTFGT 276
Query: 160 SFSICFDENDSGSVFFG-DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSG 216
+FS C S S F G + T L Y V +E+ +G + L+ S
Sbjct: 277 AFSYCLPPTPSSSGFLTLGAGTSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSV 336
Query: 217 FQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 274
F A ++DSG T LP Y+ + F + R + + C++ S + +++P
Sbjct: 337 FSAGMVMDSGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVRLPS 396
Query: 275 MRLIFSK------NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
+ L+FS + + ++ + +F N S D GI+G ++
Sbjct: 397 VALVFSGGAVVNLDANGIILGNCLAFAAN-----------SDDSSPGIVGNVQQRTFEVL 445
Query: 329 FDRENLKLAWSHSKC 343
+D + + C
Sbjct: 446 YDVGGGAVGFKAGAC 460
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 82/317 (25%), Positives = 128/317 (40%), Gaps = 37/317 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS-CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+DPS SS+ V C P C + C C Y+ Y + +S++G L D L L S
Sbjct: 194 FDPSKSSTYAAVHCGEPQCAAAGGLCSEDNTTCLYLVHYG-DGSSTTGVLSRDTLALTS- 251
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
S + GCG + G + DG++GLG G++S+PS A + FS
Sbjct: 252 ------SRALAGFPFGCGTRNLGDF---GRVDGLLGLGRGELSLPSQAAAS--FGAVFSY 300
Query: 164 CFDENDSGSVFFG-DQGPATQ----QSTSFLPIGEKYDAYFVGVESYCIGNSCL------ 212
C ++S + + PAT Q T+ L + YFV + S IG L
Sbjct: 301 CLPSSNSTTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAV 360
Query: 213 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
T+ G L+DSG T+LP + Y + +F + + + CY+ + E +
Sbjct: 361 FTRGG--TLLDSGTVLTYLPAQAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVI 418
Query: 272 VPDMRLIFSKNQSFVVR--NHIFSFPENEGFTVFCLTVMSTDGD---YGIIGQNFMMGHR 326
VP + F F + + EN G CL + D IIG
Sbjct: 419 VPAVSFRFGDGAVFELDFFGVMIFLDENVG----CLAFAAMDAGGLPLSIIGNTQQRSAE 474
Query: 327 IVFDRENLKLAWSHSKC 343
+++D K+ + + C
Sbjct: 475 VIYDVAAEKIGFVPASC 491
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 34/125 (27%), Positives = 55/125 (44%), Gaps = 2/125 (1%)
Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLI 278
++DSG S T L +Y V F R++ G S + CY+ ++KVP + +
Sbjct: 339 ILDSGTSVTRLARPVYVAVREAFRAAAGGLRLAPGGFSLFDTCYDLRGRRVVKVPTVSVH 398
Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
+ + + P + T FCL + TDG I+G G R+VFD + ++A
Sbjct: 399 LAGGAEVALPPENYLIPVDTRGT-FCLALAGTDGGVSIVGNIQQQGFRVVFDGDRQRVAL 457
Query: 339 SHSKC 343
C
Sbjct: 458 VPKSC 462
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 83/341 (24%), Positives = 147/341 (43%), Gaps = 63/341 (18%)
Query: 47 PSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
P+ SS+ + C+ C+ SR + C Y +Y+ ++GYL + L +
Sbjct: 137 PARSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAY--NYTYGSGYTAGYLATETLTVG 194
Query: 102 --SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
+F K V GC T + +D ++ G++GLG G +S+ S LA
Sbjct: 195 DGTFPK----------VAFGC---STENGVDNSS--GIVGLGRGPLSLVSQLAVG----- 234
Query: 160 SFSICF--DENDSGS--VFFGDQGPATQ----QSTSFL--PIGEKYDAYFVGVESYCIGN 209
FS C D D G+ + FG T+ QST L P ++ Y+V + + +
Sbjct: 235 RFSYCLRSDMADGGASPILFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDS 294
Query: 210 SCL---------TQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 258
+ L TQ+G +VDSG + T+L + YA V F +++ + +
Sbjct: 295 TELPVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGA 354
Query: 259 KY----CYNASS---EEMLKVPDMRLIFSKNQSF--VVRNHIFSF-PENEG-FTVFCLTV 307
Y CY S+ + ++VP + L F+ + V+N+ +++G TV CL V
Sbjct: 355 PYDLDLCYKPSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLV 414
Query: 308 MSTDGDY--GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+ D IIG M +++D + +++ + C ++
Sbjct: 415 LPATDDLPISIIGNLMQMDMHLLYDIDGGMFSFAPADCAKL 455
>gi|213998832|gb|ACJ60783.1| nucellin [Hordeum vulgare subsp. spontaneum]
Length = 127
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 62/125 (49%), Gaps = 4/125 (3%)
Query: 120 CGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGD 177
CG KQ +P DG++GLG+G + + L +I +N C G ++ GD
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVIGHCLSSKGKGVLYVGD 60
Query: 178 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTFLPTEIYA 236
P T+ T ++P+ E Y G+ I + F+A+ DSG+++T +P +IY
Sbjct: 61 FNPPTRGVT-WVPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTYTHVPAQIYN 119
Query: 237 EVVVK 241
E+V K
Sbjct: 120 EIVSK 124
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 65/256 (25%), Positives = 102/256 (39%), Gaps = 43/256 (16%)
Query: 116 VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDS 170
V GCG GS+ AA GV+GLG G +S S + A N F+ C + S
Sbjct: 174 VAFGCGSDNQGSF---AAAGGVLGLGQGPLSFGSQVGYA--YGNKFAYCLVNYLDPTSVS 228
Query: 171 GSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL--TQSGFQ------- 218
S+ FGD+ +T + PI + Y+V +E +G L + S ++
Sbjct: 229 SSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLGNG 288
Query: 219 -ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI-SLQGNSWKYCYNASSEEMLKVPDMR 276
++ DSG + T+ Y+ ++ FD V R S+QG C + + P
Sbjct: 289 GSIFDSGTTLTYWFPSAYSHILAAFDSGVHYPRAESVQG--LDLCVELTGVDQPSFPSFT 346
Query: 277 LIFSKNQSFVVRNHIFSFPENEGF------TVFCLT---VMSTDGDYGIIGQNFMMGHRI 327
+ F F PE E + V CL + S G + IG +
Sbjct: 347 IEFDDGAVFQ--------PEAENYFVDVAPNVRCLAMAGLASPLGGFNTIGNLLQQNFFV 398
Query: 328 VFDRENLKLAWSHSKC 343
+DRE + ++ +KC
Sbjct: 399 QYDREENLIGFAPAKC 414
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 83/335 (24%), Positives = 142/335 (42%), Gaps = 34/335 (10%)
Query: 24 TLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYS 83
T L CL G + ++ +DP SSS VSC C+ + C Y +Y
Sbjct: 21 TWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQLLDEAGCNVNSCIYKVEYG 80
Query: 84 TEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG 143
+ + + G L + L S+ ++ IGCG G ++ ++GLG G
Sbjct: 81 -DGSFTIGELATETLTFV-------HSNSIPNISIGCGHDNEGLFVGADG---LIGLGGG 129
Query: 144 DVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGD--QGPATQQSTSFLPIGEKYDAY-FV 200
+S+ S L + SFS C + DS S D P + S L +++ ++ +V
Sbjct: 130 AISISSQLKAS-----SFSYCLVDIDSPSFSTLDFNTDPPSDSLISPLVKNDRFPSFRYV 184
Query: 201 GVESYCIGNSCL---------TQSGFQAL-VDSGASFTFLPTEIYAEVVVKFDKLVSSKR 250
V +G L +SG + VDSG + T LP+++Y + F L ++
Sbjct: 185 KVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEVLREAFLGLTTNLP 244
Query: 251 ISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVM 308
+ + + + CY+ SS+ ++VP + I S + +N + + + FCL +
Sbjct: 245 PAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLI---QVDSAGTFCLAFV 301
Query: 309 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
S IIG G R+ +D N + +S +KC
Sbjct: 302 SATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336
>gi|156065227|ref|XP_001598535.1| hypothetical protein SS1G_00624 [Sclerotinia sclerotiorum 1980]
gi|154691483|gb|EDN91221.1| hypothetical protein SS1G_00624 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 482
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 77/328 (23%), Positives = 133/328 (40%), Gaps = 42/328 (12%)
Query: 69 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG---CGRKQT 125
C + PC YS +S+ YL D A V + IG + Q
Sbjct: 107 CSERRSPCQTAGTYSANSSSTYAYLASDFNISYVDGSGASGDYVTDTFTIGSTTLDKLQF 166
Query: 126 GSYLDGAAPDGVMGLG--LGDVSV-----------PSLLAKAGLIQ-NSFSICFDENDS- 170
G ++P+G++G+G + +V V P+ + GLI N+FS+ ++ DS
Sbjct: 167 GIGYTSSSPEGILGIGYEINEVQVGRARKSAYKNLPAQMVADGLINSNAFSLWLNDLDSS 226
Query: 171 -GSVFFGDQGPATQQST-SFLPIGEK---YDAYFVGVESYCIGNSCLTQ-SGFQALVDSG 224
GSV FG A LPI ++ Y + + + +GN + Q L+DSG
Sbjct: 227 TGSVLFGGVDTARYHGQLETLPIQKESGYYAEFLITLTEVTLGNLVIAQDQSLAVLLDSG 286
Query: 225 ASFTFLP----TEIYAEVVVKFDKLVSSKRI--SLQGNSWKYCYNASSEEMLKVPDMRLI 278
+S T+LP IY +V ++D + + SL NS + +S + D L+
Sbjct: 287 SSLTYLPDAMAEAIYEQVDAQYDYSEGAAYVPCSLASNSSALNFTFTSPTIQVTMD-ELV 345
Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFMMGHRIVFDRENLKLA 337
S F + T CL ++ G+ ++G F+ +V+D N +++
Sbjct: 346 IPVTSS---NGQQLRFTDG---TAACLFGIAPAGESTAVLGDTFIRSAYVVYDLANNEIS 399
Query: 338 WSHSKCEEVIDKSHVHLVPPPAGQSPNP 365
+ + + + ++V G S P
Sbjct: 400 LAQTN----FNATATNVVEITTGTSAVP 423
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 75/314 (23%), Positives = 124/314 (39%), Gaps = 32/314 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCK------SLKDPCPYIADYSTEDTSSSGYLVDDIL 98
+DP +SSS VSCS P C S+ S + C Y A Y + + S GYL D +
Sbjct: 160 FDPKTSSSYAAVSCSSPQCDGLSTATLNPAVCSPSNVCIYQASYG-DSSFSVGYLSKDTV 218
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
+ S + GCG+ G + A G+MGL +S+ L A +
Sbjct: 219 SFGANSV--------PNFYYGCGQDNEGLFGRSA---GLMGLARNKLSL--LYQLAPTLG 265
Query: 159 NSFSICF-DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 217
SFS C + SG + G P T + YF+ + + L S
Sbjct: 266 YSFSYCLPSTSSSGYLSIGSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSS 325
Query: 218 Q-----ALVDSGASFTFLPTEIYAEV--VVKFDKLVSSKRISLQGNSWKYCYNASSEEML 270
+ ++DSG T LPT +Y + V S+KR + + C+ + ++
Sbjct: 326 EYTSLPTIIDSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAY-SILDTCFEGQASKLR 384
Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
VP + + FS + + + +G T CL + IIG +V+D
Sbjct: 385 AVPAVSMAFSGGATLKLSAGNL-LVDVDGATT-CL-AFAPARSAAIIGNTQQQTFSVVYD 441
Query: 331 RENLKLAWSHSKCE 344
++ ++ ++ + C
Sbjct: 442 VKSNRIGFAAAGCS 455
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 80/325 (24%), Positives = 129/325 (39%), Gaps = 51/325 (15%)
Query: 57 SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD------ILHLASFSKHAPQS 110
+CS L S S+C + PC Y DY +D S++ V + +S SK+ +
Sbjct: 165 TCSKSLPFSLSTCPTPGSPCAY--DYRYKDGSAARGTVGTESATIALSSSSSSSKNKVKK 222
Query: 111 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----- 165
+ +++GC TG + A DGV+ LG +VS S A FS C
Sbjct: 223 AKLQGLVLGCTGSYTGPSFE--ASDGVLSLGYSNVSFAS--HAASRFGGRFSYCLVDHLS 278
Query: 166 DENDSGSVFFGDQ-----------GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
N + + FG GP +Q T + Y V +++ + L
Sbjct: 279 PRNATSYLTFGPNSALSGPCPAAAGPGARQ-TPLVLDSRMRPFYDVSIKAISVDGELLKI 337
Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNAS 265
G +VDSG S T L Y VV KL R+++ + ++YCYN +
Sbjct: 338 PRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLARFPRVAM--DPFEYCYNWT 395
Query: 266 S----EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY---GIIG 318
S +E +P + + F+ + + + G V C+ V +G + +IG
Sbjct: 396 SPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPG--VKCIGVQ--EGPWPGISVIG 451
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
H FD +N +L + S+C
Sbjct: 452 NILQQEHLWEFDLKNRRLRFKRSRC 476
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 83/341 (24%), Positives = 147/341 (43%), Gaps = 63/341 (18%)
Query: 47 PSSSSSSKNVSCSHPLCK-----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
P+ SS+ + C+ C+ SR + C Y +Y+ ++GYL + L +
Sbjct: 137 PARSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAY--NYTYGSGYTAGYLATETLTVG 194
Query: 102 --SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
+F K V GC T + +D ++ G++GLG G +S+ S LA
Sbjct: 195 DGTFPK----------VAFGC---STENGVDNSS--GIVGLGRGPLSLVSQLAVG----- 234
Query: 160 SFSICF--DENDSGS--VFFGDQGPATQ----QSTSFL--PIGEKYDAYFVGVESYCIGN 209
FS C D D G+ + FG T+ QST L P ++ Y+V + + +
Sbjct: 235 RFSYCLRSDMADGGASPILFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDS 294
Query: 210 SCL---------TQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 258
+ L TQ+G +VDSG + T+L + YA V F +++ + +
Sbjct: 295 TELPVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGA 354
Query: 259 KY----CYNASS---EEMLKVPDMRLIFSKNQSF--VVRNHIFSF-PENEG-FTVFCLTV 307
Y CY S+ + ++VP + L F+ + V+N+ +++G TV CL V
Sbjct: 355 PYDLDLCYKPSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLV 414
Query: 308 MSTDGDY--GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+ D IIG M +++D + +++ + C ++
Sbjct: 415 LPATDDLPISIIGNLMQMDMHLLYDIDGGMFSFAPADCAKL 455
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 91/320 (28%), Positives = 134/320 (41%), Gaps = 42/320 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
Y+P+ SSS K V C LC+ S C S C Y Y + + + G + L L
Sbjct: 187 YNPALSSSYKLVGCQANLCQQLDVSGC-SRNGSCLYQVSYG-DGSYTQGNFATETLTLGG 244
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSF 161
AP +V IGCG G ++ A ++GLG G +S PS L + G I F
Sbjct: 245 ----APLQNV----AIGCGHDNEGLFVGAAG---LLGLGGGSLSFPSQLTDENGKI---F 290
Query: 162 SICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQS-- 215
S C + DS S + FG + + + D Y+V + +G L+ S
Sbjct: 291 SYCLVDRDSESSSTLQFGRAAVPNGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDS 350
Query: 216 --GFQA------LVDSGASFTFLPTEIYAEVVVKF----DKLVSSKRISLQGNSWKYCYN 263
G A +VDSG + T L T Y + F L S+ +SL + CY+
Sbjct: 351 VFGIDASGNGGVIVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPSTDGVSL----FDTCYD 406
Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
SS+E + VP + FS S + + P + FC T I+G
Sbjct: 407 LSSKESVDVPTVVFHFSGGGSMSLPAKNYLVPVDS-MGTFCFAFAPTSSSLSIVGNIQQQ 465
Query: 324 GHRIVFDRENLKLAWSHSKC 343
G R+ FDR N ++ ++ +KC
Sbjct: 466 GIRVSFDRANNQVGFAVNKC 485
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 79/317 (24%), Positives = 129/317 (40%), Gaps = 40/317 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP SSSS ++ C C++ S C++ K C Y Y + + + G V + L +
Sbjct: 197 FDPRSSSSFASLPCESQQCQALETSGCRASK--CLYQVSYG-DGSFTVGEFVTETLTFGN 253
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S + + V +GCG G + V GL + L + + +SFS
Sbjct: 254 -------SGMINDVAVGCGHDNEGLF--------VGSAGLLGLGGGPLSLTSQMKASSFS 298
Query: 163 ICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------ 213
C + DS S + F P+ + L G+ Y+VG+ +G L+
Sbjct: 299 YCLVDRDSSSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLF 358
Query: 214 ---QSGFQAL-VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASS 266
SG+ + VDSG + T L T+ Y + D VS + N + CY+ SS
Sbjct: 359 QMDDSGYGGIIVDSGTAITRLQTQAYNTLR---DAFVSRTPYLKKTNGFALFDTCYDLSS 415
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
+ + +P + F+ +S + + P + T FC T IIG G R
Sbjct: 416 QSRVTIPTVSFEFAGGKSLQLPPKNYLIPVDSVGT-FCFAFAPTTSSLSIIGNVQQQGTR 474
Query: 327 IVFDRENLKLAWSHSKC 343
+ +D N + +S KC
Sbjct: 475 VHYDLANSVVGFSPHKC 491
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 132/333 (39%), Gaps = 52/333 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+ P S+S + + C+ LC SC+ D C Y +Y + T + G + AS
Sbjct: 138 FAPGQSASYEPMRCAGTLCSDILHHSCER-PDTCTYRYNYG-DGTMTVGVYATERFTFAS 195
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S ++ + GCG GS +G+ G++G G +S+ S L+ FS
Sbjct: 196 -SGGGGLTTTTVPLGFGCGSVNVGSLNNGS---GIVGFGRNPLSLVSQLSI-----RRFS 246
Query: 163 ICFDENDS--------GSVFFGDQGPATQ--QSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
C S GS+ G G AT Q+T L + Y+V +G L
Sbjct: 247 YCLTSYASRRQSTLLFGSLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRL 306
Query: 213 T--QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN------ 256
+S F +VDSG + T LP + AEVV F + + + GN
Sbjct: 307 RIPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAFRQQLRLP-FANGGNPEDGVC 365
Query: 257 -----SWKYCYNASSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMST 310
+W+ +SS + VP M L F + RN++ ++ CL + +
Sbjct: 366 FLVPAAWR---RSSSTSQMPVPRMVLHFQGADLDLPRRNYVL---DDHRRGRLCLLLADS 419
Query: 311 DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
D IG R+++D E L+ + ++C
Sbjct: 420 GDDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 452
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 80/320 (25%), Positives = 129/320 (40%), Gaps = 40/320 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DPS S S + C PLC+ S C + C Y Y + + + + +
Sbjct: 172 FDPSKSKSFAGIPCYSPLCRRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETL----T 227
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
F + A V IGCG G ++ A ++GLG G +S P+ N FS
Sbjct: 228 FRRAA-----VPRVAIGCGHDNEGLFVGAAG---LLGLGRGGLSFPT--QTGTRFNNKFS 277
Query: 163 ICFDENDS----GSVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS---CL 212
C + + S+ FGD A ++ F P+ K D Y+V + +G + +
Sbjct: 278 YCLTDRTASAKPSSIVFGDS--AVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGI 335
Query: 213 TQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
+ S F+ ++DSG S T L Y + F S + + + + + CY+
Sbjct: 336 SASFFRLDSTGNGGVIIDSGTSVTRLTRPAYVSLRDAFRVGASHLKRAPEFSLFDTCYDL 395
Query: 265 SSEEMLKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
S +KVP + L F + S N++ +N G FC T IIG
Sbjct: 396 SGLSEVKVPTVVLHFRGADVSLPAANYLVPV-DNSG--SFCFAFAGTMSGLSIIGNIQQQ 452
Query: 324 GHRIVFDRENLKLAWSHSKC 343
G R+VFD ++ ++ C
Sbjct: 453 GFRVVFDLAGSRVGFAPRGC 472
>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 480
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 80/324 (24%), Positives = 122/324 (37%), Gaps = 54/324 (16%)
Query: 67 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 126
S C S P Y Y+ D S L D L L + + P + + GC G
Sbjct: 165 SECSSFSCPPFY---YAYGDGSLVARLYRDSLSLPTPAPSPPINV--RNFTFGCAHTTLG 219
Query: 127 SYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDENDSGS--------VFFGD 177
P GV G G G +S+PS LA + + N FS C + + + G
Sbjct: 220 E------PVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADRVRRPSPLILGR 273
Query: 178 --QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF----------QALVDSGA 225
G TS L + Y VG+ +GN + F +VDSG
Sbjct: 274 YYTGETEFIYTSLLENPKHPYFYSVGLAGISVGNIRIPAPEFLTKVDEGGSGGVVVDSGT 333
Query: 226 SFTFLPTEIYAEVVVKFD----KLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 281
+FT LP +Y VV +F+ K+ + R + CY E + VP + L F
Sbjct: 334 TFTMLPAGLYESVVAEFENRTGKVANRARRIEENTGLSPCY--YYENSVGVPRVVLHFVG 391
Query: 282 NQSFVV---RNHIFSFPE------NEGFTVFCLTVMS-------TDGDYGIIGQNFMMGH 325
+S VV +N+ + F + V CL +M+ G +G G
Sbjct: 392 EKSNVVLPRKNYFYEFLDGGDGVVGRKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGF 451
Query: 326 RIVFDRENLKLAWSHSKCEEVIDK 349
+V+D E ++ ++ +C + D
Sbjct: 452 EVVYDLEKNRVGFARRQCSTLWDN 475
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 86/345 (24%), Positives = 140/345 (40%), Gaps = 60/345 (17%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+ +DP+ S+S + + CS P C +R+ SC S + C Y+ + +SS G L
Sbjct: 67 TTFDPTRSTSYQTIPCSSPTCTNRTQDFPIPASCDS-NNLCHATLSYA-DASSSDGNLAS 124
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLD-GAAPDGVMGLGLGDVSVPSLLAKA 154
D+ H+ SS S ++ GC S D + G+MG+ G +S S L
Sbjct: 125 DVFHIG--------SSDISGLVFGCMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQLGFP 176
Query: 155 GLIQNSFSICFDEND-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVE 203
FS C D SG + G+ P Q ST LP ++ AY V +E
Sbjct: 177 -----KFSYCISGTDFSGLLLLGESNLTWSVPLNYTPLIQISTP-LPYFDRV-AYTVQLE 229
Query: 204 SYCIGNSCL--TQSGF--------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 253
+ + L +S F Q +VDSG FTFL +Y + F SS L
Sbjct: 230 GIKVLDKLLPIPKSTFEPDHTGAGQTMVDSGTQFTFLLGPVYNALRSAFLNQTSSVLRVL 289
Query: 254 QGNSWKY------CYNA--SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE---GFTV 302
+ + + CY S + +P + L+F + V + + E +V
Sbjct: 290 EDPDFVFQGAMDLCYLVPLSQRVLPLLPTVTLVFRGAEMTVSGDRVLYRVPGELRGNDSV 349
Query: 303 FCLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
CL+ ++D + +IG + + FD E ++ + +C+
Sbjct: 350 HCLSFGNSDLLGVEAYVIGHHHQQNVWMEFDLEKSRIGLAQVRCD 394
>gi|213998814|gb|ACJ60774.1| nucellin [Hordeum cf. pusillum GP-2003]
Length = 142
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 61/125 (48%), Gaps = 4/125 (3%)
Query: 120 CGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSVFFGD 177
CG KQ +P DG++GLG+G + L +I N C G ++ GD
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGD 60
Query: 178 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTFLPTEIYA 236
P ++ T ++P+ E Y G+ I N + F+A+ DSG+++T +P +IY
Sbjct: 61 FNPPSRGVT-WVPMKESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTHVPAQIYN 119
Query: 237 EVVVK 241
E+V K
Sbjct: 120 EIVSK 124
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 80/348 (22%), Positives = 141/348 (40%), Gaps = 72/348 (20%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDI 97
+ +DPS SSS + C+HPLCK R +L C + + + + T + G LV +
Sbjct: 124 ASFDPSLSSSFYVLPCTHPLCKPRVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREK 183
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L + S +I+GC + + G++G+ LG +S P AK
Sbjct: 184 LAFSP-------SQTTPPLILGCSSESRDA-------RGILGMNLGRLSFP-FQAKV--- 225
Query: 158 QNSFSICF------DEND--SGSVFFGDQG-------------PATQQSTSFLPIGEKYD 196
FS C + N+ +GS + G+ P +Q+ + P+
Sbjct: 226 -TKFSYCVPTRQPANNNNFPTGSFYLGNNPNSARFRYVSMLTFPQSQRMPNLDPL----- 279
Query: 197 AYFVGVESYCIGNSCLT-----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKL 245
AY V ++ IG L SG Q +VDSG+ FTFL Y V + ++
Sbjct: 280 AYTVPMQGIRIGGRKLNIPPSVFRPNAGGSG-QTMVDSGSEFTFLVDVAYDRVREEIIRV 338
Query: 246 VSS--KRISLQGNSWKYCYNASSEEMLK-VPDMRLIFSKNQSFVV-RNHIFSFPENEGFT 301
+ K+ + G C++ ++ E+ + + D+ F K VV + + + + G
Sbjct: 339 LGPRVKKGYVYGGVADMCFDGNAMEIGRLLGDVAFEFEKGVEIVVPKERVLA---DVGGG 395
Query: 302 VFCLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
V C+ + ++ IIG + FD N ++ + + C +
Sbjct: 396 VHCVGIGRSERLGAASNIIGNFHQQNLWVEFDLANRRIGFGVADCSRL 443
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 55.5 bits (132), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 73/329 (22%), Positives = 128/329 (38%), Gaps = 34/329 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCK-----SRSSCK-SLKDPCPYIADYSTEDTSSSGYLVDDIL 98
YD SSSSS + + C+ C+ SSC + PC Y YS + + ++G L + +
Sbjct: 104 YDKSSSSSYREIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGYS-DQSRTTGILAYETI 162
Query: 99 HLASFSK-------HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
+ S + H + +V +GC R+ G+ GA+ GV+GLG G +S+ +
Sbjct: 163 SMKSRKRSGKRAGNHKTRRIRIKNVALGCSRESVGASFLGAS--GVLGLGQGPISLATQT 220
Query: 152 AKAGLIQNSFSICFDENDSGS--VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYC 206
L FS C + GS F G + + PI A Y+V V
Sbjct: 221 RHTAL-GGIFSYCLVDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVA 279
Query: 207 IGNSCL-----TQSGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 255
+ + + G + DSG + ++L Y++V+ + + R
Sbjct: 280 VDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIP 339
Query: 256 NSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG 315
++ CYN + E +P + + F + + + E L ++T
Sbjct: 340 EGFELCYNVTRMEK-GMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSN 398
Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
I+G H I +D ++ + S C
Sbjct: 399 ILGNLLQQDHHIEYDLAKARIGFKWSPCH 427
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 81/329 (24%), Positives = 131/329 (39%), Gaps = 45/329 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DPSSSS+ V CS LC S+C S C Y Y + +S+ G L + L
Sbjct: 142 FDPSSSSTYATVPCSSALCSDLPTSTCTSASK-CGYTYTYG-DASSTQGVLASETFTLGK 199
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
K P V GCG G + GA G++GLG G + SL+++ GL + F
Sbjct: 200 EKKKLP------GVAFGCGDTNEGDGFTQGA---GLVGLGRGPL---SLVSQLGL--DKF 245
Query: 162 SICFDENDSGS----VFFGDQGPATQ--------QSTSFLPIGEKYDAYFVGVESYCIGN 209
S C D G + G A Q+T + + Y+V + +G+
Sbjct: 246 SYCLTSLDDGDGKSPLLLGGSAAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGS 305
Query: 210 SCLT--QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 259
+ +T S F +VDSG S T+L + Y + F ++ +
Sbjct: 306 TRITLPASAFAIQDDGTGGVIVDSGTSITYLELQGYRALKKAFVAQMALPTVDGSEIGLD 365
Query: 260 YCYN--ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
C+ A + ++VP + L F + + ++ CLTV + G II
Sbjct: 366 LCFQGPAKGVDEVQVPKLVLHFDGGADLDLPAENYMVLDSAS-GALCLTVAPSRG-LSII 423
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
G + V+D L+++ +C ++
Sbjct: 424 GNFQQQNFQFVYDVAGDTLSFAPVQCNKL 452
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 67/298 (22%), Positives = 118/298 (39%), Gaps = 26/298 (8%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP+ SS+ NVSC+ P C + C Y Y + + S G+ D L L+S+
Sbjct: 223 FDPARSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY- 280
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
GCG + G + + A G++GLG G S+P K G + F+
Sbjct: 281 ------DAVKGFRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAH 328
Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA----YFVGVESYCIGNSCLT--QSGF 217
C +G+ + + +++ L D Y+VG+ +G L+ QS F
Sbjct: 329 CLPARSTGTGYLDFGAGSLAAASARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVF 388
Query: 218 Q---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKV 272
+VDSG T LP Y+ + F ++++ + + + CY+ + + +
Sbjct: 389 ATAGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAI 448
Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
P + L+F V + + GD GI+G + + +D
Sbjct: 449 PTVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYD 506
>gi|449449755|ref|XP_004142630.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500674|ref|XP_004161165.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 413
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 79/319 (24%), Positives = 129/319 (40%), Gaps = 45/319 (14%)
Query: 56 VSCSHPLCKSRSSC-----KSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
VS PLC + SS K+ D C Y +Y+ + SS G LV D++ + +
Sbjct: 103 VSREDPLCAALSSLGKFIFKNPNDQCAYEVEYA-DHGSSVGVLVKDLVPM----RLTNGK 157
Query: 111 SVQSSVIIGCGRKQ-TGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND 169
+ ++ GCG Q G + GV+GL ++ S L+ G + N C
Sbjct: 158 RISPNLGFGCGYDQENGDLQQPPSIAGVLGLSSSKATIVSQLSDLGHVSNVVGHCLTGRG 217
Query: 170 SGSVFFG-DQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLTQSGFQALVDSGASF 227
G +FFG D P++ S+ PI + Y G + G DSG+S+
Sbjct: 218 GGFLFFGGDVVPSS--GMSWTPILRNSEGKYSSGPAEVYFNGRAVGIGGLTLTFDSGSSY 275
Query: 228 TFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML--------KVPDMRLIF 279
T+ +++Y + +KL+ + L+GN K + + E+ V D+R F
Sbjct: 276 TYFNSQVYRAI----EKLLKN---DLKGNPLKLASDDKTLELCWKGPKPFESVVDVRNFF 328
Query: 280 ---------SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD----GDYGIIGQNFMMGHR 326
SKN F + + F CL ++ G+ IIG M+
Sbjct: 329 KPLAMSFKNSKNVQFQIPPEAYLIISE--FGNVCLGILDGSKEGMGNVNIIGDISMLNKI 386
Query: 327 IVFDRENLKLAWSHSKCEE 345
+V+D E ++ W+ S C
Sbjct: 387 VVYDNERERIGWASSNCNR 405
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 75/317 (23%), Positives = 120/317 (37%), Gaps = 34/317 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTED-TSSSGYLVDDILHLASF 103
+DP SS+ VSC+ C S +S C Y DY D +S+SG L S
Sbjct: 122 FDPVKSSTYDTVSCASNFCSSLP-FQSCTTSCKY--DYMYGDGSSTSGAL--------ST 170
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
+ +V GCG GS+ A G++GLG G +S+ S + + FS
Sbjct: 171 ETVTVGTGTIPNVAFGCGHTNLGSF---AGAAGIVGLGQGPLSLIS--QASSITSKKFSY 225
Query: 164 CF---DENDSGSVFFGDQGPATQQSTSFLPIGEK----YDAYFVGVE------SYCIGNS 210
C + + GD A + + L Y A G+ +Y +G
Sbjct: 226 CLVPLGSTKTSPMLIGDSAAAGGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTF 285
Query: 211 CLTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
+ SG + DSG + T+L T + +V V YC++ +
Sbjct: 286 SIDASGQGGFILDSGTTLTYLETGAFNALVAALKAEVPFPEADGSLYGLDYCFSTAGVAN 345
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF 329
P M F + ++F + G CL + ++ G + I+G H IV
Sbjct: 346 PTYPTMTFHFKGADYELPPENVFVALDTGG--SICLAMAASTG-FSIMGNIQQQNHLIVH 402
Query: 330 DRENLKLAWSHSKCEEV 346
D N ++ + + CE +
Sbjct: 403 DLVNQRVGFKEANCETI 419
>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
Length = 360
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 74/309 (23%), Positives = 126/309 (40%), Gaps = 32/309 (10%)
Query: 62 LCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 121
+C + CK+ CPY Y ++ + ++ + S P+ +V+ GCG
Sbjct: 60 VCLVTNPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRVENVMFGCG 119
Query: 122 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFG 176
G + A ++GLG G +S S L L +SFS C D N S + FG
Sbjct: 120 HWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVDRNSDANVSSKLIFG 174
Query: 177 -DQGPATQQSTSF--LPIGEKYDA---YFVGVESYCIGNSCL----------TQSGFQAL 220
D+ + +F L G++ Y+V ++S +G + T +
Sbjct: 175 EDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIATDGSGGTI 234
Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 280
+DSG + ++ Y + F V + + CYN + E +PD ++FS
Sbjct: 235 IDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEPCYNVTGVEQPDLPDFGIVFS 294
Query: 281 KNQ--SFVVRNHIFSFPENEGFTVFCLTVMST-DGDYGIIGQNFMMGHRIVFDRENLKLA 337
+F V N+ F E E V CL ++ T IIG I++D + +L
Sbjct: 295 DGAVWNFPVENY---FIEIEPREVVCLAILGTPPSALSIIGNYQQQNFHILYDTKKSRLG 351
Query: 338 WSHSKCEEV 346
++ +KC +V
Sbjct: 352 FAPTKCADV 360
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 131/317 (41%), Gaps = 40/317 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP SSSS ++ C C++ S C++ K C Y Y + + + G V + L +
Sbjct: 197 FDPRSSSSFASLPCESQQCQALETSGCRASK--CLYQVSYG-DGSFTVGEFVIETLTFGN 253
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S + ++V +GCG G + V GL + SL + + +SFS
Sbjct: 254 -------SGMINNVAVGCGHDNEGLF--------VGSAGLLGLGGGSLSLTSQMKASSFS 298
Query: 163 ICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT------ 213
C + DS S + F P+ + L G+ Y+VG+ +G L+
Sbjct: 299 YCLVDRDSSSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLF 358
Query: 214 ---QSGFQAL-VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY---CYNASS 266
SG+ + VDSG + T L T+ Y + D VS + N + CY+ SS
Sbjct: 359 QMDDSGYGGIIVDSGTAITRLQTQAYNTLR---DAFVSRTPYLKKTNGFALFDTCYDLSS 415
Query: 267 EEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
+ + +P + F+ +S + + P + T FC T IIG G R
Sbjct: 416 QSRVTIPTVSFEFAGGKSLQLPPKNYLIPVDSVGT-FCFAFAPTTSSLSIIGNVQQQGTR 474
Query: 327 IVFDRENLKLAWSHSKC 343
+ +D N + +S KC
Sbjct: 475 VHYDLANSVVGFSPHKC 491
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 77/333 (23%), Positives = 131/333 (39%), Gaps = 54/333 (16%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+ P S+S + + C+ LC C+ + D C Y +Y + Y + +
Sbjct: 144 FAPGESASYEPMRCAGQLCSDILHHGCE-MPDTCTYRYNYGDGTMTMGVYATERF----T 198
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
F+ + + GCG GS +G+ G++G G +S+ S L+ FS
Sbjct: 199 FTSSGGDRLMTVPLGFGCGSMNVGSLNNGS---GIVGFGRNPLSLVSQLSI-----RRFS 250
Query: 163 ICFDENDSG---SVFFGD-----QGPATQ--QSTSFLPIGEKYDAYFVGVESYCIGNSCL 212
C SG ++ FG G AT Q+T L + Y+V + +G L
Sbjct: 251 YCLTSYGSGRKSTLLFGSLSGGVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRL 310
Query: 213 T--QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN------ 256
+S F +VDSG + T LP + AEVV F + + + GN
Sbjct: 311 RIPESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQLRLP-FANGGNPEDGVC 369
Query: 257 -----SWKYCYNASSEEMLKVPDMRLIFSK-NQSFVVRNHIFSFPENEGFTVFCLTVMST 310
+W+ +SS + VP M F + RN++ ++ CL + +
Sbjct: 370 FLVPAAWR---RSSSTSQVPVPRMVFHFQDADLDLPRRNYVL---DDHRKGRLCLLLADS 423
Query: 311 DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
D IG R+++D E L+++ ++C
Sbjct: 424 GDDGSTIGNLVQQDMRVLYDLEAETLSFAPAQC 456
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 71/320 (22%), Positives = 127/320 (39%), Gaps = 46/320 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
+DPS SS+ ++C+ C+ + C S C Y +Y+ + + S G ++ L
Sbjct: 175 FDPSKSSTYAPIACNTDACRKLGDHYHNGCTSGGTQCGYSVEYA-DGSHSRGVYSNETLT 233
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
L AP +V+ GCGR Q G DG++GLG VS+ ++ + +
Sbjct: 234 L------APGITVE-DFHFGCGRDQRGP---SDKYDGLLGLGGAPVSL--VVQTSSVYGG 281
Query: 160 SFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKY-----DAYFVGVESYCIGNSCL-- 212
+FS C +S + F P + ++F+ ++ Y V + +G L
Sbjct: 282 AFSYCLPALNSEAGFLVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHI 341
Query: 213 TQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 270
QS F+ ++DSG T LP Y + K + + + + + + CYN + +
Sbjct: 342 PQSAFRGGMIIDSGTVDTELPETAYNALEAALRKALKAYPL-VPSDDFDTCYNFTGYSNI 400
Query: 271 KVPDMRLIFSKNQSF-------VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
VP + FS + ++ N +F E+ D GIIG
Sbjct: 401 TVPRVAFTFSGGATIDLDVPNGILVNDCLAFQES-----------GPDDGLGIIGNVNQR 449
Query: 324 GHRIVFDRENLKLAWSHSKC 343
+++D + + C
Sbjct: 450 TLEVLYDAGRGNVGFRAGAC 469
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 77/320 (24%), Positives = 134/320 (41%), Gaps = 40/320 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
++P+ S S N+ C PLC+ S C + K C Y Y + + + G + L
Sbjct: 189 FNPTKSRSFANIPCGSPLCRRLDSPGCSTKKHICLYQVSYG-DGSFTYGEFSTETLTF-- 245
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+ + V +GCG G ++ A ++GLG G +S PS + + FS
Sbjct: 246 ------RGTRVGRVALGCGHDNEGLFIGAAG---LLGLGRGRLSFPSQIGRR--FSRKFS 294
Query: 163 ICFDENDSGS----VFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS---CL 212
C + + S + FGD A ++ F P+ K D Y+V + +G + +
Sbjct: 295 YCLVDRSASSKPSYMVFGDS--AISRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGI 352
Query: 213 TQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
T S F+ ++DSG S T L Y + F S+ + + + + + C++
Sbjct: 353 TASLFKLDSTGNGGVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSLFDTCFDL 412
Query: 265 SSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
S + +KVP + L F + S N++ +N G FC T I+G
Sbjct: 413 SGKTEVKVPTVVLHFRGADVSLPASNYLIPV-DNSG--SFCFAFAGTMSGLSIVGNIQQQ 469
Query: 324 GHRIVFDRENLKLAWSHSKC 343
G R+V+D ++ ++ C
Sbjct: 470 GFRVVYDLAASRVGFAPRGC 489
>gi|159463556|ref|XP_001690008.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158283996|gb|EDP09746.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 547
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 73/161 (45%), Gaps = 11/161 (6%)
Query: 45 YDPSSSSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+ P SS+S CS C +SC + C Y Y E +S+SG+L +D+L +
Sbjct: 123 FKPELSSTSSTFGCSDARCFCGANSCSCNNEQCGYSIRY-LEGSSTSGFLAEDMLAVGDG 181
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
A + + GC + ++G L DGV G+G S+ L + G+I ++FS+
Sbjct: 182 GPAA-------NFVFGCAQSESG-LLYSQIADGVFGMGRTPASLYGQLVQQGVIDDAFSM 233
Query: 164 CFDENDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVE 203
CF G + G+ PA + P+ + + + +E
Sbjct: 234 CFGAPREGVLLLGNVALPADAPAPVVTPVVGNTNKFNIQIE 274
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 82/337 (24%), Positives = 133/337 (39%), Gaps = 50/337 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLK-DPCPYIADYSTED-----TSSSGYLVDD 96
+DP S+S ++ P C++ RS K C Y Y D ++S G LV++
Sbjct: 183 FDPRHSTSYGEMNYDAPDCQALGRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEE 242
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
L A + Q+ + IGCG G L GA G++GL G +S+P +A G
Sbjct: 243 TLTFAGGVR-------QAYLSIGCGHDNKG--LFGAPAAGILGLSRGQISIPHQIAFLGY 293
Query: 157 IQNSFSICFDENDSG------SVFFGDQGPATQQSTSFLP------IGEKYDAYFVGVES 204
SFS C + SG ++ FG T SF P + Y +GV
Sbjct: 294 -NASFSYCLVDFISGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSV 352
Query: 205 YCIGNSCLTQSGFQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSS-KRISLQ 254
+ +T+ Q ++DSG + T L Y F + ++S
Sbjct: 353 GGVRVPGVTERDLQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTG 412
Query: 255 GNS--WKYCYNASSEEML----KVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLT 306
G S + CY L KVP + + F+ S +N++ + ++ G F
Sbjct: 413 GPSGLFDTCYTVGGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLITV-DSRGTVCFAF- 470
Query: 307 VMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
+ D +IG G R+V+D ++ ++ + C
Sbjct: 471 AGTGDRSVSVIGNILQQGFRVVYDIGGQRVGFAPNSC 507
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 77/318 (24%), Positives = 129/318 (40%), Gaps = 41/318 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
+DP +SS+ +V CS C + S+C S + C Y A Y + + S GYL D
Sbjct: 177 FDPRASSTYTSVRCSASQCDELQAATLNPSAC-SASNVCIYQASYG-DSSFSVGYLSTDT 234
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
+ S S S GCG+ G + A G++GL +S+ LA + +
Sbjct: 235 VSFGSTSY--------PSFYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--L 281
Query: 158 QNSFSICFDENDSGSVFFGDQGP-ATQQSTSFLPIGEK-YDA--YFVGVESYCIGNSCLT 213
SFS C + S + GP T S+ P+ DA YF+ + +G S L
Sbjct: 282 GYSFSYCLPT--AASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLA 339
Query: 214 -----QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
S ++DSG T LPT ++ + + ++ + + + C+ + +
Sbjct: 340 VSPSEYSSLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQ 399
Query: 269 MLKVPDMRLIFSKNQS--FVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
L+VP + + F+ S RN + ++ CL TD IIG
Sbjct: 400 -LRVPTVVMAFAGGASMKLTTRNVLIDVDDS----TTCLAFAPTDST-AIIGNTQQQTFS 453
Query: 327 IVFDRENLKLAWSHSKCE 344
+++D ++ +S C
Sbjct: 454 VIYDVAQSRIGFSAGGCS 471
>gi|37542277|gb|AAK81699.1| aspartyl proteinase [Oryza sativa]
Length = 411
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 62/300 (20%), Positives = 123/300 (41%), Gaps = 37/300 (12%)
Query: 73 KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 132
K+ C Y Y SS G L+ D SFS A + +S+ GCG Q + +
Sbjct: 112 KNQCHYGIQYV--GGSSIGVLIVD-----SFSLPASNGTNPTSIAFGCGYNQGKNNHNVP 164
Query: 133 AP-DGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQSTSFLP 190
P +G++GLG G V++ S L G+I ++ C G +FFGD T T + P
Sbjct: 165 TPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVT-WSP 223
Query: 191 IGEKYDAYFVGVESYCIGN---SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 247
+ ++ Y + + S ++ + + + DSGA++T+ + Y + +S
Sbjct: 224 MNREHKHYSPRQGTLHFNSNKQSPISAAPMEVIFDSGATYTYFALQPYHATLSVVKSTLS 283
Query: 248 SK-----RISLQGNSWKYCYNASSEEMLKVPDMRLIFS----------KNQSFVVRNHIF 292
+ + + + C+ +++ + +++ F K + + +
Sbjct: 284 KECKFLTEVKEKDRALTVCWKG-KDKIRTIDEVKKCFRSLSLKFADGDKKATLEIPPEHY 342
Query: 293 SFPENEGFTVFCLTVMSTDGDY------GIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
EG CL ++ ++ +IG M+ +++D E L W + +C+ +
Sbjct: 343 LIISQEGHV--CLGILDGSKEHPSLAGTNLIGGITMLDQMVIYDSERSLLGWVNYQCDRI 400
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 77/321 (23%), Positives = 132/321 (41%), Gaps = 42/321 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
++P S S + CS PLC+ S C + + C Y Y + + ++G + L
Sbjct: 152 FNPYKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYG-DGSFTTGDFATETLTF-- 208
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL-IQNSF 161
+ + + V +GCG G ++ A ++GLG G +S PS + G+ + F
Sbjct: 209 ------RGNKIAKVALGCGHHNEGLFVGAAG---LLGLGRGRLSFPS---QTGIRFNHKF 256
Query: 162 SICFDENDSGS----VFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGN---SC 211
S C + + S + FGD A + F P+ K D Y+VG+ +G
Sbjct: 257 SYCLVDRSASSKPSSMVFGDA--AISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRG 314
Query: 212 LTQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
++ S F+ ++DSG S T L Y + F + + + + CY+
Sbjct: 315 VSPSLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFDTCYD 374
Query: 264 ASSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
S + +KVP + L F + + N++ EN F C T IIG
Sbjct: 375 LSGQSSVKVPTVVLHFRGADMALPATNYLIPVDENGSF---CFAFAGTISGLSIIGNIQQ 431
Query: 323 MGHRIVFDRENLKLAWSHSKC 343
G R+V+D ++ ++ C
Sbjct: 432 QGFRVVYDLAGSRIGFAPRGC 452
>gi|224101053|ref|XP_002334311.1| predicted protein [Populus trichocarpa]
gi|222871031|gb|EEF08162.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 70/292 (23%), Positives = 111/292 (38%), Gaps = 59/292 (20%)
Query: 108 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSIC-- 164
P + + ++ GC A P GV G G G +S+P+ LA + + N FS C
Sbjct: 208 PTNLIVNNFTFGCAHTAL------AEPIGVAGFGRGVLSLPAQLATLSPQLGNQFSYCLV 261
Query: 165 -------------------FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESY 205
+D ++ G P TS L E Y VG+E
Sbjct: 262 SHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNKPRFVY-TSMLDNLEHPYFYCVGLEGI 320
Query: 206 CIGNSCLTQSGFQA----------LVDSGASFTFLPTEIYAEVVVKFDKLV----SSKRI 251
IG + GF +VDSG +FT LP +Y VV +F+ V R+
Sbjct: 321 SIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVNERARV 380
Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV---RNHIFSF-----PENEGFTVF 303
+ CY + + + L F N S VV RN+ + F + + V
Sbjct: 381 IEEDTGLSPCYYFDNNVVNVP-SVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGKKRKVG 439
Query: 304 CLTVMS-------TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVID 348
CL +M+ + G +G G +V+D EN ++ ++ +C + +
Sbjct: 440 CLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQCASLWE 491
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 86/345 (24%), Positives = 130/345 (37%), Gaps = 39/345 (11%)
Query: 25 LLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCP 77
L W + A +R + D S S K V C CK S ++C + PC
Sbjct: 107 LTWVNCRYRARGKDNRRVFRAD--ESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCS 164
Query: 78 YIADYSTEDTSSS-GYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDG 136
Y DY D S++ G + + + + + +IGC TG GA DG
Sbjct: 165 Y--DYRYADGSAAQGVFAKETITVGLTNGRMARLPGH---LIGCSSSFTGQSFQGA--DG 217
Query: 137 VMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQ--QSTSFL 189
V+GL D S S L FS C ++N S + FG + T+ L
Sbjct: 218 VLGLAFSDFSFTS--TATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPL 275
Query: 190 PIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEIYAEVVVK 241
+ Y + V +G L SG ++DSG S T L Y +VV
Sbjct: 276 DLTRIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTG 335
Query: 242 FDK-LVSSKRISLQGNSWKYCYNASSE-EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEG 299
+ LV KR+ +G +YC++ +S + K+P + F H S+ +
Sbjct: 336 LARYLVELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARF--EPHRKSYLVDAA 393
Query: 300 FTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
V CL +S +IG + FD L+++ S C
Sbjct: 394 PGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSAC 438
>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 469
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 79/352 (22%), Positives = 141/352 (40%), Gaps = 71/352 (20%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK----SRSSCKSLKD-------PCP-YIADYSTEDTSS 89
+ + P +SSSS+ + C +P C+ + C+ PCP YI Y S+
Sbjct: 137 IPRFIPKNSSSSRVIGCQNPKCQFLFGANVQCRGCDPNTRNCTVPCPPYILQYGL--GST 194
Query: 90 SGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS 149
+G L+ + L P +V ++GC S + P G+ G G G S+PS
Sbjct: 195 AGILISEKLDF-------PDLTVPD-FVVGC------SVISTRTPAGIAGFGRGPESLPS 240
Query: 150 LLAKAGLIQNSFSICFDEN--------DSGSVFFGDQGPATQQSTSFLPIGEK------- 194
+ S FD+ D+GS G + + S+ P +
Sbjct: 241 QMKLKSFSHCLVSRRFDDTNVTTDLGLDTGS---GHKSGSKTPGLSYTPFRKNPNVSNTA 297
Query: 195 -YDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFD 243
+ Y++ + +G+ + T ++VDSG++FTF+ ++ V +F
Sbjct: 298 FLEYYYLNLRRIYVGSKHVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFA 357
Query: 244 KLVS--SKRISLQGNSW-KYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEG 299
+S ++ L+ S C+N S + + VP++ F + ++ FSF N
Sbjct: 358 TQMSNYTREKDLEKVSGIAPCFNISGKGDVTVPELIFEFKGGAKMELPLSNYFSFVGNA- 416
Query: 300 FTVFCLTVMSTD--------GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
CLTV+S + G I+G + + +D EN + ++ KC
Sbjct: 417 -DTVCLTVVSDNTVNPGGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 74/319 (23%), Positives = 126/319 (39%), Gaps = 35/319 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
+DPS SSS N++C+ LC +S C S C Y Y + T S G+L + L
Sbjct: 179 FDPSKSSSYINITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKST-SVGFLSQERL 237
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
+ + + + + GCG+ G + A G++GLG +S + + +
Sbjct: 238 TITA-------TDIVDDFLFGCGQDNEGLFSGSA---GLIGLGRHPISF--VQQTSSIYN 285
Query: 159 NSFSICFDENDS--GSVFFGDQGPATQQSTSFLPIGE-KYDAYFVGVE--SYCIGNSCL- 212
FS C S G + FG AT + + P+ D F G++ +G + L
Sbjct: 286 KIFSYCLPSTSSSLGHLTFG-ASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLP 344
Query: 213 --TQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
+ S F A ++DSG T L YA + F + + ++ + + CY+ S
Sbjct: 345 AVSSSTFSAGGSIIDSGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGY 404
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS--TDGDYGIIGQNFMMGH 325
+ + VP + F+ V + CL + D D I G
Sbjct: 405 KEISVPKIDFEFAGG--VTVELPLVGILIGRSAQQVCLAFAANGNDNDITIFGNVQQKTL 462
Query: 326 RIVFDRENLKLAWSHSKCE 344
+V+D E ++ + + C
Sbjct: 463 EVVYDVEGGRIGFGAAGCN 481
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 74/315 (23%), Positives = 121/315 (38%), Gaps = 30/315 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDP-CPYIADYSTEDTSSSGYLVDDILHLA 101
+DPS SSS N+ C+ LC S C S D C Y Y +++ S G+L + L +
Sbjct: 183 FDPSKSSSYTNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYG-DNSISRGFLSQERLTIT 241
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
+ + + + GCG+ G + A G+MGL +S + + + F
Sbjct: 242 A-------TDIVHDFLFGCGQDNEGLFRGTA---GLMGLSRHPISF--VQQTSSIYNKIF 289
Query: 162 SICFDENDS--GSVFFGDQGP--ATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL----- 212
S C S G + FG A + T F I + Y + + +G + L
Sbjct: 290 SYCLPSTPSSLGHLTFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSS 349
Query: 213 -TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
T S +++DSG T LP YA + F + + ++ CY+ S + +
Sbjct: 350 STFSAGGSIIDSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEIS 409
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS--TDGDYGIIGQNFMMGHRIVF 329
VP R+ F V + E CL + D I G +V+
Sbjct: 410 VP--RIDFEFAGGVKVELPLVGILYGESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVY 467
Query: 330 DRENLKLAWSHSKCE 344
D E ++ + + C
Sbjct: 468 DVEGGRIGFGAAGCN 482
>gi|224138580|ref|XP_002326638.1| predicted protein [Populus trichocarpa]
gi|222833960|gb|EEE72437.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 70/292 (23%), Positives = 111/292 (38%), Gaps = 59/292 (20%)
Query: 108 PQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSIC-- 164
P + + ++ GC A P GV G G G +S+P+ LA + + N FS C
Sbjct: 208 PTNLIVNNFTFGCAHTAL------AEPIGVAGFGRGVLSLPAQLATLSPQLGNQFSYCLV 261
Query: 165 -------------------FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESY 205
+D ++ G P TS L E Y VG+E
Sbjct: 262 SHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNKPRFVY-TSMLDNLEHPYFYCVGLEGI 320
Query: 206 CIGNSCLTQSGFQA----------LVDSGASFTFLPTEIYAEVVVKFDKLV----SSKRI 251
IG + GF +VDSG +FT LP +Y VV +F+ V R+
Sbjct: 321 SIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVNERARV 380
Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV---RNHIFSF-----PENEGFTVF 303
+ CY + + + L F N S VV RN+ + F + + V
Sbjct: 381 IEEDTGLSPCYYFDNNVVNVP-SVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGKKRKVG 439
Query: 304 CLTVMS-------TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVID 348
CL +M+ + G +G G +V+D EN ++ ++ +C + +
Sbjct: 440 CLMLMNGGEEAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQCASLWE 491
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 122/313 (38%), Gaps = 35/313 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+DP+ S++ SC C + C LK C YI Y + ++++G D L L
Sbjct: 173 FDPAMSATYSAFSCGSAQCAQLGDEGNGC--LKSQCQYIVKYG-DGSNTAGTYGSDTLSL 229
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQN 159
S S S GC + G + DG+MGLG GD SL+++ A
Sbjct: 230 TS-------SDAVKSFQFGCSHRAAGFVGE---LDGLMGLG-GDTE--SLVSQTAATYGK 276
Query: 160 SFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGV--ESYCIGNSCLT- 213
+FS C + G + G G A+ S P+ F GV + + + L
Sbjct: 277 AFSYCLPPPSSSGGGFLTLGAAGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNV 336
Query: 214 -QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEML 270
S F ++VDSG T LP Y + F K + + + S C++ S +
Sbjct: 337 PASVFSGASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCFDFSGFNTI 396
Query: 271 KVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
VP + L FS+ + + + F T + DGD GI+G ++FD
Sbjct: 397 TVPTVTLTFSRGAAMDLDISGILYAGCLAF-----TATAHDGDTGILGNVQQRTFEMLFD 451
Query: 331 RENLKLAWSHSKC 343
+ + C
Sbjct: 452 VGGRTIGFRSGAC 464
>gi|213998806|gb|ACJ60770.1| nucellin [Hordeum flexuosum]
Length = 136
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 65/135 (48%), Gaps = 10/135 (7%)
Query: 113 QSSVIIGCGRKQTGSYLDGAAP----DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDE 167
+ + GCG KQ +P DG++GLG+G + L +I N C
Sbjct: 6 KKKIAFGCGYKQEEP---ADSPPSLVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62
Query: 168 NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGAS 226
G ++ GD P ++ +++P+ E Y G+ I N + + F+A+ DSG++
Sbjct: 63 KGKGVLYVGDFNPPSR-GVTWVPMKESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSGST 121
Query: 227 FTFLPTEIYAEVVVK 241
+T +P +IY E+V K
Sbjct: 122 YTHVPAQIYNEIVSK 136
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 82/319 (25%), Positives = 121/319 (37%), Gaps = 50/319 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
YDPS SS+ V C+ +CK S C S K C + Y+ + TS+ G D L
Sbjct: 157 YDPSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQ-CGFAISYA-DGTSTVGAYSQDKL 214
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
L AP + VQ + GCG G + DGV+GLG SL A+ G +
Sbjct: 215 TL------APGAIVQ-NFYFGCGH---GKHAVRGLFDGVLGLGR---LRESLGARYGGV- 260
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGE---KYDAYFVGVESYCIGNSC--LT 213
FS C S F F P+G + V + +G L
Sbjct: 261 --FSYCLPSVSSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLR 318
Query: 214 QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
S F +VDSG T L + Y + F K + + R+ G+ CYN + + +
Sbjct: 319 PSAFSGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGD-LDTCYNLTGYKNVV 377
Query: 272 VPDMRLIFSKNQSF-------VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
VP + L F+ + ++ N +F E+ DG G++G
Sbjct: 378 VPKIALTFTGGATINLDVPNGILVNGCLAFAES-----------GPDGSAGVLGNVNQRA 426
Query: 325 HRIVFDRENLKLAWSHSKC 343
++FD K + C
Sbjct: 427 FEVLFDTSTSKFGFRAKAC 445
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 78/334 (23%), Positives = 128/334 (38%), Gaps = 48/334 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCK--------SRSSCKSL---KDPCPYIADYSTEDTSSSGYL 93
+DP+ S++ V C+ C + SC S + C Y Y + + S G L
Sbjct: 190 FDPAGSATYAAVRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYG-DGSFSRGVL 248
Query: 94 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
D + L S + GCG G + G+MGLG ++S+ S A
Sbjct: 249 ATDTVALGGAS--------LGGFVFGCGLSNRGLF---GGTAGLMGLGRTELSLVSQTAS 297
Query: 154 AGLIQNSFSICFDENDSG------SVFFGDQGPATQQSTSFLPIG--------EKYDAYF 199
FS C SG S+ GD ++ ++T+ P+ + YF
Sbjct: 298 --RYGGVFSYCLPAATSGDASGSLSLGGGDDAASSYRNTT--PVAYTRMIADPAQPPFYF 353
Query: 200 VGVESYCIGNSCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQG 255
+ V +G + L G A L+DSG T L +Y V +F + ++ + G
Sbjct: 354 LNVTGAAVGGTALAAQGLGASNVLIDSGTVITRLAPSVYRAVRAEFMRQFGAAGYPAAPG 413
Query: 256 NS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTV--MSTDG 312
S CY+ + + +KVP + L V F + + CL + +S +
Sbjct: 414 FSILDTCYDLTGHDEVKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYED 473
Query: 313 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+ IIG R+V+D +L ++ C V
Sbjct: 474 ETPIIGNYQQKNKRVVYDTLGSRLGFADEDCNYV 507
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 82/319 (25%), Positives = 121/319 (37%), Gaps = 50/319 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
YDPS SS+ V C+ +CK S C S K C + Y+ + TS+ G D L
Sbjct: 123 YDPSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQ-CGFAISYA-DGTSTVGAYSQDKL 180
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
L AP + VQ + GCG G + DGV+GLG SL A+ G +
Sbjct: 181 TL------APGAIVQ-NFYFGCGH---GKHAVRGLFDGVLGLGR---LRESLGARYGGV- 226
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGE---KYDAYFVGVESYCIGNSC--LT 213
FS C S F F P+G + V + +G L
Sbjct: 227 --FSYCLPSVSSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLR 284
Query: 214 QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
S F +VDSG T L + Y + F K + + R+ G+ CYN + + +
Sbjct: 285 PSAFSGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGD-LDTCYNLTGYKNVV 343
Query: 272 VPDMRLIFSKNQSF-------VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
VP + L F+ + ++ N +F E+ DG G++G
Sbjct: 344 VPKIALTFTGGATINLDVPNGILVNGCLAFAES-----------GPDGSAGVLGNVNQRA 392
Query: 325 HRIVFDRENLKLAWSHSKC 343
++FD K + C
Sbjct: 393 FEVLFDTSTSKFGFRAKAC 411
>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
Length = 492
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 61/260 (23%), Positives = 110/260 (42%), Gaps = 46/260 (17%)
Query: 132 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND--------SGSVFFG---DQGP 180
A P GV G G G +S+P+ LA + + FS C + S + G D
Sbjct: 231 AEPVGVAGFGRGPLSLPAQLAPS--LSGRFSYCLVAHSFRADRLIRSSPLILGRSTDAAA 288
Query: 181 ATQQSTSFL--PI--GEKYDAYF-VGVESYCIGNSCL----------TQSGFQALVDSGA 225
T F+ P+ K+ ++ V +E+ +G + +VDSG
Sbjct: 289 IGASETDFVYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSGT 348
Query: 226 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-----CYNASSEEMLKVPDMRLIFS 280
+FT LP++ +A V +F + +++ R + + CY+ S + VP + L F
Sbjct: 349 TFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGLAPCYHYSPSDR-AVPPVALHFR 407
Query: 281 KNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGD----------YGIIGQNFMMGHRIV 328
N + + RN+ F EG +V CL +M+ G+ G +G G +V
Sbjct: 408 GNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVV 467
Query: 329 FDRENLKLAWSHSKCEEVID 348
+D + ++ ++ +C ++ D
Sbjct: 468 YDVDAGRVGFARRRCTDLWD 487
>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
Length = 519
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 62/270 (22%), Positives = 113/270 (41%), Gaps = 46/270 (17%)
Query: 132 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND--------SGSVFFG---DQGP 180
A P GV G G G +S+P+ LA + + FS C + S + G D
Sbjct: 231 AEPVGVAGFGRGPLSLPAQLAPS--LSGRFSYCLVAHSFRADRLIRSSPLILGRSTDAAA 288
Query: 181 ATQQSTSFL--PI--GEKYDAYF-VGVESYCIGNSCL----------TQSGFQALVDSGA 225
T F+ P+ K+ ++ V +E+ +G + +VDSG
Sbjct: 289 IGASETDFVYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSGT 348
Query: 226 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY-----CYNASSEEMLKVPDMRLIFS 280
+FT LP++ +A V +F + +++ R + + CY+ S + VP + L F
Sbjct: 349 TFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGLAPCYHYSPSDR-AVPPVALHFR 407
Query: 281 KNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGD----------YGIIGQNFMMGHRIV 328
N + + RN+ F EG +V CL +M+ G+ G +G G +V
Sbjct: 408 GNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVV 467
Query: 329 FDRENLKLAWSHSKCEEVIDKSHVHLVPPP 358
+D + ++ ++ +C ++ D ++ P
Sbjct: 468 YDVDAGRVGFARRRCTDLWDTLSRRIIDQP 497
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 52/179 (29%), Positives = 81/179 (45%), Gaps = 22/179 (12%)
Query: 119 GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-DENDSGSVFFGD 177
GCGR G + G+ DG++GLG G +S S A + FS C +EN GS+ FG+
Sbjct: 224 GCGRNNEGDF--GSGADGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEENSIGSLLFGE 279
Query: 178 QGPATQQSTSFLPIG--------EKYDAYFVGVESYCIGNSCLT--QSGFQA---LVDSG 224
+ + S F + E+ YFV + +GN L S F + ++DSG
Sbjct: 280 KATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVFASPGTIIDSG 339
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRIS----LQGNSWKYCYNASSEEMLKVPDMRLIF 279
T LP Y+ + F K ++ +S + + CYN S + + +P+ L F
Sbjct: 340 TVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLHF 398
>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 76/317 (23%), Positives = 122/317 (38%), Gaps = 40/317 (12%)
Query: 56 VSCSHPLCKSRSS-----CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
V C PLC + S C + C Y +Y+ + SS G L+ D + L K S
Sbjct: 114 VKCVDPLCAAIQSAPNHHCAGPNEQCDYEVEYA-DQGSSLGVLLRDNIPL----KFTNGS 168
Query: 111 SVQSSVIIGCGRKQTGSYLDGAAPD----GVMGLGLGDVSVPSLLAKAGLIQNSFSICFD 166
+ + GCG QT G P GV+GLG G S+ S L GLI+N C
Sbjct: 169 LARPMLAFGCGYDQTHH---GQNPPPSTAGVLGLGNGRTSILSQLHSLGLIRNVVGHCLS 225
Query: 167 ENDSGSVFFGDQ-GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGA 225
G +FFGDQ P + + L Y G + G + + DSG+
Sbjct: 226 GRGGGFLFFGDQLIPPSGVVWTPLLQSSSAQHYKTGPADLFFDRKTTSVKGLELIFDSGS 285
Query: 226 SFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS------WK--YCYNASSEEMLKVPDMRL 277
S+T+ ++ + +V + K +S WK + + + + L
Sbjct: 286 SYTYFNSQAHKALVNLIANDLRGKPLSRATGDPSLPICWKGPKPFKSLHDVTSNFKPLLL 345
Query: 278 IFSKNQSFVVRNHIFSFPENEGFTV-----FCLTVMSTD----GDYGIIGQNFMMGHRIV 328
F+K+ +N P V CL ++ G+ IIG + ++
Sbjct: 346 SFTKS-----KNSPLQLPPEAYLIVTKHGNVCLGILDGTEIGLGNTNIIGDISLQDKLVI 400
Query: 329 FDRENLKLAWSHSKCEE 345
+D E ++ W+ + C+
Sbjct: 401 YDNEKQQIGWASANCDR 417
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 88/352 (25%), Positives = 130/352 (36%), Gaps = 53/352 (15%)
Query: 25 LLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCP 77
L W + A +R + D S S K V C CK S ++C + PC
Sbjct: 129 LTWVNCRYRARGKDNRRVFRAD--ESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCS 186
Query: 78 YIADYSTEDTSSS-GYLVDDIL-------HLASFSKHAPQSSVQSSVIIGCGRKQTGSYL 129
Y DY D S++ G + + +A H +IGC TG
Sbjct: 187 Y--DYRYADGSAAQGVFAKETITVGLTNGRMARLPGH----------LIGCSSSFTGQSF 234
Query: 130 DGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQ- 183
GA DGV+GL D S S L FS C ++N S + FG
Sbjct: 235 QGA--DGVLGLAFSDFSFTS--TATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTA 290
Query: 184 -QSTSFLPIGEKYDAYFVGVESYCIGNSCL--------TQSGFQALVDSGASFTFLPTEI 234
+ T+ L + Y + V +G L SG ++DSG S T L
Sbjct: 291 FRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAA 350
Query: 235 YAEVVVKFDK-LVSSKRISLQGNSWKYCYNASSE-EMLKVPDMRLIFSKNQSFVVRNHIF 292
Y +VV + LV KR+ +G +YC++ +S + K+P + F H
Sbjct: 351 YKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARF--EPHRK 408
Query: 293 SFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
S+ + V CL +S +IG + FD L+++ S C
Sbjct: 409 SYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSAC 460
>gi|388520263|gb|AFK48193.1| unknown [Lotus japonicus]
Length = 157
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/126 (23%), Positives = 58/126 (46%), Gaps = 1/126 (0%)
Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLI 278
++DSG T LP +Y + F +++S K G S C+ + +EM +VP++++I
Sbjct: 32 IIDSGTVITRLPMPVYTALKNSFVRIMSKKYAQAPGISILDTCFKGNVKEMSEVPEIQMI 91
Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
F ++ H ++G T + S + IIG ++ +D N K+ +
Sbjct: 92 FGGGADLPLKAHNTLIELDKGVTCLAIAGSSENNPIAIIGNYQQQTFKVAYDVANSKIGF 151
Query: 339 SHSKCE 344
+ C+
Sbjct: 152 AAGGCQ 157
>gi|213998800|gb|ACJ60767.1| nucellin [Hordeum marinum subsp. marinum]
Length = 142
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 64/131 (48%), Gaps = 4/131 (3%)
Query: 120 CGRKQTGSYLDGAAP-DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSVFFGD 177
CG KQ +P DG++GLG+G + L +I N C G ++ G+
Sbjct: 1 CGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGN 60
Query: 178 QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTFLPTEIYA 236
P ++ T ++P+ E Y G+ I N + F+A+ DSG+++T +P++IY
Sbjct: 61 FNPPSRGVT-WVPMRESSFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTLVPSQIYN 119
Query: 237 EVVVKFDKLVS 247
E+V K +S
Sbjct: 120 EIVSKVRGTLS 130
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 78/316 (24%), Positives = 133/316 (42%), Gaps = 38/316 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
++PS SSS ++C CKS + D C Y Y Y V D A+ +
Sbjct: 197 FEPSFSSSYAPLTCETHQCKSLDVSECRNDSCLYEVSY-----GDGSYTVGD---FATET 248
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
S+ ++V IGCG G ++ A ++GLG G +S PS + + SFS C
Sbjct: 249 ITLDGSASLNNVAIGCGHDNEGLFVGAAG---LLGLGGGSLSFPSQINAS-----SFSYC 300
Query: 165 F---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQA 219
D + + ++ F P+ + L + Y++G+ +G L+ +S F+
Sbjct: 301 LVNRDTDSASTLEFNSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEV 360
Query: 220 --------LVDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNASSE 267
+VDSG + T L +++Y + F + L S+ ++L + CY+ SS
Sbjct: 361 DESGNGGIIVDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVAL----FDTCYDLSSR 416
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
++VP + F + + + P + T FC T IIG G R+
Sbjct: 417 SSVEVPTVSFHFPDGKYLALPAKNYLIPVDSAGT-FCFAFAPTTSALSIIGNVQQQGTRV 475
Query: 328 VFDRENLKLAWSHSKC 343
+D N + +S + C
Sbjct: 476 SYDLSNSLVGFSPNGC 491
>gi|154311375|ref|XP_001555017.1| hypothetical protein BC1G_06540 [Botryotinia fuckeliana B05.10]
gi|114149215|gb|AAR87747.3| aspartic proteinase precursor [Botryotinia fuckeliana]
gi|347829155|emb|CCD44852.1| similar to aspartic-type endopeptidase opsB [Botryotinia
fuckeliana]
Length = 482
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 76/341 (22%), Positives = 143/341 (41%), Gaps = 50/341 (14%)
Query: 65 SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG--- 121
S + C +PC Y+ +S+ Y+ D A V + IG
Sbjct: 103 SSTLCSRKTNPCQTAGTYTANSSSTYAYVASDFNISYVDGSGASGDYVTDTFTIGSATLD 162
Query: 122 RKQTGSYLDGAAPDGVMGLG--LGDVSV-----------PSLLAKAGLIQ-NSFSICFDE 167
+ Q G ++P+G++G+G + +V V P+ + GLI N+FS+ ++
Sbjct: 163 KLQFGIGYTSSSPEGILGIGYEINEVQVGRAGKKAYNNLPAQMVADGLINSNAFSLWLND 222
Query: 168 ND--SGSVFFGDQGPATQQ---STSFLPIGEK---YDAYFVGVESYCIGNSCLTQ-SGFQ 218
D +GS+ FG G T Q LPI ++ Y + + + +G++ + Q
Sbjct: 223 LDASTGSILFG--GVDTAQFHGQLETLPIEKESGYYAEFLITLTEVMLGDTVIAQDQALA 280
Query: 219 ALVDSGASFTFLP----TEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV-- 272
L+DSG+S T+LP IY +V ++D + +G ++ C A++ L
Sbjct: 281 VLLDSGSSLTYLPDAMAEAIYEQVEAQYD--------ASEGAAYVPCSLATNTSALNFTF 332
Query: 273 --PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD-YGIIGQNFMMGHRIVF 329
P +++ ++ V +G T CL ++ GD ++G F+ IV+
Sbjct: 333 TSPTIQVTMNELVIPVTSTTGQQLQFTDG-TAACLFGIAPAGDSTSVLGDTFIRSAYIVY 391
Query: 330 DRENLKLAWSHSK----CEEVIDKSHVHLVPPPAGQSPNPL 366
D +N +++ + + V++ + P A NP+
Sbjct: 392 DLDNNEISLAQTNFNATSTSVVEITTGTTAVPSATLVANPV 432
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 77/326 (23%), Positives = 125/326 (38%), Gaps = 50/326 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
+DP+ SSS V C P C++ ++ + C Y Y Y V D
Sbjct: 238 FDPALSSSYATVPCDSPHCRALDASACHNNAANGNSSCVYEVAYG-----DGSYTVGDFA 292
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
+ + S+ V IGCG G ++ A + G L S PS ++
Sbjct: 293 -TETLTLGGDGSAAVHDVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA----- 343
Query: 159 NSFSICFDENDSGS---VFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL 212
FS C + DS S + FG A+ ST P+ + Y+V + +G L
Sbjct: 344 TEFSYCLVDRDSPSASTLQFG----ASDSSTVTAPLMRSPRSNTFYYVALNGISVGGETL 399
Query: 213 T-----------QSGFQALVDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNS 257
+ Q +VDSG + T L + Y+ + F + L + +SL
Sbjct: 400 SDIPPAAFAMDEQGSGGVIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSL---- 455
Query: 258 WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
+ CY+ + ++VP + L F + + P +G +CL +T G I+
Sbjct: 456 FDTCYDLAGRSSVQVPAVSLRFEGGGELKLPAKNYLIPV-DGAGTYCLAFAATGGAVSIV 514
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKC 343
G G R+ FD + +S +KC
Sbjct: 515 GNVQQQGIRVSFDTAKNTVGFSPNKC 540
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 71/301 (23%), Positives = 121/301 (40%), Gaps = 58/301 (19%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
D+ +DPS SS+ K C+ P CPY Y + + + G L + +
Sbjct: 101 DQKAPIFDPSKSSTFKETRCNTP-----------DHSCPYKLVYD-DKSYTQGTLATETV 148
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD--GVMGLGLGDVSVPSLLAKAGL 156
+ S S V IIGC R +GS G P G++GL G +S+ S +
Sbjct: 149 TIHSTSG---VPFVMPETIIGCSRNNSGS---GFRPSSSGIVGLSRGSLSLISQM----- 197
Query: 157 IQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
G + GD ST+ K Y++ +++ +G++ + G
Sbjct: 198 --------------GGAYPGDG----VVSTTMFAKTAKRGQYYLNLDAVSVGDTRIETVG 239
Query: 217 --FQAL-----VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
F AL +DSG T+ P V +++V++ R+ + CY +++ E+
Sbjct: 240 TPFHALNGNIVIDSGTPLTYFPVSYCNLVRKAVERVVTADRVVDPSRNDMLCYYSNTIEI 299
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD-GDYGIIG----QNFMMG 324
P + + FS V+ + N G VFCL ++ + I G NF++G
Sbjct: 300 F--PVITVHFSGGADLVLDKYNMYMELNRG-GVFCLAIICNNPTQVAIFGNRAQNNFLVG 356
Query: 325 H 325
+
Sbjct: 357 Y 357
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 82/324 (25%), Positives = 135/324 (41%), Gaps = 48/324 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILHLASF 103
++P S + + C C S + C Y YS D+S + G L + + +F
Sbjct: 124 FEPLRSKTYSPIPCESEQCSFFGYSCSPQKMCAY--SYSYADSSVTKGVLAREAI---TF 178
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNS-- 160
S V +I GCG +G++ + M P SL+++ G + S
Sbjct: 179 SSTDGDPVVVGDIIFGCGHSNSGTFNENDMGIIGM------GGGPLSLVSQIGTLYGSKR 232
Query: 161 FSICF-----DENDSGSVFFGDQGPATQQSTSFLPIG--EKYDAYFVGVESYCIG----- 208
FS C D + SG++ FG++ + + P+ E +Y V +E +G
Sbjct: 233 FSQCLVPFHTDAHTSGTINFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVR 292
Query: 209 -NSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN---SWKYCYNA 264
NS T S ++DSG T++P E Y +V + V S + ++ + + CY
Sbjct: 293 FNSSETLSKGNIMIDSGTPATYIPQEFYERLVEELK--VQSSLLPIEDDPDLGTQLCYR- 349
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIGQ---- 319
SE L+ P + F ++ F P +G VFC + STDGDY I G
Sbjct: 350 -SETNLEGPILTAHFEGADVQLLPIQTF-IPPKDG--VFCFAMAGSTDGDY-IFGNFAQS 404
Query: 320 NFMMGHRIVFDRENLKLAWSHSKC 343
N +MG FD + +++ + C
Sbjct: 405 NILMG----FDLDRKTISFKPTDC 424
>gi|196003874|ref|XP_002111804.1| hypothetical protein TRIADDRAFT_55203 [Trichoplax adhaerens]
gi|190585703|gb|EDV25771.1| hypothetical protein TRIADDRAFT_55203 [Trichoplax adhaerens]
Length = 428
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 83/347 (23%), Positives = 128/347 (36%), Gaps = 82/347 (23%)
Query: 67 SSCKSLKDPCPYIADYSTEDTSS------------------SGYLVDDILHLASFSKHAP 108
S C + P P + Y + SS SG LV D+LHL H
Sbjct: 81 SFCGIMAAPSPVVKHYFHMNRSSTLEETNLRIDSSYVKGYWSGQLVSDMLHLG-IGLHK- 138
Query: 109 QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP----------SLLAKAGLIQ 158
Q +Q + I Q + + DG++GL ++V ++ +AG I+
Sbjct: 139 QVRIQFAAIT----NQKEFFTETTRFDGILGLAYPSLAVQGNFYQKPVFNEIVQQAG-IR 193
Query: 159 NSFSICFDENDSGSVFFGDQ-------------------GPATQQSTSFLPIGEKYDAYF 199
+ F++ + + FG+Q GP + PI EKY F
Sbjct: 194 DIFTLTYCASKMRKDLFGNQYITGGGFMTLGGIDNNLLAGPVF-----YTPIVEKYYYQF 248
Query: 200 ----VGVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 255
V V+ IG S + ALVDSG S P +Y ++ F + + + + G
Sbjct: 249 QLTNVLVDGQSIGFSPYDYMHYPALVDSGTSILRFPPFMYKRLMPIFLRSIQDRSVFSHG 308
Query: 256 NSWK---YCYNASSEEMLKVPDMRLI----------FSKNQSFVV-----RNHIFSFPEN 297
++ C S + P +RL F + F + + I S E
Sbjct: 309 FFYRGHAVCMEESQLLQHRFPTIRLSIRLASFEKTNFKTPRQFTLVLSPMQYFILSGKER 368
Query: 298 EGFTVFCLTVMSTDGDYGII-GQNFMMGHRIVFDRENLKLAWSHSKC 343
G + + T G +GII G M G + FDR N L ++ SKC
Sbjct: 369 HGKPCYHFGIAGTSGAFGIILGDVVMKGFSVTFDRVNSMLGFAVSKC 415
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 75/323 (23%), Positives = 129/323 (39%), Gaps = 44/323 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRS--SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+ P++SS+ ++ CS P C SC + + D+S S L D L LA
Sbjct: 138 FSPNTSSTYASLQCSVPQCTQVRGLSCPTTGTAACFFNQTYGGDSSFSAMLSQDSLGLA- 196
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNSF 161
S GC +GS L P G++GLG G + SLL+++G L F
Sbjct: 197 -------VDTLPSYSFGCVNAVSGSTL---PPQGLLGLGRGPM---SLLSQSGSLYSGVF 243
Query: 162 SICFDEND----SGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL---- 212
S CF SGS+ G G P ++T L + Y+V + +G +
Sbjct: 244 SYCFPSFKSYYFSGSLRLGPLGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAP 303
Query: 213 ------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
+G ++DSG T +YA + +F K V ++ ++ C+ A++
Sbjct: 304 ELLAFDPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQVKGPFATI--GAFDTCFAATN 361
Query: 267 EEMLKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD----YGIIGQNF 321
E++ P + F+ + + N + + ++ CL + + + +I
Sbjct: 362 EDI--APPVTFHFTGMDLKLPLENTLI---HSSAGSLACLAMAAAPNNVNSVLNVIANLQ 416
Query: 322 MMGHRIVFDRENLKLAWSHSKCE 344
RI+FD N +L + C
Sbjct: 417 QQNLRIMFDVTNSRLGIARELCN 439
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 65/262 (24%), Positives = 106/262 (40%), Gaps = 51/262 (19%)
Query: 45 YDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
+DP SS+ SCS C + R + SL C Y Y + ++++G D L L
Sbjct: 165 FDPGKSSTYTPFSCSSAACTRLEGRDNGCSLNSTCQYTVRYG-DGSNTTGTYGSDTLALN 223
Query: 102 SFSKHAPQSSVQSSVIIGCGRK-QTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQN 159
S K + GC G LD DG+MGLG G PSL+++ A +
Sbjct: 224 STEK-------VENFQFGCSETSDPGEGLDEDQTDGLMGLGGG---APSLVSQTAATYGS 273
Query: 160 SFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-----------------YFVGV 202
+FS C PAT +S+ FL +G YFV +
Sbjct: 274 AFSYCL--------------PATTRSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVIL 319
Query: 203 ESYCIGNS--CLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 258
+ +G ++ + F A ++DSG T LP Y+ + F + + +
Sbjct: 320 QGINVGGDPVAISPTVFAAGSIMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSIL 379
Query: 259 KYCYNASSEEMLKVPDMRLIFS 280
C++ + ++ + +P + L+FS
Sbjct: 380 DTCFDFTGQDNVSIPAVELVFS 401
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 83/338 (24%), Positives = 145/338 (42%), Gaps = 59/338 (17%)
Query: 45 YDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
YDP SS+ + CS LC+ S +C S K+ C Y ED S V +L
Sbjct: 137 YDPGESSTFAFLPCSDRLCQEGQFSFKNCTS-KNRCVY------EDVYGSAAAVG-VLAS 188
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+F+ A + +V + GCG GS + G++GL +S+ + L IQ
Sbjct: 189 ETFTFGA-RRAVSLRLGFGCGALSAGSLIGAT---GILGLSPESLSLITQLK----IQR- 239
Query: 161 FSIC---FDENDSGSVFFGDQGPATQ-------QSTSFLPIGEKYDAYFVGVESYCIGNS 210
FS C F + + + FG ++ Q+T+ + K Y+V + +G+
Sbjct: 240 FSYCLTPFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHK 299
Query: 211 CLT----------QSGFQALVDSGASFTFLPT---EIYAEVVVKFDKLVSSKRISLQGNS 257
L G +VDSG++ +L E E V+ +L + R
Sbjct: 300 RLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTV---ED 356
Query: 258 WKYCY------NASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTV-MS 309
++ C+ A++ E ++VP + L F + V+ R++ F P + CL V +
Sbjct: 357 YELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAG---LMCLAVGKT 413
Query: 310 TDGD-YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
TDG IIG ++FD ++ K +++ ++C+++
Sbjct: 414 TDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQCDQI 451
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 75/311 (24%), Positives = 122/311 (39%), Gaps = 41/311 (13%)
Query: 49 SSSSSKNVSCSHPLCKSRSSCK---SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 105
+SSS K + C+ C SS ++ C Y +Y + + +SG + D + S
Sbjct: 53 ASSSYKKLPCNSTHCSGMSSAGIGPRCEETCKYKYEYG-DGSRTSGDVGSDRISFRSHGA 111
Query: 106 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 165
S + GCGRK G D G++GLG S+ L + FS C
Sbjct: 112 GEDHRSFFDGFLFGCGRKLKG---DWNFTQGLIGLGQKSHSLIQQLGDK--LGYKFSYCL 166
Query: 166 DENDS-----GSVFFGDQGPATQQSTSFLPI--GEKYDA--YFVGVESYCIGNSCLT--- 213
DS +F G PI G+ D Y+V ++S +G +
Sbjct: 167 VSYDSPPSAKSFLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYD 226
Query: 214 -QSGF----------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKY 260
+SG + ++DSG ++T L +Y + ++ V + GNS
Sbjct: 227 KESGHNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTL---GNSAGLDL 283
Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 319
C+N+S + P + F+ V+ +IF + V CL++ S+ GD IIG
Sbjct: 284 CFNSSGDTSYGFPSVTFYFANQVQLVLPFENIFQVTSRD---VVCLSMDSSGGDLSIIGN 340
Query: 320 NFMMGHRIVFD 330
I++D
Sbjct: 341 MQQQNFHILYD 351
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 74/312 (23%), Positives = 126/312 (40%), Gaps = 31/312 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
++P+SS+S +SC C+S + + C Y Y + + + G V + + L S S
Sbjct: 186 FEPASSTSYSPLSCDTKQCQSLDVSECRNNTCLYEVSYG-DGSYTVGDFVTETITLGSAS 244
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
+V IGCG G ++ A G+ G L S PS + + SFS C
Sbjct: 245 V--------DNVAIGCGHNNEGLFIGAAGLLGLGGGKL---SFPSQINAS-----SFSYC 288
Query: 165 F--DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCLT--QSGFQA 219
++DS S + T+ L + D Y+VG+ +G L+ +S F+
Sbjct: 289 LVDRDSDSASTLEFNSALLPHAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEM 348
Query: 220 --------LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
++DSG + T L T Y + F K ++ + + CY+ S + ++
Sbjct: 349 DESGNGGIIIDSGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVE 408
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDR 331
VP + + + + + P + T FC T IIG G R+ FD
Sbjct: 409 VPTVTFHLAGGKVLPLPATNYLIPVDSDGT-FCFAFAPTSSALSIIGNVQQQGTRVGFDL 467
Query: 332 ENLKLAWSHSKC 343
N + + +C
Sbjct: 468 ANSLVGFEPRQC 479
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 131/320 (40%), Gaps = 41/320 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
++P S S V C PLC+ S C + C Y Y + + ++G V + L
Sbjct: 171 FNPVKSGSFAKVLCRTPLCRRLESPGCNQ-RQTCLYQVSYG-DGSYTTGEFVTETLTF-- 226
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+ + V +GCG G ++ A ++GLG G +S PS + FS
Sbjct: 227 ------RRTKVEQVALGCGHDNEGLFVGAAG---LLGLGRGGLSFPSQAGRT--FNQKFS 275
Query: 163 ICFDENDSGS----VFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGN---SCL 212
C + + S V FG+ A ++ F P+ + D Y+V + +G S +
Sbjct: 276 YCLVDRSASSKPSSVVFGNS--AVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGI 333
Query: 213 TQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
T S F+ ++D G S T L Y + F SS + + + + + CY+
Sbjct: 334 TASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDL 393
Query: 265 SSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
S + +KVP + L F + S N++ +G FC T IIG
Sbjct: 394 SGKTTVKVPTVVLHFRGADVSLPASNYLIPV---DGSGRFCFAFAGTTSGLSIIGNIQQQ 450
Query: 324 GHRIVFDRENLKLAWSHSKC 343
G R+V+D + ++ +S C
Sbjct: 451 GFRVVYDLASSRVGFSPRGC 470
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 73/339 (21%), Positives = 132/339 (38%), Gaps = 53/339 (15%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVD 95
S ++P SSS C+ +C +R+ SC C I Y+ + +S+ G L
Sbjct: 95 STFNPLLSSSYTPTPCNSSVCMTRTRDLTIPASCDPNNKLCHVIVSYA-DASSAEGTLAA 153
Query: 96 DILHLASFSKHAPQSSVQSSVIIGC--GRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
+ LA + Q + GC T + A G+MG+ G +S+ +
Sbjct: 154 ETFSLAG--------AAQPGTLFGCMDSAGYTSDINEDAKTTGLMGMNRGSLSLVT---- 201
Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI------GEKYD--AYFVGVESY 205
++ FS C D+ V GP+ + P+ +D AY V +E
Sbjct: 202 -QMVLPKFSYCISGEDAFGVLLLGDGPSAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGI 260
Query: 206 CIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 254
+ L T +G Q +VDSG FTFL +Y + +F + ++
Sbjct: 261 KVSEKLLQLPKSVFVPDHTGAG-QTMVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIE 319
Query: 255 GNSWKY------CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM 308
++ + CY+A + + VP + L+FS + V + V+C T
Sbjct: 320 DPNFVFEGAMDLCYHAPA-SLAAVPAVTLVFSGAEMRVSGERLLYRVSKGRDWVYCFTFG 378
Query: 309 STD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
++D + +IG + + FD ++ ++ + C+
Sbjct: 379 NSDLLGIEAYVIGHHHQQNVWMEFDLVKSRVGFTETTCD 417
>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
Length = 469
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 84/380 (22%), Positives = 148/380 (38%), Gaps = 74/380 (19%)
Query: 15 NALLCLPVTTLLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCK----SRSSCK 70
++L+CLP T+ C S + + + P +SSSSK + C P C+ C+
Sbjct: 111 SSLVCLPCTSRYLCSGC-DFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFLYGPNVQCR 169
Query: 71 SLKDP--------CP-YIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCG 121
DP CP YI Y S++G L+ + L + ++GC
Sbjct: 170 GC-DPNTRNCTVGCPPYILQYGLG--STAGVLITEKLDFPDLT--------VPDFVVGC- 217
Query: 122 RKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEN--------DSGSV 173
S + P G+ G G G VS+PS + S FD+ D+GS
Sbjct: 218 -----SIISTRQPAGIAGFGRGPVSLPSQMNLKRFSHCLVSRRFDDTNVTTDLDLDTGS- 271
Query: 174 FFGDQGPATQQSTSFLPIGEK--------YDAYFVGVESYCIGNSCL----------TQS 215
G + ++ P + + Y++ + +G + T
Sbjct: 272 --GHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNG 329
Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVS--SKRISLQGNS-WKYCYNASSEEMLKV 272
++VDSG++FTF+ ++ V +F +S ++ L+ + C+N S + + V
Sbjct: 330 DGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLGPCFNISGKGDVTV 389
Query: 273 PDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTD--------GDYGIIGQNFMM 323
P++ F + ++ F+F N CLTV+S G I+G
Sbjct: 390 PELIFEFKGGAKLELPLSNYFTFVGNT--DTVCLTVVSDKTVNPSGGTGPAIILGSFQQQ 447
Query: 324 GHRIVFDRENLKLAWSHSKC 343
+ + +D EN + ++ KC
Sbjct: 448 NYLVEYDLENDRFGFAKKKC 467
>gi|328768800|gb|EGF78845.1| hypothetical protein BATDEDRAFT_12639 [Batrachochytrium
dendrobatidis JAM81]
Length = 355
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 75/301 (24%), Positives = 122/301 (40%), Gaps = 43/301 (14%)
Query: 58 CSHPLCKSRSSCKSLKDPC------PYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 111
CS P C S L + Y T S SG + D ++A + S
Sbjct: 82 CSDPACVKHSQFNRLLSSTWTSLTQTFSIQYGTG--SLSGVMSSDTFYMAGLT--VTNQS 137
Query: 112 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLI-QNSFSIC 164
SV Q G+ DGV+GLG+ ++S+ ++ + GLI F +
Sbjct: 138 FAESV------SQPGTTFINTKYDGVLGLGMREISINNVATPMENMHAQGLIPAGVFGLY 191
Query: 165 FDENDS-GSVF-FGDQGPA-TQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 221
+N + GSV G P+ S ++LP+ K + VG+ S + L Q+ QA+
Sbjct: 192 LTKNSAPGSVLTIGGYDPSHVDGSITWLPL-SKRQFWQVGLTSVTFNGTTLIQNA-QAVF 249
Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 281
DSG S +PT VS+ I Q + Y +P + +
Sbjct: 250 DSGTSLIAIPT-------------VSATLIHQQLGAIPYQNGLQLIPCTGLPSVTFML-N 295
Query: 282 NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 341
N SF +RN + P G+ V + G + I+G +FM + +FD +N ++ ++S
Sbjct: 296 NVSFTLRNEDYVIPFGFGYCVSAFVGLDMHG-FWILGDSFMKLYYTIFDSDNNRIGIANS 354
Query: 342 K 342
+
Sbjct: 355 R 355
>gi|328768784|gb|EGF78829.1| hypothetical protein BATDEDRAFT_12559 [Batrachochytrium
dendrobatidis JAM81]
Length = 355
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 75/301 (24%), Positives = 122/301 (40%), Gaps = 43/301 (14%)
Query: 58 CSHPLCKSRSSCKSLKDPC------PYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSS 111
CS P C S L + Y T S SG + D ++A + S
Sbjct: 82 CSDPACVKHSQFNRLLSSTWTSLTQTFSIQYGTG--SLSGVMSSDTFYMAGLT--VTNQS 137
Query: 112 VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLI-QNSFSIC 164
SV Q G+ DGV+GLG+ ++S+ ++ + GLI F +
Sbjct: 138 FAESV------SQPGTTFINTKYDGVLGLGMREISINNVATPMENMHAQGLIPAGVFGLY 191
Query: 165 FDENDS-GSVF-FGDQGPA-TQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGFQALV 221
+N + GSV G P+ S ++LP+ K + VG+ S + L Q+ QA+
Sbjct: 192 LTKNSAPGSVLTIGGYDPSHVDGSITWLPL-SKRQFWQVGLTSVTFNGTTLIQNA-QAVF 249
Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSK 281
DSG S +PT VS+ I Q + Y +P + +
Sbjct: 250 DSGTSLIAIPT-------------VSATLIHQQLGAIPYQNGLQLIPCTGLPSVTFML-N 295
Query: 282 NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHS 341
N SF +RN + P G+ V + G + I+G +FM + +FD +N ++ ++S
Sbjct: 296 NVSFTLRNEDYVIPFGFGYCVSAFVGLDMHG-FWILGDSFMKLYYTIFDSDNNRIGIANS 354
Query: 342 K 342
+
Sbjct: 355 R 355
>gi|32487305|emb|CAE05796.1| OSJNBb0046K02.6 [Oryza sativa Japonica Group]
gi|38344664|emb|CAE02326.2| OSJNBb0112E13.8 [Oryza sativa Japonica Group]
gi|125547764|gb|EAY93586.1| hypothetical protein OsI_15371 [Oryza sativa Indica Group]
gi|125589862|gb|EAZ30212.1| hypothetical protein OsJ_14269 [Oryza sativa Japonica Group]
Length = 174
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 43/88 (48%), Gaps = 2/88 (2%)
Query: 32 FGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSG 91
FG V R L+ YDP SS SSK V C +C SR C ++ CPYIA YS + + G
Sbjct: 89 FGHVCVCLRKLTFYDPRSSVSSKEVKCDDTICTSRPPC-NMTLRCPYIAAYS-DGGLTMG 146
Query: 92 YLVDDILHLASFSKHAPQSSVQSSVIIG 119
L D+LH + +SV G
Sbjct: 147 ILFTDLLHYHQLYGNGQTQPTSTSVTFG 174
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 70/267 (26%), Positives = 112/267 (41%), Gaps = 31/267 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS---RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLA 101
+DP+ SSS V C P+C ++ C Y+ Y + ++++G D L L+
Sbjct: 185 FDPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYG-DGSNTTGVYSSDTLTLS 243
Query: 102 SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNS 160
+ SS GCG Q+G + +G DG++GLG PSL+ + AG
Sbjct: 244 A-------SSAVQGFFFGCGHAQSGLF-NGV--DGLLGLGR---EQPSLVEQTAGTYGGV 290
Query: 161 FSICFDENDS--GSVFFGDQGPATQ----QSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 213
FS C S G + G GP+ +T LP Y V + +G L+
Sbjct: 291 FSYCLPTKPSTAGYLTLGLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSV 350
Query: 214 -QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKYCYNASSEE 268
S F +VD+G T LP YA + F ++S ++ CYN +
Sbjct: 351 PASAFAGGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYG 410
Query: 269 MLKVPDMRLIFSKNQSFVV-RNHIFSF 294
+ +P++ L F + ++ + I SF
Sbjct: 411 TVTLPNVALTFGSGATVMLGADGILSF 437
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 87/340 (25%), Positives = 149/340 (43%), Gaps = 63/340 (18%)
Query: 45 YDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
YDP SS+ + CS LC+ S +C S K+ C Y ED S V +L
Sbjct: 59 YDPGESSTFAFLPCSDRLCQEGQFSFKNCTS-KNRCVY------EDVYGSAAAVG-VLAS 110
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+F+ A + +V + GCG GS L GA G++GL +S+ + L IQ
Sbjct: 111 ETFTFGA-RRAVSLRLGFGCGALSAGS-LIGAT--GILGLSPESLSLITQLK----IQR- 161
Query: 161 FSIC---FDENDSGSVFFGDQGPATQ-------QSTSFL--PIGEKYDAYFVGVESYCIG 208
FS C F + + + FG ++ Q+T+ + P+ Y Y+V + +G
Sbjct: 162 FSYCLTPFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVETVY--YYVPLVGISLG 219
Query: 209 NSCLT----------QSGFQALVDSGASFTFLPT---EIYAEVVVKFDKLVSSKRISLQG 255
+ L G +VDSG++ +L E E V+ +L + R
Sbjct: 220 HKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTV--- 276
Query: 256 NSWKYCY------NASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTV- 307
++ C+ A++ E ++VP + L F + V+ R++ F P + CL V
Sbjct: 277 EDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRA---GLMCLAVG 333
Query: 308 MSTDGD-YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+TDG IIG ++FD ++ K +++ ++C+++
Sbjct: 334 KTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQCDQI 373
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 72/319 (22%), Positives = 125/319 (39%), Gaps = 34/319 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
+ PS+SSS ++VSC+ C+S +C S C Y+ +Y + + ++G L +
Sbjct: 105 FKPSTSSSYQSVSCNSSTCQSLQFATGNTGACGSNPSTCNYVVNYG-DGSYTNGELGVEQ 163
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L S S + GCGR G + G+MGLG +S+ S
Sbjct: 164 LSFGGVSV--------SDFVFGCGRNNKGLF---GGVSGLMGLGRSYLSLVS--QTNATF 210
Query: 158 QNSFSICF---DENDSGSVFFGDQGPATQQS-----TSFLPIGEKYDAYFVGVESYCIGN 209
FS C + SGS+ G++ + T LP + + Y + + +
Sbjct: 211 GGVFSYCLPTTESGASGSLVMGNESSVFKNVTPITYTRMLPNPQLSNFYILNLTGIDVDG 270
Query: 210 SCLTQSGF---QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
L F L+DSG T LP+ +Y + F K + + + C+N +
Sbjct: 271 VALQVPSFGNGGVLIDSGTVITRLPSSVYKALKALFLKQFTGFPSAPGFSILDTCFNLTG 330
Query: 267 EEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
+ + +P + + F N V + E+ L +S D IIG
Sbjct: 331 YDEVSIPTISMHFEGNAELKVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRN 390
Query: 325 HRIVFDRENLKLAWSHSKC 343
R+++D + K+ ++ C
Sbjct: 391 QRVIYDTKQSKVGFAEESC 409
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 76/320 (23%), Positives = 124/320 (38%), Gaps = 43/320 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
++ S++ K + C P CK + C + Y + S+ L D + L+
Sbjct: 74 FNTVKSTTFKTLGCGAPQCKQVPNPICGGSTCTWNTTYGSSTILSN--LTRDTIALSM-- 129
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
P + GC +K TGS + P G++G G G +S L L +++FS C
Sbjct: 130 DPVPYYA------FGCIQKATGSSVP---PQGLLGFGRGPLSF--LSQTQNLYKSTFSYC 178
Query: 165 FDE----NDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------- 212
N SGS+ G G P ++T L + Y+V + +G +
Sbjct: 179 LPSFRTLNFSGSLRLGPVGQPPRIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDIPRSAL 238
Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
+G + DSG FT L Y V +F K V + +S G + CY+
Sbjct: 239 AFNPTTGAGTIFDSGTVFTRLVAPAYIAVRNEFRKRVGNATVSSLGG-FDTCYSVP---- 293
Query: 270 LKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGD----YGIIGQNFMMG 324
+ P + +FS N + N + G T CL + + + +I
Sbjct: 294 IVPPTITFMFSGMNVTMPPENLLIH--STAGVTS-CLAMAAAPDNVNSVLNVIASMQQQN 350
Query: 325 HRIVFDRENLKLAWSHSKCE 344
HRI+FD N +L + +C
Sbjct: 351 HRILFDVPNSRLGVAREQCS 370
>gi|356500210|ref|XP_003518926.1| PREDICTED: basic 7S globulin-like [Glycine max]
Length = 435
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 117/291 (40%), Gaps = 54/291 (18%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP------CPYIADYSTEDTSSSGYLVDD 96
S Y P+ S++ CS S +C S P C D + T++SG L D
Sbjct: 80 STYRPARCGSAQ---CSLARSDSCGNCFSAPKPGCNNNTCGVTPDNTVTGTATSGELAQD 136
Query: 97 ILHLASFSKHAP-QSSVQSSVIIGCGRKQTGSYLDGAAP--DGVMGLGLGDVSVPSLLAK 153
++ L S + P Q++ S + C L G A G+ GLG +++PS LA
Sbjct: 137 VVSLQSTNGFNPIQNATVSRFLFSCAPT---FLLQGLATGVSGMAGLGRTRIALPSQLAS 193
Query: 154 AGLIQNSFSICFDENDSGSVFFGDQGPAT-------QQSTSFLPI-------------GE 193
A + F++C ++ G FFGD GP Q +F P+ GE
Sbjct: 194 AFSFRRKFAVCLSSSN-GVAFFGD-GPYVLLPNVDASQLLTFTPLLINPVSTASAFSQGE 251
Query: 194 KYDAYFVGVESYCIG------NSCLTQSGFQAL----VDSGASFTFLPTEIYAEVVVKFD 243
YF+GV+S I N+ L + + + S +T L I+ V F
Sbjct: 252 PSAEYFIGVKSIKIDEKTVPLNTTLLSINSKGVGGTKISSVNPYTVLEDSIFKAVTEAFV 311
Query: 244 KLVSSKRISLQGNSWKYCYNASSEEML------KVPDMRLIFSKNQSFVVR 288
K S++ I+ + + S E +L VP + L+ +NQ V R
Sbjct: 312 KASSARNITRVASVAPFEVCFSRENVLATRLGAAVPTIELVL-QNQKTVWR 361
>gi|242095592|ref|XP_002438286.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
gi|241916509|gb|EER89653.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
Length = 495
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 76/321 (23%), Positives = 129/321 (40%), Gaps = 35/321 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DPS SSS ++V C P C S S C + ST +G +V D L L+
Sbjct: 185 FDPSMSSSFRSVLCGSPDCGGHSC--SAGGSCTFTLQNSTF-VFGNGTIVMDTLTLSP-- 239
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS-LLAKAGLIQNSFSI 163
S+ + +GC + + DG A G + L L S+ + +L + +FS
Sbjct: 240 -----SATFENFAVGCMQLDNDLFTDGVA-VGNIDLSLSRHSLATRVLNSSPPGMAAFSY 293
Query: 164 CFDENDSGSVF---------FGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQ 214
C + F + D P G + Y+V + + I L
Sbjct: 294 CLPADTDTHGFLTIAPALSDYSDHAGVKYVPLVTNPTGPNF--YYVDLVAIAINGEDLPI 351
Query: 215 -----SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
+G ++DS ++FT+L IYA + +F K + + CYN + E
Sbjct: 352 PPALFTGNGTMIDSQSAFTYLNPPIYAALRDEFRKAMLQYQPVPAFGGLDTCYNFTLAEN 411
Query: 270 LKVPDMRLIFSKNQSFVV--RNHIFSFPEN--EGFTVFCLTVMST---DGDYGIIGQNFM 322
+ +PD+ L FS ++ + R ++ F E+ +GF CL + + + +G
Sbjct: 412 IYLPDITLRFSNGETMDLDDRQFMYFFREHLTDGFPFGCLAFAAAPDQNFPWNYLGSQVQ 471
Query: 323 MGHRIVFDRENLKLAWSHSKC 343
IV+D +A+ S+C
Sbjct: 472 RTKEIVYDVRGGMVAFVPSRC 492
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 79/353 (22%), Positives = 142/353 (40%), Gaps = 55/353 (15%)
Query: 35 SIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSR-----SSCKSLKDPCPYIADYSTEDTSS 89
++ D+ + + S S + V CS PLC S C + C Y Y + + +
Sbjct: 126 TVCFDQPVPVFRASVSHTFSRVPCSDPLCGHAVYLPLSGCAARDRSCFYAYGY-MDHSIT 184
Query: 90 SGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPS 149
+G + +D + A ++ ++ GCG G + + G+ G G G +S+PS
Sbjct: 185 TGKMAEDTFTFKA-PDRADTAAAVPNIRFGCGMMNYGLFTPNQS--GIAGFGTGPLSLPS 241
Query: 150 LLAKAGLIQNSFSICF---DENDSGSVFFGDQ---------GPATQQSTSFLP------I 191
L FS CF +E+ V G + GP QST F P +
Sbjct: 242 QLKV-----RRFSYCFTAMEESRVSPVILGGEPENIEAHATGPI--QSTPFAPGPAGAPV 294
Query: 192 GEKYDAYFVGVESYCIGNSCL--TQSGFQ--------ALVDSGASFTFLPTEIYA---EV 238
G + YF+ + +G + L S F +DSG + TF P ++ E
Sbjct: 295 GSQ-PFYFLSLRGVTVGETRLPFNASTFALKGDGSGGTFIDSGTAITFFPQAVFRSLREA 353
Query: 239 VVKFDKLVSSKRISLQGNSWKYCYNASSEEML-KVPDMRLIFSKNQSFVVRNHIFSFPEN 297
V L +K + N C++ +++ VP + L + R + ++
Sbjct: 354 FVAQVPLPVAKGYTDPDN--LLCFSVPAKKKAPAVPKLILHLEGADWELPRENYVLDNDD 411
Query: 298 EGFTV---FCLTVMSTDGDYGIIGQNFMMGH-RIVFDRENLKLAWSHSKCEEV 346
+G C+ ++S G I NF + IV+D E+ K+ ++ ++C+++
Sbjct: 412 DGSGAGRKLCVVILSAGNSNGTIIGNFQQQNMHIVYDLESNKMVFAPARCDKL 464
>gi|316927704|gb|ADU58605.1| xyloglucan-specific endoglucanase inhibitor 4 [Solanum tuberosum]
Length = 440
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 87/350 (24%), Positives = 133/350 (38%), Gaps = 63/350 (18%)
Query: 50 SSSSKNVSCSHPLCKSR------SSCKSLKDP------CPYIADYSTEDTSSSGYLVDDI 97
SSS K V C CK SC P C +I TS+ G L D+
Sbjct: 79 SSSYKPVPCGSIPCKRSLSGACVESCVGPPSPGCNNNTCSHIPYNHFIRTSTGGELAQDV 138
Query: 98 LHLASFSKHAPQSSVQSS-VIIGCGRKQTGSYLDGAAP--DGVMGLGLGDVSVPSLLAKA 154
+ L S P+ + ++ V+ C S L+G A G++GLG G V P+ LA A
Sbjct: 139 VSLQSTDGSNPRKYLSTNGVVFDCAPH---SLLEGLAKGVKGILGLGNGYVGFPTQLANA 195
Query: 155 GLIQNSFSICFDENDS--GSVFFGDQ------GPATQQSTSFLPI-------------GE 193
+ F+IC + + G +FFGD G + + P+ GE
Sbjct: 196 FSVPRKFAICLTSSTTSRGVIFFGDSPYVFLPGMDVSKRLVYTPLLKNPVSTSGSYFEGE 255
Query: 194 KYDAYFVGVESYCI-GNSCLTQSGFQALVDSGAS---------FTFLPTEIYAEVVVKFD 243
YF+GV S I GN + + G +T L T IY + F
Sbjct: 256 PSTDYFIGVTSIKINGNVVPINTTLLNITKDGKGGTKISTVDPYTKLETSIYNALTKAFV 315
Query: 244 K-LVSSKRISLQGNSWKYCYNASSEEMLK----VPDMRLIFSKNQSFVVRNHIFSFPENE 298
K L R+ +K CYN +S + VP + L+ N++ I+
Sbjct: 316 KSLAKVPRVKPVA-PFKVCYNRTSLGSTRVGRGVPPIELVLG-NKNATTSWTIWGVNSMV 373
Query: 299 GFT--VFCLTVMSTDGDYG-----IIGQNFMMGHRIVFDRENLKLAWSHS 341
V CL + ++ +IG + + + + FD N +L ++ S
Sbjct: 374 AMNNDVLCLGFLDGGVEFEPTTSIVIGAHQIEDNLLQFDIANKRLGFTSS 423
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 78/318 (24%), Positives = 137/318 (43%), Gaps = 33/318 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+D S S + K + C C+S C S K C Y Y + + S G L + L L S
Sbjct: 131 FDSSKSQTYKTLPCPSNTCQSVQGTFCSSRKH-CLYSIHY-VDGSQSLGDLSVETLTLGS 188
Query: 103 FSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
+ S VQ +IGCGR + + G++GLG G +S+ + L+ + F
Sbjct: 189 TNG----SPVQFPGTVIGCGRYNAIGIEEKNS--GIVGLGRGPMSLITQLSPS--TGGKF 240
Query: 162 SICFD---ENDSGSVFFGDQGPATQQSTSFLPIGEKYDA--YFVGVESYCIGNSCLT--- 213
S C S + FG+ + + T P+ K YF+ +E++ +G + +
Sbjct: 241 SYCLVPGLSTASSKLNFGNAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGS 300
Query: 214 -QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM- 269
SG + ++DSG + T LP +Y+++ K V +R+ CY + +++
Sbjct: 301 PGSGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVRDPNQVLGLCYKVTPDKLD 360
Query: 270 LKVPDMRLIFSKNQSFVVRNHIFSFPE-NEGFTVFCLTVMSTDGDYGIIG-QNFMMGHRI 327
VP + FS V N I +F + + F T +G + QN ++G
Sbjct: 361 ASVPVITAHFSGAD--VTLNAINTFVQVADDVVCFAFQPTETGAVFGNLAQQNLLVG--- 415
Query: 328 VFDRENLKLAWSHSKCEE 345
+D + +++ H+ C +
Sbjct: 416 -YDLQMNTVSFKHTDCTK 432
>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 460
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 81/340 (23%), Positives = 129/340 (37%), Gaps = 60/340 (17%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSL---KDPCPYIADYSTEDTSSSGYLVDDILH 99
++P+SS++ + V C P C SC SL K+ C + Y D+S L D L
Sbjct: 135 FNPASSATFRPVPCGAPPCSQAPNPSCTSLAKSKNSCGFSLSYG--DSSLDATLSQDNLA 192
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA-GLIQ 158
+ + V GC K GS AAP GL +A+ G+ +
Sbjct: 193 VTA------NGGVIKGYTFGCLTKSNGS----AAP--AQGLLGLGRGPLGFVAQTKGIYE 240
Query: 159 NSFSICFDE------NDSGSVFFGDQG---PATQQSTSFLPIGEKYDAYFVGVESYCIGN 209
+FS C N SGS+ G +G P ++T L + Y+V + IG
Sbjct: 241 GTFSYCLPSYYRSAANFSGSLTLGRKGQPAPEKMKTTPLLASPHRPSLYYVAMTGVRIGK 300
Query: 210 SCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS-----------S 248
+ +G ++DSG F L YA V + + V+ S
Sbjct: 301 KSVPIPPSALAFDAATGAGTVLDSGTMFARLAQPAYAAVRDEVRRRVAGSLRRRGGGGAS 360
Query: 249 KRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM 308
+S G + CYN S+ + P + L+F + G T CL +
Sbjct: 361 VSVSSLGG-FDTCYNVST---VAWPAVTLVFGGGMEVRLPEENVVIRSTYGSTS-CLAMA 415
Query: 309 STDGD-----YGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
++ D +IG HR++FD N ++ ++ +C
Sbjct: 416 ASPADGVNAALNVIGSLQQQNHRVLFDVPNARVGFARERC 455
>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 482
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 61/264 (23%), Positives = 105/264 (39%), Gaps = 47/264 (17%)
Query: 132 AAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFSICFDENDSGS--------VFFGDQGPAT 182
+ P GV G G G +S+P+ LA + + N FS C + S + G
Sbjct: 214 SEPTGVAGFGRGLLSLPAQLATHSPQLGNRFSYCLVSHSFRSERIRKPSPLILGRYNDEK 273
Query: 183 QQS---------TSFLPIGEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDS 223
Q + TS L + Y VG++ +G + + +VDS
Sbjct: 274 QSNGDEVVEFVYTSMLENPKHSYFYTVGLKGISVGKKTVPAPKILRRVNKKGDGGVVVDS 333
Query: 224 GASFTFLPTEIYAEVVVKFDKLV--SSKRIS--LQGNSWKYCYNASSEEMLKVPDMRLIF 279
G +FT LP + Y VV FD+ S++R Q CY ++ ++ +R +
Sbjct: 334 GTTFTMLPEKFYNSVVEGFDRRARKSNRRAPEIEQKTGLSPCYYLNTAAIVPAVTLRFV- 392
Query: 280 SKNQSFVV--RNHIFSFPE-----NEGFTVFCLTVMS-------TDGDYGIIGQNFMMGH 325
N S V+ +N+ + F + V CL M+ + G G++G G
Sbjct: 393 GMNSSVVLPRKNYFYEFMDGGDGVRRKERVGCLMFMNGGDEAEMSGGPGGVLGNYQQQGF 452
Query: 326 RIVFDRENLKLAWSHSKCEEVIDK 349
+ +D E ++ ++ KC + D+
Sbjct: 453 EVEYDLEKKRVGFARRKCASLWDR 476
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 81/321 (25%), Positives = 126/321 (39%), Gaps = 42/321 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
++P++SS+ + V C+ PLCK S C++ K C Y Y D +
Sbjct: 195 FNPAASSTYRKVPCATPLCKKLDISGCRN-KRYCEYQVSYG-----------DGSFTVGD 242
Query: 103 FSKHAP--QSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
FS + V V +GCG G ++ A G+ G +S PS
Sbjct: 243 FSTETLTFRGQVIRRVALGCGHDNEGLFIGAAGLLGLG---RGSLSFPS--QTGAQFSKR 297
Query: 161 FSICF-DENDSG---SVFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNSCLT 213
FS C D + SG S+ FG A +S F P+ K D Y+V + +G LT
Sbjct: 298 FSYCLVDRSASGTASSLIFGK--AAIPKSAIFTPLLSNPKLDTFYYVELVGISVGGRRLT 355
Query: 214 Q---SGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
S F+ ++DSG S T L Y+ + F + + + + + CY
Sbjct: 356 SIPASVFRMDATGNGGVIIDSGTSVTRLVDSAYSTMRDAFRVGTGNLKSAGGFSLFDTCY 415
Query: 263 NASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
+ S + +KVP + F + + P + T FC G IIG
Sbjct: 416 DLSGLKTVKVPTLVFHFQGGAHISLPATNYLIPVDSSAT-FCFAFAGNTGGLSIIGNIQQ 474
Query: 323 MGHRIVFDRENLKLAWSHSKC 343
G+R+VFD ++ + C
Sbjct: 475 QGYRVVFDSLANRVGFKAGSC 495
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 67/268 (25%), Positives = 106/268 (39%), Gaps = 51/268 (19%)
Query: 114 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-----N 168
S++ +GC GA+ G++G+ +S PS L+ FS CF + N
Sbjct: 254 SNITLGCADIDREGLPTGAS--GLLGMDRRPISFPSQLSSR--YARKFSHCFPDKIAHLN 309
Query: 169 DSGSVFFGD--------------QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
SG VFFG+ Q PA ++ D Y+VG+ + S L
Sbjct: 310 SSGLVFFGESDIISPYLRYTPLVQNPAVPSAS--------LDYYYVGLVGISVDESRLPL 361
Query: 213 ----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
T SG ++DSG +FT+L + + +F S + + CY
Sbjct: 362 SHKNFDIDKVTGSG-GTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCY 420
Query: 263 NASSE----EMLKVPDMRLIFSKNQSFVVRNHIFSFP--ENEGFTVFCLT-VMSTDGDYG 315
N +S E +P + L F V+ + P +E T CL +MS D +
Sbjct: 421 NITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDIPFN 480
Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
IIG + +D E L+L + ++C
Sbjct: 481 IIGNYQQQNLWVEYDLEKLRLGIAPAQC 508
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 76/320 (23%), Positives = 127/320 (39%), Gaps = 41/320 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
++P S S V C PLC+ S C + C Y Y + + ++G V + L
Sbjct: 84 FNPVKSGSFAKVLCRTPLCRRLESPGCNQ-RQTCLYQVSYG-DGSYTTGEFVTETLTF-- 139
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+ + V +GCG G ++ A G+ +S PS + FS
Sbjct: 140 ------RRTKVEQVALGCGHDNEGLFVGAAGLLGLGRG---GLSFPSQAGRT--FNQKFS 188
Query: 163 ICFDENDSGS----VFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGN---SCL 212
C + + S V FG+ A ++ F P+ + D Y+V + +G S +
Sbjct: 189 YCLVDRSASSKPSSVVFGNS--AVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGI 246
Query: 213 TQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
T S F+ ++D G S T L Y + F SS + + + + + CY+
Sbjct: 247 TASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDL 306
Query: 265 SSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
S + +KVP + L F + S N++ +G FC T IIG
Sbjct: 307 SGKTTVKVPTVVLHFRGADVSLPASNYLIPV---DGSGRFCFAFAGTTSGLSIIGNIQQQ 363
Query: 324 GHRIVFDRENLKLAWSHSKC 343
G R+V+D + ++ +S C
Sbjct: 364 GFRVVYDLASSRVGFSPRGC 383
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 84/317 (26%), Positives = 130/317 (41%), Gaps = 37/317 (11%)
Query: 43 SEYDPSSSSSSKNVSCSHPLC------KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
S +DPS+SS+ SCS C + + C S + C YI Y + +S++G D
Sbjct: 171 SLFDPSASSTYSPFSCSSAACVQLSQSQQGNGCSSSQ--CQYIVSY-VDGSSTTGTYSSD 227
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AG 155
L L S + Q GC + ++G + D DG+MGLG GD SL+++ AG
Sbjct: 228 TLTLGSNAIKGFQ--------FGCSQSESGGFSD--QTDGLMGLG-GDAQ--SLVSQTAG 274
Query: 156 LIQNSFSICFDENDSGSVFFGDQGPATQQS---TSFLPIGEKYDAYFVGVESYCIGNSCL 212
+FS C GS F G A++ T L + Y V +E+ +G L
Sbjct: 275 TFGKAFSYCLPPTP-GSSGFLTLGAASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQL 333
Query: 213 T--QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
S F A ++DSG T LP Y+ + F + + C++ S +
Sbjct: 334 NIPTSVFSAGSVMDSGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQS 393
Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM--STDGDYGIIGQNFMMGHR 326
+ +P + L+FS V N F+ E +CL S D G IG
Sbjct: 394 SVSIPSVALVFSGG---AVVNLDFNGIMLE-LDNWCLAFAANSDDSSLGFIGNVQQRTFE 449
Query: 327 IVFDRENLKLAWSHSKC 343
+++D + + C
Sbjct: 450 VLYDVGGGAVGFRAGAC 466
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 66/298 (22%), Positives = 117/298 (39%), Gaps = 26/298 (8%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP SS+ NVSC+ P C + C Y Y + + S G+ D L L+S+
Sbjct: 221 FDPVRSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQYG-DGSYSIGFFAMDTLTLSSY- 278
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAGLIQNSFSI 163
GCG + G + + A G++GLG G S+P K G + F+
Sbjct: 279 ------DAVKGFRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDKYGGV---FAH 326
Query: 164 CFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA----YFVGVESYCIGNSCLT--QSGF 217
C +G+ + + +++ L D Y++G+ +G L+ QS F
Sbjct: 327 CLPARSTGTGYLDFGAGSPAAASARLTTPMLTDNGPTFYYIGMTGIRVGGQLLSIPQSVF 386
Query: 218 Q---ALVDSGASFTFLPTEIYAEVVVKFDKLVSSK--RISLQGNSWKYCYNASSEEMLKV 272
+VDSG T LP Y+ + F ++++ + + + CY+ + + +
Sbjct: 387 ATAGTIVDSGTVITRLPPPAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAI 446
Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD 330
P + L+F V + + GD GI+G + + +D
Sbjct: 447 PTVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYD 504
>gi|219120652|ref|XP_002181060.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407776|gb|EEC47712.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 453
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 71/327 (21%), Positives = 131/327 (40%), Gaps = 41/327 (12%)
Query: 46 DPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 105
DP SS+ + C L C + + C Y TE +S + V D L
Sbjct: 128 DPQRSSTLRYTQCGSCLLSGIQECAA-EQKCGINQRY-TEGSSWTAVEVSDTFVLGGPEI 185
Query: 106 HAPQSSVQSSVII--GCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFS 162
+ + V ++I GC +K G + A +G++GL D+S+ L K +I + SFS
Sbjct: 186 SSLEQYVSFTIIFAFGCQQKVRGLFRTQYA-NGILGLERSDLSLIKRLWKENVIPRESFS 244
Query: 163 ICFDENDSGSVFFGDQGPATQQSTS---FLPIGEKYDAYFVGVESYCIGNSCLTQS---- 215
+C + + G GP + T + P Y V V +G+ CLT +
Sbjct: 245 LCMTPFEG---YIGLGGPLRDKHTESMKYTPFTSTQSWYAVHVVRVFVGDECLTSNDQHD 301
Query: 216 -------------GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
G ++DSG + T+LP + + + +L ++ Q +S Y
Sbjct: 302 TVVEHALVEAFAEGKGTILDSGTTDTYLPKAVAGRMREIWARLSNTP---FQPSS---TY 355
Query: 263 NASSEEMLKVPDMRLIFSKNQSF--VVRNHIFSFPEN----EGFTVFCLTVMSTDGDYGI 316
+ +E +P + + N + + +N + PE G + + + +
Sbjct: 356 AYTYDEFRSLPIVTFELANNVTLQALPKNFMEDLPEPLRPWTGRRKLMNRLYADEVQGAV 415
Query: 317 IGQNFMMGHRIVFDRENLKLAWSHSKC 343
+G N M+G+ ++FD + + + + C
Sbjct: 416 VGLNTMVGYDLLFDVQGNRFGVAPALC 442
>gi|453087366|gb|EMF15407.1| candidapepsin-4 precursor [Mycosphaerella populorum SO2202]
Length = 471
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 76/303 (25%), Positives = 129/303 (42%), Gaps = 42/303 (13%)
Query: 69 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIG---CGRKQT 125
C++ DPC Y+ D+S+ YL D + V +V IG +Q
Sbjct: 107 CQARGDPCSISGTYNANDSSTYTYLNSDFNISYVDGSGSAGDYVSDTVKIGDTTLTGQQF 166
Query: 126 GSYLDGAAPDGVMGLG--LGDVS-----------VPSLLAKAGLIQ-NSFSICFDEND-- 169
G + ++ +G++G+G + +V+ VP L KAG I N++S+ ++ D
Sbjct: 167 GIGYESSSQEGILGIGYPINEVAVQYNGGKTYSNVPQSLVKAGAINTNAYSLWLNDLDAS 226
Query: 170 SGSVFFGDQGPATQQSTSFL---PIGEKYDAY------FVGVESYCIGNSCLTQSGFQAL 220
+GS+ FG G T++ T L PI E Y V + S + + AL
Sbjct: 227 TGSILFG--GVNTEKYTGSLETIPIVETQGVYAEFIIALTAVGANGTAGSIVNKQAIPAL 284
Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 280
+DSG+S +LP +I + +D V + S QG ++ C A+S+ L L FS
Sbjct: 285 LDSGSSLMYLPNDITQSI---YDS-VGASYDSEQGAAFVDCDLANSDGSLD-----LTFS 335
Query: 281 KNQSFVVRNHIFSFPE-NEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFD--RENLKLA 337
V N + + G V L + ++G F+ +V+D + + LA
Sbjct: 336 SPTIKVPMNELVIVAGIDRGKEVCILGIGPAGSSTPVLGDTFLRSAYVVYDLAKNEISLA 395
Query: 338 WSH 340
++
Sbjct: 396 QTN 398
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 87/371 (23%), Positives = 164/371 (44%), Gaps = 64/371 (17%)
Query: 5 ICFGSHANAYNALLCLPVTTLLW-----CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCS 59
+ G+ A Y+A++ + L+W C + F D+ +DP SSS + CS
Sbjct: 101 LAIGTPAETYSAIMDTG-SDLIWTQCKPCKVCF------DQPTPIFDPEKSSSFSKLPCS 153
Query: 60 HPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 117
LC + SSC D C Y Y + +S+ G L + S S +
Sbjct: 154 SDLCVALPISSC---SDGCEYRYSYG-DHSSTQGVLATETFTFGDAS--------VSKIG 201
Query: 118 IGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG--SVF 174
GCG G +Y GA G++GLG G + SL+++ G+ + S+ + ++ G ++
Sbjct: 202 FGCGEDNRGRAYSQGA---GLVGLGRGPL---SLISQLGVPKFSYCLTSIDDSKGISTLL 255
Query: 175 FGDQGPATQQSTSFLPIGE---KYDAYFVGVESYCIGNSCL--TQSGFQA--------LV 221
G + AT +S P+ + + Y++ +E +G++ L +S F ++
Sbjct: 256 VGSE--ATVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLII 313
Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN---SWKYCYNASSE-EMLKVPDMRL 277
DSG + T+L +A + +F +S ++ + + + C+ + ++VP +
Sbjct: 314 DSGTTITYLKDNAFAALKKEF---ISQMKLDVDASGSTELELCFTLPPDGSPVEVPQLVF 370
Query: 278 IFSK-NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF-DRENLK 335
F + N+I E+ V CLT+ S+ G I G NF + +V D E
Sbjct: 371 HFEGVDLKLPKENYII---EDSALRVICLTMGSSSG-MSIFG-NFQQQNIVVLHDLEKET 425
Query: 336 LAWSHSKCEEV 346
++++ ++C ++
Sbjct: 426 ISFAPAQCNQL 436
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 82/318 (25%), Positives = 134/318 (42%), Gaps = 39/318 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++PS SSS K ++C+ +C K + S K+ C Y Y + + + + SF
Sbjct: 123 FNPSLSSSFKPLACASSICGKLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETL----SF 178
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
+HA + SV +GCGR G + A ++GLG G +S PS + + FS
Sbjct: 179 GEHAVR-----SVAMGCGRNNQGLFHGAAG---LLGLGRGPLSFPSQTGTS--YASVFSY 228
Query: 164 CFDENDS---GSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------- 212
C +S S+ FG P + T LP Y+VG+ + S +
Sbjct: 229 CLPRRESAIAASLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAF 288
Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV---SSKRISLQGNSWKYCYNASS 266
++ +VDSG + + L T Y + F LV S+ ISL + CY+ SS
Sbjct: 289 AMGSRGTGGVIVDSGTAISRLTTPAYTALRDAFRSLVTFPSAPGISL----FDTCYDLSS 344
Query: 267 EEMLKVPDMRLIFSKNQSF-VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
+ +P + L F S + + I ++EG +CL + + IIG
Sbjct: 345 MKTATLPAVVLDFDGGASMPLPADGILVNVDDEG--TYCLAFAPEEEAFSIIGNVQQQTF 402
Query: 326 RIVFDRENLKLAWSHSKC 343
RI D + ++ + +C
Sbjct: 403 RISIDNQKEQMGIAPDQC 420
>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
Length = 414
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 140/348 (40%), Gaps = 65/348 (18%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKSRSS----CKSLKDPCPYIADYSTEDTSSSGYLVD 95
R + P+SSS+ + C+ LC+ +S C + C Y Y T+ GYL
Sbjct: 91 RPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATG--CVYYYPYGMGFTA--GYLAT 146
Query: 96 DILHL--ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK 153
+ LH+ ASF A S ++ V G + G++GLG +S L+++
Sbjct: 147 ETLHVGGASFPGVAFGCSTENGV--------------GNSSSGIVGLGRSPLS---LVSQ 189
Query: 154 AGLIQNSFSICF----DENDSGSVFFGDQGPAT--QQSTSFL--PIGEKYDAYFVGVESY 205
G+ FS C D DS + FG T + S + L P Y+V +
Sbjct: 190 VGV--GRFSYCLRSDADAGDS-PILFGSLAKVTGGKSSPAILENPEMPSSSYYYVNLTGI 246
Query: 206 CIGNSCL----TQSGFQ----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
+G + L T GF +VDSG + T+L E YA V F +++ +
Sbjct: 247 TVGATDLPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANL 306
Query: 252 SLQGNSWKY----CYNASSE---EMLKVPDMRLIFSKNQSFVVRNH----IFSFPENEGF 300
+ N ++ C++A++ + VP + L F+ + VR +
Sbjct: 307 TTTVNGTRFGFDLCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRA 366
Query: 301 TVFCLTVM--STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
V CL V+ S IIG M +++D + +++ + C V
Sbjct: 367 AVECLLVLPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCANV 414
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 79/331 (23%), Positives = 125/331 (37%), Gaps = 42/331 (12%)
Query: 44 EYDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDP-CPYIADYSTEDTSSSGYLVDDIL 98
+DP+ SS+ + V C P C + SC + C + Y++ + L D L
Sbjct: 142 SFDPTQSSTYRPVRCGAPQCAQVPPATPSCPAGPGASCAFNLSYASSTLHA--VLGQDAL 199
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
L+ + A GC R TGS P G++G G G + S L++
Sbjct: 200 SLSDSNGAA---VPDDHYTFGCLRVVTGSG-GSVPPQGLVGFGRGPL---SFLSQTKATY 252
Query: 159 NS-FSICF----DENDSGSVFFGDQG-PATQQSTSFLP------------IGEKYDAYFV 200
S FS C N SG++ G G P ++T L +G + + V
Sbjct: 253 GSIFSYCLPSYKSSNFSGTLRLGPAGQPRRIKTTPLLSNPHRPSLYYVAMVGVRVNGKAV 312
Query: 201 GVESYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKY 260
+ + + T G +VD+G FT L YA + F + VS+ G +
Sbjct: 313 PIPASALALDAATGRG-GTIVDAGTMFTRLSPPAYAALRNAFRRGVSAPAAPALGG-FDT 370
Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQN 320
CY + + VP + +F+ + G V CL + + D G N
Sbjct: 371 CYYVNGTK--SVPAVAFVFAGGARVTLPEENVVISSTSG-GVACLAMAAGPSDGVNAGLN 427
Query: 321 FM-----MGHRIVFDRENLKLAWSHSKCEEV 346
+ HR+VFD N ++ +S C V
Sbjct: 428 VLASMQQQNHRVVFDVGNGRVGFSRELCTAV 458
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 85/322 (26%), Positives = 126/322 (39%), Gaps = 47/322 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLK-DPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++P SSS K++SC C ++ + C Y +Y + + S G + L L S
Sbjct: 180 FEPQQSSSYKHLSCLSSACTELTTMNHCRLGGCVYEINYG-DGSRSQGDFSQETLTLGSD 238
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL-AKAGLIQNSFS 162
S S GCG TG + A G++GLG +S PS +K G FS
Sbjct: 239 SF--------PSFAFGCGHTNTGLFKGSA---GLLGLGRTALSFPSQTKSKYG---GQFS 284
Query: 163 IC---FDENDSGSVFFGDQG--PATQQSTSFLPI--GEKYDA-YFVGVESYCIGN----- 209
C F + S F QG PAT +F+P+ Y + YFVG+ +G
Sbjct: 285 YCLPDFVSSTSTGSFSVGQGSIPAT---ATFVPLVSNSNYPSFYFVGLNGISVGGERLSI 341
Query: 210 --SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKF----DKLVSSKRISLQGNSWKYCYN 263
+ L + G +VDSG T L + Y + F L S+K S+ CY+
Sbjct: 342 PPAVLGRGG--TIVDSGTVITRLVPQAYDALKTSFRSKTRNLPSAKPFSI----LDTCYD 395
Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY--GIIGQNF 321
SS +++P + F N V F + CL S IIG
Sbjct: 396 LSSYSQVRIPTITFHFQNNADVAVSAVGILFTIQSDGSQVCLAFASASQSISTNIIGNFQ 455
Query: 322 MMGHRIVFDRENLKLAWSHSKC 343
R+ FD ++ ++ C
Sbjct: 456 QQRMRVAFDTGAGRIGFAPGSC 477
>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 466
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 82/360 (22%), Positives = 143/360 (39%), Gaps = 80/360 (22%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLC--------------KSRSSCKSLKDPCP-YIADYSTE 85
N ++ P +SSSSK V C++P C + +++ + CP Y Y
Sbjct: 128 NTPKFIPKNSSSSKFVGCTNPKCAWVFGPDVKSHCCRQDKAAFNNCSQTCPAYTVQYGLG 187
Query: 86 DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 145
S++G+L+ + L+ + S ++GC S + P G+ G G G+
Sbjct: 188 --STAGFLLSENLNFPT--------KKYSDFLLGC------SVVSVYQPAGIAGFGRGEE 231
Query: 146 SVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQS----------TSFL--PIGE 193
S+PS + L + S+ + + D + + T S T FL P +
Sbjct: 232 SLPS---QMNLTRFSYCLLSHQFDDSATITSNLVLETASSRDGKTNGVSYTPFLKNPTTK 288
Query: 194 KYDA----YFVGVESYCIGNSCLT------------QSGFQALVDSGASFTFLPTEIYAE 237
K A Y++ ++ +G + GF +VDSG++FTF+ I+
Sbjct: 289 KNPAFGAYYYITLKRIVVGEKRVRVPRRLLEPNVDGDGGF--IVDSGSTFTFMERPIFDL 346
Query: 238 VVVKFDKLVSSKRISLQGNSWKY--CYN-ASSEEMLKVPDMRLIFS--KNQSFVVRNHIF 292
V +F K VS R + C+ A E P++R F V N+
Sbjct: 347 VAQEFAKQVSYTRAREAEKQFGLSPCFVLAGGAETASFPELRFEFRGGAKMRLPVANYFS 406
Query: 293 SFPENEGFTVFCLTVMSTD--GDYGIIGQNFMMGH------RIVFDRENLKLAWSHSKCE 344
+ + V CLT++S D G G +G ++G+ + +D EN + + C+
Sbjct: 407 LVGKGD---VACLTIVSDDVAGSGGTVGPAVILGNYQQQNFYVEYDLENERFGFRSQSCQ 463
>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 78/173 (45%), Gaps = 7/173 (4%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
+N++ +DP +SSS+ ++CS C S KS P Y +YS + + +SGY + D++
Sbjct: 119 QNVTFFDPGASSSAVKLACSDKRCFSDLHKKSGCSPLEYKVEYS-DGSFTSGYYISDLIS 177
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
+ + + GC G L + G++GLG G + V S L+ L
Sbjct: 178 FETVMSSNLTVKSSAPFVFGCSNLHAGLISLPETSIHGIVGLGKGRLLVVSQLSSQRLAP 237
Query: 159 NSFSICFD--ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGN 209
FS+C + G + G+ +T + P+ Y V ++++ + +
Sbjct: 238 EVFSLCLSGGQEGGGVIILGEN---RLPNTVYTPLVRSQTHYNVNLKTFAVND 287
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 81/325 (24%), Positives = 126/325 (38%), Gaps = 43/325 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCK-------------SRSSCKSLKDPCPYIADYSTEDTSSSG 91
+DP++S + V C P C +RS+ S + C Y Y + + S G
Sbjct: 225 FDPAASPTFAAVPCGSPACAASLKDATGAPGSCARSAGNS-EQRCYYALSYG-DGSFSRG 282
Query: 92 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
L D L L + +K + GCG G + A G+MGLG D+S+ S
Sbjct: 283 VLAQDTLGLGTTTK-------LDGFVFGCGLSNRGLFGGTA---GLMGLGRTDLSLVS-- 330
Query: 152 AKAGLIQNSFSICF--DENDSGSVFFGDQGPAT----QQSTSFLPIGEKYDAYFVGVE-S 204
A FS C +GS+ G GP++ T + + YF+ + +
Sbjct: 331 QTAARFGGVFSYCLPATTTSTGSLSLG-PGPSSSFPNMAYTRMIADPTQPPFYFINITGA 389
Query: 205 YCIGNSCLTQSGFQA---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKY 260
G + LT GF A LVDSG T L +Y V +F + + + G S
Sbjct: 390 AVGGGAALTAPGFGAGNVLVDSGTVITRLAPSVYKAVRAEFARRF--EYPAAPGFSILDA 447
Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIG 318
CY+ + + + VP + L V F + + CL + S + IIG
Sbjct: 448 CYDLTGRDEVNVPLLTLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIG 507
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
R+V+D +L ++ C
Sbjct: 508 NYQQRNKRVVYDTVGSRLGFADEDC 532
>gi|222629275|gb|EEE61407.1| hypothetical protein OsJ_15596 [Oryza sativa Japonica Group]
Length = 466
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 56/244 (22%), Positives = 105/244 (43%), Gaps = 40/244 (16%)
Query: 132 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPI 191
A P GV G G G +S+P+ LA + L ++ + +++ V+ T L
Sbjct: 231 AEPVGVAGFGRGPLSLPAQLAPS-LSGSTDAAAIGASETDFVY-----------TPLLHN 278
Query: 192 GEKYDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVK 241
+ Y V +E+ +G + +VDSG +FT LP++ +A V +
Sbjct: 279 PKHPYFYSVALEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADE 338
Query: 242 FDKLVSSKRISLQGNSWKY-----CYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSF 294
F + +++ R + + CY+ S + VP + L F N + + RN+ F
Sbjct: 339 FARAMAAARFTRAEGAEAQTGLAPCYHYSPSDR-AVPPVALHFRGNATVALPRRNYFMGF 397
Query: 295 PENEGFTVFCLTVMSTDGD----------YGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
EG +V CL +M+ G+ G +G G +V+D + ++ ++ +C
Sbjct: 398 KSEEGRSVGCLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 457
Query: 345 EVID 348
++ D
Sbjct: 458 DLWD 461
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 81/346 (23%), Positives = 152/346 (43%), Gaps = 62/346 (17%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRSSCKSLK---DP---CPYIADYSTEDTSSSGYLVDD 96
S ++P +S + + CS P C++R+ L DP C +I Y+ + +S G L +
Sbjct: 103 SIFNPLASKTYTKIPCSSPTCETRTRDLPLPVSCDPAKLCHFIISYA-DASSVEGNLAFE 161
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKAG 155
+ S + A + GC S + A G+MG+ G +S + + G
Sbjct: 162 TFRVGSVTGPA--------TVFGCMDSGFSSNSEEDAKTTGLMGMNRGSLS---FVNQMG 210
Query: 156 LIQNSFSICF-DENDSGSVFFGDQGPATQQSTSFLPIGEK------YD--AYFVGVESYC 206
FS C D + SG + G+ + + ++ P+ E +D AY V +E
Sbjct: 211 F--RKFSYCISDRDSSGVLLLGEASFSWLKPLNYTPLVEMSTPLPYFDRVAYSVQLEGIR 268
Query: 207 IGNSCLT--QSGF--------QALVDSGASFTFLPTEIYAEVVVKF-------DKLVSSK 249
+ + L+ +S F Q +VDSG FTFL +Y+ + +F ++++
Sbjct: 269 VSDKVLSLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALKQEFLLQTKGVLRVLNEP 328
Query: 250 RISLQGNSWKYCY--NASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFP-ENEGF-TVFC 304
R QG + CY + + +P + L+F + V + ++ P E G +V+C
Sbjct: 329 RYVFQG-AMDLCYLIEPTRAALPNLPVVNLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWC 387
Query: 305 LTVMSTDGDYGIIGQNFMMGHR------IVFDRENLKLAWSHSKCE 344
T ++D GI ++F++GH + +D E ++ ++ +C+
Sbjct: 388 FTFGNSD-SLGI--ESFVIGHHQQQNVWMEYDLEKSRIGFAEVRCD 430
>gi|213998796|gb|ACJ60765.1| nucellin [Hordeum marinum subsp. gussoneanum]
Length = 133
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 33/120 (27%), Positives = 60/120 (50%), Gaps = 3/120 (2%)
Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGE 193
DG++GLG+G + L +I N C G ++ G+ P ++ T ++P+ E
Sbjct: 10 DGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSSKGKGVLYVGNFNPPSRGVT-WVPMRE 68
Query: 194 KYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 252
Y G+ I N + F+A+ DSG+++T +P++IY E+V K +S ++
Sbjct: 69 SSFYYSPGLAELLIDNQPIRGNPTFEAVFDSGSTYTLVPSQIYNEIVPKVRGTLSESSLA 128
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 81/322 (25%), Positives = 121/322 (37%), Gaps = 43/322 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPC-----PYIADYSTEDTSSSGYLVDDILH 99
++P SSS K + C C + +S PC Y +Y + +SS G + L
Sbjct: 179 FEPKQSSSYKTLPCLSATCTELITSESNPTPCLLGGCVYEINYG-DGSSSQGDFSQETLT 237
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL-LAKAGLIQ 158
L S S + GCG TG + G++GLG +S PS +K G
Sbjct: 238 LGSDSFQ--------NFAFGCGHTNTGLF---KGSSGLLGLGQNSLSFPSQSKSKYG--- 283
Query: 159 NSFSICF-DENDSGSVFFGDQGPAT-QQSTSFLPIGEKY---DAYFVGVESYCIGNSCLT 213
F+ C D S S G + S F P+ + YFVG+ +G L+
Sbjct: 284 GQFAYCLPDFGSSTSTGSFSVGKGSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLS 343
Query: 214 -----QSGFQALVDSGASFTFLPTEIYAEVVVKFDK----LVSSKRISLQGNSWKYCYNA 264
+VDSG T L + Y + F L S+K S+ CY+
Sbjct: 344 IPPAVLGRGSTIVDSGTVITRLLPQAYNALKTSFRSKTRDLPSAKPFSI----LDTCYDL 399
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST---DGDYGIIGQNF 321
S +++P + F N V + P G + CL S DG + IIG
Sbjct: 400 SRHSQVRIPTITFHFQNNADVAVSDVGILVPVQNGGSQVCLAFASASQMDG-FNIIGNFQ 458
Query: 322 MMGHRIVFDRENLKLAWSHSKC 343
R+ FD ++ ++ C
Sbjct: 459 QQRMRVAFDTGAGRIGFASGSC 480
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 88/373 (23%), Positives = 152/373 (40%), Gaps = 60/373 (16%)
Query: 1 MLGAICFGSHANAYNALLCLPVTTLLW-----CLLVFGASIVQDRNLSEYDPSSSSSSKN 55
L + G+ A Y+A++ + L+W C F D+ +DP SSS
Sbjct: 97 FLMKLAIGTPAETYSAIMDTG-SDLIWTQCKPCKDCF------DQPTPIFDPKKSSSFSK 149
Query: 56 VSCSHPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQ 113
+ CS LC + SSC D C Y+ Y + +S+ G L + S
Sbjct: 150 LPCSSDLCAALPISSC---SDGCEYLYSYG-DYSSTQGVLATETFAFGDASV-------- 197
Query: 114 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF----DEND 169
S + GCG GS A G++GLG G +S+ S L + FS C D
Sbjct: 198 SKIGFGCGEDNDGSGFSQGA--GLVGLGRGPLSLISQLG-----EPKFSYCLTSMDDSKG 250
Query: 170 SGSVFFGDQGPATQQSTSFLPIGE---KYDAYFVGVESYCIGNSCL--TQSGFQA----- 219
S+ G + AT ++ P+ + + Y++ +E +G++ L +S F
Sbjct: 251 ISSLLVGSE--ATMKNAITTPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQNDGS 308
Query: 220 ---LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE-EMLKVPDM 275
++DSG + T+L +A + +F + C+ + + VP +
Sbjct: 309 GGLIIDSGTTITYLEDSAFAALKKEFISQLKLDVDESGSTGLDLCFTLPPDASTVDVPQL 368
Query: 276 RLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF-DREN 333
F + N+I + + G V CLT+ S+ G I G NF + +V D E
Sbjct: 369 VFHFEGADLKLPAENYIIA---DSGLGVICLTMGSSSG-MSIFG-NFQQQNIVVLHDLEK 423
Query: 334 LKLAWSHSKCEEV 346
++++ ++C ++
Sbjct: 424 ETISFAPAQCNQL 436
>gi|147821119|emb|CAN68736.1| hypothetical protein VITISV_030193 [Vitis vinifera]
Length = 441
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/147 (30%), Positives = 67/147 (45%), Gaps = 13/147 (8%)
Query: 43 SEYDPSSSSSSKNV------SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
S Y P+ SS+ SC H L + R C + C ++ S+G L +D
Sbjct: 81 SSYRPARCHSSQCFLAHGPKSCDHCLSRGRPKCNN--GTCILFSENVFTSKVSAGDLSED 138
Query: 97 ILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
+L L S P+S+V + C + L G A +G+ GLG G + +P+LL+ A
Sbjct: 139 VLSLQSTDGLNPRSAVAIPHFLFSCAPEVLLQGLAGGA-EGIAGLGHGRIGLPTLLSSAL 197
Query: 156 LIQNSFSICF--DENDSGSVFFGDQGP 180
F++C SG +FFGD GP
Sbjct: 198 NFTRKFAVCLPPTTTSSGVIFFGD-GP 223
>gi|225451013|ref|XP_002284868.1| PREDICTED: basic 7S globulin-like [Vitis vinifera]
Length = 441
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 45/147 (30%), Positives = 67/147 (45%), Gaps = 13/147 (8%)
Query: 43 SEYDPSSSSSSKNV------SCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
S Y P+ SS+ SC H L + R C + C ++ S+G L +D
Sbjct: 81 SSYRPAQCHSSQCFLAHGPKSCDHCLSRGRPKCNN--GTCILFSENVFTSKVSAGDLSED 138
Query: 97 ILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
+L L S P+S+V + C + L G A +G+ GLG G + +P+LL+ A
Sbjct: 139 VLSLQSTDGLNPRSAVAIPHFLFSCAPEVLLQGLAGGA-EGIAGLGHGRIGLPTLLSSAL 197
Query: 156 LIQNSFSICF--DENDSGSVFFGDQGP 180
F++C SG +FFGD GP
Sbjct: 198 NFTRKFAVCLPPTTTSSGVIFFGD-GP 223
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 82/318 (25%), Positives = 134/318 (42%), Gaps = 39/318 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLC-KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
++PS SSS K ++C+ +C K + S K+ C Y Y + + + + SF
Sbjct: 56 FNPSLSSSFKPLACASSICGKLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETL----SF 111
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSI 163
+HA + SV +GCGR G + A ++GLG G +S PS + + FS
Sbjct: 112 GEHAVR-----SVAMGCGRNNQGLFHGAAG---LLGLGRGPLSFPSQTGTS--YASVFSY 161
Query: 164 CFDENDS---GSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------- 212
C +S S+ FG P + T LP Y+VG+ + S +
Sbjct: 162 CLPRRESAIAASLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAF 221
Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLV---SSKRISLQGNSWKYCYNASS 266
++ +VDSG + + L T Y + F LV S+ ISL + CY+ SS
Sbjct: 222 AMGSRGTGGVIVDSGTAISRLTTPAYTALRDAFRSLVTFPSAPGISL----FDTCYDLSS 277
Query: 267 EEMLKVPDMRLIFSKNQSF-VVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGH 325
+ +P + L F S + + I ++EG +CL + + IIG
Sbjct: 278 MKTATLPAVVLDFDGGASMPLPADGILVNVDDEG--TYCLAFAPEEEAFSIIGNVQQQTF 335
Query: 326 RIVFDRENLKLAWSHSKC 343
RI D + ++ + +C
Sbjct: 336 RISIDNQKEQMGIAPDQC 353
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 77/376 (20%), Positives = 146/376 (38%), Gaps = 79/376 (21%)
Query: 23 TTLLW--CLLVFGASIVQDRNLSEYDPSS--------SSSSKNVSCSHPLCK----SRSS 68
++L+W C + Q+ S DP+ SS+ +++ C P C S +
Sbjct: 95 SSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSLPCRSPKCNWVFGSDLN 154
Query: 69 CKSLKDPCPYIA-DYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS 127
C + K CPY +Y S++G LV D+L L+ ++ + GC S
Sbjct: 155 CSTTKR-CPYYGLEYGLG--STTGQLVSDVLGLSKLNRIP-------DFLFGC------S 198
Query: 128 YLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC---FDENDSGSVFFGDQG----P 180
+ P+G+ G G G S+P A+ GL + S+ + FD+ +G
Sbjct: 199 LVSNRQPEGIAGFGRGLASIP---AQLGLTKFSYCLVSHRFDDTPQSGDLVLHRGRRHAD 255
Query: 181 ATQQSTSFLPIGEK------YDAYFVGVESYCIGNSCL----------TQSGFQALVDSG 224
A ++ P + + Y++ + +G + + +VDSG
Sbjct: 256 AAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKEGDGGMIVDSG 315
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQ---GNSWKYCYNASSEEMLKVPDMRLIFSK 281
++FTF+ I+ V + +K ++ + + + + CYN + + + VP + F
Sbjct: 316 STFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGPCYNITGQSEVDVPKLTFSFKG 375
Query: 282 NQSFVVRNHIFSFPENEGFT-----VFCLTVM-------STDGDYGIIGQNFMMGHRIVF 329
+ P + F+ V C+TV+ ST G I+G I +
Sbjct: 376 GAN-------MDLPLTDYFSLVTDGVVCMTVLTDPDEPGSTTGPAIILGNYQQQNFYIEY 428
Query: 330 DRENLKLAWSHSKCEE 345
D + + + +C+
Sbjct: 429 DLKKQRFGFKPQQCDR 444
>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
Length = 334
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 72/336 (21%), Positives = 127/336 (37%), Gaps = 65/336 (19%)
Query: 25 LLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST 84
L+W + S + +N +DPS S+S K VSC C+
Sbjct: 47 LMWTQCLPCLSCYKQKN-PMFDPSKSTSFKEVSCESQQCR-------------------L 86
Query: 85 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 144
DT +S IL+ ++ GCG +G++ + G+ G G
Sbjct: 87 LDTPTS------ILN----------------IVFGCGHNNSGTFNENEM--GLFGTGGRP 122
Query: 145 VSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-- 197
+S+ S + FS C D + + + FG + + P+ K D
Sbjct: 123 LSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKDDPTY 182
Query: 198 YFVGVESYCIGN--------SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 249
YFV ++ +G+ S + G +D+G T LP + Y +V + + +
Sbjct: 183 YFVTLDGISVGDKLFPFSSSSPMATKG-NVFIDAGTPPTLLPRDFYNRLVQGVKEAIPME 241
Query: 250 RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS 309
+ + CY +++ ++ P + F + + F P+ EG V+C +
Sbjct: 242 PVQDPDLQPQLCYRSAT--LIDGPILTAHFDGADVQLKPLNTFISPK-EG--VYCFAMQP 296
Query: 310 TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
DGD GI G M I FD + K+++ C +
Sbjct: 297 IDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCTK 332
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 73/324 (22%), Positives = 143/324 (44%), Gaps = 40/324 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTS-SSGYLVDDILH 99
+DPS SSS +++ C C S +C + C Y YS D S ++G L +
Sbjct: 136 FDPSRSSSYRHMLCGSRFCNALDVSEQACTMDTNICEY--HYSYGDKSYTNGNLATEKFT 193
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQ 158
+ S S S ++ GCG G++ + G+ SL+++ + +I+
Sbjct: 194 IGSTSSRPVH---LSPIVFGCGTGNGGTF-----DELGSGIVGLGGGALSLVSQLSSIIK 245
Query: 159 NSFSICF-----DENDSGSVFFG-DQGPATQQSTSFLPIGEKYDAYF-VGVESYCIGNSC 211
FS C N + + FG D + Q S + ++ D Y+ V +E+ +GN
Sbjct: 246 GKFSYCLVPLSEQSNVTSKIKFGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKR 305
Query: 212 L----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYC 261
L + G ++DSG + TFL +E + E+ ++ V ++R+S + C
Sbjct: 306 LPYTNGLLNGNVEKG-NVIIDSGTTLTFLDSEFFTELERVLEETVKAERVSDPRGLFSVC 364
Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNF 321
+ ++ + + +P + + F N + V + +F + + + C T++S++ GI G
Sbjct: 365 FRSAGD--IDLPVIAVHF--NDADVKLQPLNTFVKADE-DLLCFTMISSN-QIGIFGNLA 418
Query: 322 MMGHRIVFDRENLKLAWSHSKCEE 345
M + +D E +++ + C +
Sbjct: 419 QMDFLVGYDLEKRTVSFKPTDCTK 442
>gi|302783208|ref|XP_002973377.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
gi|300159130|gb|EFJ25751.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
Length = 472
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 65/257 (25%), Positives = 110/257 (42%), Gaps = 33/257 (12%)
Query: 39 DRNLSEYDP----SSSSSSKNVSCSHPLCK-----SRSSCKSL---KDPCPYIADYSTED 86
D N+S DP +SS+S + C+ P C S ++C S C Y YST D
Sbjct: 121 DCNVSTNDPLFSSASSTSYTRIPCTSPFCSTSPGFSTNACGSSAVGSTTCLYNFSYST-D 179
Query: 87 TSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 146
SS+G + D++ + + K S++ S +GCGR+ T + L G++G D S
Sbjct: 180 YSSAGEMASDVVAMKTPRKTRGNKSLRMS--LGCGREST-TLLGILNTSGLVGFAKTDKS 236
Query: 147 VPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA-YFVGVESY 205
LA+ + SG + G+ ++ S S+ P+ A Y++G+ S
Sbjct: 237 FIGQLAEMDYTSKFIYCVPSDTFSGKIVLGNYKISSHSSLSYTPMIVNSTALYYIGLRSI 296
Query: 206 CIGNS-------CLTQSGFQALVDSGASFTFLPTEIYAEVV-------VKFDKLVSSKRI 251
I ++ L ++DS +F++ + Y +V K+ S++
Sbjct: 297 SITDTLTFPVQGILADGTGGTIIDSTFAFSYFTPDSYTPLVQAIQNLNSNLTKVSSNETA 356
Query: 252 SLQGNSWKYCYNASSEE 268
+L GN CYN S +
Sbjct: 357 ALLGN--DICYNVSVND 371
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 77/326 (23%), Positives = 130/326 (39%), Gaps = 46/326 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDP--CPYIADYSTEDTSSSGYLVD 95
+ PS+SSS ++VSC+ C+S +C S +P C Y+ +Y + + ++G L
Sbjct: 105 FKPSTSSSYQSVSCNSSTCQSLQFATGNTGACGS-SNPSTCNYVVNYG-DGSYTNGELGV 162
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG 155
+ L S S + GCGR G + G+MGLG +S+ S
Sbjct: 163 EALSFGGVSV--------SDFVFGCGRNNKGLF---GGVSGLMGLGRSYLSLVS--QTNA 209
Query: 156 LIQNSFSICF---DENDSGSVFFGDQGPATQQS-----TSFLPIGEKYDAYFVGVESYCI 207
FS C + SGS+ G++ + + T L + + Y + + +
Sbjct: 210 TFGGVFSYCLPTTEAGSSGSLVMGNESSVFKNANPITYTRMLSNPQLSNFYILNLTGIDV 269
Query: 208 GNSCLTQ----SGFQALVDSGASFTFLPTEIY----AEVVVKFDKLVSSKRISLQGNSWK 259
G L L+DSG T LP+ +Y AE + KF S+ S+
Sbjct: 270 GGVALKAPLSFGNGGILIDSGTVITRLPSSVYKALKAEFLKKFTGFPSAPGFSI----LD 325
Query: 260 YCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPENEGFTVFCLTVMSTDGDYGII 317
C+N + + + +P + L F N V + E+ L +S D II
Sbjct: 326 TCFNLTGYDEVSIPTISLRFEGNAQLNVDATGTFYVVKEDASQVCLALASLSDAYDTAII 385
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKC 343
G R+++D + K+ ++ C
Sbjct: 386 GNYQQRNQRVIYDTKQSKVGFAEEPC 411
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 80/342 (23%), Positives = 141/342 (41%), Gaps = 63/342 (18%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
+DPS SSS + C+HPLCK R +SC L C Y Y+ + T + G LV +
Sbjct: 119 FDPSLSSSFSVLPCNHPLCKPRIPDFTLPTSC-DLNRLCHYSYFYA-DGTLAEGNLVREK 176
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
+ ++ P +I+GC D + G++G+ LG +S S +A +
Sbjct: 177 ITFSTSQSTPP-------LILGCAE-------DASDDKGILGMNLGRLSFAS---QAKIT 219
Query: 158 QNSFSICFDEND-------SGSVFFGDQ-GPATQQSTSFLPIGEKYD-------AYFVGV 202
+ FS C +GS + G+ A Q S L + A+ V +
Sbjct: 220 K--FSYCVPTRQVRPGFTPTGSFYLGENPNSAGFQYISLLTFSQSQRMPNLDPLAHTVAL 277
Query: 203 ESYCIGNSCLT--QSGF--------QALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KR 250
+ IGN L S F Q+++DSG+ FT+L Y +V + +L K+
Sbjct: 278 QGIRIGNKKLNIPVSAFRADPSGAGQSMIDSGSEFTYLVDVAYNKVREEVVRLAGPRLKK 337
Query: 251 ISLQGNSWKYCYNASSEEMLK-VPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVM 308
+ C++ ++ E+ + + +M F K V+ + + + + G V C+ +
Sbjct: 338 GYVYSGVSDMCFDGNAMEIGRLIGNMVFEFDKGVEIVIEKGRVLA---DVGGGVHCVGIG 394
Query: 309 STD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVI 347
++ IIG + FD N ++ + + C +
Sbjct: 395 RSEMLGAASNIIGNFHQQNLWVEFDIANRRVGFGKADCSRSV 436
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 67/268 (25%), Positives = 105/268 (39%), Gaps = 51/268 (19%)
Query: 114 SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDE-----N 168
S++ +GC GA+ G++G+ +S PS L+ FS CF + N
Sbjct: 253 SNITLGCADIDREGLPTGAS--GLLGMDRRPISFPSQLSSR--YARKFSHCFPDKIAHLN 308
Query: 169 DSGSVFFGD--------------QGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
SG VFFG+ Q PA ++ D Y+VG+ + S L
Sbjct: 309 SSGLVFFGESDIISPYLRYTPLVQNPAVPSAS--------LDYYYVGLVGISVDESRLPL 360
Query: 213 ----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
T SG ++DSG +FT+L + + +F S + + CY
Sbjct: 361 SHKNFDIDKVTGSG-GTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCY 419
Query: 263 NASSE----EMLKVPDMRLIFSKNQSFVVRNHIFSFP--ENEGFTVFCLTV-MSTDGDYG 315
N +S E +P + L F V+ + P +E T CL MS D +
Sbjct: 420 NITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQMSGDIPFN 479
Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
IIG + +D E L+L + ++C
Sbjct: 480 IIGNYQQQNLWVEYDLEKLRLGIAPAQC 507
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 80/328 (24%), Positives = 135/328 (41%), Gaps = 39/328 (11%)
Query: 45 YDPSSSSSSKNVSCS--------HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
+DPS S+S K + C+ H C+ SS S K C Y Y + + +SG L +
Sbjct: 129 FDPSQSTSFKIIPCNAAACDLVVHDECRDNSSKTSPKT-CKYFYWYG-DSSRTSGDLALE 186
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
L + S S H P S ++IGCG G + ++GLG G +S PS L ++
Sbjct: 187 SLSV-SLSDH-PSSLEIRDMVIGCGHSNKGLFQGAGG---LLGLGQGALSFPSQL-RSSP 240
Query: 157 IQNSFSICFDEND-----SGSVFFGDQGPATQQ--STSFLPIGEKYDA----YFVGVESY 205
I SFS C + S ++ FG ++ F P ++ Y++G++
Sbjct: 241 IGQSFSYCLVDRTNNLSVSSAISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGI 300
Query: 206 CIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 255
I L T ++DSG + T+L + Y V F +S R
Sbjct: 301 KIDQELLPIPAERFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISYPRAD-PF 359
Query: 256 NSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG 315
+ CYNA+ + P + ++F + + + CL ++ TDG
Sbjct: 360 DILGICYNATGRAAVPFPALSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDG-MS 418
Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
IIG ++D ++ +L ++++ C
Sbjct: 419 IIGNFQQQNIHFLYDVQHARLGFANTDC 446
>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
Length = 488
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 75/336 (22%), Positives = 135/336 (40%), Gaps = 50/336 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP------CPYIADYSTEDTSSSGYLVDDIL 98
+DPS S + + +SC P+C+ C ++ D C + Y + + SG LV D+
Sbjct: 169 HDPSKSRTFRRLSCFDPMCEL---CTAVVDGGGGSAGCLFRRRYG-DGGAVSGELVSDVF 224
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
H + + ++ V GC + + G + G++ LG+G PS + + G+
Sbjct: 225 HFGA-AGDGGGYQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGK---PSFVTQLGV-- 277
Query: 159 NSFSICF-------------DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESY 205
+ FS C +E + + FG T + F G Y V Y
Sbjct: 278 DRFSYCIPASEITDDDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSV-VY 336
Query: 206 CIGNSCLTQ-------SGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVS-SKRI 251
G Q +G +A LVDSG + +LP ++ + + ++ +S ++R
Sbjct: 337 QHGGRLNQQQPVPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRY 396
Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF-VVRNHIFSFPENEGFTVFCLTVMST 310
L S YCY + ++ V + L F + +F EN CL V +
Sbjct: 397 DLTHPSL-YCYLGNMTDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAA- 453
Query: 311 DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
G+ I+G + +D +++A+ +C+ V
Sbjct: 454 -GNRAILGVYPQRNINVGYDLSTMEIAFDRDQCDRV 488
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 77/342 (22%), Positives = 142/342 (41%), Gaps = 48/342 (14%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKSRSS------CKSLKDPCPYIADYSTEDTSSSGYL 93
+N + YDP +S+S KN++C+ C SS CKS CPY Y ++ +
Sbjct: 207 QNGAFYDPKASASYKNITCNDQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFA 266
Query: 94 VDDI-LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLA 152
V+ ++L + + +V+ +++ GCG G + A ++GLG G +S S L
Sbjct: 267 VETFTVNLTTNGGSSELYNVE-NMMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL- 321
Query: 153 KAGLIQNSFSICF-----DENDSGSVFFGDQGPATQQS----TSFLPIGEKY--DAYFVG 201
L +SFS C D N S + FG+ TSF+ E Y+V
Sbjct: 322 -QSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQ 380
Query: 202 VESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKR- 250
++S + L + ++DSG + ++ Y + K + K
Sbjct: 381 IKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYP 440
Query: 251 ISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFT-----VFCL 305
+ C+N S +++P++ + F+ +++FP F + CL
Sbjct: 441 VYRDFPILDPCFNVSGIHNVQLPELGIAFADGA-------VWNFPTENSFIWLNEDLVCL 493
Query: 306 TVMST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
++ T + IIG I++D + +L ++ +KC ++
Sbjct: 494 AMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKCADI 535
>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
Length = 467
Score = 52.8 bits (125), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 75/336 (22%), Positives = 135/336 (40%), Gaps = 50/336 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDP------CPYIADYSTEDTSSSGYLVDDIL 98
+DPS S + + +SC P+C+ C ++ D C + Y + + SG LV D+
Sbjct: 148 HDPSKSRTFRRLSCFDPMCEL---CTAVVDGGGGSAGCLFRRRYG-DGGAVSGELVSDVF 203
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
H + + ++ V GC + + G + G++ LG+G PS + + G+
Sbjct: 204 HFGA-AGDGGGYQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGK---PSFVTQLGV-- 256
Query: 159 NSFSICF-------------DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESY 205
+ FS C +E + + FG T + F G Y V Y
Sbjct: 257 DRFSYCIPASEITDDDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSV-VY 315
Query: 206 CIGNSCLTQ-------SGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVS-SKRI 251
G Q +G +A LVDSG + +LP ++ + + ++ +S ++R
Sbjct: 316 QHGGRLNQQQPVPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRY 375
Query: 252 SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF-VVRNHIFSFPENEGFTVFCLTVMST 310
L S YCY + ++ V + L F + +F EN CL V +
Sbjct: 376 DLTHPSL-YCYLGNMTDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAA- 432
Query: 311 DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
G+ I+G + +D +++A+ +C+ V
Sbjct: 433 -GNRAILGVYPQRNINVGYDLSTMEIAFDRDQCDRV 467
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 90/351 (25%), Positives = 141/351 (40%), Gaps = 66/351 (18%)
Query: 28 CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTE 85
C L FG QD + YD ++SSS + CS C S C + C Y
Sbjct: 114 CKLCFG----QDTPI--YDTTTSSSFSPLPCSSATCLPIWSSRCSTPSATCRYR------ 161
Query: 86 DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV 145
Y DD ++S SV + GCG G + G +GLG G +
Sbjct: 162 ------YAYDD----GAYSPECAGISV-GGIAFGCGVDNGGLSYNST---GTVGLGRGSL 207
Query: 146 SVPSLLAKAGLIQNSFSIC----FDENDSGSVFFGDQGPATQ----------QSTSFLPI 191
S L+A+ G+ FS C F+ + S VFFG QST +
Sbjct: 208 S---LVAQLGV--GKFSYCLTDFFNTSLSSPVFFGSLAELAASSASADAAVVQSTPLVQS 262
Query: 192 GEKYDAYFVGVESYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVV 240
Y+V +E +G++ L +VDSG FT L E VVV
Sbjct: 263 PYNPSRYYVSLEGISLGDARLPIPNGTFDLNDDDGSGGMIVDSGTIFTIL-VETGFRVVV 321
Query: 241 KFDKLVSSKRISLQGNSWKYCYNASS---EEMLKVPDMRLIFSKNQSFVV-RNHIFSFPE 296
V + + + + C+ A + +E+ +PDM L F+ + R++ SF E
Sbjct: 322 DHVAGVLGQPVVNASSLDRPCFPAPAAGVQELPDMPDMVLHFAGGADMRLHRDNYMSFNE 381
Query: 297 NEGFTVFCLTVMSTDGDYGIIGQNFMMGH-RIVFDRENLKLAWSHSKCEEV 346
E + FCL ++ T+ G + NF + +++FD +L++ + C ++
Sbjct: 382 EE--SSFCLNIVGTESASGSVLGNFQQQNIQMLFDITVGQLSFMPTDCSKL 430
>gi|328865865|gb|EGG14251.1| hypothetical protein DFA_12021 [Dictyostelium fasciculatum]
Length = 698
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 68/281 (24%), Positives = 117/281 (41%), Gaps = 36/281 (12%)
Query: 132 AAP---DGVMGLGL-------GDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPA 181
AAP DG+MGL GD + SLL K I NSFS+C + + G + G P
Sbjct: 246 AAPRKRDGIMGLSYQSLDPNNGD-DIFSLLVKTHEIHNSFSMCLSD-EGGMLVLGGVDPK 303
Query: 182 TQQS-TSFLPI-GEKYDAYFVGVESYCIGNSCLTQSGFQ--ALVDSGASFTFLPTEIYAE 237
+ + PI E+Y Y V I + L FQ ++VDSG + FL +I+ +
Sbjct: 304 MNSTLMKYTPITNERY--YSVNCTGLRIDGNNLNSKSFQSISIVDSGTTIMFLKLDIFND 361
Query: 238 VVVKFDKLVSS-KRISLQGNS-WKY-CYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF 294
++ + S I+ Q S W + C+ S ++ K P + ++F + + +
Sbjct: 362 LIYYLVQHYSHLPGITTQSESLWNHQCFTLSDRQLEKYPTISMVFPNTEGGLFE---VAI 418
Query: 295 PEN------EGFTVFCLTVMSTDGDYGI-IGQNFMMGHRIVFDRENLKLAWSH--SKCEE 345
P N + F + Y + IG + G+ + ++RE+ + ++ C
Sbjct: 419 PPNLYMIKIDDMYCFGFEKLPIKSPYSVLIGDVALQGYNVHYNREDGSIGFAKVTDNCGM 478
Query: 346 VIDKS--HVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPS 384
D + HV ++ Q + L +NG+ P+
Sbjct: 479 GQDNNQYHVEMISEEV-QENDSLVVKIHAIDANGRDGGAPN 518
>gi|66817422|ref|XP_642564.1| hypothetical protein DDB_G0277581 [Dictyostelium discoideum AX4]
gi|60470632|gb|EAL68608.1| hypothetical protein DDB_G0277581 [Dictyostelium discoideum AX4]
Length = 492
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 80/328 (24%), Positives = 129/328 (39%), Gaps = 43/328 (13%)
Query: 41 NLSEYDPSSSSSSKNVSCSHPLC----KSRSSCK---SLKDPCPYIADYSTEDTSSSGYL 93
N YDP+ SSSS+ + CS C + SCK + K C +I Y D S
Sbjct: 133 NRPVYDPALSSSSQLIPCSSDKCLGSGSASPSCKLHQNAKSTCDFIILYG--DGSK---- 186
Query: 94 VDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG-------LGDVS 146
+ FS S V S++ G ++ G++ + DG+MGLG L
Sbjct: 187 ----IKGKVFSDEITVSGVSSTIYFGANVEEVGAF-EYPRADGIMGLGRTSNNKNLVPTI 241
Query: 147 VPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQ-QSTSFLPIGEKYDAYFVGVESY 205
S++ I+N F I D + G + G S + PI Y + S+
Sbjct: 242 FDSMVRSNSSIKNIFGIYLDYHGQGYLSLGKINHHYYIGSIQYTPIQPAGPFYAIKPTSF 301
Query: 206 CIGNSCL-TQSGFQALVDSGASFTFLPTEIYAEVVVKF-------DKLVSSKRISLQGNS 257
+ N+ S Q +VDSG S L + +Y ++ F D + S I S
Sbjct: 302 RVDNTSFPANSMGQVIVDSGTSDLILTSRVYDHLIQYFRKHYCHIDMVCSYPSIF----S 357
Query: 258 WKYCYNASSEEMLKVPDMRLIFSKNQSFVV--RNHIFSFPEN-EGFTVFCLTVMSTDGDY 314
+ C+ E+ P + F + +N++ N +G +C + D D
Sbjct: 358 SRVCF-EKEEDFATFPWLHFGFEGGVRIAIPPKNYMIKTESNQQGVYGYCWGIDRGD-DM 415
Query: 315 GIIGQNFMMGHRIVFDRENLKLAWSHSK 342
I+G FM G+ +FD ++ ++ K
Sbjct: 416 TILGDVFMRGYYTIFDNIENRVGFAIGK 443
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 78/326 (23%), Positives = 144/326 (44%), Gaps = 56/326 (17%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-RSSCKSLKDP--CPYIADYSTEDTSSSGYLVDDILHLA 101
+DPS+S++ + C+ C + S +S DP C Y Y + + ++GYL D + +
Sbjct: 122 FDPSNSTTFHKLPCTTAPCNALDESARSCTDPTTCGYTYSYG-DHSYTTGYLASDTVTVG 180
Query: 102 SFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+ +SVQ +V GCG + G++ + + G++GLG G++S S L I
Sbjct: 181 N-------ASVQIRNVAFGCGTRNGGNFDEQGS--GIVGLGGGNLSFVSQLGDT--IGKK 229
Query: 161 FSICF------------DENDSGSVFFGDQGPATQQSTSFL-----PIGEKYDA--YFVG 201
FS C D + + FGD + ST+ + P+ K + Y++
Sbjct: 230 FSYCLLPLENEISSQPSDSPATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLT 289
Query: 202 VESYCIGNSCLT-----------QSGFQA-------LVDSGASFTFLPTEIYAEVVVKFD 243
+E+ +G L SG ++ ++DSG + TFL E Y +
Sbjct: 290 IEAITVGRKKLLYSSSSSKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALV 349
Query: 244 KLVSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTV 302
+ + +R++ NS + C+ + EE +++P M++ F ++ EG
Sbjct: 350 EEIKMERVNDVKNSMFSLCFKSGKEE-VELPLMKVHFRGGADVELKPVNTFVRAEEGLVC 408
Query: 303 FCLTVMSTDGDYGIIGQ-NFMMGHRI 327
F + + G YG + Q NF++G+ +
Sbjct: 409 FTMLPTNDVGIYGNLAQMNFVVGYDL 434
>gi|407926291|gb|EKG19258.1| Peptidase A1 [Macrophomina phaseolina MS6]
Length = 477
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 53/225 (23%), Positives = 99/225 (44%), Gaps = 49/225 (21%)
Query: 146 SVPSLLAKAGLIQ-NSFSICFDENDS--GSVFFGDQGPATQQST-SFLPIGEKYDAYFVG 201
++P L+ G+IQ N++S+ ++ D+ GS+ FG T + LPI ++Y +Y
Sbjct: 200 NLPQLMVDKGIIQSNAYSLWLNDLDASRGSILFGGVDTEKYHGTLATLPIIQEYGSY--- 256
Query: 202 VESYCIGNSCLTQSGFQA---------------LVDSGASFTFLPTEIYAEVVVKFDKLV 246
+ I + L +G L+DSG+S T+LP + A + FD
Sbjct: 257 -REFIIALTGLGANGNNGSYFSSNDSSSNVVPVLLDSGSSLTYLPDSVVANIYSDFDATY 315
Query: 247 SSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE-----GFT 301
S+ QG ++ C A+S++ L+ F + S P NE G++
Sbjct: 316 DSE----QGAAFVDCDKANSDDTLE-------------FTFSSPTISVPMNELVLLAGYS 358
Query: 302 ---VFCLTVMSTDGD-YGIIGQNFMMGHRIVFDRENLKLAWSHSK 342
C+ ++ GD ++G F+ +V+D N +++ + +
Sbjct: 359 RGQAICILGIAPAGDSTSVLGDTFLRSAYVVYDLANNEISLAQTN 403
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 78/322 (24%), Positives = 133/322 (41%), Gaps = 43/322 (13%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRS---SCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
+ ++P++S S + V C P C SR+ SC C + Y+ D+S L D L
Sbjct: 146 TPFNPAASKSYRAVPCGSPAC-SRAPNPSCSLNTKSCGFSLTYA--DSSLEAALSQDSLA 202
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
+A + V S GC +K TG+ P G++GLG G +S L + +
Sbjct: 203 VA--------NDVVKSYTFGCLQKATGT---ATPPQGLLGLGRGPLSF--LSQTKDMYEG 249
Query: 160 SFSICFDE----NDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
+FS C N SG++ G +G P ++T L + Y+V + +G +
Sbjct: 250 TFSYCLPSFKSLNFSGTLRLGRKGQPLRIKTTPLLVNPHRSSLYYVSMTGIRVGKKVVPI 309
Query: 213 --------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
+G ++DSG FT L Y V + + + +S G + CYN
Sbjct: 310 PPAALAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRIRGAPLSSLGG-FDTCYNT 368
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG---DYGIIGQNF 321
+ +K P + +F+ Q + +++ G T + DG +I
Sbjct: 369 T----VKWPPVTFMFTGMQVTLPADNLV-IHSTYGTTSCLAMAAAPDGVNTVLNVIASMQ 423
Query: 322 MMGHRIVFDRENLKLAWSHSKC 343
HRI+FD N ++ ++ +C
Sbjct: 424 QQNHRILFDVPNGRVGFAREQC 445
>gi|403343737|gb|EJY71200.1| Aspartic protease PM5 [Oxytricha trifallax]
Length = 518
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 70/298 (23%), Positives = 123/298 (41%), Gaps = 33/298 (11%)
Query: 73 KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGA 132
+D C + Y E +S SG+LV D ++ H + + GC ++T +
Sbjct: 64 QDKCMFNQRYG-EGSSYSGFLVKDQVYFGD-KYHDKDDAF--NFTFGCVAEETHLFYSQE 119
Query: 133 APDGVMGLGLGDVSVPSL------LAKAGLI-QNSFSICFDENDSGSVFFGDQGPATQQS 185
A DG++G+ S PS+ + + LI + FS+C +N G G +
Sbjct: 120 A-DGILGM-TRRTSNPSMKPIYESMYENNLIDKKMFSLCLGKNGGYFQLGGFDGQSHLDD 177
Query: 186 TSFLPIGEKYDAYFVGVESYCIGNSCLT--QSGFQALVDSGASFTFLPTEIYAEVVVKFD 243
+LP+ +K Y + ++ + N ++ +S Q +DSG +FT++P ++ + FD
Sbjct: 178 VLWLPLIDK-STYIIKLQGISMNNHMMSGIESITQGFIDSGTTFTYIPQKLIDTLKQHFD 236
Query: 244 KL--------VSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR---NHIF 292
KRI Q + C+ + E+ P +F V N +
Sbjct: 237 WFCKVDPENNCKGKRIDPQ-QEQQICFEYNEEQNPDGPKKFFQSYPLLTFKVDDNGNTLD 295
Query: 293 SFPENEGFT----VFCLTVMSTDG-DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
+P + +CL + T D I+G FM +FD EN K+ + + C E
Sbjct: 296 WYPSEYLYRDQKHKYCLAIEVTQRPDQIILGGTFMRQKNFIFDVENNKVGIARASCNE 353
>gi|24647679|ref|NP_650621.1| CG17283 [Drosophila melanogaster]
gi|7300253|gb|AAF55416.1| CG17283 [Drosophila melanogaster]
Length = 465
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 74/318 (23%), Positives = 127/318 (39%), Gaps = 56/318 (17%)
Query: 51 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSS----------SGYLVDDILHL 100
+ S N+ P CKS++ CK K P + ++ S +G L D + +
Sbjct: 169 TGSSNIWVPGPHCKSKA-CKKHKQYHPAKSSTYVKNGKSFAITYGSGSVAGVLAKDTVRI 227
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN- 159
A V ++ K+ G+ + DG++GLG ++V ++ L+QN
Sbjct: 228 AGL--------VVTNQTFAMTTKEPGTTFVTSNFDGILGLGYRSIAVDNVKT---LVQNM 276
Query: 160 ---------SFSICFDENDSG----SVFFGDQGPAT---QQSTSFLPIGEKYDAYFVGVE 203
F+IC S ++ FG + S ++ P+ +K F +
Sbjct: 277 CSEDVITSCKFAICMKGGGSSSRGGAIIFGSSNTSAYSGSNSYTYTPVTKKGYWQFTLQD 336
Query: 204 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
Y +G + ++ S QA+VDSG S PT IY K +K++ R + G W C
Sbjct: 337 IY-VGGTKVSGS-VQAIVDSGTSLITAPTAIYN----KINKVIGC-RATSSGECWMKCAK 389
Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFP--ENEGFTVFCLTVMSTDGDYGIIGQNF 321
K+PD + + + FVV+ + N G TV V + I+G F
Sbjct: 390 -------KIPDFTFVIA-GKKFVVKGNKMKLKVRTNRGRTVCISAVTEVPDEPVILGDAF 441
Query: 322 MMGHRIVFDRENLKLAWS 339
+ FD N ++ ++
Sbjct: 442 IRHFCTEFDLANNRIGFA 459
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 69/318 (21%), Positives = 129/318 (40%), Gaps = 31/318 (9%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRS--SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
++P SS+ N+SC C S + C + + C Y Y + +S+ G L + +H S
Sbjct: 132 FEPHKSSTFANLSCDSQPCTSSNIYYCPLVGNLCLYTNTYG-DGSSTKGVLCTESIHFGS 190
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
+ P++ I GCG + G++GLG G +S+ S L I + FS
Sbjct: 191 QTVTFPKT------IFGCGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQ--IGHKFS 242
Query: 163 IC---FDENDSGSVFFGDQGPATQQSTSFLP--IGEKYDA-YFVGVESYCIGNSCLT--- 213
C F + + FG+ T P I Y + YF+ + IG L
Sbjct: 243 YCLLPFTSTSTIKLKFGNDTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRT 302
Query: 214 --QSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCYNASSEEML 270
+ ++D G T+L Y V + L S+ + +C+ ++ +
Sbjct: 303 TDHTNGNIIIDLGTVLTYLEVNFYHNFVTLLREALGISETKDDIPYPFDFCF--PNQANI 360
Query: 271 KVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRI 327
P + F+ + F+ +N F F + + CL V+ + + G + ++
Sbjct: 361 TFPKIVFQFTGAKVFLSPKNLFFRF---DDLNMICLAVLPDFYAKGFSVFGNLAQVDFQV 417
Query: 328 VFDRENLKLAWSHSKCEE 345
+DR+ K++++ + C +
Sbjct: 418 EYDRKGKKVSFAPADCSK 435
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 84/338 (24%), Positives = 141/338 (41%), Gaps = 57/338 (16%)
Query: 45 YDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
Y+P SSS + CS LC+ S +C + + C Y Y + + + G L +
Sbjct: 133 YEPRRSSSFAYLPCSDRLCQEGQFSYKNC-ARNNRCMYDELYGSAE--AGGVLASETFTF 189
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+K V + GCG G L GA+ G+MGL G +S+ S L+
Sbjct: 190 GVNAK------VSLPLGFGCGALSAGD-LVGAS--GLMGLSPGIMSLVSQLSVP-----R 235
Query: 161 FSIC---FDENDSGSVFFGD-------QGPATQQSTSFL--PIGEKYDAYFVGVESYCIG 208
FS C F E + + FG + T Q+TS L P E Y+V + +G
Sbjct: 236 FSYCLTPFAERKTSPLLFGAMADLRRYRTTGTVQTTSILRNPAMETA-YYYVPLVGLSLG 294
Query: 209 NSCL----TQSGF-------QALVDSGASFTFLPTEIYAEV---VVKFDKLVSSKRISLQ 254
L T G +VDSG++ ++L + V VV+ +L +
Sbjct: 295 TKRLDVPATSLGMIKPDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAVRLPVANGTDED 354
Query: 255 GNSWKYCY---NASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMST 310
+ ++ C+ + E +K P + L F + + R++ F P + CL V ++
Sbjct: 355 YDDYELCFALPTGVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRA---GLMCLAVGTS 411
Query: 311 DGDYG--IIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+G IIG ++FD N K +++ +KC+++
Sbjct: 412 PDGFGVSIIGNVQQQNMHVLFDVRNQKFSFAPTKCDDI 449
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 75/311 (24%), Positives = 121/311 (38%), Gaps = 41/311 (13%)
Query: 49 SSSSSKNVSCSHPLCKSRSSCK---SLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 105
+SSS K + C+ C SS ++ C Y +Y + + +SG + D + S
Sbjct: 53 ASSSYKKLPCNSTHCSGMSSAGIGPRCEETCKYKYEYG-DGSRTSGDVGSDRISFRSHGA 111
Query: 106 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF 165
S + GC RK G D G++GLG S+ L + FS C
Sbjct: 112 GEDHRSFFDGFLFGCARKLKG---DWNFTQGLIGLGQKSHSLIQQLGDK--LGYKFSYCL 166
Query: 166 DENDS-----GSVFFGDQGPATQQSTSFLPI--GEKYDA--YFVGVESYCIGNSCLT--- 213
DS +F G PI G+ D Y+V ++S IG +
Sbjct: 167 VSYDSPPSAKSFLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYD 226
Query: 214 -QSGF----------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS--WKY 260
+SG + ++DSG ++T L +Y + ++ V + GNS
Sbjct: 227 KESGHNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTL---GNSAGLDL 283
Query: 261 CYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 319
C+N+S + P + F+ V+ +IF + V CL++ S+ GD IIG
Sbjct: 284 CFNSSGDTSYGFPSVTFYFANQVQLVLPFENIFQVTSRD---VVCLSMDSSGGDLSIIGN 340
Query: 320 NFMMGHRIVFD 330
I++D
Sbjct: 341 MQQQNFHILYD 351
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 77/313 (24%), Positives = 126/313 (40%), Gaps = 34/313 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+DP+ SS+ + VSC+ C + + C + C Y Y + ++++G D L L
Sbjct: 171 FDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYG-DGSTTNGTYSRDTLTL 229
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+ S GC ++G + D DG+MGLG G S+ S A A NS
Sbjct: 230 SGASDAV------KGFQFGCSHLESG-FSD--QTDGLMGLGGGAQSLVSQTAAA--YGNS 278
Query: 161 FSICFDENDSGSVFFGDQGPATQQ----STSFLPIGEKYDAYFVGVESYCIGNS--CLTQ 214
FS C SGS F G +T L + Y ++ +G L+
Sbjct: 279 FSYCLPPT-SGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSP 337
Query: 215 SGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
S F A +VDSG T LP Y+ + F + R + + C++ + + + +
Sbjct: 338 SVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISI 397
Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFD 330
P + L+FS + + + + CL +T DG GIIG +++D
Sbjct: 398 PTVALVFSGGAAIDLDPNGIMYGN-------CLAFAATGDDGTTGIIGNVQQRTFEVLYD 450
Query: 331 RENLKLAWSHSKC 343
+ L + C
Sbjct: 451 VGSSTLGFRSGAC 463
>gi|213998810|gb|ACJ60772.1| nucellin [Hordeum comosum]
Length = 154
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 64/135 (47%), Gaps = 10/135 (7%)
Query: 113 QSSVIIGCGRKQTGSYLDGAAP----DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDE 167
+ + GCG KQ +P DG++GLG+G + L +I N C
Sbjct: 6 KKKIAFGCGYKQEEP---ADSPPSLVDGILGLGMGKAGFAAQLKGQKMITGNVIGHCLSS 62
Query: 168 NDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGAS 226
G ++ GD P ++ T ++P+ E Y G+ I N + + F+A+ DS ++
Sbjct: 63 KGKGVLYVGDFNPPSRGVT-WVPMKESLFYYSPGLAELLIDNQPIRGNPTFEAVFDSDST 121
Query: 227 FTFLPTEIYAEVVVK 241
+T +P +IY E+V K
Sbjct: 122 YTHVPAQIYNEIVSK 136
>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
Length = 450
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 79/356 (22%), Positives = 137/356 (38%), Gaps = 70/356 (19%)
Query: 35 SIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSR---------SSCKSLKDPCPYIADYSTE 85
S + + +DP SSSSK + C +P C S C C Y YST+
Sbjct: 118 SAADPKKVPIFDPKLSSSSKILDCRNPKCVSTYFPYVHLGCPRCNGNSKHCSYACPYSTQ 177
Query: 86 --DTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLG 143
+SSGY + + L P+ +++ + ++GC T S + D + G G
Sbjct: 178 YGTGASSGYFLLENLKF-------PRKTIR-NFLLGC----TTSAARELSSDALAGFGRS 225
Query: 144 DVSVPSLLAKAGLIQNSFSICFDEND------SGSVFFGDQGPATQQSTSFLPIGEKYDA 197
S+P + F+ C + +D SG + D + S+ P + A
Sbjct: 226 MFSLPIQMG-----VKKFAYCLNSHDYDDTRNSGKLIL-DYRDGKTKGLSYTPFLKSPPA 279
Query: 198 ----YFVGVESYCIGNSCLT------------QSGFQALVDSG-ASFTFLPTEIYAEVVV 240
Y +GV+ IGN L +SG ++DSG ++ ++ V
Sbjct: 280 SAFYYHLGVKDIKIGNKLLRIPSKYLAPGSDGRSG--VIIDSGYGGAGYMTGPVFKIVTN 337
Query: 241 KFDKLVSSKRISLQGNS---WKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPE 296
+ K +S R SL+ + CYN + + +K+P + F + VV + F
Sbjct: 338 ELKKQMSKYRRSLEAETQTGLTPCYNFTGHKSIKIPPLIYQFRGGANMVVPGKNYFGISP 397
Query: 297 NEGFTVFCLTVMSTDGDYG---------IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
E F +M T+G I+G + + + + +D +N + + C
Sbjct: 398 QESLACF---LMDTNGTNALEITPDPSIILGNSQHVDYYVEYDLKNDRFGFRRQTC 450
>gi|357440775|ref|XP_003590665.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
gi|355479713|gb|AES60916.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
truncatula]
Length = 435
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 64/252 (25%), Positives = 104/252 (41%), Gaps = 40/252 (15%)
Query: 73 KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP-QSSVQSSVIIGCGRKQTGSYLDG 131
+ C D S T++SG L +D+L + S + P Q+ V S + C L
Sbjct: 115 NNTCGVTPDNSITHTATSGELAEDVLSIQSSNGFNPGQNVVVSRFLFSCAPTFLLKGLAT 174
Query: 132 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPA--------TQ 183
A G+ GLG +++PS LA A F+IC + G V FGD GP
Sbjct: 175 GA-SGMAGLGRTKIALPSQLASAFSFARKFAICLSSSK-GVVLFGD-GPYGFLPNVVFDS 231
Query: 184 QSTSFLPI-------------GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGA----- 225
S ++ P+ G+ YF+GV++ I ++ + +D+
Sbjct: 232 DSLTYTPLLINPVSTASAFSQGQPSAEYFIGVKTIKIDEKVVSLNTSLLSIDNNGVGGTK 291
Query: 226 -----SFTFLPTEIYAEVVVKFDKLVSSKRISLQGN--SWKYCYNASSEEML--KVPDMR 276
+T L IY V F K +++ I G+ +++CY + L VP +
Sbjct: 292 ISTVDPYTVLEASIYKAVTDAFVKASAARNIKRVGSVAPFEFCYTNLTGTRLGAAVPTIE 351
Query: 277 LIFSKNQSFVVR 288
L F +N++ V R
Sbjct: 352 L-FLQNENVVWR 362
>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 442
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 76/324 (23%), Positives = 134/324 (41%), Gaps = 54/324 (16%)
Query: 56 VSCSHPLCKSRSSCKSLKDPCP-----YIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
+ C+HPLCK R SL C + + + + T + G LV + + + P
Sbjct: 138 LPCNHPLCKPRVPDFSLPTDCDANSLCHYSYFYADGTYAEGNLVREKIAFSPSQTTPP-- 195
Query: 111 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND- 169
+I+GC + + G++G+ LG + PS +A + + S+ + +
Sbjct: 196 -----IILGCATQSDDA-------RGILGMNLGRLGFPS---QAKITKFSYCVPTKQAQP 240
Query: 170 -SGSVFFGDQGPATQ--QSTSFLPIGEKYD-------AYFVGVESYCIGNSCLT------ 213
SGS + G+ PA+ + + L G+ AY + ++ IG L
Sbjct: 241 ASGSFYLGNN-PASSSFRYVNLLTFGQSQRMPNLDPLAYTLPLQGISIGGKKLNIPPSVF 299
Query: 214 -----QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRISLQGNSWKYCYNASS 266
SG Q ++DSG+ FT+L E Y + + K V K+ + G C++ +
Sbjct: 300 KPNAGGSG-QTMIDSGSEFTYLVDEAYNVIREELVKKVGPKIKKGYMYGGVADICFDGDA 358
Query: 267 EEMLK-VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD--GDYGIIGQNFMM 323
E+ + V DM F K V+ + G V CL + ++ G G I NF
Sbjct: 359 IEIGRLVGDMVFEFEKGVQIVIPKERVLATVDGG--VHCLGMGRSERLGAGGNIIGNFHQ 416
Query: 324 GHRIV-FDRENLKLAWSHSKCEEV 346
+ V FD N ++ + + C ++
Sbjct: 417 QNLWVEFDLANRRVGFGEADCSKL 440
>gi|304361786|gb|ADM26243.1| MIP25078p [Drosophila melanogaster]
Length = 467
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 74/318 (23%), Positives = 127/318 (39%), Gaps = 56/318 (17%)
Query: 51 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSS----------SGYLVDDILHL 100
+ S N+ P CKS++ CK K P + ++ S +G L D + +
Sbjct: 171 TGSSNIWVPGPHCKSKA-CKKHKQYHPAKSSTYVKNGKSFAITYGSGSVAGVLAKDTVRI 229
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN- 159
A V ++ K+ G+ + DG++GLG ++V ++ L+QN
Sbjct: 230 AGL--------VVTNQTFAMTTKEPGTTFVTSNFDGILGLGYRSIAVDNVKT---LVQNM 278
Query: 160 ---------SFSICFDENDSG----SVFFGDQGPAT---QQSTSFLPIGEKYDAYFVGVE 203
F+IC S ++ FG + S ++ P+ +K F +
Sbjct: 279 CSEDVITSCKFAICMKGGGSSSRGGAIIFGSSNTSAYSGSNSYTYTPVTKKGYWQFTLQD 338
Query: 204 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
Y +G + ++ S QA+VDSG S PT IY K +K++ R + G W C
Sbjct: 339 IY-VGGTKVSGS-VQAIVDSGTSLITAPTAIYN----KINKVIGC-RATSSGECWMKCAK 391
Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFP--ENEGFTVFCLTVMSTDGDYGIIGQNF 321
K+PD + + + FVV+ + N G TV V + I+G F
Sbjct: 392 -------KIPDFTFVIA-GKKFVVKGNKMKLKVRTNRGRTVCISAVTEVPDEPVILGDAF 443
Query: 322 MMGHRIVFDRENLKLAWS 339
+ FD N ++ ++
Sbjct: 444 IRHFCTEFDLANNRIGFA 461
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 82/352 (23%), Positives = 146/352 (41%), Gaps = 67/352 (19%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLV 94
++ +DP+ SSS + CS P C++R+ SC S K C Y+ + +SS G L
Sbjct: 110 VNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIPASCDSDK-LCHATLSYA-DASSSEGNLA 167
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS-YLDGAAPDGVMGLGLGDVSVPSLLAK 153
+I H + S+ S++I GC +GS + G++G+ G + S +++
Sbjct: 168 AEIFHFGN-------STNDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSL---SFISQ 217
Query: 154 AGLIQNSFSICFDENDSGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVE 203
G + S+ I ++ G + GD P + ST LP ++ AY V +
Sbjct: 218 MGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLIRISTP-LPYFDRV-AYTVQLT 275
Query: 204 SYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-------L 245
+ L T +G Q +VDSG FTFL +Y + F +
Sbjct: 276 GIKVNGKLLPIPKSVLLPDHTGAG-QTMVDSGTQFTFLLGPVYTALRSDFLNQTNGILTV 334
Query: 246 VSSKRISLQGNSWKYCYNASSEEML-----KVPDMRLIFSKNQSFVV-RNHIFSFPE--- 296
QG + CY S + ++P + L+F + V + ++ P
Sbjct: 335 YEDPEFVFQG-TMDLCYRISPFRIRTGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTA 393
Query: 297 -NEGFTVFCLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
N+ +V+C T ++D + +IG + I FD + ++ + +C+
Sbjct: 394 GND--SVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLAPVQCD 443
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 74/317 (23%), Positives = 127/317 (40%), Gaps = 37/317 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRS-SCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASF 103
+DP+ SSS V C C + C C Y +Y + +S++G L + L +S
Sbjct: 155 FDPAKSSSYAVVPCGTTECAAAGGECNGTT--CVYGVEYG-DGSSTTGVLARETLTFSS- 210
Query: 104 SKHAPQSSVQSSVIIGCGRKQTGSY--LDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
SS + I GCG G + +DG G L L + P+ G+ F
Sbjct: 211 ------SSEFTGFIFGCGETNLGDFGEVDGLLGLGRGSLSLSSQAAPAF---GGI----F 257
Query: 162 SICFDENDS--GSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNSCL---- 212
S C ++ G + G Q + + K D YF+ + S IG L
Sbjct: 258 SYCLPSYNTTPGYLSIGATPVTGQIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPP 317
Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
T++G L+DSG T+LP Y + +F + + + + CY+ + +
Sbjct: 318 SEFTKTG--TLLDSGTILTYLPPPAYTALRDRFKFTMQGSKPAPPYDELDTCYDFTGQSG 375
Query: 270 LKVPDMRLIFSKNQSFVVRNH-IFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFMMGHR 326
+ +P + FS F + I +FP++ V CL +S D + ++G
Sbjct: 376 ILIPGVSFNFSDGAVFNLNFFGIMTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAE 435
Query: 327 IVFDRENLKLAWSHSKC 343
+++D K+ + + C
Sbjct: 436 VIYDVPAQKIGFIPASC 452
>gi|297800470|ref|XP_002868119.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313955|gb|EFH44378.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 499
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 84/346 (24%), Positives = 127/346 (36%), Gaps = 89/346 (25%)
Query: 69 CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY 128
C + PCP Y+ D S L D L L S S ++ GC
Sbjct: 172 CNTSSYPCPPFY-YAYGDGSLVAKLFSDSLSLPSVS--------VANFTFGCAHTTL--- 219
Query: 129 LDGAAPDGVMGLGLGDVSVPSLLA-KAGLIQNSFSICFDEN--DSGSVFFGDQGPATQQS 185
A P GV G G G +S+P+ L+ + + NSFS C + DS V + P+
Sbjct: 220 ---AEPIGVAGFGRGRLSLPAQLSVHSPHLGNSFSYCLVSHSFDSDRV----RRPSPLIL 272
Query: 186 TSFLPIGEKYDA--------------------------------YFVGVESYCIGNSCLT 213
F+ EK A Y V ++ IG +
Sbjct: 273 GRFVDKKEKRVATTDDDDDGDETKKKKNEFVFTEMLVNPKHPYFYSVSLQGISIGKRNIP 332
Query: 214 QSGFQALVD----------SGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK---- 259
+D SG +FT LP + Y VV +FD V R+ + + +
Sbjct: 333 APAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVG--RVHERADRVEPSSG 390
Query: 260 --YCYNASSEEMLKVPDMRLIFSKNQSFVV---RNHIFSFPE-----NEGFTVFCLTVMS 309
CY + + +KVP + L F+ N S V RN+ + F + E V CL +M+
Sbjct: 391 MSPCYYLN--QTVKVPALVLHFAGNGSTVTLPRRNYFYEFMDGGDGKEEKRKVGCLMLMN 448
Query: 310 -------TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVID 348
G I+G G +V+D N ++ ++ KC + D
Sbjct: 449 GGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCASLWD 494
>gi|388516731|gb|AFK46427.1| unknown [Medicago truncatula]
Length = 435
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 64/252 (25%), Positives = 104/252 (41%), Gaps = 40/252 (15%)
Query: 73 KDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAP-QSSVQSSVIIGCGRKQTGSYLDG 131
+ C D S T++SG L +D+L + S + P Q+ V S + C L
Sbjct: 115 NNTCGVTPDNSITHTATSGELAEDVLSIQSSNGFNPGQNVVVSRFLFSCAPTFLLKGLAT 174
Query: 132 AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPA--------TQ 183
A G+ GLG +++PS LA A F+IC + G V FGD GP
Sbjct: 175 GA-SGMAGLGRTKIALPSQLASAFSFARKFAICLSSSK-GVVLFGD-GPYGFLPNVVFDS 231
Query: 184 QSTSFLPI-------------GEKYDAYFVGVESYCIGNSCLTQSGFQALVDSGA----- 225
S ++ P+ G+ YF+GV++ I ++ + +D+
Sbjct: 232 DSLTYTPLLINPVSTASAFSQGQPSAEYFIGVKTIKIDEKVVSLNTSLLSIDNNGVGGTK 291
Query: 226 -----SFTFLPTEIYAEVVVKFDKLVSSKRISLQGN--SWKYCYNASSEEML--KVPDMR 276
+T L IY V F K +++ I G+ +++CY + L VP +
Sbjct: 292 ISTVDPYTVLEASIYKAVTDAFVKAPAARNIKRVGSVAPFEFCYTNLTGTRLGAAVPTIE 351
Query: 277 LIFSKNQSFVVR 288
L F +N++ V R
Sbjct: 352 L-FLQNENVVWR 362
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 52/205 (25%), Positives = 84/205 (40%), Gaps = 27/205 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-------RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
+DPS+S + K++SC+ C S C++ + C Y A Y + + S GYL D+
Sbjct: 161 FDPSASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYG-DSSYSMGYLSQDL 219
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L LA S + GCG+ G + A G++GLG +S+ L +
Sbjct: 220 LTLA-------PSQTLPGFVYGCGQDSDGLFGRAA---GILGLGRNKLSM--LGQVSSKF 267
Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGE---KYDAYFVGVESYCIGNSCLTQ 214
+FS C G + + F P+ YF+ + + +G L
Sbjct: 268 GYAFSYCLPTRGGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGV 327
Query: 215 SGFQ----ALVDSGASFTFLPTEIY 235
+ Q ++DSG T LP +Y
Sbjct: 328 AAAQYRVPTIIDSGTVITRLPMSVY 352
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 87/371 (23%), Positives = 163/371 (43%), Gaps = 64/371 (17%)
Query: 5 ICFGSHANAYNALLCLPVTTLLW-----CLLVFGASIVQDRNLSEYDPSSSSSSKNVSCS 59
+ G+ A Y+A++ + L+W C + F D+ +DP SSS + CS
Sbjct: 101 LAIGTPAETYSAIMDTG-SDLIWTQCKPCKVCF------DQPTPIFDPEKSSSFSKLPCS 153
Query: 60 HPLCKSR--SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVI 117
LC + SSC D C Y Y + +S+ G L + S S +
Sbjct: 154 SDLCVALPISSC---SDGCEYRYSYG-DHSSTQGVLATETFTFGDAS--------VSKIG 201
Query: 118 IGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSG--SVF 174
GCG G +Y GA G++GLG G + SL+++ G+ + S+ + ++ G ++
Sbjct: 202 FGCGEDNRGRAYSQGA---GLVGLGRGPL---SLISQLGVPKFSYCLTSIDDSKGISTLL 255
Query: 175 FGDQGPATQQSTSFLPIGE---KYDAYFVGVESYCIGNSCL--TQSGFQA--------LV 221
G + AT +S P+ + + Y++ +E +G++ L +S F ++
Sbjct: 256 VGSE--ATVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLII 313
Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN---SWKYCYNASSE-EMLKVPDMRL 277
DSG + T+L +A + +F +S ++ + + + C+ + + VP +
Sbjct: 314 DSGTTITYLKDSAFAALKKEF---ISQMKLDVDASGSTELELCFTLPPDGSPVDVPQLVF 370
Query: 278 IFSK-NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVF-DRENLK 335
F + N+I E+ V CLT+ S+ G I G NF + +V D E
Sbjct: 371 HFEGVDLKLPKENYII---EDSALRVICLTMGSSSG-MSIFG-NFQQQNIVVLHDLEKET 425
Query: 336 LAWSHSKCEEV 346
++++ ++C ++
Sbjct: 426 ISFAPAQCNQL 436
>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
Length = 489
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 75/338 (22%), Positives = 135/338 (39%), Gaps = 52/338 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKD------PCPYIADYSTEDTSSSGYLVDDIL 98
+DPS S + + +SC P+C+ C ++ D C + Y + + SG LV D+
Sbjct: 168 HDPSKSRTFRRLSCFDPMCE---LCTAVVDGGGGSAGCLFRRRYG-DGGAVSGELVSDVF 223
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
H + + ++ V GC + + G + G++ LG+G PS + + G+
Sbjct: 224 HFGA-AGDGGGYQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGK---PSFVTQLGV-- 276
Query: 159 NSFSICF---------------DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE 203
+ FS C +E + + FG T + F G Y V
Sbjct: 277 DRFSYCIPASEITDDDDDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSV- 335
Query: 204 SYCIGNSCLTQ-------SGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVS-SK 249
Y G Q +G +A LVDSG + +LP ++ + + ++ +S ++
Sbjct: 336 VYQHGGRLNQQQPVPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTR 395
Query: 250 RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF-VVRNHIFSFPENEGFTVFCLTVM 308
R L S YCY + ++ V + L F + +F EN CL V
Sbjct: 396 RYDLTHPSL-YCYLGNMTDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVA 453
Query: 309 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+ G+ I+G + +D +++A+ +C+ V
Sbjct: 454 A--GNRAILGVYPQRNINVGYDLSTMEIAFDRDQCDRV 489
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 83/352 (23%), Positives = 147/352 (41%), Gaps = 67/352 (19%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLV 94
++ +DP+ SSS + CS P C++R+ SC S K C Y+ + +SS G L
Sbjct: 110 VNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIPASCDSDKL-CHATLSYA-DASSSEGNLA 167
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGS-YLDGAAPDGVMGLGLGDVSVPSLLAK 153
+I H + S+ S++I GC +GS + G++G+ G + S +++
Sbjct: 168 AEIFHFGN-------STNDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSL---SFISQ 217
Query: 154 AGLIQNSFSICFDENDSGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVE 203
G + S+ I ++ G + GD P + ST LP ++ AY V +
Sbjct: 218 MGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLIRISTP-LPYFDRV-AYTVQLT 275
Query: 204 SYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDK-------L 245
+ L T +G Q +VDSG FTFL +Y + F +
Sbjct: 276 GIKVNGKLLPIPKSVLVPDHTGAG-QTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTV 334
Query: 246 VSSKRISLQGNSWKYCYNAS-----SEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPE--- 296
QG + CY S S + ++P + L+F + V + ++ P
Sbjct: 335 YEDPDFVFQG-TMDLCYRISPVRIRSGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTV 393
Query: 297 -NEGFTVFCLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
N+ +V+C T ++D + +IG + I FD + ++ + +C+
Sbjct: 394 GND--SVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLAPVECD 443
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 57/227 (25%), Positives = 96/227 (42%), Gaps = 23/227 (10%)
Query: 25 LLWCLLVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYST 84
L+W + + +L +DP SS+ KNV C C+ ++ C Y D
Sbjct: 121 LVWIPCLSFKPCTHNCDLRFFDPMESSTYKNVPCDSYRCQITNAATCQFSDCFYSCDPRH 180
Query: 85 EDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGD 144
+D+ G L D L L S + +S + + CG + G Y G++GLG G
Sbjct: 181 QDSCPDGDLAMDTLTLNSTTG---KSFMLPNTGFICGNRIGGDY----PGVGILGLGHGS 233
Query: 145 VSVPSLLAKAGLIQNSFSIC---FDENDSGSVFFGDQGPATQQ---STSFLPIGEKYDAY 198
+S+ + ++ LI FS C + N + + FGD+ + ST G Y +Y
Sbjct: 234 LSLLNRISH--LIDGKFSHCIVPYSSNQTSKLSFGDKAVVSGSAMFSTRLDMTGGPY-SY 290
Query: 199 FVGVESYCIGNSCLTQSGFQAL-------VDSGASFTFLPTEIYAEV 238
+ +GN ++ G + +DSG FT+ P Y+++
Sbjct: 291 TLSFYGISVGNKSISAGGIGSDYYMNGLGMDSGTMFTYFPEYFYSQL 337
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 83/350 (23%), Positives = 141/350 (40%), Gaps = 56/350 (16%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
+ P +S++ V C C SR SC + C Y+ + ++S G L D+
Sbjct: 102 FRPRASATFAAVPCGSARCSSRDLPAPPSCDAASRRCRVSLSYA-DGSASDGALATDVFA 160
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
+ AP ++S+ GC S D A G++G+ G +S + +A
Sbjct: 161 VGD----AP--PLRSA--FGCMSAAYDSSPDAVATAGLLGMNRGALS---FVTQAS--TR 207
Query: 160 SFSICF-DENDSGSVFFGDQG--------PATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
FS C D +D+G + G Q T LP ++ AY V + +G
Sbjct: 208 RFSYCISDRDDAGVLLLGHSDLPFLPLNYTPLYQPTPPLPYFDRV-AYSVQLLGIRVGGK 266
Query: 211 CL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 259
L T +G Q +VDSG FTFL + Y+ V +F K +L+ S+
Sbjct: 267 PLPIPPSVLAPDHTGAG-QTMVDSGTQFTFLLGDAYSAVKAEFLKQTKPLLPALEDPSFA 325
Query: 260 Y------CYN---ASSEEMLKVPDMRLIFSKNQSFVVRNH-IFSFP-ENEGFT-VFCLTV 307
+ C+ ++P + L+F+ Q V + ++ P E G V+CLT
Sbjct: 326 FQEAFDTCFRVPKGRPPPSARLPPVTLLFNGAQMSVAGDRLLYKVPGERRGADGVWCLTF 385
Query: 308 MSTDG---DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHL 354
+ D +IG + M + +D E ++ + KC+ ++ + L
Sbjct: 386 GNADMVPLTAYVIGHHHQMNLWVEYDLERGRVGLAPVKCDVASERLGLML 435
>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
Length = 468
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 75/338 (22%), Positives = 135/338 (39%), Gaps = 52/338 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKD------PCPYIADYSTEDTSSSGYLVDDIL 98
+DPS S + + +SC P+C+ C ++ D C + Y + + SG LV D+
Sbjct: 147 HDPSKSRTFRRLSCFDPMCE---LCTAVVDGGGGSAGCLFRRRYG-DGGAVSGELVSDVF 202
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
H + + ++ V GC + + G + G++ LG+G PS + + G+
Sbjct: 203 HFGA-AGDGGGYQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGK---PSFVTQLGV-- 255
Query: 159 NSFSICF---------------DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE 203
+ FS C +E + + FG T + F G Y V
Sbjct: 256 DRFSYCIPASEITDDDDDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSV- 314
Query: 204 SYCIGNSCLTQ-------SGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVS-SK 249
Y G Q +G +A LVDSG + +LP ++ + + ++ +S ++
Sbjct: 315 VYQHGGRLNQQQPVPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTR 374
Query: 250 RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF-VVRNHIFSFPENEGFTVFCLTVM 308
R L S YCY + ++ V + L F + +F EN CL V
Sbjct: 375 RYDLTHPSL-YCYLGNMTDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVA 432
Query: 309 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+ G+ I+G + +D +++A+ +C+ V
Sbjct: 433 A--GNRAILGVYPQRNINVGYDLSTMEIAFDRDQCDRV 468
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 89/350 (25%), Positives = 147/350 (42%), Gaps = 70/350 (20%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVD 95
S ++P SS + V C P CK+R+ SC + K C I Y+ + TS G L
Sbjct: 105 SVFNPLSSKTYSKVPCLSPTCKTRTRDLTIPVSCDATK-LCHVIVSYA-DATSIEGNLAF 162
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKA 154
+ L S +K A I GC S + + G++G+ G +S + +
Sbjct: 163 ETFRLGSLTKPA--------TIFGCMDSGFSSNSEEDSKTTGLIGMNRGSLSFVNQMGYP 214
Query: 155 GLIQNSFSICFDENDS-GSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVE 203
FS C DS G + G+ P Q ST LP ++ AY V +E
Sbjct: 215 -----KFSYCISGFDSAGVLLLGNASFPWLKPLSYTPLVQISTP-LPYFDRV-AYTVQLE 267
Query: 204 SYCIGNSCLT--QSGF--------QALVDSGASFTFLPTEIYAEVVVKF-------DKLV 246
+ N L+ +S F Q +VDSG FTFL +Y + +F K++
Sbjct: 268 GIKVKNKVLSLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLSQTRGILKVL 327
Query: 247 SSKRISLQGNSWKYCY--NASSEEMLKVPDMRLIFSKNQSFVVRNH-IFSFP-ENEGF-T 301
+ QG + CY ++S + +P + L+F + V ++ P E G +
Sbjct: 328 NDDNFVFQG-AMDLCYLLDSSRPNLQNLPVVSLMFQGAEMSVSGERLLYRVPGEVRGRDS 386
Query: 302 VFCLTVMSTDGDYGIIG-QNFMMGHR------IVFDRENLKLAWSHSKCE 344
V+C T ++D ++G + F++GH + FD E ++ + +C+
Sbjct: 387 VWCFTFGNSD----LLGVEAFVIGHHHQQNVWMEFDLEKSRIGLADVRCD 432
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 85/365 (23%), Positives = 135/365 (36%), Gaps = 82/365 (22%)
Query: 37 VQDRNLSEYDPSSSSSSKNVSCSHPLC------KSRSSCKSLKDP--------CP-YIAD 81
+ + + P +SS++K + C +P C S C K P CP YI
Sbjct: 130 IDPTKIPTFIPKNSSTAKLLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQ 189
Query: 82 YSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG 141
Y T+ G+L+ D L+ K PQ ++GC S L P G+ G G
Sbjct: 190 YGLGATA--GFLLLDNLNFPG--KTVPQ------FLVGC------SILSIRQPSGIAGFG 233
Query: 142 LGDVSVPSLLAKAGLIQNSFSIC-----FDENDSGS---VFFGDQGPATQQSTSFLPIGE 193
G S+PS + FS C FD+ S + G S+ P
Sbjct: 234 RGQESLPSQMN-----LKRFSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRS 288
Query: 194 K-------YDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYA 236
+ Y+V + +G + + +VDSG++FTF+ +Y
Sbjct: 289 NPSNNSVFREYYYVTLRKLIVGGVDVKIPYKFLEPGSDGNGGTIVDSGSTFTFMERPVYN 348
Query: 237 EVVVKFDKLVSSKRISLQGN-----SWKYCYNASSEEMLKVPDMRLIFS--KNQSFVVRN 289
V +F + + K+ S + N C+N S + + P+ F S + N
Sbjct: 349 LVAQEFLRQL-GKKYSREENVEAQSGLSPCFNISGVKTISFPEFTFQFKGGAKMSQPLLN 407
Query: 290 HIFSFPENEGFTVFCLTVMSTDGDYG---------IIGQNFMMGHRIVFDRENLKLAWSH 340
+ FSF + V C TV+S DG G I+G + +D EN + +
Sbjct: 408 Y-FSFVGDA--EVLCFTVVS-DGGAGQPKTAGPAIILGNYQQQNFYVEYDLENERFGFGP 463
Query: 341 SKCEE 345
C+
Sbjct: 464 RNCKR 468
>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
Length = 471
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 75/338 (22%), Positives = 135/338 (39%), Gaps = 52/338 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKD------PCPYIADYSTEDTSSSGYLVDDIL 98
+DPS S + + +SC P+C+ C ++ D C + Y + + SG LV D+
Sbjct: 150 HDPSKSRTFRRLSCFDPMCE---LCTAVVDGGGGSAGCLFRRRYG-DGGAVSGELVSDVF 205
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
H + + ++ V GC + + G + G++ LG+G PS + + G+
Sbjct: 206 HFGA-AGDGGGYQLERDVAFGCAHVEDSKAVRGYS-TGILALGIGK---PSFVTQLGV-- 258
Query: 159 NSFSICF---------------DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVE 203
+ FS C +E + + FG T + F G Y V
Sbjct: 259 DRFSYCIPASEITDDDDDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSV- 317
Query: 204 SYCIGNSCLTQ-------SGFQA------LVDSGASFTFLPTEIYAEVVVKFDKLVS-SK 249
Y G Q +G +A LVDSG + +LP ++ + + ++ +S ++
Sbjct: 318 VYQHGGRLNQQQPVPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTR 377
Query: 250 RISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSF-VVRNHIFSFPENEGFTVFCLTVM 308
R L S YCY + ++ V + L F + +F EN CL V
Sbjct: 378 RYDLTHPSL-YCYLGNMTDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVA 435
Query: 309 STDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+ G+ I+G + +D +++A+ +C+ V
Sbjct: 436 A--GNRAILGVYPQRNINVGYDLSTMEIAFDRDQCDRV 471
>gi|342871686|gb|EGU74178.1| hypothetical protein FOXB_15313 [Fusarium oxysporum Fo5176]
Length = 656
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 67/285 (23%), Positives = 123/285 (43%), Gaps = 29/285 (10%)
Query: 146 SVPSLLAKAGLI-QNSFSICFD--ENDSGSVFFGDQGPATQQSTS---FLPIGE---KYD 196
++P+ LA GLI N++S+ + E+ +G++ FG G +Q T LPI + ++
Sbjct: 199 NLPAKLASKGLIASNAYSLYLNDLESATGTILFG--GVDQEQYTGDLVTLPINKINGEFA 256
Query: 197 AYFVGVESYCIGNSCLTQS-GFQALVDSGASFTFLP----TEIYAEVVVKFDKLVSSKRI 251
+ ++S + + + ++DSG++ ++LP ++IY V ++++ S +
Sbjct: 257 ELSITLQSVSADSETIADNLDLAVILDSGSTLSYLPATLTSDIYDIVGAQYEEGESVAYV 316
Query: 252 --SLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS 309
L +S + + VP L+ V SF + F +
Sbjct: 317 PCDLGNDSGNLTFKFKDPAEISVPLSELVLDFTD---VTGRQLSFDNGQAACTFG--IAP 371
Query: 310 TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTT 369
T GD I+G F+ +VFD EN +++ + S D + H++ G+ P P T
Sbjct: 372 TTGDISILGDTFLRSAYVVFDLENNEISLAQSN----FDATKSHILEIGTGKHPVPTATG 427
Query: 370 EQQSTSNGQAAA--PPSTAKTAPSKSIAASAQQLDSVLRVACSLL 412
S + AAA P A S A A + + + +A S L
Sbjct: 428 SGSSDNKENAAASLAPLGGDAAISMVAGAFALGMTAYIELAASWL 472
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 126/313 (40%), Gaps = 34/313 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+DP+ SS+ + VSC+ C + + C + C Y Y + ++++G D L L
Sbjct: 171 FDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYG-DGSTTNGTYSRDTLTL 229
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+ S GC ++G + D DG+MGLG G S+ S A A NS
Sbjct: 230 SGASDAV------KGFQFGCSHVESG-FSD--QTDGLMGLGGGAQSLVSQTAAA--YGNS 278
Query: 161 FSICFDENDSGSVFFGDQGPATQQS----TSFLPIGEKYDAYFVGVESYCIGNS--CLTQ 214
FS C SGS F G S T L + Y ++ +G L+
Sbjct: 279 FSYCLPPT-SGSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLSP 337
Query: 215 SGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
S F A +VDSG T LP Y+ + F + R + + C++ + + + +
Sbjct: 338 SVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISI 397
Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMST--DGDYGIIGQNFMMGHRIVFD 330
P + L+FS + + + + CL +T DG GIIG +++D
Sbjct: 398 PTVALVFSGGAAIDLDPNGIMYGN-------CLAFAATGDDGTTGIIGNVQQRTFEVLYD 450
Query: 331 RENLKLAWSHSKC 343
+ L + C
Sbjct: 451 VGSSTLGFRSGAC 463
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 80/345 (23%), Positives = 137/345 (39%), Gaps = 56/345 (16%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
+ P +S++ V C C SR SC C Y+ + ++S G L D+
Sbjct: 111 FRPRASATFAAVPCGSTQCSSRDLPAPPSCDGASRQCHVSLSYA-DGSASDGALATDVF- 168
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
+ + P S GC S DG A G++G+ G +S + +A
Sbjct: 169 --AVGEAPPLRSA-----FGCMSTAYDSSPDGVATAGLLGMNRGTLS---FVTQAS--TR 216
Query: 160 SFSICF-DENDSGSVFFGDQG--------PATQQSTSFLPIGEKYDAYFVGVESYCIGNS 210
FS C D +D+G + G Q T LP ++ AY V + +G
Sbjct: 217 RFSYCISDRDDAGVLLLGHSDLPFLPLNYTPLYQPTLPLPYFDRV-AYSVQLLGIRVGGK 275
Query: 211 CL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK 259
L T +G Q +VDSG FTFL + Y+ + +F K +L S+
Sbjct: 276 ALPIPASVLAPDHTGAG-QTMVDSGTQFTFLLGDAYSALKAEFLKQTKPLLRALDDPSFA 334
Query: 260 Y------CYNASS---EEMLKVPDMRLIFSKNQSFVVRNH-IFSFP-ENEGFT-VFCLTV 307
+ C+ + ++P + L+F+ + V + ++ P E+ G V+CLT
Sbjct: 335 FQEALDTCFRVPAGRPPPSARLPPVTLLFNGAEMSVAGDRLLYKVPGEHRGADGVWCLTF 394
Query: 308 MSTDG---DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDK 349
+ D +IG + M + +D E ++ + KC+ ++
Sbjct: 395 GNADMVPLTAYVIGHHHQMNLWVEYDLERGRVGLAPVKCDVASER 439
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 82/327 (25%), Positives = 132/327 (40%), Gaps = 65/327 (19%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVD 95
S ++P SSS+ V CS P+C++R+ SC C ++A + TS G L
Sbjct: 97 SVFNPVSSSTYSPVPCSSPICRTRTRDLPIPASCDPKTHFC-HVAISYADATSIEGNLAH 155
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGS-YLDGAAPDGVMGLGLGDVSVPSLLAKA 154
D + S ++ + GC S + A G+MG+ G +S + L +
Sbjct: 156 DTFVIGSVTRPG--------TLFGCMDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGFS 207
Query: 155 GLIQNSFSICFDEND-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVE 203
FS C +D SG + GD P Q+T LP ++ AY V +E
Sbjct: 208 -----KFSYCISGSDSSGILLLGDASYSWLGPIQYTPLVLQTTP-LPYFDRV-AYTVQLE 260
Query: 204 SYCIGNSCLT--QSGF--------QALVDSGASFTFLPTEIYAEVVVKFD-------KLV 246
+G+ L+ +S F Q +VDSG FTFL +Y + +F ++V
Sbjct: 261 GIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIV 320
Query: 247 SSKRISLQGNSWKYCYNASSE---EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGF--- 300
QG + CY S +P + L+F + V + G
Sbjct: 321 DDPNFVFQG-TMDLCYRVGSSTRPNFTGLPVISLMFRGAEMSVSGQKLLYRVNGAGSEGK 379
Query: 301 -TVFCLTVMSTDGDYGIIG-QNFMMGH 325
V+C T ++D ++G + F++GH
Sbjct: 380 EEVYCFTFGNSD----LLGIEAFVIGH 402
>gi|225719388|gb|ACO15540.1| Cathepsin D precursor [Caligus clemensi]
Length = 362
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 75/318 (23%), Positives = 132/318 (41%), Gaps = 43/318 (13%)
Query: 47 PSSSSSSKNVSC-SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 105
PSS+ + NV C +H S +S +KD + Y +SG+L D K
Sbjct: 69 PSSTCGAPNVPCKTHNQYDSGNSSTHVKDGSKFNVKYKI--GKASGFLSQD--------K 118
Query: 106 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS-FSIC 164
G ++ DGV+GLG G S + L G I++ FS+
Sbjct: 119 VCVDGVCMEEQTFGEATSESMDPFANVYHDGVLGLGFGKDSFLNSLLDQGRIESPLFSLW 178
Query: 165 FD------ENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCI-----GNSCLT 213
+ +N+S V G + S++P+ D + VG++S I G +T
Sbjct: 179 VNRQPFRSKNNSRLVLGGIDTGHYSGNISYIPLNSD-DVWRVGMKSISIKGVHRGCGFIT 237
Query: 214 QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 273
+ G + D+G+ FT+ P + A+ + ++ + + +I+ S+ Y Y E+L +P
Sbjct: 238 RPGCDVVFDAGSRFTYGPI-LEAKTI---NRWIGATQIA---PSYGY-YKVRCNEILTLP 289
Query: 274 DMRLIFS------KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
++ L+F K + ++V I T V T + +G NF +
Sbjct: 290 NVELVFEDLTLVLKPKDYIVETKILGMK-----TCMSGFVGLTKQESWTLGANFFGAYFS 344
Query: 328 VFDRENLKLAWSHSKCEE 345
V+D EN ++ + S+ E
Sbjct: 345 VYDIENKRIGLATSRRAE 362
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 72/319 (22%), Positives = 117/319 (36%), Gaps = 41/319 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP+ SSS VSC +C + C Y Y + + + G L + L +
Sbjct: 185 FDPADSSSFAGVSCGSDVCDRLENTGCNAGRCRYEVSYG-DGSYTKGTLALETLTVGQV- 242
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
+ V IGCG G ++ A G+ S+ + G +FS C
Sbjct: 243 -------MIRDVAIGCGHTNQGMFIGAAGLLGLG-----GGSMSFIGQLGGQTGGAFSYC 290
Query: 165 FDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA----------YFVGVESYCIGNSC--- 211
+GS A + LP+G + + Y++G+ +G
Sbjct: 291 LVSRGTGST------GALEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSV 344
Query: 212 ------LTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
LT+ G +V D+G + T PT Y F S+ + + + CY+
Sbjct: 345 PEETFQLTEYGTNGVVMDTGTAVTRFPTAAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDL 404
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
+ E ++VP + FS + F P + G T FCL + IIG G
Sbjct: 405 NGFESVRVPTVSFYFSDGPVLTLPARNFLIPVDGGGT-FCLAFAPSPSGLSIIGNIQQEG 463
Query: 325 HRIVFDRENLKLAWSHSKC 343
+I FD N + + + C
Sbjct: 464 IQISFDGANGFVGFGPNIC 482
>gi|301103993|ref|XP_002901082.1| aspartyl protease family A01B, putative [Phytophthora infestans
T30-4]
gi|262101420|gb|EEY59472.1| aspartyl protease family A01B, putative [Phytophthora infestans
T30-4]
Length = 446
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 62/297 (20%), Positives = 127/297 (42%), Gaps = 29/297 (9%)
Query: 76 CPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD 135
C Y Y E + Y D++ L+S S ++ + GC +Q+G +LD + D
Sbjct: 119 CKYGQTY-IEGDHWTAYKASDVMQLSS--------SFEARIEFGCIYEQSGVFLDQPS-D 168
Query: 136 GVMGLGLGDVSVPSLLAKAGLIQNS-FSICFDENDSGSVFFGDQGPATQQSTSFLPIGEK 194
G+MG S+ + + + FS C E G + + P+
Sbjct: 169 GIMGFSRHPDSIFEQFYRQKVTHSRIFSQCLAEGGGLLTIGGVDLARHTEPVRYTPLRNT 228
Query: 195 -YDAYFVGVESYCIGNSCLT----QSGFQA----LVDSGASFTFLPTEIYAEVVVKFDKL 245
Y + V + S +G++ T + F A ++DSG +F ++P + + +
Sbjct: 229 GYQYWTVTLLSVSVGDANNTVQVDRKEFNADRGCVLDSGTTFLYMPESTKQPFRLAWSRA 288
Query: 246 VSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFC 304
V S + N++ Y +S+++ +PD+ F + + + F+ N ++
Sbjct: 289 VGSFSFVPESNTF---YFMTSKQVAALPDICFWFKNDVHICLPSSRYFALVGN---GIYT 342
Query: 305 LTVMSTDGDYG-IIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAG 360
T+ T G I+G + + GH +++D +N ++ + + C++ + ++ V L P G
Sbjct: 343 GTIFFTAGPKATILGASVLEGHDVIYDVDNHRVGIAEAMCDQPL-QAEVELSLDPGG 398
>gi|413952262|gb|AFW84911.1| hypothetical protein ZEAMMB73_904583 [Zea mays]
Length = 312
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 58/247 (23%), Positives = 97/247 (39%), Gaps = 14/247 (5%)
Query: 114 SSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDSGS 172
+S++ GC Q+G A DG+ G G +SV S L G+ FS C +D+G
Sbjct: 17 ASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGG 76
Query: 173 VFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYC-------IGNSCLTQSGFQA-LVDSG 224
G + + P+ Y + +ES I +S T S Q +VDSG
Sbjct: 77 GIL-VLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIVDSG 135
Query: 225 ASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 284
+ +L Y V VS SL + C+ SS P + L F +
Sbjct: 136 TTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ-CFITSSSVDSSFPTVTLYFMGGVA 194
Query: 285 FVVR--NHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDRENLKLAWSHS 341
V+ N++ + ++C+ G + I+G + V+D N+++ W+
Sbjct: 195 MSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWADY 254
Query: 342 KCEEVID 348
C ++
Sbjct: 255 DCSMSVN 261
>gi|147859621|emb|CAN83119.1| hypothetical protein VITISV_043393 [Vitis vinifera]
Length = 431
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 62/270 (22%), Positives = 109/270 (40%), Gaps = 25/270 (9%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
L+ YD S + K VSC C + S C + C Y Y+ + +SS GY V
Sbjct: 117 LTLYDIKESLTGKLVSCDQDFCYAINGGPPSYCIA-NMSCSYTEIYA-DGSSSFGYFVKG 174
Query: 97 ILHLASFSK--HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
+ ++ H + + V + C Q+G A DG++G G + S+ S LA +
Sbjct: 175 YCTASKYNSIPHLNNNPLLE-VPLRCSATQSGDLSSEEALDGILGFGKSNTSMISQLASS 233
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT- 213
G ++ F+ C D + G +F G Q + P+ Y V +++ +G L
Sbjct: 234 GKVRKMFAHCLDGLNGGGIF--AIGHIVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNL 291
Query: 214 ---------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNA 264
+ G ++DSG + +LP +Y +++ K S ++ + + C+
Sbjct: 292 PTDVFDVGDKKG--TIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFT-CFQY 348
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNHIFSF 294
S P + F + V H + F
Sbjct: 349 SESLDDGFPAVTFHFENSLYLKVHPHEYLF 378
>gi|213998828|gb|ACJ60781.1| nucellin [Hordeum brachyantherum subsp. californicum]
Length = 133
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 33/115 (28%), Positives = 56/115 (48%), Gaps = 3/115 (2%)
Query: 135 DGVMGLGLGDVSVPSLLAKAGLIQ-NSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGE 193
DG++GLG+G L +I N C G ++ GD P ++ T ++P+ E
Sbjct: 10 DGILGLGMGKAGFAVQLKGQKMITGNVIGHCLSSQGKGVLYVGDFNPPSRGVT-WVPMKE 68
Query: 194 KYDAYFVGVESYCIGNSCLT-QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVS 247
Y G+ I N + F+A+ DSG+++T +P ++Y E+V K +S
Sbjct: 69 SLFYYSPGLAEPLIDNQPIRGNPTFEAVFDSGSTYTHVPAQVYNEIVSKVRGTLS 123
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 76/325 (23%), Positives = 121/325 (37%), Gaps = 46/325 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
YDPS SSSS CS P C++ + C D C Y Y + ++S+G + D+L L
Sbjct: 187 YDPSKSSSSAAFPCSSPACRNLGPYANGCTPAGDQCQYRVQYP-DGSASAGTYISDVLTL 245
Query: 101 ASFSKHAPQSSVQSSVIIGCGRK--QTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
A +S S GC Q GS+ + + G+M LG G S+P+
Sbjct: 246 ----NPAKPASAISEFRFGCSHALLQPGSFSNKTS--GIMALGRGAQSLPT--QTKATYG 297
Query: 159 NSFSICFDENDSGSVFFGDQGPATQQST-SFLPIGEKYDA---YFVGVESYCIGNSCLTQ 214
+ FS C S FF P S + P+ A Y V + + + L
Sbjct: 298 DVFSYCLPPTPVHSGFFILGVPRVAASRYAVTPMLRSKAAPMLYLVRLIAIEVAGKRLPV 357
Query: 215 S----GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS----- 265
A++DS T LP Y + F + + R + CY+ S
Sbjct: 358 PPAVFAAGAVMDSRTIVTRLPPTAYMALRAAFVAEMRAYRAAAPKEHLDTCYDFSGAAPG 417
Query: 266 SEEMLKVPDMRLIFSK-------NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIG 318
+K+P + L+F + S V+ + +F N + D GIIG
Sbjct: 418 GGGGVKLPKITLVFDGPNGAVELDPSGVLLDGCLAFAPN-----------TDDQMTGIIG 466
Query: 319 QNFMMGHRIVFDRENLKLAWSHSKC 343
++++ + + + C
Sbjct: 467 NVQQQALEVLYNVDGATVGFRRGAC 491
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 71/321 (22%), Positives = 125/321 (38%), Gaps = 42/321 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP S + + CS P C+ S C + + C Y Y + + + + +
Sbjct: 184 FDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETL----T 239
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNSF 161
F ++ + V +GCG G ++ A G+ S + G F
Sbjct: 240 FRRNRVKG-----VALGCGHDNEGLFVGAAGLLGLG------KGKLSFPGQTGHRFNQKF 288
Query: 162 SICFDENDSGS----VFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS---C 211
S C + + S V FG+ A + F P+ K D Y+VG+ +G +
Sbjct: 289 SYCLVDRSASSKPSSVVFGNA--AVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPG 346
Query: 212 LTQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
+T S F+ ++DSG S T L Y + F + + + + + C++
Sbjct: 347 VTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSLFDTCFD 406
Query: 264 ASSEEMLKVPDMRLIFSK-NQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
S+ +KVP + L F + + S N++ N F C T G IIG
Sbjct: 407 LSNMNEVKVPTVVLHFRRADVSLPATNYLIPVDTNGKF---CFAFAGTMGGLSIIGNIQQ 463
Query: 323 MGHRIVFDRENLKLAWSHSKC 343
G R+V+D + ++ ++ C
Sbjct: 464 QGFRVVYDLASSRVGFAPGGC 484
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 81/343 (23%), Positives = 128/343 (37%), Gaps = 57/343 (16%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
D+ + P+ S++ + V C PLC + + Y ++ S++G L +
Sbjct: 128 DQPTPYFRPARSATYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASE-- 185
Query: 99 HLASFSKHAPQSS--VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
+F+ A SS + S V GCG +G + + G++GLG G +S+ S L +
Sbjct: 186 ---TFTFGAANSSKVMVSDVAFGCGNINSGQLANSS---GMVGLGRGPLSLVSQLGPS-- 237
Query: 157 IQNSFSIC---FDENDSGSVFFG----------DQGPATQQSTSFLPIGEKYDAYFVGVE 203
FS C F + + FG + QST + YF+ ++
Sbjct: 238 ---RFSYCLTSFLSPEPSRLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLK 294
Query: 204 SYCIGNSCLTQSGF----------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 253
+G L +DSG S T+L + Y V +LVS R
Sbjct: 295 GISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDAYDAVR---HELVSVLRPLP 351
Query: 254 QGNSWK------YCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN----EGFTVF 303
N + + + + VPDM L F + V PEN +G T F
Sbjct: 352 PTNDTEIGLETCFPWPPPPSVAVTVPDMELHFDGGANMTVP------PENYMLIDGATGF 405
Query: 304 CLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
M GD IIG I++D N L++ + C V
Sbjct: 406 LCLAMIRSGDATIIGNYQQQNMHILYDIANSLLSFVPAPCNIV 448
>gi|302757745|ref|XP_002962296.1| hypothetical protein SELMODRAFT_27319 [Selaginella moellendorffii]
gi|300170955|gb|EFJ37556.1| hypothetical protein SELMODRAFT_27319 [Selaginella moellendorffii]
Length = 163
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 61/134 (45%), Gaps = 10/134 (7%)
Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE-------MLKV 272
+ DSG + TFLP +Y +V+ F + ++ ++ CYN S + L
Sbjct: 32 IFDSGTTLTFLPLGVYIQVISVFSRRINLPLVNGTSVGLDLCYNISLQRDYTFPSLALHF 91
Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDR 331
PD + ++ VV + + NE +V CL +MS+ IIG G+ I+FD
Sbjct: 92 PDAWMNLHQDNYIVVPSRADAEAWNE--SVACLAIMSSASIGINIIGNVMQQGYHIMFDN 149
Query: 332 ENLKLAWSHSKCEE 345
E + ++ + C E
Sbjct: 150 EKSTVTFAPASCSE 163
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 51.6 bits (122), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 75/321 (23%), Positives = 128/321 (39%), Gaps = 46/321 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS-----RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
+DPS SS+ ++C C R+ C S C Y +Y + +S+ G ++ +
Sbjct: 169 FDPSKSSTYAPIACGADACNKLGDHYRNGCTSGGTQCGYRVEYG-DGSSTRGVYSNETIT 227
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
AP +V+ GCG Q G DG++GLG S+ ++ A +
Sbjct: 228 F------APGITVK-DFHFGCGHDQRGP---SDKFDGLLGLGGAPESL--VVQTASVYGG 275
Query: 160 SFSICFD--ENDSGSVFFGDQGPATQQSTSF-------LPIGEKYDAYFVGVESYCIGNS 210
+FS C +++G + G + A +++F LP+ +Y V + +G
Sbjct: 276 AFSYCLPALNSEAGFLALGVRPSAATNTSAFVFTPMWHLPMDAT--SYMVNMTGISVGGK 333
Query: 211 CLT--QSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASS 266
L +S F+ L+DSG T LP Y + K ++ + + + CYN +
Sbjct: 334 PLDIPRSAFRGGMLIDSGTIVTELPETAYNALNAALRKAFAAYPM-VASEDFDTCYNFTG 392
Query: 267 EEMLKVPDMRLIFSKNQS--FVVRNHIFSFPENEGFTVFCLTVMSTDGD--YGIIGQNFM 322
+ VP + L FS + V N I CL + D GIIG
Sbjct: 393 YSNVTVPRVALTFSGGATIDLDVPNGI--------LVKDCLAFRESGPDVGLGIIGNVNQ 444
Query: 323 MGHRIVFDRENLKLAWSHSKC 343
+++D + K+ + C
Sbjct: 445 RTLEVLYDAGHGKVGFRAGAC 465
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 82/333 (24%), Positives = 133/333 (39%), Gaps = 58/333 (17%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DPS SS+ N+SCS C + C + CPY +Y SS G + L L +
Sbjct: 135 FDPSKSSTYSNLSCSE--C---NKCDVVNGECPYSVEY-VGSGSSQGIYAREQLTLETID 188
Query: 105 KHAPQSSVQSSVIIGCGRK----QTGSYLDGAAPDGVMGLGLGDVS-VPSLLAKAGLIQN 159
+ + S+I GCGRK G G +GV GLG G S +PS K
Sbjct: 189 ESIIKV---PSLIFGCGRKFSISSNGYPYQGI--NGVFGLGSGRFSLLPSFGKK------ 237
Query: 160 SFSICFDENDSGSVFF-----GDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
FS C + + F GD+ ST+ I Y+V +E+ IG L
Sbjct: 238 -FSYCIGNLRNTNYKFNRLVLGDKANMQGDSTTLNVIN---GLYYVNLEAISIGGRKLDI 293
Query: 213 ---------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ---GNSWKY 260
T + ++DSGA T+L + + + + L+ + Q N +
Sbjct: 294 DPTLFERSITDNNSGVIIDSGADHTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPYTL 353
Query: 261 CYNA-SSEEMLKVPDMRLIFSKNQ--SFVVRNHIFSFPENEGFTVFCLTVMSTD--GD-- 313
CY+ S+++ P + F++ V + ENE FC+ ++ + GD
Sbjct: 354 CYSGVVSQDLSGFPLVTFHFAEGAVLDLDVTSMFIQTTENE----FCMAMLPGNYFGDDY 409
Query: 314 --YGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
+ IG + + +D +++ + CE
Sbjct: 410 ESFSSIGMLAQQNYNVGYDLNRMRVYFQRIDCE 442
>gi|255552237|ref|XP_002517163.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223543798|gb|EEF45326.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 469
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 68/246 (27%), Positives = 97/246 (39%), Gaps = 50/246 (20%)
Query: 50 SSSSKNVSCSHPLCKSRSS------CKSLKDP------CPYIADYSTEDTSSSGYLVDDI 97
SSS VSC LCK +S C S P C + + +SG + D+
Sbjct: 114 SSSYTPVSCDSLLCKLANSLACATECNSTPKPGCHNNTCAHSPENPVIRLGTSGQIGQDV 173
Query: 98 LHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGL-GLGD--VSVPSLLAK 153
+ L SF+ P V + CG ++L DGV GL GLG+ +S+P+ +
Sbjct: 174 VSLQSFNGKTPDRIVSVPNFPFVCGP----TFLLENLADGVTGLAGLGNSNISLPAQFSS 229
Query: 154 AGLIQNSFSICFDE--NDSGSVFFGD----------------QGPATQQSTSFLPIGEKY 195
A F++C +G +FFGD P + S+L GE
Sbjct: 230 AFGFPKKFAVCLSNSTKSNGLIFFGDGPYSNLPNDLTYTPLIHNPVSTAGGSYL--GEAS 287
Query: 196 DAYFVGVESYCIGNSCLTQSGFQALVDSGAS----------FTFLPTEIYAEVVVKFDKL 245
YF+GV+S IG + + +DS +T L T IY VV F K
Sbjct: 288 VEYFIGVKSIRIGGKDVKFNKTLLSIDSEGKGGTKISTVDPYTVLHTSIYKAVVKAFVKE 347
Query: 246 VSSKRI 251
+ K I
Sbjct: 348 MDKKFI 353
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 79/328 (24%), Positives = 135/328 (41%), Gaps = 39/328 (11%)
Query: 45 YDPSSSSSSKNVSCS--------HPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDD 96
+DPS S+S K + C+ H C+ SS S K C Y Y + + +SG L +
Sbjct: 213 FDPSQSTSFKIIPCNAAACDLVVHDECRDNSSKTSPKT-CKYFYWYG-DSSRTSGDLALE 270
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
L + S S H P S ++IGCG G + ++GLG G +S PS L ++
Sbjct: 271 SLSV-SLSDH-PSSLEIRDMVIGCGHSNKGLFQGAGG---LLGLGQGALSFPSQL-RSSP 324
Query: 157 IQNSFSICFDEND-----SGSVFFGDQGPATQQ--STSFLPIGEKYDA----YFVGVESY 205
I SFS C + S ++ FG ++ F P ++ Y++G++
Sbjct: 325 IGQSFSYCLVDRTNNLSVSSAISFGAGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGI 384
Query: 206 CIGNSCLTQSGFQ----------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG 255
I L + ++DSG + T+L + Y V F +S R
Sbjct: 385 KIDQELLPIPAERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISYPRAD-PF 443
Query: 256 NSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYG 315
+ CYNA+ + P + ++F + + + CL ++ TDG
Sbjct: 444 DILGICYNATGRTAVPFPTLSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDG-MS 502
Query: 316 IIGQNFMMGHRIVFDRENLKLAWSHSKC 343
IIG ++D ++ +L ++++ C
Sbjct: 503 IIGNFQQQNIHFLYDVQHARLGFANTDC 530
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 77/324 (23%), Positives = 139/324 (42%), Gaps = 42/324 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDP-CPYIADYSTEDTS-SSGYLVDDILH 99
+DP SSS+ +++SCS C K +SC + C Y YS D S +SG + D +
Sbjct: 134 FDPKSSSTYRDISCSTKQCDLLKEGASCSGEGNKTCHY--SYSYGDRSFTSGNVAADTIT 191
Query: 100 LASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP-SLLAKAG-LI 157
L S S + + IIGCG GS+ + + + P SL+++ G I
Sbjct: 192 LGSTSG---RPVLLPKAIIGCGHNNGGSFTEKGSGIVGL------GGGPISLISQLGSTI 242
Query: 158 QNSFSICF-----DENDSGSVFFGDQGPATQQSTSFLP-IGEKYDA-YFVGVESYCIGN- 209
FS C + +S + FG G + P I + D YF+ +E+ +G+
Sbjct: 243 DGKFSYCLVPLSSNATNSSKLNFGSNGIVSGGGVQSTPLISKDPDTFYFLTLEAVSVGSE 302
Query: 210 ------SCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
S S ++DSG + T P + ++E+ V+ + CY+
Sbjct: 303 RIKFPGSSFGTSEGNIIIDSGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSGILSLCYS 362
Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPE-NEGFTVFCLTVMSTDGDYGIIGQ-NF 321
++ LK P + F + + V N + +F + ++ F +++ +G + Q NF
Sbjct: 363 IDAD--LKFPSITAHF--DGADVKLNPLNTFVQVSDTVLCFAFNPINSGAIFGNLAQMNF 418
Query: 322 MMGHRIVFDRENLKLAWSHSKCEE 345
++G +D E +++ + C +
Sbjct: 419 LVG----YDLEGKTVSFKPTDCTQ 438
>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 439
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 76/298 (25%), Positives = 123/298 (41%), Gaps = 35/298 (11%)
Query: 67 SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTG 126
+ CK+ Y Y + TS Y D + S V G GR G
Sbjct: 154 TQCKACTVENNYNMTYGDDSTSVGNYGCDTMT--------LEPSDVFQKFQFGRGRNNKG 205
Query: 127 SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDENDS-GSVFFGDQGPATQQS 185
+ G+ DG++GLG G +S S A FS C E DS GS+ FG++ AT QS
Sbjct: 206 DF--GSGVDGMLGLGQGQLSTVSQTASK--FNKVFSYCLPEEDSIGSLLFGEK--ATSQS 259
Query: 186 TSF----LPIG----EKYDAYFVGVESYCIGNSCLT--QSGFQA---LVDSGASFTFLPT 232
+S L G ++ YFV + +GN L S F + ++DS T LP
Sbjct: 260 SSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVFASPGTIIDSRTVITRLPQ 319
Query: 233 EIYAEVVVKFDKLVSSKRIS----LQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVR 288
Y+ + F K ++ +S +G+ CYN S + + +P++ L F +
Sbjct: 320 RAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLN 379
Query: 289 --NHIFSFPENEGFTVFCLTVMST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
N ++ E+ F ST + + IIG + +++D + ++ + + C
Sbjct: 380 GTNIVWGSDESRLCLAFAGNSKSTMNPELTIIGNRQQLSLTVLYDIQGGRIGFRSNGC 437
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 81/343 (23%), Positives = 128/343 (37%), Gaps = 57/343 (16%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
D+ + P+ S++ + V C PLC + + Y ++ S++G L +
Sbjct: 128 DQPTPYFRPARSATYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASE-- 185
Query: 99 HLASFSKHAPQSS--VQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
+F+ A SS + S V GCG +G + + G++GLG G +S+ S L +
Sbjct: 186 ---TFTFGAANSSKVMVSDVAFGCGNINSGQLANSS---GMVGLGRGPLSLVSQLGPS-- 237
Query: 157 IQNSFSIC---FDENDSGSVFFG----------DQGPATQQSTSFLPIGEKYDAYFVGVE 203
FS C F + + FG + QST + YF+ ++
Sbjct: 238 ---RFSYCLTSFLSPEPSRLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLK 294
Query: 204 SYCIGNSCLTQSGF----------QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 253
+G L +DSG S T+L + Y V +LVS R
Sbjct: 295 GISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDAYDAVR---RELVSVLRPLP 351
Query: 254 QGNSWK------YCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPEN----EGFTVF 303
N + + + + VPDM L F + V PEN +G T F
Sbjct: 352 PTNDTEIGLETCFPWPPPPSVAVTVPDMELHFDGGANMTVP------PENYMLIDGATGF 405
Query: 304 CLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
M GD IIG I++D N L++ + C V
Sbjct: 406 LCLAMIRSGDATIIGNYQQQNMHILYDIANSLLSFVPAPCNIV 448
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 73/332 (21%), Positives = 131/332 (39%), Gaps = 36/332 (10%)
Query: 47 PSSSSSSKNVSCSHPLCKSRSSCKSL--KDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
PS+SS+ V C C++ SS S C Y+ Y + + +SG L + ++ +
Sbjct: 157 PSASSTYGRVGCDTKACRALSSAASCSPDGSCEYLYSYG-DGSRASGQLSTETFTFSTIA 215
Query: 105 KHAPQSSVQ--------------SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL 150
+ +S + + GC TG++ DG++GLG G VS+ S
Sbjct: 216 DSSKTNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTF----RADGLVGLGGGPVSLASQ 271
Query: 151 LAKAGLIQNSFSICF----DENDSGSVFFGDQGPATQQSTSFLPI--GEKYDAYFVGVES 204
L + FS C + N S ++ FG + ++ + P+ GE Y + ++S
Sbjct: 272 LGATTSLGRKFSYCLAPYANTNASSALNFGSRAVVSEPGAASTPLITGEVETYYTIALDS 331
Query: 205 YCIGNSCLTQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY 262
+ + + QA +VDSG + T+L + + +V + + R CY
Sbjct: 332 INVAGTKRPTTAAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAESPEKILDLCY 391
Query: 263 NAS---SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQ 319
+ S E+ L +PD+ L+ ++ EG L S I+G
Sbjct: 392 DISGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVVQEGVLCLALVATSERQSVSILGN 451
Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVIDKSH 351
+ +D E + ++ + C KSH
Sbjct: 452 IAQQNLHVGYDLEKGTVTFAAADCA----KSH 479
>gi|348685429|gb|EGZ25244.1| pepsin-like aspartic protease A1 [Phytophthora sojae]
Length = 467
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 68/328 (20%), Positives = 136/328 (41%), Gaps = 44/328 (13%)
Query: 51 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSKHAPQS 110
S + S P C + C++ K C Y Y E S Y D++ L+
Sbjct: 117 SMTLQTSWGEPACMA---CENGK--CKYGQTY-VEGDHWSAYKASDMMQLSP-------- 162
Query: 111 SVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS-FSICFDEND 169
S ++ + GC +Q+G +LD + DG+MG S+ + + + FS C E
Sbjct: 163 SFEARIEFGCIYEQSGVFLDQPS-DGIMGFSRHPDSIFEQFYRQKVTHSRIFSQCLTEGG 221
Query: 170 SGSVFFGDQGPATQQSTSFLPIGEK-YDAYFVGVESYCIGNSCLT--------QSGFQAL 220
G + + P+ Y + V ++S +GN T + +
Sbjct: 222 GMLTIGGVDLTRHTEPVRYTPLRSTGYQYWTVTLQSVSVGNQSNTLQVDTYEYNADRGCV 281
Query: 221 VDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFS 280
+DSG +F ++P + + + V S Q +++ Y+ + +++ +PD+
Sbjct: 282 LDSGTTFLYMPERTKEPFRLAWSRAVGSFSYIPQSDTF---YSMTPDQVAALPDI----- 333
Query: 281 KNQSFVVRNHI-FSFPENEGFT-----VFCLTVMSTDGDYG-IIGQNFMMGHRIVFDREN 333
F ++N + P + F V+ T+ + G I+G + + GH I++D +N
Sbjct: 334 ---CFWLKNDVHICLPPSRYFAQVGDGVYTGTIFFSPGPRATILGASVLEGHDIIYDVDN 390
Query: 334 LKLAWSHSKCEEVIDKSHVHLVPPPAGQ 361
++ + + C++ + ++ V L P G+
Sbjct: 391 NRVGIAEAMCDQPM-QAAVELSLDPGGE 417
>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 373
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 74/323 (22%), Positives = 135/323 (41%), Gaps = 42/323 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSR-------SSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
++ SSSS+ + V CS +C S C +D C Y Y++ + S+GYL D
Sbjct: 69 FNTSSSSTYRRVGCSAQVCHDMHVSQNIPSGCVEEEDSCIYSLRYASGEY-SAGYLSQDR 127
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L LA+ S+Q I GCG + + +G + G++G G S + +A+
Sbjct: 128 LTLAN------SYSIQ-KFIFGCG---SDNRYNGHSA-GIIGFGNKSYSFFNQIAQL-TN 175
Query: 158 QNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSGF 217
++FS CF N F GP + S + + + +D Y + Y + + +G
Sbjct: 176 YSAFSYCFPSNQENEGFL-SIGPYVRDSNKLI-LTQLFD-YGAHLPVYALQQFDMMVNGM 232
Query: 218 Q------------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCY--N 263
+ +VDSG TF+ + ++ + K + ++ +S + C+ N
Sbjct: 233 RLQVDPPVYTTRMTVVDSGTVETFVLSPVFRALDRALTKAMVAEGYVRGSDSKEICFHSN 292
Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG---DYGIIGQN 320
S + K+P + + FS++ + ++F + ++G C T D I+G
Sbjct: 293 GDSVDWSKLPVVEIKFSRSILKLPAENVFYYETSDG--SICSTFQPDDAGVPGVQILGNR 350
Query: 321 FMMGHRIVFDRENLKLAWSHSKC 343
R+VFD + + C
Sbjct: 351 ATRSFRVVFDIQQRNFGFEAGAC 373
>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
Length = 537
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 80/345 (23%), Positives = 118/345 (34%), Gaps = 55/345 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLV---DDILHLA 101
Y P+ SSS + + CS C PY S S Y D + +
Sbjct: 190 YRPAKSSSWRRIRCSQKECAV----------LPYNTCQSPSKAESCSYFQKTQDGTVTIG 239
Query: 102 SFSKHAPQSSVQSS-------VIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
+ K +V +I+GC + G +D A DGV+ LG GD+S AK
Sbjct: 240 IYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVD--AHDGVLSLGNGDMSFAVHAAKR 297
Query: 155 GLIQNSFSICF-----DENDSGSVFFGDQ----GPATQQSTSFLPI------GEKYDAYF 199
FS C + S + FG GP T ++ + G K
Sbjct: 298 --FGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAKVTGVL 355
Query: 200 VGVESYCIGNSCLTQSGF---QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGN 256
VG E I + F ++D+ S T L E YA V D+ +S +
Sbjct: 356 VGGERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYELE 415
Query: 257 SWKYCYN-------ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMS 309
++YCY + +P + + PE E V CL
Sbjct: 416 GFEYCYKWTFTGDGVXPAHNVTIPSFTVEMAGGARLEPEAKSVVMPEVEP-GVACLAFRK 474
Query: 310 -TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 353
G GI+G FM + D + K+ + KC + H+H
Sbjct: 475 LLRGGPGILGNVFMQEYIWEIDHGDGKIRFRKDKC----NTHHLH 515
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 77/331 (23%), Positives = 136/331 (41%), Gaps = 53/331 (16%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP SSS V CS LC + RS+C KD C Y+ Y + +S+ G L +
Sbjct: 149 FDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYG-DYSSTRGLLATETFTFED 207
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
++S+ S + GCG + G DG + G++GLG G +S+ S L + F
Sbjct: 208 ------ENSI-SGIGFGCGVENEG---DGFSQGSGLVGLGRGPLSLISQLK-----ETKF 252
Query: 162 SICF----DENDSGSVFFGD-------------QGPATQQSTSFLPIGEKYDAYFVGVES 204
S C D S S+F G G T ++ S L ++ Y++ ++
Sbjct: 253 SYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVT-KTMSLLRNPDQPSFYYLELQG 311
Query: 205 YCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 254
+G L+ +S F+ ++DSG + T+L + + +F +S
Sbjct: 312 ITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSG 371
Query: 255 GNSWKYCYN-ASSEEMLKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG 312
C+ + + + VP M F + N++ + + V CL + S++G
Sbjct: 372 STGLDLCFKLPDAAKNIAVPKMIFHFKGADLELPGENYMVA---DSSTGVLCLAMGSSNG 428
Query: 313 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
I G ++ D E +++ ++C
Sbjct: 429 -MSIFGNVQQQNFNVLHDLEKETVSFVPTEC 458
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 77/317 (24%), Positives = 129/317 (40%), Gaps = 49/317 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIA----DYSTEDTSSSGYLVDDILHL 100
+ P S++ KNVSC+ P CK + +P ++ + + +S + LV D + L
Sbjct: 132 FAPEKSTTFKNVSCAAP------ECKQVPNPGCGVSSRNFNLTYGSSSIAANLVQDTITL 185
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
A + S GC K TG+ A P G++GLG G +S+ S L Q++
Sbjct: 186 A--------TDPVPSYTFGCVSKTTGT---SAPPQGLLGLGRGPLSLLS--QTQNLYQST 232
Query: 161 FSICFDE----NDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
FS C N SGS+ G P + T L + Y+V +E+ +G +
Sbjct: 233 FSYCLPSFKSLNFSGSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIP 292
Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
+G + DSG FT L +Y V +F + V K + CYN
Sbjct: 293 PAALAFNPTTGAGTIFDSGTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNVP 352
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM----STDGDYGIIGQNF 321
+ VP + IF+ + +++I + + CL + + + +I
Sbjct: 353 ----IVVPTITFIFTGMNVTLPQDNILI--HSTAGSTTCLAMAGAPDNVNSVLNVIANMQ 406
Query: 322 MMGHRIVFDRENLKLAW 338
HR+++D N + W
Sbjct: 407 QQNHRVLYDVPNSR-GW 422
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 78/323 (24%), Positives = 121/323 (37%), Gaps = 43/323 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS----------RSSCKSLKDPCPYIADYSTEDTSSSGYLV 94
+DPS SS+ + C+ CK ++ + C Y +Y + G
Sbjct: 169 FDPSKSSTFATIPCASDACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYG-NGAITEGVYS 227
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
+ L L S S+V S GCG Q G Y DG++GLG S+ S A
Sbjct: 228 TETLALGS-------SAVVKSFRFGCGSDQHGPY---DKFDGLLGLGGAPESLVSQTAS- 276
Query: 155 GLIQNSFSICFDENDSGSVFFGDQGP-ATQQSTS---FLPIG----EKYDAYFVGVESYC 206
+ +FS C +SG+ F P +T S S F P+ + Y V +
Sbjct: 277 -VYGGAFSYCLPPLNSGAGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGIS 335
Query: 207 IGNSCL--TQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYC 261
+G L + F +VDSG T +PT Y + F ++ + +S C
Sbjct: 336 VGGKALDIPPAVFAKGNIVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTC 395
Query: 262 YNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVM-STDGDYGIIGQN 320
YN + + VP + L F + + E+ CL + DG +GIIG
Sbjct: 396 YNFTGHGTVTVPKVALTFVGGATVDLDVPSGVLVED------CLAFADAGDGSFGIIGNV 449
Query: 321 FMMGHRIVFDRENLKLAWSHSKC 343
+++D L + C
Sbjct: 450 NTRTIEVLYDSGKGHLGFRAGAC 472
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 75/318 (23%), Positives = 128/318 (40%), Gaps = 41/318 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCK-------SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
+DP +SS+ +V CS C + S+C S + C Y A Y + + S G L D
Sbjct: 177 FDPRASSTYASVRCSASQCDELQAATLNPSAC-SASNVCIYQASYG-DSSFSVGSLSTDT 234
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
+ S + S GCG+ G + A G++GL +S+ LA + +
Sbjct: 235 VSFGS--------TRYPSFYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--L 281
Query: 158 QNSFSICFDENDSGSVFFGDQGP-ATQQSTSFLPIGEK-YDA--YFVGVESYCIGNSCLT 213
SFS C + S + GP T S+ P+ DA YF+ + +G S L
Sbjct: 282 GYSFSYCLPT--AASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLA 339
Query: 214 -----QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
S ++DSG T LPT ++ + + ++ + + + C+ + +
Sbjct: 340 VSPSEYSSLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQ 399
Query: 269 MLKVPDMRLIFSKNQS--FVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
L+VP + + F+ S RN + ++ CL TD IIG
Sbjct: 400 -LRVPTVAMAFAGGASMKLTTRNVLIDVDDS----TTCLAFAPTDST-AIIGNTQQQTFS 453
Query: 327 IVFDRENLKLAWSHSKCE 344
+++D ++ +S C
Sbjct: 454 VIYDVAQSRIGFSAGGCS 471
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 81/339 (23%), Positives = 131/339 (38%), Gaps = 54/339 (15%)
Query: 39 DRNLSEYDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDD 96
D++ +DP +S S V C+ PLC+ S C + C Y Y + + ++G +
Sbjct: 183 DQSGQMFDPRASHSYGAVDCAAPLCRRLDSGGCDLRRKACLYQVAYG-DGSVTAGDFATE 241
Query: 97 ILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGL 156
L AS ++ P+ V +GCG G ++ A ++GLG G +S PS +++
Sbjct: 242 TLTFASGAR-VPR------VALGCGHDNEGLFVAAAG---LLGLGRGSLSFPSQISR--R 289
Query: 157 IQNSFSICFDE---------NDSGSVFFGDQGPATQQSTSFLPIGEK---YDAYFVGVES 204
SFS C + + S +V FG P GE+ D
Sbjct: 290 FGRSFSYCLVDRTSSSASATSRSSTVTFGSGARGALGRRVLHPDGEEPQDGDVLLRAAHG 349
Query: 205 YCIGNSCL-------------TQSGFQALVDSGASFTFLPTEIYAEV------VVKFDKL 245
+ T G +VDSG P+ +A +
Sbjct: 350 HQRRRRARPGRGRVRPPPDPSTGRG-GVIVDSG-----RPSPAWARAGRTPPCATRSRAA 403
Query: 246 VSSKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFC 304
+ R+S G S + CY+ S +++KVP + + F+ + + P + T FC
Sbjct: 404 AAGLRLSPGGFSLFDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGT-FC 462
Query: 305 LTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
TDG IIG G R+VFD + +L + C
Sbjct: 463 FAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRLGFVPKGC 501
>gi|242092874|ref|XP_002436927.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
gi|241915150|gb|EER88294.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
Length = 484
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/131 (25%), Positives = 62/131 (47%), Gaps = 3/131 (2%)
Query: 215 SGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPD 274
+G +++ +FT+L ++YA + +F K +S ++ S CYN ++ VP
Sbjct: 355 AGGGTILELHTTFTYLKPKVYAALRDEFRKSMSQYPVAPPQGSLDTCYNFTALSSYSVPA 414
Query: 275 MRLIFSKNQSF-VVRNHIFSFPE-NEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRE 332
+ L F F + + + FPE F+V CL ++ DG +IG M +V+D
Sbjct: 415 VTLKFDGGAEFDLWIDEMMYFPEPGSYFSVGCLAFVAQDGG-AVIGSMAQMSTEVVYDVR 473
Query: 333 NLKLAWSHSKC 343
K+ + +C
Sbjct: 474 GGKVGFVPYRC 484
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 77/331 (23%), Positives = 136/331 (41%), Gaps = 53/331 (16%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP SSS V CS LC + RS+C KD C Y+ Y + +S+ G L +
Sbjct: 41 FDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYG-DYSSTRGLLATETFTFED 99
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
++S+ S + GCG + G DG + G++GLG G +S+ S L + F
Sbjct: 100 ------ENSI-SGIGFGCGVENEG---DGFSQGSGLVGLGRGPLSLISQLK-----ETKF 144
Query: 162 SICF----DENDSGSVFFGD-------------QGPATQQSTSFLPIGEKYDAYFVGVES 204
S C D S S+F G G T ++ S L ++ Y++ ++
Sbjct: 145 SYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVT-KTMSLLRNPDQPSFYYLELQG 203
Query: 205 YCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 254
+G L+ +S F+ ++DSG + T+L + + +F +S
Sbjct: 204 ITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSG 263
Query: 255 GNSWKYCYN-ASSEEMLKVPDMRLIFS-KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG 312
C+ + + + VP M F + N++ + + V CL + S++G
Sbjct: 264 STGLDLCFKLPDAAKNIAVPKMIFHFKGADLELPGENYMVA---DSSTGVLCLAMGSSNG 320
Query: 313 DYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
I G ++ D E +++ ++C
Sbjct: 321 -MSIFGNVQQQNFNVLHDLEKETVSFVPTEC 350
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 78/314 (24%), Positives = 124/314 (39%), Gaps = 36/314 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKS--LKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP+ S++ SCS C + L C YI Y + ++++G D L L +
Sbjct: 174 FDPAKSATYSAFSCSSAQCAQLGGEGNGCLNSHCQYIVKY-VDHSNTTGTYGSDTLGLTT 232
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAK-AGLIQNSF 161
S + GC + G DG+MGLG GD SL+++ A +F
Sbjct: 233 -------SDAVKNFQFGCSHRANGFV---GQLDGLMGLG-GDTE--SLVSQTAATYGKAF 279
Query: 162 SICFDENDSGSVFFGDQGPATQQSTS----FLPIGEKYDAYFVGV--ESYCIGNSCLT-- 213
S C + S + F G A ++S P+ F GV ++ + + L
Sbjct: 280 SYCLPPSSSSAGGFLTLGAAAGGTSSSRYSRTPLVRFNVPTFYGVFLQAITVAGTKLNVP 339
Query: 214 QSGFQ--ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLK 271
S F ++VDSG T LP Y + F K + + + C++ S + ++
Sbjct: 340 ASVFSGASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGILDTCFDFSGIKTVR 399
Query: 272 VPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCL--TVMSTDGDYGIIGQNFMMGHRIVF 329
VP + L FS R + + F CL T + DGD GI+G ++F
Sbjct: 400 VPVVTLTFS-------RGAVMDLDVSGIFYAGCLAFTATAQDGDTGILGNVQQRTFEMLF 452
Query: 330 DRENLKLAWSHSKC 343
D L + C
Sbjct: 453 DVGGSTLGFRPGAC 466
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 76/351 (21%), Positives = 134/351 (38%), Gaps = 63/351 (17%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSR----SSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+D +S ++ V CS P+C S S C + C Y+ DY+ + + +SG +V+D
Sbjct: 142 FDALASQTTLAVPCSDPICTSGKYPLSGCTFNDNTCFYLYDYA-DKSITSGRIVED---- 196
Query: 101 ASFSKHAPQSSVQS---------SVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
+F+ +PQ + S +V GCG+ G + + G+ G G +S+PS L
Sbjct: 197 -TFTFRSPQGNNGSKAHAGVAVPNVRFGCGQYNKGIFKSNES--GIAGFSRGPMSLPSQL 253
Query: 152 AKAGLIQNSFSICFDENDSGSVFFGDQGP--------ATQQSTSFLPIGEKYDAYFVGVE 203
K + F+ D S G GP QST F Y++ ++
Sbjct: 254 -KVARFSHCFTAIADARTSPVFLGGAPGPDNLGAHATGPVQSTPF--ANSNGSLYYLTLK 310
Query: 204 SYCIGNSCLTQSGFQ------------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRI 251
+G + L + ++DSG LP +Y + F V+ ++
Sbjct: 311 GITVGKTRLPLNALAFAGKGTGSGSGGTIIDSGTGIRTLPGPMYRSLRAAF---VARVKL 367
Query: 252 SLQGNSW-----KYCYNASSEEMLKVPDMRLIFSK--------NQSFVVRNHIFSFPENE 298
+ S C+ A+ L K + +++ E+E
Sbjct: 368 PVANESAADAESTLCFEAARSASLPPEAPAPALPKVVLHVAGADWDLPRESYVLDLLEDE 427
Query: 299 --GFTVFCLTVMST-DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+ CL + S D D IIG + +D E KL + ++C+++
Sbjct: 428 DGSGSGLCLVMNSAGDSDLTIIGNFQQQNMHVAYDLEKNKLVFVPARCDKM 478
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 80/334 (23%), Positives = 134/334 (40%), Gaps = 58/334 (17%)
Query: 45 YDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+ P+SSS+ + C+ C+ S +C + C Y Y + T+ GYL + L +
Sbjct: 128 FQPASSSTFSKLPCTSSFCQFLPNSIRTCNATG--CVYNYKYGSGYTA--GYLATETLKV 183
Query: 101 --ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
ASF SV GC + G + G+ GLG G +S L+ + G+
Sbjct: 184 GDASFP----------SVAFGCSTENG----VGNSTSGIAGLGRGALS---LIPQLGV-- 224
Query: 159 NSFSICFDENDSGS---VFFGDQGPATQ---QSTSFL---PIGEKYDAYFVGVESYCIGN 209
FS C + + FG T QST F+ + Y Y+V + +G
Sbjct: 225 GRFSYCLRSGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSY--YYVNLTGITVGE 282
Query: 210 SCL---------TQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 258
+ L TQ+G +VDSG + T+L + Y V F ++
Sbjct: 283 TDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTANVTTVNGTRGL 342
Query: 259 KYCYNAS-SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE---GFTVFCLTVMSTDGD- 313
C+ ++ + VP + L F + V + F+ E + TV CL ++ GD
Sbjct: 343 DLCFKSTGGGGGIAVPSLVLRFDGGAEYAVPTY-FAGVETDSQGSVTVACLMMLPAKGDQ 401
Query: 314 -YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+IG M +++D + ++S + C +V
Sbjct: 402 PMSVIGNVMQMDMHLLYDLDGGIFSFSPADCAKV 435
>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 481
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 66/269 (24%), Positives = 110/269 (40%), Gaps = 58/269 (21%)
Query: 132 AAPDGVMGLGLGDVSVPSLLAK-AGLIQNSFSICFDENDSGSVFFGDQ------------ 178
A P GV G G G +S+P+ L+ + + N FS C + F GD+
Sbjct: 212 AEPTGVAGFGRGILSLPAQLSTLSPHLGNRFSYCLVSHS----FDGDRLRRPSPLILGRH 267
Query: 179 -----GPATQQSTSFLPI----GEKYDAYF-VGVESYCIGNSCL----------TQSGFQ 218
G +S F+ K+ Y+ VG+ +G + +
Sbjct: 268 NDTITGAGDGESVEFVYTSMLSNPKHPYYYCVGLAGISVGKRTVPAPEILKRVDEKGNGG 327
Query: 219 ALVDSGASFTFLPTEIYAEVVVKFDKLVSS--KRIS--LQGNSWKYCYNASSEEMLKVPD 274
+VDSG +FT LP Y VV +FDK V+ KR S CY + + ++P
Sbjct: 328 MVVDSGTTFTMLPESFYNAVVNEFDKRVNRFHKRASEIETKTGLGPCYYLNG--LSQIPV 385
Query: 275 MRLIFSKNQSFVV---RNHIFSFPE-NEGF----TVFCLTVMSTD-------GDYGIIGQ 319
++L F N S VV +N+ + F + +G V C+ +M+ + G +G
Sbjct: 386 LKLHFVGNNSDVVLPRKNYFYEFMDGGDGIRRKGKVGCMMLMNGEDETELDGGPGATLGN 445
Query: 320 NFMMGHRIVFDRENLKLAWSHSKCEEVID 348
G +V+D E ++ ++ +C + D
Sbjct: 446 YQQQGFEVVYDLEKERVGFAKKECALLWD 474
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 77/353 (21%), Positives = 136/353 (38%), Gaps = 73/353 (20%)
Query: 42 LSEYDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDP--------CP-YIADYSTEDTS 88
+ + P +SSSSK + C P C+ C+ DP CP YI Y S
Sbjct: 137 IPRFIPKNSSSSKIIGCQSPKCQFLYGPNVQCRGC-DPNTRNCTVGCPPYILQYGLG--S 193
Query: 89 SSGYLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVP 148
++G L+ + L + ++GC S + P G+ G G G VS+P
Sbjct: 194 TAGVLITEKLDFPDLT--------VPDFVVGC------SIISTRQPAGIAGFGRGPVSLP 239
Query: 149 SLLAKAGLIQNSFSICFDEN--------DSGSVFFGDQGPATQQSTSFLPIGEK------ 194
S + S FD+ D+GS G + ++ P +
Sbjct: 240 SQMNLKRFSHCLVSRRFDDTNVTTDLDLDTGS---GHNSGSKTPGLTYTPFRKNPNVSNK 296
Query: 195 --YDAYFVGVESYCIGNSCL----------TQSGFQALVDSGASFTFLPTEIYAEVVVKF 242
+ Y++ + +G + T ++VDSG++FTF+ ++ V +F
Sbjct: 297 AFLEYYYLNLRRIYVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEF 356
Query: 243 DKLVS--SKRISLQGNS-WKYCYNASSEEMLKVPDMRLIFSKNQSFVVR-NHIFSFPENE 298
+S ++ L+ + C+N S + + VP++ F + ++ F+F N
Sbjct: 357 ASQMSNYTREKDLEKETGLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNT 416
Query: 299 GFTVFCLTVMSTD--------GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
CLTV+S G I+G + + +D EN + ++ KC
Sbjct: 417 --DTVCLTVVSDKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 78/320 (24%), Positives = 132/320 (41%), Gaps = 45/320 (14%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DPS SS+ K C + CPY Y+ E + S+G L + + + S S
Sbjct: 103 FDPSKSSTFKEKRCH-------------GNSCPYEIIYADE-SYSTGILATETVTIQSTS 148
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDG--AAPDGVMGLGLGDVSVPSL--LAKAGLIQNS 160
+ V + IGCG + G A+ G++GL +G S+ S L GLI
Sbjct: 149 G---EPFVMAETSIGCGLNNSNLMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLI--- 202
Query: 161 FSICFDENDSGSVFFGDQ----GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
S CF + + FG G T + F+ + + Y++ +++ +G+ + G
Sbjct: 203 -SYCFSSQGTSKINFGTNAVVAGDGTVAADMFIKKDQPF--YYLNLDAVSVGDKRIETLG 259
Query: 217 --FQA-----LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK--YCYNASSE 267
F A +DSG ++T+LPT Y +V + + S + CYN +
Sbjct: 260 TPFHAQDGNIFIDSGTTYTYLPTS-YCNLVREAVAASVVAANQVPDPSSENLLCYNWDTM 318
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRI 327
E+ P + L F+ V+ + G T FCL + D I N + +
Sbjct: 319 EIF--PVITLHFAGGADLVLDKYNMYVETITGGT-FCLAIGCVDPSMPAIFGNRAHNNLL 375
Query: 328 V-FDRENLKLAWSHSKCEEV 346
V +D L +++S + C +
Sbjct: 376 VGYDSSTLVISFSPTNCSAL 395
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 64/317 (20%), Positives = 127/317 (40%), Gaps = 37/317 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
++P SSS + CS LC++ S + C Y Y + + + G + + L S S
Sbjct: 137 FNPQGSSSFSTLPCSSQLCQALQSPTCSNNSCQYTYGYG-DGSETQGSMGTETLTFGSVS 195
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
++ GCG G A G++G+G G +S+PS L FS C
Sbjct: 196 I--------PNITFGCGENNQGFGQGNGA--GLVGMGRGPLSLPSQLDVT-----KFSYC 240
Query: 165 F---DENDSGSVFFG---DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------ 212
++S ++ G + A +T+ + + Y++ + +G++ L
Sbjct: 241 MTPIGSSNSSTLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSV 300
Query: 213 ----TQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
+ +G ++DSG + T+ Y V F ++ ++ + + C+ S+
Sbjct: 301 FKLNSNNGTGGIIIDSGTTLTYFVDNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSD 360
Query: 268 EM-LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
+ L++P + F + + F P N + CL + S+ I G
Sbjct: 361 QSNLQIPTFVMHFDGGDLVLPSENYFISPSNG---LICLAMGSSSQGMSIFGNIQQQNLL 417
Query: 327 IVFDRENLKLAWSHSKC 343
+V+D N +++ ++C
Sbjct: 418 VVYDTGNSVVSFLSAQC 434
>gi|452820752|gb|EME27790.1| aspartyl protease [Galdieria sulphuraria]
Length = 559
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 69/339 (20%), Positives = 131/339 (38%), Gaps = 58/339 (17%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSR-------SSCKS--------LKDPCPYIADYSTEDT 87
S+Y S S V C+ PLC S S C S + C + Y
Sbjct: 161 SKYSSHLQSKSSIVGCNDPLCSSNICEALGCSECSSSGACCANKMPQACGFFLRYGDGSG 220
Query: 88 SSSGYLVDDI-LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLG---LG 143
+ LVD + + ASF H G + + + ++ DG++G+G LG
Sbjct: 221 AEGALLVDQVQVGNASFVAHFG------------GILEDTTNFEQSSVDGILGMGYPALG 268
Query: 144 ------DVSVPSLLAKAGLIQNSFSICFDENDSGSVFFGDQGPATQQSTSFLPIGEKYDA 197
+ + S+ ++ + QN FS+C V G + +F+P+
Sbjct: 269 CTPSCIEPLIDSMFRQSKIEQNMFSLCISVRGGHLVLGGYDSNMAASNITFVPMILSSPP 328
Query: 198 YFVGVE---SYCIGNSCLTQSGF-QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL 253
F V S + N L+ GF + +VDSG + + + + ++ + + +
Sbjct: 329 TFYAVSLGGSIRVDNEELSLDGFDKGIVDSGTTLLVISEQAF----IQLKNYLQTHYCQV 384
Query: 254 QG-----NSW---KYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFP-ENEGFTVFC 304
G +SW C + +P + + + ++ + + + GF+++C
Sbjct: 385 PGLCDYQHSWFDSASCVILEESHLQHLPTLTIHVANRVDLILTPYDYMLQVQRNGFSLYC 444
Query: 305 LTVM---STDGD-YGIIGQNFMMGHRIVFDRENLKLAWS 339
L + S DG + I+G M + +FDR N ++ ++
Sbjct: 445 LGIQSLPSKDGSPFVILGNTVMTKYLTIFDRRNHRIGFA 483
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 60/251 (23%), Positives = 102/251 (40%), Gaps = 28/251 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS-------CKSLKDPCPYIADYSTEDTSSSGYLVDDI 97
+DPS S+S N++C+ LC S+ C + C Y Y + + S GY +
Sbjct: 189 FDPSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYG-DSSFSVGYFSRER 247
Query: 98 LHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI 157
L + + + V + + GCG+ G + A G++GLG +S + A
Sbjct: 248 LTVTA-------TDVVDNFLFGCGQNNQGLFGGSA---GLIGLGRHPISF--VQQTAAKY 295
Query: 158 QNSFSICFDENDS--GSVFFGDQGPATQ-QSTSFLPIGEKYDAYFVGVESYCIGNSCL-- 212
+ FS C S G + FG + T F I Y + + + +G L
Sbjct: 296 RKIFSYCLPSTSSSTGHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPV 355
Query: 213 ---TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM 269
T S A++DSG T LP Y + F + +S + + + CY+ S ++
Sbjct: 356 SSSTFSTGGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYKV 415
Query: 270 LKVPDMRLIFS 280
+P + F+
Sbjct: 416 FSIPTIEFSFA 426
>gi|50552716|ref|XP_503768.1| YALI0E10175p [Yarrowia lipolytica]
gi|49649637|emb|CAG79359.1| YALI0E10175p [Yarrowia lipolytica CLIB122]
Length = 534
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 71/312 (22%), Positives = 134/312 (42%), Gaps = 63/312 (20%)
Query: 135 DGVMGLGLGDV------------------SVPSLLAKAGLIQ-NSFSICFDE--NDSGSV 173
+GVMG+GL + ++P + GLI+ N++S+ + +DSG+V
Sbjct: 192 NGVMGIGLAGLESTITYRGNDQISGNPYENLPMKMKAEGLIKANAYSLWLNNLSSDSGNV 251
Query: 174 FFGDQGPAT-------------QQSTSFLPIGEKYDAYFVGVESYCIGN-----SCLTQS 215
FG A Q+S S PI A++VG++S I + +T+
Sbjct: 252 LFGGVDYAKIDGDLFTVKLVNPQRSVSSKPI-----AFYVGLDSVSITDVKGVSGFITKQ 306
Query: 216 GFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISL----QGNSWKYCYNASSEEMLK 271
AL+DSG + T+LP + + VV + + G S YN S +
Sbjct: 307 PVPALLDSGTTLTYLPQDAFNYVVRAMGATYDPQNGYVCPCKNGYSGHLDYNFSGAN-IS 365
Query: 272 VPDMRLIFS---KNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
VP +L + ++QS V N F ++ CL +M D+ I+G +F+ +V
Sbjct: 366 VPLYQLTYPIQLQSQSGRVVNAQFRNGDDA-----CLLLMQASQDHVILGDSFLRAAYVV 420
Query: 329 FDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTS-NGQAA---APPS 384
++ ++ +++ +K + +++ + ++ NP P T+ N + P
Sbjct: 421 YNLDSYEVSMGQTKYG--VTDTNIVEIDSNGVKNANPAPEYSSSFTNVNSETTILRGAPG 478
Query: 385 TAKTAPSKSIAA 396
+A + PS +++
Sbjct: 479 SADSNPSTTLSG 490
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 66/317 (20%), Positives = 126/317 (39%), Gaps = 37/317 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
++P SSS + CS LC++ SS + C Y Y + + + G + + L S S
Sbjct: 137 FNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTYGYG-DGSETQGSMGTETLTFGSVS 195
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
++ GCG G A G++G+G G +S+PS L FS C
Sbjct: 196 I--------PNITFGCGENNQGFGQGNGA--GLVGMGRGPLSLPSQLDVT-----KFSYC 240
Query: 165 FDENDSGS---VFFG---DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--TQSG 216
S + + G + A +T+ + + Y++ + +G++ L S
Sbjct: 241 MTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSA 300
Query: 217 FQ---------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
F ++DSG + T+ Y V +F ++ ++ + + C+ S+
Sbjct: 301 FALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSD 360
Query: 268 -EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
L++P + F + + F P N + CL + S+ I G
Sbjct: 361 PSNLQIPTFVMHFDGGDLELPSENYFISPSNG---LICLAMGSSSQGMSIFGNIQQQNML 417
Query: 327 IVFDRENLKLAWSHSKC 343
+V+D N ++++ ++C
Sbjct: 418 VVYDTGNSVVSFASAQC 434
>gi|18414692|ref|NP_567506.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15809800|gb|AAL06828.1| AT4g16560/dl4305c [Arabidopsis thaliana]
gi|18377815|gb|AAL67094.1| AT4g16560/dl4305c [Arabidopsis thaliana]
gi|332658370|gb|AEE83770.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 68/148 (45%), Gaps = 25/148 (16%)
Query: 222 DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWK------YCYNASSEEMLKVPDM 275
DSG +FT LP + Y VV +FD V R+ + + + CY + + +KVP +
Sbjct: 353 DSGTTFTMLPAKFYNSVVEEFDSRVG--RVHERADRVEPSSGMSPCYYLN--QTVKVPAL 408
Query: 276 RLIFSKNQSFVV---RNHIFSFPE-----NEGFTVFCLTVMS-------TDGDYGIIGQN 320
L F+ N+S V RN+ + F + E + CL +M+ G I+G
Sbjct: 409 VLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKRKIGCLMLMNGGDESELRGGTGAILGNY 468
Query: 321 FMMGHRIVFDRENLKLAWSHSKCEEVID 348
G +V+D N ++ ++ KC + D
Sbjct: 469 QQQGFEVVYDLLNRRVGFAKRKCASLWD 496
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 83/328 (25%), Positives = 132/328 (40%), Gaps = 65/328 (19%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVD 95
S ++P SSS+ V CS P+C++R+ SC C ++A + TS G L
Sbjct: 101 SVFNPVSSSTYSPVPCSSPICRTRTRDLPIPASCDPKTHLC-HVAISYADATSIEGNLAH 159
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSY-LDGAAPDGVMGLGLGDVSVPSLLAKA 154
+ + S ++ + GC S + A G+MG+ G +S + L +
Sbjct: 160 ETFVIGSVTRPG--------TLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFS 211
Query: 155 GLIQNSFSICFDEND-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVE 203
FS C +D SG + GD P QST LP ++ AY V +E
Sbjct: 212 -----KFSYCISGSDSSGFLLLGDASYSWLGPIQYTPLVLQSTP-LPYFDRV-AYTVQLE 264
Query: 204 SYCIGNSCLT--QSGF--------QALVDSGASFTFLPTEIYAEVVVKFD-------KLV 246
+G+ L+ +S F Q +VDSG FTFL +Y + +F +LV
Sbjct: 265 GIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLV 324
Query: 247 SSKRISLQGNSWKYCYNASSE---EMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGF--- 300
QG + CY S +P + L+F + V + G
Sbjct: 325 DDPDFVFQG-TMDLCYKVGSTTRPNFSGLPMVSLMFRGAEMSVSGQKLLYRVNGAGSEGK 383
Query: 301 -TVFCLTVMSTDGDYGIIG-QNFMMGHR 326
V+C T ++D ++G + F++GH
Sbjct: 384 EEVYCFTFGNSD----LLGIEAFVIGHH 407
>gi|194745306|ref|XP_001955129.1| GF16404 [Drosophila ananassae]
gi|190628166|gb|EDV43690.1| GF16404 [Drosophila ananassae]
Length = 463
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 65/307 (21%), Positives = 127/307 (41%), Gaps = 52/307 (16%)
Query: 51 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIA----------DYSTEDTSSSGYLVDDILHL 100
+ S N+ P CKS++ C+S K P + + + S G L +D + +
Sbjct: 170 TGSSNIWVPGPKCKSKA-CRSHKKFHPAKSSTYKKKTKAFEITYGSGSVKGRLAEDTVSI 228
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+ ++ SS + G + + DG++GLG +SV ++ L+QN
Sbjct: 229 GGLTVDNQTFAMTSS--------EPGEAFEESKFDGILGLGYQAISVDNVKT---LMQNM 277
Query: 161 ----------FSICFDENDS----GSVFFGDQGPAT---QQSTSFLPIGEKYDAYFVGVE 203
F+IC + GS+F G++ S + P+ +K + + ++
Sbjct: 278 CSQNVITSCIFAICLRGGGTSAKGGSLFIGNKNTTAYTGSNSYVYTPVTKK-GYWQMKLD 336
Query: 204 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
+ +G++ ++ + QA+VDSG S P Y E V + +S G W C
Sbjct: 337 GFYVGSTKVSGTA-QAIVDSGTSLIAAPLHAYKEFVKETGCTPTS-----SGECWVKCSK 390
Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMM 323
+ + + D +++ +++ + +G TV L V + ++ I+G F+
Sbjct: 391 TIPDIVFVIADKKIVIKGDKAKM------KVKTQKGHTVCLLVVTYEETNFWILGDPFLR 444
Query: 324 GHRIVFD 330
+ VFD
Sbjct: 445 NNCAVFD 451
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 76/362 (20%), Positives = 136/362 (37%), Gaps = 66/362 (18%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+++DPS SS+ VSC C++ R++C + C Y+ Y + ++++G L +
Sbjct: 144 TQFDPSRSSTYGRVSCQTDACEALGRATCDDGSN-CAYLYAYG-DGSNTTGVLSTETFTF 201
Query: 101 A-SFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN 159
+ +P+ V GC GS+ G VS+ + L A +
Sbjct: 202 DDGGAGRSPRQVRIGGVKFGCSTATAGSFPADGLVGLGGGA----VSLVTQLGGATSLGR 257
Query: 160 SFSICF---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLTQSG 216
FS C N S ++ FG T+ + P+ +GN + +
Sbjct: 258 RFSYCLVPHSVNASSALNFGALADVTEPGAASTPL---------------VGNKTVASAA 302
Query: 217 F-QALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEM---LKV 272
+ +VDSG + TFL + +V + + ++ + + CYN + E+ +
Sbjct: 303 SSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESI 362
Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTV----FCLTVMST------------------ 310
PD+ L F + ++ PEN V CL +++T
Sbjct: 363 PDLTLEFGGGAAVALK------PENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIH 416
Query: 311 ---DGDYGIIGQNFM---MGHRIVFDRENLKLAWSHSKCEEVIDKSHVHLVPPPAGQSPN 364
D D G +G + RI+ D S ++D+ + PP QSP+
Sbjct: 417 VGYDLDAGTVGNKTVASAASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPV-QSPD 475
Query: 365 PL 366
L
Sbjct: 476 GL 477
>gi|414869114|tpg|DAA47671.1| TPA: hypothetical protein ZEAMMB73_872184 [Zea mays]
Length = 492
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 76/326 (23%), Positives = 127/326 (38%), Gaps = 51/326 (15%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+D S S++ +V C P C S ++C S CP+ + G D+L +
Sbjct: 191 FDTSQSTTFTHVPCDSPDCPSTANC-SAGSVCPFNLFF------VEGTFSQDVLTV---- 239
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDV-----SVPSLLAKAGLIQN 159
AP +VQ + C LD A DG+ +G D+ S+PS L AG
Sbjct: 240 --APSVAVQDFTFV-C--------LDAGASDGMPEVGTLDLSRDRNSLPSRL--AGSASA 286
Query: 160 SFSICFDE--NDSGSVFFGDQGPATQQS-TSFLPIGEKYDA-----YFVGVESYCIGNSC 211
+FS C + + G + GD + T+ P+ D YF+ V +G+
Sbjct: 287 AFSYCMPQYPDSPGFLSLGDDATVRGDNCTAHAPLLSSDDPDLANMYFIDVVGMSLGDVD 346
Query: 212 LT------QSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQG-NSWKYCYNA 264
L + +V++G +FT L + Y + F + ++ S+ G + CYN
Sbjct: 347 LPIPSGTFGNNASTIVEAGTTFTMLAPDAYTPLRDAFRQAMAQYNRSVPGFYDFDTCYNF 406
Query: 265 SSEEMLKVPDMRLIFSKNQSFVVRNH---IFSFPENEGFTVFCLTVMS----TDGDYGII 317
+ + L VP + F S ++ + P FTV CL + D +I
Sbjct: 407 TGLQELTVPLVEFKFGNGDSLLIDGDQMLYYDIPSEGPFTVTCLAFSTLDVDDDDVSAVI 466
Query: 318 GQNFMMGHRIVFDRENLKLAWSHSKC 343
G + +V+D + + C
Sbjct: 467 GAYSLATTEVVYDVAGGTVGFIPESC 492
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 129/320 (40%), Gaps = 39/320 (12%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
++PS SS+ ++++C LC+ ++ C Y Y D + FS
Sbjct: 123 FNPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYG-----------DGSFTVGEFS 171
Query: 105 KHAPQ--SSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFS 162
S+ +SV IGCG G + A ++GLG G +S PS + + L + FS
Sbjct: 172 TETLSFGSNAVNSVAIGCGHNNQGLFTGAAG---LLGLGKGLLSFPSQVGQ--LYGSVFS 226
Query: 163 ICFDENDS-GSV--FFGDQGPATQQSTSFLPIGEKYDAYF--------VGVESYCIGNSC 211
C +S GSV FG+Q A+ + L K D ++ VG S I
Sbjct: 227 YCLPTRESTGSVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGS 286
Query: 212 L-----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNAS 265
L T +G ++DSG + T L T Y + F + S G S + CY+ S
Sbjct: 287 LSLDSSTGNG-GVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLS 345
Query: 266 SEEMLKVPDMRLIFSKNQSFVVRNHIFSFP-ENEGFTVFCLTVMSTDGDYGIIGQNFMMG 324
+ +P + +F+ + + P +N G +CL ++ IIG
Sbjct: 346 GRSSIMLPAVSFVFNGGATMALPAQNIMVPVDNSG--TYCLAFAPNSENFSIIGNIQQQS 403
Query: 325 HRIVFDRENLKLAWSHSKCE 344
R+ FD ++ ++C
Sbjct: 404 FRMSFDSTGNRVGIGANQCN 423
>gi|289740593|gb|ADD19044.1| aspartyl protease [Glossina morsitans morsitans]
Length = 394
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 73/316 (23%), Positives = 130/316 (41%), Gaps = 43/316 (13%)
Query: 47 PSSSSSSKNVSC-SHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFSK 105
PS N++C H + S K+ + Y + S SGYL D +++A
Sbjct: 102 PSKQCYFTNIACLMHNKYDANKSSSYKKNGTEFAIHYGS--GSLSGYLSTDTVNIAGLGI 159
Query: 106 HAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSL------LAKAGLI-Q 158
Q+ ++ + G GA DG++GLG ++V + + + GLI Q
Sbjct: 160 EG-QTFAEA-------LSEPGLVFIGAKFDGILGLGYSSIAVDGVKPPFYQMYEQGLISQ 211
Query: 159 NSFSICFDEN----DSGSVFFGDQGPATQQST-SFLPIGEKYDAYF-VGVESYCIGNSCL 212
FS + + + G + FG P + ++LP+ K AY+ + ++S +GN L
Sbjct: 212 PVFSFYLNRDPKAPEGGEIIFGGSDPNHYKGEFTYLPVTRK--AYWQIKMDSASMGNLNL 269
Query: 213 TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKV 272
Q G Q + D+G S LP +K + I + G Y + E + K+
Sbjct: 270 CQGGCQVIADTGTSLIALP----PSEATSINKAIGGTPI-MGGQ-----YMVACENIPKL 319
Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT-VMSTD-----GDYGIIGQNFMMGHR 326
P +R + ++F + + + CL+ M D G I+G F+ +
Sbjct: 320 PVIRFVLG-GKTFELEGKDYILRIAQMGKTICLSGFMGIDIPPPNGPIWILGDVFIGKYY 378
Query: 327 IVFDRENLKLAWSHSK 342
FD N ++ ++ +K
Sbjct: 379 TEFDMGNDRVGFAEAK 394
>gi|408397130|gb|EKJ76280.1| hypothetical protein FPSE_03535 [Fusarium pseudograminearum CS3096]
Length = 467
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 63/303 (20%), Positives = 131/303 (43%), Gaps = 36/303 (11%)
Query: 117 IIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLI-QNSFSICFD--ENDSGSV 173
+IG G + +D P+ + P+ LA G+I N++S+ D E+ +G +
Sbjct: 176 VIGIGYTSNEAVVDQPDPEFYKNM-------PARLASDGVIASNAYSLYLDDLESATGKI 228
Query: 174 FFGDQGPATQQ------STSFLPIGEKYDAYFVGVESYCIGNSCLTQS-GFQALVDSGAS 226
FG G Q + + I ++Y ++V ++S G+ + + ++DSG++
Sbjct: 229 LFG--GVDEQHFIGDLVTVPIMKINDEYSEFYVKLQSINSGSEIVGEDLDLGVVLDSGST 286
Query: 227 FTFLPTE----IYAEVVVKFDKLVSSKRI----SLQGNSWKYCYNASSEEMLKVPDMRLI 278
T+LP IY V +++ ++ + + QG + + + + +E + + ++ L
Sbjct: 287 LTYLPASVTDSIYQLVGADYEEGQTTAYVPCDLANQGGNLTFKFTSPAEITVPLSELILD 346
Query: 279 FSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDRENLKLAW 338
F+ + SF + F + ++ I+G F+ +VFD +N +++
Sbjct: 347 FTD-----ITGRQMSFTNGQAACSFGIAPSTSQ--VSILGDTFLRSAYVVFDLDNNEISL 399
Query: 339 SHSKCEEVIDKSHVHLVPPPAGQSPNPLPTTEQQSTSNGQAAAPPSTAKTAPSKSIAASA 398
+ S E SH+ + P+ + QS+ + AA S ++ + SI A A
Sbjct: 400 AQSNSEAT--GSHILEISKGKNAVPSATGSEGPQSSGSENAAGSLSPLESTGAVSILAGA 457
Query: 399 QQL 401
L
Sbjct: 458 MAL 460
>gi|302763589|ref|XP_002965216.1| hypothetical protein SELMODRAFT_27315 [Selaginella moellendorffii]
gi|300167449|gb|EFJ34054.1| hypothetical protein SELMODRAFT_27315 [Selaginella moellendorffii]
Length = 163
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 61/134 (45%), Gaps = 10/134 (7%)
Query: 220 LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE-------MLKV 272
+ DSG + TFLP +Y +V+ F + ++ ++ CYN S + L
Sbjct: 32 IFDSGTTLTFLPLGVYIQVISVFSRRINLPLVNGTSVGLDLCYNISLQRDYTFPSLALHF 91
Query: 273 PDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDG-DYGIIGQNFMMGHRIVFDR 331
PD + ++ +V + + NE +V CL +MS+ IIG G+ I+FD
Sbjct: 92 PDAWMNLHQDNYIIVPSRADAEAWNE--SVACLAIMSSASIGINIIGNVMQEGYHIMFDN 149
Query: 332 ENLKLAWSHSKCEE 345
E + ++ + C E
Sbjct: 150 EKSTVTFAPASCSE 163
>gi|242076594|ref|XP_002448233.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
gi|241939416|gb|EES12561.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
Length = 508
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/263 (23%), Positives = 105/263 (39%), Gaps = 52/263 (19%)
Query: 134 PDGVMGLGLGDVSVPSLLAKAGLIQNSFSICFDEND--------SGSVFFGDQGPATQQS 185
P GV G G G +S+P LA + FS C + + G A ++
Sbjct: 245 PVGVAGFGRGPLSLPGQLAPQ--LSGRFSYCLVSHSFRADRLIRPSPLILGRSPDAAAET 302
Query: 186 TSFL--PI--GEKYDAYF-VGVESYCIGNSCLT---------QSGFQAL-VDSGASFTFL 230
F+ P+ K+ ++ V +E+ +G + + ++G + VDSG +FT L
Sbjct: 303 GGFVYTPLLHNPKHPYFYSVALEAVSVGATRIQARPELARVDRAGNGGMVVDSGTTFTML 362
Query: 231 PTEIYAEV------VVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVPDMRLIFSKNQS 284
P E YA V + ++R Q CY+ ++ + VP + L F N +
Sbjct: 363 PNETYARVAEAFARAMAAAGFARAERAEEQ-TGLTPCYHYAASDR-GVPPLALHFRGNAT 420
Query: 285 FVV--RNHIFSFPENEGF-------TVFCLTVMS----------TDGDYGIIGQNFMMGH 325
+ RN+ F E V CL +M+ DG G +G G
Sbjct: 421 VALPRRNYFMGFKSEEEAGGAGRKDDVGCLMLMNGGDVSGEDGGDDGPAGTLGNFQQQGF 480
Query: 326 RIVFDRENLKLAWSHSKCEEVID 348
+V+D + ++ ++ +C E+ D
Sbjct: 481 EVVYDVDAGRVGFARRRCTELWD 503
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 76/318 (23%), Positives = 132/318 (41%), Gaps = 35/318 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
++PS SS+ ++++C LC+ ++ C Y Y + + + G + L
Sbjct: 123 FNPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYG-DGSFTVGEFSTETLSFG--- 178
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
S+ +SV IGCG G + A ++GLG G +S PS + + L + FS C
Sbjct: 179 -----SNAVNSVAIGCGHNNQGLFTGAAG---LLGLGKGLLSFPSQVGQ--LYGSVFSYC 228
Query: 165 FDENDS-GSV--FFGDQGPATQQSTSFLPIGEKYDA-YFVGVESYCIGNSCL-------- 212
+S GSV FG+Q A+ + L K D Y+V + +G + +
Sbjct: 229 LPTRESTGSVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLS 288
Query: 213 ----TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNS-WKYCYNASSE 267
T +G ++DSG + T L T Y + F + S G S + CY+ S
Sbjct: 289 LDSSTGNG-GVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGR 347
Query: 268 EMLKVPDMRLIFSKNQSFVVRNHIFSFP-ENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
+ +P + +F+ + + P +N G +CL ++ IIG R
Sbjct: 348 SSIMLPAVSFVFNGGATMALPAQNIMVPVDNSG--TYCLAFAPNSENFSIIGNIQQQSFR 405
Query: 327 IVFDRENLKLAWSHSKCE 344
+ FD ++ ++C
Sbjct: 406 MSFDSTGNRVGIGANQCN 423
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 72/315 (22%), Positives = 123/315 (39%), Gaps = 33/315 (10%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
+DP+ S+S VSCS +C + C Y Y + + + G L L +F
Sbjct: 182 FDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYG-DGSYTKGTLA---LETLTFG 237
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
+ ++ SV IGCG + G ++ A G+ S+ + G +FS C
Sbjct: 238 R-----TMVRSVAIGCGHRNRGMFVGAAGLLGLG-----GGSMSFVGQLGGQTGGAFSYC 287
Query: 165 F---DENDSGSVFFGDQGPATQQSTSFLPIGEKYDA---YFVGVESYCIGNS-------- 210
+ SGS+ FG + A +++P+ A Y++G+ +G
Sbjct: 288 LVSRGTDSSGSLVFGRE--ALPAGAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEV 345
Query: 211 -CLTQSGFQALV-DSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEE 268
LT+ G +V D+G + T LPT Y F ++ + + CY+
Sbjct: 346 FRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGFV 405
Query: 269 MLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIV 328
++VP + FS + F P ++ T FC + I+G G +I
Sbjct: 406 SVRVPTVSFYFSGGPILTLPARNFLIPMDDAGT-FCFAFAPSTSGLSILGNIQQEGIQIS 464
Query: 329 FDRENLKLAWSHSKC 343
FD N + + + C
Sbjct: 465 FDGANGYVGFGPNIC 479
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 64/317 (20%), Positives = 126/317 (39%), Gaps = 37/317 (11%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLASFS 104
++P SSS + CS LC++ S + C Y Y + + + G + + L S S
Sbjct: 137 FNPQGSSSFSTLPCSSQLCQALQSPTCSNNSCQYTYGYG-DGSETQGSMGTETLTFGSVS 195
Query: 105 KHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSIC 164
++ GCG G A G++G+G G +S+PS L FS C
Sbjct: 196 I--------PNITFGCGENNQGFGQGNGA--GLVGMGRGPLSLPSQLDVT-----KFSYC 240
Query: 165 F---DENDSGSVFFG---DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL------ 212
+ S ++ G + A +T+ + + Y++ + +G++ L
Sbjct: 241 MTPIGSSTSSTLLLGSLANSVTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSV 300
Query: 213 ----TQSGFQA-LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSE 267
+ +G ++DSG + T+ Y V F ++ ++ + + C+ S+
Sbjct: 301 FKLNSNNGTGGIIIDSGTTLTYFADNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSD 360
Query: 268 EM-LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHR 326
+ L++P + F + + F P N + CL + S+ I G
Sbjct: 361 QSNLQIPTFVMHFDGGDLVLPSENYFISPSNG---LICLAMGSSSQGMSIFGNIQQQNLL 417
Query: 327 IVFDRENLKLAWSHSKC 343
+V+D N +++ ++C
Sbjct: 418 VVYDTGNSVVSFLFAQC 434
>gi|413950927|gb|AFW83576.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 316
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 70/271 (25%), Positives = 106/271 (39%), Gaps = 53/271 (19%)
Query: 116 VIIGCGRKQTG-SYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNSFSICF-----DEND 169
V++GC TG S+L A DGV+ LG +VS S A FS C N
Sbjct: 58 VVLGCTTSYTGESFL---ASDGVLSLGYSNVSFAS--RAAARFGGRFSYCLVDHLAPRNA 112
Query: 170 SGSVFFGDQ------------------GPATQQSTSFLPIGEKYDAYFVGVESYCIGNSC 211
+ + FG P +Q T L Y V V +
Sbjct: 113 TSYLTFGPNPAVSSASASRTACAGSAAAPGARQ-TPLLLDHRMRPFYAVAVNGVSVDGEL 171
Query: 212 L--------TQSGFQALVDSGASFTFLPTEIYAEVVVKF-DKLVSSKRISLQGNSWKYCY 262
L Q G A++DSG S T L + Y VV KLV R+++ + + YCY
Sbjct: 172 LRIPRLVWDVQKGGGAILDSGTSLTVLVSPAYRAVVAALGKKLVGLPRVAM--DPFDYCY 229
Query: 263 NASS----EEM-LKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDY--- 314
N +S E++ + VP + + F+ + + G V C+ + +GD+
Sbjct: 230 NWTSPLTGEDLAVAVPALAVHFAGSARLQPPPKSYVIDAAPG--VKCIGLQ--EGDWPGV 285
Query: 315 GIIGQNFMMGHRIVFDRENLKLAWSHSKCEE 345
+IG H FD +N +L + S+C +
Sbjct: 286 SVIGNILQQEHLWEFDLKNRRLRFKRSRCMQ 316
>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
Length = 534
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 80/360 (22%), Positives = 124/360 (34%), Gaps = 55/360 (15%)
Query: 30 LVFGASIVQDRNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSS 89
+ G ++ + + Y P+ SSS + + CS C PY S S
Sbjct: 172 MSMGGEGAKEASKNWYRPAKSSSWRRIRCSQKECAV----------LPYNTCQSPSKAES 221
Query: 90 SGYLV---DDILHLASFSKHAPQSSVQSS-------VIIGCGRKQTGSYLDGAAPDGVMG 139
Y D + + + K +V +I+GC + G +D A DGV+
Sbjct: 222 CSYFQKTQDGTVTIGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVD--AHDGVLS 279
Query: 140 LGLGDVSVPSLLAKAGLIQNSFSICF-----DENDSGSVFFGDQ----GPATQQSTSFLP 190
LG GD+S AK FS C + S + FG GP T ++
Sbjct: 280 LGNGDMSFAVHAAKR--FGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYN 337
Query: 191 I------GEKYDAYFVGVESYCIGNSCLTQSGF---QALVDSGASFTFLPTEIYAEVVVK 241
+ G + VG E I + F ++D+ S T L E YA V
Sbjct: 338 VDVKPAYGAQVTGVLVGGERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAA 397
Query: 242 FDKLVSSKRISLQGNSWKYCYN-------ASSEEMLKVPDMRLIFSKNQSFVVRNHIFSF 294
D+ +S + ++YCY + +P + +
Sbjct: 398 LDRHLSHLPRVYELEGFEYCYKWTFTGDGVDPAHNVTIPSFTVEMAGGARLEPEAKSVVM 457
Query: 295 PENEGFTVFCLTVMS-TDGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 353
PE E V CL G GI+G FM + D + K+ + KC + H+H
Sbjct: 458 PEVEP-GVACLAFRKLLRGGPGILGNVFMQEYIWEIDHGDGKIRFRKDKC----NTHHLH 512
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 79/335 (23%), Positives = 133/335 (39%), Gaps = 59/335 (17%)
Query: 45 YDPSSSSSSKNVSCSHPLCK----SRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+ P+SSS+ + C+ C+ S +C + C Y Y + T+ GYL + L +
Sbjct: 128 FQPASSSTFSKLPCTSSFCQFLPNSIRTCNATG--CVYNYKYGSGYTA--GYLATETLKV 183
Query: 101 --ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
ASF SV GC + G + G+ GLG G +S L+ + G+
Sbjct: 184 GDASFP----------SVAFGCSTENG----VGNSTSGIAGLGRGALS---LIPQLGV-- 224
Query: 159 NSFSICFDENDSGS---VFFGDQGPATQ---QSTSFL---PIGEKYDAYFVGVESYCIGN 209
FS C + + FG T QST F+ + Y Y+V + +G
Sbjct: 225 GRFSYCLRSGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSY--YYVNLTGITVGE 282
Query: 210 SCL---------TQSGFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSW 258
+ L TQ+G +VDSG + T+L + Y V F +
Sbjct: 283 TDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGL 342
Query: 259 KYCYNAS--SEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENE---GFTVFCLTVMSTDGD 313
C+ ++ + VP + L F + V + F+ E + TV CL ++ GD
Sbjct: 343 DLCFKSTGGGGGGIAVPSLVLRFDGGAEYAVPTY-FAGVETDSQGSVTVACLMMLPAKGD 401
Query: 314 --YGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEV 346
+IG M +++D + +++ + C +V
Sbjct: 402 QPMSVIGNVMQMDMHLLYDLDGGIFSFAPADCAKV 436
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 78/333 (23%), Positives = 138/333 (41%), Gaps = 57/333 (17%)
Query: 45 YDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP SSS V CS LC + RS+C KD C Y+ Y + +S+ G L +
Sbjct: 150 FDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDSCEYLYTYG-DYSSTRGLLATETFTFED 208
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDG-AAPDGVMGLGLGDVSVPSLLAKAGLIQNSF 161
++S+ S + GCG + G DG + G++GLG G +S+ S L + F
Sbjct: 209 ------ENSI-SGIGFGCGVENEG---DGFSQGSGLVGLGRGPLSLISQLK-----ETKF 253
Query: 162 SICF----DENDSGSVFFGD-------------QGPATQQSTSFLPIGEKYDAYFVGVES 204
S C D S S+F G G T ++ S L ++ Y++ ++
Sbjct: 254 SYCLTSIEDSEASSSLFIGSLASGIVNKTGANLDGEVT-KTMSLLRNPDQPSFYYLELQG 312
Query: 205 YCIGNSCLT--QSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQ 254
+G L+ +S F+ ++DSG + T+L + + +F +S
Sbjct: 313 ITVGAKRLSVEKSTFELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSG 372
Query: 255 GNSWKYCYN-ASSEEMLKVPDMRLIF---SKNQSFVVRNHIFSFPENEGFTVFCLTVMST 310
C+ ++ + + VP +LIF + N++ + + V CL + S+
Sbjct: 373 STGLDLCFKLPNAAKNIAVP--KLIFHFKGADLELPGENYMVA---DSSTGVLCLAMGSS 427
Query: 311 DGDYGIIGQNFMMGHRIVFDRENLKLAWSHSKC 343
+G I G ++ D E + + ++C
Sbjct: 428 NG-MSIFGNVQQQNFNVLHDLEKETVTFVPTEC 459
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 71/321 (22%), Positives = 124/321 (38%), Gaps = 42/321 (13%)
Query: 45 YDPSSSSSSKNVSCSHPLCKSRSS--CKSLKDPCPYIADYSTEDTSSSGYLVDDILHLAS 102
+DP S + + CS P C+ S C + + C Y Y + + + + +
Sbjct: 184 FDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETL----T 239
Query: 103 FSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAG-LIQNSF 161
F ++ + V +GCG G ++ A G+ S + G F
Sbjct: 240 FRRNRVKG-----VALGCGHDNEGLFVGAAGLLGLG------KGKLSFPGQTGHRFNQKF 288
Query: 162 SICFDENDSGS----VFFGDQGPATQQSTSFLPI--GEKYDA-YFVGVESYCIGNS---C 211
S C + + S V FG+ A + F P+ K D Y+VG+ +G +
Sbjct: 289 SYCLVDRSASSKPSSVVFGNA--AVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPG 346
Query: 212 LTQSGFQ--------ALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
+T S F+ ++DSG S T L Y + F + + + + + C++
Sbjct: 347 VTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFD 406
Query: 264 ASSEEMLKVPDMRLIF-SKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFM 322
S+ +KVP + L F + S N++ N F C T G IIG
Sbjct: 407 LSNMNEVKVPTVVLHFRGADVSLPATNYLIPVDTNGKF---CFAFAGTMGGLSIIGNIQQ 463
Query: 323 MGHRIVFDRENLKLAWSHSKC 343
G R+V+D + ++ ++ C
Sbjct: 464 QGFRVVYDLASSRVGFAPGGC 484
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 85/346 (24%), Positives = 143/346 (41%), Gaps = 62/346 (17%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKSRS-------SCKSLKDPCPYIADYSTEDTSSSGYLVD 95
+ +DP+ SSS V CS C R+ SC S C I Y+ + +SS G L
Sbjct: 121 TTFDPNRSSSYSPVPCSSLTCTDRTRDFPIPASCDS-NQLCHAILSYA-DASSSEGNLAS 178
Query: 96 DILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPD-GVMGLGLGDVSVPSLLAKA 154
D ++ + S I GC + + + + G+MG+ G +S S +
Sbjct: 179 DTFYIGN--------SDMPGTIFGCMDSSFSTNTEEDSKNTGLMGMNRGSLSFVSQMDFP 230
Query: 155 GLIQNSFSICFDEND-SGSVFFGDQG----------PATQQSTSFLPIGEKYDAYFVGVE 203
FS C ++D SG + GD P Q ST LP ++ AY V +E
Sbjct: 231 -----KFSYCISDSDFSGVLLLGDANFSWLMPLNYTPLIQISTP-LPYFDRV-AYTVQLE 283
Query: 204 SYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 252
+ + L T +G Q +VDSG FTFL +Y+ + +F S
Sbjct: 284 GIKVSSKLLPLPKSVFVPDHTGAG-QTMVDSGTQFTFLLGPVYSALRNEFLNQTSQILRV 342
Query: 253 LQGNSWKY------CYNA--SSEEMLKVPDMRLIFSKNQSFVVRNH-IFSFP-ENEGF-T 301
L+ ++ + CY S + +P + L+F + V + ++ P E G +
Sbjct: 343 LEDPNYVFQGGMDLCYRVPLSQTSLPWLPTVSLMFRGAEMKVSGDRLLYRVPGEVRGSDS 402
Query: 302 VFCLTVMSTD---GDYGIIGQNFMMGHRIVFDRENLKLAWSHSKCE 344
V+C T ++D + +IG + + FD E ++ ++ +C+
Sbjct: 403 VYCFTFGNSDLLAVEAYVIGHHHQQNVWMEFDLEKSRIGFAQVQCD 448
>gi|195501954|ref|XP_002098017.1| GE10127 [Drosophila yakuba]
gi|194184118|gb|EDW97729.1| GE10127 [Drosophila yakuba]
Length = 465
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 77/319 (24%), Positives = 134/319 (42%), Gaps = 58/319 (18%)
Query: 51 SSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSS----------SGYLVDDILHL 100
+ S N+ P CKS++ C+ K P + ++ S +G L D + +
Sbjct: 169 TGSSNIWVPGPHCKSKA-CQKHKKYHPAKSSTYVKNGKSFAITYGSGSVAGVLAKDTVRI 227
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQN- 159
A + A Q+ ++ K+ G+ + DG++GLG +SV ++ L++N
Sbjct: 228 AGLT-VANQTFAMTT-------KEPGTTFVTSNFDGILGLGYRSISVDNVKT---LVENM 276
Query: 160 ---------SFSICFDENDSG----SVFFGDQGPAT---QQSTSFLPIGEKYDAYFVGVE 203
F+IC S ++ FG + S ++ P+ K F +
Sbjct: 277 CSEDVITSCKFAICMKGGGSSSRGGALIFGSSNTSAYSGSNSYTYTPVTTKGYWQFTLQD 336
Query: 204 SYCIGNSCLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYN 263
Y +G++ ++ S QA+VDSG S PT IY K +K++ S G W C
Sbjct: 337 IY-VGSTKVSGS-VQAIVDSGTSLITAPTAIYN----KINKVIGCTATS-SGECWMKCAK 389
Query: 264 ASSEEMLKVPDMRLIFSKNQSFVVRNHIF--SFPENEGFTVFCLTVMSTDGDYGII-GQN 320
K+PD + + + FVV+ + N G TV C++ +S D +I G
Sbjct: 390 -------KIPDFTFVIA-GKKFVVKGNKMKVKVKTNRGKTV-CISAVSEVPDEPVILGDA 440
Query: 321 FMMGHRIVFDRENLKLAWS 339
F+ VFD N ++ ++
Sbjct: 441 FIRHFCTVFDLANNRIGFA 459
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 69/322 (21%), Positives = 123/322 (38%), Gaps = 42/322 (13%)
Query: 44 EYDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+ P+ SS+ + V C P C S S + C + Y+ ++ L D L L
Sbjct: 142 SFSPTQSSTYRTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAA--STFQAVLGQDSLAL 199
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+++V S GC R +G + P G++G G G +S L +
Sbjct: 200 --------ENNVVVSYTFGCLRVVSG---NSVPPQGLIGFGRGPLSF--LSQTKDTYGSV 246
Query: 161 FSICFDE----NDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
FS C N SG++ G G P ++T L + Y+V + +G+ +
Sbjct: 247 FSYCLPNYRSSNFSGTLKLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVP 306
Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
+G ++D+G FT L +YA V F V + G + CYN +
Sbjct: 307 QSALAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRVRTPVAPPLGG-FDTCYNVT 365
Query: 266 SEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMSTDG---DYGIIGQNF 321
+ VP + +F+ + + ++ + G + +DG ++
Sbjct: 366 ----VSVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQ 421
Query: 322 MMGHRIVFDRENLKLAWSHSKC 343
R++FD N ++ +S C
Sbjct: 422 QQNQRVLFDVANGRVGFSRELC 443
>gi|125589909|gb|EAZ30259.1| hypothetical protein OsJ_14308 [Oryza sativa Japonica Group]
Length = 178
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 39/80 (48%), Gaps = 2/80 (2%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILH 99
R L+ YDP SS SSK V C +C SR C ++ CPYI Y+ + + G L D+LH
Sbjct: 101 RKLTFYDPRSSVSSKEVKCDDTICTSRPPC-NMTLRCPYITGYA-DGGLTMGILFTDLLH 158
Query: 100 LASFSKHAPQSSVQSSVIIG 119
+ +SV G
Sbjct: 159 YHQLYGNGQTQPTSTSVTFG 178
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 69/322 (21%), Positives = 123/322 (38%), Gaps = 42/322 (13%)
Query: 44 EYDPSSSSSSKNVSCSHPLC---KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDILHL 100
+ P+ SS+ + V C P C S S + C + Y+ ++ L D L L
Sbjct: 123 SFSPTQSSTYRTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAA--STFQAVLGQDSLAL 180
Query: 101 ASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQNS 160
+++V S GC R +G + P G++G G G +S L +
Sbjct: 181 --------ENNVVVSYTFGCLRVVSG---NSVPPQGLIGFGRGPLSF--LSQTKDTYGSV 227
Query: 161 FSICF----DENDSGSVFFGDQG-PATQQSTSFLPIGEKYDAYFVGVESYCIGNSCL--- 212
FS C N SG++ G G P ++T L + Y+V + +G+ +
Sbjct: 228 FSYCLPNYRSSNFSGTLKLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVP 287
Query: 213 -------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNAS 265
+G ++D+G FT L +YA V F V + G + CYN +
Sbjct: 288 QSALAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRVRTPVAPPLGG-FDTCYNVT 346
Query: 266 SEEMLKVPDMRLIFSKNQSFVV-RNHIFSFPENEGFTVFCLTVMSTDG---DYGIIGQNF 321
+ VP + +F+ + + ++ + G + +DG ++
Sbjct: 347 ----VSVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQ 402
Query: 322 MMGHRIVFDRENLKLAWSHSKC 343
R++FD N ++ +S C
Sbjct: 403 QQNQRVLFDVANGRVGFSRELC 424
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 78/361 (21%), Positives = 138/361 (38%), Gaps = 69/361 (19%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLV 94
R+ + P +S + +V C C+SR +C C Y+ + +SS G L
Sbjct: 104 RSALSFRPRASLTFASVPCGSAQCRSRDLPSPPACDGASKQCRVSLSYA-DGSSSDGALA 162
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
++ + + P + GC + DG A G++G+ G +S S +
Sbjct: 163 TEVF---TVGQGPPLRAA-----FGCMATAFDTSPDGVATAGLLGMNRGALSFVSQAST- 213
Query: 155 GLIQNSFSICF-DENDSGSVFFGDQGPATQQSTSFLPIG-----------EKYD--AYFV 200
FS C D +D+G + G FLP+ +D AY V
Sbjct: 214 ----RRFSYCISDRDDAGVLLLG------HSDLPFLPLNYTPLYQPAMPLPYFDRVAYSV 263
Query: 201 GVESYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 249
+ +G L T +G Q +VDSG FTFL + Y+ + +F +
Sbjct: 264 QLLGIRVGGKPLPIPASVLAPDHTGAG-QTMVDSGTQFTFLLGDAYSALKAEFSRQTKPW 322
Query: 250 RISLQGNSWKYCYNASSEEMLKVPDMR----------LIFSKNQSFVVRNH-IFSFP--E 296
+L N + + + + +VP R L+F+ Q V + ++ P
Sbjct: 323 LPAL--NDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLFNGAQMTVAGDRLLYKVPGER 380
Query: 297 NEGFTVFCLTVMSTDG---DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 353
G V+CLT + D +IG + M + +D E ++ + +C+ ++ +
Sbjct: 381 RGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLERGRVGLAPIRCDVASERLGLM 440
Query: 354 L 354
L
Sbjct: 441 L 441
>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
Length = 469
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 75/327 (22%), Positives = 131/327 (40%), Gaps = 47/327 (14%)
Query: 43 SEYDPSSSSSSKNVSCSHPLCKS--RSSCKSLKDPCP---------YIADYSTEDTSSSG 91
+ + P+ S++ + CS +C R +C Y Y ++SG
Sbjct: 132 TAFRPNGSATFSPLPCSSDMCLPVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSG 191
Query: 92 YLVDDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLL 151
YL D + + V+ GC SY D A GV+G+G G++S L+
Sbjct: 192 YLATDTFTFGA--------TAVPGVVFGC---SDASYGDFAGASGVIGIGRGNLS---LI 237
Query: 152 AKAGLIQNSFSICFDE-NDSGS----VFFGDQG-PATQ--QSTSFLPIGEKYDAYFVGVE 203
++ + S+ + E D GS + FGD P T+ QST L D Y+V +
Sbjct: 238 SQLQFGKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGQSTPLLSSTLYPDFYYVNLT 297
Query: 204 SYCIGNSCLTQ--SGFQALVDSGASFTFL----PTEIYAEVVVKFDKLVSSKRISL---Q 254
+ + L +G L +G L P + + + RI L
Sbjct: 298 GVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVN 357
Query: 255 GNS---WKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTD 311
G++ CYNASS +KVP + L+F + + + +N+ + CLT++ +
Sbjct: 358 GSAALELDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDND-TGLECLTMLPSQ 416
Query: 312 GDYGIIGQNFMMGHRIVFDRENLKLAW 338
G ++G G +++D + +L +
Sbjct: 417 GG-SVLGTLLQTGTNMIYDVDAGRLTF 442
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 78/361 (21%), Positives = 138/361 (38%), Gaps = 69/361 (19%)
Query: 40 RNLSEYDPSSSSSSKNVSCSHPLCKSRS-----SCKSLKDPCPYIADYSTEDTSSSGYLV 94
R+ + P +S + +V C C+SR +C C Y+ + +SS G L
Sbjct: 105 RSALSFRPRASLTFASVPCDSAQCRSRDLPSPPACDGASKQCRVSLSYA-DGSSSDGALA 163
Query: 95 DDILHLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKA 154
++ + + P + GC + DG A G++G+ G +S S +
Sbjct: 164 TEVF---TVGQGPPLRAA-----FGCMATAFDTSPDGVATAGLLGMNRGALSFVSQAST- 214
Query: 155 GLIQNSFSICF-DENDSGSVFFGDQGPATQQSTSFLPIG-----------EKYD--AYFV 200
FS C D +D+G + G FLP+ +D AY V
Sbjct: 215 ----RRFSYCISDRDDAGVLLLG------HSDLPFLPLNYTPLYQPAMPLPYFDRVAYSV 264
Query: 201 GVESYCIGNSCL-----------TQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSK 249
+ +G L T +G Q +VDSG FTFL + Y+ + +F +
Sbjct: 265 QLLGIRVGGKPLPIPASVLAPDHTGAG-QTMVDSGTQFTFLLGDAYSALKAEFSRQTKPW 323
Query: 250 RISLQGNSWKYCYNASSEEMLKVPDMR----------LIFSKNQSFVVRNH-IFSFP--E 296
+L N + + + + +VP R L+F+ Q V + ++ P
Sbjct: 324 LPAL--NDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLFNGAQMTVAGDRLLYKVPGER 381
Query: 297 NEGFTVFCLTVMSTDG---DYGIIGQNFMMGHRIVFDRENLKLAWSHSKCEEVIDKSHVH 353
G V+CLT + D +IG + M + +D E ++ + +C+ ++ +
Sbjct: 382 RGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLERGRVGLAPIRCDVASERLGLM 441
Query: 354 L 354
L
Sbjct: 442 L 442
>gi|431910128|gb|ELK13201.1| Cathepsin D [Pteropus alecto]
Length = 375
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 68/275 (24%), Positives = 120/275 (43%), Gaps = 34/275 (12%)
Query: 88 SSSGYLVDDILHLASFSKHAPQSSVQ-SSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVS 146
S SGYL D + + S +P SSV+ I G KQ G A DG++G+ +S
Sbjct: 111 SLSGYLSQDTVSVPCKSAPSPPSSVKVERQIFGEATKQPGITFIAAKFDGILGMAYPRIS 170
Query: 147 V-------PSLLAKAGLIQNSFSICFDENDSGS-----VFFGDQGPATQQSTSFLPIGEK 194
V +L+ + + +N FS + + + + G S S+L + K
Sbjct: 171 VNNVLPVFDNLMQQKLVDKNIFSFYLNRDPNAQPGGELMLGGTDSKYYTGSLSYLNVTRK 230
Query: 195 YDAYF-VGVESYCIGNS-CLTQSGFQALVDSGASFTFLPTEIYAEVVVKFDKLVSSKRIS 252
AY+ V +E +GNS L ++G +A+VD+G S P E V K + + +
Sbjct: 231 --AYWQVHMEQVDVGNSLTLCKAGCEAIVDTGTSLVVGPV----EEVRALQKAIGAVPL- 283
Query: 253 LQGNSWKYCYNASSEEMLKVPDMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLT-VMSTD 311
+QG Y E++ +P++ L + + + ++ ++G CL+ M D
Sbjct: 284 IQGE-----YMIPCEKVSSLPEVTLKLG-GKGYKLGAEDYTLKVSQGGKTICLSGFMGMD 337
Query: 312 -----GDYGIIGQNFMMGHRIVFDRENLKLAWSHS 341
G I+G F+ + VFDR+ ++ + +
Sbjct: 338 IPPPGGPLWILGDVFIGRYYTVFDRDENRVGLAEA 372
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 76/310 (24%), Positives = 118/310 (38%), Gaps = 22/310 (7%)
Query: 43 SEYDPSSSSSSKNVSCSHPLC----KSRSSCKSLKDPCPYIADYSTEDTSSSGYLVDDIL 98
S +DPSSSS+ SCS C +S+ + C YI +Y +++ D L
Sbjct: 162 SLFDPSSSSTYSPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTG-TYSSDTL 220
Query: 99 HLASFSKHAPQSSVQSSVIIGCGRKQTGSYLDGAAPDGVMGLGLGDVSVPSLLAKAGLIQ 158
L SS + GC + ++G + D DG+MGLG G S+ S AG
Sbjct: 221 TLG--------SSAMTDFQFGCSQSESGGFND--QTDGLMGLGGGAQSLAS--QTAGTFG 268
Query: 159 NSFSICFDENDSGSVFFG-DQGPATQQSTSFLPIGEKYDAYFVGVESYCIGNSCLT--QS 215
+FS C S F G + T L + Y V +ES +G+ L S
Sbjct: 269 TAFSYCLPPTSGSSGFLTLGTGSSGFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTS 328
Query: 216 GFQA--LVDSGASFTFLPTEIYAEVVVKFDKLVSSKRISLQGNSWKYCYNASSEEMLKVP 273
F A L+DSG T LP Y+ + F + + C++ S + + +P
Sbjct: 329 VFSAGSLMDSGTIITRLPPTAYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIP 388
Query: 274 DMRLIFSKNQSFVVRNHIFSFPENEGFTVFCLTVMSTDGDYGIIGQNFMMGHRIVFDREN 333
+ L+FS + + + T D GIIG +++D
Sbjct: 389 TVTLVFSGGAAVDLAFDGIMLEISSSIRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGG 448
Query: 334 LKLAWSHSKC 343
+ + C
Sbjct: 449 GAVGFKAGAC 458
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.132 0.393
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,666,950,588
Number of Sequences: 23463169
Number of extensions: 281859798
Number of successful extensions: 1093697
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 229
Number of HSP's successfully gapped in prelim test: 1890
Number of HSP's that attempted gapping in prelim test: 1090535
Number of HSP's gapped (non-prelim): 2435
length of query: 422
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 277
effective length of database: 8,957,035,862
effective search space: 2481098933774
effective search space used: 2481098933774
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)