BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 047535
(376 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 442 bits (1136), Expect = e-121, Method: Compositional matrix adjust.
Identities = 228/372 (61%), Positives = 277/372 (74%), Gaps = 16/372 (4%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
N + VS+ NGEY+MK SIGTPP D+YGI DTGSDLMW QCLPC+ CYKQ P+++P+
Sbjct: 78 NTPEPPVSSNNGEYLMKISIGTPPF-DVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPS 136
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNN-- 127
S+S+KE+SC+S+QC LLDTVSCS Q+LC+++YGY D SL +GV+ATE +T NSN+
Sbjct: 137 KSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTL-NSNSGQ 195
Query: 128 --FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGA-NKFSYCLVPFHTD 184
N+VFGCGHNN+G FNENEMGL G G LSL SQI+S LG+ KFS CLVPF TD
Sbjct: 196 PXSIXNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTD 255
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
SITSK+ FG +EVSG VVST LV+K+D TYYFVTL+GISVG+ KL P+ +SS
Sbjct: 256 PSITSKIIFGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISVGD-----KLFPFSSSSP 310
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAP 304
+KGN+FID G PPTLLP+DFYNRL + V+ AI + P QDP L QLCY++ ++ P
Sbjct: 311 MATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATLID-GP 369
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
ILTAHFD GA V L +TFI P EGV+CFAMQPIDGD GIFGNF Q + IG+D D +
Sbjct: 370 ILTAHFD-GADVQLKPLNTFISPK-EGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGK 427
Query: 365 MVSFKPTDCTKQ 376
VSFK DCTKQ
Sbjct: 428 KVSFKAVDCTKQ 439
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 228/372 (61%), Positives = 277/372 (74%), Gaps = 16/372 (4%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
N + VS+ NGEY+MK SIGTPP D+YGI DTGSDLMW QCLPC+ CYKQ P+++P+
Sbjct: 78 NTPEPPVSSNNGEYLMKISIGTPPF-DVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPS 136
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNN-- 127
S+S+KE+SC+S+QC LLDTVSCS Q+LC+++YGY D SL +GV+ATE +T NSN+
Sbjct: 137 KSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTL-NSNSGQ 195
Query: 128 --FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGA-NKFSYCLVPFHTD 184
N+VFGCGHNN+G FNENEMGL G G LSL SQI+S LG+ KFS CLVPF TD
Sbjct: 196 PTSILNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTD 255
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
SITSK+ FG +EVSG VVST LV+K+D TYYFVTL+GISVG+ KL P+ +SS
Sbjct: 256 PSITSKIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGD-----KLFPFSSSSP 310
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAP 304
+KGN+FID G PPTLLP+DFYNRL + V+ AI + P QDP L QLCY++ ++ P
Sbjct: 311 MATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATLID-GP 369
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
ILTAHFD GA V L +TFI P EGV+CFAMQPIDGD GIFGNF Q + IG+D D +
Sbjct: 370 ILTAHFD-GADVQLKPLNTFISPK-EGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGK 427
Query: 365 MVSFKPTDCTKQ 376
VSFK DCTKQ
Sbjct: 428 KVSFKAVDCTKQ 439
>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
Length = 334
Score = 390 bits (1001), Expect = e-106, Method: Compositional matrix adjust.
Identities = 207/367 (56%), Positives = 250/367 (68%), Gaps = 45/367 (12%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
N + VS+ NGEY+MK SIGTPP D+YGI DTGSDLMW QCLPC+ CYKQ P+++P+
Sbjct: 11 NTPEPPVSSNNGEYLMKISIGTPPF-DVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPS 69
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
S+S+KE+SC+S+QC LLDT + +L
Sbjct: 70 KSTSFKEVSCESQQCRLLDTPT--------------------SIL--------------- 94
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGA-NKFSYCLVPFHTDSSITS 189
N+VFGCGHNN+G FNENEMGL G G LSL SQI+S LG+ KFS CLVPF TD SITS
Sbjct: 95 NIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITS 154
Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
K+ FG +EVSG VVST LV+K+D TYYFVTL+GISVG+ KL P+ +SS +KG
Sbjct: 155 KIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGD-----KLFPFSSSSPMATKG 209
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAH 309
N+FID G PPTLLP+DFYNRL + V+ AI + P QDP L QLCY++ ++ PILTAH
Sbjct: 210 NVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATLID-GPILTAH 268
Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
FD GA V L +TFI P EGV+CFAMQPIDGD GIFGNF Q + IG+D D + VSFK
Sbjct: 269 FD-GADVQLKPLNTFI-SPKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFK 326
Query: 370 PTDCTKQ 376
DCTKQ
Sbjct: 327 AVDCTKQ 333
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 197/367 (53%), Positives = 249/367 (67%), Gaps = 12/367 (3%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
QS + G Y+M+ SIGTPP IYGI DTGSDL W C+PC +CYKQ PI++P S+
Sbjct: 15 QSPIYAYLGHYLMEVSIGTPPF-KIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKST 73
Query: 74 SYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF---FD 130
SY+ +SC S+ CH LDT CS Q+ CNYTY YA +++T+GVLA E IT ++
Sbjct: 74 SYRNISCDSKLCHKLDTGVCSPQKHCNYTYAYASAAITQGVLAQETITLSSTKGESVPLK 133
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
+VFGCGHNNTG FN+ EMG++GLG +S SQI S G +FS CLVPFHTD S++SK
Sbjct: 134 GIVFGCGHNNTGGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFHTDVSVSSK 193
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
M G GSEVSG GVVST LV+K+DKT YFVTL GISVGN + L +SS ++ KGN
Sbjct: 194 MSLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGN----TYLHFNGSSSQSVEKGN 249
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY-QDPRLGSQLCYKTPSMAGIAPILTAH 309
+F+D+G PPT+LP Y+RL QVR+ + + P D LG QLCY+T + P+LTAH
Sbjct: 250 VFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYRTKNNL-RGPVLTAH 308
Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
F+GG V L+ T TF+ P +GVFC D G++GNFAQS+ IG+D D Q+VSFK
Sbjct: 309 FEGG-DVKLLPTQTFVSPK-DGVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFK 366
Query: 370 PTDCTKQ 376
P DCTK
Sbjct: 367 PMDCTKH 373
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 194/367 (52%), Positives = 248/367 (67%), Gaps = 13/367 (3%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
QS + G Y+M+ SIGTPP IYGI DTGSDL W C+PC CYKQ P+++P S+
Sbjct: 62 QSPIYAYLGHYLMELSIGTPPF-KIYGIADTGSDLTWTSCVPCNNCYKQRNPMFDPQKST 120
Query: 74 SYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF---FD 130
+Y+ +SC S+ CH LDT CS Q+ CNYTY YA +++T+GVLA E IT ++
Sbjct: 121 TYRNISCDSKLCHKLDTGVCSPQKRCNYTYAYASAAITRGVLAQETITLSSTKGKSVPLK 180
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
+VFGCGHNNTG FN++EMG++GLG +SL SQ+ S G +FS CLVPFHTD S++SK
Sbjct: 181 GIVFGCGHNNTGGFNDHEMGIIGLGGGPVSLISQMGSSFGGKRFSQCLVPFHTDVSVSSK 240
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
M FG GS+VSG GVVST LV+K+DKT YFVTL GISV N + + SS + KGN
Sbjct: 241 MSFGKGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVEN-----TYLHFNGSSQNVEKGN 295
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY-QDPRLGSQLCYKTPSMAGIAPILTAH 309
MF+D+G PPT+LP Y+++ QVR+ + + P DP LG QLCY+T + P+LTAH
Sbjct: 296 MFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLCYRTKNNLR-GPVLTAH 354
Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
F+ GA V L T TFI P +GVFC D G++GNFAQS+ IG+D D Q+VSFK
Sbjct: 355 FE-GADVKLSPTQTFISPK-DGVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFK 412
Query: 370 PTDCTKQ 376
P DCTK
Sbjct: 413 PKDCTKH 419
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 363 bits (931), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 191/373 (51%), Positives = 248/373 (66%), Gaps = 18/373 (4%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
N+VQ+ ++ G+++M+ IGTPP+ I G+VDTGSDL+W+QC PC+ CYKQ+KP+++P
Sbjct: 55 NIVQAPINAYIGQHLMEIYIGTPPI-KITGLVDTGSDLIWIQCAPCLGCYKQIKPMFDPL 113
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--- 127
SS+Y +SC S CH LDT CS ++ CNYTYGY D+SLTKGVLA + TF ++
Sbjct: 114 KSSTYNNISCDSPLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPV 173
Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
+FGCGHNNTG FN++EMGL+GLG SL SQI G KFS CLVPF TD I
Sbjct: 174 SLSRFLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKI 233
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
+S+M FG GS+V G GVV+T LV +E T YFVTL GISV + Y+ + I
Sbjct: 234 SSRMSFGKGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDT--------YFPMNSTIG 285
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY-QDPRLGSQLCYKTPSMAGIAPIL 306
K NM +D+G PP LLP+ Y+++ +VRN + L P DP LG+QLCY+T + P L
Sbjct: 286 KANMLVDSGTPPILLPQQLYDKVFAEVRNKVALKPITDDPSLGTQLCYRTQTNLK-GPTL 344
Query: 307 TAHFDGGAKVPLIHTSTFIPPP--VEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFDS 363
T HF GA V L TFIPP +G+FC A+ + D G++GNFAQS+ IG+D D
Sbjct: 345 TFHFV-GANVLLTPIQTFIPPTPQTKGIFCLAIYNRTNSDPGVYGNFAQSNYLIGFDLDR 403
Query: 364 QMVSFKPTDCTKQ 376
Q+VSFKPTDCTKQ
Sbjct: 404 QVVSFKPTDCTKQ 416
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 354 bits (909), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 197/371 (53%), Positives = 250/371 (67%), Gaps = 20/371 (5%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
Q+ VS + +Y+M+ SIGTPP+ Y VDTGSDL+W+QC+PC CYKQ+ P+++P SSS
Sbjct: 49 QTPVSVHHYDYLMELSIGTPPV-KTYAQVDTGSDLIWLQCIPCTNCYKQLNPMFDPQSSS 107
Query: 74 SYKELSCQSEQCHLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FF 129
+Y ++ SE C L + SCS Q CNYTY Y D S+T+GVLA E +T ++
Sbjct: 108 TYSNIAYGSESCSKLYSTSCSPDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTGKPVAL 167
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
V+FGCGHNN GVFN+ EMG++GLGR LSL SQI S G FS CLVPFHT+ SITS
Sbjct: 168 KGVIFGCGHNNNGVFNDKEMGIIGLGRGPLSLVSQIGSSFGGKMFSQCLVPFHTNPSITS 227
Query: 190 KMYFGNGSEVSGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSG--AI 246
M FG GSEV G GVVST LVSK + +YFVTL GISV +++ +P+ + S I
Sbjct: 228 PMSFGKGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISVEDIN-----LPFNDGSSLEPI 282
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ-DPRLGSQLCYKTPS-MAGIAP 304
+KGNM ID+G P TLLP+DFY+RL E+VRN + L P DP LG QLCY+TP+ + G
Sbjct: 283 TKGNMVIDSGTPTTLLPEDFYHRLVEEVRNKVALDPIPIDPTLGYQLCYRTPTNLKGTT- 341
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFDS 363
LTAHF+ GA V L T FIP +G+FCFA + GI+GN AQS+ IG+D +
Sbjct: 342 -LTAHFE-GADVLLTPTQIFIPVQ-DGIFCFAFTSTFSNEYGIYGNHAQSNYLIGFDLEK 398
Query: 364 QMVSFKPTDCT 374
Q+VSFK TDCT
Sbjct: 399 QLVSFKATDCT 409
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 353 bits (907), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 189/367 (51%), Positives = 257/367 (70%), Gaps = 17/367 (4%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
+ V++ NG+Y+MK ++G+PP+ DIYG+VDTGSDL+W QC PC CY+Q P++ P S +
Sbjct: 73 TRVTSNNGDYLMKLTLGSPPV-DIYGLVDTGSDLVWAQCTPCGGCYRQKSPMFEPLRSKT 131
Query: 75 YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDN 131
Y + C+SEQC SCS Q++C Y+Y YADSS+TKGVLA E ITF +++ +
Sbjct: 132 YSPIPCESEQCSFFG-YSCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVVVGD 190
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
++FGCGH+N+G FNEN+MG++G+G LSL SQI + G+ +FS CLVPFHTD+ + +
Sbjct: 191 IIFGCGHSNSGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAHTSGTI 250
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
FG S+VSG GVV+T L S+E +T Y VTLEGISVG+ +NSS +SKGN+
Sbjct: 251 NFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGD------TFVRFNSSETLSKGNI 304
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ-DPRLGSQLCYKTPSMAGIAPILTAHF 310
ID+G P T +P++FY RL E+++ L P + DP LG+QLCY++ + PILTAHF
Sbjct: 305 MIDSGTPATYIPQEFYERLVEELKVQSSLLPIEDDPDLGTQLCYRSETNLE-GPILTAHF 363
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQ-PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
+ GA V L+ TFIPP +GVFCFAM DGD IFGNFAQS++ +G+D D + +SFK
Sbjct: 364 E-GADVQLLPIQTFIPPK-DGVFCFAMAGSTDGDY-IFGNFAQSNILMGFDLDRKTISFK 420
Query: 370 PTDCTKQ 376
PTDCT Q
Sbjct: 421 PTDCTNQ 427
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 350 bits (898), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 189/374 (50%), Positives = 247/374 (66%), Gaps = 19/374 (5%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
++VQ+ ++ G+Y+M+ IGTPP+ I G VDTGSDL+WVQC+PC+ CY Q+ P+++P
Sbjct: 51 DIVQAPINAYIGQYLMELYIGTPPI-KISGTVDTGSDLIWVQCVPCLGCYNQINPMFDPL 109
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--- 127
SS+Y +SC S C+ CS ++ C+YTYGYADSSLTKGVLA E +T ++
Sbjct: 110 KSSTYTNISCDSPLCYKPYIGECSPEKRCDYTYGYADSSLTKGVLAQETVTLTSNTGKPI 169
Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
++FGCGHNNTG FN++EMGL+GLG SL SQI G KFS CLVPF TD +I
Sbjct: 170 SLQGILFGCGHNNTGNFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDITI 229
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
+S+M FG GSEV G GVV+T LV +E D T Y+VTL GISV + Y + I
Sbjct: 230 SSQMSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDT--------YLPMNSTI 281
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY-QDPRLGSQLCYKTPSMAGIAPI 305
KGNM +D+G PP +LP+ Y+R+ +V+N + L P DP LG QLCY+T + P
Sbjct: 282 EKGNMLVDSGTPPNILPQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCYRTQTNLK-GPT 340
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVE--GVFCFAMQP-IDGDVGIFGNFAQSDLFIGYDFD 362
LT HF+ GA + L TFIPP E GVFC A+ + D GI+GNFAQ++ IG+D D
Sbjct: 341 LTYHFE-GANLLLTPIQTFIPPTPETKGVFCLAITNCANSDPGIYGNFAQTNYLIGFDLD 399
Query: 363 SQMVSFKPTDCTKQ 376
Q+VSFKPTDCTKQ
Sbjct: 400 RQIVSFKPTDCTKQ 413
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 347 bits (890), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 195/376 (51%), Positives = 256/376 (68%), Gaps = 21/376 (5%)
Query: 8 YPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIY 67
Y + +QS VS + EY+M+ SIGTPP+ IY DTGSDL+W QC+PC +CYKQ P++
Sbjct: 44 YKPSTIQSPVSAYDCEYLMELSIGTPPI-KIYAEADTGSDLVWFQCIPCTKCYKQQNPMF 102
Query: 68 NPASSSSYKELSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSN 126
+P SSSSY ++C +E C+ LD+ CS+ Q+ CNYTY YAD+S+T+GVLA E +T ++
Sbjct: 103 DPRSSSSYTNITCGTESCNKLDSSLCSTDQKTCNYTYSYADNSITQGVLAQETLTLTSTT 162
Query: 127 N---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGA--NKFSYCLVPF 181
F ++FGCGHNN+G FN+ EMGL+GLGR LSL SQI S LGA N FS CLVPF
Sbjct: 163 GEPVAFQGIIFGCGHNNSG-FNDREMGLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPF 221
Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
+TD SITS+M FG GSEV G G VST L+SK D T YF TL GISV +++ +P+ N
Sbjct: 222 NTDPSITSQMNFGKGSEVLGNGTVSTPLISK-DGTGYFATLLGISVEDIN-----LPFSN 275
Query: 242 SS--GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM 299
S G I+KGN+ ID+G T LP++FY+RL EQVRN + L P++ G +LCY+TP+
Sbjct: 276 GSSLGTITKGNILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFRID--GYELCYQTPTN 333
Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
P LT HF+GG V L FIP + FCFA+ + + +GN+AQS+ IG+
Sbjct: 334 LN-GPTLTIHFEGG-DVLLTPAQMFIPVQDDN-FCFAVFDTNEEYVTYGNYAQSNYLIGF 390
Query: 360 DFDSQMVSFKPTDCTK 375
D + Q+VSFK TDCTK
Sbjct: 391 DLERQVVSFKATDCTK 406
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 346 bits (888), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 187/371 (50%), Positives = 255/371 (68%), Gaps = 14/371 (3%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
+N V + V++ NG+Y+MK ++GTPP+ D+YG+VDTGSDL+W QC PC CY+Q P++ P
Sbjct: 36 SNGVFTRVTSNNGDYLMKLTLGTPPV-DVYGLVDTGSDLVWAQCTPCQGCYRQKSPMFEP 94
Query: 70 ASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-- 127
S++Y + C SE+C+ L SCS Q+LC Y+Y YADSS+TKGVLA E +TF +++
Sbjct: 95 LRSNTYTPIPCDSEECNSLFGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEP 154
Query: 128 -FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
++VFGCGH+N+G FNEN+MG++GLG LSL SQ + G+ +FS CLVPFH D
Sbjct: 155 VVVGDIVFGCGHSNSGTFNENDMGIIGLGGGPLSLVSQFGNLYGSKRFSQCLVPFHADPH 214
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
+ FG+ S+VSG GV +T LVS+E +T Y VTLEGISVG+ S +NSS +
Sbjct: 215 TLGTISFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVS------FNSSEML 268
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY-QDPRLGSQLCYKTPSMAGIAPI 305
SKGN+ ID+G P T LP++FY+RL ++++ + P DP LG+QLCY++ + PI
Sbjct: 269 SKGNIMIDSGTPATYLPQEFYDRLVKELKVQSNMLPIDDDPDLGTQLCYRSETNLE-GPI 327
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
L AHF+ GA V L+ TFIPP +GVFCFAM IFGNFAQS++ IG+D D +
Sbjct: 328 LIAHFE-GADVQLMPIQTFIPPK-DGVFCFAMAGTTDGEYIFGNFAQSNVLIGFDLDRKT 385
Query: 366 VSFKPTDCTKQ 376
VSFK TDC+ Q
Sbjct: 386 VSFKATDCSNQ 396
>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 364
Score = 327 bits (837), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 185/371 (49%), Positives = 251/371 (67%), Gaps = 25/371 (6%)
Query: 8 YPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIY 67
Y +N + V++ NG+Y+MK ++GTPP+ D+YG+VDT SDL+W QC PC CYKQ P++
Sbjct: 15 YASNGPFTRVTSNNGDYLMKLTLGTPPV-DVYGLVDTDSDLVWAQCTPCQGCYKQKNPMF 73
Query: 68 NPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
+P ++C+ SCS ++ C+Y Y YAD S TKG+LA E TF +++
Sbjct: 74 DPL------------KECNSFFDHSCSPEKACDYVYAYADDSATKGMLAKEIATFSSTDG 121
Query: 128 --FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
++++FGCGHNNTGVFNEN+MGL+GLG LSL SQ+ + G+ +FS CLVPFH D
Sbjct: 122 KPIVESIIFGCGHNNTGVFNENDMGLIGLGGGPLSLVSQMGNLYGSKRFSQCLVPFHADP 181
Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
+ + G S+VSG GVV+T LVS+E +T Y VTLEGISVG+ +P +NSS
Sbjct: 182 HTSGTISLGEASDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGD-----TFVP-FNSSEM 235
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTP-YQDPRLGSQLCYKTPSMAGIAP 304
+SKGN+ ID+G P T LP++FY+RL E+++ I L P + DP LG+QLCYK+ + P
Sbjct: 236 LSKGNIMIDSGTPETYLPQEFYDRLVEELKVQINLPPIHVDPDLGTQLCYKSETNLE-GP 294
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
ILTAHF+ GA V L+ TFIPP +GVFCFAM + IFGNFAQS++ IG+D D +
Sbjct: 295 ILTAHFE-GADVKLLPLQTFIPPK-DGVFCFAMTGTTDGLYIFGNFAQSNVLIGFDLDKR 352
Query: 365 MVSFKPTDCTK 375
+V FKPTD TK
Sbjct: 353 IVFFKPTDFTK 363
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 173/389 (44%), Positives = 240/389 (61%), Gaps = 24/389 (6%)
Query: 1 MSPATYFYPN-------NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC 53
MS +F P + QS + + GEY+MKFS+GTP DI I DTGSDL+W QC
Sbjct: 62 MSRVHHFSPTKNSDIFTDTAQSEMISNQGEYLMKFSLGTPAF-DILAIADTGSDLIWTQC 120
Query: 54 LPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLL-DTVSCSSQ--QLCNYTYGYADSSL 110
PC QCY+Q P+++P SSS+Y+++SC ++QC LL + SCS + + C+Y+Y Y D S
Sbjct: 121 KPCDQCYEQDAPLFDPKSSSTYRDISCSTKQCDLLKEGASCSGEGNKTCHYSYSYGDRSF 180
Query: 111 TKGVLATERITFGNSNN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILS 167
T G +A + IT G+++ + GCGHNN G F E G+VGLG +SL SQ+ S
Sbjct: 181 TSGNVAADTITLGSTSGRPVLLPKAIIGCGHNNGGSFTEKGSGIVGLGGGPISLISQLGS 240
Query: 168 QLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISV 227
+ KFSYCLVP ++++ +SK+ FG+ VSGGGV ST L+SK+ T+YF+TLE +SV
Sbjct: 241 TIDG-KFSYCLVPLSSNATNSSKLNFGSNGIVSGGGVQSTPLISKDPDTFYFLTLEAVSV 299
Query: 228 GNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPR 287
G S+ I + SS S+GN+ ID+G TL P+DF++ L V++A+ TP +DP
Sbjct: 300 G-----SERIKFPGSSFGTSEGNIIIDSGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPS 354
Query: 288 LGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIF 347
LCY + P +TAHFD GA V L +TF+ + V CFA PI+ IF
Sbjct: 355 GILSLCYSIDADLKF-PSITAHFD-GADVKLNPLNTFVQVS-DTVLCFAFNPINSG-AIF 410
Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCTKQ 376
GN AQ + +GYD + + VSFKPTDCT+
Sbjct: 411 GNLAQMNFLVGYDLEGKTVSFKPTDCTQD 439
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 166/367 (45%), Positives = 221/367 (60%), Gaps = 15/367 (4%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
+S V G Y+M +S+GTPP IYGI DTGSD++W+QC PC QCY Q PI+NP+ SS
Sbjct: 77 ESTVIPDRGGYLMTYSVGTPPT-KIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSS 135
Query: 74 SYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFD 130
SYK + C S+ CH + SCS Q C Y Y DSS ++G L+ + ++ +++ F
Sbjct: 136 SYKNIPCSSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFP 195
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP-FHTDSSITS 189
+V GCG +N G F G+VGLG +SL +Q+ S +G KFSYCLVP + +S+ +S
Sbjct: 196 KIVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGG-KFSYCLVPLLNKESNASS 254
Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS-GAISK 248
+ FG+ + VSG GVVST L+ K+D +YF+TL+ SVGN K + + SS G +
Sbjct: 255 ILSFGDAAVVSGDGVVSTPLI-KKDPVFYFLTLQAFSVGN-----KRVEFGGSSEGGDDE 308
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTA 308
GN+ ID+G TL+P D Y LE V + +KL DP LCY S PI+T
Sbjct: 309 GNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLCYSLKSNEYDFPIITV 368
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
HF GA V L STF+ P +G+ CFA QP IFGN AQ +L +GYD + VSF
Sbjct: 369 HFK-GADVELHSISTFV-PITDGIVCFAFQPSPQLGSIFGNLAQQNLLVGYDLQQKTVSF 426
Query: 369 KPTDCTK 375
KPTDCTK
Sbjct: 427 KPTDCTK 433
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 291 bits (745), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 166/367 (45%), Positives = 221/367 (60%), Gaps = 15/367 (4%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
+S V G Y+M +S+GTPP IYGI DTGSD++W+QC PC QCY Q PI+NP+ SS
Sbjct: 77 ESTVIPDRGGYLMTYSVGTPPT-KIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSS 135
Query: 74 SYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFD 130
SYK + C S+ CH + SCS Q C Y Y DSS ++G L+ + ++ +++ F
Sbjct: 136 SYKNIPCLSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFP 195
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP-FHTDSSITS 189
V GCG +N G F G+VGLG +SL +Q+ S +G KFSYCLVP + +S+ +S
Sbjct: 196 KTVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGG-KFSYCLVPLLNKESNASS 254
Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS-GAISK 248
+ FG+ + VSG GVVST L+ K+D +YF+TL+ SVGN K + + SS G +
Sbjct: 255 ILSFGDAAVVSGDGVVSTPLI-KKDPVFYFLTLQAFSVGN-----KRVEFGGSSEGGDDE 308
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTA 308
GN+ ID+G TL+P D Y LE V + +KL DP LCY S PI+TA
Sbjct: 309 GNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLCYSLKSNEYDFPIITA 368
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
HF GA + L STF+ P +G+ CFA QP IFGN AQ +L +GYD + VSF
Sbjct: 369 HFK-GADIELHSISTFV-PITDGIVCFAFQPSPQLGSIFGNLAQQNLLVGYDLQQKTVSF 426
Query: 369 KPTDCTK 375
KPTDCTK
Sbjct: 427 KPTDCTK 433
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 290 bits (742), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 178/384 (46%), Positives = 236/384 (61%), Gaps = 15/384 (3%)
Query: 1 MSPATYFYPN----NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC 56
+S A +F N N +QS V + NGEY+M S+GTPP+ ++GI DTGSDL+W QC PC
Sbjct: 68 ISRANHFRANGVSTNSIQSPVISNNGEYLMNISLGTPPV-SMHGIADTGSDLLWRQCKPC 126
Query: 57 VQCYKQVKPIYNPASSSSYKELSCQSEQC-HLLDTVSCSSQQLCNYTYGYADSSLTKGVL 115
CY+Q++PI++PA S +Y+ LSC+ + C +L CS C Y+Y Y D S T G L
Sbjct: 127 DSCYEQIEPIFDPAKSKTYQILSCEGKSCSNLGGQGGCSDDNTCIYSYSYGDGSHTSGDL 186
Query: 116 ATERITFGNSNNF---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGAN 172
A + +T G++ VVFGCGHNN G F + GLVGLG LS+ SQ+ +G
Sbjct: 187 AVDTLTIGSTTGRPVSVPKVVFGCGHNNGGTFELHGSGLVGLGGGPLSMISQLRPLIGG- 245
Query: 173 KFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSN 232
+FSYCLVP D S++SKM+FG+ VSG G VST L S++ T+Y++TLE +SVG+
Sbjct: 246 RFSYCLVPLGNDPSVSSKMHFGSRGIVSGAGAVSTPLASRQPDTFYYLTLESMSVGSKKL 305
Query: 233 SSKLIPYYNSSGA-ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
+ K S A +GN+ ID+G TLLP+DFY LE V +AI P +DP
Sbjct: 306 AYKGFSKVGSPLADADEGNIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVRDPNNVFS 365
Query: 292 LCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFA 351
LCY S I P +TAHF GA + L +TF+ E +FCFAM P+ D+ IFGN A
Sbjct: 366 LCYSNLSGLRI-PTITAHFV-GADLELKPLNTFVQVQ-EDLFCFAMIPVS-DLAIFGNLA 421
Query: 352 QSDLFIGYDFDSQMVSFKPTDCTK 375
Q + +GYD S+ VSFKPTDCTK
Sbjct: 422 QMNFLVGYDLKSRTVSFKPTDCTK 445
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 162/370 (43%), Positives = 226/370 (61%), Gaps = 17/370 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+QS + + GEY+M IGTPP+ + IVDTGSDL W QC PC CYKQV P+++P +S
Sbjct: 81 IQSRIVPSAGEYLMNLYIGTPPV-PVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNS 139
Query: 73 SSYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---F 128
S+Y++ SC + C L SCS ++ C + Y YAD S T G LA+E +T ++
Sbjct: 140 STYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPVS 199
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
F FGCGH++ G+F+++ G+VGLG LSL SQ+ S + FSYCL+P TDSSI+
Sbjct: 200 FPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTING-LFSYCLLPVSTDSSIS 258
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPY--YNSSGAI 246
S++ FG VSG G VST LV K T+Y++TLEGISVG K +PY Y+ +
Sbjct: 259 SRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGK-----KRLPYKGYSKKTEV 313
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPIL 306
+GN+ +D+G T LP++FY++LE+ V N+IK +DP LCY T + API+
Sbjct: 314 EEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAEIN-APII 372
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
TAHF A V L +TF+ E + CF + P D+G+ GN AQ + +G+D + V
Sbjct: 373 TAHFK-DANVELQPLNTFMRMQ-EDLVCFTVAPTS-DIGVLGNLAQVNFLVGFDLRKKRV 429
Query: 367 SFKPTDCTKQ 376
SFK DCT+
Sbjct: 430 SFKAADCTQH 439
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 163/369 (44%), Positives = 216/369 (58%), Gaps = 16/369 (4%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
N +S V GEY+M +S+GTPP ++YG+VDTGSD++W+QC PC QCYKQ PI+NP+
Sbjct: 74 NTPESTVYVNGGEYLMTYSVGTPPF-NVYGVVDTGSDIVWLQCKPCEQCYKQTTPIFNPS 132
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF-- 128
SSSYK + C S C + SC+ Q C YT ++D S ++G L+ E +T ++
Sbjct: 133 KSSSYKNIPCSSNLCQSVRYTSCNKQNSCEYTINFSDQSYSQGELSVETLTLDSTTGHSV 192
Query: 129 -FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
F V GCGHNN G+F G+VGLG +SL +Q+ S +G KFSYCL+P DS+
Sbjct: 193 SFPKTVIGCGHNNRGMFQGETSGIVGLGIGPVSLTTQLKSSIGG-KFSYCLLPLLVDSNK 251
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
TSK+ FG+ + VSG GVVST V K+ + +Y++TLE SVGN K I + +
Sbjct: 252 TSKLNFGDAAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGN-----KRIEFEVLDDS-E 305
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILT 307
+GN+ +D+G TLLP Y LE V +KL DP LCY S PI+T
Sbjct: 306 EGNIILDSGTTLTLLPSHVYTNLESAVAQLVKLDRVDDPNQLLNLCYSITSDQYDFPIIT 365
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG-IFGNFAQSDLFIGYDFDSQMV 366
AHF GA + L STF +GV C A G IFGN AQ +L +GYD +V
Sbjct: 366 AHFK-GADIKLNPISTF-AHVADGVVCLAF--TSSQTGPIFGNLAQLNLLVGYDLQQNIV 421
Query: 367 SFKPTDCTK 375
SFKP+DC K
Sbjct: 422 SFKPSDCIK 430
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 283 bits (724), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 174/396 (43%), Positives = 233/396 (58%), Gaps = 31/396 (7%)
Query: 1 MSPATYFYPNNV-----VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP 55
+S A F PN++ VQS++ GEY+M+ SIG P + +I I DTGSDL+WVQC P
Sbjct: 65 ISRANRFKPNSISARALVQSDIVPGGGEYLMRISIGNPQV-EILAIADTGSDLIWVQCQP 123
Query: 56 CVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQ---QLCNYTYGYADSSL 110
C CYKQ PI++P SSSY+ + C +E C+ LD SC ++ + C YTY Y D S
Sbjct: 124 CEMCYKQNSPIFDPRRSSSYRNVLCGNEFCNKLDGEARSCDARGFVKTCGYTYSYGDQSF 183
Query: 111 TKGVLATERITFGNSNN-------FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLAS 163
+ G LA ER G++N+ +F V FGCG N G F+E G++GLG +SL S
Sbjct: 184 SDGHLAIERFGIGSTNSNTSAAIAYFQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVS 243
Query: 164 QILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGG--GVVSTSLVSKEDKTYYFVT 221
Q+ +L + KFSYCLVP S+ TSK+ FGN +SG VVST L+ K+ +TYY++T
Sbjct: 244 QLGPKL-SGKFSYCLVPTSEQSNYTSKINFGNDINISGSNYNVVSTPLLPKKPETYYYLT 302
Query: 222 LEGISVGNLSNSSKLIPYYN-SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKL 280
LE ISV N K +PY N +G + KGN+ ID+G T L +F+N L+ V A+K
Sbjct: 303 LEAISVEN-----KRLPYTNLWNGEVEKGNIIIDSGTTLTFLDSEFFNNLDSAVEEAVKG 357
Query: 281 TPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI 340
DP +C+K + PI+TAHF GA V L +TF E + CF M P
Sbjct: 358 ERVSDPHGLFNICFKDEKAIEL-PIITAHFT-GADVELQPVNTFAKVE-EDLLCFTMIP- 413
Query: 341 DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTKQ 376
D+ IFGN AQ + +GYD + + VSF PTDCTKQ
Sbjct: 414 SNDIAIFGNLAQMNFLVGYDLEKKAVSFLPTDCTKQ 449
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 281 bits (718), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 166/369 (44%), Positives = 224/369 (60%), Gaps = 15/369 (4%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
N QS +++ GEY+M SIGTPP+ I I DTGSDL+W QC PC CY+Q P+++P
Sbjct: 73 NSPQSFITSNRGEYLMNISIGTPPV-PILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPK 131
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNF- 128
SS+Y+++SC S QC L+ SCS+ + C+YT Y D+S TKG +A + +T G+S
Sbjct: 132 ESSTYRKVSCSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRP 191
Query: 129 --FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
N++ GCGH NTG F+ G++GLG SL SQ+ + KFSYCLVPF +++
Sbjct: 192 VSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSING-KFSYCLVPFTSETG 250
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
+TSK+ FG VSG GVVSTS+V K+ TYYF+ LE ISVG SK I + ++
Sbjct: 251 LTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVG-----SKKIQFTSTIFGT 305
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPIL 306
+GN+ ID+G TLLP +FY LE V + IK QDP LCY+ S + P +
Sbjct: 306 GEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSSSFKV-PDI 364
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
T HF GG V L + +TF+ E V CFA + + IFGN AQ + +GYD S V
Sbjct: 365 TVHFKGG-DVKLGNLNTFVAVS-EDVSCFAFAA-NEQLTIFGNLAQMNFLVGYDTVSGTV 421
Query: 367 SFKPTDCTK 375
SFK TDC++
Sbjct: 422 SFKKTDCSQ 430
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 168/374 (44%), Positives = 224/374 (59%), Gaps = 19/374 (5%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
N Q+++ GEY MK SIGTP L+++ I DTGSDL WVQCLPC CY+Q P+++P+
Sbjct: 81 NSFQNDLVPNGGEYFMKMSIGTP-LVEVIVIADTGSDLTWVQCLPCDPCYRQKSPLFDPS 139
Query: 71 SSSSYKELSCQSEQCHLLDTV--SCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
SSSY+ + C S C+ LD +C+ +C Y Y Y D S T G LATE+ T G++++
Sbjct: 140 RSSSYRHMLCGSRFCNALDVSEQACTMDTNICEYHYSYGDKSYTNGNLATEKFTIGSTSS 199
Query: 128 ---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
+VFGCG N G F+E G+VGLG LSL SQ LS + KFSYCLVP
Sbjct: 200 RPVHLSPIVFGCGTGNGGTFDELGSGIVGLGGGALSLVSQ-LSSIIKGKFSYCLVPLSEQ 258
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS-- 242
S++TSK+ FG S +SG VVST LVSK+ TYY+VTLE ISVGN K +PY N
Sbjct: 259 SNVTSKIKFGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGN-----KRLPYTNGLL 313
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI 302
+G + KGN+ ID+G T L +F+ LE + +K DPR +C+++ +
Sbjct: 314 NGNVEKGNVIIDSGTTLTFLDSEFFTELERVLEETVKAERVSDPRGLFSVCFRSAGDIDL 373
Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
P++ HF+ A V L +TF+ E + CF M +GIFGN AQ D +GYD +
Sbjct: 374 -PVIAVHFN-DADVKLQPLNTFVKAD-EDLLCFTMIS-SNQIGIFGNLAQMDFLVGYDLE 429
Query: 363 SQMVSFKPTDCTKQ 376
+ VSFKPTDCTK
Sbjct: 430 KRTVSFKPTDCTKH 443
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 160/367 (43%), Positives = 218/367 (59%), Gaps = 13/367 (3%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+QS + + GEY+M SIGTPP+ + IVDTGSDL W QC PC CYKQV P ++P +S
Sbjct: 81 IQSRLVPSAGEYIMNLSIGTPPV-PVIAIVDTGSDLTWTQCRPCTHCYKQVVPFFDPKNS 139
Query: 73 SSYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---F 128
S+Y++ SC + C L + SC + + C + Y YAD S T G LA E +T ++
Sbjct: 140 STYRDSSCGTSFCLALGNDRSCRNGKKCTFMYSYADGSFTGGNLAVETLTVASTAGKPVS 199
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
F FGC H + G+F+E+ G+VGLG LS+ SQ+ S + +FSYCL+P TDSS++
Sbjct: 200 FPGFAFGCVHRSGGIFDEHSSGIVGLGVAELSMISQLKSTING-RFSYCLLPVFTDSSMS 258
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYF-VTLEGISVGNLSNSSKLIPYYNSSGAIS 247
S++ FG VSG G VST LV K TYY+ +TLEG SVG S K ++ +
Sbjct: 259 SRINFGRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYK---GFSKKAEVE 315
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILT 307
+GN+ +D+G T LP +FY +LEE V ++IK +DP S LCY T API+T
Sbjct: 316 EGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYNTTVDQIDAPIIT 375
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
AHF A V L +TF+ E + CF + P D+GI GN AQ + +G+D + VS
Sbjct: 376 AHFK-DANVELQPWNTFLRMQ-EDLVCFTVLPTS-DIGILGNLAQVNFLVGFDLRKKRVS 432
Query: 368 FKPTDCT 374
FK DCT
Sbjct: 433 FKAADCT 439
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 156/370 (42%), Positives = 221/370 (59%), Gaps = 15/370 (4%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
+S+V++ GEY+M S+GTPP I GI DTGSDL+W QC PC +CYKQV P+++P
Sbjct: 82 KAAESDVTSNRGEYLMSLSLGTPPF-KIMGIADTGSDLIWTQCKPCERCYKQVDPLFDPK 140
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--- 127
SS +Y++ SC + QC LLD +CS +C Y Y Y D S T G +A++ IT ++
Sbjct: 141 SSKTYRDFSCDARQCSLLDQSTCSG-NICQYQYSYGDRSYTMGNVASDTITLDSTTGSPV 199
Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
F V GCGH N G F++ G+VGLG LSL SQ+ S +G KFSYCLVP + +
Sbjct: 200 SFPKTVIGCGHENDGTFSDKGSGIVGLGAGPLSLISQMGSSVGG-KFSYCLVPLSSRAGN 258
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
+SK+ FG+ + VSG GV ST L+S E ++YF+TLE +SVGN + I + +SS
Sbjct: 259 SSKLNFGSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGN-----ERIKFGDSSLGT 313
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPIL 306
+GN+ ID+G T++P DF++ L V N ++ +DP +CY S + P +
Sbjct: 314 GEGNIIIDSGTTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCYSATSDLKV-PAI 372
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
TAHF GA V L +TF+ + V C A + I+GN AQ + + Y+ + +
Sbjct: 373 TAHFT-GADVKLKPINTFVQVS-DDVVCLAFASTTSGISIYGNVAQMNFLVEYNIQGKSL 430
Query: 367 SFKPTDCTKQ 376
SFKPTDCTK+
Sbjct: 431 SFKPTDCTKK 440
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 161/371 (43%), Positives = 229/371 (61%), Gaps = 16/371 (4%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
+N Q ++++ +GEY+M S+GTPP I I DTGSDL+W QC PC CY QV P+++P
Sbjct: 80 DNAPQIDLTSNSGEYLMNISLGTPPF-PIMAIADTGSDLLWTQCKPCDDCYTQVDPLFDP 138
Query: 70 ASSSSYKELSCQSEQCHLLDT-VSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNN 127
+SS+YK++SC S QC L+ SCS++ C+Y+ Y D S TKG +A + +T G+++
Sbjct: 139 KASSTYKDVSCSSSQCTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDT 198
Query: 128 F---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
N++ GCGHNN G FN+ G+VGLG +SL +Q+ + KFSYCLVP ++
Sbjct: 199 RPVQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDG-KFSYCLVPLTSE 257
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
+ TSK+ FG + VSG GVVST L++K +T+Y++TL+ ISVG SK + Y S
Sbjct: 258 NDRTSKINFGTNAVVSGTGVVSTPLIAKSQETFYYLTLKSISVG-----SKEVQYPGSDS 312
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAP 304
+GN+ ID+G TLLP +FY+ LE+ V ++I QDP+ G LCY + P
Sbjct: 313 GSGEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLSLCYSATGDLKV-P 371
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
+T HFD GA V L ++ F+ E + CFA + I+GN AQ + +GYD S+
Sbjct: 372 AITMHFD-GADVNLKPSNCFVQIS-EDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSK 428
Query: 365 MVSFKPTDCTK 375
VSFKPTDC K
Sbjct: 429 TVSFKPTDCAK 439
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 168/373 (45%), Positives = 224/373 (60%), Gaps = 12/373 (3%)
Query: 7 FYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI 66
F + +S V + GEY+M++S+G+PP + GIVDTGSD++W+QC PC CYKQ PI
Sbjct: 74 FVSTDSAESTVVASQGEYLMRYSVGSPPF-QVLGIVDTGSDILWLQCEPCEDCYKQTTPI 132
Query: 67 YNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
++P+ S +YK L C S C L +CSS +C Y+ Y D S + G L+ E +T G+++
Sbjct: 133 FDPSKSKTYKTLPCSSNTCESLRNTACSSDNVCEYSIDYGDGSHSDGDLSVETLTLGSTD 192
Query: 127 N---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
F V GCGHNN G F E G+VGLG +SL SQ+ S +G KFSYCL P +
Sbjct: 193 GSSVHFPKTVIGCGHNNGGTFQEEGSGIVGLGGGPVSLISQLSSSIGG-KFSYCLAPIFS 251
Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
+S+ +SK+ FG+ + VSG G VST L + +YF+TLE SVG+ N + +S
Sbjct: 252 ESNSSSKLNFGDAAVVSGRGTVSTPLDPLNGQVFYFLTLEAFSVGD--NRIEFSGSSSSG 309
Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA 303
GN+ ID+G TLLP++ Y LE V + IKL +DP LCYKT S
Sbjct: 310 SGSGDGNIIIDSGTTLTLLPQEDYLNLESAVSDVIKLERARDPSKLLSLCYKTTSDELDL 369
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG-IFGNFAQSDLFIGYDFD 362
P++TAHF GA V L STF+P +GV CFA I +G IFGN AQ +L +GYD
Sbjct: 370 PVITAHFK-GADVELNPISTFVPVE-KGVVCFAF--ISSKIGAIFGNLAQQNLLVGYDLV 425
Query: 363 SQMVSFKPTDCTK 375
+ VSFKPTDCTK
Sbjct: 426 KKTVSFKPTDCTK 438
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 165/370 (44%), Positives = 226/370 (61%), Gaps = 19/370 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V+S + GEY+M S+GTPP +I I DTGSDL+W QC PC +CYKQ+ P+++P SS
Sbjct: 82 VESEIIANGGEYLMSLSLGTPPF-EILAIADTGSDLIWTQCTPCDKCYKQIAPLFDPKSS 140
Query: 73 SSYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---F 128
+Y++LSC + QC L ++ SCSS+QLC Y+Y Y D S T G LA + +T ++N +
Sbjct: 141 KTYRDLSCDTRQCQNLGESSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVY 200
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS-I 187
F V GCG N G F++ + G++GLG +SL SQ+ S +G KFSYCLVPF ++S+
Sbjct: 201 FPKTVIGCGRRNNGTFDKKDSGIIGLGGGPMSLISQMGSSVGG-KFSYCLVPFSSESAGN 259
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
+SK++FG + VSG GV ST L+SK T+Y++TLE +SVG+ K I + SS S
Sbjct: 260 SSKLHFGRNAVVSGSGVQSTPLISKNPDTFYYLTLEAMSVGD-----KKIEFGGSSFGGS 314
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNA-IKLTPYQDPRLGSQLCYK-TPSMAGIAPI 305
+GN+ ID+G TL P +F+ V NA I QD CY+ TP + P+
Sbjct: 315 EGNIIIDSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPTPDLK--VPV 372
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
+TAHF+ GA V L +TFI + V C A IFGN AQ + IGYD +
Sbjct: 373 ITAHFN-GADVVLQTLNTFILIS-DDVLCLAFNSTQSG-AIFGNVAQMNFLIGYDIQGKS 429
Query: 366 VSFKPTDCTK 375
VSFKPTDCT+
Sbjct: 430 VSFKPTDCTQ 439
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 159/385 (41%), Positives = 225/385 (58%), Gaps = 20/385 (5%)
Query: 1 MSPATYFYP---NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV 57
++ A +FY N+ QS V GEY+M +S+GTPP +YGIVDTGSD++W+QC PC
Sbjct: 61 INRANHFYKYSLANIPQSTVIPDIGEYLMTYSVGTPPF-KLYGIVDTGSDIVWLQCEPCQ 119
Query: 58 QCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLAT 117
+CY Q P++NP+ SSSYK + C S+ C ++ SC+ + C Y+ Y D+S + G L+
Sbjct: 120 ECYNQTTPMFNPSKSSSYKNIPCPSKLCQSMEDTSCNDKNYCEYSTYYGDNSHSGGDLSV 179
Query: 118 ERITFGNSNNF---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKF 174
+ +T ++N F N+V GCG NN + G+VG G S +Q+ S G KF
Sbjct: 180 DTLTLESTNGLTVSFPNIVIGCGTNNILSYEGASSGIVGFGSGPASFITQLGSSTGG-KF 238
Query: 175 SYCLVPFHTDSSI----TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNL 230
SYCL P + ++I TSK+ FG+ + VSG GVV+T ++ K+ +T+Y++TLE SVGN
Sbjct: 239 SYCLTPLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNR 298
Query: 231 SNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS 290
+P ++GN+ ID+G T L KD Y+ LE V + +KL DP
Sbjct: 299 RVEIGGVP-----NGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERVDDPTQTL 353
Query: 291 QLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNF 350
LCY + PI+T HF GA V L STF+ +GVFC A + D IFGN
Sbjct: 354 NLCYSVKAEGYDFPIITMHFK-GADVDLHPISTFV-SVADGVFCLAFES-SQDHAIFGNL 410
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCTK 375
AQ +L +GYD ++VSFKP+DCTK
Sbjct: 411 AQQNLMVGYDLQQKIVSFKPSDCTK 435
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 164/370 (44%), Positives = 228/370 (61%), Gaps = 11/370 (2%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
N +QS+V + G Y+M S+GTPP+ + GI DTGSDL+W QCLPC CY+QV+P+++P
Sbjct: 81 NDIQSDVISGGGAYLMNISLGTPPV-PMLGIADTGSDLIWRQCLPCPNCYEQVEPLFDPK 139
Query: 71 SSSSYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-- 127
S +YK L C +E C L SC C Y+Y Y D S T+G L+++ +T G++
Sbjct: 140 ESETYKTLDCDNEFCQDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGSTEGDP 199
Query: 128 -FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
F + FGCGH+N G FNE + GL+GLG LSL Q+ S++G +FSYCLVP +DS+
Sbjct: 200 ASFPGIAFGCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVGG-QFSYCLVPLSSDST 258
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS-GA 245
++SK+ FG VSG G VST L+ T+Y++TLEG+SVG+ + + K SS A
Sbjct: 259 VSSKINFGKSGVVSGSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAA 318
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPI 305
+ +GN+ ID+G TLLP+DFY +E + NAI DP LCY + + I P
Sbjct: 319 VEEGNIIIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDPNGIFSLCYSSVNNLEI-PT 377
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
+TAHF GA V L +TF+ E + CF+M P ++ IFGN AQ + +GYD +
Sbjct: 378 ITAHFT-GADVQLPPLNTFVQVQ-EDLVCFSMIP-SSNLAIFGNLAQINFLVGYDLKNNK 434
Query: 366 VSFKPTDCTK 375
VSFK TDCT+
Sbjct: 435 VSFKQTDCTE 444
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 156/386 (40%), Positives = 231/386 (59%), Gaps = 23/386 (5%)
Query: 1 MSPATYFY-PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC 59
++ A +F+ + ++ ++ +GEY++ +S+G PP +YGI+DTGSD++W+QC PC +C
Sbjct: 62 VNRANHFHKAHKAAKATITQNDGEYLISYSVGIPPF-QLYGIIDTGSDMIWLQCKPCEKC 120
Query: 60 YKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSS--QQLCNYTYGYADSSLTKGVLAT 117
Y Q I++P+ S++YK L S C ++ SCSS +++C YT Y D S ++G L+
Sbjct: 121 YNQTTRIFDPSKSNTYKILPFSSTTCQSVEDTSCSSDNRKMCEYTIYYGDGSYSQGDLSV 180
Query: 118 ERITFGNSNNF---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGA--N 172
E +T G++N F V GCG NNT F G+VGLG +SL +Q+ + +
Sbjct: 181 ETLTLGSTNGSSVKFRRTVIGCGRNNTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGR 240
Query: 173 KFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSN 232
KFSYCL S+I+SK+ FG+ + VSG G VST +V+ + K +Y++TLE SVGN
Sbjct: 241 KFSYCLASM---SNISSKLNFGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGN--- 294
Query: 233 SSKLIPYYNSSGAI-SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
I + +SS KGN+ ID+G TLLP D Y++LE V + ++L +DP
Sbjct: 295 --NRIEFTSSSFRFGEKGNIIIDSGTTLTLLPNDIYSKLESAVADLVELDRVKDPLKQLS 352
Query: 292 LCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG-IFGNF 350
LCY++ AP++ AHF GA V L +TFI +GV C A I +G IFGN
Sbjct: 353 LCYRSTFDELNAPVIMAHF-SGADVKLNAVNTFIEVE-QGVTCLAF--ISSKIGPIFGNM 408
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCTKQ 376
AQ + +GYD ++VSFKPTDC+KQ
Sbjct: 409 AQQNFLVGYDLQKKIVSFKPTDCSKQ 434
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 272 bits (695), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 164/368 (44%), Positives = 226/368 (61%), Gaps = 17/368 (4%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
Q ++++ +GEY+M SIGTPP I I DTGSDL+W QC PC CY QV P+++P +SS
Sbjct: 80 QIDLTSNSGEYLMNVSIGTPPF-PIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSS 138
Query: 74 SYKELSCQSEQCHLLDT-VSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNF--- 128
+YK++SC S QC L+ SCS+ C+Y+ Y D+S TKG +A + +T G+S+
Sbjct: 139 TYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQ 198
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
N++ GCGHNN G FN+ G+VGLG +SL Q+ + KFSYCLVP + T
Sbjct: 199 LKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDG-KFSYCLVPLTSKKDQT 257
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
SK+ FG + VSG GVVST L++K +T+Y++TL+ ISVG SK I Y S S
Sbjct: 258 SKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVG-----SKQIQYSGSDSESS 312
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILT 307
+GN+ ID+G TLLP +FY+ LE+ V ++I QDP+ G LCY + P++T
Sbjct: 313 EGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKV-PVIT 371
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
HFD GA V L ++ F+ E + CFA + I+GN AQ + +GYD S+ VS
Sbjct: 372 MHFD-GADVKLDSSNAFVQVS-EDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVS 428
Query: 368 FKPTDCTK 375
FKPTDC K
Sbjct: 429 FKPTDCAK 436
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 164/368 (44%), Positives = 226/368 (61%), Gaps = 17/368 (4%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
Q ++++ +GEY+M SIGTPP I I DTGSDL+W QC PC CY QV P+++P +SS
Sbjct: 80 QIDLTSNSGEYLMNVSIGTPPF-PIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSS 138
Query: 74 SYKELSCQSEQCHLLDT-VSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNF--- 128
+YK++SC S QC L+ SCS+ C+Y+ Y D+S TKG +A + +T G+S+
Sbjct: 139 TYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQ 198
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
N++ GCGHNN G FN+ G+VGLG +SL Q+ + KFSYCLVP + T
Sbjct: 199 LKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDG-KFSYCLVPLTSKKDQT 257
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
SK+ FG + VSG GVVST L++K +T+Y++TL+ ISVG SK I Y S S
Sbjct: 258 SKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVG-----SKQIQYSGSDSESS 312
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILT 307
+GN+ ID+G TLLP +FY+ LE+ V ++I QDP+ G LCY + P++T
Sbjct: 313 EGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKV-PVIT 371
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
HFD GA V L ++ F+ E + CFA + I+GN AQ + +GYD S+ VS
Sbjct: 372 MHFD-GADVKLDSSNAFVQVS-EDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVS 428
Query: 368 FKPTDCTK 375
FKPTDC K
Sbjct: 429 FKPTDCAK 436
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 169/389 (43%), Positives = 232/389 (59%), Gaps = 27/389 (6%)
Query: 1 MSPATYFYPNNV-----VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP 55
+S A F PN+V ++ ++ GEY M+ SIGTPP+ ++ I DTGSDL+WVQC P
Sbjct: 66 ISRANRFTPNSVSAAKTLEYDIIPGGGEYFMRISIGTPPI-EVLVIADTGSDLIWVQCQP 124
Query: 56 CVQCYKQVKPIYNPASSSSYKELSCQSEQCHLL--DTVSCSSQ---QLCNYTYGYADSSL 110
C +CYKQ PI+NP SS+Y+ + C++ C+ L D +CS+ + C Y+Y Y D S
Sbjct: 125 CQECYKQKSPIFNPKQSSTYRRVLCETRYCNALNSDMRACSAHGFFKACGYSYSYGDHSF 184
Query: 111 TKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG 170
T G LATER G++NN + FGCG++N G F+E G+VGLG LSL SQ+ +++
Sbjct: 185 TMGYLATERFIIGSTNNSIQELAFGCGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKID 244
Query: 171 ANKFSYCLVPFHTDSSIT-SKMYFGNGSEVSGGGV-VSTSLVSKEDKTYYFVTLEGISVG 228
NKFSYCLVP S+ + K+ FG+ S +SG VST LVSKE +T+Y++TLE ISVG
Sbjct: 245 -NKFSYCLVPILEKSNFSLGKIVFGDNSFISGSDTYVSTPLVSKEPETFYYLTLEAISVG 303
Query: 229 NLSNSSKLIPYYNSS--GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP 286
N + + Y NS G + KGN+ ID+G T L YN+LE + A++ DP
Sbjct: 304 N-----ERLAYENSRNDGNVEKGNIIIDSGTTLTFLDSKLYNKLELVLEKAVEGERVSDP 358
Query: 287 RLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG 345
+C++ GI PI+T HF A V L +TF E + CF M P +G +
Sbjct: 359 NGIFSICFR--DKIGIELPIITVHFT-DADVELKPINTFAKAE-EDLLCFTMIPSNG-IA 413
Query: 346 IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
IFGN AQ + +GYD D VSF PTDC+
Sbjct: 414 IFGNLAQMNFLVGYDLDKNCVSFMPTDCS 442
>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 315
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 157/306 (51%), Positives = 194/306 (63%), Gaps = 18/306 (5%)
Query: 79 SCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF-GNSNNF--FDNVVFG 135
SC S CH LDT CS ++ CNYTYGY D+SLTKGVLA + TF N+ +FG
Sbjct: 20 SCDSPLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKLVSLSRFLFG 79
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
CGHNNTG FN++EMGL+GLG SL SQI G KFS CLVPF TD I+S+M FG
Sbjct: 80 CGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRMSFGK 139
Query: 196 GSEVSGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFID 254
GS+V G GVV+T LV +E D T YFVTL GISV + Y + I KGNM +D
Sbjct: 140 GSQVLGDGVVTTPLVQREQDMTSYFVTLLGISVEDT--------YLPMNSTIEKGNMLVD 191
Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPY-QDPRLGSQLCYKTPSMAGIAPILTAHFDGG 313
+G PP +LP+ Y+R+ +V+N + L DP LG QLCY+T + P LT HF+ G
Sbjct: 192 SGTPPNILPQQLYDRVYVEVKNNVPLELITNDPSLGPQLCYRTQTNLK-GPTLTYHFE-G 249
Query: 314 AKVPLIHTSTFIPPPVE--GVFCFAMQP-IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
A + L TFIPP E GVFC A+ + + G++GNFAQS+ IG+D D Q+VSFK
Sbjct: 250 ANLLLTPIQTFIPPTPETKGVFCLAINNYTNSNGGVYGNFAQSNYLIGFDLDRQVVSFKA 309
Query: 371 TDCTKQ 376
TDCTKQ
Sbjct: 310 TDCTKQ 315
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 163/371 (43%), Positives = 221/371 (59%), Gaps = 11/371 (2%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
N +QSNV + G Y+M S+GTPP+ + GI DTGSDL+W QCLPC CYKQV+P+++P
Sbjct: 81 NDIQSNVISGGGSYLMNISLGTPPV-SMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPK 139
Query: 71 SSSSYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-- 127
S +YK L C ++ C L SC C +Y Y D S T+ L++E T G++
Sbjct: 140 KSKTYKTLGCNNDFCQDLGQQGSCGDDNTCTSSYSYGDQSYTRRDLSSETFTIGSTEGDP 199
Query: 128 -FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
F + FGCGH+N G FNE + GL+GLG LSL Q+ S++G +FSYCLVP +DS+
Sbjct: 200 ASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGG-QFSYCLVPLSSDST 258
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS-GA 245
+SK+ FG + VSG G VST L+ T+Y++TLEG+S+G+ + K SS A
Sbjct: 259 ASSKINFGKSAVVSGSGTVSTPLIKGTPDTFYYLTLEGMSLGSEKVAFKGFSKNKSSPAA 318
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPI 305
+ N+ ID+G TLLP+DFY +E + I DPR LCY I P
Sbjct: 319 AEESNIIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCYSGVKKLEI-PT 377
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
+TAHF GA V L +TF+ E + CF+M P ++ IFGN +Q + +GYD +
Sbjct: 378 ITAHFI-GADVQLPPLNTFVQAQ-EDLVCFSMIP-SSNLAIFGNLSQMNFLVGYDLKNNK 434
Query: 366 VSFKPTDCTKQ 376
VSFKPTDCTKQ
Sbjct: 435 VSFKPTDCTKQ 445
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 160/375 (42%), Positives = 217/375 (57%), Gaps = 18/375 (4%)
Query: 7 FYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI 66
F N ++ V +A GEY++ +S+GTP L ++GI+DTGSD++W+QC PC +CY+Q PI
Sbjct: 72 FVSPNSPETTVISALGEYLISYSVGTPSL-QVFGILDTGSDIIWLQCQPCKKCYEQTTPI 130
Query: 67 YNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
++ + S +YK L C S C + CSS++ C Y+ Y D S + G L+ E +T G++N
Sbjct: 131 FDSSKSQTYKTLPCPSNTCQSVQGTFCSSRKHCLYSIHYVDGSQSLGDLSVETLTLGSTN 190
Query: 127 NF---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
F V GCG N E G+VGLGR +SL +Q+ G KFSYCLVP
Sbjct: 191 GSPVQFPGTVIGCGRYNAIGIEEKNSGIVGLGRGPMSLITQLSPSTGG-KFSYCLVP--G 247
Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
S+ +SK+ FGN + VSG G VST L SK +YF+TLE SVG N + + S
Sbjct: 248 LSTASSKLNFGNAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGR--NRIE----FGSP 301
Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK-TP-SMAG 301
G+ KGN+ ID+G T LP Y++LE V + L +DP LCYK TP +
Sbjct: 302 GSGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVRDPNQVLGLCYKVTPDKLDA 361
Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
P++TAHF GA V L +TF+ + V CFA QP + +FGN AQ +L +GYD
Sbjct: 362 SVPVITAHFS-GADVTLNAINTFV-QVADDVVCFAFQPTETG-AVFGNLAQQNLLVGYDL 418
Query: 362 DSQMVSFKPTDCTKQ 376
VSFK TDCTKQ
Sbjct: 419 QMNTVSFKHTDCTKQ 433
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 147/367 (40%), Positives = 218/367 (59%), Gaps = 24/367 (6%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+QS++ +GEY+M SIGTPP+ D GI DTGSDL W QCLPC++CY+Q++PI+NP S
Sbjct: 81 LQSSIGPGSGEYLMSVSIGTPPV-DYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKS 139
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
+S+ + C ++ CH +D C Q +C+Y+Y Y D + +KG L E+IT G+S+
Sbjct: 140 TSFSHVPCNTQTCHAVDDGHCGVQGVCDYSYTYGDRTYSKGDLGFEKITIGSSSV---KS 196
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG-ANKFSYCLVPFHTDSSITSKM 191
V GCGH ++G F G++GLG +LSL SQ+ G + +FSYCL + ++ K+
Sbjct: 197 VIGCGHASSGGFGFAS-GVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHAN--GKI 253
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
FG + VSG GVVST L+SK TYY++TLE IS+GN + + +GN+
Sbjct: 254 NFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNERHMA----------FAKQGNV 303
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA---PILTA 308
ID+G T+LPK+ Y+ + + +K +DP LC+ A + P++TA
Sbjct: 304 IIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDGINAAASLGIPVITA 363
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID--GDVGIFGNFAQSDLFIGYDFDSQMV 366
HF GGA V L+ +TF + V C ++ + GI GN AQ++ IGYD +++ +
Sbjct: 364 HFSGGANVNLLPINTF-RKVADNVNCLTLKAASPTTEFGIIGNLAQANFLIGYDLEAKRL 422
Query: 367 SFKPTDC 373
SFKPT C
Sbjct: 423 SFKPTVC 429
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 150/365 (41%), Positives = 211/365 (57%), Gaps = 32/365 (8%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
QS V++ GEY+M +SIGTPP ++G VDTGSDL+W+QC PC QCY Q+ PI++P+ SS
Sbjct: 78 QSTVNSDKGEYLMSYSIGTPPF-KVFGFVDTGSDLVWLQCEPCKQCYPQITPIFDPSLSS 136
Query: 74 SYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF---FD 130
SY+ + C S+ CH + T SC +G L+ E +T ++ + F
Sbjct: 137 SYQNIPCLSDTCHSMRTTSCD----------------VRGYLSVETLTLDSTTGYSVSFP 180
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
+ GCG+ NTG F+ G+VGLG +SL SQ+ + +G KFSYCL P+ +S TSK
Sbjct: 181 KTMIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGG-KFSYCLGPWLPNS--TSK 237
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
+ FG+ + V G G ++T +V K+ ++ Y++TLE SVGN KLI + + ++GN
Sbjct: 238 LNFGDAAIVYGDGAMTTPIVKKDAQSGYYLTLEAFSVGN-----KLIEFGGPTYGGNEGN 292
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHF 310
+ ID+G T LP D Y R E V I L +DP +LCY AP++TAHF
Sbjct: 293 ILIDSGTTFTFLPYDVYYRFESAVAEYINLEHVEDPNGTFKLCYNVAYHGFEAPLITAHF 352
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
GA + L + STFI +G+ C A P IFGN AQ +L +GY+ V+FKP
Sbjct: 353 K-GADIKLYYISTFIKVS-DGIACLAFIP--SQTAIFGNVAQQNLLVGYNLVQNTVTFKP 408
Query: 371 TDCTK 375
DCTK
Sbjct: 409 VDCTK 413
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 260 bits (664), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 167/373 (44%), Positives = 225/373 (60%), Gaps = 16/373 (4%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
N +S V + GEY+M +S+GTPP I GIVDTGSD++W+QC PC CY Q PI++P+
Sbjct: 81 NTAESTVIASQGEYLMSYSVGTPPF-QILGIVDTGSDIIWLQCQPCEDCYNQTTPIFDPS 139
Query: 71 SSSSYKELSCQSEQCHLLDT-VSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
S +YK L C S C + + SCSS C YT Y D+S ++G L+ E +T G+++
Sbjct: 140 QSKTYKTLPCSSNICQSVQSAASCSSNNDECEYTITYGDNSHSQGDLSVETLTLGSTDGS 199
Query: 129 ---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
F V GCGHNN G F G+VGLG +SL SQ+ S +G KFSYCL P + S
Sbjct: 200 SVQFPKTVIGCGHNNKGTFQREGSGIVGLGGGPVSLISQLSSSIGG-KFSYCLAPLFSQS 258
Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
+ +SK+ FG+ + VSG G VST +V K +YF+TLE SVG ++ + +
Sbjct: 259 NSSSKLNFGDEAVVSGRGTVSTPIVPKNGLGFYFLTLEAFSVG---DNRIEFGSSSFESS 315
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-AP 304
+GN+ ID+G T+LP+D Y LE V +AI+L +DP +LCY+T S + P
Sbjct: 316 GGEGNIIIDSGTTLTILPEDDYLNLESAVADAIELERVEDPSKFLRLCYRTTSSDELNVP 375
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG-IFGNFAQSDLFIGYDFDS 363
++TAHF GA V L STFI EGV CFA + +G IFGN AQ +L +GYD
Sbjct: 376 VITAHFK-GADVELNPISTFIEVD-EGVVCFAFR--SSKIGPIFGNLAQQNLLVGYDLVK 431
Query: 364 QMVSFKPTDCTKQ 376
Q VSFKPTDCT++
Sbjct: 432 QTVSFKPTDCTQE 444
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 258 bits (658), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 170/383 (44%), Positives = 231/383 (60%), Gaps = 21/383 (5%)
Query: 5 TYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK 64
++ N +S V + GEY+M +S+GTPP +I G+VDTGS + W+QC C CY+Q
Sbjct: 78 SFVASTNTAESTVKASQGEYLMSYSVGTPPF-EILGVVDTGSGITWMQCQRCEDCYEQTT 136
Query: 65 PIYNPASSSSYKELSCQSEQCH-LLDTVSCSSQQL-CNYTYGYADSSLTKGVLATERITF 122
PI++P+ S +YK L C S C ++ T SCSS ++ C YT Y D S ++G L+ E +T
Sbjct: 137 PIFDPSKSKTYKTLPCSSNMCQSVISTPSCSSDKIGCKYTIKYGDGSHSQGDLSVETLTL 196
Query: 123 GNSNNF---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
G++N F N V GCGHNN G F G+VGLG +SL SQ+ S +G KFSYCL
Sbjct: 197 GSTNGSSVQFPNTVIGCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSIGG-KFSYCLA 255
Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIP 238
P + S+ +SK+ FG+ + VSG G VST LVSK + +Y++TLE SVG+ K I
Sbjct: 256 PMFSQSNSSSKLNFGDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGD-----KRIE 310
Query: 239 YY----NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY 294
+ +S + +GN+ ID+G TLLP++ Y+ LE V +AI+ DP LCY
Sbjct: 311 FVGGSSSSGSSNGEGNIIIDSGTTLTLLPQEDYSNLESAVADAIQANRVSDPSNFLSLCY 370
Query: 295 K-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQS 353
+ TPS P++TAHF GA V L STF+ EGV CFA + V IFGN AQ
Sbjct: 371 QTTPSGQLDVPVITAHFK-GADVELNPISTFV-QVAEGVVCFAFHSSEV-VSIFGNLAQL 427
Query: 354 DLFIGYDFDSQMVSFKPTDCTKQ 376
+L +GYD Q VSFKPTDCT++
Sbjct: 428 NLLVGYDLMEQTVSFKPTDCTQE 450
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 155/373 (41%), Positives = 218/373 (58%), Gaps = 20/373 (5%)
Query: 10 NNVVQSNVSTAN-GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
+N V+S V+ + G+Y+M +S+GTPP +YGIVDT SD++WVQC C CY P+++
Sbjct: 73 SNAVESPVTLLDDGDYLMSYSLGTPPF-PVYGIVDTASDIIWVQCQLCETCYNDTSPMFD 131
Query: 69 PASSSSYKELSCQSEQCHLLDTVSCSS--QQLCNYTYGYADSSLTKGVLATERITFGNSN 126
P+ S +YK L C S C + SCSS +++C +T Y D S ++G L E +T G+ N
Sbjct: 132 PSYSKTYKNLPCSSTTCKSVQGTSCSSDERKICEHTVNYKDGSHSQGDLIVETVTLGSYN 191
Query: 127 N---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
+ F V GC N F + +G+VGLG +SL Q+ S + + KFSYCL P
Sbjct: 192 DPFVHFPRTVIGCIRNTNVSF--DSIGIVGLGGGPVSLVPQLSSSI-SKKFSYCLAPI-- 246
Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
S +SK+ FG+ + VSG G VST +V K+ K +Y++TLE SVGN +++ +SS
Sbjct: 247 -SDRSSKLKFGDAAMVSGDGTVSTRIVFKDWKKFYYLTLEAFSVGN----NRIEFRSSSS 301
Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA 303
+ KGN+ ID+G T+LP D Y++LE V + +KL +DP LCYK+
Sbjct: 302 RSSGKGNIIIDSGTTFTVLPDDVYSKLESAVADVVKLERAEDPLKQFSLCYKSTYDKVDV 361
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
P++TAHF GA V L +TFI V C A IFGN AQ + +GYD
Sbjct: 362 PVITAHF-SGADVKLNALNTFIVAS-HRVVCLAFLSSQSG-AIFGNLAQQNFLVGYDLQR 418
Query: 364 QMVSFKPTDCTKQ 376
++VSFKPTDCTKQ
Sbjct: 419 KIVSFKPTDCTKQ 431
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 161/384 (41%), Positives = 223/384 (58%), Gaps = 25/384 (6%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
Q+++ + GEY+M SIGTPP I I DTGSDL W+Q PC QCY Q PI++P++S+
Sbjct: 70 QTDLLPSGGEYMMNLSIGTPPF-PILAIADTGSDLTWLQSKPCDQCYPQKGPIFDPSNST 128
Query: 74 SYKELSCQSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
++ +L C + C+ LD SC+ C YTY Y D S T G LA++ +T GN++ N
Sbjct: 129 TFHKLPCTTAPCNALDESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTVGNASVQIRN 188
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH-------TD 184
V FGCG N G F+E G+VGLG LS SQ+ +G KFSYCL+P +D
Sbjct: 189 VAFGCGTRNGGNFDEQGSGIVGLGGGNLSFVSQLGDTIG-KKFSYCLLPLENEISSQPSD 247
Query: 185 SSITSKMYFGNG---SEVSGGGVV--STSLVSKEDKTYYFVTLEGISVGN-----LSNSS 234
S TS++ FG+ S S GVV +T LV+KE TYY++T+E I+VG S+SS
Sbjct: 248 SPATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLLYSSSSS 307
Query: 235 KLIPYYN-SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QL 292
K Y + S ++ +GN+ ID+G T L ++FY LE + IK+ D + L
Sbjct: 308 KTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVNDVKNSMFSL 367
Query: 293 CYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQ 352
C+K+ P++ HF GGA V L +TF+ EG+ CF M P + DVGI+GN AQ
Sbjct: 368 CFKSGKEEVELPLMKVHFRGGADVELKPVNTFVRAE-EGLVCFTMLPTN-DVGIYGNLAQ 425
Query: 353 SDLFIGYDFDSQMVSFKPTDCTKQ 376
+ +GYD + VSF P DC+KQ
Sbjct: 426 MNFVVGYDLGKRTVSFLPADCSKQ 449
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 159/386 (41%), Positives = 212/386 (54%), Gaps = 42/386 (10%)
Query: 1 MSPATYFYPN---NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV 57
++ A +FY N QS V +GEY+M +S+GTPP +YGI DTGSD++W+QC PC
Sbjct: 61 INRANHFYKTALTNTPQSTVIPDHGEYLMTYSVGTPPF-KLYGIADTGSDIVWLQCEPCK 119
Query: 58 QCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLAT 117
+CY Q P + P+ SS+YK + C S+ C S QQ G L+
Sbjct: 120 ECYNQTTPKFKPSKSSTYKNIPCSSDLCK-------SGQQ---------------GNLSV 157
Query: 118 ERITFGNSNNF---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKF 174
+ +T +S F V GCG +NT F G+VGLG SL +Q+ S + A KF
Sbjct: 158 DTLTLESSTGHPISFPKTVIGCGTDNTVSFEGASSGIVGLGGGPASLITQLGSSIDA-KF 216
Query: 175 SYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSS 234
SYCL+P +S+ TSK+ FG+ + VSG GVVST +V K+ +Y++TLE SVGN
Sbjct: 217 SYCLLPNPVESNTTSKLNFGDTAVVSGDGVVSTPIVKKDPIVFYYLTLEAFSVGN----- 271
Query: 235 KLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY 294
K I + SS +GN+ ID+G T++P D YN LE V +KL DP LCY
Sbjct: 272 KRIEFEGSSNGGHEGNIIIDSGTTLTVIPTDVYNNLESAVLELVKLKRVNDPTRLFNLCY 331
Query: 295 KTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP----IDGD-VGIFGN 349
S PI+T HF GA V L STF+ +G+ C A I D V IFGN
Sbjct: 332 SVTSDGYDFPIITTHFK-GADVKLHPISTFV-DVADGIVCLAFATTSAFIPSDVVSIFGN 389
Query: 350 FAQSDLFIGYDFDSQMVSFKPTDCTK 375
AQ +L +GYD ++VSFKPTDC+K
Sbjct: 390 LAQQNLLVGYDLQQKIVSFKPTDCSK 415
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 147/369 (39%), Positives = 215/369 (58%), Gaps = 26/369 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+Q+ ++ +GEY+M SIGTPP+ D G+ DTGSDLMW QCLPC++CYKQ +PI++P S
Sbjct: 81 LQAPLTPGSGEYLMSVSIGTPPV-DYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKS 139
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
+S+ + C S+ C +D C +Q +C+Y+Y Y D + TKG L E+IT G+S+
Sbjct: 140 TSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKITIGSSSV---KS 196
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG-ANKFSYCLVPFHTDSSITSKM 191
V GCGH + G++GLG +LSL SQ+ G + +FSYCL + ++ K+
Sbjct: 197 VIGCGHESG-GGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHAN--GKI 253
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
FG + VSG GVVST L+SK TYY+VTLE IS+GN + + + +GN+
Sbjct: 254 NFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMA----------SAKQGNV 303
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY----KTPSMAGIAPILT 307
ID+G + LPK+ Y+ + + +K +DP LC+ + +GI PI+T
Sbjct: 304 IIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGI-PIIT 362
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID--GDVGIFGNFAQSDLFIGYDFDSQM 365
A F GGA V L+ +TF V C + P + GI GN A ++ IGYD +++
Sbjct: 363 AQFSGGANVNLLPVNTF-QKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKR 421
Query: 366 VSFKPTDCT 374
+SFKPT CT
Sbjct: 422 LSFKPTVCT 430
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 149/360 (41%), Positives = 210/360 (58%), Gaps = 21/360 (5%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
GEY++ +S+GTPP +YG +DTGS+++W+QC PC C+ Q PI+NP+ SSSYK + C
Sbjct: 87 GEYLISYSVGTPPF-KVYGFMDTGSNIVWLQCQPCNTCFNQTSPIFNPSKSSSYKNIPCT 145
Query: 82 SEQCHLLDT----VSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGN---SNNFFDNVV 133
S C DT +SCS+ +C Y+ Y + ++G L+ + +T + S+ F N+V
Sbjct: 146 SSTCK--DTNDTHISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFPNIV 203
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
GCGH N N G+VG+GR +SL Q+ S +KFSYCL+P+++DS+ +SK+ F
Sbjct: 204 IGCGHINVLQDNSQSSGVVGMGRGPMSLIKQVGSSSVGSKFSYCLIPYNSDSNSSSKLIF 263
Query: 194 GNGSEVSGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
G VSG VVST +V + YYF+TLE SVGN I Y S A S N+
Sbjct: 264 GEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGN-----NRIEYGERSNA-STQNIL 317
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDG 312
ID+G P T+LP F ++L V +KL + P LCY T P +TAHF+
Sbjct: 318 IDSGTPLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHLSLCYNTTGKQLNVPDITAHFN- 376
Query: 313 GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
GA V L TF P +G+ CF +G + IFGN AQ++L I YD + +++SFKPTD
Sbjct: 377 GADVKLNSNGTFFPFE-DGIMCFGFISSNG-LEIFGNIAQNNLLIDYDLEKEIISFKPTD 434
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 154/366 (42%), Positives = 207/366 (56%), Gaps = 23/366 (6%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
A YVM +SIGTPP +YG+VDTGSD +W QC PC C Q PI+NP+ SS+YK +
Sbjct: 86 AGSYYVMSYSIGTPPF-QLYGVVDTGSDGIWFQCKPCKPCLNQTSPIFNPSKSSTYKNIR 144
Query: 80 CQSEQCHLLDTVSCSS--QQLCNYTYGYADSSLTKGVLATERITFGNSNN----FFDNVV 133
C S C + CSS ++ C Y Y D S ++G ++ + +T NSN+ F +V
Sbjct: 145 CSSPICKRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTL-NSNDGSPISFPKIV 203
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
GCGH N+ G++G GR S+ SQ+ S +G KFSYCL + ++I+SK+YF
Sbjct: 204 IGCGHKNSLTTEGLASGIIGFGRGNFSIVSQLGSSIGG-KFSYCLASLFSKANISSKLYF 262
Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN----LSNSSKLIPYYNSSGAISKG 249
G+ + VSG GVVST L+ YF LE SVG+ L +SS LIP ++G
Sbjct: 263 GDMAVVSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSS-LIP-------DNEG 314
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAH 309
N ID+G+ T LP D Y++LE V + +KL +DP LCYKT PI+TAH
Sbjct: 315 NAVIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTTLKKYEVPIITAH 374
Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
F GA V L +TFI E V CFA ++GN AQ + +GYD ++SFK
Sbjct: 375 FR-GADVKLNAFNTFIQMNHE-VMCFAFNSSAFPWVVYGNIAQQNFLVGYDTLKNIISFK 432
Query: 370 PTDCTK 375
PT+CTK
Sbjct: 433 PTNCTK 438
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 152/378 (40%), Positives = 216/378 (57%), Gaps = 21/378 (5%)
Query: 7 FYPNNVVQSNVSTANGE-YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP 65
F PN V VS G+ Y++ F IGTPP +YG++DT +D +W QC PC C+ P
Sbjct: 71 FPPNKVPNIVVSPFMGDGYIISFLIGTPPF-QLYGVMDTANDNIWFQCNPCKPCFNTTSP 129
Query: 66 IYNPASSSSYKELSCQSEQCHLLDTVSCSS--QQLCNYTYGYADSSLTKGVLATERITFG 123
+++P+ SS+YK + C S +C ++ CSS +++C Y++ Y + ++G L+ + +T
Sbjct: 130 MFDPSKSSTYKTIPCSSPKCKNVENTHCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLTL- 188
Query: 124 NSNN----FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
NSNN F N+V GCGH N G G +GLGR LS SQ+ S +G KFSYCLV
Sbjct: 189 NSNNDTPISFKNIVIGCGHRNKGPLEGYVSGNIGLGRGPLSFISQLNSSIGG-KFSYCLV 247
Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPY 239
P ++ I+ K++FG+ S VSG G VST + + E Y TL +SVG+ +I +
Sbjct: 248 PLFSNEGISGKLHFGDKSVVSGVGTVSTPITAGE--IGYSTTLNALSVGD-----HIIKF 300
Query: 240 YNSSGAISK-GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS 298
NS+ GN ID+G T+LP++ Y+RLE V + +KL + P +LCYK
Sbjct: 301 ENSTSKNDNLGNTIIDSGTTLTILPENVYSRLESIVTSMVKLERAKSPNQQFKLCYKATL 360
Query: 299 MAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG-IFGNFAQSDLFI 357
PI+TAHF+ GA V L +TF P E V CFA + G I GN AQ + +
Sbjct: 361 KNLDVPIITAHFN-GADVHLNSLNTFYPIDHE-VVCFAFVSVGNFPGTIIGNIAQQNFLV 418
Query: 358 GYDFDSQMVSFKPTDCTK 375
G+D ++SFKPTDCTK
Sbjct: 419 GFDLQKNIISFKPTDCTK 436
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 141/352 (40%), Positives = 205/352 (58%), Gaps = 26/352 (7%)
Query: 30 IGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD 89
IGTPP+ D GI DTGSDL W QCLPC++CY+Q++PI+NP S+S+ + C ++ CH +D
Sbjct: 86 IGTPPV-DYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVD 144
Query: 90 TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEM 149
C Q +C+Y+Y Y D + +KG L E+IT G+S+ V GCGH ++G F
Sbjct: 145 DGHCGVQGVCDYSYTYGDRTYSKGDLGFEKITIGSSS---VKSVIGCGHASSGGFGFAS- 200
Query: 150 GLVGLGRTRLSLASQILSQLG-ANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS 208
G++GLG +LSL SQ+ G + +FSYCL + ++ K+ FG + VSG GVVST
Sbjct: 201 GVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHAN--GKINFGQNAVVSGPGVVSTP 258
Query: 209 LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYN 268
L+SK TYY++TLE IS+GN + + +GN+ ID+G + LPK+ Y+
Sbjct: 259 LISKNTVTYYYITLEAISIGNERHMA----------FAKQGNVIIDSGTTLSFLPKELYD 308
Query: 269 RLEEQVRNAIKLTPYQDPRLGSQLCY----KTPSMAGIAPILTAHFDGGAKVPLIHTSTF 324
+ + +K +DP LC+ + +GI PI+TA F GGA V L+ +TF
Sbjct: 309 GVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGI-PIITAQFSGGANVNLLPVNTF 367
Query: 325 IPPPVEGVFCFAMQPID--GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
V C + P + GI GN A ++ IGYD +++ +SFKPT CT
Sbjct: 368 -QKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVCT 418
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 155/370 (41%), Positives = 216/370 (58%), Gaps = 21/370 (5%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
N Q++++ GEY+M S+GTPP I + DTGS+L+W QC PC CY QV P+++P
Sbjct: 81 NSPQTDITPCGGEYLMNLSLGTPPS-PIMAVADTGSNLIWTQCKPCDDCYTQVDPLFDPK 139
Query: 71 SSSSYKELSCQSEQCHLLDT-VSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
+SS+YK++SC S QC L+ SCS++ + C+Y YAD S T G A + +T G+++N
Sbjct: 140 ASSTYKDVSCSSSQCTALENQASCSTEDKTCSYLVSYADGSYTMGKFAVDTLTLGSTDNR 199
Query: 129 ---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
N++ GCG NN F G+VGLG +SL Q+ + KFSYCLVP ++
Sbjct: 200 PVQLKNIIIGCGQNNAVTFRNKSSGVVGLGGGAVSLIKQLGDSIDG-KFSYCLVP---EN 255
Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
TSK+ FG + VSG G VST LV K T+Y++TL+ ISVG SK + +S+
Sbjct: 256 DQTSKINFGTNAVVSGPGTVSTPLVVKSRDTFYYLTLKSISVG-----SKNMQTPDSN-- 308
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPI 305
KGNM ID+G TLLP +Y +E V + I +D R+GS LCY + I P+
Sbjct: 309 -IKGNMVIDSGTTLTLLPVKYYIEIENAVASLINADKSKDERIGSSLCYNATADLNI-PV 366
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
+T HF+ GA V L ++F E + C A GI+GN AQ + +GYD S+
Sbjct: 367 ITMHFE-GADVKLYPYNSFF-KVTEDLVCLAFGMSFYRNGIYGNVAQKNFLVGYDTASKT 424
Query: 366 VSFKPTDCTK 375
+SFKPTDC K
Sbjct: 425 MSFKPTDCAK 434
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 243 bits (621), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 146/370 (39%), Positives = 209/370 (56%), Gaps = 18/370 (4%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
+S V + G+Y+M +S+GTPP+ YGIVDTGSD++W+QC PC QCY Q P +NP+ SS
Sbjct: 77 ESTVISYEGDYIMSYSVGTPPIKS-YGIVDTGSDIVWLQCEPCEQCYNQTTPKFNPSKSS 135
Query: 74 SYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF---FD 130
SYK +SC S+ C + SC+ ++ C Y+ Y + S ++G L+ E +T ++ F
Sbjct: 136 SYKNISCSSKLCQSVRDTSCNDKKNCEYSINYGNQSHSQGDLSLETLTLESTTGRPVSFP 195
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD----SS 186
V GCG NN G F G+VGLG SL +Q+ +G KFSYCLV S
Sbjct: 196 KTVIGCGTNNIGSFKRVSSGVVGLGGGPASLITQLGPSIGG-KFSYCLVRMSITLKNMSM 254
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
+SK+ FG+ + VSG V+ST +V K+ +Y++T+E SVG+ K + + SS +
Sbjct: 255 GSSKLNFGDVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSVGD-----KRVEFAGSSKGV 309
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PI 305
+GN+ ID+ T +P D Y +L + + + L DP LCY S P
Sbjct: 310 EEGNIIIDSSTIVTFVPSDVYTKLNSAIVDLVTLERVDDPNQQFSLCYNVSSDEEYDFPY 369
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
+TAHF GA + L T+TF+ V CFA P +G IFG+F+Q D +GYD +
Sbjct: 370 MTAHFK-GADILLYATNTFV-EVARDVLCFAFAPSNGG-AIFGSFSQQDFMVGYDLQQKT 426
Query: 366 VSFKPTDCTK 375
VSFK DCT+
Sbjct: 427 VSFKSVDCTE 436
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 241 bits (616), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 148/364 (40%), Positives = 206/364 (56%), Gaps = 12/364 (3%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
++S + +GE++M IGTPP+ ++ I DTGSDL W QCLPC +C+ Q +PI+NP S
Sbjct: 79 IRSPIIPDSGEFLMSIFIGTPPV-NVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRS 137
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SSY+++SC S+ C L++ C Q C+Y Y Y D S T G LA+++IT G+
Sbjct: 138 SSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGSFK--LPK 195
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGAN-KFSYCLVPFHTDSSITSK 190
V GCGH N G F G++GLG LSL SQ+ + G +FSYCL F ++++IT
Sbjct: 196 TVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGT 255
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
+ FG + VSG VVST LV + T+YF+TLE ISVG + S + GN
Sbjct: 256 ISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGK----KRFKAANGISAMTNHGN 311
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAH 309
+ ID+G TLLP+ Y + + IK DP +LCY + + PI+TAH
Sbjct: 312 IIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAH 371
Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
F GGA V L+ +TF P + V C P V IFGN AQ + +GYD ++ +SF+
Sbjct: 372 FAGGADVKLLPVNTF-APVADNVTCLTFAPAT-QVAIFGNLAQINFEVGYDLGNKRLSFE 429
Query: 370 PTDC 373
P C
Sbjct: 430 PKLC 433
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 142/358 (39%), Positives = 202/358 (56%), Gaps = 38/358 (10%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+QS + + GEY+M IGTPP+ + IVDTGSDL W QC PC CYKQV P+++P +S
Sbjct: 81 IQSRIVPSAGEYLMNLYIGTPPV-PVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNS 139
Query: 73 SSYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---F 128
S+Y++ SC + C L SCS ++ C + Y YAD S T G LA+E +T ++
Sbjct: 140 STYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPVS 199
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
F FGCGH++ G+F+++ G+VGLG LSL SQ+ S + FSYCL+P TDSSI+
Sbjct: 200 FPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTING-LFSYCLLPVSTDSSIS 258
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPY--YNSSGAI 246
S++ FG VSG G VST L +PY Y+ +
Sbjct: 259 SRINFGASGRVSGYGTVSTPL--------------------------RLPYKGYSKKTEV 292
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPIL 306
+GN+ +D+G T LP++FY++LE+ V N+IK +DP LCY T + API+
Sbjct: 293 EEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAEIN-APII 351
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
TAHF A V L +TF+ E + CF + P D+G+ GN AQ + +G+D +
Sbjct: 352 TAHFK-DANVELQPLNTFMRMQ-EDLVCFTVAPTS-DIGVLGNLAQVNFLVGFDLRKK 406
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 85/155 (54%), Gaps = 10/155 (6%)
Query: 227 VGNLSNSSKLIPY-------YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK 279
+GNL+ + L+ + ++ + +GN+ +D+G T LP +FY +LEE V ++IK
Sbjct: 389 LGNLAQVNFLVGFDLRKKRGFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIK 448
Query: 280 LTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP 339
+DP S LCY T API+TAHF A V L +TF+ E + CF + P
Sbjct: 449 GKRVRDPNGISSLCYNTTVDQIDAPIITAHFK-DANVELQPWNTFLRMQ-EDLVCFTVLP 506
Query: 340 IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
D+GI GN AQ + +G+D + VSFK DCT
Sbjct: 507 TS-DIGILGNLAQVNFLVGFDLRKKRVSFKAADCT 540
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 240 bits (612), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 136/371 (36%), Positives = 211/371 (56%), Gaps = 18/371 (4%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
N V++ + GEY+MK S+GTPP I + DTGSD++W QC+PC CY+Q P++NP+
Sbjct: 72 NTVEAPIYNNRGEYLMKLSVGTPPF-PIIAVADTGSDIIWTQCVPCTNCYQQDLPMFNPS 130
Query: 71 SSSSYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-- 127
S++Y+++SC S C + SCS + C Y+ Y D+S ++G A + +T G+++
Sbjct: 131 KSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRV 190
Query: 128 -FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
F GCGH+N G F+ N G+VGLG SL Q+ S +G KFSYCL P D
Sbjct: 191 VAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGG-KFSYCLTPIGNDDG 249
Query: 187 ITSKMYFGNGSEVSGGGVVSTSL-VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
++K+ FG+ + VSG G VST + +S + K++Y + L+ +SVG + +Y+++ +
Sbjct: 250 GSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNT------FYSTANS 303
Query: 246 I--SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA 303
I K N+ ID+G TLLP D Y+ + + N+I L DP + C++T +
Sbjct: 304 ILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETTTDDYKV 363
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQSDLFIGYDFD 362
P + HF+ GA + L + I + V C A D D+ I+GN AQ + +GYD
Sbjct: 364 PFIAMHFE-GANLRLQRENVLIRVS-DNVICLAFAGAQDNDISIYGNIAQINFLVGYDVT 421
Query: 363 SQMVSFKPTDC 373
+ +SFKP +C
Sbjct: 422 NMSLSFKPMNC 432
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 136/371 (36%), Positives = 210/371 (56%), Gaps = 18/371 (4%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
N V++ + GEY+MK S+GTPP I + DTGSD++W QC PC CY+Q P++NP+
Sbjct: 72 NTVEAPIYNNRGEYLMKLSVGTPPF-PIIAVADTGSDIIWTQCEPCTNCYQQDLPMFNPS 130
Query: 71 SSSSYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-- 127
S++Y+++SC S C + SCS + C Y+ Y D+S ++G A + +T G+++
Sbjct: 131 KSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRV 190
Query: 128 -FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
F GCGH+N G F+ N G+VGLG SL Q+ S +G KFSYCL P D
Sbjct: 191 VAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGG-KFSYCLTPIGNDDG 249
Query: 187 ITSKMYFGNGSEVSGGGVVSTSL-VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
++K+ FG+ + VSG G VST + +S + K++Y + L+ +SVG + +Y+++ +
Sbjct: 250 GSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNT------FYSTANS 303
Query: 246 I--SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA 303
I K N+ ID+G TLLP D Y+ + + N+I L DP + C++T +
Sbjct: 304 ILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETTTDDYKV 363
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQSDLFIGYDFD 362
P + HF+ GA + L + I + V C A D D+ I+GN AQ + +GYD
Sbjct: 364 PFIAMHFE-GANLRLQRENVLIRVS-DNVICLAFAGAQDNDISIYGNIAQINFLVGYDVT 421
Query: 363 SQMVSFKPTDC 373
+ +SFKP +C
Sbjct: 422 NMSLSFKPMNC 432
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 123/230 (53%), Positives = 160/230 (69%), Gaps = 6/230 (2%)
Query: 7 FYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI 66
F+ N +QS VS + +Y+M+ SIGTPP+ IY DTGSDL+W+QC+PC CYKQ+ P+
Sbjct: 42 FFNRNTIQSPVSANHYDYLMELSIGTPPV-KIYAQADTGSDLIWLQCIPCTNCYKQLNPM 100
Query: 67 YNPASSSSYKELSCQSEQCHLLDTVSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNS 125
++ SSS++ ++C SE C L + SCS Q+ C Y Y Y D S T+GVLA E +T ++
Sbjct: 101 FDSQSSSTFSNIACGSESCSKLYSTSCSPDQINCKYNYSYVDGSETQGVLAQETLTLTST 160
Query: 126 NN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
F V+FGCGHNN G FN+ EMG++GLGR LSL SQI S LG N FS CLVPF+
Sbjct: 161 TGEPVAFKGVIFGCGHNNNGAFNDKEMGIIGLGRGPLSLVSQIGSSLGGNMFSQCLVPFN 220
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLS 231
T+ SI+S M FG GSEV G GVVST LVSK +++YFVTL GISV +++
Sbjct: 221 TNPSISSPMSFGKGSEVLGNGVVSTPLVSKTTYQSFYFVTLLGISVEDIN 270
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 158/384 (41%), Positives = 219/384 (57%), Gaps = 35/384 (9%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+QS + A+GE+ M +IGTPP+ ++ I DTGSDL WVQC PC QCYK+ PI++ S
Sbjct: 74 LQSGLIGADGEFFMSITIGTPPM-KVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKS 132
Query: 73 SSYKELSCQSEQCHLLDTVS--C-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-- 127
S+YK C S CH L + C S+ +C Y Y Y D S +KG +ATE I+ +++
Sbjct: 133 STYKSEPCDSRNCHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGSP 192
Query: 128 -FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
F VFGCG+NN G F+E G++GLG LSL SQ+ S + + KFSYCL ++
Sbjct: 193 VSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSI-SKKFSYCLSHKSATTN 251
Query: 187 ITSKMYFGNGSEVSG----GGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
TS + G S S GV+ST LV KE +TYY++TLE ISVG K IPY S
Sbjct: 252 GTSVINLGTNSIPSSLSKDSGVISTPLVDKEPRTYYYLTLEAISVGK-----KKIPYTGS 306
Query: 243 S-----GAI---SKGNMFIDTGAPPTLLPKDFYNR----LEEQVRNAIKLTPYQDPRLGS 290
S G I + GN+ ID+G TLL F+++ +EE V A +++ DP+
Sbjct: 307 SYNPNDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAKRVS---DPQGLL 363
Query: 291 QLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNF 350
C+K+ S P +T HF GA V L + F+ E + C +M P +V I+GNF
Sbjct: 364 SHCFKSGSAEIGLPEITVHFT-GADVRLSPINAFVKVS-EDMVCLSMVPTT-EVAIYGNF 420
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
AQ D +GYD +++ VSF+ DC+
Sbjct: 421 AQMDFLVGYDLETRTVSFQRMDCS 444
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 157/384 (40%), Positives = 217/384 (56%), Gaps = 35/384 (9%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+QS + A+GE+ M +IGTPP+ ++ I DTGSDL WVQC PC QCYK+ PI++ S
Sbjct: 74 LQSGLIGADGEFFMSITIGTPPI-KVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKS 132
Query: 73 SSYKELSCQSEQCHLLDTVS--C-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-- 127
S+YK C S C L + C S +C Y Y Y D S +KG +ATE ++ +++
Sbjct: 133 STYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSP 192
Query: 128 -FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
F VFGCG+NN G F+E G++GLG LSL SQ+ S + + KFSYCL ++
Sbjct: 193 VSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSI-SKKFSYCLSHKSATTN 251
Query: 187 ITSKMYFGNGSEVSG----GGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
TS + G S S GVVST LV KE TYY++TLE ISVG K IPY S
Sbjct: 252 GTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGK-----KKIPYTGS 306
Query: 243 S------GAISK--GNMFIDTGAPPTLLPKDFYNR----LEEQVRNAIKLTPYQDPRLGS 290
S G +S+ GN+ ID+G TLL F+++ +EE V A +++ DP+
Sbjct: 307 SYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVS---DPQGLL 363
Query: 291 QLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNF 350
C+K+ S P +T HF GA V L + F+ E + C +M P +V I+GNF
Sbjct: 364 SHCFKSGSAEIGLPEITVHFT-GADVRLSPINAFVKLS-EDMVCLSMVPTT-EVAIYGNF 420
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
AQ D +GYD +++ VSF+ DC+
Sbjct: 421 AQMDFLVGYDLETRTVSFQHMDCS 444
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 150/381 (39%), Positives = 208/381 (54%), Gaps = 51/381 (13%)
Query: 7 FYPNNVVQSNVSTANGE-YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP 65
F PN + +S+ G YVM +SIGTPP +Y ++DTG+D +W QC PC C Q P
Sbjct: 72 FSPNKIQDVPLSSFMGAGYVMSYSIGTPPF-QLYSLIDTGNDNIWFQCKPCKPCLNQTSP 130
Query: 66 IYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNS 125
+++P+ SS+YK + C S C D G+ L + +T NS
Sbjct: 131 MFHPSKSSTYKTIPCTSPICKNAD--------------GH--------YLGVDTLTL-NS 167
Query: 126 NN----FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
NN F N+V GCGH N G G +GL R LS SQ+ S +G KFSYCLVP
Sbjct: 168 NNGTPISFKNIVIGCGHRNQGPLEGYVSGNIGLARGPLSFISQLNSSIGG-KFSYCLVPL 226
Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
+ +++SK++FG+ S VSG G VST + +++ YFV+LE SVG+ +I N
Sbjct: 227 FSKENVSSKLHFGDKSTVSGLGTVSTPI---KEENGYFVSLEAFSVGD-----HIIKLEN 278
Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
S ++GN ID+G T+LPKD Y+RLE V + +KL +DP LCY+T S
Sbjct: 279 SD---NRGNSIIDSGTTMTILPKDVYSRLESVVLDMVKLKRVKDPSQQFNLCYQTTSTTL 335
Query: 302 IAPIL--TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG----DVGIFGNFAQSDL 355
+ +L TAHF G++V L +TF P E V CFA + G + IFGN Q +
Sbjct: 336 LTKVLIITAHFS-GSEVHLNALNTFYPITDE-VICFAF--VSGGNFSSLAIFGNVVQQNF 391
Query: 356 FIGYDFDSQMVSFKPTDCTKQ 376
+G+D + + +SFKPTDCTK
Sbjct: 392 LVGFDLNKKTISFKPTDCTKH 412
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 223 bits (569), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 142/377 (37%), Positives = 210/377 (55%), Gaps = 28/377 (7%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
N + +S + GEY+M+F IG+PP+ + +VDTGS L+W+QC PC C+ Q P++ P
Sbjct: 75 NKLPESLLIPDKGEYLMRFYIGSPPV-ERLAMVDTGSSLIWLQCSPCHNCFPQETPLFEP 133
Query: 70 ASSSSYKELSCQSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
SS+YK +C S+ C LL C C Y Y D S + G+L TE ++FG++
Sbjct: 134 LKSSTYKYATCDSQPCTLLQPSQRDCGKLGQCIYGIMYGDKSFSVGILGTETLSFGSTGG 193
Query: 128 F----FDNVVFGCG-HNNTGVFNENE-MGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
F N +FGCG NN ++ N+ MG+ GLG LSL SQ+ +Q+G +KFSYCL+P+
Sbjct: 194 AQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQIG-HKFSYCLLPY 252
Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYY 240
DS+ TSK+ FG+ + ++ GVVST L+ K TYYF+ LE +++G K++
Sbjct: 253 --DSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQ-----KVV--- 302
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
S + GN+ ID+G P T L FYN ++ + + QD + C+ P+ A
Sbjct: 303 --STGQTDGNIVIDSGTPLTYLENTFYNNFVASLQETLGVKLLQDLPSPLKTCF--PNRA 358
Query: 301 GIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG-DVGIFGNFAQSDLFIG 358
+A P + F GA V L + IP + C A+ P G + +FG+ AQ D +
Sbjct: 359 NLAIPDIAFQFT-GASVALRPKNVLIPLTDSNILCLAVVPSSGIGISLFGSIAQYDFQVE 417
Query: 359 YDFDSQMVSFKPTDCTK 375
YD + + VSF PTDC K
Sbjct: 418 YDLEGKKVSFAPTDCAK 434
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 223 bits (569), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 135/359 (37%), Positives = 198/359 (55%), Gaps = 15/359 (4%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
GEY+++ S+GTPP I + DTGSD++W QC PC CY+Q P+++P+ S++YK ++C
Sbjct: 81 GEYLVEISVGTPPF-SIVAVADTGSDVIWTQCKPCSNCYQQNAPMFDPSKSTTYKNVACS 139
Query: 82 SEQC-HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCG 137
S C + D SCS C Y+ Y D S ++G LA + +T +++ F V GCG
Sbjct: 140 SPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQSTSGRPVAFPRTVIGCG 199
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS-ITSKMYFGNG 196
H+N G FN N G+VGLGR SL +Q+ G KFSYCL+P T S+ ++K+ FG+
Sbjct: 200 HDNAGTFNANVSGIVGLGRGPASLVTQLGPATGG-KFSYCLIPIGTGSTNDSTKLNFGSN 258
Query: 197 SEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
+ VSG G VST + S KT+Y + LE +SVG+ +K +S + N+ ID+
Sbjct: 259 ANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGD----TKFNFPEGASKLGGESNIIIDS 314
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
G T LP N + ++ L QDP C+ T + P +T HF+ GA
Sbjct: 315 GTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYCFATTTDDYEMPPVTMHFE-GAD 373
Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
VPL + F+ + C A D ++ I+GN AQS+ +GYD + VSF+P C
Sbjct: 374 VPLQRENLFVRLS-DDTICLAFGSFPDDNIFIYGNIAQSNFLVGYDIKNLAVSFQPAHC 431
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 141/368 (38%), Positives = 196/368 (53%), Gaps = 24/368 (6%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V+++V +GEY+M SIGTP I+DTGSDL+W QC PC QC+ Q PI+NP S
Sbjct: 84 VETSVYAGDGEYLMNLSIGTPAQ-PFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGS 142
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
SS+ L C S+ C L + +CS+ C YTYGY D S T+G + TE +TFG+ + N+
Sbjct: 143 SSFSTLPCSSQLCQALSSPTCSN-NFCQYTYGYGDGSETQGSMGTETLTFGSVS--IPNI 199
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGCG NN G N GLVG+GR LSL SQL KFSYC+ P SS S +
Sbjct: 200 TFGCGENNQGFGQGNGAGLVGMGRGPLSLP----SQLDVTKFSYCMTPI--GSSTPSNLL 253
Query: 193 FGN-GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK--- 248
G+ + V+ G +T + S + T+Y++TL G+SVG S +P S+ A++
Sbjct: 254 LGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVG-----STRLPIDPSAFALNSNNG 308
Query: 249 -GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG--IAPI 305
G + ID+G T + Y + ++ + I L G LC++TPS P
Sbjct: 309 TGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPT 368
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
HFDGG + L + FI P G+ C AM + IFGN Q ++ + YD + +
Sbjct: 369 FVMHFDGG-DLELPSENYFISPS-NGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSV 426
Query: 366 VSFKPTDC 373
VSF C
Sbjct: 427 VSFASAQC 434
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 144/369 (39%), Positives = 196/369 (53%), Gaps = 27/369 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+++ V +GEY+M +IGTP + I+DTGSDL+W QC PC QC+ Q PI+NP S
Sbjct: 85 IETPVYAGSGEYLMNVAIGTPAS-SLSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDS 143
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
SS+ L C+S+ C L + SC + C YTYGY D S T+G +ATE TF S+ N+
Sbjct: 144 SSFSTLPCESQYCQDLPSESCYND--CQYTYGYGDGSSTQGYMATETFTFETSS--VPNI 199
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGCG +N G N GL+G+G LSL SQLG +FSYC+ + S T +
Sbjct: 200 AFGCGEDNQGFGQGNGAGLIGMGWGPLSLP----SQLGVGQFSYCMTSSGSSSPSTLAL- 254
Query: 193 FGNGSEVSG--GGVVSTSLV-SKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAIS 247
GS SG G ST+L+ S + TYY++TL+GI+VG NL S +
Sbjct: 255 ---GSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDD----G 307
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI--API 305
G M ID+G T LP+D YN + + + I L+P + G C++ PS P
Sbjct: 308 TGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLSPVDESSSGLSTCFQLPSDGSTVQVPE 367
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
++ FDGG V + + P EGV C AM + IFGN Q + + YD +
Sbjct: 368 ISMQFDGG--VLNLGEENVLISPAEGVICLAMGSSSQQGISIFGNIQQQETQVLYDLQNL 425
Query: 365 MVSFKPTDC 373
VSF PT C
Sbjct: 426 AVSFVPTQC 434
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 140/367 (38%), Positives = 195/367 (53%), Gaps = 22/367 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+++ V +GEY+M +IGTP I+DTGSDL+W QC PC QC+ Q PI+NP S
Sbjct: 85 IETPVYAGDGEYLMNVAIGTPDS-SFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDS 143
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
SS+ L C+S+ C L + +C++ + C YTYGY D S T+G +ATE TF S+ N+
Sbjct: 144 SSFSTLPCESQYCQDLPSETCNNNE-CQYTYGYGDGSTTQGYMATETFTFETSS--VPNI 200
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGCG +N G N GL+G+G LSL SQLG +FSYC+ + SS S +
Sbjct: 201 AFGCGEDNQGFGQGNGAGLIGMGWGPLSLP----SQLGVGQFSYCMTSY--GSSSPSTLA 254
Query: 193 FGNGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAISKG 249
G+ + G ST+L+ S + TYY++TL+GI+VG NL S + G
Sbjct: 255 LGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDD----GTG 310
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI--APILT 307
M ID+G T LP+D YN + + + I L + G C++ PS P ++
Sbjct: 311 GMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEIS 370
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG-DVGIFGNFAQSDLFIGYDFDSQMV 366
FDGG V + + P EGV C AM + IFGN Q + + YD + V
Sbjct: 371 MQFDGG--VLNLGEQNILISPAEGVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAV 428
Query: 367 SFKPTDC 373
SF PT C
Sbjct: 429 SFVPTQC 435
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 143/383 (37%), Positives = 206/383 (53%), Gaps = 32/383 (8%)
Query: 9 PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
P + V+ + GEY+M +IGTPPL +VDTGSDL+W QC PCV C Q P +
Sbjct: 77 PITAARILVAASQGEYLMDLAIGTPPL-RYTAMVDTGSDLIWTQCAPCVLCADQPTPYFR 135
Query: 69 PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN- 127
PA S++Y+ + C+S C L +C + +C Y Y Y D + T GVLA+E TFG +N+
Sbjct: 136 PARSATYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSS 195
Query: 128 --FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
+V FGCG+ N+G N G+VGLGR LSL +SQLG ++FSYCL F +
Sbjct: 196 KVMVSDVAFGCGNINSGQL-ANSSGMVGLGRGPLSL----VSQLGPSRFSYCLTSFLSPE 250
Query: 186 SITSKMYFG-----NGSEVSGGG--VVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLI 237
S++ FG NG+ S G V ST LV + YF++L+GIS+G K +
Sbjct: 251 P--SRLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQ-----KRL 303
Query: 238 PYYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK-LTPYQDPRLGSQLC 293
P AI+ G +FID+G T L +D Y+ + ++ + ++ L P D +G + C
Sbjct: 304 PIDPLVFAINDDGTGGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETC 363
Query: 294 Y---KTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNF 350
+ PS+A P + HFDGGA + + + + G C AM GD I GN+
Sbjct: 364 FPWPPPPSVAVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMI-RSGDATIIGNY 422
Query: 351 AQSDLFIGYDFDSQMVSFKPTDC 373
Q ++ I YD + ++SF P C
Sbjct: 423 QQQNMHILYDIANSLLSFVPAPC 445
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 143/383 (37%), Positives = 206/383 (53%), Gaps = 32/383 (8%)
Query: 9 PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
P + V+ + GEY+M +IGTPPL +VDTGSDL+W QC PCV C Q P +
Sbjct: 77 PITAARILVAASQGEYLMDLAIGTPPL-RYTAMVDTGSDLIWTQCAPCVLCADQPTPYFR 135
Query: 69 PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN- 127
PA S++Y+ + C+S C L +C + +C Y Y Y D + T GVLA+E TFG +N+
Sbjct: 136 PARSATYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSS 195
Query: 128 --FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
+V FGCG+ N+G N G+VGLGR LSL +SQLG ++FSYCL F +
Sbjct: 196 KVMVSDVAFGCGNINSGQL-ANSSGMVGLGRGPLSL----VSQLGPSRFSYCLTSFLSPE 250
Query: 186 SITSKMYFG-----NGSEVSGGG--VVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLI 237
S++ FG NG+ S G V ST LV + YF++L+GIS+G K +
Sbjct: 251 P--SRLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQ-----KRL 303
Query: 238 PYYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK-LTPYQDPRLGSQLC 293
P AI+ G +FID+G T L +D Y+ + ++ + ++ L P D +G + C
Sbjct: 304 PIDPLVFAINDDGTGGVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETC 363
Query: 294 Y---KTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNF 350
+ PS+A P + HFDGGA + + + + G C AM GD I GN+
Sbjct: 364 FPWPPPPSVAVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMI-RSGDATIIGNY 422
Query: 351 AQSDLFIGYDFDSQMVSFKPTDC 373
Q ++ I YD + ++SF P C
Sbjct: 423 QQQNMHILYDIANSLLSFVPAPC 445
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 218 bits (554), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 149/382 (39%), Positives = 204/382 (53%), Gaps = 33/382 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+QS + + GEY M SIGTPP I DTGSDL WVQC PC QCYKQ P+++ S
Sbjct: 74 LQSGLISNGGEYFMSISIGTPPS-KFLAIADTGSDLTWVQCKPCQQCYKQNTPLFDKKKS 132
Query: 73 SSYKELSCQSEQCHLLD--TVSC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-- 127
S+YK SC S C+ L C S+ C Y Y Y D S TKG +ATE I+ +S+
Sbjct: 133 STYKTESCDSITCNALSEHEEGCDESRNACKYRYSYGDESFTKGEVATETISIDSSSGSP 192
Query: 128 -FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
F FGCG+NN G F E G++GLG LSL SQ+ S +G KFSYCL ++
Sbjct: 193 VSFPGTAFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIG-KKFSYCLSHTSATTN 251
Query: 187 ITSKMYFGNGSEVS----GGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
TS + G S S +++T L+ K+ +TYYF+TLE I+VG +PY
Sbjct: 252 GTSVINLGTNSMTSKPSKDSAILTTPLIQKDPETYYFLTLEAITVGKTK-----LPYTGG 306
Query: 243 SG------AISKGNMFIDTGAPPTLLPKDFYNR----LEEQVRNAIKLTPYQDPRLGSQL 292
G + GN+ ID+G TLL FY+ +EE V A +++ DP+
Sbjct: 307 GGYSLNRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKRVS---DPQGILTH 363
Query: 293 CYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQ 352
C+K+ P +T HF GA V L ++F+ E + C +M P +V I+GN Q
Sbjct: 364 CFKSGDKEIGLPTITMHFT-GADVKLSPINSFVKLS-EDIVCLSMIPTT-EVAIYGNMVQ 420
Query: 353 SDLFIGYDFDSQMVSFKPTDCT 374
D +GYD +++ VSF+ DC+
Sbjct: 421 MDFLVGYDLETKTVSFQRMDCS 442
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 136/366 (37%), Positives = 193/366 (52%), Gaps = 20/366 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V++ V +GEY+M SIGTP I+DTGSDL+W QC PC QC+ Q PI+NP S
Sbjct: 84 VETPVYAGDGEYLMNLSIGTPAQ-PFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGS 142
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
SS+ L C S+ C L + +CS+ C YTYGY D S T+G + TE +TFG+ + N+
Sbjct: 143 SSFSTLPCSSQLCQALQSPTCSNNS-CQYTYGYGDGSETQGSMGTETLTFGSVS--IPNI 199
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGCG NN G N GLVG+GR LSL SQL KFSYC+ P SS +S +
Sbjct: 200 TFGCGENNQGFGQGNGAGLVGMGRGPLSLP----SQLDVTKFSYCMTPI--GSSTSSTLL 253
Query: 193 FGN-GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKG 249
G+ + V+ G +T + S + T+Y++TL G+SVG+ L + +++G G
Sbjct: 254 LGSLANSVTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGT---G 310
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG--IAPILT 307
+ ID+G T + Y + + + + L+ G LC++ PS P
Sbjct: 311 GIIIDSGTTLTYFADNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFV 370
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
HFDGG V + + + P G+ C AM + IFGN Q +L + YD + +VS
Sbjct: 371 MHFDGGDLV--LPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVS 428
Query: 368 FKPTDC 373
F C
Sbjct: 429 FLFAQC 434
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 136/366 (37%), Positives = 193/366 (52%), Gaps = 20/366 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V++ V +GEY+M SIGTP I+DTGSDL+W QC PC QC+ Q PI+NP S
Sbjct: 84 VETPVYAGDGEYLMNLSIGTPAQ-PFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGS 142
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
SS+ L C S+ C L + +CS+ C YTYGY D S T+G + TE +TFG+ + N+
Sbjct: 143 SSFSTLPCSSQLCQALQSPTCSNNS-CQYTYGYGDGSETQGSMGTETLTFGSVS--IPNI 199
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGCG NN G N GLVG+GR LSL SQL KFSYC+ P SS +S +
Sbjct: 200 TFGCGENNQGFGQGNGAGLVGMGRGPLSLP----SQLDVTKFSYCMTPI--GSSNSSTLL 253
Query: 193 FGN-GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKG 249
G+ + V+ G +T + S + T+Y++TL G+SVG+ L + +++G G
Sbjct: 254 LGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGT---G 310
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG--IAPILT 307
+ ID+G T + Y + + + + L+ G LC++ PS P
Sbjct: 311 GIIIDSGTTLTYFVDNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFV 370
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
HFDGG V + + + P G+ C AM + IFGN Q +L + YD + +VS
Sbjct: 371 MHFDGGDLV--LPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVS 428
Query: 368 FKPTDC 373
F C
Sbjct: 429 FLSAQC 434
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 153/394 (38%), Positives = 211/394 (53%), Gaps = 33/394 (8%)
Query: 1 MSPATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCY 60
+S + F +QS + + GEY M SIGTPP ++ I DTGSDL WVQC PC QCY
Sbjct: 62 ISRSRRFTTKTDLQSGLISNGGEYFMSISIGTPPS-KVFAIADTGSDLTWVQCKPCQQCY 120
Query: 61 KQVKPIYNPASSSSYKELSCQSEQCHLLDTVS--C-SSQQLCNYTYGYADSSLTKGVLAT 117
KQ P+++ SS+YK SC S+ C L C S+ +C Y Y Y D+S TKG +AT
Sbjct: 121 KQNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVAT 180
Query: 118 ERI---TFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKF 174
E I + S+ F VFGCG+NN G F E G++GLG LSL SQ+ S +G KF
Sbjct: 181 ETISIDSSSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIG-KKF 239
Query: 175 SYCLVPFHTDSSITSKMYFGNGSEVSG----GGVVSTSLVSKEDKTYYFVTLEGISVGNL 230
SYCL ++ TS + G S S ++T L+ K+ +TYYF+TLE ++VG
Sbjct: 240 SYCLSHTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKT 299
Query: 231 SNSSKLIPY----YNSSGAISK--GNMFIDTGAPPTLLPKDFYN----RLEEQVRNAIKL 280
+PY Y +G SK GN+ ID+G TLL FY+ +EE V A ++
Sbjct: 300 K-----LPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRV 354
Query: 281 TPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI 340
+ DP+ C+K+ P +T HF A V L + F+ E C +M P
Sbjct: 355 S---DPQGLLTHCFKSGDKEIGLPAITMHFT-NADVKLSPINAFVKLN-EDTVCLSMIPT 409
Query: 341 DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+V I+GN Q D +GYD +++ VSF+ DC+
Sbjct: 410 T-EVAIYGNMVQMDFLVGYDLETKTVSFQRMDCS 442
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 140/377 (37%), Positives = 194/377 (51%), Gaps = 30/377 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+Q V NGE++M SIGTP L IVDTGSDL+W QC PCV+C+ Q P+++P+SS
Sbjct: 107 LQVPVHAGNGEFLMDMSIGTPALA-YAAIVDTGSDLVWTQCKPCVECFNQSTPVFDPSSS 165
Query: 73 SSYKELSCQSEQCHLLDTVSC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
S+Y L C S C L T +C S+ + C YTY Y D+S T+GVLA E T +
Sbjct: 166 STYSTLPCSSSLCSDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKTK--LPG 223
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V FGCG N G GLVGLGR LSL +SQLG KFSYCL D + S +
Sbjct: 224 VAFGCGDTNEGDGFTQGAGLVGLGRGPLSL----VSQLGLGKFSYCLTSL--DDTSKSPL 277
Query: 192 YFGNGSEV-----SGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
G+ + + S + +T L+ + ++Y+VTL+ ++VG S IP S+ A
Sbjct: 278 LLGSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVG-----STRIPLPGSAFA 332
Query: 246 ISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI 302
+ G + +D+G T L Y L++ +KL +G LC+K P+ +G+
Sbjct: 333 VQDDGTGGVIVDSGTSITYLELQGYRPLKKAFAAQMKLPVADGSAVGLDLCFKAPA-SGV 391
Query: 303 ----APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIG 358
P L HFDGGA + L + + G C + G + I GNF Q ++
Sbjct: 392 DDVEVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMGSRG-LSIIGNFQQQNIQFV 450
Query: 359 YDFDSQMVSFKPTDCTK 375
YD D +SF P C K
Sbjct: 451 YDVDKDTLSFAPVQCAK 467
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 138/370 (37%), Positives = 197/370 (53%), Gaps = 27/370 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIY-GIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
V++ V NGE++MK +IGTP + Y I+DTGSDL+W QC PC C+ Q PI++P
Sbjct: 86 VEAPVHAGNGEFLMKLAIGTPA--ETYSAIMDTGSDLIWTQCKPCKDCFDQPTPIFDPKK 143
Query: 72 SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SSS+ +L C S+ C L SCS C Y Y Y D S T+GVLATE FG+++
Sbjct: 144 SSSFSKLPCSSDLCAALPISSCSDG--CEYLYSYGDYSSTQGVLATETFAFGDAS--VSK 199
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
+ FGCG +N G GLVGLGR LSL +SQLG KFSYCL I+S +
Sbjct: 200 IGFGCGEDNDGSGFSQGAGLVGLGRGPLSL----ISQLGEPKFSYCLTSMDDSKGISSLL 255
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS--- 247
GSE + ++T L+ + ++Y+++LEGISVG+ L+P S+ +I
Sbjct: 256 V---GSEATMKNAITTPLIQNPSQPSFYYLSLEGISVGD-----TLLPIEKSTFSIQNDG 307
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI--API 305
G + ID+G T L + L+++ + +KL + G LC+ P A P
Sbjct: 308 SGGLIIDSGTTITYLEDSAFAALKKEFISQLKLDVDESGSTGLDLCFTLPPDASTVDVPQ 367
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
L HF+ GA + L + I GV C M G + IFGNF Q ++ + +D + +
Sbjct: 368 LVFHFE-GADLKLPAENYIIADSGLGVICLTMGSSSG-MSIFGNFQQQNIVVLHDLEKET 425
Query: 366 VSFKPTDCTK 375
+SF P C +
Sbjct: 426 ISFAPAQCNQ 435
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 141/363 (38%), Positives = 193/363 (53%), Gaps = 24/363 (6%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY+M+F IGTPP+ + + I DTGSDL+WVQC PC +C Q P+++P SS++K + C S
Sbjct: 91 EYLMRFYIGTPPV-ERFAIADTGSDLIWVQCAPCEKCVPQNAPLFDPRKSSTFKTVPCDS 149
Query: 83 EQCHLL--DTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNF--FDNVVFGCG 137
+ C LL +C + C Y Y Y D +L G+L E I FG+ NN F + FGC
Sbjct: 150 QPCTLLPPSQRACVGKSGQCYYQYIYGDHTLVSGILGFESINFGSKNNAIKFPKLTFGCT 209
Query: 138 HNNTGVFNENE--MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
+N +E++ MGLVGLG LSL SQ+ Q+G KFSYC P ++S TSKM FGN
Sbjct: 210 FSNNDTVDESKRNMGLVGLGVGPLSLISQLGYQIG-RKFSYCFPPLSSNS--TSKMRFGN 266
Query: 196 GSEVSG-GGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
+ V GVVST L+ K +YY++ LEG+S+GN +S + + GN+ I
Sbjct: 267 DAIVKQIKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKK--------VKTSESQTDGNILI 318
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGG 313
D+G T+L + FYN+ V+ + + P L C++ P + F G
Sbjct: 319 DSGTSFTILKQSFYNKFVALVKEVYGVEAVKIPPLVYNFCFENKGKRKRFPDVVFLFT-G 377
Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
AKV + S + C P D D IFGN AQ + YD MVSF P D
Sbjct: 378 AKV-RVDASNLFEAEDNNLLCMVALPTSDEDDSIFGNHAQIGYQVEYDLQGGMVSFAPAD 436
Query: 373 CTK 375
C K
Sbjct: 437 CAK 439
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 143/378 (37%), Positives = 209/378 (55%), Gaps = 29/378 (7%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
N + QS + NGEY+M+F IGTPP+ + DTGSDL+WVQC PC C+ Q P++ P
Sbjct: 76 NKLPQSVLILHNGEYLMRFYIGTPPV-ERLATADTGSDLIWVQCSPCASCFPQSTPLFQP 134
Query: 70 ASSSSYKELSCQSEQCHLL--DTVSCSSQQLCNYTYGYADS-SLTKGVLATERITF---- 122
SS++ +C+S+ C LL + C C YTY Y D S ++G+L+TE + F
Sbjct: 135 LKSSTFMPTTCRSQPCTLLLPEQKGCGKSGECIYTYKYGDQYSFSEGLLSTETLRFDSQG 194
Query: 123 GNSNNFFDNVVFGCG-HNNTGVFNENEM-GLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
G F N FGCG +NN VF ++ G++GLG LSL SQI Q+G +KFSYCL+P
Sbjct: 195 GVQTVAFPNSFFGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQIG-HKFSYCLLP 253
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPY 239
+ S TSK+ FGN S ++G GVVST ++ K TYYF+ LE ++V K +P
Sbjct: 254 LGSTS--TSKLKFGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQ-----KTVP- 305
Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM 299
+G+ + GN+ ID+G T L + FY ++ ++ + QD C+
Sbjct: 306 ---TGS-TDGNVIIDSGTLLTYLGESFYYNFAASLQESLAVELVQDVLSPLPFCFPYRDN 361
Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP--IDGDVGIFGNFAQSDLFI 357
+ P + F GA+V L + F+ C + P + G + IFG+F+Q D +
Sbjct: 362 F-VFPEIAFQFT-GARVSLKPANLFVMTEDRNTVCLMIAPSSVSG-ISIFGSFSQIDFQV 418
Query: 358 GYDFDSQMVSFKPTDCTK 375
YD + + VSF+PTDC+K
Sbjct: 419 EYDLEGKKVSFQPTDCSK 436
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 139/376 (36%), Positives = 187/376 (49%), Gaps = 26/376 (6%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+Q V NGE++M +IGTP L IVDTGSDL+W QC PCV C+KQ P+++P+SS
Sbjct: 89 LQVPVHAGNGEFLMDVAIGTPAL-SYAAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSS 147
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+Y + C S C L T +C+S C YTY Y D+S T+GVLA+E T G V
Sbjct: 148 STYATVPCSSALCSDLPTSTCTSASKCGYTYTYGDASSTQGVLASETFTLGKEKKKLPGV 207
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGCG N G GLVGLGR LSL +SQLG +KFSYCL D S +
Sbjct: 208 AFGCGDTNEGDGFTQGAGLVGLGRGPLSL----VSQLGLDKFSYCLTSLD-DGDGKSPLL 262
Query: 193 FGNGSEVSGGG-----VVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
G + V +T LV + ++Y+V+L G++VG S I S+ AI
Sbjct: 263 LGGSAAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVG-----STRITLPASAFAI 317
Query: 247 SK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI- 302
G + +D+G T L Y L++ + L +G LC++ P+ G+
Sbjct: 318 QDDGTGGVIVDSGTSITYLELQGYRALKKAFVAQMALPTVDGSEIGLDLCFQGPA-KGVD 376
Query: 303 ---APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
P L HFDGGA + L + + G C + P G + I GNF Q + Y
Sbjct: 377 EVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVAPSRG-LSIIGNFQQQNFQFVY 435
Query: 360 DFDSQMVSFKPTDCTK 375
D +SF P C K
Sbjct: 436 DVAGDTLSFAPVQCNK 451
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 140/362 (38%), Positives = 194/362 (53%), Gaps = 20/362 (5%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
+GEY+M+F IGTPP+ + I DT SDL+WVQC PC C+ Q P++ P SS++ LSC
Sbjct: 87 HGEYLMRFYIGTPPV-ERLAIADTASDLIWVQCSPCETCFPQDTPLFEPHKSSTFANLSC 145
Query: 81 QSEQCHLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
S+ C + C LC YT Y D S TKGVL TE I FG+ F +FGCG N
Sbjct: 146 DSQPCTSSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTESIHFGSQTVTFPKTIFGCGSN 205
Query: 140 NTGV--FNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
N + + G+VGLG LSL SQ+ Q+G +KFSYCL+PF + S+I K+ FGN +
Sbjct: 206 NDFMHQISNKVTGIVGLGAGPLSLVSQLGDQIG-HKFSYCLLPFTSTSTI--KLKFGNDT 262
Query: 198 EVSGGGVVSTSL-VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
++G GVVST L + +YYF+ L GI++G + + N GN+ ID G
Sbjct: 263 TITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRTTDHTN-------GNIIIDLG 315
Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAK 315
T L +FY+ +R A+ ++ +D + + P+ A I P + F GAK
Sbjct: 316 TVLTYLEVNFYHNFVTLLREALGISETKD-DIPYPFDFCFPNQANITFPKIVFQFT-GAK 373
Query: 316 VPLIHTSTFIPPPVEGVFCFAMQP--IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
V L + F + C A+ P +FGN AQ D + YD + VSF P DC
Sbjct: 374 VFLSPKNLFFRFDDLNMICLAVLPDFYAKGFSVFGNLAQVDFQVEYDRKGKKVSFAPADC 433
Query: 374 TK 375
+K
Sbjct: 434 SK 435
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 138/378 (36%), Positives = 203/378 (53%), Gaps = 19/378 (5%)
Query: 4 ATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV 63
A+ + +++ + NGEY+M+ +IGTPP+ ++DTGSDL+W QC PC QCYKQ
Sbjct: 88 ASTLDSEDQLEAPIHAGNGEYLMELAIGTPPV-SYPAVLDTGSDLIWTQCKPCTQCYKQP 146
Query: 64 KPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG 123
PI++P SSS+ ++SC S C + + +CS C Y Y Y D S+T+GVLATE TFG
Sbjct: 147 TPIFDPKKSSSFSKVSCGSSLCSAVPSSTCSDG--CEYVYSYGDYSMTQGVLATETFTFG 204
Query: 124 NSNNFFD--NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
S N N+ FGCG +N G E GLVGLGR LSL +SQL +FSYCL P
Sbjct: 205 KSKNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGRGPLSL----VSQLKEPRFSYCLTPM 260
Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNSSKLIPY 239
D + S + G+ +V V T+ + K ++Y+++LEGISVG+ S + +
Sbjct: 261 --DDTKESILLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTF 318
Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM 299
G G + ID+G T + + + L+++ + KL + G LC+ PS
Sbjct: 319 --EVGDDGNGGVIIDSGTTITYIEQKAFEALKKEFISQTKLPLDKTSSTGLDLCFSLPSG 376
Query: 300 AGIA--PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFI 357
+ P + HF GG + L + I GV C AM G + IFGN Q ++ +
Sbjct: 377 STQVEIPKIVFHFKGG-DLELPAENYMIGDSNLGVACLAMGASSG-MSIFGNVQQQNILV 434
Query: 358 GYDFDSQMVSFKPTDCTK 375
+D + + +SF PT C +
Sbjct: 435 NHDLEKETISFVPTSCDQ 452
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 137/369 (37%), Positives = 199/369 (53%), Gaps = 19/369 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+++ + NGEY+++ +IGTPP+ ++DTGSDL+W QC PC +CYKQ PI++P S
Sbjct: 97 LEAPIHAGNGEYLIELAIGTPPV-SYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKS 155
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD-- 130
SS+ ++SC S C L + +CS C Y Y Y D S+T+GVLATE TFG S N
Sbjct: 156 SSFSKVSCGSSLCSALPSSTCSDG--CEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVH 213
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
N+ FGCG +N G E GLVGLGR LSL SQ+ Q +FSYCL P D + S
Sbjct: 214 NIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKEQ----RFSYCLTPI--DDTKESV 267
Query: 191 MYFGNGSEVSGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
+ G+ +V V T+ + K ++Y+++LE ISVG+ S + + G
Sbjct: 268 LLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTF--EVGDDGN 325
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA--PIL 306
G + ID+G T + + Y L+++ + KL + G LC+ PS + P L
Sbjct: 326 GGVIIDSGTTITYVQQKAYEALKKEFISQTKLALDKTSSTGLDLCFSLPSGSTQVEIPKL 385
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
HF GG + L + I GV C AM G + IFGN Q ++ + +D + + +
Sbjct: 386 VFHFKGG-DLELPAENYMIGDSNLGVACLAMGASSG-MSIFGNVQQQNILVNHDLEKETI 443
Query: 367 SFKPTDCTK 375
SF PT C +
Sbjct: 444 SFVPTSCDQ 452
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 207 bits (528), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 140/376 (37%), Positives = 191/376 (50%), Gaps = 29/376 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+Q V NGE++M SIGTP L IVDTGSDL+W QC PCV C+KQ P+++P+SS
Sbjct: 84 LQVPVHAGNGEFLMDVSIGTPALA-YSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSS 142
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+Y + C S C L T C+S C YTY Y DSS T+GVLATE T S V
Sbjct: 143 STYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK--LPGV 200
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
VFGCG N G GLVGLGR LSL +SQLG +KFSYCL D + S +
Sbjct: 201 VFGCGDTNEGDGFSQGAGLVGLGRGPLSL----VSQLGLDKFSYCLTSL--DDTNNSPLL 254
Query: 193 FGNGSEV-----SGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
G+ + + + V +T L+ + ++Y+V+L+ I+VG S I +S+ A+
Sbjct: 255 LGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVG-----STRISLPSSAFAV 309
Query: 247 SK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI- 302
G + +D+G T L Y L++ + L +G LC++ P+ G+
Sbjct: 310 QDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPA-KGVD 368
Query: 303 ---APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
P L HFDGGA + L + + G C + G + I GNF Q + Y
Sbjct: 369 QVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRG-LSIIGNFQQQNFQFVY 427
Query: 360 DFDSQMVSFKPTDCTK 375
D +SF P C K
Sbjct: 428 DVGHDTLSFAPVQCNK 443
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 140/376 (37%), Positives = 191/376 (50%), Gaps = 29/376 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+Q V NGE++M SIGTP L IVDTGSDL+W QC PCV C+KQ P+++P+SS
Sbjct: 94 LQVPVHAGNGEFLMDVSIGTPALA-YSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSS 152
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+Y + C S C L T C+S C YTY Y DSS T+GVLATE T S V
Sbjct: 153 STYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK--LPGV 210
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
VFGCG N G GLVGLGR LSL +SQLG +KFSYCL D + S +
Sbjct: 211 VFGCGDTNEGDGFSQGAGLVGLGRGPLSL----VSQLGLDKFSYCLTSL--DDTNNSPLL 264
Query: 193 FGNGSEV-----SGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
G+ + + + V +T L+ + ++Y+V+L+ I+VG S I +S+ A+
Sbjct: 265 LGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVG-----STRISLPSSAFAV 319
Query: 247 SK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI- 302
G + +D+G T L Y L++ + L +G LC++ P+ G+
Sbjct: 320 QDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPA-KGVD 378
Query: 303 ---APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
P L HFDGGA + L + + G C + G + I GNF Q + Y
Sbjct: 379 QVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRG-LSIIGNFQQQNFQFVY 437
Query: 360 DFDSQMVSFKPTDCTK 375
D +SF P C K
Sbjct: 438 DVGHDTLSFAPVQCNK 453
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 139/372 (37%), Positives = 189/372 (50%), Gaps = 29/372 (7%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
V NGE++M SIGTP L IVDTGSDL+W QC PCV C+KQ P+++P+SSS+Y
Sbjct: 67 VHAGNGEFLMDVSIGTPALA-YSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYA 125
Query: 77 ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
+ C S C L T C+S C YTY Y DSS T+GVLATE T S VVFGC
Sbjct: 126 TVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK--LPGVVFGC 183
Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
G N G GLVGLGR LSL +SQLG +KFSYCL D + S + G+
Sbjct: 184 GDTNEGDGFSQGAGLVGLGRGPLSL----VSQLGLDKFSYCLTSL--DDTNNSPLLLGSL 237
Query: 197 SEV-----SGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK-- 248
+ + + V +T L+ + ++Y+V+L+ I+VG S I +S+ A+
Sbjct: 238 AGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVG-----STRISLPSSAFAVQDDG 292
Query: 249 -GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI----A 303
G + +D+G T L Y L++ + L +G LC++ P+ G+
Sbjct: 293 TGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPA-KGVDQVEV 351
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
P L HFDGGA + L + + G C + G + I GNF Q + YD
Sbjct: 352 PRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRG-LSIIGNFQQQNFQFVYDVGH 410
Query: 364 QMVSFKPTDCTK 375
+SF P C K
Sbjct: 411 DTLSFAPVQCNK 422
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 131/359 (36%), Positives = 193/359 (53%), Gaps = 38/359 (10%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY+MK IGTPP +I ++DTGS+ +W QCLPCV CY Q PI++P+ SS++KE+ C +
Sbjct: 64 EYLMKLQIGTPPF-EIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDT 122
Query: 83 EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHN 139
C Y Y S TKG L TE +T +++ + GCG N
Sbjct: 123 H------------DHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRN 170
Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
N+G F G+VGL R SL +Q+ + SYC TSK+ FG + V
Sbjct: 171 NSG-FKPGFAGVVGLDRGPKSLITQMGGEY-PGLMSYCFA-----GKGTSKINFGANAIV 223
Query: 200 SGGGVVSTSLVSKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
+G GVVST++ K K +Y++ L+ +SVGN + P++ KGN+ ID+G+
Sbjct: 224 AGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFH-----ALKGNIVIDSGST 278
Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPL 318
T P+ + N + + V + T + PR LCY + ++ I P++T HF GGA + L
Sbjct: 279 LTYFPESYCNLVRKAVEQVV--TAVRFPR-SDILCYYSKTI-DIFPVITMHFSGGADLVL 334
Query: 319 IHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ ++ GVFC A+ PI+ IFGN AQ++ +GYD S +VSFKPT+C+
Sbjct: 335 DKYNMYVASNTGGVFCLAIICNSPIEE--AIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 391
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 133/370 (35%), Positives = 193/370 (52%), Gaps = 27/370 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIY-GIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
V++ V NGE++M +IGTP + Y I+DTGSDL+W QC PC C+ Q PI++P
Sbjct: 86 VEAPVHAGNGEFLMNLAIGTPA--ETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEK 143
Query: 72 SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SSS+ +L C S+ C L SCS C Y Y Y D S T+GVLATE TFG+++
Sbjct: 144 SSSFSKLPCSSDLCVALPISSCSDG--CEYRYSYGDHSSTQGVLATETFTFGDAS--VSK 199
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
+ FGCG +N G GLVGLGR LSL +SQLG KFSYCL I++ +
Sbjct: 200 IGFGCGEDNRGRAYSQGAGLVGLGRGPLSL----ISQLGVPKFSYCLTSIDDSKGISTLL 255
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS--- 247
GSE + + T L+ + ++Y+++LEGISVG+ L+P S+ +I
Sbjct: 256 V---GSEATVKSAIPTPLIQNPSRPSFYYLSLEGISVGD-----TLLPIEKSTFSIQDDG 307
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG--IAPI 305
G + ID+G T L + + L+++ + +KL +LC+ P P
Sbjct: 308 SGGLIIDSGTTITYLKDNAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVEVPQ 367
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
L HF+ G + L + I V C M G + IFGNF Q ++ + +D + +
Sbjct: 368 LVFHFE-GVDLKLPKENYIIEDSALRVICLTMGSSSG-MSIFGNFQQQNIVVLHDLEKET 425
Query: 366 VSFKPTDCTK 375
+SF P C +
Sbjct: 426 ISFAPAQCNQ 435
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 131/359 (36%), Positives = 193/359 (53%), Gaps = 38/359 (10%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY+MK IGTPP +I ++DTGS+ +W QCLPCV CY Q PI++P+ SS++KE+ C +
Sbjct: 58 EYLMKLQIGTPPF-EIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDT 116
Query: 83 EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHN 139
C Y Y S TKG L TE +T +++ + GCG N
Sbjct: 117 H------------DHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRN 164
Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
N+G F G+VGL R SL +Q+ + SYC TSK+ FG + V
Sbjct: 165 NSG-FKPGFAGVVGLDRGPKSLITQMGGEY-PGLMSYCFA-----GKGTSKINFGANAIV 217
Query: 200 SGGGVVSTSLVSKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
+G GVVST++ K K +Y++ L+ +SVGN + P++ KGN+ ID+G+
Sbjct: 218 AGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFH-----ALKGNIVIDSGST 272
Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPL 318
T P+ + N + + V + T + PR LCY + ++ I P++T HF GGA + L
Sbjct: 273 LTYFPESYCNLVRKAVEQVV--TAVRFPR-SDILCYYSKTI-DIFPVITMHFSGGADLVL 328
Query: 319 IHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ ++ GVFC A+ PI+ IFGN AQ++ +GYD S +VSFKPT+C+
Sbjct: 329 DKYNMYVASNTGGVFCLAIICNSPIEE--AIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 385
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 128/361 (35%), Positives = 193/361 (53%), Gaps = 37/361 (10%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N Y+MK +GTPP +I ++DTGS++ W QCLPCV CYKQ PI++P+ SS++KE
Sbjct: 377 NSVYLMKLQVGTPPF-EIEAVIDTGSEITWTQCLPCVHCYKQNAPIFDPSKSSTFKE--- 432
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCG 137
++CH C Y Y D + TKG LAT+ +T +++ + GCG
Sbjct: 433 --KRCH---------DHSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETIIGCG 481
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
NN+ F + G VGL LSL +Q+ + SYC + TSK+ FG +
Sbjct: 482 RNNSW-FRPSFEGFVGLNWGPLSLITQMGGEY-PGLMSYCFA-----GNGTSKINFGTNA 534
Query: 198 EVSGGGVVSTSL-VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
V GGGVVST++ V+ +Y++ L+ +SVG+ + P++ +GN+ ID+G
Sbjct: 535 IVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFH-----ALEGNIVIDSG 589
Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKV 316
T P+ + N + + V + + P DP LCY + + I P++T HF GGA +
Sbjct: 590 TTLTYFPESYCNLVRQAVEHVVPAVPAADPTGNDLLCYYS-NTTEIFPVITMHFSGGADL 648
Query: 317 PLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
L + F+ G+FC A+ P IFGN AQ++ +GYD S +VSFKPT+C
Sbjct: 649 VLDKYNMFMESYSGGLFCLAIICNNPTQE--AIFGNRAQNNFLVGYDSSSLLVSFKPTNC 706
Query: 374 T 374
+
Sbjct: 707 S 707
Score = 177 bits (448), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 114/346 (32%), Positives = 168/346 (48%), Gaps = 59/346 (17%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY+MK IGTPP ++ ++DTGS+L+W QCLPC+ CY Q PI++P+ SS++KE C +
Sbjct: 64 EYLMKLQIGTPPF-EVEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCNT 122
Query: 83 EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHN 139
C Y Y D S T+G LATE +T +++ + GC N
Sbjct: 123 P------------DHSCPYKLVYDDKSYTQGTLATETVTIHSTSGVPFVMPETIIGCSRN 170
Query: 140 NTGV-FNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
N+G F + G+VGL R LSL SQ+ G
Sbjct: 171 NSGSGFRPSSSGIVGLSRGSLSLISQM------------------------------GGA 200
Query: 199 VSGGGVVSTSLVSKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
G GVVST++ +K K Y++ L+ +SVG+ + P++ GN+ ID+G
Sbjct: 201 YPGDGVVSTTMFAKTAKRGQYYLNLDAVSVGDTRIETVGTPFH-----ALNGNIVIDSGT 255
Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVP 317
P T P + N + + V + DP LCY + ++ I P++T HF GGA +
Sbjct: 256 PLTYFPVSYCNLVRKAVERVVTADRVVDPSRNDMLCYYSNTIE-IFPVITVHFSGGADLV 314
Query: 318 LIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYD 360
L + ++ GVFC A+ P V IFGN AQ++ +GYD
Sbjct: 315 LDKYNMYMELNRGGVFCLAIICNNPT--QVAIFGNRAQNNFLVGYD 358
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 133/370 (35%), Positives = 192/370 (51%), Gaps = 27/370 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIY-GIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
V++ V NGE++M +IGTP + Y I+DTGSDL+W QC PC C+ Q PI++P
Sbjct: 86 VEAPVHAGNGEFLMNLAIGTPA--ETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEK 143
Query: 72 SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SSS+ +L C S+ C L SCS C Y Y Y D S T+GVLATE TFG+++
Sbjct: 144 SSSFSKLPCSSDLCVALPISSCSDG--CEYRYSYGDHSSTQGVLATETFTFGDAS--VSK 199
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
+ FGCG +N G GLVGLGR LSL +SQLG KFSYCL I++ +
Sbjct: 200 IGFGCGEDNRGRAYSQGAGLVGLGRGPLSL----ISQLGVPKFSYCLTSIDDSKGISTLL 255
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS--- 247
GSE + + T L+ + ++Y+++LEGISVG+ L+P S+ +I
Sbjct: 256 V---GSEATVKSAIPTPLIQNPSRPSFYYLSLEGISVGD-----TLLPIEKSTFSIQDDG 307
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG--IAPI 305
G + ID+G T L + L+++ + +KL +LC+ P P
Sbjct: 308 SGGLIIDSGTTITYLKDSAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVDVPQ 367
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
L HF+ G + L + I V C M G + IFGNF Q ++ + +D + +
Sbjct: 368 LVFHFE-GVDLKLPKENYIIEDSALRVICLTMGSSSG-MSIFGNFQQQNIVVLHDLEKET 425
Query: 366 VSFKPTDCTK 375
+SF P C +
Sbjct: 426 ISFAPAQCNQ 435
>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
Length = 308
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 139/369 (37%), Positives = 186/369 (50%), Gaps = 79/369 (21%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
N +QSNV + G Y+M S+GTPP+ + GI DTGSDL+W QCLPC CYKQV+P+++P
Sbjct: 16 NDIQSNVISGGGSYLMNISLGTPPV-SMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPK 74
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--- 127
S +YK L G L++E T G++
Sbjct: 75 KSKTYKTL----------------------------------GYLSSETFTIGSTEGDPA 100
Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
F + FGCGH+N G FNE + GL+GLG LSL Q+ S++G +FSYCLVP +DS+
Sbjct: 101 SFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGG-QFSYCLVPLSSDSTA 159
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
+SK+ FG + VSG G +S A
Sbjct: 160 SSKINFGKSAVVSGSGT------------------------------------SSPAAAE 183
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILT 307
+ N+ ID+G TLLP+DFY +E + I DPR LCY I P +T
Sbjct: 184 ESNIIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCYSGVKKLEI-PTIT 242
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
AHF GA V L +TF+ E + CF+M P ++ IFGN +Q + +GYD + VS
Sbjct: 243 AHFI-GADVQLPPLNTFVQAQ-EDLVCFSMIP-SSNLAIFGNLSQMNFLVGYDLKNNKVS 299
Query: 368 FKPTDCTKQ 376
FKPTDCTKQ
Sbjct: 300 FKPTDCTKQ 308
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 134/381 (35%), Positives = 200/381 (52%), Gaps = 30/381 (7%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
N +++ +GE++M+ SIG P + IVDTGSDL+W QC PC +C+ Q PI++P
Sbjct: 95 NNIKAPTHGGSGEFLMELSIGNPAV-KYAAIVDTGSDLIWTQCKPCTECFDQPTPIFDPE 153
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
SSSY ++ C S C+ L +C+ + C Y Y Y D S T+G+LATE TF + N+
Sbjct: 154 KSSSYSKVGCSSGLCNALPRSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTFEDENS-I 212
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
+ FGCG N G GLVGLGR LSL +SQL KFSYCL DS +S
Sbjct: 213 SGIGFGCGVENEGDGFSQGSGLVGLGRGPLSL----ISQLKETKFSYCLTSIE-DSEASS 267
Query: 190 KMYFGN---------GSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPY 239
++ G+ G+ + G + SL+ D+ ++Y++ L+GI+VG +K +
Sbjct: 268 SLFIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVG-----AKRLSV 322
Query: 240 YNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT 296
S+ +S+ G M ID+G T L + + L+E+ + + L G LC+K
Sbjct: 323 EKSTFELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKL 382
Query: 297 PSMAG--IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
P+ A P L HF GA + L + + GV C AM +G + IFGN Q +
Sbjct: 383 PNAAKNIAVPKLIFHFK-GADLELPGENYMVADSSTGVLCLAMGSSNG-MSIFGNVQQQN 440
Query: 355 LFIGYDFDSQMVSFKPTDCTK 375
+ +D + + V+F PT+C K
Sbjct: 441 FNVLHDLEKETVTFVPTECGK 461
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 133/381 (34%), Positives = 199/381 (52%), Gaps = 30/381 (7%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
N +++ +GE++M+ SIG P + IVDTGSDL+W QC PC +C+ Q PI++P
Sbjct: 94 NNIKAPTHGGSGEFLMELSIGNPAV-KYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPE 152
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
SSSY ++ C S C+ L +C+ + C Y Y Y D S T+G+LATE TF + N+
Sbjct: 153 KSSSYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENS-I 211
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
+ FGCG N G GLVGLGR LSL +SQL KFSYCL DS +S
Sbjct: 212 SGIGFGCGVENEGDGFSQGSGLVGLGRGPLSL----ISQLKETKFSYCLTSIE-DSEASS 266
Query: 190 KMYFGN---------GSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPY 239
++ G+ G+ + G + SL+ D+ ++Y++ L+GI+VG +K +
Sbjct: 267 SLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVG-----AKRLSV 321
Query: 240 YNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT 296
S+ +++ G M ID+G T L + + L+E+ + + L G LC+K
Sbjct: 322 EKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKL 381
Query: 297 PSMAG--IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
P A P + HF GA + L + + GV C AM +G + IFGN Q +
Sbjct: 382 PDAAKNIAVPKMIFHFK-GADLELPGENYMVADSSTGVLCLAMGSSNG-MSIFGNVQQQN 439
Query: 355 LFIGYDFDSQMVSFKPTDCTK 375
+ +D + + VSF PT+C K
Sbjct: 440 FNVLHDLEKETVSFVPTECGK 460
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 133/362 (36%), Positives = 197/362 (54%), Gaps = 17/362 (4%)
Query: 18 STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKE 77
S GE+++ +GTPP + I+DTGSDL W+Q PC C++Q PI++P+ SS+Y +
Sbjct: 19 SAGYGEFLVPIYLGTPPQKAVV-IIDTGSDLTWIQSEPCRACFEQADPIFDPSKSSTYNK 77
Query: 78 LSCQSEQC-HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
++C S C LL T +CS+ C Y YGY D S+T+G + E IT ++ + V FG
Sbjct: 78 IACSSSACADLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAG--EEVKFGA 135
Query: 137 GHNNTGVFNEN-EMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
NTG F + G++GLG+ +S+ SQ+ S LG NKFSYCLV + + S TS MYFG+
Sbjct: 136 SVYNTGTFGDTGGEGILGLGQGPVSMPSQLGSVLG-NKFSYCLVDWLSAGSETSTMYFGD 194
Query: 196 GSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNMFI 253
+ V G V T +V D TYY++ ++GISV G+L + + + +S G+ G I
Sbjct: 195 AA-VPSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGS---GGTII 250
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG-IAPILTAHFDG 312
D+G T L ++ +N L + ++ P G LC+ T + P +T H D
Sbjct: 251 DSGTTITYLQQEVFNALVAAYTSQVRY-PTTTSATGLDLCFNTRGTGSPVFPAMTIHLD- 308
Query: 313 GAKVPLIHTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
G + L +TFI + C A +D + IFGN Q + I YD D+ + F P
Sbjct: 309 GVHLELPTANTFISLETN-IICLAFASALDFPIAIFGNIQQQNFDIVYDLDNMRIGFAPA 367
Query: 372 DC 373
DC
Sbjct: 368 DC 369
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 138/387 (35%), Positives = 194/387 (50%), Gaps = 34/387 (8%)
Query: 8 YPNNV---VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK 64
YP +V +S V++ G+YV S+GTP + I DTGSDL+W+QC PC C+ Q
Sbjct: 21 YPPSVSTDYESPVASGGGDYVTTISLGTPAKV-FSVIADTGSDLIWIQCKPCQACFNQKD 79
Query: 65 PIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGN 124
PI++P SSSY +SC C L SCS C+Y+YGY D S T+G L++E +T +
Sbjct: 80 PIFDPEGSSSYTTMSCGDTLCDSLPRKSCSPD--CDYSYGYGDGSGTRGTLSSETVTLTS 137
Query: 125 SNN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
+ N+ FGCGH N G FN+ GLVGLGR LS SQ L L +KFSYCLVP+
Sbjct: 138 TQGEKLAAKNIAFGCGHLNRGSFNDAS-GLVGLGRGNLSFVSQ-LGDLFGHKFSYCLVPW 195
Query: 182 HTDSSITSKMYFGNGSEVSGGG-----VVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKL 236
S TS M+FG+ S G + + + +++Y+V L+ IS+ + +
Sbjct: 196 RDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISI---AGRALR 252
Query: 237 IPYYNSSGAIS-----KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
IP +G+ G M D+G TLLP Y + +R+ I G
Sbjct: 253 IP----AGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSAGLD 308
Query: 292 LCYKT----PSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG-VFCFAMQPIDGDVGI 346
LCY S P + HF+ GA L + FI G + C AM + D+GI
Sbjct: 309 LCYDVSGSKASYKMKIPAMVFHFE-GADYQLPVENYFIAANDAGTIVCLAMVSSNMDIGI 367
Query: 347 FGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+GN Q + + YD S + + P+ C
Sbjct: 368 YGNMMQQNFRVMYDIGSSKIGWAPSQC 394
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 131/366 (35%), Positives = 188/366 (51%), Gaps = 16/366 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S ++ +GEY ++ IG+P L Y ++DTGSD+ W+QC PC CYKQ +++P +S
Sbjct: 3 VTSGLAFGSGEYFVRVGIGSPTKLQ-YLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRAS 61
Query: 73 SSYKELSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SS++ LSC + QC LLD +C+S C Y Y D S T G LA++ +F S
Sbjct: 62 SSFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASD--SFSVSRGRTSP 119
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
VVFGCGH+N G+F L+GLG +LS SQL + KFSYCLV +S +
Sbjct: 120 VVFGCGHDNEGLFVGAAG-LLGLGAGKLSFP----SQLSSRKFSYCLVSRDNGVRASSAL 174
Query: 192 YFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISK 248
FG+ + + T L+ T+Y+ L GIS+G LS S +S+G +
Sbjct: 175 LFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTG---R 231
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILT 307
G + ID+G T LP Y + + R+A + P CY ++ + P ++
Sbjct: 232 GGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVS 291
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
HF+GGA V L ++ +P G FCFA D+ I GN Q + + D DS V
Sbjct: 292 FHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVG 351
Query: 368 FKPTDC 373
F P C
Sbjct: 352 FAPRQC 357
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 201 bits (510), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 137/387 (35%), Positives = 194/387 (50%), Gaps = 34/387 (8%)
Query: 8 YPNNV---VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK 64
YP +V +S V++ G+YV S+GTP + I DTGSDL+W+QC PC C+ Q
Sbjct: 21 YPPSVSTDYESPVASGGGDYVTTISLGTPAKV-FSVIADTGSDLIWIQCKPCQACFNQKD 79
Query: 65 PIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGN 124
PI++P SSSY +SC C L SCS C+Y+YGY D S T+G L++E +T +
Sbjct: 80 PIFDPEGSSSYTTMSCGDTLCDSLPRKSCSPN--CDYSYGYGDGSGTRGTLSSETVTLTS 137
Query: 125 SNN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
+ N+ FGCGH N G FN+ GLVGLGR LS SQ L L +KFSYCLVP+
Sbjct: 138 TQGEKLAAKNIAFGCGHLNRGSFNDAS-GLVGLGRGNLSFVSQ-LGDLFGHKFSYCLVPW 195
Query: 182 HTDSSITSKMYFGNGSEVSGGG-----VVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKL 236
S TS M+FG+ S G + + + +++Y+V L+ IS+ + +
Sbjct: 196 RDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISI---AGRALR 252
Query: 237 IPYYNSSGAIS-----KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
IP +G+ G M D+G TLLP Y + +R+ + G
Sbjct: 253 IP----AGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGSSAGLD 308
Query: 292 LCYKT----PSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG-VFCFAMQPIDGDVGI 346
LCY S P + HF+ GA L + FI G + C AM + D+GI
Sbjct: 309 LCYDVSGSKASYKKKIPAMVFHFE-GADHQLPVENYFIAANDAGTIVCLAMVSSNMDIGI 367
Query: 347 FGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+GN Q + + YD S + + P+ C
Sbjct: 368 YGNMMQQNFRVMYDIGSSKIGWAPSQC 394
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 201 bits (510), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 141/376 (37%), Positives = 205/376 (54%), Gaps = 27/376 (7%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
NN+ +S + NGEY+M IGTPP+ + I DTGSDL+WVQC PC C+ Q P++ P
Sbjct: 78 NNLPESLLIPENGEYLMTLYIGTPPV-ERLAIADTGSDLIWVQCSPCQNCFPQDTPLFEP 136
Query: 70 ASSSSYKELSCQSEQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
SS++K +C S+ C + C C Y+Y Y D S T GV+ TE ++FG++ +
Sbjct: 137 LKSSTFKAATCDSQPCTSVPPSQRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGSTGD 196
Query: 128 F----FDNVVFGCGHNNTGVFNENE--MGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
F + +FGCG N F+ ++ GLVGLG LSL SQ+ Q+G KFSYCL+PF
Sbjct: 197 AQTVSFPSSIFGCGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQIGY-KFSYCLLPF 255
Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYY 240
++S TSK+ FG+ + V+ GVVST L+ K ++YF+ LE +++G K++P
Sbjct: 256 SSNS--TSKLKFGSEAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQ-----KVVP-- 306
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
+ GN+ ID+G T L + FYN ++ + + QD + C+ M
Sbjct: 307 ---TGRTDGNIIIDSGTVLTYLEQTFYNNFVASLQEVLSVESAQDLPFPFKFCFPYRDMT 363
Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID-GDVGIFGNFAQSDLFIGY 359
P++ F GA V L + I + C A+ P + IFGN AQ D + Y
Sbjct: 364 --IPVIAFQFT-GASVALQPKNLLIKLQDRNMLCLAVVPSSLSGISIFGNVAQFDFQVVY 420
Query: 360 DFDSQMVSFKPTDCTK 375
D + + VSF PTDCTK
Sbjct: 421 DLEGKKVSFAPTDCTK 436
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 132/372 (35%), Positives = 185/372 (49%), Gaps = 33/372 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S S +GEY + +GTP D+Y ++DTGSD+ W+QC PC CY+Q P++NP SS
Sbjct: 151 VVSGASQGSGEYFSRIGVGTPAK-DMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSS 209
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+YK L+C + QC LL+T +C S + C Y Y D S T G LAT+ +TFGNS +NV
Sbjct: 210 STYKSLTCSAPQCSLLETSACRSNK-CLYQVSYGDGSFTVGELATDTVTFGNSGK-INNV 267
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH+N G+F I +Q+ A FSYCLV DS +S +
Sbjct: 268 ALGCGHDNEGLFTGAAG-----LLGLGGGVLSITNQMKATSFSYCLV--DRDSGKSSSLD 320
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP----YYNSSGAISK 248
F N ++ GG + L +K+ T+Y+V L G SVG ++P ++SG+
Sbjct: 321 F-NSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVG---GEKVVLPDAIFDVDASGS--- 373
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS------QLCYKTPSMAGI 302
G + +D G T L YN L + +KLT + + GS CY S++ +
Sbjct: 374 GGVILDCGTAVTRLQTQAYNSLRDAF---LKLT--VNLKKGSSSISLFDTCYDFSSLSTV 428
Query: 303 -APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
P + HF GG + L + IP G FCFA P + I GN Q I YD
Sbjct: 429 KVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDL 488
Query: 362 DSQMVSFKPTDC 373
++ C
Sbjct: 489 SKNVIGLSGNKC 500
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 131/366 (35%), Positives = 188/366 (51%), Gaps = 16/366 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S ++ +GEY ++ IG+P L Y ++DTGSD+ W+QC PC CYKQ +++P +S
Sbjct: 3 VTSGLAFGSGEYFVRVGIGSPTKLQ-YLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRAS 61
Query: 73 SSYKELSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SS++ LSC + QC LLD +C+S C Y Y D S T G LA++ +F S
Sbjct: 62 SSFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASD--SFLVSRGRTSP 119
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
VVFGCGH+N G+F L+GLG +LS SQL + KFSYCLV +S +
Sbjct: 120 VVFGCGHDNEGLFVGAAG-LLGLGAGKLSFP----SQLSSRKFSYCLVSRDNGVRASSAL 174
Query: 192 YFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISK 248
FG+ + + T L+ T+Y+ L GIS+G LS S +S+G +
Sbjct: 175 LFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTG---R 231
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILT 307
G + ID+G T LP Y + + R+A + P CY ++ + P ++
Sbjct: 232 GGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVS 291
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
HF+GGA V L ++ +P G FCFA D+ I GN Q + + D DS V
Sbjct: 292 FHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVG 351
Query: 368 FKPTDC 373
F P C
Sbjct: 352 FAPRQC 357
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 139/370 (37%), Positives = 198/370 (53%), Gaps = 21/370 (5%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
V++ V NGE++MK +IGTP L I+DTGSDL W QC PC CY Q PIY+P+
Sbjct: 102 KAVEAPVYAGNGEFLMKMAIGTPSL-SFSAILDTGSDLTWTQCKPCTDCYPQPTPIYDPS 160
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
SS+Y ++ C S C L SCS C Y Y Y D S T+G+L+ E T ++
Sbjct: 161 QSSTYSKVPCSSSMCQALPMYSCSGAN-CEYLYSYGDQSSTQGILSYESFTL--TSQSLP 217
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
++ FGCG N G GLVG GR LSL SQ+ LG NKFSYCLV S TS
Sbjct: 218 HIAFGCGQENEGGGFSQGGGLVGFGRGPLSLISQLGQSLG-NKFSYCLVSITDSPSKTSP 276
Query: 191 MYFGNGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS-- 247
++ G + ++ V ST LV S+ T+Y+++LEGISVG +L+ + + +
Sbjct: 277 LFIGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGG-----QLLDIADGTFDLQLD 331
Query: 248 -KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA--P 304
G + ID+G T L + Y+ +++ V ++I L +G LC++ S + + P
Sbjct: 332 GTGGVIIDSGTTVTYLEQSGYDVVKKAVISSINLPQVDGSNIGLDLCFEPQSGSSTSHFP 391
Query: 305 ILTAHFDGGA-KVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
+T HF+G +P +I G+ C AM P +G + IFGN Q + I YD +
Sbjct: 392 TITFHFEGADFNLP---KENYIYTDSSGIACLAMLPSNG-MSIFGNIQQQNYQILYDNER 447
Query: 364 QMVSFKPTDC 373
++SF PT C
Sbjct: 448 NVLSFAPTVC 457
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 132/363 (36%), Positives = 182/363 (50%), Gaps = 18/363 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+ S S +GEY + IG P +Y ++DTGSD+ W+QC PC CY Q PI+ PASS
Sbjct: 133 IISGTSQGSGEYFSRVGIGKPSS-PVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPASS 191
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
+SY LSC ++QC LD C + C Y Y D S T G TE IT G+++ DNV
Sbjct: 192 TSYSPLSCDTKQCQSLDVSECRNNT-CLYEVSYGDGSYTVGDFVTETITLGSAS--VDNV 248
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGHNN G+F L+GLG +LS SQI A+ FSYCLV +DS+ T +
Sbjct: 249 AIGCGHNNEGLFIGAAG-LLGLGGGKLSFPSQI----NASSFSYCLVDRDSDSASTLEF- 302
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNM 251
S + + + L ++E T+Y+V + G+SV G L + + + + SG G +
Sbjct: 303 ---NSALLPHAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESG---NGGI 356
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHF 310
ID+G T L YN L + K P CY + P +T H
Sbjct: 357 IIDSGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVEVPTVTFHL 416
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
GG +PL T+ IP +G FCFA P + I GN Q +G+D + +V F+P
Sbjct: 417 AGGKVLPLPATNYLIPVDSDGTFCFAFAPTSSALSIIGNVQQQGTRVGFDLANSLVGFEP 476
Query: 371 TDC 373
C
Sbjct: 477 RQC 479
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 131/372 (35%), Positives = 185/372 (49%), Gaps = 33/372 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S S +GEY + +GTP ++Y ++DTGSD+ W+QC PC CY+Q P++NP SS
Sbjct: 151 VVSGASQGSGEYFSRIGVGTPAK-EMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSS 209
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+YK L+C + QC LL+T +C S + C Y Y D S T G LAT+ +TFGNS +NV
Sbjct: 210 STYKSLTCSAPQCSLLETSACRSNK-CLYQVSYGDGSFTVGELATDTVTFGNSGK-INNV 267
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH+N G+F I +Q+ A FSYCLV DS +S +
Sbjct: 268 ALGCGHDNEGLFTGAAG-----LLGLGGGVLSITNQMKATSFSYCLV--DRDSGKSSSLD 320
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP----YYNSSGAISK 248
F N ++ GG + L +K+ T+Y+V L G SVG ++P ++SG+
Sbjct: 321 F-NSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVG---GEKVVLPDAIFDVDASGS--- 373
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS------QLCYKTPSMAGI 302
G + +D G T L YN L + +KLT + + GS CY S++ +
Sbjct: 374 GGVILDCGTAVTRLQTQAYNSLRDAF---LKLT--VNLKKGSSSISLFDTCYDFSSLSTV 428
Query: 303 -APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
P + HF GG + L + IP G FCFA P + I GN Q I YD
Sbjct: 429 KVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDL 488
Query: 362 DSQMVSFKPTDC 373
++ C
Sbjct: 489 SKNVIGLSGNKC 500
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 125/359 (34%), Positives = 186/359 (51%), Gaps = 39/359 (10%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
Y+MK +GTPP +I +DTGSDL+W QC+PC CY Q PI++P++SS++KE C
Sbjct: 61 YLMKLQVGTPPF-EIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNGN 119
Query: 84 QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHNN 140
CH Y YAD++ +KG LATE +T +++ GCGHN+
Sbjct: 120 SCH--------------YKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNS 165
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
+ F G+VGL SL +Q+ + SYC S TSK+ FG + V+
Sbjct: 166 SW-FKPTFSGMVGLSWGPSSLITQMGGEY-PGLMSYCFA-----SQGTSKINFGTNAIVA 218
Query: 201 GGGVVSTSL-VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPP 259
G GVVST++ ++ Y++ L+ +SVG+ + ++ +GN+ ID+G
Sbjct: 219 GDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFH-----ALEGNIIIDSGTTL 273
Query: 260 TLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLI 319
T P + N + E V + + DP LCY T ++ I P++T HF GGA + L
Sbjct: 274 TYFPVSYCNLVREAVDHYVTAVRTADPTGNDMLCYYTDTI-DIFPVITMHFSGGADLVLD 332
Query: 320 HTSTFIPPPVEGVFCFAM----QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ +I G FC A+ P D IFGN AQ++ +GYD S +VSF PT+C+
Sbjct: 333 KYNMYIETITRGTFCLAIICNNPPQD---AIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 388
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 140/370 (37%), Positives = 202/370 (54%), Gaps = 28/370 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIY-GIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
+++ V NGE++MK +IGTPP + Y I+DTGSDL+W QC PC QC+ Q PI++P
Sbjct: 86 IEAPVLPGNGEFLMKLAIGTPP--ETYSAILDTGSDLIWTQCKPCTQCFHQSTPIFDPKK 143
Query: 72 SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SSS+ +LSC S+ C L SC++ C Y Y Y D S T+G+LA+E +TFG ++ N
Sbjct: 144 SSSFSKLSCSSQLCEALPQSSCNNG--CEYLYSYGDYSSTQGILASETLTFGKAS--VPN 199
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V FGCG +N G GLVGLGR LSL +SQL KFSYCL D + TS +
Sbjct: 200 VAFGCGADNEGSGFSQGAGLVGLGRGPLSL----VSQLKEPKFSYCLTT--VDDTKTSTL 253
Query: 192 YFGNGSEV--SGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS- 247
G+ + V S + +T L+ S ++Y+++LEGISVG+ +P S+ ++
Sbjct: 254 LMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTR-----LPIKKSTFSLQD 308
Query: 248 --KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG--IA 303
G + ID+G T L + +N + ++ I L G +C+ PS +
Sbjct: 309 DGSGGLIIDSGTTITYLEESAFNLVAKEFTAKINLPVDSSGSTGLDVCFTLPSGSTNIEV 368
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
P L HFD GA + L + I GV C AM G + IFGN Q ++ + +D +
Sbjct: 369 PKLVFHFD-GADLELPAENYMIGDSSMGVACLAMGSSSG-MSIFGNVQQQNMLVLHDLEK 426
Query: 364 QMVSFKPTDC 373
+ +SF PT C
Sbjct: 427 ETLSFLPTQC 436
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 141/373 (37%), Positives = 199/373 (53%), Gaps = 28/373 (7%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIY-GIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
N + S V + NGE++M +IGTPP + Y I+DTGSDL+W QC PC QC+ Q PI++
Sbjct: 86 NAEINSPVLSGNGEFLMNLAIGTPP--ETYSAIMDTGSDLIWTQCKPCTQCFDQPSPIFD 143
Query: 69 PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
P SSS+ +LSC S+ C L SCS C Y Y Y D S T+G +ATE TFG +
Sbjct: 144 PKKSSSFSKLSCSSQLCKALPQSSCSDS--CEYLYTYGDYSSTQGTMATETFTFGKVS-- 199
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
NV FGCG +N G GLVGLGR LSL +SQL KFSYCL D + T
Sbjct: 200 IPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSL----VSQLKEAKFSYCLTSI--DDTKT 253
Query: 189 SKMYFGNGSEVSG--GGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
S + G+ + V+G + +T L+ ++Y+++LEGISVG +P S+
Sbjct: 254 STLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTR-----LPIKESTFQ 308
Query: 246 ISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG- 301
+ G + ID+G T L + ++ ++++ + + L G +LCY PS
Sbjct: 309 LQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDNSGATGLELCYNLPSDTSE 368
Query: 302 -IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
P L HF GA + L + I GV C AM G + IFGN Q ++F+ +D
Sbjct: 369 LEVPKLVLHFT-GADLELPGENYMIADSSMGVICLAMGS-SGGMSIFGNVQQQNMFVSHD 426
Query: 361 FDSQMVSFKPTDC 373
+ + +SF PT+C
Sbjct: 427 LEKETLSFLPTNC 439
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 130/370 (35%), Positives = 184/370 (49%), Gaps = 29/370 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S VS +GEY + +GTP ++Y ++DTGSD+ W+QC PC CY+Q P++NP SS
Sbjct: 151 VVSGVSQGSGEYFSRIGVGTPAK-EMYLVLDTGSDVNWIQCEPCSDCYQQSDPVFNPTSS 209
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+YK L+C + QC LL+T +C S + C Y Y D S T G LAT+ +TFGNS D V
Sbjct: 210 STYKSLTCSAPQCSLLETSACRSNK-CLYQVSYGDGSFTVGELATDTVTFGNSGKIND-V 267
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH+N G+F A I +Q+ A FSYCLV DS +S +
Sbjct: 268 ALGCGHDNEGLFTGAAG-----LLGLGGGALSITNQMKATSFSYCLV--DRDSGKSSSLD 320
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP----YYNSSGAISK 248
F N ++ G + L +++ T+Y+V L G SVG ++P ++SG+
Sbjct: 321 F-NSVQLGSGDATAPLLRNQKIDTFYYVGLSGFSVG---GQKVMMPDAIFDVDASGS--- 373
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGI-A 303
G + +D G T L YN L + +KLT S CY S++ +
Sbjct: 374 GGVILDCGTAVTRLQTQAYNSLRDAF---LKLTTNLKKGTSSISLFDTCYDFSSLSSVKV 430
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
P + HF GG + L + IP G FCFA P + I GN Q I YD +
Sbjct: 431 PTVAFHFTGGKSLDLPAKNYLIPVDDNGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLAN 490
Query: 364 QMVSFKPTDC 373
+++ C
Sbjct: 491 KIIGLSGNKC 500
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 132/381 (34%), Positives = 199/381 (52%), Gaps = 32/381 (8%)
Query: 9 PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
P + V+ ++GEY++ +IGTPPL I+DTGSDL+W QC PC+ C Q P ++
Sbjct: 74 PITAARVLVTASSGEYLVDLAIGTPPLY-YTAIMDTGSDLIWTQCAPCLLCADQPTPYFD 132
Query: 69 PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN- 127
S++Y+ L C+S +C L + SC +++C Y Y Y D++ T GVLA E TFG +N+
Sbjct: 133 VKKSATYRALPCRSSRCASLSSPSC-FKKMCVYQYYYGDTASTAGVLANETFTFGAANST 191
Query: 128 --FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
N+ FGCG N G N G+VG GR LSL +SQLG ++FSYCL + S
Sbjct: 192 KVRATNIAFGCGSLNAGDL-ANSSGMVGFGRGPLSL----VSQLGPSRFSYCLTSYL--S 244
Query: 186 SITSKMYFG-----NGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIP 238
+ S++YFG + + S G V ++ +++ YF++L+ IS+G +KL+P
Sbjct: 245 ATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLG-----TKLLP 299
Query: 239 YYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
AI+ G + ID+G T L +D Y + + +AI L D +G C++
Sbjct: 300 IDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLPAMNDTDIGLDTCFQ 359
Query: 296 TPSMAGI---APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQ 352
P + P L HFD A + L+ + + G C M P G I GN+ Q
Sbjct: 360 WPPPPNVTVTVPDLVFHFD-SANMTLLPENYMLIASTTGYLCLVMAPT-GVGTIIGNYQQ 417
Query: 353 SDLFIGYDFDSQMVSFKPTDC 373
+L + YD + +SF P C
Sbjct: 418 QNLHLLYDIGNSFLSFVPAPC 438
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 139/375 (37%), Positives = 205/375 (54%), Gaps = 28/375 (7%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIY-GIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
N+ + + V NGE++MK +IGTPP + Y I+DTGSDL+W QC PC QC+ Q PI++
Sbjct: 83 NSEIDAPVLPGNGEFLMKLAIGTPP--ETYSAIMDTGSDLIWTQCKPCTQCFDQPTPIFD 140
Query: 69 PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
P SSS+ +LSC S+ C L +CS C Y YGY D S T+G+LA+E +TFG +
Sbjct: 141 PKKSSSFSKLSCSSKLCEALPQSTCSDG--CEYLYGYGDYSSTQGMLASETLTFGKVS-- 196
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
V FGCG +N G GLVGLGR LSL +SQL KFSYCL D +
Sbjct: 197 VPEVAFGCGEDNEGSGFSQGSGLVGLGRGPLSL----VSQLKEPKFSYCLT--SVDDTKA 250
Query: 189 SKMYFGNGSEV--SGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
S + G+ + V S + +T L+ + ++Y+++LEGISVG+ S +P S+ +
Sbjct: 251 STLLMGSLASVKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTS-----LPIKKSTFS 305
Query: 246 ISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG- 301
+ + G + ID+G T L + ++ + ++ + I L G ++C+ PS +
Sbjct: 306 LQEDGSGGLIIDSGTTITYLEQSAFDLVAKEFTSQINLPVDNSGSTGLEVCFTLPSGSTD 365
Query: 302 -IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
P L HFD GA + L + I GV C AM G + IFGN Q ++ + +D
Sbjct: 366 IEVPKLVFHFD-GADLELPAENYMIADASMGVACLAMGSSSG-MSIFGNIQQQNMLVLHD 423
Query: 361 FDSQMVSFKPTDCTK 375
+ + +SF PT C +
Sbjct: 424 LEKETLSFLPTQCDE 438
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 135/390 (34%), Positives = 202/390 (51%), Gaps = 34/390 (8%)
Query: 1 MSPATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCY 60
+SPA P + V+ ++GEY++ +IGTPPL I+DTGSDL+W QC PC+ C
Sbjct: 66 VSPAPVADPITAARVLVTASSGEYLVDLAIGTPPLY-YTAIMDTGSDLIWTQCAPCLLCA 124
Query: 61 KQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERI 120
Q P ++ S++Y+ L C+S +C L + SC +++C Y Y Y D++ T GVLA E
Sbjct: 125 AQPTPYFDVKRSATYRALPCRSSRCAALSSPSC-FKKMCVYQYYYGDTASTAGVLANETF 183
Query: 121 TFGNSNN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
TFG +++ N+ FGCG N G N G+VG GR LSL +SQLG ++FSYC
Sbjct: 184 TFGAASSTKVRAANISFGCGSLNAGEL-ANSSGMVGFGRGPLSL----VSQLGPSRFSYC 238
Query: 178 LVPFHTDSSITSKMYFG-----NGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNL 230
L + S S++YFG N + S G V ++ +++ YF++++GIS+G
Sbjct: 239 LTSYL--SPTPSRLYFGVFANLNSTNTSSGSPVQSTPFVINPALPNMYFLSVKGISLG-- 294
Query: 231 SNSSKLIPYYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPR 287
+K +P AI+ G + ID+G T L +D Y + + + I L D
Sbjct: 295 ---TKRLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLASTIPLPAMNDTD 351
Query: 288 LGSQLCYKTPSMAGI---APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDV 344
+G C++ P + P HFD GA + L + + G C AM P V
Sbjct: 352 IGLDTCFQWPPPPNVTVTVPDFVFHFD-GANMTLPPENYMLIASTTGYLCLAMAPT--SV 408
Query: 345 G-IFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
G I GN+ Q +L + YD + +SF P C
Sbjct: 409 GTIIGNYQQQNLHLLYDIANSFLSFVPAPC 438
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 130/366 (35%), Positives = 190/366 (51%), Gaps = 30/366 (8%)
Query: 26 MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQC 85
M+ SIG P + IVDTGSDL+W QC PC +C+ Q PI++P SSSY ++ C S C
Sbjct: 1 MELSIGNPAV-KYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLC 59
Query: 86 HLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVF 144
+ L +C+ + C Y Y Y D S T+G+LATE TF + N+ + FGCG N G
Sbjct: 60 NALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENS-ISGIGFGCGVENEGDG 118
Query: 145 NENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN--------- 195
GLVGLGR LSL +SQL KFSYCL DS +S ++ G+
Sbjct: 119 FSQGSGLVGLGRGPLSL----ISQLKETKFSYCLTSIE-DSEASSSLFIGSLASGIVNKT 173
Query: 196 GSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNM 251
G+ + G + SL+ D+ ++Y++ L+GI+VG +K + S+ +++ G M
Sbjct: 174 GASLDGEVTKTMSLLRNPDQPSFYYLELQGITVG-----AKRLSVEKSTFELAEDGTGGM 228
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG--IAPILTAH 309
ID+G T L + + L+E+ + + L G LC+K P A P + H
Sbjct: 229 IIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFH 288
Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
F GA + L + + GV C AM +G + IFGN Q + + +D + + VSF
Sbjct: 289 FK-GADLELPGENYMVADSSTGVLCLAMGSSNG-MSIFGNVQQQNFNVLHDLEKETVSFV 346
Query: 370 PTDCTK 375
PT+C K
Sbjct: 347 PTECGK 352
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 127/374 (33%), Positives = 174/374 (46%), Gaps = 34/374 (9%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQ-VKPIYNPASSSSYKELSCQ 81
EY+M S+GTPP + +DTGSDL+W QC PC+ C++Q P+ +PA+SS++ L C
Sbjct: 89 EYLMHVSVGTPPR-PVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLDPAASSTHAALPCD 147
Query: 82 SEQCHLLDTVSCSSQQL----CNYTYGYADSSLTKGVLATERITFGNSNNF----FDNVV 133
+ C L SC + C Y Y Y D SLT G LAT+ TFG +N V
Sbjct: 148 APLCRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGLAARRVT 207
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP-FHTDSS------ 186
FGCGH N G+F NE G+ G GR R SL SQL FSYC F T SS
Sbjct: 208 FGCGHINKGIFQANETGIAGFGRGRWSLP----SQLNVTSFSYCFTSMFDTKSSSVVTLG 263
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
+ G V +T L+ + + YFV L GISVG + +P
Sbjct: 264 AAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVG---GARVAVPESR---- 316
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA----G 301
+ + ID+GA T LP+D Y ++ + + + L LC+ P A
Sbjct: 317 -LRSSTIIDSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLCFALPVAALWRRP 375
Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
P LT H DGGA L + V C + G+ + GN+ Q + + YD
Sbjct: 376 AVPALTLHLDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNYQQQNTHVVYDL 435
Query: 362 DSQMVSFKPTDCTK 375
++ ++SF P C K
Sbjct: 436 ENDVLSFAPARCDK 449
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 124/362 (34%), Positives = 189/362 (52%), Gaps = 41/362 (11%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
Y+MK +GTPP +I +DTGSD++W QC+PC CY Q PI++P+ SS+++E C
Sbjct: 421 YLMKLQVGTPPF-EIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTFREQRCNGN 479
Query: 84 QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHNN 140
CH Y YAD + +KG+LATE +T +++ GCG +N
Sbjct: 480 SCH--------------YEIIYADKTYSKGILATETVTIPSTSGEPFVMAETKIGCGLDN 525
Query: 141 TGV----FNENEMGLVGLGRTRLSLASQI-LSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
T + F + G+VGL LSL SQ+ L G SYC TSK+ FG
Sbjct: 526 TNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGL--ISYCF-----SGQGTSKINFGT 578
Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
+ V+G G V+ + K+D +Y++ L+ +SV + LI + GN+FID+
Sbjct: 579 NAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVED-----NLIATLGTPFHAEDGNIFIDS 633
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ--LCYKTPSMAGIAPILTAHFDGG 313
G T P + N + E V + T + P +GS LCY + ++ I P++T HF GG
Sbjct: 634 GTTLTYFPMSYCNLVREAVEQVV--TAVKVPDMGSDNLLCYYSDTI-DIFPVITMHFSGG 690
Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDV-GIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
A + L + ++ G+FC A+ D + +FGN AQ++ +GYD S ++SF PT+
Sbjct: 691 ADLVLDKYNMYLETITGGIFCLAIGCNDPSMPAVFGNRAQNNFLVGYDPSSNVISFSPTN 750
Query: 373 CT 374
C+
Sbjct: 751 CS 752
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 114/354 (32%), Positives = 177/354 (50%), Gaps = 41/354 (11%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
Y+MK +GTPP +I +DTGSDL+W QC+PC CY Q PI++P+ SS++ E C +
Sbjct: 82 YLMKLQVGTPPF-EIAAEIDTGSDLIWTQCMPCPDCYSQFDPIFDPSKSSTFNEQRCHGK 140
Query: 84 QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHNN 140
CH Y Y D++ +KG+LATE +T +++ GCG +N
Sbjct: 141 SCH--------------YEIIYEDNTYSKGILATETVTIHSTSGEPFVMAETTIGCGLHN 186
Query: 141 TGV----FNENEMGLVGLGRTRLSLASQI-LSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
T + F + G+VGL SL SQ+ L G SYC TSK+ FG
Sbjct: 187 TDLDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGL--ISYCF-----SGQGTSKINFGT 239
Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
+ V+G G V+ + K+D +Y++ L+ +SV + + P++ GN+ ID+
Sbjct: 240 NAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGTPFH-----AEDGNIVIDS 294
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
G+ T P + N + + V + DP LCY + ++ I P++T HF GGA
Sbjct: 295 GSTVTYFPVSYCNLVRKAVEQVVTAVRVPDPSGNDMLCYFSETI-DIFPVITMHFSGGAD 353
Query: 316 VPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
+ L + ++ G+FC A+ P IFGN AQ++ +GYD S ++
Sbjct: 354 LVLDKYNMYMESNSGGLFCLAIICNSPTQE--AIFGNRAQNNFLVGYDSSSLLL 405
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 194 bits (494), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 143/373 (38%), Positives = 198/373 (53%), Gaps = 36/373 (9%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELS 79
GEY+M SIGTPPL I DTGSDL+W QC PC QC+ Q P+YNPASS+++ L
Sbjct: 90 GEYLMTLSIGTPPL-SYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLP 148
Query: 80 CQSEQCHLLDTVSCSSQQ-----LCNYTYGYADSSLTKGVLATERITFGNS---NNFFDN 131
C S ++ + + N TYG + T GV +E TFG++
Sbjct: 149 CNSSLSMCAGVLAGKAPPPGCACMYNQTYG---TGWTAGVQGSETFTFGSAAADQARVPG 205
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
+ FGC + ++ +N GLVGLGR LSL +SQLGA +FSYCL PF D++ TS +
Sbjct: 206 IAFGCSNASSSDWN-GSAGLVGLGRGSLSL----VSQLGAGRFSYCLTPFQ-DTNSTSTL 259
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
G + ++G GV ST V+ K TYY++ L GIS+G + + + P S A
Sbjct: 260 LLGPSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLG--AKALSISPDAFSLKADG 317
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD--PRLGSQLCYK--TPSMAGIA 303
G + ID+G T L Y ++ V++ + L P D G LCY TP+ A A
Sbjct: 318 TGGLIIDSGTTITSLVNAAYQQVRAAVQSLVTL-PAIDGSDSTGLDLCYALPTPTSAPPA 376
Query: 304 -PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ-PIDGDVGIFGNFAQSDLFIGYDF 361
P +T HFDG A + L S I GV+C AM+ DG + FGN+ Q ++ I YD
Sbjct: 377 MPSMTLHFDG-ADMVLPADSYMISG--SGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDV 433
Query: 362 DSQMVSFKPTDCT 374
++M+SF P C+
Sbjct: 434 RNEMLSFAPAKCS 446
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 194 bits (494), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 136/374 (36%), Positives = 193/374 (51%), Gaps = 28/374 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V++ V NGE++MK +IG+PP I+DTGSDL+W QC PC QC+ Q PI++P S
Sbjct: 100 VKAPVVAGNGEFLMKLAIGSPPR-SFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQS 158
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FF 129
SS+ ++SC SE C L T +CSS C Y Y Y DSS T+GVLA E TFG+S
Sbjct: 159 SSFYKISCSSELCGALPTSTCSSDG-CEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISI 217
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
+ FGCG++N G GLVGLGR LSL SQ+ Q KF+YCL D S S
Sbjct: 218 PGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQ----KFAYCLTAI--DDSKPS 271
Query: 190 KMYFGNGSEV----SGGGVVSTSLVSKEDK-TYYFVTLEGISVG--NLSNSSKLIPYYNS 242
+ G+ + + S + +T L+ + ++Y+++L+GISVG LS ++
Sbjct: 272 SLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDD 331
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAG 301
G + ID+G T + + L+ + + L P D G LC+ P+
Sbjct: 332 ----GSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNL-PVDDSGTGGLDLCFNLPAGTN 386
Query: 302 I--APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
P LT HF GA + L + I G+ C A+ G + IFGN Q + + +
Sbjct: 387 QVEVPKLTFHFK-GADLELPGENYMIGDSKAGLLCLAIGSSRG-MSIFGNLQQQNFMVVH 444
Query: 360 DFDSQMVSFKPTDC 373
D + +SF PT C
Sbjct: 445 DLQEETLSFLPTQC 458
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 194 bits (494), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 126/368 (34%), Positives = 188/368 (51%), Gaps = 23/368 (6%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSS 74
+V + Y++ +IGTPPL + ++DTGSDL+W QC PC +C+ Q P+Y PA S++
Sbjct: 84 SVHASTATYLVDIAIGTPPL-PLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSAT 142
Query: 75 YKELSCQSEQCHLLDT--VSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
Y +SC+S C L + CS C Y + Y D + T GVLATE T G S+
Sbjct: 143 YANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLG-SDTAVRG 201
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V FGCG N G +N GLVG+GR LSL +SQLG +FSYC PF +++ S +
Sbjct: 202 VAFGCGTENLGS-TDNSSGLVGMGRGPLSL----VSQLGVTRFSYCFTPF--NATAASPL 254
Query: 192 YFGNGSEVSGGG-----VVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
+ G+ + +S V S S ++ +YY+++LEGI+VG+ + P +
Sbjct: 255 FLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGD--TLLPIDPAVFRLTPM 312
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-API 305
G + ID+G T L + + L + + ++L LG LC+ S + P
Sbjct: 313 GDGGVIIDSGTTFTALEESAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPR 372
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
L HFD GA + L S + GV C M G + + G+ Q + I YD + +
Sbjct: 373 LVLHFD-GADMELRRESYVVEDRSAGVACLGMVSARG-MSVLGSMQQQNTHILYDLERGI 430
Query: 366 VSFKPTDC 373
+SF+P C
Sbjct: 431 LSFEPAKC 438
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 131/363 (36%), Positives = 189/363 (52%), Gaps = 26/363 (7%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKEL 78
NG Y+M+ IGTP + + I DTGSDL WVQC PC +C+ Q P+Y+P +SS++ L
Sbjct: 93 NGNYLMRIYIGTPSV-ERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLL 151
Query: 79 SCQSEQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN-VVFG 135
C S+ C L CS C Y Y Y D+S + G L+++ I +++ + FG
Sbjct: 152 PCDSQPCTQLPYSQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRLMLLQLHYNSKICFG 211
Query: 136 CGHNN--TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
CG N T + G+VGLG LSL SQ+ ++G +KFSYCL+PF ++S+ SK+ F
Sbjct: 212 CGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDEIG-HKFSYCLLPFSSNSN--SKLKF 268
Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
G + V G GVVST L+ K D +Y++ LEGI+VG + + + GN+ I
Sbjct: 269 GEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTVKT----------GQTDGNIII 318
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGG 313
D+G+ T L + FYN V+ + + Q C+ P + HF GG
Sbjct: 319 DSGSTLTYLEESFYNEFVSLVKETVAVEEDQYIPYPFDFCFTYKEGMSTPPDVVFHFTGG 378
Query: 314 AKV-PLIHTSTFIPPPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
V ++T I + + C + P D + IFGN Q D +GYD VSF PT
Sbjct: 379 DVVLKPMNTLVLIE---DNLICSTVVPSHFDGIAIFGNLGQIDFHVGYDIQGGKVSFAPT 435
Query: 372 DCT 374
DC+
Sbjct: 436 DCS 438
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 194 bits (493), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 135/379 (35%), Positives = 185/379 (48%), Gaps = 32/379 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP--CVQCYKQVKPIYNPA 70
V+S + T + EY+M ++GTPP + I DTGSDL+WV C +++P+
Sbjct: 89 VESKIITRSFEYLMYVNVGTPPA-QMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPS 147
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF-- 128
S++Y LSCQS C L SC + C Y Y Y D S T GVL+TE +F +
Sbjct: 148 RSTTYSLLSCQSAACQALSQASCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGE 207
Query: 129 ----FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQI-LSQLGANKFSYCLVPFHT 183
V FGC + G F + GLVGLG LSL SQ+ + A +FSYCLVP +
Sbjct: 208 GQVRVPRVSFGCSTGSAGSFRSD--GLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYA 265
Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
++ +S + FG + VS G ST LV E +YY V LE ++V +S NSS
Sbjct: 266 AANSSSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQDVASA-----NSS 320
Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA--- 300
+ +D+G T L L ++ I+L Q P QLCY +
Sbjct: 321 ------RIIVDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQGKSQAE 374
Query: 301 --GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD--VGIFGNFAQSDLF 356
GI P +T F GGA V L +TF EG C + P+ V I GN AQ +
Sbjct: 375 DFGI-PDVTLRFGGGASVTLRPENTFSLLE-EGTLCLVLVPVSESQPVSILGNIAQQNFH 432
Query: 357 IGYDFDSQMVSFKPTDCTK 375
+GYD D++ V+F DCT+
Sbjct: 433 VGYDLDARTVTFAAVDCTR 451
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 194 bits (493), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 124/359 (34%), Positives = 185/359 (51%), Gaps = 39/359 (10%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
Y+MK +GTPP +I +DTGSDL+W QC+PC CY Q PI++P++SS++KE C
Sbjct: 61 YLMKLQVGTPPF-EIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNGN 119
Query: 84 QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHNN 140
CH Y YAD++ +KG LATE +T +++ GCGHN+
Sbjct: 120 SCH--------------YKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNS 165
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
+ F G+VGL SL +Q+ + SYC S TSK+ FG + V+
Sbjct: 166 SW-FKPTFSGMVGLSWGPSSLITQMGGEY-PGLMSYCFA-----SQGTSKINFGTNAIVA 218
Query: 201 GGGVVSTSL-VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPP 259
G GVVST++ ++ Y++ L+ +SVG+ + ++ +GN+ ID+G
Sbjct: 219 GDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFH-----ALEGNIIIDSGTTL 273
Query: 260 TLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLI 319
T P + N + E V + + DP LCY T ++ I P++T HF GGA + L
Sbjct: 274 TYFPVSYCNLVREAVDHYVTAVRTADPTGNDMLCYYTDTI-DIFPVITMHFSGGADLVLD 332
Query: 320 HTSTFIPPPVEGVFCFAM----QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ +I G FC A+ P D IFGN AQ++ +GYD S +V F PT+C+
Sbjct: 333 KYNMYIETITRGTFCLAIICNNPPQD---AIFGNRAQNNFLVGYDSSSLLVFFSPTNCS 388
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 194 bits (493), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 126/368 (34%), Positives = 188/368 (51%), Gaps = 23/368 (6%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSS 74
+V + Y++ +IGTPPL + ++DTGSDL+W QC PC +C+ Q P+Y PA S++
Sbjct: 84 SVHASTATYLVDIAIGTPPL-PLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSAT 142
Query: 75 YKELSCQSEQCHLLDT--VSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
Y +SC+S C L + CS C Y + Y D + T GVLATE T G S+
Sbjct: 143 YANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLG-SDTAVRG 201
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V FGCG N G +N GLVG+GR LSL +SQLG +FSYC PF +++ S +
Sbjct: 202 VAFGCGTENLGS-TDNSSGLVGMGRGPLSL----VSQLGVTRFSYCFTPF--NATAASPL 254
Query: 192 YFGNGSEVSGGG-----VVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
+ G+ + +S V S S ++ +YY+++LEGI+VG+ + P +
Sbjct: 255 FLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGD--TLLPIDPAVFRLTPM 312
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-API 305
G + ID+G T L + + L + + ++L LG LC+ S + P
Sbjct: 313 GDGGVIIDSGTTFTALEERAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPR 372
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
L HFD GA + L S + GV C M G + + G+ Q + I YD + +
Sbjct: 373 LVLHFD-GADMELRRESYVVEDRSAGVACLGMVSARG-MSVLGSMQQQNTHILYDLERGI 430
Query: 366 VSFKPTDC 373
+SF+P C
Sbjct: 431 LSFEPAKC 438
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 140/378 (37%), Positives = 195/378 (51%), Gaps = 45/378 (11%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSC 80
GEY+M +IGTPPL + DTGSDL+W QC PC QC++Q P+YNPASS+++ L C
Sbjct: 110 GEYLMTLAIGTPPL-PYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPC 168
Query: 81 QS--EQCHLLDTVSCSSQQ---LCNYTYGYADSSLTKGVLATERITFGNS---NNFFDNV 132
S C + + N TYG + T GV +E TFG+S V
Sbjct: 169 NSSLSMCAGALAGAAPPPGCACMYNQTYG---TGWTAGVQGSETFTFGSSAADQARVPGV 225
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGC + ++ +N GLVGLGR LSL +SQLGA +FSYCL PF D++ TS +
Sbjct: 226 AFGCSNASSSDWN-GSAGLVGLGRGSLSL----VSQLGAGRFSYCLTPFQ-DTNSTSTLL 279
Query: 193 FGNGSEVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS- 247
G + ++G GV ST V+ + TYY++ L GIS+G +K +P S GA S
Sbjct: 280 LGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLG-----AKALPI--SPGAFSL 332
Query: 248 ----KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD--PRLGSQLCYKTPSMA- 300
G + ID+G T L Y ++ V++ + P D G LC+ P+
Sbjct: 333 KPDGTGGLIIDSGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTS 392
Query: 301 ---GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ-PIDGDVGIFGNFAQSDLF 356
+ P +T HFD GA + L S I GV+C AM+ DG + FGN+ Q ++
Sbjct: 393 APPAVLPSMTLHFD-GADMVLPADSYMISG--SGVWCLAMRNQTDGAMSTFGNYQQQNMH 449
Query: 357 IGYDFDSQMVSFKPTDCT 374
I YD + +SF P C+
Sbjct: 450 ILYDVREETLSFAPAKCS 467
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 140/362 (38%), Positives = 197/362 (54%), Gaps = 27/362 (7%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
+GEY+++ +IGTP L + I+DTGSDL+W +C PC C IY+P+SSS+Y ++
Sbjct: 38 GSGEYLIQMAIGTPA-LSLSAIMDTGSDLVWTKCNPCTDC--STSSIYDPSSSSTYSKVL 94
Query: 80 CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
CQS C SC++ C Y Y Y D S T G+L+ E TF S+ N+ FGCGH+
Sbjct: 95 CQSSLCQPPSIFSCNNDGDCEYVYPYGDRSSTSGILSDE--TFSISSQSLPNITFGCGHD 152
Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
N G + GLVG GR LSL SQ+ +G NKFSYCLV TDSS TS ++ GN + +
Sbjct: 153 NQGF--DKVGGLVGFGRGSLSLVSQLGPSMG-NKFSYCLVS-RTDSSKTSPLFIGNTASL 208
Query: 200 SGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPY----YNSSGAISKGNMFIDT 255
V ST LV +Y+++LEGISVG S IP S G+ G + ID+
Sbjct: 209 EATTVGSTPLVQSSSTNHYYLSLEGISVG---GQSLAIPTGTFDIQSDGS---GGLIIDS 262
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGA 314
G T L + Y+ ++E + ++I L P D +L LC+ + P +T HF GA
Sbjct: 263 GTTLTFLQQTAYDAVKEAMVSSINL-PQADGQL--DLCFNQQGSSNPGFPSMTFHFK-GA 318
Query: 315 KVPLIHTSTFIPPPVEGVFCFAMQPID---GDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
+ + P + C AM P + G++ IFGN Q + I YD ++ ++SF PT
Sbjct: 319 DYDVPKENYLFPDSTSDIVCLAMMPTNSNLGNMAIFGNVQQQNYQILYDNENNVLSFAPT 378
Query: 372 DC 373
C
Sbjct: 379 AC 380
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 194 bits (492), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 136/374 (36%), Positives = 193/374 (51%), Gaps = 28/374 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V++ V NGE++MK +IG+PP I+DTGSDL+W QC PC QC+ Q PI++P S
Sbjct: 355 VKAPVVAGNGEFLMKLAIGSPPR-SFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQS 413
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FF 129
SS+ ++SC SE C L T +CSS C Y Y Y DSS T+GVLA E TFG+S
Sbjct: 414 SSFYKISCSSELCGALPTSTCSSDG-CEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISI 472
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
+ FGCG++N G GLVGLGR LSL SQ+ Q KF+YCL D S S
Sbjct: 473 PGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQ----KFAYCLTAI--DDSKPS 526
Query: 190 KMYFGNGSEV----SGGGVVSTSLVSKEDK-TYYFVTLEGISVG--NLSNSSKLIPYYNS 242
+ G+ + + S + +T L+ + ++Y+++L+GISVG LS ++
Sbjct: 527 SLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDD 586
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAG 301
G + ID+G T + + L+ + + L P D G LC+ P+
Sbjct: 587 ----GSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNL-PVDDSGTGGLDLCFNLPAGTN 641
Query: 302 I--APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
P LT HF GA + L + I G+ C A+ G + IFGN Q + + +
Sbjct: 642 QVEVPKLTFHFK-GADLELPGENYMIGDSKAGLLCLAIGSSRG-MSIFGNLQQQNFMVVH 699
Query: 360 DFDSQMVSFKPTDC 373
D + +SF PT C
Sbjct: 700 DLQEETLSFLPTQC 713
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 193 bits (491), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 125/362 (34%), Positives = 184/362 (50%), Gaps = 37/362 (10%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N Y+MK +GTPP +I I+DTGS++ W QCLPCV CY+Q PI++P+ SS++KE C
Sbjct: 62 NSVYLMKLQVGTPPF-EIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKEKRC 120
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCG 137
C Y Y D + T G LATE IT +++ + GCG
Sbjct: 121 DGHS--------------CPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGCG 166
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
HNN+ F + G+VGL SL +Q+ + SYC TSK+ FG +
Sbjct: 167 HNNSW-FKPSFSGMVGLNWGPSSLITQMGGEY-PGLMSYCF-----SGQGTSKINFGANA 219
Query: 198 EVSGGGVVSTSLVSKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
V+G GVVST++ K +Y++ L+ +SVGN I ++ +GN+ ID+G
Sbjct: 220 IVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGN-----TRIETMGTTFHALEGNIVIDSG 274
Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKV 316
T P + N + + V + + DP LCY + ++ I P++T HF GG +
Sbjct: 275 TTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYNSDTI-DIFPVITMHFSGGVDL 333
Query: 317 PLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
L + ++ GVFC A+ P IFGN AQ++ +GYD S +VSF PT+C
Sbjct: 334 VLDKYNMYMESNNGGVFCLAIICNSPTQE--AIFGNRAQNNFLVGYDSSSLLVSFSPTNC 391
Query: 374 TK 375
+
Sbjct: 392 SA 393
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 129/374 (34%), Positives = 175/374 (46%), Gaps = 25/374 (6%)
Query: 7 FYPNNV---VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV 63
F P ++ + S S +GEY + IG PP Y I+DTGSD+ WVQC PC CY+Q
Sbjct: 129 FKPEDLQSPIISGTSQGSGEYFSRVGIGKPPS-QAYLILDTGSDVNWVQCAPCADCYQQA 187
Query: 64 KPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG 123
PI+ PASS+S+ LSC + QC LD C + C Y Y D S T G TE IT G
Sbjct: 188 DPIFEPASSASFSTLSCNTRQCRSLDVSECRNDT-CLYEVSYGDGSYTVGDFVTETITLG 246
Query: 124 NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
++ DNV GCGHNN G+F + SQ+ A FSYCLV
Sbjct: 247 SAP--VDNVAIGCGHNNEGLFVGAAG-----LLGLGGGSLSFPSQINATSFSYCLV--DR 297
Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
DS S + F S + V + L + T+Y+V L G+SVG +L+ S+
Sbjct: 298 DSESASTLEF--NSTLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGG-----ELVSIPESA 350
Query: 244 GAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
I + G + +D+G T L D YN L + + P + CY S
Sbjct: 351 FQIDESGNGGVIVDSGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIALFDTCYDLSSKG 410
Query: 301 GI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
+ P ++ HF G ++PL + +P EG FCFA P + I GN Q + Y
Sbjct: 411 NVEVPTVSFHFPDGKELPLPAKNYLVPLDSEGTFCFAFAPTASSLSIIGNVQQQGTRVVY 470
Query: 360 DFDSQMVSFKPTDC 373
D + +V F P C
Sbjct: 471 DLVNHLVGFVPNKC 484
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 133/377 (35%), Positives = 191/377 (50%), Gaps = 31/377 (8%)
Query: 12 VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
+Q V NGE++M SIGTP + I+DTGSDL+W QC PCV+C+ Q P+++P+S
Sbjct: 90 ALQVPVHAGNGEFLMDMSIGTPAVA-YAAIIDTGSDLVWTQCKPCVECFNQSTPVFDPSS 148
Query: 72 SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SS+Y L C S C L + C+S + C YTY Y DSS T+GVLA E T + +
Sbjct: 149 SSTYAALPCSSTLCSDLPSSKCTSAK-CGYTYTYGDSSSTQGVLAAETFTLAKTK--LPD 205
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V FGCG N G GLVGLGR LSL +SQLG NKFSYCL D + S +
Sbjct: 206 VAFGCGDTNEGDGFTQGAGLVGLGRGPLSL----VSQLGLNKFSYCLTSL--DDTSKSPL 259
Query: 192 YFGNGSEV-----SGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
G+ + + + V +T L+ + ++Y+V L+G++VG S I +S+ A
Sbjct: 260 LLGSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVG-----STHITLPSSAFA 314
Query: 246 ISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI 302
+ G + +D+G T L Y L++ +KL +G C++ P+ +G+
Sbjct: 315 VQDDGTGGVIVDSGTSITYLELQGYRALKKAFAAQMKLPAADGSGIGLDTCFEAPA-SGV 373
Query: 303 ----APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIG 358
P L H D GA + L + + G C + G + I GNF Q ++
Sbjct: 374 DQVEVPKLVFHLD-GADLDLPAENYMVLDSGSGALCLTVMGSRG-LSIIGNFQQQNIQFV 431
Query: 359 YDFDSQMVSFKPTDCTK 375
YD +SF P C K
Sbjct: 432 YDVGENTLSFAPVQCAK 448
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 133/359 (37%), Positives = 181/359 (50%), Gaps = 29/359 (8%)
Query: 30 IGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD 89
IGTP L IVDTGSDL+W QC PCV C+KQ P+++P+SSS+Y + C S C L
Sbjct: 173 IGTPALA-YSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLP 231
Query: 90 TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEM 149
T C+S C YTY Y DSS T+GVLATE T S VVFGCG N G
Sbjct: 232 TSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK--LPGVVFGCGDTNEGDGFSQGA 289
Query: 150 GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV-----SGGGV 204
GLVGLGR LSL +SQLG +KFSYCL D + S + G+ + + + V
Sbjct: 290 GLVGLGRGPLSL----VSQLGLDKFSYCLTSL--DDTNNSPLLLGSLAGISEASAAASSV 343
Query: 205 VSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNMFIDTGAPPT 260
+T L+ + ++Y+V+L+ I+VG S I +S+ A+ G + +D+G T
Sbjct: 344 QTTPLIKNPSQPSFYYVSLKAITVG-----STRISLPSSAFAVQDDGTGGVIVDSGTSIT 398
Query: 261 LLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI----APILTAHFDGGAKV 316
L Y L++ + L +G LC++ P+ G+ P L HFDGGA +
Sbjct: 399 YLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAK-GVDQVEVPRLVFHFDGGADL 457
Query: 317 PLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
L + + G C + G + I GNF Q + YD +SF P C K
Sbjct: 458 DLPAENYMVLDGGSGALCLTVMGSRG-LSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNK 515
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 127/368 (34%), Positives = 183/368 (49%), Gaps = 28/368 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+ S S +GEY + IG PP +Y ++DTGSD+ WVQC PC +CY+Q PI+ P SS
Sbjct: 140 IVSGASQGSGEYFSRVGIGRPPS-PVYMVLDTGSDVSWVQCAPCAECYEQTDPIFEPTSS 198
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
+S+ LSC++EQC LD C + C Y Y D S T G TE +T G+++ N+
Sbjct: 199 ASFTSLSCETEQCKSLDVSECRNGT-CLYEVSYGDGSYTVGDFVTETVTLGSTS--LGNI 255
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGHNN G+F L + SQL A+ FSYCLV +DS TS +
Sbjct: 256 AIGCGHNNEGLFIGAAGLL-----GLGGGSLSFPSQLNASSFSYCLVDRDSDS--TSTLD 308
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---G 249
F S ++ V + + T++++ L G+SVG ++P +S +S+ G
Sbjct: 309 F--NSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGG-----AVLPIPETSFQMSEDGNG 361
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGI-API 305
+ +D+G T L YN L + +K T G L CY S + + P
Sbjct: 362 GIIVDSGTAVTRLQTTVYNVLRDAF---VKSTHDLQTARGVALFDTCYDLSSKSRVEVPT 418
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
++ HF G ++PL + IP EG FCFA P D + I GN Q +G+D + +
Sbjct: 419 VSFHFANGNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSL 478
Query: 366 VSFKPTDC 373
V F P C
Sbjct: 479 VGFSPNKC 486
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 132/381 (34%), Positives = 189/381 (49%), Gaps = 32/381 (8%)
Query: 5 TYFYPNNV---VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK 61
T F P ++ V S S +GEY + +GTP ++Y ++DTGSD+ W+QCLPC +CY+
Sbjct: 142 TRFQPEDLTTPVVSGTSQGSGEYFSRIGVGTPAK-EMYVVLDTGSDVNWIQCLPCSECYQ 200
Query: 62 QVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERIT 121
Q PI++P SSS++K L+C +C LD +C S + C Y Y D S T G AT+ +T
Sbjct: 201 QSDPIFDPTSSSTFKSLTCSDPKCASLDVSACRSNK-CLYQVSYGDGSFTVGNYATDTVT 259
Query: 122 FGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
FG S D V GCGH+N G+F A + +Q+ A FSYCLV
Sbjct: 260 FGESGKVND-VALGCGHDNEGLFTGAAG-----LLGLGGGALSMTNQIKAKSFSYCLV-- 311
Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPY 239
DS+ +S + F N ++ G + L + + T+Y+V L G SVG +S S L
Sbjct: 312 DRDSAKSSSLDF-NSVQIGAGDATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFE- 369
Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS------QLC 293
++SGA G + +D G T L YN L + +KLT D + G+ C
Sbjct: 370 VDASGA---GGVILDCGTAVTRLQTQAYNSLRDAF---VKLT--TDFKKGTSPISLFDTC 421
Query: 294 YKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQ 352
Y S++ + P +T HF GG + L + IP G FCFA P + I GN Q
Sbjct: 422 YDFSSLSTVKVPTVTFHFTGGKSLNLPAKNYLIPIDDAGTFCFAFAPTSSSLSIIGNVQQ 481
Query: 353 SDLFIGYDFDSQMVSFKPTDC 373
I YD + ++ C
Sbjct: 482 QGTRITYDLANNLIGLSANKC 502
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 191 bits (485), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 138/371 (37%), Positives = 191/371 (51%), Gaps = 22/371 (5%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
+G Y M+ +G+PP IVDTGSDL+W+QC PC QCY Q PIY+P++SS++ + SC
Sbjct: 1 SGAYTMEIELGSPPK-KFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSC 59
Query: 81 QSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITF---GNSNNFFDNVVFGC 136
+ C L CSS + C Y Y Y DSS T+G A E +T G S+ F N FGC
Sbjct: 60 STSSCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGC 119
Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
G N+G F G+VGLG+ ++SL++Q+ S + NKFSYCLV F DSS TS + FG+
Sbjct: 120 GRLNSGSFG-GAAGIVGLGQGKISLSTQLGSAIN-NKFSYCLVDFDDDSSKTSPLIFGS- 176
Query: 197 SEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVG--NLSNSSKLIPYYNSSGA-------- 245
S +G G +ST ++ + TYYFV LEGISVG LS +++ I + +
Sbjct: 177 SASTGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRAL 236
Query: 246 -ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA- 303
++ G D+G TLL Y++++ +++ L G LCY
Sbjct: 237 EVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFDLCYDVSKSKNFKF 296
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFD 362
P LT F G P I E V C AM +GI GN Q + + YD
Sbjct: 297 PALTLAFKGTKFSPPQKNYFVIVDTAETVACLAMGGSGSLGLGIIGNLMQQNYHVVYDRG 356
Query: 363 SQMVSFKPTDC 373
+ +S P C
Sbjct: 357 TSTISMSPAQC 367
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 191 bits (485), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 131/365 (35%), Positives = 177/365 (48%), Gaps = 17/365 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S + +GEY + +G P D ++DTGSD+ W+QC PC CY+Q PIYNPA S
Sbjct: 134 VVSGMDQGSGEYFSRIGVGAPRR-DQLMVLDTGSDVTWIQCEPCSDCYQQSDPIYNPALS 192
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
SSYK + CQ+ C LD CS C Y Y D S T+G ATE +T G + NV
Sbjct: 193 SSYKLVGCQANLCQQLDVSGCSRNGSCLYQVSYGDGSYTQGNFATETLTLGGAP--LQNV 250
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH+N G+F L+GLG LS SQ+ + G FSYCLV DS +S +
Sbjct: 251 AIGCGHDNEGLFVGAAG-LLGLGGGSLSFPSQLTDENG-KIFSYCLV--DRDSESSSTLQ 306
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS---KG 249
FG + V G V++ L + T+Y+V+L GISVG K++ +S I G
Sbjct: 307 FGRAA-VPNGAVLAPMLKNSRLDTFYYVSLSGISVGG-----KMLSISDSVFGIDASGNG 360
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTA 308
+ +D+G T L Y+ L + R K P D CY S + P +
Sbjct: 361 GVIVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPSTDGVSLFDTCYDLSSKESVDVPTVVF 420
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
HF GG + L + +P G FCFA P + I GN Q + + +D + V F
Sbjct: 421 HFSGGGSMSLPAKNYLVPVDSMGTFCFAFAPTSSSLSIVGNIQQQGIRVSFDRANNQVGF 480
Query: 369 KPTDC 373
C
Sbjct: 481 AVNKC 485
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 191 bits (485), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 135/383 (35%), Positives = 185/383 (48%), Gaps = 42/383 (10%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP----IYN 68
V+S + T + EY+M ++GTPP + I DTGSDL+WV C ++
Sbjct: 92 VESKIITRSFEYLMYVNVGTPPT-QLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQ 150
Query: 69 PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GN 124
P SS+Y +LSCQS C L SC + C Y Y Y D S T GVL+TE +F G
Sbjct: 151 PTRSSTYSQLSCQSNACQALSQASCDADSECQYQYSYGDGSRTIGVLSTETFSFVDGGGK 210
Query: 125 SNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGA-----NKFSYCLV 179
V FGC + G F + GLVGLG SL +SQLGA K SYCL+
Sbjct: 211 GQVRVPRVNFGCSTASAGTFRSD--GLVGLGAGAFSL----VSQLGATTHIDRKLSYCLI 264
Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPY 239
P + D++ +S + FG+ + VS G ST LV + +YY V LE ++VG + +
Sbjct: 265 PSY-DANSSSTLNFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGG-----QEVAT 318
Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM 299
++S + +D+G T L L ++ IKL Q P QLCY
Sbjct: 319 HDS-------RIIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQLCYDVQGK 371
Query: 300 A-----GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD--VGIFGNFAQ 352
+ GI P +T F GGA V L +TF EG C + P+ V I GN AQ
Sbjct: 372 SETDNFGI-PDVTLRFGGGAAVTLRPENTF-SLLQEGTLCLVLVPVSESQPVSILGNIAQ 429
Query: 353 SDLFIGYDFDSQMVSFKPTDCTK 375
+ +GYD D++ V+F DC +
Sbjct: 430 QNFHVGYDLDARTVTFAAADCAR 452
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 191 bits (484), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 136/362 (37%), Positives = 178/362 (49%), Gaps = 15/362 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S S +GEY ++ IG P Y ++DTGSD+ W+QC PC CY+QV PI++PASS
Sbjct: 149 VTSGTSQGSGEYFLRVGIGRPSKT-FYMVIDTGSDVNWLQCKPCDDCYQQVDPIFDPASS 207
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
SS+ L CQ+ QC LD +C + C Y Y D S T G ATE ++FGNS + D V
Sbjct: 208 SSFSRLGCQTPQCRNLDVFACRNDS-CLYQVSYGDGSYTVGDFATETVSFGNSGS-VDKV 265
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH+N G+F GL+GLG LSL SQI A+ FSYCLV + DS +S +
Sbjct: 266 AIGCGHDNEGLF-VGAAGLIGLGGGPLSLTSQI----KASSFSYCLV--NRDSVDSSTLE 318
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
F N ++ S SK D T+Y+V + G+SVG + P KG +
Sbjct: 319 F-NSAKPSDSVTAPIFKNSKVD-TFYYVGITGMSVGG--EKLAIPPSIFEVDGSGKGGII 374
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFD 311
+D G T L YN L + K P CY S + P + FD
Sbjct: 375 VDCGTAVTRLQTQAYNALRDTFVKLTKDLPSTSGFALFDTCYNLSSRTSVRVPTVAFLFD 434
Query: 312 GGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
GG +PL ++ IP G FC A P + I GN Q + YD + VSF
Sbjct: 435 GGKSLPLPPSNYLIPVDSAGTFCLAFAPTTASLSIIGNVQQQGTRVTYDLANSQVSFSSR 494
Query: 372 DC 373
C
Sbjct: 495 KC 496
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 190 bits (483), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 126/368 (34%), Positives = 182/368 (49%), Gaps = 28/368 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+ S S +GEY + IG PP +Y ++DTGSD+ WVQC PC +CY+Q P + P SS
Sbjct: 140 IVSGASQGSGEYFSRVGIGRPPS-PVYMVLDTGSDVSWVQCAPCAECYEQTDPXFEPTSS 198
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
+S+ LSC++EQC LD C + C Y Y D S T G TE +T G+++ N+
Sbjct: 199 ASFTSLSCETEQCKSLDVSECRNGT-CLYEVSYGDGSYTVGDFVTETVTLGSTS--LGNI 255
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGHNN G+F L + SQL A+ FSYCLV +DS TS +
Sbjct: 256 AIGCGHNNEGLFIGAAGLL-----GLGGGSLSFPSQLNASSFSYCLVDRDSDS--TSTLD 308
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---G 249
F S ++ V + + T++++ L G+SVG ++P +S +S+ G
Sbjct: 309 F--NSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGG-----AVLPIPETSFQMSEDGNG 361
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGI-API 305
+ +D+G T L YN L + +K T G L CY S + + P
Sbjct: 362 GIIVDSGTAVTRLQTTVYNVLRDAF---VKSTHDLQTARGVALFDTCYDLSSKSRVEVPT 418
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
++ HF G ++PL + IP EG FCFA P D + I GN Q +G+D + +
Sbjct: 419 VSFHFANGNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSL 478
Query: 366 VSFKPTDC 373
V F P C
Sbjct: 479 VGFSPNKC 486
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 190 bits (483), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 138/376 (36%), Positives = 192/376 (51%), Gaps = 40/376 (10%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSC 80
GEY+M +IGTPPL + DTGSDL+W QC PC QC++Q P+YNPASS+++ L C
Sbjct: 112 GEYLMTLAIGTPPL-PYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPC 170
Query: 81 QS--EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNS---NNFFDNVVFG 135
S C + Y + T GV +E TFG+S V FG
Sbjct: 171 NSSLSMCAGALAGAAPPPGCACMYYQTYGTGWTAGVQGSETFTFGSSAADQARVPGVAFG 230
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
C + ++ +N GLVGLGR LSL +SQLGA +FSYCL PF D++ TS + G
Sbjct: 231 CSNASSSDWN-GSAGLVGLGRGSLSL----VSQLGAGRFSYCLTPFQ-DTNSTSTLLLGP 284
Query: 196 GSEVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS---- 247
+ ++G GV ST V+ + TYY++ L GIS+G +K +P S GA S
Sbjct: 285 SAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLG-----AKALPI--SPGAFSLKPD 337
Query: 248 -KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLT-PYQD--PRLGSQLCYKTPSMA--- 300
G + ID+G T L Y ++ V++ + T P D G LC+ P+
Sbjct: 338 GTGGLIIDSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTSAP 397
Query: 301 -GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ-PIDGDVGIFGNFAQSDLFIG 358
+ P +T HFD GA + L S I GV+C AM+ DG + FGN+ Q ++ I
Sbjct: 398 PAVLPSMTLHFD-GADMVLPADSYMISG--SGVWCLAMRNQTDGAMSTFGNYQQQNMHIL 454
Query: 359 YDFDSQMVSFKPTDCT 374
YD + +SF P C+
Sbjct: 455 YDVREETLSFAPAKCS 470
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 122/363 (33%), Positives = 172/363 (47%), Gaps = 21/363 (5%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
S S +GEY + IG+PP +Y +VDTGSD+ WVQC PC CY+Q PI+ P+ SSS
Sbjct: 146 SGASQGSGEYFSRVGIGSPPK-HVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSS 204
Query: 75 YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
Y L+C++ QC LD C + C Y Y D S T G ATE IT S + +NV
Sbjct: 205 YAPLTCETHQCKSLDVSECRNDS-CLYEVSYGDGSYTVGDFATETITLDGSAS-LNNVAI 262
Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
GCGH+N G+F L + SQ+ A+ FSYCLV TDS+ T +
Sbjct: 263 GCGHDNEGLFVGAAGLL-----GLGGGSLSFPSQINASSFSYCLVNRDTDSASTLEF--- 314
Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNM 251
S + V + L + + T+Y++ + GI VG +++ SS + + G +
Sbjct: 315 -NSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGG-----QMLSIPRSSFEVDESGNGGI 368
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHF 310
+D+G T L D YN L + + P CY S + + P ++ HF
Sbjct: 369 IVDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHF 428
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
G + L + IP G FCFA P + I GN Q + YD + +V F P
Sbjct: 429 PDGKYLALPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFSP 488
Query: 371 TDC 373
C
Sbjct: 489 NGC 491
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 135/376 (35%), Positives = 196/376 (52%), Gaps = 36/376 (9%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
GEY+M +IGTPPL I DTGSDL+W QC PC QC++Q P+YNP+SS+++ L C
Sbjct: 90 GEYLMALAIGTPPL-PYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPC 148
Query: 81 QSEQ--CHLLDTVSCSSQQ---LCNY--TYGYADSSLTKGVLATERITFGNS---NNFFD 130
S C + ++ C Y TYG +S+ +G +E TFG++ +
Sbjct: 149 NSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQG---SETFTFGSTPAGHARVP 205
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
+ FGC ++G + GLVGLGR RLSL +SQLG KFSYCL P+ D++ TS
Sbjct: 206 GIAFGCSTASSGFNASSASGLVGLGRGRLSL----VSQLGVPKFSYCLTPYQ-DTNSTST 260
Query: 191 MYFGNGSEVSG-GGVVSTSLVSKED----KTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
+ G + ++G GV ST V+ T+Y++ L GIS+G + S + P S A
Sbjct: 261 LLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALS--IPPDAFSLNA 318
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD--PRLGSQLCYKTPSMAGIA 303
G + ID+G TLL Y ++ V + + L P D G LC+ PS
Sbjct: 319 DGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGSADTGLDLCFMLPSSTSAP 377
Query: 304 PI---LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ-PIDGDVGIFGNFAQSDLFIGY 359
P +T HF+G V + +++ G++C AMQ DG+V I GN+ Q ++ I Y
Sbjct: 378 PAMPSMTLHFNGADMV--LPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILY 435
Query: 360 DFDSQMVSFKPTDCTK 375
D + +SF P C+
Sbjct: 436 DIGQETLSFAPAKCSA 451
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 132/379 (34%), Positives = 192/379 (50%), Gaps = 34/379 (8%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSS 74
+V + Y++ F+IGTPPL + ++DTGSDL+W QC PC +C+ Q P+Y PA S +
Sbjct: 92 SVHASTATYLVDFAIGTPPLA-LSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVT 150
Query: 75 YKELSCQSEQCHLLDTVSCSSQQL------------CNYTYGYADSSLTKGVLATERITF 122
Y +SC S C L ++ SS+ C Y Y Y D S T GVLATE TF
Sbjct: 151 YANVSCGSRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTF 210
Query: 123 GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
G D + FGCG +N G +N GLVG+GR LSL +SQLG KFSYC PF+
Sbjct: 211 GAGTTVHD-LAFGCGTDNLG-GTDNSSGLVGMGRGPLSL----VSQLGVTKFSYCFTPFN 264
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLV----SKEDKTYYFVTLEGISVGNLSNSSKLIP 238
D++ +S ++ G+ + +S ST V +YY+++LEGI+VG+ + P
Sbjct: 265 -DTTTSSPLFLGSSASLS-PAAKSTPFVPSPSGPRRSSYYYLSLEGITVGD--TLLPIDP 320
Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS 298
A +G + ID+G T L + + L V + L LG +C+ P
Sbjct: 321 AVFRLTASGRGGLIIDSGTTFTALEERAFVVLARAVAARVALPLASGAHLGLSVCFAAPQ 380
Query: 299 MAGI----APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
G P L HFD GA + L +S + V GV C + G + + G+ Q +
Sbjct: 381 GRGPEAVDVPRLVLHFD-GADMELPRSSAVVEDRVAGVACLGIVSARG-MSVLGSMQQQN 438
Query: 355 LFIGYDFDSQMVSFKPTDC 373
+ + YD ++SF+P +C
Sbjct: 439 MHVRYDVGRDVLSFEPANC 457
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 188 bits (477), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 135/376 (35%), Positives = 196/376 (52%), Gaps = 36/376 (9%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
GEY+M +IGTPPL I DTGSDL+W QC PC QC++Q P+YNP+SS+++ L C
Sbjct: 30 GEYLMALAIGTPPL-PYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPC 88
Query: 81 QSEQ--CHLLDTVSCSSQQ---LCNY--TYGYADSSLTKGVLATERITFGNS---NNFFD 130
S C + ++ C Y TYG +S+ +G +E TFG++ +
Sbjct: 89 NSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQG---SETFTFGSTPAGHARVP 145
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
+ FGC ++G + GLVGLGR RLSL +SQLG KFSYCL P+ D++ TS
Sbjct: 146 GIAFGCSTASSGFNASSASGLVGLGRGRLSL----VSQLGVPKFSYCLTPYQ-DTNSTST 200
Query: 191 MYFGNGSEVSG-GGVVSTSLVSKED----KTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
+ G + ++G GV ST V+ T+Y++ L GIS+G + S + P S A
Sbjct: 201 LLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALS--IPPDAFSLNA 258
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD--PRLGSQLCYKTPSMAGIA 303
G + ID+G TLL Y ++ V + + L P D G LC+ PS
Sbjct: 259 DGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGSADTGLDLCFMLPSSTSAP 317
Query: 304 PI---LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ-PIDGDVGIFGNFAQSDLFIGY 359
P +T HF+G V + +++ G++C AMQ DG+V I GN+ Q ++ I Y
Sbjct: 318 PAMPSMTLHFNGADMV--LPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILY 375
Query: 360 DFDSQMVSFKPTDCTK 375
D + +SF P C+
Sbjct: 376 DIGQETLSFAPAKCSA 391
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 187 bits (475), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 136/380 (35%), Positives = 197/380 (51%), Gaps = 44/380 (11%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
GEY+M +IGTPPL I DTGSDL+W QC PC QC++Q P+YNP+SS+++ L C
Sbjct: 88 GEYLMALAIGTPPL-PYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPC 146
Query: 81 QSEQ--CHLLDTVSCSSQQ---LCNY--TYGYADSSLTKGVLATERITFGNS---NNFFD 130
S C + ++ C Y TYG +S+ +G +E TFG++ +
Sbjct: 147 NSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQG---SETFTFGSTPAGQSRVP 203
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
+ FGC ++G + GLVGLGR RLSL +SQLG KFSYCL P+ D++ TS
Sbjct: 204 GIAFGCSTASSGFNASSASGLVGLGRGRLSL----VSQLGVPKFSYCLTPYQ-DTNSTST 258
Query: 191 MYFGNGSEVSG-GGVVSTSLVSKED----KTYYFVTLEGISVGNLSNSSKLIP----YYN 241
+ G + ++G GV ST V+ T+Y++ L GIS+G + S IP N
Sbjct: 259 LLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALS---IPPDAFLLN 315
Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD--PRLGSQLCYKTPSM 299
+ G G + ID+G TLL Y ++ V + + L P D G LC+ PS
Sbjct: 316 ADG---TGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGSAATGLDLCFMLPSS 371
Query: 300 AGIAPI---LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ-PIDGDVGIFGNFAQSDL 355
P +T HF+G V + +++ G++C AMQ DG+V I GN+ Q ++
Sbjct: 372 TSAPPAMPSMTLHFNGADMV--LPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNM 429
Query: 356 FIGYDFDSQMVSFKPTDCTK 375
I YD + +SF P C+
Sbjct: 430 HILYDIGQETLSFAPAKCSA 449
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 135/368 (36%), Positives = 191/368 (51%), Gaps = 21/368 (5%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
+ + ++ V++ NGEY++ S G PP IVDTGSDL WVQCLPC CY+ + ++P
Sbjct: 76 DQLFETPVASGNGEYLIDISYGNPPQKST-AIVDTGSDLNWVQCLPCKSCYETLSAKFDP 134
Query: 70 ASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
+ S+SYK L C S C L SC++ C Y Y Y D S T G L+T+ +T G
Sbjct: 135 SKSASYKTLGCGSNFCQDLPFQSCAAS--CQYDYMYGDGSSTSGALSTDDVTIGTGK--I 190
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
NV FGCG++N G F LVGLG+ LSL SQ L KFSYCLVP S+ TS
Sbjct: 191 PNVAFGCGNSNLGTFAGAGG-LVGLGKGPLSLVSQ-LGGTATKKFSYCLVPL--GSTKTS 246
Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS---GAI 246
+Y G+ S ++GG + L + T+Y+ L+GISV K + Y ++ A
Sbjct: 247 PLYIGD-STLAGGVAYTPMLTNNNYPTFYYAELQGISV-----EGKAVNYPANTFDIAAT 300
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG-IAPI 305
+G + +D+G T L D +N + ++ A+ G + C+ T +A P
Sbjct: 301 GRGGLILDSGTTLTYLDVDAFNPMVAALKAALPYPEADGSFYGLEYCFSTAGVANPTYPT 360
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
+ HF+ GA V L +TFI EG C AM G IFGN Q + I +D ++
Sbjct: 361 VVFHFN-GADVALAPDNTFIALDFEGTTCLAMASSTG-FSIFGNIQQLNHVIVHDLVNKR 418
Query: 366 VSFKPTDC 373
+ FK +C
Sbjct: 419 IGFKSANC 426
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 120/375 (32%), Positives = 192/375 (51%), Gaps = 25/375 (6%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S + +GEY+++ S+G+PP + Y +VD+GSD+MWVQC PC++CY Q P+++PA+S
Sbjct: 160 VVSGLDEGSGEYLVRVSVGSPPT-EQYLVVDSGSDVMWVQCKPCLECYVQADPLFDPATS 218
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQL--CNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
+++ +SC S C +L T +C +L C Y YAD S TKG LA E +T G + +
Sbjct: 219 ATFSGVSCGSAICRILPTSACGDGELGGCEYEVSYADGSYTKGALALETLTLGGTA--VE 276
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
VV GCGH N G+F GL+GLG +SL Q+ ++G FSYCL S +
Sbjct: 277 GVVIGCGHRNRGLF-VGAAGLMGLGWGPMSLVGQLGGEVG-GAFSYCLASRGGYGSGAAD 334
Query: 191 -----MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
+ G V G V + + ++Y+V L GI VG+ + +P
Sbjct: 335 DDAGWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGD-----ERLPLQAGLFQ 389
Query: 246 ISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSM 299
+++ G++ +DTG T LP++ Y L + A+ + + S + CY
Sbjct: 390 LTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYDLSGY 449
Query: 300 AGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIG 358
A + P ++ FDG A++ L + + + G++C A P + I GN Q+ + I
Sbjct: 450 ASVRVPTVSFCFDGDARLILAARNVLLEVDM-GIYCLAFAPSSSGLSIMGNTQQAGIQIT 508
Query: 359 YDFDSQMVSFKPTDC 373
D + + F P +C
Sbjct: 509 VDSANGYIGFGPANC 523
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 129/366 (35%), Positives = 178/366 (48%), Gaps = 19/366 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S V +GEY + IG+P +Y ++DTGSD+ WVQC PC CY+Q P+++P+ S
Sbjct: 155 VVSGVGQGSGEYFSRVGIGSP-ARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLS 213
Query: 73 SSYKELSCQSEQCHLLDTVSC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
+SY +SC S++C LDT +C ++ C Y Y D S T G ATE +T G+S N
Sbjct: 214 ASYAAVSCDSQRCRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDSTP-VGN 272
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V GCGH+N G+F L+ LG LS SQI A+ FSYCLV DS S +
Sbjct: 273 VAIGCGHDNEGLFVGAAG-LLALGGGPLSFPSQI----SASTFSYCLV--DRDSPAASTL 325
Query: 192 YFGNGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISK 248
FG+G+ + G V+ LV S T+Y+V L GISVG LS + +SG+
Sbjct: 326 QFGDGA--AEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGS--- 380
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILT 307
G + +D+G T L Y L + P CY + P ++
Sbjct: 381 GGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSDRTSVEVPAVS 440
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
F+GG + L + IP G +C A P + V I GN Q + +D V
Sbjct: 441 LRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTARGAVG 500
Query: 368 FKPTDC 373
F P C
Sbjct: 501 FTPNKC 506
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 125/376 (33%), Positives = 181/376 (48%), Gaps = 43/376 (11%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY+++ ++GTP + +DTGSDL+W QC PC C+ Q P+ +PA+SS+Y L C +
Sbjct: 83 EYLVRLAVGTP-RRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCGA 141
Query: 83 EQCHLLDTVSCSSQQL-----CNYTYGYADSSLTKGVLATERITFGNSNN-----FFDNV 132
+C L SC + L C Y Y Y D SLT G +AT+R TFG+S +
Sbjct: 142 ARCRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTRRL 201
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP-FHTDSSIT--- 188
FGCGH N GVF NE G+ G GR R SL SQL FSYC F + SS+
Sbjct: 202 TFGCGHLNKGVFQSNETGIAGFGRGRWSLP----SQLNVTSFSYCFTSMFESKSSLVTLG 257
Query: 189 ---SKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKL-IPYYNSS 243
+ +Y S G V +T ++ + + YF++L+GISVG ++L +P
Sbjct: 258 GSPAALY----SHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGK----TRLPVPETKFR 309
Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA--- 300
I ID+GA T LP++ Y ++ + + L P LC+ P A
Sbjct: 310 STI------IDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLCFALPVTALWR 363
Query: 301 -GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
P LT H + GA L ++ V C + G+ + GNF Q + + Y
Sbjct: 364 RPAVPSLTLHLE-GADWELPRSNYVFEDLGARVMCIVLDAAPGEQTVIGNFQQQNTHVVY 422
Query: 360 DFDSQMVSFKPTDCTK 375
D ++ +SF P C +
Sbjct: 423 DLENDRLSFAPARCDR 438
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 121/361 (33%), Positives = 187/361 (51%), Gaps = 36/361 (9%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
Y+M+ +GTPP +I +DTGSDL+W QC+PC CY Q PI++P+ SS++KE C
Sbjct: 61 YLMRLQLGTPPF-EIVAEIDTGSDLIWTQCMPCPNCYTQFAPIFDPSKSSTFKEKRCHGN 119
Query: 84 QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHNN 140
C Y YAD S + G+LATE +T +++ GCG NN
Sbjct: 120 SCP--------------YEIIYADESYSTGILATETVTIQSTSGEPFVMAETSIGCGLNN 165
Query: 141 TGV----FNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
+ + + + G+VGL SL SQ+ + SYC S TSK+ FG
Sbjct: 166 SNLMTPGYAASSSGIVGLNMGPSSLISQMDLPI-PGLISYCF-----SSQGTSKINFGTN 219
Query: 197 SEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
+ V+G G V+ + K+D+ +Y++ L+ +SVG+ + P++ GN+FID+G
Sbjct: 220 AVVAGDGTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFH-----AQDGNIFIDSG 274
Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQ-DPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
T LP + N + E V ++ DP + LCY +M I P++T HF GGA
Sbjct: 275 TTYTYLPTSYCNLVREAVAASVVAANQVPDPSSENLLCYNWDTME-IFPVITLHFAGGAD 333
Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDV-GIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ L + ++ G FC A+ +D + IFGN A ++L +GYD + ++SF PT+C+
Sbjct: 334 LVLDKYNMYVETITGGTFCLAIGCVDPSMPAIFGNRAHNNLLVGYDSSTLVISFSPTNCS 393
Query: 375 K 375
Sbjct: 394 A 394
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 128/374 (34%), Positives = 184/374 (49%), Gaps = 37/374 (9%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ +IGTPP + ++DTGSDL+W QC PC C Q P++ PA+SSSY + C
Sbjct: 102 EYLIDLAIGTPPQ-PVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPAASSSYVPMRCSG 160
Query: 83 EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV--FGCGHNN 140
+ C+ + SC C Y Y Y D + T GV ATER TF +S+ +V FGCG N
Sbjct: 161 QLCNDILHHSCQRPDTCTYRYNYGDGTTTLGVYATERFTFASSSGEKLSVPLGFGCGTMN 220
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN----- 195
G N N G+VG GR LSL +SQL +FSYCL P+ S+ S + FG+
Sbjct: 221 VGSLN-NGSGIVGFGRDPLSL----VSQLSIRRFSYCLTPY--TSTRKSTLMFGSLSDGV 273
Query: 196 --GSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
G + + G V +T L+ S+++ T+Y+V G++VG L + G +
Sbjct: 274 FEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDG--SGGVI 331
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKL--TPYQDPRLGSQLCYKTPSMAGI-------- 302
+D+G TL P + R ++L T P G +C+ TP AG
Sbjct: 332 VDSGTALTLFPAAVLTEVLRAFRAQLRLPFTSSSSPDDG--VCFATPMAAGGRRASAATV 389
Query: 303 --APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG-IFGNFAQSDLFIGY 359
P + HF GA + L + + P G C + GD G GNF Q D+ + Y
Sbjct: 390 VSVPRMAFHFQ-GADLELPRRNYVLDDPRRGSLCILLAD-SGDSGATIGNFVQQDMRVLY 447
Query: 360 DFDSQMVSFKPTDC 373
D +++ +SF P C
Sbjct: 448 DLEAETLSFAPAQC 461
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 124/379 (32%), Positives = 178/379 (46%), Gaps = 43/379 (11%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ ++GTPP + +DTGSDL+W QC PC C+ Q P+ +PA+SS+Y L C +
Sbjct: 91 EYLVHLAVGTPPR-PVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAASSTYAALPCGA 149
Query: 83 EQCHLLDTVSC---------SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN-- 131
+C L SC + + C Y Y Y D S+T G +AT+R TFG N D+
Sbjct: 150 PRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSRL 209
Query: 132 ----VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP-FHTDSS 186
+ FGCGH N GVF NE G+ G GR R SL SQL FSYC F + SS
Sbjct: 210 PTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLP----SQLNVTTFSYCFTSMFESKSS 265
Query: 187 I-------TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPY 239
+ + + + + + +SG + L + + YF++L+GISVG + +P
Sbjct: 266 LVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVG---KTRLAVPE 322
Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPS 298
I ID+GA T LP+ Y ++ + + L P + LC+ P
Sbjct: 323 AKLRSTI------IDSGASITTLPEAVYEAVKAEFAAQVGLPPTGVVEGSALDLCFALPV 376
Query: 299 MAGI----APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
A P LT H D GA L + V C + GD + GNF Q +
Sbjct: 377 TALWRRPPVPSLTLHLD-GADWELPRGNYVFEDLAARVMCVVLDAAPGDQTVIGNFQQQN 435
Query: 355 LFIGYDFDSQMVSFKPTDC 373
+ YD ++ +SF P C
Sbjct: 436 THVVYDLENDWLSFAPARC 454
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 184 bits (468), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 124/363 (34%), Positives = 189/363 (52%), Gaps = 23/363 (6%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY+M+ +IG PP+ + DTGSDL W QC PC C+ Q P+Y+P++SS++ L C S
Sbjct: 70 EYLMELAIGKPPV-PFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPLPCSS 128
Query: 83 EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--FFDNVVFGCGHNN 140
C + + +C+ LC Y Y Y D + + G+L TE +T G S+ V FGCG +N
Sbjct: 129 ATCLPIWSRNCTPSSLCRYRYAYGDGAYSAGILGTETLTLGPSSAPVSVGGVAFGCGTDN 188
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
G + N G VGLGR LSL L+QLG KFSYCL F +S++ S G +E++
Sbjct: 189 GGD-SLNSTGTVGLGRGTLSL----LAQLGVGKFSYCLTDFF-NSALDSPFLLGTLAELA 242
Query: 201 GG--GVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS---KGNMFID 254
G V ST L+ S ++ + YFV+L+GIS+G++ +P N + + G M +D
Sbjct: 243 PGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVR-----LPIPNGTFDLRGDGTGGMIVD 297
Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS-MAGIAPILTAHFDGG 313
+G T+L + + + +V + P L + C+ P+ P L HF GG
Sbjct: 298 SGTTFTILAESGFREVVGRVARVLGQPPVNASSLDAP-CFPAPAGEPPYMPDLVLHFAGG 356
Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
A + L + + FC + + + GNF Q ++ + +D +SF PTD
Sbjct: 357 ADMRLYRDNYMSYNEEDSSFCLNIAGTTPESTSVLGNFQQQNIQMLFDTTVGQLSFLPTD 416
Query: 373 CTK 375
C+K
Sbjct: 417 CSK 419
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 184 bits (467), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 127/368 (34%), Positives = 193/368 (52%), Gaps = 28/368 (7%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY+M+ +IGTPP+ + DTGSDL W QC PC C+ Q P+Y+P++SS++ + C S
Sbjct: 76 EYLMELAIGTPPV-PFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSS 134
Query: 83 EQC-HLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNS----NNFFDNVVFGC 136
C +L + +CS+ LC Y Y Y+D + + G+L TE +T G+S +V FGC
Sbjct: 135 ATCLPVLRSRNCSTPSSLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSDVAFGC 194
Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
G +N G + N G VGLGR LSL L+QLG KFSYCL F +S++ S G
Sbjct: 195 GTDNGGD-SLNSTGTVGLGRGTLSL----LAQLGVGKFSYCLTDFF-NSTLDSPFLLGTL 248
Query: 197 SEVS--GGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS---GAISKGN 250
+E++ G V ST L+ S + + Y V+L+GI++G++ +P N + A S G
Sbjct: 249 AELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVR-----LPIPNKTFDLHANSTGG 303
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS---MAGIAPILT 307
M +D+G ++LP+ + + + V + P L S C+ P+ P L
Sbjct: 304 MVVDSGTTFSILPESGFRVVVDHVAQVLGQPPVNASSLDSP-CFPAPAGERQLPFMPDLV 362
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
HF GGA + L + + FC + + GNF Q ++ + +D +S
Sbjct: 363 LHFAGGADMRLHRDNYMSYNQEDSSFCLNIVGTTSTWSMLGNFQQQNIQMLFDMTVGQLS 422
Query: 368 FKPTDCTK 375
F PTDC+K
Sbjct: 423 FLPTDCSK 430
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 184 bits (467), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 142/393 (36%), Positives = 193/393 (49%), Gaps = 48/393 (12%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQ--VKPIYNPA 70
VQ+ + G Y M S+GTPPL D IVDTGS+L+W QC PC +C+ + P+ PA
Sbjct: 80 VQAQLENGAGAYNMNISLGTPPL-DFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPA 138
Query: 71 SSSSYKELSCQSEQCHLLDTVS----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
SS++ L C C L T S C++ C Y Y Y S T G LATE +T G+
Sbjct: 139 RSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAYNYTYG-SGYTAGYLATETLTVGDGT 197
Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
F V FGC N GV +N G+VGLGR LSL +SQL +FSYCL D
Sbjct: 198 --FPKVAFGCSTEN-GV--DNSSGIVGLGRGPLSL----VSQLAVGRFSYCLRSDMADGG 248
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSK----EDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
S + FG+ ++++ G VV ++ + K + T+Y+V L GI+V S +P S
Sbjct: 249 -ASPILFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAV-----DSTELPVTGS 302
Query: 243 SGAISK----GNMFIDTGAPPTLLPKDFYNRLEE----QVRNAIKLTPYQDPRLGSQLCY 294
+ ++ G +D+G T L KD Y +++ Q+ N + TP LCY
Sbjct: 303 TFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCY 362
Query: 295 KTPSMAG-----IAPILTAHFDGGAK--VPLIHTSTFIPPPVEG---VFCFAMQPIDGD- 343
K PS G P L F GGAK VP+ + + +G V C + P D
Sbjct: 363 K-PSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDL 421
Query: 344 -VGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
+ I GN Q D+ + YD D M SF P DC K
Sbjct: 422 PISIIGNLMQMDMHLLYDIDGGMFSFAPADCAK 454
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 133/370 (35%), Positives = 183/370 (49%), Gaps = 27/370 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S V +GEY + IG+P ++Y ++DTGSD+ WVQC PC CY+Q P+++P+ S
Sbjct: 158 VVSGVGQGSGEYFSRVGIGSP-ARELYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLS 216
Query: 73 SSYKELSCQSEQCHLLDTVSC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
+SY +SC S +C LDT +C ++ C Y Y D S T G ATE +T G+S N
Sbjct: 217 ASYAAVSCDSPRCRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDSTP-VTN 275
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V GCGH+N G+F L+ LG LS SQI A+ FSYCLV DS S +
Sbjct: 276 VAIGCGHDNEGLFVGAAG-LLALGGGPLSFPSQI----SASTFSYCLV--DRDSPAASTL 328
Query: 192 YFG-NGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAIS 247
FG +G+E V+ LV S T+Y+V L GISVG LS S +SG+
Sbjct: 329 QFGADGAEAD---TVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGS-- 383
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGI-A 303
G + +D+G T L Y L + ++ TP G L CY +
Sbjct: 384 -GGVIVDSGTAVTRLQSSAYAALRDAF---VRGTPSLPRTSGVSLFDTCYDLSDRTSVEV 439
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
P ++ F+GG + L + IP G +C A P + V I GN Q + +D
Sbjct: 440 PAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAK 499
Query: 364 QMVSFKPTDC 373
+V F P C
Sbjct: 500 GVVGFTPNKC 509
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 119/378 (31%), Positives = 174/378 (46%), Gaps = 17/378 (4%)
Query: 3 PATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQ 62
P +F + V S + +GEY ++ IG+PP + Y +VD+GSD++WVQC PC++CY Q
Sbjct: 104 PTDFFGSESKVVSGLDEGSGEYFVRVGIGSPPT-EQYLVVDSGSDVIWVQCKPCLECYAQ 162
Query: 63 VKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF 122
P+++PASS+++ +SC S C L T C C Y Y D S TKG LA E +T
Sbjct: 163 ADPLFDPASSATFSAVSCGSAICRTLRTSGCGDSGGCEYEVSYGDGSYTKGTLALETLTL 222
Query: 123 GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
G + + V GCGH N G+F GL+GLG +SL Q L FSYCL
Sbjct: 223 GGTA--VEGVAIGCGHRNRGLF-VGAAGLLGLGWGPMSLVGQ-LGGAAGGAFSYCLASRG 278
Query: 183 TDSS----ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKL 236
S + G V G V + + + ++Y+V + GI VG+ L L
Sbjct: 279 GSGSGAADAAGSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGL 338
Query: 237 IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT 296
G G + +DTG T LP++ Y L + A+ P CY
Sbjct: 339 FQLTEDGG----GGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLLDTCYDL 394
Query: 297 PSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDL 355
+ P ++ +FDG A + L + + G++C A P + I GN Q +
Sbjct: 395 SGYTSVRVPTVSFYFDGAATLTLPARNLLLEVD-GGIYCLAFAPSSSGLSILGNIQQEGI 453
Query: 356 FIGYDFDSQMVSFKPTDC 373
I D + + F P C
Sbjct: 454 QITVDSANGYIGFGPATC 471
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 131/366 (35%), Positives = 185/366 (50%), Gaps = 23/366 (6%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S S +GEY + +G P Y ++DTGSD+ W+QC PC CY+Q PI+ PA+S
Sbjct: 148 VSSGTSQGSGEYFTRVGVGNPAK-SYYMVLDTGSDINWIQCQPCSDCYQQSDPIFTPAAS 206
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
SSY L+C S+QC+ L SC + Q C Y Y D S T G TE ++FG S +++
Sbjct: 207 SSYSPLTCDSQQCNSLQMSSCRNGQ-CRYQVNYGDGSFTFGDFVTETMSFGGSGT-VNSI 264
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH+N G+F GL+GLG LSL SQL A FSYCLV + DS+ +S +
Sbjct: 265 ALGCGHDNEGLF-VGAAGLLGLGGGPLSLT----SQLKATSFSYCLV--NRDSAASSTLD 317
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNM 251
F N + V G V++ L S + T+Y+V L G+SV G L + + + SG G +
Sbjct: 318 F-NSAPV-GDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSG---DGGV 372
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGI-APILT 307
+D G T L + YN L + + ++ + G L CY + + P ++
Sbjct: 373 IVDCGTAITRLQSEAYNSLRDSF---VSMSRHLRSTSGVALFDTCYDLSGQSSVKVPTVS 429
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
HFDGG L + IP G +CFA P + I GN Q + +D + V
Sbjct: 430 FHFDGGKSWDLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVSFDLANNRVG 489
Query: 368 FKPTDC 373
F C
Sbjct: 490 FSTNKC 495
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 128/370 (34%), Positives = 191/370 (51%), Gaps = 30/370 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S S +GEY + +G P Y ++DTGSD+ W+QC PC CY+Q PI++P +S
Sbjct: 150 VTSGTSQGSGEYFTRVGVGNPAR-QFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTAS 208
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+Y ++CQS+QC L+ SC S Q C Y Y D S T G ATE ++FGNS + NV
Sbjct: 209 STYAPVTCQSQQCSSLEMSSCRSGQ-CLYQVNYGDGSYTFGDFATESVSFGNSGS-VKNV 266
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH+N G+F GL+GLG LSL +QL A FSYCLV + DS+ +S +
Sbjct: 267 ALGCGHDNEGLF-VGAAGLLGLGGGPLSLT----NQLKATSFSYCLV--NRDSAGSSTLD 319
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---G 249
F N +++ V + + +++ T+Y+V L G+SVG +++ S+ + + G
Sbjct: 320 F-NSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGG-----QMVSIPESTFRLDESGNG 373
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL-----CYKTPSMAGI-A 303
+ +D G T L YN L + +++T Q+ +L S + CY A +
Sbjct: 374 GIIVDCGTAITRLQTQAYNPLRDAF---VRMT--QNLKLTSAVALFDTCYDLSGQASVRV 428
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
P ++ HF G L + IP G +CFA P + I GN Q + +D +
Sbjct: 429 PTVSFHFADGKSWNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLAN 488
Query: 364 QMVSFKPTDC 373
+ F P C
Sbjct: 489 NRMGFSPNKC 498
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 124/366 (33%), Positives = 181/366 (49%), Gaps = 17/366 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+ S ++ +GEY + +GTPP Y ++DTGSD+MW+QCLPC +CY Q P++NPA+S
Sbjct: 142 IISGLAQGSGEYFTRLGVGTPPRY-TYMVLDTGSDIMWIQCLPCAKCYGQTDPLFNPAAS 200
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+Y+++ C + C LD C +++ C Y Y D S T G +TE +TF V
Sbjct: 201 STYRKVPCATPLCKKLDISGCRNKRYCEYQVSYGDGSFTVGDFSTETLTF--RGQVIRRV 258
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH+N G+F L+GLGR LS SQ +Q + +FSYCLV + S S +
Sbjct: 259 ALGCGHDNEGLFIGAAG-LLGLGRGSLSFPSQTGAQF-SKRFSYCLVD-RSASGTASSLI 315
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
FG + + + + L + + T+Y+V L GISVG +S + A G +
Sbjct: 316 FGKAA-IPKSAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMD-ATGNGGVI 373
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGI-APILT 307
ID+G T L Y+ + R+A ++ G CY + + P L
Sbjct: 374 IDSGTSVTRLVDSAYSTM----RDAFRVGTGNLKSAGGFSLFDTCYDLSGLKTVKVPTLV 429
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
HF GGA + L T+ IP FCFA G + I GN Q + +D + V
Sbjct: 430 FHFQGGAHISLPATNYLIPVDSSATFCFAFAGNTGGLSIIGNIQQQGYRVVFDSLANRVG 489
Query: 368 FKPTDC 373
FK C
Sbjct: 490 FKAGSC 495
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 128/370 (34%), Positives = 191/370 (51%), Gaps = 30/370 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S S +GEY + +G P Y ++DTGSD+ W+QC PC CY+Q PI++P +S
Sbjct: 9 VTSGTSQGSGEYFTRVGVGNPAR-QFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTAS 67
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+Y ++CQS+QC L+ SC S Q C Y Y D S T G ATE ++FGNS + NV
Sbjct: 68 STYAPVTCQSQQCSSLEMSSCRSGQ-CLYQVNYGDGSYTFGDFATESVSFGNSGS-VKNV 125
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH+N G+F GL+GLG LSL +QL A FSYCLV + DS+ +S +
Sbjct: 126 ALGCGHDNEGLF-VGAAGLLGLGGGPLSLT----NQLKATSFSYCLV--NRDSAGSSTLD 178
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---G 249
F N +++ V + + +++ T+Y+V L G+SVG +++ S+ + + G
Sbjct: 179 F-NSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGG-----QMVSIPESTFRLDESGNG 232
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL-----CYKTPSMAGI-A 303
+ +D G T L YN L + +++T Q+ +L S + CY A +
Sbjct: 233 GIIVDCGTAITRLQTQAYNPLRDAF---VRMT--QNLKLTSAVALFDTCYDLSGQASVRV 287
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
P ++ HF G L + IP G +CFA P + I GN Q + +D +
Sbjct: 288 PTVSFHFADGKSWNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLAN 347
Query: 364 QMVSFKPTDC 373
+ F P C
Sbjct: 348 NRMGFSPNKC 357
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 134/377 (35%), Positives = 193/377 (51%), Gaps = 38/377 (10%)
Query: 18 STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYK 76
+T GE++M +IGTPPL I DTGSDL+W QC PC QC++Q P+YNP+SS+++
Sbjct: 79 TTVPGEFLMTLAIGTPPL-PFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFS 137
Query: 77 ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF----FDNV 132
L C S +C + N TYG + + +G TE TFG+S +
Sbjct: 138 ALPCNSSLGLCAPACAC----MYNMTYGSGWTYVFQG---TETFTFGSSTPADQVRVPGI 190
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGC + ++G + GLVGLGR LSL +SQLGA KFSYCL P+ D++ TS +
Sbjct: 191 AFGCSNASSGFNASSASGLVGLGRGSLSL----VSQLGAPKFSYCLTPYQ-DTNSTSTLL 245
Query: 193 FGNGSEVSGGGVV-STSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
G + ++ GVV ST V+ YY++ L GIS+G + + + P S A G +
Sbjct: 246 LGPSASLNDTGVVSSTPFVASPSSIYYYLNLTGISLG--TTALPIPPNAFSLKADGTGGL 303
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD--PRLGSQLCYKTPSMAGI---APIL 306
ID+G T+L Y ++ V + + L P D G LC++ PS P +
Sbjct: 304 IIDSGTTITMLGNTAYQQVRAAVLSLVTL-PTTDGSAATGLDLCFELPSSTSAPPSMPSM 362
Query: 307 TAHFDGGAKVPLIHTSTFI-----PPPVEGVFCFAMQ-PIDGD---VGIFGNFAQSDLFI 357
T HFDG V + ++ P ++C AMQ D D V I GN+ Q ++ I
Sbjct: 363 TLHFDGADMV--LPADNYMMSLSDPDSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMHI 420
Query: 358 GYDFDSQMVSFKPTDCT 374
YD + +SF P C+
Sbjct: 421 LYDVGKETLSFAPAKCS 437
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 131/378 (34%), Positives = 192/378 (50%), Gaps = 39/378 (10%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
++ +GEY K +GTP + ++DTGSD++W+QC PC +CY+Q +++P S SY
Sbjct: 133 LAQGSGEYFTKIGVGTPATPALM-VLDTGSDVVWLQCAPCRRCYEQSGQVFDPRRSRSYN 191
Query: 77 ELSCQSEQCHLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
+ C + C LD+ C + C Y Y D S+T G ATE +TF V G
Sbjct: 192 AVGCAAPLCRRLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTFAGGAR-VARVALG 250
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK---MY 192
CGH+N G+F L+GLGR LS +QI + G FSYCLV + ++ S+ +
Sbjct: 251 CGHDNEGLFVAAAG-LLGLGRGSLSFPTQISRRYG-RSFSYCLVDRTSSANTASRSSTVT 308
Query: 193 FGNGSEVSGGGVVSTSLV----SKEDKTYYFVTLEGISVG-----NLSNSS-KLIPYYNS 242
FG+G+ G V++S + +T+Y+V L GISVG ++NS +L P S
Sbjct: 309 FGSGAV---GSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRLDP---S 362
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQL---CYKT 296
SG +G + +D+G T L + Y+ L + R A ++L+P G L CY
Sbjct: 363 SG---RGGVIVDSGTSVTRLARPAYSALRDAFRGAAAGLRLSPG-----GFSLFDTCYDL 414
Query: 297 PSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDL 355
+ P ++ HF GGA+ L + IP +G FCFA DG V I GN Q
Sbjct: 415 SGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGF 474
Query: 356 FIGYDFDSQMVSFKPTDC 373
+ +D D Q V+F P C
Sbjct: 475 RVVFDGDGQRVAFTPKGC 492
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 126/365 (34%), Positives = 189/365 (51%), Gaps = 25/365 (6%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY+M+ +IGTPP+ + DTGSDL W QC PC C+ Q P+Y+P++SS++ + C S
Sbjct: 65 EYLMELAIGTPPV-PFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSS 123
Query: 83 EQC-HLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNS----NNFFDNVVFGC 136
C + +CS+ C Y Y Y+D + + G+L TE +T G+S +V FGC
Sbjct: 124 ATCLPTWRSRNCSNPSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAFGC 183
Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
G +N G + N G VGLGR LSL L+QLG KFSYCL F +S++ S + G
Sbjct: 184 GTDNGGD-SLNSTGTVGLGRGTLSL----LAQLGVGKFSYCLTDFF-NSTMDSPFFLGTL 237
Query: 197 SEVS-GGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS---GAISKGN 250
+E++ G G V ++ L S + + YFV L+GIS+G++ +P N + A G
Sbjct: 238 AELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVR-----LPIPNGTFDLRADGNGG 292
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHF 310
M +D+G T+L K + + ++V + P L S C+ +P P L HF
Sbjct: 293 MMVDSGTTFTILAKSGFREVVDRVAQLLGQPPVNASSLDSP-CFPSPDGEPFMPDLVLHF 351
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
GGA + L + + FC + GNF Q ++ + +D +SF P
Sbjct: 352 AGGADMRLHRDNYMSYNEDDSSFCLNIVGSPSTWSRLGNFQQQNIQMLFDMTVGQLSFLP 411
Query: 371 TDCTK 375
TDC+K
Sbjct: 412 TDCSK 416
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 134/383 (34%), Positives = 193/383 (50%), Gaps = 39/383 (10%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSS 73
+ +S GEY+M +IGTPP+ I DTGSDL+W QC PC QC++Q P+YNP+SS+
Sbjct: 77 TQISPTAGEYLMTLAIGTPPV-SYQAIADTGSDLIWTQCAPCSSQCFQQPTPLYNPSSST 135
Query: 74 SYKELSCQSEQCHLLDTVSCSSQQ-----LCNYTYGYADSSLTKGVLATERITFGNSN-- 126
++ L C S ++ ++ + N TYG +S+ +G +E TFG+S
Sbjct: 136 TFAVLPCNSSLSMCAAALAGTTPPPGCTCMYNMTYGSGWTSVYQG---SETFTFGSSTPA 192
Query: 127 --NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
+ FGC + + G + GLVGLGR LSL +SQLG KFSYCL P+ D
Sbjct: 193 NQTGVPGIAFGCSNASGGFNTSSASGLVGLGRGSLSL----VSQLGVPKFSYCLTPYQ-D 247
Query: 185 SSITSKMYFGNGSEVSG-GGVVSTSLVSKED----KTYYFVTLEGISVGNLSNSSKLIPY 239
++ TS + G + ++ GGV ST V+ TYY++ L GIS+G + S IP
Sbjct: 248 TNSTSTLLLGPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALS---IPT 304
Query: 240 YN-SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD---PRLGSQLCYK 295
S A G ID+G TLL Y ++ V + + L P D G LC++
Sbjct: 305 TALSLKADGTGGFIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGGSAATGLDLCFE 363
Query: 296 TPSMAGIAPI---LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ-PIDGDVGIFGNFA 351
PS P +T HFDG V + + ++C AMQ DG V I GN+
Sbjct: 364 LPSSTSAPPTMPSMTLHFDGADMVLPADSYMMLD---SNLWCLAMQNQTDGGVSILGNYQ 420
Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
Q ++ I YD + ++F P C+
Sbjct: 421 QQNMHILYDVGQETLTFAPAKCS 443
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 136/384 (35%), Positives = 190/384 (49%), Gaps = 35/384 (9%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+Q V NGE++M S+GTP L IVDTGSDL+W QC PCV+C+ Q P+++PA+S
Sbjct: 105 LQVPVHAGNGEFLMDLSVGTPAL-PYAAIVDTGSDLVWTQCKPCVECFNQTTPVFDPAAS 163
Query: 73 SSYKELSCQSEQCHLL-------DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNS 125
S+Y L C S C L + S S+ C YTY Y D+S T+GVLATE TF +
Sbjct: 164 STYAALPCSSALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATE--TFTLA 221
Query: 126 NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
V FGCG N G GLVGLGR LSL +SQLG ++FSYCL D+
Sbjct: 222 RQKVPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSL----VSQLGIDRFSYCLTSLD-DA 276
Query: 186 SITSKMYFGNGSEVSGGGVV----STSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYY 240
+ S + G+ + +S +T LV + ++Y+V+L G++VG S +
Sbjct: 277 AGRSPLLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVG-----STRLALP 331
Query: 241 NSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP 297
+S+ AI G + +D+G T L Y L + + L +G LC++ P
Sbjct: 332 SSAFAIQDDGTGGVIVDSGTSITYLELRAYRALRKAFVAHMSLPTVDASEIGLDLCFQGP 391
Query: 298 SMA------GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFA 351
+ A P L HFDGGA + L + + G C + G + I GNF
Sbjct: 392 AGAVDQDVQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMASRG-LSIIGNFQ 450
Query: 352 QSDLFIGYDFDSQMVSFKPTDCTK 375
Q + YD +SF P +C K
Sbjct: 451 QQNFQFVYDVAGDTLSFAPAECNK 474
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 127/373 (34%), Positives = 183/373 (49%), Gaps = 21/373 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S ++ +GEY K +GTP + ++DTGSD++W+QC PC +CY Q +++P S
Sbjct: 131 VVSGLAQGSGEYFTKIGVGTPATPALM-VLDTGSDVVWLQCAPCRRCYDQSGQVFDPRRS 189
Query: 73 SSYKELSCQSEQCHLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SY + C + C LD+ C ++ C Y Y D S+T G ATE +TF
Sbjct: 190 RSYGAVGCSAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAGGAR-VAR 248
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD---SSIT 188
+ GCGH+N G+F L+GLGR LS +QI + G FSYCLV + +S +
Sbjct: 249 IALGCGHDNEGLFVAAAG-LLGLGRGSLSFPAQISRRYG-RSFSYCLVDRTSSANPASHS 306
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
S + FG+G+ S T +V +T+Y+V L GISVG S +
Sbjct: 307 STVTFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSSG 366
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQL---CYKTPSMAG 301
+G + +D+G T L + Y+ L + R A ++L+P G L CY
Sbjct: 367 RGGVIVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSPG-----GFSLFDTCYDLSGRKV 421
Query: 302 I-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
+ P ++ HF GGA+ L + IP +G FCFA DG V I GN Q + +D
Sbjct: 422 VKVPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFD 481
Query: 361 FDSQMVSFKPTDC 373
D Q V F P C
Sbjct: 482 GDGQRVGFVPKGC 494
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 134/381 (35%), Positives = 193/381 (50%), Gaps = 38/381 (9%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
+ + + EY+M+ +IGTPP+ + DTGSDL W QC PC C+ Q PIY+ A SSS
Sbjct: 84 ARLRSGQAEYLMELAIGTPPV-PFVALADTGSDLTWTQCQPCKLCFPQDTPIYDTAVSSS 142
Query: 75 YKELSCQSEQC-HLLDTVSC-SSQQLCNYTYGYADSSLTKGVLATERITF-GNSNNFFDN 131
+ + C S C + + +C +S C Y Y Y D + + GVL TE +TF G
Sbjct: 143 FSPVPCASATCLPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGAPGVSVGG 202
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
+ FGCG +N G+ + N G VGLGR LSL ++QLG KFSYCL F ++S+ S +
Sbjct: 203 IAFGCGVDNGGL-SYNSTGTVGLGRGSLSL----VAQLGVGKFSYCLTDFF-NTSLGSPV 256
Query: 192 YFGNGSEVS----GGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
FG +E++ G V ST LV S T+Y+V+LEGIS+G+ +P N + +
Sbjct: 257 LFGALAELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGD-----ARLPIPNGTFDL 311
Query: 247 S---KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY-------KT 296
G M +D+G T L + + + + V ++ L S C+ +
Sbjct: 312 RDDGSGGMIVDSGTTFTFLVESAFRVVVDHVAGVLRQPVVNASSLDSP-CFPAATGEQQL 370
Query: 297 PSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF--AMQPIDGDVGIFGNFAQSD 354
P+M P + HF GGA + L + E FC A P DV I GNF Q +
Sbjct: 371 PAM----PDMVLHFAGGADMRLHRDNYMSFNQEESSFCLNIAGSP-SADVSILGNFQQQN 425
Query: 355 LFIGYDFDSQMVSFKPTDCTK 375
+ + +D +SF PTDC K
Sbjct: 426 IQMLFDITVGQLSFMPTDCGK 446
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 181 bits (459), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 141/393 (35%), Positives = 192/393 (48%), Gaps = 48/393 (12%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQ--VKPIYNPA 70
VQ+ + G Y M S+GTPPL D IVDTGS+L+W QC PC +C+ + P+ PA
Sbjct: 80 VQAQLENGAGAYNMNISLGTPPL-DFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPA 138
Query: 71 SSSSYKELSCQSEQCHLLDTVS----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
SS++ L C C L T S C++ C Y Y Y S T G LATE +T G+
Sbjct: 139 RSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAYNYTYG-SGYTAGYLATETLTVGDGT 197
Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
F V FGC N GV +N G+VGLGR LSL +SQL +FSYCL D
Sbjct: 198 --FPKVAFGCSTEN-GV--DNSSGIVGLGRGPLSL----VSQLAVGRFSYCLRSDMADGG 248
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSK----EDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
S + FG+ ++++ VV ++ + K + T+Y+V L GI+V S +P S
Sbjct: 249 -ASPILFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAV-----DSTELPVTGS 302
Query: 243 SGAISK----GNMFIDTGAPPTLLPKDFYNRLEE----QVRNAIKLTPYQDPRLGSQLCY 294
+ ++ G +D+G T L KD Y +++ Q+ N + TP LCY
Sbjct: 303 TFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCY 362
Query: 295 KTPSMAG-----IAPILTAHFDGGAK--VPLIHTSTFIPPPVEG---VFCFAMQPIDGD- 343
K PS G P L F GGAK VP+ + + +G V C + P D
Sbjct: 363 K-PSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDL 421
Query: 344 -VGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
+ I GN Q D+ + YD D M SF P DC K
Sbjct: 422 PISIIGNLMQMDMHLLYDIDGGMFSFAPADCAK 454
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 127/382 (33%), Positives = 182/382 (47%), Gaps = 35/382 (9%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S ++ +GEY K +GTP + ++DTGSD++WVQC PC +CY+Q P+++P S
Sbjct: 118 VVSGLAQGSGEYFTKIGVGTPATQALM-VLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRS 176
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SSY + C + C LD+ C ++ C Y Y D S+T G TE +TF
Sbjct: 177 SSYGAVGCGAALCRRLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTFAGGAR-VAR 235
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD------- 184
V GCGH+N G+F L+GLGR LS +QI + G FSYCLV +
Sbjct: 236 VALGCGHDNEGLFVAAAG-LLGLGRGGLSFPTQISRRYG-RSFSYCLVDRTSSGAGAAPG 293
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
S +S + FG GS + + + + +T+Y+V L GISVG
Sbjct: 294 SHRSSTVSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDP 353
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA----IKLTP---------YQDPRLGSQ 291
+ +G + +D+G T L + Y+ L + R A ++L+P Y LG +
Sbjct: 354 STGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYD---LGGR 410
Query: 292 LCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFA 351
K P+ ++ HF GGA+ L + IP G FCFA DG V I GN
Sbjct: 411 RVVKVPT-------VSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQ 463
Query: 352 QSDLFIGYDFDSQMVSFKPTDC 373
Q + +D D Q V F P C
Sbjct: 464 QQGFRVVFDGDGQRVGFAPKGC 485
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 181 bits (458), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 133/364 (36%), Positives = 188/364 (51%), Gaps = 30/364 (8%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
+GEY+M+FS+GTP + + I DTGSDL W+QC PC CY Q P+++P SS+Y ++ C
Sbjct: 85 HGEYLMRFSLGTPSV-ERLAIFDTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTYVDVPC 143
Query: 81 QSEQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF-----GNSNNFFDNVV 133
+S+ C L + C S + C Y + Y S T G L + I+F G F V
Sbjct: 144 ESQPCTLFPQNQRECGSSKQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSV 203
Query: 134 FGCG--HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
FGC N T + G VGLG LSLASQ+ Q+G +KFSYC+VPF + S T K+
Sbjct: 204 FGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIG-HKFSYCMVPFSSTS--TGKL 260
Query: 192 YFGNGSEVSGGGVVSTS-LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
F GS VVST +++ +YY + LEGI+VG K++ +G I GN
Sbjct: 261 KF--GSMAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQ----KKVL-----TGQIG-GN 308
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHF 310
+ ID+ T L + Y V+ AI + +D + C + P+ P HF
Sbjct: 309 IIIDSVPILTHLEQGIYTDFISSVKEAINVEVAEDAPTPFEYCVRNPTNLNF-PEFVFHF 367
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
GA V L + FI + C + P G + IFGN+AQ + + YD + VSF P
Sbjct: 368 T-GADVVLGPKNMFIALD-NNLVCMTVVPSKG-ISIFGNWAQVNFQVEYDLGEKKVSFAP 424
Query: 371 TDCT 374
T+C+
Sbjct: 425 TNCS 428
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 129/373 (34%), Positives = 183/373 (49%), Gaps = 37/373 (9%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ +IGTPP + +DTGSDL+W QC PCV C+ Q P ++ + SS+ L C+S
Sbjct: 34 EYLVHLAIGTPPQ-PVQLTLDTGSDLIWTQCKPCVSCFDQPLPYFDTSRSSTNALLPCES 92
Query: 83 EQCHLLDTVSC-----SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
QC L TV+ + Q C Y Y D+S+T G+LA ++ TF + V FGCG
Sbjct: 93 TQCKLDPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKFTF-VAGTSLPGVTFGCG 151
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL------VPFHTDSSITSKM 191
NNTGVFN NE G+ G GR LSL SQL FS+C +P + + +
Sbjct: 152 LNNTGVFNSNETGIAGFGRGPLSLP----SQLKVGNFSHCFTTITGAIPSTVLLDLPADL 207
Query: 192 YFGNGSEVSGGGVVSTSLV----SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
F NG G V +T L+ ++ + T Y+++L+GI+VG S +P S+ A++
Sbjct: 208 -FSNGQ----GAVQTTPLIQYAKNEANPTLYYLSLKGITVG-----STRLPVPESAFALT 257
Query: 248 KGN--MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-AP 304
G ID+G T LP Y + ++ IKL G C+ PS A P
Sbjct: 258 NGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVP 317
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEG--VFCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
L HF+G F P G + C A+ D + I GNF Q ++ + YD
Sbjct: 318 KLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGD-ETTIIGNFQQQNMHVLYDLQ 376
Query: 363 SQMVSFKPTDCTK 375
+ M+SF C K
Sbjct: 377 NNMLSFVAAQCDK 389
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 134/367 (36%), Positives = 184/367 (50%), Gaps = 25/367 (6%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
VS +GEYV++ S+GTPP IVDTGSDL WVQC PC +C++Q P++ P +SSSY
Sbjct: 1 VSAGSGEYVLQISLGTPPQ-QFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYS 59
Query: 77 ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
SC C L +CS + C Y+Y Y D S T+G A E +T S + FGC
Sbjct: 60 NASCTDSLCDALPRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTLNGST--LARIGFGC 117
Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
GHN G F + GL+GLG+ LSL SQ+ S + FSYCLV T + S + FGN
Sbjct: 118 GHNQEGTFAGAD-GLIGLGQGPLSLPSQLNSSF-THIFSYCLVDQSTTGTF-SPITFGNA 174
Query: 197 SEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNMF 252
+E S T L+ ED +YY+V +E ISVGN + +P S+ I G +
Sbjct: 175 AENSRASF--TPLLQNEDNPSYYYVGVESISVGN-----RRVPTPPSAFRIDANGVGGVI 227
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPR-LGSQLCYKTPSMAGIA---PILTA 308
+D+G T + + ++R I P DP G LCY S++ + P +T
Sbjct: 228 LDSGTTITYWRLAAFIPILAELRRQISY-PEADPTPYGLNLCYDISSVSASSLTLPSMTV 286
Query: 309 HFDG-GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
H ++P+ + + E V C AM D I GN Q + I D + V
Sbjct: 287 HLTNVDFEIPVSNLWVLVDNFGETV-CTAMSTSD-QFSIIGNVQQQNNLIVTDVANSRVG 344
Query: 368 FKPTDCT 374
F TDC+
Sbjct: 345 FLATDCS 351
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 127/374 (33%), Positives = 183/374 (48%), Gaps = 22/374 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S ++ +GEY K +GTP + ++DTGSD++W+QC PC +CY Q +++P +S
Sbjct: 136 VVSGLAQGSGEYFTKIGVGTP-VTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRAS 194
Query: 73 SSYKELSCQSEQCHLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SY + C + C LD+ C ++ C Y Y D S+T G ATE +TF S
Sbjct: 195 HSYGAVDCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA-SGARVPR 253
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV----PFHTDSSI 187
V GCGH+N G+F L+GLGR LS SQI + G FSYCLV + +S
Sbjct: 254 VALGCGHDNEGLFVAAAG-LLGLGRGSLSFPSQISRRFG-RSFSYCLVDRTSSSASATSR 311
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
+S + FG+G+ T +V +T+Y+V L GISVG + +
Sbjct: 312 SSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPST 371
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQL---CYKTPSMA 300
+G + +D+G T L + Y L + R A ++L+P G L CY +
Sbjct: 372 GRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPG-----GFSLFDTCYDLSGLK 426
Query: 301 GI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
+ P ++ HF GGA+ L + IP G FCFA DG V I GN Q + +
Sbjct: 427 VVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVF 486
Query: 360 DFDSQMVSFKPTDC 373
D D Q + F P C
Sbjct: 487 DGDGQRLGFVPKGC 500
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 136/365 (37%), Positives = 180/365 (49%), Gaps = 29/365 (7%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
V++ NGEY++ S G+PP IVDTGSDL+W QCLPC C I++P SS+Y
Sbjct: 73 VASGNGEYLIDISFGSPPQ-KASVIVDTGSDLIWTQCLPCETCNAAASVIFDPVKSSTYD 131
Query: 77 ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
+SC S C L SC++ C Y Y Y D S T G L+TE +T NV FGC
Sbjct: 132 TVSCASNFCSSLPFQSCTTS--CKYDYMYGDGSSTSGALSTETVT--VGTGTIPNVAFGC 187
Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
GH N G F G+VGLG+ LSL SQ S + + KFSYCLVP S+ TS M G+
Sbjct: 188 GHTNLGSF-AGAAGIVGLGQGPLSLISQA-SSITSKKFSYCLVPL--GSTKTSPMLIGD- 242
Query: 197 SEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPY---YNSSGAISKGNMF 252
+ GGV T+L++ + T+Y+ L GISV S K + Y S A +G
Sbjct: 243 -SAAAGGVAYTALLTNTANPTFYYADLTGISV-----SGKAVTYPVGTFSIDASGQGGFI 296
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA----PILTA 308
+D+G T L +N L ++ + G C+ T AG+A P +T
Sbjct: 297 LDSGTTLTYLETGAFNALVAALKAEVPFPEADGSLYGLDYCFST---AGVANPTYPTMTF 353
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
HF GA L + F+ G C AM G I GN Q + I +D +Q V F
Sbjct: 354 HFK-GADYELPPENVFVALDTGGSICLAMAASTG-FSIMGNIQQQNHLIVHDLVNQRVGF 411
Query: 369 KPTDC 373
K +C
Sbjct: 412 KEANC 416
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 115/365 (31%), Positives = 173/365 (47%), Gaps = 20/365 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S + +GEY ++ IG+PP + Y +VD+GSD++WVQC PC++CY Q P+++PA+S
Sbjct: 116 VVSGLDEGSGEYFVRVGIGSPPT-EQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPATS 174
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
+++ + C S C L T C C+Y Y D S TKG LA E +T G + + V
Sbjct: 175 ATFSAVPCGSAVCRTLRTSGCGDSGGCDYEVSYGDGSYTKGALALETLTLGGTA--VEGV 232
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH N G+F GL+GLG +SL Q L FSYCL S +
Sbjct: 233 AIGCGHRNRGLF-VGAAGLLGLGWGPMSLVGQ-LGGAAGGAFSYCLA-----SRGAGSLV 285
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---G 249
G V G V + + + ++Y+V L GI VG+ + +P +++ G
Sbjct: 286 LGRSEAVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGD-----ERLPLQEDLFQLTEDGAG 340
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTA 308
+ +DTG T LP++ Y L + A+ P CY + P ++
Sbjct: 341 GVVMDTGTAVTRLPQEAYAALRDAFVAAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSF 400
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
+FDG A + L + + G++C A P I GN Q + I D + + F
Sbjct: 401 YFDGAATLTLPARNLLLEVD-GGIYCLAFAPSSSGPSILGNIQQEGIQITVDSANGYIGF 459
Query: 369 KPTDC 373
PT C
Sbjct: 460 GPTTC 464
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 120/362 (33%), Positives = 176/362 (48%), Gaps = 10/362 (2%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S + +GEY + IGTP + Y ++DTGSD++W+QC PC +CY Q PI+NP+SS
Sbjct: 143 VVSGMEQGSGEYFTRIGIGTP-TREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSS 201
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+ + C S C LD C C Y Y D S T G ATE +TFG ++ NV
Sbjct: 202 VSFSTVGCDSAVCSQLDANDCHGGG-CLYEVSYGDGSYTVGSYATETLTFGTTS--IQNV 258
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH+N G+F L + LS +Q+ +Q G FSYCLV ++SS T +
Sbjct: 259 AIGCGHDNVGLFVGAAGLLGLGAGS-LSFPAQLGTQTG-RAFSYCLVDRDSESSGT--LE 314
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
FG S V G + + + + T+Y++++ ISVG + S + +G +
Sbjct: 315 FGPES-VPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGII 373
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFD 311
ID+G T L Y+ L + + P D CY ++ ++ P + HF
Sbjct: 374 IDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFS 433
Query: 312 GGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
GA L + IP G FCFA P D ++ I GN Q + + +D + +V F
Sbjct: 434 NGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAID 493
Query: 372 DC 373
C
Sbjct: 494 QC 495
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 129/365 (35%), Positives = 180/365 (49%), Gaps = 22/365 (6%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S S +GEY ++ IG PP Y ++DTGSD+ W+QC PC +CY+Q PI++P SS
Sbjct: 138 VVSGTSQGSGEYFLRVGIGKPPS-QAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPVSS 196
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
+SY + C + QC LD C + C Y Y D S T G ATE +T G + +NV
Sbjct: 197 NSYSPIRCDAPQCKSLDLSECRNGT-CLYEVSYGDGSYTVGEFATETVTLGTAA--VENV 253
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGHNN G+F GL+GLG +LS +Q+ A FSYCLV + DS S +
Sbjct: 254 AIGCGHNNEGLF-VGAAGLLGLGGGKLSFPAQV----NATSFSYCLV--NRDSDAVSTLE 306
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS---SGAISKG 249
F S + V + + E T+Y++ L+GISVG + +P S AI G
Sbjct: 307 F--NSPLPRNVVTAPLRRNPELDTFYYLGLKGISVGG-----EALPIPESIFEVDAIGGG 359
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTA 308
+ ID+G T L + Y+ L + K P + CY S + P ++
Sbjct: 360 GIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVQVPTVSF 419
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
HF G ++PL + IP G FCFA P + I GN Q +G+D + +V F
Sbjct: 420 HFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSLSIMGNVQQQGTRVGFDIANSLVGF 479
Query: 369 KPTDC 373
C
Sbjct: 480 SADSC 484
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 124/365 (33%), Positives = 171/365 (46%), Gaps = 25/365 (6%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
S S +GEY + +G P Y ++DTGSD+ W+QC PC CY+Q PI++P SSSS
Sbjct: 146 SGTSQGSGEYFSRVGVGQPAK-PFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSS 204
Query: 75 YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
+ L C+S+QC L+T C + + C Y Y D S T G TE +TFGNS ++V
Sbjct: 205 FASLPCESQQCQALETSGCRASK-CLYQVSYGDGSFTVGEFVTETLTFGNS-GMINDVAV 262
Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
GCGH+N G+F + + SQ+ A+ FSYCLV D +S
Sbjct: 263 GCGHDNEGLFVGSAG-----LLGLGGGPLSLTSQMKASSFSYCLV----DRDSSSSSDLE 313
Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNMF 252
S V + L S + T+Y+V L G+SVG LS L +S G +
Sbjct: 314 FNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDS----GYGGII 369
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGIA-PILTA 308
+D+G T L YN L + + TPY G L CY S + + P ++
Sbjct: 370 VDSGTAITRLQTQAYNTLRDAF---VSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSF 426
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
F GG + L + IP G FCFA P + I GN Q + YD + +V F
Sbjct: 427 EFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGF 486
Query: 369 KPTDC 373
P C
Sbjct: 487 SPHKC 491
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 132/376 (35%), Positives = 188/376 (50%), Gaps = 31/376 (8%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
Q+ + G Y M S+GTP LL + DTGSDL+W QC PC +C++Q P + PASSS
Sbjct: 76 QALLENGVGGYNMNISVGTP-LLTFPVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSS 134
Query: 74 SYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
++ +L C S C L +++ + C Y Y Y S T G LATE + G+++ F +V
Sbjct: 135 TFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYG-SGYTAGYLATETLKVGDAS--FPSV 191
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGC N GV N G+ GLGR LSL + QLG +FSYCL ++ S +
Sbjct: 192 AFGCSTEN-GVGNSTS-GIAGLGRGALSL----IPQLGVGRFSYCLR--SGSAAGASPIL 243
Query: 193 FGNGSEVSGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK-- 248
FG+ + ++ G V ST V+ +YY+V L GI+VG +P S+ ++
Sbjct: 244 FGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETD-----LPVTTSTFGFTQNG 298
Query: 249 --GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK-TPSMAGIA-P 304
G +D+G T L KD Y +++ + + G LC+K T GIA P
Sbjct: 299 LGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTANVTTVNGTRGLDLCFKSTGGGGGIAVP 358
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD--VGIFGNFAQSDLFIGY 359
L FDGGA+ + + +G V C M P GD + + GN Q D+ + Y
Sbjct: 359 SLVLRFDGGAEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLY 418
Query: 360 DFDSQMVSFKPTDCTK 375
D D + SF P DC K
Sbjct: 419 DLDGGIFSFSPADCAK 434
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 124/403 (30%), Positives = 193/403 (47%), Gaps = 56/403 (13%)
Query: 12 VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
V ++ V +A GEY++K +GTP +DT SDL+W QC PCV+CYKQ+ P++NP +
Sbjct: 76 VAEAPVLSAGGEYLVKLGLGTPQHC-FTAAIDTASDLIWTQCQPCVKCYKQLDPVFNPVA 134
Query: 72 SSSYKELSCQSEQCHLLDTVSCSS------QQLCNYTYGYADSSLTKGVLATERITFGNS 125
S+SY + C S+ C LDT C+ + C YTY Y ++ T+G+LA +R+ G
Sbjct: 135 STSYAVVPCNSDTCDELDTHRCARDGDSDDEDACQYTYSYGGNATTRGILAVDRLAIG-- 192
Query: 126 NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
++ F VVFGC ++ G G+VGLGR LSL +SQL +F YCL P + S
Sbjct: 193 DDVFRGVVFGCSSSSVGGPPPQVSGVVGLGRGALSL----VSQLSVRRFMYCLPPPVSRS 248
Query: 186 SITSKMYFGNGS-----EVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
+ ++ G + S VV S S+ +YY++ L+GIS+G+ + S +
Sbjct: 249 A--GRLVLGADAAATVRNASERVVVPMSTGSRY-PSYYYLNLDGISIGDRAMSFR---SR 302
Query: 241 NSSGAISKGN--------------------------MFIDTGAPPTLLPKDFYNRLEEQV 274
N A + G M ID + T L + Y + + +
Sbjct: 303 NRMNATTPGTAAGAPASPVSGSGDGDGSGTGPDAYGMIIDIASTITFLEESLYEEMVDDL 362
Query: 275 RNAIKLTPYQDPRLGSQLCYKTPS---MAGI-APILTAHFDGGAKVPLIHTSTFIPPPVE 330
I+L LG LC+ P M+ + AP ++ F+ G + L F+
Sbjct: 363 EEEIRLPRGSGSDLGLDLCFILPEGVPMSRVYAPPVSLAFE-GVWLRLDKEQMFVEDRAS 421
Query: 331 GVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
G+ C + DG V I GN+ Q ++ + Y+ ++F T C
Sbjct: 422 GMMCLMVGKTDG-VSILGNYQQQNMQVMYNLRRGRITFIKTAC 463
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 122/367 (33%), Positives = 174/367 (47%), Gaps = 22/367 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
Q +S G YV+ +GTP D+ + DTGSDL WVQC PC CY+Q P+++PA S
Sbjct: 135 AQRGISLGTGNYVVSMGLGTP-ARDMTVVFDTGSDLSWVQCTPCSDCYEQKDPLFDPARS 193
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+Y + C S +C LD+ SCS + C Y Y D S T G LA + +T S +
Sbjct: 194 STYSAVPCASPECQGLDSRSCSRDKKCRYEVVYGDQSQTDGALARDTLTLTQS-DVLPGF 252
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
VFGCG +TG+F + GLVGLGR ++SL+SQ S+ GA FSYCL SS ++ Y
Sbjct: 253 VFGCGEQDTGLFGRAD-GLVGLGREKVSLSSQAASKYGAG-FSYCL-----PSSPSAAGY 305
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNM 251
G + + ++Y+V L G+ V G S ++ ++++G +
Sbjct: 306 LSLGGPAPANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIV--FSAAGTV----- 358
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTA 308
ID+G T LP Y L ++ Y+ S L CY + P +
Sbjct: 359 -IDSGTVITRLPPRVYAALRSAFARSMGRYGYKRAPALSILDTCYDFTGHTTVRIPSVAL 417
Query: 309 HFDGGAKVPLIHTST-FIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
F GGA V L + ++ + FA D GI GN Q L + YD Q +
Sbjct: 418 VFAGGAAVGLDFSGVLYVAKVSQACLAFAPNGDGADAGIIGNTQQKTLAVVYDVARQKIG 477
Query: 368 FKPTDCT 374
F C+
Sbjct: 478 FGANGCS 484
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 124/365 (33%), Positives = 171/365 (46%), Gaps = 25/365 (6%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
S S +GEY + +G P Y ++DTGSD+ W+QC PC CY+Q PI++P SSSS
Sbjct: 146 SGTSQGSGEYFSRVGVGQPAK-PFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSS 204
Query: 75 YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
+ L C+S+QC L+T C + + C Y Y D S T G E +TFGNS +NV
Sbjct: 205 FASLPCESQQCQALETSGCRASK-CLYQVSYGDGSFTVGEFVIETLTFGNS-GMINNVAV 262
Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
GCGH+N G+F + + + SQ+ A+ FSYCLV D +S
Sbjct: 263 GCGHDNEGLFVGSAG-----LLGLGGGSLSLTSQMKASSFSYCLV----DRDSSSSSDLE 313
Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNMF 252
S V + L S + T+Y+V L G+SVG LS L +S G +
Sbjct: 314 FNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDS----GYGGII 369
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGIA-PILTA 308
+D+G T L YN L + + TPY G L CY S + + P ++
Sbjct: 370 VDSGTAITRLQTQAYNTLRDAF---VSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSF 426
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
F GG + L + IP G FCFA P + I GN Q + YD + +V F
Sbjct: 427 EFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGF 486
Query: 369 KPTDC 373
P C
Sbjct: 487 SPHKC 491
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 177 bits (450), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 124/362 (34%), Positives = 179/362 (49%), Gaps = 10/362 (2%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S ++ +GEY + +GTP + + Y ++DTGSD++W+QC PC +CY QV PI+NP+ S
Sbjct: 186 VVSGMAQGSGEYFTRIGVGTP-MREQYMVLDTGSDVVWIQCEPCSKCYSQVDPIFNPSLS 244
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
+S+ L C S C LD +C C Y Y D S T G ATE +TFG ++ NV
Sbjct: 245 ASFSTLGCNSAVCSYLDAYNCHGGG-CLYKVSYGDGSYTIGSFATEMLTFGTTS--VRNV 301
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH+N G+F L+GLG LS SQ+ +Q G FSYCLV ++SS T +
Sbjct: 302 AIGCGHDNAGLFVGAAG-LLGLGAGLLSFPSQLGTQTG-RAFSYCLVDRFSESSGT--LE 357
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
FG S V G +++ L + T+Y+V L ISVG S + +G
Sbjct: 358 FGPES-VPLGSILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDETSGRGGFI 416
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFD 311
+D+G T L Y+ + + + P + CY + + P + HF
Sbjct: 417 VDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSIFDTCYDLSGLPLVNVPTVVFHFS 476
Query: 312 GGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
GA + L + IP G FCFA P D+ I GN Q + + +D + +V F
Sbjct: 477 NGASLILPAKNYMIPMDFMGTFCFAFAPATSDLSIMGNIQQQGIRVSFDTANSLVGFALR 536
Query: 372 DC 373
C
Sbjct: 537 QC 538
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 177 bits (449), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 128/365 (35%), Positives = 180/365 (49%), Gaps = 22/365 (6%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S S +GEY ++ IG PP Y ++DTGSD+ W+QC PC +CY+Q PI++P SS
Sbjct: 138 VVSGTSQGSGEYFLRVGIGKPPS-QAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPISS 196
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
+SY + C QC LD C + C Y Y D S T G ATE +T G++ +NV
Sbjct: 197 NSYSPIRCDEPQCKSLDLSECRNGT-CLYEVSYGDGSYTVGEFATETVTLGSAA--VENV 253
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGHNN G+F GL+GLG +LS +Q+ A FSYCLV + DS S +
Sbjct: 254 AIGCGHNNEGLF-VGAAGLLGLGGGKLSFPAQV----NATSFSYCLV--NRDSDAVSTLE 306
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS---GAISKG 249
F S + + + + E T+Y++ L+GISVG + +P SS AI G
Sbjct: 307 F--NSPLPRNAATAPLMRNPELDTFYYLGLKGISVGG-----EALPIPESSFEVDAIGGG 359
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTA 308
+ ID+G T L + Y+ L + K P + CY S + P ++
Sbjct: 360 GIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVEIPTVSF 419
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
F G ++PL + IP G FCFA P + I GN Q +G+D + +V F
Sbjct: 420 RFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVGFDIANSLVGF 479
Query: 369 KPTDC 373
C
Sbjct: 480 SVDSC 484
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 177 bits (449), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 137/389 (35%), Positives = 189/389 (48%), Gaps = 54/389 (13%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--------QCYKQVKPIYNPASSS 73
GEY+M SIGTPPL I DTGSDL+W QC PC QC+KQ +YNP+SS+
Sbjct: 85 GEYIMTLSIGTPPL-SYRAIADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGCLYNPSSST 143
Query: 74 SYKELSCQS--EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---- 127
++ L C S C + S C Y Y + T GV + E TFG+S+
Sbjct: 144 TFGVLPCNSPLSMCAAMAGPSPPPGCACMYNQTYG-TGWTAGVQSVETFTFGSSSTPPAV 202
Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
N+ FGC + ++ +N GLVGLGR +SL +SQLGA FSYCL PF D++
Sbjct: 203 RVPNIAFGCSNASSNDWN-GSAGLVGLGRGSMSL----VSQLGAGAFSYCLTPFQ-DANS 256
Query: 188 TSKMYFGNGSEVS---GGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIPYY 240
TS + G + + G V ST V+ K TYY++ L GISVG + + P
Sbjct: 257 TSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVGE--TALAIPPDA 314
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRN----AIKLTPYQDPRLGSQLCY-- 294
S A G + ID+G T L Y ++ VR+ + L D G LC+
Sbjct: 315 FSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLDLCFAL 374
Query: 295 KTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVE-------GVFCFAMQ-PIDGDVGI 346
K + P +T HF+GGA + L PVE GV+C AM+ G + +
Sbjct: 375 KASTPPPAMPSMTLHFEGGADMVL---------PVENYMILGSGVWCLAMRNQTVGAMSM 425
Query: 347 FGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
GN+ Q ++ + YD + +SF P C+
Sbjct: 426 VGNYQQQNIHVLYDVRKETLSFAPAVCSS 454
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 177 bits (449), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 134/383 (34%), Positives = 190/383 (49%), Gaps = 37/383 (9%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
Q+ + + G Y M SIGTPP+ + DTGS L+W QC PC +C + P + PASSS
Sbjct: 80 QTLLDNSAGAYNMNLSIGTPPV-TFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSS 138
Query: 74 SYKELSCQSEQCHLLDT--VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
++ +L C S C L + ++C++ C Y Y Y T G LATE + G ++ F
Sbjct: 139 TFSKLPCASSLCQFLTSPYLTCNATG-CVYYYPYG-MGFTAGYLATETLHVGGAS--FPG 194
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI-TSK 190
V FGC N GV N + G+VGLGR+ LSL SQ+ G +FSYCL +D+ S
Sbjct: 195 VAFGCSTEN-GVGNSSS-GIVGLGRSPLSLVSQV----GVGRFSYCL---RSDADAGDSP 245
Query: 191 MYFGNGSEVSGGGVVSTSLVSKED---KTYYFVTLEGISVG--NLSNSSKLIPYYNSSGA 245
+ FG+ ++V+GG V ST L+ + +YY+V L GI+VG +L +S + +GA
Sbjct: 246 ILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGA 305
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEE----QVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
G +D+G T L K+ Y ++ Q+ A T R G LC+ + G
Sbjct: 306 GLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGG 365
Query: 302 IA----PILTAHFDGGAKVPLIHTSTFIPPPVE-----GVFCFAMQPIDG--DVGIFGNF 350
+ P L F GGA+ + S V+ V C + P + I GN
Sbjct: 366 GSGVPVPTLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPASEKLSISIIGNV 425
Query: 351 AQSDLFIGYDFDSQMVSFKPTDC 373
Q DL + YD D M SF P DC
Sbjct: 426 MQMDLHVLYDLDGGMFSFAPADC 448
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 177 bits (448), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 119/356 (33%), Positives = 172/356 (48%), Gaps = 12/356 (3%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
+GEY + IGTP + Y ++DTGSD++W+QC PC +CY Q PI+NP+SS S+ +
Sbjct: 4 GSGEYFTRIGIGTP-TREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVG 62
Query: 80 CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
C S C LD C C Y Y D S T G ATE +TFG ++ NV GCGH+
Sbjct: 63 CDSAVCSQLDANDCHGGG-CLYEVSYGDGSYTVGSYATETLTFGTTS--IQNVAIGCGHD 119
Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
N G+F L + LS +Q+ +Q G FSYCLV ++SS T + G E
Sbjct: 120 NVGLFVGAAGLLGLGAGS-LSFPAQLGTQTG-RAFSYCLVDRDSESSGTLEF----GPES 173
Query: 200 SGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
G + T LV+ T+Y++++ ISVG + S + +G + ID+G
Sbjct: 174 VPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTA 233
Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVP 317
T L Y+ L + + P D CY ++ ++ P + HF GA
Sbjct: 234 VTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFSNGAGFI 293
Query: 318 LIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
L + IP G FCFA P D ++ I GN Q + + +D + +V F C
Sbjct: 294 LPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 349
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 130/373 (34%), Positives = 186/373 (49%), Gaps = 38/373 (10%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSC 80
GEY+M +IGTPPL I DTGSDL+W QC PC QC+KQ YNP+SS+++ L C
Sbjct: 86 GEYIMTLAIGTPPL-SYPAIADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPC 144
Query: 81 QS--EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNS---NNFFDNVVFG 135
S C L S C Y Y + T G+ + E TFG++ + FG
Sbjct: 145 NSSVSMCAALAGPSPPPGCSCMYNQTYG-TGWTAGIQSVETFTFGSTPADQTRVPGIAFG 203
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
C + ++ +N GLVGLGR +SL +SQLGA FSYCL PF D++ TS + G
Sbjct: 204 CSNASSDDWN-GSAGLVGLGRGSMSL----VSQLGAGMFSYCLTPFQ-DANSTSTLLLGP 257
Query: 196 GSEVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
+ ++G GV++T V+ K TYY++ L GIS+G + S + P + G +
Sbjct: 258 SAALNGTGVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALS--IPPNAFALRTDGTGGL 315
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD--PRLGSQLCYKT-------PSMAGI 302
ID+G T L Y ++ + + + L P D G LC+ PSM
Sbjct: 316 IIDSGTTITSLVDAAYQQVRAAIESLVTL-PVADGSDSTGLDLCFALTSETSTPPSM--- 371
Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ-PIDGDVGIFGNFAQSDLFIGYDF 361
P +T HFDG V + + GV+C AM+ G + FGN+ Q ++ + YD
Sbjct: 372 -PSMTFHFDGADMVLPVDNYMILG---SGVWCLAMRNQTVGAMSTFGNYQQQNVHLLYDI 427
Query: 362 DSQMVSFKPTDCT 374
+ +SF P C+
Sbjct: 428 HEETLSFAPAKCS 440
>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 455
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 135/383 (35%), Positives = 196/383 (51%), Gaps = 35/383 (9%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V +++ + +G Y+MK IGTPP +I+ +DTGS+++W+ C+ C C+ Q I+NP +S
Sbjct: 87 VHASIFSGDGNYLMKLLIGTPPT-EIHAAIDTGSNVIWIPCINCKDCFNQSSSIFNPLAS 145
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADS-SLTKGVLATERITFGNSNNF--- 128
S+Y++ C S QC + SC S +C Y+ + G +A + +T +S+
Sbjct: 146 STYQDAPCDSYQCETTSS-SCQSDNVCLYSCDEKHQLNCPNGRIAVDTMTLTSSDGRPFP 204
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
F CG++ F +G++GLGR LSL S+ L L KFSYCL ++ S
Sbjct: 205 LPYSDFVCGNSIYKTF--AGVGVIGLGRGALSLTSK-LYHLSDGKFSYCLADYY--SKQP 259
Query: 189 SKMYFGNGSEVSGGG--VVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
SK+ FG S +S VVST+L Y+VTLEGISVG + + Y + A
Sbjct: 260 SKINFGLQSFISDDDLEVVSTTLGHHRHSGNYYVTLEGISVG---EKRQDLYYVDDPFAP 316
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPI- 305
GNM ID+G TLLPKDFY+ L V AI P P S+ + + ++P
Sbjct: 317 PVGNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPH-NSRFPFSMDNTLKLSPCF 375
Query: 306 ----------LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQ 352
+T HF A V L ++FI E V CFA QP G ++G++ Q
Sbjct: 376 WYYPELKFPKITIHF-TDADVELSDDNSFI-RVAEDVVCFAFAATQP--GQSTVYGSWQQ 431
Query: 353 SDLFIGYDFDSQMVSFKPTDCTK 375
+ +GYD VSFK TDC+K
Sbjct: 432 MNFILGYDLKRGTVSFKRTDCSK 454
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 126/379 (33%), Positives = 190/379 (50%), Gaps = 41/379 (10%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ +IGTPP + I+DTGSDL W QC PCV C++Q P +NP+ S ++ L C
Sbjct: 110 EYLVHMAIGTPPQ-PVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDL 168
Query: 83 EQCHLLDTVSCSSQQ----LCNYTYGYADSSLTKGVLATERITFGNSNNFFD-----NVV 133
C L SC Q +C Y Y YAD S+T G L ++ +F ++++ ++
Sbjct: 169 RICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLT 228
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
FGCG N G+F NE G+ G R LS+ +QL + FSYC S S ++
Sbjct: 229 FGCGLFNNGIFVSNETGIAGFSRGALSMP----AQLKVDNFSYCFTAI--TGSEPSPVFL 282
Query: 194 GNG----SEVSGGG---VVSTSLV---SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
G S+ +GGG V ST+L+ S + K YY ++L+G++VG + +P S
Sbjct: 283 GVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYY-ISLKGVTVG-----TTRLPIPESV 336
Query: 244 GAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
A+ + G +D+G T+LP+ YN + + KLT + SQLC+ P A
Sbjct: 337 FALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGA 396
Query: 301 G-IAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGDVGIFGNFAQSDLF 356
P L HF+ GA + L + G + C A+ + D+ + GNF Q ++
Sbjct: 397 KPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLAINAGE-DLSVIGNFQQQNMH 454
Query: 357 IGYDFDSQMVSFKPTDCTK 375
+ YD + M+SF P C K
Sbjct: 455 VLYDLANDMLSFVPARCNK 473
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 126/379 (33%), Positives = 190/379 (50%), Gaps = 41/379 (10%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ +IGTPP + I+DTGSDL W QC PCV C++Q P +NP+ S ++ L C
Sbjct: 110 EYLVHMAIGTPPQ-PVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDL 168
Query: 83 EQCHLLDTVSCSSQQ----LCNYTYGYADSSLTKGVLATERITFGNSNNFFD-----NVV 133
C L SC Q +C Y Y YAD S+T G L ++ +F ++++ ++
Sbjct: 169 RICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLT 228
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
FGCG N G+F NE G+ G R LS+ +QL + FSYC S S ++
Sbjct: 229 FGCGLFNNGIFVSNETGIAGFSRGALSMP----AQLKVDNFSYCFTAI--TGSEPSPVFL 282
Query: 194 GNG----SEVSGGG---VVSTSLV---SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
G S+ +GGG V ST+L+ S + K YY ++L+G++VG + +P S
Sbjct: 283 GVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYY-ISLKGVTVG-----TTRLPIPESV 336
Query: 244 GAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
A+ + G +D+G T+LP+ YN + + KLT + SQLC+ P A
Sbjct: 337 FALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGA 396
Query: 301 G-IAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGDVGIFGNFAQSDLF 356
P L HF+ GA + L + G + C A+ + D+ + GNF Q ++
Sbjct: 397 KPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLAINAGE-DLSVIGNFQQQNMH 454
Query: 357 IGYDFDSQMVSFKPTDCTK 375
+ YD + M+SF P C K
Sbjct: 455 VLYDLANDMLSFVPARCNK 473
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 122/403 (30%), Positives = 178/403 (44%), Gaps = 52/403 (12%)
Query: 13 VQSNVSTANG-------EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQ-VK 64
V++ V TA EY++ S+GTPP + +DTGSDL+W QC PC+ C+ Q
Sbjct: 76 VRARVRTAGAGGGIVTNEYLVHLSVGTPPR-PVALTLDTGSDLVWTQCAPCLNCFDQGAI 134
Query: 65 PIYNPASSSSYKELSCQSEQCHLLDTVSCSS------QQLCNYTYGYADSSLTKGVLATE 118
P+ +PA+SS++ + C + C L SC ++ C Y Y Y D S+T G LA++
Sbjct: 135 PVLDPAASSTHAAVRCDAPVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASD 194
Query: 119 RITFGNSNNF------FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGAN 172
R TFG +N + FGCGH N G+F NE G+ G GR R SL SQLG
Sbjct: 195 RFTFGPGDNADGGGVSERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLP----SQLGVT 250
Query: 173 KFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLS 231
FSYC +S + G V ST L+ + + YF++L+ I+VG
Sbjct: 251 SFSYCFTSMFESTSSLVTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVG--- 307
Query: 232 NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
+ IP + + + ID+GA T LP+D Y ++ + + L
Sbjct: 308 --ATRIPIPERRQRLREASAIIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALD 365
Query: 292 LCYKTPSMAG------------------IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVF 333
LC+ PS A P L H GGA L + V
Sbjct: 366 LCFALPSAAAPKSAFGWRWRGRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVM 425
Query: 334 CFAMQPIDG---DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
C + G + GN+ Q + + YD ++ ++SF P C
Sbjct: 426 CLVLDAATGGGDQTVVIGNYQQQNTHVVYDLENDVLSFAPARC 468
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 126/379 (33%), Positives = 190/379 (50%), Gaps = 41/379 (10%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ +IGTPP + I+DTGSDL W QC PCV C++Q P +NP+ S ++ L C
Sbjct: 84 EYLVHMAIGTPPQ-PVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDL 142
Query: 83 EQCHLLDTVSCSSQQ----LCNYTYGYADSSLTKGVLATERITFGNSNNFFD-----NVV 133
C L SC Q +C Y Y YAD S+T G L ++ +F ++++ ++
Sbjct: 143 RICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLT 202
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
FGCG N G+F NE G+ G R LS+ +QL + FSYC S S ++
Sbjct: 203 FGCGLFNNGIFVSNETGIAGFSRGALSMP----AQLKVDNFSYCFTAI--TGSEPSPVFL 256
Query: 194 GNG----SEVSGGG---VVSTSLV---SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
G S+ +GGG V ST+L+ S + K YY ++L+G++VG + +P S
Sbjct: 257 GVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYY-ISLKGVTVG-----TTRLPIPESV 310
Query: 244 GAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
A+ + G +D+G T+LP+ YN + + KLT + SQLC+ P A
Sbjct: 311 FALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGA 370
Query: 301 G-IAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGDVGIFGNFAQSDLF 356
P L HF+ GA + L + G + C A+ + D+ + GNF Q ++
Sbjct: 371 KPDVPALVLHFE-GATLDLPRENYMFEIEEAGGIRLTCLAINAGE-DLSVIGNFQQQNMH 428
Query: 357 IGYDFDSQMVSFKPTDCTK 375
+ YD + M+SF P C K
Sbjct: 429 VLYDLANDMLSFVPARCNK 447
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 128/368 (34%), Positives = 175/368 (47%), Gaps = 24/368 (6%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+Q G Y++ GTP + I+DTGSD+ W+QC PC CY QV PI+ P S
Sbjct: 127 LQPGSKVGTGNYIVTAGFGTPAKNSLL-IIDTGSDVTWIQCKPCSDCYSQVDPIFEPQQS 185
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
SSYK LSC S C L T++ C Y Y D S ++G + E +T G+ + F +
Sbjct: 186 SSYKHLSCLSSACTELTTMNHCRLGGCVYEINYGDGSRSQGDFSQETLTLGSDS--FPSF 243
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGCGH NTG+F + GL+GLGRT LS SQ S+ G +FSYCL F + +S T
Sbjct: 244 AFGCGHTNTGLF-KGSAGLLGLGRTALSFPSQTKSKYGG-QFSYCLPDFVSSTS-TGSFS 300
Query: 193 FGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
G GS + V LVS + ++YFV L GISVG S IP + +G
Sbjct: 301 VGQGSIPATATFV--PLVSNSNYPSFYFVGLNGISVGGERLS---IP----PAVLGRGGT 351
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHF 310
+D+G T L Y+ L+ R+ + P P CY S + + P +T HF
Sbjct: 352 IVDSGTVITRLVPQAYDALKTSFRSKTRNLPSAKPFSILDTCYDLSSYSQVRIPTITFHF 411
Query: 311 DGGAKVPLIHTSTFIPPPVEG-----VFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
A V + +G F A Q I + I GNF Q + + +D +
Sbjct: 412 QNNADVAVSAVGILFTIQSDGSQVCLAFASASQSISTN--IIGNFQQQRMRVAFDTGAGR 469
Query: 366 VSFKPTDC 373
+ F P C
Sbjct: 470 IGFAPGSC 477
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 123/362 (33%), Positives = 176/362 (48%), Gaps = 21/362 (5%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ +IGTPP + +DTGSDL+W QC PC C+ Q P Y+ + SS++ SC S
Sbjct: 90 EYLLHLAIGTPPQ-PVQLTLDTGSDLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDS 148
Query: 83 EQCHLLDTVS-CSSQ--QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
QC L +V+ C +Q Q C ++Y Y D S T G L E ++F + VVFGCG N
Sbjct: 149 TQCKLDPSVTMCVNQTVQTCAFSYSYGDKSATIGFLDVETVSF-VAGASVPGVVFGCGLN 207
Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
NTG+F NE G+ G GR LSL SQL FS+C T
Sbjct: 208 NTGIFRSNETGIAGFGRGPLSLP----SQLKVGNFSHCFTAVSGRKPSTVLFDLPADLYK 263
Query: 200 SGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN--MFIDT 255
+G G V T+ + K T+Y+++L+GI+VG S +P S+ A+ G ID+
Sbjct: 264 NGRGTVQTTPLIKNPAHPTFYYLSLKGITVG-----STRLPVPESAFALKNGTGGTIIDS 318
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM--AGIAPILTAHFDGG 313
G T LP Y + ++ +KL G LC+ P + A P L HF+ G
Sbjct: 319 GTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFE-G 377
Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
A + L + G + I+G++ I GNF Q ++ + YD + +SF C
Sbjct: 378 ATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 437
Query: 374 TK 375
K
Sbjct: 438 DK 439
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 120/348 (34%), Positives = 180/348 (51%), Gaps = 31/348 (8%)
Query: 42 VDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNY 101
+DTGSDL+W QC PC+ C Q P ++ S++Y+ L C+S +C L + SC +++C Y
Sbjct: 1 MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSC-FKKMCVY 59
Query: 102 TYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTR 158
Y Y D++ T GVLA E TFG +N+ N+ FGCG N G N G+VG GR
Sbjct: 60 QYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDL-ANSSGMVGFGRGP 118
Query: 159 LSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG-----NGSEVSGGGVVSTS--LVS 211
LSL +SQLG ++FSYCL + S+ S++YFG + + S G V ++ +++
Sbjct: 119 LSL----VSQLGPSRFSYCLTSYL--SATPSRLYFGVYANLSSTNTSSGSPVQSTPFVIN 172
Query: 212 KEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNMFIDTGAPPTLLPKDFYN 268
YF++L+ IS+G +KL+P AI+ G + ID+G T L +D Y
Sbjct: 173 PALPNMYFLSLKAISLG-----TKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYE 227
Query: 269 RLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI---APILTAHFDGGAKVPLIHTSTFI 325
+ + +AI L D +G C++ P + P L HFD A + L+ + +
Sbjct: 228 AVRRGLVSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFD-SANMTLLPENYML 286
Query: 326 PPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
G C M P G I GN+ Q +L + YD + +SF P C
Sbjct: 287 IASTTGYLCLVMAPT-GVGTIIGNYQQQNLHLLYDIGNSFLSFVPAPC 333
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 123/378 (32%), Positives = 179/378 (47%), Gaps = 25/378 (6%)
Query: 7 FYPNNVVQS---NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV 63
+ P ++V V +GEY ++ +G+PP D Y +VD+GSD++WVQC PC QCY Q
Sbjct: 110 YLPEDLVSEVVPGVDDGSGEYFVRVGVGSPPT-DQYLVVDSGSDVIWVQCRPCEQCYAQT 168
Query: 64 KPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQL---CNYTYGYADSSLTKGVLATERI 120
P+++PA+SSS+ +SC S C L C C+Y+ Y D S TKG LA E +
Sbjct: 169 DPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETL 228
Query: 121 TFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
T G + V GCGH N+G+F GL+GLG +SL Q+ G FSYCL
Sbjct: 229 TLGGTA--VQGVAIGCGHRNSGLF-VGAAGLLGLGWGAMSLVGQLGGAAG-GVFSYCLAS 284
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
+ + G V G V + + + ++Y+V L GI VG + +P
Sbjct: 285 RGAGGA--GSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGG-----ERLPLQ 337
Query: 241 NSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP 297
+S +++ G + +DTG T LP++ Y L A+ P CY
Sbjct: 338 DSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLS 397
Query: 298 SMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEG-VFCFAMQPIDGDVGIFGNFAQSDL 355
A + P ++ +FD GA + L + + V G VFC A P + I GN Q +
Sbjct: 398 GYASVRVPTVSFYFDQGAVLTLPARNLLV--EVGGAVFCLAFAPSSSGISILGNIQQEGI 455
Query: 356 FIGYDFDSQMVSFKPTDC 373
I D + V F P C
Sbjct: 456 QITVDSANGYVGFGPNTC 473
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 127/363 (34%), Positives = 180/363 (49%), Gaps = 12/363 (3%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S ++ +GEY + +GTPP +Y ++DTGSD++W+QC PC +CY Q P+++P S
Sbjct: 136 VTSGLAQGSGEYFTRLGVGTPPKY-VYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKS 194
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+ +SC+S C LD+ C+S+Q C Y Y D S T G +TE +TF + V
Sbjct: 195 GSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLTFRGTR--VPKV 252
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH+N G+F L+GLGR RLS +Q + G KFSYCLV + SS S +
Sbjct: 253 ALGCGHDNEGLFVGAAG-LLGLGRGRLSFPTQTGLRFG-RKFSYCLVD-RSASSKPSSVV 309
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
FG S VS V + + + + T+Y++ L GISVG + + A G +
Sbjct: 310 FGQ-SAVSRTAVFTPLITNPKLDTFYYLELTGISVGGARVAGITASLFKLDTA-GNGGVI 367
Query: 253 IDTGAPPTLLPKDFYNRLEEQVR-NAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHF 310
ID+G T L + Y L + R A L D L C+ + P + HF
Sbjct: 368 IDSGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDYSL-FDTCFDLSGKTEVKVPTVVMHF 426
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
GA V L T+ IP GVFCFA + I GN Q + +D + + F
Sbjct: 427 R-GADVSLPATNYLIPVDTNGVFCFAFAGTMSGLSIIGNIQQQGFRVVFDVAASRIGFAA 485
Query: 371 TDC 373
C
Sbjct: 486 RGC 488
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 129/377 (34%), Positives = 186/377 (49%), Gaps = 32/377 (8%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
Q+ + G Y M S+GTP LL + DTGSDL+W QC PC +C++Q P + PASSS
Sbjct: 76 QALLENGVGGYNMNISVGTP-LLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSS 134
Query: 74 SYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
++ +L C S C L +++ + C Y Y Y S T G LATE + G+++ F +V
Sbjct: 135 TFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYG-SGYTAGYLATETLKVGDAS--FPSV 191
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGC N GV N G+ GLGR LSL + QLG +FSYCL ++ S +
Sbjct: 192 AFGCSTEN-GVGNSTS-GIAGLGRGALSL----IPQLGVGRFSYCLR--SGSAAGASPIL 243
Query: 193 FGNGSEVSGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK-- 248
FG+ + ++ G V ST V+ +YY+V L GI+VG +P S+ ++
Sbjct: 244 FGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETD-----LPVTTSTFGFTQNG 298
Query: 249 --GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG---IA 303
G +D+G T L KD Y +++ + + G LC+K+ G
Sbjct: 299 LGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAV 358
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD--VGIFGNFAQSDLFIG 358
P L FDGGA+ + + +G V C M P GD + + GN Q D+ +
Sbjct: 359 PSLVLRFDGGAEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLL 418
Query: 359 YDFDSQMVSFKPTDCTK 375
YD D + SF P DC K
Sbjct: 419 YDLDGGIFSFAPADCAK 435
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 126/375 (33%), Positives = 175/375 (46%), Gaps = 28/375 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPAS 71
QS + G Y++ +GTP D+ I DTGSDL W QC PCV+ CY Q +PI++P++
Sbjct: 143 AQSGLPLGTGNYIVNVGLGTPKK-DLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPST 201
Query: 72 SSSYKELSCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
S +Y +SC S C L + + CSS C Y Y DSS T G A +++T N
Sbjct: 202 SKTYSNISCTSAACSSLKSATGNSPGCSSSN-CVYGIQYGDSSFTIGFFAKDKLTL-TQN 259
Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
+ FD +FGCG NN G+F + GL+GLGR LS+ Q + G FSYCL T
Sbjct: 260 DVFDGFMFGCGQNNKGLFGKTA-GLIGLGRDPLSIVQQTAQKFG-KYFSYCL---PTSRG 314
Query: 187 ITSKMYFGNGSEVSGG-----GVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
+ FGNG+ V G+ T S + YYF+ + GISVG + S + + N
Sbjct: 315 SNGHLTFGNGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQN 374
Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
+ ID+G T LP Y L+ + + P CY +
Sbjct: 375 A-------GTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDTCYDLSNYTS 427
Query: 302 IA-PILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGY 359
I+ P ++ +F+G A V L I V FA D +GIFGN Q L + Y
Sbjct: 428 ISIPKISFNFNGNANVELDPNGILITNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVY 487
Query: 360 DFDSQMVSFKPTDCT 374
D + F C+
Sbjct: 488 DVAGGQLGFGYKGCS 502
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 133/377 (35%), Positives = 187/377 (49%), Gaps = 45/377 (11%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
+ GEY+M IG+PP ++DTGSDL+W QC PC+ C +Q P + PA S+SY L
Sbjct: 84 SEGEYLMDVGIGSPPRY-FSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLP 142
Query: 80 CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--FFDNVVFGCG 137
C S C+ L + C Q C Y Y DS+ + GVLA E TFG ++ V FGCG
Sbjct: 143 CSSAMCNALYSPLC-FQNACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSFGCG 201
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG--- 194
+ N G N G+VG GR LSL +SQLG+ +FSYCL F S TS++YFG
Sbjct: 202 NMNAGTL-FNGSGMVGFGRGALSL----VSQLGSPRFSYCLTSFM--SPATSRLYFGAYA 254
Query: 195 --NGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK-- 248
N + S G V ++ +V+ T YF+ + GISV + L+P S AI++
Sbjct: 255 TLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISV-----AGDLLPIDPSVFAINETD 309
Query: 249 --GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS------QLCYKTP--- 297
G + ID+G T L + Y ++ + L PR + C+K P
Sbjct: 310 GTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGL-----PRANATPSDTFDTCFKWPPPP 364
Query: 298 -SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLF 356
M + P + HFD GA + L + + G C AM P D D I G+F +
Sbjct: 365 RRMVTL-PEMVLHFD-GADMELPLENYMVMDGGTGNLCLAMLPSD-DGSIIGSFQHQNFH 421
Query: 357 IGYDFDSQMVSFKPTDC 373
+ YD ++ ++SF P C
Sbjct: 422 MLYDLENSLLSFVPAPC 438
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 125/374 (33%), Positives = 179/374 (47%), Gaps = 22/374 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S ++ +GEY K +GTP + ++DTGSD++W+QC PC +CY Q P+++P S
Sbjct: 129 VVSGLAQGSGEYFTKIGVGTPSTPALM-VLDTGSDVVWLQCAPCRRCYDQSGPVFDPRRS 187
Query: 73 SSYKELSCQSEQCHLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SSY + C + C LD+ C ++ C Y Y D S+T G ATE +TF
Sbjct: 188 SSYGAVDCAAPLCRRLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFAGGAR-VAR 246
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V GCGH+N G+F L+GLGR LS +QI + G FSYCLV + SS +
Sbjct: 247 VALGCGHDNEGLFVAAAG-LLGLGRGSLSFPTQISRRYG-KSFSYCLVDRTSSSSSGAAS 304
Query: 192 YFGNGSEVSGGGVVSTS-----LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
+ + G S + + + +T+Y+V L GISVG +
Sbjct: 305 RSRSSTVTFGPPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPST 364
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQL---CYKTPSMA 300
+G + +D+G T L + Y+ L + R A ++L+P G L CY
Sbjct: 365 GRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLRLSPG-----GFSLFDTCYDLGGRK 419
Query: 301 GI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
+ P ++ HF GGA+ L + IP G FCFA DG V I GN Q + +
Sbjct: 420 VVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVF 479
Query: 360 DFDSQMVSFKPTDC 373
D D Q V F P C
Sbjct: 480 DGDGQRVGFAPKGC 493
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 133/377 (35%), Positives = 187/377 (49%), Gaps = 45/377 (11%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
+ GEY+M IG+PP ++DTGSDL+W QC PC+ C +Q P + PA S+SY L
Sbjct: 81 SEGEYLMDVGIGSPPRY-FSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLP 139
Query: 80 CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--FFDNVVFGCG 137
C S C+ L + C Q C Y Y DS+ + GVLA E TFG ++ V FGCG
Sbjct: 140 CSSAMCNALYSPLC-FQNACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSFGCG 198
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG--- 194
+ N G N G+VG GR LSL +SQLG+ +FSYCL F S TS++YFG
Sbjct: 199 NMNAGTL-FNGSGMVGFGRGALSL----VSQLGSPRFSYCLTSFM--SPATSRLYFGAYA 251
Query: 195 --NGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK-- 248
N + S G V ++ +V+ T YF+ + GISV + L+P S AI++
Sbjct: 252 TLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISV-----AGDLLPIDPSVFAINETD 306
Query: 249 --GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS------QLCYKTP--- 297
G + ID+G T L + Y ++ + L PR + C+K P
Sbjct: 307 GTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGL-----PRANATPSDTFDTCFKWPPPP 361
Query: 298 -SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLF 356
M + P + HFD GA + L + + G C AM P D D I G+F +
Sbjct: 362 RRMVTL-PEMVLHFD-GADMELPLENYMVMDGGTGNLCLAMLPSD-DGSIIGSFQHQNFH 418
Query: 357 IGYDFDSQMVSFKPTDC 373
+ YD ++ ++SF P C
Sbjct: 419 MLYDLENSLLSFVPAPC 435
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 124/366 (33%), Positives = 166/366 (45%), Gaps = 20/366 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S + +GEY ++ +G+PP + Y ++D+GSD++WVQC PC QCY Q P++NPA S
Sbjct: 123 VVSGMEQGSGEYFVRIGVGSPPR-NQYVVIDSGSDIIWVQCEPCTQCYHQSDPVFNPADS 181
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
SSY +SC S C +D C + C Y Y D S TKG LA E +TFG + NV
Sbjct: 182 SSYAGVSCASTVCSHVDNAGCHEGR-CRYEVSYGDGSYTKGTLALETLTFGRT--LIRNV 238
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH+N G+F GL+GLG +S Q+ Q G FSYCLV SS +
Sbjct: 239 AIGCGHHNQGMF-VGAAGLLGLGSGPMSFVGQLGGQAGGT-FSYCLVSRGIQSS--GLLQ 294
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
FG + G V + YY G S+ + + G G +
Sbjct: 295 FGREAVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELG---DGGVV 351
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGI-APILT 307
+DTG T LP Y E R+A PR CY + P ++
Sbjct: 352 MDTGTAVTRLPTAAY----EAFRDAFIAQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVS 407
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
+F GG + L + IP G FCFA P + I GN Q + I D + V
Sbjct: 408 FYFSGGPILTLPARNFLIPVDDVGSFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGFVG 467
Query: 368 FKPTDC 373
F P C
Sbjct: 468 FGPNVC 473
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 174 bits (441), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 122/378 (32%), Positives = 178/378 (47%), Gaps = 25/378 (6%)
Query: 7 FYPNNVVQS---NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV 63
+ P ++V V +GEY ++ +G+PP D Y +VD+GSD++WVQC PC QCY Q
Sbjct: 110 YLPEDLVSEVVPGVDDGSGEYFVRVGVGSPPT-DQYLVVDSGSDVIWVQCRPCEQCYAQT 168
Query: 64 KPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQL---CNYTYGYADSSLTKGVLATERI 120
P+++PA+SSS+ +SC S C L C C+Y+ Y D S TKG LA E +
Sbjct: 169 DPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETL 228
Query: 121 TFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
T G + V GCGH N+G+F GL+GLG +SL Q+ G FSYCL
Sbjct: 229 TLGGTA--VQGVAIGCGHRNSGLF-VGAAGLLGLGWGAMSLIGQLGGAAG-GVFSYCLAS 284
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
+ + G V G V + + + ++Y+V L GI VG + +P
Sbjct: 285 RGAGGA--GSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGG-----ERLPLQ 337
Query: 241 NSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP 297
+ +++ G + +DTG T LP++ Y L A+ P CY
Sbjct: 338 DGLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLS 397
Query: 298 SMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEG-VFCFAMQPIDGDVGIFGNFAQSDL 355
A + P ++ +FD GA + L + + V G VFC A P + I GN Q +
Sbjct: 398 GYASVRVPTVSFYFDQGAVLTLPARNLLV--EVGGAVFCLAFAPSSSGISILGNIQQEGI 455
Query: 356 FIGYDFDSQMVSFKPTDC 373
I D + V F P C
Sbjct: 456 QITVDSANGYVGFGPNTC 473
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 174 bits (440), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 123/362 (33%), Positives = 175/362 (48%), Gaps = 21/362 (5%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ +IGTPP + +DTGS L+W QC PC C+ Q P Y+ + SS++ SC S
Sbjct: 90 EYLLHLAIGTPPQ-PVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDS 148
Query: 83 EQCHLLDTVS-CSSQ--QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
QC L +V+ C +Q Q C Y+Y Y D S T G L E ++F + VVFGCG N
Sbjct: 149 TQCKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSF-VAGASVPGVVFGCGLN 207
Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
NTG+F NE G+ G GR LSL SQL FS+C T
Sbjct: 208 NTGIFRSNETGIAGFGRGPLSLP----SQLKVGNFSHCFTAVSGRKPSTVLFDLPADLYK 263
Query: 200 SGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN--MFIDT 255
+G G V T+ + K T+Y+++L+GI+VG S +P S+ A+ G ID+
Sbjct: 264 NGRGTVQTTPLIKNPAHPTFYYLSLKGITVG-----STRLPVPESAFALKNGTGGTIIDS 318
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM--AGIAPILTAHFDGG 313
G T LP Y + ++ +KL G LC+ P + A P L HF+ G
Sbjct: 319 GTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFE-G 377
Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
A + L + G + I+G++ I GNF Q ++ + YD + +SF C
Sbjct: 378 ATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 437
Query: 374 TK 375
K
Sbjct: 438 DK 439
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 174 bits (440), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 127/380 (33%), Positives = 181/380 (47%), Gaps = 38/380 (10%)
Query: 7 FYPNNVVQS---NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV 63
+ P ++V V +GEY ++ +G+PP D Y +VD+GSD++WVQC PC QCY Q
Sbjct: 110 YLPEDLVSEVVPGVDDGSGEYFVRVGVGSPPT-DQYLVVDSGSDVIWVQCRPCEQCYAQT 168
Query: 64 KPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQL---CNYTYGYADSSLTKGVLATERI 120
P+++PA+SSS+ +SC S C L C C+Y+ Y D S TKG LA E +
Sbjct: 169 DPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETL 228
Query: 121 TFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
T G + V GCGH N+G+F GL+GLG +SL Q+ G FSYCL
Sbjct: 229 TLGGTA--VQGVAIGCGHRNSGLF-VGAAGLLGLGWGAMSLVGQLGGAAG-GVFSYCL-- 282
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDK--TYYFVTLEGISVGNLSNSSKLIP 238
S+ G GS V G T V + + ++Y+V L GI VG + +P
Sbjct: 283 -------ASRGAGGAGSLVLG----RTEAVPRGRRASSFYYVGLTGIGVGG-----ERLP 326
Query: 239 YYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
+S +++ G + +DTG T LP++ Y L A+ P CY
Sbjct: 327 LQDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYD 386
Query: 296 TPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEG-VFCFAMQPIDGDVGIFGNFAQS 353
A + P ++ +FD GA + L + + V G VFC A P + I GN Q
Sbjct: 387 LSGYASVRVPTVSFYFDQGAVLTLPARNLLV--EVGGAVFCLAFAPSSSGISILGNIQQE 444
Query: 354 DLFIGYDFDSQMVSFKPTDC 373
+ I D + V F P C
Sbjct: 445 GIQITVDSANGYVGFGPNTC 464
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 174 bits (440), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 125/370 (33%), Positives = 174/370 (47%), Gaps = 20/370 (5%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
S ++ GEY +GTP D+Y +VDTGSD+ W+QC PC CYKQ ++NP+SSSS
Sbjct: 7 SGLAFGTGEYFAVVGVGTP-RRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNPSSSSS 65
Query: 75 YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERI----TFGNSNNFFD 130
+K L C S C LD + C S + C Y Y D S T G L T+ + FG
Sbjct: 66 FKVLDCSSSLCLNLDVMGCLSNK-CLYQADYGDGSFTMGELVTDNVVLDDAFGPGQVVLT 124
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
N+ GCGH+N G F G++GLGR LS + L N FSYCL +D + S
Sbjct: 125 NIPLGCGHDNEGTFG-TAAGILGLGRGPLSFPNN-LDASTRNIFSYCLPDRESDPNHKST 182
Query: 191 MYFGNGS---EVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGA 245
+ FG+ + +G L + TYY+V + GISVG L+N + +S G
Sbjct: 183 LVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSHG- 241
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA-IKLTPYQDPRLGSQLCYKTPSMAGIA- 303
G D+G T L Y + + R A + LT D ++ CY M I+
Sbjct: 242 --NGGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFKI-FDTCYDFTGMNSISV 298
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
P +T HF G + L ++ +P +FCFA G + GN Q + YD
Sbjct: 299 PTVTFHFQGDVDMRLPPSNYIVPVSNNNIFCFAFAASMGP-SVIGNVQQQSFRVIYDNVH 357
Query: 364 QMVSFKPTDC 373
+ + P C
Sbjct: 358 KQIGLLPDQC 367
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 174 bits (440), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 123/362 (33%), Positives = 175/362 (48%), Gaps = 21/362 (5%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ +IGTPP + +DTGS L+W QC PC C+ Q P Y+ + SS++ SC S
Sbjct: 34 EYLLHLAIGTPPQ-PVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDS 92
Query: 83 EQCHLLDTVS-CSSQ--QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
QC L +V+ C +Q Q C Y+Y Y D S T G L E ++F + VVFGCG N
Sbjct: 93 TQCKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSF-VAGASVPGVVFGCGLN 151
Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
NTG+F NE G+ G GR LSL SQL FS+C T
Sbjct: 152 NTGIFRSNETGIAGFGRGPLSLP----SQLKVGNFSHCFTAVSGRKPSTVLFDLPADLYK 207
Query: 200 SGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN--MFIDT 255
+G G V T+ + K T+Y+++L+GI+VG S +P S+ A+ G ID+
Sbjct: 208 NGRGTVQTTPLIKNPAHPTFYYLSLKGITVG-----STRLPVPESAFALKNGTGGTIIDS 262
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM--AGIAPILTAHFDGG 313
G T LP Y + ++ +KL G LC+ P + A P L HF+ G
Sbjct: 263 GTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFE-G 321
Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
A + L + G + I+G++ I GNF Q ++ + YD + +SF C
Sbjct: 322 ATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKC 381
Query: 374 TK 375
K
Sbjct: 382 DK 383
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 174 bits (440), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 121/372 (32%), Positives = 181/372 (48%), Gaps = 22/372 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S + A+GEY +GTPP + ++DTGSD++W+QC PCV CY+Q+ P+Y+P S
Sbjct: 88 VISGLPFASGEYFASVGVGTPPTPALL-VIDTGSDVVWLQCKPCVHCYRQLSPLYDPRGS 146
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+Y + C QC T ++ C Y Y D+S T G LAT+R+ F N + NV
Sbjct: 147 STYAQTPCSPPQCRNPQTCDGTTGG-CGYRIVYGDASSTSGNLATDRLVFSNDTS-VGNV 204
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH+N G+F + GL+G+ R S A+Q+ G F+YCL S +S +
Sbjct: 205 TLGCGHDNEGLFG-SAAGLLGVARGNNSFATQVADSYG-RYFAYCLGDRTRSGSSSSYLV 262
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG-----NLSNSS-KLIPYYNSSGAI 246
FG + V + + + Y+V + G SVG SN+S L P A
Sbjct: 263 FGRTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDP------AT 316
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQV-RNAIKLTPYQDPRLGS--QLCYKTPSMA-GI 302
+G + +D+G T +D Y L + A K+ + R S CY +A
Sbjct: 317 GRGGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLRGVAVAD 376
Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYDF 361
AP + HF GGA V L + +P CFA++ D + + GN Q + +D
Sbjct: 377 APGVVLHFAGGADVALPPENYLVPEESGRYHCFALEAAGHDGLSVIGNVLQQRFRVVFDV 436
Query: 362 DSQMVSFKPTDC 373
+++ V F+P C
Sbjct: 437 ENERVGFEPNGC 448
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 174 bits (440), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 120/368 (32%), Positives = 168/368 (45%), Gaps = 19/368 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S V +GEY + IG+P +Y ++DTGSD+ W+QC PC CY Q P+++PA S
Sbjct: 185 VVSGVGQGSGEYFSRIGIGSP-ARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPALS 243
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQL-----CNYTYGYADSSLTKGVLATERITF-GNSN 126
SSY + C S C LD +C + C Y Y D S T G ATE +T G+ +
Sbjct: 244 SSYATVPCDSPHCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLGGDGS 303
Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
+V GCGH+N G+F L+ LG LS SQI A +FSYCLV DS
Sbjct: 304 AAVHDVAIGCGHDNEGLFVGAAG-LLALGGGPLSFPSQI----SATEFSYCLV--DRDSP 356
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
S + FG + V + + S T+Y+V L GISVG S + P +
Sbjct: 357 SASTLQFGASDSST---VTAPLMRSPRSNTFYYVALNGISVGG-ETLSDIPPAAFAMDEQ 412
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-API 305
G + +D+G T L Y+ L + + P CY + + P
Sbjct: 413 GSGGVIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSLFDTCYDLAGRSSVQVPA 472
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
++ F+GG ++ L + IP G +C A G V I GN Q + + +D
Sbjct: 473 VSLRFEGGGELKLPAKNYLIPVDGAGTYCLAFAATGGAVSIVGNVQQQGIRVSFDTAKNT 532
Query: 366 VSFKPTDC 373
V F P C
Sbjct: 533 VGFSPNKC 540
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 126/375 (33%), Positives = 175/375 (46%), Gaps = 28/375 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPAS 71
QS + G Y++ +GTP D+ I DTGSDL W QC PCV+ CY Q +PI++P++
Sbjct: 143 AQSGLPLGTGNYIVNVGLGTPKK-DLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSA 201
Query: 72 SSSYKELSCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
S +Y +SC S C L + + CSS C Y Y DSS T G A + +T N
Sbjct: 202 SKTYSNISCTSTACSGLKSATGNSPGCSSSN-CVYGIQYGDSSFTVGFFAKDTLTL-TQN 259
Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
+ FD +FGCG NN G+F + GL+GLGR LS+ Q + G FSYCL T
Sbjct: 260 DVFDGFMFGCGQNNRGLFGKTA-GLIGLGRDPLSIVQQTAQKFG-KYFSYCL---PTSRG 314
Query: 187 ITSKMYFGNG-----SEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
+ FGNG S+ G+ T S + T+YF+ + GISVG + S + + N
Sbjct: 315 SNGHLTFGNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQN 374
Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
+ ID+G T LP Y L+ + + P CY +
Sbjct: 375 A-------GTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLDTCYDLSNYTS 427
Query: 302 IA-PILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGY 359
I+ P ++ +F+G A V L I V FA D +GIFGN Q L + Y
Sbjct: 428 ISIPKISFNFNGNANVDLEPNGILITNGASQVCLAFAGNGDDDTIGIFGNIQQQTLEVVY 487
Query: 360 DFDSQMVSFKPTDCT 374
D + F C+
Sbjct: 488 DVAGGQLGFGYKGCS 502
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 127/373 (34%), Positives = 177/373 (47%), Gaps = 36/373 (9%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ +IGTPP + +DTGSDL+W QC PC C+ Q P ++P++SS+ SC S
Sbjct: 34 EYLVHLAIGTPPQ-PVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDS 92
Query: 83 EQCHLLDTVSCSS-----QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
C L SC S Q C YTY Y D S+T G L ++ TF + V FGCG
Sbjct: 93 TLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCG 152
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL------VPFHTDSSITSKM 191
N GVF NE G+ G GR LSL SQL FS+C +P + + +
Sbjct: 153 LFNNGVFKSNETGIAGFGRGPLSLP----SQLKVGNFSHCFTTITGAIPSTVLLDLPADL 208
Query: 192 YFGNGSEVSGGGVVSTSLV----SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
F NG G V +T L+ ++ + T Y+++L+GI+VG S +P S+ A++
Sbjct: 209 -FSNGQ----GAVQTTPLIQYAKNEANPTLYYLSLKGITVG-----STRLPVPESAFALT 258
Query: 248 KGN--MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-AP 304
G ID+G T LP Y + ++ IKL G C+ PS A P
Sbjct: 259 NGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVP 318
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEG--VFCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
L HF+G F P G + C A+ D + I GNF Q ++ + YD
Sbjct: 319 KLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGD-ETTIIGNFQQQNMHVLYDLQ 377
Query: 363 SQMVSFKPTDCTK 375
+ M+SF C K
Sbjct: 378 NNMLSFVAAQCDK 390
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 119/363 (32%), Positives = 171/363 (47%), Gaps = 16/363 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S V +GEY + +G+P +Y ++DTGSD+ WVQC PC CY+Q P+++P+ S
Sbjct: 152 VVSGVGLGSGEYFSRVGVGSP-ARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLS 210
Query: 73 SSYKELSCQSEQCHLLDTVSC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
+SY ++C + +CH LD +C +S C Y Y D S T G ATE +T G+S +
Sbjct: 211 TSYASVACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDSAP-VSS 269
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V GCGH+N G+F L+ LG LS SQI A FSYCLV DS +S +
Sbjct: 270 VAIGCGHDNEGLFVGAAG-LLALGGGPLSFPSQI----SATTFSYCLV--DRDSPSSSTL 322
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
FG+ ++ V + + S T+Y+V L GISVG S + P + G +
Sbjct: 323 QFGDAADAE---VTAPLIRSPRTSTFYYVGLSGISVGGQILS--IPPSAFAMDGTGAGGV 377
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHF 310
+D+G T L Y L + + P CY + P ++ F
Sbjct: 378 IVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRF 437
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
GG ++ L + IP G +C A P + V I GN Q + +D V F
Sbjct: 438 AGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTS 497
Query: 371 TDC 373
C
Sbjct: 498 NKC 500
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 131/376 (34%), Positives = 190/376 (50%), Gaps = 36/376 (9%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
V + GEY+M IGTPP I+DTGSDL+W QC PC+ C Q P ++PA S SY
Sbjct: 82 VLASEGEYLMSMGIGTPPRY-YSAILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYA 140
Query: 77 ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD--NVVF 134
+L C S C+ L C + +C Y Y Y DS+ T GVL+ E TFG ++ + F
Sbjct: 141 KLPCNSPMCNALYYPLC-YRNVCVYQYFYGDSANTAGVLSNETFTFGTNDTRVTVPRIAF 199
Query: 135 GCGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
GCG+ N G +FN + G+VG GR LSL +SQLG+ +FSYCL F S + S++YF
Sbjct: 200 GCGNLNAGSLFNGS--GMVGFGRGPLSL----VSQLGSPRFSYCLTSFM--SPVPSRLYF 251
Query: 194 G-----NGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
G N + S G V ++ +V+ T Y++ + GISVG +L+P S AI
Sbjct: 252 GAYATLNSTSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGG-----ELLPIDPSVFAI 306
Query: 247 SK----GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CY---KTP 297
+ G + ID+G+ T L + Y+ + + + + L L L C+ P
Sbjct: 307 NDADGTGGVIIDSGSTITYLARAAYDMVHQAFADQVGLPLTNATSLADVLDTCFVWPPPP 366
Query: 298 SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFI 357
P L HF+ GA + L + + G C A+ D D I G+F + +
Sbjct: 367 RKIVTMPELAFHFE-GANMELPLENYMLIDGDTGNLCLAIAASD-DGSIIGSFQHQNFHV 424
Query: 358 GYDFDSQMVSFKPTDC 373
YD ++ ++SF P C
Sbjct: 425 LYDNENSLLSFTPATC 440
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 131/344 (38%), Positives = 178/344 (51%), Gaps = 42/344 (12%)
Query: 18 STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKE 77
S G+Y+M+FSIG PPLL I+ VDTGSDLMWV+C PC C P+Y+PA S S +
Sbjct: 81 SQKGGKYIMQFSIGEPPLL-IWAEVDTGSDLMWVKCSPCNGCNPPPSPLYDPARSRSSGK 139
Query: 78 LSCQSEQCHLLDTVSCSSQQ------LC--NYTYGYADSSLTKGVLATERITFGNSNNFF 129
L C S+ C L S Q LC +Y YG++ T+GVL TE TFG+
Sbjct: 140 LPCSSQLCQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTFGD-GYVA 198
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
+NV FG G GLVGLGR LSL +SQLGA +F+YCL D ++ S
Sbjct: 199 NNVSFGRSDTIDGSQFGGTAGLVGLGRGHLSL----VSQLGAGRFAYCLA---ADPNVYS 251
Query: 190 KMYFGN--GSEVSGGGVVSTSLVS--KEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
+ FG+ + S G V ST LV+ K D+ T+Y+V L+GISVG +P + +
Sbjct: 252 TILFGSLAALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGG-----SRLPIKDGTF 306
Query: 245 AIS---KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
AI+ G +F D+GA T L Y + + + + I+ Y G C+ +
Sbjct: 307 AINSDGSGGVFFDSGAIDTSLKDAAYQVVRQAITSEIQRLGYD---AGDDTCFVAANQQA 363
Query: 302 IA--PILTAHFDGGAKVPL-----IHTSTFIPPPVEGVFCFAMQ 338
+A P L HFD GA + L + TST P E + C A++
Sbjct: 364 VAQMPPLVLHFDDGADMSLNGRNYLKTST--KGPSEVLVCMAIK 405
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 122/384 (31%), Positives = 189/384 (49%), Gaps = 30/384 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V + ++GEY++ F+IGTP + +DTGSDL+W QC PC C+ Q P+++P+ S
Sbjct: 76 VTATAVPSSGEYLIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPVCFDQPFPLFDPSVS 135
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQL----CNYTYGYADSSLTKGVLATERITFGNSNN- 127
S+++ ++C C +S S+ L C Y Y D S+T G + + TF + N
Sbjct: 136 STFRAVACPDPICRPSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMSPNGE 195
Query: 128 -----FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPF- 181
+ FGCG NTGVF NE G+ G GR LSL SQL +FSYCL
Sbjct: 196 GAPPVAVSGLAFGCGDYNTGVFASNESGIAGFGRGPLSLP----SQLRVGRFSYCLTSHD 251
Query: 182 HTDSSITSKMYFG---NGSEV-SGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKL 236
T+S+ TS ++ G NG S G ST ++ S T+Y+++LEGI+VG
Sbjct: 252 ETESNKTSAVFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTR----- 306
Query: 237 IPYYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD-PRLGSQL 292
+P +S A+ K G ID+G T P + +L+ + + L Y + +G+ L
Sbjct: 307 LPVDSSVFALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQLPLPRYDNTSEVGNLL 366
Query: 293 CYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVE-GVFCFAMQPIDGDVGIFGNFA 351
C++ P P+ F + + +IP + GV C + + D+ + GNF
Sbjct: 367 CFQRPKGGKQVPVPKLIFHLASADMDLPRENYIPEDTDSGVMCLMINGAEVDMVLIGNFQ 426
Query: 352 QSDLFIGYDFDSQMVSFKPTDCTK 375
Q ++ I YD ++ + F C K
Sbjct: 427 QQNMHIVYDVENSKLLFASAQCDK 450
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 123/374 (32%), Positives = 182/374 (48%), Gaps = 40/374 (10%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ ++GTPP + ++DTGSDL+W QC PC C Q PI++P +SSSY+ + C
Sbjct: 103 EYLVDLAVGTPPQ-PVSALLDTGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRCAG 161
Query: 83 EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------GNSNNFFDNVVFGC 136
E C+ + SC C Y Y Y D + T+GV ATER TF G + + FGC
Sbjct: 162 ELCNDILHHSCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGFGC 221
Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN- 195
G N G N N G+VG GR LSL +SQL +FSYCL P+ S S + FG+
Sbjct: 222 GTMNKGSLN-NGSGIVGFGRAPLSL----VSQLAIRRFSYCLTPYA--SGRKSTLLFGSL 274
Query: 196 --GSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVG----NLSNSSKLIPYYNSSGAIS 247
G + V T+ L S+++ T+Y+V G++VG + S+ + S GAI
Sbjct: 275 RGGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGAI- 333
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ--LCY-----KTPSMA 300
+D+G TL P + R+ ++L + G +C+ + P A
Sbjct: 334 -----VDSGTALTLFPAPVLAEVVRAFRSQLRLPFAANGSSGPDDGVCFAAAASRVPRPA 388
Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG-IFGNFAQSDLFIGY 359
+ P + H GA + L + + +G C + GD G GNF Q D+ + Y
Sbjct: 389 -VVPRMVFHLQ-GADLDLPRRNYVLDDQRKGNLCLLLAD-SGDSGTTIGNFVQQDMRVLY 445
Query: 360 DFDSQMVSFKPTDC 373
D ++ +SF P C
Sbjct: 446 DLEADTLSFAPAQC 459
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 121/371 (32%), Positives = 178/371 (47%), Gaps = 34/371 (9%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EYV+ ++GTPP I ++DTGSDL+W QC C C +Q P+++P SSSY+ + C
Sbjct: 97 EYVLDLAVGTPPQ-PITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAG 155
Query: 83 EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV--FGCGHNN 140
+ C + SC C Y Y Y D + T G ATER TF +S+ +V FGCG N
Sbjct: 156 QLCGDILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGCGTMN 215
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV- 199
G N N G+VG GR LSL +SQL +FSYCL P+ SS S + FG+ ++V
Sbjct: 216 VGSLN-NASGIVGFGRDPLSL----VSQLSIRRFSYCLTPYA--SSRKSTLQFGSLADVG 268
Query: 200 ---SGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS---KGNM 251
G V T+ L S ++ T+Y+V G++VG ++ + S+ A+ G +
Sbjct: 269 LYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVG-----ARRLRIPASAFALRPDGSGGV 323
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY-------KTPSMAG--I 302
ID+G TL P + R+ ++L +C+ MA
Sbjct: 324 IIDSGTALTLFPAAVLAEVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAGGGRMARQVA 383
Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
P + HF GA + L + + G C + D GNF Q D+ + YD +
Sbjct: 384 VPRMVFHFQ-GADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLE 442
Query: 363 SQMVSFKPTDC 373
+ +SF P +C
Sbjct: 443 RETLSFAPVEC 453
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 124/366 (33%), Positives = 173/366 (47%), Gaps = 22/366 (6%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S V +GEY + +G P +Y ++DTGSD+ W+QC PC CY Q P+Y+P+ S
Sbjct: 152 VVSGVGQGSGEYFSRVGVGRP-ARQLYMVLDTGSDVTWLQCQPCADCYAQSDPVYDPSVS 210
Query: 73 SSYKELSCQSEQCHLLDTVSC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
+SY + C S +C LD +C +S C Y Y D S T G ATE +T G+S N
Sbjct: 211 TSYATVGCDSPRCRDLDAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTLGDSAP-VSN 269
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V GCGH+N G+F L+ LG LS SQI A FSYCLV DS +S +
Sbjct: 270 VAIGCGHDNEGLFVGAAG-LLALGGGPLSFPSQI----SATTFSYCLV--DRDSPSSSTL 322
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS---K 248
FG+ + + V + + S T+Y+V L GISVG + S IP +S+ A+
Sbjct: 323 QFGDSEQPA---VTAPLIRSPRTNTFYYVALSGISVGGEALS---IP--SSAFAMDDAGS 374
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILT 307
G + +D+G T L Y L E + P CY + + P +
Sbjct: 375 GGVIVDSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQVPAVA 434
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
F+GG ++ L + IP G +C A G V I GN Q + + +D V
Sbjct: 435 LWFEGGGELKLPAKNYLIPVDAAGTYCLAFAGTSGPVSIIGNVQQQGVRVSFDTAKNTVG 494
Query: 368 FKPTDC 373
F C
Sbjct: 495 FTADKC 500
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 117/363 (32%), Positives = 170/363 (46%), Gaps = 16/363 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S V +GEY + +G+P +Y ++DTGSD+ WVQC PC CY+Q P+++P+ S
Sbjct: 156 VVSGVGLGSGEYFSRVGVGSP-ARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLS 214
Query: 73 SSYKELSCQSEQCHLLDTVSC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
+SY ++C + +CH LD +C +S C Y Y D S T G ATE +T G+S +
Sbjct: 215 TSYASVACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDSAP-VSS 273
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V GCGH+N G+F L G LS SQI A FSYCLV DS +S +
Sbjct: 274 VAIGCGHDNEGLFVGAAGLLALGGGP-LSFPSQI----SATTFSYCLV--DRDSPSSSTL 326
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
FG+ ++ V + + S T+Y+V L G+SVG S + P + + G +
Sbjct: 327 QFGDAADAE---VTAPLIRSPRTSTFYYVGLSGLSVGGQILS--IPPSAFAMDSTGAGGV 381
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHF 310
+D+G T L Y L + + P CY + P ++ F
Sbjct: 382 IVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRF 441
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
GG ++ L + IP G +C A P + V I GN Q + +D V F
Sbjct: 442 AGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTT 501
Query: 371 TDC 373
C
Sbjct: 502 NKC 504
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 134/381 (35%), Positives = 189/381 (49%), Gaps = 26/381 (6%)
Query: 2 SPATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK 61
+P T + ++VV S +S +GEY + +GTP +Y ++DTGSD++W+QC PC +CY
Sbjct: 121 APRTGGFSSSVV-SGLSQGSGEYFTRLGVGTPARY-VYMVLDTGSDIVWLQCAPCRRCYS 178
Query: 62 QVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERI 120
Q PI++P S +Y + C S C LD+ C++++ C Y Y D S T G +TE +
Sbjct: 179 QSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETL 238
Query: 121 TFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
TF N V GCGH+N G+F L+GLG+ +LS Q + KFSYCLV
Sbjct: 239 TF--RRNRVKGVALGCGHDNEGLFVGAAG-LLGLGKGKLSFPGQTGHRFN-QKFSYCLVD 294
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
+ SS S + FGN + VS + L + + T+Y+V L GISVG +P
Sbjct: 295 -RSASSKPSSVVFGNAA-VSRIARFTPLLSNPKLDTFYYVELLGISVGGTR-----VPGV 347
Query: 241 NSS----GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLC 293
+S I G + ID+G T L + Y + + R A+K P D L C
Sbjct: 348 AASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAP--DFSL-FDTC 404
Query: 294 YKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQ 352
+ +M + P + HF GA V L T+ IP G FCFA G + I GN Q
Sbjct: 405 FDLSNMNEVKVPTVVLHFR-GADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQ 463
Query: 353 SDLFIGYDFDSQMVSFKPTDC 373
+ YD S V F P C
Sbjct: 464 QGFRVVYDLASSRVGFAPGGC 484
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 110/302 (36%), Positives = 167/302 (55%), Gaps = 27/302 (8%)
Query: 9 PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
P + V+ ++GEY++ +IGTPPL I+DTGSDL+W QC PC+ C Q P ++
Sbjct: 74 PITAARVLVTASSGEYLVDLAIGTPPLY-YTAIMDTGSDLIWTQCAPCLLCADQPTPYFD 132
Query: 69 PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN- 127
S++Y+ L C+S +C L + SC +++C Y Y Y D++ T GVLA E TFG +N+
Sbjct: 133 VKKSATYRALPCRSSRCASLSSPSC-FKKMCVYQYYYGDTASTAGVLANETFTFGAANST 191
Query: 128 --FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
N+ FGCG N G N G+VG GR LSL +SQLG ++FSYCL + S
Sbjct: 192 KVRATNIAFGCGSLNAGDL-ANSSGMVGFGRGPLSL----VSQLGPSRFSYCLTSYL--S 244
Query: 186 SITSKMYFG-----NGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIP 238
+ S++YFG + + S G V ++ +++ YF++L+ IS+G +KL+P
Sbjct: 245 ATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLG-----TKLLP 299
Query: 239 YYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
AI+ G + ID+G T L +D Y + + +AI LT D +G C++
Sbjct: 300 IDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLTAMNDTDIGLDTCFQ 359
Query: 296 TP 297
P
Sbjct: 360 WP 361
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 120/369 (32%), Positives = 178/369 (48%), Gaps = 30/369 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S + +GEY + +G P Y ++DTGSD+ W+QC PC CY+Q PI++P +S
Sbjct: 146 VSSGTAQGSGEYFSRVGVGQPSK-PFYMVLDTGSDVNWLQCKPCSDCYQQSDPIFDPTAS 204
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
SSY L+C ++QC L+ +C + + C Y Y D S T G TE ++FG + + V
Sbjct: 205 SSYNPLTCDAQQCQDLEMSACRNGK-CLYQVSYGDGSFTVGEYVTETVSFGAGS--VNRV 261
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH+N G+F + L + SQ+ A FSYCLV DS +S +
Sbjct: 262 AIGCGHDNEGLFVGSAGLL-----GLGGGPLSLTSQIKATSFSYCLV--DRDSGKSSTLE 314
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---G 249
F S G VV+ L +++ T+Y+V L G+SVG +++ + A+ + G
Sbjct: 315 F--NSPRPGDSVVAPLLKNQKVNTFYYVELTGVSVGG-----EIVTVPPETFAVDQSGAG 367
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKL-TPYQDPRLGSQL---CYKTPSMAGI-AP 304
+ +D+G T L YN VR+A K T P G L CY S+ + P
Sbjct: 368 GVIVDSGTAITRLRTQAYN----SVRDAFKRKTSNLRPAEGVALFDTCYDLSSLQSVRVP 423
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
++ HF G L + IP G +CFA P + I GN Q + +D +
Sbjct: 424 TVSFHFSGDRAWALPAKNYLIPVDGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSFDLANS 483
Query: 365 MVSFKPTDC 373
+V F P C
Sbjct: 484 LVGFSPNKC 492
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 130/371 (35%), Positives = 186/371 (50%), Gaps = 28/371 (7%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
V ++GEY+M+ IGTP I+DTGSDL+W QC PC+ C Q P ++PA+SS+Y+
Sbjct: 85 VLASDGEYLMEMGIGTPARF-YSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSSTYR 143
Query: 77 ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--FFDNVVF 134
L C + C+ L C Q+ C Y Y Y DS+ T GVLA E TFG ++ + F
Sbjct: 144 SLGCSAPACNALYYPLC-YQKTCVYQYFYGDSASTAGVLANETFTFGTNDTRVTLPRISF 202
Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
GCG+ N G N G+VG GR LSL +SQLG+ +FSYCL F S + S++YFG
Sbjct: 203 GCGNLNAGSL-ANGSGMVGFGRGSLSL----VSQLGSPRFSYCLTSFL--SPVRSRLYFG 255
Query: 195 NGSEV---SGGGVVSTS-LVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAISK 248
+ + + V ST +++ T YF+ + GISVG L ++ ++ G
Sbjct: 256 AYATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDG---T 312
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLT-PYQDPRLGSQL--CYKT---PSMAGI 302
G ID+G T L + Y + E + T P D S L C++ P +
Sbjct: 313 GGTIIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVT 372
Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
P L HFD GA L + + P G C AM D I G++ + + YD +
Sbjct: 373 LPQLVLHFD-GADWELPLQNYMLVDPSTGGLCLAMA-TSSDGSIIGSYQHQNFNVLYDLE 430
Query: 363 SQMVSFKPTDC 373
+ ++SF P C
Sbjct: 431 NSLLSFVPAPC 441
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 119/338 (35%), Positives = 164/338 (48%), Gaps = 18/338 (5%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSC-SSQQLC 99
++DTGSD+ WVQC PC CY+Q P+++P+ S+SY +SC S++C LDT +C ++ C
Sbjct: 2 VLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGAC 61
Query: 100 NYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRL 159
Y Y D S T G ATE +T G+S NV GCGH+N G+F L+ LG L
Sbjct: 62 LYEVAYGDGSYTVGDFATETLTLGDSTP-VGNVAIGCGHDNEGLFVGAAG-LLALGGGPL 119
Query: 160 SLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLV-SKEDKTYY 218
S SQI A+ FSYCLV DS S + FG+G+ + G V+ LV S T+Y
Sbjct: 120 SFPSQI----SASTFSYCLV--DRDSPAASTLQFGDGAAEA--GTVTAPLVRSPRTSTFY 171
Query: 219 FVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRN 276
+V L GISVG LS + +SG+ G + +D+G T L Y L +
Sbjct: 172 YVALSGISVGGQPLSIPASAFAMDATSGS---GGVIVDSGTAVTRLQSAAYAALRDAFVQ 228
Query: 277 AIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF 335
P CY + P ++ F+GG + L + IP G +C
Sbjct: 229 GAPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCL 288
Query: 336 AMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
A P + V I GN Q + +D V F P C
Sbjct: 289 AFAPTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 125/365 (34%), Positives = 173/365 (47%), Gaps = 16/365 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S ++ +GEY + +GTPP +Y ++DTGSD++W+QC PC CY Q P++NP S
Sbjct: 31 VISGLAQGSGEYFTRIGVGTPPKY-VYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKS 89
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+ ++ C++ C L++ C+ +Q C Y Y D S T G TE +TF + + V
Sbjct: 90 GSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTK--VEQV 147
Query: 133 VFGCGHNNTGVF--NENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
GCGH+N G+F +GL G + S A + +Q KFSYCLV + SS S
Sbjct: 148 ALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQ----KFSYCLVD-RSASSKPSS 202
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
+ FGN S VS + L + T+Y+V L GISVG S ++ G
Sbjct: 203 VVFGN-SAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLD-RTGNGG 260
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVR-NAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTA 308
+ ID G T L K Y L + R A L + L CY + P +
Sbjct: 261 VIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSL-FDTCYDLSGKTTVKVPTVVL 319
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
HF GA V L ++ IP G FCFA + I GN Q + YD S V F
Sbjct: 320 HFR-GADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGF 378
Query: 369 KPTDC 373
P C
Sbjct: 379 SPRGC 383
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 119/369 (32%), Positives = 175/369 (47%), Gaps = 26/369 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S + +GEY ++ +G+PP Y ++D+GSD++WVQC PC QCY Q P+++PA S
Sbjct: 129 VISGMEQGSGEYFVRIGVGSPPRSQ-YMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADS 187
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
+S+ +SC S C L+ C + + C Y Y D S TKG LA E +TFG + +V
Sbjct: 188 ASFTGVSCSSSVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTFGRT--MVRSV 244
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH N G+F L G + +S Q+ Q G FSYCLV TDSS +
Sbjct: 245 AIGCGHRNRGMFVGAAGLLGLGGGS-MSFVGQLGGQTGG-AFSYCLVSRGTDSS--GSLV 300
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---G 249
FG + +G V + + ++Y++ L G+ VG + +P +++ G
Sbjct: 301 FGREALPAGAAWVPL-VRNPRAPSFYYIGLAGLGVGGIR-----VPISEEVFRLTELGDG 354
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGI-AP 304
+ +DTG T LP Y + R+A PR CY + P
Sbjct: 355 GVVMDTGTAVTRLPTLAY----QAFRDAFLAQTANLPRATGVAIFDTCYDLLGFVSVRVP 410
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
++ +F GG + L + IP G FCFA P + I GN Q + I +D +
Sbjct: 411 TVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANG 470
Query: 365 MVSFKPTDC 373
V F P C
Sbjct: 471 YVGFGPNIC 479
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 125/377 (33%), Positives = 181/377 (48%), Gaps = 22/377 (5%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
N V S + +GEY ++ +GTP ++ +VDTGSDL W+QC PC CYKQ PI++P
Sbjct: 115 NGPVTSGLLYGSGEYFVRLGVGTPAR-SLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDP 173
Query: 70 ASSSSYKELSCQSEQCHLLDTVSCSSQQ----LCNYTYGYADSSLTKGVLATERITFGNS 125
+SSS++ + C S C L+ SCS + C+Y Y D S + G +++ T G
Sbjct: 174 RNSSSFQRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTG 233
Query: 126 NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQIL----SQLGANKFSYCLVPF 181
+ +V FGCG +N G+ GL+GLG +LS SQI + AN FSYCLV
Sbjct: 234 SKAM-SVAFGCGFDNEGL-FAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDR 291
Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
+ +S + + +S L + + T+Y+ + G+SVG +P
Sbjct: 292 SNPMTRSSSSLIFGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQ-----LPISL 346
Query: 242 SSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTP 297
S +S+ G + ID+G T P Y + + RNA P PR CY
Sbjct: 347 KSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLP-SAPRYSLFDTCYNFS 405
Query: 298 SMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLF 356
A + P L HF+ GA + L T+ IP G FC A P ++GI GN Q
Sbjct: 406 GKASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFR 465
Query: 357 IGYDFDSQMVSFKPTDC 373
IG+D ++F P C
Sbjct: 466 IGFDLQKSHLAFAPQQC 482
>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 137/383 (35%), Positives = 195/383 (50%), Gaps = 45/383 (11%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
QS ++ + G Y++K S+GTPP +I + D DL W+ C C C K + P+ SS
Sbjct: 87 QSELNFSKGNYLIKISVGTPPA-EILALADITGDLTWLPCKTCQDCTKDGFTFF-PSESS 144
Query: 74 SYKELSCQSEQCHLLDTVSCSSQQLCNYTYG----YADSSLTKGVLATERITFGNSNN-- 127
+Y +C+S QC + + C ++ +C Y G S KG++A + I+F +S+
Sbjct: 145 TYTSAACESYQCQITNGAVCQTK-MCIYLCGPLPQQRSSCTNKGLVAMDTISFHSSSGQA 203
Query: 128 -FFDNVVFGCGHNNTGVFNENEMG--LVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
+ N F CG T + N + +G +VGLGR S+ SQ + L FS CLVP+ +
Sbjct: 204 LSYPNTNFICG---TFIDNWHYIGAGIVGLGRGLFSMTSQ-MKHLINGTFSQCLVPYSSK 259
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNS 242
S SK+ FG VSG GVVST + + YF+ LE +SVG ++N+ P
Sbjct: 260 QS--SKINFGLKGVVSGEGVVSTPIADDGESGAYFLFLEAMSVGGNRVANNFYSAP---- 313
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTP--YQDPRLGSQLCYKTPSMA 300
K N++ID T LP DFY +E +VR AI LTP Y + R S LCYK+ S
Sbjct: 314 -----KSNIYIDWRTTFTSLPHDFYENVEAEVRKAINLTPINYNNERKLS-LCYKSESDH 367
Query: 301 GI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDV--------GIFGNFA 351
AP +T HF A V L +TF+ V CFA +DG ++G++
Sbjct: 368 DFDAPPITMHFT-NADVQLSPLNTFVRMDWN-VVCFAF--LDGTFNATKRITHAVYGSWQ 423
Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
Q + +GYD S VSFK DCT
Sbjct: 424 QMNFIVGYDLKSSTVSFKQADCT 446
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 121/371 (32%), Positives = 178/371 (47%), Gaps = 34/371 (9%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EYV+ ++GTPP I ++DTGSDL+W QC C C +Q P+++P SSSY+ + C
Sbjct: 97 EYVLDLAVGTPPQ-PITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAG 155
Query: 83 EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV--FGCGHNN 140
+ C + SC C Y Y Y D + T G ATER TF +S+ +V FGCG N
Sbjct: 156 QLCGDILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGCGTMN 215
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV- 199
G N N G+VG GR LSL +SQL +FSYCL P+ SS S + FG+ ++V
Sbjct: 216 VGSLN-NASGIVGFGRDPLSL----VSQLSIRRFSYCLTPYA--SSRKSTLQFGSLADVG 268
Query: 200 ---SGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS---KGNM 251
G V T+ L S ++ T+Y+V G++VG ++ + S+ A+ G +
Sbjct: 269 LYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVG-----ARRLRIPASAFALRPDGSGGV 323
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY-------KTPSMAG--I 302
ID+G TL P + R+ ++L +C+ MA
Sbjct: 324 IIDSGTALTLFPVAVLAEVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAGGGRMARQVA 383
Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
P + HF GA + L + + G C + D GNF Q D+ + YD +
Sbjct: 384 VPRMVFHFQ-GADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLE 442
Query: 363 SQMVSFKPTDC 373
+ +SF P +C
Sbjct: 443 RETLSFAPVEC 453
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 118/364 (32%), Positives = 173/364 (47%), Gaps = 24/364 (6%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
S + +GEY + IG P ++Y ++DTGSD+ W+QC PC CY Q +PI+ P+SSSS
Sbjct: 139 SGTTQGSGEYFTRVGIGKPAR-EVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSS 197
Query: 75 YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
Y+ LSC + QC+ L+ C + C Y Y D S T G ATE +T G++ NV
Sbjct: 198 YEPLSCDTPQCNALEVSECRNAT-CLYEVSYGDGSYTVGDFATETLTIGST--LVQNVAV 254
Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
GCGH+N G+F + SQL FSYCLV +DS+ T
Sbjct: 255 GCGHSNEGLFVGAAG-----LLGLGGGLLALPSQLNTTSFSYCLVDRDSDSASTVDF--- 306
Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNM 251
G+ +S VV+ L + + T+Y++ L GISVG +L+ SS + + G +
Sbjct: 307 -GTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGG-----ELLQIPQSSFEMDESGSGGI 360
Query: 252 FIDTGAPPTLLPKDFYNRLEEQ-VRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAH 309
ID+G T L + YN L + V+ + L + CY + + P + H
Sbjct: 361 IIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAM-FDTCYNLSAKTTVEVPTVAFH 419
Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
F GG + L + IP G FC A P + I GN Q + +D + ++ F
Sbjct: 420 FPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFS 479
Query: 370 PTDC 373
C
Sbjct: 480 SNKC 483
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 127/367 (34%), Positives = 177/367 (48%), Gaps = 19/367 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S +S +GEY + +GTP +Y ++DTGSD++W+QC PC +CY Q PI++P S
Sbjct: 131 VVSGLSQGSGEYFTRLGVGTPARY-VYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKS 189
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
+Y + C S C LD+ C++++ C Y Y D S T G +TE +TF N
Sbjct: 190 KTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTF--RRNRVKG 247
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V GCGH+N G+F L+GLG+ +LS Q + KFSYCLV + SS S +
Sbjct: 248 VALGCGHDNEGLFVGAAG-LLGLGKGKLSFPGQTGHRFN-QKFSYCLVD-RSASSKPSSV 304
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS----GAIS 247
FGN + VS + L + + T+Y+V L GISVG +P +S I
Sbjct: 305 VFGNAA-VSRIARFTPLLSNPKLDTFYYVGLLGISVGGTR-----VPGVTASLFKLDQIG 358
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APIL 306
G + ID+G T L + Y + + R K C+ +M + P +
Sbjct: 359 NGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTV 418
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
HF GA V L T+ IP G FCFA G + I GN Q + YD S V
Sbjct: 419 VLHFR-GADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRV 477
Query: 367 SFKPTDC 373
F P C
Sbjct: 478 GFAPGGC 484
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 171 bits (433), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 125/365 (34%), Positives = 173/365 (47%), Gaps = 16/365 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S ++ +GEY + +GTPP +Y ++DTGSD++W+QC PC CY Q P++NP S
Sbjct: 118 VISGLAQGSGEYFTRIGVGTPPKY-VYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKS 176
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+ ++ C++ C L++ C+ +Q C Y Y D S T G TE +TF + + V
Sbjct: 177 GSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTK--VEQV 234
Query: 133 VFGCGHNNTGVF--NENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
GCGH+N G+F +GL G + S A + +Q KFSYCLV + SS S
Sbjct: 235 ALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQ----KFSYCLVD-RSASSKPSS 289
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
+ FGN S VS + L + T+Y+V L GISVG S ++ G
Sbjct: 290 VVFGN-SAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLD-RTGNGG 347
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVR-NAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTA 308
+ ID G T L K Y L + R A L + L CY + P +
Sbjct: 348 VIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSL-FDTCYDLSGKTTVKVPTVVL 406
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
HF GA V L ++ IP G FCFA + I GN Q + YD S V F
Sbjct: 407 HFR-GADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGF 465
Query: 369 KPTDC 373
P C
Sbjct: 466 SPRGC 470
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 171 bits (433), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 127/364 (34%), Positives = 177/364 (48%), Gaps = 11/364 (3%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S +S +GEY + +GTPP +Y ++DTGSD++W+QC PC +CY Q PI+NP S
Sbjct: 99 VVSGLSQGSGEYFTRLGVGTPPRY-LYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKS 157
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
S+ + C S C LD+ CS+++ C Y Y D S T G ATE +TF N
Sbjct: 158 KSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTF--RGNKIAK 215
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V GCGH+N G+F L+GLGR RLS SQ + +KFSYCLV + SS S M
Sbjct: 216 VALGCGHHNEGLFVGAAG-LLGLGRGRLSFPSQTGIRFN-HKFSYCLVD-RSASSKPSSM 272
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
FG+ + +S + + + + T+Y+V L GISVG + + P + G +
Sbjct: 273 VFGDAA-ISRLARFTPLIRNPKLDTFYYVGLIGISVGGV-RVRGVSPSLFKLDSAGNGGV 330
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHF 310
ID+G T L + Y L + R + CY + + P + HF
Sbjct: 331 IIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFDTCYDLSGQSSVKVPTVVLHF 390
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
GA + L T+ IP G FCFA + I GN Q + YD + F P
Sbjct: 391 R-GADMALPATNYLIPVDENGSFCFAFAGTISGLSIIGNIQQQGFRVVYDLAGSRIGFAP 449
Query: 371 TDCT 374
CT
Sbjct: 450 RGCT 453
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 170 bits (431), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 122/368 (33%), Positives = 176/368 (47%), Gaps = 24/368 (6%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S ++ +GEY ++ +G+PP Y ++D+GSD++WVQC PC QCY Q P+++PA S
Sbjct: 32 VVSGMNQGSGEYFVRIGLGSPPRSQ-YMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADS 90
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
+S+ +SC S C ++ C+S + C Y Y D S TKG LA E +TFG + NV
Sbjct: 91 ASFMGVSCSSAVCDRVENAGCNSGR-CRYEVSYGDGSYTKGTLALETLTFGRT--VVRNV 147
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH+N G+F L G + +S Q+ Q G N FSYCLV T+ + +
Sbjct: 148 AIGCGHSNRGMFVGAAGLLGLGGGS-MSFMGQLSGQTG-NAFSYCLVSRGTN----TNGF 201
Query: 193 FGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLS-NSSKLIPYYNSSGAISKGN 250
GSE G LV ++Y++ L G+ VG+ S+ + N G+ G
Sbjct: 202 LEFGSEAMPVGAAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGS---GG 258
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGI-API 305
+ +DTG T P Y E RNA PR CY + P
Sbjct: 259 VVMDTGTAVTRFPTVAY----EAFRNAFIEQTQNLPRASGVSIFDTCYNLFGFLSVRVPT 314
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
++ +F GG + + + IP G FCFA P + I GN Q + I D ++
Sbjct: 315 VSFYFSGGPILTIPANNFLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDEANEF 374
Query: 366 VSFKPTDC 373
V F P C
Sbjct: 375 VGFGPNIC 382
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 170 bits (431), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 132/374 (35%), Positives = 186/374 (49%), Gaps = 33/374 (8%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
V ++GEY+M+ IGTP I+DTGSDL+W QC PC+ C Q P ++PA S++Y+
Sbjct: 83 VLASDGEYLMEMGIGTPTRY-YSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYR 141
Query: 77 ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF--FDNVVF 134
L C S C+ L C Q++C Y Y Y DS+ T GVLA E TFG + + F
Sbjct: 142 SLGCASPACNALYYPLC-YQKVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISF 200
Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
GCG+ N G+ N G+VG GR LSL +SQLG+ +FSYCL F S + S++YFG
Sbjct: 201 GCGNLNAGLL-ANGSGMVGFGRGSLSL----VSQLGSPRFSYCLTSFL--SPVPSRLYFG 253
Query: 195 -----NGSEVSGGGVVSTS-LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
N + S V ST +V+ T YF+ + GISVG L+P + AI+
Sbjct: 254 VYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGG-----YLLPIDPAVFAIND 308
Query: 249 ----GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYK---TPSM 299
G ID+G T L + Y+ + + I L P + S L C++ P
Sbjct: 309 TDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQITL-PLLNVTDASVLDTCFQWPPPPRQ 367
Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
+ P L HFDG + + P G C AM D I G++ + + Y
Sbjct: 368 SVTLPQLVLHFDGADWELPLQNYMLVDPSTGGGLCLAMA-SSSDGSIIGSYQHQNFNVLY 426
Query: 360 DFDSQMVSFKPTDC 373
D ++ ++SF P C
Sbjct: 427 DLENSLMSFVPAPC 440
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 120/363 (33%), Positives = 181/363 (49%), Gaps = 25/363 (6%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
+ TAN Y++ +GTP D+ + DTGSDL WVQC PC CYKQ P+++P+ S++Y
Sbjct: 182 RLGTAN--YIVSVGLGTP-RRDLLVVFDTGSDLSWVQCKPCNNCYKQHDPLFDPSQSTTY 238
Query: 76 KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
+ C +++C LD+ +CSS + C Y Y D S T G LA + +T G S++ VFG
Sbjct: 239 SAVPCGAQEC--LDSGTCSSGK-CRYEVVYGDMSQTDGNLARDTLTLGPSSDQLQGFVFG 295
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
CG ++TG+F + GL GLGR R+SLASQ ++ GA FSYCL SS ++ Y
Sbjct: 296 CGDDDTGLFGRAD-GLFGLGRDRVSLASQAAARYGAG-FSYCL-----PSSWRAEGYLSL 348
Query: 196 GSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIP-YYNSSGAISKGNMFI 253
GS + T++V++ D ++Y++ L GI V + ++ P + + G + I
Sbjct: 349 GSAAAPPHAQFTAMVTRSDTPSFYYLDLVGIKVAG--RTVRVAPAVFKAPGTV------I 400
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDG 312
D+G T LP Y+ L ++ CY + P + FDG
Sbjct: 401 DSGTVITRLPSRAYSALRSSFAGFMRRYKRAPALSILDTCYDFTGRTKVQIPSVALLFDG 460
Query: 313 GAKVPL-IHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
GA + L ++ + FA D VGI GN Q + YD +Q + F
Sbjct: 461 GATLNLGFGGVLYVANRSQACLAFASNGDDTSVGILGNMQQKTFAVVYDLANQKIGFGAK 520
Query: 372 DCT 374
C+
Sbjct: 521 GCS 523
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 125/377 (33%), Positives = 181/377 (48%), Gaps = 22/377 (5%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
N V S + +GEY ++ +GTP ++ +VDTGSDL W+QC PC CYKQ PI++P
Sbjct: 40 NGPVTSGLLYGSGEYFVRLGLGTPAR-SLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDP 98
Query: 70 ASSSSYKELSCQSEQCHLLDTVSCSSQQ----LCNYTYGYADSSLTKGVLATERITFGNS 125
+SSS++ + C S C L+ SCS + C+Y Y D S + G +++ T G
Sbjct: 99 RNSSSFQRIPCLSPLCKALEVHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTG 158
Query: 126 NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQIL----SQLGANKFSYCLVPF 181
+ +V FGCG +N G+ GL+GLG +LS SQI + AN FSYCLV
Sbjct: 159 SKAM-SVAFGCGFDNEGL-FAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDR 216
Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
+ +S + + +S L + + T+Y+ + G+SVG +P
Sbjct: 217 SNPMTRSSSSLIFGVAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQ-----LPISL 271
Query: 242 SSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTP 297
S +S+ G + ID+G T P Y + + RNA P PR CY
Sbjct: 272 KSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATINLP-SAPRYSLFDTCYNFS 330
Query: 298 SMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLF 356
A + P L HF+ GA + L T+ IP G FC A P ++GI GN Q
Sbjct: 331 GKASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFR 390
Query: 357 IGYDFDSQMVSFKPTDC 373
IG+D ++F P C
Sbjct: 391 IGFDLQKSHLAFAPQQC 407
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 122/367 (33%), Positives = 175/367 (47%), Gaps = 16/367 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+ S +S +GEY + IG P Y +DTGSD+ W+QC PC CY QV PIY+P++S
Sbjct: 1 ISSGLSLGSGEYFARMGIGNP-QRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNS 59
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG-NSNNFFDN 131
SSY+ + C S C LD +C C+Y Y DSS + G L E G NS+ N
Sbjct: 60 SSYRRVYCGSALCQALDYSACQGMG-CSYRVVYGDSSASSGDLGIESFYLGPNSSTAMRN 118
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD-SSITSK 190
+ FGCGH+N+G+F L+G+G LS SQI + +G FSYCLV ++ S +S
Sbjct: 119 IAFGCGHSNSGLFRGEAG-LLGMGGGTLSFFSQIAASIGP-AFSYCLVDRYSQLQSRSSP 176
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
+ FG + + + L + T+Y+ L GISVG + P + G
Sbjct: 177 LIFGR-TAIPFAARFTPLLKNPRINTFYYAVLTGISVGG--TPLPIPPAQFALTGNGTGG 233
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGIA-PIL 306
+D+G T + Y L + R A + P P G L C+ + + P L
Sbjct: 234 AILDSGTSVTRVVPPAYAVLRDAYRAASRNLP---PAPGVYLLDTCFNFQGLPTVQIPSL 290
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
HFD G + L + IP G FC A P + + GN Q IG+D ++
Sbjct: 291 VLHFDNGVDMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLI 350
Query: 367 SFKPTDC 373
+ P +C
Sbjct: 351 AIAPREC 357
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 122/378 (32%), Positives = 183/378 (48%), Gaps = 49/378 (12%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EYV+ +IGTPP + ++DTGSDL+W QC PC C Q P++ P S+SY+ + C
Sbjct: 101 EYVVDLAIGTPPQ-PVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPGESASYEPMRCAG 159
Query: 83 EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV-----FGCG 137
+ C + C C Y Y Y D ++T GV ATER TF +S D ++ FGCG
Sbjct: 160 QLCSDILHHGCEMPDTCTYRYNYGDGTMTMGVYATERFTFTSSGG--DRLMTVPLGFGCG 217
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
N G N N G+VG GR LSL +SQL +FSYCL + S S + FG+
Sbjct: 218 SMNVGSLN-NGSGIVGFGRNPLSL----VSQLSIRRFSYCLTSY--GSGRKSTLLFGS-- 268
Query: 198 EVSGG------GVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS-- 247
+SGG G V T+ L S ++ T+Y+V L G++VG ++ + S+ A+
Sbjct: 269 -LSGGVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVG-----ARRLRIPESAFALRPD 322
Query: 248 -KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYKTPSMAGIA 303
G + +D+G TLLP + R ++L P+ +P G +C+ P+ +
Sbjct: 323 GSGGVIVDSGTALTLLPGAVLAEVVRAFRQQLRL-PFANGGNPEDG--VCFLVPAAWRRS 379
Query: 304 --------PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDL 355
P + HF A + L + + +G C + D GN Q D+
Sbjct: 380 SSTSQVPVPRMVFHFQ-DADLDLPRRNYVLDDHRKGRLCLLLADSGDDGSTIGNLVQQDM 438
Query: 356 FIGYDFDSQMVSFKPTDC 373
+ YD +++ +SF P C
Sbjct: 439 RVLYDLEAETLSFAPAQC 456
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 119/345 (34%), Positives = 170/345 (49%), Gaps = 24/345 (6%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDT------VSCS 94
IVDTGSDL WVQC PC +CY Q P++NP++S SY+ + C S C L + V S
Sbjct: 149 IVDTGSDLSWVQCQPCKRCYNQQDPVFNPSTSPSYRTVLCSSPTCQSLQSATGNLGVCGS 208
Query: 95 SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGL 154
+ CNY Y D S T+G L TE + GNS +N +FGCG NN G+F GLVGL
Sbjct: 209 NPPSCNYVVNYGDGSYTRGELGTEHLDLGNSTA-VNNFIFGCGRNNQGLFG-GASGLVGL 266
Query: 155 GRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKE 213
GR+ LSL SQ + G FSYCL T++S S + GN S +S T ++
Sbjct: 267 GRSSLSLISQTSAMFGG-VFSYCLPITETEAS-GSLVMGGNSSVYKNTTPISYTRMIPNP 324
Query: 214 DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQ 273
+YF+ L GI+VG+++ + P + G M ID+G T LP Y L+++
Sbjct: 325 QLPFYFLNLTGITVGSVAVQA---PSFGKDG------MMIDSGTVITRLPPSIYQALKDE 375
Query: 274 VRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGV 332
P + C+ + P + HF+G A++ + T F +
Sbjct: 376 FVKQFSGFPSAPAFMILDTCFNLSGYQEVEIPNIKMHFEGNAELNVDVTGVFYFVKTDAS 435
Query: 333 -FCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
C A+ + + +VGI GN+ Q + + YD M+ F CT
Sbjct: 436 QVCLAIASLSYENEVGIIGNYQQKNQRVIYDTKGSMLGFAAEACT 480
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 124/378 (32%), Positives = 176/378 (46%), Gaps = 47/378 (12%)
Query: 7 FYPNNVVQS---NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV 63
+ P ++V V +GEY ++ +G+PP D Y +VD+GSD++WVQC PC QCY Q
Sbjct: 110 YLPEDLVSEVVPGVDDGSGEYFVRVGVGSPPT-DQYLVVDSGSDVIWVQCRPCEQCYAQT 168
Query: 64 KPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQL---CNYTYGYADSSLTKGVLATERI 120
P+++PA+SSS+ +SC S C L C C+Y+ Y D S TKG LA E +
Sbjct: 169 DPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETL 228
Query: 121 TFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
T G + V GCGH N+G+F GL+GLG +SL Q+ G FSYCL
Sbjct: 229 TLGGTA--VQGVAIGCGHRNSGLF-VGAAGLLGLGWGAMSLVGQLGGAAG-GVFSYCL-- 282
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
S+ G GS S ++Y+V L GI VG + +P
Sbjct: 283 -------ASRGAGGAGSLAS---------------SFYYVGLTGIGVGG-----ERLPLQ 315
Query: 241 NSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP 297
+S +++ G + +DTG T LP++ Y L A+ P CY
Sbjct: 316 DSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLS 375
Query: 298 SMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEG-VFCFAMQPIDGDVGIFGNFAQSDL 355
A + P ++ +FD GA + L + + V G VFC A P + I GN Q +
Sbjct: 376 GYASVRVPTVSFYFDQGAVLTLPARNLLV--EVGGAVFCLAFAPSSSGISILGNIQQEGI 433
Query: 356 FIGYDFDSQMVSFKPTDC 373
I D + V F P C
Sbjct: 434 QITVDSANGYVGFGPNTC 451
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 121/375 (32%), Positives = 179/375 (47%), Gaps = 41/375 (10%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EYV+ +IGTPP + ++DTGSDL+W QC PC C Q P++ P S+SY+ + C
Sbjct: 95 EYVVDLAIGTPPQ-PVSALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEPMRCAG 153
Query: 83 EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV-----FGCG 137
C + SC C Y Y Y D ++T GV ATER TF +S FGCG
Sbjct: 154 TLCSDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGCG 213
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
N G N N G+VG GR LSL +SQL +FSYCL + S S + FG+ S
Sbjct: 214 SVNVGSLN-NGSGIVGFGRNPLSL----VSQLSIRRFSYCLTSYA--SRRQSTLLFGSLS 266
Query: 198 E-VSG---GGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS---KG 249
+ V G G V +T L+ S ++ T+Y+V G++VG ++ + S+ A+ G
Sbjct: 267 DGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVG-----ARRLRIPESAFALRPDGSG 321
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYKTPSMAGIA--- 303
+ +D+G TLLP + R ++L P+ +P G +C+ P+ +
Sbjct: 322 GVIVDSGTALTLLPAAVLAEVVRAFRQQLRL-PFANGGNPEDG--VCFLVPAAWRRSSST 378
Query: 304 -----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIG 358
P + HF GA + L + + G C + D GN Q D+ +
Sbjct: 379 SQMPVPRMVLHFQ-GADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVL 437
Query: 359 YDFDSQMVSFKPTDC 373
YD +++ +S P C
Sbjct: 438 YDLEAETLSIAPARC 452
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 174/366 (47%), Gaps = 28/366 (7%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
S + +GEY + IG P ++Y ++DTGSD+ W+QC PC CY Q +PI+ P+SSSS
Sbjct: 142 SGTTQGSGEYFTRVGIGNPAR-EVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSS 200
Query: 75 YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
Y+ LSC + QC+ L+ C + C Y Y D S T G ATE +T G++ NV
Sbjct: 201 YEPLSCDTPQCNALEVSECRNAT-CLYEVSYGDGSYTVGDFATETLTIGST--LVQNVAV 257
Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
GCGH+N G+F L + SQL FSYCLV +DS+ T +
Sbjct: 258 GCGHSNEGLFVGAAGLL-----GLGGGLLALPSQLNTTSFSYCLVDRDSDSASTVEF--- 309
Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNM 251
G+ + VV+ L + + T+Y++ L GISVG +L+ SS + + G +
Sbjct: 310 -GTSLPPDAVVAPLLRNHQLDTFYYLGLTGISVGG-----ELLQIPQSSFEMDESGSGGI 363
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGI-APILT 307
ID+G T L YN L + +K T + G + CY + I P +
Sbjct: 364 IIDSGTAVTRLQTGIYNSLRDSF---LKGTSDLEKAAGVAMFDTCYNLSAKTTIEVPTVA 420
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
HF GG + L + IP G FC A P + I GN Q + +D + ++
Sbjct: 421 FHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIG 480
Query: 368 FKPTDC 373
F C
Sbjct: 481 FSSNKC 486
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 131/387 (33%), Positives = 188/387 (48%), Gaps = 38/387 (9%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYN 68
+ ++++ G Y M S+GTPPL I+DTGSDL W QC PC C+ Q P+Y+
Sbjct: 82 HGLLEALAENGAGAYHMILSVGTPPLA-FPAIIDTGSDLTWTQCAPCTTACFAQPTPLYD 140
Query: 69 PASSSSYKELSCQSEQCHLLDTV--SCSSQQLCNYTYGYADSSLTKGVLATERITF---- 122
PA SS++ +L C S C L + +C++ C Y Y YA T G LA + +
Sbjct: 141 PARSSTFSKLPCASPLCQALPSAFRACNATG-CVYDYRYA-VGFTAGYLAADTLAIGDGD 198
Query: 123 --GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
G++++ F V FGC N G + G+VGLGR+ LSL LSQ+G +FSYCL
Sbjct: 199 GDGDASSSFAGVAFGCSTANGGDM-DGASGIVGLGRSALSL----LSQIGVGRFSYCL-- 251
Query: 181 FHTDSSI-TSKMYFGNGSEVSGGGVVSTSLV-----SKEDKTYYFVTLEGISVGNLSNSS 234
+D+ S + FG + V+G V ST+L+ ++ YY+V L GI+VG S
Sbjct: 252 -RSDADAGASPILFGALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVG-----S 305
Query: 235 KLIPYYNSS---GAISKGNMFIDTGAPPTLLPKDFYNRLEEQV--RNAIKLTPYQDPRLG 289
+P +S+ A G + +D+G T L + Y L + + A LT +
Sbjct: 306 TDLPVTSSTFGFTAAGAGGVIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFD 365
Query: 290 SQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG-VFCFAMQPIDGDVGIFG 348
LC++ + P L F GGA+ + S F G V C + P G V + G
Sbjct: 366 FDLCFEAGAADTPVPRLVFRFAGGAEYAVPRQSYFDAVDEGGRVACLLVLPTRG-VSVIG 424
Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDCTK 375
N Q DL + YD D SF P DC
Sbjct: 425 NVMQMDLHVLYDLDGATFSFAPADCAS 451
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 122/367 (33%), Positives = 176/367 (47%), Gaps = 16/367 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S +S +GEY + IG+P Y +DTGSD+ W+QC PC CY QV PIY+P++S
Sbjct: 34 VSSGLSLGSGEYFARMGIGSPQR-SYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNS 92
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG-NSNNFFDN 131
SSY+ + C S C LD +C C+Y Y DSS + G L E G NS+ N
Sbjct: 93 SSYRRVYCGSALCQALDYSACQGMG-CSYRVVYGDSSASSGDLGIESFYLGPNSSTAMRN 151
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD-SSITSK 190
+ FGCGH+N+G+F L+G+G LS SQI + +G FSYCLV ++ S +S
Sbjct: 152 IAFGCGHSNSGLFRGEAG-LLGMGGGTLSFFSQIAASIGP-AFSYCLVDRYSQLQSRSSP 209
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
+ FG + + + L + T+Y+ L GISVG + + P + G
Sbjct: 210 LIFGR-TAIPFAARFTPLLKNPRIDTFYYAILTGISVGG--TALPIPPAQFALTGNGTGG 266
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGIA-PIL 306
+D+G T + Y L + R A + P P G L C+ + + P L
Sbjct: 267 AILDSGTSVTRVVPAAYAVLRDAYRAASRNLP---PAPGVYLLDTCFNFQGLPTVQIPSL 323
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
HFD + L + IP G FC A P + + GN Q IG+D ++
Sbjct: 324 VLHFDNDVDMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLI 383
Query: 367 SFKPTDC 373
+ P +C
Sbjct: 384 AIAPREC 390
>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 118/381 (30%), Positives = 175/381 (45%), Gaps = 51/381 (13%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC--------LPCVQCYKQVKP--IYNPASS 72
EY+M +IGTPP + I DTGSDL+W+ C L + P ++P+ S
Sbjct: 99 EYLMAVNIGTPPT-RMVAIADTGSDLIWLNCSYGGDGPGLAAARDADAQPPGVQFDPSKS 157
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNS------- 125
++++ + C S C L SC + C Y+Y Y D S T GVL+TE TF ++
Sbjct: 158 TTFRLVDCDSVACSELPEASCGADSKCRYSYSYGDGSHTSGVLSTETFTFADAPGARGDG 217
Query: 126 -NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGAN-----KFSYCLV 179
NV FGC G + + ++SQLGA+ +FSYCLV
Sbjct: 218 TTTRVANVNFGCSTTFVGSSVGDGL------VGLGGGDLSLVSQLGADTSLGRRFSYCLV 271
Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPY 239
P+ +S S + FG + V+ G V+T L+ + K YY V L + VGN
Sbjct: 272 PYSVKAS--SALNFGPRAAVTDPGAVTTPLIPSQVKAYYIVELRSVKVGN---------- 319
Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY----- 294
+ A + + +D+G T LP+ + L +++ IKL P Q P LC+
Sbjct: 320 -KTFEAPDRSPLIVDSGTTLTFLPEALVDPLVKELTGRIKLPPAQSPERLLPLCFDVSGV 378
Query: 295 KTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD--VGIFGNFAQ 352
+ +A + P +T GGA V L +TF+ EG C A+ + I GN AQ
Sbjct: 379 REGQVAAMIPDVTVGLGGGAAVTLKAENTFVEVQ-EGTLCLAVSAMSEQFPASIIGNIAQ 437
Query: 353 SDLFIGYDFDSQMVSFKPTDC 373
++ +GYD D V+F P C
Sbjct: 438 QNMHVGYDLDKGTVTFAPAAC 458
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 131/374 (35%), Positives = 189/374 (50%), Gaps = 38/374 (10%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
+ + SIGTPP I+DTGSDL+W QC + KP+Y+PA SSS+ C
Sbjct: 88 HHTLTVSIGTPPQPRTL-ILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPCDG 146
Query: 83 EQCHL--LDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
C +T +CS + C YTY Y S+ TKG LA+E TFG ++ FGCG
Sbjct: 147 RLCETGSFNTKNCSRNK-CIYTYNYG-SATTKGELASETFTFGEHRRVSVSLDFGCGKLT 204
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
+G G++G+ RLSL +SQL +FSYCL PF D + TS ++FG +++S
Sbjct: 205 SGSL-PGASGILGISPDRLSL----VSQLQIPRFSYCLTPF-LDRNTTSHIFFGAMADLS 258
Query: 201 G----GGVVSTSLVSKEDKT--YYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNM 251
G + +TSLV+ D + YY+V L GISVG +K + SS AI + G
Sbjct: 259 KYRTTGPIQTTSLVTNPDGSNYYYYVPLIGISVG-----TKRLNVPVSSFAIGRDGSGGT 313
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG--SQLCYKTPSMAGIA------ 303
F+D+G +LP L+E + A+KL G +LC++ P G A
Sbjct: 314 FVDSGDTTGMLPSVVMEALKEAMVEAVKLPVVNATDHGYEYELCFQLPRNGGGAVETAVQ 373
Query: 304 -PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG-IFGNFAQSDLFIGYDF 361
P L HFDGGA + L+ +++ G C + G G I GN+ Q ++ + +D
Sbjct: 374 VPPLVYHFDGGAAM-LLRRDSYMVEVSAGRMCLVIS--SGARGAIIGNYQQQNMHVLFDV 430
Query: 362 DSQMVSFKPTDCTK 375
++ SF PT C +
Sbjct: 431 ENHEFSFAPTQCNQ 444
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 132/374 (35%), Positives = 185/374 (49%), Gaps = 33/374 (8%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
V ++GEY+M+ IGTP I+DTGSDL+W QC PC+ C Q P ++PA S++Y+
Sbjct: 83 VLASDGEYLMEMGIGTPTRY-YSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYR 141
Query: 77 ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF--FDNVVF 134
L C S C+ L C Q++C Y Y Y DS+ T GVLA E TFG + + F
Sbjct: 142 SLGCASPACNALYYPLC-YQKVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISF 200
Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
GCG+ N G N G+VG GR LSL +SQLG+ +FSYCL F S + S++YFG
Sbjct: 201 GCGNLNAGSL-ANGSGMVGFGRGSLSL----VSQLGSPRFSYCLTSFL--SPVPSRLYFG 253
Query: 195 -----NGSEVSGGGVVSTS-LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
N + S V ST +V+ T YF+ + GISVG L+P + AI+
Sbjct: 254 VYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGG-----YLLPIDPAVFAIND 308
Query: 249 ----GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYK---TPSM 299
G ID+G T L + Y+ + + I L P + S L C++ P
Sbjct: 309 TDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQITL-PLLNVTDASVLDTCFQWPPPPRQ 367
Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
+ P L HFDG + + P G C AM D I G++ + + Y
Sbjct: 368 SVTLPQLVLHFDGADWELPLQNYMLVDPSTGGGLCLAMA-SSSDGSIIGSYQHQNFNVLY 426
Query: 360 DFDSQMVSFKPTDC 373
D ++ ++SF P C
Sbjct: 427 DLENSLMSFVPAPC 440
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 125/387 (32%), Positives = 187/387 (48%), Gaps = 26/387 (6%)
Query: 12 VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
++S S GEY + +GTPP ++ I+DTGSDL W+QC PC C++Q Y P
Sbjct: 159 TLESGASLGTGEYFLDMFVGTPPK-HVWLILDTGSDLSWIQCDPCYDCFEQNGSHYYPKD 217
Query: 72 SSSYKELSCQSEQCHLLDTVS----CSSQ-QLCNYTYGYADSSLTKGVLATE----RITF 122
SS+Y+ +SC +C L+ + C ++ Q C Y Y YAD S T G A+E +T+
Sbjct: 218 SSTYRNISCYDPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTW 277
Query: 123 GNSNNFFDNVV---FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
N F VV FGCGH N G F GL+GLGR +S SQI S G + FSYCL
Sbjct: 278 PNGKEKFKQVVDVMFGCGHWNKGFFY-GASGLLGLGRGPISFPSQIQSIYG-HSFSYCLT 335
Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKE---DKTYYFVTLEGISV-GNLSNSS 234
+++S++SK+ FG E+ ++ T+L++ E D+T+Y++ ++ I V G + + S
Sbjct: 336 DLFSNTSVSSKLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDIS 395
Query: 235 KLIPYYNSSGAISKGNM--FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL 292
+ +++S GA + ID+G+ T P Y+ ++E IKL
Sbjct: 396 EQTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIAADDFVMSP 455
Query: 293 CYKTPS--MAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ--PIDGDVGIFG 348
CY M P HF G + F + V C A+ P + I G
Sbjct: 456 CYNVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAIMKTPNHSHLTIIG 515
Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDCTK 375
N Q + I YD + + P C +
Sbjct: 516 NLLQQNFHILYDVKRSRLGYSPRRCAE 542
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 130/373 (34%), Positives = 186/373 (49%), Gaps = 30/373 (8%)
Query: 14 QSNVSTANGEYVMKFSIGTPPL-LDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
QS V NGEY+M ++G+PP D+ IVDTGSDL WVQCLPC CY+Q P ++P+ S
Sbjct: 29 QSPVKAGNGEYLMTLTLGSPPQSFDV--IVDTGSDLNWVQCLPCRVCYQQPGPKFDPSKS 86
Query: 73 SSYKELSCQSEQCHL--LDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGN--SNNF 128
S+++ +C C++ L +C++ +C Y Y Y D S T G LA E I+ N
Sbjct: 87 RSFRKAACTDNLCNVSALPLKACAA-NVCQYQYTYGDQSNTNGDLAFETISLNNGAGTQS 145
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
N FGCG N G F GLVGLG+ LSL SQ LS ANKFSYCLV ++ S+
Sbjct: 146 VPNFAFGCGTQNLGTF-AGAAGLVGLGQGPLSLNSQ-LSHTFANKFSYCLVSLNSLSA-- 201
Query: 189 SKMYFGNGSEVSGGGVVSTSL-VSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGA 245
S + F GS + + TS+ V+ TYY+V L I VG L+ + + S+G
Sbjct: 202 SPLTF--GSIAAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSVFAIDQSTG- 258
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-- 303
+G ID+G T+L Y+ + + + G LC+ ++AG++
Sbjct: 259 --RGGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPRLDGSAYGLDLCF---NIAGVSNP 313
Query: 304 --PILTAHFDGGAKVPLIHTSTFIPPPVEG-VFCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
P + F GA + + F+ C AM G I GN Q + + YD
Sbjct: 314 SVPDMVFKFQ-GADFQMRGENLFVLVDTSATTLCLAMGGSQG-FSIIGNIQQQNHLVVYD 371
Query: 361 FDSQMVSFKPTDC 373
+++ + F DC
Sbjct: 372 LEAKKIGFATADC 384
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 125/368 (33%), Positives = 174/368 (47%), Gaps = 21/368 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S +S +GEY + +GTP +Y ++DTGSD++W+QC PC +CY Q PI++P S
Sbjct: 131 VVSGLSQGSGEYFTRLGVGTPARY-VYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKS 189
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
+Y + C S C LD+ C++++ C Y Y D S T G +TE +TF N
Sbjct: 190 KTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTF--RRNRVKG 247
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V GCGH+N G+F L +LS Q + KFSYCLV + SS S +
Sbjct: 248 VALGCGHDNEGLFVGAAGLLGLGK-GKLSFPGQTGHRFN-QKFSYCLVD-RSASSKPSSV 304
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS----GAIS 247
FGN + VS + L + + T+Y+V L GISVG +P +S I
Sbjct: 305 VFGNAA-VSRIARFTPLLSNPKLDTFYYVGLLGISVGGTR-----VPGVTASLFKLDQIG 358
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGI-API 305
G + ID+G T L + Y + + R K T + P C+ +M + P
Sbjct: 359 NGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAK-TLKRAPNFSLFDTCFDLSNMNEVKVPT 417
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
+ HF A V L T+ IP G FCFA G + I GN Q + YD S
Sbjct: 418 VVLHFR-RADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSR 476
Query: 366 VSFKPTDC 373
V F P C
Sbjct: 477 VGFAPGGC 484
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 115/363 (31%), Positives = 171/363 (47%), Gaps = 14/363 (3%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S ++ +GEY ++ +G+PP + Y ++D+GSD++WVQC PC QCY Q P+++PA S
Sbjct: 131 VVSGMNQGSGEYFIRIGVGSPPR-EQYVVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADS 189
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
+S+ + C S C ++ C + C Y Y D S TKG LA E +TFG + NV
Sbjct: 190 ASFMGVPCSSSVCERIENAGCHAGG-CRYEVMYGDGSYTKGTLALETLTFGRT--VVRNV 246
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH N G+F L G + +SL Q+ Q G FSYCLV TDS+ +
Sbjct: 247 AIGCGHRNRGMFVGAAGLLGLGGGS-MSLVGQLGGQTGG-AFSYCLVSRGTDSA--GSLE 302
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNS-SKLIPYYNSSGAISKGNM 251
FG G+ G + + + ++Y++ L G+ VG + S+ + N G G +
Sbjct: 303 FGRGAMPVGAAWIPL-IRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMG---NGGV 358
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHF 310
+DTG T +P Y + P CY + P ++ +F
Sbjct: 359 VMDTGTAVTRIPTVAYVAFRDAFIGQTGNLPRASGVSIFDTCYNLNGFVSVRVPTVSFYF 418
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
GG + L + IP G FCFA + I GN Q + I +D + V F P
Sbjct: 419 AGGPILTLPARNFLIPVDDVGTFCFAFAASPSGLSIIGNIQQEGIQISFDGANGFVGFGP 478
Query: 371 TDC 373
C
Sbjct: 479 NVC 481
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 120/396 (30%), Positives = 183/396 (46%), Gaps = 46/396 (11%)
Query: 12 VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
V ++ + A GEY++K IGTPP +DT SDL+W QC PC CY QV P++NP
Sbjct: 77 VAETPIMPAGGEYLVKLGIGTPPY-KFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRV 135
Query: 72 SSSYKELSCQSEQCHLLDTVSC--SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
SS+Y L C S+ C LD C + C YTY Y+ ++ T+G LA +++ G + F
Sbjct: 136 SSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIG--EDAF 193
Query: 130 DNVVFGCGHNNT-GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
V FGC ++T G G+VGLGR LSL +SQL +F+YCL P S I
Sbjct: 194 RGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSL----VSQLSVRRFAYCLPP--PASRIP 247
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKED---KTYYFVTLEGISVGNLSNSSKLIPYYN---- 241
K+ G ++ + ++ + D +YY++ L+G+ +G+ + S
Sbjct: 248 GKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRAMSLPPTTTTTATAT 307
Query: 242 ------------SSGAISKGN-----MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ 284
++ A++ G+ M ID + T L Y+ L + I+L
Sbjct: 308 ATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEIRLPRGT 367
Query: 285 DPRLGSQLCYKTPSMAGIA------PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ 338
LG LC+ P G+A P + FD G + L F G+ C +
Sbjct: 368 GSSLGLDLCFILPD--GVAFDRVYVPAVALAFD-GRWLRLDKARLFAEDRESGMMCLMVG 424
Query: 339 PID-GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ G V I GNF Q ++ + Y+ V+F + C
Sbjct: 425 RAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 117/366 (31%), Positives = 170/366 (46%), Gaps = 20/366 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S + +GEY ++ +G+PP Y ++D+GSD++WVQC PC QCY Q P+++PA S
Sbjct: 32 VVSGMDQGSGEYFVRIGVGSPPRSQ-YMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADS 90
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
+S+ +SC S C +D C+S + C Y Y D S TKG LA E +T G + NV
Sbjct: 91 ASFMGVSCSSAVCDQVDNAGCNSGR-CRYEVSYGDGSSTKGTLALETLTLGRT--VVQNV 147
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH N G+F L G + +S Q LS+ N FSYCLV T+ S +
Sbjct: 148 AIGCGHMNQGMFVGAAGLLGLGGGS-MSFVGQ-LSRERGNAFSYCLVSRVTN----SNGF 201
Query: 193 FGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK--- 248
GSE G L+ +YY++ L G+ VG++ +P +++
Sbjct: 202 LEFGSEAMPVGAAWIPLIRNPHSPSYYYIGLSGLGVGDMK-----VPISEDIFELTELGN 256
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILT 307
G + +DTG T P Y + + P CY + P ++
Sbjct: 257 GGVVMDTGTAVTRFPTVAYEAFRDAFIDQTGNLPRASGVSIFDTCYNLFGFLSVRVPTVS 316
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
+F GG + L + IP G FCFA P + I GN Q + I D ++ V
Sbjct: 317 FYFSGGPILTLPANNFLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDGANEFVG 376
Query: 368 FKPTDC 373
F P C
Sbjct: 377 FGPNVC 382
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 120/396 (30%), Positives = 183/396 (46%), Gaps = 46/396 (11%)
Query: 12 VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
V ++ + A GEY++K IGTPP +DT SDL+W QC PC CY QV P++NP
Sbjct: 77 VAETPIMPAGGEYLVKLGIGTPPY-KFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRV 135
Query: 72 SSSYKELSCQSEQCHLLDTVSC--SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
SS+Y L C S+ C LD C + C YTY Y+ ++ T+G LA +++ G + F
Sbjct: 136 SSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIG--EDAF 193
Query: 130 DNVVFGCGHNNT-GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
V FGC ++T G G+VGLGR LSL +SQL +F+YCL P S I
Sbjct: 194 RGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSL----VSQLSVRRFAYCLPP--PASRIP 247
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKED---KTYYFVTLEGISVGNLSNSSKLIPYYN---- 241
K+ G ++ + ++ + D +YY++ L+G+ +G+ + S
Sbjct: 248 GKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSLPPTTTTTATAT 307
Query: 242 ------------SSGAISKGN-----MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ 284
++ A++ G+ M ID + T L Y+ L + I+L
Sbjct: 308 ATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEIRLPRGT 367
Query: 285 DPRLGSQLCYKTPSMAGIA------PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ 338
LG LC+ P G+A P + FD G + L F G+ C +
Sbjct: 368 GSSLGLDLCFILPD--GVAFDRVYVPAVALAFD-GRWLRLDKARLFAEDRESGMMCLMVG 424
Query: 339 PID-GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ G V I GNF Q ++ + Y+ V+F + C
Sbjct: 425 RAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 130/394 (32%), Positives = 194/394 (49%), Gaps = 44/394 (11%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
+N + + + EY+M+ +IGTPP+ + DTGSDL W QC PC C+ Q PIY+
Sbjct: 81 SNAGPARLRSGQAEYLMELAIGTPPV-PFVALADTGSDLTWTQCKPCKLCFPQDTPIYDT 139
Query: 70 ASSSSYKELSCQSEQCHLL----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNS 125
A+S+S+ + C S C + + ++ C Y Y Y D + + GVL TE +TF S
Sbjct: 140 AASASFSPVPCASATCLPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGS 199
Query: 126 NN-------FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL 178
+ V FGCG +N G+ + N G VGLGR LSL ++QLG KFSYCL
Sbjct: 200 SPGAPGPGVSVGGVAFGCGVDNGGL-SYNSTGTVGLGRGSLSL----VAQLGVGKFSYCL 254
Query: 179 VPFHTDSSITSKMYFGNGSE------VSGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLS 231
F ++S+ S + FG+ +E + G V ST LV + + Y+V+LEGIS+G+
Sbjct: 255 TDFF-NTSLGSPVLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGD-- 311
Query: 232 NSSKLIPYYNSSGAIS---KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRL 288
+P N + + G M +D+G T+L + + + V + L
Sbjct: 312 ---ARLPIPNGTFDLRDDGSGGMIVDSGTIFTVLVESAFRVVVNHVAGVLNQPVVNASSL 368
Query: 289 GSQLCYKTPSMAGI-----APILTAHFDGGAKVPLIHTSTFIPPPVE-GVFCFAMQPIDG 342
S C+ P+ AG P + HF GGA + L H ++ E FC +
Sbjct: 369 DSP-CF--PATAGEQQLPDMPDMLLHFAGGADMRL-HRDNYMSFNQESSSFCLNIAGAPS 424
Query: 343 DVG-IFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
G I GNF Q ++ + +D +SF PTDC+K
Sbjct: 425 AYGSILGNFQQQNIQMLFDITVGQLSFVPTDCSK 458
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 131/383 (34%), Positives = 201/383 (52%), Gaps = 35/383 (9%)
Query: 10 NNVVQSNVSTA-------NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQ 62
N+++ ++++ A NG+++MK SIG PP ++ V TGSDL+W+ CL C
Sbjct: 77 NDLISNSITAAEFPSILDNGDFLMKISIGIPPT-ELLVNVATGSDLVWIPCLSFKPCTHN 135
Query: 63 VK-PIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYG-YADSSLTKGVLATERI 120
++P SS+YK + C S +C + + +C C Y+ S G LA + +
Sbjct: 136 CDLRFFDPMESSTYKNVPCDSYRCQITNAATCQFSD-CFYSCDPRHQDSCPDGDLAMDTL 194
Query: 121 TFGNSNN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
T ++ N F CG+ G + +G++GLG LSL ++I S L KFS+C
Sbjct: 195 TLNSTTGKSFMLPNTGFICGNRIGGDYPG--VGILGLGHGSLSLLNRI-SHLIDGKFSHC 251
Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLI 237
+VP+ ++ TSK+ FG+ + VSG + ST L Y ++ GISVGN S S+ I
Sbjct: 252 IVPYSSNQ--TSKLSFGDKAVVSGSAMFSTRLDMTGGPYSYTLSFYGISVGNKSISAGGI 309
Query: 238 --PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTP-YQDPRLGSQLCY 294
YY + + G MF T P+ FY++LE VR AI+ P Y DP +LCY
Sbjct: 310 GSDYYMNGLGMDSGTMF-------TYFPEYFYSQLEYDVRYAIQQEPLYPDPTRRLRLCY 362
Query: 295 K-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDV-GIFGNFAQ 352
+ +P + P +T HF+GG+ V L +++FI E + C A + +FG + Q
Sbjct: 363 RYSPDFS--PPTITMHFEGGS-VELSSSNSFIRM-TEDIVCLAFATSSSEQDAVFGYWQQ 418
Query: 353 SDLFIGYDFDSQMVSFKPTDCTK 375
++L IGYD D+ +SF TDCTK
Sbjct: 419 TNLLIGYDLDAGFLSFLKTDCTK 441
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 124/380 (32%), Positives = 189/380 (49%), Gaps = 30/380 (7%)
Query: 7 FYPNNVVQSNVSTANGE-YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP 65
F + + + V+ G+ +++ FS+G PP+ + GI DTGSDL+WVQC PC C++Q P
Sbjct: 73 FITDEIQANMVADDRGQAFLVNFSVGRPPVPQLVGI-DTGSDLLWVQCRPCADCFRQSTP 131
Query: 66 IYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNS 125
I++P+ SS+Y +LS S C + C Y YAD S + G LATE I F S
Sbjct: 132 IFDPSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETS 191
Query: 126 NN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
+ +VVFGCGH+N G F+ + G++GL S I+S+LG+ +FSYC+
Sbjct: 192 DQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQS----IVSRLGS-RFSYCIGDLF 246
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
+++ G+G ++ G + +Y+VTLEGISVG + P
Sbjct: 247 DPHYTHNQLVLGDGVKMEGSSTPFHTF-----NGFYYVTLEGISVG--ETRLDINPEVFQ 299
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQDPRLGSQLCYK--- 295
+G + +D+G T L KD + N ++ VR + Y+ + LCYK
Sbjct: 300 RTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYR--TIPGWLCYKGRV 357
Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFA-MQPIDGDVG-IFGNFAQS 353
+ G P L HF GA + L S F+ + VFC A ++ ++G + G AQ
Sbjct: 358 NEDLRGF-PELAFHFAEGADLVLDANSLFVQKN-QDVFCLAVLESNLKNIGSVIGIMAQQ 415
Query: 354 DLFIGYDFDSQMVSFKPTDC 373
+ YD + V F+ TDC
Sbjct: 416 HYNVAYDLIGKRVYFQRTDC 435
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 125/375 (33%), Positives = 189/375 (50%), Gaps = 31/375 (8%)
Query: 13 VQSN-VSTANGE-YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
+Q+N V+ G+ +++ FS+G PP+ + GI DTGSDL+WVQC PC C++Q PI++P+
Sbjct: 46 IQANMVADDRGQAFLVNFSVGRPPVPQLVGI-DTGSDLLWVQCRPCADCFRQSTPIFDPS 104
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--- 127
SS+Y +LS S C + C Y YAD S + G LATE I F S+
Sbjct: 105 KSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTV 164
Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
+VVFGCGH+N G F+ + G++GL S I+S+LG+ +FSYC+
Sbjct: 165 TVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQS----IVSRLGS-RFSYCIGDLFDPHYT 219
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
+++ G+G ++ G + +Y+VTLEGISVG + P
Sbjct: 220 HNQLVLGDGVKMEGSSTPFHTF-----NGFYYVTLEGISVG--ETRLDINPEVFQRTESG 272
Query: 248 KGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQDPRLGSQLCYK---TPSMA 300
+G + +D+G T L KD + N ++ VR + Y+ + LCYK +
Sbjct: 273 QGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYR--TIPGWLCYKGRVNEDLR 330
Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID-GDVG-IFGNFAQSDLFIG 358
G P L HF GA + L S F+ + VFC A+ + ++G + G AQ +
Sbjct: 331 GF-PELAFHFAEGADLVLDANSLFVQKN-QDVFCLAVLESNLKNIGSVIGIMAQQHYNVA 388
Query: 359 YDFDSQMVSFKPTDC 373
YD + V F+ TDC
Sbjct: 389 YDLIGKRVYFQRTDC 403
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 124/380 (32%), Positives = 189/380 (49%), Gaps = 30/380 (7%)
Query: 7 FYPNNVVQSNVSTANGE-YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP 65
F + + + V+ G+ +++ FS+G PP+ + GI DTGSDL+WVQC PC C++Q P
Sbjct: 41 FITDEIQANMVADDRGQAFLVNFSVGRPPVPQLVGI-DTGSDLLWVQCRPCADCFRQSTP 99
Query: 66 IYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNS 125
I++P+ SS+Y +LS S C + C Y YAD S + G LATE I F S
Sbjct: 100 IFDPSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETS 159
Query: 126 NN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
+ +VVFGCGH+N G F+ + G++GL S I+S+LG+ +FSYC+
Sbjct: 160 DQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQS----IVSRLGS-RFSYCIGDLF 214
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
+++ G+G ++ G + +Y+VTLEGISVG + P
Sbjct: 215 DPHYTHNQLVLGDGVKMEGSSTPFHTF-----NGFYYVTLEGISVG--ETRLDINPEVFQ 267
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQDPRLGSQLCYK--- 295
+G + +D+G T L KD + N ++ VR + Y+ + LCYK
Sbjct: 268 RTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYR--TIPGWLCYKGRV 325
Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID-GDVG-IFGNFAQS 353
+ G P L HF GA + L S F+ + VFC A+ + ++G + G AQ
Sbjct: 326 NEDLRGF-PELAFHFAEGADLVLDANSLFVQKN-QDVFCLAVLESNLKNIGSVIGIMAQQ 383
Query: 354 DLFIGYDFDSQMVSFKPTDC 373
+ YD + V F+ TDC
Sbjct: 384 HYNVAYDLIGKRVYFQRTDC 403
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 116/346 (33%), Positives = 165/346 (47%), Gaps = 18/346 (5%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQ-LC 99
++DTGSD++WVQC PC +CY+Q P+++P SSSY + C + C LD+ C ++ C
Sbjct: 2 VLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGAC 61
Query: 100 NYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRL 159
Y Y D S+T G TE +TF V GCGH+N G+F L+GLGR L
Sbjct: 62 MYQVAYGDGSVTAGDFVTETLTFAGGAR-VARVALGCGHDNEGLFVAAAG-LLGLGRGGL 119
Query: 160 SLASQILSQLGANKFSYCLVPFHTD-------SSITSKMYFGNGSEVSGGGVVSTSLVSK 212
S +QI + G FSYCLV + S +S + FG GS + + + +
Sbjct: 120 SFPTQISRRYG-RSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSASFTPMVRNP 178
Query: 213 EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE 272
+T+Y+V L GISVG + +G + +D+G T L + Y+ L +
Sbjct: 179 RMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRD 238
Query: 273 QVRNA----IKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPP 327
R A ++L+P CY + P ++ HF GGA+ L + IP
Sbjct: 239 AFRAAAAGGLRLSPGGFSLF--DTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPV 296
Query: 328 PVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
G FCFA DG V I GN Q + +D D Q V F P C
Sbjct: 297 DSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 125/367 (34%), Positives = 176/367 (47%), Gaps = 19/367 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S ++ +GEY + +GTPP +Y ++DTGSD++W+QC PC +CY Q P+++P S
Sbjct: 115 VISGLAQGSGEYFTRIGVGTPPRY-VYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKS 173
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
S+ ++C+S CH LD+ C++Q Q C Y Y D S T G +TE +TF +
Sbjct: 174 RSFASIACRSPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRTR--VAR 231
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V GCGH+N G+F L+GLGR RLS SQ + +KFSYCLV + SS S M
Sbjct: 232 VALGCGHDNEGLFVGAAG-LLGLGRGRLSFPSQTGRRFN-HKFSYCLVD-RSASSKPSSM 288
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS----GAIS 247
FG+ S VS + + + + T+Y+V L GISVG +P +S
Sbjct: 289 VFGD-SAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTR-----VPGITASLFKLDQTG 342
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APIL 306
G + ID+G T L + Y + R C+ + P +
Sbjct: 343 NGGVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGKTEVKVPTV 402
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
HF GA V L ++ IP G FC A G + I GN Q + YD V
Sbjct: 403 VLHFR-GADVSLPASNYLIPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAGSRV 461
Query: 367 SFKPTDC 373
F P C
Sbjct: 462 GFAPHGC 468
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 122/364 (33%), Positives = 168/364 (46%), Gaps = 22/364 (6%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ +IGTPP + +DTGSDL+W QC PC C+ Q P ++P++SS+ SC S
Sbjct: 81 EYLVHLAIGTPPQ-PVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDS 139
Query: 83 EQCHLLDTVSCSS-----QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
C L SC S Q C YTY Y D S+T G L ++ TF + V FGCG
Sbjct: 140 TLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCG 199
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
N GVF NE G+ G GR LSL SQL FS+C + T +
Sbjct: 200 LFNNGVFKSNETGIAGFGRGPLSLP----SQLKVGNFSHCFTAVNGLKPSTVLLDLPADL 255
Query: 198 EVSGGGVV-STSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN--MFI 253
SG G V ST L+ + T+Y+++L+GI+VG S +P S A+ G I
Sbjct: 256 YKSGRGAVQSTPLIQNPANPTFYYLSLKGITVG-----STRLPVPESEFALKNGTGGTII 310
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG-IAPILTAHFDG 312
D+G T LP Y + + +KL C P A P L HF+G
Sbjct: 311 DSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEG 370
Query: 313 GA-KVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
+P + + + C A+ G+V GNF Q ++ + YD + +SF P
Sbjct: 371 ATMDLPRENYVFEVEDAGSSILCLAIIE-GGEVTTIGNFQQQNMHVLYDLQNSKLSFVPA 429
Query: 372 DCTK 375
C K
Sbjct: 430 QCDK 433
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 129/366 (35%), Positives = 185/366 (50%), Gaps = 17/366 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S ++ +GEY + +GTP +Y ++DTGSD++W+QC PC++CY Q P+++P S
Sbjct: 134 VISGLAQGSGEYFTRLGVGTPARY-VYMVLDTGSDIVWIQCAPCIKCYSQTDPVFDPTKS 192
Query: 73 SSYKELSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
S+ + C S C LD CS+ +Q+C Y Y D S T G +TE +TF +
Sbjct: 193 RSFANIPCGSPLCRRLDYPGCSTKKQICLYQVSYGDGSFTVGEFSTETLTFRGTR--VGR 250
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
VV GCGH+N G+F L+GLGR RLS SQI + + KFSYCL + SS S +
Sbjct: 251 VVLGCGHDNEGLFVGAAG-LLGLGRGRLSFPSQIGRRFNS-KFSYCLGD-RSASSRPSSI 307
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAISKG 249
FG+ S +S + L + + T+Y+V L GISVG +S S + +S+G G
Sbjct: 308 VFGD-SAISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTG---NG 363
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQ-VRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILT 307
+ ID+G T L + Y L + + A L + L C+ + P +
Sbjct: 364 GVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFSL-FDTCFDLSGKTEVKVPTVV 422
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
HF GA VPL ++ IP G FCFA + I GN Q + YD + V
Sbjct: 423 LHFR-GADVPLPASNYLIPVDNSGSFCFAFAGTASGLSIIGNIQQQGFRVVYDLATSRVG 481
Query: 368 FKPTDC 373
F P C
Sbjct: 482 FAPRGC 487
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 167 bits (423), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 122/369 (33%), Positives = 177/369 (47%), Gaps = 18/369 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S +S +GEY ++ S+GTPP +Y ++DTGSD++W+QC PCV CY Q I++P S
Sbjct: 47 VVSGLSLGSGEYFIRISVGTPPRR-MYLVMDTGSDILWLQCAPCVNCYHQSDAIFDPYKS 105
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN----F 128
S+Y L C + QC LD +C + + C Y Y D S T G T+ ++ +++
Sbjct: 106 STYSTLGCSTRQCLNLDIGTCQANK-CLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVV 164
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
+ + GCGH+N G F GL+GLG+ LS +Q+ Q G +FSYCL TDS+
Sbjct: 165 LNKIPLGCGHDNEGYF-VGAAGLLGLGKGPLSFPNQVDPQNGG-RFSYCLTDRETDSTEG 222
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
S + FG + G + + T+Y++ + GISVG + + S +
Sbjct: 223 SSLVFGEAAVPPAGARFTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDS--LGN 280
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGI-AP 304
G + ID+G T L Y L + R T P G L CY +A + P
Sbjct: 281 GGVIIDSGTSVTRLQNAAYASLRDAFRAG---TSDLAPTAGFSLFDTCYDLSGLASVDVP 337
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
+T HF GG + L ++ IP FC A G I GN Q + YD
Sbjct: 338 TVTLHFQGGTDLKLPASNYLIPVDNSNTFCLAFAGTTGP-SIIGNIQQQGFRVIYDNLHN 396
Query: 365 MVSFKPTDC 373
V F P+ C
Sbjct: 397 QVGFVPSQC 405
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 167 bits (423), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 115/384 (29%), Positives = 185/384 (48%), Gaps = 34/384 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S V +GEY ++G PP + ++DTGSDL+W+QC+PC CY+QV P+Y+P SS
Sbjct: 77 VMSGVPFDSGEYFAVINVGDPPTRALV-VIDTGSDLIWLQCVPCRHCYRQVTPLYDPRSS 135
Query: 73 SSYKELSCQSEQCH-LLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
S+++ + C S +C +L C ++ C Y Y D S + G LAT+R+ F + +
Sbjct: 136 STHRRIPCASPRCRDVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDRLVFPDDTHVH- 194
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL-VPFHTDSSITS 189
NV GCGH+N G+ E+ GL+G+GR +LS +Q+ G + FSYCL + +S
Sbjct: 195 NVTLGCGHDNVGLL-ESAAGLLGVGRGQLSFPTQLAPAYG-HVFSYCLGDRLSRAQNGSS 252
Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS-- 247
+ FG E + YY V + G SVG ++ + N+S A++
Sbjct: 253 YLVFGRTPEPPSTAFTPLRTNPRRPSLYY-VDMVGFSVGG----ERVTGFSNASLALNPA 307
Query: 248 --KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL-----CYK----- 295
+G + +D+G + +D Y + + + +L ++ CY
Sbjct: 308 TGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMR-KLATKFSVFDACYDLRGNG 366
Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG-----VFCFAMQPIDGDVGIFGNF 350
P+ A P + HF GGA + L + I PV+G FC +Q D + + GN
Sbjct: 367 APAAAVRVPSIVLHFAGGADMALPQANYLI--PVQGGDRRTYFCLGLQAADDGLNVLGNV 424
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
Q + +D + + F P C+
Sbjct: 425 QQQGFGLVFDVERGRIGFTPNGCS 448
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 167 bits (422), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 123/368 (33%), Positives = 173/368 (47%), Gaps = 23/368 (6%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASS 72
+S + G YV+ +GTP D+ I DTGSDL W QC PC + CY Q +PI+NP+ S
Sbjct: 128 KSGSTIGTGNYVVTVGLGTPKR-DLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKS 186
Query: 73 SSYKELSCQSEQCHLL-----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
+SY +SC S C L ++ SCS+ C Y Y D S + G A +++ S +
Sbjct: 187 TSYTNISCSSPTCDELKSGTGNSPSCSAST-CVYGIQYGDQSYSVGFFAQDKLAL-TSTD 244
Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
F+N +FGCG NN G+F GL+GLGR LSL SQ + G FSYCL + SS
Sbjct: 245 VFNNFLFGCGQNNRGLF-VGVAGLIGLGRNALSLVSQTAQKYG-KLFSYCL---PSTSSS 299
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
T + FG+G S + SLV+ + ++YF+ L ISVG S+ S+ S
Sbjct: 300 TGYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLST-------SASVFS 352
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APIL 306
ID+G + LP Y+ L + + P P CY + P +
Sbjct: 353 TAGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTCYDFSQYDTVDVPKI 412
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
+F GA++ L + F + V FA D+ I GN Q + YD
Sbjct: 413 NLYFSDGAEMDLDPSGIFYILNISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGR 472
Query: 366 VSFKPTDC 373
+ F P C
Sbjct: 473 IGFAPGGC 480
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 167 bits (422), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 123/367 (33%), Positives = 174/367 (47%), Gaps = 14/367 (3%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S +S +GEY ++ S+GTPP +Y ++DTGSD++W+QC PCV CY Q +++P S
Sbjct: 26 VISGLSLGSGEYFIRVSVGTPPR-GMYLVMDTGSDILWLQCAPCVSCYHQCDEVFDPYKS 84
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERI----TFGNSNNF 128
S+Y L C S QC LD C + C Y Y D S + G AT+ + T G
Sbjct: 85 STYSTLGCNSRQCLNLDVGGCVGNK-CLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVV 143
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
+ + GCGH+N G F GL+GLG+ LS +QI S+ G +FSYCL TDS+
Sbjct: 144 LNKIPLGCGHDNEGYF-VGAAGLLGLGKGPLSFPNQINSENGG-RFSYCLTGRDTDSTER 201
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN-SSGAIS 247
S + FG+ + G + + T+Y++ + GISVG S IP ++
Sbjct: 202 SSLIFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVG---GSILTIPTSAFQLDSLG 258
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APIL 306
G + ID+G T L Y L E R CY ++ + P +
Sbjct: 259 NGGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNLSDLSSVDVPTV 318
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
T HF GGA + L ++ +P FC A G I GN Q + YD V
Sbjct: 319 TLHFQGGADLKLPASNYLVPVDNSSTFCLAFAGTTGP-SIIGNIQQQGFRVIYDNLHNQV 377
Query: 367 SFKPTDC 373
F P+ C
Sbjct: 378 GFVPSQC 384
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 121/363 (33%), Positives = 169/363 (46%), Gaps = 20/363 (5%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ +IGTPP + +DTGSDL+W QC PC C+ Q P ++P++SS+ SC S
Sbjct: 81 EYLVHLAIGTPPQ-PVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDS 139
Query: 83 EQCHLLDTVSCSS-----QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
C L SC S Q C YTY Y D S+T G L ++ TF + V FGCG
Sbjct: 140 TLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCG 199
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
N GVF NE G+ G GR LSL SQL FS+C + T +
Sbjct: 200 LFNNGVFKSNETGIAGFGRGPLSLP----SQLKVGNFSHCFTAVNGLKPSTVLLDLPADL 255
Query: 198 EVSGGGVV-STSLVSK-EDKTYYFVTLEGISVGNLSNSSKL-IPYYNSSGAISKGNMFID 254
SG G V ST L+ + T+Y+++L+GI+VG S++L +P + G ID
Sbjct: 256 YKSGRGAVQSTPLIQNPANPTFYYLSLKGITVG----STRLPVPESEFTLKNGTGGTIID 311
Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG-IAPILTAHFDGG 313
+G T LP Y + + +KL C P A P L HF+G
Sbjct: 312 SGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGA 371
Query: 314 A-KVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
+P + + + C A+ G+V GNF Q ++ + YD + +SF P
Sbjct: 372 TMDLPRENYVFEVEDAGSSILCLAIIE-GGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQ 430
Query: 373 CTK 375
C K
Sbjct: 431 CDK 433
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 129/384 (33%), Positives = 191/384 (49%), Gaps = 28/384 (7%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
++ V+S GEY M +G PP I+DTGSDL W+QC PC C+ Q P+++P
Sbjct: 73 DSTVESGAELGAGEYFMDVFVGNPPR-HFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDP 131
Query: 70 ASSSSYKELSCQSEQCHLL------DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG 123
+ S+S+K + C + C L+ D S +S + C Y Y Y DSS T G LA E ++
Sbjct: 132 SQSTSFKIIPCNAAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVS 191
Query: 124 NSNN----FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
S++ ++V GCGH+N G+ + GL+GLG+ LS SQ+ S FSYCLV
Sbjct: 192 LSDHPSSLEIRDMVIGCGHSNKGL-FQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLV 250
Query: 180 PFHTDSSITSKMYFGNGSEVSG--GGVVSTSLVSKED--KTYYFVTLEGISVGNLSNSSK 235
+ S++S + FG G +S + T V + +T+Y++ ++GI + +
Sbjct: 251 DRTNNLSVSSAISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQ-----E 305
Query: 236 LIPYYNSSGAIS---KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL 292
L+P AI+ G ID+G T L +D Y +E I P DP +
Sbjct: 306 LLPIPAERFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISY-PRADPFDILGI 364
Query: 293 CYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPP-PVEGVFCFAMQPIDGDVGIFGNF 350
CY A + P L+ F GA++ L + FI P P E C A+ P DG + I GNF
Sbjct: 365 CYNATGRAAVPFPALSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDG-MSIIGNF 423
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
Q ++ YD + F TDC+
Sbjct: 424 QQQNIHFLYDVQHARLGFANTDCS 447
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 112/383 (29%), Positives = 186/383 (48%), Gaps = 33/383 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S + +GEY +GTP + ++DTGSDL+W+QC PC +CY Q +++P S
Sbjct: 75 VFSGIPFESGEYFALVGVGTPSTKAML-VIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRS 133
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQL----CNYTYGYADSSLTKGVLATERITFGNSNNF 128
S+Y+ + C S QC L C S C Y Y D S + G LAT+++ F N + +
Sbjct: 134 STYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFAN-DTY 192
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
+NV GCG +N G+F ++ GL+G+GR ++S+++Q+ G + F YCL + S+ +
Sbjct: 193 VNNVTLGCGRDNEGLF-DSAAGLLGVGRGKISISTQVAPAYG-SVFEYCLGDRTSRSTRS 250
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAI- 246
S + FG E T+L+S + + Y+V + G SVG ++ + N+S A+
Sbjct: 251 SYLVFGRTPEPP--STAFTALLSNPRRPSLYYVDMAGFSVGG----ERVTGFSNASLALD 304
Query: 247 ---SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSM- 299
+G + +D+G + +D Y L + + + + CY
Sbjct: 305 TATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRP 364
Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG--------VFCFAMQPIDGDVGIFGNFA 351
A AP++ HF GGA + L + F+ PV+G C + D + + GN
Sbjct: 365 AASAPLIVLHFAGGADMALPPENYFL--PVDGGRRRAASYRRCLGFEAADDGLSVIGNVQ 422
Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
Q + +D + + + F P CT
Sbjct: 423 QQGFRVVFDVEKERIGFAPKGCT 445
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 125/366 (34%), Positives = 175/366 (47%), Gaps = 18/366 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S + +GEY + +GTP + Y ++DTGSD+ W+QC PC +CY Q PI+NP+ S
Sbjct: 146 VVSGMEQGSGEYFTRIGVGTP-TREQYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYS 204
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
+S+ + C S C LD C S C Y Y D S + G ATE +TFG ++ NV
Sbjct: 205 ASFSTVGCDSAVCSQLDAYDCHSGG-CLYEASYGDGSYSTGSFATETLTFGTTS--VANV 261
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH N G+F L+GLG LS +QI +Q G + FSYCLV +DSS +
Sbjct: 262 AIGCGHKNVGLFIGAAG-LLGLGAGALSFPNQIGTQTG-HTFSYCLVDRESDSS--GPLQ 317
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
FG S V G + + + T+Y++++ ISVG S + G
Sbjct: 318 FGPKS-VPVGSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDETSGHGGFI 376
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGIA-PILT 307
ID+G T L Y+ VR+A Q PR + CY + ++ P +
Sbjct: 377 IDSGTVVTRLVTSAYD----AVRDAFVAGTGQLPRTDAVSIFDTCYDLSGLQFVSVPTVG 432
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
HF GA + L + IP G FCFA P V I GN Q + + +D + +V
Sbjct: 433 FHFSNGASLILPAKNYLIPMDTVGTFCFAFAPAASSVSIMGNTQQQHIRVSFDSANSLVG 492
Query: 368 FKPTDC 373
F C
Sbjct: 493 FAFDQC 498
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 125/394 (31%), Positives = 179/394 (45%), Gaps = 66/394 (16%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC-----------LPCVQCYK 61
V S V + + EY+M ++G+PP + I DTGSDL+WV+C P Q
Sbjct: 90 VVSKVVSRSFEYLMTVNLGSPPR-SMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQ--- 145
Query: 62 QVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERIT 121
++P+ SS+Y +SCQ++ C L +C C Y Y Y D S T GVL+TE T
Sbjct: 146 -----FDPSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFT 200
Query: 122 FGNSNN-------FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG---- 170
F + + V FGC G F + + A +++QLG
Sbjct: 201 FDDGGSGRSPRQVRVGGVKFGCSTATAGSFPADGL------VGLGGGAVSLVTQLGGATS 254
Query: 171 -ANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN 229
+FSYCLVP ++S S + FG ++V+ G ST LV+ + TYY V L+ + VGN
Sbjct: 255 LGRRFSYCLVPHSVNAS--SALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGN 312
Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
+ +S + + +D+G T L + +++ I L P Q P
Sbjct: 313 KTVASA-----------ASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGL 361
Query: 290 SQLCYKTPSMAGIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM------QP 339
QLCY A P LT F GGA V L + F+ EG C A+ QP
Sbjct: 362 LQLCYNVAGREVEAGESIPDLTLEFGGGAAVALKPENAFVAVQ-EGTLCLAIVATTEQQP 420
Query: 340 IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
V I GN AQ ++ +GYD D+ V+F DC
Sbjct: 421 ----VSILGNLAQQNIHVGYDLDAGTVTFAGADC 450
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 119/364 (32%), Positives = 182/364 (50%), Gaps = 20/364 (5%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCY--KQVKPIYNPASSSSYKE 77
GEY+M+ SIGTPP L I ++DTGSDL+W++C C C + I+ +SSSYK+
Sbjct: 1 GEGEYMMELSIGTPPQL-IPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKK 59
Query: 78 LSCQSEQCHLLDT--VSCSSQQLCNYTYGYADSSLTKGVLATERITFG------NSNNFF 129
L C S C + + + ++ C Y Y Y D S T G + ++RI+F + +FF
Sbjct: 60 LPCNSTHCSGMSSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFF 119
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
D +FGCG G +N + GL+GLG+ SL Q+ +LG KFSYCLV + + S S
Sbjct: 120 DGFLFGCGRKLKGDWNFTQ-GLIGLGQKSHSLIQQLGDKLGY-KFSYCLVSYDSPPSAKS 177
Query: 190 KMYFGNGSEVSGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNS--SKLIPYYNSSGA 245
++ G+ + + G VVST ++ + D+T Y+V L+ I+VG + K + S G
Sbjct: 178 FLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHNTSVGP 237
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-P 304
ID+G TLL Y + + + + L P G LC+ + P
Sbjct: 238 FLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVIL-PTLGNSAGLDLCFNSSGDTSYGFP 296
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
+T +F ++ L + F + V C +M GD+ I GN Q + I YD +
Sbjct: 297 SVTFYFANQVQLVLPFENIFQVTSRD-VVCLSMDSSGGDLSIIGNMQQQNFHILYDLVAS 355
Query: 365 MVSF 368
+SF
Sbjct: 356 QISF 359
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 122/369 (33%), Positives = 182/369 (49%), Gaps = 24/369 (6%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
++S +S +GEY + +GTPP + + DTGSD++W+QCLPC CY Q P++NP+ S
Sbjct: 70 LRSGLSDGSGEYFVSLGVGTPPR-TVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFS 128
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+++ ++C S C L C Q C Y Y D S T G +TE ++FG +N ++V
Sbjct: 129 STFQSITCGSSLCQQLLIRGCRRNQ-CLYQVSYGDGSFTVGEFSTETLSFG--SNAVNSV 185
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGHNN G+F L+GLG+ LS SQ+ QL + FSYCL + S+ +
Sbjct: 186 AIGCGHNNQGLFTGAAG-LLGLGKGLLSFPSQV-GQLYGSVFSYCLPTRESTGSV--PLI 241
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN-- 250
FGN V+ +T L + + T+Y+V + GI VG S S IP + S S GN
Sbjct: 242 FGN-QAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVS---IPAGSLSLDSSTGNGG 297
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-----QLCYKTPSMAGIA-P 304
+ +D+G T L YN + + R + D ++ S CY + I P
Sbjct: 298 VILDSGTAVTRLVTSAYNPMRDAFRAGMP----SDAKMTSGFSLFDTCYDLSGRSSIMLP 353
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
++ F+GGA + L + +P G +C A P + I GN Q + +D
Sbjct: 354 AVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGN 413
Query: 365 MVSFKPTDC 373
V C
Sbjct: 414 RVGIGANQC 422
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 133/379 (35%), Positives = 187/379 (49%), Gaps = 31/379 (8%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
+ VV S +S +GEY M+ +GTP ++Y ++DTGSD++W+QC PC CY Q P++NP
Sbjct: 122 SGVVISGLSQGSGEYFMRLGVGTPAT-NMYMVLDTGSDVVWLQCSPCKVCYNQSDPVFNP 180
Query: 70 ASSSSYKELSCQSEQCHLLDTVS-CSSQQ--LCNYTYGYADSSLTKGVLATERITFGNSN 126
A S ++ + C S C LD S C S++ C Y Y D S T G +TE +TF +
Sbjct: 181 AKSKTFATVPCGSRLCRRLDDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTFHGAR 240
Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP---FHT 183
D+V GCGH+N G+F L+GLGR LS SQ ++ KFSYCLV +
Sbjct: 241 --VDHVALGCGHDNEGLFVGAAG-LLGLGRGGLSFPSQTKNRYNG-KFSYCLVDRTSSGS 296
Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
S S + FGNG+ V V + L + + T+Y++ L GISVG +P + S
Sbjct: 297 SSKPPSTIVFGNGA-VPKTAVFTPLLTNPKLDTFYYLQLLGISVGG-----SRVPGVSES 350
Query: 244 ----GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYK 295
A G + ID+G T L + Y L R+A +L + R S C+
Sbjct: 351 QFKLDATGNGGVIIDSGTSVTRLTQSAYVAL----RDAFRLGATRLKRAPSYSLFDTCFD 406
Query: 296 TPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
M + P + HF GG +V L ++ IP +G FCFA G + I GN Q
Sbjct: 407 LSGMTTVKVPTVVFHFTGG-EVSLPASNYLIPVNNQGRFCFAFAGTMGSLSIIGNIQQQG 465
Query: 355 LFIGYDFDSQMVSFKPTDC 373
+ YD V F C
Sbjct: 466 FRVAYDLVGSRVGFLSRAC 484
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 125/369 (33%), Positives = 178/369 (48%), Gaps = 26/369 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPIYNPA 70
S + G YV+ +GTP Y +V DTGSD WVQC PCV CY+Q + +++PA
Sbjct: 169 ASSGRALGTGNYVVTVGLGTP--ASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPA 226
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
SS+Y +SC + C LDT CS C Y Y D S + G A + +T +S +
Sbjct: 227 RSSTYANISCAAPACSDLDTRGCSGGN-CLYGVQYGDGSYSIGFFAMDTLTL-SSYDAVK 284
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
FGCG N G+F E GL+GLGR + SL Q + G F++CL SS T
Sbjct: 285 GFRFGCGERNEGLFGE-AAGLLGLGRGKTSLPVQTYDKYG-GVFAHCLP---ARSSGTGY 339
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP--YYNSSGAISK 248
+ FG GS + G ++T +++ T+Y+V + GI VG S IP + ++G I
Sbjct: 340 LDFGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLS---IPQSVFTTAGTI-- 394
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PI 305
+D+G T LP Y+ L +A+ Y+ S L CY M+ +A P
Sbjct: 395 ----VDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPT 450
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
++ F GGA++ + + V V FA GDVGI GN + YD +
Sbjct: 451 VSLLFQGGARLDVDASGIMYAASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKK 510
Query: 365 MVSFKPTDC 373
+V F P C
Sbjct: 511 VVGFSPGAC 519
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 128/384 (33%), Positives = 191/384 (49%), Gaps = 28/384 (7%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
++ V+S GEY M +G PP + I+DTGSDL W+QC PC C+ Q P+++P
Sbjct: 157 DSTVESGAELGAGEYFMDVFVGNPPRHFLL-IIDTGSDLTWLQCKPCKACFDQSGPVFDP 215
Query: 70 ASSSSYKELSCQSEQCHLL------DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG 123
+ S+S+K + C + C L+ D S +S + C Y Y Y DSS T G LA E ++
Sbjct: 216 SQSTSFKIIPCNAAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVS 275
Query: 124 NSNN----FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
S++ ++V GCGH+N G+ + GL+GLG+ LS SQ+ S FSYCLV
Sbjct: 276 LSDHPSSLEIRDMVIGCGHSNKGL-FQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLV 334
Query: 180 PFHTDSSITSKMYFGNGSEVSG--GGVVSTSLVSKED--KTYYFVTLEGISVGNLSNSSK 235
+ S++S + FG G +S + T V + +T+Y++ ++GI + +
Sbjct: 335 DRTNNLSVSSAISFGAGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKI-----DQE 389
Query: 236 LIPYYNSSGAIS---KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL 292
L+P AI+ G ID+G T L +D Y +E I P DP +
Sbjct: 390 LLPIPAERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISY-PRADPFDILGI 448
Query: 293 CYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPP-PVEGVFCFAMQPIDGDVGIFGNF 350
CY + P L+ F GA++ L + FI P P E C A+ P DG + I GNF
Sbjct: 449 CYNATGRTAVPFPTLSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDG-MSIIGNF 507
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
Q ++ YD + F TDC+
Sbjct: 508 QQQNIHFLYDVQHARLGFANTDCS 531
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 121/369 (32%), Positives = 174/369 (47%), Gaps = 29/369 (7%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
+GEY+ K ++GTP + + + DT SDL W+QC PC +CY Q P+++P S+SY+E+S
Sbjct: 135 SGEYIAKIAVGTPGVEALLAL-DTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYREMSF 193
Query: 81 QSEQCHLLDTVSC--SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
+ C L + + C YT GY D S T G E +TF + GCGH
Sbjct: 194 NAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEETLTFAGGVR-LPRISIGCGH 252
Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD-SSITSKMYFGNGS 197
+N G+F G++GLGR +S +QI FSYCLV F + S++S + FG G+
Sbjct: 253 DNKGLFGAPAAGILGLGRGLMSFPNQIDHN---GTFSYCLVDFLSGPGSLSSTLTFGAGA 309
Query: 198 EVSGGGVVST-SLVSKEDKTYYFVTLEGISVGNL------SNSSKLIPYYNSSGAISKGN 250
+ V T ++++ T+Y+V L GISVG + +L PY +G
Sbjct: 310 VDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLDPY------TGRGG 363
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRN-AIKL--TPYQDPRLGSQLCYKTPSMAGI--API 305
+ +D+G T L + Y + R A+ L P CY T G+ P
Sbjct: 364 VIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCY-TVGGRGMKKVPT 422
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQSDLFIGYDFDSQ 364
++ HF G +V L + IP G CFA D V I GN Q I YD +
Sbjct: 423 VSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSIIGNIQQQGFRIVYDIGGR 482
Query: 365 MVSFKPTDC 373
V F P C
Sbjct: 483 -VGFAPNSC 490
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 128/383 (33%), Positives = 188/383 (49%), Gaps = 26/383 (6%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
++S S GEY + +GTPP ++ I+DTGSDL W+QC PC C++Q P YNP S
Sbjct: 159 LESGASLGTGEYFIDMFVGTPPK-HVWLILDTGSDLSWIQCDPCYDCFEQNGPHYNPNES 217
Query: 73 SSYKELSCQSEQCHLLDT----VSCSSQ-QLCNYTYGYADSSLTKGVLATE----RITFG 123
SSY+ +SC +C L+ + C ++ Q C Y Y YAD S T G A E +T+
Sbjct: 218 SSYRNISCYDPRCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWP 277
Query: 124 NSNNFFDNVV---FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
N F +VV FGCGH N G F+ GL+GLGR LS SQ+ S G + FSYCL
Sbjct: 278 NGKEKFKHVVDVMFGCGHWNKGFFH-GAGGLLGLGRGPLSFPSQLQSIYG-HSFSYCLTD 335
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKE---DKTYYFVTLEGISV-GNLSNSSK 235
+++S++SK+ FG E+ ++ T L++ E D T+Y++ ++ I V G + + +
Sbjct: 336 LFSNTSVSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPE 395
Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
+++S G G ID+G+ T P Y+ ++E IKL CY
Sbjct: 396 KTWHWSSEGV---GGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQIAADDFIMSPCYN 452
Query: 296 TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM--QPIDGDVGIFGNFAQ 352
+M P HF GA + F + V C A+ P + I GN Q
Sbjct: 453 VSGAMQVELPDYGIHFADGAVWNFPAENYFYQYEPDEVICLAILKTPNHSHLTIIGNLLQ 512
Query: 353 SDLFIGYDFDSQMVSFKPTDCTK 375
+ I YD + + P C +
Sbjct: 513 QNFHILYDVKRSRLGYSPRRCAE 535
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 111/383 (28%), Positives = 185/383 (48%), Gaps = 33/383 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S + +GEY +GTP + ++DTGSDL+W+QC PC +CY Q +++P S
Sbjct: 75 VFSGIPFESGEYFALVGVGTPSTKAML-VIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRS 133
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQL----CNYTYGYADSSLTKGVLATERITFGNSNNF 128
S+Y+ + C S QC L C S C Y Y D S + G LAT+++ F N + +
Sbjct: 134 STYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDKLAFAN-DTY 192
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
+NV GCG +N G+F ++ GL+G+ R ++S+++Q+ G + F YCL + S+ +
Sbjct: 193 VNNVTLGCGRDNEGLF-DSAAGLLGVARGKISISTQVAPAYG-SVFEYCLGDRTSRSTRS 250
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAI- 246
S + FG E T+L+S + + Y+V + G SVG ++ + N+S A+
Sbjct: 251 SYLVFGRTPEPP--STAFTALLSNPRRPSLYYVDMAGFSVGG----ERVTGFSNASLALD 304
Query: 247 ---SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSM- 299
+G + +D+G + +D Y L + + + + CY
Sbjct: 305 TATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRP 364
Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG--------VFCFAMQPIDGDVGIFGNFA 351
A AP++ HF GGA + L + F+ PV+G C + D + + GN
Sbjct: 365 AASAPLIVLHFAGGADMALPPENYFL--PVDGGRRRAASYRRCLGFEAADDGLSVIGNVQ 422
Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
Q + +D + + + F P CT
Sbjct: 423 QQGFRVVFDVEKERIGFAPKGCT 445
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 121/369 (32%), Positives = 182/369 (49%), Gaps = 24/369 (6%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
++S +S +GEY + +GTPP + + DTGSD++W+QCLPC CY Q P++NP+ S
Sbjct: 70 LRSGLSDGSGEYFVSLGVGTPPR-TVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFS 128
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+++ ++C S C L C Q C Y Y D S T G +TE ++FG +N ++V
Sbjct: 129 STFQSITCGSSLCQQLLIRGCRRNQ-CLYQVSYGDGSFTVGEFSTETLSFG--SNAVNSV 185
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGHNN G+F L+GLG+ LS SQ+ QL + FSYCL + S+ +
Sbjct: 186 AIGCGHNNQGLFTGAAG-LLGLGKGLLSFPSQV-GQLYGSVFSYCLPTRESTGSV--PLI 241
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN-- 250
FGN V+ +T L + + T+Y+V + GI VG S + IP + S S GN
Sbjct: 242 FGN-QAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVN---IPAGSLSLDSSTGNGG 297
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-----QLCYKTPSMAGIA-P 304
+ +D+G T L YN + + R + D ++ S CY + I P
Sbjct: 298 VILDSGTAVTRLVTSAYNPMRDAFRAGMP----SDAKMTSGFSLFDTCYDLSGRSSIMLP 353
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
++ F+GGA + L + +P G +C A P + I GN Q + +D
Sbjct: 354 AVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGN 413
Query: 365 MVSFKPTDC 373
V C
Sbjct: 414 RVGIGANQC 422
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 164 bits (415), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 118/360 (32%), Positives = 171/360 (47%), Gaps = 12/360 (3%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
S ++ +G+Y + +GTP +Y + DTGSD+ W+QC PC +CY+Q PI+NP+ SSS
Sbjct: 5 SGIAGGSGDYFARIGVGTP-ARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSS 63
Query: 75 YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
+K L+C S C L CS + C Y Y D S T G +TE ++FG + +V
Sbjct: 64 FKPLACASSICGKLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSFG--EHAVRSVAM 121
Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
GCG NN G+F+ L+GLGR LS SQ + A+ FSYCL +S+I + + FG
Sbjct: 122 GCGRNNQGLFHGAAG-LLGLGRGPLSFPSQTGTSY-ASVFSYCLP--RRESAIAASLVFG 177
Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFID 254
S V + L ++ TYY+V L I V + + P + G+ G + +D
Sbjct: 178 P-SAVPEKARFTKLLPNRRLDTYYYVGLARIRVAG--SPVNIPPDAFAMGSRGTGGVIVD 234
Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM-AGIAPILTAHFDGG 313
+G + L Y L + R+ + L CY SM P + FDGG
Sbjct: 235 SGTAISRLTTPAYTALRDAFRSLVTFPSAPGISL-FDTCYDLSSMKTATLPAVVLDFDGG 293
Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
A +PL + EG +C A P + I GN Q I D + + P C
Sbjct: 294 ASMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 353
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 164 bits (415), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 117/372 (31%), Positives = 179/372 (48%), Gaps = 27/372 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+ V Y++ +GTP D+ + DTGSDL WVQC PC CY+Q P+++P+ S
Sbjct: 127 ARRGVPLGTANYIVSVGLGTPKR-DLLVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQS 185
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG-----NSNN 127
++Y + C +++C LD+ SCSS + C Y Y D S T G LA + +T G +S++
Sbjct: 186 TTYSAVPCGAQECRRLDSGSCSSGK-CRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSD 244
Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
VFGCG ++TG+F + + GL GLGR R+SLASQ ++ GA FSYCL SS
Sbjct: 245 QLQEFVFGCGDDDTGLFGKAD-GLFGLGRDRVSLASQAAAKYGAG-FSYCL-----PSSS 297
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP-YYNSSGAI 246
T++ Y GS + + + ++Y++ L GI V + ++ P + + G +
Sbjct: 298 TAEGYLSLGSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAG--RTVRVSPAVFRTPGTV 355
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA- 303
ID+G T LP Y L ++ Y+ S L CY +
Sbjct: 356 ------IDSGTVITRLPSRAYAALRSSFAGLMRRYSYKRAPALSILDTCYDFTGRNKVQI 409
Query: 304 PILTAHFDGGAKVPL-IHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
P + FDGGA + L ++ + FA D + I GN Q + YD
Sbjct: 410 PSVALLFDGGATLNLGFGEVLYVANKSQACLAFASNGDDTSIAILGNMQQKTFAVVYDVA 469
Query: 363 SQMVSFKPTDCT 374
+Q + F C+
Sbjct: 470 NQKIGFGAKGCS 481
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 118/360 (32%), Positives = 171/360 (47%), Gaps = 12/360 (3%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
S ++ +G+Y + +GTP +Y + DTGSD+ W+QC PC +CY+Q PI+NP+ SSS
Sbjct: 72 SGIAGGSGDYFARIGVGTP-ARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSS 130
Query: 75 YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
+K L+C S C L CS + C Y Y D S T G +TE ++FG + +V
Sbjct: 131 FKPLACASSICGKLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETLSFG--EHAVRSVAM 188
Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
GCG NN G+F+ L+GLGR LS SQ + A+ FSYCL +S+I + + FG
Sbjct: 189 GCGRNNQGLFHGAAG-LLGLGRGPLSFPSQTGTSY-ASVFSYCLP--RRESAIAASLVFG 244
Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFID 254
S V + L ++ TYY+V L I V + + P + G+ G + +D
Sbjct: 245 P-SAVPEKARFTKLLPNRRLDTYYYVGLARIRVAG--SPVNIPPDAFAMGSRGTGGVIVD 301
Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM-AGIAPILTAHFDGG 313
+G + L Y L + R+ + L CY SM P + FDGG
Sbjct: 302 SGTAISRLTTPAYTALRDAFRSLVTFPSAPGISL-FDTCYDLSSMKTATLPAVVLDFDGG 360
Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
A +PL + EG +C A P + I GN Q I D + + P C
Sbjct: 361 ASMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 420
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 164 bits (414), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 117/364 (32%), Positives = 181/364 (49%), Gaps = 20/364 (5%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCY--KQVKPIYNPASSSSYKE 77
GEY+M+ SIGTPP L I ++DTGSDL+W++C C C + I+ +SSSYK+
Sbjct: 1 GEGEYMMELSIGTPPQL-IPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKK 59
Query: 78 LSCQSEQCHLLDT--VSCSSQQLCNYTYGYADSSLTKGVLATERITFG------NSNNFF 129
L C S C + + + ++ C Y Y Y D S T G + ++RI+F + +FF
Sbjct: 60 LPCNSTHCSGMSSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFF 119
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
D +FGC G +N + GL+GLG+ SL Q+ +LG KFSYCLV + + S S
Sbjct: 120 DGFLFGCARKLKGDWNFTQ-GLIGLGQKSHSLIQQLGDKLGY-KFSYCLVSYDSPPSAKS 177
Query: 190 KMYFGNGSEVSGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNS--SKLIPYYNSSGA 245
++ G+ + + G VVST ++ + D+T Y+V L+ I++G + K + S G
Sbjct: 178 FLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGHNTSVGP 237
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-P 304
ID+G TLL Y + + + + L P G LC+ + P
Sbjct: 238 FLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVIL-PTLGNSAGLDLCFNSSGDTSYGFP 296
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
+T +F ++ L + F + V C +M GD+ I GN Q + I YD +
Sbjct: 297 SVTFYFANQVQLVLPFENIFQVTSRD-VVCLSMDSSGGDLSIIGNMQQQNFHILYDLVAS 355
Query: 365 MVSF 368
+SF
Sbjct: 356 QISF 359
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 118/379 (31%), Positives = 174/379 (45%), Gaps = 42/379 (11%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
GEY++K IGTP +DT SDL+W+QC PCV CY+Q+ PI+NP SSSY + C
Sbjct: 86 GEYLVKLGIGTPQHY-FSAAIDTASDLVWLQCQPCVSCYRQLDPIFNPRLSSSYAVVPCS 144
Query: 82 SEQCHLLDTVSCSS--QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
S+ C LD C Q C Y Y Y+ +++T G LA +++ G N F VV GC +
Sbjct: 145 SDTCSQLDGHRCDEDDDQACRYNYKYSGNAVTNGTLAIDKLAVG--GNVFHAVVLGCSDS 202
Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
+ G GLVGL R LSL LSQL +F YCL P S K+ G G+
Sbjct: 203 SVGGPPPQASGLVGLARGPLSL----LSQLSVRRFMYCLPPPM--SRTPGKLVLGAGAGA 256
Query: 200 SGGGVVSTSLV-----SKEDKTYYFVTLEGISVGNLSNSSKLIP-------------YYN 241
VS + S +YY++ +G++VG+ + + P +
Sbjct: 257 DAVRNVSDRVTVTMSSSTRYPSYYYLNFDGLAVGDQTPGTIRRPTSPPATGGGVGGGGGD 316
Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP--RLGSQLCYKTPSM 299
+ M +D + + L Y+ L + + I+L P P RLG LC+ P
Sbjct: 317 GGSGANAYGMIVDVASTISFLEASLYDELADDLEEEIRL-PRATPSTRLGLDLCFILPEG 375
Query: 300 AGI----APILTAHFDGGAKVPLIHTSTFIPPPVEG-VFCFAMQPIDGDVGIFGNFAQSD 354
GI P ++ FD G + L F+ +G + C + G V I GN+ Q +
Sbjct: 376 VGIDRVYVPTVSMSFD-GRWLELERDRLFLE---DGRMMCLMIGRTSG-VSILGNYQQQN 430
Query: 355 LFIGYDFDSQMVSFKPTDC 373
+ + Y+ ++F C
Sbjct: 431 MHVLYNLRRGKITFAKASC 449
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 125/368 (33%), Positives = 181/368 (49%), Gaps = 21/368 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S ++ +GEY + +GTP ++ ++DTGSD++W+QC PC +CY Q P++NP S
Sbjct: 136 VTSGLAQGSGEYFTRLGVGTPARY-VFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKS 194
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
S+ + C S C LD+ CS+++ +C Y Y D S T G +TE +TF +
Sbjct: 195 RSFANIPCGSPLCRRLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGTR--VGR 252
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V GCGH+N G+F L+GLGR RLS SQI + + KFSYCLV + SS S M
Sbjct: 253 VALGCGHDNEGLFIGAAG-LLGLGRGRLSFPSQIGRRF-SRKFSYCLVD-RSASSKPSYM 309
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS----GAIS 247
FG+ S +S + + + + T+Y+V L G+SVG +P +S +
Sbjct: 310 VFGD-SAISRTARFTPLVSNPKLDTFYYVELLGVSVGGTR-----VPGITASLFKLDSTG 363
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVR-NAIKLTPYQDPRLGSQLCYKTPSMAGI-API 305
G + ID+G T L + Y L + R A L + L C+ + P
Sbjct: 364 NGGVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSL-FDTCFDLSGKTEVKVPT 422
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
+ HF GA V L ++ IP G FCFA + I GN Q + YD +
Sbjct: 423 VVLHFR-GADVSLPASNYLIPVDNSGSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASR 481
Query: 366 VSFKPTDC 373
V F P C
Sbjct: 482 VGFAPRGC 489
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 113/368 (30%), Positives = 172/368 (46%), Gaps = 17/368 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S +S +GEY ++ +G+PP + Y +VD+GSD++W+QC PC +CY+Q P+++PA+S
Sbjct: 122 VVSGISEGSGEYFVRVGVGSPPT-EQYLVVDSGSDVIWIQCRPCAECYQQADPLFDPAAS 180
Query: 73 SSYKELSCQSEQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
+S+ + C S C L + C+ C Y Y D S T+GVLA E +TFG+S
Sbjct: 181 ASFTAVPCDSGVCRTLPGGSSGCADSGACRYQVSYGDGSYTQGVLAMETLTFGDSTP-VQ 239
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
V GCGH N G+F GL+GLG +SL Q L FSYCL D+ S
Sbjct: 240 GVAIGCGHRNRGLF-VGAAGLLGLGWGPMSLVGQ-LGGAAGGAFSYCLASRGADAGAGS- 296
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK-- 248
+ FG + G V L + + ++Y+V L L + +P + +++
Sbjct: 297 LVFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLT-----GLGVGGERLPLQDGLFDLTEDG 351
Query: 249 -GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG-SQLCYKTPSMAGI-API 305
G + +DTG T LP D Y L + + I + P + CY A + P
Sbjct: 352 GGGVVMDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLDTCYDLSGYASVRVPT 411
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
+ +F + + GV+C A + I GN Q + I D +
Sbjct: 412 VALYFGRDGAALTLPARNLLVEMGGGVYCLAFAASASGLSILGNIQQQGIQITVDSANGY 471
Query: 366 VSFKPTDC 373
V F P+ C
Sbjct: 472 VGFGPSTC 479
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 127/382 (33%), Positives = 184/382 (48%), Gaps = 49/382 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSC 80
GEY+M +IGTPP I DTGSDL+W QC PC +C+KQ P+YNP+SS +++ L C
Sbjct: 90 GEYIMTLAIGTPPQ-SYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPC 148
Query: 81 QSE--QCHLLDTVSCSSQQ---LCNYTYGYADSSLTKGVLATERITFGNS---NNFFDNV 132
S C ++ ++ C Y Y + T G+ +E TFG+S +
Sbjct: 149 SSALNLCAAEARLAGATPPPGCACRYNQTYG-TGWTSGLQGSETFTFGSSPADQVRVPGI 207
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGC + ++ +N + + ++SQL A FSYCL PF D+ S +
Sbjct: 208 AFGCSNASSDDWNGSAGLV-----GLGRGGLSLVSQLAAGMFSYCLTPFQ-DTKSKSTLL 261
Query: 193 FG---NGSEVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
G + ++G GV ST V K TYY++ L GISVG + + + P + A
Sbjct: 262 LGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVG--AAALPIPPGAFALRA 319
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPR--LGSQLCYKTPSMA--- 300
G + ID+G T L Y R+ VR+ +KL P D G LC+ PS +
Sbjct: 320 DGTGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKL-PVTDGSNATGLDLCFALPSSSAPP 378
Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVE-------GVFCFAMQP-IDGDVGIFGNFAQ 352
P +T HF GGA + L PVE G++C AM+ DG++ GN+ Q
Sbjct: 379 ATLPSMTLHFGGGADMVL---------PVENYMILDGGMWCLAMRSQTDGELSTLGNYQQ 429
Query: 353 SDLFIGYDFDSQMVSFKPTDCT 374
+L I YD + +SF P C+
Sbjct: 430 QNLHILYDVQKETLSFAPAKCS 451
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 119/359 (33%), Positives = 173/359 (48%), Gaps = 26/359 (7%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSC 80
G YV + +GTP I +VDTGS L W+QC PC V C++Q P+++P +SSSY +SC
Sbjct: 135 GNYVTRMGLGTPAKPYIM-VVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSC 193
Query: 81 QSEQCHLLDT-----VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
+ QC+ L T +CSS +C Y Y DSS + G L+ + ++FG +N N +G
Sbjct: 194 STPQCNDLSTATLNPAACSSSDVCIYQASYGDSSFSVGYLSKDTVSFG--SNSVPNFYYG 251
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
CG +N G+F + GL+GL R +LSL Q+ LG + FSYCL + ++ Y N
Sbjct: 252 CGQDNEGLFGRSA-GLMGLARNKLSLLYQLAPTLGYS-FSYCLPSSSSSGYLSIGSY--N 307
Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
+ S +VS++L D + YF+ L G++V P SS S ID+
Sbjct: 308 PGQYSYTPMVSSTL----DDSLYFIKLSGMTVAG-------KPLAVSSSEYSSLPTIIDS 356
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
G T LP Y+ L + V A+K T D C+ + + P ++ F GGA
Sbjct: 357 GTVITRLPTTVYDALSKAVAGAMKGTKRADAYSILDTCFVGQASSLRVPAVSMAFSGGAA 416
Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ L + + C A P I GN Q + YD S + F CT
Sbjct: 417 LKLSAQNLLVDVD-SSTTCLAFAPAR-SAAIIGNTQQQTFSVVYDVKSNRIGFAAGGCT 473
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 115/360 (31%), Positives = 179/360 (49%), Gaps = 28/360 (7%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
++ SIG PP+ + ++DTGSDL W+QCLPC +CY Q P ++P+ SS+Y+ SC+S
Sbjct: 88 FLANISIGDPPVPQLL-LIDTGSDLTWIQCLPC-KCYPQTIPFFHPSRSSTYRNASCESA 145
Query: 84 QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF---DNVVFGCGHNN 140
+ C Y Y D S T+G+LA E++TF S+ N+VFGCG +N
Sbjct: 146 PHAMPQIFRDEKTGNCRYHLRYRDFSNTRGILAKEKLTFQTSDEGLISKPNIVFGCGQDN 205
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
+G + G++GLG S+ ++ +KFSYC + + + GNG+ +
Sbjct: 206 SGFTQYS--GVLGLGPGTFSIVTRNF----GSKFSYCFGSLIDPTYPHNFLILGNGARIE 259
Query: 201 GGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
G T L +D+ Y++ L+ IS+G L + Y SKG IDTG
Sbjct: 260 GD---PTPLQIFQDR--YYLDLQAISLGEKLLDIEPGIFQRYR-----SKGGTVIDTGCS 309
Query: 259 PTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQLCYKTPSMAGIA--PILTAHFDGGA 314
PT+L ++ Y L E++ + L +D + CY+ + P++T HF GGA
Sbjct: 310 PTILAREAYETLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGFPVVTFHFAGGA 369
Query: 315 KVPLIHTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
++ L S F+ FC AM D+ + G AQ + +GY+ + V F+ TDC
Sbjct: 370 ELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 429
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 118/362 (32%), Positives = 162/362 (44%), Gaps = 12/362 (3%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S +GEY ++ +G+PP Y ++D+GSD++WVQC PC +CY+Q P+++PA S
Sbjct: 126 VVSGTEQGSGEYFVRIGVGSPPRSQ-YVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGS 184
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
++Y +SC S C LD C+ + C Y Y D S T+G LA E +TFG N+
Sbjct: 185 ATYAGISCDSSVCDRLDNAGCNDGR-CRYEVSYGDGSYTRGTLALETLTFGRV--LIRNI 241
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH N G+F L+GLG +S Q+ Q G FSYCLV T+S T +
Sbjct: 242 AIGCGHMNRGMFIGAAG-LLGLGGGAMSFVGQLGGQTGG-AFSYCLVSRGTES--TGTLE 297
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
FG G+ G V + YY G + I G G +
Sbjct: 298 FGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLG---YGGVV 354
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFD 311
+DTG T LP Y + P D CY + P ++ +F
Sbjct: 355 MDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFS 414
Query: 312 GGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
GG + L + IP EG FCFA + I GN Q + I D + V F PT
Sbjct: 415 GGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPT 474
Query: 372 DC 373
C
Sbjct: 475 IC 476
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 129/372 (34%), Positives = 173/372 (46%), Gaps = 28/372 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+QS + G Y++ GTP + I+DTGSDL W+QC PC CY QV I+ P S
Sbjct: 126 LQSGTTVGTGNYIVTAGFGTPAKNSLL-IIDTGSDLTWIQCKPCADCYSQVDAIFEPKQS 184
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQL----CNYTYGYADSSLTKGVLATERITFGNSNNF 128
SSYK L C S C L T + C Y Y D S ++G + E +T G+ +
Sbjct: 185 SSYKTLPCLSATCTELITSESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTLGSDS-- 242
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
F N FGCGH NTG+F + GL+GLG+ LS SQ S+ G +F+YCL P S+ T
Sbjct: 243 FQNFAFGCGHTNTGLF-KGSSGLLGLGQNSLSFPSQSKSKYGG-QFAYCL-PDFGSSTST 299
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
G GS + V T LVS T+YFV L GISVG S IP +
Sbjct: 300 GSFSVGKGSIPA--SAVFTPLVSNFMYPTFYFVGLNGISVGGDRLS---IP----PAVLG 350
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PIL 306
+G+ +D+G T L YN L+ R+ + P P CY + + P +
Sbjct: 351 RGSTIVDSGTVITRLLPQAYNALKTSFRSKTRDLPSAKPFSILDTCYDLSRHSQVRIPTI 410
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEG-----VFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
T HF A V + +P G F A Q +DG I GNF Q + + +D
Sbjct: 411 TFHFQNNADVAVSDVGILVPVQNGGSQVCLAFASASQ-MDG-FNIIGNFQQQRMRVAFDT 468
Query: 362 DSQMVSFKPTDC 373
+ + F C
Sbjct: 469 GAGRIGFASGSC 480
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 130/382 (34%), Positives = 182/382 (47%), Gaps = 41/382 (10%)
Query: 20 ANG----EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
ANG EY++ +IGTPP + I+DTGSDL+W QC PC C+ + +P++SS++
Sbjct: 407 ANGVPDTEYLVHLAIGTPPQ-PVQLILDTGSDLVWTQCRPCPVCFSRALGPLDPSNSSTF 465
Query: 76 KELSCQSEQCHLLDTVSCSSQ----QLCNYTYGYADSSLTKGVLATERITF----GNSNN 127
L C S C L SC Q C Y Y YAD S+T G L E TF G
Sbjct: 466 DVLPCSSPVCDNLTWSSCGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQA 525
Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
++ FGCG N G+F NE G+ G GR LSL SQL + FS+C S
Sbjct: 526 TVPDLAFGCGLFNNGIFTSNETGIAGFGRGALSLP----SQLKVDNFSHCFTAI--TGSE 579
Query: 188 TSKMYFGNGSEV---SGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
S + G + + + G V ST LV Y+++L+GI+VG S +P S+
Sbjct: 580 PSSVLLGLPANLYSDADGAVQSTPLVQNFSSLRAYYLSLKGITVG-----STRLPIPEST 634
Query: 244 GAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG--SQLC--YKT 296
A+ + G ID+G T LP+D Y + + ++L P + S+LC +
Sbjct: 635 FALKQDGTGGTIIDSGTGMTTLPQDAYKLVHDAFTAQVRL-PVDNATSSSLSRLCFSFSV 693
Query: 297 PSMAG-IAPILTAHFDGGAKVPLIHTSTFIPPPVEG--VFCFAMQPIDGDVGIFGNFAQS 353
P A P L HF+ GA + L + G V C A+ D D+ I GN+ Q
Sbjct: 694 PRRAKPDVPKLVLHFE-GATLDLPRENYMFEFEDAGGSVTCLAINAGD-DLTIIGNYQQQ 751
Query: 354 DLFIGYDFDSQMVSFKPTDCTK 375
+L + YD M+SF P C +
Sbjct: 752 NLHVLYDLVRNMLSFVPAQCNR 773
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 127/382 (33%), Positives = 183/382 (47%), Gaps = 49/382 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSC 80
GEY+M +IGTPP I DTGSDL+W QC PC +C+KQ P+YNP+SS +++ L C
Sbjct: 90 GEYIMTLAIGTPPQ-SYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPC 148
Query: 81 QSE--QCHLLDTVSCSSQQ---LCNYTYGYADSSLTKGVLATERITFGNS---NNFFDNV 132
S C ++ ++ C Y Y + T G+ +E TFG+S +
Sbjct: 149 SSALNLCAAEARLAGATPPPGCACRYNQTYG-TGWTSGLQGSETFTFGSSPADQVRVPGI 207
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGC + ++ +N + + ++SQL A FSYCL PF D+ S +
Sbjct: 208 AFGCSNASSDDWNGSAGLV-----GLGRGGLSLVSQLAAGMFSYCLTPFQ-DTKSKSTLL 261
Query: 193 FG---NGSEVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
G + ++G GV ST V K TYY++ L GISVG + + P + A
Sbjct: 262 LGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVG--PAALPIPPGAFALRA 319
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPR--LGSQLCYKTPSMA--- 300
G + ID+G T L Y R+ VR+ +KL P D G LC+ PS +
Sbjct: 320 DGTGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKL-PVTDGSNATGLDLCFALPSSSAPP 378
Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVE-------GVFCFAMQP-IDGDVGIFGNFAQ 352
P +T HF GGA + L PVE G++C AM+ DG++ GN+ Q
Sbjct: 379 ATLPSMTLHFGGGADMVL---------PVENYMILDGGMWCLAMRSQTDGELSTLGNYQQ 429
Query: 353 SDLFIGYDFDSQMVSFKPTDCT 374
+L I YD + +SF P C+
Sbjct: 430 QNLHILYDVQKETLSFAPAKCS 451
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 127/382 (33%), Positives = 183/382 (47%), Gaps = 49/382 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSC 80
GEY+M +IGTPP I DTGSDL+W QC PC +C+KQ P+YNP+SS +++ L C
Sbjct: 95 GEYIMTLAIGTPPQ-SYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPC 153
Query: 81 QSE--QCHLLDTVSCSSQQ---LCNYTYGYADSSLTKGVLATERITFGNS---NNFFDNV 132
S C ++ ++ C Y Y + T G+ +E TFG+S +
Sbjct: 154 SSALNLCAAEARLAGATPPPGCACRYNQTYG-TGWTSGLQGSETFTFGSSPADQVRVPGI 212
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGC + ++ +N + + ++SQL A FSYCL PF D+ S +
Sbjct: 213 AFGCSNASSDDWNGSAGLV-----GLGRGGLSLVSQLAAGMFSYCLTPFQ-DTKSKSTLL 266
Query: 193 FG---NGSEVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
G + ++G GV ST V K TYY++ L GISVG + + P + A
Sbjct: 267 LGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVG--PAALPIPPGAFALRA 324
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPR--LGSQLCYKTPSMA--- 300
G + ID+G T L Y R+ VR+ +KL P D G LC+ PS +
Sbjct: 325 DGTGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKL-PVTDGSNATGLDLCFALPSSSAPP 383
Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVE-------GVFCFAMQP-IDGDVGIFGNFAQ 352
P +T HF GGA + L PVE G++C AM+ DG++ GN+ Q
Sbjct: 384 ATLPSMTLHFGGGADMVL---------PVENYMILDGGMWCLAMRSQTDGELSTLGNYQQ 434
Query: 353 SDLFIGYDFDSQMVSFKPTDCT 374
+L I YD + +SF P C+
Sbjct: 435 QNLHILYDVQKETLSFAPAKCS 456
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 174/365 (47%), Gaps = 22/365 (6%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPIYNPASS 72
S + G YV+ +GTP Y +V DTGSD WVQC PCV CY+Q + +++PA S
Sbjct: 170 SGRALGTGNYVVTVGLGTP--ASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARS 227
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+Y +SC + C LDT CS C Y Y D S + G A + +T +S +
Sbjct: 228 STYANVSCAAPACFDLDTRGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTL-SSYDAVKGF 285
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGCG N G+F E GL+GLGR + SL Q + G F++CL SS T +
Sbjct: 286 RFGCGERNEGLFGE-AAGLLGLGRGKTSLPVQTYDKYG-GVFAHCLP---ARSSGTGYLD 340
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
FG GS + G ++T +++ T+Y+V + GI VG +L+ S +
Sbjct: 341 FGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGG-----QLLSIPQS--VFATAGTI 393
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAH 309
+D+G T LP Y+ L +A+ Y+ S L CY M+ +A P ++
Sbjct: 394 VDSGTVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLL 453
Query: 310 FDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
F GGA + + + V V FA GDVGI GN + YD ++V F
Sbjct: 454 FQGGAILDVDASGIMYAASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 513
Query: 369 KPTDC 373
P C
Sbjct: 514 SPGAC 518
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 118/374 (31%), Positives = 181/374 (48%), Gaps = 31/374 (8%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
+GEY+ K ++GTP + + + DT SDL W+QC PC +CY Q P+++P S+SY E++
Sbjct: 131 SGEYMAKIAVGTPAVQALLAL-DTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNY 189
Query: 81 QSEQCHLLDTVSC--SSQQLCNYTYGYAD----SSLTKGVLATERITF-GNSNNFFDNVV 133
+ C L + + C YT Y D +S + G L E +TF G + ++
Sbjct: 190 DAPDCQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGGVRQAYLSI- 248
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTD-SSITSKM 191
GCGH+N G+F G++GLGR ++S+ QI + LG N FSYCLV F + S +S +
Sbjct: 249 -GCGHDNKGLFGAPAAGILGLGRGQISIPHQI-AFLGYNASFSYCLVDFISGPGSPSSTL 306
Query: 192 YFGNGS-EVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNL------SNSSKLIPYYNSSG 244
FG G+ + S + +++++ T+Y+V L G+SVG + +L PY
Sbjct: 307 TFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPY----- 361
Query: 245 AISKGNMFIDTGAPPTLLPKDFY---NRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
+G + +D+G T L + Y ++ P CY AG
Sbjct: 362 -TGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGLFDTCYTVGGRAG 420
Query: 302 I-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQSDLFIGY 359
+ P ++ HF GG +V L + IP G CFA D V + GN Q + Y
Sbjct: 421 VKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRVVY 480
Query: 360 DFDSQMVSFKPTDC 373
D Q V F P +C
Sbjct: 481 DLAGQRVGFAPNNC 494
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 114/366 (31%), Positives = 171/366 (46%), Gaps = 20/366 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S + +GEY ++ +G+PP + Y ++D+GSD++WVQC PC +CY+Q P+++PA S
Sbjct: 132 VISGMEAGSGEYFVRIGVGSPPR-NQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFDPADS 190
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
SS+ +SC S+ C L+ C++ + C Y Y D S TKG LA E +T G +V
Sbjct: 191 SSFAGVSCGSDVCDRLENTGCNAGR-CRYEVSYGDGSYTKGTLALETLTVGQV--MIRDV 247
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH N G+F L G + +S Q+ Q G FSYCLV T S T +
Sbjct: 248 AIGCGHTNQGMFIGAAGLLGLGGGS-MSFIGQLGGQTGG-AFSYCLVSRGTGS--TGALE 303
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNS----SKLIPYYNSSGAISK 248
FG G+ G +S + + ++Y++ L GI VG + S + + Y ++G +
Sbjct: 304 FGRGALPVGATWISL-IRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVV-- 360
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILT 307
+DTG T P Y + P CY + P ++
Sbjct: 361 ----MDTGTAVTRFPTAAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVS 416
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
+F G + L + IP G FC A P + I GN Q + I +D + V
Sbjct: 417 FYFSDGPVLTLPARNFLIPVDGGGTFCLAFAPSPSGLSIIGNIQQEGIQISFDGANGFVG 476
Query: 368 FKPTDC 373
F P C
Sbjct: 477 FGPNIC 482
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 124/372 (33%), Positives = 174/372 (46%), Gaps = 30/372 (8%)
Query: 12 VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ---CYKQVKPIYN 68
VV + EY+ + +G P L Y + DTGSD+ W+QC PC CYKQ PI++
Sbjct: 136 VVSGQSKGSGAEYLAQIGVGQPVKL-FYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFD 194
Query: 69 PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
P SSSSY LSC S+QC LLD +C+S C Y Y D S T G LATE ++FGNSN+
Sbjct: 195 PKSSSSYSPLSCNSQQCKLLDKANCNSDT-CIYQVHYGDGSFTTGELATETLSFGNSNS- 252
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
N+ GCGH+N G+F A + SQL A+ FSYCLV +DSS T
Sbjct: 253 IPNLPIGCGHDNEGLFAGGAG-----LIGLGGGAISLSSQLKASSFSYCLVNLDSDSSST 307
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
+ S + + S + + +Y +V + GISVG K +P + I +
Sbjct: 308 LEF----NSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGG-----KTLPISPTRFEIDE 358
Query: 249 ---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGI 302
G + +D+G + LP D Y L E +KLT P G + CY + +
Sbjct: 359 SGLGGIIVDSGTIISRLPSDVYESLREAF---VKLTSSLSPAPGISVFDTCYNFSGQSNV 415
Query: 303 -APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
P + G + L + I G +C A + I G+F Q + + YD
Sbjct: 416 EVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDL 475
Query: 362 DSQMVSFKPTDC 373
+ +V F C
Sbjct: 476 TNSLVGFSTNKC 487
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 161 bits (408), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 119/372 (31%), Positives = 170/372 (45%), Gaps = 22/372 (5%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
S + +GEY + +GTP + ++DTGSD++W+QC PC CY Q +++P S S
Sbjct: 119 SGLPQGSGEYFAQVGVGTPATTALM-VLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRS 177
Query: 75 YKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
Y + C + C LD+ C ++ C Y Y D S+T G A+E +TF V
Sbjct: 178 YAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGAR-VQRVA 236
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV----PFHTDSSITS 189
GCGH+N G+F L RLS SQI G FSYCLV S+ +S
Sbjct: 237 IGCGHDNEGLFIAASGLLGLGR-GRLSFPSQIARSFG-RSFSYCLVDRTSSVRPSSTRSS 294
Query: 190 KMYFGNGSEVSGGGVVSTSL-VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
+ FG G+ + G T + + T+Y+V L G SVG + +
Sbjct: 295 TVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGR 354
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQL---CYKTPSMAGI 302
G + +D+G T L + Y + + R A ++++P G L CY +
Sbjct: 355 GGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPG-----GFSLFDTCYNLSGRRVV 409
Query: 303 -APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
P ++ H GGA V L + IP G FCFAM DG V I GN Q + +D
Sbjct: 410 KVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDG 469
Query: 362 DSQMVSFKPTDC 373
D+Q V F P C
Sbjct: 470 DAQRVGFVPKSC 481
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 161 bits (408), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 135/374 (36%), Positives = 185/374 (49%), Gaps = 36/374 (9%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPAS 71
VQS S +G+Y + +GTP + I DTGSDL W QC PC + CYKQ +P +P
Sbjct: 122 VQSGASIGSGDYAVTVGLGTPKK-EFTLIFDTGSDLTWTQCEPCAKTCYKQKEPRLDPTK 180
Query: 72 SSSYKELSCQSEQCHLLDT---VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
S+SYK +SC S C LLDT SCSS C Y Y D S + G ATE +T +S+N
Sbjct: 181 STSYKNISCSSAFCKLLDTEGGESCSSPT-CLYQVQYGDGSYSIGFFATETLTL-SSSNV 238
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
F N +FGCG N+G+F GL+GLGRT+LSL SQ +Q FSYCL +S +
Sbjct: 239 FKNFLFGCGQQNSGLF-RGAAGLLGLGRTKLSLPSQT-AQKYKKLFSYCL-----PASSS 291
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKT--YYFVTLEGISVG--NLSNSSKLIPYYNSSG 244
SK Y G +VS V + +S++ K+ +Y + + +SVG LS + + +++SG
Sbjct: 292 SKGYLSFGGQVS--KTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASI---FSTSG 346
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA- 303
+ ID+G T LP Y+ L + + P D CY I
Sbjct: 347 TV------IDSGTVITRLPSTAYSALSSAFQKLMTDYPSTDGYSIFDTCYDFSKNETIKI 400
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGV----FCFAMQPIDGDVGIFGNFAQSDLFIGY 359
P + F GG ++ + + I PV G+ FA D IFGN Q + Y
Sbjct: 401 PKVGVSFKGGVEMDIDVSG--ILYPVNGLKKVCLAFAGNGDDVKAAIFGNTQQKTYQVVY 458
Query: 360 DFDSQMVSFKPTDC 373
D V F P+ C
Sbjct: 459 DDAKGRVGFAPSGC 472
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 119/372 (31%), Positives = 170/372 (45%), Gaps = 22/372 (5%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
S + +GEY + +GTP + ++DTGSD++W+QC PC CY Q +++P S S
Sbjct: 113 SGLPQGSGEYFAQVGVGTPATTALM-VLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRS 171
Query: 75 YKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
Y + C + C LD+ C ++ C Y Y D S+T G A+E +TF V
Sbjct: 172 YAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGAR-VQRVA 230
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV----PFHTDSSITS 189
GCGH+N G+F L RLS SQI G FSYCLV S+ +S
Sbjct: 231 IGCGHDNEGLFIAASGLLGLGR-GRLSFPSQIARSFG-RSFSYCLVDRTSSVRPSSTRSS 288
Query: 190 KMYFGNGSEVSGGGVVSTSL-VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
+ FG G+ + G T + + T+Y+V L G SVG + +
Sbjct: 289 TVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGR 348
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQL---CYKTPSMAGI 302
G + +D+G T L + Y + + R A ++++P G L CY +
Sbjct: 349 GGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPG-----GFSLFDTCYNLSGRRVV 403
Query: 303 -APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
P ++ H GGA V L + IP G FCFAM DG V I GN Q + +D
Sbjct: 404 KVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDG 463
Query: 362 DSQMVSFKPTDC 373
D+Q V F P C
Sbjct: 464 DAQRVGFVPKSC 475
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 167/366 (45%), Gaps = 20/366 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S + +GEY ++ +G+PP + Y ++D+GSD++WVQC PC QCY Q P++NPA S
Sbjct: 125 VVSGMEQGSGEYFVRIGVGSPPR-NQYVVMDSGSDIIWVQCEPCTQCYHQSDPVFNPADS 183
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
SS+ +SC S C +D +C + C Y Y D S TKG LA E ITFG + NV
Sbjct: 184 SSFSGVSCASTVCSHVDNAACHEGR-CRYEVSYGDGSYTKGTLALETITFGRT--LIRNV 240
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH+N G+F L+GLG +S Q+ Q G FSYCLV +SS +
Sbjct: 241 AIGCGHHNQGMFVGAAG-LLGLGGGPMSFVGQLGGQTGG-AFSYCLVSRGIESS--GLLE 296
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
FG + G V + YY G + S+ + + G G +
Sbjct: 297 FGREAMPVGAAWVPLIHNPRAQSFYYIGLSGLGVGGLRVSISEDVFKLSELG---DGGVV 353
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGI-APILT 307
+DTG T LP Y E R+ PR CY + P ++
Sbjct: 354 MDTGTAVTRLPTVAY----EAFRDGFIAQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVS 409
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
+F GG + L + IP G FCFA P + I GN Q + I D + V
Sbjct: 410 FYFSGGPILTLPARNFLIPVDDVGTFCFAFAPSSSGLSIIGNIQQEGIQISVDGANGFVG 469
Query: 368 FKPTDC 373
F P C
Sbjct: 470 FGPNVC 475
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 160 bits (405), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 115/356 (32%), Positives = 171/356 (48%), Gaps = 19/356 (5%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
Y +GTP D+ +DTGSD W+QC PC CY+Q + +++P+ SS+Y +++C S
Sbjct: 134 YFTSLRLGTP-ATDLLVELDTGSDQSWIQCKPCPDCYEQHEALFDPSKSSTYSDITCSSR 192
Query: 84 QCHLLDTV---SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
+C L + +CSS + C Y YAD S T G LA + +T + + VFGCGHNN
Sbjct: 193 ECQELGSSHKHNCSSDKKCPYEITYADDSYTVGNLARDTLTL-SPTDAVPGFVFGCGHNN 251
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
G F E + GL+GLGR + SL+SQ+ ++ GA FSYCL + S T + F + +
Sbjct: 252 AGSFGEID-GLLGLGRGKASLSSQVAARYGAG-FSYCLP---SSPSATGYLSFSGAAAAA 306
Query: 201 GGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPT 260
T +V+ + ++Y++ L GI+V + K+ P ++ A ID+G +
Sbjct: 307 PTNAQFTEMVAGQHPSFYYLNLTGITVAG--RAIKVPPSVFATAA----GTIIDSGTAFS 360
Query: 261 LLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLI 319
LP Y L VR+A+ CY + P + F GA V L
Sbjct: 361 CLPPSAYAALRSSVRSAMGRYKRAPSSTIFDTCYDLTGHETVRIPSVALVFADGATVHLH 420
Query: 320 HTSTFIPPPVEGVFCFAM--QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ C A P D +G+ GN Q L + YD D+Q V F C
Sbjct: 421 PSGVLYTWSNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGANGC 476
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 160 bits (405), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 121/382 (31%), Positives = 179/382 (46%), Gaps = 35/382 (9%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S + +G+Y + F +GTPP IVD+GSDL+WVQC PC+QCY Q P+Y P++S
Sbjct: 54 VVSGSTLGSGQYFVDFFLGTPPQ-KFSLIVDSGSDLLWVQCAPCLQCYAQDTPLYAPSNS 112
Query: 73 SSYKELSCQSEQCHLLDTVS---CSSQQ--LCNYTYGYADSSLTKGVLATERITFGNSNN 127
S++ + C S +C L+ C C Y Y YAD+SL+KGV A E T +
Sbjct: 113 STFNPVPCLSPECLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATVDDVR- 171
Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
D V FGCG +N G F G++GLG+ LS SQ+ G NKF+YCLV + +S+
Sbjct: 172 -IDKVAFGCGRDNQGSF-AAAGGVLGLGQGPLSFGSQVGYAYG-NKFAYCLVNYLDPTSV 228
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVGN----LSNSSKLIPYYNS 242
+S + FG+ + + T +VS + T Y+V +E + VG +S+S+ + + +
Sbjct: 229 SSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGN 288
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT-----P 297
G+I + PP ++ ++ VR P G LC P
Sbjct: 289 GGSIFDSGTTVTYWLPPAY--RNILAAFDKNVR-----YPRAASVQGLDLCVDVTGVDQP 341
Query: 298 SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIF---GNFAQSD 354
S +L GG V + V C AM + VG F GN Q +
Sbjct: 342 SFPSFTIVL-----GGGAVFQPQQGNYFVDVAPNVQCLAMAGLPSSVGGFNTIGNLLQQN 396
Query: 355 LFIGYDFDSQMVSFKPTDCTKQ 376
+ YD + + F P C+
Sbjct: 397 FLVQYDREENRIGFAPAKCSSH 418
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 118/372 (31%), Positives = 170/372 (45%), Gaps = 22/372 (5%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
S + +GEY + +GTP + ++DTGSD++W+QC PC CY Q +++P S S
Sbjct: 113 SGLPQGSGEYFAQVGVGTPATTALM-VLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRS 171
Query: 75 YKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
Y + C + C LD+ C ++ C Y Y D S+T G A+E +TF V
Sbjct: 172 YAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGAR-VQRVA 230
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV----PFHTDSSITS 189
GCGH+N G+F L RLS +QI G FSYCLV S+ +S
Sbjct: 231 IGCGHDNEGLFIAASGLLGLGR-GRLSFPTQIARSFG-RSFSYCLVDRTSSVRPSSTRSS 288
Query: 190 KMYFGNGSEVSGGGVVSTSL-VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
+ FG G+ + G T + + T+Y+V L G SVG + +
Sbjct: 289 TVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGR 348
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQL---CYKTPSMAGI 302
G + +D+G T L + Y + + R A ++++P G L CY +
Sbjct: 349 GGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPG-----GFSLFDTCYNLSGRRVV 403
Query: 303 -APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
P ++ H GGA V L + IP G FCFAM DG V I GN Q + +D
Sbjct: 404 KVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDG 463
Query: 362 DSQMVSFKPTDC 373
D+Q V F P C
Sbjct: 464 DAQRVGFVPKSC 475
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 124/372 (33%), Positives = 174/372 (46%), Gaps = 30/372 (8%)
Query: 12 VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ---CYKQVKPIYN 68
VV + EY+ + +G P L Y + DTGSD+ W+QC PC CYKQ PI++
Sbjct: 136 VVSGQSKGSGAEYLAQIGVGQPVKL-FYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFD 194
Query: 69 PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
P SSSSY LSC S+QC LLD +C+S C Y Y D S T G LATE ++FGNSN+
Sbjct: 195 PKSSSSYSPLSCNSQQCKLLDKANCNSDT-CIYQVHYGDGSFTTGELATETLSFGNSNS- 252
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
N+ GCGH+N G+F A + SQL A+ FSYCLV +DSS T
Sbjct: 253 IPNLPIGCGHDNEGLFAGGAG-----LIGLGGGAISLSSQLKASSFSYCLVNLDSDSSST 307
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
+ S + + S + + +Y +V + GISVG K +P + I +
Sbjct: 308 LEF----NSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGG-----KTLPISPTRFEIDE 358
Query: 249 ---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGI 302
G + +D+G + LP D Y L E +KLT P G + CY + +
Sbjct: 359 SGLGGIIVDSGTIISRLPSDVYESLREAF---VKLTSSLSPAPGISVFDTCYNFSGQSNV 415
Query: 303 -APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
P + G + L + I G +C A + I G+F Q + + YD
Sbjct: 416 EVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDL 475
Query: 362 DSQMVSFKPTDC 373
+ +V F C
Sbjct: 476 TNSIVGFSTNKC 487
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 118/381 (30%), Positives = 177/381 (46%), Gaps = 44/381 (11%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
GEY++K GTP +DT SDL+W+QC PCV CY+Q+ P++NP SSSY + C
Sbjct: 90 GEYLVKLGTGTPQHF-FSAAIDTASDLVWMQCQPCVSCYRQLDPVFNPKLSSSYAVVPCT 148
Query: 82 SEQCHLLDTVSCSSQQ--LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
S+ C LD C C YTY Y+ +TKG LA +++ G + F VVFGC +
Sbjct: 149 SDTCAQLDGHRCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAIG--GDVFHAVVFGCSDS 206
Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
+ G GLVGLGR LSL +SQL ++F YCL P + +S K+ G G++
Sbjct: 207 SVGGPAAQASGLVGLGRGPLSL----VSQLSVHRFMYCLPPPMSRTS--GKLVLGAGADA 260
Query: 200 ---SGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG------- 249
V T S +YY++ L+G++VG+ + + SG G
Sbjct: 261 VRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQTPGTTRNATSPPSGGAGGGGGGGGGG 320
Query: 250 ----------NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP--RLGSQLCYKTP 297
M +D + + L Y+ L + + I+L P P RLG LC+ P
Sbjct: 321 IVGAGGANAYGMIVDVASTISFLETSLYDELADDLEEEIRL-PRATPSLRLGLDLCFILP 379
Query: 298 SMAGI----APILTAHFDGGAKVPLIHTSTFIPPPVEG-VFCFAMQPIDGDVGIFGNFAQ 352
G+ P ++ FD G + L F+ +G + C + G V I GNF
Sbjct: 380 EGVGMDRVYVPTVSLSFD-GRWLELDRDRLFV---TDGRMMCLMIGRTSG-VSILGNFQL 434
Query: 353 SDLFIGYDFDSQMVSFKPTDC 373
++ + ++ ++F C
Sbjct: 435 QNMRVLFNLRRGKITFAKASC 455
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 106/298 (35%), Positives = 149/298 (50%), Gaps = 37/298 (12%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
A EY++ ++GTPP + +DTGSDL+W QC PC C+ Q P+ +PA+SS+Y L
Sbjct: 82 ATNEYLVHLAVGTPPR-PVALTLDTGSDLVWTQCAPCRDCFDQGIPLLDPAASSTYAALP 140
Query: 80 CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--------FFDN 131
C + +C L SC + C Y Y Y D S+T G +AT+R TFG++
Sbjct: 141 CGAPRCRALPFTSCGGRS-CVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDGSLPATRR 199
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP-FHTDSSIT-- 188
+ FGCGH N GVF NE G+ G GR R SL SQL A FSYC F + SSI
Sbjct: 200 LTFGCGHFNKGVFQSNETGIAGFGRGRWSLP----SQLNATSFSYCFTSMFDSKSSIVTL 255
Query: 189 ----SKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKL-IPYYNS 242
+ +Y S G V +T L + + YF++L+GISVG ++L +P
Sbjct: 256 GGAPAALY----SHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGK----TRLPVPETKF 307
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
I ID+GA T LP++ Y ++ + + L P +C+ P A
Sbjct: 308 RSTI------IDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDVCFALPVSA 359
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 123/380 (32%), Positives = 182/380 (47%), Gaps = 31/380 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S + +G+Y + F +GTPP IVD+GSDL+WVQC PC QCY Q P+Y P++S
Sbjct: 53 VVSGSTLGSGQYFVDFFLGTPPQ-KFSLIVDSGSDLLWVQCSPCRQCYAQDSPLYVPSNS 111
Query: 73 SSYKELSCQSEQCHLLDTVS---CSSQQ--LCNYTYGYADSSLTKGVLATERITFGNSNN 127
S++ + C S C L+ C + C Y Y YAD+S +KGV A E T
Sbjct: 112 STFSPVPCLSSDCLLIPATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESATVDGVR- 170
Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
D V FGCG +N G F G++GLG+ LS SQ+ G NKF+YCLV + +S+
Sbjct: 171 -IDKVAFGCGSDNQGSF-AAAGGVLGLGQGPLSFGSQVGYAYG-NKFAYCLVNYLDPTSV 227
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
+S + FG+ + + T +VS + T Y+V +E ++VG K +P +S+ I
Sbjct: 228 SSSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGG-----KSLPISDSAWEI 282
Query: 247 S---KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA 303
G D+G T Y+ + + + P + G LC + + G+
Sbjct: 283 DLLGNGGSIFDSGTTLTYWFPSAYSHILAAFDSGVHY-PRAESVQGLDLCVE---LTGVD 338
Query: 304 ----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIF---GNFAQSDLF 356
P T FD GA + F+ V C AM + +G F GN Q + F
Sbjct: 339 QPSFPSFTIEFDDGAVFQPEAENYFV-DVAPNVRCLAMAGLASPLGGFNTIGNLLQQNFF 397
Query: 357 IGYDFDSQMVSFKPTDCTKQ 376
+ YD + ++ F P C+
Sbjct: 398 VQYDREENLIGFAPAKCSSH 417
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 130/379 (34%), Positives = 184/379 (48%), Gaps = 31/379 (8%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
+ V S +S +GEY M+ +GTP ++Y ++DTGSD++W+QC PC CY Q I++P
Sbjct: 124 SGAVISGLSQGSGEYFMRLGVGTPAT-NVYMVLDTGSDVVWLQCSPCKACYNQSDVIFDP 182
Query: 70 ASSSSYKELSCQSEQCHLLDTVS-CSSQQ--LCNYTYGYADSSLTKGVLATERITFGNSN 126
S ++ + C S C LD S C +++ C Y Y D S T+G +TE +TF +
Sbjct: 183 KKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGAR 242
Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP---FHT 183
D+V GCGH+N G+F L+GLGR LS SQ S+ KFSYCLV +
Sbjct: 243 --VDHVPLGCGHDNEGLFVGAAG-LLGLGRGGLSFPSQTKSRYNG-KFSYCLVDRTSSGS 298
Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
S S + FGN + V V + L + + T+Y++ L GISVG +P + S
Sbjct: 299 SSKPPSTIVFGNDA-VPKTSVFTPLLTNPKLDTFYYLQLLGISVGG-----SRVPGVSES 352
Query: 244 ----GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYK 295
A G + ID+G T L + Y L R+A +L + R S C+
Sbjct: 353 QFKLDATGNGGVIIDSGTSVTRLTQSAYVAL----RDAFRLGATKLKRAPSYSLFDTCFD 408
Query: 296 TPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
M + P + HF GG +V L ++ IP EG FCFA G + I GN Q
Sbjct: 409 LSGMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQG 467
Query: 355 LFIGYDFDSQMVSFKPTDC 373
+ YD V F C
Sbjct: 468 FRVAYDLVGSRVGFLSRAC 486
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 129/379 (34%), Positives = 184/379 (48%), Gaps = 31/379 (8%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
+ V S +S +GEY M+ +GTP ++Y ++DTGSD++W+QC PC CY Q I++P
Sbjct: 121 SGAVISGLSQGSGEYFMRLGVGTPAT-NVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDP 179
Query: 70 ASSSSYKELSCQSEQCHLLDTVS-CSSQQ--LCNYTYGYADSSLTKGVLATERITFGNSN 126
S ++ + C S C LD S C +++ C Y Y D S T+G +TE +TF +
Sbjct: 180 KKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGAR 239
Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP---FHT 183
D+V GCGH+N G+F L+GLGR LS SQ ++ KFSYCLV +
Sbjct: 240 --VDHVPLGCGHDNEGLFVGAAG-LLGLGRGGLSFPSQTKNRYNG-KFSYCLVDRTSSGS 295
Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
S S + FGN + V V + L + + T+Y++ L GISVG +P + S
Sbjct: 296 SSKPPSTIVFGNAA-VPKTSVFTPLLTNPKLDTFYYLQLLGISVGG-----SRVPGVSES 349
Query: 244 ----GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYK 295
A G + ID+G T L + Y L R+A +L + R S C+
Sbjct: 350 QFKLDATGNGGVIIDSGTSVTRLTQPAYVAL----RDAFRLGATKLKRAPSYSLFDTCFD 405
Query: 296 TPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
M + P + HF GG +V L ++ IP EG FCFA G + I GN Q
Sbjct: 406 LSGMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQG 464
Query: 355 LFIGYDFDSQMVSFKPTDC 373
+ YD V F C
Sbjct: 465 FRVAYDLVGSRVGFLSRAC 483
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 131/369 (35%), Positives = 174/369 (47%), Gaps = 42/369 (11%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSY 75
+ TAN YV+ GTP I DTGS++ W+QC PCV CY Q +P+++P SS+Y
Sbjct: 11 IGTAN--YVITVGFGTPKKNQTV-IFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSSTY 67
Query: 76 KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
+ +SC S C L + CS C Y Y D S T G LATE T + N F+N +FG
Sbjct: 68 RNISCTSAACTGLSSRGCSGST-CVYGVTYGDGSSTVGFLATETFTLA-AGNVFNNFIFG 125
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
CG NN G+F GL+GLGR+ SL SQ+ + LG N FSYCL + SS T + GN
Sbjct: 126 CGQNNQGLF-TGAAGLIGLGRSPYSLNSQLATSLG-NIFSYCL---PSTSSATGYLNIGN 180
Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAISKGNMFI 253
G + L + T YF+ L GISVG L+ SS + + S G I I
Sbjct: 181 PLRTPG---YTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTV---FQSVGTI------I 228
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHF 310
D+G T LP Y L R A +T Y S L CY + P + H+
Sbjct: 229 DSGTVITRLPPTAYGALRTAFRAA--MTQYTRAAAASILDTCYDFSRTTTVTFPTIKLHY 286
Query: 311 DG------GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
G GA V + +S+ + FA +GI GN Q + + YD +
Sbjct: 287 TGLDVTIPGAGVFYVISSSQV------CLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALK 340
Query: 365 MVSFKPTDC 373
+ F C
Sbjct: 341 RIGFAAGAC 349
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 120/359 (33%), Positives = 170/359 (47%), Gaps = 24/359 (6%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSSSYKEL 78
G YV+ +GTP Y +V DTGSD WVQC PCV CY+Q + +++PA SS+Y +
Sbjct: 176 TGNYVVTVGLGTP--ASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANV 233
Query: 79 SCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
SC + C LDT CS C Y Y D S + G A + +T +S + FGCG
Sbjct: 234 SCAAPACSDLDTRGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTL-SSYDAVKGFRFGCGE 291
Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
N G+F E GL+GLGR + SL Q + G F++CL S+ T + FG GS
Sbjct: 292 RNEGLFGE-AAGLLGLGRGKTSLPVQTYDKYG-GVFAHCLP---ARSTGTGYLDFGAGSP 346
Query: 199 VSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
+ + +T ++ T+Y+V L GI VG +L+ Y + +D+G
Sbjct: 347 AA--RLTTTPMLVDNGPTFYYVGLTGIRVGG-----RLL--YIPQSVFATAGTIVDSGTV 397
Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHFDGGAK 315
T LP Y+ L A+ Y+ S L CY M+ +A P ++ F GGA+
Sbjct: 398 ITRLPPAAYSSLRSAFAAAMSARGYKKAPAVSLLDTCYDFAGMSQVAIPTVSLLFQGGAR 457
Query: 316 VPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ + + V FA GDVGI GN + YD ++VSF P C
Sbjct: 458 LDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFSPGAC 516
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 121/360 (33%), Positives = 177/360 (49%), Gaps = 28/360 (7%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSC 80
G YV + +GTP + +VDTGS L W+QC PC V C++Q P++NP SSS+Y + C
Sbjct: 120 GNYVTRMGLGTPATQYVM-VVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGC 178
Query: 81 QSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
++QC L+ +CSS +C Y Y DSS + G L+ + ++FG+++ N +G
Sbjct: 179 SAQQCSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS--LPNFYYG 236
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
CG +N G+F + GL+GL R +LSL Q+ LG + F+YCL + SS + N
Sbjct: 237 CGQDNEGLFGRSA-GLIGLARNKLSLLYQLAPSLGYS-FTYCLP--SSSSSGYLSLGSYN 292
Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNMFID 254
+ S +VS+SL D + YF+ L G++V GN P SS A S ID
Sbjct: 293 PGQYSYTPMVSSSL----DDSLYFIKLSGMTVAGN--------PLSVSSSAYSSLPTIID 340
Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGA 314
+G T LP Y+ L + V A+K T C+K + AP +T F GGA
Sbjct: 341 SGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFKGQASRVSAPAVTMSFAGGA 400
Query: 315 KVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ L + + + C A P I GN Q + YD S + F C+
Sbjct: 401 ALKLSAQNLLVDVD-DSTTCLAFAPAR-SAAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 458
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 114/345 (33%), Positives = 160/345 (46%), Gaps = 25/345 (7%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDT------VSCS 94
IVDTGSDL WVQC PC +CY Q P++NP+ S SY+ + C S C L V S
Sbjct: 80 IVDTGSDLSWVQCQPCNRCYNQQDPVFNPSKSPSYRTVLCNSLTCRSLQLATGNSGVCGS 139
Query: 95 SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGL 154
+ CNY Y D S T G + E + GN+ +N +FGCG N G+F GLVGL
Sbjct: 140 NPPTCNYVVNYGDGSYTSGEVGMEHLNLGNTT--VNNFIFGCGRKNQGLFG-GASGLVGL 196
Query: 155 GRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKE 213
GRT LSL SQI G FSYCL ++S S + GN S +S T ++
Sbjct: 197 GRTDLSLISQISPMFGG-VFSYCLPTTEAEAS-GSLVMGGNSSVYKNTTPISYTRMIHNP 254
Query: 214 DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQ 273
+YF+ L GI+VG + + + K M ID+G + LP Y L+ +
Sbjct: 255 LLPFYFLNLTGITVGGVEVQAP---------SFGKDRMIIDSGTVISRLPPSIYQALKAE 305
Query: 274 VRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGV 332
P + C+ + P + +F+G A++ + T F +
Sbjct: 306 FVKQFSGYPSAPSFMILDSCFNLSGYQEVKIPDIKMYFEGSAELNVDVTGVFYSVKTDAS 365
Query: 333 -FCFAMQ--PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
C A+ P + +VGI GN+ Q + I YD M+ F C+
Sbjct: 366 QVCLAIASLPYEDEVGIIGNYQQKNQRIIYDTKGSMLGFAEEACS 410
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 126/367 (34%), Positives = 187/367 (50%), Gaps = 33/367 (8%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPI---YNPASSSSYKE 77
GEY+M F+IG P + G +DT + L+WVQC C QC + + + + + S +Y+
Sbjct: 73 GEYLMSFNIGNPSS-QVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTTKFLSSKSFTYEM 131
Query: 78 LSCQSEQCHLLDTV-SC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV--- 132
C S C+ L +C SS + C Y Y D+ T G+L+++ F S+ +V
Sbjct: 132 EPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDTSDGMLVDVGFL 191
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGC ++ G VGL +T LSL +SQLG KFSYCLVPF+ S TSKMY
Sbjct: 192 NFGCSEAPLTGDEQSYTGNVGLNQTPLSL----ISQLGIKKFSYCLVPFNNLGS-TSKMY 246
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI--SKGN 250
FG+ SGG T L+ YY V + GIS+GN P+++ + +
Sbjct: 247 FGSLPVTSGG---QTPLLYPNSDAYY-VKVLGISIGNDE------PHFDGVFDVYEVRDG 296
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ--DPRLGSQLCYKTPSMAGIA--PIL 306
IDTG + L D ++ L + +K P + DP+ +LC++ + + P +
Sbjct: 297 WIIDTGITYSSLETDAFDSLLAKFL-TLKDFPQRKDDPKERFELCFELQNANDLESFPDV 355
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
T HFDG A + L STF+ +G+FC A+ V I GNF + +GYD ++Q++
Sbjct: 356 TVHFDG-ADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVI 414
Query: 367 SFKPTDC 373
SF P DC
Sbjct: 415 SFAPVDC 421
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 119/367 (32%), Positives = 165/367 (44%), Gaps = 24/367 (6%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
Q +S G YV+ +GTP Y ++ DTGSDL WVQC PC CY+Q P+++P+
Sbjct: 138 AQRGISLGTGNYVVSVGLGTP--AKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSL 195
Query: 72 SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SS+Y ++C + +C LD CSS C Y Y D S T G L + +T S+
Sbjct: 196 SSTYAAVACGAPECQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASDT-LPG 254
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
VFGCG N G+F + + GL GLGR ++SL SQ G F+YCL SS + +
Sbjct: 255 FVFGCGDQNAGLFGQVD-GLFGLGREKVSLPSQGAPSYGPG-FTYCL-----PSSSSGRG 307
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
Y G T+L ++Y++ L GI VG + IP + + G
Sbjct: 308 YLSLGGAPPANAQF-TALADGATPSFYYIDLVGIKVG---GRAIRIPATAFA---AAGGT 360
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYK-TPSMAGIAPILTA 308
ID+G T LP Y L A + Y+ S L CY T P +
Sbjct: 361 VIDSGTVITRLPPRAYAPLRAAF--ARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVEL 418
Query: 309 HFDGGAKVPLIHTST-FIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
F GGA V L T ++ + FA D + I GN Q + YD +Q +
Sbjct: 419 AFAGGATVSLDFTGVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVAYDVANQRIG 478
Query: 368 FKPTDCT 374
F C+
Sbjct: 479 FGAKGCS 485
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 119/362 (32%), Positives = 171/362 (47%), Gaps = 30/362 (8%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSC 80
G YV + +GTP I +VDTGS L W+QC PC V C++Q P+++P +SSSY +SC
Sbjct: 115 GNYVTRMGLGTPAKPYIM-VVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSC 173
Query: 81 QSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
S QC L T + CS +C Y Y DSS + G L+ + ++FG N N +G
Sbjct: 174 SSPQCDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKDTVSFG--ANSVPNFYYG 231
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
CG +N G+F + GL+GL R +LSL Q+ LG + FSYCL S +S Y
Sbjct: 232 CGQDNEGLFGRSA-GLMGLARNKLSLLYQLAPTLGYS-FSYCL------PSTSSSGYLSI 283
Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
GS GG + + + D + YF++L G++V P SS + ID+
Sbjct: 284 GSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGK-------PLAVSSSEYTSLPTIIDS 336
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYK-TPSMAGIAPILTAHFDGG 313
G T LP Y L + V A+K + + C++ S P ++ F GG
Sbjct: 337 GTVITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDTCFEGQASKLRAVPAVSMAFSGG 396
Query: 314 AKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
A + L + + V+G C A P I GN Q + YD S + F
Sbjct: 397 ATLKLSAGNLLV--DVDGATTCLAFAPAR-SAAIIGNTQQQTFSVVYDVKSNRIGFAAAG 453
Query: 373 CT 374
C+
Sbjct: 454 CS 455
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 157 bits (397), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 119/381 (31%), Positives = 181/381 (47%), Gaps = 33/381 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V+S V +GEY++ +GTPP I+DTGSDL W+QC PC+ C++Q PI++PA+S
Sbjct: 138 VESGVPVGSGEYLVDVYLGTPPR-RFRMIMDTGSDLNWLQCAPCLDCFEQSGPIFDPAAS 196
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCN--------YTYGYADSSLTKGVLATERITFG- 123
SY+ ++C ++C L+ + S+ + C Y Y Y D S T G LA E T
Sbjct: 197 ISYRNVTCGDDRCRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNL 256
Query: 124 --NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
+ D V FGCGH N G+F+ L LS ASQ+ G + FSYCLV
Sbjct: 257 TQSGTRRVDGVAFGCGHRNRGLFHGAAGLLGLGR-GPLSFASQLRGVYGGHAFSYCLV-- 313
Query: 182 HTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPY 239
S+ SK+ FG+ + ++ T+ D T+Y++ L+ I VG + +
Sbjct: 314 EHGSAAGSKIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNI----- 368
Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKT 296
SS +S G ID+G + P+ Y + + + +++P LG + CY
Sbjct: 369 --SSDTLSAGGTIIDSGTTLSYFPEPAYQAIRQAFID--RMSPSYPLILGFPVLSPCYNV 424
Query: 297 PSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM--QPIDGDVGIFGNFAQS 353
+ P L+ F GA + FI EG+ C A+ P G + I GN+ Q
Sbjct: 425 SGAEKVEVPELSLVFADGAAWEFPAENYFIRLEPEGIMCLAVLGTPRSG-MSIIGNYQQQ 483
Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
+ + YD + + F P C
Sbjct: 484 NFHVLYDLEHNRLGFAPRRCA 504
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 157 bits (397), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 120/377 (31%), Positives = 176/377 (46%), Gaps = 48/377 (12%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
Q+ + G Y M S+GTP LL + DTGSDL+W QC PC +C++Q P + PASSS
Sbjct: 76 QALLENGVGGYNMNISVGTP-LLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSS 134
Query: 74 SYKELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
++ +L C S C L +++ + C Y Y Y S T G LATE + G+++ F +V
Sbjct: 135 TFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYG-SGYTAGYLATETLKVGDAS--FPSV 191
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGC N GLG+ LG +FSYCL ++ S +
Sbjct: 192 AFGCSTEN------------GLGQL----------DLGVGRFSYCLR--SGSAAGASPIL 227
Query: 193 FGNGSEVSGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK-- 248
FG+ + ++ G V ST V+ +YY+V L GI+VG +P S+ ++
Sbjct: 228 FGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETD-----LPVTTSTFGFTQNG 282
Query: 249 --GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG---IA 303
G +D+G T L KD Y +++ + + G LC+K+ G
Sbjct: 283 LGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAV 342
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD--VGIFGNFAQSDLFIG 358
P L FDGGA+ + + +G V C M P GD + + GN Q D+ +
Sbjct: 343 PSLVLRFDGGAEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLL 402
Query: 359 YDFDSQMVSFKPTDCTK 375
YD D + SF P DC K
Sbjct: 403 YDLDGGIFSFAPADCAK 419
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 157 bits (396), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 113/360 (31%), Positives = 177/360 (49%), Gaps = 28/360 (7%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
++ SIG PP+ + ++DTGSDL W+ CLPC +CY Q P ++P+ SS+Y+ SC S
Sbjct: 78 FLANISIGNPPVPQLL-LIDTGSDLTWIHCLPC-KCYPQTIPFFHPSRSSTYRNASCVSA 135
Query: 84 QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF---DNVVFGCGHNN 140
+ C Y Y D S T+G+LA E++TF S++ N+VFGCG +N
Sbjct: 136 PHAMPQIFRDEKTGNCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGCGQDN 195
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
+G + G++GLG S+ ++ +KFSYC + + + GNG+++
Sbjct: 196 SGFTKYS--GVLGLGPGTFSIVTRNF----GSKFSYCFGSLTNPTYPHNILILGNGAKIE 249
Query: 201 GGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
G T L +D+ Y++ L+ IS G L Y S+G IDTG
Sbjct: 250 GD---PTPLQIFQDR--YYLDLQAISFGEKLLDIEPGTFQRYR-----SQGGTVIDTGCS 299
Query: 259 PTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQLCYKTPSMAGIA--PILTAHFDGGA 314
PT+L ++ Y L E++ + L +D + CY+ + P++T HF GGA
Sbjct: 300 PTILAREAYETLSEEIDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLYGFPVVTFHFAGGA 359
Query: 315 KVPLIHTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
++ L S F+ FC AM D+ + G AQ + +GY+ + V F+ TDC
Sbjct: 360 ELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 419
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 129/372 (34%), Positives = 182/372 (48%), Gaps = 25/372 (6%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
V+ A GEY+ +GTP + IVDTGSDL WVQC PC +CY Q ++ P +S+S+
Sbjct: 6 VAAARGEYLATVRLGTPERV-FSVIVDTGSDLTWVQCSPCGKCYSQNDALFLPNTSTSFT 64
Query: 77 ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN---NFFDNVV 133
+L+C S C+ L C +Q C Y Y Y D SLT G + IT N N
Sbjct: 65 KLACGSALCNGLPFPMC-NQTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPNFA 123
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
FGCGH+N G F + G++GLG+ LS SQ+ S KFSYCLV + + TS + F
Sbjct: 124 FGCGHDNEGSFAGAD-GILGLGQGPLSFHSQLKSVYNG-KFSYCLVDWLAPPTQTSPLLF 181
Query: 194 GNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVG-NLSNSSKLIPYYNSSGAISKGNM 251
G+ + V +++ TYY+V L GISVG NL N S + +S G G +
Sbjct: 182 GDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGG--AGTI 239
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-------AP 304
F D+G T L + Y ++V A+ + R + ++G P
Sbjct: 240 F-DSGTTVTQLAEAAY----KEVLAAMNASTMAYSRKIDDISRLDLCLSGFPKDQLPTVP 294
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
+T HF+GG V L ++ FI +CFAM DV I G+ Q + + YD +
Sbjct: 295 AMTFHFEGGDMV-LPPSNYFIYLESSQSYCFAMTS-SPDVNIIGSVQQQNFQVYYDTAGR 352
Query: 365 MVSFKPTDCTKQ 376
+ F P DC +
Sbjct: 353 KLGFVPKDCVGR 364
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 120/361 (33%), Positives = 172/361 (47%), Gaps = 15/361 (4%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
+S +GEY + +GTPP +Y ++DTGSD++W+QC PC +CY Q I++P+ S S+
Sbjct: 123 LSQGSGEYFTRLGVGTPPKY-LYMVLDTGSDVVWLQCKPCTKCYSQTDQIFDPSKSKSFA 181
Query: 77 ELSCQSEQCHLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
+ C S C LD+ CS LC Y Y D S T G +TE +TF + V G
Sbjct: 182 GIPCYSPLCRRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTFRRAA--VPRVAIG 239
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
CGH+N G+F L+GLGR LS +Q ++ NKFSYCL T S+ S + FG+
Sbjct: 240 CGHDNEGLFVGAAG-LLGLGRGGLSFPTQTGTRFN-NKFSYCLTD-RTASAKPSSIVFGD 296
Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY--NSSGAISKGNMFI 253
S VS + + + + T+Y+V L GISVG ++ +S+G G + I
Sbjct: 297 -SAVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTG---NGGVII 352
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDG 312
D+G T L + Y L + R CY ++ + P + HF
Sbjct: 353 DSGTSVTRLTRPAYVSLRDAFRVGASHLKRAPEFSLFDTCYDLSGLSEVKVPTVVLHFR- 411
Query: 313 GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
GA V L + +P G FCFA + I GN Q + +D V F P
Sbjct: 412 GADVSLPAANYLVPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGSRVGFAPRG 471
Query: 373 C 373
C
Sbjct: 472 C 472
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 120/360 (33%), Positives = 170/360 (47%), Gaps = 25/360 (6%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKE 77
G YV+ +GTP Y +V DTGSD WVQC PCV CY+Q + +++PASSS+Y
Sbjct: 176 GTGNYVVTVGLGTP--ASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYAN 233
Query: 78 LSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
+SC + C LD CS C Y Y D S + G A + +T +S + FGCG
Sbjct: 234 VSCAAPACSDLDVSGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTL-SSYDAVKGFRFGCG 291
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
N G+F E GL+GLGR + SL Q + G F++CL P S+ T + FG GS
Sbjct: 292 ERNDGLFGE-AAGLLGLGRGKTSLPVQTYGKYG-GVFAHCLPP---RSTGTGYLDFGAGS 346
Query: 198 EVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
+ +T +++ T+Y+V + GI VG +L+P S + +D+G
Sbjct: 347 PPA---TTTTPMLTGNGPTFYYVGMTGIRVGG-----RLLPIAPS--VFAAAGTIVDSGT 396
Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHFDGGA 314
T LP Y+ L A+ Y+ S L CY M+ +A P ++ F GGA
Sbjct: 397 VITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGA 456
Query: 315 KVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ + + V FA GDVGI GN + YD ++V F P C
Sbjct: 457 ALDVDASGIMYTVSASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 516
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 119/367 (32%), Positives = 165/367 (44%), Gaps = 24/367 (6%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
Q +S G YV+ +GTP Y ++ DTGSDL WVQC PC CY+Q P+++P+
Sbjct: 138 AQRGISLGTGNYVVSVGLGTP--AKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSL 195
Query: 72 SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SS+Y ++C + +C LD CSS C Y Y D S T G L + +T S+
Sbjct: 196 SSTYAAVACGAPECQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASDT-LPG 254
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
VFGCG N G+F + + GL GLGR ++SL SQ G F+YCL SS + +
Sbjct: 255 FVFGCGDQNAGLFGQVD-GLFGLGREKVSLPSQGAPSYGPG-FTYCL-----PSSSSGRG 307
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
Y G T+L ++Y++ L GI VG + IP + + G
Sbjct: 308 YLSLGGAPPANAQF-TALADGATPSFYYIDLVGIKVG---GRAIRIPATAFA---AAGGT 360
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYK-TPSMAGIAPILTA 308
ID+G T LP Y L A + Y+ S L CY T P +
Sbjct: 361 VIDSGTVITRLPPRAYAPLRAAF--ARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVEL 418
Query: 309 HFDGGAKVPLIHTST-FIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
F GGA V L T ++ + FA D + I GN Q + YD +Q +
Sbjct: 419 AFAGGATVSLDFTGVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVTYDVANQRIG 478
Query: 368 FKPTDCT 374
F C+
Sbjct: 479 FGAKGCS 485
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 123/380 (32%), Positives = 192/380 (50%), Gaps = 43/380 (11%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
+ + + EY+M+ +IGTPP+ I + DTGSDL W QC PC C+ Q PIY+ +SSS
Sbjct: 74 ARLRSGQAEYLMELAIGTPPVPFI-ALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSSS 132
Query: 75 YKELSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
+ L C S C + + CS+ C Y Y Y D + + I+ G +
Sbjct: 133 FSPLPCSSATCLPIWSSRCSTPSATCRYRYAYDDGAYSP---ECAGISVG-------GIA 182
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
FGCG +N G+ + N G VGLGR LSL ++QLG KFSYCL F ++S++S ++F
Sbjct: 183 FGCGVDNGGL-SYNSTGTVGLGRGSLSL----VAQLGVGKFSYCLTDFF-NTSLSSPVFF 236
Query: 194 GNGSEVSGGG-------VVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
G+ +E++ V ST LV S + + Y+V+LEGIS+G+ +P N +
Sbjct: 237 GSLAELAASSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGD-----ARLPIPNGTFD 291
Query: 246 IS----KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
++ G M +D+G T+L + + + + V + P + + C+ P+ AG
Sbjct: 292 LNDDDGSGGMIVDSGTIFTILVETGFRVVVDHVAGVLG-QPVVNASSLDRPCFPAPA-AG 349
Query: 302 I-----APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG-IFGNFAQSDL 355
+ P + HF GGA + L + E FC + + G + GNF Q ++
Sbjct: 350 VQELPDMPDMVLHFAGGADMRLHRDNYMSFNEEESSFCLNIVGTESASGSVLGNFQQQNI 409
Query: 356 FIGYDFDSQMVSFKPTDCTK 375
+ +D +SF PTDC+K
Sbjct: 410 QMLFDITVGQLSFMPTDCSK 429
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 122/368 (33%), Positives = 176/368 (47%), Gaps = 23/368 (6%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASS 72
+S + +G YV+ +G+P D+ I DTGSDL W QC PCV CY+Q + I++P++S
Sbjct: 137 KSASTLGSGNYVVTVGLGSPKR-DLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTS 195
Query: 73 SSYKELSCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
SY +SC S C L++ + CSS C Y Y D S + G A E+++ S +
Sbjct: 196 LSYSNVSCDSPSCEKLESATGNSPGCSSST-CLYGIRYGDGSYSIGFFAREKLSL-TSTD 253
Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
F+N FGCG NN G+F GL+GL R LSL SQ + G FSYCL + SS
Sbjct: 254 VFNNFQFGCGQNNRGLFG-GTAGLLGLARNPLSLVSQTAQKYG-KVFSYCL---PSSSSS 308
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
T + FG+G S + S V+ + ++YF+ + GISVG + +P S S
Sbjct: 309 TGYLSFGSGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGE-----RKLPIPKS--VFS 361
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APIL 306
ID+G + LP Y+ +++ R + P CY + P +
Sbjct: 362 TAGTIIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKYKTVKVPKI 421
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
+F GGA++ L V V FA D +V I GN Q + + YD
Sbjct: 422 ILYFSGGAEMDLAPEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGR 481
Query: 366 VSFKPTDC 373
V F P+ C
Sbjct: 482 VGFAPSGC 489
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 113/370 (30%), Positives = 164/370 (44%), Gaps = 47/370 (12%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S + +GEY ++ +G+PP Y ++D+GSD++WVQC PC QCY Q P+++PA S
Sbjct: 190 VISGMEQGSGEYFVRIGVGSPPRSQ-YMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADS 248
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
+S+ +SC S C L+ C + + C Y Y D S TKG LA E +TFG + +V
Sbjct: 249 ASFTGVSCSSSVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTFGRT--MVRSV 305
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH N G+F L G + +S Q+ Q G FSYCLV
Sbjct: 306 AIGCGHRNRGMFVGAAGLLGLGGGS-MSFVGQLGGQTGG-AFSYCLV------------- 350
Query: 193 FGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK--- 248
LV ++Y++ L G+ VG + +P +++
Sbjct: 351 ----------SAAWVPLVRNPRAPSFYYIGLAGLGVGGIR-----VPISEEVFRLTELGD 395
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGI-A 303
G + +DTG T LP Y + R+A PR CY +
Sbjct: 396 GGVVMDTGTAVTRLPTLAY----QAFRDAFLAQTANLPRATGVAIFDTCYDLLGFVSVRV 451
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
P ++ +F GG + L + IP G FCFA P + I GN Q + I +D +
Sbjct: 452 PTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGAN 511
Query: 364 QMVSFKPTDC 373
V F P C
Sbjct: 512 GYVGFGPNIC 521
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 118/361 (32%), Positives = 165/361 (45%), Gaps = 30/361 (8%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
G YV + +GTP +VDTGS L W+QC PCV C++QV P+Y+P +SS+Y + C
Sbjct: 132 GNYVTELGLGTP-ATSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLYDPRASSTYATVPC 190
Query: 81 QSEQCHLLDTV-----SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
+ QC L +CS + +C Y Y DSS + G L+ + ++FG+ + + N +G
Sbjct: 191 SASQCDELQAATLNPSACSVRNVCIYQASYGDSSFSVGYLSRDTVSFGSGS--YPNFYYG 248
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
CG +N G+F + GL+GL R +LSL Q+ LG FSYCL + S Y
Sbjct: 249 CGQDNEGLFGRSA-GLIGLARNKLSLLYQLAPSLG-YSFSYCL------PTPASTGYLSI 300
Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
G SG + S D + YFVTL G+SVG P S S ID+
Sbjct: 301 GPYTSGHYSYTPMASSSLDASLYFVTLSGMSVGG-------SPLAVSPAEYSSLPTIIDS 353
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIAPILTAHFDGG 313
G T LP Y L + V A + Q S L C++ + P + F GG
Sbjct: 354 GTVITRLPTAVYTALSKAV--AAAMVGVQSAPAFSILDTCFQGQASQLRVPAVAMAFAGG 411
Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
A + L + I + C A P D I GN Q + YD + F C
Sbjct: 412 ATLKLATQNVLIDVD-DSTTCLAFAPTDSTT-IIGNTQQQTFSVVYDVAQSRIGFAAGGC 469
Query: 374 T 374
+
Sbjct: 470 S 470
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 115/408 (28%), Positives = 190/408 (46%), Gaps = 52/408 (12%)
Query: 1 MSPATYFYPNNVVQSNVSTANGE-----------YVMKFSIGTPPLLDIYGIVDTGSDLM 49
+S A + Y N + + ++N + +++ FS+G PP+ + I+DTGS L+
Sbjct: 62 ISSARFKYLQNSIDKELGSSNFQVDVEQAIKTSLFLVNFSVGQPPVPQL-TIMDTGSSLL 120
Query: 50 WVQCLPCVQCY--KQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYAD 107
W+QC PC C + P++NPA SS++ E SC C C S C Y Y
Sbjct: 121 WIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCDDRFCRYAPNGHCGSSNKCVYEQVYIS 180
Query: 108 SSLTKGVLATERITFGNSNN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQ 164
+ +KGVLA ER+TF N + FGCG+ N + G++GLG SLA Q
Sbjct: 181 GTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGYENGEQLESHFTGILGLGAKPTSLAVQ 240
Query: 165 ILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEG 224
+ +KFSYC+ + +++ G +++ G T + + + + Y++ LEG
Sbjct: 241 L-----GSKFSYCIGDLANKNYGYNQLVLGEDADILGD---PTPIEFETENSIYYMNLEG 292
Query: 225 ISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ 284
ISVG+ + + + + + + +D+G T L Y L ++++ +
Sbjct: 293 ISVGDTQLNIEPVVFKRRG---PRTGVILDSGTLYTWLADIAYRELYNEIKSIL------ 343
Query: 285 DPRL-----GSQLCYK---TPSMAGIAPILTAHFDGGAKVPLIHTSTFIP---PPVEGVF 333
DP+L LCY + + G P++T HF GGA++ + TS F P P VF
Sbjct: 344 DPKLERFWFRDFLCYHGRVSEELIGF-PVVTFHFAGGAELAMEATSMFYPLSEPNTFNVF 402
Query: 334 CFAMQPIDGDVGIFGNF------AQSDLFIGYDFDSQMVSFKPTDCTK 375
C +++P G + F AQ IGYD + + + DC +
Sbjct: 403 CMSVKPTKEHGGEYKEFTAIGLMAQQYYNIGYDLKEKNIYLQRIDCVQ 450
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 126/370 (34%), Positives = 174/370 (47%), Gaps = 28/370 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ---CYKQVKPIYNP 69
V S S GEY + +G P + + + DTGSD+ W+QC PC CYKQ+ PI++P
Sbjct: 173 VTSGASQGAGEYFARIGVGQP-VQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDP 231
Query: 70 ASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
SSSSY LSC SEQCHLLD +C + C Y Y D S T G LATE +F +SN+
Sbjct: 232 KSSSSYSPLSCDSEQCHLLDEAACDANS-CIYEVEYGDGSFTVGELATETFSFRHSNS-I 289
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
N+ GCGH+N G+F + A + SQL A FSYCLV ++SS T
Sbjct: 290 PNLPIGCGHDNEGLFVGADG-----LIGLGGGAISLSSQLEATSFSYCLVDLDSESSSTL 344
Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDK--TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
N + S TS + K D+ T+ +V + G+SVG K +P +SS I
Sbjct: 345 DF---NADQPSDS---LTSPLVKNDRFPTFRYVKVIGMSVGG-----KPLPISSSSFEID 393
Query: 248 K---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-A 303
+ G + +D+G T +P D Y+ L + K P CY S + +
Sbjct: 394 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 453
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
P + G + L + I G FC A P + I GN Q + + YD +
Sbjct: 454 PTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 513
Query: 364 QMVSFKPTDC 373
+V F C
Sbjct: 514 SLVGFSTDKC 523
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 126/380 (33%), Positives = 182/380 (47%), Gaps = 23/380 (6%)
Query: 12 VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
++S VS +GEY M +GTPP I+DTGSDL W+QC+PC C++Q P Y+P
Sbjct: 183 TLESGVSLGSGEYFMDVFVGTPPK-HFSLILDTGSDLNWIQCVPCYACFEQNGPYYDPKD 241
Query: 72 SSSYKELSCQSEQCHLLDTVS----CSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSN 126
SSS+K ++C +C L+ + C + Q C Y Y Y DSS T G A E T +
Sbjct: 242 SSSFKNITCHDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTT 301
Query: 127 -------NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
+NV+FGCGH N G+F+ L+GLGR LS A+Q+ S G + FSYCLV
Sbjct: 302 PEGKPELKIVENVMFGCGHWNRGLFHGAAG-LLGLGRGPLSFATQLQSLYG-HSFSYCLV 359
Query: 180 PFHTDSSITSKMYFGNGSE-VSGGGVVSTSLVSKEDK---TYYFVTLEGISVGNLSNSSK 235
+++SS++SK+ FG E +S + TS V ++ T+Y+V ++ I VG K
Sbjct: 360 DRNSNSSVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGG--EVLK 417
Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
+ A G ID+G T + Y ++E IK P + + CY
Sbjct: 418 IPEETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPPLKPCYN 477
Query: 296 TPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQS 353
+ + P F GA + FI E V C A+ + I GN+ Q
Sbjct: 478 VSGVEKMELPEFAILFADGAMWDFPVENYFIQIEPEDVVCLAILGTPRSALSIIGNYQQQ 537
Query: 354 DLFIGYDFDSQMVSFKPTDC 373
+ I YD + + P C
Sbjct: 538 NFHILYDLKKSRLGYAPMKC 557
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 104/350 (29%), Positives = 167/350 (47%), Gaps = 40/350 (11%)
Query: 38 IYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSS-Q 96
++ ++DTGSD+ W+QC PC QCYKQ ++ PA S++YK L C S C L + S S
Sbjct: 1 MFLLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLN 60
Query: 97 QLCNYTYGYADSSLTKGVLATERITFGNSNNFF---DNVVFGCGHNNTGVFNENEMGLVG 153
CNY Y D S T+G A E +T + + N FGCGH N G+FN GL+G
Sbjct: 61 SSCNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFN-GAAGLMG 119
Query: 154 LGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKE 213
LG++ + +Q G FSYCL P + + + ++FG + + + + S
Sbjct: 120 LGKSSIGFPAQTSVAFG-KVFSYCL-PSVSSTIPSGILHFGEAAMLDYDVRFTPLVDSSS 177
Query: 214 DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQ 273
+ YFV++ GI+VG+ +L+P + +D+G + + Y RL +
Sbjct: 178 GPSQYFVSMTGINVGD-----ELLPI--------SATVMVDSGTVISRFEQSAYERLRDA 224
Query: 274 -------VRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPL--IHTST 323
++ A+ + P+ C++ ++ I P++T HF A++ L +H
Sbjct: 225 FTQILPGLQTAVSVAPFDT-------CFRVSTVDDINIPLITLHFRDDAELRLSPVH--- 274
Query: 324 FIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ P +GV CFA P + GNF Q +L YD + +C
Sbjct: 275 ILYPVDDGVMCFAFAPSSSGRSVLGNFQQQNLRFVYDIPKSRLGISAFEC 324
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 118/354 (33%), Positives = 160/354 (45%), Gaps = 26/354 (7%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ +G+P ++D+GSD+ WVQC PC+QC+ QV P+++P+ SS+Y SC S
Sbjct: 130 EYLITVRLGSPAKTQTV-LIDSGSDVSWVQCKPCLQCHSQVDPLFDPSLSSTYSPFSCSS 188
Query: 83 EQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
C L D CSS C Y YAD S T G +++ + G +N N FGC H
Sbjct: 189 AACAQLGQDGNGCSSSSQCQYIVRYADGSSTTGTYSSDTLALG--SNTISNFQFGCSHVE 246
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
+G FN+ GL+GLG SLASQ G FSYCL P + S + G G+
Sbjct: 247 SG-FNDLTDGLMGLGGGAPSLASQTAGTFG-TAFSYCLPPTPSSSGF---LTLGAGTS-- 299
Query: 201 GGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPT 260
G V + L S T+Y V LE I VG S IP ++ M +D+G T
Sbjct: 300 -GFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLS---IPT-----SVFSAGMVMDSGTIIT 350
Query: 261 LLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLI 319
LP+ Y+ L + +K PR C+ + + P + F GGA V L
Sbjct: 351 RLPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVRLPSVALVFSGGAVVNLD 410
Query: 320 HTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ FA D GI GN Q + YD V FK C
Sbjct: 411 ANGIIL----GNCLAFAANSDDSSPGIVGNVQQRTFEVLYDVGGGAVGFKAGAC 460
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 121/372 (32%), Positives = 173/372 (46%), Gaps = 27/372 (7%)
Query: 9 PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCV-QCYKQVKPI 66
P+ S + + G YV+ +GTP Y +V DTGSD WVQC PCV +CYKQ +P+
Sbjct: 148 PSLPATSGRAVSTGNYVVTVGLGTP--ASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPL 205
Query: 67 YNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
++PA SS+Y +SC C LDT C+ C Y Y D S T G A + +T ++
Sbjct: 206 FDPAKSSTYANVSCTDSACADLDTNGCTGGH-CLYAVQYGDGSYTVGFFAQDTLTI--AH 262
Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
+ FGCG N G+F + GL+GLGR + SL Q ++ G F+YCL T
Sbjct: 263 DAIKGFRFGCGEKNNGLFGKT-AGLMGLGRGKTSLTVQAYNKYG-GAFAYCLPALTTG-- 318
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
T + FG GS +G T +++ + +T+Y+V + GI VG + +P S
Sbjct: 319 -TGYLDFGPGS--AGNNARLTPMLTDKGQTFYYVGMTGIRVGG-----QQVPVAES--VF 368
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA- 303
S +D+G T LP Y L + Y+ S L CY ++ +
Sbjct: 369 STAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVEL 428
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFC--FAMQPIDGDVGIFGNFAQSDLFIGYDF 361
P ++ F GGA + + S + E C FA D V I GN Q + YD
Sbjct: 429 PTVSLVFQGGACLD-VDVSGIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDL 487
Query: 362 DSQMVSFKPTDC 373
+ V F P C
Sbjct: 488 GKKTVGFAPGSC 499
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 125/359 (34%), Positives = 172/359 (47%), Gaps = 27/359 (7%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELS 79
+G YV+ GTP + DTGSD+ W+QC PC V+CY Q +P+++P+ SS+Y+ +S
Sbjct: 13 SGNYVITVGFGTPTRTQTV-VFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTYRNVS 71
Query: 80 CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
C C L T CSS C Y Y D S T G LA + + F N +FGCG N
Sbjct: 72 CTEPACVGLSTRGCSSST-CLYGVFYGDGSSTIGFLAMDTFMLTPAQK-FKNFIFGCGQN 129
Query: 140 NTGVFNENEMGLVGLGRTRL-SLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
NTG+F + GLVGLGR+ SL SQ+ LG N FSYCL + SS T + GN
Sbjct: 130 NTGLF-QGTAGLVGLGRSSTYSLNSQVAPSLG-NVFSYCL---PSTSSATGYLNIGNPQN 184
Query: 199 VSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAISKGNMFIDTG 256
G + L T YF+ L GISVG LS SS + + S G I ID+G
Sbjct: 185 TPG---YTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTV---FQSVGTI------IDSG 232
Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK-TPSMAGIAPILTAHFDG-GA 314
T LP Y+ L+ VR A+ CY + + + + P++ HF G
Sbjct: 233 TVITRLPPTAYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVLHFAGLDV 292
Query: 315 KVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
++P F+ + FA +GI GN Q + + YD + + + F C
Sbjct: 293 RIPATGVF-FVFNSSQVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSAGAC 350
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 130/363 (35%), Positives = 165/363 (45%), Gaps = 40/363 (11%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPASSSSYKELSC 80
+YV+ S+GTP + VDTGSD+ WVQC PC CY Q P+++P SSSY + C
Sbjct: 141 QYVVTVSLGTPAVAQTL-EVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPC 199
Query: 81 QSEQCHLLDTVS--CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
+ C L S CS Q C Y Y D S T GV +++ +T SN +FGCGH
Sbjct: 200 AAASCSQLALYSNGCSGGQ-CGYVVSYGDGSTTTGVYSSDTLTLTGSNA-LKGFLFGCGH 257
Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
G+F + GL+GLGR SL SQ S G FSYCL P S Y G
Sbjct: 258 AQQGLFAGVD-GLLGLGRQGQSLVSQASSTYG-GVFSYCLPPTQ-----NSVGYISLGGP 310
Query: 199 VSGGGVVSTSLVSKE-DKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNMFIDT 255
S G +T L++ D TYY V L GISVG LS + + +SGA+ +DT
Sbjct: 311 SSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVF----ASGAV------VDT 360
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGIA-PILTAHF 310
G T LP Y+ L R A + PY P + CY + P ++ F
Sbjct: 361 GTVVTRLPPTAYSALRSAFRAA--MAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAF 418
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
GGA + L TS + G FA D I GN Q + FD V F P
Sbjct: 419 GGGAAMDL-GTSGIL---TSGCLAFAPTGGDSQASILGNVQQRSFEV--RFDGSTVGFMP 472
Query: 371 TDC 373
C
Sbjct: 473 ASC 475
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 172/365 (47%), Gaps = 22/365 (6%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPIYNPASS 72
S + G YV+ +GTP Y +V DTGSD WVQC PCV CYKQ + +++PA S
Sbjct: 173 SGRALGTGNYVVTIGLGTP--ASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARS 230
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+Y +SC + C L T CS C Y+ Y D S + G A + +T +S +
Sbjct: 231 STYANVSCAAPACSDLYTRGCSGGH-CLYSVQYGDGSYSIGFFAMDTLTL-SSYDAVKGF 288
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGCG N G+F E GL+GLGR + SL Q + G F++CL SS T +
Sbjct: 289 RFGCGERNEGLFGE-AAGLLGLGRGKTSLPVQTYDKYG-GVFAHCLP---ARSSGTGYLD 343
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
FG GS + G +T +++ T+Y+V + GI VG +L+ S S
Sbjct: 344 FGPGSPAAVGARQTTPMLTDNGPTFYYVGMTGIRVGG-----QLLSIPQS--VFSTAGTI 396
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAH 309
+D+G T LP Y+ L +A+ Y+ S L CY M+ +A P ++
Sbjct: 397 VDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYDFTGMSEVAIPKVSLL 456
Query: 310 FDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
F GGA + + + + V FA D DVGI GN + YD + V F
Sbjct: 457 FQGGAYLDVNASGIMYAASLSQVCLGFAANEDDDDVGIVGNTQLKTFGVVYDIGKKTVGF 516
Query: 369 KPTDC 373
P C
Sbjct: 517 SPGAC 521
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 116/360 (32%), Positives = 168/360 (46%), Gaps = 13/360 (3%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
++ +GEY + +GTP +Y ++DTGSD++W+QC PC +CY Q +++P S +Y
Sbjct: 111 LAQGSGEYFTRIGVGTPARY-VYMVLDTGSDVVWLQCAPCRKCYTQTDHVFDPTKSRTYA 169
Query: 77 ELSCQSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
+ C + C LD+ CS++ ++C Y Y D S T G +TE +TF N V G
Sbjct: 170 GIPCGAPLCRRLDSPGCSNKNKVCQYQVSYGDGSFTFGDFSTETLTF--RRNRVTRVALG 227
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
CGH+N G+F L RLS Q + +KFSYCLV + S+ S + FG+
Sbjct: 228 CGHDNEGLFTGAAGLLGLGR-GRLSFPVQTGRRFN-HKFSYCLVD-RSASAKPSSVIFGD 284
Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
S VS + + + + T+Y++ L GISVG + A G + ID+
Sbjct: 285 -SAVSRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAA-GNGGVIIDS 342
Query: 256 GAPPTLLPKDFYNRLEEQVR-NAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGG 313
G T L + Y L + R A L + L C+ + + P + HF G
Sbjct: 343 GTSVTRLTRPAYIALRDAFRIGASHLKRAPEFSL-FDTCFDLSGLTEVKVPTVVLHFR-G 400
Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
A V L T+ IP G FCFA + I GN Q I YD V F P C
Sbjct: 401 ADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRISYDLTGSRVGFAPRGC 460
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 113/359 (31%), Positives = 167/359 (46%), Gaps = 25/359 (6%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
G YV + +GTP +VDTGS L W+QC PCV C++QV P+++P +SS+Y + C
Sbjct: 132 GNYVTQLGLGTPST-SYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYASVRC 190
Query: 81 QSEQCHLLDTV-----SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
+ QC L +CS+ +C Y Y DSS + G L+T+ ++FG++ + + +G
Sbjct: 191 SASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGSLSTDTVSFGSTR--YPSFYYG 248
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
CG +N G+F + GL+GL R +LSL Q+ LG + FSYCL P + S +
Sbjct: 249 CGQDNEGLFGRSA-GLIGLARNKLSLLYQLAPSLGYS-FSYCL-PTAASTGYLSIGPYNT 305
Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
G S + S+SL D + YF+TL G+SVG P S S ID+
Sbjct: 306 GHYYSYTPMASSSL----DASLYFITLSGMSVGG-------SPLAVSPSEYSSLPTIIDS 354
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
G T LP + L + V A+ C++ + P + F GGA
Sbjct: 355 GTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQLRVPTVAMAFAGGAS 414
Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ L + I + C A P D I GN Q + YD + F C+
Sbjct: 415 MKLTTRNVLIDVD-DSTTCLAFAPTD-STAIIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 121/379 (31%), Positives = 178/379 (46%), Gaps = 29/379 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC-LPC--VQCYKQVKPIYNP 69
V + V A +Y+ ++ IG PP ++DTGS+L+W QC C C KQ P YN
Sbjct: 73 VSAPVHLATRQYIAEYLIGDPPQ-RAAALIDTGSNLIWTQCGTTCGLKACAKQDLPYYNL 131
Query: 70 ASSSSYKELSC--QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
+ SS++ + C ++ C C C + Y S+ G L TE TF +
Sbjct: 132 SRSSTFAAVPCADSAKLCAANGVHLCGLDGSCTFAASYGAGSVF-GSLGTEAFTFQSGAA 190
Query: 128 FFDNVVFGC---GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
+ FGC G N GL+GLGR RLSL +SQ GA KFSYCL P+ +
Sbjct: 191 ---KLGFGCVSLTRITKGALN-GASGLIGLGRGRLSL----VSQTGATKFSYCLTPYLRN 242
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSL---VSKED---KTYYFVTLEGISVG--NLSNSSKL 236
+S ++ G + +SGGG TS+ S ED T+Y++ L GISVG L S
Sbjct: 243 HGASSHLFVGASASLSGGGGAVTSIPFVKSPEDYPYSTFYYLPLVGISVGETKLPIPSAA 302
Query: 237 IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPR-LGSQLCYK 295
+ G + IDTG+P T L + Y+ L ++V + + Q P G LC
Sbjct: 303 FELRRVAAGYWSGGVIIDTGSPVTSLAEAAYSALSDEVARQLNRSLVQPPADTGLDLCVA 362
Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDL 355
+ + P+L HF GGA + + ++ P + C ++ G + GNF Q D+
Sbjct: 363 RQDVDKVVPVLVFHFGGGADMA-VSAGSYWGPVDKSTACMLIEE-GGYETVIGNFQQQDV 420
Query: 356 FIGYDFDSQMVSFKPTDCT 374
+ YD +SF+ DC+
Sbjct: 421 HLLYDIGKGELSFQTADCS 439
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 154 bits (390), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 126/365 (34%), Positives = 168/365 (46%), Gaps = 28/365 (7%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSSSYKEL 78
+G Y++ +GTP D+ I DTGSDL W QC PCV+ CY Q +PI+NP+ S+SY +
Sbjct: 128 GSGNYIVTVGLGTPKN-DLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNV 186
Query: 79 SCQSEQCHLLDTV-----SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
SC S C L + SCS+ C Y Y D S + G LA E+ T NS + FD V
Sbjct: 187 SCSSAACGSLSSATGNAGSCSASN-CIYGIQYGDQSFSVGFLAKEKFTLTNS-DVFDGVY 244
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMY 192
FGCG NN G+F GL+GLGR +LS SQ + NK FSYCL + +S T +
Sbjct: 245 FGCGENNQGLFT-GVAGLLGLGRDKLSFPSQTATAY--NKIFSYCL---PSSASYTGHLT 298
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKL-IPYYNSSGAISKGNM 251
FG+ S ++ + ++Y + + I+VG KL IP S S
Sbjct: 299 FGSAGISRSVKFTPISTIT-DGTSFYGLNIVAITVGG----QKLPIP----STVFSTPGA 349
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHF 310
ID+G T LP Y L + + P C+ + P + F
Sbjct: 350 LIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSF 409
Query: 311 DGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
GGA V L F + V FA D + IFGN Q L + YD V F
Sbjct: 410 SGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFA 469
Query: 370 PTDCT 374
P C+
Sbjct: 470 PNGCS 474
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 154 bits (390), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 131/363 (36%), Positives = 168/363 (46%), Gaps = 40/363 (11%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPASSSSYKELSC 80
+YV+ S+GTP + VDTGSD+ WVQC PC CY Q P+++P SSSY + C
Sbjct: 130 QYVVTVSLGTPAVAQTL-EVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPC 188
Query: 81 QSEQCHLLDTVS--CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
+ C L S CS Q C Y Y D S T GV +++ +T SN +FGCGH
Sbjct: 189 AAASCSQLALYSNGCSGGQ-CGYVVSYGDGSTTTGVYSSDTLTLTGSNA-LKGFLFGCGH 246
Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
G+F + GL+GLGR SL SQ S G FSYCL P T +S+ Y G
Sbjct: 247 AQQGLFAGVD-GLLGLGRQGQSLVSQASSTYG-GVFSYCLPP--TQNSVG---YISLGGP 299
Query: 199 VSGGGVVSTSLVSKE-DKTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNMFIDT 255
S G +T L++ D TYY V L GISVG LS + + +SGA+ +DT
Sbjct: 300 SSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVF----ASGAV------VDT 349
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGIA-PILTAHF 310
G T LP Y+ L R A + PY P + CY + P ++ F
Sbjct: 350 GTVVTRLPPTAYSALRSAFRAA--MAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAF 407
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
GGA + L TS + G FA D I GN Q + FD V F P
Sbjct: 408 GGGAAMDL-GTSGIL---TSGCLAFAPTGGDSQASILGNVQQRSFEV--RFDGSTVGFMP 461
Query: 371 TDC 373
C
Sbjct: 462 ASC 464
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 154 bits (390), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 120/395 (30%), Positives = 179/395 (45%), Gaps = 48/395 (12%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP---IYNP 69
V + V + EY+M +GTPP+ + I DTGSDL+WV+C P + P
Sbjct: 99 VVAEVVSRQFEYLMAIEVGTPPV-RVLAIADTGSDLVWVKCKGKDNDNNSTAPPSVYFVP 157
Query: 70 ASSSSYKELSCQSEQCHLLDTV-SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
++SS+Y + C ++ C L + SCS C Y Y Y D S G L+TE TF +
Sbjct: 158 SASSTYGRVGCDTKACRALSSAASCSPDGSCEYLYSYGDGSRASGQLSTETFTFSTIADS 217
Query: 129 --------------------FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQI--L 166
+ FGC TG F + + +G +SLASQ+
Sbjct: 218 SKTNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTFRADGLVGLGG--GPVSLASQLGAT 275
Query: 167 SQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGIS 226
+ LG KFSYCL P+ +++ +S + FG+ + VS G ST L++ E +TYY + L+ I+
Sbjct: 276 TSLG-RKFSYCLAPY-ANTNASSALNFGSRAVVSEPGAASTPLITGEVETYYTIALDSIN 333
Query: 227 VGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP 286
V + ++ ++ +D+G T L L + + IKL + P
Sbjct: 334 VAGTKRPT----------TAAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAESP 383
Query: 287 RLGSQLCYKTPSMAGI----APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG 342
LCY + G P +T GG +V L +TF+ EGV C A+
Sbjct: 384 EKILDLCYDISGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVVQ-EGVLCLALVATSE 442
Query: 343 --DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
V I GN AQ +L +GYD + V+F DC K
Sbjct: 443 RQSVSILGNIAQQNLHVGYDLEKGTVTFAAADCAK 477
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 119/367 (32%), Positives = 170/367 (46%), Gaps = 31/367 (8%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
++T ++++ +G PP Y I D +D W+QC PC++CY Q I++P+ SSSY
Sbjct: 180 ITTGTSNFLVQIGVGGPPQ-KFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFDPSQSSSYT 238
Query: 77 ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
LSC+++ C+LL SCS C Y Y D + T+GVL E ++F S+ + D V GC
Sbjct: 239 LLSCETKHCNLLPNSSCSDDGYCRYNITYKDGTNTEGVLINETVSF-ESSGWVDRVSLGC 297
Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
+ N G F ++ G GLGR LS S+I A+ SYCLV D +S + F
Sbjct: 298 SNKNQGPFVGSD-GTFGLGRGSLSFPSRI----NASSMSYCLVE-SKDGYSSSTLEF--N 349
Query: 197 SEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG----NLSNSSKLI-PYYNSSGAISKGNM 251
S G V + L + + + Y+V L+GI VG ++ NS+ I PY N G M
Sbjct: 350 SPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGN-------GGM 402
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL----CYKTPSMAGIA-PIL 306
+ + + T+L D YN VR+A RL + L CY S + PIL
Sbjct: 403 IVSSSSLITMLENDTYNV----VRDAFVAKTQHLERLKAFLQFDTCYNLSSNNTVELPIL 458
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
+ G L S G FCFA P G I G Q + +D + V
Sbjct: 459 EFEVNDGKSWLLPKESYLYAVDKNGTFCFAFAPSKGSFSILGTLQQYGTRVTFDLVNSFV 518
Query: 367 SFKPTDC 373
C
Sbjct: 519 YLHTLCC 525
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 121/372 (32%), Positives = 172/372 (46%), Gaps = 27/372 (7%)
Query: 9 PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCV-QCYKQVKPI 66
P+ S + + G YV+ +GTP Y +V DTGSD WVQC PCV +CYKQ P+
Sbjct: 148 PSLPATSGRAVSTGNYVVTVGLGTP--ASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPL 205
Query: 67 YNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
++PA SS+Y +SC C LDT C+ C Y Y D S T G A + +T ++
Sbjct: 206 FDPAKSSTYANVSCTDSACADLDTNGCTGGH-CLYAVQYGDGSYTVGFFAQDTLTI--AH 262
Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
+ FGCG N G+F + GL+GLGR + SL Q ++ G F+YCL T
Sbjct: 263 DAIKGFRFGCGEKNNGLFGKTA-GLMGLGRGKTSLTVQAYNKYG-GAFAYCLPALTTG-- 318
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
T + FG GS +G T +++ + +T+Y+V + GI VG + +P S
Sbjct: 319 -TGYLDFGPGS--AGNNARLTPMLTDKGQTFYYVGMTGIRVGG-----QQVPVAES--VF 368
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA- 303
S +D+G T LP Y L + Y+ S L CY ++ +
Sbjct: 369 STAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVEL 428
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFC--FAMQPIDGDVGIFGNFAQSDLFIGYDF 361
P ++ F GGA + + S + E C FA D V I GN Q + YD
Sbjct: 429 PTVSLVFQGGACLD-VDVSGIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDL 487
Query: 362 DSQMVSFKPTDC 373
+ V F P C
Sbjct: 488 GKKTVGFAPGSC 499
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 154 bits (389), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 113/359 (31%), Positives = 168/359 (46%), Gaps = 25/359 (6%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
G YV + +GTP +VDTGS L W+QC PCV C++QV P+++P +SS+Y + C
Sbjct: 132 GNYVTQLGLGTPST-SYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYTSVRC 190
Query: 81 QSEQCHLLDTV-----SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
+ QC L +CS+ +C Y Y DSS + G L+T+ ++FG+++ + + +G
Sbjct: 191 SASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGYLSTDTVSFGSTS--YPSFYYG 248
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
CG +N G+F + GL+GL R +LSL Q+ LG + FSYCL P + S +
Sbjct: 249 CGQDNEGLFGRSA-GLIGLARNKLSLLYQLAPSLGYS-FSYCL-PTAASTGYLSIGPYNT 305
Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
G S + S+SL D + YF+TL G+SVG P S S ID+
Sbjct: 306 GHYYSYTPMASSSL----DASLYFITLSGMSVGG-------SPLAVSPSEYSSLPTIIDS 354
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
G T LP + L + V A+ C++ + P + F GGA
Sbjct: 355 GTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQLRVPTVVMAFAGGAS 414
Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ L + I + C A P D I GN Q + YD + F C+
Sbjct: 415 MKLTTRNVLIDVD-DSTTCLAFAPTD-STAIIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 154 bits (389), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 118/347 (34%), Positives = 162/347 (46%), Gaps = 37/347 (10%)
Query: 41 IVDTGSDLMWVQCLPCVQ---CYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQ 97
++DTGSD+ W+QCLPC CY+Q+ PI++P SSSY +SC SEQC LLD C+
Sbjct: 13 VLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQLLDEAGCNVNS 72
Query: 98 LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRT 157
C Y Y D S T G LATE +TF +SN+ N+ GCGH+N G+F +
Sbjct: 73 -CIYKVEYGDGSFTIGELATETLTFVHSNS-IPNISIGCGHDNEGLFVGADG-----LIG 125
Query: 158 RLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTY 217
A I SQL A+ FSYCLV DS S + F S SL+S K
Sbjct: 126 LGGGAISISSQLKASSFSYCLV--DIDSPSFSTLDFNTDPP-------SDSLISPLVKND 176
Query: 218 YFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEE-- 272
F + + V +S K +P +S I + G + +D+G T LP D Y L E
Sbjct: 177 RFPSFRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEVLREAF 236
Query: 273 -----QVRNAIKLTPYQDP-RLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIP 326
+ A +++P+ L SQ + P++A I P G + L + I
Sbjct: 237 LGLTTNLPPAPEISPFDTCYDLSSQSNVEVPTIAFILP-------GENSLQLPAKNCLIQ 289
Query: 327 PPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
G FC A + I GNF Q + + YD + +V F C
Sbjct: 290 VDSAGTFCLAFVSATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 154 bits (389), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 126/365 (34%), Positives = 168/365 (46%), Gaps = 28/365 (7%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSSSYKEL 78
+G Y++ +GTP D+ I DTGSDL W QC PCV+ CY Q +PI+NP+ S+SY +
Sbjct: 100 GSGNYIVTVGLGTPKN-DLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNV 158
Query: 79 SCQSEQCHLLDTV-----SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
SC S C L + SCS+ C Y Y D S + G LA E+ T NS + FD V
Sbjct: 159 SCSSAACGSLSSATGNAGSCSASN-CIYGIQYGDQSFSVGFLAKEKFTLTNS-DVFDGVY 216
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMY 192
FGCG NN G+F GL+GLGR +LS SQ + NK FSYCL + +S T +
Sbjct: 217 FGCGENNQGLFT-GVAGLLGLGRDKLSFPSQTATAY--NKIFSYCL---PSSASYTGHLT 270
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKL-IPYYNSSGAISKGNM 251
FG+ S ++ + ++Y + + I+VG KL IP S S
Sbjct: 271 FGSAGISRSVKFTPISTIT-DGTSFYGLNIVAITVGG----QKLPIP----STVFSTPGA 321
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHF 310
ID+G T LP Y L + + P C+ + P + F
Sbjct: 322 LIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSF 381
Query: 311 DGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
GGA V L F + V FA D + IFGN Q L + YD V F
Sbjct: 382 SGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFA 441
Query: 370 PTDCT 374
P C+
Sbjct: 442 PNGCS 446
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 154 bits (389), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 110/385 (28%), Positives = 178/385 (46%), Gaps = 34/385 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S V +GEY +G PP + ++DTGSDL+W+QCLPC +CY+QV P+Y+P +S
Sbjct: 81 VMSGVPFDSGEYFAVIGVGDPPTHALV-VIDTGSDLIWLQCLPCRRCYRQVTPLYDPRNS 139
Query: 73 SSYKELSCQSEQCH-LLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
+++ + C S QC +L C ++ C Y Y D S + G LAT+ + +
Sbjct: 140 KTHRRIPCASPQCRGVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDTLVLPDDTRVH- 198
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
NV GCGH+N G+ + GL+G GR +LS +Q+ G + FSYCL + + +S
Sbjct: 199 NVTLGCGHDNEGLL-ASAAGLLGAGRGQLSFPTQLAPAYG-HVFSYCLGDRMSRARNSSS 256
Query: 191 -MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS-- 247
+ FG E+ + YY V + G SVG ++ + N+S A++
Sbjct: 257 YLVFGRTPELPSTAFTPLRTNPRRPSLYY-VDMVGFSVGG----ERVAGFSNASLALNPA 311
Query: 248 --KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL-----CYKT---- 296
+G + +D+G + +D Y + + + + RL ++ CY
Sbjct: 312 TGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMR--RLRNKFSVFDTCYDVHGNG 369
Query: 297 PSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG-----VFCFAMQPIDGDVGIFGNFA 351
P P + HF A + L + I PV G FC +Q D + + GN
Sbjct: 370 PGTGVRVPSIVLHFAAAADMALPQANYLI--PVVGGDRRTYFCLGLQAADDGLNVLGNVQ 427
Query: 352 QSDLFIGYDFDSQMVSFKPTDCTKQ 376
Q + +D + + F P C+ +
Sbjct: 428 QQGFGVVFDVERGRIGFTPNGCSGE 452
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 125/370 (33%), Positives = 172/370 (46%), Gaps = 28/370 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ---CYKQVKPIYNP 69
V S S GEY + +G P + + + DTGSD+ W+QC PC CYKQ+ PI++P
Sbjct: 173 VTSGASQGAGEYFARIGVGQP-VQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDP 231
Query: 70 ASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
SSSSY LSC SEQCHLLD +C + C Y Y D S T G LATE +F +SN+
Sbjct: 232 KSSSSYSPLSCDSEQCHLLDEAACDANS-CIYEVEYGDGSFTVGELATETFSFRHSNS-I 289
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
N+ GCGH+N G+F A + SQL A FSYCLV ++SS T
Sbjct: 290 PNLPIGCGHDNEGLFVGAAG-----LIGLGGGAISLSSQLEATSFSYCLVDLDSESSSTL 344
Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDK--TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
N + S TS + K D+ T+ +V + G+SVG K +P +SS I
Sbjct: 345 DF---NADQPSDS---LTSPLVKNDRFPTFRYVKVIGMSVGG-----KPLPISSSSFEID 393
Query: 248 K---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-A 303
+ G + +D+G T +P D Y+ L + K P CY S + +
Sbjct: 394 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 453
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
P + G + L + G FC A P + I GN Q + + YD +
Sbjct: 454 PTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 513
Query: 364 QMVSFKPTDC 373
+V F C
Sbjct: 514 SLVGFSTDKC 523
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 119/360 (33%), Positives = 169/360 (46%), Gaps = 25/360 (6%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKE 77
G YV+ +GTP Y +V DTGSD WVQC PCV CY+Q + +++PASSS+Y
Sbjct: 179 GTGNYVVTVGLGTPA--SRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYAN 236
Query: 78 LSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
+SC + C LD CS C Y Y D S + G A + +T +S + FGCG
Sbjct: 237 VSCAAPACSDLDVSGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTL-SSYDAVKGFRFGCG 294
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
N G+F E GL+GLGR + SL Q + G F++CL S+ T + FG GS
Sbjct: 295 ERNDGLFGE-AAGLLGLGRGKTSLPVQTYGKYG-GVFAHCLP---ARSTGTGYLDFGAGS 349
Query: 198 EVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
+ +T +++ T+Y+V + GI VG +L+P S + +D+G
Sbjct: 350 PPA---TTTTPMLTGNGPTFYYVGMTGIRVGG-----RLLPIAPS--VFAAAGTIVDSGT 399
Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHFDGGA 314
T LP Y+ L A+ Y+ S L CY M+ +A P ++ F GGA
Sbjct: 400 VITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGA 459
Query: 315 KVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ + + V FA GDVGI GN + YD ++V F P C
Sbjct: 460 ALDVDASGIMYTVSASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 154 bits (388), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 125/371 (33%), Positives = 181/371 (48%), Gaps = 33/371 (8%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
GEY+ +GTP + IVDTGSDL WVQC PC CY Q ++ P +S+S+ +L+C
Sbjct: 1 GEYLATVRLGTPERV-FSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACG 59
Query: 82 SEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN---NFFDNVVFGCGH 138
+E C+ L C +Q C Y Y Y D SL+ G + IT N N FGCGH
Sbjct: 60 TELCNGLPYPMC-NQTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGCGH 118
Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
+N G F + G++GLG+ LS SQ L + KFSYCLV + + TS + FG+ +
Sbjct: 119 DNEGSFAGAD-GILGLGQGPLSFPSQ-LKTVFNGKFSYCLVDWLAPPTQTSPLLFGDAAV 176
Query: 199 VSGGGVVSTSLVSKED-KTYYFVTLEGISVG----NLSNSSKLIPYYNSSGAISKGNMFI 253
+ GV SL++ TYY+V L GISVG N+S+++ I +G I
Sbjct: 177 PTFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTI------F 230
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ-DPRLGSQLCYKTPSMAGIA-------PI 305
D+G T L + + + + + P + D G LC + G A P
Sbjct: 231 DSGTTVTQLAGEVHQEVLAAMNASTMDYPRKSDDSSGLDLC-----LGGFAEGQLPTVPS 285
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
+T HF+GG + L ++ FI +CF+M DV I G+ Q + + YD +
Sbjct: 286 MTFHFEGG-DMELPPSNYFIFLESSQSYCFSMVS-SPDVTIIGSIQQQNFQVYYDTVGRK 343
Query: 366 VSFKPTDCTKQ 376
+ F P C +
Sbjct: 344 IGFVPKSCVGR 354
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 154 bits (388), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 114/363 (31%), Positives = 169/363 (46%), Gaps = 19/363 (5%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
++ +GEY + +GTP +Y ++DTGSD++W+QC PC +CY Q P+++P S +Y
Sbjct: 122 LAQGSGEYFTRIGVGTPARY-VYMVLDTGSDVVWLQCAPCRKCYTQADPVFDPTKSRTYA 180
Query: 77 ELSCQSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
+ C + C LD+ C+++ ++C Y Y D S T G +TE +TF + V G
Sbjct: 181 GIPCGAPLCRRLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRTR--VTRVALG 238
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
CGH+N G+F L RLS Q + KFSYCLV + S+ S + FG+
Sbjct: 239 CGHDNEGLFIGAAGLLGLGR-GRLSFPVQTGRRFN-QKFSYCLVD-RSASAKPSSVVFGD 295
Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
S VS + + + + T+Y++ L GISVG S L A G + ID+
Sbjct: 296 -SAVSRTARFTPLIKNPKLDTFYYLELLGISVGG-SPVRGLSASLFRLDAAGNGGVIIDS 353
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGI-APILTAHF 310
G T L + Y L R+A ++ R C+ + + P + HF
Sbjct: 354 GTSVTRLTRPAYIAL----RDAFRVGASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVLHF 409
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
GA V L T+ IP G FCFA + I GN Q + +D V F P
Sbjct: 410 R-GADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAP 468
Query: 371 TDC 373
C
Sbjct: 469 RGC 471
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 154 bits (388), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 119/359 (33%), Positives = 169/359 (47%), Gaps = 25/359 (6%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKEL 78
G YV+ +GTP Y +V DTGSD WVQC PCV CY+Q + +++PASSS+Y +
Sbjct: 176 TGNYVVTVGLGTPA--SRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANV 233
Query: 79 SCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
SC + C LD CS C Y Y D S + G A + +T +S + FGCG
Sbjct: 234 SCAAPACSDLDVSGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTL-SSYDAVKGFRFGCGE 291
Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
N G+F E GL+GLGR + SL Q + G F++CL S+ T + FG GS
Sbjct: 292 RNDGLFGE-AAGLLGLGRGKTSLPVQTYGKYG-GVFAHCLP---ARSTGTGYLDFGAGSP 346
Query: 199 VSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
+ +T +++ T+Y+V + GI VG +L+P S + +D+G
Sbjct: 347 PA---TTTTPMLTGNGPTFYYVGMTGIRVGG-----RLLPIAPS--VFAAAGTIVDSGTV 396
Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHFDGGAK 315
T LP Y+ L A+ Y+ S L CY M+ +A P ++ F GGA
Sbjct: 397 ITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAA 456
Query: 316 VPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ + + V FA GDVGI GN + YD ++V F P C
Sbjct: 457 LDVDASGIMYTVSASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 515
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 119/372 (31%), Positives = 178/372 (47%), Gaps = 37/372 (9%)
Query: 18 STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKE 77
ST +++ FS+G P + I+DTGS+++WV+C PC +C +Q P+ +P+ SS+Y
Sbjct: 93 STYEPLFLVNFSMGQPATPQL-AIMDTGSNILWVRCAPCKRCTQQNGPLLDPSKSSTYAS 151
Query: 78 LSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN---NFFDNVVF 134
L C + CH + C+ C Y YA + GVLATE++ F +S+ N +VVF
Sbjct: 152 LPCTNTMCHYAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVPSVVF 211
Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
GC H N + G+ GLG+ S +++ +KFSYCL +++ FG
Sbjct: 212 GCSHENGDYKDRRFTGVFGLGKGITSFVTRM-----GSKFSYCLGNIADPHYGYNQLVFG 266
Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN---M 251
+ G ST L K +Y+VTLEGISVG +S+ KGN
Sbjct: 267 EKANFEG---YSTPL--KVVNGHYYVTLEGISVGEKRLD------IDSTAFSMKGNEKSA 315
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQLCYK-TPSMAGIA-PILT 307
ID+G T L + + L+ +VR + L P+ GS CYK T S I P++T
Sbjct: 316 LIDSGTALTWLAESAFRALDNEVRQLLDGVLMPFWR---GSFACYKGTVSQDLIGFPVVT 372
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG------DVGIFGNFAQSDLFIGYDF 361
HF GGA + L S F + + C A++ + G AQ + YD
Sbjct: 373 FHFSGGADLDLDTESMFYQATPD-ILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDL 431
Query: 362 DSQMVSFKPTDC 373
+S + F+ DC
Sbjct: 432 NSNKLFFQRIDC 443
>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
Length = 501
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 118/375 (31%), Positives = 173/375 (46%), Gaps = 23/375 (6%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S ++ +GEY K +GTP + ++DTGSD++W+QC PC +CY Q +++P +S
Sbjct: 136 VVSGLAQGSGEYFTKIGVGTP-VTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRAS 194
Query: 73 SSYKELSCQSEQCHLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SY + C + C LD+ C ++ C Y Y D S+T G ATE +TF S
Sbjct: 195 HSYGAVDCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA-SGARVPR 253
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV----PFHTDSSI 187
V GCGH+N G+F L+GLGR LS SQI + G FSYCLV + +S
Sbjct: 254 VALGCGHDNEGLFVAAAG-LLGLGRGSLSFPSQISRRFG-RSFSYCLVDRTSSSASATSR 311
Query: 188 TSKMYFGNGSEVS-GGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
+S + FG+G+ + G V+ +D G + + +
Sbjct: 312 SSTVTFGSGARGALGRRVLHPDGEEPQDGDVLLRAAHGHQRRRRARPGRGRVRPPPDPST 371
Query: 247 SKGNMFIDTGAP-PTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQL---CYKTPSM 299
+G + +D+G P P + R A ++L+P G L CY +
Sbjct: 372 GRGGVIVDSGRPSPAWARAGRTPPCATRSRAAAAGLRLSPG-----GFSLFDTCYDLSGL 426
Query: 300 AGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIG 358
+ P ++ HF GGA+ L + IP G FCFA DG V I GN Q +
Sbjct: 427 KVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVV 486
Query: 359 YDFDSQMVSFKPTDC 373
+D D Q + F P C
Sbjct: 487 FDGDGQRLGFVPKGC 501
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 125/380 (32%), Positives = 184/380 (48%), Gaps = 24/380 (6%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
++S VS +GEY M IGTPP I+DTGSDL W+QC+PC+ C++Q P Y+P S
Sbjct: 181 LESGVSLGSGEYFMDVFIGTPPK-HYSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKES 239
Query: 73 SSYKELSCQSEQCHLLDTVS----CSSQ-QLCNYTYGYADSSLTKGVLATERITFG---- 123
SS++ ++C +C L+ + C + Q C Y Y Y DSS T G A E T
Sbjct: 240 SSFENITCHDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTP 299
Query: 124 ---NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
+ +NV+FGCGH N G+F+ GL+GLGR LS ASQ+ S G + FSYCLV
Sbjct: 300 NGKSEQKHVENVMFGCGHWNRGLFH-GAAGLLGLGRGPLSFASQLQSIYG-HSFSYCLVD 357
Query: 181 FHTDSSITSKMYFGNGSE-VSGGGVVSTSLVSKEDK---TYYFVTLEGISV-GNLSNSSK 235
++D+S++SK+ FG E +S + TS V E+ T+Y+V ++ I V G + +
Sbjct: 358 RNSDTSVSSKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIPE 417
Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
+ + G G ID+G T + Y ++E IK + + CY
Sbjct: 418 ETWHLSKEGG---GGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVEGFPPLKPCYN 474
Query: 296 TPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
+ + P F GA + FI + V + + I GN+ Q +
Sbjct: 475 VSGIEKMELPDFGILFSDGAMWDFPVENYFIQIEPDLVCLAILGTPKSALSIIGNYQQQN 534
Query: 355 LFIGYDFDSQMVSFKPTDCT 374
I YD + + P CT
Sbjct: 535 FHILYDMKKSRLGYAPMKCT 554
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 113/350 (32%), Positives = 166/350 (47%), Gaps = 33/350 (9%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSS----- 95
IVDTGSDL WVQC PC CY Q P+Y+P+ SSSYK + C S C L + +S
Sbjct: 152 IVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATGNSGPCGG 211
Query: 96 -----QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMG 150
+ C Y Y D S T+G LA+E I G++ +N+VFGCG NN G+F G
Sbjct: 212 FNGVVKTTCEYVVSYGDGSYTRGDLASESIVLGDTK--LENLVFGCGRNNKGLFG-GASG 268
Query: 151 LVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV--SGGGVVSTS 208
L+GLGR+ +SL SQ L FSYCL +S T + FGN V + V T
Sbjct: 269 LMGLGRSSVSLVSQTLKTFNG-VFSYCLPSLEDGASGT--LSFGNDFSVYKNSTSVFYTP 325
Query: 209 LVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFY 267
LV + +++Y + L G S+G + + + +G + ID+G T LP Y
Sbjct: 326 LVQNPQLRSFYILNLTGASIGGVELKTL---------SFGRG-ILIDSGTVITRLPPSIY 375
Query: 268 NRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTF-I 325
++ + P C+ S I+ P + F+G A++ + T F
Sbjct: 376 KAVKTEFLKQFSGFPSAPGYSILDTCFNLTSYEDISIPTIKMIFEGNAELEVDVTGVFYF 435
Query: 326 PPPVEGVFCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
P + C A+ + + +VGI GN+ Q + + YD + + +C
Sbjct: 436 VKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIAGENC 485
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 128/387 (33%), Positives = 192/387 (49%), Gaps = 37/387 (9%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
++S VS +GEY M IGTPP I+DTGSDL W+QC+PC C+ Q P Y+P S
Sbjct: 181 LESGVSLGSGEYFMDVFIGTPPR-HFSLILDTGSDLNWIQCVPCYDCFVQNGPYYDPKES 239
Query: 73 SSYKELSCQSEQCHLLDTVS----CSSQ-QLCNYTYGYADSSLTKGVLATERITF----- 122
SS+K + C +CHL+ + C ++ Q C Y Y Y DSS T G A E T
Sbjct: 240 SSFKNIGCHDPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSP 299
Query: 123 -GNSN-NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
G S +NV+FGCGH N G+F+ L+GLGR LS +SQ+ S G + FSYCLV
Sbjct: 300 AGKSEFKRVENVMFGCGHWNRGLFHGAAG-LLGLGRGPLSFSSQLQSLYG-HSFSYCLVD 357
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKEDK---TYYFVTLEGISV-GNLSNSSK 235
++D++++SK+ FG ++ V+ TSLV+ ++ T+Y+V ++ I V G + +
Sbjct: 358 RNSDTNVSSKLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIPE 417
Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE----QVRN--AIKLTPYQDPRLG 289
+ + GA G +D+G + + Y +++ +V+ IK P DP
Sbjct: 418 ETWHLSPEGA---GGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFPILDP--- 471
Query: 290 SQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIF 347
CY + + P F+ GA + FI E + C A+ + I
Sbjct: 472 ---CYNVSGVEKMELPEFRILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTPRSALSII 528
Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCT 374
GN+ Q + I YD + + P C
Sbjct: 529 GNYQQQNFHILYDTKKSRLGYAPMKCA 555
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 119/361 (32%), Positives = 165/361 (45%), Gaps = 26/361 (7%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSCQS 82
YV+ +GTP D+ + DTGSDL W QC PC CYKQ I++P+ SSSY ++C S
Sbjct: 46 YVVVVGLGTPKR-DLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTS 104
Query: 83 EQCHLLDT------VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
C L + S S+ C Y Y D+S + G L+ ER+T + + D+ +FGC
Sbjct: 105 SLCTQLTSDGIKSECSSSTDASCIYDAKYGDNSTSVGFLSQERLTI-TATDIVDDFLFGC 163
Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMYFGN 195
G +N G+FN GL+GLGR +S+ Q S NK FSYCL SS + FG
Sbjct: 164 GQDNEGLFN-GSAGLMGLGRHPISIVQQTSSNY--NKIFSYCL---PATSSSLGHLTFG- 216
Query: 196 GSEVSGGGVVSTSLVS-KEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFID 254
S + ++ T L + D ++Y + + ISVG +P +SS S G ID
Sbjct: 217 ASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTK-----LPAVSSS-TFSAGGSIID 270
Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGG 313
+G T L Y L R ++ P + CY I+ P + F GG
Sbjct: 271 SGTVITRLAPTVYAALRSAFRRXMEKYPVANEAGLLDTCYDLSGYKEISVPRIDFEFSGG 330
Query: 314 AKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
V L H + V FA D D+ +FGN Q L + YD + F
Sbjct: 331 VTVELXHRGILXVESEQQVCLAFAANGSDNDITVFGNVQQKTLEVVYDVKGGRIGFGAAG 390
Query: 373 C 373
C
Sbjct: 391 C 391
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 112/373 (30%), Positives = 175/373 (46%), Gaps = 43/373 (11%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCY--KQVKPIYNPASSSSYKELSCQ 81
+ + FS+G PP+ + I+DTGS L+W+QC PC C + P++NPA SS++ E SC
Sbjct: 68 FFVNFSVGQPPVPQ-FTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTFVECSCD 126
Query: 82 SEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGH 138
C CSS + C Y Y + +KGVLA ER+TF N + FGCGH
Sbjct: 127 DRFCRYAPNGHCSSNK-CVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGH 185
Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
N G++GLG SLA Q+ +KFSYC+ + +++ G ++
Sbjct: 186 ENGEQLESEFTGILGLGAKPTSLAVQL-----GSKFSYCIGDLANKNYGYNQLVLGEDAD 240
Query: 199 VSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
+ G T + + + Y++ LEGISVG+ + + + + S+ + +DTG
Sbjct: 241 ILGD---PTPIEFETENGIYYMNLEGISVGDKQLNIEPVVFKRRG---SRTGVILDTGTL 294
Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRL-----GSQLCYK---TPSMAGIAPILTAHF 310
T L Y L ++++ + DP+L LCY + G P++T HF
Sbjct: 295 YTWLADIAYRELYNEIKSIL------DPKLERFWFRDFLCYHGRVNEELIGF-PVVTFHF 347
Query: 311 DGGAKVPLIHTSTFIP----PPVEGVFCFAMQPIDGDVGIFGNF------AQSDLFIGYD 360
GGA++ + TS F P VFC +++P G + +F AQ I YD
Sbjct: 348 AGGAELAMEATSMFYPMTESDTYHNVFCMSVRPTTEHGGEYKDFTAIGLMAQQYYNIAYD 407
Query: 361 FDSQMVSFKPTDC 373
+ + + DC
Sbjct: 408 LKERNIYLQRIDC 420
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 124/365 (33%), Positives = 168/365 (46%), Gaps = 28/365 (7%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSSSYKEL 78
+G Y++ +GTP D+ I DTGSDL W QC PCV+ CY Q +PI+NP+ S+SY +
Sbjct: 129 GSGNYIVTVGLGTPKN-DLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNV 187
Query: 79 SCQSEQCHLLDTV-----SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
SC S C L + SCS+ C Y Y D S + G LA ++ T S++ FD V
Sbjct: 188 SCSSAACGSLSSATGNAGSCSASN-CIYGIQYGDQSFSVGFLAKDKFTL-TSSDVFDGVY 245
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMY 192
FGCG NN G+F GL+GLGR +LS SQ + NK FSYCL + +S T +
Sbjct: 246 FGCGENNQGLFT-GVAGLLGLGRDKLSFPSQTATAY--NKIFSYCL---PSSASYTGHLT 299
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKL-IPYYNSSGAISKGNM 251
FG+ S ++ + ++Y + + I+VG KL IP S S
Sbjct: 300 FGSAGISRSVKFTPISTIT-DGTSFYGLNIVAITVGG----QKLPIP----STVFSTPGA 350
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHF 310
ID+G T LP Y L + + P C+ + P + F
Sbjct: 351 LIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSF 410
Query: 311 DGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
GGA V L F + V FA D + IFGN Q L + YD V F
Sbjct: 411 SGGAVVELGSKGIFYAFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFA 470
Query: 370 PTDCT 374
P C+
Sbjct: 471 PNGCS 475
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 116/369 (31%), Positives = 167/369 (45%), Gaps = 38/369 (10%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S +GEY ++ IG+P + Y ++D+GSD++W+QC PC QCY Q PI+NPA+S
Sbjct: 118 VVSGTEEGSGEYFVRIGIGSPAIYQ-YMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATS 176
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
+S+ ++C S C+ LD + C Y Y D S TKG LA E IT G + +
Sbjct: 177 ASFIGVACSSNVCNQLDDDVACRKGRCGYQVAYGDGSYTKGTLALETITIGRT--VIQDT 234
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH N G+F GL+GLG +S Q+ +Q G F YCLV
Sbjct: 235 AIGCGHWNEGMF-VGAAGLLGLGGGPMSFVGQLGAQTGG-AFGYCLV------------- 279
Query: 193 FGNGSEVSGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLSN--SSKLIPYYNSSGAISKG 249
S G + L+ ++Y+V+L G++VG + S ++ + I G
Sbjct: 280 ----SRAMPVGAMWVPLIHNPFYPSFYYVSLSGLAVGGIRVPISEQIFQLTD----IGTG 331
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGI-AP 304
+ +DTG T LP YN R+A PR CY + P
Sbjct: 332 GVVMDTGTAITRLPTVAYNAF----RDAFIAQTTNLPRAPGVSIFDTCYDLNGFVTVRVP 387
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
++ +F GG + + IP G FCFA P + I GN Q + + D +
Sbjct: 388 TVSFYFSGGQILTFPARNFLIPADDVGTFCFAFAPSPSGLSIIGNIQQEGIQVSIDGTNG 447
Query: 365 MVSFKPTDC 373
V F P C
Sbjct: 448 FVGFGPNVC 456
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 115/363 (31%), Positives = 167/363 (46%), Gaps = 42/363 (11%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSS 74
+V + Y++ +IGTPPL + ++DTGSDL+W QC PC +C+ Q P+Y PA S++
Sbjct: 84 SVHASTATYLVDIAIGTPPL-PLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSAT 142
Query: 75 YKELSCQSEQCHLLDT--VSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
Y +SC+S C L + CS C Y + Y D + T GVLATE T G S+
Sbjct: 143 YANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLG-SDTAVRG 201
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V FGCG N G +N GLVG+GR LSL +SQLG + P + + +
Sbjct: 202 VAFGCGTENLGS-TDNSSGLVGMGRGPLSL----VSQLGVTR------PRRSCRARAAAR 250
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
G + S LEGI+VG+ + P + G +
Sbjct: 251 GGGAPTTTS--------------------PLEGITVGD--TLLPIDPAVFRLTPMGDGGV 288
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHF 310
ID+G T L + + L + + ++L LG LC+ S + P L HF
Sbjct: 289 IIDSGTTFTALEERAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHF 348
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
DG A + L S + GV C M G + + G+ Q + I YD + ++SF+P
Sbjct: 349 DG-ADMELRRESYVVEDRSAGVACLGMVSARG-MSVLGSMQQQNTHILYDLERGILSFEP 406
Query: 371 TDC 373
C
Sbjct: 407 AKC 409
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 120/381 (31%), Positives = 186/381 (48%), Gaps = 28/381 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V+S V+ +GEY+++ +GTPP I+DTGSDL W+QC PC+ C+ Q P+++P +S
Sbjct: 139 VESGVAVGSGEYLVEVYVGTPPR-RFQMIMDTGSDLNWLQCAPCLDCFDQRGPVFDPMAS 197
Query: 73 SSYKELSCQSEQCHLLD------TVSCSSQQLCNYTYGYADSSLTKGVLATERITF---G 123
+SY+ ++C +C L+ T S C Y Y Y D S T G LA E T
Sbjct: 198 TSYRNVTCGDTRCGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTA 257
Query: 124 NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
+S+ D VV GCGH N G+F+ L+GLGR LS ASQ+ + G + FSYCLV
Sbjct: 258 SSSRRVDGVVLGCGHRNRGLFHGAAG-LLGLGRGPLSFASQLRAVYG-HAFSYCLV--DH 313
Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLV--SKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
S++ SK+ FG+ + + ++ + S + T+Y+V L+GI VG +++ +
Sbjct: 314 GSAVGSKIVFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGG-----EMLDIPS 368
Query: 242 SSGAISK----GNMFIDTGAPPTLLPKDFYNRLEEQ-VRNAIKLTPYQDPRLGSQLCYKT 296
++ +SK G ID+G + P+ Y + + V K P CY
Sbjct: 369 NTWGVSKEDGSGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNV 428
Query: 297 PSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFA-MQPIDGDVGIFGNFAQSD 354
+ + P + F GA + FI EG+ C A + + I GN+ Q +
Sbjct: 429 SGVERVEVPEFSLLFADGAVWDFPAENYFIRLDTEGIMCLAVLGTPRSAMSIIGNYQQQN 488
Query: 355 LFIGYDFDSQMVSFKPTDCTK 375
+ YD + F P C +
Sbjct: 489 FHVLYDLHHNRLGFAPRRCAE 509
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 130/366 (35%), Positives = 164/366 (44%), Gaps = 33/366 (9%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSS 73
N+ T N YV+ S+GTP + VDTGSDL WVQC PC CY Q P+++PA SS
Sbjct: 134 NIGTLN--YVVTVSLGTPGVAQTL-EVDTGSDLSWVQCTPCAAPACYSQKDPLFDPAQSS 190
Query: 74 SYKELSCQSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SY + C C L SCS+ Q C Y Y D S T GV +++ +T + N+
Sbjct: 191 SYAAVPCGGPVCGGLGIYASSCSAAQ-CGYVVSYGDGSKTTGVYSSDTLTL-SPNDAVRG 248
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
FGCGH +G F N+ GL+GLGR SL Q G FSYCL T S T +
Sbjct: 249 FFFGCGHAQSG-FTGND-GLLGLGREEASLVEQTAGTYG-GVFSYCL---PTRPSTTGYL 302
Query: 192 YFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
G S + G +T L+S + TYY V L GISVG S +P ++ G
Sbjct: 303 TLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLS---VP-----SSVFAGG 354
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY-QDPRLGS-QLCYKTPSMAGIA-PILT 307
+DTG T LP Y L R+ + Y P G CY + P +
Sbjct: 355 TVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILDTCYNFSGYGTVTLPNVA 414
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
F GGA V L G FA DG + I GN Q + D V
Sbjct: 415 LTFSGGATVTLGADGIL----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVG 468
Query: 368 FKPTDC 373
FKP+ C
Sbjct: 469 FKPSSC 474
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 116/372 (31%), Positives = 169/372 (45%), Gaps = 35/372 (9%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ F IGTP + VDTGSD++W QC PC C+ Q P ++ ++S + + C
Sbjct: 91 EYLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPCFDCFTQPLPRFDTSASDTVHGVLCTD 150
Query: 83 EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF---GNSNNFFDNVVFGCGHN 139
C L +C C Y Y D+S+T G LA + TF G ++VFGCG
Sbjct: 151 PICRALRPHACFLGG-CTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVFGCGQY 209
Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
NTG F+ NE G+ G GR LSL QLG + FSYC F T S F G+
Sbjct: 210 NTGNFHSNETGIAGFGRGPLSLP----RQLGVSSFSYC---FTTIFESKSTPVFLGGAPA 262
Query: 200 SG------GGVVSTSLVSKEDKTYYFVTLEGISVGN----LSNSSKLIPYYNSSGAISKG 249
G G ++ST + + YY+++L+GI+VG + S+ ++ S G I
Sbjct: 263 DGLRAHATGPILSTPFLPNHPE-YYYLSLKGITVGKTRLAVPESAFVVKADGSGGTI--- 318
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKL--TPYQDPRLGSQLCYKTPSMAGIA---- 303
ID+G T P+ + L E + L T Y D + C+ T S+ +
Sbjct: 319 ---IDSGTAITAFPRAVFRSLWEAFVAQVPLPHTSYNDTGEPTLQCFSTESVPDASKVPV 375
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
P +T H + GA L + P C + D D + GNF Q ++ I +D
Sbjct: 376 PKMTLHLE-GADWELPRENYMAEYPDSDQLCVVVLAGDDDRTMIGNFQQQNMHIVHDLAG 434
Query: 364 QMVSFKPTDCTK 375
+ +P C K
Sbjct: 435 NKLVIEPAQCDK 446
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 127/374 (33%), Positives = 176/374 (47%), Gaps = 28/374 (7%)
Query: 9 PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPI 66
P+ S + G YV+ +GTP Y +V DTGSD WVQC PCV CYKQ + +
Sbjct: 146 PSLPASSGSALGTGNYVVTIGLGTP--AGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKL 203
Query: 67 YNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
++PA SS+Y +SC + C L CS C Y Y D S + G A + +T +S
Sbjct: 204 FDPARSSTYANISCAAPACSDLYIKGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTL-SSY 261
Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
+ FGCG N G++ E GL+GLGR + SL Q + G F++C F SS
Sbjct: 262 DAIKGFRFGCGERNEGLYGE-AAGLLGLGRGKTSLPVQAYDKYG-GVFAHC---FPARSS 316
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP--YYNSSG 244
T + FG GS + ++T ++ T+Y+V L GI VG S IP + +SG
Sbjct: 317 GTGYLDFGPGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLS---IPQSVFTTSG 373
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGI 302
I +D+G T LP Y+ L +A+ Y+ S L CY M+ +
Sbjct: 374 TI------VDSGTVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFTGMSEV 427
Query: 303 A-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFC--FAMQPIDGDVGIFGNFAQSDLFIGY 359
A P ++ F GGA + +H S I C FA D DVGI GN + Y
Sbjct: 428 AIPTVSLLFQGGASLD-VHASGIIYAASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVY 486
Query: 360 DFDSQMVSFKPTDC 373
D ++V F P C
Sbjct: 487 DIGKKVVGFCPGAC 500
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 117/365 (32%), Positives = 172/365 (47%), Gaps = 22/365 (6%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPIYNPASS 72
S + G YV+ +GTP Y +V DTGSD WVQC PCV CY+Q + +++PA S
Sbjct: 171 SGRALGTGNYVVTVGLGTP--ASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARS 228
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+Y +SC + C L+ CS C Y Y D S + G A + +T +S +
Sbjct: 229 STYANVSCAAPACSDLNIHGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTL-SSYDAVKGF 286
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGCG N G+F E GL+GLGR + SL Q + G F++CL S+ T +
Sbjct: 287 RFGCGERNEGLFGE-AAGLLGLGRGKTSLPVQTYDKYG-GVFAHCLP---ARSTGTGYLD 341
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
FG GS + ++T ++++ T+Y+V + GI VG +L+ S +
Sbjct: 342 FGAGSLAAARARLTTPMLTENGPTFYYVGMTGIRVGG-----QLLSIPQS--VFATAGTI 394
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAH 309
+D+G T LP Y+ L A+ Y+ S L CY M+ +A P ++
Sbjct: 395 VDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLL 454
Query: 310 FDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
F GGA++ + + V FA GDVGI GN + YD ++V F
Sbjct: 455 FQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 514
Query: 369 KPTDC 373
P C
Sbjct: 515 YPGAC 519
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 118/352 (33%), Positives = 173/352 (49%), Gaps = 28/352 (7%)
Query: 30 IGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSCQSEQCH-- 86
+GTP + +VDTGS L W+QC PC V C++Q P++NP SSS+Y + C ++QC
Sbjct: 3 LGTPATQYVM-VVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSDL 61
Query: 87 ---LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGV 143
L+ +CSS +C Y Y DSS + G L+ + ++FG+++ N +GCG +N G+
Sbjct: 62 PSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS--LPNFYYGCGQDNEGL 119
Query: 144 FNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGG 203
F + GL+GL R +LSL Q+ LG + F+YCL + SS + N + S
Sbjct: 120 FGRSA-GLIGLARNKLSLLYQLAPSLGYS-FTYCLP--SSSSSGYLSLGSYNPGQYSYTP 175
Query: 204 VVSTSLVSKEDKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLL 262
+VS+SL D + YF+ L G++V GN P SS A S ID+G T L
Sbjct: 176 MVSSSL----DDSLYFIKLSGMTVAGN--------PLSVSSSAYSSLPTIIDSGTVITRL 223
Query: 263 PKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTS 322
P Y+ L + V A+K T C+K + AP +T F GGA + L +
Sbjct: 224 PTSVYSALSKAVAAAMKGTSRASAYSILDTCFKGQASRVSAPAVTMSFAGGAALKLSAQN 283
Query: 323 TFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ + C A P I GN Q + YD S + F C+
Sbjct: 284 LLVDVD-DSTTCLAFAPAR-SAAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 333
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 126/386 (32%), Positives = 189/386 (48%), Gaps = 37/386 (9%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
++S VS +GEY + +GTPP I+DTGSDL W+QC+PC +C++Q P Y+P S
Sbjct: 170 LESGVSLGSGEYFIDVFVGTPPK-HFSLILDTGSDLNWIQCVPCYECFEQNGPHYDPGQS 228
Query: 73 SSYKELSCQSEQCHLLDTVS----CSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSN- 126
SSY+ + C +CHL+ + C ++ Q C Y Y Y DSS T G A E T +
Sbjct: 229 SSYRNIGCHDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMS 288
Query: 127 ------NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
+NV+FGCGH N G+F+ L+GLGR LS +SQ+ S G + FSYCLV
Sbjct: 289 SGKPELRRVENVMFGCGHWNRGLFHGAAG-LLGLGRGPLSFSSQLQSLYG-HSFSYCLVD 346
Query: 181 FHTDSSITSKMYFGNGSE-VSGGGVVSTSLVSKEDK---TYYFVTLEGISVG----NLSN 232
++D++++SK+ FG + +S + T+LV+ ++ T+Y+V ++ I VG N+
Sbjct: 347 RNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPE 406
Query: 233 SSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL 292
I S G I ID+G + + Y ++E +K P +
Sbjct: 407 EKWQIATDGSGGTI------IDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEP 460
Query: 293 CYKTPSMAGIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIF 347
CY ++ G+ P F GA + FI V C A+ + I
Sbjct: 461 CY---NVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSALSII 517
Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDC 373
GN+ Q + I YD + F PT C
Sbjct: 518 GNYQQQNFHILYDTKKSRLGFAPTKC 543
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 117/380 (30%), Positives = 182/380 (47%), Gaps = 31/380 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPA 70
V + V A +Y+ ++ +G PP ++DTGS L+W QC C++ C +Q P +N +
Sbjct: 75 VSAPVHWATRQYIAEYMVGDPPQ-RAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNAS 133
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
SS S+ + CQ + C C+ C + Y + G L T+ TF +
Sbjct: 134 SSGSFAPVPCQDKACAGNYLHFCALDGTCTFRVTYGAGGII-GFLGTDAFTFQSGGA--- 189
Query: 131 NVVFGCGHNNTGVFNE---NEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
+ FGC + GL+GLGR RLSLASQ GA +FSYCL P+ ++
Sbjct: 190 TLAFGCVSFTRFAAPDVLHGASGLIGLGRGRLSLASQT----GAKRFSYCLTPYFHNNGA 245
Query: 188 TSKMYFGNGSEVSGGG--VVSTSLV-SKED---KTYYFVTLEGISVG--NLSNSSKLIPY 239
+S ++ G + +SGGG V+S + V S +D T+Y++ L GI+VG L+ S
Sbjct: 246 SSHLFVGAAASLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDL 305
Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYN----RLEEQVRNAIKLTPYQDPRLGSQLCYK 295
+G + ID+G+P T L +D Y L Q+ ++ P +D G LC
Sbjct: 306 QEVEEGFWEGGVIIDSGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDG-GMALCVA 364
Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDV-GIFGNFAQSD 354
+ + P L HF GGA + L + P + C A+ + G + I GNF Q +
Sbjct: 365 RGDLDRVVPTLVLHFSGGADMAL-PPENYWAPLEKSTACMAI--VRGYLQSIIGNFQQQN 421
Query: 355 LFIGYDFDSQMVSFKPTDCT 374
+ I +D +SF+ DC+
Sbjct: 422 MHILFDVGGGRLSFQNADCS 441
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 151 bits (381), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 118/370 (31%), Positives = 172/370 (46%), Gaps = 24/370 (6%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPAS 71
+ S S G Y+ + +GTP + +VD+GS L W+QC PC V C+ Q P+Y+P +
Sbjct: 97 LASGASVGVGNYITRLGLGTPTTTYVM-VVDSGSSLTWLQCAPCAVSCHPQAGPLYDPRA 155
Query: 72 SSSYKELSCQSEQCHLLDTV-----SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
SS+Y + C + QC L SCS +C Y Y D S + G L+ + ++ +S
Sbjct: 156 SSTYAAVPCSAPQCAELQAATLNPSSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSSSG 215
Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
+ F +GCG +N G+F GL+GL R +LSL SQ+ +G N F+YCL P +S
Sbjct: 216 S-FPGFYYGCGQDNVGLFGR-AAGLIGLARNKLSLLSQLAPSVG-NSFAYCL-PTSAAAS 271
Query: 187 ITSKMYFGNGSEVSGGGVVS-TSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
+ FG+ S+ G S TS+VS D + YFV+L G+SV S L + G
Sbjct: 272 -AGYLSFGSNSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAG----SPLAVPSSEYG 326
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAP 304
++ ID+G T LP Y L + V A+ + Q C+K P
Sbjct: 327 SLPT---IIDSGTVITRLPTPVYTALSKAVGAALAAPSAPAYSI-LQTCFKGQVAKLPVP 382
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
+ F GGA + L + + E C A P D I GN Q + YD
Sbjct: 383 AVNMAFAGGATLRLTPGNVLV-DVNETTTCLAFAPTD-STAIIGNTQQQTFSVVYDVKGS 440
Query: 365 MVSFKPTDCT 374
+ F C+
Sbjct: 441 RIGFAAGGCS 450
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 150 bits (380), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 124/366 (33%), Positives = 162/366 (44%), Gaps = 34/366 (9%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSS 73
++ T+N YV+ S+GTP + VDTGSDL WVQC PC CY+Q P+++PA SS
Sbjct: 131 DIGTSN--YVVTASLGTPGMAQTL-EVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPAQSS 187
Query: 74 SYKELSCQSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SY + C C L +CS+ Q C Y Y D S T GV +++ +T +N
Sbjct: 188 SYAAVPCGRSACAGLGIYASACSAAQ-CGYVVSYGDGSNTTGVYSSDTLTLA-ANATVQG 245
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
+FGCGH +G GL+G GR + SL Q G FSYCL T SS T +
Sbjct: 246 FLFGCGHAQSGGLFTGIDGLLGFGREQPSLVQQTAGAYG-GVFSYCL---PTKSSTTGYL 301
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
G S V+ G + L S TYY V L GISVG P + A + G +
Sbjct: 302 TLGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQ-------PLSVPASAFAAGTV 354
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPI----LT 307
+DTG T LP Y L R+ + P P CY S AG + +
Sbjct: 355 -VDTGTVITRLPPAAYAALRSAFRSGMASYPSAPPIGILDTCY---SFAGYGTVNLTSVA 410
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
F GA + L G FA DG + I GN Q + D V
Sbjct: 411 LTFSSGATMTLGADGIM----SFGCLAFASSGSDGSMAILGNVQQRSFEV--RIDGSSVG 464
Query: 368 FKPTDC 373
F+P+ C
Sbjct: 465 FRPSSC 470
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 150 bits (379), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 117/392 (29%), Positives = 180/392 (45%), Gaps = 45/392 (11%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-YKQVKPIYNPAS 71
V S S+ +G+Y + IGTPP + + DTGSDL+WV+C PC C ++ +
Sbjct: 75 VISGASSGSGQYFVSLRIGTPPQTLLL-VADTGSDLIWVKCSPCRNCSHRSPGSAFFARH 133
Query: 72 SSSYKELSCQSEQCHLLDTVS---CSSQQL---CNYTYGYADSSLTKGVLATERITFGNS 125
S++Y + C S QC L+ C+ +L C Y Y YADSS T G + E +T S
Sbjct: 134 STTYSAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTS 193
Query: 126 N---NFFDNVVFGCGHNN-----TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
+ + FGCG TG E G++GLGR +S +SQ+ + G+ KFSYC
Sbjct: 194 TGKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGS-KFSYC 252
Query: 178 LVPFHTDSSITSKMYFGNGSE--VSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNS 233
L+ + TS + G VS G++S + L++ T+Y++ ++G+ V N
Sbjct: 253 LMDYTLSPPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYV----NG 308
Query: 234 SKLI--PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
KL P S + G ID+G T + + Y + + + +KL +P G
Sbjct: 309 VKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGFD 368
Query: 292 LCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPV-------EGVFCFAMQPI--D 341
LC + A P ++ + GG S F PPP + + C A+QP+ D
Sbjct: 369 LCMNVSGVTRPALPRMSFNLAGG--------SVFSPPPRNYFIETGDQIKCLAVQPVSQD 420
Query: 342 GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
G + GN Q + +D D + F C
Sbjct: 421 GGFSVLGNLMQQGFLLEFDRDKSRLGFTRRGC 452
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 111/365 (30%), Positives = 159/365 (43%), Gaps = 18/365 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S + +GEY ++ +G+PP D Y ++D+GSD++WVQC PC CYKQ P+++PA S
Sbjct: 121 VVSGMDQGSGEYFVRIGVGSPPR-DQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKS 179
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
SY +SC S C ++ C S C Y Y D S TKG LA E +TF + NV
Sbjct: 180 GSYTGVSCGSSVCDRIENSGCHSGG-CRYEVMYGDGSYTKGTLALETLTF--AKTVVRNV 236
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH N G+F L G + +S Q+ Q G F YCLV TDS T +
Sbjct: 237 AMGCGHRNRGMFIGAAGLLGIGGGS-MSFVGQLSGQTGG-AFGYCLVSRGTDS--TGSLV 292
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---G 249
FG + G V + YY L IP + +++ G
Sbjct: 293 FGREALPVGASWVPLVRNPRAPSFYYVGLKG------LGVGGVRIPLPDGVFDLTETGDG 346
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTA 308
+ +DTG T LP Y + ++ P CY + P ++
Sbjct: 347 GVVMDTGTAVTRLPTGAYAAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSF 406
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
+F G + L + +P G +CFA + I GN Q + + +D + V F
Sbjct: 407 YFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGF 466
Query: 369 KPTDC 373
P C
Sbjct: 467 GPNVC 471
>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
Length = 417
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 99/313 (31%), Positives = 152/313 (48%), Gaps = 36/313 (11%)
Query: 12 VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
V ++ + A GEY++K IGTPP +DT SDL+W QC PC CY QV P++NP
Sbjct: 77 VAETPIMPAGGEYLVKLGIGTPPY-KFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRV 135
Query: 72 SSSYKELSCQSEQCHLLDTVSC--SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
SS+Y L C S+ C LD C + C YTY Y+ ++ T+G LA +++ G + F
Sbjct: 136 SSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIG--EDAF 193
Query: 130 DNVVFGCGHNNTGVFNENEM-GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
V FGC ++TG + G+VGLGR LSL +SQL +F+YCL P S I
Sbjct: 194 RGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSL----VSQLSVRRFAYCLPP--PASRIP 247
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKED---KTYYFVTLEGISVGNLSNS------------ 233
K+ G ++ + ++ + D +YY++ L+G+ +G+ + S
Sbjct: 248 GKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSLPPTTTTTATAT 307
Query: 234 ----SKLIPYYNSSGAISKGN-----MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ 284
+ ++ A++ G+ M ID + T L Y+ L + I+L
Sbjct: 308 ATATAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEIRLPRGT 367
Query: 285 DPRLGSQLCYKTP 297
LG LC+ P
Sbjct: 368 GSSLGLDLCFILP 380
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 120/359 (33%), Positives = 168/359 (46%), Gaps = 22/359 (6%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSSSYKEL 78
G YV+ +GTP Y +V DTGSD WVQC PCV CY+Q + +++PA SS+ +
Sbjct: 183 TGNYVVTIGLGTP--AGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANI 240
Query: 79 SCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
SC + C L T CS C Y Y D S + G A + +T +S + FGCG
Sbjct: 241 SCAAPACSDLYTKGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTL-SSYDAIKGFRFGCGE 298
Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
N G+F E GL+GLGR + SL Q + G F++C F SS T + FG GS
Sbjct: 299 RNEGLFGE-AAGLLGLGRGKTSLPVQAYDKYG-GVFAHC---FPARSSGTGYLDFGPGSS 353
Query: 199 VSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
+ ++T ++ T+Y+V L GI VG KL+ S + +D+G
Sbjct: 354 PAVSTKLTTPMLVDNGLTFYYVGLTGIRVGG-----KLLSIPPS--VFTTAGTIVDSGTV 406
Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHFDGGAK 315
T LP Y+ L +AI Y+ S L CY M+ +A P ++ F GGA
Sbjct: 407 ITRLPPAAYSSLRSAFASAIAARGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLFQGGAS 466
Query: 316 VPLIHTSTFIPPPV-EGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ + + V + FA D DVGI GN + YD ++V F P C
Sbjct: 467 LDVDASGIIYAASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 126/393 (32%), Positives = 194/393 (49%), Gaps = 46/393 (11%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V+S V+ + EY+M +GTPP I+DTGSDL W+QC PC+ C++Q P+++PA+S
Sbjct: 135 VESGVAVGSAEYLMDVYVGTPPRR-FQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAAS 193
Query: 73 SSYKELSCQSEQC-HLLDTVSCSS-------QQLCNYTYGYADSSLTKGVLATERITFG- 123
SSY+ L+C +C H+ + + + C Y Y Y D S + G LA E T
Sbjct: 194 SSYRNLTCGDPRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNL 253
Query: 124 ---NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
+++ D VVFGCGH N G+F+ L+GLGR LS ASQ+ + G + FSYCLV
Sbjct: 254 TAPGASSRVDGVVFGCGHRNRGLFHGAAG-LLGLGRGPLSFASQLRAVYGGHTFSYCLVD 312
Query: 181 FHTDSSITSKMYFGNGSEVSGGG-----VVSTSLVSKEDKTYYFVTLEGISVG----NLS 231
+D + SK+ FG ++ + + S T+Y+V L G+ VG N+S
Sbjct: 313 HGSD--VASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGELLNIS 370
Query: 232 NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV--RNAIKLTPYQD-PRL 288
+ + +++S S G + ID+G + + Y + R + P D P L
Sbjct: 371 SDT-----WDASEGGSGGTI-IDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPVL 424
Query: 289 GSQLCYKTPSMAGI----APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM--QPIDG 342
CY +++G+ P L+ F GA + FI +G+ C A+ P G
Sbjct: 425 SP--CY---NVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTG 479
Query: 343 DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
+ I GNF Q + + YD + + F P C +
Sbjct: 480 -MSIIGNFQQQNFHVAYDLHNNRLGFAPRRCAE 511
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 128/382 (33%), Positives = 193/382 (50%), Gaps = 29/382 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
++S V+ +GEY M IGTPP I+DTGSDL W+QC+PC C++Q P Y+P S
Sbjct: 79 LESGVTLGSGEYFMDVFIGTPPK-HYSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKES 137
Query: 73 SSYKELSCQSEQCHLLDT----VSCSSQ-QLCNYTYGYADSSLTKGVLATERITF----- 122
SS++ + C +CHL+ + + C ++ Q C Y Y Y DSS T G ATE T
Sbjct: 138 SSFRNIGCHDPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSP 197
Query: 123 -GNSN-NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
G S +NV+FGCGH N G+F+ GL+GLGR LS +SQ+ S G + FSYCLV
Sbjct: 198 TGKSEFKRVENVMFGCGHWNRGLFH-GASGLLGLGRGPLSFSSQLQSLYG-HSFSYCLVD 255
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKEDK---TYYFVTLEGISV-GNLSNSSK 235
++D++++SK+ FG ++ ++ T+LV ++ T+Y+V ++ I V G + N +
Sbjct: 256 RNSDTNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPE 315
Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTP-YQD-PRLGSQLC 293
S G G +D+G + + Y +++ +K P QD P L C
Sbjct: 316 STWNMTSDGV---GGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQDFPILDP--C 370
Query: 294 YKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFA 351
Y + I P F GA + FI E V C A+ + I GN+
Sbjct: 371 YNVSGVEKIDLPDFGILFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRSALSIIGNYQ 430
Query: 352 QSDLFIGYDFDSQMVSFKPTDC 373
Q + + YD + + P +C
Sbjct: 431 QQNFHVLYDTKKSRLGYAPMNC 452
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 128/386 (33%), Positives = 199/386 (51%), Gaps = 35/386 (9%)
Query: 12 VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
++S ++ +GEY M +G+PP I+DTGSDL W+QCLPC C++Q Y+P +
Sbjct: 158 TLESGMTLGSGEYFMDVLVGSPPK-HFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDPKA 216
Query: 72 SSSYKELSCQSEQCHLLDT----VSCSSQ-QLCNYTYGYADSSLTKGVLATERITF---- 122
S+SYK ++C ++C+L+ + + C S Q C Y Y Y DSS T G A E T
Sbjct: 217 SASYKNITCNDQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTT 276
Query: 123 -GNSNNFF--DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
G S+ + +N++FGCGH N G+F+ L+GLGR LS +SQ+ S G + FSYCLV
Sbjct: 277 NGGSSELYNVENMMFGCGHWNRGLFHGAAG-LLGLGRGPLSFSSQLQSLYG-HSFSYCLV 334
Query: 180 PFHTDSSITSKMYFGNGSE-VSGGGVVSTSLVSKEDK---TYYFVTLEGISV-GNLSNSS 234
++D++++SK+ FG + +S + TS V+ ++ T+Y+V ++ I V G + N
Sbjct: 335 DRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIP 394
Query: 235 KLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQD-PRLG 289
+ +S GA G ID+G + + Y N++ E+ + K Y+D P L
Sbjct: 395 EETWNISSDGA---GGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKG--KYPVYRDFPILD 449
Query: 290 SQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIF 347
C+ + + P L F GA ++FI E + C AM I
Sbjct: 450 P--CFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLN-EDLVCLAMLGTPKSAFSII 506
Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDC 373
GN+ Q + I YD + + PT C
Sbjct: 507 GNYQQQNFHILYDTKRSRLGYAPTKC 532
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 116/356 (32%), Positives = 172/356 (48%), Gaps = 29/356 (8%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
+Y++ IGTP ++ I DTGS L+W QC PC CY +V P+++P S+S+K L C S
Sbjct: 131 DYIVNVGIGTPKK-EMPLIFDTGSGLIWTQCKPCKACYPKV-PVFDPTKSASFKGLPCSS 188
Query: 83 EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG 142
+ C + CSS + C Y Y D+S + G LATE I+F + F N++ GC +G
Sbjct: 189 KLCQSIRQ-GCSSPK-CTYLTAYVDNSSSTGTLATETISFSHLKYDFKNILIGCSDQVSG 246
Query: 143 VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGG 202
+ E G++GL R+ +SLASQ + + FSYC +P S+ G GG
Sbjct: 247 E-SLGESGIMGLNRSPISLASQT-ANIYDKLFSYC-IPSTPGST---------GHLTFGG 294
Query: 203 GVVSTSLVSKEDKTY----YFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
V + S KT Y + + GISVG KL+ + K ID+GA
Sbjct: 295 KVPNDVRFSPVSKTAPSSDYDIKMTGISVGG----RKLL----IDASAFKIASTIDSGAV 346
Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVP 317
T LP Y+ L R +K P D CY + + +A P ++ F+GG ++
Sbjct: 347 LTRLPPKAYSALRSVFREMMKGYPLLDQDDFLDTCYDFSNYSTVAIPSISVFFEGGVEMD 406
Query: 318 LIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ + P V+C A +D +V IFGNF Q + +D + + F P C
Sbjct: 407 IDVSGIMWQVPGSKVYCLAFAELDDEVSIFGNFQQKTYTVVFDGAKERIGFAPGGC 462
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 119/387 (30%), Positives = 188/387 (48%), Gaps = 39/387 (10%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V+S V+ +GEY++ +GTPP I+DTGSDL W+QC PC+ C++Q P+++PA+S
Sbjct: 138 VESGVAVGSGEYLIDVYVGTPPR-RFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAAS 196
Query: 73 SSYKELSCQSEQCHLL----DTVSCS--SQQLCNYTYGYADSSLTKGVLATERITFG--- 123
SSY+ ++C ++C L+ +C ++ C Y Y Y D S T G LA E T
Sbjct: 197 SSYRNVTCGDQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTA 256
Query: 124 -NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
++ D VVFGCGH N G+F+ L LS ASQ+ + G + FSYCLV
Sbjct: 257 PGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGR-GPLSFASQLRAVYG-HTFSYCLVEHG 314
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSL---VSKEDKTYYFVTLEGISVG----NLSNSSK 235
+D+ SK+ FG V + + S T+Y+V L+G+ VG N+S+ +
Sbjct: 315 SDAG--SKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTW 372
Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI-KLTPYQDPRLGSQLCY 294
+ G G ID+G + + Y + + + + +L P CY
Sbjct: 373 DV------GKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDFPVLNPCY 426
Query: 295 KTPSMAGI----APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ--PIDGDVGIFG 348
+++G+ P L+ F GA + F+ +G+ C A++ P G + I G
Sbjct: 427 ---NVSGVERPEVPELSLLFADGAVWDFPAENYFVRLDPDGIMCLAVRGTPRTG-MSIIG 482
Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDCTK 375
NF Q + + YD + + F P C +
Sbjct: 483 NFQQQNFHVVYDLQNNRLGFAPRRCAE 509
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 114/377 (30%), Positives = 170/377 (45%), Gaps = 43/377 (11%)
Query: 24 YVMKFSIGTP---PLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
YV S+G P ++ IVDTGSDL WVQC PC CY Q P+++PA S++Y + C
Sbjct: 144 YVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRC 203
Query: 81 QSEQCHLLDTV--------SCSS----QQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
+ C D++ SC S + C Y Y D S ++GVLAT+ + G ++
Sbjct: 204 NASACA--DSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGAS-- 259
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
VFGCG +N G+F GL+GLGRT LSL SQ S+ G FSYCL P T +
Sbjct: 260 LGGFVFGCGLSNRGLFG-GTAGLMGLGRTELSLVSQTASRYG-GVFSYCL-PAATSGDAS 316
Query: 189 SKMYFGNGSEVSGG-----GVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNS 242
+ G G + + V T +++ + +YF+ + G +VG + +++
Sbjct: 317 GSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQ------- 369
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMA 300
+ N+ ID+G T L Y + + Y S L CY
Sbjct: 370 --GLGASNVLIDSGTVITRLAPSVYRAVRAEFMRQFGAAGYPAAPGFSILDTCYDLTGHD 427
Query: 301 GI-APILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPI--DGDVGIFGNFAQSDLF 356
+ P+LT +GGA V + +G C AM + + + I GN+ Q +
Sbjct: 428 EVKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYEDETPIIGNYQQKNKR 487
Query: 357 IGYDFDSQMVSFKPTDC 373
+ YD + F DC
Sbjct: 488 VVYDTLGSRLGFADEDC 504
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 116/374 (31%), Positives = 173/374 (46%), Gaps = 29/374 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+ S V Y++ IG ++ IVDTGSDL WVQC PC CY Q P++NP+ S
Sbjct: 56 LSSGVRLQTLNYIVTVEIGGR---NMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGS 112
Query: 73 SSYKELSCQSEQCHLLD------TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
SY+ + C S C L V S+ CNY Y D S T+G L E++ G ++
Sbjct: 113 PSYQTILCNSSTCQSLQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNLGTTH 172
Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
N +FGCG NN G+F GL+GLG++ LSL SQ S + FSYCL D+S
Sbjct: 173 --VSNFIFGCGRNNKGLFG-GASGLMGLGKSDLSLVSQT-SAIFEGVFSYCLPTTAADAS 228
Query: 187 ITSKMYFGNGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
S + GN S +S + + + + T+YF+ L GIS+G ++ + P Y SG
Sbjct: 229 -GSLILGGNSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQA---PNYRQSG 284
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-A 303
+ ID+G T LP Y L+ + P P C+ +
Sbjct: 285 ------ILIDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDI 338
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPI--DGDVGIFGNFAQSDLFIGYD 360
P + F+G A++ + T F + C A+ + D ++ I GN+ Q + + Y+
Sbjct: 339 PTIRMQFEGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYN 398
Query: 361 FDSQMVSFKPTDCT 374
+ F C+
Sbjct: 399 TKESKLGFAAEACS 412
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 117/368 (31%), Positives = 167/368 (45%), Gaps = 28/368 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V+S + G YVM S+GTP I DTGSDL+WVQ PC C I++P S
Sbjct: 44 VESPLHPDGGGYVMDISVGTPGK-RFRAIADTGSDLVWVQSEPCTGCSGGT--IFDPRQS 100
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FF 129
S+++E+ C S+ C L C+Y+Y Y S T+G A + I+ G +++ F
Sbjct: 101 STFREMDCSSQLCAELPGSCEPGSSTCSYSYEYG-SGETEGEFARDTISLGTTSDGSQKF 159
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
+ GCG N+G + GLVGLG+ +SL SQ+ + + +KFSYCLV ++ S +S
Sbjct: 160 PSFAVGCGMVNSGFDGVD--GLVGLGQGPVSLTSQLSAAI-DSKFSYCLVDINSQSE-SS 215
Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDK--TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
+ FG + + G G+ ST + D TYY +T+ GI+V + S
Sbjct: 216 PLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMG-------------S 262
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PIL 306
G ID+G T +P Y R+ ++ + + L +G LCY S P L
Sbjct: 263 PGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPAL 322
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG-DVGIFGNFAQSDLFIGYDFDSQM 365
T G P + C AM G V I GN Q I YD S
Sbjct: 323 TIRLAGATMTPPSSNYFLVVDDSGDTVCLAMGSASGLPVSIIGNVMQQGYHILYDRGSSE 382
Query: 366 VSFKPTDC 373
+SF C
Sbjct: 383 LSFVQAKC 390
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 110/365 (30%), Positives = 159/365 (43%), Gaps = 18/365 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+ S + +GEY ++ +G+PP D Y ++D+GSD++WVQC PC CYKQ P+++PA S
Sbjct: 120 IVSGMDQGSGEYFVRIGVGSPPR-DQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKS 178
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
SY +SC S C ++ C S C Y Y D S TKG LA E +TF + NV
Sbjct: 179 GSYTGVSCGSSVCDRIENSGCHSGG-CRYEVMYGDGSYTKGTLALETLTF--AKTVVRNV 235
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH N G+F L G + +S Q+ Q G F YCLV TDS T +
Sbjct: 236 AMGCGHRNRGMFIGAAGLLGIGGGS-MSFVGQLSGQTGG-AFGYCLVSRGTDS--TGSLV 291
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---G 249
FG + G V + YY L IP + +++ G
Sbjct: 292 FGREALPVGASWVPLVRNPRAPSFYYVGLKG------LGVGGVRIPLPDGVFDLTETGDG 345
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTA 308
+ +DTG T LP Y + ++ P CY + P ++
Sbjct: 346 GVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSF 405
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
+F G + L + +P G +CFA + I GN Q + + +D + V F
Sbjct: 406 YFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGF 465
Query: 369 KPTDC 373
P C
Sbjct: 466 GPNVC 470
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 106/301 (35%), Positives = 141/301 (46%), Gaps = 20/301 (6%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ +IGTPP + +DTGSDL+W QC PC C+ Q P ++P++SS+ SC S
Sbjct: 81 EYLVHLAIGTPPQ-PVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDS 139
Query: 83 EQCHLLDTVSCSS-----QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
C L SC S Q C YTY Y D S+T G L ++ TF + V FGCG
Sbjct: 140 TLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCG 199
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
N GVF NE G+ G GR LSL SQL FS+C + T +
Sbjct: 200 LFNNGVFKSNETGIAGFGRGPLSLP----SQLKVGNFSHCFTAVNGLKPSTVLLDLPADL 255
Query: 198 EVSGGGVV-STSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN--MFI 253
SG G V ST L+ + T+Y+++L+GI+VG S +P S A+ G I
Sbjct: 256 YKSGRGAVQSTPLIQNPANPTFYYLSLKGITVG-----STRLPVPESEFALKNGTGGTII 310
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG-IAPILTAHFDG 312
D+G T LP Y + + +KL C P A P L HF+G
Sbjct: 311 DSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEG 370
Query: 313 G 313
Sbjct: 371 A 371
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 115/380 (30%), Positives = 176/380 (46%), Gaps = 37/380 (9%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
+G+Y+ K ++GTP + + + DT SDL W+QC PC +CY Q P+++P S+SY E++
Sbjct: 138 SGDYIAKIAVGTPAVEALLAL-DTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNY 196
Query: 81 QSEQCHLLDTVSC--SSQQLCNYTYGYAD------SSLTKGVLATERITF-GNSNNFFDN 131
+ C L + + C YT Y D +S + G L E +TF G + +
Sbjct: 197 DAPDCQALGRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAGGVRQAYLS 256
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTD-SSITS 189
+ GCGH+N G+F G++GL R ++S+ QI + LG N FSYCLV F + S +S
Sbjct: 257 I--GCGHDNKGLFGAPAAGILGLSRGQISIPHQI-AFLGYNASFSYCLVDFISGPGSPSS 313
Query: 190 KMYFGNGS-EVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNL------SNSSKLIPYYNS 242
+ FG G+ + S + +++++ T+Y+V L G+SVG + +L PY
Sbjct: 314 TLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGH 373
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYN---RLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM 299
G I +D+G T L + Y + P CY
Sbjct: 374 GGVI------LDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTCYTVGGR 427
Query: 300 AGI-----APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQS 353
AG+ P ++ HF GG ++ L + I G CFA D V + GN Q
Sbjct: 428 AGLRHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTVCFAFAGTGDRSVSVIGNILQQ 487
Query: 354 DLFIGYDFDSQMVSFKPTDC 373
+ YD Q V F P C
Sbjct: 488 GFRVVYDIGGQRVGFAPNSC 507
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 118/387 (30%), Positives = 176/387 (45%), Gaps = 38/387 (9%)
Query: 15 SNVSTANGEYVMKFSIGTP----PLLDIYGIVDTGSDLMWVQCLP---CVQCYKQVKPIY 67
S S +G+Y ++ +GTP PL IVDTGSDL W+QC P P Y
Sbjct: 50 SGSSIGSGQYFVELRVGTPAKKFPL-----IVDTGSDLTWIQCNPPNTTANSSSPPAPWY 104
Query: 68 NPASSSSYKELSCQSEQCHLL-----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF 122
+ +SSSSY+E+ C ++C L + S +S C+YTYGY+D S T G+LA E I+
Sbjct: 105 DKSSSSSYREIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISM 164
Query: 123 ----------GNSNNF---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQL 169
GN NV GC + G G++GLG+ +SLA+Q
Sbjct: 165 KSRKRSGKRAGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTA 224
Query: 170 GANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVG 228
FSYCLV + S+ +S + G + T +V +++Y+V + G++V
Sbjct: 225 LGGIFSYCLVDYLRGSNASSFLVMG---RTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVD 281
Query: 229 NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRL 288
+ G +KG +F D+G + L + Y+++ + +I L Q+
Sbjct: 282 GKPVDGIASSDWGIDGDGNKGTIF-DSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPE 340
Query: 289 GSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG--I 346
G +LCY M P L F GGA + L + ++ E V C A+Q + G I
Sbjct: 341 GFELCYNVTRMEKGMPKLGVEFQGGAVMELPW-NNYMVLVAENVQCVALQKVTTTNGSNI 399
Query: 347 FGNFAQSDLFIGYDFDSQMVSFKPTDC 373
GN Q D I YD + FK + C
Sbjct: 400 LGNLLQQDHHIEYDLAKARIGFKWSPC 426
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 116/374 (31%), Positives = 173/374 (46%), Gaps = 36/374 (9%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
+ V + G Y M+FSIGTPP + + DTGSDL+W +C Y+P
Sbjct: 86 TDTVPLRMDGGGGAYDMEFSIGTPPQ-KLTALADTGSDLIWTKCDAGGGAAWGGSSSYHP 144
Query: 70 ASSSSYKELSCQSEQCHLLDTVS----CSSQQLCNYTYGYA---DSSLTKGVLATERITF 122
+SS++ L C C L + S + C+Y Y Y D T+G L +E T
Sbjct: 145 NASSTFTRLPCSDRLCAALRSYSLARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTL 204
Query: 123 GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
G + V FGC G + E GLVGLGR LSL +SQL A F YCL
Sbjct: 205 GG--DAVPGVGFGCTTALEGDYGEGA-GLVGLGRGPLSL----VSQLDAGTFMYCLT--- 254
Query: 183 TDSSITSKMYFGNGSEV--SGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
D+S S + FG + + +G GV ST L++ T+Y V L I++G+ + +
Sbjct: 255 ADASKASPLLFGALATMTGAGAGVQSTGLLAS--TTFYAVNLRSITIGSATTAGV----- 307
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQ-VRNAIKLTPYQDPRLGSQLCYKTPSM 299
G + D+G T L + Y + + LTP + R G + CY+ P
Sbjct: 308 -----GGPGGVVFDSGTTLTYLAEPAYTEAKAAFLSQTTSLTPVEG-RYGFEACYEKPDS 361
Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
A + P + HFDGGA + L + ++ +GV C+ +Q + I GN Q + + +
Sbjct: 362 ARLIPAMVLHFDGGADMAL-PVANYVVEVDDGVVCWVVQR-SPSLSIIGNIMQMNYLVLH 419
Query: 360 DFDSQMVSFKPTDC 373
D ++SF+P +C
Sbjct: 420 DVRKSVLSFQPANC 433
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 123/367 (33%), Positives = 166/367 (45%), Gaps = 31/367 (8%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELSC 80
EYV+ IGTP + ++DTGSDL WVQC PC +CY Q P+++P+SSSSY + C
Sbjct: 90 EYVVTLGIGTPAVQQTV-LIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPC 148
Query: 81 QSEQCHLLDT---------VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
S+ C L VS + LC Y Y + + T GV +TE +T +
Sbjct: 149 DSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLTL-KPGVVVAD 207
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
FGCG + G + + + GL+GLG SL SQ SQ G FSYCL P + +
Sbjct: 208 FGFGCGDHQHGPYEKFD-GLLGLGGAPESLVSQTSSQFG-GPFSYCLPPTSGGAGFLTLG 265
Query: 192 YFGNGSEVSGGGVVSTSLVSK--EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
N S + +S + + + T+Y VTL GISVG P A S G
Sbjct: 266 APPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGG-------APLAIPPSAFSSG 318
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGI-APIL 306
M ID+G T LP Y L R+A+ P G L CY A + P +
Sbjct: 319 -MVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFTGHANVTVPTI 377
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
+ F GGA + L + + V+G FA D +GI GN Q + YD V
Sbjct: 378 SLTFSGGATIDLAAPAGVL---VDGCLAFAGAGTDNAIGIIGNVNQRTFEVLYDSGKGTV 434
Query: 367 SFKPTDC 373
F+ C
Sbjct: 435 GFRAGAC 441
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 118/359 (32%), Positives = 176/359 (49%), Gaps = 25/359 (6%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSC 80
G YV + +GTP + +VDTGS L W+QC PC V C++Q P++NP SSSSY +SC
Sbjct: 119 GNYVTRMGLGTPAKSYVM-VVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSSYASVSC 177
Query: 81 QSEQCHLLDTV-----SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
+ QC L T +CS+ +C Y Y DSS + G L+ + ++FG+++ N +G
Sbjct: 178 SAPQCDALTTATLNPSTCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS--VPNFYYG 235
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
CG +N G+F ++ GL+GL R +LSL Q+ +G + FSYCL P + SS + N
Sbjct: 236 CGQDNEGLFGQSA-GLIGLARNKLSLLYQLAPSMGYS-FSYCL-PTSSSSSGYLSIGSYN 292
Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
+ S + +SL D + YF+ + GI+V P S+ A S ID+
Sbjct: 293 PGQYSYTPMAKSSL----DDSLYFIKMTGITVAGK-------PLSVSASAYSSLPTIIDS 341
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
G T LP D Y+ L + V A+K TP C++ + P ++ F GGA
Sbjct: 342 GTVITRLPTDVYSALSKAVAGAMKGTPRASAFSILDTCFQGQASRLRVPQVSMAFAGGAA 401
Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ L T+ + C A P I GN Q + YD + + F C+
Sbjct: 402 LKLKATNLLVDVD-SATTCLAFAPAR-SAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 458
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 125/369 (33%), Positives = 175/369 (47%), Gaps = 33/369 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPAS 71
+S ++ +G Y++ IGTP D+ + DTGSDL W QC PC+ CY Q +P +NP+S
Sbjct: 121 AKSGITLGSGNYIVTIGIGTPKH-DLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSS 179
Query: 72 SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SS+Y+ +SC S C D SCS+ C Y+ GY D S T+G LA E+ T NS + ++
Sbjct: 180 SSTYQNVSCSSPMCE--DAESCSASN-CVYSIGYGDKSFTQGFLAKEKFTLTNS-DVLED 235
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V FGCG NN G+F+ L A + N FSYCL F ++S T +
Sbjct: 236 VYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTY--NNIFSYCLPSFTSNS--TGHL 291
Query: 192 YFGNG--SEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP-YYNSSGAISK 248
FG+ SE V T + S Y + + GISVG+ + P +++ GAI
Sbjct: 292 TFGSAGISE----SVKFTPISSFPSAFNYGIDIIGISVGD--KELAITPNSFSTEGAI-- 343
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGIA-P 304
ID+G T LP Y L + K++ Y+ G L CY + + P
Sbjct: 344 ----IDSGTVFTRLPTKVYAELRSVFKE--KMSSYKSTS-GYGLFDTCYDFTGLDTVTYP 396
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
+ F GG V L + +P + V C A D IFGN Q+ L + YD
Sbjct: 397 TIAFSFAGGTVVELDGSGISLPIKISQV-CLAFAGNDDLPAIFGNVQQTTLDVVYDVAGG 455
Query: 365 MVSFKPTDC 373
V F P C
Sbjct: 456 RVGFAPNGC 464
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 126/376 (33%), Positives = 169/376 (44%), Gaps = 70/376 (18%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
Q+ + + G Y M SIGTPP+ + DTGS L+W QC PC +C + P + PASSS
Sbjct: 80 QTLLDNSAGAYNMNLSIGTPPV-TFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSS 138
Query: 74 SYKELSCQSEQCHLLDT--VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
++ +L C S C L + +C++ C Y Y Y T G LATE + G ++ F
Sbjct: 139 TFSKLPCASSLCQFLTSPYRTCNATG-CVYYYPYG-MGFTAGYLATETLHVGGAS--FPG 194
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V FGC N GV N + G+VGLGR+ LSL SQ+ G +FSYCL + D+ S +
Sbjct: 195 VTFGCSTEN-GVGNSSS-GIVGLGRSPLSLVSQV----GVARFSYCLRS-NADAG-DSPI 246
Query: 192 YFGNGSEVSGGGVVSTSLVSKED---KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
FG+ ++V+GG V ST L+ + +YY+V L GI+VG
Sbjct: 247 LFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVG-------------------- 286
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK----TPSMAGIAP 304
T LP N LT R G LC+ P
Sbjct: 287 ----------ATDLPMAMAN-----------LTTVNGTRFGFDLCFDATAAGGGGGVPVP 325
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVE-----GVFCFAMQPIDG--DVGIFGNFAQSDLFI 357
L F GGA+ + S F V+ V C + P + I GN Q DL +
Sbjct: 326 TLVLRFAGGAEYAVRRRSYFGVVEVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLHV 385
Query: 358 GYDFDSQMVSFKPTDC 373
YD D M SF P DC
Sbjct: 386 LYDLDGGMFSFAPADC 401
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 147 bits (372), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 117/368 (31%), Positives = 166/368 (45%), Gaps = 28/368 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V+S + G YVM S+GTP I DTGSDL+WVQ PC C I++P S
Sbjct: 44 VESPLHPDGGGYVMDISVGTPGK-RFRAIADTGSDLVWVQSEPCTGCSGGT--IFDPRQS 100
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNS---NNFF 129
S+++E+ C S+ C L C+Y+Y Y S T+G A + I+ G + + F
Sbjct: 101 STFREMDCSSQLCTELPGSCEPGSSACSYSYEYG-SGETEGEFARDTISLGTTSGGSQKF 159
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
+ GCG N+G + GLVGLG+ +SL SQ+ + + +KFSYCLV ++ S +S
Sbjct: 160 PSFAVGCGMVNSGF--DGVDGLVGLGQGPVSLTSQLSAAI-DSKFSYCLVDINSQSE-SS 215
Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDK--TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
+ FG + + G G+ ST + D TYY +T+ GI+V + S
Sbjct: 216 PLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMG-------------S 262
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PIL 306
G ID+G T +P Y R+ ++ + + L +G LCY S P L
Sbjct: 263 PGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPAL 322
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG-DVGIFGNFAQSDLFIGYDFDSQM 365
T G P + C AM G V I GN Q I YD S
Sbjct: 323 TIRLAGATMTPPSSNYFLVVDDSGDTVCLAMGSAGGLPVSIIGNVMQQGYHILYDRGSSE 382
Query: 366 VSFKPTDC 373
+SF C
Sbjct: 383 LSFVQAKC 390
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 147 bits (372), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 123/367 (33%), Positives = 166/367 (45%), Gaps = 31/367 (8%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELSC 80
EYV+ IGTP + ++DTGSDL WVQC PC +CY Q P+++P+SSSSY + C
Sbjct: 170 EYVVTLGIGTPAVQQTV-LIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPC 228
Query: 81 QSEQCHLLDT---------VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
S+ C L VS + LC Y Y + + T GV +TE +T +
Sbjct: 229 DSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLTL-KPGVVVAD 287
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
FGCG + G + + + GL+GLG SL SQ SQ G FSYCL P + +
Sbjct: 288 FGFGCGDHQHGPYEKFD-GLLGLGGAPESLVSQTSSQFG-GPFSYCLPPTSGGAGFLTLG 345
Query: 192 YFGNGSEVSGGGVVSTSLVSK--EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
N S + +S + + + T+Y VTL GISVG P A S G
Sbjct: 346 APPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGG-------APLAIPPSAFSSG 398
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGI-APIL 306
M ID+G T LP Y L R+A+ P G L CY A + P +
Sbjct: 399 -MVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFTGHANVTVPTI 457
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
+ F GGA + L + + V+G FA D +GI GN Q + YD V
Sbjct: 458 SLTFSGGATIDLAAPAGVL---VDGCLAFAGAGTDNAIGIIGNVNQRTFEVLYDSGKGTV 514
Query: 367 SFKPTDC 373
F+ C
Sbjct: 515 GFRAGAC 521
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 147 bits (372), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 109/375 (29%), Positives = 176/375 (46%), Gaps = 36/375 (9%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
NV +++ SIG+PP+ + + DT SDL+W+QC PC+ CY Q PI++P+ S ++
Sbjct: 77 NVPIIPQAFLVNISIGSPPVTQLLHM-DTASDLLWLQCRPCINCYAQSLPIFDPSRSYTH 135
Query: 76 KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG-----NSNNFFD 130
+ SC++ Q + + + C Y+ Y D + +KG+LA E + F +S+
Sbjct: 136 RNESCRTSQYSMPSLRFNAKTRSCEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALH 195
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
+VVFGCGH+N G G++GLG SL + KFSYC S +
Sbjct: 196 DVVFGCGHDNYGE-PLVGTGILGLGYGEFSLVHRF-----GTKFSYCFGSLDDPSYPHNV 249
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP----YYNSSGAI 246
+ G+ G ++ + + +Y+VT+E ISV + ++P +N +
Sbjct: 250 LVLGD----DGANILGDTTPLEIYNGFYYVTIEAISVDGI-----ILPIDPWVFNRNHQT 300
Query: 247 SKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI 302
G IDTG T L ++ Y N++E+ + CY +
Sbjct: 301 GLGGTIIDTGNSLTSLVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLERDL 360
Query: 303 A----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIG 358
PI+T HF GA++ L S F+ VFC A+ P G++ G AQ IG
Sbjct: 361 VESGFPIVTFHFSDGAELSLDVKSVFMKLS-PNVFCLAVTP--GNMNSIGATAQQSYNIG 417
Query: 359 YDFDSQMVSFKPTDC 373
YD +++ +SF+ DC
Sbjct: 418 YDLEAKKISFERIDC 432
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 114/359 (31%), Positives = 178/359 (49%), Gaps = 36/359 (10%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV-KPIYNPASSSSYKELSCQS 82
+++ FS+G PP+ + I+DTGS L+W+QC PC C +Q+ P+++P+ SS+Y LSC++
Sbjct: 102 FLVNFSMGQPPVPQL-AIMDTGSSLLWIQCAPCKSCSQQIIGPMFDPSISSTYDSLSCKN 160
Query: 83 EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN---NFFDNVVFGCGHN 139
C + C S C Y Y + + GV+ATE++ FG+S+ N +NV+FGC H
Sbjct: 161 IICRYAPSGECDSSSQCVYNQTYVEGLPSVGVIATEQLIFGSSDEGRNAVNNVLFGCSHR 220
Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
N + G+ GLG S + +++Q+G+ KFSYC+ +++ G +
Sbjct: 221 NGNYKDRRFTGVFGLG----SGITSVVNQMGS-KFSYCIGNIADPDYSYNQLVLSEGVNM 275
Query: 200 SGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK-GNMFIDTGAP 258
G ST L + +Y V LEGISVG ++L+ ++ K + ID+G
Sbjct: 276 EG---YSTPLDVVDG--HYQVILEGISVGE----TRLVIDPSAFKRTEKQRRVIIDSGTA 326
Query: 259 PTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQLCYKTPSMAGIA--PILTAHFDGGA 314
PT L ++ Y LE +VRN + LTP+ S LCYK + P +T HF GA
Sbjct: 327 PTWLAENEYRALEREVRNLLDRFLTPFMRE---SFLCYKGKVGQDLVGFPAVTFHFAEGA 383
Query: 315 KVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
L+ + V G D + G AQ + YD + + F+ DC
Sbjct: 384 D--LVVDTEMRQASVYG-------KDFKDFSVIGLMAQQYYNVAYDLNKHKLFFQRIDC 433
>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 438
Score = 147 bits (371), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 113/360 (31%), Positives = 176/360 (48%), Gaps = 39/360 (10%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N EY+M + TPP+ + + DTGS L+W++C ++ + PASSS Y L C
Sbjct: 73 NFEYLMALDVSTPPV-RMLALADTGSSLVWLKC--------KLPAAHTPASSS-YARLPC 122
Query: 81 QSEQCHLL-DTVSC----SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
+ C L D SC S +C Y Y +AD S T G + + TF +F G
Sbjct: 123 DAFACKALGDAASCRATGSGNNICVYRYAFADGSCTAGPVTVDAFTFSTRLDF------G 176
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG-ANKFSYCLVPFHTDSSITSKMYFG 194
C G+ ++ GLVGL +SL SQ+ ++ A+KFSYCLVP+ + +++S + FG
Sbjct: 177 CATRTEGLSVPDD-GLVGLANGPISLVSQLSAKTPFAHKFSYCLVPYSSSETVSSSLNFG 235
Query: 195 NGSEVSGG-GVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
+ + VS G +T LV+ +K++Y + L+ I V + K +P ++ + +
Sbjct: 236 SHAIVSSSPGAATTPLVAGRNKSFYTIALDSIKV-----AGKPVPLQTTT-----TKLIV 285
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY----KTPSMAGIA-PILTA 308
D+G T LPK + L + AIKL + P +CY + P G + P +T
Sbjct: 286 DSGTMLTYLPKAVLDPLVAALTAAIKLPRVKSPETLYAVCYDVRRRAPEDVGKSIPDVTL 345
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
GG +V L +TF+ C A+ I GN AQ +L +G+D + + VSF
Sbjct: 346 VLGGGGEVRLPWGNTFVVENKGTTVCLALVESHLPEFILGNVAQQNLHVGFDLERRTVSF 405
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 147 bits (371), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 114/362 (31%), Positives = 168/362 (46%), Gaps = 30/362 (8%)
Query: 23 EYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
E+V+ GTP Y ++ DTGSD+ W+QCLPC CYKQ PI++P S++Y + C
Sbjct: 134 EFVVTVGFGTP--AQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSVVPC 191
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
QC D CS+ C Y Y D S + GVL+ E ++ S FGCG N
Sbjct: 192 GHPQCAAADGSKCSNGT-CLYKVEYGDGSSSAGVLSHETLSL-TSTRALPGFAFGCGQTN 249
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
G F + + GL+GLGR +LSL+SQ + G FSYCL +D++ + G + S
Sbjct: 250 LGDFGDVD-GLIGLGRGQLSLSSQAAASFGGT-FSYCL---PSDNTTHGYLTIGPTTPAS 304
Query: 201 GGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPP 259
V T++V K+D ++YFV L I +G ++P + + F+D+G
Sbjct: 305 NDDVQYTAMVQKQDYPSFYFVELVSIDIGGY-----ILPVPPT--LFTDDGTFLDSGTIL 357
Query: 260 TLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAK 315
T LP + Y L ++ + + K P DP CY + I P ++ F G+
Sbjct: 358 TYLPPEAYTALRDRFKFTMTQYKPAPAYDPF---DTCYDFTGQSAIFIPAVSFKFSDGSV 414
Query: 316 VPLIHTSTFIPP----PVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
L I P P G F +P I GN Q + + YD ++ + F
Sbjct: 415 FDLSFFGILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASA 474
Query: 372 DC 373
C
Sbjct: 475 SC 476
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 117/372 (31%), Positives = 173/372 (46%), Gaps = 33/372 (8%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
N+ T N Y++ +G ++ I+DTGSDL WVQC PC+ CY Q P++NP++SSSY
Sbjct: 127 NLETLN--YIVTIGLGNQ---NMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSY 181
Query: 76 KELSCQSEQCHLL-----DTVSCSSQQ--LCNYTYGYADSSLTKGVLATERITFGNSNNF 128
L C S C L +T +C S CN+T Y D S T G L E ++FG +
Sbjct: 182 NSLLCNSSTCQNLQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGVEHLSFGGIS-- 239
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
N VFGCG NN G+F G++GLGR+ LS+ SQ + G FSYCL TDS +
Sbjct: 240 VSNFVFGCGRNNKGLFG-GVSGIMGLGRSNLSMISQTNTTFGG-VFSYCLPT--TDSGAS 295
Query: 189 SKMYFGNGSEVSGG--GVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
+ GN S + + TS+VS + +Y + L GI VG ++ +
Sbjct: 296 GSLVIGNESSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAIQDT---------S 346
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-P 304
G + ID+G T L YN L+ + P C+ + ++ P
Sbjct: 347 FGNGGILIDSGTVITRLAPSLYNALKAEFLKQFSGYPIAPALSILDTCFNLTGIEEVSIP 406
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI--DGDVGIFGNFAQSDLFIGYDFD 362
L+ HF+ + + P C A+ + + D+ I GN+ Q + + YD
Sbjct: 407 TLSMHFENNVDLNVDAVGILYMPKDGSQVCLALASLSDENDMAIIGNYQQRNQRVIYDAK 466
Query: 363 SQMVSFKPTDCT 374
+ F DC+
Sbjct: 467 QSKIGFAREDCS 478
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 121/388 (31%), Positives = 189/388 (48%), Gaps = 40/388 (10%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V+S V+ +GEY++ +GTPP I+DTGSDL W+QC PC+ C++Q P+++PA+S
Sbjct: 141 VESGVAVGSGEYLVDLYVGTPPR-RFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAAS 199
Query: 73 SSYKELSCQSEQCHLL----DTVSCSSQQL--CNYTYGYADSSLTKGVLATERITFG--- 123
SY+ ++C +C L+ +C C Y Y Y D S T G LA E T
Sbjct: 200 LSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTA 259
Query: 124 -NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
++ D+VVFGCGH+N G+F+ L+GLGR LS ASQ+ + G + FSYCLV
Sbjct: 260 PGASRRVDDVVFGCGHSNRGLFHGAAG-LLGLGRGALSFASQLRAVYG-HAFSYCLV--D 315
Query: 183 TDSSITSKMYFGNGSEVSGGGVVS----TSLVSKEDKTYYFVTLEGISVG----NLSNSS 234
SS+ SK+ FG+ + G ++ + T+Y+V L+G+ VG N+S S+
Sbjct: 316 HGSSVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPST 375
Query: 235 KLIPYYNSSGAISKGNMFIDTGAPPT--LLPKDFYNRLEEQ---VRNAIKLTPYQDPRLG 289
+ S G I + A P ++ + F R+++ V + L+P
Sbjct: 376 WDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSP------- 428
Query: 290 SQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFA-MQPIDGDVGIF 347
CY + + P + F GA + F+ +G+ C A + + I
Sbjct: 429 ---CYNVSGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSII 485
Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
GNF Q + + YD + + F P C +
Sbjct: 486 GNFQQQNFHVLYDLQNNRLGFAPRRCAE 513
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 117/387 (30%), Positives = 175/387 (45%), Gaps = 38/387 (9%)
Query: 15 SNVSTANGEYVMKFSIGTP----PLLDIYGIVDTGSDLMWVQCLP---CVQCYKQVKPIY 67
S S +G+Y ++ +GTP PL I+DTGSDL W+QC P P Y
Sbjct: 18 SGSSIGSGQYFVELRVGTPAKKFPL-----IIDTGSDLTWIQCNPPNTTANSSSPPAPWY 72
Query: 68 NPASSSSYKELSCQSEQCHLL-----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF 122
+ +SSSSY+E+ C ++C L + S S C+YTYGY+D S T G+LA E I+
Sbjct: 73 DKSSSSSYREIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISM 132
Query: 123 ----------GNSNNF---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQL 169
GN NV GC + G G++GLG+ +SLA+Q
Sbjct: 133 KSRKRSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTA 192
Query: 170 GANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVG 228
FSYCLV + S+ +S + G + T +V +++Y+V + G++V
Sbjct: 193 LGGIFSYCLVDYLRGSNASSFLVMG---RTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVD 249
Query: 229 NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRL 288
+ G +KG +F D+G + L + Y+++ + +I L Q+
Sbjct: 250 GKPVDGIASSDWGIDGDGNKGTIF-DSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPE 308
Query: 289 GSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG--I 346
G +LCY M P L F GGA + L + ++ E V C A+Q + G I
Sbjct: 309 GFELCYNVTRMEKGMPKLGVEFQGGAVMEL-PWNNYMVLVAENVQCVALQKVTTTNGSNI 367
Query: 347 FGNFAQSDLFIGYDFDSQMVSFKPTDC 373
GN Q D I YD + FK + C
Sbjct: 368 LGNLLQQDHHIEYDLAKARIGFKWSPC 394
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 121/388 (31%), Positives = 189/388 (48%), Gaps = 40/388 (10%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V+S V+ +GEY++ +GTPP I+DTGSDL W+QC PC+ C++Q P+++PA+S
Sbjct: 141 VESGVAVGSGEYLVDLYVGTPPR-RFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPATS 199
Query: 73 SSYKELSCQSEQCHLL----DTVSCSSQQL--CNYTYGYADSSLTKGVLATERITFG--- 123
SY+ ++C +C L+ +C C Y Y Y D S T G LA E T
Sbjct: 200 LSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTA 259
Query: 124 -NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
++ D+VVFGCGH+N G+F+ L+GLGR LS ASQ+ + G + FSYCLV
Sbjct: 260 PGASRRVDDVVFGCGHSNRGLFHGAAG-LLGLGRGALSFASQLRAVYG-HAFSYCLV--D 315
Query: 183 TDSSITSKMYFGNGSEVSGGGVVS----TSLVSKEDKTYYFVTLEGISVG----NLSNSS 234
SS+ SK+ FG+ + G ++ + T+Y+V L+G+ VG N+S S+
Sbjct: 316 HGSSVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPST 375
Query: 235 KLIPYYNSSGAISKGNMFIDTGAPPT--LLPKDFYNRLEEQ---VRNAIKLTPYQDPRLG 289
+ S G I + A P ++ + F R+++ V + L+P
Sbjct: 376 WDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSP------- 428
Query: 290 SQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFA-MQPIDGDVGIF 347
CY + + P + F GA + F+ +G+ C A + + I
Sbjct: 429 ---CYNVSGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSII 485
Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
GNF Q + + YD + + F P C +
Sbjct: 486 GNFQQQNFHVLYDLQNNRLGFAPRRCAE 513
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 166/366 (45%), Gaps = 25/366 (6%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSY 75
+ +AN Y + +GTP D+ + DTGSDL W QC PC CYKQ I++P+ SSSY
Sbjct: 131 IGSAN--YFVVVGLGTPKR-DLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSY 187
Query: 76 KELSCQSEQCHLLDTVSC-----SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
++C S C L + SS C Y Y D S + G L+ ER+T + + D
Sbjct: 188 INITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQERLTI-TATDIVD 246
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
+ +FGCG +N G+F+ GL+GLGR +S Q S + FSYCL + SS
Sbjct: 247 DFLFGCGQDNEGLFS-GSAGLIGLGRHPISFVQQT-SSIYNKIFSYCL---PSTSSSLGH 301
Query: 191 MYFGNGSEVSGGGVVSTSLVS-KEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
+ FG S + + T L + D T+Y + + GISVG +P +SS S G
Sbjct: 302 LTFG-ASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTK-----LPAVSSS-TFSAG 354
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTA 308
ID+G T L Y L R ++ P + CY I+ P +
Sbjct: 355 GSIIDSGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGYKEISVPKIDF 414
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
F GG V L I + V FA D D+ IFGN Q L + YD + +
Sbjct: 415 EFAGGVTVELPLVGILIGRSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVEGGRIG 474
Query: 368 FKPTDC 373
F C
Sbjct: 475 FGAAGC 480
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 123/393 (31%), Positives = 190/393 (48%), Gaps = 46/393 (11%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V+S V+ +GEY+M +GTPP I+DTGSDL W+QC PC+ C+ QV P+++PA+S
Sbjct: 140 VESGVAVGSGEYLMDVYVGTPPR-RFRMIMDTGSDLNWLQCAPCLDCFDQVGPVFDPAAS 198
Query: 73 SSYKELSCQSEQCHLL----DTVSCSS--QQLCNYTYGYADSSLTKGVLATERITFG--- 123
SSY+ ++C ++C L+ +C + C Y Y Y D S T G LA E T
Sbjct: 199 SSYRNVTCGDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTA 258
Query: 124 -NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
++ D+VVFGCGH N G+F+ L LS ASQ+ + G + FSYCLV
Sbjct: 259 PGASRRVDDVVFGCGHWNRGLFHGAAGLLGLGR-GPLSFASQLRAVYG-HTFSYCLVDHG 316
Query: 183 TDSSITSKMYFGNGSEVSGGGV------VSTSLVSKEDKTYYFVTLEGISVG----NLSN 232
+D + SK+ FG ++ + + S T+Y+V L+G+ VG N+S+
Sbjct: 317 SD--VASKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISS 374
Query: 233 SSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE----QVRNAIKLTPYQDPRL 288
+ + G G ID+G + + Y + + ++ + L P P L
Sbjct: 375 DT----WGVGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIP-DFPVL 429
Query: 289 GSQLCYKTPSMAGI----APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM--QPIDG 342
CY +++G+ P L+ F GA + FI +G+ C A+ P G
Sbjct: 430 SP--CY---NVSGVDRPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTG 484
Query: 343 DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
+ I GNF Q + + YD + + F P C +
Sbjct: 485 -MSIIGNFQQQNFHVVYDLKNNRLGFAPRRCAE 516
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/372 (29%), Positives = 176/372 (47%), Gaps = 40/372 (10%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
NV +++ SIG+PP+ + + DT SDL+W+QCLPC+ CY Q PI++P+ S ++
Sbjct: 77 NVPIIPQAFLVNISIGSPPITQLLHM-DTASDLLWIQCLPCINCYAQSLPIFDPSRSYTH 135
Query: 76 KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG-----NSNNFFD 130
+ +C++ Q + ++ + C Y+ Y D + +KG+LA E + F +S+
Sbjct: 136 RNETCRTSQYSMPSLKFNANTRSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALH 195
Query: 131 NVVFGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
+VVFGCGH+N G E + G++GLG SL + KFSYC S
Sbjct: 196 DVVFGCGHDNYG---EPLVGTGILGLGYGEFSLVHRF-----GKKFSYCFGSLDDPSYPH 247
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP----YYNSSG 244
+ + G+ G ++ + + +Y+VT+E ISV + ++P +N +
Sbjct: 248 NVLVLGD----DGANILGDTTPLEIHNGFYYVTIEAISVDGI-----ILPIDPRVFNRNH 298
Query: 245 AISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
G IDTG T L ++ Y NR+E+ + CY
Sbjct: 299 QTGLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECYNGNFER 358
Query: 301 GIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLF 356
+ PI+T HF GA++ L S F+ VFC A+ P G++ G AQ
Sbjct: 359 DLVESGFPIVTFHFSEGAELSLDVKSLFMKLS-PNVFCLAVTP--GNLNSIGATAQQSYN 415
Query: 357 IGYDFDSQMVSF 368
IGYD ++ VSF
Sbjct: 416 IGYDLEAMEVSF 427
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 109/347 (31%), Positives = 163/347 (46%), Gaps = 26/347 (7%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCN 100
IVDT S+L WVQC PC C+ Q +P+++P+SS SY + C S C L + S Q C+
Sbjct: 127 IVDTASELTWVQCEPCDACHDQQEPLFDPSSSPSYAAVPCNSSSCDALRVATGMSGQACD 186
Query: 101 -------YTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVG 153
YT Y D S ++GVLA +R++ + VFGCG +N G F GL+G
Sbjct: 187 DQPAACSYTLSYRDGSYSRGVLAHDRLSLAGED--IQGFVFGCGTSNQGPFGGTS-GLMG 243
Query: 154 LGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV--SGGGVVSTSLVS 211
LGR++LSL SQ + Q G FSYCL P + SS + G+ + V + +V T++VS
Sbjct: 244 LGRSQLSLISQTMDQFG-GVFSYCLPPKESGSS--GSLVLGDDASVYRNSTPIVYTAMVS 300
Query: 212 KE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRL 270
+Y L GI+VG S P +++ G G +D+G T L Y +
Sbjct: 301 DPLQGPFYLANLTGITVGGEDVQS---PGFSAGGG---GKAIVDSGTIITSLVPSVYAAV 354
Query: 271 EEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTST-FIPPP 328
+ + + P P C+ + + P L FDGGA+V + ++
Sbjct: 355 RAEFVSQLAEYPQAAPFSILDTCFDLTGLREVQVPSLKLVFDGGAEVEVDSKGVLYVVTG 414
Query: 329 VEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
C A+ + D I GN+ Q +L + +D + F C
Sbjct: 415 DASQVCLALASLKSEYDTPIIGNYQQKNLRVIFDTVGSQIGFAQETC 461
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 121/380 (31%), Positives = 175/380 (46%), Gaps = 28/380 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S T +GEY+ K ++GTP + + + DTGSD+ W+QC PC +CY Q P+++P S
Sbjct: 123 VVSRAPTTSGEYMAKIAVGTPAVEALLAM-DTGSDITWLQCQPCRRCYPQSGPVFDPRHS 181
Query: 73 SSYKELSCQSEQCHLLDTVSC--SSQQLCNYTYGYA-DSSLTKGVLATERITFGNSNNFF 129
+SY+E+ + C L + + C Y GY D S T G E +TF
Sbjct: 182 TSYREMGYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFAGGVQ-V 240
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGAN--KFSYCLVPFHTDS-- 185
++ GCGH+N G+F G++GLGR ++S SQI + LG N FSYCL F S
Sbjct: 241 PHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQI-AALGYNVTSFSYCLADFFLSSPG 299
Query: 186 -SITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNL------SNSSKLI 237
S++S + G+G+ T V + T+Y+V L G+SVG + + KL
Sbjct: 300 RSVSSTLTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDDLKLD 359
Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQLCY 294
PY +G + +D+G T L + Y + R A + P CY
Sbjct: 360 PY------TGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPSGFFDTCY 413
Query: 295 KTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQS 353
A P ++ HF GG ++ L + IP G CFA D V I GN Q
Sbjct: 414 TMGGRAMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSVSIIGNIQQQ 473
Query: 354 DLFIGYDFDSQMVSFKPTDC 373
+ Y+ V F P C
Sbjct: 474 GFRVVYNIGGGRVGFAPNSC 493
>gi|224143825|ref|XP_002325088.1| predicted protein [Populus trichocarpa]
gi|222866522|gb|EEF03653.1| predicted protein [Populus trichocarpa]
Length = 241
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 115/292 (39%), Positives = 153/292 (52%), Gaps = 58/292 (19%)
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
SSS+Y ++ S +CHLLDTVS + + SS +KG +R++
Sbjct: 7 SSSTYTTINRHSNKCHLLDTVSLPRKTI-------TLSSSSKG----QRVSV-------P 48
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
++VFGCGHNNTG FNE+EMG VG G SL S+I S G KF++CLVPFH+ +I+SK
Sbjct: 49 DIVFGCGHNNTG-FNEHEMGSVGRGGRPSSLTSRIGSYSGNIKFTHCLVPFHSTLNISSK 107
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
MY G+GSE+ G GVVST LV K+++ YY+V LEGISV K + Y+SSG ISK
Sbjct: 108 MYSGDGSEIIGKGVVSTPLVRKKNRAYYYVALEGISV-----RGKFLT-YSSSGTISK-- 159
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHF 310
+ + GA +V ++P QD C+ P+ A F
Sbjct: 160 VHFEGGA---------------RVPTTTFISPKQD-----VFCFAMTITDAFMPV--ACF 197
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
G L+ F P +AM GIFGNFAQS+ IG+D D
Sbjct: 198 ARGLGSFLL----FRPRT-----SYAMSESMLAGGIFGNFAQSNFRIGFDLD 240
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 128/386 (33%), Positives = 197/386 (51%), Gaps = 35/386 (9%)
Query: 12 VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
++S ++ +GEY M +G+PP I+DTGSDL W+QCLPC C++Q Y+P +
Sbjct: 143 TLESGMTLGSGEYFMDVLVGSPPK-HFSLILDTGSDLNWIQCLPCHDCFQQNGAFYDPKA 201
Query: 72 SSSYKELSCQSEQCHLLDTVS----CSS-QQLCNYTYGYADSSLTKGVLATERITF---- 122
S+SYK ++C +C+L+ C S Q C Y Y Y DSS T G A E T
Sbjct: 202 SASYKNITCNDPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTT 261
Query: 123 -GNSNNFF--DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
G S+ + +N++FGCGH N G+F+ L+GLGR LS +SQ+ S G + FSYCLV
Sbjct: 262 SGGSSELYNVENMMFGCGHWNRGLFHGAAG-LLGLGRGPLSFSSQLQSLYG-HSFSYCLV 319
Query: 180 PFHTDSSITSKMYFGNGSE-VSGGGVVSTSLVSKEDK---TYYFVTLEGISV-GNLSNSS 234
++D++++SK+ FG + +S + TS V++++ T+Y+V ++ I V G + N
Sbjct: 320 DRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIP 379
Query: 235 KLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQD-PRLG 289
+ +S GA G ID+G + + Y N++ E+ + K Y+D P L
Sbjct: 380 EETWNISSDGA---GGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKG--KYPVYRDFPILD 434
Query: 290 SQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIF 347
C+ + I P L F GA ++FI E + C A+ I
Sbjct: 435 P--CFNVSGIDSIQLPELGIAFADGAVWNFPTENSFIWLN-EDLVCLAILGTPKSAFSII 491
Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDC 373
GN+ Q + I YD + + PT C
Sbjct: 492 GNYQQQNFHILYDTKRSRLGYAPTKC 517
>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 457
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 118/371 (31%), Positives = 175/371 (47%), Gaps = 30/371 (8%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP--CVQCYKQVKPIYNPASS 72
S +S + YVMKFSIG+P + D Y I D+GS L+W+QC C CY+Q P++NP+ S
Sbjct: 92 SRMSYTDKAYVMKFSIGSPAV-DTYAIPDSGSSLVWLQCGTPYCRNCYRQKIPLFNPSKS 150
Query: 73 SSYKELSCQSEQCHLL---DTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
+Y + C + +C + + C Q+C Y Y D S T+GV++T+ TF +
Sbjct: 151 VTYMKRLCNTAECRVALGDEYWRCKKPNQICKYHEDYLDDSYTEGVISTDIFTFPEHISG 210
Query: 129 FDN----VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
F N ++FGCG+NN+ + GLVGL + SL + Q+ ++FSYC V T+
Sbjct: 211 FGNYTLRIIFGCGYNNSDPQHFYPPGLVGLTNNKASL----VGQMDVDQFSYC-VSIDTE 265
Query: 185 SSITSKM--YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSS--KLIPYY 240
++ M FG + +SG ST LV D Y F ++GI V + Y
Sbjct: 266 QNLKGSMEIRFGLAASISGH---STQLVPNSDGWYIFKNVDGIYVNEFEVEGYPAWVFKY 322
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD-PRLGSQLCYKTPSM 299
G +G + +DTG T L + L + + I + P +D G +LCY +
Sbjct: 323 TEGG---QGGLTMDTGTTYTELHNSVMDPLIKLLEEHITIVPEKDYSNSGFELCYFSDDF 379
Query: 300 AGIA-PILTAHF-DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFI 357
G P + F D +T P C AM +G + I G D+ I
Sbjct: 380 LGATLPDIELRFTDNKDTYFSFNTRNAWTPNGRSQMCLAMFRTNG-MSIIGMHQLRDIKI 438
Query: 358 GYDFDSQMVSF 368
GYD +VSF
Sbjct: 439 GYDLHHNIVSF 449
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 124/385 (32%), Positives = 175/385 (45%), Gaps = 42/385 (10%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC----LPCVQCYKQVKPIYN 68
V + V A +Y+ + IG+PP ++DTGSDL+W QC LP C KQ P YN
Sbjct: 75 VSAQVHRATRQYIASYLIGSPPQ-RTEALIDTGSDLIWTQCATTCLP-KSCAKQGLPYYN 132
Query: 69 PASSSSYKELSCQSEQ--CHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
+ SS++ + C + C C C + Y + G L TE F +
Sbjct: 133 LSQSSTFVPVPCADKAGFCAANGVHLCGLDGSCTFIASYGAGRVI-GSLGTESFAFESGT 191
Query: 127 NFFDNVVFGC---GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
++ FGC +G N+ GL+GLGR RLSL SQI GA +FSYCL P+
Sbjct: 192 T---SLAFGCVSLTRITSGALNDAS-GLIGLGRGRLSLVSQI----GATRFSYCLTPYFH 243
Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKED---KTYYFVTLEGISVGNLSNSSKLIPYY 240
S +S ++ G + + GGG + S +D T+Y++ LEGI+VG +P
Sbjct: 244 SSGASSHLFVGASASLGGGGASMPFVKSPKDYPYSTFYYLPLEGITVGKTR-----LPAV 298
Query: 241 NSS--------GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLG 289
NS+ G + IDTG+P T L Y L+E+V + L P + G
Sbjct: 299 NSTTFQLRQLFKGYWAGGVIIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDS-G 357
Query: 290 SQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGN 349
+LC + P L HF GGA + + S + PV+ M G I GN
Sbjct: 358 LELCVAREGFQKVVPALVFHFGGGADMAVPAASYW--APVDKAAACMMILEGGYDSIIGN 415
Query: 350 FAQSDLFIGYDFDSQMVSFKPTDCT 374
F Q D+ + YD SF+ DCT
Sbjct: 416 FQQQDMHLLYDLRRGRFSFQTADCT 440
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 123/373 (32%), Positives = 186/373 (49%), Gaps = 45/373 (12%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
+ + G Y M FS+GTPP + + DTGSDL+W +C C +C + Y P SSS+
Sbjct: 73 QMDSGGGAYDMTFSMGTPPQ-TLSALADTGSDLIWAKCGACKRCAPRGSASYYPTKSSSF 131
Query: 76 KELSCQSEQCHLLDTVSCSS-------QQLCNYTYGYADSS----LTKGVLATERITFGN 124
+L C S C L++ S ++ +C+Y Y Y SS T+G + +E T G
Sbjct: 132 SKLPCSSALCRTLESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLG- 190
Query: 125 SNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
++ + FGC + + GLVGLGR +LSL Q+ ++GA FSYCL +D
Sbjct: 191 -SDAVQGIGFGC-TTMSEGGYGSGSGLVGLGRGKLSLVRQL--KVGA--FSYCLT---SD 241
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
S +S + FG G+ ++G GV ST LV+ + T+Y V L+ IS+G + G
Sbjct: 242 PSTSSPLLFGAGA-LTGPGVQSTPLVNLKTSTFYTVNLDSISIGAA----------KTPG 290
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLE----EQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
G +F D+G T L + Y E Q N ++ P D G ++C++T S
Sbjct: 291 TGRHGIIF-DSGTTLTFLAEPAYTLAEAGLLSQTTNLTRV-PGTD---GYEVCFQT-SGG 344
Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
+ P + HFDGG + T + + V C+ +Q ++ I GN Q D I YD
Sbjct: 345 AVFPSMVLHFDGGDMA--LKTENYFGAVNDSVSCWLVQKSPSEMSIVGNIMQMDYHIRYD 402
Query: 361 FDSQMVSFKPTDC 373
D ++SF+PT+C
Sbjct: 403 LDKSVLSFQPTNC 415
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 112/374 (29%), Positives = 175/374 (46%), Gaps = 28/374 (7%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPASSSS 74
V A +YV ++ IG PP ++DTGSDL+W QC C++ C +Q P YN ++SS+
Sbjct: 83 VRWATLQYVAEYLIGDPPQ-RAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASST 141
Query: 75 YKELSCQSEQCHLLDTVS--CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
+ + C + C D + C C+ GY + + G L TE F + +
Sbjct: 142 FAPVPCAARICAANDDIIHFCDLAAGCSVIAGYG-AGVVAGTLGTEAFAFQSGTA---EL 197
Query: 133 VFGCGHNNTGVFN--ENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
FGC V GL+GLGR RLSL +SQ GA KFSYCL P+ ++ T
Sbjct: 198 AFGCVTFTRIVQGALHGASGLIGLGRGRLSL----VSQTGATKFSYCLTPYFHNNGATGH 253
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDK--TYYFVTLEGISVG--NLSNSSKLIPYYNSSGAI 246
++ G + + G G V T+ K K +Y++ L G++VG L + + + +
Sbjct: 254 LFVGASASLGGHGDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGL 313
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCYKTPSMAGIA 303
G + ID+G+P T L D Y+ L ++ N + P D G+ LC + +
Sbjct: 314 FSGGVIIDSGSPFTSLVHDAYDALASELAARLNGSLVAPPPDADDGA-LCVARRDVGRVV 372
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG---DVGIFGNFAQSDLFIGYD 360
P + HF GGA + + S + PV+ G + GN+ Q ++ + YD
Sbjct: 373 PAVVFHFRGGADMAVPAESYW--APVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYD 430
Query: 361 FDSQMVSFKPTDCT 374
+ SF+P DC+
Sbjct: 431 LANGDFSFQPADCS 444
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 120/396 (30%), Positives = 173/396 (43%), Gaps = 52/396 (13%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-YKQVKPIYNPAS 71
V S ++ +G+Y + IG PP + I DTGSDL+WV+C C C + ++ P
Sbjct: 73 VVSGAASGSGQYFVDLRIGQPPQ-SLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRH 131
Query: 72 SSSYKELSCQSEQCHLLDTVS----CSSQQL---CNYTYGYADSSLTKGVLATERITFGN 124
SS++ C C L+ C+ ++ C+Y YGYAD SLT G+ A E +
Sbjct: 132 SSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKT 191
Query: 125 SNN---FFDNVVFGCGHNNTG------VFNENEMGLVGLGRTRLSLASQILSQLGANKFS 175
S+ +V FGCG +G FN G++GLGR +S ASQ+ + G NKFS
Sbjct: 192 SSGKEARLKSVAFGCGFRISGQSVSGTSFN-GANGVMGLGRGPISFASQLGRRFG-NKFS 249
Query: 176 YCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSK 235
YCL+ + TS + GNG + + L + T+Y+V L+ + V N +K
Sbjct: 250 YCLMDYTLSPPPTSYLIIGNGGDGISKLFFTPLLTNPLSPTFYYVKLKSVFV----NGAK 305
Query: 236 L-----IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS 290
L I + SG G +D+G L + Y + VR +KL G
Sbjct: 306 LRIDPSIWEIDDSG---NGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTPGF 362
Query: 291 QLCYKTPSMA---GIAPILTAHFDGGAKVPLIHTSTFIPPPV-------EGVFCFAMQPI 340
LC + I P L F GGA F+PPP E + C A+Q +
Sbjct: 363 DLCVNVSGVTKPEKILPRLKFEFSGGA--------VFVPPPRNYFIETEEQIQCLAIQSV 414
Query: 341 DGDVG--IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
D VG + GN Q +D D + F C
Sbjct: 415 DPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 450
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 117/370 (31%), Positives = 176/370 (47%), Gaps = 32/370 (8%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
A+ + + +GTPP I+D GSDL+W QC KQ++P+++ A SSS+ L
Sbjct: 103 AHQGHSLTVGVGTPPQPSKV-ILDLGSDLLWTQCSLVGPTAKQLEPVFDAARSSSFSVLP 161
Query: 80 CQSEQCHL--LDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
C S+ C +C+ ++ C Y Y + T GVLATE TFG + N+ FGCG
Sbjct: 162 CDSKLCEAGTFTNKTCTDRK-CAYENDYGIMTAT-GVLATETFTFGAHHGVSANLTFGCG 219
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
G E G++GL LS+ L QL KFSYCL PF TS + FG +
Sbjct: 220 KLANGTIAEAS-GILGLSPGPLSM----LKQLAITKFSYCLTPFADRK--TSPVMFGAMA 272
Query: 198 EV----SGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS---KG 249
++ + G V + L+ + YY+V + G+SVG SK + + AI G
Sbjct: 273 DLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGMSVG-----SKRLDVPQETLAIKPDGTG 327
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP---SMAGI-API 305
+D+ L + + L++ V IKL +C++ P SM G+ P
Sbjct: 328 GTVLDSATTLAYLVEPAFTELKKAVMEGIKLPVANRSVDDYPVCFELPRGMSMEGVQVPP 387
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ--PIDGDVGIFGNFAQSDLFIGYDFDS 363
L HFDG A++ L + F P G+ C A+ P +G + GN Q ++ + YD +
Sbjct: 388 LVLHFDGDAEMSLPRDNYF-QEPSPGMMCLAVMQAPFEGAPNVIGNVQQQNMHVLYDVGN 446
Query: 364 QMVSFKPTDC 373
+ S+ PT C
Sbjct: 447 RKFSYAPTKC 456
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 118/373 (31%), Positives = 177/373 (47%), Gaps = 33/373 (8%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
N+ T N Y++ +G+ ++ I+DTGSDL WVQC PC+ CY Q PI+ P++SSSY
Sbjct: 59 NLQTLN--YIVTMGLGSK---NMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSY 113
Query: 76 KELSCQSEQCHLL-----DTVSCSSQQ--LCNYTYGYADSSLTKGVLATERITFGNSNNF 128
+ +SC S C L +T +C S CNY Y D S T G L E ++FG +
Sbjct: 114 QSVSCNSSTCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSFGGVS-- 171
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
+ VFGCG NN G+F GL+GLGR+ LSL SQ + G FSYCL T++ +
Sbjct: 172 VSDFVFGCGRNNKGLFG-GVSGLMGLGRSYLSLVSQTNATFGG-VFSYCLPT--TEAGSS 227
Query: 189 SKMYFGNGSEV--SGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
+ GN S V + + T ++S + +Y + L GI VG ++ + L +
Sbjct: 228 GSLVMGNESSVFKNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAPL--------S 279
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-P 304
G + ID+G T LP Y L+ + P C+ ++ P
Sbjct: 280 FGNGGILIDSGTVITRLPSSVYKALKAEFLKKFTGFPSAPGFSILDTCFNLTGYDEVSIP 339
Query: 305 ILTAHFDGGAKVPLIHTSTF-IPPPVEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDF 361
++ F+G A++ + T TF + C A+ + D I GN+ Q + + YD
Sbjct: 340 TISLRFEGNAQLNVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDT 399
Query: 362 DSQMVSFKPTDCT 374
V F C+
Sbjct: 400 KQSKVGFAEEPCS 412
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 117/369 (31%), Positives = 174/369 (47%), Gaps = 19/369 (5%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+ S + ++ Y++K GTPP Y ++DTGS++ W+ C PC C + +P + P+ S
Sbjct: 113 LASGQAISSSNYIIKLGFGTPPQ-SFYTVLDTGSNIAWIPCNPCSGCSSKQQP-FEPSKS 170
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
S+Y L+C S+QC LL + S + C+ T Y D S +L++E ++ G+ +N
Sbjct: 171 STYNYLTCASQQCQLLRVCTKSDNSVNCSLTQRYGDQSEVDEILSSETLSVGSQQ--VEN 228
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
VFGC + G+ LVG GR LS SQ + L + FSYCL P S+ T +
Sbjct: 229 FVFGCSNAARGLIQRTP-SLVGFGRNPLSFVSQT-ATLYDSTFSYCL-PSLFSSAFTGSL 285
Query: 192 YFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
G +S G+ T L+S ++Y+V L GISVG S IP S S G
Sbjct: 286 LLGK-EALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVS---IPAGTLSLDESTGR 341
Query: 251 -MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAH 309
ID+G T L + YN + + R+ + P CY PS P++T H
Sbjct: 342 GTIIDSGTVITRLVEPAYNAMRDSFRSQLSNLTMASPTDLFDTCYNRPSGDVEFPLITLH 401
Query: 310 FDGGAKVPLIHTSTFIPPPVEG-VFC--FAMQPIDGD--VGIFGNFAQSDLFIGYDFDSQ 364
FD + L + P +G V C F + P GD + FGN+ Q L I +D
Sbjct: 402 FDDNLDLTLPLDNILYPGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQQQKLRIVHDVAES 461
Query: 365 MVSFKPTDC 373
+ +C
Sbjct: 462 RLGIASENC 470
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 144 bits (363), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 124/387 (32%), Positives = 190/387 (49%), Gaps = 38/387 (9%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
++S VS +GEY M +GTPP I+DTGSDL W+QC+PC+ C++Q P Y+P S
Sbjct: 186 LESGVSLGSGEYFMDVFVGTPPK-HFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDS 244
Query: 73 SSYKELSCQSEQCHLLDTVS----CSSQ-QLCNYTYGYADSSLTKGVLATERITF----- 122
SS++ +SC +C L+ C ++ Q C Y Y Y D S T G A E T
Sbjct: 245 SSFRNISCHDPRCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTP 304
Query: 123 -GNSN-NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
G S +NV+FGCGH N G+F+ L+GLG+ LS ASQ+ S G + FSYCLV
Sbjct: 305 NGTSELKHVENVMFGCGHWNRGLFHGAAG-LLGLGKGPLSFASQMQSLYGQS-FSYCLVD 362
Query: 181 FHTDSSITSKMYFGNGSE-VSGGGVVSTSLVSKED---KTYYFVTLEGISVGN-LSNSSK 235
++++S++SK+ FG E +S + TS +D T+Y+V ++ + V + + +
Sbjct: 363 RNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPE 422
Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
+ +S GA G ID+G T + Y ++E IK + + CY
Sbjct: 423 ETWHLSSEGA---GGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKPCY- 478
Query: 296 TPSMAGIAPILTAHF------DGGAKVPLIHTSTFIPPPVEGVFCFAM--QPIDGDVGIF 347
+++GI + F + P+ + +I P V C A+ P + I
Sbjct: 479 --NVSGIEKMELPDFGILFADEAVWNFPVENYFIWIDPE---VVCLAILGNPRSA-LSII 532
Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCT 374
GN+ Q + I YD + + P C
Sbjct: 533 GNYQQQNFHILYDMKKSRLGYAPMKCA 559
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 116/355 (32%), Positives = 163/355 (45%), Gaps = 34/355 (9%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD---------TV 91
IVDT S+L WVQC PC C+ Q P+++P+SS SY + C S C L
Sbjct: 167 IVDTASELTWVQCAPCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALQLATGGTSGGAA 226
Query: 92 SCSSQQ----LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNEN 147
+C Q C+YT Y D S ++GVLA +R++ + D VFGCG +N G
Sbjct: 227 ACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSL--AGEVIDGFVFGCGTSNQGPPFGG 284
Query: 148 EMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV--SGGGVV 205
GL+GLGR++LSL SQ + Q G FSYCL +DSS + G+ S V + +V
Sbjct: 285 TSGLMGLGRSQLSLVSQTMDQFGG-VFSYCLPLKESDSS--GSLVIGDDSSVYRNSTPIV 341
Query: 206 STSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPK 264
S+VS +YFV L GI+VG + + S G ID+G T L
Sbjct: 342 YASMVSDPLQGPFYFVNLTGITVGG-----QEVESSGFSSGGGGGKAIIDSGTVITSLVP 396
Query: 265 DFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGI-APILTAHFDGGAKVPLIHTS 322
YN ++ + + P Q P C+ + + P L FDGG +V +
Sbjct: 397 SIYNAVKAEFLSQFAEYP-QAPGFSILDTCFNMTGLREVQVPSLKLVFDGGVEVEVDSGG 455
Query: 323 T--FIPPPVEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
F+ V C AM P+ + I GN+ Q +L + +D V F C
Sbjct: 456 VLYFVSSDSSQV-CLAMAPLKSEYETNIIGNYQQKNLRVIFDTSGSQVGFAQETC 509
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 125/384 (32%), Positives = 187/384 (48%), Gaps = 32/384 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
++S VS +GEY M +GTPP I+DTGSDL W+QC+PC+ C++Q P Y+P S
Sbjct: 184 LESGVSLGSGEYFMDVFVGTPPK-HFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDS 242
Query: 73 SSYKELSCQSEQCHLLDTVS----CSSQ-QLCNYTYGYADSSLTKGVLATERITF----- 122
SS++ +SC +C L+ + C ++ Q C Y Y Y D S T G A E T
Sbjct: 243 SSFRNISCHDPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTP 302
Query: 123 -GNSN-NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
G S +NV+FGCGH N G+F+ L+GLG+ LS ASQ+ S G + FSYCLV
Sbjct: 303 NGKSELKHVENVMFGCGHWNRGLFHGAAG-LLGLGKGPLSFASQMQSLYGQS-FSYCLVD 360
Query: 181 FHTDSSITSKMYFGNGSE-VSGGGVVSTSLVSKED---KTYYFVTLEGISVGN-LSNSSK 235
++++S++SK+ FG E +S + TS +D T+Y+V + + V + + +
Sbjct: 361 RNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPE 420
Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
+ +S GA G ID+G T + Y ++E IK + + CY
Sbjct: 421 ETWHLSSEGA---GGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGLPPLKPCYN 477
Query: 296 TPSMAGIA-PILTAHFDGGA--KVPLIHTSTFIPPPVEGVFCFAM--QPIDGDVGIFGNF 350
+ + P F GA P+ + I P V C A+ P + I GN+
Sbjct: 478 VSGIEKMELPDFGILFADGAVWNFPVENYFIQIDP---DVVCLAILGNPRSA-LSIIGNY 533
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
Q + I YD + + P C
Sbjct: 534 QQQNFHILYDMKKSRLGYAPMKCA 557
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 121/363 (33%), Positives = 179/363 (49%), Gaps = 34/363 (9%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
+ + G Y M FSIGTPP ++ + DTGSDL+W +C C +C Q P Y P SSS+
Sbjct: 74 QLDSGGGAYDMTFSIGTPPQ-ELSALADTGSDLIWAKCGACTRCVPQGSPSYYPNKSSSF 132
Query: 76 KELSCQSEQCHLLDTVSCSSQQL-CNYTYGYADSS----LTKGVLATERITFGNSNNFFD 130
+L C C L + CS+ C+Y Y Y +S T+G L +E T G ++
Sbjct: 133 SKLPCSGSLCSDLPSSQCSAGGAECDYKYSYGLASDPHHYTQGYLGSETFTLG--SDAVP 190
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
+ FGC + + GLVGLGR LSL +SQL FSYCL +D++ TS
Sbjct: 191 GIGFGC-TTMSEGGYGSGSGLVGLGRGPLSL----VSQLNVGAFSYCLT---SDAAKTSP 242
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
+ FG+G+ ++G GV ST L+ + YY V LE IS+G + ++G S G
Sbjct: 243 LLFGSGA-LTGAGVQSTPLL-RTSTYYYTVNLESISIGAAT----------TAGTGSSGI 290
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHF 310
+F D+G L + Y +E V + R G ++C++T + P + HF
Sbjct: 291 IF-DSGTTVAFLAEPAYTLAKEAVLSQTTNLTMASGRDGYEVCFQTS--GAVFPSMVLHF 347
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
DGG + T + + V C+ +Q + I GN Q + I YD + M+SF+P
Sbjct: 348 DGGDMD--LPTENYFGAVDDSVSCWIVQK-SPSLSIVGNIMQMNYHIRYDVEKSMLSFQP 404
Query: 371 TDC 373
+C
Sbjct: 405 ANC 407
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 144 bits (363), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 115/381 (30%), Positives = 171/381 (44%), Gaps = 58/381 (15%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSY 75
+S +G Y +K +G+PP I+DTGS L W+QC PCV C+ QV P++ P++S++Y
Sbjct: 113 LSIGSGNYYLKLGLGSPPKYYTM-ILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTY 171
Query: 76 KELSCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
+ L C S +C LL + C++ +C YT Y D+S + G L+ + +T S
Sbjct: 172 RPLYCSSSECSLLKAATLNDPLCTASGVCVYTASYGDASYSMGYLSRDLLTLTPSQT-LP 230
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
+ +GCG +N G+F + G+VGL R +LS+ +Q+ + G FSYCL P T
Sbjct: 231 SFTYGCGQDNEGLFGK-AAGIVGLARDKLSMLAQLSPKYG-YAFSYCL-PTSTS------ 281
Query: 191 MYFGNGSEVSGGGVVSTSLVS------------KEDKTYYFVTLEGISVGNLSNSSKLIP 238
SGGG +S +S ++ + YF+ L I+V
Sbjct: 282 ---------SGGGFLSIGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAG--------- 323
Query: 239 YYNSSGAISKGNM---FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCY 294
G + G ID+G T LP Y L E + Q P C+
Sbjct: 324 --RPVGVAAAGYQVPTIIDSGTVVTRLPISIYAALREAFVKIMSRRYEQAPAYSILDTCF 381
Query: 295 K--TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQ 352
K SM+G AP + F GGA + L + I +G+ C A + I GN Q
Sbjct: 382 KGSLKSMSG-APEIRMIFQGGADLSLRAPNILIEAD-KGIACLAFAS-SNQIAIIGNHQQ 438
Query: 353 SDLFIGYDFDSQMVSFKPTDC 373
I YD + + F P C
Sbjct: 439 QTYNIAYDVSASKIGFAPGGC 459
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 120/396 (30%), Positives = 172/396 (43%), Gaps = 52/396 (13%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-YKQVKPIYNPAS 71
V S S+ +G+Y + IG PP + I DTGSDL+WV+C C C + ++ P
Sbjct: 72 VVSGASSGSGQYFVDLRIGQPPQ-SLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRH 130
Query: 72 SSSYKELSCQSEQCHLL----DTVSCSSQQL---CNYTYGYADSSLTKGVLATERITFGN 124
SS++ C C L+ C+ ++ C Y YGYAD SLT G+ A E +
Sbjct: 131 SSTFSPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKT 190
Query: 125 SNNF---FDNVVFGCGHNNTG------VFNENEMGLVGLGRTRLSLASQILSQLGANKFS 175
S+ +V FGCG +G FN G++GLGR +S ASQ+ + G NKFS
Sbjct: 191 SSGKEAKLKSVAFGCGFRISGQSVSGTSFN-GANGVMGLGRGPISFASQLGRRFG-NKFS 248
Query: 176 YCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSK 235
YCL+ + TS + G+G + + L + T+Y+V L+ + V N +K
Sbjct: 249 YCLMDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFV----NGAK 304
Query: 236 L-----IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS 290
L I + SG G +D+G L Y + V+ IKL + G
Sbjct: 305 LRIDPSIWEIDDSG---NGGTVMDSGTTLAFLADPAYRLVIAAVKQRIKLPNADELTPGF 361
Query: 291 QLCYKTPSMA---GIAPILTAHFDGGAKVPLIHTSTFIPPPV-------EGVFCFAMQPI 340
LC + I P L F GGA F+PPP E + C A+Q +
Sbjct: 362 DLCVNVSGVTKPEKILPRLKFEFSGGA--------VFVPPPRNYFIETEEQIQCLAIQSV 413
Query: 341 DGDVG--IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
D VG + GN Q +D D + F C
Sbjct: 414 DPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 449
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 113/384 (29%), Positives = 178/384 (46%), Gaps = 35/384 (9%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSS 74
V A +Y+ ++ IG PP I+DTGS+L+W QC C C+ Q Y+P+ S +
Sbjct: 64 VHWAESQYIAEYLIGDPPQ-QAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSRT 122
Query: 75 YKELSCQSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFG-NSNNFFDNV 132
+ ++C C L C+ + C Y + + GVL TE TF S N ++
Sbjct: 123 ARPVACNDTACALGSETRCARDNKACAVLTAYG-AGVIGGVLGTEAFTFQPQSENV--SL 179
Query: 133 VFGC--GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
FGC T + G++GLGR LSL +SQLG NKFSYCL P+ + S+ TS+
Sbjct: 180 AFGCIAATRLTPGSLDGASGIIGLGRGNLSL----VSQLGDNKFSYCLTPYFSQSTNTSR 235
Query: 191 MYFGNGSEVSGGGVVSTSL--VSKED----KTYYFVTLEGISVGN--LSNSSKLIPYYNS 242
++ G + +S GG +TS+ + D T+Y++ L GI+VG+ L+
Sbjct: 236 LFVGASAGLSSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQV 295
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI--KLTPYQDPRLGSQLCYKTP--S 298
+ + G + ID+G+P T L Y L +++ + + P G LC
Sbjct: 296 ATGLWAGTL-IDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLDLCAAVAHGD 354
Query: 299 MAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG--------DVGIFGNF 350
+ + P L HF G + + P + C + G + I GN+
Sbjct: 355 VGKLVPPLVLHFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTIIGNY 414
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
Q D+ + YD + M+SF+P DC+
Sbjct: 415 MQQDMHLLYDLEKGMLSFQPADCS 438
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 120/381 (31%), Positives = 177/381 (46%), Gaps = 43/381 (11%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSC 80
EYV+ IGTPP + + DTGSDL WVQCLPC CY Q +P+++P+ SS+Y ++ C
Sbjct: 121 EYVVTIGIGTPPR-NFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSSTYVDVPC 179
Query: 81 QSEQCHL--LDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF---FDNVVFG 135
+ +CH+ + C + C Y+ Y D S T G LA E T + VVFG
Sbjct: 180 SAPECHIGGVQQTRCGATS-CEYSVKYGDESETHGSLAEETFTLSPPSPLAPAATGVVFG 238
Query: 136 CGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQL--GANKFSYCLVPFHTDSSITSK 190
C H VFN+ M GL+GLGR S+ SQ + G FSYCL P S T
Sbjct: 239 CSHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPP---RGSSTGY 295
Query: 191 MYFGNGSEVSG---GGVVSTSLVS--KEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
+ G G+ + T L++ + ++ Y V L G+SV + ++ IP + A
Sbjct: 296 LTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSV---NGAAVDIP----ASA 348
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCYKTPSMAGI 302
S G + ID+G T +P Y L ++ R + K+ P +L CY +
Sbjct: 349 FSLGAV-IDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKL-LDTCYDVTGQDVV 406
Query: 303 -APILTAHFDGGAKVPLIHTSTFIPPPVEG-------VFCFAMQPID-GDVGIFGNFAQS 353
AP + F GGA++ + + + P E + C A P + + I GN Q
Sbjct: 407 TAPRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAGLVIVGNMQQR 466
Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
+ +D D + F P C+
Sbjct: 467 AYNVVFDVDGGRIGFGPNGCS 487
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 119/383 (31%), Positives = 175/383 (45%), Gaps = 38/383 (9%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
A EY + +GTP +++ I+DTGSD+ W+QC+PC C ++P +NP SSS+ +L
Sbjct: 134 AGLEYYVPLQLGTP-AVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLP 192
Query: 80 CQSEQC-HLLDTVS--CS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD----- 130
C S C ++ V CS S + C ++ Y D SL+ G+LA E I GN+ NF D
Sbjct: 193 CASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIA-GNTPNFGDGEPVK 251
Query: 131 --NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
N+ GC + GL+G+ R +S SQ+ S+ A KFS+C + +
Sbjct: 252 LSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRY-ARKFSHCFPDKIAHLNSS 310
Query: 189 SKMYFGNGSEVSG----GGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
++FG +S +V V YY+V L GISV + S+L P + +
Sbjct: 311 GLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISV----DESRL-PLSHKNF 365
Query: 245 AISK----GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS-- 298
I K G ID+G T L K + + + D G CY S
Sbjct: 366 DIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGT 425
Query: 299 ---MAGIAPILTAHFDGGAKVPLIHTSTFIP---PPVEGVFCFAMQPIDGDV--GIFGNF 350
+ I P +T HF GG V L S IP + C A Q + GD+ I GN+
Sbjct: 426 AALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQ-MSGDIPFNIIGNY 484
Query: 351 AQSDLFIGYDFDSQMVSFKPTDC 373
Q +L++ YD + + P C
Sbjct: 485 QQQNLWVEYDLEKLRLGIAPAQC 507
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 114/382 (29%), Positives = 167/382 (43%), Gaps = 32/382 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPA 70
+ +S G YV+ +GTP D+ + DTGSDL WVQC PC CY Q P++ P+
Sbjct: 74 AERGISVGTGNYVVSVGLGTP-ARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPS 132
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSS---QQLCNYTYGYADSSLTKGVLATERITFG---- 123
SSS++ + C +C SCSS C Y Y D S T G L + +T G
Sbjct: 133 SSSTFSAVRCGEPECPRARQ-SCSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPS 191
Query: 124 -----NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL 178
N++N VFGCG NNTG+F + + GL GLGR ++SL+SQ + G FSYCL
Sbjct: 192 TNASENNSNKLPGFVFGCGENNTGLFGKAD-GLFGLGRGKVSLSSQAAGKYG-EGFSYCL 249
Query: 179 VPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP 238
+ S+ + G + + L ++Y+V L GI V + P
Sbjct: 250 P--SSSSNAHGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRP 307
Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD-PRLGS-QLCYKT 296
A+ + +D+G T L Y+ L +A+ Y+ PRL CY
Sbjct: 308 ------ALWPAGLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSILDTCYDF 361
Query: 297 PSMAGIA---PILTAHFDGGAKVPLIHTST-FIPPPVEGVFCFAMQPIDGDVGIFGNFAQ 352
+ A P + F GGA + + + ++ + FA GI GN Q
Sbjct: 362 TAHANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQACLAFAPNGNGRSAGILGNTQQ 421
Query: 353 SDLFIGYDFDSQMVSFKPTDCT 374
+ + YD Q + F C+
Sbjct: 422 RTVAVVYDVGRQKIGFAAKGCS 443
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 117/365 (32%), Positives = 172/365 (47%), Gaps = 22/365 (6%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPIYNPASS 72
S + G YV+ +GTP + Y +V DTGSD WVQC PCV CY+Q + +++PA S
Sbjct: 171 SGRALGTGNYVVTVGLGTP--VSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARS 228
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+Y +SC + C L+ CS C Y Y D S + G A + +T +S +
Sbjct: 229 STYANVSCAAPACSDLNIHGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTL-SSYDAVKGF 286
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGCG N G+F E GL+GLGR + SL Q + G F++CL S+ T +
Sbjct: 287 RFGCGERNEGLFGE-AAGLLGLGRGKTSLPVQTYDKYG-GVFAHCLP---ARSTGTGYLD 341
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
FG GS + ++T +++ T+Y+V + GI VG +L+ S +
Sbjct: 342 FGAGSLAAASARLTTPMLTDNGPTFYYVGMTGIRVGG-----QLLSIPQS--VFATAGTI 394
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAH 309
+D+G T LP Y+ L A+ Y+ S L CY M+ +A P ++
Sbjct: 395 VDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLL 454
Query: 310 FDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
F GGA++ + + V FA GDVGI GN + YD ++V F
Sbjct: 455 FQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 514
Query: 369 KPTDC 373
P C
Sbjct: 515 YPGAC 519
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 114/349 (32%), Positives = 159/349 (45%), Gaps = 24/349 (6%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSSSYKELS 79
+G Y + +GTP D+ I DTGSDL W QC PC + CYKQ I++P+ S+SY ++
Sbjct: 143 SGNYFVVVGLGTPKR-DLSLIFDTGSDLTWTQCEPCARSCYKQQDVIFDPSKSTSYSNIT 201
Query: 80 CQSEQCHLLDTVS-----CS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
C S C L T + CS S + C Y Y DSS + G + ER+T + + DN +
Sbjct: 202 CTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGYFSRERLTV-TATDVVDNFL 260
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
FGCG NN G+F GL+GLGR +S Q ++ FSYCL + SS T + F
Sbjct: 261 FGCGQNNQGLFG-GSAGLIGLGRHPISFVQQTAAKY-RKIFSYCL---PSTSSSTGHLSF 315
Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
G + S +S+ ++Y + + I+VG + +P SS S G I
Sbjct: 316 GPAATGRYLKYTPFSTISR-GSSFYGLDITAIAVGGVK-----LPV--SSSTFSTGGAII 367
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDG 312
D+G T LP Y L R + P CY + P + F G
Sbjct: 368 DSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYKVFSIPTIEFSFAG 427
Query: 313 GAKVPLIHTST-FIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
G V L F+ + FA D DV I+GN Q + + YD
Sbjct: 428 GVTVKLPPQGILFVASTKQVCLAFAANGDDSDVTIYGNVQQRTIEVVYD 476
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 112/370 (30%), Positives = 166/370 (44%), Gaps = 34/370 (9%)
Query: 24 YVMKFSIGT----PPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
YV ++G P ++ IVDTGSDL WVQC PC CY Q P+++PA S++Y +
Sbjct: 185 YVTTIALGGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVR 244
Query: 80 CQSEQCHL-LDTV-----SC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
C + C L SC + C Y Y D S ++GVLAT+ + G ++ D
Sbjct: 245 CNASACAASLKAATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGAS--LDGF 302
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
VFGCG +N G+F GL+GLGRT LSL SQ + G FSYCL P T + +
Sbjct: 303 VFGCGLSNRGLFG-GTAGLMGLGRTELSLVSQTALRYG-GVFSYCL-PATTSGDASGSLS 359
Query: 193 FGN--GSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
G S + V T +++ + +YF+ + G +VG + +++ +
Sbjct: 360 LGGDASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQ---------GLGAS 410
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGI-APIL 306
N+ ID+G T L Y + + Y S L CY + P+L
Sbjct: 411 NVLIDSGTVITRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSILDTCYDLTGHDEVKVPLL 470
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDS 363
T +GGA+V + +G C AM + + I GN+ Q + + YD
Sbjct: 471 TLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNYQQKNKRVVYDTVG 530
Query: 364 QMVSFKPTDC 373
+ F DC
Sbjct: 531 SRLGFADEDC 540
>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
Length = 396
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 106/363 (29%), Positives = 170/363 (46%), Gaps = 32/363 (8%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP-CVQCYKQVKPIYNPASSSSYKELSCQS 82
YV+ +IGTPP + I+D G +L+W QC C +C+KQ P+++ +SS+++ C +
Sbjct: 51 YVVNLTIGTPPQ-PVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCGA 109
Query: 83 EQCHLLDTVSCSSQQLCNYTYGYADS-SLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
C + T SC+ Y + S T G + T+ + G + + FGC +
Sbjct: 110 AVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAAT--ARLAFGCAVASE 167
Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
G VGLGRT LSLA+Q + A FSYCL P D+ +S ++ G ++++G
Sbjct: 168 MDTMWGSSGSVGLGRTNLSLAAQ----MNATAFSYCLAP--PDTGKSSALFLGASAKLAG 221
Query: 202 GG--VVSTSLVSKEDKTY------YFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
G +T V + Y + LE I GN ++ +P S + +
Sbjct: 222 AGKGAGTTPFVKTSTPPHSGLSRSYLLRLEAIRAGN---ATIAMPQ-------SGNTIMV 271
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGG 313
T P T L Y L + V +A+ P P LC+ S +G AP L F GG
Sbjct: 272 STATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKASASGGAPDLVLAFQGG 331
Query: 314 AKVPLIHTSTFIPPPVEGVFCFAM--QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
A++ + S+++ C A+ P G V I G+ Q ++ + +D D + +SF+P
Sbjct: 332 AEM-TVPVSSYLFDAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPA 390
Query: 372 DCT 374
DC+
Sbjct: 391 DCS 393
>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
Length = 394
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 105/365 (28%), Positives = 173/365 (47%), Gaps = 36/365 (9%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
YV F+IGTPP ++D +L+W QC C +C++Q P+++P +S++Y+ C +
Sbjct: 51 YVANFTIGTPPQ-PASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTP 109
Query: 84 QCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
C + D+ +CS +C Y ++ T G + T+ G + ++ FGC +
Sbjct: 110 LCESIPSDSRNCSG-NVCAY-QASTNAGDTGGKVGTDTFAVGTAKA---SLAFGCVVASD 164
Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
G+VGLGRT SL ++Q G FSYCL P D+ S ++ G+ ++++G
Sbjct: 165 IDTMGGPSGIVGLGRTPWSL----VTQTGVAAFSYCLAPH--DAGKNSALFLGSSAKLAG 218
Query: 202 GG-VVSTSLVS-----KEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
GG ST V+ + YY V LEG+ G+ +IP S + +DT
Sbjct: 219 GGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGD-----AMIPLPPSGSTV-----LLDT 268
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
+P + L Y +++ V A+ P P LC+ +G AP L F GGA
Sbjct: 269 FSPISFLVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGAA 328
Query: 316 VPLIHTSTFIPPPVEGVFCFAMQP-----IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
+ + S ++ G C AM ++ + G+ Q ++ +D D + +SF+P
Sbjct: 329 M-TVAASNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEP 387
Query: 371 TDCTK 375
DCTK
Sbjct: 388 ADCTK 392
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 109/363 (30%), Positives = 178/363 (49%), Gaps = 28/363 (7%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
+++ SIG+PP+ + +VDTGS L+WVQCLPC+ C++Q ++P S S+K L C
Sbjct: 104 FLVNLSIGSPPVTQLV-VVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFP 162
Query: 84 QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHNN 140
+ ++ C+ Y Y ++G+LA E + F + N+ FGCGH N
Sbjct: 163 GYNYINGYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGKIKKSNITFGCGHMN 222
Query: 141 TGVFNENEM-GLVGLGR-TRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
N++ G+ GLG +++A+Q+ NKFSYC+ + + + G GS
Sbjct: 223 IKTNNDDAYNGVFGLGAYPHITMATQL-----GNKFSYCIGDINNPLYTHNHLVLGQGSY 277
Query: 199 VSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
+ G ST L + +Y+VTL+ ISVG S + K+ P + G + ID+G
Sbjct: 278 IEGD---STPL--QIHFGHYYVTLQSISVG--SKTLKIDPNAFKISSDGSGGVLIDSGMT 330
Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDP--RLGSQLCYK---TPSMAGIAPILTAHFDGG 313
T L + L +++ + +K + P R LC+K + + G P +T HF GG
Sbjct: 331 YTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVGF-PAVTFHFAGG 389
Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGD---VGIFGNFAQSDLFIGYDFDSQMVSFKP 370
A + L S F + FC A+ P + + + + G AQ + +G+D + V F+
Sbjct: 390 ADLVLESGSLFRQHGGD-RFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRR 448
Query: 371 TDC 373
DC
Sbjct: 449 IDC 451
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 123/367 (33%), Positives = 166/367 (45%), Gaps = 34/367 (9%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELSC 80
EYV+ IGTP + I ++DTGSDL WVQC PC +CY Q P+++P+SSSSY + C
Sbjct: 117 EYVVTLGIGTPAVQQIV-LIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPC 175
Query: 81 QSEQCHLLDTVS----CSS--QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
S+ C L + C+S LC Y Y + + T GV +TE +T + F
Sbjct: 176 DSDACRKLAAGAYGHGCTSGAAALCEYGIEYGNRATTTGVYSTETLTL-KPGVVVADFGF 234
Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
GCG + G + + + GL+GLG SL SQ SQ G FSYCL P + + G
Sbjct: 235 GCGDHQHGPYEKFD-GLLGLGGAPESLVSQTSSQFG-GPFSYCLPPTSGGAGF---LALG 289
Query: 195 NGSEVSGGGVVSTSLVSKEDK-----TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
+ S + L + + T+Y VTL GISVG P A S G
Sbjct: 290 APNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVGG-------APLAVPPSAFSSG 342
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGI-APIL 306
M ID+G T LP Y L R+A+ P G+ L CY + P +
Sbjct: 343 -MVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGAVLDTCYDFTGHTNVTVPTI 401
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
F GGA + L + + V+G FA D +GI GN Q + YD V
Sbjct: 402 ALTFSGGATIDLATPAGVL---VDGCLAFAGAGTDDTIGIIGNVNQRTFEVLYDSGKGTV 458
Query: 367 SFKPTDC 373
F+ C
Sbjct: 459 GFRAGAC 465
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 113/370 (30%), Positives = 162/370 (43%), Gaps = 19/370 (5%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
VS +GEY+++ IG+PPL + + + DTGSD++WVQC PC CY Q P+++PA+S+S+
Sbjct: 116 VSHGSGEYLVRVGIGSPPL-EQHLVADTGSDVIWVQCSPCSDCYAQGDPLFDPANSASFS 174
Query: 77 ELSCQSEQCHL----LDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
+ C S C + C Y Y D S T GVLA E +T + V
Sbjct: 175 PVPCNSGVCRAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTL-DGGTEVQGV 233
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH N G+F E GL+GLG +SL Q L FSYCL +++ S
Sbjct: 234 AMGCGHENRGLFAE-AAGLLGLGWGPMSLVGQ-LGGAAGGAFSYCLAGYYSGEGSGSGSL 291
Query: 193 FGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
+ + G V LV D ++Y+V + G+ V +L G G +
Sbjct: 292 VLGREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAG--ERLQLQDGLFDLGDDGGGGV 349
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGI-APILTAH 309
+DTG T LP + Y L A + + P + CY A + P + +
Sbjct: 350 VMDTGTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSLFDTCYDLSGYASVRVPTVALY 409
Query: 310 FDG------GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
F G A + L + +P G +C A + I GN Q + I D S
Sbjct: 410 FGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVASGPSILGNIQQQGIEITVDSAS 469
Query: 364 QMVSFKPTDC 373
V F P C
Sbjct: 470 GYVGFGPATC 479
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 119/362 (32%), Positives = 171/362 (47%), Gaps = 33/362 (9%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSC 80
EYV+ GTP + + ++DTGSD+ WVQC PC +CY Q P+++P+ SS+Y ++C
Sbjct: 130 EYVVTLGFGTPSVPQVL-LMDTGSDVSWVQCTPCNSTKCYPQKDPLFDPSKSSTYAPIAC 188
Query: 81 QSEQCHLLDTV---SCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
++ C L C+S C Y+ YAD S ++GV + E +T D FGC
Sbjct: 189 NTDACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGVYSNETLTLAPGITVED-FHFGC 247
Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
G + G ++ + GL+GLG +SL Q S G FSYCL ++++ + G+
Sbjct: 248 GRDQRGPSDKYD-GLLGLGGAPVSLVVQTSSVYGG-AFSYCLPALNSEAGF---LVLGSP 302
Query: 197 SEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
+ V T + T+Y VT+ GISVG P + A +G M ID+
Sbjct: 303 PSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGK-------PLHIPQSAF-RGGMIIDS 354
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGA 314
G T LP+ YN LE +R A+K P P CY + I P + F GGA
Sbjct: 355 GTVDTELPETAYNALEAALRKALKAYPLV-PSDDFDTCYNFTGYSNITVPRVAFTFSGGA 413
Query: 315 KVPLIHTSTFIPPPVEGVFCFAMQ---PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
+ L +P + C A Q P DG +GI GN Q L + YD V F+
Sbjct: 414 TIDLD-----VPNGILVNDCLAFQESGPDDG-LGIIGNVNQRTLEVLYDAGRGNVGFRAG 467
Query: 372 DC 373
C
Sbjct: 468 AC 469
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 110/356 (30%), Positives = 154/356 (43%), Gaps = 24/356 (6%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EYV+ IG+P + + DTGSD+ WVQC PC QC+ +V +++P++SS+Y SC S
Sbjct: 130 EYVITVGIGSPAVTQTMSM-DTGSDVSWVQCKPCSQCHSEVDSLFDPSASSTYSPFSCSS 188
Query: 83 EQCHLLDTVS----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
C L CSS Q C Y Y D S T G +++ +T G +N FGC
Sbjct: 189 AACVQLSQSQQGNGCSSSQ-CQYIVSYVDGSSTTGTYSSDTLTLG--SNAIKGFQFGCSQ 245
Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
+ +G F++ GL+GLG SL SQ G FSYCL P + S + G+
Sbjct: 246 SESGGFSDQTDGLMGLGGDAQSLVSQTAGTFG-KAFSYCLPP-----TPGSSGFLTLGAA 299
Query: 199 VSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
G V + L S + TYY V LE I VG N ++ +D+G
Sbjct: 300 SRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQ--------LNIPTSVFSAGSVMDSGTV 351
Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVP 317
T LP Y+ L + +K P P C+ + ++ P + F GGA V
Sbjct: 352 ITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVN 411
Query: 318 LIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
L + FA D +G GN Q + YD V F+ C
Sbjct: 412 LDFNGIMLELD-NWCLAFAANSDDSSLGFIGNVQQRTFEVLYDVGGGAVGFRAGAC 466
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 107/362 (29%), Positives = 167/362 (46%), Gaps = 26/362 (7%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
Y++ +G+ ++ IVDTGSDL WVQC PC CY Q P++ P++S SY+ + C S
Sbjct: 122 YIVTMGLGSQ---NMSVIVDTGSDLTWVQCEPCRSCYNQNGPLFKPSTSPSYQPILCNST 178
Query: 84 QCHLLDTVSC----SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
C L+ +C S+ C+Y Y D S T G L E++ FG + N VFGCG N
Sbjct: 179 TCQSLELGACGSDPSTSATCDYVVNYGDGSYTSGELGIEKLGFGGIS--VSNFVFGCGRN 236
Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
N G+F GL+GLGR+ LS+ SQ + G FSYCL P + + + GN S V
Sbjct: 237 NKGLFG-GASGLMGLGRSELSMISQTNATFGG-VFSYCL-PSTDQAGASGSLVMGNQSGV 293
Query: 200 SGGG---VVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
+ L + + +Y + L GI VG +S + + + G + +D+G
Sbjct: 294 FKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVS-------LHVQASSFGNGGVILDSG 346
Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAK 315
+ L Y L+ + P C+ + P ++ +F+G A+
Sbjct: 347 TVISRLAPSVYKALKAKFLEQFSGFPSAPGFSILDTCFNLTGYDQVNIPTISMYFEGNAE 406
Query: 316 VPLIHTSTF-IPPPVEGVFCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
+ + T F + C A+ + + ++GI GN+ Q + + YD V F
Sbjct: 407 LNVDATGIFYLVKEDASRVCLALASLSDEYEMGIIGNYQQRNQRVLYDAKLSQVGFAKEP 466
Query: 373 CT 374
CT
Sbjct: 467 CT 468
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 123/369 (33%), Positives = 173/369 (46%), Gaps = 33/369 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPAS 71
+S ++ +G Y++ IGTP D+ + DTGSDL W QC PC+ CY Q +P +NP+S
Sbjct: 121 AKSGITLGSGNYIVTIGIGTPKH-DLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSS 179
Query: 72 SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SS+Y+ +SC S C D SCS+ C Y+ Y D S T+G LA E+ T NS + ++
Sbjct: 180 SSTYQNVSCSSPMCE--DAESCSASN-CVYSIVYGDKSFTQGFLAKEKFTLTNS-DVLED 235
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V FGCG NN G+F+ L A + N FSYCL F ++S T +
Sbjct: 236 VYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTY--NNIFSYCLPSFTSNS--TGHL 291
Query: 192 YFGNG--SEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP-YYNSSGAISK 248
FG+ SE V T + S Y + + GISVG+ + P +++ GAI
Sbjct: 292 TFGSAGISE----SVKFTPISSFPSAFNYGIDIIGISVGD--KELAITPNSFSTEGAI-- 343
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGIA-P 304
ID+G T LP Y L + K++ Y+ G L CY + + P
Sbjct: 344 ----IDSGTVFTRLPTKVYAELRSVFKE--KMSSYKSTS-GYGLFDTCYDFTGLDTVTYP 396
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
+ F G V L + +P + V C A D IFGN Q+ L + YD
Sbjct: 397 TIAFSFAGSTVVELDGSGISLPIKISQV-CLAFAGNDDLPAIFGNVQQTTLDVVYDVAGG 455
Query: 365 MVSFKPTDC 373
V F P C
Sbjct: 456 RVGFAPNGC 464
>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 104/365 (28%), Positives = 173/365 (47%), Gaps = 36/365 (9%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
YV F+IGTPP ++D +L+W QC C +C++Q P+++P +S++Y+ C +
Sbjct: 51 YVANFTIGTPPQ-PASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTP 109
Query: 84 QCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
C + D+ +CS +C Y ++ T G + T+ G + ++ FGC +
Sbjct: 110 LCESIPSDSRNCSG-NVCAY-QASTNAGDTGGKVGTDTFAVGTAKA---SLAFGCVVASD 164
Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
G+VGLGRT SL ++Q G FSYCL P D+ S ++ G+ ++++G
Sbjct: 165 IDTMGGPSGIVGLGRTPWSL----VTQTGVAAFSYCLAPH--DAGRNSALFLGSSAKLAG 218
Query: 202 GG-VVSTSLVS-----KEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
GG ST V+ + YY V LEG+ G+ +IP S + +DT
Sbjct: 219 GGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGD-----AMIPLPPSGSTV-----LLDT 268
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
+P + L Y +++ V A+ P P LC+ +G AP L F GGA
Sbjct: 269 FSPISFLVDGAYQAVKKAVTAAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGAA 328
Query: 316 VPLIHTSTFIPPPVEGVFCFAMQP-----IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
+ + + ++ G C AM ++ + G+ Q ++ +D D + +SF+P
Sbjct: 329 M-TVPATNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEP 387
Query: 371 TDCTK 375
DCTK
Sbjct: 388 ADCTK 392
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 126/378 (33%), Positives = 171/378 (45%), Gaps = 38/378 (10%)
Query: 11 NVVQSNVSTAN--GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIY 67
N +++ V T + G Y + +GTP D + DTGSDL W QC PC C+ Q +
Sbjct: 117 NEMKTRVPTTHFGGGYAVTVGLGTPKK-DFSLLFDTGSDLTWTQCEPCSGGCFPQNDEKF 175
Query: 68 NPASSSSYKELSCQSEQCHLLDTVS---CSSQQLCNYTYGYADSSLTKGVLATERITFGN 124
+P S+SYK LSC SE C + S CSS C Y Y + T G LATE +T
Sbjct: 176 DPTKSTSYKNLSCSSEPCKSIGKESAQGCSSSNSCLYGVKYG-TGYTVGFLATETLTITP 234
Query: 125 SNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
S + F+N V GCG N G F+ GL+GLGR+ ++L SQ S N FSYCL
Sbjct: 235 S-DVFENFVIGCGERNGGRFS-GTAGLLGLGRSPVALPSQTSSTY-KNLFSYCL---PAS 288
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP----YY 240
SS T + FG G + TS + + Y + + GISVG + +P +
Sbjct: 289 SSSTGHLSFGGGVSQAAKFTPITSKIPE----LYGLDVSGISVGG-----RKLPIDPSVF 339
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
++G I ID+G T LP ++ L + + G Q CY A
Sbjct: 340 RTAGTI------IDSGTTLTYLPSTAHSALSSAFQEMMTNYTLTKGTSGLQPCYDFSKHA 393
Query: 301 G---IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP--IDGDVGIFGNFAQSDL 355
P ++ F+GG +V + + FI C A + D DV IFGN Q
Sbjct: 394 NDNITIPQISIFFEGGVEVDIDDSGIFIAANGLEEVCLAFKDNGNDTDVAIFGNVQQKTY 453
Query: 356 FIGYDFDSQMVSFKPTDC 373
+ YD MV F P C
Sbjct: 454 EVVYDVAKGMVGFAPGGC 471
>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
Length = 443
Score = 142 bits (357), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 85/209 (40%), Positives = 118/209 (56%), Gaps = 14/209 (6%)
Query: 30 IGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD 89
+G P L +YGI DTGS+L+W+QCLPC CY Q PI++PA S +Y+ +S S C+ +
Sbjct: 63 LGVPSTL-VYGIADTGSELIWLQCLPCTHCYNQTPPIFDPAESYTYETVSSDSPICNAVR 121
Query: 90 TVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV---VFGCGHNNTGVFN 145
+SC + C Y + Y D + TKG L+T+ F + V FGC H+
Sbjct: 122 RISCREGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTRTIVEVGYLTFGCSHDTKARLK 181
Query: 146 ENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVV 205
++ G+VGL R SL +SQL KFSYC+V D S+MYFG+ + + GG
Sbjct: 182 GHQAGVVGLNRHPNSL----VSQLKVKKFSYCMV-IPDDHGSGSRMYFGSRAVILGG--- 233
Query: 206 STSLVSKEDKTYYFVTLEGISVGNLSNSS 234
T L+ K D ++YFVTL+GISVG S
Sbjct: 234 KTPLL-KGDYSHYFVTLKGISVGEEKGRS 261
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 58/112 (51%), Gaps = 5/112 (4%)
Query: 57 VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCS-SQQLCNYTYGYAD-SSLTKGV 114
QC+ Q PI++P+ SS+Y + + C+ +C ++ C Y Y S+ T+G
Sbjct: 332 AQCFNQTPPIFDPSKSSTYSTVPWDAPTCYQAGGYACHIDEEDCCYRISYGSGSTSTEGT 391
Query: 115 LATERITF-GNSNNFFD--NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLAS 163
++ + F N N D ++VFGC TG F E+G+VGL + LSL S
Sbjct: 392 ISIDAFAFEDNRQNMVDVXHLVFGCSDYTTGTFKGYEVGIVGLNQDSLSLVS 443
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 121/373 (32%), Positives = 177/373 (47%), Gaps = 36/373 (9%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPAS 71
+ +++ G YV+ +GTP D DTGSDL W QC PC+ C+ Q +P ++P +
Sbjct: 129 IPASIVPTGGAYVVTVGLGTPKK-DFTLSFDTGSDLTWTQCEPCLGGCFPQNQPKFDPTT 187
Query: 72 SSSYKELSCQSEQCHLLDTVSCSSQ----QLCNYTYGYADSSLTKGVLATERITFGNSNN 127
S+SYK +SC SE C L+ + +Q C Y Y S T G LATE + S++
Sbjct: 188 STSYKNVSCSSEFCKLIAEGNYPAQDCISNTCLYGIQYG-SGYTIGFLATETLAIA-SSD 245
Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
F N +FGC + G FN GL+GLGR+ ++L SQ ++ N FSYCL +S
Sbjct: 246 VFKNFLFGCSEESRGTFN-GTTGLLGLGRSPIALPSQTTNKY-KNLFSYCL-----PASP 298
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
+S + G EVS ++ +S + K Y + GISV + +P +G+IS
Sbjct: 299 SSTGHLSFGVEVSQAA--KSTPISPKLKQLYGLNTVGISV-----RGRELPI---NGSIS 348
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG---IAP 304
+ ID+G T LP Y+ L R + + Q CY ++ P
Sbjct: 349 R--TIIDSGTTFTFLPSPTYSALGSAFREMMANYTLTNGTSSFQPCYDFSNIGNGTLTIP 406
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGV----FCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
++ F+GG +V + + I PV G+ FA D D IFGN+ Q + YD
Sbjct: 407 GISIFFEGGVEVEIDVSGIMI--PVNGLKEVCLAFADTGSDSDFAIFGNYQQKTYEVIYD 464
Query: 361 FDSQMVSFKPTDC 373
MV F P C
Sbjct: 465 VAKGMVGFAPKGC 477
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 119/362 (32%), Positives = 153/362 (42%), Gaps = 38/362 (10%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELSC 80
+YV+ S+GTP + VDTGSD+ WVQC PC C Q +++PA SS+Y + C
Sbjct: 142 QYVVTVSLGTPGVSQTV-EVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPC 200
Query: 81 QSEQCHLLDT--VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
++ C L CS Q C Y Y D S T GV ++ + N +FGCGH
Sbjct: 201 GADACSELRIYEAGCSGSQ-CGYVVSYGDGSNTTGVYGSDTLALAPGNT-VGTFLFGCGH 258
Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
G+F + GL+ LGR +SL SQ G FSYCL S ++ Y G
Sbjct: 259 AQAGMFAGID-GLLALGRQSMSLKSQAAGAYG-GVFSYCL-----PSKQSAAGYLTLGGP 311
Query: 199 VSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
S G +T L++ T+Y V L GISVG + + G +DTG
Sbjct: 312 TSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAF--------AGGTVVDTGT 363
Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGIA--PILTAHFD 311
T LP Y L R AI PY P + CY S G+ P + F
Sbjct: 364 VITRLPPTAYAALRSAFRGAIA--PYGYPSAPANGILDTCYDF-SRYGVVTLPTVALTFS 420
Query: 312 GGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
GGA + L G FA DGD I GN Q + FD V F P
Sbjct: 421 GGATLALEAPGIL----SSGCLAFAPNGGDGDAAILGNVQQRSFAV--RFDGSTVGFMPG 474
Query: 372 DC 373
C
Sbjct: 475 AC 476
>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 396
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 105/363 (28%), Positives = 171/363 (47%), Gaps = 32/363 (8%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP-CVQCYKQVKPIYNPASSSSYKELSCQS 82
YV+ +IGTPP + I+D G +L+W QC C +C+KQ P+++ +SS+++ C +
Sbjct: 51 YVVNLTIGTPPQ-PVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCGA 109
Query: 83 EQCHLLDTVSCSSQQLCNYTYGYADS-SLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
C + T SC+ Y + S T G + T+ + G + + FGC +
Sbjct: 110 AVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAAT--ARLAFGCAVASE 167
Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
G VGLGRT LSLA+Q + A FSYCL P D+ +S ++ G ++++G
Sbjct: 168 MDTMWGSSGSVGLGRTNLSLAAQ----MNATAFSYCLAP--PDTGKSSALFLGASAKLAG 221
Query: 202 GG--------VVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
G V +++ + Y + LE I GN ++ +P S + +
Sbjct: 222 AGKGAGTTPFVKTSTPPNSGLSRSYLLRLEAIRAGN---ATIAMPQ-------SGNTITV 271
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGG 313
T P T L Y L + V +A+ P P LC+ S +G AP L F GG
Sbjct: 272 STATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKASASGGAPDLVLAFQGG 331
Query: 314 AKVPLIHTSTFIPPPVEGVFCFAM--QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
A++ + S+++ C A+ P G V I G+ Q ++ + +D D + +SF+P
Sbjct: 332 AEM-TVPVSSYLFDAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPA 390
Query: 372 DCT 374
DC+
Sbjct: 391 DCS 393
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 115/349 (32%), Positives = 160/349 (45%), Gaps = 25/349 (7%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSSSYKELS 79
+G Y + +GTP D+ I DTGSDL W QC PC + CYKQ I++P+ S+SY ++
Sbjct: 142 SGNYFVVVGLGTPKR-DLSLIFDTGSDLTWTQCEPCARSCYKQQDAIFDPSKSTSYSNIT 200
Query: 80 CQSEQCHLLDTVS-----CS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
C S C L T + CS S + C Y Y DSS + G + ER++ + + DN +
Sbjct: 201 CTSTLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRERLSV-TATDIVDNFL 259
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
FGCG NN G+F GL+GLGR +S Q + + FSYCL SS T ++ F
Sbjct: 260 FGCGQNNQGLFG-GSAGLIGLGRHPISFVQQT-AAVYRKIFSYCL---PATSSSTGRLSF 314
Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
G + S S +S+ ++Y + + GISVG +P SS S G I
Sbjct: 315 GT-TTTSYVKYTPFSTISR-GSSFYGLDITGISVGGAK-----LPV--SSSTFSTGGAII 365
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDG 312
D+G T LP Y L R + P CY + P + F G
Sbjct: 366 DSGTVITRLPPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLSGYEVFSIPKIDFSFAG 425
Query: 313 GAKVPLIHTST-FIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
G V L ++ + FA D DV I+GN Q + + YD
Sbjct: 426 GVTVQLPPQGILYVASAKQVCLAFAANGDDSDVTIYGNVQQKTIEVVYD 474
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 116/366 (31%), Positives = 166/366 (45%), Gaps = 27/366 (7%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSS 73
V+ G YV+ +GTP + + +V DTGSD WVQC PCV CY+Q +P+++P S+
Sbjct: 88 GVALGTGNYVVPVRLGTP--AERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSA 145
Query: 74 SYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
+Y +SC S C L CS C Y Y D S T G A + +T + + N
Sbjct: 146 TYANISCSSSYCSDLYVSGCSGGH-CLYGIQYGDGSYTIGFYAQDTLTL--AYDTIKNFR 202
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
FGCG N G+F GL+GLGR + SL Q + G F+YCL S+ T +
Sbjct: 203 FGCGEKNRGLFGR-AAGLLGLGRGKTSLPVQAYDKYG-GVFAYCL---PATSAGTGFLDL 257
Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
G G+ + + T ++ T+Y+V + GI VG ++P S S +
Sbjct: 258 GPGAPAANARL--TPMLVDRGPTFYYVGMTGIKVGG-----HVLPIPGS--VFSTAGTLV 308
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAG--IA-PILTA 308
D+G T LP Y L A++ Y S L CY G IA P ++
Sbjct: 309 DSGTVITRLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSL 368
Query: 309 HFDGGAKVPLIHTST-FIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
F GGA + + + ++ + FA D DV I GN Q + YD ++V
Sbjct: 369 VFQGGACLDVDASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVG 428
Query: 368 FKPTDC 373
F P C
Sbjct: 429 FAPGAC 434
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 119/372 (31%), Positives = 175/372 (47%), Gaps = 33/372 (8%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
N+ T N Y++ +G+ ++ I+DTGSDL WVQC PC+ CY Q PI+ P++SSSY
Sbjct: 59 NLQTLN--YIVTMGLGST---NMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSY 113
Query: 76 KELSCQSEQCHLL-----DTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
+ +SC S C L +T +C S CNY Y D S T G L E+++FG +
Sbjct: 114 QSVSCNSSTCQSLQFATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVEQLSFGGVS--V 171
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
+ VFGCG NN G+F GL+GLGR+ LSL SQ + G FSYCL T+S +
Sbjct: 172 SDFVFGCGRNNKGLFG-GVSGLMGLGRSYLSLVSQTNATFGG-VFSYCLPT--TESGASG 227
Query: 190 KMYFGNGSEVSGGGVVST---SLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
+ GN S V T L + + +Y + L GI V ++ +P + + G +
Sbjct: 228 SLVMGNESSVFKNVTPITYTRMLPNPQLSNFYILNLTGIDVDGVALQ---VPSFGNGGVL 284
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PI 305
ID+G T LP Y L+ P C+ ++ P
Sbjct: 285 ------IDSGTVITRLPSSVYKALKALFLKQFTGFPSAPGFSILDTCFNLTGYDEVSIPT 338
Query: 306 LTAHFDGGAKVPLIHTSTF-IPPPVEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFD 362
++ HF+G A++ + T TF + C A+ + D I GN+ Q + + YD
Sbjct: 339 ISMHFEGNAELKVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTK 398
Query: 363 SQMVSFKPTDCT 374
V F C+
Sbjct: 399 QSKVGFAEESCS 410
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 119/377 (31%), Positives = 174/377 (46%), Gaps = 31/377 (8%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
S + +G+Y + FS+GTP + IVDTGSDL +VQC PC CY+Q P+Y P++SS+
Sbjct: 25 SGTTLGSGQYFVDFSLGTPEQ-KFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQPSNSST 83
Query: 75 YKELSCQSEQCHLLDT---VSCSS-------QQLCNYTYGYADSSLTKGVLATERITFGN 124
+ + C S +C L+ CSS Q C+Y Y Y D+S T GV A E T G
Sbjct: 84 FTPVPCDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYETATVGG 143
Query: 125 SNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
++V FGCG+ N G F + G++GLG+ LS SQ NKF+YCL + +
Sbjct: 144 IR--VNHVAFGCGNRNQGSF-VSAGGVLGLGQGALSFTSQAGYAF-ENKFAYCLTSYLSP 199
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
+S+ S + FG+ + + T LVS + + Y+V + I G + LIP +S+
Sbjct: 200 TSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFG---GETLLIP--DSA 254
Query: 244 GAIS---KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM- 299
I G D+G T Y R+ ++ G LC +
Sbjct: 255 WKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPLCVNVSGID 314
Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIP--PPVEGVFCFAMQPIDGD-VGIFGNFAQSDLF 356
I P T FD GA + FI P ++ C AM D + GN Q +
Sbjct: 315 HPIYPSFTIEFDQGATYRPNQGNYFIEVSPNID---CLAMLESSSDGFNVIGNIIQQNYL 371
Query: 357 IGYDFDSQMVSFKPTDC 373
+ YD + + F +C
Sbjct: 372 VQYDREEHRIGFAHANC 388
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 141 bits (355), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 118/383 (30%), Positives = 164/383 (42%), Gaps = 25/383 (6%)
Query: 4 ATYFYPNNVVQSNVSTANGEYVMKFSIGTP----PLLDIYGIVDTGSDLMWVQCLPCVQC 59
AT P N + +GEY+ K ++GTP + D GSD+ W+QC+PC +C
Sbjct: 105 ATPADPENGTVVTGAPTSGEYIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRC 164
Query: 60 YKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQL--CNYTYGYADSSLTKGVLAT 117
Y Q P+YN SSS ++ C + C L + Q L C Y Y D S + G
Sbjct: 165 YHQPGPVYNRLKSSSASDVGCYAPACRALGSSGGCVQFLNECQYKVEYGDGSSSAGDFGV 224
Query: 118 ERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
E +TF V GCG +N G+F G++GLGR LS SQI + G FSYC
Sbjct: 225 ETLTFPPGVR-VPGVAIGCGSDNQGLFPAPAAGILGLGRGSLSFPSQIAGRYG-RSFSYC 282
Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTS----LVSKEDKTYYFVTLEGISVGNLSNS 233
L T +S + FG+G+ + S L + T+Y+V L GISVG +
Sbjct: 283 LAGQGTGGR-SSTLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVR 341
Query: 234 SKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVR-NAIKLTPYQDPRLGSQL 292
+ G + +D+G T L Y + R A+K + P G
Sbjct: 342 GVTESDLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAFRVAAVKELGWPSP--GGPF 399
Query: 293 -----CYKT--PSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPV-EGVFCFAMQPI-DGD 343
CY + + P ++ HF GG +V L + IP +G CFA D
Sbjct: 400 AFFDTCYSSVRGRVMKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRG 459
Query: 344 VGIFGNFAQSDLFIGYDFDSQMV 366
V I GN + YD D Q V
Sbjct: 460 VSIIGNIQLQGFRVVYDVDGQRV 482
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 116/366 (31%), Positives = 166/366 (45%), Gaps = 27/366 (7%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSS 73
V+ G YV+ +GTP + + +V DTGSD WVQC PCV CY+Q +P+++P S+
Sbjct: 153 GVALGTGNYVVPVRLGTP--AERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSA 210
Query: 74 SYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
+Y +SC S C L CS C Y Y D S T G A + +T + + N
Sbjct: 211 TYANISCSSSYCSDLYVSGCSGGH-CLYGIQYGDGSYTIGFYAQDTLTL--AYDTIKNFR 267
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
FGCG N G+F GL+GLGR + SL Q + G F+YCL S+ T +
Sbjct: 268 FGCGEKNRGLFGR-AAGLLGLGRGKTSLPVQAYDKYG-GVFAYCL---PATSAGTGFLDL 322
Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
G G+ + + T ++ T+Y+V + GI VG ++P S S +
Sbjct: 323 GPGAPAANARL--TPMLVDRGPTFYYVGMTGIKVGG-----HVLPIPGS--VFSTAGTLV 373
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAG--IA-PILTA 308
D+G T LP Y L A++ Y S L CY G IA P ++
Sbjct: 374 DSGTVITRLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSL 433
Query: 309 HFDGGAKVPLIHTST-FIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
F GGA + + + ++ + FA D DV I GN Q + YD ++V
Sbjct: 434 VFQGGACLDVDASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVG 493
Query: 368 FKPTDC 373
F P C
Sbjct: 494 FAPGAC 499
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 114/350 (32%), Positives = 165/350 (47%), Gaps = 33/350 (9%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSS----- 95
IVDTGSDL WVQC PC CY Q P+Y+P+ SSSYK + C S C L + +S
Sbjct: 149 IVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGG 208
Query: 96 -----QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMG 150
+ C Y Y D S T+G LA+E I G++ +N VFGCG NN G+F +
Sbjct: 209 NNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTK--LENFVFGCGRNNKGLFGGSSG- 265
Query: 151 LVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV--SGGGVVSTS 208
L+GLGR+ +SL SQ L FSYCL +S + FGN S V + V T
Sbjct: 266 LMGLGRSSVSLVSQTLKTFNG-VFSYCLPSLEDGAS--GSLSFGNDSSVYTNSTSVSYTP 322
Query: 209 LVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFY 267
LV + +++Y + L G S+G + S + +G + ID+G T LP Y
Sbjct: 323 LVQNPQLRSFYILNLTGASIGGVELKSS---------SFGRG-ILIDSGTVITRLPPSIY 372
Query: 268 NRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTF-I 325
++ + P C+ S I+ PI+ F G A++ + T F
Sbjct: 373 KAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYF 432
Query: 326 PPPVEGVFCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
P + C A+ + + +VGI GN+ Q + + YD + + +C
Sbjct: 433 VKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 482
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 114/350 (32%), Positives = 165/350 (47%), Gaps = 33/350 (9%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSS----- 95
IVDTGSDL WVQC PC CY Q P+Y+P+ SSSYK + C S C L + +S
Sbjct: 101 IVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGG 160
Query: 96 -----QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMG 150
+ C Y Y D S T+G LA+E I G++ +N VFGCG NN G+F +
Sbjct: 161 NNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTK--LENFVFGCGRNNKGLFGGSSG- 217
Query: 151 LVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV--SGGGVVSTS 208
L+GLGR+ +SL SQ L FSYCL +S + FGN S V + V T
Sbjct: 218 LMGLGRSSVSLVSQTLKTFNG-VFSYCLPSLEDGAS--GSLSFGNDSSVYTNSTSVSYTP 274
Query: 209 LVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFY 267
LV + +++Y + L G S+G + S + +G + ID+G T LP Y
Sbjct: 275 LVQNPQLRSFYILNLTGASIGGVELKSS---------SFGRG-ILIDSGTVITRLPPSIY 324
Query: 268 NRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTF-I 325
++ + P C+ S I+ PI+ F G A++ + T F
Sbjct: 325 KAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYF 384
Query: 326 PPPVEGVFCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
P + C A+ + + +VGI GN+ Q + + YD + + +C
Sbjct: 385 VKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 434
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 114/350 (32%), Positives = 165/350 (47%), Gaps = 33/350 (9%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSS----- 95
IVDTGSDL WVQC PC CY Q P+Y+P+ SSSYK + C S C L + +S
Sbjct: 149 IVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGG 208
Query: 96 -----QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMG 150
+ C Y Y D S T+G LA+E I G++ +N VFGCG NN G+F +
Sbjct: 209 NNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTK--LENFVFGCGRNNKGLFGGSSG- 265
Query: 151 LVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV--SGGGVVSTS 208
L+GLGR+ +SL SQ L FSYCL +S + FGN S V + V T
Sbjct: 266 LMGLGRSSVSLVSQTLKTFNG-VFSYCLPSLEDGAS--GSLSFGNDSSVYTNSTSVSYTP 322
Query: 209 LVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFY 267
LV + +++Y + L G S+G + S + +G + ID+G T LP Y
Sbjct: 323 LVQNPQLRSFYILNLTGASIGGVELKSS---------SFGRG-ILIDSGTVITRLPPSIY 372
Query: 268 NRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTF-I 325
++ + P C+ S I+ PI+ F G A++ + T F
Sbjct: 373 KAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYF 432
Query: 326 PPPVEGVFCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
P + C A+ + + +VGI GN+ Q + + YD + + +C
Sbjct: 433 VKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDSTQERLGIVGENC 482
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 140 bits (354), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 118/383 (30%), Positives = 174/383 (45%), Gaps = 38/383 (9%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
A EY + +GTP +++ I+DTGSD+ W+QC+PC C ++P +NP SSS+ +L
Sbjct: 135 AGLEYYVPLQVGTP-AVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLP 193
Query: 80 CQSEQC-HLLDTVS--CS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD----- 130
C S C ++ V CS S + C ++ Y D SL+ G+LA E I GN+ NF D
Sbjct: 194 CASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIA-GNTPNFGDGEPVK 252
Query: 131 --NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
N+ GC + GL+G+ R +S SQ+ S+ A KFS+C + +
Sbjct: 253 LSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRY-ARKFSHCFPDKIAHLNSS 311
Query: 189 SKMYFGNGSEVSG----GGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
++FG +S +V V YY+V L GISV + S+L P + +
Sbjct: 312 GLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISV----DESRL-PLSHKNF 366
Query: 245 AISK----GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS-- 298
I K G ID+G T L K + + + D G CY S
Sbjct: 367 DIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGT 426
Query: 299 ---MAGIAPILTAHFDGGAKVPLIHTSTFIP---PPVEGVFCFAMQPIDGDV--GIFGNF 350
+ I P +T HF GG V L S IP + C A + GD+ I GN+
Sbjct: 427 AALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFL-MSGDIPFNIIGNY 485
Query: 351 AQSDLFIGYDFDSQMVSFKPTDC 373
Q +L++ YD + + P C
Sbjct: 486 QQQNLWVEYDLEKLRLGIAPAQC 508
>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 394
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 104/365 (28%), Positives = 172/365 (47%), Gaps = 36/365 (9%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
YV F+IGTPP ++D +L+W QC C +C++Q P+++P +S++Y+ C +
Sbjct: 51 YVANFTIGTPPQ-PASAVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAEPCGTP 109
Query: 84 QCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
C + D +CS +C Y ++ T G + T+ G + ++ FGC +
Sbjct: 110 LCESIPSDVRNCSG-NVCAYE-ASTNAGDTGGKVGTDTFAVGTAKA---SLAFGCVVASD 164
Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
G+VGLGRT SL ++Q G FSYCL P D+ S ++ G+ ++++G
Sbjct: 165 IDTMGGPSGIVGLGRTPWSL----VTQTGVAAFSYCLAPH--DAGKNSALFLGSSAKLAG 218
Query: 202 GG-VVSTSLVS-----KEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
GG ST V+ + YY V LEG+ G+ +IP S + +DT
Sbjct: 219 GGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGD-----AMIPLPPSGSTV-----LLDT 268
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
+P + L Y +++ V A+ P P LC+ +G AP L F GGA
Sbjct: 269 FSPISFLVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGAA 328
Query: 316 VPLIHTSTFIPPPVEGVFCFAMQP-----IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
+ + + ++ G C AM ++ + G+ Q ++ +D D + +SF+P
Sbjct: 329 M-TVPATNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEP 387
Query: 371 TDCTK 375
DCTK
Sbjct: 388 ADCTK 392
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 114/363 (31%), Positives = 165/363 (45%), Gaps = 52/363 (14%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQL-- 98
IVDT S+L WVQC PC C+ Q P+++P+SS SY + C S C L QQL
Sbjct: 157 IVDTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDALQ------QQLAT 210
Query: 99 ----------------CNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG 142
C+Y Y D S ++GVLA +R++ + D VFGCG +N G
Sbjct: 211 GAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSL--AGEVIDGFVFGCGTSNQG 268
Query: 143 VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV--S 200
GL+GLGR++LSL SQ + Q G FSYCL P +S + + G+ +
Sbjct: 269 PPFGGTSGLMGLGRSQLSLVSQTVDQFG-GVFSYCL-PLSRESDASGSLVLGDDPSAYRN 326
Query: 201 GGGVVSTSLVSKEDKT----YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
VV TS+VS D +Y V L GI+VG S+G ++ +D+G
Sbjct: 327 STPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQE--------VESTGFSARA--IVDSG 376
Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGI-APILTAHFDGGA 314
T L YN + + + + P Q P C+ + + P LT FDGGA
Sbjct: 377 TVITSLVPSVYNAVRAEFMSQLAEYP-QAPGFSILDTCFNMTGLKEVQVPSLTLVFDGGA 435
Query: 315 KVPLIHTST--FIPPPVEGVFCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
+V + F+ V C A+ + + + I GN+ Q +L + +D + V F
Sbjct: 436 EVEVDSGGVLYFVSSDSSQV-CLAVASLKSEDETSIIGNYQQKNLRVVFDTSASQVGFAQ 494
Query: 371 TDC 373
C
Sbjct: 495 ETC 497
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 118/375 (31%), Positives = 173/375 (46%), Gaps = 38/375 (10%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCL----PCVQCYKQVKPIYNPASSSSYKELS 79
+ + IGTPP IVDTGSDL+W QC V P+Y+P SS++ L
Sbjct: 91 HSLTVGIGTPPQPRKL-IVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFAFLP 149
Query: 80 CQSEQCH--LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
C C +C+S+ C Y Y S+ GVLA+E TFG + FGCG
Sbjct: 150 CSDRLCQEGQFSFKNCTSKNRCVYEDVYG-SAAAVGVLASETFTFGARRAVSLRLGFGCG 208
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
+ G G++GL LSL ++QL +FSYCL PF TS + FG +
Sbjct: 209 ALSAGSLI-GATGILGLSPESLSL----ITQLKIQRFSYCLTPFADKK--TSPLLFGAMA 261
Query: 198 EVSGGG----VVSTSLVSKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---G 249
++S + +T++VS KT YY+V L GIS+G+ K + +S A+ G
Sbjct: 262 DLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGH-----KRLAVPAASLAMRPDGGG 316
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA------ 303
+D+G+ L + + ++E V + ++L +LC+ P A
Sbjct: 317 GTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQ 376
Query: 304 -PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-QPIDGD-VGIFGNFAQSDLFIGYD 360
P L HFDGGA + L + F P G+ C A+ + DG V I GN Q ++ + +D
Sbjct: 377 VPPLVLHFDGGAAMVLPRDNYF-QEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFD 435
Query: 361 FDSQMVSFKPTDCTK 375
SF PT C +
Sbjct: 436 VQHHKFSFAPTQCDQ 450
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 118/360 (32%), Positives = 151/360 (41%), Gaps = 34/360 (9%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELSC 80
+YV+ S+GTP + VDTGSD+ WVQC PC C Q +++PA SS+Y + C
Sbjct: 142 QYVVTVSLGTPGVSQTV-EVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPC 200
Query: 81 QSEQCHLLDT--VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
++ C L CS Q C Y Y D S T GV ++ + N +FGCGH
Sbjct: 201 GADACSELRIYEAGCSGSQ-CGYVVSYGDGSNTTGVYGSDTLALAPGNT-VGTFLFGCGH 258
Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
G+F + GL+ LGR +SL SQ G FSYCL S ++ Y G
Sbjct: 259 AQAGMFAGID-GLLALGRQSMSLKSQAAGAYG-GVFSYCL-----PSKQSAAGYLTLGGP 311
Query: 199 VSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
S G +T L++ T+Y V L GISVG + + G +DTG
Sbjct: 312 SSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAF--------AGGTVVDTGT 363
Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA--PILTAHFDGG 313
T LP Y L R AI Y L CY S G+ P + F GG
Sbjct: 364 VITRLPPTAYAALRSAFRGAIAPCGYPSAPANGILDTCYDF-SRYGVVTLPTVALTFSGG 422
Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
A + L G FA DGD I GN Q + FD V F P C
Sbjct: 423 ATLALEAPGIL----SSGCLAFAPNGGDGDAAILGNVQQRSFAV--RFDGSTVGFMPGAC 476
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 127/403 (31%), Positives = 191/403 (47%), Gaps = 58/403 (14%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V+S V+ +GEY+M +GTPP I+DTGSDL W+QC PC+ C++Q P+++PA+S
Sbjct: 140 VESGVAVGSGEYLMDVYVGTPPR-RFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAAS 198
Query: 73 SSYKELSCQSEQC-HL----------LDTVSCSSQQLCNYTYGYADSSLTKGVLATERIT 121
SSY+ ++C +C H+ T + C Y Y Y D S T G LA E T
Sbjct: 199 SSYRNVTCGDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFT 258
Query: 122 FG----NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
++ D VVFGCGH N G+F+ L+GLGR LS ASQ+ + G + FSYC
Sbjct: 259 VNLTAPGASRRVDGVVFGCGHRNRGLFHGAAG-LLGLGRGPLSFASQLRAVYG-HTFSYC 316
Query: 178 LVPFHTDSSITSKMYFGNGSEVSG---------GGVVSTSLVSKEDKTYYFVTLEGISVG 228
LV +D + SK+ FG + S S T+Y+V L+G+ VG
Sbjct: 317 LVDHGSD--VGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVG 374
Query: 229 ----NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLE----EQVRNAIKL 280
N+S+ + + G G ID+G + + Y + +++ + L
Sbjct: 375 GELLNISSDTWDV------GKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPL 428
Query: 281 TPYQDPRLGSQLCYKTPSMAGI----APILTAHFDGGAKVPLIHTSTFIPPPVEG--VFC 334
P + P L CY +++G+ P L+ F GA + FI +G + C
Sbjct: 429 VP-EFPVLSP--CY---NVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMC 482
Query: 335 FAM--QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
A+ P G + I GNF Q + + YD + + F P C +
Sbjct: 483 LAVLGTPRTG-MSIIGNFQQQNFHVVYDLQNNRLGFAPRRCAE 524
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 113/359 (31%), Positives = 161/359 (44%), Gaps = 18/359 (5%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKEL 78
+ +Y + +GTP D+ I DTGS L W QC PC CYKQ PI++P+ SSSY +
Sbjct: 136 GSADYYVVVGLGTPKR-DLSLIFDTGSYLTWTQCEPCAGSCYKQQDPIFDPSKSSSYTNI 194
Query: 79 SCQSEQCHLLDTVSCSS--QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
C S C + CSS C Y Y D+S+++G L+ ER+T + + + +FGC
Sbjct: 195 KCTSSLCTQFRSAGCSSSTDASCIYDVKYGDNSISRGFLSQERLTI-TATDIVHDFLFGC 253
Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
G +N G+F GL+GL R +S Q S + FSYCL T SS+ + +
Sbjct: 254 GQDNEGLF-RGTAGLMGLSRHPISFVQQT-SSIYNKIFSYCLPS--TPSSLGHLTFGASA 309
Query: 197 SEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
+ + S +S E+ ++Y + + GISVG +P +SS S G ID+G
Sbjct: 310 ATNANLKYTPFSTISGEN-SFYGLDIVGISVGGTK-----LPAVSSS-TFSAGGSIIDSG 362
Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAK 315
T LP Y L R + P CY I+ P + F GG K
Sbjct: 363 TVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEISVPRIDFEFAGGVK 422
Query: 316 VPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
V L + + FA D+ IFGN Q L + YD + + F C
Sbjct: 423 VELPLVGILYGESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGRIGFGAAGC 481
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 132/374 (35%), Positives = 177/374 (47%), Gaps = 33/374 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPAS 71
VQS S G+YV+ +GTP + I DTGSD+ W QC PCV+ CYKQ +P NP++
Sbjct: 108 VQSGASIGAGDYVVTVGLGTPKK-EFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPST 166
Query: 72 SSSYKELSCQSEQCHLLD-----TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
S+SYK +SC S C L+ + SCSS C Y Y D S + G ATE +T +S+
Sbjct: 167 STSYKNISCSSALCKLVASGKKFSQSCSSST-CLYQVQYGDGSYSIGFFATETLTL-SSS 224
Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
N F N +FGCG N GL+GLGRT+L+L SQ ++ FSYCL +S
Sbjct: 225 NVFKNFLFGCGQQNN-GLFGGAAGLLGLGRTKLALPSQT-AKTYKKLFSYCL-----PAS 277
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
+SK Y G +VS V T L + D T +Y + + G+SVG S A
Sbjct: 278 SSSKGYLSLGGQVS-KSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSI-------DESA 329
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-P 304
S G + ID+G T L Y+ L +N + P CY + P
Sbjct: 330 FSAGTV-IDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIP 388
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGV----FCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
+ F GG ++ + + I PV G+ FA D D IFGN Q + YD
Sbjct: 389 KVGVTFKGGVEMDIDVSG--ILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYD 446
Query: 361 FDSQMVSFKPTDCT 374
V F P C+
Sbjct: 447 GAKGRVGFAPGGCS 460
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 132/374 (35%), Positives = 177/374 (47%), Gaps = 33/374 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPAS 71
VQS S G+YV+ +GTP + I DTGSD+ W QC PCV+ CYKQ +P NP++
Sbjct: 60 VQSGASIGAGDYVVTVGLGTPKK-EFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPST 118
Query: 72 SSSYKELSCQSEQCHLLD-----TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
S+SYK +SC S C L+ + SCSS C Y Y D S + G ATE +T +S+
Sbjct: 119 STSYKNISCSSALCKLVASGKKFSQSCSS-STCLYQVQYGDGSYSIGFFATETLTL-SSS 176
Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
N F N +FGCG N GL+GLGRT+L+L SQ ++ FSYCL +S
Sbjct: 177 NVFKNFLFGCGQQNN-GLFGGAAGLLGLGRTKLALPSQT-AKTYKKLFSYCL-----PAS 229
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
+SK Y G +VS V T L + D T +Y + + G+SVG S A
Sbjct: 230 SSSKGYLSLGGQVS-KSVKFTPLSADFDSTPFYGLDITGLSVGGRQLSI-------DESA 281
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-P 304
S G + ID+G T L Y+ L +N + P CY + P
Sbjct: 282 FSAGTV-IDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIP 340
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGV----FCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
+ F GG ++ + + I PV G+ FA D D IFGN Q + YD
Sbjct: 341 KVGVTFKGGVEMDIDVSG--ILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYD 398
Query: 361 FDSQMVSFKPTDCT 374
V F P C+
Sbjct: 399 GAKGRVGFAPGGCS 412
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 121/364 (33%), Positives = 162/364 (44%), Gaps = 30/364 (8%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSC 80
EYV+ IGTP + ++DTGSDL WVQC PC CY Q P+Y+P +SS+Y + C
Sbjct: 126 EYVVTLGIGTPAVQQTV-LIDTGSDLSWVQCKPCNSSSCYPQKDPLYDPTASSTYAPVPC 184
Query: 81 QSEQCHLL-------DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
S+ C L + S LC Y Y + T GV +TE +T + D
Sbjct: 185 DSKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTETLTLSPQVSVKD-FG 243
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
FGCG G F+ + L G SL SQ G FSYCL P ++ + +
Sbjct: 244 FGCGLVQQGTFDLFDGLLGLGGAPE-SLVSQTAETYGG-AFSYCLPPGNSTTGFLALGAP 301
Query: 194 GNGSEVSGGGVVSTSLVS-KEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
N ++ + G + T L S E T+Y V L G+SVG + P + G M
Sbjct: 302 TNNNDTA--GFLFTPLHSLPEQATFYLVNLTGVSVGG--KPLDIPP------TVLSGGMI 351
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGI-APILTAH 309
ID+G T LP Y+ L R A+ P P L CY +A + P +
Sbjct: 352 IDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYNFTGIANVTVPTVALT 411
Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
FDGGA + L S + ++ FA DGDVGI GN Q + YD V F+
Sbjct: 412 FDGGATIDLDVPSGVL---IQDCLAFAGGASDGDVGIIGNVNQRTFEVLYDSGRGHVGFR 468
Query: 370 PTDC 373
P C
Sbjct: 469 PGAC 472
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 118/384 (30%), Positives = 174/384 (45%), Gaps = 39/384 (10%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPA 70
+ +S G YV+ +GTP D+ + DTGSDL WVQC PC CYKQ P++ P+
Sbjct: 143 AERGISVGTGNYVVSVGLGTP-ARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPS 201
Query: 71 SSSSYKELSCQSEQCHLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFG------ 123
SS++ + C + +C + S C Y Y D S T+G L + +T G
Sbjct: 202 DSSTFSAVRCGARECRARQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPAN 261
Query: 124 ---NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
++N VFGCG NNTG+F + + GL GLGR ++SL+SQ + G FSYCL
Sbjct: 262 ASAENDNKLPGFVFGCGENNTGLFGQAD-GLFGLGRGKVSLSSQAAGKFG-EGFSYCL-- 317
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIP 238
SS ++ Y G+ V T ++++ ++Y+V L GI V + + I
Sbjct: 318 --PSSSSSAPGYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRV-----AGRAIR 370
Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD-PRLGS-QLCYKT 296
+ A+ + +D+G T L Y L +A+ Y+ PRL CY
Sbjct: 371 VSSPRVALP---LIVDSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILDTCYDF 427
Query: 297 PSMAGIA---PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD---VGIFGNF 350
+ A P + F GGA + + + V C A P +GD GI GN
Sbjct: 428 TAHANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQA-CLAFAP-NGDGRSAGILGNT 485
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
Q L + YD Q + F C+
Sbjct: 486 QQRTLAVVYDVARQKIGFAAKGCS 509
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 110/376 (29%), Positives = 179/376 (47%), Gaps = 41/376 (10%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
+++ SIG+PP+ + +VDTGS L+WVQCLPC+ C++Q ++P S S+K L C
Sbjct: 104 FLVNLSIGSPPVTQLV-VVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFP 162
Query: 84 QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG--NSNNFFD----------- 130
+ ++ C+ Y Y ++G+LA E + F + F
Sbjct: 163 GYNYINGYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGRVFQYNAISTQISKI 222
Query: 131 ---NVVFGCGHNNTGVFNENEM-GLVGLGRT-RLSLASQILSQLGANKFSYCLVPFHTDS 185
N+ FGCGH N N++ G+ GLG +++A+Q+ NKFSYC+ +
Sbjct: 223 KKSNITFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQL-----GNKFSYCIGDINNPL 277
Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
+ + G GS + G ST L + +Y+VTL+ ISVG S + K+ P +
Sbjct: 278 YTHNHLVLGQGSYIEGD---STPL--QIHFGHYYVTLQSISVG--SKTLKIDPNAFKISS 330
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP--RLGSQLCYK---TPSMA 300
G + ID+G T L + L +++ + +K + P R LC+K + +
Sbjct: 331 DGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGVVSRDLV 390
Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD---VGIFGNFAQSDLFI 357
G P +T HF GGA + L S F + FC A+ P + + + + G AQ + +
Sbjct: 391 GF-PAVTFHFAGGADLVLESGSLFRQHGGDR-FCLAILPSNSELLNLSVIGILAQQNYNV 448
Query: 358 GYDFDSQMVSFKPTDC 373
G+D + V F+ DC
Sbjct: 449 GFDLEQMKVFFRRIDC 464
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 115/365 (31%), Positives = 170/365 (46%), Gaps = 22/365 (6%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQ-CYKQVKPIYNPASS 72
S + G YV+ +GTP Y +V DTGSD WVQC PCV CY+Q + +++P S
Sbjct: 169 SGRALGTGNYVVTVGLGTP--ASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRS 226
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+Y +SC + C L+ CS C Y Y D S + G A + +T +S +
Sbjct: 227 STYANVSCAAPACSDLNIHGCSGGH-CLYGVQYGDGSYSIGFFAMDTLTL-SSYDAVKGF 284
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGCG N G+F E GL+GLGR + SL Q + G F++CL S+ T +
Sbjct: 285 RFGCGERNEGLFGE-AAGLLGLGRGKTSLPVQTYDKYG-GVFAHCLP---ARSTGTGYLD 339
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
FG GS + ++T +++ T+Y++ + GI VG +L+ S +
Sbjct: 340 FGAGSPAAASARLTTPMLTDNGPTFYYIGMTGIRVGG-----QLLSIPQS--VFATAGTI 392
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAH 309
+D+G T LP Y+ L A+ Y+ S L CY M+ +A P ++
Sbjct: 393 VDSGTVITRLPPPAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLL 452
Query: 310 FDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
F GGA++ + + V FA GDVGI GN + YD ++V F
Sbjct: 453 FQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 512
Query: 369 KPTDC 373
P C
Sbjct: 513 YPGVC 517
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 118/385 (30%), Positives = 179/385 (46%), Gaps = 28/385 (7%)
Query: 12 VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
++S VS +GEY + IG+PP I+DTGSDL W+QC+PC C++Q P Y+P
Sbjct: 184 TLESGVSLGSGEYFIDVFIGSPPK-HFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKD 242
Query: 72 SSSYKELSCQSEQCHLLDT----VSCSSQ-QLCNYTYGYADSSLTKGVLATERITF---- 122
S S++ ++C +C L+ + C + Q C Y Y Y DSS T G A E T
Sbjct: 243 SISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTS 302
Query: 123 ---GNSN-NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL 178
G S +NV+FGCGH N G+F+ L LS +SQ+ S G + FSYCL
Sbjct: 303 STTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGR-GPLSFSSQLQSLYG-HSFSYCL 360
Query: 179 VPFHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKEDK---TYYFVTLEGISVGNLSNSS 234
V +D+S++SK+ FG ++ ++ TSL++ ++ T+Y++ ++ I VG
Sbjct: 361 VDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVG----GE 416
Query: 235 KL-IPYYNSS-GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL 292
KL IP N + A G ID+G + Y ++E +K +
Sbjct: 417 KLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHP 476
Query: 293 CYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNF 350
CY + P F GA + FI + C AM + I GN+
Sbjct: 477 CYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNY 536
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCTK 375
Q + I YD + + + P C +
Sbjct: 537 QQQNFHILYDTKNSRLGYAPMRCAE 561
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 118/384 (30%), Positives = 179/384 (46%), Gaps = 28/384 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
++S VS +GEY + IG+PP I+DTGSDL W+QC+PC C++Q P Y+P S
Sbjct: 185 LESGVSLGSGEYFIDVFIGSPPK-HFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDS 243
Query: 73 SSYKELSCQSEQCHLLDTVS----CSSQ-QLCNYTYGYADSSLTKGVLATERITF----- 122
S++ ++C +C L+ + C + Q C Y Y Y DSS T G A E T
Sbjct: 244 ISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSS 303
Query: 123 --GNSN-NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
G S +NV+FGCGH N G+F+ L LS +SQ+ S G + FSYCLV
Sbjct: 304 TTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGR-GPLSFSSQLQSLYG-HSFSYCLV 361
Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKEDK---TYYFVTLEGISVGNLSNSSK 235
+D+S++SK+ FG ++ ++ TSL++ ++ T+Y++ ++ I VG K
Sbjct: 362 DRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGG----EK 417
Query: 236 L-IPYYNSS-GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLC 293
L IP N + A G ID+G + Y ++E +K + C
Sbjct: 418 LQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPC 477
Query: 294 YKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFA 351
Y + P F GA + FI + C AM + I GN+
Sbjct: 478 YNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQ 537
Query: 352 QSDLFIGYDFDSQMVSFKPTDCTK 375
Q + I YD + + + P C +
Sbjct: 538 QQNFHILYDTKNSRLGYAPMRCAE 561
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 115/384 (29%), Positives = 176/384 (45%), Gaps = 47/384 (12%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSCQ 81
+Y+ ++ IG PP I+DTGS+L+W QC C C++Q P Y+P+ S + + + C
Sbjct: 70 QYIAEYLIGDPPQ-RAEAIIDTGSNLIWTQCSRCRPTCFRQNLPYYDPSRSRAARAVGCN 128
Query: 82 SEQCHLLDTVSC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC---G 137
C L C S + C GY ++ G LATE +TF + ++VFGC
Sbjct: 129 DAACALGSETQCLSDNKTCAVVTGYGAGNI-AGTLATENLTFQSETV---SLVFGCIVVT 184
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
+ G N G++GLGR +LSL SQ LG +FSYCL P+ D+ S M G +
Sbjct: 185 KLSPGSLN-GASGIIGLGRGKLSLPSQ----LGDTRFSYCLTPYFEDTIEPSHMVVGASA 239
Query: 198 EVSGGGVVSTSLVS-------KED--KTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAI 246
+ G ST + + +D T+Y++ L GI+ G L+ S + +
Sbjct: 240 GLINGSASSTPVTTVPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLRQVAPGM 299
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS---QLCYKTPSMAGIA 303
G FID+GAP T L Y L ++ + Q P G+ LC +
Sbjct: 300 WTGT-FIDSGAPLTSLVDVAYQALRAELARQLGAALVQ-PLAGTTGFDLCVALKDAERLV 357
Query: 304 PILTAHFDGGAKVPLIHTSTFIPP-----PVE-GVFCFAM-QPID------GDVGIFGNF 350
P L HF GG+ T +PP PV+ C + +D + + GN+
Sbjct: 358 PPLVLHFGGGSGT---GTDLVVPPANYWAPVDSATACMVVFSSVDRKSLPMNETTVIGNY 414
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
Q ++ + YD ++SF+P DC+
Sbjct: 415 MQQNMHVLYDLAGGVLSFQPADCS 438
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 132/374 (35%), Positives = 177/374 (47%), Gaps = 33/374 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPAS 71
VQS S G+YV+ +GTP + I DTGSD+ W QC PCV+ CYKQ +P NP++
Sbjct: 120 VQSGASIGAGDYVVTVGLGTPKK-EFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPST 178
Query: 72 SSSYKELSCQSEQCHLLD-----TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
S+SYK +SC S C L+ + SCSS C Y Y D S + G ATE +T +S+
Sbjct: 179 STSYKNISCSSALCKLVASGKKFSQSCSSST-CLYQVQYGDGSYSIGFFATETLTL-SSS 236
Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
N F N +FGCG N GL+GLGRT+L+L SQ ++ FSYCL +S
Sbjct: 237 NVFKNFLFGCGQQNN-GLFGGAAGLLGLGRTKLALPSQT-AKTYKKLFSYCL-----PAS 289
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
+SK Y G +VS V T L + D T +Y + + G+SVG S A
Sbjct: 290 SSSKGYLSLGGQVS-KSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSI-------DESA 341
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-P 304
S G + ID+G T L Y+ L +N + P CY + P
Sbjct: 342 FSAGTV-IDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIP 400
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGV----FCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
+ F GG ++ + + I PV G+ FA D D IFGN Q + YD
Sbjct: 401 KVGVTFKGGVEMDIDVSG--ILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYD 458
Query: 361 FDSQMVSFKPTDCT 374
V F P C+
Sbjct: 459 GAKGRVGFAPGGCS 472
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 115/367 (31%), Positives = 167/367 (45%), Gaps = 30/367 (8%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSY 75
S A G YV + +GTP + +VDTGS L W+QC PC V C++Q P+++P +S +Y
Sbjct: 124 ASVAVGNYVTRLGLGTPATSYVM-VVDTGSSLTWLQCSPCSVSCHRQAGPVFDPRASGTY 182
Query: 76 KELSCQSEQCHLLDTV-----SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
+ C S +C L +CS +C Y Y DSS + G L+ + ++FG+ + F
Sbjct: 183 AAVQCSSSECGELQAATLNPSACSVSNVCIYQASYGDSSYSVGYLSKDTVSFGSGS--FP 240
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
+GCG +N G+F + GL+GL + +LSL Q+ LG FSYCL +S +
Sbjct: 241 GFYYGCGQDNEGLFGRSA-GLIGLAKNKLSLLYQLAPSLG-YAFSYCL-----PTSSAAA 293
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP--YYNSSGAISK 248
Y GS G + S D + YFVTL GISV + + +P Y S I
Sbjct: 294 GYLSIGSYNPGQYSYTPMASSSLDASLYFVTLSGISV---AGAPLAVPPSEYRSLPTI-- 348
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGIAPILT 307
ID+G T LP + Y L V A+ + P C++ + P +
Sbjct: 349 ----IDSGTVITRLPPNVYTALSRAVAAAMASAAPRAPTYSILDTCFRGSAAGLRVPRVD 404
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
F GGA + L + I + C A P G I GN Q + YD +
Sbjct: 405 MAFAGGATLALSPGNVLIDVD-DSTTCLAFAPT-GGTAIIGNTQQQTFSVVYDVAQSRIG 462
Query: 368 FKPTDCT 374
F C+
Sbjct: 463 FAAGGCS 469
>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 418
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 179/379 (47%), Gaps = 61/379 (16%)
Query: 25 VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQ 84
V F+IGTPP I+D +L+W QC C +C+KQ P++ P +SS+++ C ++
Sbjct: 68 VANFTIGTPPQ-PASAIIDVAGELVWTQCSMCSRCFKQDLPLFVPNASSTFRPEPCGTDA 126
Query: 85 CHLLDTVSCSSQQLCNYTYGYADSSL---TKGVLATERITFGNSNNFFDNVVFGC----G 137
C + T +CSS +C Y G +S L T G++AT+ G + ++ FGC G
Sbjct: 127 CKSIPTSNCSS-NMCTY-EGTINSKLGGHTLGIVATDTFAIGTATA---SLGFGCVVASG 181
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
+ G GL+GLGR S ++SQ+ KFSYCL P DS S++ G+ +
Sbjct: 182 IDTMG----GPSGLIGLGRA----PSSLVSQMNITKFSYCLTPH--DSGKNSRLLLGSSA 231
Query: 198 EVSGGGVVSTSLVSK-----EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
+++GGG +T+ K + YY + L+GI G+ + + L P N+ +
Sbjct: 232 KLAGGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIA--LPPSGNT--------VL 281
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFD 311
+ T AP + L Y L+++V A+ P P LC+ ++ AP L F
Sbjct: 282 VQTLAPMSFLVDSAYQALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQ 341
Query: 312 GGAKVPLIHTSTFIPPPV--------EGVFCFAM--------QPIDGDVGIFGNFAQSDL 355
GA + +PPP +G C A+ +D ++ I G+ Q +
Sbjct: 342 QGA------AALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENT 395
Query: 356 FIGYDFDSQMVSFKPTDCT 374
D + + +SF+P DC+
Sbjct: 396 HFLLDLEKKTLSFEPADCS 414
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 117/371 (31%), Positives = 175/371 (47%), Gaps = 32/371 (8%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYG-IVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSS 74
+S +G Y +K +GTPP Y I+DTGS L W+QC PC V C+ Q P+Y+P+ S +
Sbjct: 118 LSIGSGNYYVKLGLGTPP--KYYAMILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKT 175
Query: 75 YKELSCQSEQCHLL------DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
YK+LSC S +C L D + + C YT Y D+S + G L+ + +T +S
Sbjct: 176 YKKLSCASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQT- 234
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
+GCG +N G+F G++GL R +LS+ +Q+ ++ G + FSYCL ++ SS
Sbjct: 235 LPQFTYGCGQDNQGLFGR-AAGIIGLARDKLSMLAQLSTKYG-HAFSYCLPTANSGSSGG 292
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
+ G+ S S + L ++ + YF+ L I+V + + A+ +
Sbjct: 293 GFLSIGSISPTSYK--FTPMLTDSKNPSLYFLRLTAITVSGRP--------LDLAAAMYR 342
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA--P 304
ID+G T LP Y L Q I T Y S L C+K S+ I+ P
Sbjct: 343 VPTLIDSGTVITRLPMSMYAAL-RQAFVKIMSTKYAKAPAYSILDTCFKG-SLKSISAVP 400
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFD 362
+ F GGA + L S I +G+ C A G + I GN Q I YD
Sbjct: 401 EIKMIFQGGADLTLRAPSILIEAD-KGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVS 459
Query: 363 SQMVSFKPTDC 373
+ + F P C
Sbjct: 460 TSRIGFAPGSC 470
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 128/396 (32%), Positives = 199/396 (50%), Gaps = 53/396 (13%)
Query: 12 VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
++S ++ +GEY M +GTPP I+DTGSDL W+QCLPC C+ Q + Y+P +
Sbjct: 150 TLESGMTLGSGEYFMDVLVGTPPK-HFSLILDTGSDLNWLQCLPCYDCFHQNEAFYDPKT 208
Query: 72 SSSYKELSCQSEQCHLLDT----VSCSSQ-QLCNYTYGYADSSLTKGVLATERITF---- 122
S+S+K ++C +C L+ + V C S Q C Y Y Y D S T G A E T
Sbjct: 209 SASFKNITCNDPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTT 268
Query: 123 --GNSNNF-FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
G S+ + +N++FGCGH N G+F+ L+GLGR LS +SQ+ S G + FSYCLV
Sbjct: 269 TEGRSSEYKVENMMFGCGHWNRGLFSGASG-LLGLGRGPLSFSSQLQSLYG-HSFSYCLV 326
Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKED---KTYYFVTLEGISVGNLSNSSK 235
++D++++SK+ FG ++ ++ TS V+ ++ +T+Y++ ++ I VG +
Sbjct: 327 DRNSDTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVG---GEAL 383
Query: 236 LIPY--YNSS--GAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRN---AIKLTPYQ 284
IP +N S GA G ID+G + + Y N+ E+++ + P
Sbjct: 384 DIPEETWNISPDGA---GGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPVL 440
Query: 285 DPRLGSQLCYKTPSMAGIA------PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ 338
DP C+ +++GI P L F GA ++FI E + C A+
Sbjct: 441 DP------CF---NVSGIEENNIHLPELGIAFADGAVWNFPAENSFIWLS-EDLVCLAIL 490
Query: 339 PI-DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
I GN+ Q + I YD + F PT C
Sbjct: 491 GTPKSTFSIIGNYQQQNFHILYDTKMSRLGFTPTKC 526
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 112/374 (29%), Positives = 155/374 (41%), Gaps = 55/374 (14%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S ++ +GEY +GTPP + ++DTGSD++W+QC PC QCY Q +++P S
Sbjct: 131 VVSGLAQGSGEYFASVGVGTPPTPALL-VLDTGSDVVWLQCAPCRQCYAQSGRVFDPRRS 189
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQL----CNYTYGYADSSLTKGVLATERITFGNSNNF 128
SY + C + C LD C Y Y D S+T G LATE + F
Sbjct: 190 RSYAAVRCGAPPCRGLDAGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLWFARGAR- 248
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS-- 186
V GCGH+N G+F L RLSL +Q + G +FSYC D
Sbjct: 249 VPRVAVGCGHDNEGLFVAAAGLLGLGR-GRLSLPTQTARRYG-RRFSYCFQGSDLDHRTI 306
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
I + G+ V G G S +L P +
Sbjct: 307 IRTVHQHVGGARVRGVG---------------------------ERSLRLDP------ST 333
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQL---CYKTPSMA 300
+G + +D+G T L + Y + E R A ++L P G L CY
Sbjct: 334 GRGGVILDSGTSVTRLARPVYVAVREAFRAAAGGLRLAPG-----GFSLFDTCYDLRGRR 388
Query: 301 GI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
+ P ++ H GGA+V L + IP G FC A+ DG V I GN Q + +
Sbjct: 389 VVKVPTVSVHLAGGAEVALPPENYLIPVDTRGTFCLALAGTDGGVSIVGNIQQQGFRVVF 448
Query: 360 DFDSQMVSFKPTDC 373
D D Q V+ P C
Sbjct: 449 DGDRQRVALVPKSC 462
>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
Length = 419
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 112/384 (29%), Positives = 177/384 (46%), Gaps = 62/384 (16%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPASSSSYKELSCQ 81
YV F+IGTPP + GIVD +L+W QC C C+KQ P+++P++S++Y+ C
Sbjct: 62 YVANFTIGTPPQA-VSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCG 120
Query: 82 SEQCHLLDTVSCSSQQLCNYTYGYADSSL---TKGVLATERITFGNSNNFFDNVVFGCGH 138
S C + T +CS C GY S+ T G+ +T+ I GN+ + FGC
Sbjct: 121 SPLCKSIPTRNCSGDGEC----GYEAPSMFGDTFGIASTDAIAIGNAEG---RLAFGCVV 173
Query: 139 NNTGVFN---ENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
+ G + + G VGLGRT SL + Q FSYCL P S ++ G
Sbjct: 174 ASDGSIDGAMDGPSGFVGLGRTPWSL----VGQSNVTAFSYCLAPHGPGKK--SALFLGA 227
Query: 196 GSEVSGGGVVS--TSLVSKE--------DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
++++G G + T L+ + YY V LEGI G+++ ++ SSG
Sbjct: 228 SAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAA------SSGG 281
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPI 305
+ + ++T P + LP Y LE+ V A+ +P LC++ +++G+ P
Sbjct: 282 GAITILQLETFRPLSYLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQNAAVSGV-PD 340
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVE---------GVFCFA------MQPIDGDVGIFGNF 350
L F GGA T PP + G C + + D V I G+
Sbjct: 341 LVFTFQGGA--------TLTAPPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSL 392
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
Q ++ +D + + +SF+P DC+
Sbjct: 393 LQENVHFLFDLEKETLSFEPADCS 416
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 113/369 (30%), Positives = 167/369 (45%), Gaps = 33/369 (8%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQ 81
YV ++G ++ IVDTGSDL WVQC PC CY Q P+++PA+S ++ + C
Sbjct: 180 YVTTIALGGGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAAVPCG 239
Query: 82 SEQC--HLLDTV----SCS-----SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
S C L D SC+ S+Q C Y Y D S ++GVLA + + G + D
Sbjct: 240 SPACAASLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGLGTTTK-LD 298
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
VFGCG +N G+F GL+GLGRT LSL SQ ++ G FSYCL P T S T
Sbjct: 299 GFVFGCGLSNRGLFG-GTAGLMGLGRTDLSLVSQTAARFG-GVFSYCL-PATTTS--TGS 353
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
+ G G S + T +++ + +YF+ N++ ++ ++ G
Sbjct: 354 LSLGPGPSSSFPNMAYTRMIADPTQPPFYFI--------NITGAAVGGGAALTAPGFGAG 405
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTA 308
N+ +D+G T L Y + + + P CY + P+LT
Sbjct: 406 NVLVDSGTVITRLAPSVYKAVRAEFARRFEY-PAAPGFSILDACYDLTGRDEVNVPLLTL 464
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQ--PIDGDVGIFGNFAQSDLFIGYDFDSQM 365
+GGA+V + +G C AM P + I GN+ Q + + YD
Sbjct: 465 TLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQRNKRVVYDTVGSR 524
Query: 366 VSFKPTDCT 374
+ F DCT
Sbjct: 525 LGFADEDCT 533
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 112/367 (30%), Positives = 156/367 (42%), Gaps = 28/367 (7%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSCQ 81
EYV+ +G+PP ++DTGSD+ WV+C PC QC QV P+++P+ SS+Y SC
Sbjct: 139 EYVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCWQQCRPQVDPLFDPSLSSTYSPFSCS 198
Query: 82 SEQCHLL----DTVSCSSQQLCNYTYGYADSSL-TKGVLATERITFGNSNN--FFDNVVF 134
S C L + CSS C Y Y D S+ T G +++ + G+++N F
Sbjct: 199 SAACAQLFQEGNANGCSSSGQCQYIAMYGDGSVGTTGTYSSDTLALGSNSNTVVVSKFRF 258
Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
GC H TG+ + G + SL SQ G FSYCL P + S + G
Sbjct: 259 GCSHAETGITGLTAGLMGLGGGAQ-SLVSQTAGTFGTTAFSYCLPPTPSSSGF---LTLG 314
Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFID 254
S G V + L S + +Y V LE I VG S IP + M +D
Sbjct: 315 AAGTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLS---IPT-----TVFSAGMIMD 366
Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS---QLCYKTPSMAGIA-PILTAHF 310
+G T LP Y+ L + +K P G C+ + ++ P + F
Sbjct: 367 SGTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDTCFDMSGQSSVSMPTVALVF 426
Query: 311 D--GGAKVPLIHTSTFIPPPVEGVFCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMV 366
GGA V L + + +FC A DG GI GN Q + YD V
Sbjct: 427 SGAGGAVVNLDASGILLQMETSSIFCLAFVATSDDGSTGIIGNVQQRTFQVLYDVAGGAV 486
Query: 367 SFKPTDC 373
FK C
Sbjct: 487 GFKAGAC 493
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 137 bits (346), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 113/357 (31%), Positives = 171/357 (47%), Gaps = 26/357 (7%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
Y + IGTPP L I DT SDL W QC KQV+P+++PA SSS+ ++C S+
Sbjct: 91 YTVTIGIGTPPQLHTL-IADTASDLTWTQCNLFNDTAKQVEPLFDPAKSSSFAFVTCSSK 149
Query: 84 QCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHNN 140
C + T CS++ C Y Y Y S GVLA E T ++N + FGCG
Sbjct: 150 LCTEDNPGTKRCSNKT-CRYVYPYV-SVEAAGVLAYESFTLSDNNQHICMSFGFGCGALT 207
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
G G++G+ LS+ +SQL KFSYCL P+ TD +S ++FG +++
Sbjct: 208 DGNL-LGASGILGMSPAILSM----VSQLAIPKFSYCLTPY-TDRK-SSPLFFGAWADL- 259
Query: 201 GGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPT 260
G +T + K YY+V L G+S+G ++ + ++ A+ +G +D G
Sbjct: 260 -GRYKTTGPIQKSLTFYYYVPLVGLSLG-----TRRLDVPAATFALKQGGTVVDLGCTVG 313
Query: 261 LLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA----PILTAHFDGGAKV 316
L + + L+E V + + L ++C+ PS + P L +FDGGA +
Sbjct: 314 QLAEPAFTALKEAVLHTLNLPLTNRTVKDYKVCFALPSGVAMGAVQTPPLVLYFDGGADM 373
Query: 317 PLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
L + F P G+ C A+ P G + I GN Q + + +D F PT C
Sbjct: 374 VLPRDNYF-QEPTAGLMCLALVP-GGGMSIIGNVQQQNFHLLFDVHDSKFLFAPTIC 428
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 111/385 (28%), Positives = 169/385 (43%), Gaps = 46/385 (11%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ IGTP + +DTGSDL+W QC C C+ Q P++ + S ++ + C
Sbjct: 93 EYLIHLGIGTPRPQRVVLHLDTGSDLVWTQCA-CTVCFDQPVPVFRASVSHTFSRVPCSD 151
Query: 83 EQCH---LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITF-----GNSNNFFDNVV 133
C L C+++ + C Y YGY D S+T G +A + TF ++ N+
Sbjct: 152 PLCGHAVYLPLSGCAARDRSCFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNIR 211
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
FGCG N G+F N+ G+ G G LSL SQL +FSYC +S ++ +
Sbjct: 212 FGCGMMNYGLFTPNQSGIAGFGTGPLSLP----SQLKVRRFSYCFTAME-ESRVSPVILG 266
Query: 194 GNGSEVSG---GGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
G + G + ST + +YF++L G++VG +P+ S+
Sbjct: 267 GEPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVGETR-----LPFNASTF 321
Query: 245 AIS---KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKL---TPYQDPRLGSQLCYKTPS 298
A+ G FID+G T P+ + L E + L Y DP + LC+ P+
Sbjct: 322 ALKGDGSGGTFIDSGTAITFFPQAVFRSLREAFVAQVPLPVAKGYTDP--DNLLCFSVPA 379
Query: 299 --MAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-----FCFAMQPIDGDVG-IFGNF 350
A P L H + GA L + + +G C + G I GNF
Sbjct: 380 KKKAPAVPKLILHLE-GADWELPRENYVLDNDDDGSGAGRKLCVVILSAGNSNGTIIGNF 438
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCTK 375
Q ++ I YD +S + F P C K
Sbjct: 439 QQQNMHIVYDLESNKMVFAPARCDK 463
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 122/392 (31%), Positives = 187/392 (47%), Gaps = 45/392 (11%)
Query: 9 PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
P Q+ + +G+Y M F IGTP + G DTGSDL+W +C C +C + P Y
Sbjct: 77 PGESAQTPLKKGSGDYAMSFGIGTP-ATGLSGEADTGSDLIWTKCGACARCSPRGSPSYY 135
Query: 69 PASSSSYKELSCQSEQCHLLDTVSCSSQQL-------CNYTYGYADSS----LTKGVLAT 117
P SSSS ++C C L CS+ C+Y Y Y ++ T+G+L T
Sbjct: 136 PTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMT 195
Query: 118 ERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
E TFG+ F + FGC + G F GLVGLGR +LSL ++QL F Y
Sbjct: 196 ETFTFGDDAAAFPGIAFGCTLRSEGGFGTGS-GLVGLGRGKLSL----VTQLNVEAFGYR 250
Query: 178 LVPFHTDSSITSKMYFGNGSEVSGG---GVVSTSLVSK---EDKTYYFVTLEGISVGN-- 229
L +D S S + FG+ ++V+GG +ST L++ +D +Y+V L GISVG
Sbjct: 251 L---SSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKL 307
Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
+ S + S+GA G + D+G T+LP Y + +++ + + +Q P
Sbjct: 308 VQIPSGTFSFDRSTGA---GGVIFDSGTTLTMLPDPAYTLVRDELLSQMG---FQKPPPA 361
Query: 290 SQ----LCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPV----EGVFCFAMQPID 341
+ +C+ S P + HFDGGA + L T ++P E C+++
Sbjct: 362 ANDDDLICFTGGSSTTTFPSMVLHFDGGADMDL-STENYLPQMQGQNGETARCWSVVKSS 420
Query: 342 GDVGIFGNFAQSDLFIGYDF--DSQMVSFKPT 371
+ I GN Q D + +D +++M+ PT
Sbjct: 421 QALTIIGNIMQMDFHVVFDLSGNARMLFQPPT 452
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 122/392 (31%), Positives = 187/392 (47%), Gaps = 45/392 (11%)
Query: 9 PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
P Q+ + +G+Y M F IGTP + G DTGSDL+W +C C +C + P Y
Sbjct: 77 PGESAQTPLKKGSGDYAMSFGIGTP-ATGLSGEADTGSDLIWTKCGACARCSPRGSPSYY 135
Query: 69 PASSSSYKELSCQSEQCHLLDTVSCSSQQL-------CNYTYGYADSS----LTKGVLAT 117
P SSSS ++C C L CS+ C+Y Y Y ++ T+G+L T
Sbjct: 136 PTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMT 195
Query: 118 ERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
E TFG+ F + FGC + G F GLVGLGR +LSL ++QL F Y
Sbjct: 196 ETFTFGDDAAAFPGIAFGCTLRSEGGFGTGS-GLVGLGRGKLSL----VTQLNVEAFGYR 250
Query: 178 LVPFHTDSSITSKMYFGNGSEVSGG---GVVSTSLVSK---EDKTYYFVTLEGISVGN-- 229
L +D S S + FG+ ++V+GG +ST L++ +D +Y+V L GISVG
Sbjct: 251 L---SSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKL 307
Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
+ S + S+GA G + D+G T+LP Y + +++ + + +Q P
Sbjct: 308 VQIPSGTFSFDRSTGA---GGVIFDSGTTLTMLPDPAYTLVRDELLSQMG---FQKPPPA 361
Query: 290 SQ----LCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPV----EGVFCFAMQPID 341
+ +C+ S P + HFDGGA + L T ++P E C+++
Sbjct: 362 ANDDDLICFTGGSSTTTFPSMVLHFDGGADMDL-STENYLPQMQGQNGETARCWSVVKSS 420
Query: 342 GDVGIFGNFAQSDLFIGYDF--DSQMVSFKPT 371
+ I GN Q D + +D +++M+ PT
Sbjct: 421 QALTIIGNIMQMDFHVVFDLSGNARMLFQPPT 452
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 118/366 (32%), Positives = 171/366 (46%), Gaps = 32/366 (8%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSC 80
EYV+ IGTP + ++DTGSDL WVQC PC CY Q P+++P+ SS++ + C
Sbjct: 124 EYVVTLGIGTPAVQQTV-LIDTGSDLSWVQCKPCNASDCYPQKDPLFDPSKSSTFATIPC 182
Query: 81 QSEQCHLLDTV----SCSSQQ-----LCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
S+ C L C++ C Y Y + ++T+GV +TE + G S+ +
Sbjct: 183 ASDACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLALG-SSAVVKS 241
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
FGCG + G +++ + GL+GLG SL SQ S G FSYCL P ++ + + +
Sbjct: 242 FRFGCGSDQHGPYDKFD-GLLGLGGAPESLVSQTASVYG-GAFSYCLPPLNSGAGFLT-L 298
Query: 192 YFGNGSEVSGGGVVSTSL--VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
N + S G V T + S + T+Y VTL GISVG + IP +KG
Sbjct: 299 GAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALD---IP----PAVFAKG 351
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGI-APILT 307
N+ +D+G T +P Y L R+A+ P P + CY + P +
Sbjct: 352 NI-VDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTCYNFTGHGTVTVPKVA 410
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
F GGA V L S + VE FA DG GI GN + + YD +
Sbjct: 411 LTFVGGATVDLDVPSGVL---VEDCLAFA-DAGDGSFGIIGNVNTRTIEVLYDSGKGHLG 466
Query: 368 FKPTDC 373
F+ C
Sbjct: 467 FRAGAC 472
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 117/390 (30%), Positives = 165/390 (42%), Gaps = 83/390 (21%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC-----------LPCVQCYK 61
V S V + + EY+M ++G+PP + I DTGSDL+WV+C P Q
Sbjct: 90 VVSKVVSRSFEYLMTVNLGSPPR-SMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQ--- 145
Query: 62 QVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERIT 121
++P+ SS+Y +SCQ++ C L +C C Y Y Y D S T GVL+TE T
Sbjct: 146 -----FDPSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFT 200
Query: 122 FGNSNN-------FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG---- 170
F + V FGC G F + + A +++QLG
Sbjct: 201 FDDGGAGRSPRQVRIGGVKFGCSTATAGSFPADGL------VGLGGGAVSLVTQLGGATS 254
Query: 171 -ANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN 229
+FSYCLVP ++S S + FG ++V+ G ST L VGN
Sbjct: 255 LGRRFSYCLVPHSVNAS--SALNFGALADVTEPGAASTPL-----------------VGN 295
Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
+ +S + + +D+G T L + +++ I L P Q P
Sbjct: 296 KTVASA-----------ASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGL 344
Query: 290 SQLCYKTPSMAGIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM------QP 339
QLCY A P LT F GGA V L + F+ EG C A+ QP
Sbjct: 345 LQLCYNVAGREVEAGESIPDLTLEFGGGAAVALKPENAFVAVQ-EGTLCLAIVATTEQQP 403
Query: 340 IDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
V I GN AQ ++ +GYD D+ V K
Sbjct: 404 ----VSILGNLAQQNIHVGYDLDAGTVGNK 429
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 49/165 (29%), Positives = 71/165 (43%), Gaps = 23/165 (13%)
Query: 227 VGNLSNSSKLIPYYNSSGAI--------SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI 278
+GNL+ + + Y +G + + + +D+G T L + +++ I
Sbjct: 407 LGNLAQQNIHVGYDLDAGTVGNKTVASAASSRIIVDSGTTLTFLDPSLLGPIVDELSRRI 466
Query: 279 KLTPYQDPRLGSQLCYKTPSMAGIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFC 334
L P Q P QLCY A P LT F GGA V L + F+ EG C
Sbjct: 467 TLPPVQSPDGLLQLCYNVAGREVEAGESIPDLTLEFGGGAAVALKPENAFVAVQ-EGTLC 525
Query: 335 FAM------QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
A+ QP V I GN AQ ++ +GYD D+ V+F DC
Sbjct: 526 LAIVATTEQQP----VSILGNLAQQNIHVGYDLDAGTVTFAVADC 566
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 110/363 (30%), Positives = 152/363 (41%), Gaps = 22/363 (6%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASS 72
S S EYV+ +IGTP + + I DTGSD+ WVQC PC C Q +++PA S
Sbjct: 120 SGYSLGTTEYVITVTIGTPAVTQVMSI-DTGSDVSWVQCAPCAAQSCSSQKDKLFDPAMS 178
Query: 73 SSYKELSCQSEQC-HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
++Y SC S QC L D + + C Y Y D S T G ++ ++ S++ +
Sbjct: 179 ATYSAFSCGSAQCAQLGDEGNGCLKSQCQYIVKYGDGSNTAGTYGSDTLSL-TSSDAVKS 237
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
FGC H G E + GL+GLG SL SQ + G FSYCL P SS +
Sbjct: 238 FQFGCSHRAAGFVGELD-GLMGLGGDTESLVSQTAATYG-KAFSYCLPP--PSSSGGGFL 293
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
G S T +V T+Y V L+GI+V N ++ G
Sbjct: 294 TLGAAGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGT--------MLNVPASVFSGAS 345
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHF 310
+D+G T LP Y L + +K P P C+ I P +T F
Sbjct: 346 VVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCFDFSGFNTITVPTVTLTF 405
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
GA + L + G F DGD GI GN Q + +D + + F+
Sbjct: 406 SRGAAMDLDISGILY----AGCLAFTATAHDGDTGILGNVQQRTFEMLFDVGGRTIGFRS 461
Query: 371 TDC 373
C
Sbjct: 462 GAC 464
>gi|449467979|ref|XP_004151699.1| PREDICTED: probable aspartic protease At2g35615-like, partial
[Cucumis sativus]
Length = 209
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 62/132 (46%), Positives = 89/132 (67%), Gaps = 4/132 (3%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+Q+ ++ +GEY+M SIGTPP+ D G+ DTGSDLMW QCLPC++CYKQ +PI++P S
Sbjct: 81 LQAPLTPGSGEYLMSVSIGTPPV-DYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKS 139
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
+S+ + C S+ C +D C +Q +C+Y+Y Y D + TKG L E+IT G+S+
Sbjct: 140 TSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKITIGSSSV---KS 196
Query: 133 VFGCGHNNTGVF 144
V GCGH + G F
Sbjct: 197 VIGCGHESGGGF 208
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 119/369 (32%), Positives = 162/369 (43%), Gaps = 38/369 (10%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSC 80
+YV+ GTP + + ++DTGSDL WVQC PC CY Q P+++P++SS+Y + C
Sbjct: 121 QYVVTLGFGTPAVPQVL-LIDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYAPVPC 179
Query: 81 QSEQCHLLD--------TVSCSSQQLCNYTYGYADSSLTKGVLATERITFG-NSNNFFDN 131
SE C LD T S S LC Y Y + T GV +TE +T + +N
Sbjct: 180 GSEACRDLDPDSYANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTETLTLSPEAATVVNN 239
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
FGCG GVF+ + L G SL SQ G FSYCL + ++
Sbjct: 240 FSFGCGLVQKGVFDLFDGLLGLGGAPE-SLVSQTTGTYGG-AFSYCL-----PAGNSTAG 292
Query: 192 YFGNGSEVSGG----GVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
+ G+ +GG G T L E T+Y V L GISVG + +
Sbjct: 293 FLALGAPATGGNNTAGFQFTPLQVVE-TTFYLVKLTGISVGGKQ--------LDIEPTVF 343
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYK-TPSMAGIAP 304
G M ID+G T LP+ Y+ L R+A+ P P L CY T + P
Sbjct: 344 AGGMIIDSGTIVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTCYDFTGNTNVTVP 403
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
+ F+GG + L S + ++G F DGD GI GN Q + YD
Sbjct: 404 TVALTFEGGVTIDLDVPSGVL---LDGCLAFVAGASDGDTGIIGNVNQRTFEVLYDSARG 460
Query: 365 MVSFKPTDC 373
V F+ C
Sbjct: 461 HVGFRAGAC 469
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 118/370 (31%), Positives = 169/370 (45%), Gaps = 23/370 (6%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYN 68
N +S +S G YV+ +GTP + DTGSD WVQC PCV CY+Q +P++
Sbjct: 151 NLPAKSGLSLNTGNYVVPIRLGTP-AARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFT 209
Query: 69 PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
P S++Y +SC S C LDT CS C Y Y D S T G A + +T G +
Sbjct: 210 PTKSATYANISCTSSYCSDLDTRGCSGGH-CLYAVQYGDGSYTVGFYAQDTLTLG--YDT 266
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
+ FGCG N G+F + GL+GLGR + S+ Q + + F+YC+ SS T
Sbjct: 267 VKDFRFGCGEKNRGLFGK-AAGLMGLGRGKTSVPVQAYDKY-SGVFAYCI---PATSSGT 321
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
+ FG G+ + ++ LV T+Y+V + GI VG S IP + S
Sbjct: 322 GFLDFGPGAPAAANARLTPMLVD-NGPTFYYVGMTGIKVGGHLLS---IP----ATVFSD 373
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAG-IA-P 304
+D+G T LP Y L ++ Y+ S L CY G IA P
Sbjct: 374 AGALVDSGTVITRLPPSAYEPLRSAFAKGMEGLGYKTAPAFSILDTCYDLTGYQGSIALP 433
Query: 305 ILTAHFDGGAKVPLIHTST-FIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
++ F GGA + + + ++ + FA D D+ I GN Q + YD
Sbjct: 434 AVSLVFQGGACLDVDASGILYVADVSQACLAFAANDDDTDMTIVGNTQQKTYSVLYDLGK 493
Query: 364 QMVSFKPTDC 373
++V F P C
Sbjct: 494 KVVGFAPGAC 503
>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
Length = 471
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 119/380 (31%), Positives = 183/380 (48%), Gaps = 40/380 (10%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP--CVQCYKQVKPIYNPASS 72
S +S + YVMKF+IG+PP+ + Y I DTGS+++W+QC C CYKQ P++NP S
Sbjct: 99 SRISIIDKVYVMKFNIGSPPV-ETYAIPDTGSNIVWIQCGSPICTNCYKQKIPLFNPTKS 157
Query: 73 SSYKELSCQSEQCH-----LLDTVSC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
S+Y C +C L + + C SS Q+C Y Y D S ++G ++T+ ITF
Sbjct: 158 STYAIRLCGHRECKQALWGLGEYLGCKSSVQVCRYHISYEDHSFSEGTISTDIITFPEHI 217
Query: 127 NFFDN----VVFGCGHNNTGVFNEN-----EMGLVGLGRTRLSLASQILSQLGANKFSYC 177
F N + FGCG+NN+ ++ G+VGLG SL + QL +FSYC
Sbjct: 218 AEFGNYSLRMFFGCGYNNSETPGQDPNSFTAPGVVGLGNEMASL----VGQLTLGQFSYC 273
Query: 178 L-VPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKL 236
+ P + T ++ FG + +SG ST+L + + Y F ++GI V + K
Sbjct: 274 ISTPDVQKPNGTIEIRFGLAASISGH---STALANNLEGWYIFQNVDGIYVDD--TKVKG 328
Query: 237 IPYYN---SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ-- 291
P + + G I G + +D+G T L + L +++ I+L P S
Sbjct: 329 YPEWVFQFAEGGI--GGLIMDSGTTYTELYFSALDALIGELKEQIELAPDTQDHSNSNYS 386
Query: 292 LCYKTPS-MAGIAPILTAHF--DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFG 348
LCY + + P + F + A P + +I + +C AM G + I G
Sbjct: 387 LCYNAANFLLTYVPAIELKFTDNKEAYFPFTLRNAWIDNGNDQ-YCLAMFGTSG-ISIIG 444
Query: 349 NFAQSDLFIGYDFDSQMVSF 368
+ D+ IGYD +VSF
Sbjct: 445 IYQHRDIKIGYDLKYNLVSF 464
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 121/385 (31%), Positives = 178/385 (46%), Gaps = 57/385 (14%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQC-------LPCVQCYKQVKPIYNPASSSSYK 76
+ + IGTPP IVDTGSDL+W QC +Q +P+Y P SSS+
Sbjct: 84 HSLTVGIGTPPQPRTL-IVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSSFA 142
Query: 77 ELSCQSEQCH--LLDTVSCSSQQLCNY--TYGYADSSLTKGVLATERITFGNSNNFFDNV 132
L C C +C+ C Y YG A++ GVLA+E TFG + +
Sbjct: 143 YLPCSDRLCQEGQFSYKNCARNNRCMYDELYGSAEAG---GVLASETFTFGVNAKVSLPL 199
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGCG + G GL+GL +SL +SQL +FSYCL PF TS +
Sbjct: 200 GFGCGALSAGDL-VGASGLMGLSPGIMSL----VSQLSVPRFSYCLTPFAERK--TSPLL 252
Query: 193 FGNGSEV----SGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
FG +++ + G V +TS++ + YY+V L G+S+G + +L S G I
Sbjct: 253 FGAMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLG----TKRLDVPATSLGMI 308
Query: 247 S---KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKL-------TPYQDPRLGSQLCYKT 296
G +D+G+ + L + + +++ V A++L Y D +LC+
Sbjct: 309 KPDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAVRLPVANGTDEDYDD----YELCFAL 364
Query: 297 PSMAGIA------PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM--QPIDGDVGIFG 348
P+ G+A P L HFDGGA + L + F P G+ C A+ P V I G
Sbjct: 365 PT--GVAMEAVKTPPLVLHFDGGAAMTLPRDNYF-QEPRAGLMCLAVGTSPDGFGVSIIG 421
Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDC 373
N Q ++ + +D +Q SF PT C
Sbjct: 422 NVQQQNMHVLFDVRNQKFSFAPTKC 446
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 111/358 (31%), Positives = 162/358 (45%), Gaps = 23/358 (6%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSC 80
EY++ GTP + + ++DTGSD+ WVQC PC +CY Q P+++P+ SS+Y ++C
Sbjct: 124 EYMVTLGFGTPSVPQVL-LMDTGSDVSWVQCAPCNSTECYPQKDPLFDPSKSSTYAPIAC 182
Query: 81 QSEQCHLLD---TVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
++ C+ L C+S C Y Y D S T+GV + E ITF D FGC
Sbjct: 183 GADACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETITFAPGITVKD-FHFGC 241
Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
GH+ G ++ + GL+GLG SL Q S G FSYCL ++++ + +
Sbjct: 242 GHDQRGPSDKFD-GLLGLGGAPESLVVQTASVYGG-AFSYCLPALNSEAGFLALGVRPSA 299
Query: 197 SEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
+ + V + D T Y V + GISVG IP + +G M ID+G
Sbjct: 300 ATNTSAFVFTPMWHLPMDATSYMVNMTGISVG---GKPLDIPR-----SAFRGGMLIDSG 351
Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAK 315
T LP+ YN L +R A P CY + + P + F GGA
Sbjct: 352 TIVTELPETAYNALNAALRKAFAAYPMVASE-DFDTCYNFTGYSNVTVPRVALTFSGGAT 410
Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ L + + V+ F D +GI GN Q L + YD V F+ C
Sbjct: 411 IDLDVPNGIL---VKDCLAFRESGPDVGLGIIGNVNQRTLEVLYDAGHGKVGFRAGAC 465
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 113/361 (31%), Positives = 157/361 (43%), Gaps = 33/361 (9%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELSC 80
EYV+ S+GTP + + I DTGSD+ WVQC PC C Q +++PA S++Y SC
Sbjct: 129 EYVITVSLGTPAVTQVMSI-DTGSDVSWVQCAPCAAQSCSSQKDKLFDPAKSATYSAFSC 187
Query: 81 QSEQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
S QC L + C + C Y Y D S T G ++ + S+ N FGC H
Sbjct: 188 SSAQCAQLGGEGNGCLNSH-CQYIVKYVDHSNTTGTYGSDTLGLTTSDA-VKNFQFGCSH 245
Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
G + + GL+GLG SL SQ + G FSYCL P + SS + G
Sbjct: 246 RANGFVGQLD-GLMGLGGDTESLVSQTAATYG-KAFSYCLPP--SSSSAGGFLTLG---- 297
Query: 199 VSGGGVVS-----TSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
+ GG S T LV T+Y V L+ I+V +KL N ++ G +
Sbjct: 298 AAAGGTSSSRYSRTPLVRFNVPTFYGVFLQAITVAG----TKL----NVPASVFSGASVV 349
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDG 312
D+G T LP Y L + +K P P C+ + + P++T F
Sbjct: 350 DSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGILDTCFDFSGIKTVRVPVVTLTFSR 409
Query: 313 GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
GA + L + F G F DGD GI GN Q + +D + F+P
Sbjct: 410 GAVMDLDVSGIFY----AGCLAFTATAQDGDTGILGNVQQRTFEMLFDVGGSTLGFRPGA 465
Query: 373 C 373
C
Sbjct: 466 C 466
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 109/350 (31%), Positives = 162/350 (46%), Gaps = 37/350 (10%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDT--------VS 92
IVDT S+L WVQC PC C+ Q P+++PASS SY L C S C L
Sbjct: 141 IVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACG 200
Query: 93 CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLV 152
Q C+YT Y D S ++GVLA ++++ + D VFGCG +N G F GL+
Sbjct: 201 GGEQPSCSYTLSYRDGSYSQGVLAHDKLSL--AGEVIDGFVFGCGTSNQGPFGGTS-GLM 257
Query: 153 GLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV--SGGGVVSTSLV 210
GLGR++LSL SQ + Q G FSYCL ++SS + G+ + V + +V T++V
Sbjct: 258 GLGRSQLSLISQTMDQFGG-VFSYCLPLKESESS--GSLVLGDDTSVYRNSTPIVYTTMV 314
Query: 211 SKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNR 269
S + +YFV L GI++G S S G + +D+G T L YN
Sbjct: 315 SDPVQGPFYFVNLTGITIGGQEVES------------SAGKVIVDSGTIITSLVPSVYNA 362
Query: 270 LEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTST--FI 325
++ + + P Q P C+ + P L F+G +V + + F+
Sbjct: 363 VKAEFLSQFAEYP-QAPGFSILDTCFNLTGFREVQIPSLKFVFEGNVEVEVDSSGVLYFV 421
Query: 326 PPPVEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
V C A+ + + I GN+ Q +L + +D + F C
Sbjct: 422 SSDSSQV-CLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 470
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 118/370 (31%), Positives = 176/370 (47%), Gaps = 41/370 (11%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
++M FSIG PP+ + ++DTGS L WV C PC C +Q PI++P+ SS+Y LSC
Sbjct: 93 FLMNFSIGEPPIPQL-AVMDTGSSLTWVMCHPCSSCSQQSVPIFDPSKSSTYSNLSCS-- 149
Query: 84 QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV---VFGCGH-- 138
+C+ D V+ C Y+ Y S ++G+ A E++T + V +FGCG
Sbjct: 150 ECNKCDVVNGE----CPYSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFGCGRKF 205
Query: 139 --NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
++ G + G+ GLG R SL L G KFSYC+ + +++ G+
Sbjct: 206 SISSNGYPYQGINGVFGLGSGRFSL----LPSFG-KKFSYCIGNLRNTNYKFNRLVLGDK 260
Query: 197 SEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP-YYNSSGAISKGNMFIDT 255
+ + G ST+L Y+V LE IS+G + P + S + + ID+
Sbjct: 261 ANMQGD---STTL--NVINGLYYVNLEAISIG--GRKLDIDPTLFERSITDNNSGVIIDS 313
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIK---LTPYQDPRLGSQLCYK---TPSMAGIAPILTAH 309
GA T L K + L +V N ++ + QD LCY + ++G P++T H
Sbjct: 314 GADHTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSGF-PLVTFH 372
Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID--GD----VGIFGNFAQSDLFIGYDFDS 363
F GA + L TS FI E FC AM P + GD G AQ + +GYD +
Sbjct: 373 FAEGAVLDLDVTSMFI-QTTENEFCMAMLPGNYFGDDYESFSSIGMLAQQNYNVGYDLNR 431
Query: 364 QMVSFKPTDC 373
V F+ DC
Sbjct: 432 MRVYFQRIDC 441
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 109/350 (31%), Positives = 162/350 (46%), Gaps = 37/350 (10%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDT--------VS 92
IVDT S+L WVQC PC C+ Q P+++PASS SY L C S C L
Sbjct: 140 IVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACG 199
Query: 93 CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLV 152
Q C+YT Y D S ++GVLA ++++ + D VFGCG +N G F GL+
Sbjct: 200 GGEQPSCSYTLSYRDGSYSQGVLAHDKLSL--AGEVIDGFVFGCGTSNQGPFGGTS-GLM 256
Query: 153 GLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV--SGGGVVSTSLV 210
GLGR++LSL SQ + Q G FSYCL ++SS + G+ + V + +V T++V
Sbjct: 257 GLGRSQLSLISQTMDQFGG-VFSYCLPLKESESS--GSLVLGDDTSVYRNSTPIVYTTMV 313
Query: 211 SKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNR 269
S + +YFV L GI++G S S G + +D+G T L YN
Sbjct: 314 SDPVQGPFYFVNLTGITIGGQEVES------------SAGKVIVDSGTIITSLVPSVYNA 361
Query: 270 LEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTST--FI 325
++ + + P Q P C+ + P L F+G +V + + F+
Sbjct: 362 VKAEFLSQFAEYP-QAPGFSILDTCFNLTGFREVQIPSLKFVFEGNVEVEVDSSGVLYFV 420
Query: 326 PPPVEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
V C A+ + + I GN+ Q +L + +D + F C
Sbjct: 421 SSDSSQV-CLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 469
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 120/374 (32%), Positives = 171/374 (45%), Gaps = 35/374 (9%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
++ST N YV +GTP ++ +DTGSD WVQC PC CY+Q P+++P +SS+Y
Sbjct: 133 SLSTTN--YVASLRLGTP-ATELVVELDTGSDQSWVQCKPCADCYEQRDPVFDPTASSTY 189
Query: 76 KELSCQSEQCHLL------DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
+ C + +C L S + + C Y Y D S T G LA + +T S +
Sbjct: 190 SAVPCGARECQELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPSPS 249
Query: 130 --DNV---VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
D V VFGCGH+N G F E + L+GLG + SL SQ+ ++ GA FSYCL
Sbjct: 250 PADTVPGFVFGCGHSNAGTFGEVDG-LLGLGLGKASLPSQVAARYGA-AFSYCL-----P 302
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
SS ++ Y G + T +V+ +D T Y++ L GI V + + ++G
Sbjct: 303 SSPSAAGYLSFGGAAARANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFATAAG 362
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMA 300
I ID+G + LP Y L R+A+ Y+ R S CY
Sbjct: 363 TI------IDSGTAFSRLPPSAYAALRSSFRSAMGRYRYK--RAPSSPIFDTCYDFTGHE 414
Query: 301 GIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
+ P + F GA V L + C A P + D+GI GN Q L + Y
Sbjct: 415 TVRIPAVELVFADGATVHLHPSGVLYTWNDVAQTCLAFVP-NHDLGILGNTQQRTLAVIY 473
Query: 360 DFDSQMVSFKPTDC 373
D SQ + F C
Sbjct: 474 DVGSQRIGFGRKGC 487
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 118/379 (31%), Positives = 177/379 (46%), Gaps = 37/379 (9%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPAS 71
++S +S +G Y +K +G+P IVDTGS W+QC PC + C+ Q P++NP++
Sbjct: 92 LKSGLSMGSGNYYVKMGLGSPTKYYTM-IVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSA 150
Query: 72 SSSYKELSCQSEQCHL-----LDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNS 125
S +YK + C S QC L+ +CS Q C Y Y DSS + G L+ + +T S
Sbjct: 151 SKTYKTVPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPS 210
Query: 126 NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL-VPFHTD 184
+ V+GCG +N G+F + G++GL LS+ SQ+ + G N FSYCL F T
Sbjct: 211 QT-LSSFVYGCGQDNQGLFGRTD-GIIGLANNELSMLSQLSGKYG-NAFSYCLPTSFSTP 267
Query: 185 SSITSK-MYFGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVG----NLSNSSKLIP 238
+S + G S T L+ + + YF+ LE I+V ++ SS +P
Sbjct: 268 NSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVP 327
Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG-SQLCYKTP 297
ID+G T LP Y L+ + Q P + C+K
Sbjct: 328 ------------TIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKG- 374
Query: 298 SMAGI---APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
S+AGI AP + F GGA + L ++ + G+ C AM + I GN+ Q
Sbjct: 375 SLAGISEVAPDIRIIFKGGADLQLKGHNSLVELET-GITCLAMAG-SSSIAIIGNYQQQT 432
Query: 355 LFIGYDFDSQMVSFKPTDC 373
+ + YD + V F P C
Sbjct: 433 VKVAYDVGNSRVGFAPGGC 451
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 113/380 (29%), Positives = 172/380 (45%), Gaps = 42/380 (11%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
+N+ T N YV +G + +VDT S+L WVQC PC C+ Q P+++P+SS S
Sbjct: 113 ANLRTLN--YVATVGLGAA---EATVVVDTASELTWVQCQPCESCHDQQDPLFDPSSSPS 167
Query: 75 YKELSCQSEQCHLLD------TVSCS----SQQLCNYTYGYADSSLTKGVLATERITFGN 124
Y + C S C L T C+ Q C+Y Y D S ++GVLA +++
Sbjct: 168 YAAVPCNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRLAG 227
Query: 125 SNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
+ + VFGCG +N G GL+GLGR+ +SL SQ + Q G FSYCL P +
Sbjct: 228 QD--IEGFVFGCGTSNQGAPFGGTSGLMGLGRSHVSLVSQTMDQFGG-VFSYCL-PMR-E 282
Query: 185 SSITSKMYFGNGSEV--SGGGVVSTSLVSKE---DKTYYFVTLEGISVGNLSNSSKLIPY 239
S + + G+ S + +V T++VS +YF+ L GI+VG S P+
Sbjct: 283 SGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEVES---PW 339
Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPS 298
+ S G + ID+G T L YN + + + + P Q P C+
Sbjct: 340 F------SAGRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYP-QAPAFSILDTCFNLTG 392
Query: 299 MAGI-APILTAHFDGGAKVPLIHTST--FIPPPVEGVFCFAMQPIDG--DVGIFGNFAQS 353
+ + P L F+G +V + F+ V C A+ + D I GN+ Q
Sbjct: 393 LKEVQVPSLKFVFEGSVEVEVDSKGVLYFVSSDASQV-CLALASLKSEYDTSIIGNYQQK 451
Query: 354 DLFIGYDFDSQMVSFKPTDC 373
+L + +D + F C
Sbjct: 452 NLRVIFDTLGSQIGFAQETC 471
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 118/379 (31%), Positives = 177/379 (46%), Gaps = 37/379 (9%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKPIYNPAS 71
++S +S +G Y +K +G+P IVDTGS W+QC PC + C+ Q P++NP++
Sbjct: 92 LKSGLSMGSGNYYVKMGLGSPTKYYTM-IVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSA 150
Query: 72 SSSYKELSCQSEQCHL-----LDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNS 125
S +YK + C S QC L+ +CS Q C Y Y DSS + G L+ + +T S
Sbjct: 151 SKTYKTVPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPS 210
Query: 126 NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL-VPFHTD 184
+ V+GCG +N G+F + G++GL LS+ SQ+ + G N FSYCL F T
Sbjct: 211 QT-LSSFVYGCGQDNQGLFGRTD-GIIGLANNELSMLSQLSGKYG-NAFSYCLPTSFSTP 267
Query: 185 SSITSK-MYFGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVG----NLSNSSKLIP 238
+S + G S T L+ + + YF+ LE I+V ++ SS +P
Sbjct: 268 NSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVP 327
Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG-SQLCYKTP 297
ID+G T LP Y L+ + Q P + C+K
Sbjct: 328 ------------TIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKG- 374
Query: 298 SMAGI---APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
S+AGI AP + F GGA + L ++ + G+ C AM + I GN+ Q
Sbjct: 375 SLAGISEVAPDIRIIFKGGADLQLKGHNSLVELET-GITCLAMAG-SSSIAIIGNYQQQT 432
Query: 355 LFIGYDFDSQMVSFKPTDC 373
+ + YD + V F P C
Sbjct: 433 VKVAYDVGNSRVGFAPGGC 451
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 115/356 (32%), Positives = 163/356 (45%), Gaps = 26/356 (7%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ +G+P ++DTGSD+ WVQC PC QC+ Q P+++P+SSS+Y SC S
Sbjct: 132 EYLITVRLGSPGKSQTM-LIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCSS 190
Query: 83 EQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
C L + CSS Q C YT Y D S T G +++ + G +N FGC +
Sbjct: 191 AACAQLGQEGNGCSSSQ-CQYTVTYGDGSSTTGTYSSDTLALG--SNAVRKFQFGCSNVE 247
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
+G FN+ GL+GLG SL SQ GA FSYCL SS + + G G+
Sbjct: 248 SG-FNDQTDGLMGLGGGAQSLVSQTAGTFGA-AFSYCL---PATSSSSGFLTLGAGTS-- 300
Query: 201 GGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPT 260
G V + L S + T+Y V ++ I VG S IP ++ +D+G T
Sbjct: 301 -GFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLS---IPT-----SVFSAGTIMDSGTVLT 351
Query: 261 LLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLI 319
LP Y+ L + +K P P C+ + ++ P + F GGA V I
Sbjct: 352 RLPPTAYSALSSAFKAGMKQYPSAPPSGILDTCFDFSGQSSVSIPTVALVFSGGAVVD-I 410
Query: 320 HTSTFIPPPVEGVFC--FAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ + + C FA D +GI GN Q + YD V FK C
Sbjct: 411 ASDGIMLQTSNSILCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 466
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 111/362 (30%), Positives = 162/362 (44%), Gaps = 40/362 (11%)
Query: 30 IGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS-SSSYKELSCQSEQCHLL 88
+GTPP + ++ G++L+W P +C++Q P + P + S SC S +
Sbjct: 1 MGTPPN-PVKLKLENGNELIWNHSNPSPECFEQAFPYFEPLTFSRGLPFASCGSPKFW-- 57
Query: 89 DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENE 148
Q C YTY Y D S+T G L ++ TF + V FGCG N GVF NE
Sbjct: 58 ------PNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLFNNGVFKSNE 111
Query: 149 MGLVGLGRTRLSLASQILSQLGANKFSYCL------VPFHTDSSITSKMYFGNGSEVSGG 202
G+ G GR LSL SQL FS+C +P + + + F NG G
Sbjct: 112 TGIAGFGRGPLSLP----SQLKVGNFSHCFTTITGAIPSTVLLDLPADL-FSNGQ----G 162
Query: 203 GVVSTSLV----SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN--MFIDTG 256
V +T L+ ++ + T Y+++L+GI+VG S +P S+ A++ G ID+G
Sbjct: 163 AVQTTPLIQYAKNEANPTLYYLSLKGITVG-----STRLPVPESAFALTNGTGGTIIDSG 217
Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAK 315
T LP Y + ++ IKL G C+ PS A P L HF+G
Sbjct: 218 TSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFEGATM 277
Query: 316 VPLIHTSTFIPPPVEG--VFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
F P G + C A+ D + I GNF Q ++ + YD + M+SF C
Sbjct: 278 DLPRENYVFEVPDDAGNSIICLAINKGD-ETTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 336
Query: 374 TK 375
K
Sbjct: 337 DK 338
>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
Length = 256
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 81/211 (38%), Positives = 111/211 (52%), Gaps = 12/211 (5%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
S S +GEY + IG+PP +Y +VDTGSD+ WVQC PC CY+Q PI+ P+ SSS
Sbjct: 44 SGASQGSGEYFSRVGIGSPPK-HVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSS 102
Query: 75 YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
Y L+C++ QC LD C + C Y Y D S T G ATE IT S + +NV
Sbjct: 103 YAPLTCETHQCKSLDVSECRNDS-CLYEVSYGDGSYTVGDFATETITLDGSAS-LNNVAI 160
Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
GCGH+N G+F + SQ+ A+ FSYCLV TDS+ T +
Sbjct: 161 GCGHDNEGLFVGAAG-----LLGLGGGSLSFPSQINASSFSYCLVNRDTDSASTLEF--- 212
Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGI 225
S + V + L + + T+Y++ + GI
Sbjct: 213 -NSPIPSHSVTAPLLRNNQLDTFYYLGMTGI 242
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 112/364 (30%), Positives = 161/364 (44%), Gaps = 43/364 (11%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELSC 80
EYV++ S GTP + + ++DTGSD+ W+QC PC QC+ Q P+Y+P+ SS+Y + C
Sbjct: 78 EYVVRVSFGTPAVPQVV-VIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPC 136
Query: 81 QSEQCHLLDTVS----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
S+ C L + C+S + C + YAD + T G + +++T N FGC
Sbjct: 137 ASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTLA-PGAIVQNFYFGC 195
Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK---MYF 193
GH V + G++GLGR R SL ++ FSYCL S++SK +
Sbjct: 196 GHGKHAVRGLFD-GVLGLGRLRESLGARY-----GGVFSYCL------PSVSSKPGFLAL 243
Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
G G SG V + T+ VTL GI+VG L P S G M +
Sbjct: 244 GAGKNPSGFVFTPMGTVPGQ-PTFSTVTLAGINVGG--KKLDLRPSAFS------GGMIV 294
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYKTPSMAG-IAPILTAH 309
D+G T L Y L R A+ +L P D CY + P +
Sbjct: 295 DSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGD----LDTCYNLTGYKNVVVPKIALT 350
Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
F GGA + L + + V G FA DG G+ GN Q + +D + F+
Sbjct: 351 FTGGATINLDVPNGIL---VNGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFR 407
Query: 370 PTDC 373
C
Sbjct: 408 AKAC 411
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 111/358 (31%), Positives = 166/358 (46%), Gaps = 37/358 (10%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVK----PIYNPASSSSYKELSCQSEQCH--LLDTVSCS 94
IVDTGSDL+W QC + P+Y+P SS++ L C C +C+
Sbjct: 29 IVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESSTFAFLPCSDRLCQEGQFSFKNCT 88
Query: 95 SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGL 154
S+ C Y Y S+ GVLA+E TFG + FGCG + G G++GL
Sbjct: 89 SKNRCVYEDVYG-SAAAVGVLASETFTFGARRAVSLRLGFGCGALSAGSLI-GATGILGL 146
Query: 155 GRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGG----VVSTSLV 210
LSL ++QL +FSYCL PF TS + FG +++S + +T++V
Sbjct: 147 SPESLSL----ITQLKIQRFSYCLTPFADKK--TSPLLFGAMADLSRHKTTRPIQTTAIV 200
Query: 211 SKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNMFIDTGAPPTLLPKDF 266
S +T YY+V L GIS+G+ K + +S A+ G +D+G+ L +
Sbjct: 201 SNPVETVYYYVPLVGISLGH-----KRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAA 255
Query: 267 YNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-------PILTAHFDGGAKVPLI 319
+ ++E V + ++L +LC+ P A P L HFDGGA + L
Sbjct: 256 FEAVKEAVMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLP 315
Query: 320 HTSTFIPPPVEGVFCFAM-QPIDGD-VGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
+ F P G+ C A+ + DG V I GN Q ++ + +D SF PT C +
Sbjct: 316 RDNYF-QEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQCDQ 372
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 112/364 (30%), Positives = 161/364 (44%), Gaps = 43/364 (11%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELSC 80
EYV++ S GTP + + ++DTGSD+ W+QC PC QC+ Q P+Y+P+ SS+Y + C
Sbjct: 112 EYVVRVSFGTPAVPQVV-VIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPC 170
Query: 81 QSEQCHLLDTVS----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
S+ C L + C+S + C + YAD + T G + +++T N FGC
Sbjct: 171 ASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTLA-PGAIVQNFYFGC 229
Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK---MYF 193
GH V + G++GLGR R SL ++ FSYCL S++SK +
Sbjct: 230 GHGKHAVRGLFD-GVLGLGRLRESLGARY-----GGVFSYCL------PSVSSKPGFLAL 277
Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
G G SG V + T+ VTL GI+VG L P S G M +
Sbjct: 278 GAGKNPSGFVFTPMGTVPGQ-PTFSTVTLAGINVGG--KKLDLRPSAFS------GGMIV 328
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYKTPSMAG-IAPILTAH 309
D+G T L Y L R A+ +L P D CY + P +
Sbjct: 329 DSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGD----LDTCYNLTGYKNVVVPKIALT 384
Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
F GGA + L + + V G FA DG G+ GN Q + +D + F+
Sbjct: 385 FTGGATINLDVPNGIL---VNGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFR 441
Query: 370 PTDC 373
C
Sbjct: 442 AKAC 445
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 110/364 (30%), Positives = 161/364 (44%), Gaps = 48/364 (13%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ +IGTPP + +DTGSDL+W QC PC C+ Q P ++P++SS+ SC S
Sbjct: 88 EYLVHLAIGTPPQ-PVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDS 146
Query: 83 EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG 142
C G +SL + +++ TF + V FGCG N G
Sbjct: 147 TLCQ-----------------GLPVASLPR----SDKFTFVGAGASVPGVAFGCGLFNNG 185
Query: 143 VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL------VPFHTDSSITSKMYFGNG 196
VF NE G+ G GR LSL SQL FS+C +P + + + F NG
Sbjct: 186 VFKSNETGIAGFGRGPLSLP----SQLKVGNFSHCFTTITGAIPSTVLLDLPADL-FSNG 240
Query: 197 SEVSGGGVVSTSLVSK-EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN--MFI 253
G V +T L+ + T+Y+++L+GI+VG S +P S A+ G I
Sbjct: 241 Q----GAVQTTPLIQNPANPTFYYLSLKGITVG-----STRLPVPESEFALKNGTGGTII 291
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG-IAPILTAHFDG 312
D+G T LP Y + + +KL C P A P L HF+G
Sbjct: 292 DSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEG 351
Query: 313 GA-KVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
+P + + + C A+ G+V GNF Q ++ + YD + +SF P
Sbjct: 352 ATMDLPRENYVFEVEDAGSSILCLAIIE-GGEVTTIGNFQQQNMHVLYDLQNSKLSFVPA 410
Query: 372 DCTK 375
C K
Sbjct: 411 QCDK 414
>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
gi|224030351|gb|ACN34251.1| unknown [Zea mays]
Length = 342
Score = 134 bits (337), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 103/352 (29%), Positives = 160/352 (45%), Gaps = 43/352 (12%)
Query: 51 VQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQ--LCNYTYGYADS 108
+QC PCV CY+Q+ P++NP SSSY + C S+ C LD C C YTY Y+
Sbjct: 1 MQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSGH 60
Query: 109 SLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ 168
+TKG LA +++ G + F VVFGC ++ G GLVGLGR LSL +SQ
Sbjct: 61 GVTKGTLAIDKLAIG--GDVFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSL----VSQ 114
Query: 169 LGANKFSYCLVPFHTDSSITSKMYFGNGSEV---SGGGVVSTSLVSKEDKTYYFVTLEGI 225
L ++F YCL P + +S K+ G G++ V T S +YY++ L+G+
Sbjct: 115 LSVHRFMYCLPPPMSRTS--GKLVLGAGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGL 172
Query: 226 SVGNLSNSSK-----------------LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYN 268
+VG+ + + +G + M +D + + L Y+
Sbjct: 173 AVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYD 232
Query: 269 RLEEQVRNAIKLTPYQDP--RLGSQLCYKTPSMAGI----APILTAHFDGGAKVPLIHTS 322
L + + I+L P P RLG LC+ P G+ P ++ FD G + L
Sbjct: 233 ELADDLEEEIRL-PRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSFD-GRWLELDRDR 290
Query: 323 TFIPPPVEG-VFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
F+ +G + C + G V I GNF ++ + ++ ++F C
Sbjct: 291 LFV---TDGRMMCLMIGRTSG-VSILGNFQLQNMRVLFNLRRGKITFAKASC 338
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 134 bits (336), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 114/372 (30%), Positives = 165/372 (44%), Gaps = 29/372 (7%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPAS 71
+S + +G Y++ +GTP + I DTGSDL W QC PC + CY Q P++ P+
Sbjct: 120 AKSGATIGSGNYIVSVGLGTPKKY-LSLIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQ 178
Query: 72 SSSYKELSCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
S++Y +SC S C L++ + CS+ + C Y Y D S + G A E +T S
Sbjct: 179 STTYSNISCSSPDCSQLESGTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETLTL-TST 237
Query: 127 NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
+ +N +FGCG NN G+F + GL+GLG+ ++S+ Q + G FSYCL SS
Sbjct: 238 DVIENFLFGCGQNNRGLFG-SAAGLIGLGQDKISIVKQTAQKYG-QVFSYCL---PKTSS 292
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKED--KTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
T + FG G + + ++K +Y V + G+ VG IP SS
Sbjct: 293 STGYLTFGGGGGGG---ALKYTPITKAHGVANFYGVDIVGMKVGGTQ-----IPI--SSS 342
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGIA 303
S ID+G T LP D Y+ L+ + P + P L CY + I
Sbjct: 343 VFSTSGAIIDSGTVITRLPPDAYSALKSAFEKGMAKYP-KAPELSILDTCYDLSKYSTIQ 401
Query: 304 -PILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
P + F GG ++ L V FA V I GN Q L + YD
Sbjct: 402 IPKVGFVFKGGEELDLDGIGIMYGASTSQVCLAFAGNQDPSTVAIIGNVQQKTLQVVYDV 461
Query: 362 DSQMVSFKPTDC 373
+ F C
Sbjct: 462 GGGKIGFGYNGC 473
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 108/372 (29%), Positives = 164/372 (44%), Gaps = 36/372 (9%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N EY++ SIG P + +DTGSD++W QC PC +C+ Q P ++ A+S++ + ++C
Sbjct: 89 NSEYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCEPCAECFTQPLPRFDTAASNTVRSVAC 148
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNNFFDNVVFGC 136
C+ C C Y GY D SL+ G + TF G ++ FGC
Sbjct: 149 SDPLCNAHSEHGCFLHG-CTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFGC 207
Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
G N G F + E G+ G GR LSL SQL +FSYC S S ++ G
Sbjct: 208 GMYNAGRFLQTETGIAGFGRGPLSLP----SQLKVRQFSYCFTTRFEAKS--SPVFLGGA 261
Query: 197 SEVSG---GGVVSTSLVSK----EDKTYYFVTLEGISVGNLSNSSKL-IPYYNSSGAISK 248
++ G ++ST V D ++Y ++ +G++VG ++L +P + G+
Sbjct: 262 GDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGK----TRLPVPEIKADGS--- 314
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEE----QVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAP 304
G FID+G T P + +L+ Q + T +D S KT +M P
Sbjct: 315 GATFIDSGTDITTFPDAVFRQLKSAFIAQAALPVNKTADEDDICFSWDGKKTAAM----P 370
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQSDLFIGYDFDS 363
L H + GA L + G C A+ D + GNF Q + I YD +
Sbjct: 371 KLVFHLE-GADWDLPRENYVTEDRESGQVCVAVSTSGQMDRTLIGNFQQQNTHIVYDLAA 429
Query: 364 QMVSFKPTDCTK 375
+ P C K
Sbjct: 430 GKLLLVPAQCDK 441
>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
Length = 419
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 109/377 (28%), Positives = 176/377 (46%), Gaps = 46/377 (12%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPASSSSYKELSC 80
YV F+IGTPP + GIVD +L+W QC C C+KQ P+++P++S++Y+ C
Sbjct: 61 HYVANFTIGTPPQA-VSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQC 119
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSL---TKGVLATERITFGNSNNFFDNVVFGCG 137
S C + T +CS C GY S+ T G+ +T+ I GN+ + FGC
Sbjct: 120 GSPLCKSIPTRNCSGDGEC----GYEAPSMFGDTFGIASTDAIAIGNAEG---RLAFGCV 172
Query: 138 HNNTGVFN---ENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
+ G + + G VGLGRT SL + Q FSYCL H S ++ G
Sbjct: 173 VASDGSIDGAMDGPSGFVGLGRTPWSL----VGQSNVTAFSYCLA-LHGPGK-KSALFLG 226
Query: 195 NGSEVSGGGVVS--TSLVSKEDKT--------YYFVTLEGISVGNLSNSSKLIPYYNSSG 244
++++G G + T L+ + YY V LEGI G+++ ++ SSG
Sbjct: 227 ASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAA------SSG 280
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAP 304
+ + ++T P + LP Y LE+ V A+ +P LC++ +++G+ P
Sbjct: 281 GGAITVLQLETFRPLSYLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQNAAVSGV-P 339
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPP-VEGVFCFA------MQPIDGDVGIFGNFAQSDLFI 357
L F GGA + + + G C + + D V I G+ Q ++
Sbjct: 340 DLVFTFQGGATLTAQPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHF 399
Query: 358 GYDFDSQMVSFKPTDCT 374
+D + + +SF+P DC+
Sbjct: 400 LFDLEKETLSFEPADCS 416
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 120/395 (30%), Positives = 188/395 (47%), Gaps = 51/395 (12%)
Query: 12 VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
++S ++ +GEY M +GTPP I+DTGSDL W+QCLPC C+ Q Y+P +
Sbjct: 148 TLESGMTLGSGEYFMDVLVGTPPK-HFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKT 206
Query: 72 SSSYKELSCQSEQCHLLDT----VSCSSQ-QLCNYTYGYADSSLTKGVLATERITF---- 122
S+S+K ++C +C L+ + V C S Q C Y Y Y D S T G A E T
Sbjct: 207 SASFKNITCNDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTT 266
Query: 123 ---GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
G+S N++FGCGH N G+F+ L R LS +SQ+ S G + FSYCLV
Sbjct: 267 TEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLG-RGPLSFSSQLQSLYG-HSFSYCLV 324
Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVS-TSLVSKED---KTYYFVTLEGISVGNLSNSSK 235
+++++++SK+ FG ++ ++ TS V+ ++ +T+Y++ ++ I VG K
Sbjct: 325 DRNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVG-----GK 379
Query: 236 LIPYYNSSGAIS---KGNMFIDTGAPPTLLPKDFY----NRLEEQVRN---AIKLTPYQD 285
+ + IS G ID+G + + Y N+ E+++ + P D
Sbjct: 380 ALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLD 439
Query: 286 PRLGSQLCYKTPSMAGIA------PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP 339
P C+ +++GI P L F G ++FI E + C A+
Sbjct: 440 P------CF---NVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLS-EDLVCLAILG 489
Query: 340 I-DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
I GN+ Q + I YD + F PT C
Sbjct: 490 TPKSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKC 524
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 118/359 (32%), Positives = 174/359 (48%), Gaps = 24/359 (6%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
G YV + +GTP + +VDTGS L W+QC PCV C++Q P++NP +SSSY +SC
Sbjct: 127 GNYVTRMGLGTPAKSYVM-VVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSC 185
Query: 81 QSEQCHLLDT-----VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
++QC L T SCS+ +C Y Y DSS + G L+ + ++FG+++ N +G
Sbjct: 186 SAQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS--VPNFYYG 243
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
CG +N G+F ++ GL+GL R +LSL Q+ +G + FSYCL + SS + N
Sbjct: 244 CGQDNEGLFGQSA-GLIGLARNKLSLLYQLAPSMGYS-FSYCLPTSSSSSSGYLSIGSYN 301
Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
+ S + S+SL D + YF+ + GI V P SS A S ID+
Sbjct: 302 PGQYSYTPMASSSL----DDSLYFIKMTGIKVAGK-------PLSVSSSAYSSLPTIIDS 350
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
G T LP Y+ L + V A+K TP C++ + P +T F GGA
Sbjct: 351 GTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFAGGAA 410
Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ L + + C A P I GN Q + YD + + F C+
Sbjct: 411 LKLAARNLLVDVD-SATTCLAFAPAR-SAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 118/359 (32%), Positives = 174/359 (48%), Gaps = 24/359 (6%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
G YV + +GTP + +VDTGS L W+QC PCV C++Q P++NP +SSSY +SC
Sbjct: 127 GNYVTRMGLGTPAKSYVM-VVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSC 185
Query: 81 QSEQCHLLDT-----VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
++QC L T SCS+ +C Y Y DSS + G L+ + ++FG+++ N +G
Sbjct: 186 SAQQCSDLTTATLSPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS--VPNFYYG 243
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
CG +N G+F ++ GL+GL R +LSL Q+ +G + FSYCL + SS + N
Sbjct: 244 CGQDNEGLFGQSA-GLIGLARNKLSLLYQLAPSMGYS-FSYCLPTSSSSSSGYLSIGSYN 301
Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
+ S + S+SL D + YF+ + GI V P SS A S ID+
Sbjct: 302 PGQYSYTPMASSSL----DDSLYFIKMTGIKVAGK-------PLSVSSSAYSSLPTIIDS 350
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
G T LP Y+ L + V A+K TP C++ + P +T F GGA
Sbjct: 351 GTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFAGGAA 410
Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ L + + C A P I GN Q + YD + + F C+
Sbjct: 411 LKLAARNLLVDVD-SATTCLAFAPAR-SAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 108/357 (30%), Positives = 159/357 (44%), Gaps = 25/357 (7%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPASSSSYKELSC 80
EYV+ +GTP + +DTGSD+ WVQC PC C+ Q +++PA SS+Y+ +SC
Sbjct: 126 EYVISVGLGTPAVTQTV-TIDTGSDVSWVQCNPCPNPPCHAQTGALFDPAKSSTYRAVSC 184
Query: 81 QSEQCHLLDTVS--CSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
+ +C L+ C + C Y Y D S T G + + +T +++ FGC
Sbjct: 185 AAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCS 244
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
H +G F++ GL+GLG SL SQ + G N FSYCL P +S +S G
Sbjct: 245 HLESG-FSDQTDGLMGLGGGAQSLVSQTAAAYG-NSFSYCLPP----TSGSSGFLTLGGG 298
Query: 198 EVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
+ G V + L SK+ T+Y L+ I+VG L P ++G++ +D+G
Sbjct: 299 GGASGFVTTRMLRSKQIPTFYGARLQDIAVGG--KQLGLSPSVFAAGSV------VDSGT 350
Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKV 316
T LP Y+ L + +K R C+ I+ P + F GGA +
Sbjct: 351 IITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISIPTVALVFSGGAAI 410
Query: 317 PLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
L FA DG GI GN Q + YD S + F+ C
Sbjct: 411 DLDPNGIMY----GNCLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGAC 463
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 120/365 (32%), Positives = 158/365 (43%), Gaps = 38/365 (10%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSC 80
EYV+ +GTP + + ++DTGSDL WVQC PC CY Q P+++P+ SS+Y + C
Sbjct: 119 EYVVTVGLGTPAVSQVL-LIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYAPIPC 177
Query: 81 QSEQCHLLD--------TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
++ C L T C Y Y D S T GV + E +T D
Sbjct: 178 NTDACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLTMAPGVTVKD-F 236
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGCGH+ G N+ GL+GLG SL Q S G FSYCL P D + +
Sbjct: 237 HFGCGHDQDGP-NDKYDGLLGLGGAPESLVVQTSSVYGG-AFSYCL-PAANDQA----GF 289
Query: 193 FGNGSEVS-GGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
G+ V+ G V T +V +E +T+Y V + GI+VG P A S G M
Sbjct: 290 LALGAPVNDASGFVFTPMV-REQQTFYVVNMTGITVGGE-------PIDVPPSAFS-GGM 340
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHF 310
ID+G T L Y L+ R A+ P P CY + + P + F
Sbjct: 341 IIDSGTVVTELQHTAYAALQAAFRKAMAAYPLL-PNGELDTCYNFTGHSNVTVPRVALTF 399
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
GGA V L +P + C A Q D GI GN Q L + YD V F
Sbjct: 400 SGGATVDLD-----VPDGILLDNCLAFQEAGPDNQPGILGNVNQRTLEVLYDVGHGRVGF 454
Query: 369 KPTDC 373
C
Sbjct: 455 GADAC 459
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 119/358 (33%), Positives = 161/358 (44%), Gaps = 27/358 (7%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EYV+ IG+P + + DTGSD+ WVQC PC QC+ +V +++P+SSS+Y SC S
Sbjct: 121 EYVITVGIGSPAVTQTMSM-DTGSDVSWVQCKPCSQCHSEVDSLFDPSSSSTYSPFSCSS 179
Query: 83 EQCHLLDTVS----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
C L C S Q C Y Y DSS T G +++ +T G+S + FGC
Sbjct: 180 APCAQLSQSQEGNGCMSSQ-CQYIVNYGDSSSTTGTYSSDTLTLGSSA--MTDFQFGCSQ 236
Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
+ +G FN+ GL+GLG SLASQ G FSYCL P S + G GS
Sbjct: 237 SESGGFNDQTDGLMGLGGGAQSLASQTAGTFG-TAFSYCLPPTSGSSGF---LTLGTGSS 292
Query: 199 VSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
G V + L S + TYY V LE I VG+ N ++ +D+G
Sbjct: 293 ---GFVKTPMLRSTQIPTYYVVLLESIKVGS--------QQLNLPTSVFSAGSLMDSGTI 341
Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVP 317
T LP Y+ L + ++ P P C+ + I+ P +T F GGA V
Sbjct: 342 ITRLPPTAYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIPTVTLVFSGGAAVD 401
Query: 318 LIHTSTFIPPPVEGVFCFAMQP--IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
L + + C A P D +GI GN Q + YD V FK C
Sbjct: 402 LAFDGIMLEIS-SSIRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 458
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 111/355 (31%), Positives = 160/355 (45%), Gaps = 24/355 (6%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ +G+P ++DTGSD+ WVQC PC QC+ Q P+++P+SSS+Y SC S
Sbjct: 127 EYLITVGLGSPATSQTM-LIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGS 185
Query: 83 EQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
C L + CSS C Y Y D S T G +++ + G+S + FGC +
Sbjct: 186 AACAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSA--VKSFQFGCSNVE 243
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
+G FN+ GL+GLG SL SQ LG FSYCL P + S + G
Sbjct: 244 SG-FNDQTDGLMGLGGGAQSLVSQTAGTLG-RAFSYCLPPTPSSSGF---LTLGAAGGSG 298
Query: 201 GGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPP 259
G V T ++ S + T+Y V L+ I VG S IP ++ +D+G
Sbjct: 299 TSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLS---IP-----ASVFSAGTVMDSGTVI 350
Query: 260 TLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPL 318
T LP Y+ L + +K P P C+ + ++ P + F GGA V L
Sbjct: 351 TRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSL 410
Query: 319 IHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ + FA D +GI GN Q + YD +V F+ C
Sbjct: 411 DASGIIL----SNCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 118/359 (32%), Positives = 174/359 (48%), Gaps = 24/359 (6%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
G YV + +GTP + +VDTGS L W+QC PCV C++Q P++NP +SSSY +SC
Sbjct: 125 GNYVTRMGLGTPAKSYVM-VVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSC 183
Query: 81 QSEQCHLLDT-----VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
++QC L T SCS+ +C Y Y DSS + G L+ + ++FG+++ N +G
Sbjct: 184 SAQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS--VPNFYYG 241
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
CG +N G+F ++ GL+GL R +LSL Q+ +G + FSYCL + SS + N
Sbjct: 242 CGQDNEGLFGQSA-GLIGLARNKLSLLYQLAPSMGYS-FSYCLPTSSSSSSGYLSIGSYN 299
Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
+ S + S+SL D + YF+ + GI V P SS A S ID+
Sbjct: 300 PGQYSYTPMASSSL----DDSLYFIKMTGIKVAGK-------PLSVSSSAYSSLPTIIDS 348
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
G T LP Y+ L + V A+K TP C++ + P +T F GGA
Sbjct: 349 GTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFAGGAA 408
Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ L + + C A P I GN Q + YD + + F C+
Sbjct: 409 LKLAARNLLVDVD-SATTCLAFAPAR-SAAIIGNTQQQTFSVVYDVKNSKIGFAAAGCS 465
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 111/367 (30%), Positives = 162/367 (44%), Gaps = 33/367 (8%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSS 73
+S +S G Y++ +G+P D+ I DTGSDL W +C ++P S+
Sbjct: 124 KSGMSLGTGNYIVSIGLGSPKK-DLMLIFDTGSDLTWARC--------SAAETFDPTKST 174
Query: 74 SYKELSCQSEQCHLLDTV----SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
SY +SC + C + + S + C Y Y D S + G L ER+T G S + F
Sbjct: 175 SYANVSCSTPLCSSVISATGNPSRCAASTCVYGIQYGDGSYSIGFLGKERLTIG-STDIF 233
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
+N FGCG + G+F + GL+GLGR +LS+ SQ + FSYCL SS T
Sbjct: 234 NNFYFGCGQDVDGLFGK-AAGLLGLGRDKLSVVSQTAPKYN-QLFSYCL----PSSSSTG 287
Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
+ FG+ S + +S ++Y + L GI+VG + L ++++G I
Sbjct: 288 FLSFGSSQSKS----AKFTPLSSGPSSFYNLDLTGITVGGQKLAIPL-SVFSTAGTI--- 339
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTA 308
ID+G T LP Y+ L R A+ P P CY I P +
Sbjct: 340 ---IDSGTVVTRLPPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKTIKVPKIVI 396
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
F GG V + F+ ++ V FA D IFGN Q + + YD V
Sbjct: 397 SFSGGVDVDVDQAGIFVANGLKQVCLAFAGNTGARDTAIFGNTQQRNFEVVYDVSGGKVG 456
Query: 368 FKPTDCT 374
F P C+
Sbjct: 457 FAPASCS 463
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 113/398 (28%), Positives = 172/398 (43%), Gaps = 59/398 (14%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-YKQVKPIYNPASSS 73
S ST +G+Y + +GTPP + + DTGSDL+WV+C C C + + P SS
Sbjct: 79 SGASTGSGQYFVDIRLGTPPQ-SLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSS 137
Query: 74 SYKELSCQSEQCHLLDTVS---CSSQQL---CNYTYGYADSSLTKGVLATERITFGN--- 124
S+ C C LL C+ +L C + Y YAD SL+ G + E T +
Sbjct: 138 SFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSG 197
Query: 125 SNNFFDNVVFGCGHNNTGV------FNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL 178
S + FGCG +G FN G++GLGR +S +SQ+ + G NKFSYCL
Sbjct: 198 SEIHLKGLSFGCGFRISGPSVSGAQFN-GARGVMGLGRGSISFSSQLGRRFG-NKFSYCL 255
Query: 179 VPFHTDSSITSKMYFGNGSE----VSGGGVVSTSL-VSKEDKTYYFVTLEGISVGNLSNS 233
+ + TS + G G + + T L ++ T+Y++T+ I++ +
Sbjct: 256 MDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVK-- 313
Query: 234 SKLIPYYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS 290
+P + I + G +D+G T L K Y + + VR +KL + G
Sbjct: 314 ---LPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGF 370
Query: 291 QLCY------KTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPV-------EGVFCFAM 337
LC + PS+ P L GGA F PPP EGV C A+
Sbjct: 371 DLCVNASGESRRPSL----PRLRFRLGGGA--------VFAPPPRNYFLETEEGVMCLAI 418
Query: 338 QPIDGDVG--IFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ ++ G + GN Q + +D + + F C
Sbjct: 419 RAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 456
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 115/363 (31%), Positives = 164/363 (45%), Gaps = 31/363 (8%)
Query: 23 EYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
E+V+ GTP Y ++ DTGSD+ W+QCLPC CYKQ PI++P S++Y + C
Sbjct: 119 EFVVTVGFGTP--AQTYTLMFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSAVPC 176
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
QC CSS C Y Y D S T GVL+ E ++ S FGCG N
Sbjct: 177 GHPQCAAAGG-KCSSNGTCLYKVQYGDGSSTAGVLSHETLSL-TSARALPGFAFGCGETN 234
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
G F + + GL+GLGR +LSL+SQ + FSYCL ++T + G + S
Sbjct: 235 LGDFGDVD-GLIGLGRGQLSLSSQAAASF-GAAFSYCLPSYNTSHGY---LTIGTTTPAS 289
Query: 201 GG-GVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
G GV T+++ K+D ++YFV L I VG I ++ +D+G
Sbjct: 290 GSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPI-------LFTRDGTLLDSGTV 342
Query: 259 PTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGA 314
T LP + Y L ++ + + K P DP CY I P+++ F G+
Sbjct: 343 LTYLPPEAYTALRDRFKFTMTQYKPAPAYDPF---DTCYDFAGQNAIFMPLVSFKFSDGS 399
Query: 315 KVPLIHTSTFIPP----PVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
L I P P G F +P I GN Q + + YD ++ + F
Sbjct: 400 SFDLSPFGVLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVS 459
Query: 371 TDC 373
C
Sbjct: 460 GSC 462
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 108/357 (30%), Positives = 158/357 (44%), Gaps = 25/357 (7%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPASSSSYKELSC 80
EYV+ +GTP + +DTGSD+ WVQC PC CY Q +++PA SS+Y+ +SC
Sbjct: 126 EYVISVGLGTPAVTQTV-TIDTGSDVSWVQCNPCPNPPCYAQTGALFDPAKSSTYRAVSC 184
Query: 81 QSEQCHLLDTV--SCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
+ +C L+ C + C Y Y D S T G + + +T +++ FGC
Sbjct: 185 AAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCS 244
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
H +G F++ GL+GLG SL SQ + G N FSYCL P +S +S G
Sbjct: 245 HVESG-FSDQTDGLMGLGGGAQSLVSQTAAAYG-NSFSYCLPP----TSGSSGFLTLGGG 298
Query: 198 EVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
G V + L S++ T+Y L+ I+VG L P ++G++ +D+G
Sbjct: 299 GGVSGFVTTRMLRSRQIPTFYGARLQDIAVGG--KQLGLSPSVFAAGSV------VDSGT 350
Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKV 316
T LP Y+ L + +K R C+ I+ P + F GGA +
Sbjct: 351 IITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISIPTVALVFSGGAAI 410
Query: 317 PLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
L FA DG GI GN Q + YD S + F+ C
Sbjct: 411 DLDPNGIMY----GNCLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGAC 463
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 118/365 (32%), Positives = 170/365 (46%), Gaps = 25/365 (6%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSS 74
++ TAN YV+ +GTPP + DTGSD WVQC PCV CYKQ +++PA SS+
Sbjct: 157 SLGTAN--YVVPIGLGTPPSRFTV-VFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSST 213
Query: 75 YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
Y +SC C LD C++ C Y Y D S T G A + T + + F
Sbjct: 214 YANVSCADPACADLDASGCNAGH-CLYGIQYGDGSYTVGFFAKD--TLAVAQDAIKGFKF 270
Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF- 193
GCG N G+F + GL+GLGR S+ Q + G + FSYCL S+ T + F
Sbjct: 271 GCGEKNRGLFGQT-AGLLGLGRGPTSITVQAYEKYGGS-FSYCL---PASSAATGYLEFG 325
Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
SG +T +++ + T+Y+V L GI VG + +++SG + +
Sbjct: 326 PLSPSSSGSNAKTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIPESVFSNSGTL------V 379
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHF 310
D+G T LP Y L A+ + Y+ S L CY ++ ++ P ++ F
Sbjct: 380 DSGTVITRLPDTAYAALSSAFAAAMAASGYKKAAAYSILDTCYDFTGLSQVSLPTVSLVF 439
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFC--FAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
GGA + L S + + C FA D VGI GN Q + YD ++V F
Sbjct: 440 QGGACLDL-DASGIVYAISQSQVCLGFASNGDDESVGIVGNTQQRTYGVLYDVSKKVVGF 498
Query: 369 KPTDC 373
P C
Sbjct: 499 APGAC 503
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 118/359 (32%), Positives = 174/359 (48%), Gaps = 24/359 (6%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
G YV + +GTP + +VDTGS L W+QC PCV C++Q P++NP +SSSY +SC
Sbjct: 125 GNYVTRMGLGTPAKSYVM-VVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSC 183
Query: 81 QSEQCHLLDT-----VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
++QC L T SCS+ +C Y Y DSS + G L+ + ++FG+++ N +G
Sbjct: 184 SAQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS--VPNFYYG 241
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
CG +N G+F ++ GL+GL R +LSL Q+ +G + FSYCL + SS + N
Sbjct: 242 CGQDNEGLFGQSA-GLIGLARNKLSLLYQLAPSMGYS-FSYCLPTSSSSSSGYLSIGSYN 299
Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
+ S + S+SL D + YF+ + GI V P SS A S ID+
Sbjct: 300 PGQYSYTPMASSSL----DDSLYFIKMTGIKVAGK-------PLSVSSSAYSSLPTIIDS 348
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAK 315
G T LP Y+ L + V A+K TP C++ + P +T F GGA
Sbjct: 349 GTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFAGGAA 408
Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ L + + C A P I GN Q + YD + + F C+
Sbjct: 409 LKLAARNLLVDVD-SATTCLAFAPAR-SAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 465
>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 488
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 111/361 (30%), Positives = 168/361 (46%), Gaps = 31/361 (8%)
Query: 18 STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKE 77
+T G YV + IGTPP + G +D SDL+W C P +NP S++ +
Sbjct: 94 ATNAGMYVFSYGIGTPPQ-QVSGALDISSDLVWTAC-------GATAP-FNPVRSTTVAD 144
Query: 78 LSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSL-TKGVLATERITFGNSNNFFDNVVFG 135
+ C + C +C + C YTY Y + T G+L TE TFG++ D VVFG
Sbjct: 145 VPCTDDACQQFAPQTCGAGASECAYTYMYGGGAANTTGLLGTEAFTFGDTR--IDGVVFG 202
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY--F 193
CG N G F+ G++GLGR LSL +SQL ++FSY P D S+ ++ + F
Sbjct: 203 CGLKNVGDFS-GVSGVIGLGRGNLSL----VSQLQVDRFSYHFAP---DDSVDTQSFILF 254
Query: 194 GNGSEVSGGGVVSTSLVSKE-DKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAISKGN 250
G+ + +ST L++ + + + Y+V L GI V +L+ S N G+ G
Sbjct: 255 GDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGS---GG 311
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA-GIAPILTAH 309
+F+ T+L + Y L + V + I L LG LCY S+A P +
Sbjct: 312 VFLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAKVPSMALV 371
Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID-GDVGIFGNFAQSDLFIGYDFDSQMVSF 368
F GGA + L + F G+ C + P GD + G+ Q + YD + + F
Sbjct: 372 FAGGAVMELELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVF 431
Query: 369 K 369
+
Sbjct: 432 E 432
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 106/361 (29%), Positives = 165/361 (45%), Gaps = 24/361 (6%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
V + +GEY+++ GTP +Y ++DTGSD+ W+ C C C+ PI++PA SSSYK
Sbjct: 108 VRSGSGEYIIQVDFGTPKQ-SMYTLIDTGSDVAWIPCKQCQGCHS-TAPIFDPAKSSSYK 165
Query: 77 ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
+C S+ C + + +C C + Y D + G LA++ IT G + + N FGC
Sbjct: 166 PFACDSQPCQEI-SGNCGGNSKCQFEVSYGDGTQVDGTLASDAITLG--SQYLPNFSFGC 222
Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
+ + + + + G + L ++L FSYCL + S+ + + G
Sbjct: 223 AESLSEDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCL---PSSSTSSGSLVLGKE 279
Query: 197 SEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
+ VS + T+L+ T+YFVTL+ ISVGN S +P N + S G ID+
Sbjct: 280 AAVSSSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRIS---VPGTNIA---SGGGTIIDS 333
Query: 256 GAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDG 312
G T L Y L + R ++++ TP +D CY S + P +T H D
Sbjct: 334 GTTITHLVPSAYTALRDAFRQQLSSLQPTPVED----MDTCYDLSSSSVDVPTITLHLDR 389
Query: 313 GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
+ L + I G+ C A D I GN Q + I +D + V F
Sbjct: 390 NVDLVLPKENILITQE-SGLACLAFSSTDSR-SIIGNVQQQNWRIVFDVPNSQVGFAQEQ 447
Query: 373 C 373
C
Sbjct: 448 C 448
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 111/355 (31%), Positives = 160/355 (45%), Gaps = 24/355 (6%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ +G+P ++DTGSD+ WVQC PC QC+ Q P+++P+SSS+Y SC S
Sbjct: 127 EYLITVGLGSPATSQTM-LIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGS 185
Query: 83 EQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
C L + CSS C Y Y D S T G +++ + G+S + FGC +
Sbjct: 186 ADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSA--VRSFQFGCSNVE 243
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
+G FN+ GL+GLG SL SQ LG FSYCL P + S + G
Sbjct: 244 SG-FNDQTDGLMGLGGGAQSLVSQTAGTLG-RAFSYCLPPTPSSSGF---LTLGAAGGSG 298
Query: 201 GGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPP 259
G V T ++ S + T+Y V L+ I VG S IP ++ +D+G
Sbjct: 299 TSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLS---IP-----ASVFSAGTVMDSGTVI 350
Query: 260 TLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPL 318
T LP Y+ L + +K P P C+ + ++ P + F GGA V L
Sbjct: 351 TRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSL 410
Query: 319 IHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ + FA D +GI GN Q + YD +V F+ C
Sbjct: 411 DASGIIL----SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 114/364 (31%), Positives = 159/364 (43%), Gaps = 31/364 (8%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSC 80
EYV+ +GTP + + ++DTGSDL WVQC PC CY Q P+++P+ SS+Y + C
Sbjct: 123 EYVVTVGLGTPSVSQVL-LIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTYAPIPC 181
Query: 81 QSEQCHLLDT-------VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV 133
++ C L S C + Y D S T+GV + E + D
Sbjct: 182 NTDACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLALAPGVAVKD-FR 240
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
FGCGH+ G N+ GL+GLG SL Q S G FSYCL + +
Sbjct: 241 FGCGHDQDGA-NDKYDGLLGLGGAPESLVVQTASVYGG-AFSYCLPALNNQVGFLALGGG 298
Query: 194 GNGSE--VSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
G S V+ G V T ++ +E++T+Y V + GI+VG P A S G M
Sbjct: 299 GAPSGGVVNTSGFVFTPMI-REEETFYVVNMTGITVGGE-------PIDVPPSAFS-GGM 349
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGIA-PILTAH 309
ID+G T L YN L+ R A+ P R G CY + + P +
Sbjct: 350 IIDSGTVVTELQHTAYNALQAAFRKAMAAYPLV--RNGELDTCYDFSGYSNVTLPKVALT 407
Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
F GGA + L + + ++ F D GI GN Q L + YD V F+
Sbjct: 408 FSGGATIDLDVPNGIL---LDDCLAFQESGPDDQPGILGNVNQRTLEVLYDAGRGRVGFR 464
Query: 370 PTDC 373
C
Sbjct: 465 AAVC 468
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 123/379 (32%), Positives = 161/379 (42%), Gaps = 36/379 (9%)
Query: 3 PATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV---QC 59
PA++ Y ++ T N YV+ S+GTP + VDTGSDL WVQC PC C
Sbjct: 128 PASWGY-------DIGTLN--YVVTASLGTPGVAQTM-EVDTGSDLSWVQCKPCSAAPSC 177
Query: 60 YKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLAT 117
Y Q P+++PA SSSY + C C L S S C Y Y D S T GV ++
Sbjct: 178 YSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSS 237
Query: 118 ERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
+ +T ++++ FGCGH +G+FN + GL+GLGR + SL Q G FSYC
Sbjct: 238 DTLTL-SASSAVQGFFFGCGHAQSGLFNGVD-GLLGLGREQPSLVEQTAGTYG-GVFSYC 294
Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLI 237
L P ++ + G S + G + L S TYY V L GISVG S
Sbjct: 295 L-PTKPSTAGYLTLGLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPAS 353
Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYK 295
+ G +DTG T LP Y L R+ + Y L CY
Sbjct: 354 AF--------AGGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYN 405
Query: 296 TPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
+ P + F GA V L G FA DG + I GN Q
Sbjct: 406 FAGYGTVTLPNVALTFGSGATVMLGADGIL----SFGCLAFAPSGSDGGMAILGNVQQRS 461
Query: 355 LFIGYDFDSQMVSFKPTDC 373
+ D V FKP+ C
Sbjct: 462 FEV--RIDGTSVGFKPSSC 478
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 112/362 (30%), Positives = 167/362 (46%), Gaps = 53/362 (14%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSS----- 95
IVDTGSDL WVQCLPC CY Q +P++NP++SSS+ L C S C L + SS
Sbjct: 80 IVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSN 139
Query: 96 --QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVG 153
C+Y Y D S ++G L E++T G + DN +FGCG NN G+F GL+G
Sbjct: 140 KNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTE--IDNFIFGCGRNNKGLFG-GASGLMG 196
Query: 154 LGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM------YFGNGSEVSGGGVVST 207
L R+ LSL SQ S G+ FSYCL SS + + F N S +S ++
Sbjct: 197 LARSELSLVSQTSSLFGS-VFSYCLPTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQN 255
Query: 208 SLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFY 267
+S +YF+ L GIS+G ++ + +P +S+ + +D+G T L Y
Sbjct: 256 PQMSN----FYFLNLTGISIGGVNLN---VPRLSSNEGVLS---LLDSGTVITRLSPSIY 305
Query: 268 NRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGI-APILTAHFDGGAKVPLIHTSTF 324
+ + + + Y+ S L C+ + P + F+G A++ +
Sbjct: 306 KAFKAEFEK--QFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIV------ 357
Query: 325 IPPPVEGVF----------CFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
VEGVF C A + + I GN+ Q + + Y+ V F
Sbjct: 358 ---DVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEP 414
Query: 373 CT 374
C+
Sbjct: 415 CS 416
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 114/363 (31%), Positives = 169/363 (46%), Gaps = 55/363 (15%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSS----- 95
IVDTGSDL WVQCLPC CY Q +P++NP++SSS+ L C S C L + SS
Sbjct: 159 IVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSN 218
Query: 96 --QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVG 153
C+Y Y D S ++G L E++T G + DN +FGCG NN G+F GL+G
Sbjct: 219 KNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTE--IDNFIFGCGRNNKGLFG-GASGLMG 275
Query: 154 LGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM------YFGNGSEVSGGGVVST 207
L R+ LSL SQ S G+ FSYCL SS + + F N S +S ++
Sbjct: 276 LARSELSLVSQTSSLFGS-VFSYCLPTTGVGSSGSLTLGGADFSNFKNISPISYTRMIQN 334
Query: 208 SLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS-GAISKGNMFIDTGAPPTLLPKDF 266
+S +YF+ L GIS+G ++ + +P +S+ G +S +D+G T L
Sbjct: 335 PQMSN----FYFLNLTGISIGGVNLN---VPRLSSNEGVLS----LLDSGTVITRLSPSI 383
Query: 267 YNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGI-APILTAHFDGGAKVPLIHTST 323
Y + + + + Y+ S L C+ + P + F+G A++ +
Sbjct: 384 YKAFKAEFEK--QFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIV----- 436
Query: 324 FIPPPVEGVF----------CFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
VEGVF C A + + I GN+ Q + + Y+ V F
Sbjct: 437 ----DVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGE 492
Query: 372 DCT 374
C+
Sbjct: 493 PCS 495
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 111/355 (31%), Positives = 160/355 (45%), Gaps = 24/355 (6%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ +G+P ++DTGSD+ WVQC PC QC+ Q P+++P+SSS+Y SC S
Sbjct: 51 EYLITVGLGSPATSQTM-LIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGS 109
Query: 83 EQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
C L + CSS C Y Y D S T G +++ + G+S + FGC +
Sbjct: 110 ADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSA--VRSFQFGCSNVE 167
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
+G FN+ GL+GLG SL SQ LG FSYCL P + S + G
Sbjct: 168 SG-FNDQTDGLMGLGGGAQSLVSQTAGTLG-RAFSYCLPPTPSSSGF---LTLGAAGGSG 222
Query: 201 GGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPP 259
G V T ++ S + T+Y V L+ I VG S IP ++ +D+G
Sbjct: 223 TSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLS---IP-----ASVFSAGTVMDSGTVI 274
Query: 260 TLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPL 318
T LP Y+ L + +K P P C+ + ++ P + F GGA V L
Sbjct: 275 TRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSL 334
Query: 319 IHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ + FA D +GI GN Q + YD +V F+ C
Sbjct: 335 DASGIIL----SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 385
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 130 bits (328), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 111/355 (31%), Positives = 160/355 (45%), Gaps = 24/355 (6%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ +G+P ++DTGSD+ WVQC PC QC+ Q P+++P+SSS+Y SC S
Sbjct: 197 EYLITVGLGSPATSQTM-LIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGS 255
Query: 83 EQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
C L + CSS C Y Y D S T G +++ + G+S + FGC +
Sbjct: 256 ADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSA--VRSFQFGCSNVE 313
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
+G FN+ GL+GLG SL SQ LG FSYCL P + S + G
Sbjct: 314 SG-FNDQTDGLMGLGGGAQSLVSQTAGTLG-RAFSYCLPPTPSSSGF---LTLGAAGGSG 368
Query: 201 GGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPP 259
G V T ++ S + T+Y V L+ I VG S IP ++ +D+G
Sbjct: 369 TSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLS---IP-----ASVFSAGTVMDSGTVI 420
Query: 260 TLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPL 318
T LP Y+ L + +K P P C+ + ++ P + F GGA V L
Sbjct: 421 TRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSL 480
Query: 319 IHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ + FA D +GI GN Q + YD +V F+ C
Sbjct: 481 DASGIIL----SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 531
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 130 bits (328), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 109/380 (28%), Positives = 177/380 (46%), Gaps = 37/380 (9%)
Query: 10 NNVVQSNVS-TANGEYVM-KFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIY 67
NN ++ VS + G +M SIG PP+ + ++DTGSD++WV C PC C + ++
Sbjct: 85 NNEYKARVSPSLTGRTIMANISIGQPPIPQLV-VMDTGSDILWVMCTPCTNCDNHLGLLF 143
Query: 68 NPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN- 126
+P+ SS++ L C++ D CS +T YAD+S G+ + + F ++
Sbjct: 144 DPSMSSTFSPL-CKTP----CDFKGCSRCDPIPFTVTYADNSTASGMFGRDTVVFETTDE 198
Query: 127 --NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
+ +V+FGCGHN + G++GL SLA++I KFSYC+
Sbjct: 199 GTSRIPDVLFGCGHNIGQDTDPGHNGILGLNNGPDSLATKI-----GQKFSYCIGDLADP 253
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
++ G G+++ G S + +Y+VT+EGISVG + P
Sbjct: 254 YYNYHQLILGEGADLEG-----YSTPFEVHNGFYYVTMEGISVGE--KRLDIAPETFEMK 306
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI-----KLTPYQDPRLGSQLCYKTPSM 299
G + IDTG+ T L + L ++VRN + + T + P + Q Y + S
Sbjct: 307 KNRTGGVIIDTGSTITFLVDSVHRLLSKEVRNLLGWSFRQTTIEKSPWM--QCFYGSISR 364
Query: 300 AGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-----DGDVGIFGNFAQS 353
+ P++T HF GA + L + +F + VFC + P+ + G AQ
Sbjct: 365 DLVGFPVVTFHFADGADLAL-DSGSFFNQLNDNVFCMTVGPVSSLNLKSKPSLIGLLAQQ 423
Query: 354 DLFIGYDFDSQMVSFKPTDC 373
+GYD +Q V F+ DC
Sbjct: 424 SYSVGYDLVNQFVYFQRIDC 443
>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
Length = 414
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 112/338 (33%), Positives = 161/338 (47%), Gaps = 35/338 (10%)
Query: 58 QCYKQVKPIYNPASSSSYKELSCQSEQCHLLDT--VSCSSQQLCNYTYGYADSSLTKGVL 115
+C + P + PASSS++ +L C S C L + ++C++ C Y Y Y T G L
Sbjct: 87 ECAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATG-CVYYYPYG-MGFTAGYL 144
Query: 116 ATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFS 175
ATE + G ++ F V FGC N GV N + G+VGLGR+ LSL SQ+ G +FS
Sbjct: 145 ATETLHVGGAS--FPGVAFGCSTEN-GVGNSSS-GIVGLGRSPLSLVSQV----GVGRFS 196
Query: 176 YCLVPFHTDSSI-TSKMYFGNGSEVSGGGVVSTSLVSKE--DKTYYFVTLEGISVG--NL 230
YCL +D+ S + FG+ ++V+GG L + E +YY+V L GI+VG +L
Sbjct: 197 YCL---RSDADAGDSPILFGSLAKVTGGKSSPAILENPEMPSSSYYYVNLTGITVGATDL 253
Query: 231 SNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE----QVRNAIKLTPYQDP 286
+S + +GA G +D+G T L K+ Y ++ Q+ A T
Sbjct: 254 PVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGT 313
Query: 287 RLGSQLCYKTPSMAGIA----PILTAHFDGGAKVPLIHTSTFIPPPVE-----GVFCFAM 337
R G LC+ + G + P L F GGA+ + S V+ V C +
Sbjct: 314 RFGFDLCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECLLV 373
Query: 338 QPIDG--DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
P + I GN Q DL + YD D M SF P DC
Sbjct: 374 LPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADC 411
>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 459
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 112/369 (30%), Positives = 174/369 (47%), Gaps = 43/369 (11%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
+++ FSIG PP+ Y ++DTGS L W+QC PC+ C++Q P+YNP+SSS+Y S+
Sbjct: 110 FLVNFSIGQPPVPQ-YAVMDTGSSLTWIQCEPCINCHQQKGPLYNPSSSSTYVS---CSD 165
Query: 84 QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN---FFDNVVFGCGHNN 140
T + + CNY+ YAD + T+G A E++ F ++ +V+FGCGHNN
Sbjct: 166 FDRTDTTFTATHGSDCNYSQTYADKTTTRGTYAREQLLFETPDDGITIMHDVIFGCGHNN 225
Query: 141 TGVFNEN--EMGLVGLGRTRLSLASQILSQLGANKFSYC-------LVPFHTDSSITSKM 191
T + G+ GLG + S I+S+LG FSYC L FH ++
Sbjct: 226 TQLPGPTGYASGVFGLGDS----GSSIISKLGFG-FSYCIGNIGDPLYGFH-------RL 273
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
GN ++ G ST LV + Y++TL GIS+G I + +
Sbjct: 274 TLGNKLKIEG---YSTPLVP---RGLYYITLVGISIGQERLDIDPIVFQRVDLNGISSRI 327
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQLCY---KTPSMAGIAPIL 306
ID+GA + +P+ YN + ++V + + L+ Y+ LCY + G P
Sbjct: 328 VIDSGATLSYIPRQAYNVVRDKVSSILSGFLSRYRYIARHLSLCYIGKLNQDLQGF-PDA 386
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD--VGIFGNFAQSDLFIGYDFDSQ 364
T H GA + + + V C A+ P + D + G AQ + YD Q
Sbjct: 387 TFHLADGADL-VFQVEGLFFQYTDNVLCLALVPTESDEETCLIGLLAQQYYNVAYDLKQQ 445
Query: 365 MVSFKPTDC 373
+ F+ +C
Sbjct: 446 KLYFQRIEC 454
>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 61/144 (42%), Positives = 88/144 (61%), Gaps = 4/144 (2%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S ++ +GEY + +GTPP +Y ++DTGSD++W+QC PC +CY Q P+++P S
Sbjct: 163 VTSGLAQGSGEYFTRLGVGTPPKY-VYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKS 221
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+ +SC+S C LD+ C+S+Q C Y Y D S T G +TE +TF + V
Sbjct: 222 GSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLTFRGTR--VPKV 279
Query: 133 VFGCGHNNTGVFNENEMGLVGLGR 156
GCGH+N G+F GL+GLGR
Sbjct: 280 ALGCGHDNEGLF-VGAAGLLGLGR 302
>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
Length = 396
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 108/371 (29%), Positives = 169/371 (45%), Gaps = 42/371 (11%)
Query: 25 VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQ 84
V F+IGTPP I+D +L+W QC C +C+KQ P++ P +SS+++ C ++
Sbjct: 44 VANFTIGTPPQ-PASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDA 102
Query: 85 CHLLDTVSCSSQQLCNY---TYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
C T +CS +C Y T D T G++ TE G + ++ FGC +
Sbjct: 103 CKSTPTSNCSG-DVCTYESTTNIRLDRHTTLGIVGTETFAIGTATA---SLAFGCVVASD 158
Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
+ G +GLGRT SL ++Q+ KFSYCL P T S S+++ G+ ++++G
Sbjct: 159 IDTMDGTSGFIGLGRTPRSL----VAQMKLTKFSYCLSPRGTGKS--SRLFLGSSAKLAG 212
Query: 202 GGVVSTS---LVSKEDKT--YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
G ST+ S +D + YY ++L+ I GN + ++ A S G + + T
Sbjct: 213 GESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIAT----------AQSGGILVMHTV 262
Query: 257 APPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYKTPS--MAGIAPILTAHFD 311
+P +LL Y ++ V A+ P P LC+K + AP L F
Sbjct: 263 SPFSLLVDSAYRAFKKAVTEAVGGAAEQPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQ 322
Query: 312 GGAK--VP----LIHTSTFIPPPVEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFDS 363
G A VP LI + A G V + G+ Q D+ YD
Sbjct: 323 GAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKK 382
Query: 364 QMVSFKPTDCT 374
+ +SF+P DC+
Sbjct: 383 ETLSFEPADCS 393
>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
Length = 492
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 111/365 (30%), Positives = 168/365 (46%), Gaps = 35/365 (9%)
Query: 18 STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKE 77
+T G YV + IGTPP + G +D SDL+W C P +NP S++ +
Sbjct: 94 ATNAGMYVFSYGIGTPPQ-QVSGALDISSDLVWTAC-------GATAP-FNPVRSTTVAD 144
Query: 78 LSCQSEQCHLLDTVSCSS-----QQLCNYTYGYADSSL-TKGVLATERITFGNSNNFFDN 131
+ C + C +C + C YTY Y + T G+L TE TFG++ D
Sbjct: 145 VPCTDDACQQFAPQTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTEAFTFGDTR--IDG 202
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
VVFGCG N G F+ G++GLGR LSL +SQL ++FSY P D S+ ++
Sbjct: 203 VVFGCGLQNVGDFS-GVSGVIGLGRGNLSL----VSQLQVDRFSYHFAP---DDSVDTQS 254
Query: 192 Y--FGNGSEVSGGGVVSTSLVSKE-DKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAI 246
+ FG+ + +ST L++ + + + Y+V L GI V +L+ S N G+
Sbjct: 255 FILFGDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGS- 313
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA-GIAPI 305
G +F+ T+L + Y L + V + I L LG LCY S+A P
Sbjct: 314 --GGVFLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAKVPS 371
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID-GDVGIFGNFAQSDLFIGYDFDSQ 364
+ F GGA + L + F G+ C + P GD + G+ Q + YD +
Sbjct: 372 MALVFAGGAVMELELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGS 431
Query: 365 MVSFK 369
+ F+
Sbjct: 432 KLVFE 436
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 106/361 (29%), Positives = 164/361 (45%), Gaps = 24/361 (6%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYK 76
V + +GEY+++ GTP +Y ++DTGSD+ W+ C C C+ PI++PA SSSYK
Sbjct: 108 VRSGSGEYIIQVDFGTPKQ-SMYTLIDTGSDVAWIPCKQCQGCHS-TAPIFDPAKSSSYK 165
Query: 77 ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
+C S+ C + + +C C + Y D + G LA++ IT G + + N FGC
Sbjct: 166 PFACDSQPCQEI-SGNCGGNSKCQFEVLYGDGTQVDGTLASDAITLG--SQYLPNFSFGC 222
Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
+ + + + G + L ++L FSYCL + S+ + + G
Sbjct: 223 AESLSEDTYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCL---PSSSTSSGSLVLGKE 279
Query: 197 SEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
+ VS + T+L+ T+YFVTL+ ISVGN S +P N + S G ID+
Sbjct: 280 AAVSSSSLKFTTLIKDPSFPTFYFVTLKAISVGNTRIS---VPATNIA---SGGGTIIDS 333
Query: 256 GAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDG 312
G T L Y L + R ++++ TP +D CY S + P +T H D
Sbjct: 334 GTTITYLVPSAYKDLRDAFRQQLSSLQPTPVED----MDTCYDLSSSSVDVPTITLHLDR 389
Query: 313 GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
+ L + I G+ C A D I GN Q + I +D + V F
Sbjct: 390 NVDLVLPKENILITQE-SGLSCLAFSSTDSR-SIIGNVQQQNWRIVFDVPNSQVGFAQEQ 447
Query: 373 C 373
C
Sbjct: 448 C 448
>gi|356558489|ref|XP_003547539.1| PREDICTED: uncharacterized protein LOC100817234 [Glycine max]
Length = 739
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 97/246 (39%), Positives = 134/246 (54%), Gaps = 12/246 (4%)
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
F + GCG NN G F+ G+VGLG +SL S I + + K+SYCLVP +S T
Sbjct: 58 FPKIPIGCGLNNAGTFDSKCFGIVGLGGGVVSLISHIGLSIDS-KYSYCLVPLFEFNS-T 115
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS- 247
SK+ FG + V G G VST ++ T+Y++ LEG+SVG SK I + ++S +
Sbjct: 116 SKINFGENAVVEGLGTVSTPIIPGSFDTFYYLKLEGMSVG-----SKRIDFVDASTSNEL 170
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APIL 306
KGN+ ID+G T+L ++FY +LE +V I L LCYK+P I PI+
Sbjct: 171 KGNIIIDSGTTLTILLENFYTKLEAEVEAHINLERVNSTDQILSLCYKSPPNNAIEVPII 230
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
T HF G + L +TF+ + FA P+ IFGN AQ + +GYD + V
Sbjct: 231 TTHF-AGVDIVLNSLNTFV-SVFDDAMWFAFAPV-ASGSIFGNLAQMNHLVGYDLLRKTV 287
Query: 367 SFKPTD 372
SFKPTD
Sbjct: 288 SFKPTD 293
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 113/353 (32%), Positives = 164/353 (46%), Gaps = 24/353 (6%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EY++ +G+P + ++DTGSD+ WVQC PC QC+ Q +++P+SSS+Y SC S
Sbjct: 126 EYLITVGMGSPAVAQTM-LIDTGSDVSWVQCKPCSQCHSQADSLFDPSSSSTYSAFSCTS 184
Query: 83 EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG 142
C L CSS Q C YT Y D S G +++ + G+S +N FGC + +G
Sbjct: 185 AACAQLRQRGCSSSQ-CQYTVKYGDGSTGSGTYSSDTLALGSST--VENFQFGCSQSESG 241
Query: 143 -VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
+ + GL+GLG SLA+Q G FSYCL P + S + G+ SG
Sbjct: 242 NLLQDQTAGLMGLGGGAESLATQTAGTFG-KAFSYCLPP-----TPGSSGFLTLGASTSG 295
Query: 202 GGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTL 261
V + L S + +YY V L+ I VG + IP + A S G++ +D+G T
Sbjct: 296 FVVKTPMLRSTQVPSYYGVLLQAIRVGGRQLN---IP----ASAFSAGSI-MDSGTIITR 347
Query: 262 LPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIH 320
LP+ Y+ L + +K P P C+ + ++ P + F GGA V L
Sbjct: 348 LPRTAYSALSSAFKAGMKQYPPAQPMGIFDTCFDFSGQSSVSIPTVALVFSGGAVVDLAS 407
Query: 321 TSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ FA D +GI GN Q + YD V FK C
Sbjct: 408 DGIIL----GSCLAFAANSDDTSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 456
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 109/363 (30%), Positives = 172/363 (47%), Gaps = 42/363 (11%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
G Y ++G+PP D ++DTGSDL WV+C PC ++ +S++YK L+C
Sbjct: 1 GVYYSTITLGSPPK-DFSLVMDTGSDLTWVRCDPCS---PDCSSTFDRLASNTYKALTCA 56
Query: 82 SEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN----FFDNVVFGCG 137
+ Y+YGY D S T+G L+ + + + + F VFGCG
Sbjct: 57 DD-----------------YSYGYGDGSFTQGDLSVDTLKMAGAASDELEEFPGFVFGCG 99
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT-SKMYFGNG 196
G+ + E+G++ L LS SQI + G NKFSYCL+ +S+ S M FG
Sbjct: 100 SLLKGLIS-GEVGILALSPGSLSFPSQIGEKYG-NKFSYCLLRQTAQNSLKKSPMVFGEA 157
Query: 197 S---EVSGGGVVSTSLVSK--EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
+ + G G + + E YY V L+GISVGN + N K +
Sbjct: 158 AVELKEPGSGKLQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSAFLNGQ---DKPTI 214
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHF 310
F D+G T+LP + +++ + + + + + G C++ P +G P +T HF
Sbjct: 215 F-DSGTTLTMLPPGVCDSIKQSLASMVSGAEFVAIK-GLDACFRVPPSSGQGLPDITFHF 272
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
+GGA ++ I + + C P + +V IFGN Q D F+ +D D++ + FK
Sbjct: 273 NGGADFVTRPSNYVID--LGSLQCLIFVPTN-EVSIFGNLQQQDFFVLHDMDNRRIGFKE 329
Query: 371 TDC 373
TDC
Sbjct: 330 TDC 332
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 88/222 (39%), Positives = 117/222 (52%), Gaps = 20/222 (9%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
N T N Y++ +G D+ I+DTGSDL WVQC PC+ CY Q P++ P++SSSY
Sbjct: 139 NFQTLN--YIVTMELGGQ---DMTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSSSY 193
Query: 76 KELSCQSEQCHLLDTV-----SCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
+ + C S C L +C S C+Y Y D S T G L E ++FG +
Sbjct: 194 QSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGIS--V 251
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
N VFGCG NN G+F GL+GLGR+ LSL SQ S G FSYCL P TD+ +
Sbjct: 252 SNFVFGCGKNNKGLFG-GVSGLMGLGRSNLSLISQTNSTFGG-VFSYCLPP--TDAGASG 307
Query: 190 KMYFGNGSEVSGG--GVVSTSLV-SKEDKTYYFVTLEGISVG 228
+ GN S V + T +V + + +Y + L GI VG
Sbjct: 308 SLAMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVG 349
>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
Length = 328
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 82/227 (36%), Positives = 117/227 (51%), Gaps = 28/227 (12%)
Query: 24 YVMKFSIGTP---PLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
YV S+G P ++ IVDTGSDL WVQC PC CY Q P+++PA S++Y + C
Sbjct: 92 YVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRC 151
Query: 81 QSEQCHLLDTV--------SCSS----QQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
+ C D++ SC S + C Y Y D S ++GVLAT+ + G ++
Sbjct: 152 NASACA--DSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGAS-- 207
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
VFGCG +N G+F GL+GLGRT LSL SQ S+ G FSYCL P T +
Sbjct: 208 LGGFVFGCGLSNRGLFG-GTAGLMGLGRTELSLVSQTASRYG-GVFSYCL-PAATSGDAS 264
Query: 189 SKMYFGNGSEVSGG-----GVVSTSLVSKEDK-TYYFVTLEGISVGN 229
+ G G + + V T +++ + +YF+ + G +VG
Sbjct: 265 GSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGG 311
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 108/370 (29%), Positives = 157/370 (42%), Gaps = 37/370 (10%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMW--VQCLP-CVQCYKQVKPIYNPAS 71
S + GEY + +GTP + ++DTGSD++W V+ LP ++ +Q A+
Sbjct: 113 SGLPQGTGEYFAQVGVGTPATTALM-VLDTGSDVVWAPVRALPPLLRAVRQGS--STGAA 169
Query: 72 SSSYKELSCQSEQCHLLDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNSNNFFD 130
+ +C + C LD+ C ++ C Y Y D S+T G A+E +TF
Sbjct: 170 PAPTPRWNCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGARV-Q 228
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
V GCGH+N G+F L R RLS SQI G FSYCLV +
Sbjct: 229 RVAIGCGHDNEGLFIAASGLLGLG-RGRLSFPSQIARSFG-RSFSYCLVDRTSSRRARPS 286
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
+G ++ T+Y+V L G SVG + +G
Sbjct: 287 RRWGGTPRMA---------------TFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGG 331
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQL---CYKTPSMAGI-A 303
+ +D+G T L + Y + + R A ++++P G L CY +
Sbjct: 332 VILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPG-----GFSLFDTCYNLSGRRVVKV 386
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
P ++ H GGA V L + IP G FCFAM DG V I GN Q + +D D+
Sbjct: 387 PTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDA 446
Query: 364 QMVSFKPTDC 373
Q V F P C
Sbjct: 447 QRVGFVPKSC 456
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 119/409 (29%), Positives = 174/409 (42%), Gaps = 50/409 (12%)
Query: 5 TYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC----LPCVQCY 60
T F+ + ++S G+Y++ + GTPP ++ I DTGSDL+W+QC P C
Sbjct: 35 TSFWAESPMESGAFLGLGQYLVSMAFGTPPQ-EVLLIADTGSDLIWLQCSTTAAPPAFCP 93
Query: 61 KQV---KPIYNPASSSSYKELSCQSEQCHLLDTV-----SCSSQQL--CNYTYGYADSSL 110
K+ +P + + S++ + C + QC L+ SCS C Y Y YAD S
Sbjct: 94 KKACSRRPAFVASKSATLSVVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSS 153
Query: 111 TKGVLATERITFGNSNN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILS 167
T G LA + T N + V FGCG N G G++GLG+ +LS +Q S
Sbjct: 154 TTGFLARDTATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGS 213
Query: 168 QLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKE-DKTYYFVTLEGIS 226
L A FSYCL+ S + G T LVS T+Y+V + I
Sbjct: 214 -LFAQTFSYCLLDLEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIR 272
Query: 227 VGNLSNSSKLIPYYNSSGAI---SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY 283
VGN +++P S AI G ID+G+ T L Y L ++ L
Sbjct: 273 VGN-----RVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHL--- 324
Query: 284 QDPRL--------GSQLCYKTPSMAGIAPI------LTAHFDGGAKVPLIHTSTFIPPPV 329
PR+ G +LCY S + +AP LT F G + L T ++
Sbjct: 325 --PRIPSSATFFQGLELCYNVSSSSSLAPANGGFPRLTIDFAQGLSLEL-PTGNYLVDVA 381
Query: 330 EGVFCFAMQPIDGDVG--IFGNFAQSDLFIGYDFDSQMVSFKPTDCTKQ 376
+ V C A++P + GN Q + +D S + F T+C
Sbjct: 382 DDVKCLAIRPTLSPFAFNVLGNLMQQGYHVEFDRASARIGFARTECVAH 430
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 108/346 (31%), Positives = 162/346 (46%), Gaps = 29/346 (8%)
Query: 41 IVDTGSDLMWVQCLPC-VQCYKQVKPIYNPASSSSYKELSCQSEQCHLL------DTVSC 93
I+DTGS L W+QC PC V C+ Q P+Y+P+ S +YK+LSC S +C L D +
Sbjct: 2 ILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLCE 61
Query: 94 SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVG 153
+ C YT Y D+S + G L+ + +T +S +GCG +N G+F G++G
Sbjct: 62 TDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQT-LPQFTYGCGQDNQGLFGR-AAGIIG 119
Query: 154 LGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKE 213
L R +LS+ +Q+ ++ G + FSYCL ++ SS + G+ S S + L +
Sbjct: 120 LARDKLSMLAQLSTKYG-HAFSYCLPTANSGSSGGGFLSIGSISPTSYK--FTPMLTDSK 176
Query: 214 DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQ 273
+ + YF+ L I+V + + A+ + ID+G T LP Y L Q
Sbjct: 177 NPSLYFLRLTAITVSGRP--------LDLAAAMYRVPTLIDSGTVITRLPMSMYAAL-RQ 227
Query: 274 VRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA--PILTAHFDGGAKVPLIHTSTFIPPPV 329
I T Y S L C+K S+ I+ P + F GGA + L S I
Sbjct: 228 AFVKIMSTKYAKAPAYSILDTCFKG-SLKSISAVPEIKMIFQGGADLTLRAPSILIEAD- 285
Query: 330 EGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+G+ C A G + I GN Q I YD + + F P C
Sbjct: 286 KGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 120/372 (32%), Positives = 178/372 (47%), Gaps = 31/372 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPAS 71
QS ++ G YV+ +GTP D + DTGS + W QC PC+ CY Q + ++P
Sbjct: 124 AQSGIAIGTGNYVVTVGLGTPKE-DFTLVFDTGSGITWTQCQPCLGSCYPQKEQKFDPTK 182
Query: 72 SSSYKELSCQSEQCHLLDTVS--CS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
S+SY +SC S C+LL T CS S C Y Y D S ++G ATE +T +S++
Sbjct: 183 STSYNNVSCSSASCNLLPTSERGCSASNSTCLYQIIYGDQSYSQGFFATETLTI-SSSDV 241
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
F N +FGCG +N G+F + GL+GL + +SL SQ + +FSYCL S+ +
Sbjct: 242 FTNFLFGCGQSNNGLFGQ-AAGLLGLSSSSVSLPSQTAEKY-QKQFSYCL-----PSTPS 294
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP-YYNSSGAIS 247
S Y G +VS + +S ++Y + + GISV + + P + +SGAI
Sbjct: 295 STGYLNFGGKVS--QTAGFTPISPAFSSFYGIDIVGISVAG--SQLPIDPSIFTTSGAI- 349
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PIL 306
ID+G T LP Y L+E + P + CY + ++ P +
Sbjct: 350 -----IDSGTVITRLPPTAYKALKEAFDEKMSNYPKTNGDELLDTCYDFSNYTTVSFPKV 404
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGV----FCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
+ F GG +V + ++ I V GV FA D + GIFGN Q + YD
Sbjct: 405 SVSFKGGVEVDI--DASGILYLVNGVKMVCLAFAANKDDSEFGIFGNHQQKTYEVVYDGA 462
Query: 363 SQMVSFKPTDCT 374
M+ F C+
Sbjct: 463 KGMIGFAAGACS 474
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 104/357 (29%), Positives = 158/357 (44%), Gaps = 36/357 (10%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHL-LDTV-----SCS 94
IVDTGSDL WVQC PC CY Q P+++P+ S+SY + C + C L SC+
Sbjct: 179 IVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCA 238
Query: 95 S---------QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFN 145
+ + C Y+ Y D S ++GVLAT+ + G ++ D VFGCG +N G+F
Sbjct: 239 TVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS--VDGFVFGCGLSNRGLFG 296
Query: 146 ENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVV 205
GL+GLGRT LSL SQ + G FSYCL + + S G+ S V
Sbjct: 297 -GTAGLMGLGRTELSLVSQTAPRFG-GVFSYCLPAATSGDAAGSLSLGGDTSSYRNATPV 354
Query: 206 S-TSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLP 263
S T +++ + +YF+ + + + ++ + N+ +D+G T L
Sbjct: 355 SYTRMIADPAQPPFYFMNV---------TGASVGGAAVAAAGLGAANVLLDSGTVITRLA 405
Query: 264 KDFYN--RLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIH 320
Y R E + + P P CY + P+LT +GGA + +
Sbjct: 406 PSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMTVDA 465
Query: 321 TSTFIPPPVEGV-FCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+G C AM + + I GN+ Q + + YD + F DC+
Sbjct: 466 AGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 522
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 104/357 (29%), Positives = 158/357 (44%), Gaps = 36/357 (10%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHL-LDTV-----SCS 94
IVDTGSDL WVQC PC CY Q P+++P+ S+SY + C + C L SC+
Sbjct: 180 IVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCA 239
Query: 95 S---------QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFN 145
+ + C Y+ Y D S ++GVLAT+ + G ++ D VFGCG +N G+F
Sbjct: 240 TVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS--VDGFVFGCGLSNRGLFG 297
Query: 146 ENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVV 205
GL+GLGRT LSL SQ + G FSYCL + + S G+ S V
Sbjct: 298 -GTAGLMGLGRTELSLVSQTAPRFG-GVFSYCLPAATSGDAAGSLSLGGDTSSYRNATPV 355
Query: 206 S-TSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLP 263
S T +++ + +YF+ + + + ++ + N+ +D+G T L
Sbjct: 356 SYTRMIADPAQPPFYFMNV---------TGASVGGAAVAAAGLGAANVLLDSGTVITRLA 406
Query: 264 KDFYN--RLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIH 320
Y R E + + P P CY + P+LT +GGA + +
Sbjct: 407 PSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMTVDA 466
Query: 321 TSTFIPPPVEGV-FCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+G C AM + + I GN+ Q + + YD + F DC+
Sbjct: 467 AGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 523
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 127 bits (319), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 107/367 (29%), Positives = 161/367 (43%), Gaps = 29/367 (7%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSY 75
S +G Y +K +G+P IVDTGS L W+QC PCV C+ Q P+++P++S +Y
Sbjct: 6 ASIGSGNYYVKVGLGSPARYYSM-IVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTY 64
Query: 76 KELSCQSEQCHLL------DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
K LSC S QC L + + +S +C YT Y DSS + G L+ + +T S
Sbjct: 65 KSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQT-L 123
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
V+GCG ++ G+F G++GLGR +LS+ Q+ S+ G FSYCL P S
Sbjct: 124 PGFVYGCGQDSEGLFGR-AAGILGLGRNKLSMLGQVSSKFG-YAFSYCL-PTRGGGGFLS 180
Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
G S + + + YF+ L I+VG + Y +
Sbjct: 181 ---IGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQY--------RV 229
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYK--TPSMAGIAPIL 306
ID+G T LP Y ++ + + P C+K M + P +
Sbjct: 230 PTIIDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNLKDMQSV-PEV 288
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
F GGA + L + + EG+ C A +G V I GN Q + +D + +
Sbjct: 289 RLIFQGGADLNLRPVNVLLQVD-EGLTCLAFAGNNG-VAIIGNHQQQTFKVAHDISTARI 346
Query: 367 SFKPTDC 373
F C
Sbjct: 347 GFATGGC 353
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 118/385 (30%), Positives = 179/385 (46%), Gaps = 42/385 (10%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPASSSSYKELSC 80
+Y+ ++ IG PP I+DTGS+L+W QC C C+ Q Y+P+ S + K ++C
Sbjct: 83 QYIAEYLIGDPPQ-QAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVAC 141
Query: 81 QSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV--VFGC- 136
C L C+ + C Y ++ G L TE TFG+ + +NV FGC
Sbjct: 142 NDTACLLGSETRCARDGKACAVLTAYGAGAI-GGFLGTEVFTFGHGQSSENNVSLAFGCI 200
Query: 137 -GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
T + G++GLGR +LSL SQ LG NKFSYCL P+ +D++ TS ++ G
Sbjct: 201 TASRLTPGSLDGASGIIGLGRGKLSLPSQ----LGDNKFSYCLTPYFSDAANTSTLFVGA 256
Query: 196 GSEVSGGGVVSTS---LVSKED---KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK- 248
+ +SGGG +TS L + +D ++Y++ L GI+VG + A +K
Sbjct: 257 SAGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAAFDLREVAPAKW 316
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAI--KLTPYQDPRLGSQLCYK--TPSMAG-IA 303
G ID+G+P T L Y L +++ + + P G LC P AG +
Sbjct: 317 GGTLIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGAEGLDLCVGGVAPGDAGKLV 376
Query: 304 PILTAHFDGGAKVPLIHTSTFIPP-----PV-EGVFCFAMQPIDG--------DVGIFGN 349
P L HF +PP PV + C + G + I GN
Sbjct: 377 PPLVLHF---GSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTLPLNETTIIGN 433
Query: 350 FAQSDLFIGYDFDSQMVSFKPTDCT 374
+ Q D+ + YD ++SF+P DC+
Sbjct: 434 YMQQDMHLLYDLGQGVLSFQPADCS 458
>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
Length = 373
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 99/366 (27%), Positives = 168/366 (45%), Gaps = 38/366 (10%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
Y+ +IGTPP I+ + +W QC PC +C+KQ P++N ++SS+Y+ C +
Sbjct: 28 YMANLTIGTPPQ-PASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRSASSTYRPEPCGTA 86
Query: 84 QCHLLDTVSCSSQQLCNYTYG--YADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
C + +CS +C+Y + D+S G+ T+ G + ++ FGC ++
Sbjct: 87 LCESVPASTCSGDGVCSYEVETMFGDTS---GIGGTDTFAIGTATA---SLAFGCAMDSN 140
Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
G+VGLGRT SL + Q+ A FSYCL P H + S + G ++++G
Sbjct: 141 IKQLLGASGVVGLGRTPWSL----VGQMNATAFSYCLAP-HGAAGKKSALLLGASAKLAG 195
Query: 202 G-GVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPP 259
G +T LV + +D + Y + LEGI G++ + P N S + +DT
Sbjct: 196 GKSAATTPLVNTSDDSSDYMIHLEGIKFGDV----IIAPPPNGS------VVLVDTIFGV 245
Query: 260 TLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA------PILTAHFDGG 313
+ L + +++ V A+ P P LC+ + A A P + F G
Sbjct: 246 SFLVDAAFQAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGA 305
Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQP-----IDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
A + + S ++ G C AM + ++ I G Q ++ +D D + +SF
Sbjct: 306 AAL-TVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSF 364
Query: 369 KPTDCT 374
+P DC+
Sbjct: 365 EPADCS 370
>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
Length = 366
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 84/219 (38%), Positives = 121/219 (55%), Gaps = 9/219 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S + +GEY + +GTP + Y ++DTGSD+ W+QC PC +CY Q PI+NP+ S
Sbjct: 146 VVSGMEQGSGEYFTRIGVGTP-TREQYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYS 204
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
+S+ + C S C LD C S C Y Y D S + G ATE +TFG ++ NV
Sbjct: 205 ASFSTVGCDSAVCSQLDAYDCHSGG-CLYEASYGDGSYSTGSFATETLTFGTTS--VANV 261
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GCGH N G+F L+GLG LS +QI +Q G + FSYCLV +DSS +
Sbjct: 262 AIGCGHKNVGLFIGAAG-LLGLGAGALSFPNQIGTQTG-HTFSYCLVDRESDSS--GPLQ 317
Query: 193 FGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLS 231
FG S V G + + + T+Y++++ IS+ ++
Sbjct: 318 FGPKS-VPVGSIFTPLEKNPHLPTFYYLSVTAISISAIA 355
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 105/341 (30%), Positives = 146/341 (42%), Gaps = 26/341 (7%)
Query: 41 IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTV---SCS- 94
+VDT SD+ WVQCLPC QC+ Q P+Y+PA SS++ + C S C L + CS
Sbjct: 172 VVDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGCSP 231
Query: 95 SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGL 154
+ C Y Y D T G T+ +T + + FGC H G F+ G++ L
Sbjct: 232 TTDECKYIVNYGDGKATTGTYVTDTLTM-SPTIVVKDFRFGCSHAVRGSFSNQNAGILAL 290
Query: 155 GRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED 214
G R SL Q G N FSYC +P + + S G E S + + +K
Sbjct: 291 GGGRGSLLEQTADAYG-NAFSYC-IPKPSSAGFLS---LGGPVEASLKFSYTPLIKNKHA 345
Query: 215 KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV 274
T+Y V LE I V + P ++GA+ +D+GA T LP Y L
Sbjct: 346 PTFYIVHLEAIIVAG--KQLAVPPTAFATGAV------MDSGAVVTQLPPQVYAALRAAF 397
Query: 275 RNAIKL-TPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGV 332
R+A+ P P CY + P ++ F GGA + L S + +G
Sbjct: 398 RSAMAAYGPLAAPVRNLDTCYDFTRFPDVKVPKVSLVFAGGATLDLEPASIIL----DGC 453
Query: 333 FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
FA P + VG GN Q + YD V F+ C
Sbjct: 454 LAFAATPGEESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 124/369 (33%), Positives = 165/369 (44%), Gaps = 39/369 (10%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSSSYKELS 79
+G Y + +GTP D I DTGSDL W QC PCV+ CY Q + I+NP+ S+SY +S
Sbjct: 150 SGNYFVTVGLGTPKK-DFSLIFDTGSDLTWTQCEPCVKSCYNQKEAIFNPSQSTSYANIS 208
Query: 80 CQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
C S C L + + C+S C Y Y DSS + G E+++ + + F++ F
Sbjct: 209 CGSTLCDSLASATGNIFNCASST-CVYGIQYGDSSFSIGFFGKEKLSL-TATDVFNDFYF 266
Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL------VPFHTDSSIT 188
GCG NN G+ GL+GLGR +LSL SQ +Q FSYCL F T T
Sbjct: 267 GCGQNNKGL-FGGAAGLLGLGRDKLSLVSQT-AQRYNKIFSYCLPSSSSSTGFLTFGGST 324
Query: 189 SK-MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
SK F + +SGG ++Y + L GISVG KL S S
Sbjct: 325 SKSASFTPLATISGG------------SSFYGLDLTGISVGG----RKLAI---SPSVFS 365
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PIL 306
ID+G T LP Y+ L R + P C+ + I+ P +
Sbjct: 366 TAGTIIDSGTVITRLPPAAYSALSSTFRKLMSQYPAAPALSILDTCFDFSNHDTISVPKI 425
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
F GG V + T F + V FA DV IFGN Q L + YD +
Sbjct: 426 GLFFSGGVVVDIDKTGIFYVNDLTQVCLAFAGNSDASDVAIFGNVQQKTLEVVYDGAAGR 485
Query: 366 VSFKPTDCT 374
V F P C+
Sbjct: 486 VGFAPAGCS 494
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 105/382 (27%), Positives = 177/382 (46%), Gaps = 57/382 (14%)
Query: 19 TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKEL 78
++ G YV F+IGTPP + +VD +L+W QC PC C++Q P+++P SS+++ L
Sbjct: 52 SSQGLYVANFTIGTPPQ-PVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGL 110
Query: 79 SCQSEQCHLLDTVS--CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
C S C + S C+S +C Y + T G+ T+ G + + + FGC
Sbjct: 111 PCGSHLCESIPESSRNCTS-DVCIYE-APTKAGDTGGMAGTDTFAIGAAK---ETLGFGC 165
Query: 137 GHNNTGVFNENEM-------GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
V + + G+VGLGRT SL ++Q+ FSYCL +
Sbjct: 166 -----VVMTDKRLKTIGGPSGIVGLGRTPWSL----VTQMNVTAFSYCLA-----GKSSG 211
Query: 190 KMYFGNGSEVSGGG-------VVSTSLVSKEDKT--YYFVTLEGISVGNLSNSSKLIPYY 240
++ G ++ GG V+ TS S ++ + YY V L GI G +
Sbjct: 212 ALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKAGGAP-----LQAA 266
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
+SSG+ + +DT + + L Y L++ + A+ + P P LC+ + ++A
Sbjct: 267 SSSGS----TVLLDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCF-SKAVA 321
Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP-----IDGDV---GIFGNFAQ 352
G AP L FDGGA + + + ++ G C + + G++ I G+ Q
Sbjct: 322 GDAPELVFTFDGGAAL-TVPPANYLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQ 380
Query: 353 SDLFIGYDFDSQMVSFKPTDCT 374
++ + +D + +SFKP DC+
Sbjct: 381 ENVHVLFDLKEETLSFKPADCS 402
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 117/413 (28%), Positives = 175/413 (42%), Gaps = 50/413 (12%)
Query: 1 MSPATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC----LPC 56
++ T F+ + ++S G+Y++ + GTPP ++ I DTGSDL+W+QC P
Sbjct: 30 LATTTSFWAESPMESGAFLGLGQYLVSMAFGTPPQ-EVLLIADTGSDLIWLQCSTTAAPP 88
Query: 57 VQCYKQV---KPIYNPASSSSYKELSCQSEQCHLLDT-------VSCSSQQLCNYTYGYA 106
C K+ +P + + S++ + C + QC L+ S ++ C Y Y YA
Sbjct: 89 AFCPKKACSRRPAFVASKSATLSVVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYA 148
Query: 107 DSSLTKGVLATERITFGNSNN---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLAS 163
D S T G LA + T N + V FGCG N G G++GLG+ +LS +
Sbjct: 149 DGSSTTGFLARDTATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPA 208
Query: 164 QILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKE-DKTYYFVTL 222
Q S L A FSYCL+ S + G T LVS T+Y+V +
Sbjct: 209 QSGS-LFAQTFSYCLLDLEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGV 267
Query: 223 EGISVGNLSNSSKLIPYYNSSGAI---SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK 279
I VGN +++P S AI G ID+G+ T L Y L ++
Sbjct: 268 VAIRVGN-----RVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVH 322
Query: 280 LTPYQDPRL--------GSQLCYKTPSMAGIAPI------LTAHFDGGAKVPLIHTSTFI 325
L PR+ G +LCY S + AP LT F G + L T ++
Sbjct: 323 L-----PRIPSSATFFQGLELCYNVSSSSSSAPANGGFPRLTIDFAQGLSLEL-PTGNYL 376
Query: 326 PPPVEGVFCFAMQPIDGDVG--IFGNFAQSDLFIGYDFDSQMVSFKPTDCTKQ 376
+ V C A++P + GN Q + +D S + F T+C
Sbjct: 377 VDVADDVKCLAIRPTLSPFAFNVLGNLMQQGYHVEFDRASARIGFARTECVAH 429
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 106/376 (28%), Positives = 158/376 (42%), Gaps = 68/376 (18%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC--LPCVQCYKQVKPIYNPASSSSYKELSC 80
EY++ + GTPP ++ +DTGSD+ W QC P C+ Q P+++P++SSS+ L C
Sbjct: 87 EYLVHLAAGTPPQ-EVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPC 145
Query: 81 QSEQCHLLDTVSC-----SSQQLCNYTYGYADSSLTKGVLATERITFGN-----SNNFFD 130
S C T C ++ + CNY+ Y D S+++G + E TF + S+
Sbjct: 146 SSPACET--TPPCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVP 203
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
+VFGCGH N GVF NE G+ G GR LSL SQL FS+C S TS
Sbjct: 204 GLVFGCGHANRGVFTSNETGIAGFGRGSLSLP----SQLKVGNFSHCFTTI--TGSKTSA 257
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
+ G + G S S + + +Y S SNS I
Sbjct: 258 VLLG----LPGVAPPSASPLGRRRGSYRCR-----STPRSSNSGTSI------------- 295
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI--APILTA 308
T LP Y + E+ +KL C+ P P +
Sbjct: 296 ---------TSLPPRTYRAVREEFAAQVKLPVVPGNATDPFTCFSAPLRGPKPDVPTMAL 346
Query: 309 HFDGGA-KVPLIHTSTFIPPPVEG--------VFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
HF+G ++P ++ V+ + C A+ I+G I GN Q ++ + Y
Sbjct: 347 HFEGATMRLP---QENYVFEVVDDDDAGNSSRIICLAV--IEGGEIILGNIQQQNMHVLY 401
Query: 360 DFDSQMVSFKPTDCTK 375
D + +SF P C +
Sbjct: 402 DLQNSKLSFVPAQCDQ 417
>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
Length = 488
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 104/327 (31%), Positives = 147/327 (44%), Gaps = 30/327 (9%)
Query: 65 PIYNPASSSSYKELSCQSEQCHLLDTVSCSS-----QQLCNYTYGYADSSLTKGVLATER 119
P ++ ++SS+ SC S C L SC + Q C YTY Y D S+T G+L ++
Sbjct: 175 PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLLEVDK 234
Query: 120 ITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
TFG + V FGCG N GVF NE G+ G GR LSL SQL FS+C
Sbjct: 235 FTFGAGASV-PGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLP----SQLKVGNFSHCFT 289
Query: 180 PFHTDSSITSKM-----YFGNGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNS 233
+ T + + NG G V ST L+ + + T Y+++L+GI+VG
Sbjct: 290 AVNGLKQSTVLLDLLADLYKNGR----GAVQSTPLIQNSANPTLYYLSLKGITVG----- 340
Query: 234 SKLIPYYNSSGAISKGN--MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
S +P S+ A++ G ID+G T LP Y + ++ IKL G
Sbjct: 341 STRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPY 400
Query: 292 LCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEG--VFCFAMQPIDGDVGIFG 348
C+ PS A P L HF+G F P G + C A+ + + G
Sbjct: 401 TCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSMICLAINELGDERATIG 460
Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDCTK 375
NF Q ++ + YD + M+SF C K
Sbjct: 461 NFQQQNMHVLYDLQNNMLSFVAAQCDK 487
Score = 41.6 bits (96), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 58/144 (40%), Gaps = 11/144 (7%)
Query: 224 GISVGNLSNSSKLIPYYNSSGAISKGN--MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLT 281
GI+VG S +P S+ A++ G ID+G T LP Y + ++ IKL
Sbjct: 41 GITVG-----STRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLP 95
Query: 282 PYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEG--VFCFAMQ 338
G C+ PS A P L HF+G F P G + C A+
Sbjct: 96 VVPGNATGPYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAIN 155
Query: 339 PIDGDVGIFGNFAQSDLFIGYDFD 362
D + I GNF Q ++ FD
Sbjct: 156 KGD-ETTIIGNFQQQNMHALPYFD 178
>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
Length = 367
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 66/157 (42%), Positives = 90/157 (57%), Gaps = 6/157 (3%)
Query: 12 VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
V ++ + A GEY++K IGTPP +DT SDL+W QC PC CY QV P++NP
Sbjct: 77 VAETPIMPAGGEYLVKLGIGTPPY-KFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRV 135
Query: 72 SSSYKELSCQSEQCHLLDTVSC--SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
SS+Y L C S+ C LD C + C YTY Y+ ++ T+G LA +++ G + F
Sbjct: 136 SSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVIG--EDAF 193
Query: 130 DNVVFGCGHNNT-GVFNENEMGLVGLGRTRLSLASQI 165
V FGC ++T G G+VGLGR LSL SQ+
Sbjct: 194 RGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQL 230
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 51/214 (23%), Positives = 79/214 (36%), Gaps = 31/214 (14%)
Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
H D Y +G+ + G + LV ED G++ G ++S+ P
Sbjct: 159 HDDDESCQYTYTYSGNATTEGTLAVDKLVIGED------AFRGVAFGCSTSSTGGAPPPQ 212
Query: 242 SSGAISKGN---------------MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP 286
+SG + G M ID + T L Y+ L + I+L
Sbjct: 213 ASGVVGLGRGPLSLVSQLSVRRYGMIIDIASTITFLEASLYDELVNDLEVEIRLPRGTGS 272
Query: 287 RLGSQLCYKTPSMAGIA------PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI 340
LG LC+ P G+A P + FDG + L F G+ C +
Sbjct: 273 SLGLDLCFILPD--GVAFDRVYVPAVALAFDG-RWLRLDKARLFAEDRESGMMCLMVGRA 329
Query: 341 D-GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ G V I GNF Q ++ + Y+ V+F + C
Sbjct: 330 EAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 363
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 111/377 (29%), Positives = 172/377 (45%), Gaps = 32/377 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPAS 71
++S +S +G Y +K +GTP IVDTGS L W+QC PCV C+ QV PI+ P++
Sbjct: 102 LKSGLSIGSGNYYVKIGLGTPAKY-FSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPST 160
Query: 72 SSSYKELSCQSEQCHL-----LDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNS 125
S +YK L C S QC L+ CS+ C Y Y D+S + G L+ + +T S
Sbjct: 161 SKTYKALPCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPS 220
Query: 126 NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL---VPFH 182
V+GCG +N G+F + G++GL ++S+ Q+ + G N FSYCL
Sbjct: 221 EAPSSGFVYGCGQDNQGLFGRSS-GIIGLANDKISMLGQLSKKYG-NAFSYCLPSSFSAP 278
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG----NLSNSSKLIP 238
SS++ + G S S + + +++ + YF+ L I+V +S SS +P
Sbjct: 279 NSSSLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVP 338
Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYK-T 296
ID+G T LP YN L++ + Q P C+K +
Sbjct: 339 ------------TIIDSGTVITRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGS 386
Query: 297 PSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLF 356
P + F GGA + L ++ + +G C A+ + I GN+ Q
Sbjct: 387 VKEMSTVPEIQIIFRGGAGLELKAHNSLVEIE-KGTTCLAIAASSNPISIIGNYQQQTFK 445
Query: 357 IGYDFDSQMVSFKPTDC 373
+ YD + + F P C
Sbjct: 446 VAYDVANFKIGFAPGGC 462
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 106/366 (28%), Positives = 172/366 (46%), Gaps = 41/366 (11%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC--LPCVQCYKQVKPIYNPASSSSYKELS 79
G Y M+FS+GTPP + + DTGSDL+W +C C Q P Y P +SS++ +L
Sbjct: 89 GAYDMEFSMGTPPQ-KLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLP 147
Query: 80 CQSEQCHLL--DTVS--CSSQQLCNYTYGYA----DSSLTKGVLATERITFGNSNNFFDN 131
C C LL D+V+ ++ C+Y Y Y D T+G LA E T G + +
Sbjct: 148 CSDRLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTLG--ADAVPS 205
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V FGC T G +G R L+ ++SQL A+ F YCL +D+S S +
Sbjct: 206 VRFGC---TTASEGGYGSGSGLVGLGRGPLS--LVSQLNASTFMYCLT---SDASKASPL 257
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
FG+ + ++G V ST L++ T+Y V L IS+G+ + + G +
Sbjct: 258 LFGSLASLTGAQVQSTGLLAS--TTFYAVNLRSISIGSAT----------TPGVGEPEGV 305
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA----PILT 307
D+G T L + Y+ + + L +D G + C++ P+ ++ P +
Sbjct: 306 VFDSGTTLTYLAEPAYSEAKAAFLSQTSLDQVEDTD-GFEACFQKPANGRLSNAAVPTMV 364
Query: 308 AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
HFDG + + ++ +GV C+ +Q + I GN Q + + +D ++S
Sbjct: 365 LHFDGADMA--LPVANYVVEVEDGVVCWIVQR-SPSLSIIGNIMQVNYLVLHDVHRSVLS 421
Query: 368 FKPTDC 373
F+P +C
Sbjct: 422 FQPANC 427
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 108/383 (28%), Positives = 174/383 (45%), Gaps = 44/383 (11%)
Query: 10 NNVVQSNVS-TANGEYVM-KFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIY 67
NN ++ VS + G +M SIG PP+ + ++DTGSD++WV C PC C + ++
Sbjct: 85 NNDYKARVSPSLTGRTIMANISIGQPPIPQLV-VMDTGSDILWVMCTPCTNCDNDLGLLF 143
Query: 68 NPASSSSYKELS---CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGN 124
+P+ SS++ L C E C D + +T YAD+S G + + F
Sbjct: 144 DPSKSSTFSPLCKTPCDFEGCR-CDPIP--------FTVTYADNSTASGTFGRDTVVFET 194
Query: 125 SN---NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
++ + +V+FGCGHN + G++GL SL +++ KFSYC+
Sbjct: 195 TDEGTSRISDVLFGCGHNIGHDTDPGHNGILGLNNGPDSLVTKL-----GQKFSYCIGNL 249
Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
++ G G+++ G S + +Y+VT+EGISVG + P
Sbjct: 250 ADPYYNYHQLILGEGADLEG-----YSTPFEVYNGFYYVTMEGISVG--EKRLDIAPETF 302
Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI-----KLTPYQDPRLGSQLCYKT 296
G + IDTG+ T L + L ++VRN + + T + P + Q Y +
Sbjct: 303 EMKENRAGGVIIDTGSTITFLVDSVHKLLSKEVRNLLGWSFRQATIEKSPWM--QCFYGS 360
Query: 297 PSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP-----IDGDVGIFGNF 350
S + P++T HF GA + L + +F + VFC + P I + G
Sbjct: 361 ISRDLVGFPVVTFHFSDGADLAL-DSGSFFNQLNDNVFCMTVGPVSSLNIKSKPSLIGLL 419
Query: 351 AQSDLFIGYDFDSQMVSFKPTDC 373
AQ +GYD +Q V F+ DC
Sbjct: 420 AQQSYNVGYDLVNQFVYFQRIDC 442
>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 413
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 109/370 (29%), Positives = 168/370 (45%), Gaps = 40/370 (10%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
YV F+IGTPP IVD +L+W QC C +C+KQ P++ P +SS++K C +
Sbjct: 62 YVANFTIGTPPQ-PASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTA 120
Query: 84 QCHLLDTVSCSSQQLCNYTYGYAD-SSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG 142
C + T SCS +C+Y T G AT+ G + + FGC +
Sbjct: 121 VCESIPTRSCSG-DVCSYKGPPTQLRGNTSGFAATDTFAIGTATV---RLAFGCVVASDI 176
Query: 143 VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGG 202
+ G +GLGRT SL ++Q+ +FSYCL P +T S S+++ G+ ++++GG
Sbjct: 177 DTMDGPSGFIGLGRTPWSL----VAQMKLTRFSYCLSPRNTGKS--SRLFLGSSAKLAGG 230
Query: 203 GVVSTS---LVSKEDKT--YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
ST+ S +D + YY ++L+ I GN + ++ A S G + + T +
Sbjct: 231 ESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIAT----------AQSGGILVMHTVS 280
Query: 258 PPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYKTPS--MAGIAPILTAHFDG 312
P +LL Y ++ V A+ P P LC+K + AP L F G
Sbjct: 281 PFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQG 340
Query: 313 GAK--VP----LIHTSTFIPPPVEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFDSQ 364
A VP LI + A G V + G+ Q D+ YD +
Sbjct: 341 AAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKE 400
Query: 365 MVSFKPTDCT 374
+SF+P DC+
Sbjct: 401 TLSFEPADCS 410
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 106/393 (26%), Positives = 165/393 (41%), Gaps = 47/393 (11%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-PIYNPAS 71
V S ST +G+Y + +GTPP + + DTGSDL+WV+C C C + +
Sbjct: 78 VVSGASTGSGQYFVDLRLGTPPQ-KLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLARH 136
Query: 72 SSSYKELSCQSEQCHLL---DTVSCSSQQL---CNYTYGYADSSLTKGVLATERITFGNS 125
S+++ C C L+ C+ +L C Y Y Y D S T G + E T S
Sbjct: 137 STTFSPNHCYDSACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTS 196
Query: 126 NNF---FDNVVFGC-----GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
+ + FGC G + +G G++GLGR +SL+SQ+ + G NKFSYC
Sbjct: 197 SGREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFG-NKFSYC 255
Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSL----VSKEDKTYYFVTLEGISVGNLSNS 233
L+ S TS + G+ G ++ T+Y++ +E +SV +
Sbjct: 256 LMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIK-- 313
Query: 234 SKLIPYYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS 290
+P S A+ + G +D+G T LP+ Y ++ ++ ++L +P G
Sbjct: 314 ---LPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPTPGF 370
Query: 291 QLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPV-------EGVFCFAMQPIDG 342
LC + P L+ G S F PPP E V C A+Q +
Sbjct: 371 DLCVNVSEIEHPRLPKLSFKLGG--------DSVFSPPPRNYFVDTDEDVKCLALQAVMT 422
Query: 343 DVG--IFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
G + GN Q + +D D + F C
Sbjct: 423 PSGFSVIGNLMQQGFLLEFDKDRTRLGFSRHGC 455
>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
Length = 315
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 99/325 (30%), Positives = 151/325 (46%), Gaps = 33/325 (10%)
Query: 72 SSSYKELSCQSEQCHLLDTVSCSSQQL----CNYTYGYADSSLTKGVLATERITFGNSNN 127
SS++K ++C C VS S+ + C Y Y D S+T G + + TF + N
Sbjct: 2 SSTFKAVACPDPICRPSSGVSVSACAMENFQCFYLCSYGDRSITAGHIFKDTFTFMSPNG 61
Query: 128 F---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
+ FGCG NTG+F NE G+ G GR SL SQL +FSYCL
Sbjct: 62 VPVAVSELAFGCGDYNTGLFVSNESGIAGFGRGPQSLP----SQLKVGRFSYCLT--LVT 115
Query: 185 SSITSKMYFGNGSEVSG------GGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLI 237
S +S + G + G G ST ++ + T+Y+++LEGI+VG +
Sbjct: 116 ESKSSVVILGTPPDPDGLRAHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGKTR-----L 170
Query: 238 PYYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD-PRLGSQLC 293
P+ S A+ K G ID+G T LP+ + L+E++ L Y + P +G +LC
Sbjct: 171 PFDKSVFALKKDGSGGTVIDSGTSLTTLPEAVFELLQEELVAQFPLPRYDNTPEVGDRLC 230
Query: 294 YKTPSMAGIAPI--LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNF 350
++ P P+ L H GA + L + F+ P GV C + D + + GNF
Sbjct: 231 FRRPKGGKQVPVPKLILHL-AGADMDLPRDNYFVEEPDSGVMCLQINGAEDTTMVLIGNF 289
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCTK 375
Q ++ + YD ++ + F P C K
Sbjct: 290 QQQNMHVVYDVENNKLLFAPAQCDK 314
>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
Length = 508
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 108/365 (29%), Positives = 165/365 (45%), Gaps = 29/365 (7%)
Query: 18 STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV-----KPIYNPASS 72
+T G YV+ FS+GTPP + + G++D SD +W+QC C C P + S
Sbjct: 91 ATNTGMYVLSFSVGTPPQV-VTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLS 149
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQL-CNYTYGYAD--SSLTKGVLATERITFGNSNNFF 129
S+ +E+ C + C L +CS+ C Y+Y Y ++ T G+LA + F
Sbjct: 150 STIREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRA-- 207
Query: 130 DNVVFGCGHNNTGVFNENEM-GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
D V+FGC V E ++ G++GLGR LSL SQ+ Q+G +FSY L P +
Sbjct: 208 DGVIFGC-----AVATEGDIGGVIGLGRGELSLVSQL--QIG--RFSYYLAP-DDAVDVG 257
Query: 189 SKMYFGNGSEVSGGGVVSTSLVS-KEDKTYYFVTLEGISVGNLSNSSKLIPYYN-SSGAI 246
S + F + ++ VST LV+ + ++ Y+V L GI V IP A
Sbjct: 258 SFILFLDDAKPRTSRAVSTPLVANRASRSLYYVELAGIRV---DGEDLAIPRGTFDLQAD 314
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA-GIAPI 305
G + + P T L Y + + + + I L LG LCY + S+A P
Sbjct: 315 GSGGVVLSITIPVTFLDAGAYKVVRQAMASKIGLRAADGSELGLDLCYTSESLATAKVPS 374
Query: 306 LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQSDLFIGYDFDSQ 364
+ F GGA + L + F G+ C + P GD + G+ Q + YD
Sbjct: 375 MALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMIYDISGS 434
Query: 365 MVSFK 369
+ F+
Sbjct: 435 RLVFE 439
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 118/374 (31%), Positives = 173/374 (46%), Gaps = 38/374 (10%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSSSYKELSCQ 81
EYV+ IGTP + + DTGSDL WVQC PC CY+Q +P+++P+ SS+Y ++ C
Sbjct: 125 EYVVTIGIGTPAR-NFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLFDPSKSSTYVDVPCG 183
Query: 82 SEQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
+ QC + ++C C Y+ Y D S+T+G LA E T S VVFGC H
Sbjct: 184 TPQCKIGGGQDLTCGGTT-CEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAAGVVFGCSHE 242
Query: 140 -NTGVFN-ENEM---GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
++GV E EM GL+GLGR S+ SQ + FSYCL P +S Y
Sbjct: 243 YSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYCLPP-----RGSSAGYLT 297
Query: 195 NGSEVSGGGVVS-TSLVSKEDK--TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
G+ +S T LV+ + + Y V L GISV S +P S+ I
Sbjct: 298 IGAAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISV-----SGAALPIDASAFYI---GT 349
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAI-KLTPYQDPRLGS-QLCYKTPSMAGI-APILTA 308
ID+G T +P Y L ++ R + T + + S CY + AP +
Sbjct: 350 VIDSGTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCYDVTGHDVVTAPPVAL 409
Query: 309 HFDGGAKVPLIHTSTFIPPPVEG------VFCFAMQPID--GDVGIFGNFAQSDLFIGYD 360
F GGA++ + + + V+ + C A P + G V I GN Q + +D
Sbjct: 410 EFGGGARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFV-IIGNMQQRAYNVVFD 468
Query: 361 FDSQMVSFKPTDCT 374
+ + + F C+
Sbjct: 469 VEGRRIGFGANGCS 482
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 105/382 (27%), Positives = 175/382 (45%), Gaps = 57/382 (14%)
Query: 19 TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKEL 78
++ G YV F+IGTPP + +VD +L+W QC PC C++Q P+++P SS+++ L
Sbjct: 52 SSQGLYVANFTIGTPPQ-PVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGL 110
Query: 79 SCQSEQCHLLDTVS--CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
C S C + S C+S +C Y + T G T+ G + + + FGC
Sbjct: 111 PCGSHLCESIPESSRNCTS-DVCIYE-APTKAGDTGGKAGTDTFAIGAAK---ETLGFGC 165
Query: 137 GHNNTGVFNENEM-------GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
V + + G+VGLGRT SL ++Q+ FSYCL +
Sbjct: 166 -----VVMTDKRLKTIGGPSGIVGLGRTPWSL----VTQMNVTAFSYCLA-----GKSSG 211
Query: 190 KMYFGNGSEVSGGG-------VVSTSLVSKEDKT--YYFVTLEGISVGNLSNSSKLIPYY 240
++ G ++ GG V+ TS S ++ + YY V L GI G +
Sbjct: 212 ALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAP-----LQAA 266
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
+SSG+ + +DT + + L Y L++ + A+ + P P LC+ ++A
Sbjct: 267 SSSGS----TVLLDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFPK-AVA 321
Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP-----IDGDV---GIFGNFAQ 352
G AP L FDGGA + + + ++ G C + + G++ I G+ Q
Sbjct: 322 GDAPELVFTFDGGAAL-TVPPANYLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQ 380
Query: 353 SDLFIGYDFDSQMVSFKPTDCT 374
++ + +D + +SFKP DC+
Sbjct: 381 ENVHVLFDLKEETLSFKPADCS 402
>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
distachyon]
Length = 836
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 116/361 (32%), Positives = 158/361 (43%), Gaps = 33/361 (9%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK--QVKPIYNPASSSSYKELSC 80
+YV+ S+GTP + VDTGSD+ WVQC PC Q +++PA SSSY + C
Sbjct: 499 QYVVTVSLGTPGVAQTV-EVDTGSDVSWVQCAPCAAPACYAQKDQLFDPAKSSSYSAVPC 557
Query: 81 QSEQCHLLDTVS--CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
++ C L T C++ C Y Y D S T GV ++ +T +++ +FGCGH
Sbjct: 558 AADACSELSTYGHGCAAGSQCGYVVSYGDGSNTTGVYGSDTLTLTDADAV-TGFLFGCGH 616
Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
G+F + GL+ LGR +SL SQ G FSYCL P S +S + G
Sbjct: 617 AQAGLFAGID-GLLALGRKGMSLTSQTSGAYGGGVFSYCLPP-----SPSSTGFLTLGGP 670
Query: 199 VSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
S G +T L++ D T+Y V L GI VG S +P + G +DTG
Sbjct: 671 SSASGFATTGLLTAWDVPTFYMVMLTGIGVGGQQLSG--VP-----ASAFAGGTVVDTGT 723
Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGIA-PILTAHFDG 312
T LP Y + + PY P + CY + P ++ F G
Sbjct: 724 VITRLPPTAYA--ALRAAFRAAMAPYGYPAAPATGILDTCYNFTDYGTVTLPTVSLTFSG 781
Query: 313 GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
GA + L F+ G FA DGD I GN Q + FD V F P
Sbjct: 782 GATLKL-DAPGFL---SSGCLAFATNSGDGDPAILGNVQQRSFAV--RFDGSSVGFMPHS 835
Query: 373 C 373
C
Sbjct: 836 C 836
>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 486
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 107/364 (29%), Positives = 162/364 (44%), Gaps = 27/364 (7%)
Query: 18 STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV-----KPIYNPASS 72
+T G YV+ FS+GTPP + + G++D SD +W+QC C C P + S
Sbjct: 91 ATNTGMYVLSFSVGTPPQV-VTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLS 149
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQL-CNYTYGYAD--SSLTKGVLATERITFGNSNNFF 129
S+ +E+ C + C L +CS+ C Y+Y Y ++ T G+LA + F
Sbjct: 150 STIREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRA-- 207
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
D V+FGC G G++GLGR LS SQ+ Q+G +FSY L P + S
Sbjct: 208 DGVIFGCAVATEGDIG----GVIGLGRGELSPVSQL--QIG--RFSYYLAP-DDAVDVGS 258
Query: 190 KMYFGNGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYN-SSGAIS 247
+ F + ++ VST LV S+ ++ Y+V L GI V IP A
Sbjct: 259 FILFLDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRV---DGEDLAIPRGTFDLQADG 315
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA-GIAPIL 306
G + + P T L Y + + + + I+L LG LCY + S+A P +
Sbjct: 316 SGGVVLSITIPVTFLDAGAYKVVRQAMASKIELRAADGSELGLDLCYTSESLATAKVPSM 375
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQSDLFIGYDFDSQM 365
F GGA + L + F G+ C + P GD + G+ Q + YD
Sbjct: 376 ALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMIYDISGSR 435
Query: 366 VSFK 369
+ F+
Sbjct: 436 LVFE 439
>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
Length = 280
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 75/165 (45%), Positives = 92/165 (55%), Gaps = 9/165 (5%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
S S +GEY + IG PP Y ++DTGSD+ WVQC PC CY+Q PI+ P +S+S
Sbjct: 123 SGTSQGSGEYFSRIGIGEPPS-QAYMVLDTGSDISWVQCAPCADCYRQADPIFEPTASAS 181
Query: 75 YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
Y LSC++ QC LD C + C Y Y D S T G TE +T G N NV
Sbjct: 182 YAPLSCEAAQCRYLDQSQCRNGN-CLYQVSYGDGSYTVGDFVTETVTIG--VNKVKNVAL 238
Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
GCGHNN G+F GL+GLG LS +QL + FSYCLV
Sbjct: 239 GCGHNNEGLF-VGAAGLIGLGGGPLSFP----AQLNSTSFSYCLV 278
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 127/365 (34%), Positives = 177/365 (48%), Gaps = 31/365 (8%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASSSSYKELS 79
+G Y++ +GTP D+ I DTGSD+ W QC PC + CYKQ + I++P+ S+SY +S
Sbjct: 146 SGNYIVTVGLGTPKK-DLSLIFDTGSDITWTQCQPCARSCYKQKEQIFDPSQSTSYTNIS 204
Query: 80 CQSEQCHLL-----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
C S C+ L +T C+S C Y Y DSS + G TE++T S + F+N+ F
Sbjct: 205 CSSSICNSLTSATGNTPGCASSA-CVYGIQYGDSSFSVGFFGTEKLTL-TSTDAFNNIYF 262
Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMYF 193
GCG NN GL+GLGR +LS+ SQ + NK FSYCL + SS T + F
Sbjct: 263 GCGQNNQ-GLFGGSAGLLGLGRDKLSVVSQTAQKY--NKIFSYCL---PSSSSSTGFLTF 316
Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAISKGNM 251
G GS S +S ++Y + GISVG L+ S+ + ++++GAI
Sbjct: 317 G-GSASKNAKFTPLSTIS-AGPSFYGLDFTGISVGGKKLAISASV---FSTAGAI----- 366
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHF 310
ID+G T LP Y+ L RN + P CY S I+ P + F
Sbjct: 367 -IDSGTVITRLPPAAYSALRASFRNLMSKYPMTKALSILDTCYDFSSYTTISVPKIGFSF 425
Query: 311 DGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
G +V + T + V FA DV IFGN Q L + YD + V F
Sbjct: 426 SSGIEVDIDATGILYASSLSQVCLAFAGNSDATDVFIFGNVQQKTLEVFYDGSAGKVGFA 485
Query: 370 PTDCT 374
P C+
Sbjct: 486 PGGCS 490
>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 397
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 98/370 (26%), Positives = 174/370 (47%), Gaps = 50/370 (13%)
Query: 25 VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQ 84
V F+IGTPP +D +L+W QC C+ C+KQ P++ P +SS++K C ++
Sbjct: 55 VANFTIGTPPQA-ASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDV 113
Query: 85 CHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVF 144
C + T C+S +C Y T G++AT+ G + ++ FGC +
Sbjct: 114 CKSIPTPKCAS-DVCAYDGVTGLGGHTVGIVATDTFAIGTAAP--ASLGFGCVVASDIDT 170
Query: 145 NENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGV 204
G +GLGRT SL ++Q+ +FSYCL P D+ S+++ G ++++GGG
Sbjct: 171 MGGPSGFIGLGRTPWSL----VAQMKLTRFSYCLAPH--DTGKNSRLFLGASAKLAGGGA 224
Query: 205 VSTSLVSKED---KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTL 261
+ + + + YY + LE I G +++ +P ++ + + + +L
Sbjct: 225 WTPFVKTSPNDGMSQYYPIELEEIKAG---DATITMPRGRNTVLVQTAVVRV------SL 275
Query: 262 LPKDFYNRLEEQVRNAIKLTPYQDPRLGS--QLCYKTPSMAGIAPILTAHFDGGAKVPLI 319
L Y ++ V ++ P P +G+ ++C+ ++G AP L F GA + +
Sbjct: 276 LVDSVYQEFKKAVMASVGAAPTATP-VGAPFEVCFPKAGVSG-APDLVFTFQAGAALTV- 332
Query: 320 HTSTFIPPPVEGVF-------CFAMQPI--------DGDVGIFGNFAQSDLFIGYDFDSQ 364
PP +F C ++ I DG + I G+F Q ++ + +D D
Sbjct: 333 -------PPANYLFDVGNDTVCLSVMSIALLNITALDG-LNILGSFQQENVHLLFDLDKD 384
Query: 365 MVSFKPTDCT 374
M+SF+P DC+
Sbjct: 385 MLSFEPADCS 394
>gi|125606590|gb|EAZ45626.1| hypothetical protein OsJ_30294 [Oryza sativa Japonica Group]
Length = 431
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 112/400 (28%), Positives = 177/400 (44%), Gaps = 70/400 (17%)
Query: 8 YPNNVVQSNVSTANG-------------EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCL 54
YPN QSN ST G E + IGTP + ++ + DT SDL+W QC
Sbjct: 60 YPNEGDQSN-STRRGLSSTPGGVQEKHVEPHVFLGIGTPAM-NVTLVFDTTSDLLWTQCQ 117
Query: 55 PCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGV 114
PC+ C Q +Y+P + +Y L+ S Y Y Y+ S T G
Sbjct: 118 PCLSCVAQAGDMYDPNKTETYANLTSSS------------------YNYTYSKQSFTSGY 159
Query: 115 LATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKF 174
ATE GN N+ FGCG N G + + + V +L+QLG ++F
Sbjct: 160 FATETFALGNVT--VANITFGCGTRNQGYY--DNVAGVFGVGRGGRGGVSLLNQLGIDRF 215
Query: 175 SYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS-----LVSKEDKTYYFVTLEGISVGN 229
SYC + + +S ++ G E++ + + + K+ YFV L G++VG
Sbjct: 216 SYCFS--SSGAPGSSAVFLGGSPELATNATTTPAASTPMVADPVLKSGYFVKLVGVTVG- 272
Query: 230 LSNSSKLIPYYNSSGAISKGN-MFIDTGAPPTLLPKDFYNRLEEQVRNAI--KLTPYQDP 286
+ L+ +S A G + ID+ +P T+L + Y VR A+ +L P ++
Sbjct: 273 ----ATLVDVAGASSAEGGGRALVIDSTSPVTVLDEATYG----PVRRALVAQLAPLKEA 324
Query: 287 R------LGSQLCYKTPSMAGIAP-----ILTAHFDGGAKVPLIHTSTFIPP-PVEGVFC 334
+G LC++ + G P +T HFDGGA ++ ++++ G+ C
Sbjct: 325 NANASAGVGLDLCFEL-AAGGATPTPPNVTMTLHFDGGAADLVLPPASYLAKDSAGGLIC 383
Query: 335 FAMQPIDGD-VGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
M P + V + G++A D + YD +VSF+P DC
Sbjct: 384 LTMTPSSSNGVPVLGSWALLDTLVLYDLAKNVVSFQPLDC 423
>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
Length = 397
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 107/372 (28%), Positives = 169/372 (45%), Gaps = 43/372 (11%)
Query: 25 VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQ 84
V F+IGTPP I+D +L+W QC C +C+KQ P++ P +SS+++ C ++
Sbjct: 44 VANFTIGTPPQ-PASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDA 102
Query: 85 CHLLDTVSCSSQQLCNY---TYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
C T +CS +C Y T D T G++ TE G + ++ FGC +
Sbjct: 103 CKSTPTSNCSG-DVCTYESTTNIRLDRHTTLGIVGTETFAIGTATA---SLAFGCVVASD 158
Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
+ G +GLGRT SL ++Q+ KFSYCL P T S S+++ G+ ++++G
Sbjct: 159 IDTMDGTSGFIGLGRTPRSL----VAQMKLTKFSYCLSPRGTGKS--SRLFLGSSAKLAG 212
Query: 202 GGVVSTS---LVSKEDKT--YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
G ST+ S +D + YY ++L+ I GN + ++ A S G + + T
Sbjct: 213 GESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIAT----------AQSGGILVMHTV 262
Query: 257 APPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYKTPS--MAGIAPILTAHFD 311
+P +LL Y ++ V A+ P P LC+K + AP L F
Sbjct: 263 SPFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQ 322
Query: 312 GGAK---VP----LIHTSTFIPPPVEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFD 362
GG VP LI + A G V + G+ Q ++ YD
Sbjct: 323 GGGAALTVPPAKYLIDVGEEKDTACAAILSMARLNRTGLEGVSVLGSLQQENVHFLYDLK 382
Query: 363 SQMVSFKPTDCT 374
+ +SF+P DC+
Sbjct: 383 KETLSFEPADCS 394
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 107/363 (29%), Positives = 165/363 (45%), Gaps = 33/363 (9%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSC 80
EYV +GTP + I+DTGS L WVQC PC QCY Q P+++P +SSSY + C
Sbjct: 128 EYVATVGLGTPAVPQTL-ILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTSSSYSPVPC 186
Query: 81 QSEQCHLL----DTVSCSS--QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
S++C L D C+S C Y Y + G +T+ +T G F
Sbjct: 187 DSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTLG-PGAIVKRFHF 245
Query: 135 GCGHN-NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
GCGH+ G F+ + G++GLGR SLA Q ++ G FS+CL P + S +
Sbjct: 246 GCGHHQQRGKFDMAD-GVLGLGRLPQSLAWQASARRGGGVFSHCLPP-----TGVSTGFL 299
Query: 194 GNGSEVSGGGVVSTSLVSKEDKTYYFVTL-EGISV-GNLSNSSKLIPYYNSSGAISKGNM 251
G+ V T L++ +D+ +++ + ISV G L + IP A+ + +
Sbjct: 300 ALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLD----IP-----PAVFREGV 350
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHF 310
D+G + L + Y L R+A+ P P C+ + P ++ F
Sbjct: 351 ITDSGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGHLDTCFNFTGYDNVTVPTVSLTF 410
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
GGA V L +S + ++G F D G+ G+ +Q + + YD + V F+
Sbjct: 411 RGGATVHLDASSGVL---MDGCLAF-WSSGDEYTGLIGSVSQRTIEVLYDMPGRKVGFRT 466
Query: 371 TDC 373
C
Sbjct: 467 GAC 469
>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 491
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 107/366 (29%), Positives = 150/366 (40%), Gaps = 31/366 (8%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELS 79
G + P +L +DT D+ W+QCLPC+ QCY Q ++P SS+ +
Sbjct: 143 GAVIDGDDDDDPMILSQTMAIDTTEDVPWIQCLPCLIPQCYPQRNAFFDPRRSSTGAPVR 202
Query: 80 CQSEQCHLLDTVS--CS---SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
C S C L + CS S C Y Y+D LT G T+ +T S F N F
Sbjct: 203 CGSRACRTLGGYANGCSKPNSTGDCLYRIEYSDHRLTLGTYMTDTLTISPSTTFL-NFRF 261
Query: 135 GCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
GC H G F+ G + LG SL SQ G N FSYC VP + + S
Sbjct: 262 GCSHAVRGKFSAQASGTMSLGGGPQSLLSQTARAYG-NAFSYC-VPGPSAAGFLSIGGPV 319
Query: 195 NGSEVSGGGVVSTSLVSKE----DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
NG + G G +T+ + + + T Y V L+GI V N + G
Sbjct: 320 NGDDGGGSGAFATTPLVRSANVINPTIYVVRLQGIEVAGRR--------LNVPPVVFSGG 371
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAH 309
+D+ A T LP Y L RNA++ + P C+ ++ + P ++
Sbjct: 372 TVMDSSAVITQLPPTAYRALRLAFRNAMRAYKTRAPTGNLDTCFDFVGVSKVTVPTVSLV 431
Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD--VGIFGNFAQSDLFIGYDFDSQMVS 367
FDGGA + L S + C A P+ D +G GN Q + YD V
Sbjct: 432 FDGGAVIELGLLSVLLDS------CLAFAPMAADFALGFIGNVQQQTHEVLYDVAGGAVG 485
Query: 368 FKPTDC 373
F+ C
Sbjct: 486 FRHGAC 491
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 108/361 (29%), Positives = 158/361 (43%), Gaps = 38/361 (10%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
YV+ SIGTP + ++DTGSD+ WV C + ++P SS+Y SC S
Sbjct: 125 YVITVSIGTPAMTQAV-MIDTGSDVSWVHCHARAGAGSSL--FFDPGKSSTYTPFSCSSA 181
Query: 84 QCHLLDTV--SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN- 140
C L+ CS C YT Y D S T G ++ + NS +N FGC +
Sbjct: 182 ACTRLEGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLAL-NSTEKVENFQFGCSETSD 240
Query: 141 --TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
G+ + GL+GLG SL SQ + G + FSYCL P T SS + G+
Sbjct: 241 PGEGLDEDQTDGLMGLGGGAPSLVSQTAATYG-SAFSYCL-PATTRSS----GFLTLGAS 294
Query: 199 VSGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
G V+T + S+ T+YFV L+GI+VG + + P ++G+I +D+G
Sbjct: 295 TGTSGFVTTPMFRSRRAPTFYFVILQGINVGG--DPVAISPTVFAAGSI------MDSGT 346
Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKV 316
T LP Y+ L R ++ P C+ ++ P + F GGA V
Sbjct: 347 IITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDFTGQDNVSIPAVELVFSGGAVV 406
Query: 317 PLIHTSTFIPPPVEGVF---CFAMQPIDGDVG-IFGNFAQSDLFIGYDFDSQMVSFKPTD 372
L +G+ C A P G +G I GN Q + +D ++ F+P
Sbjct: 407 DL---------DADGIMYGSCLAFAPATGGIGSIIGNVQQRTFEVLHDVGQSVLGFRPGA 457
Query: 373 C 373
C
Sbjct: 458 C 458
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 113/361 (31%), Positives = 156/361 (43%), Gaps = 36/361 (9%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EYV+ SIG+P + +DTGSD+ W++C +Y+P +SS+Y SC +
Sbjct: 130 EYVITVSIGSPAVAXTM-FIDTGSDVSWLRC---------KSRLYDPGTSSTYAPFSCSA 179
Query: 83 EQCHLLDT--VSCSSQQLCNYTYGYADSSLTKGVLATERITF-GNSNNFFDNVVFGCGHN 139
C L CSS C Y+ Y D S T G ++ +T G S FGC
Sbjct: 180 PACAQLGRRGTGCSSGSTCVYSVKYGDGSNTTGTYGSDTLTLAGTSEPLISGFQFGCSAV 239
Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
G +N GL+GLG S SQ + G + FSYCL P S + G S
Sbjct: 240 EHGFEEDNTDGLMGLGGDAQSFVSQTAATYG-SAFSYCLPPTWNSSGF---LTLGAPSSS 295
Query: 200 SGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
+ +T ++ SK+ T+Y + L GISVG + IP S S G++ +D+G
Sbjct: 296 TSAAFSTTPMLRSKQAATFYGLLLRGISVGG---KTLEIP----SSVFSAGSI-VDSGTV 347
Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQ--DPRLGSQLCYK-TPSMAG---IAPILTAHFDG 312
T LP Y L R+ + YQ PR C+ T G P + DG
Sbjct: 348 ITRLPPTAYGALSAAFRDGMARYQYQPAAPRGLLDTCFDFTGHGEGNNFTVPSVALVLDG 407
Query: 313 GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
GA V L H + + +G FA DG GI GN Q + YD + F+P
Sbjct: 408 GAVVDL-HPNGIVQ---DGCLAFAATDDDGRTGIIGNVQQRTFEVLYDVGQSVFGFRPGA 463
Query: 373 C 373
C
Sbjct: 464 C 464
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 113/378 (29%), Positives = 174/378 (46%), Gaps = 46/378 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y + +GTPP Y VDTGSD+ WV C+PC C + I++P S+S
Sbjct: 46 GLYYTRIYLGTPPQ-QFYVHVDTGSDVAWVNCVPCTNCKRASNVALPISIFDPEKSTSKT 104
Query: 77 ELSCQSEQCHLLDTVSCSSQQL-CNYTYGYADSSLTKGVLATERITF-----GNS--NNF 128
+SC E+C+L CS + C Y+ Y D S T G L + ++F GNS +
Sbjct: 105 SISCTDEECYLASNSKCSFNSMSCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNSTATSG 164
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSI 187
+ FGCG N TG + + GLVG G+ +SL SQ+ Q + N F++CL D+
Sbjct: 165 TARLTFGCGSNQTGTWLTD--GLVGFGQAEVSLPSQLSKQNVSVNIFAHCL---QGDNKG 219
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAI 246
+ + G+ E G+V T +V K+ ++Y V L I V G + NS G I
Sbjct: 220 SGTLVIGHIRE---PGLVYTPIVPKQ--SHYNVELLNIGVSGTNVTTPTAFDLSNSGGVI 274
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPIL 306
+D+G T L + Y++ + +VR+ ++ P C ++ G P +
Sbjct: 275 ------MDSGTTLTYLVQPAYDQFQAKVRDCMRSGVL--PVAFQFFC----TIEGYFPNV 322
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVE---GVFCFAMQPIDGDVG-----IFGNFAQSDLFIG 358
T +F GGA + L +S + +CF+ G IFG+ D +
Sbjct: 323 TLYFAGGAAMLLSPSSYLYKEMLTTGLSAYCFSWLESTSVYGYLSYTIFGDNVLKDQLVV 382
Query: 359 YDFDSQMVSFKPTDCTKQ 376
YD + + +K DCTK+
Sbjct: 383 YDNVNNRIGWKNFDCTKE 400
>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
Length = 396
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 107/370 (28%), Positives = 165/370 (44%), Gaps = 40/370 (10%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
YV F+IGTPP IVD +L+W QC C +C+KQ P++ P +SS++K C +
Sbjct: 45 YVANFTIGTPPQ-PASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTA 103
Query: 84 QCHLLDTVSCSSQQLCNYTYGYAD-SSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG 142
C + T SCS +C+Y T G AT+ G + + FGC +
Sbjct: 104 VCESIPTRSCSG-DVCSYKGPPTQLRGNTSGFAATDTFAIGTATV---RLAFGCVVASDI 159
Query: 143 VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGG 202
+ G +GLGRT SL ++Q+ +FSYCL P +T S S+++ G+ ++++G
Sbjct: 160 DTMDGPSGFIGLGRTPWSL----VAQMKLTRFSYCLSPRNTGKS--SRLFLGSSAKLAGS 213
Query: 203 GVVSTSLVSK-----EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
ST+ K + YY ++L+ I GN + ++ A S G + + T +
Sbjct: 214 ESTSTAPFIKTSPDDDGSNYYLLSLDAIRAGNTTIAT----------AQSGGILVMHTVS 263
Query: 258 PPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYKTPS--MAGIAPILTAHFDG 312
P +LL Y ++ V A+ P P LC+K + AP L F G
Sbjct: 264 PFSLLVDSAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQG 323
Query: 313 GAK--VP----LIHTSTFIPPPVEGVFCFAMQPIDG--DVGIFGNFAQSDLFIGYDFDSQ 364
A VP LI + A G V + G+ Q D+ YD +
Sbjct: 324 AAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKE 383
Query: 365 MVSFKPTDCT 374
+SF+P DC+
Sbjct: 384 TLSFEPADCS 393
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 117/360 (32%), Positives = 169/360 (46%), Gaps = 27/360 (7%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSCQ 81
E+V+ G+P + DTGSDL W+QC PC CYKQ P+++PA SSSY + C
Sbjct: 111 EFVVVVGFGSPAQTSAT-MFDTGSDLSWIQCQPCSGHCYKQHDPVFDPAKSSSYAVVPCG 169
Query: 82 SEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
+ +C C+ C Y Y D S T GVLA E +TF +S+ F +FGCG N
Sbjct: 170 TTECAAAGG-ECNGTT-CVYGVEYGDGSSTTGVLARETLTFSSSSE-FTGFIFGCGETNL 226
Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
G F E + L+GLGR LSL+SQ G FSYCL ++ T+ Y G+
Sbjct: 227 GDFGEVDG-LLGLGRGSLSLSSQAAPAFG-GIFSYCLPSYN-----TTPGYLSIGATPVT 279
Query: 202 GG--VVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
G V T++V+K D ++YF+ L I++G ++P S +K +D+G
Sbjct: 280 GQIPVQYTAMVNKPDYPSFYFIELVSINIGGY-----VLPVPPSE--FTKTGTLLDSGTI 332
Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVP 317
T LP Y L ++ + ++ + P CY +GI P ++ +F GA
Sbjct: 333 LTYLPPPAYTALRDRFKFTMQGSKPAPPYDELDTCYDFTGQSGILIPGVSFNFSDGAVFN 392
Query: 318 L----IHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
L I T P G F +P D + G+ Q + YD +Q + F P C
Sbjct: 393 LNFFGIMTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEVIYDVPAQKIGFIPASC 452
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 152/361 (42%), Gaps = 25/361 (6%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ---CYKQVKPIYNPASSSSYKELS 79
EYV+ +G+P + ++DTGSD+ WVQC PC C+ +++PA+SS+Y +
Sbjct: 134 EYVISVGLGSPAMTQRV-VIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFN 192
Query: 80 CQSEQCHLL----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
C + C L + C ++ C Y Y D S T G +++ +T S + FG
Sbjct: 193 CSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLSGS-DVVRGFQFG 251
Query: 136 CGHNNTGV-FNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
C H G ++ GL+GLG SL SQ ++ G FSYCL S +
Sbjct: 252 CSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYG-KSFSYCLPATPASSGFLTLGAPA 310
Query: 195 NGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
+G +T ++ SK+ TYYF LE I+VG L P ++G++ +
Sbjct: 311 SGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGG--KKLGLSPSVFAAGSL------V 362
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDG 312
D+G T LP Y L R + +P C+ + ++ P + F G
Sbjct: 363 DSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAG 422
Query: 313 GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
GA V L G FA D G GN Q + YD + F+
Sbjct: 423 GAVVDLDAHGIV----SGGCLAFAPTRDDKAFGTIGNVQQRTFEVLYDVGGGVFGFRAGA 478
Query: 373 C 373
C
Sbjct: 479 C 479
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 115/378 (30%), Positives = 174/378 (46%), Gaps = 34/378 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPAS 71
++S +S +G Y +K +GTP IVDTGS L W+QC PCV C+ QV PI+ P+
Sbjct: 96 LKSGLSIGSGNYYVKIGVGTPAKY-FSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSV 154
Query: 72 SSSYKELSCQSEQCHL-----LDTVSCSSQQ-LCNYTYGYADSSLTKGVLATERITFGNS 125
S +YK LSC S QC L+ CS+ C Y Y D+S + G L+ + +T S
Sbjct: 155 SKTYKALSCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPS 214
Query: 126 NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL---VPFH 182
V+GCG +N G+F + G++GL +LS+ Q+ ++ G N FSYCL
Sbjct: 215 AAPSSGFVYGCGQDNQGLFGRSA-GIIGLANDKLSMLGQLSNKYG-NAFSYCLPSSFSAQ 272
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG----NLSNSSKLIP 238
+SS++ + G S S + + + + + YF+ L I+V +S SS +P
Sbjct: 273 PNSSVSGFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYNVP 332
Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYK-T 296
ID+G T LP YN L++ + Q P C+K +
Sbjct: 333 ------------TIIDSGTVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILDTCFKGS 380
Query: 297 PSMAGIAPILTAHFDGGAKVPL-IHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDL 355
P + F GGA + L +H S + +G C A+ + I GN+ Q
Sbjct: 381 VKEMSTVPEIRIIFRGGAGLELKVHNS--LVEIEKGTTCLAIAASSNPISIIGNYQQQTF 438
Query: 356 FIGYDFDSQMVSFKPTDC 373
+ YD + + F P C
Sbjct: 439 TVAYDVANSKIGFAPGGC 456
>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
Length = 367
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 98/372 (26%), Positives = 176/372 (47%), Gaps = 54/372 (14%)
Query: 25 VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQ 84
V F+IGTPP +D +L+W QC C+ C+KQ P++ P +SS++K C ++
Sbjct: 25 VANFTIGTPPQA-ASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDV 83
Query: 85 CHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVF 144
C + T C+S +C + T G++AT+ G + ++ FGC +
Sbjct: 84 CKSIPTPKCAS-DVCAFDGVTGLGGHTVGIVATDTFAIGTAAP--ASLGFGCVVASDIDT 140
Query: 145 NENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGV 204
G +GLGRT SL ++Q+ +FSYCL P D+ S+++ G ++++GGG
Sbjct: 141 MGGPSGFIGLGRTPWSL----VAQMKLTRFSYCLAPH--DTGKNSRLFLGASAKLAGGGA 194
Query: 205 VSTSLVSKED---KTYYFVTLEGISVGNLSNSSKLIPYYNSS----GAISKGNMFIDTGA 257
+ + + + YY + LE I G +++ +P ++ A+ + ++ +D+
Sbjct: 195 WTPFVKTSPNDGMSQYYPIELEEIKAG---DATITMPRGRNTVLVQTAVVRVSLLVDS-- 249
Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVP 317
+ ++F + V A TP +P ++C+ ++G AP L F GA +
Sbjct: 250 ----VYQEFKKAVMASVGAAPTATPVGEPF---EVCFPKAGVSG-APDLVFTFQAGAALT 301
Query: 318 LIHTSTFIPPPVEGVF-------CFAMQPI--------DGDVGIFGNFAQSDLFIGYDFD 362
+ PP +F C ++ I DG + I G+F Q ++ + +D D
Sbjct: 302 V--------PPANYLFDVGNDTVCLSVMSIALLNITALDG-LNILGSFQQENVHLLFDLD 352
Query: 363 SQMVSFKPTDCT 374
M+SF+P DC+
Sbjct: 353 KDMLSFEPADCS 364
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 110/362 (30%), Positives = 159/362 (43%), Gaps = 27/362 (7%)
Query: 23 EYVMKFSIGTPPLLDIYGI-VDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSC 80
E+V+ G+P Y + +DTGSD+ W+QCLPC CYKQ P+++P S++Y + C
Sbjct: 160 EFVVTVGFGSP--AQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDPTKSATYSAVPC 217
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
QC CS+ C Y Y D S T GVL+ E ++ ++ + FGCG N
Sbjct: 218 GHPQCAAAGG-KCSNSGTCLYKVTYGDGSSTAGVLSHETLSLSSTRD-LPGFAFGCGQTN 275
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
G F + LVGLGR LSL SQ + GA FSYCL + T + +
Sbjct: 276 LGEFGGVDG-LVGLGRGALSLPSQAAATFGAT-FSYCLPSYDTTHGYLTMGSTTPAASND 333
Query: 201 GGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPP 259
V T+++ KED + YFV + I +G ++P + ++ D+G
Sbjct: 334 DDDVQYTAMIQKEDYPSLYFVEVVSIDIGGY-----ILPVPPT--VFTRDGTLFDSGTIL 386
Query: 260 TLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYK-TPSMAGIAPILTAHFDGGAK 315
T LP + Y L ++ + + K P DP CY T A P + F GA
Sbjct: 387 TYLPPEAYASLRDRFKFTMTQYKPAPAYDPF---DTCYDFTGHNAIFMPAVAFKFSDGAV 443
Query: 316 VPLIHTSTFIPP----PVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
L + I P P G F +P I GN Q + YD ++ + F
Sbjct: 444 FDLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTEVIYDVAAEKIGFGQF 503
Query: 372 DC 373
C
Sbjct: 504 TC 505
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 124/381 (32%), Positives = 160/381 (41%), Gaps = 40/381 (10%)
Query: 3 PATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV---QC 59
PA++ Y ++ T N YV+ S+GTP + VDTGSDL WVQC PC C
Sbjct: 36 PASWGY-------DIGTLN--YVVTASLGTPGVAQTM-EVDTGSDLSWVQCKPCAAAPSC 85
Query: 60 YKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLAT 117
Y Q P+++PA SSSY + C C L S S C Y Y D S T GV ++
Sbjct: 86 YSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSS 145
Query: 118 ERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
+ +T ++++ FGCGH +G+FN + GL+GLGR + SL Q G FSYC
Sbjct: 146 DTLTL-SASSAVQGFFFGCGHAQSGLFNGVD-GLLGLGREQPSLVEQTAGTYG-GVFSYC 202
Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSK 235
L T S + G G ST+ L S TYY V L GISVG S
Sbjct: 203 L---PTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS-- 257
Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--C 293
+P +G + T PPT Y L R+ + Y L C
Sbjct: 258 -VPASAFAGGTVVDTGTVVTRLPPTA-----YAALRSAFRSGMASYGYPTAPSNGILDTC 311
Query: 294 YKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQ 352
Y + P + F GA V L G FA DG + I GN Q
Sbjct: 312 YNFAGYGTVTLPNVALTFGSGATVTLGADGIL----SFGCLAFAPSGSDGGMAILGNVQQ 367
Query: 353 SDLFIGYDFDSQMVSFKPTDC 373
+ D V FKP+ C
Sbjct: 368 RSFEV--RIDGTSVGFKPSSC 386
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 119/360 (33%), Positives = 151/360 (41%), Gaps = 31/360 (8%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV---QCYKQVKPIYNPASSSSYKELSC 80
YV+ S+GTP + VDTGSDL WVQC PC CY Q P+++PA SSSY + C
Sbjct: 140 YVVTASLGTPGVAQTM-EVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPC 198
Query: 81 QSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
C L S S C Y Y D S T GV +++ +T ++++ FGCGH
Sbjct: 199 GGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL-SASSAVQGFFFGCGH 257
Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
+G+FN + GL+GLGR + SL Q G FSYCL T S + G G
Sbjct: 258 AQSGLFNGVD-GLLGLGREQPSLVEQTAGTYG-GVFSYCL---PTKPSTAGYLTLGVGGP 312
Query: 199 VSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
ST+ L S TYY V L GISVG S +P +G + T
Sbjct: 313 SGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS---VPASAFAGGTVVDTGTVVTR 369
Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHFDGG 313
PPT Y L R+ + Y L CY + P + F G
Sbjct: 370 LPPTA-----YAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSG 424
Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
A V L G FA DG + I GN Q + D V FKP+ C
Sbjct: 425 ATVTLGADGIL----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 478
>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 466
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 116/404 (28%), Positives = 174/404 (43%), Gaps = 66/404 (16%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC------------------- 53
V S++ + EY+ ++GTPP+ + DTGSDL+W++C
Sbjct: 71 VSSDLFYGDFEYLAAVNVGTPPVR-FLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSN 129
Query: 54 LPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDT-VSCSSQ-QLCNYTYGYADSSLT 111
+ +NP SSSY + C C L T SC+ C++ Y Y D +
Sbjct: 130 SSPPPPPPEAVVYFNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYRDGASA 189
Query: 112 KGVLATERITFG----NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILS 167
G+LA + TFG N ++ FGC G + + G+VGLG LSLASQ+
Sbjct: 190 TGLLAADTFTFGGNINNDTTSTASIDFGCATGTAGREFQAD-GMVGLGAGPLSLASQL-- 246
Query: 168 QLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLV--SKEDKTYYFVTLEGI 225
KFS+CL + D + +S + FG + VS G +T L+ S YY I
Sbjct: 247 ---GRKFSFCLTAYDIDDA-SSILNFGARAVVSDPGAATTPLIASSSNAAAYY-----AI 297
Query: 226 SVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPK-DFYNRLEE---QVRNAIKLT 281
S+ +L + + +P G S + +DTG T L + L E +V + L
Sbjct: 298 SIDSLKVAGQPVP-----GTTSVSKVIVDTGTVLTFLDRAALLAPLTESLARVMDGAGLP 352
Query: 282 PYQDPRLGSQLCY---KTPSMAGIAP--ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFA 336
P +LCY + + G+ P L GG +V L TF+ EGV C A
Sbjct: 353 RAPPPDETLELCYDVSRVKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVK-EGVLCLA 411
Query: 337 -------MQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+QP+ + GN A DL +G D D++ +F +C
Sbjct: 412 VVTTSPELQPL----SVLGNVALQDLHVGIDLDARTATFATANC 451
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 124/381 (32%), Positives = 160/381 (41%), Gaps = 40/381 (10%)
Query: 3 PATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV---QC 59
PA++ Y ++ T N YV+ S+GTP + VDTGSDL WVQC PC C
Sbjct: 128 PASWGY-------DIGTLN--YVVTASLGTPGVAQTM-EVDTGSDLSWVQCKPCSAAPSC 177
Query: 60 YKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLAT 117
Y Q P+++PA SSSY + C C L S S C Y Y D S T GV ++
Sbjct: 178 YSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSS 237
Query: 118 ERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
+ +T ++++ FGCGH +G+FN + GL+GLGR + SL Q G FSYC
Sbjct: 238 DTLTL-SASSAVQGFFFGCGHAQSGLFNGVD-GLLGLGREQPSLVEQTAGTYG-GVFSYC 294
Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSK 235
L T S + G G ST+ L S TYY V L GISVG S
Sbjct: 295 L---PTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS-- 349
Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--C 293
+P +G + T PPT Y L R+ + Y L C
Sbjct: 350 -VPASAFAGGTVVDTGTVVTRLPPTA-----YAALRSAFRSGMASYGYPTAPSNGILDTC 403
Query: 294 YKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQ 352
Y + P + F GA V L G FA DG + I GN Q
Sbjct: 404 YNFAGYGTVTLPNVALTFGSGATVTLGADGIL----SFGCLAFAPSGSDGGMAILGNVQQ 459
Query: 353 SDLFIGYDFDSQMVSFKPTDC 373
+ D V FKP+ C
Sbjct: 460 RSFEV--RIDGTSVGFKPSSC 478
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 114/388 (29%), Positives = 177/388 (45%), Gaps = 43/388 (11%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC-LPCVQCYK---QVKPIYN 68
+ S + +Y + IGTP + DTGSDL W+ C C C K ++
Sbjct: 108 IHSGADSGQSQYFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFR 167
Query: 69 PASSSSYKELSCQSEQC--HLLDTVSCSS----QQLCNYTYGYADSSLTKGVLATERITF 122
SSS++ + C S+ C L D S + C + Y Y + GV A E +T
Sbjct: 168 ANDSSSFRTIPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTV 227
Query: 123 GNSNN----FFDNVVFGCGHNNTGVFNENE---MGLVGLGRTRLSLASQILSQLGANKFS 175
G +++ FD V+ GC T FNE G++GLG + SLA + L+++ NKFS
Sbjct: 228 GLNDHKKIRLFD-VLIGC----TESFNETNGFPDGVMGLGYRKHSLALR-LAEIFGNKFS 281
Query: 176 YCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNS 233
YCLV + S+ + + FG+ E+ + T L+ +Y V + GISVG LS S
Sbjct: 282 YCLVDHLSSSNHKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSIS 341
Query: 234 SKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVR----NAIKLTPYQDPRLG 289
S + +N +G G M +D+G T+L + Y+++ + ++ K+ P + P L
Sbjct: 342 SDI---WNVTGV---GGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPEL- 394
Query: 290 SQLCYKTPSMAGIA-PILTAHFDGGA--KVPLIHTSTFIPPPVEGVFCFAMQPID-GDVG 345
+ C++ A P L HF GA K P+ ++I EG+ C + D
Sbjct: 395 NNFCFEDKGFDRAAVPRLLIHFADGAIFKPPV---KSYIIDVAEGIKCLGIIKADFPGSS 451
Query: 346 IFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
I GN Q + YD + F P+ C
Sbjct: 452 ILGNVMQQNHLWEYDLGRGKLGFGPSSC 479
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 116/358 (32%), Positives = 164/358 (45%), Gaps = 23/358 (6%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSCQ 81
E+V+ GTP I+DTGSDL W+QC PC CY+Q P ++PA SSSY + C
Sbjct: 136 EFVVVVGFGTPAQTAAI-ILDTGSDLSWIQCKPCSGHCYRQHDPDFDPAKSSSYAAVPCG 194
Query: 82 SEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
+ C + C+ C Y Y D S T GVL+ + +TF NS++ F FGCG N
Sbjct: 195 TPVCAAAGGM-CNGTT-CLYGVQYGDGSSTTGVLSRDTLTF-NSSSKFTGFTFGCGEKNI 251
Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
G F E + L+GLGR +LSL SQ G FSYCL ++T + G S
Sbjct: 252 GDFGEVDG-LLGLGRGKLSLPSQAAPSFGG-VFSYCLPSYNTTPGY---LNIGATKPTST 306
Query: 202 GGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPT 260
V T+++ K ++YF+ L I++G ++P S +K +D+G T
Sbjct: 307 VPVQYTAMIKKPQYPSFYFIELVSINIGGY-----ILPVPPS--VFTKTGTLLDSGTILT 359
Query: 261 LLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK-TPSMAGIAPILTAHFDGGAKVPLI 319
LP Y L ++ + ++ P CY T A + P ++ +F GA L
Sbjct: 360 YLPPPAYTSLRDRFKFTMQGNKPAPPYEPLDTCYDFTGQGAIVIPAVSFNFSDGAVFDLD 419
Query: 320 HTSTFIPP----PVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
I P P+ G F +P I GN Q + YD SQ + F P C
Sbjct: 420 FYGIMIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQKIGFIPISC 477
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 104/358 (29%), Positives = 151/358 (42%), Gaps = 30/358 (8%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELSC 80
EYV S GTP + + ++DTGSDL W+QC PC QC Q P+++P+ SS+Y + C
Sbjct: 111 EYVATVSFGTPAVPQVV-VIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSSTYSAVPC 169
Query: 81 QSEQCHLLDTVS----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
S +C L + CS+ Q C + Y D + T GV +++T + FGC
Sbjct: 170 ASGECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDKLTLA-PGAIVKDFYFGC 228
Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
GH+ + + + L + SL +Q FSYCL ++ + FG G
Sbjct: 229 GHSKSSLPGLFDGLLGLGRLSE-SLGAQYGG---GGGFSYCLPAVNSKPGF---LAFGAG 281
Query: 197 SEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
SG V + T+ VTL GI+VG L P S G M +D+G
Sbjct: 282 RNPSGFVFTPMGRVPGQ-PTFSTVTLAGITVGG--KKLDLRPSAFS------GGMIVDSG 332
Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG-IAPILTAHFDGGAK 315
T+L Y L R A+K Y+ CY + P + F GGA
Sbjct: 333 TVVTVLQSTVYRALRAAFREAMKA--YRLVHGDLDTCYDLTGYKNVVVPKIALTFSGGAT 390
Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ L + + V G FA DG G+ GN Q + +D + F+ C
Sbjct: 391 INLDVPNGIL---VNGCLAFAETGKDGTAGVLGNVNQRTFEVLFDTSASKFGFRAKAC 445
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 104/378 (27%), Positives = 179/378 (47%), Gaps = 44/378 (11%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y K +G+PP + Y VDTGSD++WV C PC +C + +Y+ +SS+ K
Sbjct: 75 GLYFTKIKLGSPPK-EYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKASSTSK 133
Query: 77 ELSCQSEQCH-LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNN--FF 129
+ C+ C ++ + +C +++ C+Y Y D S + G + IT GN
Sbjct: 134 NVGCEDAFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTAPLA 193
Query: 130 DNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDS 185
VVFGCG N +G + E G++G G++ S+ SQ+ + + FS+CL
Sbjct: 194 QEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCL------D 247
Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
++ F G EV V +T LV ++ +Y V L+G+ V L P S+
Sbjct: 248 NMNGGGIFAIG-EVESPVVKTTPLV--PNQVHYNVILKGMDVDG--EPIDLPPSLASTNG 302
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQV--RNAIKLTPYQDPRLGSQLCYKTPSMAGIA 303
G ID+G LP++ YN L E++ + +KL Q+ + C+ S A
Sbjct: 303 --DGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHMVQE----TFACFSFTSNTDKA 356
Query: 304 -PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP-----IDG-DVGIFGNFAQSDLF 356
P++ HF+ K+ ++ ++ E ++CF Q DG DV + G+ S+
Sbjct: 357 FPVVNLHFEDSLKLS-VYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKL 415
Query: 357 IGYDFDSQMVSFKPTDCT 374
+ YD +++++ + +C+
Sbjct: 416 VVYDLENEVIGWADHNCS 433
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 105/379 (27%), Positives = 183/379 (48%), Gaps = 46/379 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y K +G+PP + Y VDTGSD++WV C PC +C + +Y+ +SS+ K
Sbjct: 76 GLYFTKIKLGSPPK-EYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSK 134
Query: 77 ELSCQSEQCH-LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNN--FF 129
+ C+ + C ++ + +C +++ C+Y Y D S + G + IT GN
Sbjct: 135 NVGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLA 194
Query: 130 DNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK--FSYCLVPFHTD 184
VVFGCG N +G + + G++G G++ S+ SQ L+ G+ K FS+CL
Sbjct: 195 QEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQ-LAAGGSTKRIFSHCL------ 247
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
++ F G EV V +T +V ++ +Y V L+G+ V + L P S+
Sbjct: 248 DNMNGGGIFAVG-EVESPVVKTTPIV--PNQVHYNVILKGMDVDG--DPIDLPPSLASTN 302
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQV--RNAIKLTPYQDPRLGSQLCYKTPSMAGI 302
G ID+G LP++ YN L E++ + +KL Q+ + C+ S
Sbjct: 303 G--DGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHMVQE----TFACFSFTSNTDK 356
Query: 303 A-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP-----IDG-DVGIFGNFAQSDL 355
A P++ HF+ K+ ++ ++ E ++CF Q DG DV + G+ S+
Sbjct: 357 AFPVVNLHFEDSLKLS-VYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNK 415
Query: 356 FIGYDFDSQMVSFKPTDCT 374
+ YD +++++ + +C+
Sbjct: 416 LVVYDLENEVIGWADHNCS 434
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 105/379 (27%), Positives = 183/379 (48%), Gaps = 46/379 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y K +G+PP + Y VDTGSD++WV C PC +C + +Y+ +SS+ K
Sbjct: 72 GLYFTKIKLGSPPK-EYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSK 130
Query: 77 ELSCQSEQCH-LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNN--FF 129
+ C+ + C ++ + +C +++ C+Y Y D S + G + IT GN
Sbjct: 131 NVGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLA 190
Query: 130 DNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK--FSYCLVPFHTD 184
VVFGCG N +G + + G++G G++ S+ SQ L+ G+ K FS+CL
Sbjct: 191 QEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQ-LAAGGSTKRIFSHCL------ 243
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
++ F G EV V +T +V ++ +Y V L+G+ V + L P S+
Sbjct: 244 DNMNGGGIFAVG-EVESPVVKTTPIV--PNQVHYNVILKGMDVDG--DPIDLPPSLASTN 298
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQV--RNAIKLTPYQDPRLGSQLCYKTPSMAGI 302
G ID+G LP++ YN L E++ + +KL Q+ + C+ S
Sbjct: 299 G--DGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVKLHMVQE----TFACFSFTSNTDK 352
Query: 303 A-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP-----IDG-DVGIFGNFAQSDL 355
A P++ HF+ K+ ++ ++ E ++CF Q DG DV + G+ S+
Sbjct: 353 AFPVVNLHFEDSLKLS-VYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNK 411
Query: 356 FIGYDFDSQMVSFKPTDCT 374
+ YD +++++ + +C+
Sbjct: 412 LVVYDLENEVIGWADHNCS 430
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 106/390 (27%), Positives = 160/390 (41%), Gaps = 44/390 (11%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
+ EY++ SIGTP + +DTGSDL+W QC C C+ Q P ++ +S + + C
Sbjct: 97 DSEYLIHLSIGTPRPQRVALTLDTGSDLVWTQC-ACHVCFAQPFPTFDALASQTTLAVPC 155
Query: 81 QSEQCH--LLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITF----GNSNN------ 127
C C+ + C Y Y YAD S+T G + + TF GN+ +
Sbjct: 156 SDPICTSGKYPLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAGV 215
Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
NV FGCG N G+F NE G+ G R +SL SQL +FS+C +
Sbjct: 216 AVPNVRFGCGQYNKGIFKSNESGIAGFSRGPMSLP----SQLKVARFSHCFTAIA--DAR 269
Query: 188 TSKMYFGN--GSEVSGG---GVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
TS ++ G G + G G V ++ + + + Y++TL+GI+VG + +
Sbjct: 270 TSPVFLGGAPGPDNLGAHATGPVQSTPFANSNGSLYYLTLKGITVGKTRLPLNALAFAGK 329
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ--LCYKTPSMA 300
G ID+G LP Y L +KL + ++ LC++ +
Sbjct: 330 GTGSGSGGTIIDSGTGIRTLPGPMYRSLRAAFVARVKLPVANESAADAESTLCFE--AAR 387
Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV--------------FCFAMQPI-DGDVG 345
+ A KV L P E C M D D+
Sbjct: 388 SASLPPEAPAPALPKVVLHVAGADWDLPRESYVLDLLEDEDGSGSGLCLVMNSAGDSDLT 447
Query: 346 IFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
I GNF Q ++ + YD + + F P C K
Sbjct: 448 IIGNFQQQNMHVAYDLEKNKLVFVPARCDK 477
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 113/413 (27%), Positives = 178/413 (43%), Gaps = 81/413 (19%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS- 71
+ S S+ +G+Y + +G+PP + + DTGSDL WV+C C K I+ P S
Sbjct: 72 LMSGASSGSGQYFVSIRLGSPPQTLLL-VADTGSDLTWVRCSAC----KTNCSIHPPGST 126
Query: 72 -----SSSYKELSCQSEQCHLL---DTVSCSSQQL---CNYTYGYADSSLTKGVLATERI 120
S+++ C S C L+ + C+ +L C Y Y Y+D S T G + E
Sbjct: 127 FLARHSTTFSPTHCFSSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETT 186
Query: 121 TFGNSNNF---FDNVVFGCGHNNTG------VFNENEMGLVGLGRTRLSLASQILSQLGA 171
T S+ ++ FGCG + +G FN G++GLGR +S ASQ+ + G
Sbjct: 187 TLNTSSGREMKLKSIAFGCGFHASGPSLIGSSFN-GASGVMGLGRGPISFASQLGRRFG- 244
Query: 172 NKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS------------LVSKEDKTYYF 219
FSYCL+ + TS + G+ VVST L++ E T+Y+
Sbjct: 245 RSFSYCLLDYTLSPPPTSYLMIGD--------VVSTKKDNKSMMSFTPLLINPEAPTFYY 296
Query: 220 VTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK 279
++++G+ V + + P S + G ID+G T L + Y + + +K
Sbjct: 297 ISIKGVFVDGV--KLHIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVK 354
Query: 280 LTPYQDP-----RLGSQLCYKTPSMAGIA----PILTAHFDGGAKVPLIHTSTFIPPPV- 329
L P P R G LC ++ G++ P L+ G S + PPP
Sbjct: 355 L-PSPTPGGASTRSGFDLCV---NVTGVSRPRFPRLSLELGG--------ESLYSPPPRN 402
Query: 330 ------EGVFCFAMQPIDGDVGIF---GNFAQSDLFIGYDFDSQMVSFKPTDC 373
EG+ C A+QP++ + G F GN Q + +D + F C
Sbjct: 403 YFIDISEGIKCLAIQPVEAESGRFSVIGNLMQQGFLLEFDRGKSRLGFSRRGC 455
>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 481
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 114/414 (27%), Positives = 179/414 (43%), Gaps = 73/414 (17%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC----------VQCYKQVKPIYNPASS 72
+Y+ + IG PP +VDTGSDL+W QC C C+ Q P YN + S
Sbjct: 77 QYIASYGIGDPPQ-PAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLS 135
Query: 73 SSYKELSCQSEQCHLL----DTVSC-----SSQQLCNYTYGYADSSLTKGVLATERITFG 123
+ + + C + L +T C S C Y + + GVL T+ TF
Sbjct: 136 RTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYG-AGVALGVLGTDAFTFP 194
Query: 124 NSNNFFDNVVFGCGHNNT---GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
+S++ + FGC G N G++GLGR LSL +SQL A +FSYCL P
Sbjct: 195 SSSSV--TLAFGCVSQTRISPGALN-GASGIIGLGRGALSL----VSQLNATEFSYCLTP 247
Query: 181 FHTDSSITSKMYFGNGSEVSGGGV----------VSTSLVSKEDK-----TYYFVTLEGI 225
+ D+ S ++ G+G V+T +K K T+Y++ L G+
Sbjct: 248 YFRDTVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGL 307
Query: 226 SVGNLSNS--SKLIPYYNSSGAISKGNMFIDTGAPPTLL----PKDFYNRLEEQVRNAIK 279
+ GN + + + ++ + G ID+G+P T L + L Q+R +
Sbjct: 308 AAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGS 367
Query: 280 LTPYQDPRLGS--QLCYKT----PSMAGIA-PILTAHFD---GGAKVPLIHTSTFIPPPV 329
L P +LG +LC + S+A A P L FD GG + +I +
Sbjct: 368 LVP-PPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVE 426
Query: 330 EGVFCFAM---------QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+C A+ P + + I GNF Q D+ + YD + ++SF+P +C+
Sbjct: 427 ASTWCMAVVSSASGNATLPTN-ETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 479
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 105/369 (28%), Positives = 154/369 (41%), Gaps = 24/369 (6%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPAS 71
VQS + G Y++K ++GTP L + +DTGSD+ W QC PCV CY+Q + ++P
Sbjct: 34 VQSGIPLGAGNYLVKMALGTPKL-SLSLALDTGSDITWTQCEPCVGSCYRQAQTKFDPRK 92
Query: 72 SSSYKELSCQSEQCHLLD----TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
SSSYK +SC S C ++ C S C Y Y D S + G ATE++T S +
Sbjct: 93 SSSYKNVSCSSSSCRIITDSGGARGCVSST-CIYKVQYGDGSYSVGFFATEKLTISPS-D 150
Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
N +FGCG N G F + + + S+ N F+YCL F + S+
Sbjct: 151 VISNFLFGCGQQNAGRF--GRIAGLLGLGRGKLSLALQTSEKYNNLFTYCLPSFSSSST- 207
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
G GG V + + + GI + LS ++P + S
Sbjct: 208 --------GHLTLGGQVPKSVKFTPLSPAFKNTPFYGIDIKGLSVGGHVLPI--DASVFS 257
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PIL 306
ID+G T L Y+ L + + +K P D CY I+ P +
Sbjct: 258 NAGAIIDSGTVITRLQPTVYSALSSKFQQLMKDYPKTDGFSILDTCYDFSGNESISVPRI 317
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP--IDGDVGIFGNFAQSDLFIGYDFDSQ 364
+ F GG +V + C A P DGD +FGN Q + +D
Sbjct: 318 SFFFKGGVEVDIKFFGILTVINAWDKVCLAFAPNDDDGDFVVFGNSQQQTYDVVHDLAKG 377
Query: 365 MVSFKPTDC 373
+ F P+ C
Sbjct: 378 RIGFAPSGC 386
>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
Length = 376
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 74/194 (38%), Positives = 104/194 (53%), Gaps = 12/194 (6%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASS 72
+S + G YV+ +GTP D+ I DTGSDL W QC PC + CY Q +PI+NP+ S
Sbjct: 128 KSGSTIGTGNYVVTVGLGTPKR-DLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKS 186
Query: 73 SSYKELSCQSEQCHLL-----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
+SY +SC S C L ++ SCS+ C Y Y D S + G A +++ S +
Sbjct: 187 TSYTNISCSSPTCDELKSGTGNSPSCSAST-CVYGIQYGDQSYSVGFFAQDKLAL-TSTD 244
Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI 187
F+N +FGCG NN G+F GL+GLGR LSL S+ A+ C D+
Sbjct: 245 VFNNFLFGCGQNNRGLF-VGVAGLIGLGRNALSLMSKYPKAAPASILDTCYDFSQYDTVD 303
Query: 188 TSK--MYFGNGSEV 199
K +YF +G+E+
Sbjct: 304 VPKINLYFSDGAEM 317
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 165/372 (44%), Gaps = 36/372 (9%)
Query: 25 VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQ 84
++ IGTPP I+DTGS L W+QC V +++P+ SSS+ L C
Sbjct: 83 LVSLPIGTPPQTQQM-ILDTGSQLSWIQCHKKVPRKPPPSSVFDPSLSSSFSVLPCNHPL 141
Query: 85 CH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
C SC +LC+Y+Y YAD +L +G L E+ITF S + ++ GC
Sbjct: 142 CKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQS-TPPLILGCAEE 200
Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
++ + G++G+ RLS ASQ KFSYC+ T F G
Sbjct: 201 SS-----DAKGILGMNLGRLSFASQA----KLTKFSYCVPTRQVRPGFTPTGSFYLGENP 251
Query: 200 SGGGVVSTSLVS--------KEDKTYYFVTLEGISVGNLSNSSKLIPYY-NSSGAISKGN 250
+ GG +L++ D Y V ++GI +GN + + + + SGA G
Sbjct: 252 NSGGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPSGA---GQ 308
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG--SQLCY--KTPSMAGIAPIL 306
ID+G+ T L + YN++ E+V + + G S +C+ + + +
Sbjct: 309 TMIDSGSEFTYLVDEAYNKVREEVVRLVGARLKKGYVYGGVSDMCFNGNAIEIGRLIGNM 368
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYDFDS 363
FD G ++ ++ + GV C + + + I GNF Q ++++ +D +
Sbjct: 369 VFEFDKGVEI-VVEKERVLADVGGGVHCVGIGRSEMLGAASNIIGNFHQQNIWVEFDLAN 427
Query: 364 QMVSFKPTDCTK 375
+ V F DC++
Sbjct: 428 RRVGFGKADCSR 439
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 104/368 (28%), Positives = 163/368 (44%), Gaps = 17/368 (4%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
+S + Y++K IG+P + +Y + DTGS L W QC PC + ++Q+ PI+N +S +Y
Sbjct: 83 RISQDDTCYLVKVIIGSPGV-PLYLVPDTGSGLFWTQCEPCTRRFRQLPPIFNSTASRTY 141
Query: 76 KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
++L CQ + C V C Y YA S T GV A + + ++ FG
Sbjct: 142 RDLPCQHQFCTNNQNVFQCRDDKCVYRIAYAGGSATAGVAAQDILQSAENDRI--PFYFG 199
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLAS----QILSQLGANKFSYCLVPFH--TDSSITS 189
C +N F+ E G G L+++ Q ++ + N+FSYCL F + S TS
Sbjct: 200 CSRDNQN-FSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFDLSSPSHATS 258
Query: 190 KMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
+ FGN S +ST VS YF+ L +SV N ++ P + G
Sbjct: 259 LLRFGNDIRKSRRKYLSTPFVSPRGMPNYFLNLIDVSVA--GNRMQIPPGTFALKPDGTG 316
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ--DPRLGSQLCYKTPSMA-GIAPIL 306
ID+G T + + Y + +N +Q + +L +CYK P +
Sbjct: 317 GTIIDSGTAVTYISQTAYFPVITAFKNYFDQHGFQRVNIQLSGYICYKQQGHTFHNYPSM 376
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID-GDVGIFGNFAQSDLFIGYDFDSQM 365
HF GA + ++ G FC A+QPI I G Q++ YD ++
Sbjct: 377 AFHFQ-GADFFVEPEYVYLTVQDRGAFCVALQPISPQQRTIIGALNQANTQFIYDAANRQ 435
Query: 366 VSFKPTDC 373
+ F P +C
Sbjct: 436 LLFTPENC 443
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 108/387 (27%), Positives = 174/387 (44%), Gaps = 47/387 (12%)
Query: 15 SNVSTANGE----YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPA 70
S+ A+G+ YV++ +G+P I +DT +D W C PC C ++ PA
Sbjct: 64 SSAPVASGQSPPSYVVRAGLGSP-AQPILLALDTSADATWAHCSPCGTCPSSGS-LFAPA 121
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQ---------LCNYTYGYADSSLTKGVLATERIT 121
+S+SY L C S C +L C +Q +C +T +AD+S + LA++ +
Sbjct: 122 NSTSYAPLPCSSTMCTVLQGQPCPAQDPYDSSAPLPMCAFTKPFADASF-QASLASDWLH 180
Query: 122 FGNSNNFFDNVVFGCGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
G + N FGC +G N + GL+GLGR ++L SQ+ + FSYCL
Sbjct: 181 LGK--DAIPNYAFGCVSAVSGPTANLPKQGLLGLGRGPMALLSQV-GNMYNGVFSYCLPS 237
Query: 181 FHTDSSITSKMYFGNGSEVSGG-----GVVSTSLVSKEDKT-YYFVTLEGISVGN--LSN 232
+ K Y+ +GS G GV T ++ +++ Y+V + G+SVG +
Sbjct: 238 Y--------KSYYFSGSLRLGAAGQPRGVRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKV 289
Query: 233 SSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-Q 291
+ + ++GA +D+G T Y L E+ R + P LG+
Sbjct: 290 PAGSFAFDPATGA----GTVVDSGTVITRWTPPVYAALREEFRRHVA-APSGYTSLGAFD 344
Query: 292 LCYKTPSMA-GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM----QPIDGDVGI 346
C+ T +A G+AP +T H DGG + L +T I + C AM Q ++ V +
Sbjct: 345 TCFNTDEVAAGVAPAVTVHMDGGLDLALPMENTLIHSSATPLACLAMAEAPQNVNAVVNV 404
Query: 347 FGNFAQSDLFIGYDFDSQMVSFKPTDC 373
N Q +L + +D + V F C
Sbjct: 405 LANLQQQNLRVVFDVANSRVGFARESC 431
>gi|125564663|gb|EAZ10043.1| hypothetical protein OsI_32347 [Oryza sativa Indica Group]
Length = 330
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 97/359 (27%), Positives = 157/359 (43%), Gaps = 58/359 (16%)
Query: 36 LDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSS 95
+D+ + DT SDL+W QC PC+ C Q +Y+P + +Y L+
Sbjct: 1 MDVTLVFDTTSDLLWTQCQPCLSCVAQAGDMYDPNKTETYANLTSS-------------- 46
Query: 96 QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLG 155
NY Y Y+ S T G ATE GN N+ FGCG N G ++
Sbjct: 47 ----NYNYTYSKQSFTSGYFATETFALGNVT--VANITFGCGTRNQGYYDNVAG-----V 95
Query: 156 RTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS-----LV 210
+L+QLG ++FSYC + + +S ++ G E++ + + +
Sbjct: 96 FGVGRGGVSLLNQLGIDRFSYCFS--SSGAPGSSAVFLGGSPELATNATTTPAASTPMVA 153
Query: 211 SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN-MFIDTGAPPTLLPKDFYNR 269
K+ YFV L G++VG + + +S A G + ID+ +P T+L + Y
Sbjct: 154 DPVLKSGYFVKLVGVTVG-----ATRVDVAGASSAEGGGRALVIDSTSPVTVLDEATYG- 207
Query: 270 LEEQVRNAI--KLTPYQDPR------LGSQLCYKTPSMAGIAP-----ILTAHFDGGAKV 316
VR A+ +L P ++ +G LC++ + G P +T HFDGGA
Sbjct: 208 ---PVRRALVAQLAPLKEANANASAGVGLDLCFEL-AAGGATPTPPNVTMTLHFDGGAAD 263
Query: 317 PLIHTSTFIPP-PVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
++ + ++ G+ C M P + V + G+ A D + YD +VSF+P DC
Sbjct: 264 LVLPPANYLAKDSAGGLICLTMTPSSSNGVPVLGSSALLDTLVLYDLAKNVVSFQPLDC 322
>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 71/171 (41%), Positives = 97/171 (56%), Gaps = 11/171 (6%)
Query: 14 QSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-CYKQVKPIYNPASS 72
+S + +G YV+ +G+P D+ I DTGSDL W QC PCV CY+Q + I++P++S
Sbjct: 79 KSASTLGSGNYVVTVGLGSPKR-DLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTS 137
Query: 73 SSYKELSCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
SY +SC S C L++ + CSS C Y Y D S + G A E+++ S +
Sbjct: 138 LSYSNVSCDSPSCEKLESATGNSPGCSSST-CLYGIRYGDGSYSIGFFAREKLSL-TSTD 195
Query: 128 FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL 178
F+N FGCG NN G+F GL+GL R LSL SQ + G FSYCL
Sbjct: 196 VFNNFQFGCGQNNRGLFG-GTAGLLGLARNPLSLVSQTAQKYG-KVFSYCL 244
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 113/378 (29%), Positives = 175/378 (46%), Gaps = 43/378 (11%)
Query: 10 NNVVQSNVSTANGE-YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
+++ Q+ VS NG Y ++G+PP D ++DTGSDL WV+C PC ++
Sbjct: 109 HDLAQTPVSFTNGGVYYSSITLGSPPK-DFSLVMDTGSDLTWVRCDPC---SPDCSSTFD 164
Query: 69 PASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF 128
+S++YK L+C + L + ++L + D+ G + E
Sbjct: 165 RLASNTYKALTCADDL--RLPVLLRLWRRLFHSGRSLRDTLKMAGAASDEL-------EE 215
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
F VFGCG G+ + E+G++ L LS SQI + G NKFSYCL+ +S+
Sbjct: 216 FPGFVFGCGSLLKGLIS-GEVGILALSPGSLSFPSQIGEKYG-NKFSYCLLRQTAQNSLK 273
Query: 189 -SKMYFGNGS-EVSGGGVVSTSLVS----KEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
S M FG + E+ G + E YY V L+GISVGN L P
Sbjct: 274 KSPMVFGEAAVELKEPGSGKPQELQYTPIGESSIYYTVRLDGISVGN--QRLDLSPSTFL 331
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI 302
+G K +F D+G T+LP + +++ + + + + + G C++ P +G
Sbjct: 332 NGQ-DKPTIF-DSGTTLTMLPSGVCDSIKQSLASMVSGAEFVAIK-GLDACFRVPPSSGQ 388
Query: 303 A-PILTAHFDGGAKVPLIHTSTFIPPPVEGVF------CFAMQPIDGDVGIFGNFAQSDL 355
P +T HF+GGA F+ P V C P + +V IFGN Q D
Sbjct: 389 GLPDITFHFNGGAD--------FVTRPSNYVIDLGSLQCLIFVPTN-EVSIFGNLQQQDF 439
Query: 356 FIGYDFDSQMVSFKPTDC 373
F+ +D D++ + FK TDC
Sbjct: 440 FVLHDMDNRRIGFKETDC 457
>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
Length = 137
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 61/130 (46%), Positives = 81/130 (62%), Gaps = 4/130 (3%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
VQ+ VS NGE++M+ +IG P L I+DTGSDL W QC+PC CYKQ PIY+P+ S
Sbjct: 10 VQAPVSAGNGEFLMQLAIGKPSLA-YSAILDTGSDLTWTQCMPCSDCYKQPTPIYDPSLS 68
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+Y +SC+S C L +C S C Y Y Y D S T+G+L+ E TF S+ ++
Sbjct: 69 STYGTVSCKSSLCLALPASACISAT-CEYLYTYGDYSSTQGILSYE--TFTLSSQSIPHI 125
Query: 133 VFGCGHNNTG 142
FGCG +N G
Sbjct: 126 AFGCGQDNEG 135
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 100/342 (29%), Positives = 147/342 (42%), Gaps = 28/342 (8%)
Query: 42 VDTGSDLMWVQCLPCV--QCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTV--SCSSQQ 97
+DT D+ W+QC PC QCY Q P+++P +SS+ + C+S C L CS++
Sbjct: 152 IDTTVDVPWIQCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACRSLGPYGNGCSNRS 211
Query: 98 L---CNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGL 154
C Y Y+D T G T+ +T + N FGC H G F++ G + L
Sbjct: 212 ANAECRYLIEYSDDRATAGTYMTDTLTISGTTA-VRNFRFGCSHAVRGRFSDLTAGTMSL 270
Query: 155 GRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS--LVSK 212
G SL +Q LG N FSYC VP +S + + G + + V +T+ + S
Sbjct: 271 GGGAQSLLAQTARSLG-NAFSYC-VP---QASASGFLSIGGPATTNSTTVFATTPLVRSA 325
Query: 213 EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE 272
+ + Y V L+GI V + P S+GA+ +D+ A T LP Y L
Sbjct: 326 INPSLYLVRLQGIVVAG--RRLGIPPVAFSAGAV------MDSSAVITQLPPTAYRALRR 377
Query: 273 QVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEG 331
RNA++ P CY + + P ++ F GGA V L + I G
Sbjct: 378 AFRNAMRAYPRSGATGTLDTCYDFLGLTNVRVPAVSLVFGGGAVVVLDPPAVMI----GG 433
Query: 332 VFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
F D +G GN Q + YD + V F+ C
Sbjct: 434 CLAFTATSSDLALGFIGNVQQQTHEVLYDVAAGGVGFRRGAC 475
>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 114/381 (29%), Positives = 174/381 (45%), Gaps = 61/381 (16%)
Query: 12 VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
++S ++ +GEY M +G+PP I+DTGSDL W+QCLPC C++Q
Sbjct: 158 TLESGMTLGSGEYFMDVLVGSPPK-HFSLILDTGSDLNWIQCLPCYDCFQQ--------- 207
Query: 72 SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF-----GNSN 126
+ Q C Y Y Y DSS T G A E T G S+
Sbjct: 208 ----------------------NDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSS 245
Query: 127 NFF--DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
+ +N++FGCGH N G+F+ L R LS +SQ+ S G + FSYCLV ++D
Sbjct: 246 ELYNVENMMFGCGHWNRGLFHGAAGLLGLG-RGPLSFSSQLQSLYG-HSFSYCLVDRNSD 303
Query: 185 SSITSKMYFGNGSE-VSGGGVVSTSLVSKEDK---TYYFVTLEGISV-GNLSNSSKLIPY 239
++++SK+ FG + +S + TS V+ ++ T+Y+V ++ I V G + N +
Sbjct: 304 TNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWN 363
Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQD-PRLGSQLCY 294
+S GA G ID+G + + Y N++ E+ + K Y+D P L C+
Sbjct: 364 ISSDGA---GGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKG--KYPVYRDFPILDP--CF 416
Query: 295 KTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQ 352
+ + P L F GA ++FI E + C AM I GN+ Q
Sbjct: 417 NVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLN-EDLVCLAMLGTPKSAFSIIGNYQQ 475
Query: 353 SDLFIGYDFDSQMVSFKPTDC 373
+ I YD + + PT C
Sbjct: 476 QNFHILYDTKRSRLGYAPTKC 496
>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
Length = 137
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 61/130 (46%), Positives = 81/130 (62%), Gaps = 4/130 (3%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
VQ+ VS NGE++M+ +IG P L I+DTGSDL W QC+PC CYKQ PIY+P+ S
Sbjct: 10 VQAPVSAGNGEFLMQLAIGKPSLA-YSAILDTGSDLTWTQCIPCSDCYKQPTPIYDPSLS 68
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
S+Y +SC+S C L +C S C Y Y Y D S T+G+L+ E TF S+ ++
Sbjct: 69 STYGTVSCKSSLCLALPASACISAT-CEYLYTYGDYSSTQGILSYE--TFTLSSQSIPHI 125
Query: 133 VFGCGHNNTG 142
FGCG +N G
Sbjct: 126 AFGCGQDNEG 135
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 105/378 (27%), Positives = 168/378 (44%), Gaps = 48/378 (12%)
Query: 25 VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQ 84
++ IGTPP I+DTGS L W+QC V +++P+ SSS+ L C
Sbjct: 78 LVSLPIGTPPQSQQM-ILDTGSQLSWIQCHKKVPRKPPPSTVFDPSLSSSFSVLPCNHPL 136
Query: 85 CH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
C SC +LC+Y+Y YAD +L +G L E+ITF S + ++ GC +
Sbjct: 137 CKPRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVREKITFSTSQS-TPPLILGCAED 195
Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI--TSKMYFGNGS 197
+ ++ G++G+ RLS ASQ KFSYC+ T Y G
Sbjct: 196 AS-----DDKGILGMNLGRLSFASQA----KITKFSYCVPTRQVRPGFTPTGSFYLGENP 246
Query: 198 EVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNSSKLIPYY-NSSGAISKGN 250
+G +S S+ D + V L+GI +GN + + + + SGA G
Sbjct: 247 NSAGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGA---GQ 303
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG--------SQLCYKTPSM--A 300
ID+G+ T L YN++ E+V ++L PRL S +C+ +M
Sbjct: 304 SMIDSGSEFTYLVDVAYNKVREEV---VRLA---GPRLKKGYVYSGVSDMCFDGNAMEIG 357
Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFI 357
+ + FD G ++ +I + GV C + + + I GNF Q +L++
Sbjct: 358 RLIGNMVFEFDKGVEI-VIEKGRVLADVGGGVHCVGIGRSEMLGAASNIIGNFHQQNLWV 416
Query: 358 GYDFDSQMVSFKPTDCTK 375
+D ++ V F DC++
Sbjct: 417 EFDIANRRVGFGKADCSR 434
>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
Length = 467
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 99/376 (26%), Positives = 158/376 (42%), Gaps = 35/376 (9%)
Query: 24 YVMKFSIGTPP--LLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-PIYNPASSSSYKELSC 80
Y+++ IGTP + Y + DTGSDL W QC PC C P ++P+ S +++ LSC
Sbjct: 102 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 161
Query: 81 QSEQCHLLDTV--SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-----FFDNVV 133
C L V C + Y D G L ++ FG + + +V
Sbjct: 162 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 221
Query: 134 FGCGH-NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL--------VPFHTD 184
FGC H ++ G++ LG + S ++QLG ++FSYC+ +
Sbjct: 222 FGCAHVEDSKAVRGYSTGILALGIGKPSF----VTQLGVDRFSYCIPASEITDDDDDDDE 277
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGI--SVGNLSNSSKLIPYYNS 242
S + FG+ + ++G K+D + Y V L+ + G N + +P Y +
Sbjct: 278 ERSASFLRFGSHARMTG-----KRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVA 332
Query: 243 -SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
A + M +D+G LP + L+ ++ I LT D S CY
Sbjct: 333 GEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGNMTDV 392
Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPP--VEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
A +T F GGA + L TS F E C A+ G+ I G + Q ++ +GY
Sbjct: 393 EAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAA--GNRAILGVYPQRNINVGY 450
Query: 360 DFDSQMVSFKPTDCTK 375
D + ++F C +
Sbjct: 451 DLSTMEIAFDRDQCDR 466
>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 324
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 113/342 (33%), Positives = 142/342 (41%), Gaps = 30/342 (8%)
Query: 42 VDTGSDLMWVQCLPCV---QCYKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQ 96
VDTGSDL WVQC PC CY Q P+++PA SSSY + C C L S S
Sbjct: 3 VDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSA 62
Query: 97 QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGR 156
C Y Y D S T GV +++ +T ++++ FGCGH +G+FN + GL+GLGR
Sbjct: 63 AQCGYVVSYGDGSNTTGVYSSDTLTL-SASSAVQGFFFGCGHAQSGLFNGVD-GLLGLGR 120
Query: 157 TRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS--LVSKED 214
+ SL Q G FSYCL T S + G G ST+ L S
Sbjct: 121 EQPSLVEQTAGTYG-GVFSYCL---PTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNA 176
Query: 215 KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV 274
TYY V L GISVG S +P +G + T PPT Y L
Sbjct: 177 PTYYVVMLTGISVGGQQLS---VPASAFAGGTVVDTGTVVTRLPPTA-----YAALRSAF 228
Query: 275 RNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEG 331
R+ + Y L CY + P + F GA V L G
Sbjct: 229 RSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL----SFG 284
Query: 332 VFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
FA DG + I GN Q + D V FKP+ C
Sbjct: 285 CLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 324
>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
Length = 488
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 99/376 (26%), Positives = 158/376 (42%), Gaps = 35/376 (9%)
Query: 24 YVMKFSIGTPP--LLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-PIYNPASSSSYKELSC 80
Y+++ IGTP + Y + DTGSDL W QC PC C P ++P+ S +++ LSC
Sbjct: 123 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 182
Query: 81 QSEQCHLLDTV--SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-----FFDNVV 133
C L V C + Y D G L ++ FG + + +V
Sbjct: 183 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 242
Query: 134 FGCGH-NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL--------VPFHTD 184
FGC H ++ G++ LG + S ++QLG ++FSYC+ +
Sbjct: 243 FGCAHVEDSKAVRGYSTGILALGIGKPSF----VTQLGVDRFSYCIPASEITDDDDDDDE 298
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGI--SVGNLSNSSKLIPYYNS 242
S + FG+ + ++G K+D + Y V L+ + G N + +P Y +
Sbjct: 299 ERSASFLRFGSHARMTG-----KRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVA 353
Query: 243 -SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
A + M +D+G LP + L+ ++ I LT D S CY
Sbjct: 354 GEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGNMTDV 413
Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPP--VEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
A +T F GGA + L TS F E C A+ G+ I G + Q ++ +GY
Sbjct: 414 EAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAA--GNRAILGVYPQRNINVGY 471
Query: 360 DFDSQMVSFKPTDCTK 375
D + ++F C +
Sbjct: 472 DLSTMEIAFDRDQCDR 487
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 109/381 (28%), Positives = 173/381 (45%), Gaps = 46/381 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVK-PIYNPASSSSYK 76
G Y K +GTPP+ + +DTGSD++WV C C C + Q++ ++P SSS+
Sbjct: 76 GLYYTKVQLGTPPV-EFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSS 134
Query: 77 ELSCQSEQCH---LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN- 131
++C ++C+ +CSSQ C+YT+ Y D S T G ++ + N F+
Sbjct: 135 MIACSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHL---NTIFEGS 191
Query: 132 --------VVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLV 179
VVFGC + TG +++ G+ G G+ +S+ SQ+ SQ + FS+CL
Sbjct: 192 MTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCL- 250
Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLI 237
DSS + G E+ +V TSLV + +Y + L+ ISV L S +
Sbjct: 251 --KGDSSGGGILVLG---EIVEPNIVYTSLVPAQ--PHYNLNLQSISVNGQTLQIDSSVF 303
Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP 297
NS G I +D+G L ++ Y+ + AI + G+Q T
Sbjct: 304 ATSNSRGTI------VDSGTTLAYLAEEAYDPFVSAITAAIPQSVRTVVSRGNQCYLITS 357
Query: 298 SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD-VGIFGNFAQS 353
S+ + P ++ +F GGA + L I G V+C Q I G + I G+
Sbjct: 358 SVTDVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLK 417
Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
D + YD Q + + DC+
Sbjct: 418 DKIVVYDLAGQRIGWANYDCS 438
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 87/276 (31%), Positives = 129/276 (46%), Gaps = 28/276 (10%)
Query: 8 YPNNV---VQSNVSTANGEYVMKFSIGTPPLLDIYG-IVDTGSDLMWVQCLPCV-QCYKQ 62
+P +V + S +G Y +K G+P Y IVDTGS L W+QC PCV C+ Q
Sbjct: 99 FPKSVSVPLNPGASIGSGNYYVKVGFGSPA--RYYSMIVDTGSSLSWLQCKPCVVYCHVQ 156
Query: 63 VKPIYNPASSSSYKELSCQSEQCHLLDTVSC------SSQQLCNYTYGYADSSLTKGVLA 116
P+++P++S +YK LSC S QC L + +S +C YT Y DSS + G L+
Sbjct: 157 ADPLFDPSASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLS 216
Query: 117 TERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSY 176
+ +T S V+GCG ++ G+F G++GLGR +LS+ Q+ S+ G FSY
Sbjct: 217 QDLLTLAPSQT-LPGFVYGCGQDSDGLFGR-AAGILGLGRNKLSMLGQVSSKFG-YAFSY 273
Query: 177 CLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKL 236
CL P S G S + + + YF+ L I+VG +
Sbjct: 274 CL-PTRGGGGFLS---IGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRA----- 324
Query: 237 IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE 272
+ A + ID+G T LP Y ++
Sbjct: 325 ---LGVAAAQYRVPTIIDSGTVITRLPMSVYTPFQQ 357
>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
Length = 468
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 99/378 (26%), Positives = 158/378 (41%), Gaps = 37/378 (9%)
Query: 24 YVMKFSIGTPP--LLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-PIYNPASSSSYKELSC 80
Y+++ IGTP + Y + DTGSDL W QC PC C P ++P+ S +++ LSC
Sbjct: 101 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 160
Query: 81 QSEQCHLLDTV--SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-----FFDNVV 133
C L V C + Y D G L ++ FG + + +V
Sbjct: 161 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 220
Query: 134 FGCGH-NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL----------VPFH 182
FGC H ++ G++ LG + S ++QLG ++FSYC+
Sbjct: 221 FGCAHVEDSKAVRGYSTGILALGIGKPSF----VTQLGVDRFSYCIPASEITDDDDDDDD 276
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGI--SVGNLSNSSKLIPYY 240
+ S + FG+ + ++G K+D + Y V L+ + G N + +P Y
Sbjct: 277 DEERSASFLRFGSHARMTG-----KRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVY 331
Query: 241 NS-SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM 299
+ A + M +D+G LP + L+ ++ I LT D S CY
Sbjct: 332 VAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGNMT 391
Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPP--VEGVFCFAMQPIDGDVGIFGNFAQSDLFI 357
A +T F GGA + L TS F E C A+ G+ I G + Q ++ +
Sbjct: 392 DVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAA--GNRAILGVYPQRNINV 449
Query: 358 GYDFDSQMVSFKPTDCTK 375
GYD + ++F C +
Sbjct: 450 GYDLSTMEIAFDRDQCDR 467
>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
Length = 471
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 99/378 (26%), Positives = 158/378 (41%), Gaps = 37/378 (9%)
Query: 24 YVMKFSIGTPP--LLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-PIYNPASSSSYKELSC 80
Y+++ IGTP + Y + DTGSDL W QC PC C P ++P+ S +++ LSC
Sbjct: 104 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 163
Query: 81 QSEQCHLLDTV--SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-----FFDNVV 133
C L V C + Y D G L ++ FG + + +V
Sbjct: 164 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 223
Query: 134 FGCGH-NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL----------VPFH 182
FGC H ++ G++ LG + S ++QLG ++FSYC+
Sbjct: 224 FGCAHVEDSKAVRGYSTGILALGIGKPSF----VTQLGVDRFSYCIPASEITDDDDDDDD 279
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGI--SVGNLSNSSKLIPYY 240
+ S + FG+ + ++G K+D + Y V L+ + G N + +P Y
Sbjct: 280 DEERSASFLRFGSHARMTG-----KRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVY 334
Query: 241 NS-SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM 299
+ A + M +D+G LP + L+ ++ I LT D S CY
Sbjct: 335 VAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGNMT 394
Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPP--VEGVFCFAMQPIDGDVGIFGNFAQSDLFI 357
A +T F GGA + L TS F E C A+ G+ I G + Q ++ +
Sbjct: 395 DVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAA--GNRAILGVYPQRNINV 452
Query: 358 GYDFDSQMVSFKPTDCTK 375
GYD + ++F C +
Sbjct: 453 GYDLSTMEIAFDRDQCDR 470
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 107/381 (28%), Positives = 174/381 (45%), Gaps = 46/381 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVK-PIYNPASSSSYK 76
G Y K +GTPP+ + +DTGSD++WV C C C + Q++ ++P SSS+
Sbjct: 73 GLYYTKVQLGTPPV-EFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSS 131
Query: 77 ELSCQSEQCH---LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN- 131
++C ++C+ +CSSQ C+YT+ Y D S T G ++ + N F+
Sbjct: 132 MIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHL---NTIFEGS 188
Query: 132 --------VVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLV 179
VVFGC + TG +++ G+ G G+ +S+ SQ+ SQ + FS+CL
Sbjct: 189 VTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCL- 247
Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLI 237
DSS + G E+ +V TSLV + +Y + L+ I+V L S +
Sbjct: 248 --KGDSSGGGILVLG---EIVEPNIVYTSLVPAQP--HYNLNLQSIAVNGQTLQIDSSVF 300
Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP 297
NS G I +D+G L ++ Y+ + +I + + G+Q T
Sbjct: 301 ATSNSRGTI------VDSGTTLAYLAEEAYDPFVSAITASIPQSVHTVVSRGNQCYLITS 354
Query: 298 SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD-VGIFGNFAQS 353
S+ + P ++ +F GGA + L I G V+C Q I G + I G+
Sbjct: 355 SVTEVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLK 414
Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
D + YD Q + + DC+
Sbjct: 415 DKIVVYDLAGQRIGWANYDCS 435
>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
Length = 489
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 99/378 (26%), Positives = 158/378 (41%), Gaps = 37/378 (9%)
Query: 24 YVMKFSIGTPP--LLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-PIYNPASSSSYKELSC 80
Y+++ IGTP + Y + DTGSDL W QC PC C P ++P+ S +++ LSC
Sbjct: 122 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSC 181
Query: 81 QSEQCHLLDTV--SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN-----FFDNVV 133
C L V C + Y D G L ++ FG + + +V
Sbjct: 182 FDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVA 241
Query: 134 FGCGH-NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL----------VPFH 182
FGC H ++ G++ LG + S ++QLG ++FSYC+
Sbjct: 242 FGCAHVEDSKAVRGYSTGILALGIGKPSF----VTQLGVDRFSYCIPASEITDDDDDDDD 297
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGI--SVGNLSNSSKLIPYY 240
+ S + FG+ + ++G K+D + Y V L+ + G N + +P Y
Sbjct: 298 DEERSASFLRFGSHARMTG-----KRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVY 352
Query: 241 NS-SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM 299
+ A + M +D+G LP + L+ ++ I LT D S CY
Sbjct: 353 VAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGNMT 412
Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPP--VEGVFCFAMQPIDGDVGIFGNFAQSDLFI 357
A +T F GGA + L TS F E C A+ G+ I G + Q ++ +
Sbjct: 413 DVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAA--GNRAILGVYPQRNINV 470
Query: 358 GYDFDSQMVSFKPTDCTK 375
GYD + ++F C +
Sbjct: 471 GYDLSTMEIAFDRDQCDR 488
>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 330
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 100/315 (31%), Positives = 143/315 (45%), Gaps = 31/315 (9%)
Query: 65 PIYNPASSSSYKELSCQSEQCHLLDTVSCSS-----QQLCNYTYGYADSSLTKGVLATER 119
P ++ ++SS+ SC S C L SC + Q C YTY Y D S+T G++ ++
Sbjct: 23 PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLIEVDK 82
Query: 120 ITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
TFG + V FGCG N GVF NE G+ G GR LSL SQL FS+C
Sbjct: 83 FTFGAGASV-PGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLP----SQLKVGNFSHCFT 137
Query: 180 PFHTDSSITSKM-----YFGNGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNS 233
+ T + + NG G V ST L+ + + T+Y+++L+GI+VG
Sbjct: 138 AVNGLKQSTVLLDLPADLYKNGR----GAVQSTPLIQNSANPTFYYLSLKGITVG----- 188
Query: 234 SKLIPYYNSSGAISKGN--MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
S +P S+ A++ G ID+G T LP Y + ++ IKL G
Sbjct: 189 STRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPY 248
Query: 292 LCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEG--VFCFAMQPIDGDVGIFG 348
C+ PS A P L HF+G F P G + C A+ D + I G
Sbjct: 249 TCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGD-ETTIIG 307
Query: 349 NFAQSDLFIGYDFDS 363
NF Q ++ + YD +
Sbjct: 308 NFQQQNMHVLYDLQN 322
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 94/338 (27%), Positives = 143/338 (42%), Gaps = 23/338 (6%)
Query: 41 IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVS--CSSQ 96
++D+ SD+ WVQC+PC C+ QV Y+P+ S S SC S C L + C++
Sbjct: 162 VLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTALGPYANGCANN 221
Query: 97 QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGR 156
Q C Y Y D S T G + +T ++ N FGC H G F+ G++ LG
Sbjct: 222 Q-CQYLVRYPDGSSTSGAYIADLLTL-DAGNAVSGFKFGCSHAEQGSFDARAAGIMALGG 279
Query: 157 TRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKT 216
SL SQ S+ G N FSYC+ +DS G S VV+ + ++ T
Sbjct: 280 GPESLLSQTASRYG-NAFSYCIPATASDSGF---FTLGVPRRASSRYVVTPMVRFRQAAT 335
Query: 217 YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRN 276
+Y V L I+VG + P ++G++ +D+ T LP Y L R+
Sbjct: 336 FYGVLLRTITVGG--QRLGVAPAVFAAGSV------LDSRTAITRLPPTAYQALRSAFRS 387
Query: 277 AIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF 335
++ + P+ CY + I P ++ FD A +PL + F
Sbjct: 388 SMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILF----NDCLAF 443
Query: 336 AMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
D G+ G+ Q + + YD V F+ C
Sbjct: 444 TSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 481
>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
Length = 471
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 107/348 (30%), Positives = 139/348 (39%), Gaps = 26/348 (7%)
Query: 34 PLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDT- 90
P+L +DT DL W+QC PC +CY Q +++P S + + C S C L
Sbjct: 142 PILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRY 201
Query: 91 -VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEM 149
CS+ Q C Y Y D T G + +T N + N FGC H G F+ +
Sbjct: 202 GAGCSNNQ-CQYFVDYGDGRATSGTYMVDALTL-NPSTVVMNFRFGCSHAVRGNFSASTS 259
Query: 150 GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSL 209
G + LG R SL SQ + G N FSYC VP + S S +G T L
Sbjct: 260 GTMSLGGGRQSLLSQTAATFG-NAFSYC-VPDPSSSGFLSLGGPADGGGAG--RFARTPL 315
Query: 210 VSKED--KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFY 267
V T Y V L GI VG + P + GA+ ++ I T LP Y
Sbjct: 316 VRNPSIIPTLYLVRLRGIEVGG--RRLNVPPVVFAGGAVMDSSVII------TQLPPTAY 367
Query: 268 NRLEEQVRNAIKLTPY-QDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFI 325
L R+A+ P R G CY + P ++ FDGGA V L
Sbjct: 368 RALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVM- 426
Query: 326 PPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
VEG F P D +G GN Q + YD V F+ C
Sbjct: 427 ---VEGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 471
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 109/364 (29%), Positives = 157/364 (43%), Gaps = 33/364 (9%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV---QCYKQVKPIYNPASSSSYKELS 79
E+V+ +GTP I DTGSDL WVQC PC C+ Q P+++P+ SS+Y +
Sbjct: 148 EFVVAVGLGTPAQPSAL-IFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVH 206
Query: 80 CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
C QC + C Y Y D S T GVL+ + + S+ FGCG
Sbjct: 207 CGEPQCAAAGGLCSEDNTTCLYLVHYGDGSSTTGVLSRDTLAL-TSSRALAGFPFGCGTR 265
Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
N G F + L+GLGR LSL SQ + GA FSYCL + +S T + G
Sbjct: 266 NLGDFGRVDG-LLGLGRGELSLPSQAAASFGA-VFSYCL---PSSNSTTGYLTIGATPAT 320
Query: 200 SGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
G T+++ K ++YFV L I +G ++P ++G +D+G
Sbjct: 321 DTGAAQYTAMLRKPQFPSFYFVELVSIDIGGY-----ILPV--PPAVFTRGGTLLDSGTV 373
Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG----IAPILTAHFDGGA 314
T LP Y L ++ R ++ P CY AG I P ++ F GA
Sbjct: 374 LTYLPAQAYELLRDRFRLTMERYTPAPPNDVLDACYD---FAGESEVIVPAVSFRFGDGA 430
Query: 315 --KVPLIHTSTFIPPPVEGVFCFAMQPIDGD---VGIFGNFAQSDLFIGYDFDSQMVSFK 369
++ F+ E V C A +D + I GN Q + YD ++ + F
Sbjct: 431 VFELDFFGVMIFLD---ENVGCLAFAAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKIGFV 487
Query: 370 PTDC 373
P C
Sbjct: 488 PASC 491
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 102/371 (27%), Positives = 159/371 (42%), Gaps = 31/371 (8%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
YV++ +G+P + + DT +D W C PC C ++ PA+SSSY L C S
Sbjct: 78 SYVVRAGLGSPSQQLLLAL-DTSADATWAHCSPCGTCPSSS--LFAPANSSSYASLPCSS 134
Query: 83 EQCHLLDTVSCSSQQ-------------LCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
C L +C + Q C ++ +AD+S + LA++ + G +
Sbjct: 135 SWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASF-QAALASDTLRLG--KDAI 191
Query: 130 DNVVFGCGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
N FGC + TG N GL+GLGR ++L SQ S L FSYCL P + +
Sbjct: 192 PNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGS-LYNGVFSYCL-PSYRSYYFS 249
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
+ G G + L + + Y+V + G+SVG+ K+ + A +
Sbjct: 250 GSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHA--WVKVPAGSFAFDAATG 307
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSM-AGIAPIL 306
+D+G T Y L E+ R + P LG+ C+ T + AG AP +
Sbjct: 308 AGTVVDSGTVITRWTAPVYAALREEFRRQVA-APSGYTSLGAFDTCFNTDEVAAGGAPAV 366
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM----QPIDGDVGIFGNFAQSDLFIGYDFD 362
T H DGG + L +T I + C AM Q ++ V + N Q ++ + +D
Sbjct: 367 TVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVA 426
Query: 363 SQMVSFKPTDC 373
+ V F C
Sbjct: 427 NSRVGFAKESC 437
>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
Length = 487
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 107/348 (30%), Positives = 139/348 (39%), Gaps = 26/348 (7%)
Query: 34 PLLDIYGIVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDT- 90
P+L +DT DL W+QC PC +CY Q +++P S + + C S C L
Sbjct: 158 PILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRY 217
Query: 91 -VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEM 149
CS+ Q C Y Y D T G + +T N + N FGC H G F+ +
Sbjct: 218 GAGCSNNQ-CQYFVDYGDGRATSGTYMVDALTL-NPSTVVMNFRFGCSHAVRGNFSASTS 275
Query: 150 GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSL 209
G + LG R SL SQ + G N FSYC VP + S S +G T L
Sbjct: 276 GTMSLGGGRQSLLSQTAATFG-NAFSYC-VPDPSSSGFLSLGGPADGGGAG--RFARTPL 331
Query: 210 VSKED--KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFY 267
V T Y V L GI VG + P + GA+ ++ I T LP Y
Sbjct: 332 VRNPSIIPTLYLVRLRGIEVGG--RRLNVPPVVFAGGAVMDSSVII------TQLPPTAY 383
Query: 268 NRLEEQVRNAIKLTPY-QDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFI 325
L R+A+ P R G CY + P ++ FDGGA V L
Sbjct: 384 RALRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVM- 442
Query: 326 PPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
VEG F P D +G GN Q + YD V F+ C
Sbjct: 443 ---VEGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 487
>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 417
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 110/414 (26%), Positives = 172/414 (41%), Gaps = 59/414 (14%)
Query: 9 PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP--CVQC---YKQV 63
P+ + +S +Y + F++G+ P I +DTGSDL+W C P C+ C +
Sbjct: 4 PSPSRRQPISNRESDYTLSFNLGSHPSQSITLYMDTGSDLVWFPCAPFECILCEGKFNAT 63
Query: 64 KPI----------YNPASSSSYKELS----CQSEQCHL--LDTVSCSSQQLCNYTYGYAD 107
KP+ +PA S+++ +S C +C L ++T CSS + Y Y D
Sbjct: 64 KPLNITRSHRVSCQSPACSTAHSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYGD 123
Query: 108 SSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQI-- 165
S + R T S F N FGC H G+ G GR LSL +Q+
Sbjct: 124 GSF---IAHLHRDTLSMSQLFLKNFTFGCAHTALA----EPTGVAGFGRGLLSLPAQLAT 176
Query: 166 LSQLGANKFSYCLVPFHTDSSITSK---MYFGNGSEVSGGGV--VSTSLVSKEDKTYYF- 219
LS N+FSYCLV D K + G+ + S V V TS++ +Y++
Sbjct: 177 LSPNLGNRFSYCLVSHSFDKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNPKHSYFYC 236
Query: 220 VTLEGISVGNLSN-SSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYN----RLEEQV 274
V L GISVG + + +++ + G G + +D+G T+LP YN + +V
Sbjct: 237 VGLTGISVGKRTILAPEMLRRVDRRG---DGGVVVDSGTTFTMLPASLYNSVVAEFDRRV 293
Query: 275 RNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG--- 331
K + + G CY + + P +T HF G ++ + ++G
Sbjct: 294 GRVHKRASEVEEKTGLGPCYFLEGLVEV-PTVTWHFLGNNSNVMLPRMNYFYEFLDGEDE 352
Query: 332 ----VFCFAMQPIDGDV-------GIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
V C + D I GN+ Q + YD ++Q V F C
Sbjct: 353 ARRKVGCLMLMNGGDDTELSGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQCA 406
>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
Length = 434
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 113/405 (27%), Positives = 177/405 (43%), Gaps = 67/405 (16%)
Query: 24 YVMKFSIGTPP-LLDIYGIVDTGSDLMWVQC----LPCVQC--YKQ-------------- 62
Y++ ++GTPP ++ +Y +DTGSDL WV C C+ C Y+
Sbjct: 29 YLISLNLGTPPKVIQVY--MDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYSSS 86
Query: 63 ------VKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLC-NYTYGYADSSLTKGVL 115
V P+ + SS C C L V + + C ++ Y Y + G L
Sbjct: 87 SLRDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTL 146
Query: 116 ATERIT-FGNSNNF---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQI-LSQLG 170
+ +T G+S +F N FGC G +G+ G GR LSL SQ+ Q G
Sbjct: 147 TRDTLTTHGSSPSFTREVPNFCFGC----VGSTYREPIGIAGFGRGVLSLPSQLGFLQKG 202
Query: 171 ANKFSYCLV--PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISV 227
FS+C + F + +I+S + G+ + S + TSL+ YY++ LE I+V
Sbjct: 203 ---FSHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITV 259
Query: 228 GNLS--NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKL--TPY 283
GN + + ++S G G M ID+G T LP FY +L +++ I
Sbjct: 260 GNATAIQVPSSLREFDSHG---NGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQE 316
Query: 284 QDPRLGSQLCYKTPSMAGIA-------PILTAHFDGGAKVPLIHTSTF----IPPPVEGV 332
Q+ R G LCY+ P + P ++ HF + L + F P V
Sbjct: 317 QEARTGFDLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNSTVV 376
Query: 333 FCFAMQPID----GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
C +Q +D G G+FG+F Q ++ + YD + + + F+P DC
Sbjct: 377 KCLLLQNMDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 421
>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 417
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 113/406 (27%), Positives = 177/406 (43%), Gaps = 67/406 (16%)
Query: 24 YVMKFSIGTPP-LLDIYGIVDTGSDLMWVQC----LPCVQC--YKQ-------------- 62
Y++ ++GTPP ++ +Y +DTGSDL WV C C+ C Y+
Sbjct: 12 YLISLNLGTPPKVIQVY--MDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYSSS 69
Query: 63 ------VKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLC-NYTYGYADSSLTKGVL 115
V P+ + SS C C L V + + C ++ Y Y + G L
Sbjct: 70 SLRDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTL 129
Query: 116 ATERIT-FGNSNNF---FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQI-LSQLG 170
+ +T G+S +F N FGC G +G+ G GR LSL SQ+ Q G
Sbjct: 130 TRDTLTTHGSSPSFTREVPNFCFGC----VGSTYREPIGIAGFGRGVLSLPSQLGFLQKG 185
Query: 171 ANKFSYCLV--PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISV 227
FS+C + F + +I+S + G+ + S + TSL+ YY++ LE I+V
Sbjct: 186 ---FSHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITV 242
Query: 228 GNLS--NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKL--TPY 283
GN + + ++S G G M ID+G T LP FY +L +++ I
Sbjct: 243 GNATAIQVPSSLREFDSHG---NGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQE 299
Query: 284 QDPRLGSQLCYKTPSMAGIA-------PILTAHFDGGAKVPLIHTSTF----IPPPVEGV 332
Q+ R G LCY+ P + P ++ HF + L + F P V
Sbjct: 300 QEARTGFDLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNSTVV 359
Query: 333 FCFAMQPID----GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
C +Q +D G G+FG+F Q ++ + YD + + + F+P DC
Sbjct: 360 KCLLLQNMDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDCA 405
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 93/338 (27%), Positives = 143/338 (42%), Gaps = 23/338 (6%)
Query: 41 IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVS--CSSQ 96
++D+ SD+ WVQC+PC C+ QV Y+P+ S + SC S C L + C++
Sbjct: 32 VLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALGPYANGCANN 91
Query: 97 QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGR 156
Q C Y Y D S T G + +T ++ N FGC H G F+ G++ LG
Sbjct: 92 Q-CQYLVRYPDGSSTSGAYIADLLTL-DAGNAVSGFKFGCSHAEQGSFDARAAGIMALGG 149
Query: 157 TRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKT 216
SL SQ S+ G N FSYC+ +DS G S VV+ + ++ T
Sbjct: 150 GPESLLSQTASRYG-NAFSYCIPATASDSGF---FTLGVPRRASSRYVVTPMVRFRQAAT 205
Query: 217 YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRN 276
+Y V L I+VG + P ++G++ +D+ T LP Y L R+
Sbjct: 206 FYGVLLRTITVGG--QRLGVAPAVFAAGSV------LDSRTAITRLPPTAYQALRAAFRS 257
Query: 277 AIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF 335
++ + P+ CY + I P ++ FD A +PL + F
Sbjct: 258 SMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILF----NDCLAF 313
Query: 336 AMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
D G+ G+ Q + + YD V F+ C
Sbjct: 314 TSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 113/377 (29%), Positives = 166/377 (44%), Gaps = 41/377 (10%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
N+ +N+ +G +++ + GTP +I I+DTGS + W QC CV C + ++
Sbjct: 114 NHAHNNNLFDEDGNFLVDVAFGTP-XTEIXLILDTGSSITWTQCKACVNCLQDSNRYFDS 172
Query: 70 ASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
++SS+Y SC + TV NY Y D S + G + +T ++ F
Sbjct: 173 SASSTYSFGSC------IPSTVE------NNYNMTYGDDSTSVGNYGCDTMTL-EPSDVF 219
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSIT 188
FGCG NN G F G++GLG+ +LS SQ S+ NK FSYCL + SI
Sbjct: 220 QKFQFGCGRNNKGDFGSGVDGMLGLGQGQLSTVSQTASKF--NKVFSYCL---PEEDSIG 274
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSK----EDKTYYFVTLEGISVGNLSNSSKL-IPYYNSS 243
S + FG + + TSLV+ ++ YYFV L ISVGN +L IP S
Sbjct: 275 S-LLFGEKATSQSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGN----ERLNIP----S 325
Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSM 299
+ ID+ T LP+ Y+ L+ + A+ P + R CY
Sbjct: 326 SVFASPGTIIDSRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGR 385
Query: 300 AGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIG 358
+ P + HF GGA V L T+ C A ++ I GN Q L +
Sbjct: 386 KDVLLPEIVLHFGGGADVRLNGTNIVWGSDAS-RLCLAFAGTS-ELTIIGNRQQLSLTVL 443
Query: 359 YDFDSQMVSFKPTDCTK 375
YD + + F C+K
Sbjct: 444 YDIQGRRIGFGGNGCSK 460
>gi|356528675|ref|XP_003532925.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 342
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 106/375 (28%), Positives = 149/375 (39%), Gaps = 106/375 (28%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
N + +S + NGEY+M+ IGTPP+ + I DTGSD +WVQC PC C
Sbjct: 64 NKLPESILIPNNGEYLMRLYIGTPPVERLV-IADTGSDFIWVQCSPCQNCQ--------- 113
Query: 70 ASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNS 125
C Y YA+ S T V+ TE ++F G
Sbjct: 114 -----------------------------CVYLNIYANKSFTIEVVGTETLSFDSTGGAQ 144
Query: 126 NNFFDNVVFGCGHNNTGVFNENE--MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
F N +FGCG NN F ++ GLVGL +LSL SQ+ +Q+G KFSY
Sbjct: 145 TVSFPNSIFGCGANNNLTFRSSDKATGLVGLVAGQLSLVSQLGAQIGY-KFSY------- 196
Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
+ FG+ + ++ GVVST L+ K YF+ LE +++G K++P
Sbjct: 197 -------LKFGSEAIITTNGVVSTPLIIKPSLPLYFLNLEVVTIGQ-----KVVP----- 239
Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA 303
+ + QD + C+ +
Sbjct: 240 -------------------------------TETLGVESVQDLPFPFKFCFPYRDNMTV- 267
Query: 304 PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD---VGIFGNFAQSDLFIGYD 360
P + F GA V L + I + A+ P + IFG AQ D + YD
Sbjct: 268 PAIAFQFT-GASVALRPKNLLIKLQDRNMLXLAVVPSASSLSVISIFGIIAQFDFQVLYD 326
Query: 361 FDSQMVSFKPTDCTK 375
D + VS PTDCTK
Sbjct: 327 LDGKKVSVAPTDCTK 341
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 101/371 (27%), Positives = 158/371 (42%), Gaps = 31/371 (8%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
YV++ +G+P + + DT +D W C PC C ++ PA+SSSY L C S
Sbjct: 80 SYVVRAGLGSPSQQLLLAL-DTSADATWAHCSPCGTCPSSS--LFAPANSSSYASLPCSS 136
Query: 83 EQCHLLDTVSCSSQQ-------------LCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
C L +C + Q C ++ +AD+S + LA++ + G +
Sbjct: 137 SWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASF-QAALASDTLRLG--KDAI 193
Query: 130 DNVVFGCGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
N FGC + TG N GL+GLGR ++L SQ S L FSYCL P + +
Sbjct: 194 PNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGS-LYNGVFSYCL-PSYRSYYFS 251
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
+ G G + L + + Y+V + G+SVG K+ + A +
Sbjct: 252 GSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGRA--WVKVPAGSFAFDAATG 309
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSM-AGIAPIL 306
+D+G T Y L E+ R + P LG+ C+ T + AG AP +
Sbjct: 310 AGTVVDSGTVITRWTAPVYAALREEFRRQVA-APSGYTSLGAFDTCFNTDEVAAGGAPAV 368
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM----QPIDGDVGIFGNFAQSDLFIGYDFD 362
T H DGG + L +T I + C AM Q ++ V + N Q ++ + +D
Sbjct: 369 TVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVA 428
Query: 363 SQMVSFKPTDC 373
+ + F C
Sbjct: 429 NSRIGFAKESC 439
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 97/348 (27%), Positives = 146/348 (41%), Gaps = 25/348 (7%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ---CYKQVKPIYNPASSSSYKELS 79
EYV+ +G+P + ++DTGSD+ WVQC PC C+ +++PA+SS+Y +
Sbjct: 107 EYVISVGLGSPAVTQRV-VIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFN 165
Query: 80 CQSEQCHLL----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
C + C L + C ++ C Y Y D S T G +++ +T S + FG
Sbjct: 166 CSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLSGS-DVVRGFQFG 224
Query: 136 CGHNNTGV-FNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
C H G ++ GL+GLG S SQ ++ G F YCL S +
Sbjct: 225 CSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYG-KSFFYCLPATPASSGFLTLGAPA 283
Query: 195 NGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
+G +T ++ SK+ TYYF LE I+VG L P ++G++ +
Sbjct: 284 SGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGG--KKLGLSPSVFAAGSL------V 335
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDG 312
D+G T LP Y L R + +P C+ + ++ P + F G
Sbjct: 336 DSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAG 395
Query: 313 GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
GA V L G FA D G GN Q + YD
Sbjct: 396 GAVVDLDAHGIV----SGGCLAFAPTRDDKAFGTIGNVQQRTFEVLYD 439
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 100/375 (26%), Positives = 165/375 (44%), Gaps = 25/375 (6%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
S + +Y + +GTP +VDTGS+L WV C + K + ++ S S
Sbjct: 97 SGIDYGTAQYFTEIRVGTPAK-KFRVVVDTGSELTWVNCRYRARG-KDNRRVFRADESKS 154
Query: 75 YKELSCQSEQC-----HLLDTVSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNF 128
+K + C ++ C +L +C + C+Y Y YAD S +GV A E IT G +N
Sbjct: 155 FKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGR 214
Query: 129 FDNV---VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
+ + GC + TG + G++GL + S S S GA KFSYCLV ++
Sbjct: 215 MARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGA-KFSYCLVDHLSNK 273
Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSS 243
++++ + FG+ +T L +Y + + GIS+G L S++
Sbjct: 274 NVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWD----- 328
Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQV-RNAIKLTPYQDPRLGSQLCYKTPSMAGI 302
A S G +D+G TLL Y ++ + R ++L + + + C+ S +
Sbjct: 329 -ATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNV 387
Query: 303 A--PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG-DVGIFGNFAQSDLFIGY 359
+ P LT H GGA+ H +++ GV C + GN Q + +
Sbjct: 388 SKLPQLTFHLKGGARFE-PHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEF 446
Query: 360 DFDSQMVSFKPTDCT 374
D + +SF P+ CT
Sbjct: 447 DLMASTLSFAPSACT 461
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 108/364 (29%), Positives = 157/364 (43%), Gaps = 33/364 (9%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV---QCYKQVKPIYNPASSSSYKELS 79
E+V+ +GTP I DTGSDL WVQC PC C+ Q P+++P+ SS+Y +
Sbjct: 143 EFVVAVGLGTPAQPSAL-IFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVH 201
Query: 80 CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
C QC + C Y Y D S T GVL+ + + S+ FGCG
Sbjct: 202 CGEPQCAAAGDLCSEDNTTCLYLVRYGDGSSTTGVLSRDTLAL-TSSRALTGFPFGCGTR 260
Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
N G F + L+GLGR LSL SQ + GA FSYCL + +S T + G
Sbjct: 261 NLGDFGRVDG-LLGLGRGELSLPSQAAASFGA-VFSYCL---PSSNSTTGYLTIGATPAT 315
Query: 200 SGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
G T+++ K ++YFV L I +G ++P ++G +D+G
Sbjct: 316 DTGAAQYTAMLRKPQFPSFYFVELVSIDIGGY-----VLPV--PPAVFTRGGTLLDSGTV 368
Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG----IAPILTAHFDGGA 314
T LP Y L ++ R ++ P CY AG + P ++ F GA
Sbjct: 369 LTYLPAQAYALLRDRFRLTMERYTPAPPNDVLDACYD---FAGESEVVVPAVSFRFGDGA 425
Query: 315 --KVPLIHTSTFIPPPVEGVFCFAMQPIDG---DVGIFGNFAQSDLFIGYDFDSQMVSFK 369
++ F+ E V C A +D + I GN Q + YD ++ + F
Sbjct: 426 VFELDFFGVMIFLD---ENVGCLAFAAMDTGGLPLSIIGNTQQRSAEVIYDVAAEKIGFV 482
Query: 370 PTDC 373
P C
Sbjct: 483 PASC 486
>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
Length = 375
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 101/350 (28%), Positives = 151/350 (43%), Gaps = 50/350 (14%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCN 100
IVDTGSDL+W QC + P S ++ + C
Sbjct: 56 IVDTGSDLIWTQCKLSSSTAAAARHGSPPLSRTAPARTGAFTRTC--------------- 100
Query: 101 YTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLS 160
S+ GVLA+E TFG + FGCG + G G++GL LS
Sbjct: 101 -----TASAAAVGVLASETFTFGARRAVSLRLGFGCGALSAGSLI-GATGILGLSPESLS 154
Query: 161 LASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGG----VVSTSLVSKEDKT 216
L ++QL +FSYCL PF TS + FG +++S + +T++VS +T
Sbjct: 155 L----ITQLKIQRFSYCLTPFADKK--TSPLLFGAMADLSRHKTTRPIQTTAIVSNPVET 208
Query: 217 -YYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNMFIDTGAPPTLLPKDFYNRLEE 272
YY+V L GIS+G+ K + +S A+ G +D+G+ L + + ++E
Sbjct: 209 VYYYVPLVGISLGH-----KRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKE 263
Query: 273 QVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-------PILTAHFDGGAKVPLIHTSTFI 325
V + ++L +LC+ P A P L HFDGGA + L + F
Sbjct: 264 AVMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYF- 322
Query: 326 PPPVEGVFCFAM-QPIDGD-VGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
P G+ C A+ + DG V I GN Q ++ + +D SF PT C
Sbjct: 323 QEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 372
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 106/379 (27%), Positives = 164/379 (43%), Gaps = 47/379 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI----YNPASSSSYKE 77
G Y K +GTP D + VDTGSD++WV C C++C ++ + Y+ +SS+ K
Sbjct: 83 GLYFAKIGLGTPSR-DFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDADASSTAKS 141
Query: 78 LSCQSEQCHLLDTVS-CSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNNFFDN- 131
+SC C ++ S C S C Y Y D S T G L + + GN N
Sbjct: 142 VSCSDNFCSYVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGSTNG 201
Query: 132 -VVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSS 186
++FGCG +G E++ G++G G++ S SQ+ SQ + F++CL +
Sbjct: 202 TIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCL------DN 255
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSG 244
F G EV V +T ++SK +Y V L I VGN L SS + G
Sbjct: 256 NNGGGIFAIG-EVVSPKVKTTPMLSKS--AHYSVNLNAIEVGNSVLQLSSDAFDSGDDKG 312
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQDPRLGSQLCYKTPSMAG 301
I ID+G LP YN L Q+ + L QD S C+
Sbjct: 313 VI------IDSGTTLVYLPDAVYNPLMNQILASHQELNLHTVQD----SFTCFHYIDRLD 362
Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ------PIDGDVGIFGNFAQSDL 355
P +T FD + ++ ++ E +CF Q + I G+ A S+
Sbjct: 363 RFPTVTFQFDKSVSLA-VYPQEYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNK 421
Query: 356 FIGYDFDSQMVSFKPTDCT 374
+ YD ++Q++ + +C+
Sbjct: 422 LVVYDIENQVIGWTNHNCS 440
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 100/375 (26%), Positives = 165/375 (44%), Gaps = 25/375 (6%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS 74
S + +Y + +GTP +VDTGS+L WV C + K + ++ S S
Sbjct: 75 SGIDYGTAQYFTEIRVGTPAK-KFRVVVDTGSELTWVNCRYRARG-KDNRRVFRADESKS 132
Query: 75 YKELSCQSEQC-----HLLDTVSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNF 128
+K + C ++ C +L +C + C+Y Y YAD S +GV A E IT G +N
Sbjct: 133 FKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGR 192
Query: 129 FDNV---VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
+ + GC + TG + G++GL + S S S GA KFSYCLV ++
Sbjct: 193 MARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGA-KFSYCLVDHLSNK 251
Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSS 243
++++ + FG+ +T L +Y + + GIS+G L S++
Sbjct: 252 NVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWD----- 306
Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQV-RNAIKLTPYQDPRLGSQLCYKTPSMAGI 302
A S G +D+G TLL Y ++ + R ++L + + + C+ S +
Sbjct: 307 -ATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNV 365
Query: 303 A--PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG-DVGIFGNFAQSDLFIGY 359
+ P LT H GGA+ H +++ GV C + GN Q + +
Sbjct: 366 SKLPQLTFHLKGGARFE-PHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEF 424
Query: 360 DFDSQMVSFKPTDCT 374
D + +SF P+ CT
Sbjct: 425 DLMASTLSFAPSACT 439
>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 101/344 (29%), Positives = 146/344 (42%), Gaps = 30/344 (8%)
Query: 41 IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQ 96
I+D+GSD+ WVQC PC C++Q P+++PA S++Y + C S C L CS+
Sbjct: 171 IIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCSAN 230
Query: 97 QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG-VFNENEMGLVGLG 155
C + Y D S G + + +T G + FGC H + G F+ + G + LG
Sbjct: 231 AQCQFGINYGDGSTATGTYSFDDLTLG-PYDVIRGFRFGCAHADRGSAFDYDVAGSLALG 289
Query: 156 RTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG--GGVVSTSLVSKE 213
SL Q ++ G FSYCL P T SS+ + G E + VST L+S
Sbjct: 290 GGSQSLVQQTATRYG-RVFSYCLPP--TASSL-GFLVLGVPPERAQLIPSFVSTPLLSSS 345
Query: 214 -DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE 272
T+Y V L I V + +P A+ + ID+ + LP Y L
Sbjct: 346 MAPTFYRVLLRAIIV---AGRPLAVP-----PAVFSASSVIDSSTIISRLPPTAYQALRA 397
Query: 273 QVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEG 331
R+A+ + P CY + I P + FDGGA V L +
Sbjct: 398 AFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL------ 451
Query: 332 VFCFAMQPIDGDV--GIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
C A P D G GN Q L + YD ++ + F+ C
Sbjct: 452 GSCLAFAPTASDRMPGFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 101/383 (26%), Positives = 170/383 (44%), Gaps = 50/383 (13%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
GEY +G+P I IVDTGS+L W+QCLPC C V IY+ A S+SY+ ++C
Sbjct: 98 GEYYTSIKLGSPGQEAIL-IVDTGSELTWLQCLPCKVCAPSVDTIYDAARSASYRPVTCN 156
Query: 82 SEQCHLLDTVS------CSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNNFFDN 131
+ Q L S C+ C + Y D S + G L+T+ + G +
Sbjct: 157 NSQ--LCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQD 214
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
FGC + + G++GL +++L Q+ + G KFS+C + + T +
Sbjct: 215 FAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGW-KFSHCFPDRSSHLNSTGVV 273
Query: 192 YFGNGSEVSGGGVVSTSLV---SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
+FGN +E+ V TS+ S+ + +Y V L+G+S+ NS +L+ + G++
Sbjct: 274 FFGN-AELPHEQVQYTSVALTNSELQRKFYHVALKGVSI----NSHELV--FLPRGSV-- 324
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ------LCYKTPS---- 298
+ +D+G+ + + F+++L E +K P L C+K +
Sbjct: 325 --VILDSGSSFSSFVRPFHSQLREAF---LKHRPPSLKHLEGDSFGDLGTCFKVSNDDID 379
Query: 299 -MAGIAPILTAHFDGGAKVPLIHTSTFIPPPV---EGVFCFAMQPIDGD---VGIFGNFA 351
+ P L+ F+ G + + +P CFA + DG V + GN+
Sbjct: 380 ELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARFQNHVKMCFAFE--DGGPNPVNVIGNYQ 437
Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
Q +L++ YD V F C
Sbjct: 438 QQNLWVEYDIQRSRVGFARASCV 460
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 106/405 (26%), Positives = 168/405 (41%), Gaps = 57/405 (14%)
Query: 13 VQSNVSTANGEYVMKFSIGTP--PLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP----- 65
+ S T G+Y ++F +GTP P L + DTGSDL WV+C + P
Sbjct: 86 LTSGAYTGIGQYFVRFRVGTPAQPFLLV---ADTGSDLTWVKCRRPASANSSLSPADSGP 142
Query: 66 ----IYNPASSSSYKELSCQSEQCHL---LDTVSCSSQ-QLCNYTYGYADSSLTKGVLAT 117
+ P S ++ +SC S+ C +C + C Y Y Y D S +G + T
Sbjct: 143 GPGRAFRPEDSRTWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGT 202
Query: 118 ERITFGNSNN-----FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGAN 172
E T S +V GC + TG E G++ LG + +S AS S+ G
Sbjct: 203 ESATIALSGREERKAKLKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFG-G 261
Query: 173 KFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS-------------LVSKEDKTYYF 219
+FSYCLV + + TS + FG VS +S L+ + + +Y
Sbjct: 262 RFSYCLVDHLSPRNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYD 321
Query: 220 VTLEGISV-GNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI 278
V+L+ ISV G + + + G + +D+G T+L K Y + + +
Sbjct: 322 VSLKAISVAGEFLKIPRAVWDVEAGGGV-----ILDSGTSLTVLAKPAYRAVVAALSKGL 376
Query: 279 KLTPY--QDPRLGSQLCYKTPSMAG-----IAPILTAHFDGGAKVPLIHTSTFIPPPVEG 331
P DP + CY S +G P + HF G A++ S ++ G
Sbjct: 377 AGLPRVTMDP---FEYCYNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKS-YVIDAAPG 432
Query: 332 VFCFAMQ--PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
V C +Q P G + + GN Q + +D ++ + F+ + CT
Sbjct: 433 VKCIGLQEGPWPG-ISVIGNILQQEHLWEFDIKNRRLKFQRSRCT 476
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 108/385 (28%), Positives = 168/385 (43%), Gaps = 52/385 (13%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N + ++G+PP + ++DTGS+L W+ C + + ++NP SSSSY + C
Sbjct: 37 NVTLTVSLTVGSPPQ-QVTMVLDTGSELSWLHC----KKSPNLTSVFNPLSSSSYSPIPC 91
Query: 81 QSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
S C L + V+C ++LC+ YAD+S +G LA++ G+S +FG
Sbjct: 92 SSPVCRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSSA--LPGTLFG 149
Query: 136 C---GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
C G ++ + GL+G+ R LS ++QLG KFSYC+ DSS +
Sbjct: 150 CMDSGFSSNSEEDAKTTGLMGMNRGSLSF----VTQLGLPKFSYCIS--GRDSS--GVLL 201
Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA- 245
FG+ G + T LV D+ Y V L+GI VGN K++P S A
Sbjct: 202 FGDSHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGN-----KILPLPKSIFAP 256
Query: 246 --ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCYKTP 297
G +D+G T L Y L + K L P DP Q LCY+ P
Sbjct: 257 DHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVP 316
Query: 298 SMAGIA--PILTAHFDGGAKV----PLIHTSTFIPPPVEGVFCFAMQPIDG---DVGIFG 348
+ + P ++ F G V L++ + E V+C D + + G
Sbjct: 317 AGGKLPELPAVSLMFRGAEMVVGGEVLLYKVPGMMKGKEWVYCLTFGNSDLLGIEAFVIG 376
Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDC 373
+ Q ++++ +D V F T C
Sbjct: 377 HHHQQNVWMEFDLVKSRVGFVETRC 401
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 171/386 (44%), Gaps = 71/386 (18%)
Query: 22 GEYVMKFSIGTPPLLDIYGI-VDTGSDLMWVQCLPCVQC--YKQVK-PI--YNPASSSSY 75
G Y + +GTPP Y + VDTGSDL+WV C PC+ C + +K PI Y+ +S+S
Sbjct: 34 GLYFTQVQLGTPP--RTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASS 91
Query: 76 KELSCQSEQCHLLDTVS---CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
++ C C L+ +S C+ Q C Y++ Y D S T G L + + + N V
Sbjct: 92 SKVPCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHY--MVNATATV 149
Query: 133 VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
+FGCG +G + +E G++G G + LS SQ+ Q N F++CL
Sbjct: 150 IFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCL---------- 199
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTY---------YFVTLEGISV--GNLSNSSKLI 237
+G E GGG++ V + D Y Y V L+ ISV NL+ KL
Sbjct: 200 ------DGGE-RGGGILVLGNVIEPDIQYTPLVPYMYHYNVVLQSISVNNANLTIDPKLF 252
Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP 297
S + +G +F D+G LP + Y + V + D RL S+ YK
Sbjct: 253 -----SNDVMQGTIF-DSGTTLAYLPDEAYQAFTQAVSLVVAPFLLCDTRL-SRFIYK-- 303
Query: 298 SMAGIAPILTAHFDGGAKVP-----LIHTSTFIPPPVEGVFCFAMQPI-----DGDVGIF 347
+ P + +F+G + LI ++ P ++C Q + + IF
Sbjct: 304 ----LFPNVVLYFEGASMTLTPAEYLIRQASAANAP---IWCMGWQSMGSAESELQYTIF 356
Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDC 373
G+ + + YD + + ++P DC
Sbjct: 357 GDLVLKNKLVVYDLERGRIGWRPFDC 382
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 106/385 (27%), Positives = 173/385 (44%), Gaps = 69/385 (17%)
Query: 22 GEYVMKFSIGTPPLLDIYGI-VDTGSDLMWVQCLPCVQC--YKQVK-PI--YNPASSSSY 75
G Y + +GTPP Y + VDTGSDL+WV C PC+ C + +K PI Y+ +S+S
Sbjct: 34 GLYFTQVQLGTPP--RTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASS 91
Query: 76 KELSCQSEQCHLLDTVS---CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
++ C C L+ +S C+ Q C Y++ Y D S T G L + + + N V
Sbjct: 92 SKVPCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHY--MVNATATV 149
Query: 133 VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
+FGCG +G + +E G++G G + LS SQ+ Q N F++CL
Sbjct: 150 IFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCL---------- 199
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDK--------TYYFVTLEGISV--GNLSNSSKLIP 238
+G E GG +V +++ + + ++Y V L+ ISV NL+ KL
Sbjct: 200 ------DGGERGGGILVLGNVIEPDIQYTPLVPYMSHYNVVLQSISVNNANLTIDPKLF- 252
Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS 298
S + +G +F D+G LP + Y + V + D RL S+ YK
Sbjct: 253 ----SNDVMQGTIF-DSGTTLAYLPDEAYQAFTQAVSLVVAPFLLCDTRL-SRFIYK--- 303
Query: 299 MAGIAPILTAHFDGGAKVP-----LIHTSTFIPPPVEGVFCFAMQPI-----DGDVGIFG 348
+ P + +F+G + LI ++ P ++C Q + + IFG
Sbjct: 304 ---LFPNVVLYFEGASMTLTPAEYLIRQASAANAP---IWCMGWQSMGSAESELQYTIFG 357
Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDC 373
+ + + YD + + ++P DC
Sbjct: 358 DLVLKNKLVVYDLERGRIGWRPFDC 382
>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
Length = 362
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 66/171 (38%), Positives = 94/171 (54%), Gaps = 8/171 (4%)
Query: 12 VVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
V S +S +GEY M+ +GTP ++Y ++DTGSD++W+QC PC CY Q I++P
Sbjct: 123 AVISGLSQGSGEYFMRLGVGTPAT-NVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKK 181
Query: 72 SSSYKELSCQSEQCHLLDTVS-CSSQQ--LCNYTYGYADSSLTKGVLATERITFGNSNNF 128
S ++ + C S C LD S C +++ C Y Y D S T+G +TE +TF +
Sbjct: 182 SKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGAR-- 239
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
D+V GCGH+N G+F L LS SQ ++ KFSYCLV
Sbjct: 240 VDHVPLGCGHDNEGLFVGAAGLLGLGR-GGLSFPSQTKNRYNG-KFSYCLV 288
>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 66/169 (39%), Positives = 95/169 (56%), Gaps = 11/169 (6%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSY 75
S +G Y +K G+P IVDTGS L W+QC PCV C+ Q P+++P++S +Y
Sbjct: 111 ASIGSGNYYVKVGFGSPARYYSM-IVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTY 169
Query: 76 KELSCQSEQCHLL------DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
K LSC S QC L + + +S +C YT Y DSS + G L+ + +T S
Sbjct: 170 KSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQT-L 228
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL 178
V+GCG ++ G+F G++GLGR +LS+ Q+ S+ G FSYCL
Sbjct: 229 PGFVYGCGQDSDGLFGR-AAGILGLGRNKLSMLGQVSSKFGY-AFSYCL 275
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 105/356 (29%), Positives = 155/356 (43%), Gaps = 32/356 (8%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
EYV+ IG+P + ++DTGSD+ WV+C +++P+ S++Y SC S
Sbjct: 128 EYVITVGIGSPAVTQTM-MIDTGSDVSWVRC-----NSTDGLTLFDPSKSTTYAPFSCSS 181
Query: 83 EQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
C L + CS+ C Y Y D S T G +++ + S+ D FGC H+
Sbjct: 182 AACAQLGNNGDGCSNSG-CQYRVQYGDGSNTTGTYSSDTLALSASDTVTD-FHFGCSHHE 239
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
E GL+GLG SL SQ + G FSYCL P + S + FG + S
Sbjct: 240 EDFDGEKIDGLMGLGGDAQSLVSQTAATYG-KSFSYCLPPTNRTSGF---LTFGAPNGTS 295
Query: 201 GGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPT 260
GG V + L + T Y V L+ ISVG P +S G++ +D+G T
Sbjct: 296 GGFVTTPMLRWPKAPTLYGVLLQDISVGG-------TPLGIQPSVLSNGSV-MDSGTVIT 347
Query: 261 LLPKDFYNRLEEQVRNAI-KLTPYQDPRLGS-QLCYKTPSMAGIA-PILTAHFDGGAKVP 317
LP+ Y+ L R+++ +L + LG CY + ++ P ++ DGGA V
Sbjct: 348 WLPRRAYSALSSAFRSSMTRLRHQRAAPLGILDTCYDFTGLVNVSIPAVSLVLDGGAVVD 407
Query: 318 LIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
L I C A GD I GN Q + +D + F+ C
Sbjct: 408 LDGNGIMIQD------CLAFAATSGD-SIIGNVQQRTFEVLHDVGQGVFGFRSGAC 456
>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
Length = 483
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 110/402 (27%), Positives = 170/402 (42%), Gaps = 62/402 (15%)
Query: 24 YVMKFSIGTPP-LLDIYGIVDTGSDLMWVQC----LPCVQC--YKQVK------------ 64
Y++ SIGTPP ++ +Y +DTGSDL W C C++C Y+ +
Sbjct: 80 YLISLSIGTPPQVIQVY--MDTGSDLTWAPCGNISFDCIECDNYRNNRMMASFSPSHSSS 137
Query: 65 --------PIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLC---NYTYG---YADSSL 110
P SS C C L V + C YTYG +L
Sbjct: 138 SHRDSCTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATCSWPCPPFAYTYGAGGVVTGTL 197
Query: 111 TKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG 170
T+ L G + FGC ++ +G+ G GR LSL SQ+
Sbjct: 198 TRDTLRVHGRNLGVTQEI-PRFCFGCVASSY----REPIGIAGFGRGALSLPSQL--GFL 250
Query: 171 ANKFSYCLVPFH--TDSSITSKMYFGNGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISV 227
FS+C + F + +I+S + G+ + S + T ++ S YY+V LE I+V
Sbjct: 251 RKGFSHCFLAFKYANNPNISSPLIIGDIALTSKDDMQFTPMLKSPMYPNYYYVGLEAITV 310
Query: 228 GNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP- 286
GN+S ++++ ++ G M +D+G T LP+ FY+++ +++ I D
Sbjct: 311 GNVS-ATEVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVLSVLQSIINYPRATDME 369
Query: 287 -RLGSQLCYKTPSM------AGIAPILTAHFDGGAKVPLIHTSTF----IPPPVEGVFCF 335
R G LCYK P + P +T HF A + L S F P V C
Sbjct: 370 MRTGFDLCYKVPCQNNSILTGDLLPSITFHFLNNASLVLSRGSHFYAMSAPSNSTVVKCL 429
Query: 336 AMQPID----GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
Q +D G G+ G+F Q D+ + YD + + + F+P DC
Sbjct: 430 LFQSMDDGDYGPAGVLGSFQQQDVEVVYDMEKERIGFRPMDC 471
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 114/379 (30%), Positives = 173/379 (45%), Gaps = 30/379 (7%)
Query: 9 PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
P + V + ++ SIG PP ++Y ++DTGSDL W+QC PC CYKQ PIYN
Sbjct: 91 PADFVPPPLIRDKSAFLANLSIGNPPT-NVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYN 149
Query: 69 PASSSSYKELSCQSEQCHLLDTV-SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
S SY E+ C C L CS C Y YAD S T G+L+ E++ F + +
Sbjct: 150 RTKSDSYTEMLCNEPPCLSLGREGQCSDSGSCLYQTSYADGSRTSGLLSYEKVAFTSHYS 209
Query: 128 FFDN---VVFGCGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQLG--ANKFSYCLVPF 181
D V FGCG N V + + G++GLG +SL SQ LS +G + F+YC
Sbjct: 210 DEDKTAQVGFGCGLQNLNFVTSSRDGGVLGLGPGLVSLVSQ-LSAIGKVSKSFAYCFGNL 268
Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
++ + + FG+ + ++G T +V E +Y+V L GI +G + N
Sbjct: 269 -SNPNAGGFLVFGDATYLNGD---MTPMVIAE---FYYVNLLGIGLGVEEPRLDI----N 317
Query: 242 SSGAISK----GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY--K 295
SS K G + ID+G+ ++ P + Y + V + +K P S C+ K
Sbjct: 318 SSSFERKPDGSGGVIIDSGSTLSIFPPEVYEVVRNAVVDKLKKGYNISPLTSSPDCFEGK 377
Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDL 355
+ P L + + + S F+ E +FC +G + I G AQ
Sbjct: 378 IGRDLPLFPTLVLYLESTGILN-DRWSIFLQRYDE-LFCLGFTSGEG-LSIIGTLAQQSY 434
Query: 356 FIGYDFDSQMVSFKPT-DC 373
GY+ + +S + DC
Sbjct: 435 KFGYNLELSTLSIESNPDC 453
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 110/404 (27%), Positives = 176/404 (43%), Gaps = 65/404 (16%)
Query: 9 PNNVVQSNVSTANGEYVMKFS--------IGTPPLLDIYGIVDTGSDLMWVQCLPCVQCY 60
PNN Q+ + N ++ K+S IGTPP ++DTGS L W+QC +
Sbjct: 53 PNNP-QNKTPSYNYKFSFKYSMALIINLPIGTPPQTQPM-VLDTGSQLSWIQC------H 104
Query: 61 KQVKPI--YNPASSSSYKELSCQSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKG 113
K+ P ++P+ SS++ L C C SC +LC+Y+Y YAD + +G
Sbjct: 105 KKQPPTASFDPSLSSTFSILPCTHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEG 164
Query: 114 VLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK 173
L E+ TF S + ++ GC +T + G++G+ RLS A Q S++ K
Sbjct: 165 NLVREKFTFSRSVS-TPPLILGCATEST-----DPRGILGMNLGRLSFAKQ--SKI--TK 214
Query: 174 FSYCLVPFHTDSSI--TSKMYFGNGSEVSGGGVVSTSLVSKE-----DKTYYFVTLEGIS 226
FSYC+ P T T Y GN G V S++ D Y + + GI
Sbjct: 215 FSYCVPPRQTRPGFTPTGSFYLGNNPSSKGFKYVGMMTSSRQRMPNFDPLAYTIPMVGIR 274
Query: 227 V-GNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD 285
+ G N S + ++ G+ G ID+G+ T L + Y+++ QV A+
Sbjct: 275 IAGKKLNISPAVFRADAGGS---GQTMIDSGSEFTYLVSEAYDKVRAQVVRAV------G 325
Query: 286 PRLG--------SQLCYKTPSMAGIAPI---LTAHFDGGAKVPLIHTSTFIPPPVEGVFC 334
PRL + +C+ + I + + F+ G +V +I + GV C
Sbjct: 326 PRLKKGYVYGGVADMCFDSVKAVEIGRLIGEMVFEFERGVEV-VIPKERVLADVGGGVHC 384
Query: 335 FAMQPID---GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
+ D I GNF Q +L++ +D + V F DC++
Sbjct: 385 VGIGSSDKLGAASNIIGNFHQQNLWVEFDLVRRRVGFGKADCSR 428
>gi|222631382|gb|EEE63514.1| hypothetical protein OsJ_18330 [Oryza sativa Japonica Group]
Length = 464
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 109/396 (27%), Positives = 171/396 (43%), Gaps = 72/396 (18%)
Query: 41 IVDTGSDLMWVQCLPC----------VQCYKQVKPIYNPASSSSYKELSCQSEQCHLL-- 88
+VDTGSDL+W QC C C+ Q P YN + S + + + C + L
Sbjct: 77 VVDTGSDLVWTQCSTCRLPAVAAAGGGGCFPQNLPYYNFSLSRTARAVPCDDDDGALCGV 136
Query: 89 --DTVSC-----SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
+T C S C Y + + GVL T+ TF +S++ + FGC
Sbjct: 137 APETAGCARGGGSGDDACVVAASYG-AGVALGVLGTDAFTFPSSSSV--TLAFGCVSQTR 193
Query: 142 ---GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
G N G++GLGR LSL +SQL A +FSYCL P+ D+ S ++ G+G
Sbjct: 194 ISPGALN-GASGIIGLGRGALSL----VSQLNATEFSYCLTPYFRDTVSPSHLFVGDGEL 248
Query: 199 VSGGGV----------VSTSLVSKEDK-----TYYFVTLEGISVGNLSNS--SKLIPYYN 241
V+T +K K T+Y++ L G++ GN + + +
Sbjct: 249 AGLRAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAAGNATVALPAGAFDLRE 308
Query: 242 SSGAISKGNMFIDTGAPPTLL----PKDFYNRLEEQVRNAIKLTPYQDPRLGS--QLCYK 295
++ + G ID+G+P T L + L Q+R + L P +LG +LC +
Sbjct: 309 AAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVP-PPAKLGGALELCVE 367
Query: 296 T----PSMAGIA-PILTAHFD---GGAKVPLIHTSTFIPPPVEGVFCFAM---------Q 338
S+A A P L FD GG + +I + +C A+
Sbjct: 368 AGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEASTWCMAVVSSASGNATL 427
Query: 339 PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
P + + I GNF Q D+ + YD + ++SF+P +C+
Sbjct: 428 PTN-ETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 462
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 112/380 (29%), Positives = 171/380 (45%), Gaps = 32/380 (8%)
Query: 9 PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYN 68
P + V + ++ SIG PP ++Y ++DTGSDL W+QC PC CYKQ PIYN
Sbjct: 78 PADFVPPPLIRDKSAFLANLSIGNPPT-NVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYN 136
Query: 69 PASSSSYKELSCQSEQC-HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
S SY E+ C C L CS C Y YAD + T G+L+ E++ F + +
Sbjct: 137 RTKSDSYTEMLCNEPPCVSLGREGQCSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYS 196
Query: 128 FFDN---VVFGCGHNNTGVFNEN-EMGLVGLGRTRLSLASQILSQLG--ANKFSYCLVPF 181
D V FGCG N N + G++GLG +SL SQ LS +G + F+YC
Sbjct: 197 DEDKTAQVGFGCGLQNLNFITSNRDGGVLGLGPGLVSLVSQ-LSAIGKVSKSFAYCFGNI 255
Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG----NLS-NSSKL 236
++ + + FG+ + ++G T +V E +Y+V L GI +G L NSS
Sbjct: 256 -SNPNAGGFLVFGDATYLNGD---MTPMVIAE---FYYVNLLGIGLGVGEPRLDINSSSF 308
Query: 237 IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY-- 294
+ SG + ID+G+ ++ P + Y + V + +K P S C+
Sbjct: 309 ERKPDGSGGV-----IIDSGSTLSVFPPEVYEVVRNAVVDKLKKGYNISPLTSSPDCFEG 363
Query: 295 KTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
K + P L + + L + + +FC +G + I G AQ
Sbjct: 364 KIERDLPLFPTLVLYLESTGI--LNDRWSIFLQRYDELFCLGFTSGEG-LSIIGTLAQQS 420
Query: 355 LFIGYDFDSQMVSFKPT-DC 373
GY+ + +S + DC
Sbjct: 421 YKFGYNLELSTLSIESNPDC 440
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 104/382 (27%), Positives = 160/382 (41%), Gaps = 46/382 (12%)
Query: 26 MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQC 85
M+ IGTPP ++ +VDT S+L WVQ C C P +NP SSS+ C S C
Sbjct: 1 MQTKIGTPPR-EVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVC 59
Query: 86 HLLDTVSCSSQQLCNYTYG-------YADSSLTKGVLATERI---TFGNSNNFFDNVVFG 135
L Q CN + G Y D S GV+A E ++ + + +V+FG
Sbjct: 60 --LGRSKLGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFG 117
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG---ANKFSYCLVPFHTDSSITSKMY 192
C + + G +GL R S +QI S+ +++FSYC + + +
Sbjct: 118 CASKDLQRPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVII 177
Query: 193 FGNGSEVSGGGVVSTSLVSKEDK-------TYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
FG+ SG +S E + +Y+V L+GISVG +L+ S+
Sbjct: 178 FGD----SGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGG-----ELLHIPRSAFK 228
Query: 246 ISK---GNMFIDTGAPPTLLPKDFYNRLEEQV-RNAIKLTPYQDPRLGSQLCYKTPSMAG 301
I + G + D+G + L + + L E R + L +LCY +
Sbjct: 229 IDRLGNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGDA 288
Query: 302 ---IAPILTAHFDGGAKVPLIHTSTFIP----PPVEGV---FCFAMQPIDGDVGIFGNFA 351
AP++T HF + L S ++P P V + F A G V + GN+
Sbjct: 289 RLPTAPLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQ 348
Query: 352 QSDLFIGYDFDSQMVSFKPTDC 373
Q D I +D + + F P +C
Sbjct: 349 QQDYLIEHDLERSRIGFAPANC 370
>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
Length = 447
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 55/145 (37%), Positives = 82/145 (56%), Gaps = 7/145 (4%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
V S + +GEY +GTP + ++DTGSDL+W+QC PC +CY Q +++P S
Sbjct: 75 VFSGIPFESGEYFALVGVGTPSTKAML-VIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRS 133
Query: 73 SSYKELSCQSEQCHLLDTVSCSSQQL----CNYTYGYADSSLTKGVLATERITFGNSNNF 128
S+Y+ + C S QC L C S C Y Y D S + G LAT+++ F N + +
Sbjct: 134 STYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFAN-DTY 192
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVG 153
+NV GCG +N G+F ++ GL+G
Sbjct: 193 VNNVTLGCGRDNEGLF-DSAAGLLG 216
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 112/367 (30%), Positives = 162/367 (44%), Gaps = 41/367 (11%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
+G +++ + GTPP I+DTGS + W QC PCV+C K + ++P++S +Y SC
Sbjct: 159 DGNFLVDVAFGTPPQ-KFTLILDTGSSITWTQCKPCVRCLKASRRHFDPSASLTYSLGSC 217
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
+ TV + Y Y D S + G + +T +S + F FGCG NN
Sbjct: 218 ------IPSTVGNT------YNMTYGDKSTSVGNYGCDTMTLEHS-DVFPKFQFGCGRNN 264
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
G F G++GLG+ +LS SQ S+ FSYCL + SI S + FG +
Sbjct: 265 EGDFGSGADGMLGLGQGQLSTVSQTASKF-KKVFSYCL---PEEDSIGS-LLFGEKATSQ 319
Query: 201 GGGVVSTSLVSK------EDKTYYFVTLEGISVGNLSNSSKL-IPYYNSSGAISKGNMFI 253
+ TSLV+ E+ YYFV L ISVGN +L IP S + I
Sbjct: 320 SSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGN----KRLNIP----SSVFASPGTII 371
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSMAGI-APILTA 308
D+G T LP+ Y+ L+ + A+ P + R CY + P +
Sbjct: 372 DSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVL 431
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
HF GA V L + I C A + ++ I GN Q L + YD + F
Sbjct: 432 HFGEGADVRL-NGKRVIWGNDASRLCLAFAG-NSELTIIGNRQQVSLTVLYDIQGGRIGF 489
Query: 369 KPTDCTK 375
C+K
Sbjct: 490 GGNGCSK 496
>gi|358345193|ref|XP_003636666.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502601|gb|AES83804.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 161
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 61/126 (48%), Positives = 80/126 (63%), Gaps = 5/126 (3%)
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ-DPRLGSQLCYKTPSMAGIAPILTAH 309
M ID+G P ++LP+DF++RL EQVR + L P DP LG QLCY+TP+ P L AH
Sbjct: 1 MLIDSGTPISILPEDFFHRLLEQVRKKVALEPMPFDPSLGYQLCYRTPTNLK-GPTLVAH 59
Query: 310 FDGGAKVPLIHTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
F+G A V L T FIP G+FCFA + G +G++ QS+ IG+D + Q+VSF
Sbjct: 60 FEG-ADVLLTPTQIFIPVQY-GIFCFAFTSSFSNEYGTYGSYVQSNYLIGFDLEKQVVSF 117
Query: 369 KPTDCT 374
K TDCT
Sbjct: 118 KATDCT 123
>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
gi|194689376|gb|ACF78772.1| unknown [Zea mays]
gi|224031455|gb|ACN34803.1| unknown [Zea mays]
gi|238011528|gb|ACR36799.1| unknown [Zea mays]
gi|238015454|gb|ACR38762.1| unknown [Zea mays]
Length = 304
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 94/320 (29%), Positives = 141/320 (44%), Gaps = 40/320 (12%)
Query: 78 LSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVV---- 133
+ C C + SC C Y Y Y D ++T GV ATER TF +S
Sbjct: 1 MRCAGTLCSDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPL 60
Query: 134 -FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
FGCG N G N N G+VG GR LSL +SQL +FSYCL + S S +
Sbjct: 61 GFGCGSVNVGSLN-NGSGIVGFGRNPLSL----VSQLSIRRFSYCLTSYA--SRRQSTLL 113
Query: 193 FGNGSEVSGG---GVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
FG+ S+ G G V T+ L S ++ T+Y+V G++VG ++ + S+ A+
Sbjct: 114 FGSLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVG-----ARRLRIPESAFALR 168
Query: 248 ---KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYKTPSMAG 301
G + +D+G TLLP + R ++L P+ +P G +C+ P+
Sbjct: 169 PDGSGGVIVDSGTALTLLPAAVLAEVVRAFRQQLRL-PFANGGNPEDG--VCFLVPAAWR 225
Query: 302 IA--------PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQS 353
+ P + HF GA + L + + G C + D GN Q
Sbjct: 226 RSSSTSQMPVPRMVLHFQ-GADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQ 284
Query: 354 DLFIGYDFDSQMVSFKPTDC 373
D+ + YD +++ +S P C
Sbjct: 285 DMRVLYDLEAETLSIAPARC 304
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 98/379 (25%), Positives = 164/379 (43%), Gaps = 33/379 (8%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
+ S + G+Y +K +GTP + + + DTGSDL WV+C + ++ P +S
Sbjct: 105 MSSGAYSGTGQYFVKLRVGTP-VQEFTLVADTGSDLTWVKCAGASPPGR----VFRPKTS 159
Query: 73 SSYKELSCQSEQCHL---LDTVSCSS-QQLCNYTYGYADSSL-TKGVLATERITF---GN 124
S+ + C S+ C L +CSS C Y Y Y + S +G++ TE T G
Sbjct: 160 RSWAPIPCSSDTCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGG 219
Query: 125 SNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
+VV GC ++ G + G++ LG ++S A+Q ++ G + FSYCLV
Sbjct: 220 KVAQLKDVVLGCSSSHDGQSFRSADGVLSLGNAKISFATQAAARFGGS-FSYCLVDHLAP 278
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
+ T + FG G +V T L + +Y V ++ I V + K +
Sbjct: 279 RNATGYLAFGPG-QVPRTPATQTKLFLDPEMPFYGVKVDAIHV-----AGKALDIPAEVW 332
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD-PRLGSQLCY----KTPSM 299
G + +D+G T+L Y + + + P P + CY + P
Sbjct: 333 DAKSGGVILDSGNTLTVLAAPAYKAVVAALSKHLDGVPKVSFPPF--EHCYNWTARRPGA 390
Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD---VGIFGNFAQSDLF 356
I P L F G A++ S ++ GV C +Q +G+ + + GN Q +
Sbjct: 391 PEIIPKLAVQFAGSARLEPPAKS-YVIDVKPGVKCIGVQ--EGEWPGLSVIGNIMQQEHL 447
Query: 357 IGYDFDSQMVSFKPTDCTK 375
+D + V FK ++CT+
Sbjct: 448 WEFDLKNMQVRFKQSNCTR 466
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 104/389 (26%), Positives = 169/389 (43%), Gaps = 41/389 (10%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV-----KPIY 67
+ S T G+Y ++ +GTP + + DTGSDL WV+C + ++
Sbjct: 93 LTSGAYTGTGQYFVRLRVGTPAQPFVL-VADTGSDLTWVKCSSPSSSSSSPAASPPQRVF 151
Query: 68 NPASSSSYKELSCQSEQCHL---LDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFG 123
PA S S+ L C S+ C +CSS C+Y Y Y D+S +GV+ + T
Sbjct: 152 RPAGSKSWSPLPCDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDSATVS 211
Query: 124 NSNN------FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
S N VV GC + G ++ G++ LG + +S AS+ S+ G +FSYC
Sbjct: 212 LSGNDGTRKAKLQEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASRAASRFG-GRFSYC 270
Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVV--STSLVSKED---KTYYFVTLEGISVG--NL 230
LV + TS + FGNG G T LV ED + +YFV+++ ++V L
Sbjct: 271 LVDHLAPRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERL 330
Query: 231 SNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY--QDPRL 288
+ + + GAI +D+G T+L Y+ + + + P DP
Sbjct: 331 EILPDVWDFRKNGGAI------LDSGTSLTILATPAYDAVVKAISKQFAGVPRVNMDP-- 382
Query: 289 GSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG---DVG 345
+ CY ++ P + F G A + S ++ GV C + ++G V
Sbjct: 383 -FEYCYNWTGVSAEIPRMELRFAGAATLAPPGKS-YVIDTAPGVKCIGV--VEGAWPGVS 438
Query: 346 IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ GN Q + +D ++ + FK + C
Sbjct: 439 VIGNILQQEHLWEFDLANRWLRFKQSRCA 467
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 102/384 (26%), Positives = 169/384 (44%), Gaps = 52/384 (13%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
GEY +G+P I IVDTGS+L W++CLPC C V IY+ A S SYK ++C
Sbjct: 98 GEYYTSIKLGSPGQEAIL-IVDTGSELTWLKCLPCKVCAPSVDTIYDAARSVSYKPVTCN 156
Query: 82 SEQCHLLDTVS------CSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNNFFDN 131
+ Q L S C+ C + Y D S + G L+T+ + G +
Sbjct: 157 NSQ--LCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQD 214
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
FGC + + G++GL +++L Q+ + G KFS+C + + T +
Sbjct: 215 FAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGW-KFSHCFPDRSSHLNSTGVV 273
Query: 192 YFGNGSEVSGGGVVSTSLV---SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
+FGN +E+ V TS+ S+ + +Y V L+G+S+ NS +L+ + +
Sbjct: 274 FFGN-AELPHEQVQYTSVALTNSELQRKFYHVALKGVSI----NSHELV-------LLPR 321
Query: 249 GNMFI-DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ------LCYKTPS--- 298
G++ I D+G+ + + F+++L E +K P L C+K +
Sbjct: 322 GSVVILDSGSSFSSFVRPFHSQLREAF---LKHRPPSLKHLEGDSFGDLGTCFKVSNDDI 378
Query: 299 --MAGIAPILTAHFDGGAKVPLIHTSTFIPPPV---EGVFCFAMQPIDGD---VGIFGNF 350
+ P L+ F+ G + + +P CFA + DG V + GN+
Sbjct: 379 DELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARYQNHVKMCFAFE--DGGPNPVNVIGNY 436
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
Q +L++ YD V F C
Sbjct: 437 QQQNLWVEYDIQRSRVGFARASCV 460
>gi|242091057|ref|XP_002441361.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
gi|241946646|gb|EES19791.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
Length = 439
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 116/435 (26%), Positives = 186/435 (42%), Gaps = 82/435 (18%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPP-LLDIYGIVDTGSDLMWVQC-----LPCVQCYKQVK 64
++++ + +G Y++ ++GTPP + +Y +DTGSDL WV C C+ C VK
Sbjct: 13 DIIEPVTAYTDG-YLLSLNLGTPPQVFQVY--LDTGSDLTWVPCGSSSSYQCLDCGSSVK 69
Query: 65 PI--YNPASSSSYKELSCQSEQC-------HLLDTVSCSSQQLCNYT------------Y 103
P + P+ S+S C S C + D + + + +T Y
Sbjct: 70 PTPTFLPSESTSNTRDLCGSRFCVDVHSSDNRFDPCAAAGCAIPAFTGGQCPRPCPPFSY 129
Query: 104 GYADSSLTKGVLATERITFGNSNN-----------FFDNVVFGCGHNNTGVFNENEMGLV 152
Y +L G L+ + +T S + F FGC G +G+
Sbjct: 130 TYGGGALVLGSLSRDSVTLHGSTHGSGAGAGPLPVAFPGFGFGC----VGSSIREPLGIA 185
Query: 153 GLGRTRLSLASQILSQLGANKFSYCLVPFH--TDSSITSKMYFGN----GSEVSGGGVVS 206
G GR LSL SQ L LG FS+C + F + + TS + G+ + GG V +
Sbjct: 186 GFGRGALSLPSQ-LGFLG-KGFSHCFLGFRFARNPNFTSPLVMGDLALSSASTDGGFVFT 243
Query: 207 TSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN--MFIDTGAPPTLLPK 264
L S +Y+V LEG+ +G+ S + + SG ++GN + +DTG T LP
Sbjct: 244 PMLTSATYPNFYYVGLEGVVLGDDDGGSAMAAPPSLSGIDAQGNGGVLVDTGTTYTQLPD 303
Query: 265 DFYNRLEEQVRNAIKLTPYQ-----DPRLGSQLCYKTP-SMAGIA----PILTAHFDGGA 314
FY + + +A PY+ + R G LC+K P + A A P +T H GGA
Sbjct: 304 PFYASVLASLISAAP--PYERSRDLEARTGFDLCFKVPCARAPCADDELPPITLHLAGGA 361
Query: 315 KVPLIHTSTFIPPPVEG----VFCFAMQPIDGD-----------VGIFGNFAQSDLFIGY 359
++ L S++ P V C Q ++ + + G+F ++ + Y
Sbjct: 362 RLALPKLSSYYPVTAIRDSVVVKCLLFQRMEMEDDGDGTSGGGPAAVLGSFQMQNVEVVY 421
Query: 360 DFDSQMVSFKPTDCT 374
D + V F+P DC
Sbjct: 422 DLAAGRVGFRPRDCA 436
>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
Length = 486
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 107/402 (26%), Positives = 175/402 (43%), Gaps = 61/402 (15%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQC----LPCVQC--YKQVK------------- 64
Y++ +IGTPP + I ++DTGSDL WV C C++C Y+ K
Sbjct: 82 YLISLNIGTPPQV-IQVLMDTGSDLTWVPCGNLSFDCMECDDYRNNKLMATFSPSYSSSS 140
Query: 65 -------PIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLC-NYTYGYADSSLTKGVLA 116
P SS +C C L V + + C ++ Y Y + G+L
Sbjct: 141 YRASCASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFAYTYGAGGVVTGILT 200
Query: 117 TERITFGNSN----NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQI-LSQLGA 171
+ + S+ FGC G +G+ G GR LS+ SQ+ Q G
Sbjct: 201 RDTLRVNGSSPGVAKEIPKFCFGC----VGSAYREPIGIAGFGRGTLSMVSQLGFLQKG- 255
Query: 172 NKFSYCLVPFH--TDSSITSKMYFGNGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVG 228
FS+C + F + +I+S + G+ + S + T ++ S +Y+V LE I+VG
Sbjct: 256 --FSHCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLNSPMYPNFYYVGLEAITVG 313
Query: 229 NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDP 286
N+S ++++ ++ G M ID+G T LP+ FY+++ +++ I +
Sbjct: 314 NVS-ATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVLSILQSTINYPRDTGMEM 372
Query: 287 RLGSQLCYKTP-------SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG----VFCF 335
+ G LCYK P + + P +T HF + L + F P G V C
Sbjct: 373 QTGFDLCYKVPRPNNNTLTSDDLLPSITFHFLNNVSLVLPQGNHFYPVSAPGNPAVVKCL 432
Query: 336 AMQPID----GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
Q D G G+FG+F Q ++ + YD + + + F+P DC
Sbjct: 433 MFQSTDDGDDGPAGVFGSFQQQNVEVVYDLEKERIGFQPMDC 474
>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
Length = 409
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 94/343 (27%), Positives = 147/343 (42%), Gaps = 27/343 (7%)
Query: 41 IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQ 96
I+D+GSD+ WVQC PC + C+ Q P+++PA+S++Y + C S C L C +
Sbjct: 84 IIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLGPYRRGCLAN 143
Query: 97 QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG-VFNENEMGLVGLG 155
C + YA+ + G +++ +T G + +FGC H + G F+ + G + LG
Sbjct: 144 SQCQFGITYANGATATGTYSSDDLTLG-PYDVVRGFLFGCAHADQGSTFSYDVAGTLALG 202
Query: 156 RTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG--GGVVSTSLVSKE 213
S Q SQ + FSYC+ P +S + FG + + VST L+S
Sbjct: 203 GGSQSFVQQTASQY-SRVFSYCVPP---STSSFGFIMFGVPPQRAALVPTFVSTPLLSSS 258
Query: 214 --DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLE 271
T+Y V L I V + +P + + + ID+ + +P Y L
Sbjct: 259 TMSPTFYRVLLRSIIVAG-----RPLPVPPT---VFSASSVIDSATVISRIPPTAYQALR 310
Query: 272 EQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVE 330
R+A+ + P CY + I P + FDGGA V L + +
Sbjct: 311 AAFRSAMTMYRPAPPVSILDTCYDFSGVRSITLPSIALVFDGGATVNLDAAGILL----Q 366
Query: 331 GVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
G FA D G GN Q L + YD + + F+ C
Sbjct: 367 GCLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 409
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 110/408 (26%), Positives = 170/408 (41%), Gaps = 60/408 (14%)
Query: 13 VQSNVSTANGEYVMKFSIGTP--PLLDIYGIVDTGSDLMWVQCLPCVQCYKQV------- 63
+ S T G+Y ++F +GTP P L + DTGSDL WV+C P
Sbjct: 84 LTSAAYTGIGQYFVRFRVGTPAQPFLLV---ADTGSDLTWVKCRPAKAAAASTNSSSSAS 140
Query: 64 ----KPIYNPASSSSYKELSCQSEQCHLLDTVSCSS----QQLCNYTYGYADSSLTKGVL 115
+ + P S ++ + C S+ C S S+ C Y Y Y D S +G +
Sbjct: 141 ASSPRRAFRPEKSKTWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTV 200
Query: 116 ATERITFG-----------NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQ 164
TE T +V GC + TG E G++ LG + +S AS
Sbjct: 201 GTESATIALSSSSSSSKNKVKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVSFASH 260
Query: 165 ILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS-------GGGVVSTSLV-SKEDKT 216
S+ G +FSYCLV + + TS + FG S +S G G T LV +
Sbjct: 261 AASRFG-GRFSYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRP 319
Query: 217 YYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVR 275
+Y V+++ ISV G L + + + G G + +D+G T+L K Y + +
Sbjct: 320 FYDVSIKAISVDGELLKIPRDV--WEVDGG---GGVIVDSGTSLTVLAKPAYRAVVAALG 374
Query: 276 NAIKLTPY--QDPRLGSQLCYK--TPSMAGIA---PILTAHFDGGAKVPLIHTSTFIPPP 328
+ P DP + CY +PS P L HF G A++ + +++
Sbjct: 375 KKLARFPRVAMDP---FEYCYNWTSPSRKDEGDDLPKLAVHFAGSARLE-PPSKSYVIDA 430
Query: 329 VEGVFCFAMQ--PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
GV C +Q P G + + GN Q + +D ++ + FK + CT
Sbjct: 431 APGVKCIGVQEGPWPG-ISVIGNILQQEHLWEFDLKNRRLRFKRSRCT 477
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 105/378 (27%), Positives = 163/378 (43%), Gaps = 54/378 (14%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
+NG Y + IGTPP + IVDTGS + +V C C QC K P + P SSSYK L
Sbjct: 76 SNGYYTTRLWIGTPPQ-EFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSSSYKALK 134
Query: 80 CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGH 138
C + C+ D +LC Y YA+ S + GVL+ + I+FGN + VFGC +
Sbjct: 135 CNPD-CNCDD-----EGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLTPQRAVFGCEN 188
Query: 139 NNTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNG 196
TG +F++ G++GLGR +LS+ Q++ + + + FS C G
Sbjct: 189 VETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCY----------------GG 232
Query: 197 SEVSGGGVV-------STSLVSKED---KTYYFVTLEGISVGNLSNSSKLIP-YYNSSGA 245
EV GG +V + + S D YY + L+ + V S KL P +N
Sbjct: 233 MEVGGGAMVLGKISPPAGMVFSHSDPFRSPYYNIDLKQMHVAG--KSLKLNPKVFN---- 286
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCY-----KTP 297
K +D+G PK+ + +++ + I K DP +C+
Sbjct: 287 -GKHGTVLDSGTTYAYFPKEAFIAIKDAIIKEIPSLKRIHGPDPNY-DDVCFSGAGRDVA 344
Query: 298 SMAGIAPILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLF 356
+ P + F G K+ L F V G +C + P + G +
Sbjct: 345 EIHNFFPEIDMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTL 404
Query: 357 IGYDFDSQMVSFKPTDCT 374
+ YD ++ + F T+C+
Sbjct: 405 VTYDRENDKLGFLKTNCS 422
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 111/419 (26%), Positives = 179/419 (42%), Gaps = 66/419 (15%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPP-LLDIYGIVDTGSDLMWVQC----LPCVQCYK---- 61
+VV + Y++ +IGTPP + +Y +DTGSDL WV C C++CY
Sbjct: 70 DVVMEPLREVRDGYLITLNIGTPPQAVQVY--LDTGSDLTWVPCGNLSFDCIECYDLKNN 127
Query: 62 --QVKPIYNPASSSSYKELSCQSEQC---HLLD-------TVSCSSQQLC---------N 100
+ +++P SS+ SC S C H D CS L +
Sbjct: 128 DLKSPSVFSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPS 187
Query: 101 YTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLS 160
+ Y Y + L G+L R FGC T + E +G+ G GR LS
Sbjct: 188 FAYTYGEGGLISGILT--RDILKARTRDVPRFSFGCV---TSTYRE-PIGIAGFGRGLLS 241
Query: 161 LASQILSQLGANKFSYCLVPFH--TDSSITSKMYFGNGS---EVSGGGVVSTSLVSKEDK 215
L SQ+ FS+C +PF + +I+S + G + ++ + L +
Sbjct: 242 LPSQL--GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPMYP 299
Query: 216 TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVR 275
Y++ LE I++G +++ + G M +D+G T LP+ FY++L ++
Sbjct: 300 NSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQ 359
Query: 276 NAIK--LTPYQDPRLGSQLCYKTP-----------SMAGIAPILTAHFDGGAKVPLIHTS 322
+ I + R G LCYK P + I P +T HF A + L +
Sbjct: 360 STITYPRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGN 419
Query: 323 TF--IPPPVEG--VFCFAMQPI-DGD---VGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+F + P +G V C Q + DGD G+FG+F Q ++ + YD + + + F+ DC
Sbjct: 420 SFYAMSAPSDGSVVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 478
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 109/376 (28%), Positives = 166/376 (44%), Gaps = 51/376 (13%)
Query: 34 PLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI--YNPASSSSYKELSCQSEQCH----- 86
P +I ++DTGS+L W++C P+ ++P SSSY + C S C
Sbjct: 82 PPQNISMVIDTGSELSWLRC----NRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRD 137
Query: 87 LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNE 146
L SC S +LC+ T YAD+S ++G LA E FGNS N N++FGC + +G E
Sbjct: 138 FLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTN-DSNLIFGCMGSVSGSDPE 196
Query: 147 NE---MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGG 203
+ GL+G+ R LS +SQ+G KFSYC+ TD + G+ +
Sbjct: 197 EDTKTTGLLGMNRGSLSF----ISQMGFPKFSYCIS--GTD-DFPGFLLLGDSNFTWLTP 249
Query: 204 VVSTSLVSKE------DKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
+ T L+ D+ Y V L GI V G L K + + +GA G +D+G
Sbjct: 250 LNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGA---GQTMVDSG 306
Query: 257 APPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCYKTPSM---AGI---AP 304
T L Y L N LT Y+DP Q LCY+ + +GI P
Sbjct: 307 TQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLP 366
Query: 305 ILTAHFDGGAKV----PLIHTSTFIPPPVEGVFCFAMQPID---GDVGIFGNFAQSDLFI 357
++ F+G PL++ + + V+CF D + + G+ Q +++I
Sbjct: 367 TVSLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWI 426
Query: 358 GYDFDSQMVSFKPTDC 373
+D + P +C
Sbjct: 427 EFDLQRSRIGLAPVEC 442
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 98/346 (28%), Positives = 144/346 (41%), Gaps = 36/346 (10%)
Query: 41 IVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVS--CSSQ 96
++DT SD+ WVQC PC CY Q +Y+P SSS SC S C L + C++
Sbjct: 147 VLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNN 206
Query: 97 QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFN--ENEMGLVGL 154
C Y Y D + T G ++ +T + + FGC H G F+ + G++ L
Sbjct: 207 NQCQYRVRYPDGTSTAGTYISDLLTITPATA-VRSFQFGCSHGVQGSFSFGSSAAGIMAL 265
Query: 155 GRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG-SEVSGGGVVSTSLVSKE 213
G SL SQ + G FS+C P T + +F G V+ V T ++
Sbjct: 266 GGGPESLVSQTAATYG-RVFSHCFPP------PTRRGFFTLGVPRVAAWRYVLTPMLKNP 318
Query: 214 --DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLE 271
T+Y V LE I+V + P ++GA +D+ T LP Y L
Sbjct: 319 AIPPTFYMVRLEAIAVAG--QRIAVPPTVFAAGAA------LDSRTAITRLPPTAYQALR 370
Query: 272 EQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA----PILTAHFDGGAKVPLIHTSTFIPP 327
+ R+ + + P+ CY MAG+ P +T FD A V L +
Sbjct: 371 QAFRDRMAMYQPAPPKGPLDTCYD---MAGVRSFALPRITLVFDKNAAVELDPSGVLF-- 425
Query: 328 PVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+G F P D GI GN L + Y+ + +V F+ C
Sbjct: 426 --QGCLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 469
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 104/379 (27%), Positives = 165/379 (43%), Gaps = 47/379 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI----YNPASSSSYKE 77
G Y K +GTP D + VDTGSD++WV C C++C ++ + Y+ +SS+ K
Sbjct: 83 GLYFAKIGLGTPSR-DFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDVDASSTAKS 141
Query: 78 LSCQSEQCHLLDTVS-CSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNNFFDN- 131
+SC C ++ S C S C Y Y D S T G L + + GN N
Sbjct: 142 VSCSDNFCSYVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNG 201
Query: 132 -VVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSS 186
++FGCG +G E++ G++G G++ S SQ+ SQ + F++CL +
Sbjct: 202 TIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCL------DN 255
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYNSSG 244
F G EV V +T ++SK +Y V L I VGN L SS + G
Sbjct: 256 NNGGGIFAIG-EVVSPKVKTTPMLSKS--AHYSVNLNAIEVGNSVLELSSNAFDSGDDKG 312
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQLCYKTPSMAG 301
I ID+G LP YN L ++ + + L Q+ S C+
Sbjct: 313 VI------IDSGTTLVYLPDAVYNPLLNEILASHPELTLHTVQE----SFTCFHYTDKLD 362
Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ------PIDGDVGIFGNFAQSDL 355
P +T FD + ++ ++ E +CF Q + I G+ A S+
Sbjct: 363 RFPTVTFQFDKSVSLA-VYPREYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNK 421
Query: 356 FIGYDFDSQMVSFKPTDCT 374
+ YD ++Q++ + +C+
Sbjct: 422 LVVYDIENQVIGWTNHNCS 440
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 98/346 (28%), Positives = 144/346 (41%), Gaps = 36/346 (10%)
Query: 41 IVDTGSDLMWVQCLPCVQ--CYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVS--CSSQ 96
++DT SD+ WVQC PC CY Q +Y+P SSS SC S C L + C++
Sbjct: 172 VLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNN 231
Query: 97 QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFN--ENEMGLVGL 154
C Y Y D + T G ++ +T + + FGC H G F+ + G++ L
Sbjct: 232 NQCQYRVRYPDGTSTAGTYISDLLTITPATA-VRSFQFGCSHGVQGSFSFGSSAAGIMAL 290
Query: 155 GRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG-SEVSGGGVVSTSLVSKE 213
G SL SQ + G FS+C P T + +F G V+ V T ++
Sbjct: 291 GGGPESLVSQTAATYG-RVFSHCFPP------PTRRGFFTLGVPRVAAWRYVLTPMLKNP 343
Query: 214 --DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLE 271
T+Y V LE I+V + P ++GA +D+ T LP Y L
Sbjct: 344 AIPPTFYMVRLEAIAVAG--QRIAVPPTVFAAGAA------LDSRTAITRLPPTAYQALR 395
Query: 272 EQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA----PILTAHFDGGAKVPLIHTSTFIPP 327
+ R+ + + P+ CY MAG+ P +T FD A V L +
Sbjct: 396 QAFRDRMAMYQPAPPKGPLDTCYD---MAGVRSFALPRITLVFDKNAAVELDPSGVLF-- 450
Query: 328 PVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+G F P D GI GN L + Y+ + +V F+ C
Sbjct: 451 --QGCLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 494
>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
Japonica Group]
Length = 377
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 92/316 (29%), Positives = 146/316 (46%), Gaps = 48/316 (15%)
Query: 19 TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKEL 78
++ G YV F+IGTPP + +VD +L+W QC PC C++Q P+++P SS+++ L
Sbjct: 52 SSQGLYVANFTIGTPPQ-PVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGL 110
Query: 79 SCQSEQCHLLDTVS--CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
C S C + S C+S +C Y + T G T+ G + + + FGC
Sbjct: 111 PCGSHLCESIPESSRNCTS-DVCIYE-APTKAGDTGGKAGTDTFAIGAAK---ETLGFGC 165
Query: 137 GHNNTGVFNENEM-------GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
V + + G+VGLGRT SL ++Q+ FSYCL +
Sbjct: 166 -----VVMTDKRLKTIGGPSGIVGLGRTPWSL----VTQMNVTAFSYCLA-----GKSSG 211
Query: 190 KMYFGNGSEVSGGG-------VVSTSLVSKEDKT--YYFVTLEGISVGNLSNSSKLIPYY 240
++ G ++ GG V+ TS S ++ + YY V L GI G +
Sbjct: 212 ALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAP-----LQAA 266
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
+SSG+ + +DT + + L Y L++ + A+ + P P LC+ ++A
Sbjct: 267 SSSGS----TVLLDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFPK-AVA 321
Query: 301 GIAPILTAHFDGGAKV 316
G AP L FDGGA +
Sbjct: 322 GDAPELVFTFDGGAAL 337
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 102/379 (26%), Positives = 171/379 (45%), Gaps = 44/379 (11%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK------PIYNPASSSSY 75
G Y K +GTPP+ Y VDTGSD+ W+ C PC C + + Y+P+ SS+
Sbjct: 35 GLYYTKIYLGTPPV-GYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSSTD 93
Query: 76 KELSCQSEQCHLL---DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD-- 130
LSC+ C + VSC+S C Y+ Y D S T+G + +TF +N
Sbjct: 94 GALSCRDSNCGAALGSNEVSCTSAGYCAYSTTYGDGSSTQGYFIQDVMTFQEIHNNTQVN 153
Query: 131 ---NVVFGCGHNNTG--VFNENEM-GLVGLGRTRLSLASQILSQLG--ANKFSYCLVPFH 182
+V FGCG +G + + + GL+G G+ +S+ SQ L+ +G N+F++CL
Sbjct: 154 GTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQ-LASMGKVGNRFAHCL---Q 209
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
D+ + G+ SE + + T +VS+ +Y V ++ I+V + ++ P
Sbjct: 210 GDNQGGGTIVIGSVSEPN---ISYTPIVSRN---HYAVGMQNIAVNGRNVTT---PASFD 260
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI 302
+ + S G + +D+G L Y + V + + + + QL + S+
Sbjct: 261 TTSTSAGGVIMDSGTTLAYLVDPAYTQFVNAV-STFESSMFSSHSQCLQLAW--CSLQAD 317
Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPV---EGVFCFAMQPIDGDVG-----IFGNFAQSD 354
P + FD GA + L + P+ + +C Q G I G+ D
Sbjct: 318 FPTVKLFFDAGAVMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAGYLSYSILGDIVLKD 377
Query: 355 LFIGYDFDSQMVSFKPTDC 373
+ YD D+++V +K DC
Sbjct: 378 HLVVYDNDNRVVGWKSFDC 396
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 104/378 (27%), Positives = 162/378 (42%), Gaps = 54/378 (14%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
+NG Y + IGTPP + IVDTGS + +V C C QC K P + P S+SY+ L
Sbjct: 72 SNGYYTTRLWIGTPPQ-EFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALK 130
Query: 80 CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGH 138
C + C+ D +LC Y YA+ S + GVL+ + I+FGN + VFGC +
Sbjct: 131 CNPD-CNCDD-----EGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCEN 184
Query: 139 NNTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNG 196
TG +F++ G++GLGR +LS+ Q++ + + + FS C G
Sbjct: 185 EETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCY----------------GG 228
Query: 197 SEVSGGGVVSTSL-------VSKED---KTYYFVTLEGISVGNLSNSSKLIP-YYNSSGA 245
EV GG +V + S D YY + L+ + V S KL P +N
Sbjct: 229 MEVGGGAMVLGKISPPPGMVFSHSDPFRSPYYNIDLKQMHVAG--KSLKLNPKVFN---- 282
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCY-----KTP 297
K +D+G PK+ + +++ V I K DP +C+
Sbjct: 283 -GKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNY-DDVCFSGAGRDVA 340
Query: 298 SMAGIAPILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLF 356
+ P + F G K+ L F V G +C + P + G +
Sbjct: 341 EIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTL 400
Query: 357 IGYDFDSQMVSFKPTDCT 374
+ YD ++ + F T+C+
Sbjct: 401 VTYDRENDKLGFLKTNCS 418
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 101/393 (25%), Positives = 171/393 (43%), Gaps = 50/393 (12%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP-----IY 67
+ S T G+Y ++F +GTP + + DTGSDL WV+C P ++
Sbjct: 99 LTSGAYTGTGQYFVQFRVGTPAQPFVL-VADTGSDLTWVKCRGRRASSPDASPLASPRVF 157
Query: 68 NPASSSSYKELSCQSEQCHL---LDTVSCSSQQL----CNYTYGYADSSLTKGVLATERI 120
PA+S S+ + C S+ C +CS+ C Y Y Y D S +GV+ T+
Sbjct: 158 RPANSKSWAPIPCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVGTDAA 217
Query: 121 TFGNSNNFFDN------VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKF 174
T S + D VV GC + G ++ G++ LG + +S AS+ ++ G +F
Sbjct: 218 TIALSGSGSDRKAKLQEVVLGCTTSYDGQSFQSSDGVLSLGNSNISFASRAAARFG-GRF 276
Query: 175 SYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGN--L 230
SYCLV + TS + FG V S + L+ + +Y VT++ +SV L
Sbjct: 277 SYCLVDHLAPRNATSYLTFG---PVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKAL 333
Query: 231 SNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY--QDPRL 288
+ +++ + GAI +D+G T+L Y + + + P DP
Sbjct: 334 NIPAEVWDVKKNGGAI------LDSGTSLTILATPAYKAVVAALSKQLARVPRVTMDP-- 385
Query: 289 GSQLCY------KTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP-ID 341
+ CY + P++ P L F G A++ T +++ GV C +Q +
Sbjct: 386 -FEYCYNWTATRRPPAV----PRLEVRFAGSARL-RPPTKSYVIDAAPGVKCIGLQEGVW 439
Query: 342 GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
V + GN Q + +D ++ + F+ + C
Sbjct: 440 PGVSVIGNILQQEHLWEFDLANRWLRFQESRCA 472
>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
Length = 334
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 100/336 (29%), Positives = 157/336 (46%), Gaps = 44/336 (13%)
Query: 65 PIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQL-------CNYTYGYADSS----LTKG 113
P+ P SSSS ++C C L CS+ C+Y Y Y ++ T+G
Sbjct: 13 PLLYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEG 72
Query: 114 VLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK 173
+L TE TFG+ F + FGC + G F GLVGLGR +LSL ++QL
Sbjct: 73 ILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGS-GLVGLGRGKLSL----VTQLNVEA 127
Query: 174 FSYCLVPFHTDSSITSKMYFGNGSEVSGG---GVVSTSLVSK---EDKTYYFVTLEGISV 227
F Y L +D S S + FG+ ++V+GG +ST L++ +D +Y+V L GISV
Sbjct: 128 FGYRL---SSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISV 184
Query: 228 GN--LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD 285
G + S + S+GA G + D+G T+LP Y + +++ + + +Q
Sbjct: 185 GGKLVQIPSGTFSFDRSTGA---GGVIFDSGTTLTMLPDPAYTLVRDELLSQMG---FQK 238
Query: 286 PRLGSQ----LCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPV----EGVFCFAM 337
P + +C+ S P + HFDGGA + L T ++P E C+++
Sbjct: 239 PPPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDL-STENYLPQMQGQNGETARCWSV 297
Query: 338 QPIDGDVGIFGNFAQSDLFIGYDF--DSQMVSFKPT 371
+ I GN Q D + +D +++M+ PT
Sbjct: 298 VKSSQALTIIGNIMQMDFHVVFDLSGNARMLFQPPT 333
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 107/378 (28%), Positives = 167/378 (44%), Gaps = 40/378 (10%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK---PI--YNPASSSSYK 76
G Y + +GTPP D Y +DTGSD++WV C C C P+ ++P SS +
Sbjct: 50 GLYYTRLQLGTPPR-DFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTAS 108
Query: 77 ELSCQSEQCHL----LDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNS--N 126
+SC ++C L D+V + LC Y + Y D S T G ++ + F G S N
Sbjct: 109 LISCSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDLLHFDTVLGGSVMN 168
Query: 127 NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFH 182
N +VFGC TG +++ G+ G G+ +S+ SQ+ SQ + FS+CL
Sbjct: 169 NSSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCL---K 225
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
D S + G E+ +V T LV + +Y + ++ ISV N L +
Sbjct: 226 GDDSGGGILVLG---EIVEPNIVYTPLVPSQ--PHYNLNMQSISV----NGQTLAIDPSV 276
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRL--GSQLCYKTPSMA 300
G S ID+G L + Y+ + + + +P P L G+ + S+
Sbjct: 277 FGTSSSQGTIIDSGTTLAYLAEAAYDPFISAITSIV--SPSVRPYLSKGNHCYLISSSIN 334
Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD-VGIFGNFAQSDLF 356
I P ++ +F GGA + LI I G ++C Q I G + I G+ D
Sbjct: 335 DIFPQVSLNFAGGASMILIPQDYLIQQSSIGGAALWCIGFQKIQGQGITILGDLVLKDKI 394
Query: 357 IGYDFDSQMVSFKPTDCT 374
YD +Q + + DC+
Sbjct: 395 FVYDIANQRIGWANYDCS 412
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 104/378 (27%), Positives = 162/378 (42%), Gaps = 54/378 (14%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
+NG Y + IGTPP + IVDTGS + +V C C QC K P + P S+SY+ L
Sbjct: 72 SNGYYTTRLWIGTPPQ-EFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALK 130
Query: 80 CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGH 138
C + C+ D +LC Y YA+ S + GVL+ + I+FGN + VFGC +
Sbjct: 131 CNPD-CNCDD-----EGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCEN 184
Query: 139 NNTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNG 196
TG +F++ G++GLGR +LS+ Q++ + + + FS C G
Sbjct: 185 EETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCY----------------GG 228
Query: 197 SEVSGGGVVSTSL-------VSKED---KTYYFVTLEGISVGNLSNSSKLIP-YYNSSGA 245
EV GG +V + S D YY + L+ + V S KL P +N
Sbjct: 229 MEVGGGAMVLGKISPPPGMVFSHSDPFRSPYYNIDLKQMHVAG--KSLKLNPKVFN---- 282
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCY-----KTP 297
K +D+G PK+ + +++ V I K DP +C+
Sbjct: 283 -GKHGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNY-DDVCFSGAGRDVA 340
Query: 298 SMAGIAPILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLF 356
+ P + F G K+ L F V G +C + P + G +
Sbjct: 341 EIHNFFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTL 400
Query: 357 IGYDFDSQMVSFKPTDCT 374
+ YD ++ + F T+C+
Sbjct: 401 VTYDRENDKLGFLKTNCS 418
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 109/376 (28%), Positives = 163/376 (43%), Gaps = 51/376 (13%)
Query: 34 PLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI--YNPASSSSYKELSCQSEQCH----- 86
P +I ++DTGS+L W++C P+ ++P SSSY + C S C
Sbjct: 82 PPQNISMVIDTGSELSWLRC----NRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRD 137
Query: 87 LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNE 146
L SC S +LC+ T YAD+S ++G LA E FGNS N N++FGC + +G E
Sbjct: 138 FLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTN-DSNLIFGCMGSVSGSDPE 196
Query: 147 NE---MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGG 203
+ GL+G+ R LS +SQ+G KFSYC+ TD + G+ +
Sbjct: 197 EDTKTTGLLGMNRGSLSF----ISQMGFPKFSYCIS--GTD-DFPGFLLLGDSNFTWLTP 249
Query: 204 VVSTSLVSKE------DKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
+ T L+ D+ Y V L GI V G L K + + +GA G +D+G
Sbjct: 250 LNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGA---GQTMVDSG 306
Query: 257 APPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCYKTPSM---AGI---AP 304
T L Y L N LT Y+DP Q LCY+ GI P
Sbjct: 307 TQFTFLLGPVYTALRSDFLNQTNGILTVYEDPEFVFQGTMDLCYRISPFRIRTGILHRLP 366
Query: 305 ILTAHFDGGAKV----PLIHTSTFIPPPVEGVFCFAMQPID---GDVGIFGNFAQSDLFI 357
++ F+G PL++ + + V+CF D + + G+ Q +++I
Sbjct: 367 TVSLVFEGAEIAVSGQPLLYRVPHLTAGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWI 426
Query: 358 GYDFDSQMVSFKPTDC 373
+D + P C
Sbjct: 427 EFDLQRSRIGLAPVQC 442
>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
lyrata]
Length = 293
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 90/155 (58%), Gaps = 8/155 (5%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKPIYNPASSSSYKELSCQS 82
Y++ IGTP DI + DTGSDL W QC PC+ CY Q +P +NP+SSSSY +SC S
Sbjct: 134 YIVTIGIGTPKH-DISLMFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSSYHNVSCSS 192
Query: 83 EQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG 142
C + SCS+ C Y GY D S+T G LA E+ T NS + D++ FGCG NN G
Sbjct: 193 PMCG--NPESCSASN-CLYGIGYGDGSVTVGFLAKEKFTLTNS-DVLDDIYFGCGENNKG 248
Query: 143 VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
VF G++GLG + S Q + N FSYC
Sbjct: 249 VF-IGSAGILGLGPGKFSFPLQTTTTYN-NIFSYC 281
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 102/383 (26%), Positives = 170/383 (44%), Gaps = 42/383 (10%)
Query: 8 YPNNVVQSNVSTANGEYVM-KFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI 66
Y N+ S + G ++ SIG P + + ++DTGSD++W+ C PC C + +
Sbjct: 84 YNNDYTASVSPSLTGRTILVNLSIGQPSIPQLV-VMDTGSDILWIMCNPCTNCDNHLGLL 142
Query: 67 YNPASSSSYKELS---CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG 123
++P+ SS++ L C + C D + +T Y D+S G + + F
Sbjct: 143 FDPSMSSTFSPLCKTPCGFKGCK-CDPIP--------FTISYVDNSSASGTFGRDILVFE 193
Query: 124 NSN---NFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVP 180
++ + +V+ GCGHN + G++GL SLA+QI KFSYC+
Sbjct: 194 TTDEGTSQISDVIIGCGHNIGFNSDPGYNGILGLNNGPNSLATQI-----GRKFSYCIGN 248
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
+++ G G+++ G S + +Y+VT+EGISVG L +
Sbjct: 249 LADPYYNYNQLRLGEGADLEG-----YSTPFEVYHGFYYVTMEGISVGEKRLDIALETFE 303
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD--PRLGSQLCYK--- 295
G + +D+G T L + L +VRN +K + Q +LCY
Sbjct: 304 MKRNG--TGGVILDSGTTITYLVDSAHKLLYNEVRNLLKWSFRQVIFENAPWKLCYYGII 361
Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP---IDGDV--GIFGNF 350
+ + G P++T HF GA + L S F + +FC + P ++ + + G
Sbjct: 362 SRDLVGF-PVVTFHFVDGADLALDTGSFF--SQRDDIFCMTVSPASILNTTISPSVIGLL 418
Query: 351 AQSDLFIGYDFDSQMVSFKPTDC 373
AQ +GYD +Q V F+ DC
Sbjct: 419 AQQSYNVGYDLVNQFVYFQRIDC 441
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 100/379 (26%), Positives = 170/379 (44%), Gaps = 49/379 (12%)
Query: 25 VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI-YNPASSSSYKELSCQSE 83
++ IGTPP ++DTGS L W+QC K ++P+ SSS+ L C
Sbjct: 81 IVSLPIGTPPQTQQM-VLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPCNHP 139
Query: 84 QCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGH 138
C +C +LC+Y+Y YAD + +G L E+ITF +S + ++ GC
Sbjct: 140 LCKPRIPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFSSSQS-TPPLILGCAE 198
Query: 139 NNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSE 198
+T +E G++G+ R S ASQ +KFSYC+ + ++S F G+
Sbjct: 199 AST-----DEKGILGMNLGRRSFASQA----KISKFSYCVPTRQARAGLSSTGSFYLGNN 249
Query: 199 VSGGGVVSTSLVS--------KEDKTYYFVTLEGISVGNLS-NSSKLIPYYNSSGAISKG 249
+ G +L++ D Y + ++GI +GN N S + + SGA G
Sbjct: 250 PNSGRFQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRPDPSGA---G 306
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG--------SQLCYKTPSM-- 299
ID+G+ T L + YN++ E+V ++L P+L S +C+ M
Sbjct: 307 QTIIDSGSEFTYLVDEAYNKVREEV---VRLV---GPKLKKGYVYGGVSDMCFDGNPMEI 360
Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLF 356
+ + F+ G ++ +I + GV C + + + I GNF Q +L+
Sbjct: 361 GRLIGNMVFEFEKGVEI-VIDKWRVLADVGGGVHCIGIGRSEMLGAASNIIGNFHQQNLW 419
Query: 357 IGYDFDSQMVSFKPTDCTK 375
+ YD ++ + DC++
Sbjct: 420 VEYDLANRRIGLGKADCSR 438
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 102/373 (27%), Positives = 161/373 (43%), Gaps = 43/373 (11%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IGTPP IVDTGS + +V C C QC + P ++P SSS+YK + C
Sbjct: 80 NGYYTTRLWIGTPPQ-QFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKC 138
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
+D + S C Y YA+ S + GVL + I+FGN + VFGC +
Sbjct: 139 N------IDCICDSDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENM 192
Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMYFGNGS 197
TG +F++ G++GLG LSL Q++ + N FS C M G G+
Sbjct: 193 ETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCY----------GGMDIGGGA 242
Query: 198 EVSGGGVVSTSLV----SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS-KGNMF 252
V GG + ++ YY V L+ I V + K +P SSG +
Sbjct: 243 MVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHV-----AGKKLPL--SSGIFDGRYGAV 295
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCY-----KTPSMAGIAP 304
+D+G LP + ++ ++ + + I K DP +C+ ++ P
Sbjct: 296 LDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNF-KDICFSGAGSDAAELSNKFP 354
Query: 305 ILTAHFDGGAKVPLIHTSTFIP-PPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYDFD 362
+ F+ G K+ L + F V G +C + D + G + + YD
Sbjct: 355 TVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRA 414
Query: 363 SQMVSFKPTDCTK 375
+ + F T+C++
Sbjct: 415 NSKIGFWKTNCSE 427
>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 252
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 75/178 (42%), Positives = 99/178 (55%), Gaps = 16/178 (8%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
N+ T N Y++ +G+ ++ I+DT SDL WVQC PC+ CY Q PI+ P++SSSY
Sbjct: 59 NLQTLN--YIVTMGLGSK---NMTVIIDTRSDLTWVQCEPCMSCYNQQGPIFKPSTSSSY 113
Query: 76 KELSCQSEQCHLL-----DTVSCSSQQ--LCNYTYGYADSSLTKGVLATERITFGNSNNF 128
+ +SC S C L +T +C S CNY Y D S T G L E ++FG +
Sbjct: 114 QSVSCNSSTCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGDLGVEALSFGGVS-- 171
Query: 129 FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
+ VFGCG NN G+F GL+GLGR+ LSL SQ + G FSYCL SS
Sbjct: 172 VSDFVFGCGRNNKGLFG-GVSGLMGLGRSYLSLVSQTNATFGG-VFSYCLPTTEAGSS 227
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 102/373 (27%), Positives = 161/373 (43%), Gaps = 43/373 (11%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IGTPP IVDTGS + +V C C QC + P ++P SSS+YK + C
Sbjct: 80 NGYYTTRLWIGTPPQ-QFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKC 138
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
+D + S C Y YA+ S + GVL + I+FGN + VFGC +
Sbjct: 139 N------IDCICDSDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENM 192
Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMYFGNGS 197
TG +F++ G++GLG LSL Q++ + N FS C M G G+
Sbjct: 193 ETGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCY----------GGMDIGGGA 242
Query: 198 EVSGGGVVSTSLV----SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS-KGNMF 252
V GG + ++ YY V L+ I V + K +P SSG +
Sbjct: 243 MVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHV-----AGKKLPL--SSGIFDGRYGAV 295
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCY-----KTPSMAGIAP 304
+D+G LP + ++ ++ + + I K DP +C+ ++ P
Sbjct: 296 LDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNF-KDICFSGAGSDAAELSNKFP 354
Query: 305 ILTAHFDGGAKVPLIHTSTFIP-PPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYDFD 362
+ F+ G K+ L + F V G +C + D + G + + YD
Sbjct: 355 TVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRA 414
Query: 363 SQMVSFKPTDCTK 375
+ + F T+C++
Sbjct: 415 NSKIGFWKTNCSE 427
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 112/391 (28%), Positives = 171/391 (43%), Gaps = 55/391 (14%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQC-LP-----CVQC-YKQVKP----IYNPA 70
G Y + FS+GTPP + ++DTGS L+W C +P C C + V P IY
Sbjct: 72 GGYSVIFSLGTPPQ-KVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARN 130
Query: 71 SSSSYKELSCQSEQCHLL--DTVSCSSQQLCNY---TYGYADSSLTKGVLATERITFGNS 125
SS+ + L C+S +C+ + ++CS+ + C Y YG T G L ++ +
Sbjct: 131 KSSTVQSLPCRSPKCNWVFGSDLNCSTTKRCPYYGLEYGLGS---TTGQLVSDVLGLSKL 187
Query: 126 NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS 185
N D +FGC + V N G+ G GR LAS I +QLG KFSYCLV D
Sbjct: 188 NRIPD-FLFGC----SLVSNRQPEGIAGFGR---GLAS-IPAQLGLTKFSYCLVSHRFDD 238
Query: 186 SITSK---MYFGNGSEVSGGGVVSTSLVSKED-----KTYYFVTLEGISVGNLSNSSKLI 237
+ S ++ G + V+ + +K YY+++L I VG +
Sbjct: 239 TPQSGDLVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGG--KDVPIP 296
Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRL-----GSQL 292
P Y G M +D+G+ T + + ++ + ++ +T Y+ + G
Sbjct: 297 PRYLVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEK--HMTKYKRAKEIEDSSGLGP 354
Query: 293 CYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-----QP--IDGDV 344
CY + + P LT F GGA + L T F +GV C + +P G
Sbjct: 355 CYNITGQSEVDVPKLTFSFKGGANMDLPLTDYF-SLVTDGVVCMTVLTDPDEPGSTTGPA 413
Query: 345 GIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
I GN+ Q + +I YD Q FKP C +
Sbjct: 414 IILGNYQQQNFYIEYDLKKQRFGFKPQQCDR 444
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 103/362 (28%), Positives = 160/362 (44%), Gaps = 41/362 (11%)
Query: 39 YGIVDTGSDLMWVQCLPCVQ----CYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCS 94
Y +DTG++L W+QC C C+ P Y + S SYK +SC Q + C
Sbjct: 102 YFQIDTGNELSWIQCEGCQNKGNMCFPHKDPPYTSSQSKSYKPVSC--NQHSFCEPNQC- 158
Query: 95 SQQLCNYTYGYADSSLTKGVLATERITF---GNSNNFFDNVVFGCGHNNTG-----VFNE 146
+ LC Y Y S T G LA E TF + ++ FGC ++ + ++
Sbjct: 159 KEGLCAYNVTYGPGSYTSGNLANETFTFYSNHGKHTALKSISFGCSTDSRNMIYAFLLDK 218
Query: 147 NEM-GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVV 205
N + G++G+G S +Q L + KFSYC+ +T ++ + FG V +
Sbjct: 219 NPVSGVLGMGWGPRSFLAQ-LGSISHGKFSYCITANNTHNTY---LRFGK-HVVKSKNLQ 273
Query: 206 STSLVSKEDKTYYFVTLEGISVG----NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTL 261
+T ++ + Y V L GISV N++ + + S G I ID G TL
Sbjct: 274 TTKIMQVKPSAAYHVNLLGISVNGVKLNITKTDLAVRKDGSRGCI------IDAGTLATL 327
Query: 262 LPKDFYNRLEEQVRNAI----KLTPYQDPRLGSQLCYKTPSMAGIA--PILTAHFDGGAK 315
L K ++ L + N + L + +L LCY+ S AG P++T H + A
Sbjct: 328 LVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHKDLCYEQLSDAGRKNLPVVTFHLE-NAD 386
Query: 316 VPLIHTSTFIPPPVEG--VFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ + + F+ EG VFC +M D I G + Q YD ++++SF P DC
Sbjct: 387 LEVKPEAIFLFREFEGKNVFCLSMLSDDSKT-IIGAYQQMKQKFVYDTKARVLSFGPEDC 445
Query: 374 TK 375
K
Sbjct: 446 EK 447
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 110/381 (28%), Positives = 162/381 (42%), Gaps = 43/381 (11%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
N+ +N+ +G +++ + GTPP I+DTGS + W QC CV C K ++
Sbjct: 113 NHAHNNNLFDEDGNFLVDVAFGTPPQ-KFKLILDTGSSITWTQCKACVHCLKDSHRHFDS 171
Query: 70 ASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
+SS+Y SC + TV + Y Y D S + G + +T S + F
Sbjct: 172 LASSTYSFGSC------IPSTVGNT------YNMTYGDKSTSVGNYGCDTMTLEPS-DVF 218
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
FGCG NN G F G++GLG+ +LS SQ S+ FSYCL ++SI S
Sbjct: 219 QKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKF-KKVFSYCLPE---ENSIGS 274
Query: 190 KMYFGNGSEVSGGGVVSTSLVSK------EDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
+ FG + + TSLV+ E+ YYFV L ISVGN + IP S
Sbjct: 275 -LLFGEKATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLN---IP----S 326
Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS----QLCYKTPSM 299
+ ID+G T LP+ Y+ L+ + A+ P + R CY
Sbjct: 327 SVFASPGTIIDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGR 386
Query: 300 AGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-----QPIDGDVGIFGNFAQS 353
+ P HF GA V L + + C A ++ ++ I GN Q
Sbjct: 387 KDVLLPEXVLHFGDGADVRL-NGKRVVWGNDASRLCLAFAGNSKSTMNPELTIIGNRQQV 445
Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
L + YD + + F C+
Sbjct: 446 SLTVLYDIRGRRIGFGGNGCS 466
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 105/410 (25%), Positives = 166/410 (40%), Gaps = 61/410 (14%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ-------------- 58
+ S T G+Y ++F +GTP + I DTGSDL WV+C
Sbjct: 99 LSSGAYTGTGQYFVRFRVGTPAQPFVL-IADTGSDLTWVKCRGAASPSHATATASPAAAP 157
Query: 59 -CYKQVKPIYNPASSSSYKELSCQSEQCHL---LDTVSCSSQ-QLCNYTYGYADSSLTKG 113
++ P S ++ + C SE C +CSS C+Y Y Y D+S +G
Sbjct: 158 SPAVAPPRVFRPGDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSAARG 217
Query: 114 VLATERITFG-----------NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLA 162
V+ T+ T + VV GC + G E G++ LG + +S A
Sbjct: 218 VVGTDSATVALSGGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFEASDGVLSLGYSNISFA 277
Query: 163 SQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGG-----GVVSTSLVSKEDKTY 217
S+ S+ G +FSYCLV + TS + FG G + + G + L+ + +
Sbjct: 278 SRAASRFG-GRFSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARVRPF 336
Query: 218 YFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYN----RLEEQ 273
Y V ++ +SV ++ IP S G ID+G T+L Y L EQ
Sbjct: 337 YAVAVDSVSVDGVALD---IP-AEVWDVGSNGGTIIDSGTSLTVLATPAYKAVVAALSEQ 392
Query: 274 VRNAIKLTPYQDPRLGSQLCYKTPSMAG-----IAPILTAHFDGGAKVPLIHTSTFIPPP 328
+ ++ DP CY + P L F G A++ S ++
Sbjct: 393 LAGLPRVA--MDP---FDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKS-YVIDA 446
Query: 329 VEGVFCFAMQPIDG---DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
GV C +Q +G V + GN Q + +D +++ + F+ T CT+
Sbjct: 447 APGVKCIGVQ--EGAWPGVSVIGNILQQEHLWEFDLNNRWLRFRQTSCTQ 494
>gi|125572774|gb|EAZ14289.1| hypothetical protein OsJ_04213 [Oryza sativa Japonica Group]
Length = 492
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 104/347 (29%), Positives = 155/347 (44%), Gaps = 35/347 (10%)
Query: 18 STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSS--- 74
+T G YV+ FS+GTPP + + G++D SD +W+QC C C PA++S+
Sbjct: 91 ATNTGMYVLSFSVGTPPQV-VTGVLDITSDFVWMQCSACATCGADA-----PAATSAPPF 144
Query: 75 YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
Y LS + T C +Y YG ++ T G+LA + F D V+F
Sbjct: 145 YAFLSFHDTRAPT--TPPCG----YSYVYGGGAANTTAGLLAVDAFAFATVRA--DGVIF 196
Query: 135 GCGHNNTGVFNENEM-GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
GC V E ++ G++GLGR LS SQ+ Q+G +FSY L P + S + F
Sbjct: 197 GC-----AVATEGDIGGVIGLGRGELSPVSQL--QIG--RFSYYLAP-DDAVDVGSFILF 246
Query: 194 GNGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYN-SSGAISKGNM 251
+ ++ VST LV S+ ++ Y+V L GI V IP A G +
Sbjct: 247 LDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRV---DGEDLAIPRGTFDLQADGSGGV 303
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA-GIAPILTAHF 310
+ P T L Y + + + + I+L LG LCY + S+A P + F
Sbjct: 304 VLSITIPVTFLDAGAYKVVRQAMASKIELRAADGSELGLDLCYTSESLATAKVPSMALVF 363
Query: 311 DGGAKVPLIHTSTFIPPPVEGVFCFAMQPI-DGDVGIFGNFAQSDLF 356
GGA + L + F G+ C + P GD + G+ Q L
Sbjct: 364 AGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVSLL 410
>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
Length = 492
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 108/407 (26%), Positives = 164/407 (40%), Gaps = 65/407 (15%)
Query: 23 EYVMKFSIGTPPLLDIYGI-VDTGSDLMWVQCLP--CVQCYKQV---------------- 63
+Y + S+G P + +DTGSDL+W C P C+ C +
Sbjct: 87 DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPIDS 146
Query: 64 ------KPIYNPASSSSYKELSCQSEQCHL--LDTVSCSSQQLCNYTYGYADSSLTKGVL 115
P+ + A SS+ C + +C L ++T SC+S Y Y D SL L
Sbjct: 147 RRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVAN-L 205
Query: 116 ATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFS 175
R+ S +N F C H +G+ G GR LSL +Q+ L + +FS
Sbjct: 206 RRGRVGLAASMAV-ENFTFACAHTALA----EPVGVAGFGRGPLSLPAQLAPSL-SGRFS 259
Query: 176 YCLVP--FHTDSSI-TSKMYFGNGSEVSGGGVVSTSLV------SKEDKTYYFVTLEGIS 226
YCLV F D I +S + G ++ + G T V + + +Y V LE +S
Sbjct: 260 YCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVS 319
Query: 227 VGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY--- 283
VG ++ P G M +D+G T+LP D + R+ ++ A+ +
Sbjct: 320 VGGKRIQAQ--PELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRA 377
Query: 284 --QDPRLGSQLCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVE---GVFCFAM 337
+ + G CY +PS + P+ HF G A V L + F+ E V C +
Sbjct: 378 EGAEAQTGLAPCYHYSPSDRAVPPV-ALHFRGNATVALPRRNYFMGFKSEEGRSVGCLML 436
Query: 338 QPIDGD----------VGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ G+ G GNF Q + YD D+ V F CT
Sbjct: 437 MNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 483
>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 372
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/395 (25%), Positives = 168/395 (42%), Gaps = 46/395 (11%)
Query: 1 MSPATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQC 59
+ PA ++ V + S +Y M S+GTPP+ ++ I DTGS L WVQC C ++C
Sbjct: 2 IQPANIPADSSTVIGDDSMRKNKYFMGISLGTPPVFNLVTI-DTGSTLSWVQCKNCQIKC 60
Query: 60 YKQVKP---IYNPASSSSYKELSCQSEQC---HLLDTVS---CSSQQLCNYTYGYADSSL 110
Y Q I+NP +SS+Y ++ C +E C H+ V C Y+ Y
Sbjct: 61 YDQAAKAGQIFNPYNSSTYSKVGCSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEY 120
Query: 111 TKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG 170
+ G L +R+T SN DN +FGCG +N ++N G++G G S +Q+ Q
Sbjct: 121 SVGYLGKDRLTLA-SNRSIDNFIFGCGEDN--LYNGVNAGIIGFGTKSYSFFNQVCQQTD 177
Query: 171 ANKFSYCLVPFH-TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN 229
FSYC H + S+T G ++ T L+ + K Y + + V
Sbjct: 178 YTAFSYCFPRDHENEGSLTI------GPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNG 231
Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
+ ++ PY ISK + +D+G T + ++ L++ + ++ Y
Sbjct: 232 I--RLEIDPYI----YISKMTI-VDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWDE 284
Query: 290 SQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVF--------CFAMQPID 341
++C+ + S +A+++ V + + + PVE F C P D
Sbjct: 285 RRICFISNSG-------SANWNDFPTVEMKLIRSTLKLPVENAFYESSNNVICSTFLPDD 337
Query: 342 GDVG---IFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
V + GN A + +D + FK C
Sbjct: 338 AGVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 372
>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
Length = 519
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 108/407 (26%), Positives = 164/407 (40%), Gaps = 65/407 (15%)
Query: 23 EYVMKFSIGTPPLLDIYGI-VDTGSDLMWVQCLP--CVQCYKQV---------------- 63
+Y + S+G P + +DTGSDL+W C P C+ C +
Sbjct: 87 DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPIDS 146
Query: 64 ------KPIYNPASSSSYKELSCQSEQCHL--LDTVSCSSQQLCNYTYGYADSSLTKGVL 115
P+ + A SS+ C + +C L ++T SC+S Y Y D SL L
Sbjct: 147 RRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVAN-L 205
Query: 116 ATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFS 175
R+ S +N F C H +G+ G GR LSL +Q+ L + +FS
Sbjct: 206 RRGRVGLAASMAV-ENFTFACAHTALA----EPVGVAGFGRGPLSLPAQLAPSL-SGRFS 259
Query: 176 YCLVP--FHTDSSI-TSKMYFGNGSEVSGGGVVSTSLV------SKEDKTYYFVTLEGIS 226
YCLV F D I +S + G ++ + G T V + + +Y V LE +S
Sbjct: 260 YCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVS 319
Query: 227 VGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY--- 283
VG ++ P G M +D+G T+LP D + R+ ++ A+ +
Sbjct: 320 VGGKRIQAQ--PELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRA 377
Query: 284 --QDPRLGSQLCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVE---GVFCFAM 337
+ + G CY +PS + P+ HF G A V L + F+ E V C +
Sbjct: 378 EGAEAQTGLAPCYHYSPSDRAVPPV-ALHFRGNATVALPRRNYFMGFKSEEGRSVGCLML 436
Query: 338 QPIDGD----------VGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ G+ G GNF Q + YD D+ V F CT
Sbjct: 437 MNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 483
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 105/390 (26%), Positives = 169/390 (43%), Gaps = 58/390 (14%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
+ M+ IG+ ++ I+DTGS+ + VQC + +P+++PA+S SY+++ C S+
Sbjct: 100 FSMQLGIGSLQK-NLSAIIDTGSEAVLVQC------GSRSRPVFDPAASQSYRQVPCISQ 152
Query: 84 QCHLLD--TVSCSSQ------QLCNYTYGYADSSLTKGVLATERITFGNSNNF------F 129
C + T + SSQ C Y+ Y DS + G + + + F NS N F
Sbjct: 153 LCLAVQQQTSNGSSQPCVNSSATCTYSLSYGDSRNSTGDFSQD-VIFLNSTNSSGQAVQF 211
Query: 130 DNVVFGCGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT 188
+V FGC H+ G + + +G+VG R LSL SQ+ +LG +KFSYC T
Sbjct: 212 RDVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRAT 271
Query: 189 SKMYFGNGSEVSGGGVVSTSL----VSKEDKTYYFVTLEGISVGNLS-----NSSKLIPY 239
++ G+ S +S V T L V+ Y+V L ISV + ++ KL P
Sbjct: 272 GVIFLGD-SGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPS 330
Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPR------LGSQLC 293
G + +D+G T + D Y RNA + R G C
Sbjct: 331 TGDGGTV------LDSGTTFTRVVDDAYTAF----RNAFAASNRSGLRKKVGAAAGFDDC 380
Query: 294 YKTPSMAGI--APILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPID----GDV 344
Y + + + P + ++ L F+P G C A+ G +
Sbjct: 381 YNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKI 440
Query: 345 GIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ GN+ QS+ + YD + V F+ DC+
Sbjct: 441 NVLGNYQQSNYLVEYDNERSRVGFERADCS 470
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 113/380 (29%), Positives = 167/380 (43%), Gaps = 44/380 (11%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVK-PIYNPASSSSYK 76
G Y K +GTPP D Y VDTGSD++WV C C C + Q++ ++P SS +
Sbjct: 79 GLYYTKLRLGTPPR-DFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTAS 137
Query: 77 ELSCQSEQCHLLDTVS---CSSQ-QLCNYTYGYADSSLTKGVLATERITF----GNS--N 126
+SC ++C S CS Q LC YT+ Y D S T G ++ + F G+S
Sbjct: 138 PISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVP 197
Query: 127 NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFH 182
N VVFGC + TG +++ G+ G G+ +S+ SQ+ SQ + FS+CL
Sbjct: 198 NSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL---- 253
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSK---EDKTYYFVTLEGISVGNLSNSSKLIPY 239
K G G + G +V ++V + +Y V L ISV + + +P
Sbjct: 254 -------KGENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISV-----NGQALPI 301
Query: 240 YNSSGAISKGN-MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS 298
S + S G IDTG L + Y E + NA+ + G+Q T S
Sbjct: 302 NPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQCYVITTS 361
Query: 299 MAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD-VGIFGNFAQSD 354
+ I P ++ +F GGA + L I G V+C Q I + I G+ D
Sbjct: 362 VGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKD 421
Query: 355 LFIGYDFDSQMVSFKPTDCT 374
YD Q + + DC+
Sbjct: 422 KIFVYDLVGQRIGWANYDCS 441
>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 482
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 116/412 (28%), Positives = 173/412 (41%), Gaps = 76/412 (18%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP--CVQCYKQVKPIYNPASSSSYKE--- 77
+Y + F++G P I +DTGSDL+W C P C+ C + K +P+ ++
Sbjct: 74 DYTLSFNLG-PHSQPITLYMDTGSDLVWFPCTPFNCILCELKPKLTSDPSPPTNISHSTP 132
Query: 78 LSCQSEQCHL--------------------LDTVSCSSQQLCNYTYGYADSSLTKGVLAT 117
+SC S C + ++T C S + Y Y D SL + +
Sbjct: 133 ISCNSHACSVAHSSTPSSDLCTMAHCPLDSIETKDCGSFHCPPFYYAYGDGSL---IASL 189
Query: 118 ERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILS---QLGANKF 174
R T S N FGC H F+E G+ G GR LSL +Q+ + QLG N+F
Sbjct: 190 YRDTLSLSTLQLTNFTFGCAHT---TFSE-PTGVAGFGRGLLSLPAQLATHSPQLG-NRF 244
Query: 175 SYCLVPFHTDSSITSK---MYFG--------NGSEVSGGGVVSTSLVSKEDKTYYF-VTL 222
SYCLV S K + G NG EV V TS++ +Y++ V L
Sbjct: 245 SYCLVSHSFRSERIRKPSPLILGRYNDEKQSNGDEVV--EFVYTSMLENPKHSYFYTVGL 302
Query: 223 EGISVGNLS-NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE----QVRNA 277
+GISVG + + K++ N G G + +D+G T+LP+ FYN + E + R +
Sbjct: 303 KGISVGKKTVPAPKILRRVNKKG---DGGVVVDSGTTFTMLPEKFYNSVVEGFDRRARKS 359
Query: 278 IKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDG-GAKVPLIHTSTFIP--------PP 328
+ P + + G CY + A I P +T F G + V L + F
Sbjct: 360 NRRAPEIEQKTGLSPCYYL-NTAAIVPAVTLRFVGMNSSVVLPRKNYFYEFMDGGDGVRR 418
Query: 329 VEGVFCFAM-------QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
E V C + G G+ GN+ Q + YD + + V F C
Sbjct: 419 KERVGCLMFMNGGDEAEMSGGPGGVLGNYQQQGFEVEYDLEKKRVGFARRKC 470
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 113/380 (29%), Positives = 167/380 (43%), Gaps = 44/380 (11%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVK-PIYNPASSSSYK 76
G Y K +GTPP D Y VDTGSD++WV C C C + Q++ ++P SS +
Sbjct: 79 GLYYTKLRLGTPPR-DFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTAS 137
Query: 77 ELSCQSEQCHLLDTVS---CSSQ-QLCNYTYGYADSSLTKGVLATERITF----GNS--N 126
+SC ++C S CS Q LC YT+ Y D S T G ++ + F G+S
Sbjct: 138 PISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVP 197
Query: 127 NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFH 182
N VVFGC + TG +++ G+ G G+ +S+ SQ+ SQ + FS+CL
Sbjct: 198 NSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL---- 253
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSK---EDKTYYFVTLEGISVGNLSNSSKLIPY 239
K G G + G +V ++V + +Y V L ISV + + +P
Sbjct: 254 -------KGENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISV-----NGQALPI 301
Query: 240 YNSSGAISKGN-MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS 298
S + S G IDTG L + Y E + NA+ + G+Q T S
Sbjct: 302 NPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQCYVITTS 361
Query: 299 MAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD-VGIFGNFAQSD 354
+ I P ++ +F GGA + L I G V+C Q I + I G+ D
Sbjct: 362 VGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKD 421
Query: 355 LFIGYDFDSQMVSFKPTDCT 374
YD Q + + DC+
Sbjct: 422 KIFVYDLVGQRIGWANYDCS 441
>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 506
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 99/345 (28%), Positives = 131/345 (37%), Gaps = 27/345 (7%)
Query: 41 IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDT-----VSC 93
+VDT SD+ WVQC PC QCY Q +Y+P S C S QC L
Sbjct: 177 VVDTASDVPWVQCAPCPQPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRYANGCTGA 236
Query: 94 SSQQLCNYTYGYADSSLTKGVLATERITF-GNSNNFFDNVVFGCGHN--NTGVFNENEMG 150
+ C Y Y D S T G ++ +T + FGC H G FN G
Sbjct: 237 GNTGTCQYRVLYPDGSGTSGTYVSDLLTLNADPKGAVSKFQFGCSHALLRPGSFNNKTAG 296
Query: 151 LVGLGRTRLSLASQILSQLG-ANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSL 209
+ LGR SL+SQ N FSYCL P + S G + V+ L
Sbjct: 297 FMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGSHKGFLS---LGVPQHAASRYAVTPML 353
Query: 210 VSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNR 269
SK Y V L GI V + + +P A+ N +D+ T LP Y
Sbjct: 354 KSKMAPMIYMVRLIGIDV-----AGQRLPV---PPAVFAANAAMDSRTIITRLPPTAYMA 405
Query: 270 LEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPP 328
L R ++ P+ CY + + P +T FD A V L + +
Sbjct: 406 LRAAFRAQMRAYRAVAPKGQLDTCYDFTGVPMVRLPKVTLVFDRNAAVELDPSGVML--- 462
Query: 329 VEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ FA D GI GN Q L + Y+ D V F+ C
Sbjct: 463 -DSCLAFAPNANDFMPGIIGNVQQQTLEVLYNVDGASVGFRRAAC 506
>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 94/358 (26%), Positives = 149/358 (41%), Gaps = 40/358 (11%)
Query: 42 VDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCS------S 95
+D G L W+QCLPC C Q+ P+++P S ++ + +TV C +
Sbjct: 115 LDMGGGLSWMQCLPCRHCLLQMSPVFDPTKSPTFSNIPAH-------NTVWCRPPYQPLA 167
Query: 96 QQLCNYTYGYADSSLTKGVLATERITFGNSNNFF---DNVVFGCGHNNTGVFNENEM-GL 151
C + Y D++ G LA + +F N+ F +VFGC H N+ + G+
Sbjct: 168 NGACGFDIAYRDNTHASGYLARDTFSFPAGNDDFVPLSAIVFGCAHQTEHFKNQRAVAGI 227
Query: 152 VGL-----GRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVS 206
+GL G+ + Q+L G +FSYC PF S+ S + FG+ V
Sbjct: 228 LGLGMGPAGKPPTAFTKQVLPAHGG-RFSYC--PFVPGMSMYSYLRFGSDIPSHPPPNVH 284
Query: 207 TS----LVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPT 260
L + YFV L G+SVG LS + + N+ GA G +D G T
Sbjct: 285 RQSTPVLAPAHNSEAYFVKLAGVSVGANRLSGVTPAMFRRNAHGA---GGCVVDIGTRMT 341
Query: 261 LLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS-MAGIAPILTAHFDGGAKVPLI 319
Y ++ VR ++ + C + P+ + P +T HF+ GA + ++
Sbjct: 342 AFIHSAYVHIDHAVRQHLQRRGAHIVVVRGNTCVQQPAPHHDVLPSMTLHFENGAWLRVM 401
Query: 320 HTSTFIPPPVEGVF--CFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ--MVSFKPTDC 373
F+P V G CF D+ + G Q + +D ++SF P DC
Sbjct: 402 PEHVFMPFVVGGHHYQCFGFVS-STDLTVIGARQQVNHRFIFDLHDTIPIMSFNPEDC 458
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 103/383 (26%), Positives = 165/383 (43%), Gaps = 50/383 (13%)
Query: 26 MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQC 85
M+ IG+ ++ I+DTGS+ + VQC + +P+++PA+S SY+++ C S+ C
Sbjct: 1 MQLGIGSLQK-NLSAIIDTGSEAVLVQC------GSRSRPVFDPAASQSYRQVPCISQLC 53
Query: 86 HLLD--TVSCSSQ------QLCNYTYGYADSSLTKGVLATERITFGNSNNF-----FDNV 132
+ T + SSQ C Y+ Y DS + G + + I ++N+ F +V
Sbjct: 54 LAVQQQTSNGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDV 113
Query: 133 VFGCGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
FGC H+ G + + +G+VG R LSL SQ+ +LG +KFSYC T +
Sbjct: 114 AFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVI 173
Query: 192 YFGNGSEVSGGGVVSTSL----VSKEDKTYYFVTLEGISVGNLS-----NSSKLIPYYNS 242
+ G+ S +S V T L V+ Y+V L ISV + ++ KL P
Sbjct: 174 FLGD-SGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGD 232
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEE--QVRNAIKLTPYQDPRLGSQLCYKT---P 297
G + +D+G T + D Y N L G CY
Sbjct: 233 GGTV------LDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGS 286
Query: 298 SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPID----GDVGIFGNF 350
S+ G+ P + ++ L F+P G C A+ G + + GN+
Sbjct: 287 SLPGV-PEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNY 345
Query: 351 AQSDLFIGYDFDSQMVSFKPTDC 373
QS+ + YD + V F+ DC
Sbjct: 346 QQSNYLVEYDNERSRVGFERADC 368
>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 442
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 112/384 (29%), Positives = 170/384 (44%), Gaps = 46/384 (11%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N + ++GTPP ++ ++DTGS+L W+ C P +NP SSSY +SC
Sbjct: 63 NVSLTISITVGTPPQ-NMSMVIDTGSELSWLHCNTNTTATIPY-PFFNPNISSSYTPISC 120
Query: 81 QSEQCHLLDT-----VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
S C SC S LC+ T YAD+S ++G LA++ TFG ++F +VFG
Sbjct: 121 SSPTCTTRTRDFPIPASCDSNNLCHATLSYADASSSEGNLASD--TFGFGSSFNPGIVFG 178
Query: 136 CGHNNTGVFNE---NEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
C +++ +E N GL+G+ LSL +SQL KFSYC+ + S + +
Sbjct: 179 CMNSSYSTNSESDSNTTGLMGMNLGSLSL----VSQLKIPKFSYCI----SGSDFSGILL 230
Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGN-LSNSSKLIPYYNSSGA 245
G + GG + T LV D++ Y V LEGI + + L N S + + +GA
Sbjct: 231 LGESNFSWGGSLNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNISGNLFVPDHTGA 290
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDP----RLGSQLCYKTP-- 297
G D G + L YN L ++ N L DP ++ LCY+ P
Sbjct: 291 ---GQTMFDLGTQFSYLLGPVYNALRDEFLNQTNGTLRALDDPNFVFQIAMDLCYRVPVN 347
Query: 298 -SMAGIAPILTAHFDGGA-KVPLIHTSTFIPPPVEG---VFCFAMQPID---GDVGIFGN 349
S P ++ F+G +V +P V G V+CF D + I G+
Sbjct: 348 QSELPELPSVSLVFEGAEMRVFGDQLLYRVPGFVWGNDSVYCFTFGNSDLLGVEAFIIGH 407
Query: 350 FAQSDLFIGYDFDSQMVSFKPTDC 373
Q +++ +D V C
Sbjct: 408 HHQQSMWMEFDLVEHRVGLAHARC 431
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 105/386 (27%), Positives = 166/386 (43%), Gaps = 53/386 (13%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N + ++GTPP ++ ++DTGS+L W++C + + ++P SSSY + C
Sbjct: 82 NVSLTVSLTVGTPPQ-NVSMVLDTGSELSWLRC----NKTQTFQTTFDPNRSSSYSPVPC 136
Query: 81 QSEQC-----HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
S C SC S QLC+ YAD+S ++G LA++ GNS+ +FG
Sbjct: 137 SSLTCTDRTRDFPIPASCDSNQLCHAILSYADASSSEGNLASDTFYIGNSD--MPGTIFG 194
Query: 136 CGHNNTGVFNENE---MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
C ++ E + GL+G+ R LS +SQ+ KFSYC+ +DS + +
Sbjct: 195 CMDSSFSTNTEEDSKNTGLMGMNRGSLSF----VSQMDFPKFSYCI----SDSDFSGVLL 246
Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNSSKLIPYYNS---S 243
G+ + + T L+ D+ Y V LEGI V SSKL+P S
Sbjct: 247 LGDANFSWLMPLNYTPLIQISTPLPYFDRVAYTVQLEGIKV-----SSKLLPLPKSVFVP 301
Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRL----GSQLCYKTP 297
G +D+G T L Y+ L + N L +DP G LCY+ P
Sbjct: 302 DHTGAGQTMVDSGTQFTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDLCYRVP 361
Query: 298 ---SMAGIAPILTAHFDGGA-KVPLIHTSTFIPPPVEG---VFCFAMQPID---GDVGIF 347
+ P ++ F G KV +P V G V+CF D + +
Sbjct: 362 LSQTSLPWLPTVSLMFRGAEMKVSGDRLLYRVPGEVRGSDSVYCFTFGNSDLLAVEAYVI 421
Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDC 373
G+ Q ++++ +D + + F C
Sbjct: 422 GHHHQQNVWMEFDLEKSRIGFAQVQC 447
>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
Length = 353
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 161/373 (43%), Gaps = 46/373 (12%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKP---IYNPASSSSYKEL 78
+Y M S+GTPP+ ++ I DTGS L WVQC C ++CY Q I+NP +SS+Y ++
Sbjct: 5 KYFMGISLGTPPVFNLVTI-DTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKV 63
Query: 79 SCQSEQCH-----LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV 132
C +E C+ L C + C Y+ Y + G L +R+T SN DN
Sbjct: 64 GCSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLA-SNRSIDNF 122
Query: 133 VFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH-TDSSITSKM 191
+FGCG +N ++N G++G G S +Q+ Q FSYC H + S+T
Sbjct: 123 IFGCGEDN--LYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGSLTI-- 178
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
G ++ T L+ + K Y + + V + ++ PY ISK +
Sbjct: 179 ----GPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGI--RLEIDPYI----YISKMTI 228
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFD 311
+D+G T + ++ L++ + ++ Y ++C+ I+ +A+++
Sbjct: 229 -VDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWDERRICF-------ISNSGSANWN 280
Query: 312 GGAKVPLIHTSTFIPPPVEGVF--------CFAMQPIDGDVG---IFGNFAQSDLFIGYD 360
V + + + PVE F C P D V + GN A + +D
Sbjct: 281 DFPTVEMKLIRSTLKLPVENAFYESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFD 340
Query: 361 FDSQMVSFKPTDC 373
+ FK C
Sbjct: 341 IQAMNFGFKARAC 353
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 97/375 (25%), Positives = 160/375 (42%), Gaps = 45/375 (12%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
+NG Y + IGTPP + IVDTGS + +V C C QC K P + P SSS+YK +
Sbjct: 84 SNGYYTTRLFIGTPPQ-EFALIVDTGSTVTYVPCSTCEQCGKHQDPRFQPESSSTYKPMQ 142
Query: 80 CQSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCG 137
C + +C + + C Y YA+ S + G+LA + ++FGN + +FGC
Sbjct: 143 CNP-------SCNCDDEGKQCTYERRYAEMSSSSGLLAEDVLSFGNESELTPQRAIFGCE 195
Query: 138 HNNTG-VFNENEMGLVGLGRTRLSLASQ-ILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
TG +F++ G++GLGR LS+ Q ++ ++ N FS C
Sbjct: 196 TVETGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCY----------------G 239
Query: 196 GSEVSGGGVVSTSLVSKEDKTY-----YFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
G +V GG +V ++ D + Y I + L + K + N K
Sbjct: 240 GMDVVGGAMVLGNIPPPPDMVFAHSDPYRSAYYNIELKELHVAGKRLK-LNPRVFDGKHG 298
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY---QDPRLGSQLCY-----KTPSMAGI 302
+D+G LP++ + ++ + IK DP + +C+ ++ I
Sbjct: 299 TVLDSGTTYAYLPEEAFVAFKDAIIKEIKFLKQIHGPDPSY-NDICFSGAGRDVSQLSKI 357
Query: 303 APILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYD 360
P + F G K+ L F V G +C + Q + G + + YD
Sbjct: 358 FPEVNMVFGNGQKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIVVRNTLVTYD 417
Query: 361 FDSQMVSFKPTDCTK 375
D+ + F T+C++
Sbjct: 418 RDNDKIGFWKTNCSE 432
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 97/377 (25%), Positives = 162/377 (42%), Gaps = 46/377 (12%)
Query: 25 VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQ 84
++ IGTPP + ++DTGS L W+QC ++P+ SS++ L C
Sbjct: 98 IVDLPIGTPPQVQPM-VLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPV 156
Query: 85 CH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
C SC +LC+Y+Y YAD + +G L E+ TF S F ++ GC
Sbjct: 157 CKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRS-LFTPPLILGCATE 215
Query: 140 NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS---SITSKMYFGNG 196
+T + G++G+ R RLS ASQ S++ KFSYC VP + T Y G+
Sbjct: 216 ST-----DPRGILGMNRGRLSFASQ--SKI--TKFSYC-VPTRVTRPGYTPTGSFYLGHN 265
Query: 197 SEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
+ + ++ D Y V L+GI +G + P + A G
Sbjct: 266 PNSNTFRYIEMLTFARSQRMPNLDPLAYTVALQGIRIG--GRKLNISPAVFRADAGGSGQ 323
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG--------SQLCYKTPSMAGI 302
+D+G+ T L + Y+++ +V A+ PR+ + +C+ ++
Sbjct: 324 TMLDSGSEFTYLVNEAYDKVRAEVVRAV------GPRMKKGYVYGGVADMCFDGNAIEIG 377
Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVE-GVFCFAMQPID---GDVGIFGNFAQSDLFIG 358
I F+ V ++ + VE GV C + D I GNF Q +L++
Sbjct: 378 RLIGDMVFEFEKGVQIVVPKERVLATVEGGVHCIGIANSDKLGAASNIIGNFHQQNLWVE 437
Query: 359 YDFDSQMVSFKPTDCTK 375
+D ++ + F DC++
Sbjct: 438 FDLVNRRMGFGTADCSR 454
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 113/380 (29%), Positives = 167/380 (43%), Gaps = 44/380 (11%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVK-PIYNPASSSSYK 76
G Y K +G+PP D Y VDTGSD++WV C C C + Q++ ++P SS +
Sbjct: 79 GLYYTKIRLGSPPR-DFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTAT 137
Query: 77 ELSCQSEQCHLLDTVS---CSSQ-QLCNYTYGYADSSLTKGVLATERITF----GNS--N 126
+SC ++C S CS Q LC YT+ Y D S T G ++ + F G+S
Sbjct: 138 PVSCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVP 197
Query: 127 NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFH 182
N VVFGC + TG +++ G+ G G+ +S+ SQ+ SQ L FS+CL
Sbjct: 198 NSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCL---- 253
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSK---EDKTYYFVTLEGISVGNLSNSSKLIPY 239
K G G + G +V ++V + +Y V L ISV + + +P
Sbjct: 254 -------KGENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISV-----NGQALPI 301
Query: 240 YNSSGAISKGN-MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS 298
S + S G IDTG L + Y E + NA+ + G+Q S
Sbjct: 302 NPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQCYVIATS 361
Query: 299 MAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD-VGIFGNFAQSD 354
+A I P ++ +F GGA + L I G V+C Q I + I G+ D
Sbjct: 362 VADIFPPVSLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKD 421
Query: 355 LFIGYDFDSQMVSFKPTDCT 374
YD Q + + DC+
Sbjct: 422 KIFVYDLVGQRIGWANYDCS 441
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 106/384 (27%), Positives = 163/384 (42%), Gaps = 49/384 (12%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N + ++G+PP + ++DTGS+L W+ C + + +++P SSSY + C
Sbjct: 60 NVSLTVSLTVGSPPQ-TVTMVLDTGSELSWLHC----KKAPNLHSVFDPLRSSSYSPIPC 114
Query: 81 QSEQCHLLDT-----VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
S C VSC ++LC+ YAD+S +G LA++ GNS +FG
Sbjct: 115 TSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNSA--IPATIFG 172
Query: 136 C---GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
C G ++ + GL+G+ R LS ++Q+G KFSYC+ DSS +
Sbjct: 173 CMDSGFSSNSDEDSKTTGLIGMNRGSLSF----VTQMGLQKFSYCIS--GQDSS--GILL 224
Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGN-LSNSSKLIPYYNSSGA 245
FG S + T LV D+ Y V LEGI V N + K + + +GA
Sbjct: 225 FGESSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGA 284
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCYKTPSM 299
G +D+G T L Y L+ + K L +DP Q LCY+ P
Sbjct: 285 ---GQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLT 341
Query: 300 AGIAPIL---TAHFDGGA-KVPLIHTSTFIPPPVEG---VFCFAM---QPIDGDVGIFGN 349
P L T F G V +P + G V+CF + + + I G+
Sbjct: 342 RRTLPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGH 401
Query: 350 FAQSDLFIGYDFDSQMVSFKPTDC 373
Q ++++ +D V F C
Sbjct: 402 HHQQNVWMEFDLAKSRVGFAEVRC 425
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 105/373 (28%), Positives = 162/373 (43%), Gaps = 42/373 (11%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IGTPP + IVD+GS + +V C C QC K P + P SS+Y+ + C
Sbjct: 90 NGYYTTRLWIGTPPQM-FALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKC 148
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
+ C+ D ++ C Y YA+ S +KGVL + I+FGN + VFGC
Sbjct: 149 NMD-CNCDD-----DREQCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETV 202
Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
TG ++++ G++GLG+ LSL Q++ + L +N F C M G GS
Sbjct: 203 ETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCY----------GGMDVGGGS 252
Query: 198 EVSGGGVVSTSLV---SKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
+ GG + +V S D++ YY + L GI V S + GA+ +
Sbjct: 253 MILGGFDYPSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHSRVFDGEHGAV------L 306
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYKTPS------MAGIAP 304
D+G LP + EE V + K DP C++ + ++ I P
Sbjct: 307 DSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNF-KDTCFQVAASNYVSELSKIFP 365
Query: 305 ILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYDFD 362
+ F G L F V G +C + P D + G + + YD +
Sbjct: 366 SVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYDRE 425
Query: 363 SQMVSFKPTDCTK 375
+ V F T+C++
Sbjct: 426 NSKVGFWRTNCSE 438
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 106/384 (27%), Positives = 163/384 (42%), Gaps = 49/384 (12%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N + ++G+PP + ++DTGS+L W+ C + + +++P SSSY + C
Sbjct: 53 NVSLTVSLTVGSPPQ-TVTMVLDTGSELSWLHC----KKAPNLHSVFDPLRSSSYSPIPC 107
Query: 81 QSEQCHLLDT-----VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
S C VSC ++LC+ YAD+S +G LA++ GNS +FG
Sbjct: 108 TSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNSA--IPATIFG 165
Query: 136 C---GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
C G ++ + GL+G+ R LS ++Q+G KFSYC+ DSS +
Sbjct: 166 CMDSGFSSNSDEDSKTTGLIGMNRGSLSF----VTQMGLQKFSYCIS--GQDSS--GILL 217
Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGN-LSNSSKLIPYYNSSGA 245
FG S + T LV D+ Y V LEGI V N + K + + +GA
Sbjct: 218 FGESSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGA 277
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCYKTPSM 299
G +D+G T L Y L+ + K L +DP Q LCY+ P
Sbjct: 278 ---GQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLT 334
Query: 300 AGIAPIL---TAHFDGGA-KVPLIHTSTFIPPPVEG---VFCFAM---QPIDGDVGIFGN 349
P L T F G V +P + G V+CF + + + I G+
Sbjct: 335 RRTLPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGH 394
Query: 350 FAQSDLFIGYDFDSQMVSFKPTDC 373
Q ++++ +D V F C
Sbjct: 395 HHQQNVWMEFDLAKSRVGFAEVRC 418
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 156/371 (42%), Gaps = 32/371 (8%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-PIYNPASSSSYKELSCQ 81
YV + +GTPP + I D +D WV C C+ C P ++P SS+Y+ + C
Sbjct: 99 SYVARARLGTPPQTLLVAI-DPSNDAAWVPCSACLGCAPGASSPSFDPTQSSTYRPVRCG 157
Query: 82 SEQCHLLD--TVSCSS--QQLCNYTYGYADSSLTKGVLATERITFGNSNNFF---DNVVF 134
+ QC + T SC + C + YA S+L VL + ++ +SN D+ F
Sbjct: 158 APQCAQVPPATPSCPAGPGASCAFNLSYASSTL-HAVLGQDALSLSDSNGAAVPDDHYTF 216
Query: 135 GCGHNNTGVFNE-NEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
GC TG GLVG GR LS SQ + G + FSYCL P + S+ + +
Sbjct: 217 GCLRVVTGSGGSVPPQGLVGFGRGPLSFLSQTKATYG-SIFSYCL-PSYKSSNFSGTLRL 274
Query: 194 GNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSS----GAISK 248
G + + +T L+S + + Y+V + G+ V + K +P S+ A +
Sbjct: 275 GPAGQPR--RIKTTPLLSNPHRPSLYYVAMVGVRV-----NGKAVPIPASALALDAATGR 327
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTA 308
G +D G T L Y L R + P G CY + P +
Sbjct: 328 GGTIVDAGTMFTRLSPPAYAALRNAFRRGVS-APAAPALGGFDTCYYVNGTKSV-PAVAF 385
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ--PIDG---DVGIFGNFAQSDLFIGYDFDS 363
F GGA+V L + I GV C AM P DG + + + Q + + +D +
Sbjct: 386 VFAGGARVTLPEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGN 445
Query: 364 QMVSFKPTDCT 374
V F CT
Sbjct: 446 GRVGFSRELCT 456
>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 492
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 116/433 (26%), Positives = 166/433 (38%), Gaps = 73/433 (16%)
Query: 5 TYFYPNNVVQSNVS---TANGEYVMKFSIGTPPLLDIYGI---VDTGSDLMWVQCLP--C 56
T+ P++ +S +Y + S+G PL + +DTGSDL+W C P C
Sbjct: 61 THHLPSSRRHRQLSLPLAPGSDYTLSLSVG--PLSTANPVSLFLDTGSDLVWFPCAPFTC 118
Query: 57 VQCYKQVKPIYN------------------------PASSSSYKELSCQSEQCHL--LDT 90
+ C + P N A SS+ C + +C L ++T
Sbjct: 119 MLCEGKPTPPGNNNSSNPLPPPTDSRRIPCASPFCSAAHSSAPPADLCAAARCPLDDIET 178
Query: 91 VSCSSQQLCN-YTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEM 149
SC++ C Y Y D SL L R+ S +N F C H G +
Sbjct: 179 GSCAASHACPPLYYAYGDGSLV-ARLRRGRVGIAASVAV-ENFTFACAHTALG----EPV 232
Query: 150 GLVGLGRTRLSLASQILSQLGANKFSYCLVP--FHTDSSIT-SKMYFGNG---SEVSGGG 203
G+ G GR LSL +Q+ + +FSYCLV F D I S + G S G
Sbjct: 233 GVAGFGRGPLSLPAQLAPAALSGRFSYCLVAHSFRADRPIRPSPLILGRSPGEDPASETG 292
Query: 204 VVSTSLVSKEDKTYYF-VTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLL 262
+V T L+ Y++ V LE +SVG ++ P G G M +D+G T+L
Sbjct: 293 IVYTPLLHNPKHPYFYSVALEAVSVGGTRIPAR--PELGRVGRAGDGGMVVDSGTTFTML 350
Query: 263 PKDFYNRLEEQVRNAIKLTPYQDP-----RLGSQLCY--------KTPSMAGIAPILTAH 309
P + Y R+ E+ A+ ++ + G CY A P L H
Sbjct: 351 PNETYARVAEEFGRAMAAARFERAEAAEDQTGLAPCYYYDHDASAAEEGSARAVPPLAMH 410
Query: 310 FDGGAKVPLIHTSTFI---PPPVEGVFCFAM-----QPIDGDVGIFGNFAQSDLFIGYDF 361
F G A V L + F+ V C + G G GNF Q + YD
Sbjct: 411 FRGEATVVLPRRNYFMGFRSEERRRVGCLMLMNGGEDDGGGPAGTLGNFQQQGFEVVYDV 470
Query: 362 DSQMVSFKPTDCT 374
D+ V F CT
Sbjct: 471 DAGRVGFARRRCT 483
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 98/392 (25%), Positives = 161/392 (41%), Gaps = 46/392 (11%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV----KPIYN 68
+ S T G+Y ++F +GTP + + DTGSDL WV+C ++
Sbjct: 90 LSSGAYTGTGQYFVRFRVGTPAQPFVL-VADTGSDLTWVKCRGAGAAAGTGAGSPARVFR 148
Query: 69 PASSSSYKELSCQSEQCHL---LDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFG- 123
A+S S+ ++C S+ C +CSS C Y Y Y D S +GV+ T+ T
Sbjct: 149 TAASKSWAPIACSSDTCTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVGTDSATIAL 208
Query: 124 -------------NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG 170
VV GC G ++ G++ LG + +S AS+ ++ G
Sbjct: 209 SSGSGRGGGDSSGGRRAKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFASRAAARFG 268
Query: 171 ANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN- 229
+FSYCLV + TS + FG G+ + + L+ + +Y VT++ + V
Sbjct: 269 -GRFSYCLVDHLAPRNATSYLTFGPGA--TAPAAQTPLLLDRRMTPFYAVTVDAVYVAGE 325
Query: 230 -LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY--QDP 286
L + + + GAI +D+G T+L Y + + + P DP
Sbjct: 326 ALDIPADVWDVDRNGGAI------LDSGTSLTILATPAYRAVVTALSKHLAGLPRVTMDP 379
Query: 287 RLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD-- 343
+ CY + P + HF G A++ S ++ GV C +Q +G
Sbjct: 380 ---FEYCYNWTDAGALEIPKMEVHFAGSARLEPPAKS-YVIDAAPGVKCIGVQ--EGSWP 433
Query: 344 -VGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
V + GN Q + +D + + FK T C
Sbjct: 434 GVSVIGNILQQEHLWEFDLRDRWLRFKHTRCA 465
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 110/393 (27%), Positives = 166/393 (42%), Gaps = 53/393 (13%)
Query: 2 SPATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK 61
+PA P + S A GEY + +G+P + +VDTGS+ W+ C
Sbjct: 94 TPAEVEMP---MHSGRDDALGEYFAEVKVGSPGQ-RFWLVVDTGSEFTWLNC-------- 141
Query: 62 QVKPIYNPASSSSYKELSCQSEQC-----HLLDTVSCSS-QQLCNYTYGYADSSLTKGVL 115
S S++ ++C S +C L C C Y YAD S KG
Sbjct: 142 ----------SKSFEAVTCASRKCKVDLSELFSLSVCPKPSDPCLYDISYADGSSAKGFF 191
Query: 116 ATERITFGNSN---NFFDNVVFGCGHN--NTGVFNENEMGLVGLGRTRLSLASQILSQLG 170
T+ IT G +N +N+ GC + N FNE G++GLG + S + ++ G
Sbjct: 192 GTDSITVGLTNGKQGKLNNLTIGCTKSMLNGVNFNEETGGILGLGFAKDSFIDKAANKYG 251
Query: 171 ANKFSYCLVPFHTDSSITSKMYFGNGSEVS-GGGVVSTSLVSKEDKTYYFVTLEGISVGN 229
A KFSYCLV + S++S + G G + T L+ +Y V + GIS+G
Sbjct: 252 A-KFSYCLVDHLSHRSVSSNLTIGGHHNAKLLGEIRRTELILF--PPFYGVNVVGISIGG 308
Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPT--LLP--KDFYNRLEEQVRNAIKLTPYQD 285
K+ P A +G ID+G T LLP + + L + + ++T
Sbjct: 309 --QMLKIPPQVWDFNA--EGGTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVTGEDF 364
Query: 286 PRLGSQLCYKTPSM-AGIAPILTAHFDGGAKV-PLIHTSTFIPPPVEGVFCFAMQPIDGD 343
L + C+ + P L HF GGA+ P + + P+ V C + PIDG
Sbjct: 365 DAL--EFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPL--VKCIGIVPIDGI 420
Query: 344 VG--IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
G + GN Q + +D + V F P+ CT
Sbjct: 421 GGASVIGNIMQQNHLWEFDLSTNTVGFAPSTCT 453
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 101/382 (26%), Positives = 164/382 (42%), Gaps = 45/382 (11%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-----YKQVKPIYNPASSSSYK 76
G Y + +GTPP Y +DTGSD++WV C PC C ++P SS+
Sbjct: 39 GLYYTRIELGTPPR-PFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTAS 97
Query: 77 ELSCQSEQCHLLDTVS---CSSQQLCNYTYGYADSSLTKGVLATERITFGN------SNN 127
LSC +C + +S C++ + C Y++ Y D S T G ++ + +NN
Sbjct: 98 PLSCIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNN 157
Query: 128 FFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHT 183
+ FGC +N +G + + G+ G G+ LS+ SQ+ SQ L FS+CL
Sbjct: 158 ASAKITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEGADP 217
Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYN 241
I + G E++ G+V T +V + +Y + L+GI+V LS ++ N
Sbjct: 218 GGGI---LVLG---EITEPGMVYTPIVPSQ--PHYNLNLQGIAVNGQQLSIDPQVFATTN 269
Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT-PSMA 300
+ G I ID G L ++ Y + A+ + Q L C+ T S+
Sbjct: 270 TRGTI------IDCGTTLAYLAEEAYEPFVNTIIAAVSQST-QPFMLKGNPCFLTVHSID 322
Query: 301 GIAPILTAHFDGGAK--VPLIHTSTFIPPPVEGVFCFAMQPI------DGDVGIFGNFAQ 352
I P +T +F+G P + + P V+C Q + I G+
Sbjct: 323 EIFPSVTLYFEGAPMDLKPKDYLIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILGDLVL 382
Query: 353 SDLFIGYDFDSQMVSFKPTDCT 374
D YD ++Q + + DC+
Sbjct: 383 KDKVFVYDLENQRIGWTSFDCS 404
>gi|222637182|gb|EEE67314.1| hypothetical protein OsJ_24556 [Oryza sativa Japonica Group]
Length = 304
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 165/371 (44%), Gaps = 89/371 (23%)
Query: 26 MKFSIGTPPLL--DIYGIVDTGSDLMWVQCLPCVQCYKQVKP-----IYNPASSSSYKEL 78
M+ ++GTPP+ ++GI SDL WV+C PC C P +Y+ A+SSS+ L
Sbjct: 1 MELAVGTPPVTVQALFGI----SDLCWVECTPCSGCNNNAAPPAGARLYDRANSSSFSPL 56
Query: 79 SCQSEQCHLLDTVSCSSQQLCNYTYGYA----DSSLTKGVLATERITFG-NSNNFFDNVV 133
+ C Y Y Y D + KG+L TE I FG N +
Sbjct: 57 ----------------ADTECGYRYVYGATDTDRNYVKGILGTETIKFGSNDAATVQSFT 100
Query: 134 FGCGHN--NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
FGC + +F+ N G+VGLGR++LSL + QLG ++FSYCL ++ ++ S +
Sbjct: 101 FGCTNTVYRNDLFDGNT-GVVGLGRSKLSL----VGQLGLDRFSYCLA---SNPNVASPV 152
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKL-IPYYNSSGAISKGN 250
FG+ + + G GV ST L+ D Y+V L GISV + ++L IP N + +S+
Sbjct: 153 LFGSTASMDGNGVSSTPLL--PDDANYYVNLLGISV----DGTRLAIP--NDTARMSRTY 204
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHF 310
++ L +++ +N + + P +T HF
Sbjct: 205 EAVNGSGLLCFL-------VDDASKNVVTV-----------------------PTMTMHF 234
Query: 311 DGGAKVPLIHTSTFIPPPVEG------VFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
D G + L+ + F + V C + I GN+ Q D + Y+ +
Sbjct: 235 D-GMDMELLFGNYFAYTGKQSGGGGGDVLCLMIGKSSTGSRI-GNYLQMDFHVLYELKNS 292
Query: 365 MVSFKPTDCTK 375
++S +P DC K
Sbjct: 293 VLSVQPADCGK 303
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 98/374 (26%), Positives = 167/374 (44%), Gaps = 45/374 (12%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IGTPP + IVD+GS + +V C C QC P + P SSSY + C
Sbjct: 85 NGYYTTRLYIGTPPQ-EFALIVDSGSTVTYVPCSSCEQCGNHQDPRFQPDLSSSYSPVKC 143
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
+D S ++ C Y YA+ S + GVL + ++FG + + +FGC ++
Sbjct: 144 N------VDCTCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKPQHAIFGCENS 197
Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
TG +F+++ G++GLGR +LS+ Q++ + + ++ FS C M G G+
Sbjct: 198 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCY----------GGMDIGGGA 247
Query: 198 EVSGGGVVSTSLV-SKED---KTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNM 251
V GG + ++ S D YY + L+ I V L S++ +N SK
Sbjct: 248 MVLGGMLAPPDMIFSNSDPLRSPYYNIELKEIHVAGKALRVESRI---FN-----SKHGT 299
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCY-----KTPSMAGIA 303
+D+G LP+ + +E V +++K DP +C+ + +
Sbjct: 300 VLDSGTTYAYLPEQAFVAFKEAVTSKVHSLKKIRGPDPSY-KDICFAGAGRNVSKLHEVF 358
Query: 304 PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDF 361
P + F G K+ L F V+G +C + Q + G + + YD
Sbjct: 359 PDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDR 418
Query: 362 DSQMVSFKPTDCTK 375
++ + F T+C++
Sbjct: 419 HNEKIGFWKTNCSE 432
>gi|125575541|gb|EAZ16825.1| hypothetical protein OsJ_32297 [Oryza sativa Japonica Group]
Length = 416
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 102/378 (26%), Positives = 164/378 (43%), Gaps = 77/378 (20%)
Query: 25 VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQ 84
V F+IGTPP I+D P P +SS+++ C ++
Sbjct: 68 VANFTIGTPPQ-PASAIIDVAGP----------------APCSFPNASSTFRPEPCGTDA 110
Query: 85 CHLLDTVSCSSQQLCNYTYGYADSSL---TKGVLATERITFGNSNNFFDNVVFGC----G 137
C + T +CSS +C Y G +S L T G++AT+ G + ++ FGC G
Sbjct: 111 CKSIPTSNCSSN-MCTYE-GTINSKLGGHTLGIVATDTFAIGTATA---SLGFGCVVASG 165
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
+ G GL+GLGR S ++SQ+ KFSYCL P DS S++ G+ +
Sbjct: 166 IDTMG----GPSGLIGLGRA----PSSLVSQMNITKFSYCLTPH--DSGKNSRLLLGSSA 215
Query: 198 EVSGGGVVSTSLVSK-----EDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
+++GGG +T+ K + YY + L+GI G+ + + L P N+ +
Sbjct: 216 KLAGGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIA--LPPSGNT--------VL 265
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFD 311
+ T AP + L Y L+++V A+ P P LC+ ++ AP L F
Sbjct: 266 VQTLAPMSFLVDSAYQALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQ 325
Query: 312 GGAKVPLIHTSTFIPPPV--------EGVFCFAM--------QPIDGDVGIFGNFAQSDL 355
GA + +PPP +G C A+ +D ++ I G+ Q +
Sbjct: 326 QGAA------ALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENT 379
Query: 356 FIGYDFDSQMVSFKPTDC 373
D + + +SF+P DC
Sbjct: 380 HFLLDLEKKTLSFEPADC 397
>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 452
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 112/358 (31%), Positives = 137/358 (38%), Gaps = 53/358 (14%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV---QCYKQVKPIYNPASSSSYKELSC 80
YV+ S+GTP + VDTGSDL WVQC PC CY Q P+++PA SSSY + C
Sbjct: 140 YVVTASLGTPGVAQTM-EVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPC 198
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
C L + A G FF FGCGH
Sbjct: 199 GGPVCAGLGIYA---------------------ASACSAAQCGAVQGFF----FGCGHAQ 233
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
+G+FN + GL+GLGR + SL Q G FSYCL T S + G G
Sbjct: 234 SGLFNGVD-GLLGLGREQPSLVEQTAGTYG-GVFSYCL---PTKPSTAGYLTLGVGGPSG 288
Query: 201 GGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
ST+ L S TYY V L GISVG S +P +G + T P
Sbjct: 289 AAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS---VPASAFAGGTVVDTGTVVTRLP 345
Query: 259 PTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHFDGGAK 315
PT Y L R+ + Y L CY + P + F GA
Sbjct: 346 PTA-----YAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGAT 400
Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
V L G FA DG + I GN Q + D V FKP+ C
Sbjct: 401 VTLGADGIL----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 452
>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 485
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 110/434 (25%), Positives = 164/434 (37%), Gaps = 73/434 (16%)
Query: 4 ATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP--CVQCYK 61
AT F+ + S + +Y + F++G+ P I +DTGSDL+W C P C+ C
Sbjct: 53 ATRFHHRHRQISLPLSPGSDYTLSFNLGSHPPQPISLYMDTGSDLVWFPCAPFECILCEG 112
Query: 62 QVKPI----YNPASSSSYKELSCQSEQC--------------------HLLDTVSCSSQQ 97
+ +P + +S +SC+S C L++T CSS
Sbjct: 113 KYDTAATGGLSPPNITSSASVSCKSPACSAAHTSLSSSDLCAMARCPLELIETSDCSSFS 172
Query: 98 LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRT 157
+ Y Y D SL + +S N FGC H G +G+ G GR
Sbjct: 173 CPPFYYAYGDGSLVARLYRDSLSMPASSPLVLHNFTFGCAHTALG----EPVGVAGFGRG 228
Query: 158 RLSLASQILS---QLGANKFSYCLVPFHTDSSITSK---MYFGNGS----------EVSG 201
LSL +Q+ S LG N+FSYCLV D+ + + G S G
Sbjct: 229 VLSLPAQLASFSPHLG-NQFSYCLVSHSFDADRVRRPSPLILGRYSLDDEKKKRVGHDRG 287
Query: 202 GGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK---GNMFIDTGAP 258
V + L + + +Y V LEGI+VGN + IP + + G M +D+G
Sbjct: 288 EFVYTAMLDNPKHPYFYCVGLEGITVGN-----RKIPVPEILKRVDRRGNGGMVVDSGTT 342
Query: 259 PTLLPKDFYNRLEEQVRNAI----KLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGA 314
T+LP Y L + + + K + R G CY + A P + HF G +
Sbjct: 343 FTMLPAGLYESLVTEFNHRMGRVYKRATQIEERTGLGPCYYSDDSAAKVPAVALHFVGNS 402
Query: 315 KVPLIHTSTFIP--------PPVEGVFCFAMQ------PIDGDVGIFGNFAQSDLFIGYD 360
V L + + V C + G GN+ Q + YD
Sbjct: 403 TVILPRNNYYYEFFDGRDGQKKKRKVGCLMLMNGGDEAESGGPAATLGNYQQQGFEVVYD 462
Query: 361 FDSQMVSFKPTDCT 374
+ V F C
Sbjct: 463 LEKHRVGFARRKCA 476
>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
Length = 720
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 97/326 (29%), Positives = 137/326 (42%), Gaps = 30/326 (9%)
Query: 41 IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQ 96
I+D+GSD+ WVQC PC C++Q P+++PA S++Y + C S C L CS+
Sbjct: 171 IIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCSAN 230
Query: 97 QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG-VFNENEMGLVGLG 155
C + Y D S G + + +T G + FGC H + G F+ + G + LG
Sbjct: 231 AQCQFGINYGDGSTATGTYSFDDLTLG-PYDVIRGFRFGCAHADRGSAFDYDVAGSLALG 289
Query: 156 RTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG--GGVVSTSLVSKE 213
SL Q ++ G FSYCL P T SS+ + G E + VST L+S
Sbjct: 290 GGSQSLVQQTATRYG-RVFSYCLPP--TASSL-GFLVLGVPPERAQLIPSFVSTPLLSSS 345
Query: 214 -DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE 272
T+Y V L I V + +P A+ + ID+ + LP Y L
Sbjct: 346 MAPTFYRVLLRAIIV---AGRPLAVP-----PAVFSASSVIDSSTIISRLPPTAYQALRA 397
Query: 273 QVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEG 331
R+A+ + P CY + I P + FDGGA V L +
Sbjct: 398 AFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL------ 451
Query: 332 VFCFAMQPIDGDV--GIFGNFAQSDL 355
C A P D G GN Q L
Sbjct: 452 GSCLAFAPTASDRMPGFIGNVQQKTL 477
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 62/247 (25%), Positives = 91/247 (36%), Gaps = 25/247 (10%)
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
+G G TG ++ +++ L R L + +Q G FSYC+ P S +S +
Sbjct: 492 YGDGSTATGTYSFDDLTLGPYDVDRQGLPLRTATQYG-RVFSYCIPP-----SPSSLGFI 545
Query: 194 GNGSEVSGGGVV----STSLVSKED--KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
G +V ST L+S T+Y V L I V + P S+ ++
Sbjct: 546 TLGVPPQRAALVPTFVSTPLLSSSSMPPTFYRVLLRAIIVAG--RPLPVPPTVFSTSSVI 603
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PIL 306
I + LP Y L R A+ + P CY + I P +
Sbjct: 604 ASTTVI------SRLPPTAYQALRAAFRRAMTMYRTAPPVSILDTCYDFTGVRSITLPSI 657
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
FDGGA V L + +G FA D G GN Q L + YD + +
Sbjct: 658 ALVFDGGATVNLDAAGILL----QGCLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAI 713
Query: 367 SFKPTDC 373
F+ C
Sbjct: 714 RFRSAAC 720
>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
Length = 504
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 112/425 (26%), Positives = 158/425 (37%), Gaps = 91/425 (21%)
Query: 23 EYVMKFSIG-TPPLLDIYGIVDTGSDLMWVQCLP--CVQCYKQ----------------- 62
+Y + S+G + +DTGSDL+W C P C+ C +
Sbjct: 89 DYTLSLSVGPASAAAPVSLFLDTGSDLVWFPCAPFTCMLCEGKPTPGRSGPLPPPPDSRR 148
Query: 63 ---VKPIYNPASSSSYKELSCQSEQCHL--LDTVSCSSQQLCN-YTYGYADSSLTKGVLA 116
P+ + A +S+ C + +C L ++T SC + C Y Y D SL L
Sbjct: 149 IPCASPLCSAAHASAPPSDLCAAARCPLEDIETGSCGASHACPPLYYAYGDGSLVAH-LR 207
Query: 117 TERITFGNSNNF-----FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGA 171
R+ G DN F C H G +G+ G GR LSL Q+ QL +
Sbjct: 208 RGRVALGAGARASVAVAVDNFTFACAHTALG----EPVGVAGFGRGPLSLPGQLSPQL-S 262
Query: 172 NKFSYCLVP--FHTDSSIT-SKMYFGNGSEVSGG------GVVSTSLVSKEDKTYYF-VT 221
+FSYCLV F D I S + G + + G V T L+ Y++ V
Sbjct: 263 GRFSYCLVSHSFRADRLIRPSPLILGRSPDDADAAAAETDGFVYTPLLHNPKHPYFYSVA 322
Query: 222 LEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLE---------- 271
LE +SVG ++ P G M +D+G T+LP + Y R+
Sbjct: 323 LEAVSVGAARIQAR--PELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAA 380
Query: 272 -----EQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIP 326
E+ LTP CY+ + P L HF G A V L + F+
Sbjct: 381 GFARAERAEEQTGLTP----------CYRYAASDRGVPPLALHFRGNATVALPRRNYFMG 430
Query: 327 PPVEG---------VFCFAM--------QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFK 369
E V C + + DG G GNF Q + YD D+ V F
Sbjct: 431 FKSEDAGAGTRKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFA 490
Query: 370 PTDCT 374
CT
Sbjct: 491 RRRCT 495
>gi|383165471|gb|AFG65613.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
Length = 136
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/137 (42%), Positives = 76/137 (55%), Gaps = 4/137 (2%)
Query: 61 KQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERI 120
KQ PIY+PA SS+Y ++SC+S C+ L C S C Y Y Y D S+T G+L+ E +
Sbjct: 1 KQPTPIYDPARSSTYSKVSCKSLLCNALPDFECKSAAGCEYQYTYGDFSITVGILSYETL 60
Query: 121 TF---GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
T + N FGCG NN G + G+VGLGR LSL SQ+ + + KFSYC
Sbjct: 61 TLTSKSGAEQLIPNFAFGCGQNNEGNGFDQGAGIVGLGRGPLSLISQLSASM-PKKFSYC 119
Query: 178 LVPFHTDSSITSKMYFG 194
L+ S TS + FG
Sbjct: 120 LMTIDDSQSKTSPLMFG 136
>gi|222613193|gb|EEE51325.1| hypothetical protein OsJ_32293 [Oryza sativa Japonica Group]
Length = 371
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 155/342 (45%), Gaps = 48/342 (14%)
Query: 53 CLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTK 112
C C+ C+KQ P++ P +SS++K C ++ C + T C+S +C Y T
Sbjct: 55 CSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPTPKCAS-DVCAYDGVTGLGGHTV 113
Query: 113 GVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGAN 172
G++AT+ G + G T G +GLGRT SL ++Q+
Sbjct: 114 GIVATDTFAIGTAAPARPPAS-GASWRATSTPWAGPSGFIGLGRTPWSL----VAQMKLT 168
Query: 173 KFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED---KTYYFVTLEGISVGN 229
+FSYCL P D+ S+++ G ++++GGG + + + + YY + LE I G
Sbjct: 169 RFSYCLAPH--DTGKNSRLFLGASAKLAGGGAWTPFVKTSPNDGMSQYYPIELEEIKAG- 225
Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
+++ +P ++ + + + +LL Y ++ V ++ P P +G
Sbjct: 226 --DATITMPRGRNTVLVQTAVVRV------SLLVDSVYQEFKKAVMASVGAAPTATP-VG 276
Query: 290 S--QLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVF-------CFAMQPI 340
+ ++C+ ++G AP L F GA + + PP +F C ++ I
Sbjct: 277 APFEVCFPKAGVSG-APDLVFTFQAGAALTV--------PPANYLFDVGNDTVCLSVMSI 327
Query: 341 --------DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
DG + I G+F Q ++ + +D D M+SF+P DC+
Sbjct: 328 ALLNITALDG-LNILGSFQQENVHLLFDLDKDMLSFEPADCS 368
>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
Length = 346
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 96/370 (25%), Positives = 159/370 (42%), Gaps = 46/370 (12%)
Query: 26 MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKP---IYNPASSSSYKELSCQ 81
M S+GTPP+ ++ I DTGS L WVQC C ++CY Q I+NP +SS+Y ++ C
Sbjct: 1 MGISLGTPPVFNLVTI-DTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGCS 59
Query: 82 SEQCH-----LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
+E C+ L C + C Y+ Y + G L +R+T SN DN +FG
Sbjct: 60 TEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLA-SNRSIDNFIFG 118
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH-TDSSITSKMYFG 194
CG +N ++N G++G G S +Q+ Q FSYC H + S+T
Sbjct: 119 CGEDN--LYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGSLTI----- 171
Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFID 254
G ++ T L+ + K Y + + V + ++ PY ISK + +D
Sbjct: 172 -GPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGI--RLEIDPYI----YISKMTI-VD 223
Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGA 314
+G T + ++ L++ + ++ Y ++C+ + S +A+++
Sbjct: 224 SGTADTYILSPVFDALDKAMTKEMQAKGYTRGWDERRICFISNSG-------SANWNDFP 276
Query: 315 KVPLIHTSTFIPPPVEGVF--------CFAMQPIDGDVG---IFGNFAQSDLFIGYDFDS 363
V + + + PVE F C P D V + GN A + +D +
Sbjct: 277 TVEMKLIRSTLKLPVENAFYESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQA 336
Query: 364 QMVSFKPTDC 373
FK C
Sbjct: 337 MNFGFKARAC 346
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 102/384 (26%), Positives = 168/384 (43%), Gaps = 65/384 (16%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IGTPP + IVDTGS + +V C C QC + P + P SSS+Y+ + C
Sbjct: 81 NGYYTTRLWIGTPPQM-FALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC 139
Query: 81 QSEQCHLLDTVSC---SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGC 136
T+ C S + C Y YA+ S + GVL + I+FGN + VFGC
Sbjct: 140 ---------TIDCNCDSDRMQCVYERQYAEMSTSSGVLGEDLISFGNQSELAPQRAVFGC 190
Query: 137 GHNNTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFG 194
+ TG +++++ G++GLGR LS+ Q++ + + ++ FS C +G
Sbjct: 191 ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLC---------------YG 235
Query: 195 NGSEVSGGGVVSTSLVSKEDKT----------YYFVTLEGISVGNLSNSSKLIPYYNSSG 244
G +V GG +V + D YY + L+ I V + K +P N++
Sbjct: 236 -GMDVGGGAMVLGGISPPSDMAFAYSDPVRSPYYNIDLKEIHV-----AGKRLP-LNANV 288
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQDPRLGSQLCYKTPSMAG 301
K +D+G LP+ + ++ + ++K DP + +C+ S AG
Sbjct: 289 FDGKHGTVLDSGTTYAYLPEAAFLAFKDAIVKELQSLKKISGPDPNY-NDICF---SGAG 344
Query: 302 IA--------PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFA 351
I P++ F+ G K L F V G +C + Q + + G
Sbjct: 345 IDVSQLSKSFPVVDMVFENGQKYTLSPENYMFRHSKVRGAYCLGVFQNGNDQTTLLGGII 404
Query: 352 QSDLFIGYDFDSQMVSFKPTDCTK 375
+ + YD + + F T+C +
Sbjct: 405 VRNTLVVYDREQTKIGFWKTNCAE 428
>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
Length = 452
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 117/379 (30%), Positives = 146/379 (38%), Gaps = 62/379 (16%)
Query: 3 PATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV---QC 59
PA++ Y ++ T N YV+ S+GTP + VDTGSDL WVQC PC C
Sbjct: 128 PASWGY-------DIGTLN--YVVTASLGTPGVAQTM-EVDTGSDLSWVQCKPCSAAPSC 177
Query: 60 YKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATER 119
Y Q P+++PA SSSY + C C L + A
Sbjct: 178 YSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYA---------------------ASACSA 216
Query: 120 ITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
G FF FGCGH +G+FN + GL+GLGR + SL Q G FSYCL
Sbjct: 217 AQCGAVQGFF----FGCGHAQSGLFNGVD-GLLGLGREQPSLVEQTAGTYG-GVFSYCL- 269
Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTS--LVSKEDKTYYFVTLEGISVGNLSNSSKLI 237
T S + G G ST+ L S TYY V L GISVG S +
Sbjct: 270 --PTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS---V 324
Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYK 295
P +G + T PPT Y L R+ + Y L CY
Sbjct: 325 PASAFAGGTVVDTGTVVTRLPPTA-----YAALRSAFRSGMASYGYPTAPSNGILDTCYN 379
Query: 296 TPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSD 354
+ P + F GA V L G FA DG + I GN Q
Sbjct: 380 FAGYGTVTLPNVALTFGSGATVTLGADGIL----SFGCLAFAPSGSDGGMAILGNVQQRS 435
Query: 355 LFIGYDFDSQMVSFKPTDC 373
+ D V FKP+ C
Sbjct: 436 FEV--RIDGTSVGFKPSSC 452
>gi|361068027|gb|AEW08325.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165459|gb|AFG65601.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165460|gb|AFG65602.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165461|gb|AFG65603.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165462|gb|AFG65604.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165463|gb|AFG65605.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165465|gb|AFG65607.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165466|gb|AFG65608.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165467|gb|AFG65609.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165468|gb|AFG65610.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165469|gb|AFG65611.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165472|gb|AFG65614.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165473|gb|AFG65615.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165474|gb|AFG65616.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165475|gb|AFG65617.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165476|gb|AFG65618.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
Length = 136
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 58/137 (42%), Positives = 76/137 (55%), Gaps = 4/137 (2%)
Query: 61 KQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERI 120
KQ PIY+PA SS+Y ++SC+S C+ L C S C Y Y Y D S+T G+L+ E +
Sbjct: 1 KQPTPIYDPARSSTYSKVSCKSLLCNALPDFECKSTAGCEYQYTYGDFSITVGILSYETL 60
Query: 121 TF---GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
T + N FGCG NN G + G+VGLGR LSL SQ+ + + KFSYC
Sbjct: 61 TLTSKSGAEQLIPNFAFGCGQNNEGNGFDQGAGIVGLGRGPLSLISQLSASM-PKKFSYC 119
Query: 178 LVPFHTDSSITSKMYFG 194
L+ S TS + FG
Sbjct: 120 LMTIDDSQSKTSPLMFG 136
>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 481
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 107/407 (26%), Positives = 169/407 (41%), Gaps = 67/407 (16%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP--CVQCYKQVKPIYNPASSSSYKELSC 80
+Y + F++G+ P I +DTGSDL+W C P C+ C + + + +SC
Sbjct: 74 DYTLSFNLGSNPPQLITLYMDTGSDLVWFPCSPFECILCEGKPQTTKPANITKQTHSVSC 133
Query: 81 QS------------------EQCHL--LDTVSCSSQQLCNYTYGYADSSLTKGVLATERI 120
QS +C L ++T CSS + Y Y D S + +
Sbjct: 134 QSPACSAAHASMSSSNLCAISRCPLDYIETSDCSSFSCPPFYYAYGDGSFVANLY---QQ 190
Query: 121 TFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQI--LSQLGANKFSYCL 178
T S+ N FGC H G+ G GR LSL +Q+ LS N+FSYCL
Sbjct: 191 TLSLSSLHLQNFTFGCAHTALA----EPTGVAGFGRGILSLPAQLSTLSPHLGNRFSYCL 246
Query: 179 VPFHTDSSITSK---MYFGNGSE-VSGGG------VVSTSLVSKEDKTYYF-VTLEGISV 227
V D + + G ++ ++G G V TS++S YY+ V L GISV
Sbjct: 247 VSHSFDGDRLRRPSPLILGRHNDTITGAGDGESVEFVYTSMLSNPKHPYYYCVGLAGISV 306
Query: 228 GNLS-NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTP 282
G + + +++ + G G M +D+G T+LP+ FY N +++V K
Sbjct: 307 GKRTVPAPEILKRVDEKG---NGGMVVDSGTTFTMLPESFYNAVVNEFDKRVNRFHKRAS 363
Query: 283 YQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---------VF 333
+ + G CY ++ I P+L HF G ++ + ++G V
Sbjct: 364 EIETKTGLGPCYYLNGLSQI-PVLKLHFVGNNSDVVLPRKNYFYEFMDGGDGIRRKGKVG 422
Query: 334 CFAMQ------PIDGDVG-IFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
C + +DG G GN+ Q + YD + + V F +C
Sbjct: 423 CMMLMNGEDETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKKEC 469
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 106/375 (28%), Positives = 165/375 (44%), Gaps = 46/375 (12%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IGTPP + IVD+GS + +V C C QC K P + P SS+Y+ + C
Sbjct: 91 NGYYTTRLWIGTPPQM-FALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPELSSTYQPVKC 149
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
+ C+ D ++ C Y YA+ S +KGVL + I+FGN + VFGC
Sbjct: 150 NMD-CNCDD-----DKEQCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETV 203
Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
TG ++++ G++GLG+ LSL Q++ + L +N F C M G GS
Sbjct: 204 ETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCY----------GGMDVGGGS 253
Query: 198 EVSGGGVVSTSLV---SKEDKT-YYFVTLEGISVG--NLSNSSKLIPYYNSSGAISKGNM 251
+ GG + ++ S D++ YY + L GI V LS +S++ + GA+
Sbjct: 254 MILGGFDYPSDMIFTDSDPDRSPYYNIDLTGIRVAGKKLSLNSRV--FDGEHGAV----- 306
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCY------KTPSMAGI 302
+D+G LP + EE V + K DP C+ ++ I
Sbjct: 307 -LDSGTTYAYLPDAAFAAFEEAVMREVSPLKQIDGPDPNF-KDTCFLVAASNDVSELSKI 364
Query: 303 APILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYD 360
P + F G L F V G +C + P D + G + + YD
Sbjct: 365 FPSVEMIFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYD 424
Query: 361 FDSQMVSFKPTDCTK 375
++ V F T+C++
Sbjct: 425 RENSKVGFWRTNCSE 439
>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
gi|238008190|gb|ACR35130.1| unknown [Zea mays]
gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
Length = 269
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 86/273 (31%), Positives = 128/273 (46%), Gaps = 23/273 (8%)
Query: 113 GVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGAN 172
GVLATE TFG NF N+ FGCG G G++G+ LS +L QL
Sbjct: 5 GVLATETFTFGAHQNFSANLTFGCGKLTNGTI-AGASGIMGVSPGPLS----VLKQLSIT 59
Query: 173 KFSYCLVPFHTDSSITSKMYFGNGSEV----SGGGVVSTSLVSKE-DKTYYFVTLEGISV 227
KFSYCL PF TD TS + FG +++ + G V + L+ + YY+V + GIS+
Sbjct: 60 KFSYCLTPF-TDHK-TSPVMFGAMADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGISI 117
Query: 228 GNLS-NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP 286
G+ + + I G G +D+ L + + L++ V +KL
Sbjct: 118 GSKRLDVPEAILALRPDGT---GGTVLDSATTLAYLVEPAFKELKKAVMEGMKLPAANRS 174
Query: 287 RLGSQLCYKTP---SMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ--PI 340
+C++ P SM G+ P L HF G A++ L S F P G+ C A+ P
Sbjct: 175 IDDYPVCFELPRGMSMEGVQVPPLVLHFAGDAEMSLPRDSYF-QEPSPGMMCLAVMQAPF 233
Query: 341 DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+G + GN Q ++ + YD ++ S+ PT C
Sbjct: 234 EGAPNVIGNVQQQNMHVLYDLGNRKFSYAPTKC 266
>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
Length = 629
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 97/326 (29%), Positives = 137/326 (42%), Gaps = 30/326 (9%)
Query: 41 IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQ 96
I+D+GSD+ WVQC PC C++Q P+++PA S++Y + C S C L CS+
Sbjct: 80 IIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCSAN 139
Query: 97 QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG-VFNENEMGLVGLG 155
C + Y D S G + + +T G + FGC H + G F+ + G + LG
Sbjct: 140 AQCQFGINYGDGSTATGTYSFDDLTLG-PYDVIRGFRFGCAHADRGSAFDYDVAGSLALG 198
Query: 156 RTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG--GGVVSTSLVSKE 213
SL Q ++ G FSYCL P T SS+ + G E + VST L+S
Sbjct: 199 GGSQSLVQQTATRYG-RVFSYCLPP--TASSL-GFLVLGVPPERAQLIPSFVSTPLLSSS 254
Query: 214 -DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE 272
T+Y V L I V + +P A+ + ID+ + LP Y L
Sbjct: 255 MAPTFYRVLLRAIIV---AGRPLAVP-----PAVFSASSVIDSSTIISRLPPTAYQALRA 306
Query: 273 QVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEG 331
R+A+ + P CY + I P + FDGGA V L +
Sbjct: 307 AFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL------ 360
Query: 332 VFCFAMQPIDGDV--GIFGNFAQSDL 355
C A P D G GN Q L
Sbjct: 361 GSCLAFAPTASDRMPGFIGNVQQKTL 386
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 62/247 (25%), Positives = 91/247 (36%), Gaps = 25/247 (10%)
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
+G G TG ++ +++ L R L + +Q G FSYC+ P S +S +
Sbjct: 401 YGDGSTATGTYSFDDLTLGPYDVDRQGLPLRTATQYG-RVFSYCIPP-----SPSSLGFI 454
Query: 194 GNGSEVSGGGVV----STSLVSKED--KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
G +V ST L+S T+Y V L I V + P S+ ++
Sbjct: 455 TLGVPPQRAALVPTFVSTPLLSSSSMPPTFYRVLLRAIIVAG--RPLPVPPTVFSTSSVI 512
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PIL 306
I + LP Y L R A+ + P CY + I P +
Sbjct: 513 ASTTVI------SRLPPTAYQALRAAFRRAMTMYRTAPPVSILDTCYDFTGVRSITLPSI 566
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
FDGGA V L + +G FA D G GN Q L + YD + +
Sbjct: 567 ALVFDGGATVNLDAAGILL----QGCLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAI 622
Query: 367 SFKPTDC 373
F+ C
Sbjct: 623 RFRSAAC 629
>gi|357117301|ref|XP_003560410.1| PREDICTED: uncharacterized protein LOC100833752 [Brachypodium
distachyon]
Length = 473
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 100/381 (26%), Positives = 156/381 (40%), Gaps = 46/381 (12%)
Query: 24 YVMKFSIGTPPLLDIYGI-VDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
Y + +GT + Y + +D + W+QC PC C Q+ P+++PA S +++ +S
Sbjct: 101 YAVAVGVGTEHGYENYELEMDMAAGFSWMQCAPCHPCLPQLNPVFDPAKSPTFRPVSGH- 159
Query: 83 EQCHLLDTVSCS------SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN---VV 133
+ V C C + Y + + G LA + +F +N F + +V
Sbjct: 160 ------NAVLCRPPYHPLQDGRCGFGIAYRNGASAAGYLARDTFSFPTGDNNFQHLPGIV 213
Query: 134 FGCGHNNTGVFNEN-------EMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
FGC N F+ + MG+ G+ Q+ G +FSYC P ++
Sbjct: 214 FGCA-NRIARFDTHGALAGVLGMGMGAEGKPLTGFMRQLYHN-GGGRFSYC--PIVPGTT 269
Query: 187 ITSKMYFGNG-SEVSGGGVVSTSLVSKEDKT---YYFVTLEGISVGNLSNSSKLIPYYNS 242
S + FGN GV S+ T Y+V L GISVG L + P
Sbjct: 270 AYSFLRFGNDIPSQPPAGVHRQSMAVLAPTTTSEAYYVKLAGISVGAL-RVPGVTPEMFE 328
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTP---YQDPRLGSQLC-YKTPS 298
+G ID G T + + Y +E VR ++ Q P G LC ++TP+
Sbjct: 329 RDQHGRGGCAIDIGTKMTAIVQTAYAHVEAAVRGHLQRNRARFVQSP--GHHLCVHRTPA 386
Query: 299 MAGIAPILTAHFDGGA--KVPLIHTSTFIPPPVEG--VFCFAMQPIDGDVGIFGNFAQSD 354
+ P +T HF GG +V H + P G C + P D ++ + G Q D
Sbjct: 387 IEERLPSMTLHFVGGPWLRVKPQHLFLVVGSPTGGGEYLCLGLVP-DAEMTVIGAMQQID 445
Query: 355 LFIGYDFDSQ--MVSFKPTDC 373
+D + +VSF P DC
Sbjct: 446 TRFIFDLHNNIPIVSFNPEDC 466
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 108/397 (27%), Positives = 178/397 (44%), Gaps = 68/397 (17%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPAS 71
+ T G Y + IGTP Y VDTGSD++WV C+ C C ++ +Y+P +
Sbjct: 82 IPTDTGLYFTQIGIGTPSK-GYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTA 140
Query: 72 SSSYKELSCQSEQCHLLDT----VSCSSQQLCNYTYGYADSSLTKGVLATERITF----- 122
S+S K ++C E C SC++ C Y+ Y D S T G + + +
Sbjct: 141 SASSKTVTCGQEFCATATNGGVPPSCAANSPCQYSITYGDGSSTTGFFVADFLQYDQVSG 200
Query: 123 -GNSNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYC 177
G +N +V FGCG G + + G++G G+ S+ SQ+ S K FS+C
Sbjct: 201 DGQTNLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHC 260
Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVST-SLVSKEDKT--------YYFVTLEGISVG 228
L V+GGG+ + ++V + KT +Y V L+ I VG
Sbjct: 261 L------------------DTVNGGGIFAIGNVVQPKVKTTPLVPGMPHYNVVLKTIDVG 302
Query: 229 NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQD 285
++ +L G S+G + ID+G LP+ Y + V + L QD
Sbjct: 303 G--STLQLPTNIFDIGGGSRGTI-IDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQD 359
Query: 286 PRLGSQLCYK-TPSMAGIAPILTAHFDGGAKVPL-IHTSTFIPPPVEGVFCF-----AMQ 338
LC++ + S+ P +T HFDG +PL ++ ++ E V+C +Q
Sbjct: 360 -----FLCFQYSGSVDNGFPEVTFHFDG--DLPLVVYPHDYLFQNTEDVYCVGFQSGGVQ 412
Query: 339 PIDG-DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
DG D+ + G+ A S+ + YD ++Q++ + +C+
Sbjct: 413 SKDGKDMVLLGDLALSNKLVVYDLENQVIGWTNYNCS 449
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 94/374 (25%), Positives = 163/374 (43%), Gaps = 43/374 (11%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS 79
+NG Y + IGTPP + IVDTGS + +V C C QC K P + P SS+Y+ +
Sbjct: 73 SNGYYTTRLFIGTPPQ-EFALIVDTGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVK 131
Query: 80 CQSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCG 137
C + +C + + C Y YA+ S + GV+A + ++FGN + VFGC
Sbjct: 132 CNP-------SCNCDDEGKQCTYERRYAEMSSSSGVIAEDVVSFGNESELKPQRAVFGCE 184
Query: 138 HNNTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGN 195
+ TG ++++ G++GLGR RLS+ Q++ + + + FS C M G
Sbjct: 185 NVETGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCY----------GGMDVGG 234
Query: 196 GSEVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
G+ V G ++V YY + L+ + V K + G +
Sbjct: 235 GAMVLGQISPPPNMVFSHSNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKHGTV----- 289
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCY-----KTPSMAGIA 303
+D+G P+ ++ L++ + I K P DP +C+ + ++ +
Sbjct: 290 -LDSGTTYAYFPEAAFHALKDAIMKEIRHLKQIPGPDPNY-HDICFSGAGREVSHLSKVF 347
Query: 304 PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGDV-GIFGNFAQSDLFIGYDF 361
P + F G K+ L F V G +C + D+ + G + + YD
Sbjct: 348 PEVNMVFGSGQKLSLSPENYLFRHTKVSGAYCLGIFQNGNDLTTLLGGIVVRNTLVTYDR 407
Query: 362 DSQMVSFKPTDCTK 375
++ + F T+C++
Sbjct: 408 ENDKIGFWKTNCSE 421
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 99/374 (26%), Positives = 166/374 (44%), Gaps = 45/374 (12%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IGTPP + IVD+GS + +V C C QC P + P SSSY + C
Sbjct: 86 NGYYTTRLYIGTPPQ-EFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKC 144
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
+D S ++ C Y YA+ S + GVL + ++FG + VFGC ++
Sbjct: 145 N------VDCTCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKPQRAVFGCENS 198
Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
TG +F+++ G++GLGR +LS+ Q++ + + ++ FS C M G G+
Sbjct: 199 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCY----------GGMDIGGGA 248
Query: 198 EVSGGGVVSTSLV-SKED---KTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNM 251
V GG + +V S D YY + L+ I V L S++ +N SK
Sbjct: 249 MVLGGVPAPSDMVFSHSDPLRSPYYNIELKEIHVAGKALRVDSRV---FN-----SKHGT 300
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCY-----KTPSMAGIA 303
+D+G LP+ + ++ V +++K DP +C+ + +
Sbjct: 301 VLDSGTTYAYLPEQAFVAFKDAVTSKVHSLKKIRGPDPNY-KDICFAGAGRNVSKLHEVF 359
Query: 304 PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDF 361
P + F G K+ L F V+G +C + Q + G + + YD
Sbjct: 360 PDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDR 419
Query: 362 DSQMVSFKPTDCTK 375
++ + F T+C++
Sbjct: 420 HNEKIGFWKTNCSE 433
>gi|414586111|tpg|DAA36682.1| TPA: pepsin A [Zea mays]
Length = 503
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 112/424 (26%), Positives = 156/424 (36%), Gaps = 90/424 (21%)
Query: 23 EYVMKFSIG-TPPLLDIYGIVDTGSDLMWVQCLP--CVQCYKQ----------------- 62
+Y + S+G + +DTGSDL+W C P C+ C +
Sbjct: 89 DYTLSLSVGPASAAAPVSLFLDTGSDLVWFPCAPFTCMLCEGKPTPGRLGPLPPPPDSRR 148
Query: 63 ---VKPIYNPASSSSYKELSCQSEQCHL--LDTVSCSSQQLCN-YTYGYADSSLTKGVLA 116
P+ + A +S+ C +C L ++T SC + C Y Y D SL L
Sbjct: 149 IPCASPLCSAAHASAPPSDLCAVARCPLEDIETGSCGASHACPPLYYAYGDGSLVAH-LR 207
Query: 117 TERITFGNSNNF-----FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGA 171
R+ G DN F C H G +G+ G GR LSL Q+ QL +
Sbjct: 208 RGRVALGAGARASVAVAVDNFTFACAHTALG----EPVGVAGFGRGPLSLPGQLSPQL-S 262
Query: 172 NKFSYCLVP--FHTDSSIT-SKMYFGNG-----SEVSGGGVVSTSLVSKEDKTYYF-VTL 222
+FSYCLV F D I S + G + G V T L+ Y++ V L
Sbjct: 263 GRFSYCLVSHSFRADRLIRPSPLILGRSPDDAAAAAETDGFVYTPLLHNPKHPYFYSVAL 322
Query: 223 EGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLE----------- 271
E +SVG ++ P G M +D+G T+LP + Y R+
Sbjct: 323 EAVSVGAARIQAR--PELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAG 380
Query: 272 ----EQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPP 327
E+ LTP CY+ + P L HF G A V L + F+
Sbjct: 381 FARAERAEEQTGLTP----------CYRYAASDRGVPPLALHFRGNATVALPRRNYFMGF 430
Query: 328 PVEG---------VFCFAM--------QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
E V C + + DG G GNF Q + YD D+ V F
Sbjct: 431 KSEDAGAGTRKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFAR 490
Query: 371 TDCT 374
CT
Sbjct: 491 RRCT 494
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 101/374 (27%), Positives = 161/374 (43%), Gaps = 45/374 (12%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IGTPP + IVD+GS + +V C C QC P + P SS+Y + C
Sbjct: 82 NGYYTTRLYIGTPPQ-EFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKC 140
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
+ D S + C Y YA+ S + GVL + ++FG + VFGC ++
Sbjct: 141 SA------DCTCDSDKSQCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFGCENS 194
Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
TG +F+++ G++GLGR +LS+ Q++ + + + FS C M G G+
Sbjct: 195 ETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCY----------GGMDIGGGA 244
Query: 198 EVSGGGVVSTSLV-SKEDKT---YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
V G +V S+ D YY + L+ I V + +L P SK +
Sbjct: 245 MVLGAMPAPPDMVFSRSDPVRSPYYNIELKEIHVAG--KALRLDPRIFD----SKHGTVL 298
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA---------- 303
D+G LP+ + ++ V + K+ P + R G YK AG
Sbjct: 299 DSGTTYAYLPEQAFVAFKDAVTS--KVRPLKKIR-GPDPNYKDICFAGAGRNVSQLSQAF 355
Query: 304 PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDF 361
P + F G K+ L F VEG +C + Q + G + + YD
Sbjct: 356 PDVDMVFGDGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDR 415
Query: 362 DSQMVSFKPTDCTK 375
++ + F T+C++
Sbjct: 416 HNEKIGFWKTNCSE 429
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 101/381 (26%), Positives = 171/381 (44%), Gaps = 42/381 (11%)
Query: 19 TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
T G Y + +GTPP Y VDTGSD++WV C+ C +C ++ Y+P +SS
Sbjct: 79 TDTGLYFTEIKLGTPPK-RYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDPKASS 137
Query: 74 SYKELSCQSEQCHLL---DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------GN 124
S +SC C C++ C Y+ Y D S T G T+ + F G
Sbjct: 138 SGSTVSCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQ 197
Query: 125 SNNFFDNVVFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
+ V FGCG G N+ G++G G+ S+ SQ+ + K F++CL
Sbjct: 198 TQPGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCL-- 255
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
+I F G+ V V +T LV+ D +Y V L+ I VG ++ +P +
Sbjct: 256 ----DTIKGGGIFAIGNVVQ-PKVKTTPLVA--DMPHYNVNLKSIDVG---GTTLQLPAH 305
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP-SM 299
KG + ID+G T LP+ + + + N + + + + +C++ P S+
Sbjct: 306 VFETGERKGTI-IDSGTTLTYLPELVFKEVMAAIFNKHQDIVFHNVQ--DFMCFQYPGSV 362
Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIFGNFAQS 353
P +T HF+ + + F P + ++C A+Q DG D+ + G+ S
Sbjct: 363 DDGFPTITFHFEDDLALHVYPHEYFFPNGND-MYCVGFQNGALQSKDGKDIVLMGDLVLS 421
Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
+ + YD ++Q++ + +C+
Sbjct: 422 NKLVIYDLENQVIGWTDYNCS 442
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 108/383 (28%), Positives = 168/383 (43%), Gaps = 61/383 (15%)
Query: 28 FSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCH- 86
+IGTPP +I ++DTGS+L W++C + I+NP +S +Y ++ C S+ C
Sbjct: 71 LTIGTPPQ-NITMVLDTGSELSWLRC----KKEPNFTSIFNPLASKTYTKIPCSSQTCKT 125
Query: 87 ----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG 142
L V+C +LC++ YAD+S +G LA E FG+ VFGC + +
Sbjct: 126 RTSDLTLPVTCDPAKLCHFIISYADASSVEGHLAFETFRFGSLTR--PATVFGCMDSGSS 183
Query: 143 VFNENEM---GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEV 199
E + GL+G+ R LS ++Q+G KFSYC+ + T + G
Sbjct: 184 SNTEEDAKTTGLMGMNRGSLSF----VNQMGFRKFSYCISGLDS----TGFLLLGEARYS 235
Query: 200 SGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNSSKLIPYYNS------SGAIS 247
+ T LV D+ Y V LEGI V N K++P S +GA
Sbjct: 236 WLKPLNYTPLVQISTPLPYFDRVAYSVQLEGIKVNN-----KVLPLPKSVFVPDHTGA-- 288
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQ--VRNAIKLTPYQDPRLGSQ----LCY---KTPS 298
G +D+G T L Y+ L ++ ++ A L +P+ Q LCY T S
Sbjct: 289 -GQTMVDSGTQFTFLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQGAMDLCYLIDSTSS 347
Query: 299 MAGIAPILTAHFDGGA-KVPLIHTSTFIPPPVEG---VFCFAMQPIDGDVGI----FGNF 350
P++ F G V +P V G V+CF D ++GI G+
Sbjct: 348 TLPNLPVVKLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSD-ELGISSFLIGHH 406
Query: 351 AQSDLFIGYDFDSQMVSFKPTDC 373
Q ++++ YD ++ + F C
Sbjct: 407 QQQNVWMEYDLENSRIGFAELRC 429
>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 434
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 104/380 (27%), Positives = 173/380 (45%), Gaps = 55/380 (14%)
Query: 25 VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI-YNPASSSSYKELSCQSE 83
++ IGTPP ++DTGS L W+QC + + P ++P SSS+ L C
Sbjct: 79 IVSLPIGTPPQTQQM-VLDTGSQLSWIQC----KVPPKTPPTAFDPLLSSSFSVLPCNHS 133
Query: 84 QC------HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
C + L T SC +LC+Y+Y YAD + +G L E+ TF +S ++ GC
Sbjct: 134 LCKPRVPDYTLPT-SCDQNRLCHYSYFYADGTYAEGNLVREKFTFSSSQT-TPPLILGCA 191
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD--SSITSKMYFGN 195
+++ + G++G+ RLS +S L+++ +KFSYC+ P + SS T Y G
Sbjct: 192 TDSS-----DTQGILGMNLGRLSFSS--LAKI--SKFSYCVPPRRSQSGSSPTGSFYLGP 242
Query: 196 GSEVSGGGVVS------TSLVSKEDKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISK 248
+G V+ + + D Y + + GI + G N S + SGA
Sbjct: 243 NPSSAGFKYVNLMTYRQSQRMPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPSGA--- 299
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS--------QLCYKTPSMA 300
G ID+G T L + Y++++E++ +KL P+L +C+ +M
Sbjct: 300 GQTLIDSGTWFTFLVDEAYSKVKEEI---VKLA---GPKLKKGYVYGGSLDMCFDGDAMV 353
Query: 301 GIAPI--LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID---GDVGIFGNFAQSDL 355
I + F+ G ++ ++ + GV C + D I GNF Q DL
Sbjct: 354 IGRMIGNMAFEFENGVEI-VVEREKMLADVGGGVQCLGIGRSDLLGVASNIIGNFHQQDL 412
Query: 356 FIGYDFDSQMVSFKPTDCTK 375
++ +D + V F TDC++
Sbjct: 413 WVEFDLVGRRVGFGRTDCSR 432
>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 570
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 108/386 (27%), Positives = 168/386 (43%), Gaps = 56/386 (14%)
Query: 21 NGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSS--S 74
+G Y +G PP LDI DTGSDL WVQC PC C K P+Y P + S
Sbjct: 196 DGLYYTYIMVGEPPRPYFLDI----DTGSDLTWVQCDAPCSSCGKGRSPLYKPRRENVVS 251
Query: 75 YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD--NV 132
+K+ C Q + D C++ Q CNY YAD S + GVL + T SN N
Sbjct: 252 FKDSLCMEVQRNY-DGDQCAACQQCNYEVQYADQSSSLGVLVKDEFTLRFSNGSLTKLNA 310
Query: 133 VFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
+FGC ++ G+ G++GL R ++SL SQ+ S+ + N +CL D +
Sbjct: 311 IFGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQLASRGIINNVVGHCLT---GDPAGG 367
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
++ G+ V G+ +++ +Y + I G+ IP + S+
Sbjct: 368 GYLFLGD-DFVPQWGMAWVAMLDSPSIDFYQTKVVRIDYGS-------IPLSLDTWGSSR 419
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY----QDPRLGSQLCYKTP-SMAGIA 303
+ D+G+ T K+ Y +L V N +++ + QD +C+KT S+ +
Sbjct: 420 EQVVFDSGSSYTYFTKEAYYQL---VANLEEVSAFGLILQDS--SDTICWKTEQSIRSVK 474
Query: 304 PI------LTAHFDGGAKVPLIHTSTFIPPP------VEGVFCFAM----QPIDGDVGIF 347
+ LT F G++ L+ T I P EG C + Q DG I
Sbjct: 475 DVKHFFKPLTLQF--GSRFWLVSTKLVILPENYLLINKEGNVCLGILDGSQVHDGSTIIL 532
Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDC 373
G+ A + YD +Q + + +DC
Sbjct: 533 GDNALRGKLVVYDNVNQRIGWTSSDC 558
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 99/384 (25%), Positives = 164/384 (42%), Gaps = 43/384 (11%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSS 74
+ G Y + +G P I VDTGSD++WV C PC C ++ +Y+P SS+
Sbjct: 25 SGGLYFTQVGLGNPVKHYIVQ-VDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESST 83
Query: 75 YKELSCQSEQC---HLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFG--NSN-- 126
+SC C CS + C Y + Y D S ++G + + + +SN
Sbjct: 84 TSLVSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGL 143
Query: 127 -NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVPF 181
N V+FGC TG + ++ G++G G+ LS+ +Q+ +Q + FS+CL
Sbjct: 144 ANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL--- 200
Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
+ + G +E G+ T LV D +Y V L GISV NS++L
Sbjct: 201 EGEKRGGGILVIGGIAEP---GMTYTPLV--PDSVHYNVVLRGISV----NSNRLPIDAE 251
Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
+ + + +D+G P YN + +R A TP + + +Q + ++
Sbjct: 252 DFSSTNDTGVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGRLSD 311
Query: 302 IAPILTAHFDGGAKV----PLIHTSTFIPPPVEGVFCFAMQ-------PIDG-DVGIFGN 349
+ P +T +F+GGA + P V+C Q P DG + I G+
Sbjct: 312 LFPNVTLNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGD 371
Query: 350 FAQSDLFIGYDFDSQMVSFKPTDC 373
D + YD D+ + + +C
Sbjct: 372 IVLKDKLVVYDLDNSRIGWMSYNC 395
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 95/374 (25%), Positives = 165/374 (44%), Gaps = 45/374 (12%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IGTPP + IVDTGS + +V C C QC + P + P SSS+Y+ + C
Sbjct: 109 NGYYTTRLWIGTPPQM-FALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC 167
Query: 81 QSEQCHLLDTVSCS---SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGC 136
T+ C+ + C Y YA+ S + GVL + I+FGN + VFGC
Sbjct: 168 ---------TIDCNCDGDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGC 218
Query: 137 GHNNTG-VFNENEMGLVGLGRTRLSLASQIL-SQLGANKFSYCLVPFHTDSSITSKMYFG 194
+ TG +++++ G++GLGR LS+ Q++ ++ ++ FS C M G
Sbjct: 219 ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCY----------GGMDVG 268
Query: 195 NGSEVSGGGVVSTSLV---SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
G+ V GG + + S D++ Y+ I + + + K +P N++ K
Sbjct: 269 GGAMVLGGISPPSDMTFAYSDPDRSPYY----NIDLKEMHVAGKRLP-LNANVFDGKHGT 323
Query: 252 FIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQDPRLGSQLCY-----KTPSMAGIA 303
+D+G LP+ + ++ + ++K DP + +C+ ++
Sbjct: 324 VLDSGTTYAYLPEAAFLAFKDAIVKELQSLKQISGPDPNY-NDICFSGAGNDVSQLSKSF 382
Query: 304 PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDF 361
P++ F G K L F V G +C + Q + + G + + YD
Sbjct: 383 PVVDMVFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGNDQTTLLGGIIVRNTLVMYDR 442
Query: 362 DSQMVSFKPTDCTK 375
+ + F T+C +
Sbjct: 443 EQTKIGFWKTNCAE 456
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 110/384 (28%), Positives = 173/384 (45%), Gaps = 49/384 (12%)
Query: 19 TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
T G Y + IGTP Y VDTGSD++WV C+ C C ++ +Y+P+ SS
Sbjct: 76 TETGLYFTQIGIGTPAK-SYYVQVDTGSDILWVNCVFCDTCPRKSGLGIELTLYDPSGSS 134
Query: 74 SYKELSCQSEQC---HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSN 126
S ++C + C H SC C Y+ Y D S T G T+ + + GNS
Sbjct: 135 SGTGVTCGQDFCVATHGGVIPSCVPAAPCQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQ 194
Query: 127 NFFDN--VVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
N + FGCG G + G++G G++ S+ SQ+ + K F++CL
Sbjct: 195 TTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKVFAHCL-- 252
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIP 238
+I F G +V V +T LV +Y V LE I VG L + +
Sbjct: 253 ----DTINGGGIFAIG-DVVQPKVSTTPLV--PGMPHYNVNLEAIDVGGVKLQLPTNIFD 305
Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK-TP 297
S G I ID+G LP YN + +V P ++ + C++ +
Sbjct: 306 IGESKGTI------IDSGTTLAYLPGVVYNAIMSKVFAQYGDMPLKNDQ--DFQCFRYSG 357
Query: 298 SMAGIAPILTAHFDGGAKVPL-IHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIFGNF 350
S+ PI+T HF+GG +PL IH ++ E ++C +Q DG D+ + G+
Sbjct: 358 SVDDGFPIITFHFEGG--LPLNIHPHDYLFQNGE-LYCMGFQTGGLQTKDGKDMVLLGDL 414
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
A S+ + YD ++Q++ + +C+
Sbjct: 415 AFSNRLVLYDLENQVIGWTDYNCS 438
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 94/364 (25%), Positives = 158/364 (43%), Gaps = 46/364 (12%)
Query: 42 VDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYKELSCQSEQC---HLLDTVSC 93
VDTGSD++WV C PC C ++ +Y+P SS+ +SC C C
Sbjct: 19 VDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVSCSDPLCVRGRRFAEAQC 78
Query: 94 S-SQQLCNYTYGYADSSLTKGVLATERITFG--NSN---NFFDNVVFGCGHNNTGVFNEN 147
S + C Y + Y D S ++G + + + +SN N V+FGC TG + +
Sbjct: 79 SQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTSQVLFGCSIRQTGDLSTS 138
Query: 148 EM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMYFGNGSEVSGGG 203
+ G++G G+ LS+ +Q+ +Q + FS+CL + + G +E G
Sbjct: 139 QQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL---EGEKRGGGILVIGGIAE---PG 192
Query: 204 VVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLP 263
+ T LV D +Y V L GISV NS++L + + + +D+G P
Sbjct: 193 MTYTPLV--PDSVHYNVVLRGISV----NSNRLPIDAEDFSSTNDTGVIMDSGTTLAYFP 246
Query: 264 KDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTST 323
YN + +R A TP + + +Q + ++ + P +T +F+GGA +
Sbjct: 247 SGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGRLSDLFPNVTLNFEGGAME--LQPDN 304
Query: 324 FI------PPPVEGVFCFAMQ-------PIDG-DVGIFGNFAQSDLFIGYDFDSQMVSFK 369
++ P V+C Q P DG + I G+ D + YD D+ + +
Sbjct: 305 YLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLVVYDLDNSRIGWM 364
Query: 370 PTDC 373
+C
Sbjct: 365 SYNC 368
>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
Length = 538
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 101/393 (25%), Positives = 159/393 (40%), Gaps = 50/393 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQCYK------------------- 61
G Y++ GTP L Y +V DT +DL W+ C + K
Sbjct: 125 GMYLVSVRFGTPAL--PYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKE 182
Query: 62 -QVKPIYNPASSSSYKELSCQSEQCHLLDTVSC---SSQQLCNYTYGYADSSLTKGVLAT 117
+ K Y PA SSS++ + C ++C LL +C S + C+Y D +LT G+
Sbjct: 183 ARRKNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQDGTLTMGIYGK 242
Query: 118 ERITFGNSNNFFDN---VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKF 174
E+ T S+ ++ GC G + G++ LG +S A + G +F
Sbjct: 243 EKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVHAAKRFG-QRF 301
Query: 175 SYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNS 233
S+CL+ ++ +S + FG V G G + T +V D K Y + GI VG
Sbjct: 302 SFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGG---- 357
Query: 234 SKL-IP--YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS 290
+L IP +++ + G + +DT T L + Y + + + P G
Sbjct: 358 ERLDIPQEIWDAEKVVG-GGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELDGF 416
Query: 291 QLCYKTPSMAG---------IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI- 340
+ CY+ + AG P LT GGA++ S +P V GV C A + +
Sbjct: 417 EYCYRW-TFAGDGVDLTHNVTVPRLTVEMAGGARLEPEAKSVVMPEVVPGVACLAFRKLP 475
Query: 341 DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
G GI GN + D + F+ C
Sbjct: 476 RGGPGILGNVLMQEYIWEIDHGKGKMRFRKDKC 508
>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 103/409 (25%), Positives = 175/409 (42%), Gaps = 66/409 (16%)
Query: 24 YVMKFSIGTPP-LLDIYGIVDTGSDLMWVQC----LPCVQCYK------QVKPIYNPASS 72
Y++ +IGTPP + +Y +DTGSDL WV C C+ C + I++P S
Sbjct: 11 YLITLNIGTPPQAVQVY--MDTGSDLTWVPCGNLSFDCIDCNDLKSNNLKSSSIFSPLHS 68
Query: 73 SSYKELSCQSEQCHLLDT----------VSCSSQQLC---------NYTYGYADSSLTKG 113
SS SC S C + + CS L ++ Y Y + L G
Sbjct: 69 SSSFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLVSG 128
Query: 114 VLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK 173
+L + + + FGC T ++E +G+ G GR LSL SQ+
Sbjct: 129 ILTRDILKARTRD--VPRFSFGCV---TSTYHE-PIGIAGFGRGLLSLPSQL--GFLEKG 180
Query: 174 FSYCLVPFH--TDSSITSKMYFGNGS---EVSGGGVVSTSLVSKEDKTYYFVTLEGISVG 228
FS+C +PF + +I+S + G + ++ + L + Y++ LE I++G
Sbjct: 181 FSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPVYPNSYYIGLESITIG 240
Query: 229 NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKL--TPYQDP 286
+++ + G M +D+G T LP FY++L +++ I +
Sbjct: 241 TNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLTILQSTITYPRATETES 300
Query: 287 RLGSQLCYKTP-----------SMAGIAPILTAHFDGGAKVPLIHTSTF--IPPPVEG-- 331
R G LCYK P + + P +T +F A + L ++F + P +G
Sbjct: 301 RTGFDLCYKVPCPNNNLTSLENDVMMVFPSITFNFLNNATLLLPQGNSFYAMSAPSDGSV 360
Query: 332 VFCFAMQPID----GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTKQ 376
V C Q ++ G G+FG+F Q ++ + YD + + + F+ DC +
Sbjct: 361 VQCLLFQNMEDGNYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDCVLE 409
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 105/385 (27%), Positives = 170/385 (44%), Gaps = 50/385 (12%)
Query: 19 TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
T G Y + +GTPP Y VDTGSD++WV C+ C QC + +Y+P +SS
Sbjct: 83 TDTGLYYTEVRLGTPPKR-FYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKASS 141
Query: 74 SYKELSCQSEQCHLLDTV-----SCSSQQLCNYTYGYADSSLTKGVLATERITF------ 122
+ + C C DT CS+ C Y+ Y D S T G + + F
Sbjct: 142 TGSTVMCDQGFC--ADTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTGD 199
Query: 123 GNSNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCL 178
G + +V+FGCG G + G++G G S+ SQ+ + K F++CL
Sbjct: 200 GQTQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCL 259
Query: 179 VPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKL 236
+I F G +V V +T LV+ DK +Y V L+ I VG L + +
Sbjct: 260 ------DTIKGGGIFAIG-DVVQPKVKTTPLVA--DKPHYNVNLKTIDVGGTTLELPADI 310
Query: 237 IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK- 295
G I ID+G T LP+ + ++ V N + + D + LC++
Sbjct: 311 FKPGEKRGTI------IDSGTTLTYLPELVFKKVMLAVFNKHQDITFHDVQ--DFLCFEY 362
Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIFGN 349
+ S+ P LT HF+ + + F P + V+C A+Q DG D+ + G+
Sbjct: 363 SGSVDDGFPTLTFHFEDDLALHVYPHEYFFPNGND-VYCVGFQNGALQSKDGKDIVLMGD 421
Query: 350 FAQSDLFIGYDFDSQMVSFKPTDCT 374
S+ + YD +++++ + +C+
Sbjct: 422 LVLSNKLVVYDLENRVIGWTDYNCS 446
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 96/379 (25%), Positives = 165/379 (43%), Gaps = 36/379 (9%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK---------QVKPIYNPASS 72
G+Y + F +GTP + DTGSDL W+ C + + K +++ S
Sbjct: 81 GQYFVAFKVGTPSQ-KFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLS 139
Query: 73 SSYKELSCQSEQC--HLLDTVSCSS----QQLCNYTYGYADSSLTKGVLATERITF---G 123
SS+K + C ++ C L+D S ++ C Y Y Y+D S G A E +T
Sbjct: 140 SSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKE 199
Query: 124 NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
NV+ GC + G + G++GLG ++ S A + + G KFSYCLV +
Sbjct: 200 GRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGG-KFSYCLVDHLS 258
Query: 184 DSSITSKMYFG--NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP--Y 239
++++ + FG E + T LV ++Y V + GIS+G + IP
Sbjct: 259 HKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIG---GAMLKIPSEV 315
Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPS 298
++ GA G +D+G+ T L + Y + +R ++ + +G + C+ +
Sbjct: 316 WDVKGA---GGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTG 372
Query: 299 M-AGIAPILTAHFDGGAKV-PLIHTSTFIPPPVEGVFCFAMQPIDG-DVGIFGNFAQSDL 355
+ P L HF GA+ P + +++ +GV C + + GN Q +
Sbjct: 373 FEESLVPRLVFHFADGAEFEPPV--KSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNH 430
Query: 356 FIGYDFDSQMVSFKPTDCT 374
+D + + F P+ CT
Sbjct: 431 LWEFDLGLKKLGFAPSSCT 449
>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
Length = 538
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 101/393 (25%), Positives = 159/393 (40%), Gaps = 50/393 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQCYK------------------- 61
G Y++ GTP L Y +V DT +DL W+ C + K
Sbjct: 125 GMYLVSVRFGTPAL--PYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKE 182
Query: 62 -QVKPIYNPASSSSYKELSCQSEQCHLLDTVSC---SSQQLCNYTYGYADSSLTKGVLAT 117
+ K Y PA SSS++ + C ++C LL +C S + C+Y D +LT G+
Sbjct: 183 ARRKNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQDGTLTMGIYGK 242
Query: 118 ERITFGNSNNFFDN---VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKF 174
E+ T S+ ++ GC G + G++ LG +S A + G +F
Sbjct: 243 EKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVHAAKRFG-QRF 301
Query: 175 SYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNS 233
S+CL+ ++ +S + FG V G G + T +V D K Y + GI VG
Sbjct: 302 SFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGG---- 357
Query: 234 SKL-IP--YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS 290
+L IP +++ + G + +DT T L + Y + + + P G
Sbjct: 358 ERLDIPQEIWDAEKVVG-GGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELDGF 416
Query: 291 QLCYKTPSMAG---------IAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPI- 340
+ CY+ + AG P LT GGA++ S +P V GV C A + +
Sbjct: 417 EYCYRW-TFAGDGVDLAHNVTVPRLTVEMAGGARLEPEAKSVVMPEVVPGVACLAFRKLP 475
Query: 341 DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
G GI GN + D + F+ C
Sbjct: 476 RGGPGILGNVLMQEYIWEIDHGKGKMRFRKDKC 508
>gi|242076594|ref|XP_002448233.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
gi|241939416|gb|EES12561.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
Length = 508
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 115/427 (26%), Positives = 157/427 (36%), Gaps = 93/427 (21%)
Query: 23 EYVMKFSIG-TPPLLDIYGIVDTGSDLMWVQCLP--CVQCYKQ----------------- 62
+Y + S+G + +DTGSDL+W C P C+ C +
Sbjct: 93 DYTLSLSVGPASAAAPVSLFLDTGSDLVWFPCAPFTCMLCEGKPTPSGGHSSSAPLPLPP 152
Query: 63 ---------VKPIYNPASSSSYKELSCQSEQCHLLD--TVSC--SSQQLCNYTYGYADSS 109
P+ + A +S+ C + C L D T SC +S Y Y D S
Sbjct: 153 PPDSRRVPCASPLCSAAHASAPPSDLCAAAGCPLEDIETGSCRGASHACPPLYYAYGDGS 212
Query: 110 LTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQL 169
L L R+ G S DN F C H G +G+ G GR LSL Q+ QL
Sbjct: 213 LVA-HLRRGRVGLGASVA-VDNFTFACAHTALG----EPVGVAGFGRGPLSLPGQLAPQL 266
Query: 170 GANKFSYCLV--PFHTDSSIT-SKMYFGNGSEVSG--GGVVSTSLVSKEDKTYYF-VTLE 223
+ +FSYCLV F D I S + G + + GG V T L+ Y++ V LE
Sbjct: 267 -SGRFSYCLVSHSFRADRLIRPSPLILGRSPDAAAETGGFVYTPLLHNPKHPYFYSVALE 325
Query: 224 GISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRL------------- 270
+SVG ++ P G M +D+G T+LP + Y R+
Sbjct: 326 AVSVGATRIQAR--PELARVDRAGNGGMVVDSGTTFTMLPNETYARVAEAFARAMAAAGF 383
Query: 271 --EEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPP 328
E+ LTP CY + P L HF G A V L + F+
Sbjct: 384 ARAERAEEQTGLTP----------CYHYAASDRGVPPLALHFRGNATVALPRRNYFMGFK 433
Query: 329 VE----------GVFCFAMQ----------PIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
E V C + DG G GNF Q + YD D+ V F
Sbjct: 434 SEEEAGGAGRKDDVGCLMLMNGGDVSGEDGGDDGPAGTLGNFQQQGFEVVYDVDAGRVGF 493
Query: 369 KPTDCTK 375
CT+
Sbjct: 494 ARRRCTE 500
>gi|383165464|gb|AFG65606.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
gi|383165470|gb|AFG65612.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
Length = 136
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 75/137 (54%), Gaps = 4/137 (2%)
Query: 61 KQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERI 120
KQ PIY+PA SS+Y ++SC+S C+ L C S C Y Y Y D S+T G+L+ E +
Sbjct: 1 KQPTPIYDPARSSTYSKVSCKSLLCNALPDFECKSTAGCEYQYTYGDFSITVGILSYETL 60
Query: 121 TF---GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
T + FGCG NN G + G+VGLGR LSL SQ+ + + KFSYC
Sbjct: 61 TLTSKSGAEQLIPKFAFGCGQNNEGNGFDQGAGIVGLGRGPLSLISQLSASM-PKKFSYC 119
Query: 178 LVPFHTDSSITSKMYFG 194
L+ S TS + FG
Sbjct: 120 LMTIDDSQSKTSPLMFG 136
>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
gi|194703964|gb|ACF86066.1| unknown [Zea mays]
gi|219886221|gb|ACL53485.1| unknown [Zea mays]
gi|219886359|gb|ACL53554.1| unknown [Zea mays]
gi|223950085|gb|ACN29126.1| unknown [Zea mays]
gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 431
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 96/335 (28%), Positives = 143/335 (42%), Gaps = 42/335 (12%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
YV++ +GTP + + +DT +D W C PC C + + PASSSSY L C S
Sbjct: 78 SYVVRAGLGTP-VQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCAS 134
Query: 83 EQCHLLDTVSCSSQQ-------LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
+ C L + C + Q C ++ +AD+S + L ++ + G + FG
Sbjct: 135 DWCPLFEGQPCPANQDASAPLPACAFSKPFADTSF-QASLGSDTLRLG--KDAIAGYAFG 191
Query: 136 C-GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
C G N + GL+GLGR +SL SQ S+ FSYCL + + YF
Sbjct: 192 CVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYN-GVFSYCLPSYRS-------YYFS 243
Query: 195 NGSEVSGGG----VVSTSLVSKEDK-TYYFVTLEGISVGNL-----SNSSKLIPYYNSSG 244
+ G V T L++ + + Y+V + G+SVG + S P +
Sbjct: 244 GSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGT 303
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSM-AGI 302
I G + AP Y L E+ R + P LG+ C+ T + AG
Sbjct: 304 VIDSGTVITRWTAP-------VYAALREEFRRQVA-APSGYTSLGAFDTCFNTDEVAAGG 355
Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM 337
AP +T H DGG + L +T I + C AM
Sbjct: 356 APPVTLHMDGGVDLTLPMENTLIHSSATPLACLAM 390
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 108/422 (25%), Positives = 169/422 (40%), Gaps = 76/422 (18%)
Query: 13 VQSNVSTANGEYVMKFSIGTP--PLLDIYGIVDTGSDLMWVQC----------------L 54
+ S T G+Y ++F +GTP P L + DTGSDL WV+C L
Sbjct: 76 LSSGAYTGTGQYFVRFRVGTPAQPFLLV---ADTGSDLTWVKCHRAAAAASASPRNASSL 132
Query: 55 PCVQCYKQVKPIYNPASSSSYKELSCQSEQCHL---LDTVSCSS-QQLCNYTYGYADSSL 110
P + + P S ++ + C S C +C++ C Y Y Y D S
Sbjct: 133 P-APAPASPRRTFRPDKSRTWAPIPCSSATCRESLPFSLAACATPANPCAYDYRYKDGSA 191
Query: 111 TKGVLATERITFGNSNNF-----FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQI 165
+G + + T S VV GC + G G++ LG + +S AS+
Sbjct: 192 ARGTVGVDSATIALSGRAARKAKLRGVVLGCTTSYNGQSFLASDGVLSLGYSNISFASRA 251
Query: 166 LSQLGANKFSYCLVPFHTDSSITSKMYFGN----GSEVSGGGVVS--------------- 206
S+ G +FSYCLV + TS + FG S G+ S
Sbjct: 252 ASRFG-GRFSYCLVDHLAPRNATSYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAP 310
Query: 207 ----TSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK-GNMFIDTGAPPT 260
T LV + +Y VT++G+SV +L+ + + + G +D+G T
Sbjct: 311 GARQTPLVLDHRTRPFYAVTVKGVSVAG-----ELLKIPRAVWDVEQGGGAILDSGTSLT 365
Query: 261 LLPKDFYNRLEEQVRNAIKLTPY--QDPRLGSQLCYK--TPSMAGIA---PILTAHFDGG 313
+L K Y + + + P DP CY +PS + +A P+L HF G
Sbjct: 366 MLAKPAYRAVVAALSKRLAGLPRVTMDP---FDYCYNWTSPSGSDVAAPLPMLAVHFAGS 422
Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQ--PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
A++ S ++ GV C +Q P G + + GN Q + YD ++ + FK +
Sbjct: 423 ARLEPPAKS-YVIDAAPGVKCIGLQEGPWPG-LSVIGNILQQEHLWEYDLKNRRLRFKRS 480
Query: 372 DC 373
C
Sbjct: 481 RC 482
>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
gi|194690728|gb|ACF79448.1| unknown [Zea mays]
Length = 431
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 96/335 (28%), Positives = 143/335 (42%), Gaps = 42/335 (12%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
YV++ +GTP + + +DT +D W C PC C + + PASSSSY L C S
Sbjct: 78 SYVVRAGLGTP-VQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCAS 134
Query: 83 EQCHLLDTVSCSSQQ-------LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
+ C L + C + Q C ++ +AD+S + L ++ + G + FG
Sbjct: 135 DWCPLFEGQPCPANQDASAPLPACAFSKPFADTSF-QASLGSDTLRLG--KDAIAGYAFG 191
Query: 136 C-GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
C G N + GL+GLGR +SL SQ S+ FSYCL + + YF
Sbjct: 192 CVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYN-GVFSYCLPSYRS-------YYFS 243
Query: 195 NGSEVSGGG----VVSTSLVSKEDK-TYYFVTLEGISVGNL-----SNSSKLIPYYNSSG 244
+ G V T L++ + + Y+V + G+SVG + S P +
Sbjct: 244 GSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGT 303
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSM-AGI 302
I G + AP Y L E+ R + P LG+ C+ T + AG
Sbjct: 304 VIDSGTVITRWTAP-------VYAALREEFRRQVA-APSGYTSLGAFDTCFNTDEVAAGG 355
Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM 337
AP +T H DGG + L +T I + C AM
Sbjct: 356 APPVTLHMDGGVDLTLPMENTLIHSSATPLACLAM 390
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 96/379 (25%), Positives = 165/379 (43%), Gaps = 36/379 (9%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK---------QVKPIYNPASS 72
G+Y + F +GTP + DTGSDL W+ C + + K +++ S
Sbjct: 10 GQYSVAFKVGTPSQ-KFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLS 68
Query: 73 SSYKELSCQSEQC--HLLDTVSCSS----QQLCNYTYGYADSSLTKGVLATERITF---G 123
SS+K + C ++ C L+D S ++ C Y Y Y+D S G A E +T
Sbjct: 69 SSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKE 128
Query: 124 NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
NV+ GC + G + G++GLG ++ S A + + G KFSYCLV +
Sbjct: 129 GRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGG-KFSYCLVDHLS 187
Query: 184 DSSITSKMYFG--NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP--Y 239
++++ + FG E + T LV ++Y V + GIS+G + IP
Sbjct: 188 HKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIG---GAMLKIPSEV 244
Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPS 298
++ GA G +D+G+ T L + Y + +R ++ + +G + C+ +
Sbjct: 245 WDVKGA---GGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTG 301
Query: 299 M-AGIAPILTAHFDGGAKV-PLIHTSTFIPPPVEGVFCFAMQPIDG-DVGIFGNFAQSDL 355
+ P L HF GA+ P + +++ +GV C + + GN Q +
Sbjct: 302 FEESLVPRLVFHFADGAEFEPPV--KSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNH 359
Query: 356 FIGYDFDSQMVSFKPTDCT 374
+D + + F P+ CT
Sbjct: 360 LWEFDLGLKKLGFAPSSCT 378
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 96/379 (25%), Positives = 165/379 (43%), Gaps = 36/379 (9%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK---------QVKPIYNPASS 72
G+Y + F +GTP + DTGSDL W+ C + + K +++ S
Sbjct: 81 GQYSVAFKVGTPSQ-KFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLS 139
Query: 73 SSYKELSCQSEQC--HLLDTVSCSS----QQLCNYTYGYADSSLTKGVLATERITF---G 123
SS+K + C ++ C L+D S ++ C Y Y Y+D S G A E +T
Sbjct: 140 SSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKE 199
Query: 124 NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
NV+ GC + G + G++GLG ++ S A + + G KFSYCLV +
Sbjct: 200 GRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGG-KFSYCLVDHLS 258
Query: 184 DSSITSKMYFG--NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP--Y 239
++++ + FG E + T LV ++Y V + GIS+G + IP
Sbjct: 259 HKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIG---GAMLKIPSEV 315
Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPS 298
++ GA G +D+G+ T L + Y + +R ++ + +G + C+ +
Sbjct: 316 WDVKGA---GGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTG 372
Query: 299 M-AGIAPILTAHFDGGAKV-PLIHTSTFIPPPVEGVFCFAMQPIDG-DVGIFGNFAQSDL 355
+ P L HF GA+ P + +++ +GV C + + GN Q +
Sbjct: 373 FEESLVPRLVFHFADGAEFEPPV--KSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNH 430
Query: 356 FIGYDFDSQMVSFKPTDCT 374
+D + + F P+ CT
Sbjct: 431 LWEFDLGLKKLGFAPSSCT 449
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 101/378 (26%), Positives = 165/378 (43%), Gaps = 25/378 (6%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK--PIYNPASS 72
S + +Y + +GTP +VDTGS+L WV C + +VK ++ S
Sbjct: 79 SGIDYGTAQYFTEVRVGTPAK-KFRVVVDTGSELTWVNCRYRGRGKGKVKNRRVFRAEES 137
Query: 73 SSYKELSCQSEQCH--LLDTVSCSS----QQLCNYTYGYADSSLTKGVLATERITFGNSN 126
S+K + C ++ C L++ S S+ C+Y Y YAD S +GV A E IT G +N
Sbjct: 138 KSFKTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTN 197
Query: 127 N---FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHT 183
++ GC + +G + G++GL + S S S GA K SYCLV +
Sbjct: 198 GRKARLRGLLVGCSSSFSGQSFQGADGVLGLAFSDFSFTSTATSLFGA-KLSYCLVDHLS 256
Query: 184 DSSITSKMYFGNGSEVSGGGVV---STSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
+ +I++ + FG S + +T L +Y + + GIS+G+ L
Sbjct: 257 NKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGISIGD----DMLDIPT 312
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV-RNAIKLTPYQDPRLGSQLCYKTPS- 298
A + G +D+G TLL + Y + + R ++L + + + C+ + S
Sbjct: 313 QVWDATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTSG 372
Query: 299 -MAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFA-MQPIDGDVGIFGNFAQSDLF 356
P LT H GGA+ H +++ GV C M + GN Q +
Sbjct: 373 FNESKLPQLTFHLKGGARFE-PHRKSYLVDAAPGVKCLGFMSAGTPATNVVGNIMQQNYL 431
Query: 357 IGYDFDSQMVSFKPTDCT 374
+D + +SF P+ CT
Sbjct: 432 WEFDLMASTLSFAPSTCT 449
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 102/387 (26%), Positives = 167/387 (43%), Gaps = 41/387 (10%)
Query: 6 YFYPNNVVQ-SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK 64
Y +PN ++ + +NG Y + IGTPP + IVDTGS + +V C C C K
Sbjct: 69 YHHPNARMRLYDDLLSNGYYTTRLWIGTPPQ-EFALIVDTGSTVTYVPCSDCEHCGKHQD 127
Query: 65 PIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGN 124
P + P SS+Y + C + D V+C Y YA+ S + GVL + I+FGN
Sbjct: 128 PRFQPDESSTYHPVKCNMDCNCDHDGVNCV------YERRYAEMSSSSGVLGEDIISFGN 181
Query: 125 SNNFF-DNVVFGCGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQLGAN-KFSYCLVPF 181
+ VFGC + TG ++++ G++GLGR +LS+ Q++ + N FS C
Sbjct: 182 QSEVVPQRAVFGCENVETGDLYSQRADGIMGLGRGQLSIVDQLVDKNVINDSFSLCY--- 238
Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLV-SKED---KTYYFVTLEGISVGNLSNSSKLI 237
M+ G G+ V GG +V S+ D YY + L+ I V KL
Sbjct: 239 -------GGMHVGGGAMVLGGIPPPPDMVFSRSDPYRSPYYNIELKEIHVAG--KPLKLS 289
Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQDPRLGSQLCY 294
P S K +D+G LP++ + + + + +K DP + +C+
Sbjct: 290 P----STFDRKHGTVLDSGTTYAYLPEEAFVAFRDAIIKKSHNLKQIHGPDPNY-NDICF 344
Query: 295 K-----TPSMAGIAPILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGDVGIFG 348
++ P + F G K+ L F V G +C + + G
Sbjct: 345 SGAGRDVSQLSKAFPEVDMVFSNGQKLSLTPENYLFQHTKVHGAYCLGIFRNGDSTTLLG 404
Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDCTK 375
+ + YD +++ + F T+C++
Sbjct: 405 GIIVRNTLVTYDRENEKIGFWKTNCSE 431
>gi|226500708|ref|NP_001149229.1| aspartic proteinase nepenthesin-2 [Zea mays]
gi|195625632|gb|ACG34646.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 104/402 (25%), Positives = 164/402 (40%), Gaps = 60/402 (14%)
Query: 8 YPNNVVQSNVSTANGEYVMKFSIGTP-PLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI 66
Y + + VS + Y++ F +G P +I +VDTGSD+ W C
Sbjct: 94 YSDGRHEGRVSIPDASYIITFYLGNQRPEDNISAVVDTGSDIFWTTEKEC---------- 143
Query: 67 YNPASSSSYKELSCQSEQCHLLDTVSCSSQQL---------CNYT--YGYADSSLTKGVL 115
+ S + L C S +C + C +L C Y YG + T GV+
Sbjct: 144 ---SRSKTRSMLPCCSPKCEQRASCGCGRSELKAEAEKETKCTYAIIYGGNANDSTAGVM 200
Query: 116 ATERITFGN-------SNNFFDNVVFGCGHNNTGVFNENEM-GLVGLGRTRLSLASQILS 167
+++T S+ F V GC + T F + + G+ GLGR+ A+ +
Sbjct: 201 YEDKLTIVAVASKAVPSSQSFKEVAIGCSTSATLKFKDPSIKGVFGLGRS----ATSLPR 256
Query: 168 QLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLV-------SKEDKTYYFV 220
QL +KFSYCL + + + S + +++ G V + V + + KT YFV
Sbjct: 257 QLNFSKFSYCLSSYQ-EPDLPSYLLLTAAPDMATGAVGGGAAVATTALQPNSDYKTLYFV 315
Query: 221 TLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKL 280
L+ IS+G + + S GNMF+DTGA T L + +L ++ +K
Sbjct: 316 HLQNISIGGTR--------FPAVSTKSGGNMFVDTGASFTRLEGTVFAKLVTELDRIMKE 367
Query: 281 TPY---QDPRLGSQLCYKTPSMAGIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVF 333
Y Q R Q+CY PS A P + HF A + L S +
Sbjct: 368 RKYVKEQPGRNNGQICYSPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKTTSKLCL 427
Query: 334 CFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
I G + + GNF + + D ++ +SF DC+K
Sbjct: 428 AIYKSNIKGGISVLGNFQMQNTHMLLDTGNEKLSFVRADCSK 469
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 165/375 (44%), Gaps = 47/375 (12%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IGTPP + IVDTGS + +V C C QC + P + P SS+Y+ + C
Sbjct: 78 NGYYTTRLWIGTPPQM-FALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDLSSTYQPVKC 136
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
LD + + C Y YA+ S + GVL + ++FGN + VFGC +
Sbjct: 137 T------LDCNCDNDRMQCVYERQYAEMSTSSGVLGEDVVSFGNQSELAPQRAVFGCENV 190
Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
TG +++++ G++GLGR LS+ Q++ + + ++ FS C M G G+
Sbjct: 191 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCY----------GGMDVGGGA 240
Query: 198 EVSGGGVVSTSLV-SKEDKT---YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
V GG + +V ++ D YY + L+ I V + K +P N S K +
Sbjct: 241 MVLGGISPPSDMVFAQSDPVRSPYYNIDLKEIHV-----AGKRLP-LNPSVFDGKHGSVL 294
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIK---LTPYQDPRLGSQLCYKTPSMAGIA------- 303
D+G LP++ + +E + ++ DP + LC+ S AGI
Sbjct: 295 DSGTTYAYLPEEAFLAFKEAIVKELQSFSQISGPDPNY-NDLCF---SGAGIDVSQLSKT 350
Query: 304 -PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYD 360
P++ F G K L F V G +C + Q + G + + YD
Sbjct: 351 FPVVDMIFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYD 410
Query: 361 FDSQMVSFKPTDCTK 375
+ + F T+C +
Sbjct: 411 REQTKIGFWKTNCAE 425
>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
Length = 416
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 113/407 (27%), Positives = 171/407 (42%), Gaps = 71/407 (17%)
Query: 24 YVMKFSIGTPP-LLDIYGIVDTGSDLMWVQC----LPCVQC--YKQVK------------ 64
Y++ +IGTPP ++ +Y +DTGSDL WV C C+ C Y+ K
Sbjct: 12 YLISLNIGTPPQVIQVY--MDTGSDLTWVPCGNLSFDCMDCDDYRNSKLMSAFSPSHSSS 69
Query: 65 --------PIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLC-NYTYGYADSSLTKGVL 115
P SS C C L + + + C ++ Y Y + G L
Sbjct: 70 SYRDSCASPYCTDIHSSDNSFDPCTVAGCSLSTLIKATCARPCPSFAYTYGAGGVVTGTL 129
Query: 116 A--TERITFGNSNNFFD--NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGA 171
T R+ G + D FGC G +G+ G R LS SQ+ L
Sbjct: 130 TRDTLRVHEGPARVTKDIPKFCFGC----VGSTYHEPIGIAGFVRGTLSFPSQL--GLLK 183
Query: 172 NKFSYCLVPFH--TDSSITSKMYFGNGSEVSGGGVVSTSLV-SKEDKTYYFVTLEGISVG 228
FS+C + F + +I+S + G+ + S + T ++ S YY++ LE I+VG
Sbjct: 184 KGFSHCFLAFKYANNPNISSPLVIGDTALSSKDNMQFTPMLKSPMYPNYYYIGLEAITVG 243
Query: 229 NLSNSSKLIPY----YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ 284
N+S ++ +P ++S G G M ID+G T LP+ FY++L + AI P
Sbjct: 244 NVSATT--VPLNLREFDSQG---NGGMLIDSGTTYTHLPEPFYSQLLS-IFKAIITYPRA 297
Query: 285 ---DPRLGSQLCYKTP-------SMAGIAPILTAHFDGGAKVPLIHTSTF----IPPPVE 330
+ R G LCYK P + P +T HF L + F P
Sbjct: 298 TEVEMRAGFDLCYKVPCPNNRLTDDDNLFPSITFHFLNNVSFVLPQGNHFYAMSAPSNST 357
Query: 331 GVFCFAMQPID----GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
V C Q + G G+FG+F Q ++ I YD + + + F+P DC
Sbjct: 358 VVKCLLFQSMADSDYGPAGVFGSFQQQNVQIVYDLEKERIGFQPMDC 404
>gi|356563324|ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 480
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 115/413 (27%), Positives = 171/413 (41%), Gaps = 73/413 (17%)
Query: 23 EYVMKFSIGTPPLLD-IYGIVDTGSDLMWVQCLP--CVQCYKQ-----VKPIYN------ 68
+Y + F++G I +DTGSDL+W C P C+ C + P N
Sbjct: 69 DYTLSFNLGPQAQAQPITLYMDTGSDLVWFPCAPFKCILCEGKPNEPNASPPTNITQSVA 128
Query: 69 -----PASSSSYKELS----CQSEQCHL--LDTVSCSSQQLCNYTYGYADSSLTKGVLAT 117
PA S+++ C + +C L ++T C++ + + Y Y D SL +
Sbjct: 129 VSCKSPACSAAHNLAPPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSL---IARL 185
Query: 118 ERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILS---QLGANKF 174
R T S+ F N FGC H G+ G GR LSL +Q+ + QLG N+F
Sbjct: 186 YRDTLSLSSLFLRNFTFGCAHTTLA----EPTGVAGFGRGLLSLPAQLATLSPQLG-NRF 240
Query: 175 SYCLVPFHTDSSITSK-------MYFGNGSEVSGGGV---VSTSLVSKEDKTYYF-VTLE 223
SYCLV DS K Y E GGGV V TS++ Y++ V+L
Sbjct: 241 SYCLVSHSFDSERVRKPSPLILGRYEEKEKEKIGGGVAEFVYTSMLENPKHPYFYTVSLI 300
Query: 224 GISVGNLS-NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYN----RLEEQVRNAI 278
GI+VG + + +++ N+ G G + +D+G T+LP FYN + +V
Sbjct: 301 GIAVGKRTIPAPEMLRRVNNRG---DGGVVVDSGTTFTMLPAGFYNSVVDEFDRRVGRDN 357
Query: 279 KLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGG--AKVPLIHTSTFI--------PPP 328
K + + G CY S+A + P LT F GG + V L + F
Sbjct: 358 KRARKIEEKTGLAPCYYLNSVADV-PALTLRFAGGKNSSVVLPRKNYFYEFSDGSDGAKG 416
Query: 329 VEGVFCFAMQ------PIDGDVG-IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
V C + + G G GN+ Q + YD + + V F C
Sbjct: 417 KRKVGCLMLMNGGDEADLSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQCA 469
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 102/377 (27%), Positives = 168/377 (44%), Gaps = 39/377 (10%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK---PI--YNPASSSSYK 76
G Y + +G+PP + Y +DTGSD++WV C C C + P+ ++P SSS+
Sbjct: 66 GLYFTRVLLGSPPK-EFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTAS 124
Query: 77 ELSCQSEQCHL---LDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITF----GNS-NN 127
+SC ++C L CSSQ C YT+ Y D S T G ++ + F G+S N
Sbjct: 125 LISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTN 184
Query: 128 FFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHT 183
++VFGC + TG +++ G+ G G+ +S+ SQ+ SQ + FS+CL
Sbjct: 185 SSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGG 244
Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYN 241
I E+ +V + LV + +Y + L+ ISV +L+ ++
Sbjct: 245 GGGILVLG------EIVEEDIVYSPLVPSQ--PHYNLNLQSISVNGKSLAIDPEVFATST 296
Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
+ G I +D+G L ++ Y+ + A+ + G+Q T S+ G
Sbjct: 297 NRGTI------VDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQCYLITSSVKG 350
Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD-VGIFGNFAQSDLFI 357
I P ++ +F GG + L + G V+C Q I G + I G+ D
Sbjct: 351 IFPTVSLNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIF 410
Query: 358 GYDFDSQMVSFKPTDCT 374
YD Q + + DC+
Sbjct: 411 VYDLAGQRIGWANYDCS 427
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 107/390 (27%), Positives = 162/390 (41%), Gaps = 58/390 (14%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N + ++GTPP ++ ++DTGS+L W+ C Q +NP SSSY + C
Sbjct: 70 NISLTVSLTVGTPPQ-NVTMVIDTGSELSWLHC-NTSQNSSSSSSTFNPVWSSSYSPIPC 127
Query: 81 QSEQC-----HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
S C SC S Q C+ T YAD+S ++G LAT+ G+S NVVFG
Sbjct: 128 SSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSG--IPNVVFG 185
Query: 136 CGHNNTGVFNENE------MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
C + +F+ N GL+G+ R LS +SQ+G KFSYC+ ++ +
Sbjct: 186 CMDS---IFSSNSEEDSKNTGLMGMNRGSLSF----VSQMGFPKFSYCI----SEYDFSG 234
Query: 190 KMYFGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNSSKLIPYYNS- 242
+ G+ + + T L+ D+ Y V LEGI V + KL+P S
Sbjct: 235 LLLLGDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAH-----KLLPIPESV 289
Query: 243 --SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRN--AIKLTPYQDPRLGSQ----LCY 294
G +D+G T L Y L + N A L Y+D Q LCY
Sbjct: 290 FEPDHTGAGQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCY 349
Query: 295 KTPSMAGIAPIL---TAHFDGGAKVPLIHTSTFIPPPVE-----GVFCFAMQPID---GD 343
+ P+ P L T F GA++ + P E + CF D +
Sbjct: 350 RVPTNQTRLPPLPSVTLVFR-GAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVE 408
Query: 344 VGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ G+ Q ++++ +D + C
Sbjct: 409 AFVIGHLHQQNVWMEFDLKKSRIGLAEIRC 438
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 102/377 (27%), Positives = 168/377 (44%), Gaps = 39/377 (10%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK---PI--YNPASSSSYK 76
G Y + +G+PP + Y +DTGSD++WV C C C + P+ ++P SSS+
Sbjct: 81 GLYFTRVLLGSPPK-EFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTAS 139
Query: 77 ELSCQSEQCHL---LDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITF----GNS-NN 127
+SC ++C L CSSQ C YT+ Y D S T G ++ + F G+S N
Sbjct: 140 LISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTN 199
Query: 128 FFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHT 183
++VFGC + TG +++ G+ G G+ +S+ SQ+ SQ + FS+CL
Sbjct: 200 SSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGG 259
Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYN 241
I E+ +V + LV + +Y + L+ ISV +L+ ++
Sbjct: 260 GGGILVLG------EIVEEDIVYSPLVPSQ--PHYNLNLQSISVNGKSLAIDPEVFATST 311
Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
+ G I +D+G L ++ Y+ + A+ + G+Q T S+ G
Sbjct: 312 NRGTI------VDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQCYLITSSVKG 365
Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD-VGIFGNFAQSDLFI 357
I P ++ +F GG + L + G V+C Q I G + I G+ D
Sbjct: 366 IFPTVSLNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIF 425
Query: 358 GYDFDSQMVSFKPTDCT 374
YD Q + + DC+
Sbjct: 426 VYDLAGQRIGWANYDCS 442
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 102/383 (26%), Positives = 163/383 (42%), Gaps = 46/383 (12%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI-----YNPA 70
N+ G Y IGTP + Y +DTGS WV + C QC + + Y+P
Sbjct: 75 NIPYGTGLYYTDIGIGTPAV-KYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPR 133
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------GN 124
SS S KE+ C C C+ C Y GYAD LT G+L T+ + + G
Sbjct: 134 SSVSSKEVKCDDTIC--TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQ 191
Query: 125 SNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK--FSYCLV 179
+ +V FGCG +G N + + G++G G + + SQ L+ G K FS+CL
Sbjct: 192 TQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQ-LAAAGKTKKIFSHCL- 249
Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPY 239
S F G V V T+ + K ++ Y+ V L+ I N++ ++ +P
Sbjct: 250 -----DSTNGGGIFAIGEVVEPK--VKTTPIVKNNEVYHLVNLKSI---NVAGTTLQLP- 298
Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM 299
N G FID+G+ LP+ Y+ L V + D +G+ ++
Sbjct: 299 ANIFGTTKTKGTFIDSGSTLVYLPEIIYSELILAV-----FAKHPDITMGAMYNFQCFHF 353
Query: 300 AGIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP--IDG--DVGIFGNFA 351
G P +T HF+ + ++ ++ +CF Q I G D+ I G+
Sbjct: 354 LGSVDDKFPKITFHFENDLTLD-VYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMV 412
Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
S+ + YD + Q + + +C+
Sbjct: 413 ISNKVVVYDMEKQAIGWTEHNCS 435
>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 103/407 (25%), Positives = 168/407 (41%), Gaps = 59/407 (14%)
Query: 13 VQSNVSTANGEYVMKFSIGTP--PLLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKP---I 66
+ S T G+Y ++F +GTP P L + DTGSDL WV+C P +
Sbjct: 83 LTSGAYTGIGQYFVRFRVGTPAQPFLLV---ADTGSDLTWVKCRRPAANSSESGSGSGRA 139
Query: 67 YNPASSSSYKELSCQSEQCHL---LDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITF 122
+ P S ++ +SC S+ C +C + C Y Y Y D S +G + TE T
Sbjct: 140 FRPEDSRTWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATI 199
Query: 123 GNSNN-------FFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFS 175
S +V GC + TG E G++ LG + +S AS S+ A +FS
Sbjct: 200 ALSGRGREERKAKLKGLVLGCTSSYTGPSFEVSDGVLSLGYSDVSFASHAASRF-AGRFS 258
Query: 176 YCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS---------------------LVSKED 214
YCLV + + TS + FG V+ S+ L+ +
Sbjct: 259 YCLVDHLSPRNATSYLTFGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRM 318
Query: 215 KTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQ 273
+ +Y V ++ +SV G + + ++ G + +D+G T+L K Y +
Sbjct: 319 RPFYDVAVKAVSVAGQFLKIPRAVWDVDAGGGV-----ILDSGTSLTVLAKPAYRAVVAA 373
Query: 274 VRNAIKLTPY--QDPRLGSQLCYKTPSMAG--IAPILTAHFDGGAKVPLIHTSTFIPPPV 329
+ + P DP + CY S +G P + HF G A++ S ++
Sbjct: 374 LSEGLAGLPRVTMDP---FEYCYNWTSPSGDVTLPKMAVHFAGAARLEPPGKS-YVIDAA 429
Query: 330 EGVFCFAMQ--PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
GV C +Q P G + + GN Q + +D ++ + F+ + CT
Sbjct: 430 PGVKCIGLQEGPWPG-ISVIGNILQQEHLWEFDIKNRRLKFQRSRCT 475
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 169/381 (44%), Gaps = 42/381 (11%)
Query: 19 TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
T G Y + +GTPP Y VDTGSD++WV C+ C QC + +Y+P +SS
Sbjct: 81 TDTGLYYTEIKLGTPPK-HYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASS 139
Query: 74 SYKELSCQSEQCHLL---DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------GN 124
+ + C C C + C Y+ Y D S T G T+ + F G
Sbjct: 140 TGSMVMCDQAFCAATFGGKLPKCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDGQ 199
Query: 125 SNNFFDNVVFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
+ +V+FGCG G N+ G++G G S+ SQ+ + K F++CL
Sbjct: 200 TQPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCL-- 257
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
+I F G +V V +T LV+ DK +Y V L+ I VG ++ +P +
Sbjct: 258 ----DTIKGGGIFSIG-DVVQPKVKTTPLVA--DKPHYNVNLKTIDVG---GTTLQLPAH 307
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP-SM 299
KG + ID+G T LP+ + + V N + + D + LC++ P S+
Sbjct: 308 IFEPGEKKGTI-IDSGTTLTYLPELVFKEVMLAVFNKHQDITFHDVQ--GFLCFQYPGSV 364
Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIFGNFAQS 353
P +T HF+ + + F + V+C A Q DG D+ + G+ S
Sbjct: 365 DDGFPTITFHFEDDLALHVYPHEYFFANGND-VYCVGFQNGASQSKDGKDIVLMGDLVLS 423
Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
+ + YD +++++ + +C+
Sbjct: 424 NKLVIYDLENRVIGWTDYNCS 444
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 101/398 (25%), Positives = 173/398 (43%), Gaps = 58/398 (14%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----P 65
N+ + + T G Y K +G+PP D Y VDTGSD++WV C+ C +C ++
Sbjct: 57 NLGGNGLPTETGLYFTKLGLGSPPK-DYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLT 115
Query: 66 IYNPASSSSYKELSCQSEQCHLL---DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF 122
+Y+P S + + +SC E C C S+ C Y+ Y D S T G + +T+
Sbjct: 116 LYDPKGSETSELISCDQEFCSATYDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTY 175
Query: 123 GNSNNFF------DNVVFGCGHNNTGVFN----ENEMGLVGLGRTRLSLASQILSQLGAN 172
+ N+ +++FGCG +G + E G++G G++ S+ SQ+ +
Sbjct: 176 NHVNDNLRTAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVK 235
Query: 173 K-FSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN-- 229
K FS+CL +I F G EV V +T LV + +Y V L+ I V
Sbjct: 236 KIFSHCL------DNIRGGGIFAIG-EVVEPKVSTTPLVPR--MAHYNVVLKSIEVDTDI 286
Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
L S + N G I ID+G LP Y+ L +V + PRL
Sbjct: 287 LQLPSDIFDSGNGKGTI------IDSGTTLAYLPAIVYDELIPKVMA-------RQPRLK 333
Query: 290 SQL------CYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG 342
L C++ T ++ P++ HF+ + ++ ++ +G++C Q
Sbjct: 334 LYLVEQQFSCFQYTGNVDRGFPVVKLHFEDSLSLT-VYPHDYLFQFKDGIWCIGWQKSVA 392
Query: 343 ------DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
D+ + G+ S+ + YD ++ + + +C+
Sbjct: 393 QTKNGKDMTLLGDLVLSNKLVIYDLENMAIGWTDYNCS 430
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 96/335 (28%), Positives = 142/335 (42%), Gaps = 42/335 (12%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
YV++ +GTP + + +DT +D W C PC C + + PASSSSY L C S
Sbjct: 78 SYVVRAGLGTP-VQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCAS 134
Query: 83 EQCHLLDTVSCSSQQ-------LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
+ C L + C + Q C ++ +AD+S + L ++ + G + FG
Sbjct: 135 DWCPLFEGQPCPANQDASAPLPACAFSKPFADTSF-QASLGSDTLRLG--KDAIAGYAFG 191
Query: 136 C-GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
C G N + GL+GLGR +SL SQ S FSYCL + + YF
Sbjct: 192 CVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYN-GVFSYCLPSYRS-------YYFS 243
Query: 195 NGSEVSGGG----VVSTSLVSKEDK-TYYFVTLEGISVGNL-----SNSSKLIPYYNSSG 244
+ G V T L++ + + Y+V + G+SVG + S P +
Sbjct: 244 GSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGT 303
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSM-AGI 302
I G + AP Y L E+ R + P LG+ C+ T + AG
Sbjct: 304 VIDSGTVITRWTAP-------VYAALREEFRRQVA-APSGYTSLGAFDTCFNTDEVAAGG 355
Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM 337
AP +T H DGG + L +T I + C AM
Sbjct: 356 APPVTLHMDGGVDLTLPMENTLIHSSATPLACLAM 390
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 107/386 (27%), Positives = 168/386 (43%), Gaps = 58/386 (15%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N + ++G+PP + ++DTGS+L W+ C + + ++NP SSSSY + C
Sbjct: 997 NVTLTVSLTVGSPPQ-QVTMVLDTGSELSWLHC----KKSPNLTSVFNPLSSSSYSPIPC 1051
Query: 81 QSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
S C L + V+C ++LC+ YAD+S +G LA++ G+S +FG
Sbjct: 1052 SSPICRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSSA--LPGTLFG 1109
Query: 136 C---GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
C G ++ + GL+G+ R LS ++QLG KFSYC+ DSS +
Sbjct: 1110 CMDSGFSSNSEEDAKTTGLMGMNRGSLSF----VTQLGLPKFSYCIS--GRDSS--GVLL 1161
Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA- 245
FG+ G + T LV D+ Y V L+GI VGN K++P S A
Sbjct: 1162 FGDLHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGN-----KILPLPKSIFAP 1216
Query: 246 --ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCY--- 294
G +D+G T L Y L + K L P DP Q LCY
Sbjct: 1217 DHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVA 1276
Query: 295 ---KTPSMAGIAPILT-AHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG---DVGIF 347
K P++ ++ + A G +V L + E V+C D + +
Sbjct: 1277 AGGKLPTLPSVSLMFRGAEMVVGGEVLLYRVPEMMKGN-EWVYCLTFGNSDLLGIEAFVI 1335
Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDC 373
G+ Q ++++ +D +V+F C
Sbjct: 1336 GHHHQQNVWMEFD----LVAFAADLC 1357
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/321 (28%), Positives = 152/321 (47%), Gaps = 42/321 (13%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVK-PIYNPASSSSYK 76
G Y K +GTPP+ + +DTGSD++WV C C C + Q++ ++P SSS+
Sbjct: 23 GLYYTKVQLGTPPV-EFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSS 81
Query: 77 ELSCQSEQCH---LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN- 131
++C ++C+ +CSSQ C+YT+ Y D S T G ++ + N F+
Sbjct: 82 MIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHL---NTIFEGS 138
Query: 132 --------VVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLV 179
VVFGC + TG +++ G+ G G+ +S+ SQ+ SQ + FS+CL
Sbjct: 139 VTTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCL- 197
Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLI 237
DSS + G E+ +V TSLV + +Y + L+ I+V L S +
Sbjct: 198 --KGDSSGGGILVLG---EIVEPNIVYTSLVPAQ--PHYNLNLQSIAVNGQTLQIDSSVF 250
Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP 297
NS G I +D+G L ++ Y+ + +I + + G+Q T
Sbjct: 251 ATSNSRGTI------VDSGTTLAYLAEEAYDPFVSAITASIPQSVHTAVSRGNQCYLITS 304
Query: 298 SMAGIAPILTAHFDGGAKVPL 318
S+ + P ++ +F GGA + L
Sbjct: 305 SVTEVFPQVSLNFAGGASMIL 325
>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
Length = 499
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 108/409 (26%), Positives = 163/409 (39%), Gaps = 70/409 (17%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP--CVQCYKQVKP-IYNPASSSSYKELS 79
+Y + FSI + L +Y +DTGSD++W C P C+ C + +P P + S +S
Sbjct: 93 DYTLTFSINSQ-TLSVY--MDTGSDIVWFPCSPFECILCEGKFEPGTLTPLNVSKSSLIS 149
Query: 80 CQSEQCHL--------------------LDTVSCSSQQLCNYTYGYADSSLTKGVLATER 119
C+S C ++T CS+ ++ Y Y D SL +
Sbjct: 150 CKSRACSTAHNSPSTSDLCAIAKCPLDEIETSDCSNYHCPSFYYAYGDGSLIAKLHKHNL 209
Query: 120 ITFGNSNNFF--DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQI--LSQLGANKFS 175
I SN F + FGC H+ G +G+ G G LSL +Q+ LS N+FS
Sbjct: 210 IMPSTSNKPFSLKDFTFGCAHSALG----EPIGVAGFGFGSLSLPAQLANLSPDLGNQFS 265
Query: 176 YCLVPFHTDSSIT---SKMYFGNGSEVSGGGV---VSTSLVSKEDKTYYF-VTLEGISVG 228
YCLV DS+ S + G E + V T ++ Y++ V++E ISVG
Sbjct: 266 YCLVSHSFDSTKLHHPSPLILGKVKERDFDEITQFVYTPMLDNPKHPYFYSVSMEAISVG 325
Query: 229 NLSNSSKLIPYYNSSGAISK---GNMFIDTGAPPTLLPKDFYN----RLEEQVRNAIKLT 281
S + N+ I + G + +D+G T+LP FYN L+ +V K
Sbjct: 326 -----SSRVRAPNALIRIDRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRRVGRVFKRA 380
Query: 282 PYQDPRLGSQLCY-----KTPSMAGIAPILTAHFDGGAKVPLIHTSTFI-------PPPV 329
+ + G CY + + P L HF G V L + F
Sbjct: 381 SETESKTGLSPCYYLEGNGVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLDGEDEKKG 440
Query: 330 EGVFCFAM-----QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
V C + + G GN+ Q + YD + + V F P C
Sbjct: 441 RKVGCLMLMDGGDESEGGPGATLGNYQQQGFQVVYDLEERRVGFAPRKC 489
>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 578
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 95/384 (24%), Positives = 164/384 (42%), Gaps = 42/384 (10%)
Query: 21 NGEYVMKFSIGTPPLLDIYGI-VDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSSYK-- 76
+G Y + +G P Y + +DTGSDL W+QC PC C K +Y P + +
Sbjct: 195 DGLYYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAPCTSCAKGANQLYKPRKDNLVRSS 254
Query: 77 ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATER--ITFGNSNNFFDNVVF 134
E C Q + L T C S C+Y YAD S + GVL ++ + N + ++VF
Sbjct: 255 EPFCVEVQRNQL-TEHCESCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDIVF 313
Query: 135 GCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSK 190
GCG++ G+ + G++GL R ++SL SQ+ S+ + +N +CL S + +
Sbjct: 314 GCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLA-----SDLNGE 368
Query: 191 MYFGNGSE-VSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
Y GS+ V G+ ++ Y + + +S GN ++ +G + G
Sbjct: 369 GYIFMGSDLVPSHGMTWVPMLHHPHLEVYQMQVTKMSYGN-----AMLSLDGENGRV--G 421
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGIAPILTA 308
+ DTG+ T P Y++L ++ L +D + +C++ + + I+ +
Sbjct: 422 KVLFDTGSSYTYFPNQAYSQLVTSLQEVSDLELTRDDSDEALPICWRAKTNSPISSLSDV 481
Query: 309 H-------FDGGAKVPLIHTSTFIPPPV------EGVFCFAM----QPIDGDVGIFGNFA 351
G+K +I I P +G C + DG I G+ +
Sbjct: 482 KKFFRPITLQIGSKWLIISKKLLIQPEDYLIISNKGNVCLGILDGSNVHDGSTIIIGDIS 541
Query: 352 QSDLFIGYDFDSQMVSFKPTDCTK 375
I YD Q + + +DC +
Sbjct: 542 MRGRLIVYDNVKQRIGWMKSDCVR 565
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 99/390 (25%), Positives = 169/390 (43%), Gaps = 57/390 (14%)
Query: 19 TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
T G Y K +G+P D Y VDTGSD++WV C+ C +C ++ +Y+P S
Sbjct: 64 TVTGLYFTKIGLGSPSK-DYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSK 122
Query: 74 SYKELSCQSEQC---HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSN 126
+ + +SC+ C + + C ++ C Y+ Y D S T G + +TF GN +
Sbjct: 123 TSEFVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNPH 182
Query: 127 NFFDN--VVFGCGHNNTGVF----NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLV 179
N ++FGCG +G F E G++G G+ S+ SQ+ + K FS+CL
Sbjct: 183 TATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL- 241
Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLI 237
D+++ ++ + EV V +T LV + +Y V L+ I V L S
Sbjct: 242 ----DTNVGGGIF--SIGEVVEPKVKTTPLV--PNMAHYNVILKNIEVDGDILQLPSDTF 293
Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL----- 292
N G + ID+G LP+ Y++L +V + PRL L
Sbjct: 294 DSENGKGTV------IDSGTTLAYLPRIVYDQLMSKVLA-------KQPRLKVYLVEEQY 340
Query: 293 -CYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG------DV 344
C++ T ++ PI+ HF+ + + + +C Q D+
Sbjct: 341 SCFQYTGNVDSGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDM 400
Query: 345 GIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ G+F S+ + YD ++ + + +C+
Sbjct: 401 TLLGDFVLSNKLVVYDLENMTIGWTDYNCS 430
>gi|115475303|ref|NP_001061248.1| Os08g0207800 [Oryza sativa Japonica Group]
gi|45735815|dbj|BAD12851.1| unknown protein [Oryza sativa Japonica Group]
gi|113623217|dbj|BAF23162.1| Os08g0207800 [Oryza sativa Japonica Group]
gi|125602549|gb|EAZ41874.1| hypothetical protein OsJ_26419 [Oryza sativa Japonica Group]
Length = 449
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 96/369 (26%), Positives = 155/369 (42%), Gaps = 48/369 (13%)
Query: 39 YGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTV------- 91
Y ++DTGS L+W QC C C+ P Y + S +++E+SC + + +
Sbjct: 96 YLLIDTGSSLVWTQCDECPHCHIGDVPPYGRSQSRTFQEVSCGDDDDNDKEEAIASYCPA 155
Query: 92 ------------SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF-----FDNVVF 134
C + L N T +G ++ + F + F F +VF
Sbjct: 156 KPPGYITLCVNGRCMFKALYNLT---GQGETVQGYMSMDTFHFIDDRRFDYQAKF-RMVF 211
Query: 135 GCGHNNTGVFNENE--MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT--SK 190
GC H V + G++GLG S L Q G KFSYC+ P S S
Sbjct: 212 GCAHQENIVLTAVKECTGILGLGMGDASF----LRQTGITKFSYCVPPRMPGYSYRRHSW 267
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
+ FG+ +++SG V LV + K Y +T + L + +I Y + + +
Sbjct: 268 LRFGSHAQISGKKV---PLVMRWGKYYLPLTAITYTYNELMSPVPIIAYKSQEDYL---H 321
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD-PRLGSQLCYKTPSMAGIAPI-LTA 308
M +DTG LP ++ L +++ IK + + CYK +M + I +T
Sbjct: 322 MMVDTGTSLLSLPTSLHDDLIKEMEAIIKSENIMEGATRWPKHCYKR-TMDEVKDITVTL 380
Query: 309 HFDGGAKVPLIHTSTFIPPPVEG--VFCFAMQPI-DGDVGIFGNFAQSDLFIGYDFDSQM 365
FDGG + L ++ FI C A+ + D I G FAQ+++ +GYD S+
Sbjct: 381 SFDGGLDIELFTSALFIKTETTKGPAVCLAVNRVDDSSKAILGMFAQTNINVGYDLLSRE 440
Query: 366 VSFKPTDCT 374
++ P C
Sbjct: 441 IAMDPIRCA 449
>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 430
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 97/384 (25%), Positives = 163/384 (42%), Gaps = 61/384 (15%)
Query: 25 VMKFSIGTPPLLDIYGIVDTGSDLMWVQC----LPCVQCYKQVKPIYNPASSSSYKELSC 80
++ IGTPP ++DTGS L W+QC LP + K ++P+ SSS+ L C
Sbjct: 73 IISLPIGTPPQAQQM-VLDTGSQLSWIQCHRKKLP-----PKPKTSFDPSLSSSFSTLPC 126
Query: 81 QSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
C SC S +LC+Y+Y YAD + +G L E+ITF N+ ++ G
Sbjct: 127 SHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE-ITPPLILG 185
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT--SKMYF 193
C ++ ++ G++G+ R RLS +SQ +KFSYC+ P T Y
Sbjct: 186 CATESS-----DDRGILGMNRGRLSF----VSQAKISKFSYCIPPKSNRPGFTPTGSFYL 236
Query: 194 GNGSEVSGGGVVS------TSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
G+ G VS + + D Y V + GI G + N SG++
Sbjct: 237 GDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFG--------LKKLNISGSVF 288
Query: 248 K------GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG--SQLCYK---- 295
+ G +D+G+ T L Y+++ ++ + + G + +C+
Sbjct: 289 RPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNVA 348
Query: 296 -TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFA 351
P + G L F G ++ L+ + G+ C + + I GN
Sbjct: 349 MIPRLIGD---LVFVFTRGVEI-LVPKERVLVNVGGGIHCVGIGRSSMLGAASNIIGNVH 404
Query: 352 QSDLFIGYDFDSQMVSFKPTDCTK 375
Q +L++ +D ++ V F DC++
Sbjct: 405 QQNLWVEFDVTNRRVGFAKADCSR 428
>gi|326517745|dbj|BAK03791.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 556
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 94/387 (24%), Positives = 166/387 (42%), Gaps = 44/387 (11%)
Query: 9 PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVKP-- 65
P +++Q N N ++M +GTPP+ ++ VDTG+ L +VQC PC ++C+KQ
Sbjct: 192 PIDLIQ-NGDINNFLFLMPIKLGTPPVWNLVA-VDTGATLSFVQCEPCTLRCHKQTDAGE 249
Query: 66 IYNPASSSSYKELSCQSEQC-------HLLDTVSCSSQQLCNYTYGY-ADSSLTKGVLAT 117
I++P+ S S+ + C +C HL + C Y+ + SS + G L
Sbjct: 250 IFDPSKSESFSRVGCSENKCRTVQRALHLQSKACMEKEDSCLYSMTFGGTSSYSVGKLVR 309
Query: 118 ERITFGN--SNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFS 175
+R+ G F + +FGC + +++ E GLVG S Q+ + FS
Sbjct: 310 DRLAIGKYAKGYSFPDFLFGCSLDTE--YHQYEAGLVGFADEPFSFFEQVAPLVNYKAFS 367
Query: 176 YCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSK 235
YC F +D T + G+ + V+ T L ++ Y + L+ + L N
Sbjct: 368 YC---FPSDRRKTGYLSIGDYTRVNS---TYTPLFLARQQSRYALKLDEV----LVNGMA 417
Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP--RLGSQLC 293
L+ + M +D+G+ T+L D + +L+ + A++ Y R +C
Sbjct: 418 LV--------TTPSEMIVDSGSRWTILLSDTFTQLDAAITEAMRPLGYNRNYYRGSDYIC 469
Query: 294 YKTPSMAGIA-----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ--PIDGDVGI 346
++ + P++ FD G K+ L S+F G+ + M+ + V +
Sbjct: 470 FEDAHFQQFSDWAALPVVELKFDMGVKMVLQPQSSFHFNNDYGLCTYFMRDASLGSGVQL 529
Query: 347 FGNFAQSDLFIGYDFDSQMVSFKPTDC 373
GN + I +D F+ DC
Sbjct: 530 LGNTMTRSVGITFDIQGGQFGFRKGDC 556
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 107/386 (27%), Positives = 166/386 (43%), Gaps = 53/386 (13%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N + + GTP L +I ++DTGS+L W+ C + I+NP +S +Y ++ C
Sbjct: 64 NVTLTVSLTAGTP-LQNITMVLDTGSELSWLHC----KKEPNFNSIFNPLASKTYTKIPC 118
Query: 81 QSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
S C L VSC +LC++ YAD+S +G LA E G+ VFG
Sbjct: 119 SSPTCETRTRDLPLPVSCDPAKLCHFIISYADASSVEGNLAFETFRVGSVTG--PATVFG 176
Query: 136 C---GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
C G ++ + GL+G+ R LS ++Q+G KFSYC+ +D + +
Sbjct: 177 CMDSGFSSNSEEDAKTTGLMGMNRGSLSF----VNQMGFRKFSYCI----SDRDSSGVLL 228
Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNS---SKLIPYYNSS 243
G S + T LV D+ Y V LEGI V + S S +P + +
Sbjct: 229 LGEASFSWLKPLNYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVFVP--DHT 286
Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCYKT- 296
GA G +D+G T L Y+ L+++ K L +PR Q LCY
Sbjct: 287 GA---GQTMVDSGTQFTFLLGPVYSALKQEFLLQTKGVLRVLNEPRYVFQGAMDLCYLIE 343
Query: 297 PSMAGI--APILTAHFDGGA-KVPLIHTSTFIPPPVEG---VFCFAMQPIDG---DVGIF 347
P+ A + P++ F G V +P V G V+CF D + +
Sbjct: 344 PTRAALPNLPVVNLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDSLGIESFVI 403
Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDC 373
G+ Q ++++ YD + + F C
Sbjct: 404 GHHQQQNVWMEYDLEKSRIGFAEVRC 429
>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 449
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 123/435 (28%), Positives = 181/435 (41%), Gaps = 80/435 (18%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPP-LLDIYGIVDTGSDLMWVQC----LPCVQCYKQVK 64
+NV++ +G Y+M SIGTPP ++ +Y +DTGSDL WV C C C +
Sbjct: 8 DNVIEPLREIRDG-YLMSLSIGTPPQVVQVY--MDTGSDLTWVPCGNLSFDCQDCEEYQN 64
Query: 65 PIYNPA--------SSSSYKELS-----------------CQSEQCHLLDTVSCSSQQLC 99
I P SS+S ++ C C L V + + C
Sbjct: 65 NISGPRLAAFLPTHSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPC 124
Query: 100 ---NYTYGYA---DSSLTKGVLATE--RITFGNSNNFFDNVVFGCGHNNTGVFNENEMGL 151
YTYG + SLT+ VL T N+N FGC G +G+
Sbjct: 125 PSFAYTYGASGVVTGSLTRDVLFTHGNYNNNNNNNKQIPRFCFGC----VGATYREPIGI 180
Query: 152 VGLGRTRLSLASQI-LSQLGANKFSYCLVPFH--TDSSITSKMYFGNGSEVSGGGVVSTS 208
G GR LSL Q+ S G FS+C +PF + + +S + GN + S + +
Sbjct: 181 AGFGRGLLSLPFQLGFSHKG---FSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFT 237
Query: 209 --LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN--MFIDTGAPPTLLPK 264
L S YY++ LE I++GN N+ + + +KGN M ID+G T LP+
Sbjct: 238 PLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPE 297
Query: 265 DFYNRLEEQVRNAIKLTPYQDPRL--GSQLCYKTPSMAGIA--------PILTAHFDGGA 314
Y++L + I + L G LCYK P + P +T HF
Sbjct: 298 PLYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNV 357
Query: 315 KVPLIHTSTF--IPPPVEG--VFCFAMQPIDGD-----------VGIFGNFAQSDLFIGY 359
V L + F + P+ V C Q +DG GIFG+F Q ++ + Y
Sbjct: 358 SVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVY 417
Query: 360 DFDSQMVSFKPTDCT 374
D + + + F+P DC
Sbjct: 418 DLEKERLGFQPMDCV 432
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 110/397 (27%), Positives = 173/397 (43%), Gaps = 64/397 (16%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP---CVQC-YKQVK----PIYNPASSS 73
G Y + + GTPP + ++DTGS L+W C C +C + +K P + P SS
Sbjct: 81 GGYSISLNFGTPPQTTKF-VMDTGSSLVWFPCTSRYLCSECNFPNIKKTGIPTFLPKLSS 139
Query: 74 SYKELSCQSEQCHLL------------DTVSCSSQQLCN-YTYGYADSSLTKGVLATERI 120
S K + C++ +C ++ D+ + + Q C Y Y S T G+L +E +
Sbjct: 140 SSKLIGCKNPRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYVIQYGSGS-TAGLLLSETL 198
Query: 121 TFGNSNNFFDNVVFGCGHNNTGVFN-ENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
F N D +V GC +F+ + G+ G GR+ SL SQ LG KFSYCLV
Sbjct: 199 DFPNKKTIPDFLV-GCS-----IFSIKQPEGIAGFGRSPESLPSQ----LGLKKFSYCLV 248
Query: 180 PFHTDSSITSK---MYFGNGSEVSGGGVVSTSLVSKEDKT----YYFVTLEGISVGNLSN 232
D + TS + G+GS V+ +S + K T YY+V L I +G +
Sbjct: 249 SHAFDDTPTSSDLVLDTGSGSGVTKTAGLSHTPFLKNPTTAFRDYYYVLLRNIVIG---D 305
Query: 233 SSKLIPY-YNSSGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQDPR 287
+ +PY + G G +D+G T + Y E+Q+ + T Q+
Sbjct: 306 THVKVPYKFLVPGTDGNGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQN-L 364
Query: 288 LGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG- 345
G + CY ++ P L F GGAK+ L S + GV C + + +V
Sbjct: 365 TGLRPCYNISGEKSLSVPDLIFQFKGGAKMAL-PLSNYFSIVDSGVICLTI--VSDNVAG 421
Query: 346 ---------IFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
I GN+ Q + ++ +D +++ FK C
Sbjct: 422 PGLGGGPAIILGNYQQRNFYVEFDLENEKFGFKQQSC 458
>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Brachypodium distachyon]
Length = 429
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 96/405 (23%), Positives = 170/405 (41%), Gaps = 59/405 (14%)
Query: 4 ATYFYPNNV------VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC- 56
++ +P NV V N G++ M S+GTPP+ ++ VDTGS L WV C C
Sbjct: 49 SSLIHPTNVPAEPSPVVGNHEIHEGKFFMDISLGTPPVANLV-TVDTGSTLSWVVCQRCQ 107
Query: 57 VQCYK---QVKPIYNPASSSSYKELSCQSEQC-----HLLDTVSCSSQ-QLCNYTYGYAD 107
+ C+ + +++P S++Y+ + C S C L+ C + C Y+ Y
Sbjct: 108 ISCHTTAPEAGSVFDPDKSTTYELVGCSSRDCADVQRSLVAPFGCIEETDTCLYSLRYGS 167
Query: 108 S---SLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQ 164
+ G L T+++T +S++ D +FGC +++ F E G++G G S +Q
Sbjct: 168 GPSGQYSAGRLGTDKLTLASSSSIIDGFIFGCSGDDS--FKGYESGVIGFGGANFSFFNQ 225
Query: 165 ILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSK-EDKTYYFVTLE 223
+ Q FSYC HT S + +V T+L+ D++ Y +L+
Sbjct: 226 VARQTNYRAFSYCFPGDHTAEGFLSIGAYPKDE------LVYTNLIPHFGDRSVY--SLQ 277
Query: 224 GISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY 283
I + N ++ +K M +D+G T L ++ + + +A++ +
Sbjct: 278 QIDMMVDGNRLQV-----DQSEYTKRMMVVDSGTVDTFLLGPVFDAFSKAMASAMQAKGF 332
Query: 284 QDPRLGSQLCYK----TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPP--------PVEG 331
+G++ C++ +G P + F I T+ +PP P
Sbjct: 333 LSDTVGTETCFRPNGGDSVDSGDLPTVEMRF--------IGTTLKLPPENVFHDLLPSHD 384
Query: 332 VFCFAMQP-IDG--DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
C A +P + G +V I GN A + YD + F+ C
Sbjct: 385 KICLAFKPDVAGVRNVQILGNKATXSFRVVYDLQAMYFGFQAGAC 429
>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 402
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 82/288 (28%), Positives = 127/288 (44%), Gaps = 22/288 (7%)
Query: 94 SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVG 153
S+ +CNY Y D S T+G L E++ FG + +FGCG NN G+F GL+G
Sbjct: 128 SAAPICNYAINYGDGSFTRGELGHEKLKFGTI--LVKDFIFGCGRNNKGLFG-GVSGLMG 184
Query: 154 LGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKE 213
LGR+ LSL SQ G FSYCL P S + GN S +S + + +
Sbjct: 185 LGRSDLSLISQTSGIFGG-VFSYCL-PSTERKGSGSLILGGNSSVYRNSSPISYAKMIEN 242
Query: 214 DKTY--YFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLE 271
+ Y YF+ L GIS+G ++ + ++ + +D+G T LP Y L+
Sbjct: 243 PQLYNFYFINLTGISIGGVALQAP---------SVGPSRILVDSGTVITRLPPTIYKALK 293
Query: 272 EQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTST--FIPPP 328
+ P C+ + + P + HF+G A++ + T F+
Sbjct: 294 AEFLKQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSD 353
Query: 329 VEGVFCFAMQPID--GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
V C A+ ++ +V I GN+ Q +L + YD V F C+
Sbjct: 354 ASQV-CLALASLEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 400
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 104/379 (27%), Positives = 160/379 (42%), Gaps = 44/379 (11%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N + +IG+PP ++ ++DTGS+L W+ C + + +NP SSSY C
Sbjct: 56 NVTLTISLTIGSPPQ-NVTMVLDTGSELSWLHC----KKLPNLNSTFNPLLSSSYTPTPC 110
Query: 81 QSEQC-----HLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
S C L SC + +LC+ YAD+S +G LA E TF + +F
Sbjct: 111 NSSVCMTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAE--TFSLAGAAQPGTLF 168
Query: 135 GCGHNN--TGVFNENE--MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
GC + T NE+ GL+G+ R LSL +Q++ KFSYC+ +
Sbjct: 169 GCMDSAGYTSDINEDAKTTGLMGMNRGSLSLVTQMV----LPKFSYCI----SGEDAFGV 220
Query: 191 MYFGNGSEVSG-----GGVVSTSLVSKEDKTYYFVTLEGISVG-NLSNSSKLIPYYNSSG 244
+ G+G V +T+ D+ Y V LEGI V L K + + +G
Sbjct: 221 LLLGDGPSAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTG 280
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRL----GSQLCYKTPS 298
A G +D+G T L YN L+++ K LT +DP LCY P+
Sbjct: 281 A---GQTMVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPA 337
Query: 299 MAGIAPILTAHFDGGA-KVPLIHTSTFIPPPVEGVFCFAMQPIDG---DVGIFGNFAQSD 354
P +T F G +V + + V+CF D + + G+ Q +
Sbjct: 338 SLAAVPAVTLVFSGAEMRVSGERLLYRVSKGRDWVYCFTFGNSDLLGIEAYVIGHHHQQN 397
Query: 355 LFIGYDFDSQMVSFKPTDC 373
+++ +D V F T C
Sbjct: 398 VWMEFDLVKSRVGFTETTC 416
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 96/373 (25%), Positives = 161/373 (43%), Gaps = 43/373 (11%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IGTP + IVD+GS + +V C C QC P + P SS+Y + C
Sbjct: 88 NGYYTTRLYIGTPSQ-EFALIVDSGSTVTYVPCATCEQCGNHQDPRFQPDLSSTYSPVKC 146
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
+D + + C Y YA+ S + GVL + ++FG + VFGC +
Sbjct: 147 N------VDCTCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQRAVFGCENT 200
Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
TG +F+++ G++GLGR +LS+ Q++ + + ++ FS C M G G+
Sbjct: 201 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCY----------GGMDVGGGT 250
Query: 198 EVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIP-YYNSSGAISKGNMF 252
V GG +V YY + L+ I V + +L P +N SK
Sbjct: 251 MVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVA--GKALRLDPKIFN-----SKHGTV 303
Query: 253 IDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCY-----KTPSMAGIAP 304
+D+G LP+ + ++ V N++K DP +C+ ++ + P
Sbjct: 304 LDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNY-KDICFAGAGRNVSQLSEVFP 362
Query: 305 ILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFD 362
+ F G K+ L F VEG +C + Q + G + + YD
Sbjct: 363 DVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRH 422
Query: 363 SQMVSFKPTDCTK 375
++ + F T+C++
Sbjct: 423 NEKIGFWKTNCSE 435
>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
max]
Length = 455
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 111/410 (27%), Positives = 174/410 (42%), Gaps = 70/410 (17%)
Query: 23 EYVMKFSIGTPPLLD-IYGIVDTGSDLMWVQCLP--CVQCYKQ--VKPIYN--------- 68
+Y + F++G I +DTGSDL+W C P C+ C + P N
Sbjct: 47 DYTLSFNLGPRAQAQPITLYMDTGSDLVWFPCAPFKCILCEGKPNASPPVNTTRSVAVSC 106
Query: 69 --PASSSSYKELS----CQSEQCHL--LDTVSCSSQQLCNYTYGYADSSLTKGVLATERI 120
PA S+++ S C + +C L ++T C++ + + Y Y D SL + R
Sbjct: 107 KSPACSAAHNLASPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSL---IARLYRD 163
Query: 121 TFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILS---QLGANKFSYC 177
T S+ F N FGC + G+ G GR LSL +Q+ + QLG N+FSYC
Sbjct: 164 TLSLSSLFLRNFTFGCAYTTLA----EPTGVAGFGRGLLSLPAQLATLSPQLG-NRFSYC 218
Query: 178 LVPFHTDSSITSK---MYFGNGSEVS-----GGGV---VSTSLVSKEDKTYYF-VTLEGI 225
LV DS K + G E GGGV V T ++ Y++ V L GI
Sbjct: 219 LVSHSFDSERVRKPSPLILGRYEEEEEEEKVGGGVAEFVYTPMLENPKHPYFYTVGLIGI 278
Query: 226 SVG-NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ 284
SVG + + +++ N+ G G + +D+G T+LP FYN + ++ + +
Sbjct: 279 SVGKRIVPAPEMLRRVNNRG---DGGVVVDSGTTFTMLPAGFYNSVVDEFDRGVGRVNER 335
Query: 285 ----DPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG--------- 331
+ + G CY S+A + P+LT F GG ++ + ++G
Sbjct: 336 ARKIEEKTGLAPCYYLNSVAEV-PVLTLRFAGGNSSVVLPRKNYFYEFLDGRDAAKGKRR 394
Query: 332 VFCFAMQ------PIDGDVG-IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
V C + + G G GN+ Q + YD + + V F C
Sbjct: 395 VGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQCA 444
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 133/348 (38%), Gaps = 30/348 (8%)
Query: 41 IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVS--CS-S 95
++DT SD+ WVQC PC C+ Q +Y+P+ SSS C S C L + C+ +
Sbjct: 159 VIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNLGPYANGCTPA 218
Query: 96 QQLCNYTYGYADSSLTKGVLATERITFGNSN--NFFDNVVFGCGHN--NTGVFNENEMGL 151
C Y Y D S + G ++ +T + + FGC H G F+ G+
Sbjct: 219 GDQCQYRVQYPDGSASAGTYISDVLTLNPAKPASAISEFRFGCSHALLQPGSFSNKTSGI 278
Query: 152 VGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVS 211
+ LGR SL +Q + G + FSYCL P S G + V+ L S
Sbjct: 279 MALGRGAQSLPTQTKATYG-DVFSYCLPPTPVHSGF---FILGVPRVAASRYAVTPMLRS 334
Query: 212 KEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLE 271
K Y V L I V + P ++GA+ + T LP Y L
Sbjct: 335 KAAPMLYLVRLIAIEVAG--KRLPVPPAVFAAGAVMDSRTIV------TRLPPTAYMALR 386
Query: 272 EQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA------PILTAHFDGGAKVPLIHTSTFI 325
++ P+ CY A P +T FDG + S +
Sbjct: 387 AAFVAEMRAYRAAAPKEHLDTCYDFSGAAPGGGGGVKLPKITLVFDGPNGAVELDPSGVL 446
Query: 326 PPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
++G FA D GI GN Q L + Y+ D V F+ C
Sbjct: 447 ---LDGCLAFAPNTDDQMTGIIGNVQQQALEVLYNVDGATVGFRRGAC 491
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 104/381 (27%), Positives = 171/381 (44%), Gaps = 42/381 (11%)
Query: 19 TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
T G Y + IGTPP + VDTGSD++WV C+ C +C ++ +Y+P SS
Sbjct: 78 TDTGLYYTEIEIGTPPK-QYHVQVDTGSDILWVNCISCNKCPRKSDLGIDLRLYDPKGSS 136
Query: 74 SYKELSCQSEQCHLL---DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------GN 124
S +SC + C C+ C Y+ Y D S T G ++ + + G
Sbjct: 137 SGSTVSCDQKFCAATYGGKLPGCAKNIPCEYSVMYGDGSSTTGYFVSDSLQYNQVSGDGQ 196
Query: 125 SNNFFDNVVFGCGHN---NTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
+ + +V+FGCG + G N+ G++G G++ S+ SQ+ + K FS+CL
Sbjct: 197 TRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCL-- 254
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
+I F G +V V ST LV D +Y V LE I+VG ++ +P +
Sbjct: 255 ----DTIKGGGIFAIG-DVVQPKVKSTPLV--PDMPHYNVNLESINVG---GTTLQLPSH 304
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT-PSM 299
KG + ID+G T LP+ Y + V T + + LC + S+
Sbjct: 305 MFETGEKKGTI-IDSGTTLTYLPELVYKDVLAAVFAKHPDTTFHSVQ--DFLCIQYFQSV 361
Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIFGNFAQS 353
P +T HF+ + + F + ++CF +Q DG D+ + G+ S
Sbjct: 362 DDGFPKITFHFEDDLGLNVYPHDYFFQNG-DNLYCFGFQNGGLQSKDGKDMVLLGDLVLS 420
Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
+ + YD ++Q+V + +C+
Sbjct: 421 NKVVVYDLENQVVGWTDYNCS 441
>gi|125575538|gb|EAZ16822.1| hypothetical protein OsJ_32294 [Oryza sativa Japonica Group]
Length = 392
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 66/209 (31%), Positives = 107/209 (51%), Gaps = 20/209 (9%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
YV F+IGTPP ++D +L+W QC C +C++Q P+++P +S++Y+ C +
Sbjct: 51 YVANFTIGTPPQ-PASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTP 109
Query: 84 QCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
C + D+ +CS +C Y ++ T G + T+ G + ++ FGC +
Sbjct: 110 LCESIPSDSRNCSG-NVCAY-QASTNAGDTGGKVGTDTFAVGTAKA---SLAFGCVVASD 164
Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
G+VGLGRT SL ++Q G FSYCL P D+ S ++ G+ ++++G
Sbjct: 165 IDTMGGPSGIVGLGRTPWSL----VTQTGVAAFSYCLAPH--DAGRNSALFLGSSAKLAG 218
Query: 202 GG-VVSTSLVS-----KEDKTYYFVTLEG 224
GG ST V+ + YY V LEG
Sbjct: 219 GGKAASTPFVNISGNGNDLSNYYKVQLEG 247
>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
Length = 530
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 101/371 (27%), Positives = 169/371 (45%), Gaps = 54/371 (14%)
Query: 30 IGTPPLLDIYGIVDTGSDLMWVQCLPCVQC----------YKQVKPIYNPASSSSYKELS 79
IGTP + + + D GSDL+W+ C C+QC + Y+P+ SS+ K LS
Sbjct: 106 IGTPNISFLVAL-DAGSDLLWIPC-DCIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLS 163
Query: 80 CQSEQCHLLDTVSCSS-QQLCNYTYGY-ADSSLTKGVLA------TERITFGNSNNFFDN 131
C + C + +C S +QLC YT Y ++++ + G+L T I ++++
Sbjct: 164 CSHQLCE--SSPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDASNSSVRAP 221
Query: 132 VVFGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQLG--ANKFSYCLVPFHTDSSI 187
V+ GCG TG + + GL+GLG +S+ S LS+ G N FS C F+ D S
Sbjct: 222 VIIGCGMRQTGGYLDGVAPDGLMGLGLGEISVPS-FLSKAGLVKNSFSLC---FNDDDS- 276
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTY--YFVTLEGISVGNLSNSSKLIPYYNSSGA 245
+++FG+ G T+L D Y Y V +E +G+ S
Sbjct: 277 -GRIFFGD----QGLATQQTTLFLPSDGKYETYIVGVEACCIGS------------SCIK 319
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-P 304
+ +D+GA T LP + Y + ++ + T + + CYK+ S + P
Sbjct: 320 QTSFRALVDSGASFTFLPDESYRNVVDEFDKQVNATRFSFEGYPWEYCYKSSSKELLKNP 379
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGV--FCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
+ F ++H F+ +GV FC A+QP DGD+GI G + + +D +
Sbjct: 380 SVILKFALNNSF-VVHNPVFVVHGYQGVVGFCLAIQPADGDIGILGQNFMTGYRMVFDRE 438
Query: 363 SQMVSFKPTDC 373
+ + + ++C
Sbjct: 439 NLKLGWSRSNC 449
>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 511
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 102/371 (27%), Positives = 170/371 (45%), Gaps = 54/371 (14%)
Query: 30 IGTPPLLDIYGIVDTGSDLMWVQCLPCVQC----------YKQVKPIYNPASSSSYKELS 79
IGTP + + + D GSDL+W+ C C+QC + Y+P+ SS+ K LS
Sbjct: 87 IGTPNISFLVAL-DAGSDLLWIPC-DCIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLS 144
Query: 80 CQSEQCHLLDTVSCSS-QQLCNYTYGY-ADSSLTKGVLA------TERITFGNSNNFFDN 131
C + C + +C S +QLC YT Y ++++ + G+L T I ++++
Sbjct: 145 CSHQLCE--SSPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDASNSSVRAP 202
Query: 132 VVFGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQLG--ANKFSYCLVPFHTDSSI 187
V+ GCG TG + + GL+GLG +S+ S LS+ G N FS C F+ D S
Sbjct: 203 VIIGCGMRQTGGYLDGVAPDGLMGLGLGEISVPS-FLSKAGLVKNSFSLC---FNDDDS- 257
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTY--YFVTLEGISVGNLSNSSKLIPYYNSSGA 245
+++FG+ G T+L D Y Y V +E +G S+ K +
Sbjct: 258 -GRIFFGD----QGLATQQTTLFLPSDGKYETYIVGVEACCIG--SSCIKQTSF------ 304
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-P 304
+D+GA T LP + Y + ++ + T + + CYK+ S + P
Sbjct: 305 ----RALVDSGASFTFLPDESYRNVVDEFDKQVNATRFSFEGYPWEYCYKSSSKELLKNP 360
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGV--FCFAMQPIDGDVGIFGNFAQSDLFIGYDFD 362
+ F ++H F+ +GV FC A+QP DGD+GI G + + +D +
Sbjct: 361 SVILKFALNNSF-VVHNPVFVVHGYQGVVGFCLAIQPADGDIGILGQNFMTGYRMVFDRE 419
Query: 363 SQMVSFKPTDC 373
+ + + ++C
Sbjct: 420 NLKLGWSRSNC 430
>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
Length = 430
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 102/394 (25%), Positives = 164/394 (41%), Gaps = 81/394 (20%)
Query: 25 VMKFSIGTPPLLDIYGIVDTGSDLMWVQC----LPCVQCYKQVKPIYNPASSSSYKELSC 80
++ IGTPP ++DTGS L W+QC LP + K ++P+ SSS+ L C
Sbjct: 73 IISLPIGTPPQAQQM-VLDTGSQLSWIQCHRKKLP-----PKPKTSFDPSLSSSFSTLPC 126
Query: 81 QSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
C SC S +LC+Y+Y YAD + +G L E+ITF N+ ++ G
Sbjct: 127 SHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNT-EITPPLILG 185
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI--TSKMYF 193
C ++ ++ G++G+ R RLS +SQ +KFSYC+ P T Y
Sbjct: 186 CATESS-----DDRGILGMNRGRLSF----VSQAKISKFSYCIPPKSNRPGFTPTGSFYL 236
Query: 194 GNGSEVSGGGVVS------TSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
G+ G VS + + D Y V + GI G + N SG++
Sbjct: 237 GDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFG--------LKKLNISGSVF 288
Query: 248 K------GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
+ G +D+G+ T L Y+++ ++ R+G +L K G
Sbjct: 289 RPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMT----------RVGRRL-KKGYVYGG 337
Query: 302 IAPILTAHFDGG-AKVPLI----------HTSTFIPPPV------EGVFCFAM---QPID 341
A + FDG A +P + F+P G+ C + +
Sbjct: 338 TADMC---FDGNVAMIPRLIGDLVFVFTRGVEIFVPKERVLVNVGGGIHCVGIGRSSMLG 394
Query: 342 GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
I GN Q +L++ +D ++ V F DC++
Sbjct: 395 AASNIIGNVHQQNLWVEFDVTNRRVGFAKADCSR 428
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 99/373 (26%), Positives = 160/373 (42%), Gaps = 45/373 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
G Y + IGTPP IVDTGS L +V C C QC K P + P SS+Y+ L C
Sbjct: 90 GYYTTRIWIGTPPQ-TFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCS 148
Query: 82 SEQCHLLDTVSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
E +C S+ + C Y YA+ S + GVL + ++FG + VFGC +
Sbjct: 149 ME-------CTCDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFGCENV 201
Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
TG ++++ G++GLGR LS+ Q++ + + N FS C M G G+
Sbjct: 202 ETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCY----------GGMDVGGGA 251
Query: 198 EVSGG-----GVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
V GG G+V T YY + L+ I + + K +P N K
Sbjct: 252 MVLGGISPPAGMVFTH-SDPARSAYYNIDLKEIHI-----AGKQLP-INPMVFDGKYGTI 304
Query: 253 IDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCYK-----TPSMAGIAP 304
+D+G LP+ + ++ + N++KL D R + +C+ ++ P
Sbjct: 305 LDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPD-RNYNDICFSGVGSDVSQLSKTFP 363
Query: 305 ILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFD 362
+ F G ++ L F G +C + Q + + G + + YD +
Sbjct: 364 AVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDRE 423
Query: 363 SQMVSFKPTDCTK 375
+ F T+C++
Sbjct: 424 HLKIGFWKTNCSE 436
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 99/373 (26%), Positives = 160/373 (42%), Gaps = 45/373 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
G Y + IGTPP IVDTGS L +V C C QC K P + P SS+Y+ L C
Sbjct: 90 GYYTTRIWIGTPPQ-TFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCS 148
Query: 82 SEQCHLLDTVSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
E +C S+ + C Y YA+ S + GVL + ++FG + VFGC +
Sbjct: 149 ME-------CTCDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFGCENV 201
Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
TG ++++ G++GLGR LS+ Q++ + + N FS C M G G+
Sbjct: 202 ETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCY----------GGMDVGGGA 251
Query: 198 EVSGG-----GVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
V GG G+V T YY + L+ I + + K +P N K
Sbjct: 252 MVLGGISPPAGMVFTH-SDPARSAYYNIDLKEIHI-----AGKQLP-INPMVFDGKYGTI 304
Query: 253 IDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCYK-----TPSMAGIAP 304
+D+G LP+ + ++ + N++KL D R + +C+ ++ P
Sbjct: 305 LDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPD-RNYNDICFSGVGSDVSQLSKTFP 363
Query: 305 ILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFD 362
+ F G ++ L F G +C + Q + + G + + YD +
Sbjct: 364 AVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDRE 423
Query: 363 SQMVSFKPTDCTK 375
+ F T+C++
Sbjct: 424 HLKIGFWKTNCSE 436
>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
Length = 480
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 101/401 (25%), Positives = 160/401 (39%), Gaps = 61/401 (15%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQV-KPIYNPAS 71
+ S T G+Y ++F +GTP + + DTGSDL WV+C + ++ A+
Sbjct: 101 LSSGAYTGTGQYFVRFRVGTPAQPFVL-VADTGSDLTWVKCSGAGDGTGDAPRRVFRAAA 159
Query: 72 SSSYKELSCQSEQCHL---LDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITF----- 122
S S+ ++C S+ C +CSS C Y Y Y D S +GV+ T+ T
Sbjct: 160 SRSWAPIACSSDTCTSYVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGS 219
Query: 123 -----GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYC 177
G VV GC + G ++ G++ LG + +S AS+ ++ G +FSYC
Sbjct: 220 ESRDGGGRRAKLQGVVLGCTASYDGQSFQSSDGVLSLGNSNISFASRAAARFG-GRFSYC 278
Query: 178 LVPFHTDSSITSKMYFG-NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKL 236
LV + TS + FG G E GG S+S S +T + ++
Sbjct: 279 LVDHLAPRNATSYLTFGPPGPE--GGAAASSSSSSAAARTPLLL------------DRRM 324
Query: 237 IPYYNSSGAISK------------------GNMFIDTGAPPTLLPKDFYNRLEEQVRNAI 278
P+Y + G +D+G T+L Y + + +
Sbjct: 325 SPFYAVAVDAVHVAGEALDIPADVWDVARGGGAILDSGTSLTVLATPAYRAVVAALSERL 384
Query: 279 KLTPY--QDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFA 336
P DP + CY + A P L F G A++ +++ GV C
Sbjct: 385 AGLPRVSMDP---FEYCYNWTAAALEIPGLEVRFAGSARL-QPPAKSYVVDAAPGVKCIG 440
Query: 337 MQPIDG---DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+Q +G V + GN Q D +D + + FK T C
Sbjct: 441 VQ--EGAWPGVSVIGNILQQDHLWEFDLRDRWLRFKHTRCA 479
>gi|413923876|gb|AFW63808.1| hypothetical protein ZEAMMB73_793799 [Zea mays]
Length = 415
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 104/390 (26%), Positives = 159/390 (40%), Gaps = 57/390 (14%)
Query: 18 STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKE 77
S++ G +F + P +I +VDTGS++ W C + S +
Sbjct: 49 SSSGGGCHYRFELTHRPKDNISAVVDTGSNIFWTTEKEC-------------SRSKTRSM 95
Query: 78 LSCQSEQCHLLDTVSCSSQQL---------CNYT--YGYADSSLTKGVLATERITFGN-- 124
L C S +C + C +L C Y YG + T GVL +++T
Sbjct: 96 LPCCSPKCEQRASCGCRRSELKAEAEKETKCTYAIKYGGNANDSTAGVLYEDKLTIVAVA 155
Query: 125 -----SNNFFDNVVFGCGHNNTGVFNENEM-GLVGLGRTRLSLASQILSQLGANKFSYCL 178
+ F+ V GC + T F + + G+ GLGR+ A+ + QL +KFSYCL
Sbjct: 156 SKAVPGSQSFEEVAIGCSTSATLKFKDPSIKGVFGLGRS----ATSLPRQLNFSKFSYCL 211
Query: 179 VPFHTDSS-----ITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSN 232
+ +T+ G+ V +T+L D KT YFV L+GIS+G
Sbjct: 212 SSYQKPDLPSYLLLTAAPDMATGAVGGAAAVATTALQPNSDYKTRYFVDLQGISIGG--- 268
Query: 233 SSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY---QDPRLG 289
++L SG GNMF+DTG T L + +L ++ +K Y Q R
Sbjct: 269 -TRLPAVSTKSG----GNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVKEQPGRNN 323
Query: 290 SQLCYKTPSMAGIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG 345
Q+CY PS A P + HF A + L S + I G +
Sbjct: 324 GQICYSPPSTAADESSKLPDMVLHFADSANMVLPWDSYLWKTTSKLCLAIDKSNIKGGIS 383
Query: 346 IFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
+ GNF + + D ++ +SF DC+K
Sbjct: 384 VLGNFQMQNTHMLLDTGNEKLSFVRADCSK 413
>gi|125602787|gb|EAZ42112.1| hypothetical protein OsJ_26672 [Oryza sativa Japonica Group]
Length = 477
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 72/130 (55%), Gaps = 18/130 (13%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHL-LDTV-----SCS 94
IVDTGSDL WVQC PC CY Q P+++P+ S+SY + C + C L SC+
Sbjct: 179 IVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCA 238
Query: 95 S---------QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFN 145
+ + C Y+ Y D S ++GVLAT+ + G ++ D VFGCG +N G+F
Sbjct: 239 TVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS--VDGFVFGCGLSNRGLFG 296
Query: 146 ENEMGLVGLG 155
GL+GLG
Sbjct: 297 -GTAGLMGLG 305
>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
Length = 469
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 112/365 (30%), Positives = 161/365 (44%), Gaps = 30/365 (8%)
Query: 25 VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP---IYNPASSSSYKELSCQ 81
V+ ++GTP + G+VD S +W QC PC + P + P S+++ L C
Sbjct: 89 VINITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCS 148
Query: 82 SEQCHLLDTVSCSSQQL---------CN-YTYGYADSSL-TKGVLATERITFGNSNNFFD 130
S+ C + +C C+ Y+ Y S+ T G LAT+ TFG +
Sbjct: 149 SDMCLPVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSGYLATDTFTFGATA--VP 206
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSY-CLVPFHTDS-SIT 188
VVFGC + G F G++G+GR LSL SQ+ Q G KFSY L P TD S
Sbjct: 207 GVVFGCSDASYGDF-AGASGVIGIGRGNLSLISQL--QFG--KFSYQLLAPEATDDGSAD 261
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYN-SSGAI 246
S + FG+ + ST L+S +Y+V L G+ V N IP A
Sbjct: 262 SVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVRVDG--NRLDAIPAGTFDLRAN 319
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ-DPRLGSQLCYKTPSMAGI-AP 304
G + + + P T L + Y+ + V + I L L LCY SMA + P
Sbjct: 320 GTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSAALELDLCYNASSMAKVKVP 379
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
LT FDGGA + L + F G+ C M P G + G Q+ + YD D+
Sbjct: 380 KLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG-SVLGTLLQTGTNMIYDVDAG 438
Query: 365 MVSFK 369
++F+
Sbjct: 439 RLTFE 443
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 111/383 (28%), Positives = 172/383 (44%), Gaps = 49/383 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVKPIY-NPASSSSYK 76
G Y K +GTPP ++Y +DTGSD++WV C C C + Q++ Y +P SSS+
Sbjct: 75 GLYYTKVKLGTPPR-ELYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPGSSSTSS 133
Query: 77 ELSCQSEQCH---LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGN------SN 126
+SC +C SCS + C YT+ Y D S T G ++ + F + +
Sbjct: 134 LISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTLTT 193
Query: 127 NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFH 182
N +VVFGC TG ++E G+ G G+ +S+ SQ+ SQ + FS+CL
Sbjct: 194 NSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCL---K 250
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYY 240
D+S + G E+ +V + LV + +Y + L+ ISV + + +
Sbjct: 251 GDNSGGGVLVLG---EIVEPNIVYSPLVPSQ--PHYNLNLQSISVNGQIVRIAPSVFATS 305
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY--KTPS 298
N+ G I +D+G L ++ YN + I + G+Q CY T S
Sbjct: 306 NNRGTI------VDSGTTLAYLAEEAYNPFVIAIAAVIPQSVRSVLSRGNQ-CYLITTSS 358
Query: 299 MAGIAPILTAHFDGGAKVPL-----IHTSTFIPPPVEG-VFCFAMQPIDGD-VGIFGNFA 351
I P ++ +F GGA + L + FI EG V+C Q I G + I G+
Sbjct: 359 NVDIFPQVSLNFAGGASLVLRPQDYLMQQNFIG---EGSVWCIGFQKISGQSITILGDLV 415
Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
D YD Q + + DC+
Sbjct: 416 LKDKIFVYDLAGQRIGWANYDCS 438
>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
Length = 486
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/350 (26%), Positives = 139/350 (39%), Gaps = 46/350 (13%)
Query: 41 IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVS--CSSQ 96
++DT D+ W++C+PC QC Y+P SS+Y C S C L + C +
Sbjct: 166 VLDTAGDVPWMRCVPCTFAQCAD-----YDPTRSSTYSAFPCNSSACKQLGRYANGCDAN 220
Query: 97 QLCNYTYGYA-DSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLG 155
C Y A DS T G +++ +T NS + + FGC N G F G++ LG
Sbjct: 221 GQCQYMVVTAGDSFTTSGTYSSDVLTI-NSGDRVEGFRFGCSQNEQGSFENQADGIMALG 279
Query: 156 RTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED- 214
R SL +Q S G + FSYCL P T+K +F G + T+ + KE
Sbjct: 280 RGVQSLMAQTSSTYG-DAFSYCLPPTE-----TTKGFFQIGVPIGASYRFVTTPMLKERG 333
Query: 215 ------KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYN 268
T Y L I+V + +L N + +D+ T LP Y
Sbjct: 334 GASAAAATLYRALLLAITV----DGKEL----NVPAEVFAAGTVMDSRTIITRLPVTAYG 385
Query: 269 RLEEQVRNAIKLTPYQDPRLGSQLCY-----KTPSMAGIAPILTAHFDGGAKVPLIHTST 323
L RN ++ P+ CY + P + IA + FDG A V + +
Sbjct: 386 ALRAAFRNRMRYR-VAPPQEELDTCYDLTGVRYPRLPRIALV----FDGNAVVEMDRSGI 440
Query: 324 FIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ G FA D I GN Q + + +D + F+ C
Sbjct: 441 LL----NGCLAFASNDDDSSPSILGNVQQQTIQVLHDVGGGRIGFRSAAC 486
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 99/376 (26%), Positives = 159/376 (42%), Gaps = 44/376 (11%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI-----YNPA 70
N+ G Y IGTP + Y +DTGS WV + C QC + + Y+P
Sbjct: 51 NIPYGTGLYYTDIGIGTPAV-KYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPR 109
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------GN 124
SS S KE+ C C C+ C Y GYAD LT G+L T+ + + G
Sbjct: 110 SSVSSKEVKCDDTIC--TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQ 167
Query: 125 SNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
+ +V FGCG +G N + + G++G G + + SQ+ + K FS+CL
Sbjct: 168 TQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL-- 225
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
S F G V V T+ + K ++ Y+ V L+ I N++ ++ +P
Sbjct: 226 ----DSTNGGGIFAIGEVVEPK--VKTTPIVKNNEVYHLVNLKSI---NVAGTTLQLP-A 275
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
N G FID+G+ LP+ Y+ L V + D +G+ ++
Sbjct: 276 NIFGTTKTKGTFIDSGSTLVYLPEIIYSELILAV-----FAKHPDITMGAMYNFQCFHFL 330
Query: 301 GIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP--IDG--DVGIFGNFAQ 352
G P +T HF+ + ++ ++ +CF Q I G D+ I G+
Sbjct: 331 GSVDDKFPKITFHFENDLTLD-VYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVI 389
Query: 353 SDLFIGYDFDSQMVSF 368
S+ + YD + Q + +
Sbjct: 390 SNKVVVYDMEKQAIGW 405
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 89/387 (22%), Positives = 170/387 (43%), Gaps = 65/387 (16%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y K +G+PP + + VDTGSD++WV C PC +C + +++ +SS+ K
Sbjct: 72 GLYFTKIKLGSPPK-EYHVQVDTGSDILWVNCKPCPECPSKTNLNFHLSLFDVNASSTSK 130
Query: 77 ELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG------NSNNFF 129
++ C + C + + SC C+Y YAD S ++G +++T +
Sbjct: 131 KVGCDDDFCSFISQSDSCQPAVGCSYHIVYADESTSEGNFIRDKLTLEQVTGDLQTGPLG 190
Query: 130 DNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDS 185
VVFGCG + +G +++ G++G G++ S+ SQ+ + A + FS+CL
Sbjct: 191 QEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL------- 243
Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGNLSNSSKL 236
V GGG+ + +V ++ +Y V L G+ V + L
Sbjct: 244 -----------DNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDG--TALDL 290
Query: 237 IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV--RNAIKLTPYQDPRLGSQLCY 294
P + G +D+G PK Y+ L E + R +KL +D + C+
Sbjct: 291 PP-----SIMRNGGTIVDSGTTLAYFPKVLYDSLIETILARQPVKLHIVED----TFQCF 341
Query: 295 KTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP------IDGDVGIF 347
+A P ++ F+ K+ ++ ++ + ++CF Q +V +
Sbjct: 342 SFSENVDVAFPPVSFEFEDSVKLT-VYPHDYLFTLEKELYCFGWQAGGLTTGERTEVILL 400
Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCT 374
G+ S+ + YD +++++ + +C+
Sbjct: 401 GDLVLSNKLVVYDLENEVIGWADHNCS 427
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 99/384 (25%), Positives = 162/384 (42%), Gaps = 44/384 (11%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI-----YNPA 70
N+ G Y IGTP + Y +DTGS WV + C QC + + Y+P
Sbjct: 51 NIPYGTGLYYTDIGIGTPAV-KYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPR 109
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------GN 124
SS S KE+ C C C+ C Y GYAD LT G+L T+ + + G
Sbjct: 110 SSVSSKEVKCDDTIC--TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQ 167
Query: 125 SNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
+ +V FGCG +G N + + G++G G + + SQ+ + K FS+CL
Sbjct: 168 TQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL-- 225
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
S F G V V T+ + K ++ Y+ V L+ I N++ ++ +P
Sbjct: 226 ----DSTNGGGIFAIGEVVEPK--VKTTPIVKNNEVYHLVNLKSI---NVAGTTLQLP-A 275
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
N G FID+G+ LP+ Y+ L V + D +G+ ++
Sbjct: 276 NIFGTTKTKGTFIDSGSTLVYLPEIIYSELILAV-----FAKHPDITMGAMYNFQCFHFL 330
Query: 301 GIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP--IDG--DVGIFGNFAQ 352
G P +T HF+ + ++ ++ +CF Q I G D+ I G+
Sbjct: 331 GSVDDKFPKITFHFENDLTLD-VYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVI 389
Query: 353 SDLFIGYDFDSQMVSFKPTDCTKQ 376
S+ + YD + Q + + + ++
Sbjct: 390 SNKVVVYDMEKQAIGWTEHNSVEE 413
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 99/376 (26%), Positives = 159/376 (42%), Gaps = 44/376 (11%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI-----YNPA 70
N+ G Y IGTP + Y +DTGS WV + C QC + + Y+P
Sbjct: 75 NIPYGTGLYYTDIGIGTPAV-KYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPR 133
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------GN 124
SS S KE+ C C C+ C Y GYAD LT G+L T+ + + G
Sbjct: 134 SSVSSKEVKCDDTIC--TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQ 191
Query: 125 SNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
+ +V FGCG +G N + + G++G G + + SQ+ + K FS+CL
Sbjct: 192 TQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL-- 249
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYY 240
S F G V V T+ + K ++ Y+ V L+ I N++ ++ +P
Sbjct: 250 ----DSTNGGGIFAIGEVVEPK--VKTTPIVKNNEVYHLVNLKSI---NVAGTTLQLP-A 299
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
N G FID+G+ LP+ Y+ L V + D +G+ ++
Sbjct: 300 NIFGTTKTKGTFIDSGSTLVYLPEIIYSELILAV-----FAKHPDITMGAMYNFQCFHFL 354
Query: 301 GIA----PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP--IDG--DVGIFGNFAQ 352
G P +T HF+ + ++ ++ +CF Q I G D+ I G+
Sbjct: 355 GSVDDKFPKITFHFENDLTLD-VYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVI 413
Query: 353 SDLFIGYDFDSQMVSF 368
S+ + YD + Q + +
Sbjct: 414 SNKVVVYDMEKQAIGW 429
>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
Length = 469
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 112/365 (30%), Positives = 161/365 (44%), Gaps = 30/365 (8%)
Query: 25 VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP---IYNPASSSSYKELSCQ 81
V+ ++GTP + G+VD S +W QC PC + P + P S+++ L C
Sbjct: 89 VINITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCS 148
Query: 82 SEQCHLLDTVSCSSQQL---------CN-YTYGYADSSL-TKGVLATERITFGNSNNFFD 130
S+ C + +C C+ Y+ Y S+ T G LAT+ TFG +
Sbjct: 149 SDMCLPVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSGYLATDTFTFGATA--VP 206
Query: 131 NVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSY-CLVPFHTDS-SIT 188
VVFGC + G F G++G+GR LSL SQ+ Q G KFSY L P TD S
Sbjct: 207 GVVFGCSDASYGDF-AGASGVIGIGRGNLSLISQL--QFG--KFSYQLLAPEATDDGSAD 261
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYN-SSGAI 246
S + FG+ + ST L+S +Y+V L G+ V N IP A
Sbjct: 262 SVIRFGDDAVPKTKRGQSTPLLSSTLYPDFYYVNLTGVRVDG--NRLDAIPAGTFDLRAN 319
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ-DPRLGSQLCYKTPSMAGI-AP 304
G + + + P T L + Y+ + V + I L L LCY SMA + P
Sbjct: 320 GTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSAALELDLCYNASSMAKVKVP 379
Query: 305 ILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
LT FDGGA + L + F G+ C M P G + G Q+ + YD D+
Sbjct: 380 KLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG-SVLGTLLQTGTNMIYDVDAG 438
Query: 365 MVSFK 369
++F+
Sbjct: 439 RLTFE 443
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 99/382 (25%), Positives = 160/382 (41%), Gaps = 39/382 (10%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP---IYNP 69
+ S G+Y +K +GTP + + DTGS+L WV+C P ++ P
Sbjct: 80 MSSGAYAGTGQYFVKVLVGTP-AQEFTLVADTGSELTWVKCA------GGASPPGLVFRP 132
Query: 70 ASSSSYKELSCQSEQCHL---LDTVSCSSQ-QLCNYTYGYADSSLTK-GVLATERITF-- 122
+S S+ + C S+ C L +CSS C+Y Y Y + S GV+ T+ T
Sbjct: 133 EASKSWAPVPCSSDTCKLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIAL 192
Query: 123 -GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
G +VV GC + G ++ G++ LG ++S AS+ ++ G + FSYCLV
Sbjct: 193 PGGKVAQLQDVVLGCSSTHDGQSFKSVDGVLSLGNAKISFASRAAARFGGS-FSYCLVDH 251
Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
+ T + FG G +V T L +Y V ++ + V + +
Sbjct: 252 LAPRNATGYLAFGPG-QVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDP 310
Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQD-PRLGSQLCYK----T 296
SG + +D+G T+L Y + + + P D P + CY
Sbjct: 311 KSGGV-----ILDSGTTLTVLATPAYKAVVAALTKLLAGVPKVDFPPF--EHCYNWTAPR 363
Query: 297 PSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD---VGIFGNFAQS 353
P I P L F G A++ S ++ GV C +Q +G+ V + GN Q
Sbjct: 364 PGAPEI-PKLAVQFTGCARLEPPAKS-YVIDVKPGVKCIGLQ--EGEWPGVSVIGNIMQQ 419
Query: 354 DLFIGYDFDSQMVSFKPTDCTK 375
+ +D + V F P+ CT+
Sbjct: 420 EHLWEFDLKNMEVRFMPSTCTR 441
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 99/363 (27%), Positives = 153/363 (42%), Gaps = 30/363 (8%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
Y+ + +GTP + I D +D WV C C C P ++P SS+Y+ + C S
Sbjct: 82 NYIARAGLGTPAQTLLVAI-DPSNDAAWVPCSACAGCAAS-SPSFSPTQSSTYRTVPCGS 139
Query: 83 EQCHLLDTVSCSS--QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
QC + + SC + C + YA S+ + VL + + NN + FGC
Sbjct: 140 PQCAQVPSPSCPAGVGSSCGFNLTYAASTF-QAVLGQDSLAL--ENNVVVSYTFGCLRVV 196
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
+G + GL+G GR LS SQ G + FSYCL P + S+ + + G +
Sbjct: 197 SG-NSVPPQGLIGFGRGPLSFLSQTKDTYG-SVFSYCL-PNYRSSNFSGTLKLGPIGQPK 253
Query: 201 GGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGA---ISKGNMFIDTG 256
+ +T L+ + + Y+V + GI VG SK++ S+ A ++ ID G
Sbjct: 254 --RIKTTPLLYNPHRPSLYYVNMIGIRVG-----SKVVQVPQSALAFNPVTGSGTIIDAG 306
Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKV 316
T L Y + + R ++ TP P G CY + P +T F G V
Sbjct: 307 TMFTRLAAPVYAAVRDAFRGRVR-TPVAPPLGGFDTCYN---VTVSVPTVTFMFAGAVAV 362
Query: 317 PLIHTSTFIPPPVEGVFCFAMQ--PIDG---DVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
L + I GV C AM P DG + + + Q + + +D + V F
Sbjct: 363 TLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRE 422
Query: 372 DCT 374
CT
Sbjct: 423 LCT 425
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 99/363 (27%), Positives = 153/363 (42%), Gaps = 30/363 (8%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
Y+ + +GTP + I D +D WV C C C P ++P SS+Y+ + C S
Sbjct: 101 NYIARAGLGTPAQTLLVAI-DPSNDAAWVPCSACAGCAAS-SPSFSPTQSSTYRTVPCGS 158
Query: 83 EQCHLLDTVSCSS--QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
QC + + SC + C + YA S+ + VL + + NN + FGC
Sbjct: 159 PQCAQVPSPSCPAGVGSSCGFNLTYAASTF-QAVLGQDSLAL--ENNVVVSYTFGCLRVV 215
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
+G + GL+G GR LS SQ G + FSYCL P + S+ + + G +
Sbjct: 216 SG-NSVPPQGLIGFGRGPLSFLSQTKDTYG-SVFSYCL-PNYRSSNFSGTLKLGPIGQPK 272
Query: 201 GGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGA---ISKGNMFIDTG 256
+ +T L+ + + Y+V + GI VG SK++ S+ A ++ ID G
Sbjct: 273 --RIKTTPLLYNPHRPSLYYVNMIGIRVG-----SKVVQVPQSALAFNPVTGSGTIIDAG 325
Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKV 316
T L Y + + R ++ TP P G CY + P +T F G V
Sbjct: 326 TMFTRLAAPVYAAVRDAFRGRVR-TPVAPPLGGFDTCYN---VTVSVPTVTFMFAGAVAV 381
Query: 317 PLIHTSTFIPPPVEGVFCFAMQ--PIDG---DVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
L + I GV C AM P DG + + + Q + + +D + V F
Sbjct: 382 TLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRE 441
Query: 372 DCT 374
CT
Sbjct: 442 LCT 444
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 92/335 (27%), Positives = 147/335 (43%), Gaps = 44/335 (13%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IGTPP + IVD+GS + +V C C QC P + P SSSY + C
Sbjct: 86 NGYYTTRLYIGTPPQ-EFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKC 144
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF-FDNVVFGCGHN 139
+D S ++ C Y YA+ S + GVL + ++FG + VFGC ++
Sbjct: 145 N------VDCTCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKAQRAVFGCENS 198
Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMYFGNGS 197
TG +F+++ G++GLGR +LS+ Q++ + N FS C M G G+
Sbjct: 199 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVINDSFSLCY----------GGMDIGGGA 248
Query: 198 EVSGGGVVSTSLV-SKED---KTYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNM 251
V GG + +V S+ D YY + L+ I V L S++ SK
Sbjct: 249 MVLGGVPTPSDMVFSRSDPLRSPYYNIELKEIHVAGKALRVDSRIFD--------SKHGT 300
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCY-----KTPSMAGIA 303
+D+G LP+ + ++ V +++K DP +C+ + +
Sbjct: 301 VLDSGTTYAYLPEQAFMAFKDAVTSKVHSLKKIRGPDPSY-KDICFAGARRNVSKLHEVF 359
Query: 304 PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM 337
P + F G K+ L F V+G +C +
Sbjct: 360 PDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGV 394
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 98/384 (25%), Positives = 178/384 (46%), Gaps = 51/384 (13%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSS 74
A G Y K IGTP D Y VDTGSD+MWV C+ C +C K+ +Y+ S +
Sbjct: 94 AVGLYYAKIGIGTPAR-DYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLT 152
Query: 75 YKELSCQSEQCHLLDT---VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD- 130
K +SC + C+ ++ C + C+YT YAD S + G + + + + +
Sbjct: 153 GKLVSCDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLET 212
Query: 131 -----NVVFGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFH 182
+V+FGC +G + E G++G G++ S+ SQ+ S K F++CL
Sbjct: 213 TSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCL---- 268
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYY 240
+ F G V V +T LV ++T+Y V ++ + VG L+ + +
Sbjct: 269 --DGLNGGGIFAIGHIVQ-PKVNTTPLV--PNQTHYNVNMKAVEVGGYFLNLPTDVFDVG 323
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQDPRLGSQLCYK-T 296
+ G I ID+G LP+ Y++L ++ ++ +K+ D C++ +
Sbjct: 324 DKKGTI------IDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQF----TCFQYS 373
Query: 297 PSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPID-GDVGIFGNF 350
S+ P +T HF+ + +H ++ +G++C MQ D ++ + G+
Sbjct: 374 ESLDDGFPAVTFHFENSLYLK-VHPHEYL-FSYDGLWCIGWQNSGMQSRDRRNITLLGDL 431
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCT 374
A S+ + YD ++Q++ + +C+
Sbjct: 432 ALSNKLVLYDLENQVIGWTEYNCS 455
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 93/374 (24%), Positives = 156/374 (41%), Gaps = 45/374 (12%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IGTPP + IVD+GS + +V C C QC P + P SS+Y + C
Sbjct: 85 NGYYTTRLHIGTPPQ-EFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKC 143
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
+D S + C Y YA+ S + GVL + ++FG + VFGC ++
Sbjct: 144 N------VDCTCDSDKNQCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFGCENS 197
Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
TG +F+++ G++GLGR +LS+ Q++ + + + FS C M G G+
Sbjct: 198 ETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCY----------GGMDIGGGA 247
Query: 198 EVSGGGVVSTSLVSKEDKT----YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
V G ++ YY + L+ + V + + G + +
Sbjct: 248 MVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTV------L 301
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG----------IA 303
D+G LP+ + ++ V + + P + R G YK AG +
Sbjct: 302 DSGTTYAYLPEQAFVAFKDAVSSQVH--PLKKIR-GPDSNYKDICFAGAGRNVSQLSEVF 358
Query: 304 PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDF 361
P + F G K+ L F VEG +C + Q + G + + YD
Sbjct: 359 PKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDR 418
Query: 362 DSQMVSFKPTDCTK 375
++ + F T+C++
Sbjct: 419 HNEKIGFWKTNCSE 432
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 164/379 (43%), Gaps = 42/379 (11%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-----YKQVKPIYNPASSSSYK 76
G Y + +G PP D Y +DTGSD++WV C C C + ++P SS++
Sbjct: 81 GLYYTRVQLGNPPK-DFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTAS 139
Query: 77 ELSCQSEQCHL----LDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG------NSN 126
+SC + C L D+ C Y + Y D S T G + I ++
Sbjct: 140 LVSCSDQICALGVQSSDSACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTS 199
Query: 127 NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFH 182
N +VVFGC + TG +++ G+ G G+ LS+ SQ+ S+ A K FS+CL
Sbjct: 200 NSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCL---K 256
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYY 240
D S + G E+ VV T LV + +Y + L+ ISV L S +
Sbjct: 257 GDDSGGGILVLG---EIVEPNVVYTPLVPSQ--PHYNLNLQSISVNGQVLPISPAVFATS 311
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT-PSM 299
+S G I ID+G L ++ YN V N + + Q L CY T S+
Sbjct: 312 SSQGTI------IDSGTTLAYLAEEAYNAFVVAVTNIVSQST-QSVVLKGNRCYVTSSSV 364
Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD-VGIFGNFAQSDL 355
+ I P ++ +F GGA + L I G V+C Q I G + I G+ D
Sbjct: 365 SDIFPQVSLNFAGGASLVLGAQDYLIQQNSVGGTTVWCIGFQKIPGQGITILGDLVLKDK 424
Query: 356 FIGYDFDSQMVSFKPTDCT 374
YD +Q + + DC+
Sbjct: 425 IFIYDLANQRIGWTNYDCS 443
>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 373
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 91/180 (50%), Gaps = 14/180 (7%)
Query: 9 PNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQVK--- 64
P++ V + S ++ M S+GTP + ++ I DTGS + WVQC C V CY Q +
Sbjct: 8 PDSAVIGDDSIRKNQFFMGISLGTPAVFNLVTI-DTGSTISWVQCQYCIVHCYTQDQRAG 66
Query: 65 PIYNPASSSSYKELSCQSEQCHLLDTVS------CSSQQLCNYTYGYADSSLTKGVLATE 118
P +N +SSS+Y+ + C ++ CH + + C Y+ YA + G L+ +
Sbjct: 67 PTFNTSSSSTYRRVGCSAQVCHDMHVSQNIPSGCVEEEDSCIYSLRYASGEYSAGYLSQD 126
Query: 119 RITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL 178
R+T NS + +FGCG +N +N + G++G G S +QI + FSYC
Sbjct: 127 RLTLANSYS-IQKFIFGCGSDNR--YNGHSAGIIGFGNKSYSFFNQIAQLTNYSAFSYCF 183
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 93/374 (24%), Positives = 156/374 (41%), Gaps = 45/374 (12%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IGTPP + IVD+GS + +V C C QC P + P SS+Y + C
Sbjct: 85 NGYYTTRLHIGTPPQ-EFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKC 143
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
+D S + C Y YA+ S + GVL + ++FG + VFGC ++
Sbjct: 144 N------VDCTCDSDKNQCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFGCENS 197
Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
TG +F+++ G++GLGR +LS+ Q++ + + + FS C M G G+
Sbjct: 198 ETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCY----------GGMDIGGGA 247
Query: 198 EVSGGGVVSTSLVSKEDKT----YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
V G ++ YY + L+ + V + + G + +
Sbjct: 248 MVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTV------L 301
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG----------IA 303
D+G LP+ + ++ V + + P + R G YK AG +
Sbjct: 302 DSGTTYAYLPEQAFVAFKDAVSSQVH--PLKKIR-GPDPNYKDICFAGAGRNVSQLSEVF 358
Query: 304 PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDF 361
P + F G K+ L F VEG +C + Q + G + + YD
Sbjct: 359 PKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDR 418
Query: 362 DSQMVSFKPTDCTK 375
++ + F T+C++
Sbjct: 419 HNEKIGFWKTNCSE 432
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 108/440 (24%), Positives = 165/440 (37%), Gaps = 99/440 (22%)
Query: 13 VQSNVSTANGEYVMKFSIGTP--PLLDIYGIVDTGSDLMWVQCLPCVQC----------- 59
+ S T G+Y ++F +GTP P L + DTGSDL WV+C
Sbjct: 44 LSSGAYTGTGQYFVRFRVGTPARPFLLV---ADTGSDLTWVKCRRHAAPAPAPAPAPGYN 100
Query: 60 YKQVKP-----------------IYNPASSSSYKELSCQSEQCHL---LDTVSCSSQ-QL 98
Y P ++ P S ++ + C S+ C +C +
Sbjct: 101 YGYGAPASNDSSSVSAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSP 160
Query: 99 CNYTYGYADSSLTKGVLATERITFGNSNNF---------FDNVVFGCGHNNTGVFNENEM 149
C Y Y Y D S +G + T+ T S VV GC + TG
Sbjct: 161 CAYEYRYKDGSAARGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFLASD 220
Query: 150 GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG--------------- 194
G++ LG + +S AS+ ++ G +FSYCLV + TS + FG
Sbjct: 221 GVLSLGYSNVSFASRAAARFG-GRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRTAC 279
Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGNMFI 253
GS + G + L+ + +Y V + G+SV G L +L+ G +
Sbjct: 280 AGSAAAPGARQTPLLLDHRMRPFYAVAVNGVSVDGELLRIPRLVWDVQKG-----GGAIL 334
Query: 254 DTGAPPTLLPKDFYNRLEEQVRNAIKLTPY--QDPRLGSQLCYKTPS------MAGIAPI 305
D+G T+L Y + + + P DP CY S +A P
Sbjct: 335 DSGTSLTVLVSPAYRAVVAALGKKLVGLPRVAMDP---FDYCYNWTSPLTGEDLAVAVPA 391
Query: 306 LTAHFDGGAKVPLIHTSTFIPPP-------VEGVFCFAMQPIDGD---VGIFGNFAQSDL 355
L HF G A++ PPP GV C +Q +GD V + GN Q +
Sbjct: 392 LAVHFAGSARLQ--------PPPKSYVIDAAPGVKCIGLQ--EGDWPGVSVIGNILQQEH 441
Query: 356 FIGYDFDSQMVSFKPTDCTK 375
+D ++ + FK + C +
Sbjct: 442 LWEFDLKNRRLRFKRSRCMQ 461
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 103/380 (27%), Positives = 164/380 (43%), Gaps = 46/380 (12%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N + ++G+PP ++ ++DTGS+L W+ C + + +NP SSSY C
Sbjct: 57 NVTLTVSLTVGSPPQ-NVTMVLDTGSELSWLHC----KKLPNLNSTFNPLLSSSYTPTPC 111
Query: 81 QSEQC-----HLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
S C L SC + +LC+ YAD+S +G LA E TF + +F
Sbjct: 112 NSSICTTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAE--TFSLAGAAQPGTLF 169
Query: 135 GCGHNN--TGVFNENE--MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSK 190
GC + T NE+ GL+G+ R LSL ++Q+ KFSYC+ +
Sbjct: 170 GCMDSAGYTSDINEDSKTTGLMGMNRGSLSL----VTQMSLPKFSYCI----SGEDALGV 221
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYF------VTLEGISVG-NLSNSSKLIPYYNSS 243
+ G+G++ + + T LV+ + YF V LEGI V L K + + +
Sbjct: 222 LLLGDGTD-APSPLQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHT 280
Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRL----GSQLCYKTP 297
GA G +D+G T L Y+ L+++ K LT +DP LCY P
Sbjct: 281 GA---GQTMVDSGTQFTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAP 337
Query: 298 SMAGIAPILTAHFDGGA-KVPLIHTSTFIPPPVEGVFCFAMQPIDG---DVGIFGNFAQS 353
+ P +T F G +V + + V+CF D + + G+ Q
Sbjct: 338 ASFAAVPAVTLVFSGAEMRVSGERLLYRVSKGSDWVYCFTFGNSDLLGIEAYVIGHHHQQ 397
Query: 354 DLFIGYDFDSQMVSFKPTDC 373
++++ +D V F T C
Sbjct: 398 NVWMEFDLLKSRVGFTQTTC 417
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 104/357 (29%), Positives = 165/357 (46%), Gaps = 46/357 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
G YV++ +GTP L ++ ++DT D WV C C C P ++P +SS+Y L C
Sbjct: 97 GNYVVRVKLGTPGQL-MFMVLDTSRDAAWVPCADCAGCSS---PTFSPNTSSTYASLQCS 152
Query: 82 SEQCHLLDTVSC----SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
QC + +SC ++ N TYG DSS + + + + G + + + FGC
Sbjct: 153 VPQCTQVRGLSCPTTGTAACFFNQTYG-GDSSFSAML---SQDSLGLAVDTLPSYSFGCV 208
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
+ +G GL+GLGR +SL SQ S L + FSYC F K Y+ +GS
Sbjct: 209 NAVSGS-TLPPQGLLGLGRGPMSLLSQSGS-LYSGVFSYCFPSF--------KSYYFSGS 258
Query: 198 EVSG-----GGVVSTSLVSKEDK-TYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKG 249
G + +T L+ + T Y+V L G+SVG + + +L+ + ++GA
Sbjct: 259 LRLGPLGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGA---- 314
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGIAPILTA 308
ID+G T + Y + ++ R +K P+ +G+ C+ + IAP +T
Sbjct: 315 GTIIDSGTVITRFVEPVYAAIRDEFRKQVK-GPFAT--IGAFDTCFAA-TNEDIAPPVTF 370
Query: 309 HFDG-GAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDV----GIFGNFAQSDLFIGYD 360
HF G K+PL +T I + C AM +V + N Q +L I +D
Sbjct: 371 HFTGMDLKLPL--ENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFD 425
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 100/391 (25%), Positives = 176/391 (45%), Gaps = 46/391 (11%)
Query: 19 TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
T G Y + +GTPP Y VDTGSD++WV C+ C +C ++ Y+P +SS
Sbjct: 82 TDTGLYFTEIKLGTPPK-RYYVQVDTGSDILWVNCISCSKCPRKSGLGLDLTFYDPKASS 140
Query: 74 SYKELSCQSEQCHLL---DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------GN 124
S +SC C C++ C Y+ Y D S T G T+ + F G
Sbjct: 141 SGSTVSCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQ 200
Query: 125 SNNFFDNVVFGCGHNNTG-VFNENEM--GLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
+ + FGCG G + N N+ G++G G+ S+ SQ+ + A K F++CL
Sbjct: 201 TQPGNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLDT 260
Query: 181 ------FHTDSSITSKMYFGNGSEVSGGGVVSTSL----VSKEDKTYYFVTLEGISVGNL 230
F + + K YF G+++ L + + +Y V L+ I VG
Sbjct: 261 IKGGGIFAIGNVVQPKCYF---VFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVG-- 315
Query: 231 SNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS 290
++ +P + KG + ID+G T LP+ + ++ + V + + + + L
Sbjct: 316 -GTTLQLPAHVFETGEKKGTI-IDSGTTLTYLPELVFKQVMDVVFSKHRDIAFHN--LQD 371
Query: 291 QLCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-D 343
LC++ + S+ P +T HF+ + + F P + ++C A+Q DG D
Sbjct: 372 FLCFQYSGSVDDGFPTITFHFEDDLALHVYPHEYFFPNGND-IYCVGFQNGALQSKDGKD 430
Query: 344 VGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ + G+ S+ + YD ++Q++ + +C+
Sbjct: 431 IVLMGDLVLSNKLVVYDLENQVIGWTDYNCS 461
>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 95/373 (25%), Positives = 161/373 (43%), Gaps = 39/373 (10%)
Query: 25 VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI--YNPASSSSYKELSCQS 82
++ IGTP ++DTGS L W+QC P P ++P+ SSS+ +L C
Sbjct: 81 ILSLPIGTPSQSQEL-VLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSH 139
Query: 83 EQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
C SC S +LC+Y+Y YAD + +G L E+ TF NS ++ GC
Sbjct: 140 PLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQT-TPPLILGCA 198
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS---SITSKMYFG 194
+T +E G++G+ RLS +SQ +KFSYC +P ++ + T Y G
Sbjct: 199 KEST-----DEKGILGMNLGRLSF----ISQAKISKFSYC-IPTRSNRPGLASTGSFYLG 248
Query: 195 NGSEVSGGGVVS------TSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
+ G VS + + D Y V L+GI +G + + +G
Sbjct: 249 DNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGG--S 306
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS--QLCYKTPSMAGIAPI- 305
G +D+G+ T L Y++++E++ + + GS +C+ I +
Sbjct: 307 GQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLI 366
Query: 306 --LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYD 360
L F G ++ L+ + + G+ C + + I GN Q +L++ +D
Sbjct: 367 GDLVFEFGRGVEI-LVEKQSLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFD 425
Query: 361 FDSQMVSFKPTDC 373
++ V F +C
Sbjct: 426 VTNRRVGFSKAEC 438
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 98/383 (25%), Positives = 177/383 (46%), Gaps = 51/383 (13%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSS 74
A G Y K IGTP D Y VDTGSD+MWV C+ C +C K+ +Y+ S +
Sbjct: 94 AVGLYYAKIGIGTPAR-DYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLT 152
Query: 75 YKELSCQSEQCHLLDT---VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD- 130
K +SC + C+ ++ C + C+YT YAD S + G + + + + +
Sbjct: 153 GKLVSCDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLET 212
Query: 131 -----NVVFGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFH 182
+V+FGC +G + E G++G G++ S+ SQ+ S K F++CL
Sbjct: 213 TSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCL---- 268
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYY 240
+ F G V V +T LV ++T+Y V ++ + VG L+ + +
Sbjct: 269 --DGLNGGGIFAIGHIVQ-PKVNTTPLV--PNQTHYNVNMKAVEVGGYFLNLPTDVFDVG 323
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQDPRLGSQLCYK-T 296
+ G I ID+G LP+ Y++L ++ ++ +K+ D C++ +
Sbjct: 324 DKKGTI------IDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQF----TCFQYS 373
Query: 297 PSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPID-GDVGIFGNF 350
S+ P +T HF+ + +H ++ +G++C MQ D ++ + G+
Sbjct: 374 ESLDDGFPAVTFHFENSLYLK-VHPHEYL-FSYDGLWCIGWQNSGMQSRDRRNITLLGDL 431
Query: 351 AQSDLFIGYDFDSQMVSFKPTDC 373
A S+ + YD ++Q++ + +C
Sbjct: 432 ALSNKLVLYDLENQVIGWTEYNC 454
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 100/344 (29%), Positives = 143/344 (41%), Gaps = 43/344 (12%)
Query: 48 LMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYAD 107
+ W QC PCV+C K ++P++S +Y SC + TV + Y Y D
Sbjct: 98 ITWTQCKPCVRCLKDSHRHFDPSASLTYSLGSC------IPSTVGNT------YNMTYGD 145
Query: 108 SSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILS 167
S + G + +T ++ F FGCG NN G F G++GLG+ +LS SQ S
Sbjct: 146 KSTSVGNYGCDTMTL-EPSDVFPKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTAS 204
Query: 168 QLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSK------EDKTYYFVT 221
+ FSYCL + SI S + FG S + TSLV+ E+ YYFV
Sbjct: 205 KF-KKVFSYCL---PEEDSIGS-LLFGE-KATSQSSLKFTSLVNGPGTSGLEESGYYFVK 258
Query: 222 LEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLT 281
L ISVGN + +P S + ID+G T LP+ Y+ L + A+
Sbjct: 259 LLDISVGNKRLN---VP----SSVFASPGTIIDSGTVITCLPQRAYSALTAAFKKAMAKY 311
Query: 282 PYQDPRLGS----QLCYKTPSMAGI-APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFA 336
P + R CY + P + HF GA V L + I C A
Sbjct: 312 PLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRL-NGKRVIWGNDASRLCLA 370
Query: 337 M-----QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
++ ++ I GN Q L + YD + F C+K
Sbjct: 371 FAGNSKSTMNSELTIIGNRQQVSLTVLYDIQGGRIGFGGNGCSK 414
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 99/320 (30%), Positives = 147/320 (45%), Gaps = 40/320 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVK-PIYNPASSSSYK 76
G Y K +GTPP D Y VDTGSD++WV C C C + Q++ ++P SS +
Sbjct: 79 GLYYTKLRLGTPPR-DFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTAS 137
Query: 77 ELSCQSEQCHLLDTVS---CSSQ-QLCNYTYGYADSSLTKGVLATERITF----GNS--N 126
+SC ++C S CS Q LC YT+ Y D S T G ++ + F G+S
Sbjct: 138 PISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVP 197
Query: 127 NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFH 182
N VVFGC + TG +++ G+ G G+ +S+ SQ+ SQ + FS+CL
Sbjct: 198 NSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL---- 253
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSK---EDKTYYFVTLEGISVGNLSNSSKLIPY 239
K G G + G +V ++V + +Y V L ISV + + +P
Sbjct: 254 -------KGENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISV-----NGQALPI 301
Query: 240 YNSSGAISKGN-MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS 298
S + S G IDTG L + Y E + NA+ + G+Q T S
Sbjct: 302 NPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQCYVITTS 361
Query: 299 MAGIAPILTAHFDGGAKVPL 318
+ I P ++ +F GGA + L
Sbjct: 362 VGDIFPPVSLNFAGGASMFL 381
>gi|168051774|ref|XP_001778328.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670305|gb|EDQ56876.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 165
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 56/148 (37%), Positives = 76/148 (51%), Gaps = 17/148 (11%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASS 72
++S S +GEY + I TPP I I+DTGSDL WVQC PC+ CY Q ++NP SS
Sbjct: 1 MESGASLGSGEYFIDIFIDTPPR-HILVIIDTGSDLTWVQCTPCLHCYLQKGLVFNPHSS 59
Query: 73 SSYKELSCQSEQCHLLD-----TVSCSSQQLCNYTYGYADSSLTKGVLATERITF----- 122
SY ++C + ++ + + Q C+Y Y Y DSS T ATE T
Sbjct: 60 ESYDPVACGEPKRAFVESSNNRSTCVTDSQGCSYFYWYGDSSNTTSDFATETFTVNKTIK 119
Query: 123 -----GNSNNF-FDNVVFGCGHNNTGVF 144
G + ++FGCGHNN G+F
Sbjct: 120 NDEGGGEDDTLQISKIMFGCGHNNQGLF 147
>gi|125532795|gb|EAY79360.1| hypothetical protein OsI_34488 [Oryza sativa Indica Group]
Length = 342
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 92/364 (25%), Positives = 157/364 (43%), Gaps = 65/364 (17%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
Y+ +IGTPP I+ + +W QC PC +C+KQ P++N Y+
Sbjct: 28 YMANLTIGTPPQ-PASAIIHLAGEFVWTQCSPCRRCFKQDLPLFN-----RYE------- 74
Query: 84 QCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGV 143
++T+ + D+S G+ T+ G + ++ FGC ++
Sbjct: 75 ----VETM-------------FGDTS---GIGGTDTFAIGTATA---SLAFGCAMDSNIK 111
Query: 144 FNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGG- 202
G+VGLGRT SL + Q+ A FSYCL P H + S + G ++++GG
Sbjct: 112 QLLGASGVVGLGRTPWSL----VGQMNATAFSYCLAP-HGAAGKKSALLLGASAKLAGGK 166
Query: 203 GVVSTSLV-SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTL 261
+T LV + +D + Y + LEGI G++ + P N S + +DT +
Sbjct: 167 SAATTPLVNTSDDSSDYMIHLEGIKFGDV----IIEPPPNGS------VVLVDTIFGVSF 216
Query: 262 LPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA------PILTAHFDGGAK 315
L ++ +++ V A+ P P LC+ + A A P + F G A
Sbjct: 217 LVDAAFHAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAAA 276
Query: 316 VPLIHTSTFIPPPVEGVFCFAMQP-----IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKP 370
+ + S ++ G C AM + ++ I G Q ++ +D D + +SF+P
Sbjct: 277 L-TVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEP 335
Query: 371 TDCT 374
DC+
Sbjct: 336 ADCS 339
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 111/412 (26%), Positives = 176/412 (42%), Gaps = 83/412 (20%)
Query: 18 STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP---CVQCYK-------QVKPIY 67
S + G Y + S GTPP + I+DTGSD++W C C C +++P +
Sbjct: 61 SHSYGGYSVSLSFGTPPQTLSF-IMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQP-F 118
Query: 68 NPASSSSYKELSCQSEQCHLLDTVSCSSQQLCN-----------YTYGYADSSLTKGVLA 116
P SSS K L C++ +C + + + Q C+ Y Y S T GV
Sbjct: 119 IPKESSSSKLLGCKNPKCSWIHHSNINCDQDCSIKSCLNQTCPPYMIFYG-SGTTGGVAL 177
Query: 117 TERITFGNSNNFFDNVVFGCGHNNTGVFNENE-MGLVGLGRTRLSLASQILSQLGANKFS 175
+E + + + N + GC VF+ ++ G+ G GR SL SQ LG KFS
Sbjct: 178 SETLHLHSLSK--PNFLVGCS-----VFSSHQPAGIAGFGRGLSSLPSQ----LGLGKFS 226
Query: 176 YCLVP--FHTDSSITSKMYFGN---GSEVSGGGVVSTSLVSK---EDKT----YYFVTLE 223
YCL+ F D+ +S + S+ +V T V ++K+ YY++ L
Sbjct: 227 YCLLSHRFDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLR 286
Query: 224 GISVGNLSNSSKLIPY-YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEE----QVRNAI 278
I+VG +PY Y S G G + ID+G T + ++ + L + Q+++
Sbjct: 287 RITVG---GHHVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYR 343
Query: 279 KLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM 337
++ +D +G + C+ ++ P L +F GGA V L PVE F F
Sbjct: 344 RVKEIEDA-IGLRPCFNVSDAKTVSFPELRLYFKGGADVAL---------PVENYFAFVG 393
Query: 338 QPI-------DGDVG---------IFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ DG G I GNF + ++ YD ++ + FK C
Sbjct: 394 GEVACLTVVTDGVAGPERVGGPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 97/367 (26%), Positives = 166/367 (45%), Gaps = 35/367 (9%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
G YV++ +GTPP L ++ ++DT +D +W+ C C C +N SSS+Y +SC
Sbjct: 102 GNYVVRAKLGTPPQL-MFMVLDTSNDAVWLPCSGCSGC-SNASTSFNTNSSSTYSTVSCS 159
Query: 82 SEQCHLLDTVSCSSQ----QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
+ QC ++C S +C++ Y S L + +T + + N FGC
Sbjct: 160 TAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTL--APDVIPNFSFGCI 217
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
++ +G + GL+GLGR +SL SQ S L + FSYCL F + YF
Sbjct: 218 NSASG-NSLPPQGLMGLGRGPMSLVSQTTS-LYSGVFSYCLPSFRS-------FYFSGSL 268
Query: 198 EVSGGG----VVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
++ G + T L+ + + Y+V L G+SVG++ + P Y + A S
Sbjct: 269 KLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSV--QVPVDPVYLTFDANSGAGTI 326
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGIAPILTAHFD 311
ID+G T + Y + ++ R + ++ + LG+ C+ + +AP +T H
Sbjct: 327 IDSGTVITRFAQPVYEAIRDEFRKQVNVSSFST--LGAFDTCFSADN-ENVAPKITLHMT 383
Query: 312 G-GAKVPLIHTSTFIPPPVEGVFCFAM----QPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
K+P+ +T I + C +M Q + + + N Q +L I +D + +
Sbjct: 384 SLDLKLPM--ENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRI 441
Query: 367 SFKPTDC 373
P C
Sbjct: 442 GIAPEPC 448
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 92/381 (24%), Positives = 160/381 (41%), Gaps = 57/381 (14%)
Query: 25 VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI--YNPASSSSYKELSCQS 82
V+ IGTPP ++DTGS L W+QC + + P ++P+ SSS+ L C
Sbjct: 89 VVTLPIGTPPQPQQM-VLDTGSQLSWIQC------HNKTPPTASFDPSLSSSFYVLPCTH 141
Query: 83 EQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
C +C +LC+Y+Y YAD + +G L E++ F S ++ GC
Sbjct: 142 PLCKPRVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSPSQT-TPPLILGCS 200
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCL---VPFHTDSSITSKMYFG 194
+ + G++G+ RLS Q KFSYC+ P + ++ T Y G
Sbjct: 201 SE-----SRDARGILGMNLGRLSFPFQA----KVTKFSYCVPTRQPANNNNFPTGSFYLG 251
Query: 195 NGSEVSGGGVVS------TSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
N + VS + + D Y V ++GI +G + P A
Sbjct: 252 NNPNSARFRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGG--RKLNIPPSVFRPNAGGS 309
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG--------SQLCYKTPSMA 300
G +D+G+ T L Y+R+ E++ + PR+ + +C+ +M
Sbjct: 310 GQTMVDSGSEFTFLVDVAYDRVREEIIRVL------GPRVKKGYVYGGVADMCFDGNAME 363
Query: 301 GIAPIL---TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSD 354
I +L F+ G ++ ++ + GV C + + + I GNF Q +
Sbjct: 364 -IGRLLGDVAFEFEKGVEI-VVPKERVLADVGGGVHCVGIGRSERLGAASNIIGNFHQQN 421
Query: 355 LFIGYDFDSQMVSFKPTDCTK 375
L++ +D ++ + F DC++
Sbjct: 422 LWVEFDLANRRIGFGVADCSR 442
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 91/387 (23%), Positives = 170/387 (43%), Gaps = 65/387 (16%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y K +G+PP + + VDTGSD++W+ C PC +C + +++ +SS+ K
Sbjct: 72 GLYFTKIKLGSPPK-EYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSK 130
Query: 77 ELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKG-----VLATERITFG-NSNNFF 129
++ C + C + + SC C+Y YAD S + G +L E++T +
Sbjct: 131 KVGCDDDFCSFISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLG 190
Query: 130 DNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDS 185
VVFGCG + +G + G++G G++ S+ SQ+ + A + FS+CL
Sbjct: 191 QEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL------- 243
Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGNLSNSSKL 236
V GGG+ + +V ++ +Y V L G+ V +S
Sbjct: 244 -----------DNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDV---DGTSLD 289
Query: 237 IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV--RNAIKLTPYQDPRLGSQLCY 294
+P + G +D+G PK Y+ L E + R +KL ++ + C+
Sbjct: 290 LP----RSIVRNGGTIVDSGTTLAYFPKVLYDSLIETILARQPVKLHIVEE----TFQCF 341
Query: 295 KTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP------IDGDVGIF 347
+ A P ++ F+ K+ ++ ++ E ++CF Q +V +
Sbjct: 342 SFSTNVDEAFPPVSFEFEDSVKLT-VYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILL 400
Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCT 374
G+ S+ + YD D++++ + +C+
Sbjct: 401 GDLVLSNKLVVYDLDNEVIGWADHNCS 427
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 103/390 (26%), Positives = 174/390 (44%), Gaps = 52/390 (13%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNP 69
S ++T G Y + IGTP Y VDTGSD++WV C+ C C ++ +Y+P
Sbjct: 81 SGLATETGLYFTRIGIGTPAKR-YYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDP 139
Query: 70 ASSSSYKELSCQSEQC---HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF---- 122
S S + ++C + C + SC+S C Y+ Y D S T G T+ + +
Sbjct: 140 RGSQSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVS 199
Query: 123 --GNSNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSY 176
G + +V FGCG G + + G++G G++ S+ SQ+ + K F++
Sbjct: 200 GDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAH 259
Query: 177 CLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSS 234
CL ++ F G+ V V +T LVS D +Y V L+GI VG L +
Sbjct: 260 CL------DTVNGGGIFAIGNVVQ-PKVKTTPLVS--DMPHYNVILKGIDVGGTALGLPT 310
Query: 235 KLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQDPRLGSQ 291
+ NS G I ID+G +P+ Y L V I + QD
Sbjct: 311 NIFDSGNSKGTI------IDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS---- 360
Query: 292 LCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DV 344
C++ + S+ P +T HF+G + ++ ++ + ++C +Q DG D+
Sbjct: 361 -CFQYSGSVDDGFPEVTFHFEGDVSL-IVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDM 418
Query: 345 GIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ G+ S+ + YD ++Q + + +C+
Sbjct: 419 VLLGDLVLSNKLVLYDLENQAIGWADYNCS 448
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 98/375 (26%), Positives = 159/375 (42%), Gaps = 47/375 (12%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IG+PP + IVDTGS + +V C CVQC P + P SS+Y+ + C
Sbjct: 86 NGYYTTRLWIGSPPQ-EFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKC 144
Query: 81 QSEQCHLLDTVSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGH 138
++ +C + C Y YA+ S + GVLA + ++FG + VFGC
Sbjct: 145 NAD-------CNCDENGVQCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCET 197
Query: 139 NNTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNG 196
+G ++ + G++GLGR LS+ Q++ + + +N FS C M G G
Sbjct: 198 MESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCY----------GGMDVGGG 247
Query: 197 SEVSGGGVVSTSLV-SKEDKT---YYFVTLEGISVGNLSNSSKLIP--YYNSSGAISKGN 250
+ V GG +V S D + YY + L+ I V KL P + GAI
Sbjct: 248 AMVLGGISSPPGMVFSHSDPSRSPYYNIELKEIHVA--GKPLKLNPRTFDGKYGAI---- 301
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYK-----TPSMAGI 302
+D+G P+ Y ++ + I K DP +C+ + +
Sbjct: 302 --LDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNF-KDICFSGAGRDVTELPKV 358
Query: 303 APILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYD 360
P + F G K+ L F V G +C + D + G + + Y+
Sbjct: 359 FPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYN 418
Query: 361 FDSQMVSFKPTDCTK 375
++ + F T+C++
Sbjct: 419 RENSTIGFWKTNCSE 433
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 101/375 (26%), Positives = 160/375 (42%), Gaps = 40/375 (10%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK---PI--YNPASSSSYKEL 78
Y + +G+PP D Y +DTGSD++WV C C C P+ ++P SS + +
Sbjct: 90 YYTRLQLGSPPR-DFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLI 148
Query: 79 SCQSEQCHL----LDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNS--NNF 128
SC ++C L D+V + C YT+ Y D S T G ++ + F G S N
Sbjct: 149 SCSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMKNS 208
Query: 129 FDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTD 184
+VFGC TG + + G+ G G+ +S+ SQ+ SQ + FS+CL D
Sbjct: 209 SAPIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCL---KGD 265
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
S + G E+ +V T LV + +Y + L+ I V N L +
Sbjct: 266 DSGGGILVLG---EIVEPNIVYTPLVPSQ--PHYNLNLQSIYV----NGQTLAIDPSVFA 316
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRL--GSQLCYKTPSMAGI 302
S ID+G L + Y+ + + + +P P L G+Q + S+ +
Sbjct: 317 TSSNQGTIIDSGTTLAYLTEAAYDPFISAITSTV--SPSVSPYLSKGNQCYLTSSSINDV 374
Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPV---EGVFCFAMQPIDG-DVGIFGNFAQSDLFIG 358
P ++ +F GG + LI I ++C Q I G ++ I G+ D
Sbjct: 375 FPQVSLNFAGGTSMILIPQDYLIQQSSINGAALWCVGFQKIQGQEITILGDLVLKDKIFV 434
Query: 359 YDFDSQMVSFKPTDC 373
YD Q + + DC
Sbjct: 435 YDIAGQRIGWANYDC 449
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 103/389 (26%), Positives = 162/389 (41%), Gaps = 59/389 (15%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N ++ ++GTPP ++ ++DTGS+L W+ C + ++P S+SY+ + C
Sbjct: 28 NVSLIVSLTVGTPPQ-NVSMVIDTGSELSWLHCNKTLS----YPTTFDPTRSTSYQTIPC 82
Query: 81 QSEQC-----HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
S C SC S LC+ T YAD+S + G LA++ G+S+ +VFG
Sbjct: 83 SSPTCTNRTQDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHIGSSD--ISGLVFG 140
Query: 136 CGHNNTGVFNEN------EMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
C + VF+ N GL+G+ R LS +SQLG KFSYC+ + + +
Sbjct: 141 CMDS---VFSSNSDEDSKSTGLMGMNRGSLSF----VSQLGFPKFSYCI----SGTDFSG 189
Query: 190 KMYFGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
+ G + + T L+ D+ Y V LEGI V + KL+P S+
Sbjct: 190 LLLLGESNLTWSVPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVLD-----KLLPIPKST 244
Query: 244 ---GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCY 294
G +D+G T L YN L N L +DP Q LCY
Sbjct: 245 FEPDHTGAGQTMVDSGTQFTFLLGPVYNALRSAFLNQTSSVLRVLEDPDFVFQGAMDLCY 304
Query: 295 KTPSMAGIAPIL---TAHFDGGA-KVPLIHTSTFIPPPVEG---VFCFAMQPID---GDV 344
P + P+L T F G V +P + G V C + D +
Sbjct: 305 LVPLSQRVLPLLPTVTLVFRGAEMTVSGDRVLYRVPGELRGNDSVHCLSFGNSDLLGVEA 364
Query: 345 GIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ G+ Q ++++ +D + + C
Sbjct: 365 YVIGHHHQQNVWMEFDLEKSRIGLAQVRC 393
>gi|223950045|gb|ACN29106.1| unknown [Zea mays]
Length = 392
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 101/374 (27%), Positives = 152/374 (40%), Gaps = 57/374 (15%)
Query: 34 PLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSC 93
P +I +VDTGS++ W C + S + L C S +C + C
Sbjct: 42 PKDNISAVVDTGSNIFWTTEKEC-------------SRSKTRSMLPCCSPKCEQRASCGC 88
Query: 94 SSQQL---------CNYT--YGYADSSLTKGVLATERITFGN-------SNNFFDNVVFG 135
+L C Y YG + T GVL +++T + F+ V G
Sbjct: 89 RRSELKAEAEKETKCTYAIKYGGNANDSTAGVLYEDKLTIVAVASKAVPGSQSFEEVAIG 148
Query: 136 CGHNNTGVFNENEM-GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS-----ITS 189
C + T F + + G+ GLGR+ A+ + QL +KFSYCL + +T+
Sbjct: 149 CSTSATLKFKDPSIKGVFGLGRS----ATSLPRQLNFSKFSYCLSSYQKPDLPSYLLLTA 204
Query: 190 KMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
G+ V +T+L D KT YFV L+GIS+G ++L SG
Sbjct: 205 APDMATGAVGGAAAVATTALQPNSDYKTRYFVDLQGISIGG----TRLPAVSTKSG---- 256
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY---QDPRLGSQLCYKTPSMAGIA-- 303
GNMF+DTG T L + +L ++ +K Y Q R Q+CY PS A
Sbjct: 257 GNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVKEQPGRNNGQICYSPPSTAADESS 316
Query: 304 --PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDF 361
P + HF A + L S + I G + + GNF + + D
Sbjct: 317 KLPDMVLHFADSANMVLPWDSYLWKTTSKLCLAIDKSNIKGGISVLGNFQMQNTHMLLDT 376
Query: 362 DSQMVSFKPTDCTK 375
++ +SF DC+K
Sbjct: 377 GNEKLSFVRADCSK 390
>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 166/382 (43%), Gaps = 51/382 (13%)
Query: 25 VMKFSIGTPPL-LDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS---- 79
V+ IGTPP D+ ++DTGS L W+QC + K++ P+ P ++S LS
Sbjct: 67 VVSLPIGTPPQPTDL--VLDTGSQLSWIQCHD-KKIKKRLPPLPKPKTTSFDPSLSSSFS 123
Query: 80 --------CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
C+ SC +LC+Y+Y YAD +L +G L E+ TF S +
Sbjct: 124 LLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLS-TPP 182
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V+ GC +T EN G++G+ R RLS +SQ +KFSYC VP T S+ T
Sbjct: 183 VILGCAQAST----ENR-GILGMNRGRLSF----ISQAKISKFSYC-VPSRTGSNPTGLF 232
Query: 192 YFGNGSEVSGGGVVSTSLVSKE-------DKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
Y G+ S V T L E D Y + ++ I + + P
Sbjct: 233 YLGDNPNSSKFKYV-TMLTFPESQSSPNLDPLAYTLPMKAIKIAG--KRLNVPPAAFKPD 289
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI-----KLTPYQDPRLGSQLCYKTPSM 299
A G ID+G+ T L + Y +++E+V + K Y D + +C+
Sbjct: 290 AGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADV---ADMCFDAGVT 346
Query: 300 AGIAPI---LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQS 353
A + ++ FD G ++ + + +GV C + + + I G Q
Sbjct: 347 AEVGRRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQ 406
Query: 354 DLFIGYDFDSQMVSFKPTDCTK 375
++++ YD ++ V F +C++
Sbjct: 407 NMWVEYDLANKRVGFGGAECSR 428
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 114/402 (28%), Positives = 164/402 (40%), Gaps = 73/402 (18%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP---CVQC-YKQVKP---IYNPASSSS 74
G Y + S GTPP + I+DTGSDL+W C C C + P I+ P SSSS
Sbjct: 88 GAYSIPLSFGTPPQ-TLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSS 146
Query: 75 YKELSCQSEQCHLLDTVSCSSQ------------QLCNYTYGYADSSLTKGVLATERITF 122
K L C + +C + S+ Q+C + S +T G++ +E +
Sbjct: 147 SKVLGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITGGIMLSETLDL 206
Query: 123 GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
N + GC +T G+ G GR SL SQ LG KFSYCL+
Sbjct: 207 PGKG--VPNFIVGCSVLST----SQPAGISGFGRGPPSLPSQ----LGLKKFSYCLLSRR 256
Query: 183 TDSSITSKMYFGNGSEVSG---GGVVSTSLVSKED-------KTYYFVTLEGISVGNLSN 232
D + S +G SG G+ T V YY++ L I+VG
Sbjct: 257 YDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVG---G 313
Query: 233 SSKLIPY-YNSSGAISKGNMFIDTGAPPTLLPKDFYN----RLEEQVRNAIKLTPYQDPR 287
IPY Y GA G ID+G T + + + E+QV++ K +
Sbjct: 314 KHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQS--KRATEVEGI 371
Query: 288 LGSQLCY-----KTPSMAGIAPILTAHFDGGA--KVPLIHTSTFIPPPVEGVFCFAMQPI 340
G + C+ TPS P LT F GGA ++PL + F+ + V C +
Sbjct: 372 TGLRPCFNISGLNTPSF----PELTLKFRGGAEMELPLANYVAFL--GGDDVVCLTIV-T 424
Query: 341 DGDVG---------IFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
DG G I GNF Q + ++ YD ++ + F+ C
Sbjct: 425 DGAAGKEFSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSC 466
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 97/367 (26%), Positives = 166/367 (45%), Gaps = 35/367 (9%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
G YV++ +GTPP L ++ ++DT +D +W+ C C C +N SSS+Y +SC
Sbjct: 28 GNYVVRAKLGTPPQL-MFMVLDTSNDAVWLPCSGCSGC-SNASTSFNTNSSSTYSTVSCS 85
Query: 82 SEQCHLLDTVSCSSQ----QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
+ QC ++C S +C++ Y S L + +T + + N FGC
Sbjct: 86 TAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTL--APDVIPNFSFGCI 143
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
++ +G + GL+GLGR +SL SQ S L + FSYCL F + YF
Sbjct: 144 NSASG-NSLPPQGLMGLGRGPMSLVSQTTS-LYSGVFSYCLPSFRS-------FYFSGSL 194
Query: 198 EVSGGG----VVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
++ G + T L+ + + Y+V L G+SVG++ + P Y + A S
Sbjct: 195 KLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSV--QVPVDPVYLTFDANSGAGTI 252
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGIAPILTAHFD 311
ID+G T + Y + ++ R + ++ + LG+ C+ + +AP +T H
Sbjct: 253 IDSGTVITRFAQPVYEAIRDEFRKQVNVSSFST--LGAFDTCFSADN-ENVAPKITLHMT 309
Query: 312 G-GAKVPLIHTSTFIPPPVEGVFCFAM----QPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
K+P+ +T I + C +M Q + + + N Q +L I +D + +
Sbjct: 310 SLDLKLPM--ENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRI 367
Query: 367 SFKPTDC 373
P C
Sbjct: 368 GIAPEPC 374
>gi|326513976|dbj|BAJ92138.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 342
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 80/257 (31%), Positives = 124/257 (48%), Gaps = 30/257 (11%)
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYF 193
FGCG + G GL+GL +SL +SQL +FSYCL PF TS M F
Sbjct: 96 FGCGALSAGSL-VGASGLMGLSPGTMSL----ISQLSVPRFSYCLTPFAERK--TSPMLF 148
Query: 194 GNGSEV----SGGGVVSTSLVSKE--DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
G +++ + G + +T+++ D YY+V L G+S+G +K + +S AI+
Sbjct: 149 GAMADLRKYNTTGPIQTTAILRNPAMDTFYYYVPLVGLSLG-----TKRLRVPAASLAIN 203
Query: 248 K---GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA- 303
G +D+G+ L ++ +++ V A+KL + +LC+ PS +A
Sbjct: 204 PDGTGGTIVDSGSTMAHLAGKAFDAVKKAVLEAVKLPVFNGTVEDYELCFAVPSGVAMAA 263
Query: 304 ---PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG----IFGNFAQSDLF 356
P L HFDGGA + L + F P G+ C A+ D+G I GN Q ++
Sbjct: 264 VKTPPLVLHFDGGAAMALPRDNYFQEP-RAGLMCLAVARSPEDLGAPISIIGNVQQQNMH 322
Query: 357 IGYDFDSQMVSFKPTDC 373
+ +D +Q SF PT C
Sbjct: 323 VLFDVHNQKFSFAPTKC 339
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 99/349 (28%), Positives = 146/349 (41%), Gaps = 40/349 (11%)
Query: 41 IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD------TVS 92
++DT SD+ WVQC PC QCY Q +Y+P+ S S + +C S C L + S
Sbjct: 185 LLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLGPYANGCSSS 244
Query: 93 CSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENE-MGL 151
+S C Y Y D S T G L ++++ ++ FGC H G F+ ++ G+
Sbjct: 245 SNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTSQ-VPKFEFGCSHAARGSFSRSKTAGI 303
Query: 152 VGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVS 211
+ LGR SL SQ ++ G FSYC P + + K +F G + + +
Sbjct: 304 MALGRGVQSLVSQTSTKYG-QVFSYCFPP-----TASHKGFFVLGVPRRSSSRYAVTPML 357
Query: 212 KEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLE 271
K Y V LE I+V + P ++GA +D+ T LP Y L
Sbjct: 358 KT-PMLYQVRLEAIAVAG--QRLDVPPTVFAAGAA------LDSRTVITRLPPTAYQALR 408
Query: 272 EQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHFDG-GAKVPLIHTSTFIPP 327
R+ K++ Y+ QL CY ++ I P ++ FD GA V L P
Sbjct: 409 SAFRD--KMSMYRPAAANGQLDTCYDFTGVSSIMLPTISLVFDRTGAGVQL------DPS 460
Query: 328 PVEGVFCFAMQPIDGD---VGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
V C A GD GI G + + Y+ V F+ C
Sbjct: 461 GVLFGSCLAFASTAGDDRATGIIGFLQLQTIEVLYNVAGGSVGFRRGAC 509
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 108/379 (28%), Positives = 169/379 (44%), Gaps = 41/379 (10%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVKPIY-NPASSSSYK 76
G Y K +GTPP + Y +DTGSD++WV C C C + Q++ Y +P SSS+
Sbjct: 75 GLYYTKVKLGTPPR-EFYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPRSSSTSS 133
Query: 77 ELSCQSEQCH---LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGN------SN 126
+SC +C SCSSQ C YT+ Y D S T G ++ + F +
Sbjct: 134 LISCSDRRCRSGVQTSDASCSSQNNQCTYTFQYGDGSGTSGYYVSDLMHFAGIFEGTLTT 193
Query: 127 NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQI-LSQLGANKFSYCLVPFH 182
N +VVFGC TG ++E G+ G G+ +S+ SQ+ L + FS+CL
Sbjct: 194 NSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHCL---K 250
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
D+S + G E+ +V + LV + + +Y + L+ ISV + +++P +
Sbjct: 251 GDNSGGGVLVLG---EIVEPNIVYSPLV--QSQPHYNLNLQSISV-----NGQIVPIAPA 300
Query: 243 SGAISKGN-MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY--KTPSM 299
A S +D+G L ++ YN + + + G+Q CY T S
Sbjct: 301 VFATSNNRGTIVDSGTTLAYLAEEAYNPFVNAITALVPQSVRSVLSRGNQ-CYLITTSSN 359
Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGD-VGIFGNFAQSDL 355
I P ++ +F GGA + L + G V+C Q I G + I G+ D
Sbjct: 360 VDIFPQVSLNFAGGASLVLRPQDYLMQQNYIGEGSVWCIGFQRIPGQSITILGDLVLKDK 419
Query: 356 FIGYDFDSQMVSFKPTDCT 374
YD Q + + DC+
Sbjct: 420 IFVYDLAGQRIGWANYDCS 438
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 98/375 (26%), Positives = 159/375 (42%), Gaps = 47/375 (12%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IG+PP + IVDTGS + +V C CVQC P + P SS+Y+ + C
Sbjct: 86 NGYYTTRLWIGSPPQ-EFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKC 144
Query: 81 QSEQCHLLDTVSCSSQQL-CNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGH 138
++ +C + C Y YA+ S + GVLA + ++FG + VFGC
Sbjct: 145 NAD-------CNCDENGVQCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCET 197
Query: 139 NNTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNG 196
+G ++ + G++GLGR LS+ Q++ + + +N FS C M G G
Sbjct: 198 MESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCY----------GGMDVGGG 247
Query: 197 SEVSGGGVVSTSLV-SKEDKT---YYFVTLEGISVGNLSNSSKLIP--YYNSSGAISKGN 250
+ V GG +V S D + YY + L+ I V KL P + GAI
Sbjct: 248 AMVLGGISSPPGMVFSHSDPSRSPYYNIELKEIHVA--GKPLKLNPRTFDGKYGAI---- 301
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAI---KLTPYQDPRLGSQLCYK-----TPSMAGI 302
+D+G P+ Y ++ + I K DP +C+ + +
Sbjct: 302 --LDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNF-KDICFSGAGRDVTELPKV 358
Query: 303 APILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYD 360
P + F G K+ L F V G +C + D + G + + Y+
Sbjct: 359 FPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYN 418
Query: 361 FDSQMVSFKPTDCTK 375
++ + F T+C++
Sbjct: 419 RENSTIGFWKTNCSE 433
>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
Length = 534
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 96/392 (24%), Positives = 156/392 (39%), Gaps = 48/392 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQCYKQV----------------- 63
G Y++ IGTP L Y +V DT +DL W+ C + K
Sbjct: 123 GMYLVSVRIGTPAL--PYNLVLDTATDLTWINCRLRRRKGKHYGRQSTGQTMSMGGEGAK 180
Query: 64 ---KPIYNPASSSSYKELSCQSEQCHLLDTVSC---SSQQLCNYTYGYADSSLTKGVLAT 117
K Y PA SSS++ + C ++C +L +C S + C+Y D ++T G+
Sbjct: 181 EASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSYFQKTQDGTVTIGIYGK 240
Query: 118 ERITFGNSNNFFDN---VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKF 174
E+ T S+ ++ GC G + G++ LG +S A + G +F
Sbjct: 241 EKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHAAKRFG-QRF 299
Query: 175 SYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNS 233
S+CL+ ++ +S + FG V G G + T ++ D K Y + G+ VG
Sbjct: 300 SFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAQVTGVLVGG---- 355
Query: 234 SKL-IP--YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS 290
+L IP +++ + G + +DT T L + Y + + + P G
Sbjct: 356 ERLDIPDEVWDAERFVG-GGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYELEGF 414
Query: 291 QLCYK--------TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP-ID 341
+ CYK P+ P T GGA++ S +P GV C A + +
Sbjct: 415 EYCYKWTFTGDGVDPAHNVTIPSFTVEMAGGARLEPEAKSVVMPEVEPGVACLAFRKLLR 474
Query: 342 GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
G GI GN + D + F+ C
Sbjct: 475 GGPGILGNVFMQEYIWEIDHGDGKIRFRKDKC 506
>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 389
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 100/384 (26%), Positives = 163/384 (42%), Gaps = 50/384 (13%)
Query: 7 FYPNNVVQSNVSTANGE----YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQ 62
FY + VV +S+ + + ++ + G+P + DTGS L W QC PC CY Q
Sbjct: 37 FYDSKVVSLPLSSPHSQRGLAFMAEIHFGSPQKKQFLHM-DTGSSLTWTQCFPCSDCYAQ 95
Query: 63 -VKPIYNPASSSSYKELSCQ-----SEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLA 116
+ P Y PA+S +Y++ C+ S D ++ ++C Y Y D + KG LA
Sbjct: 96 KIYPKYRPAASITYRDAMCEDSHPKSNPHFAFDPLT----RICTYQQHYLDETNIKGTLA 151
Query: 117 TERITFGNSNNFFDN---VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK 173
E IT + F V FGC + G + G++GLG + S I+ + G+ K
Sbjct: 152 QEMITVDTHDGGFKRVHGVYFGCNTLSDGSYFTG-TGILGLGVGKYS----IIGEFGS-K 205
Query: 174 FSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNS 233
FS+CL ++ + + G+G+ V G V + E T + LE I VG
Sbjct: 206 FSFCLGEI-SEPKASHNLILGDGANVQGHPTV---INITEGHTIF--QLESIIVGEEITL 259
Query: 234 SKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ-DPRLGSQL 292
+ +F+DTG+ + L + Y + + + I P +P L
Sbjct: 260 DDPV------------QVFVDTGSTLSHLSTNLYYKFVDAFDDLIGSRPLSYEP----TL 303
Query: 293 CYKTPSMAGIAPI-LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG--IFGN 349
CYK ++ + + + FD GA++ + + FI + C A+Q I G
Sbjct: 304 CYKADTIERLEKMDVGFKFDVGAELSVNIHNIFIQQGPPEIRCLAIQNNKESFSHVIIGV 363
Query: 350 FAQSDLFIGYDFDSQMVSFKPTDC 373
A +GYD ++ DC
Sbjct: 364 IAMQGYNVGYDLSAKTAYINKQDC 387
>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 442
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 95/372 (25%), Positives = 153/372 (41%), Gaps = 36/372 (9%)
Query: 25 VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKE---LSCQ 81
V+ IGTPP L ++DTGS L W+QC K+ P + S L C
Sbjct: 83 VVTLPIGTPPQLQQM-VLDTGSQLSWIQCHNKKTPQKKQPPTTSSFDPSLSSSFFVLPCN 141
Query: 82 SEQC--HLLD---TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
C + D C + LC+Y+Y YAD + +G L E+I F S ++ GC
Sbjct: 142 HPLCKPRVPDFSLPTDCDANSLCHYSYFYADGTYAEGNLVREKIAFSPSQT-TPPIILGC 200
Query: 137 GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
+++ G++G+ RL SQ KFSYC VP + Y GN
Sbjct: 201 ATQ-----SDDARGILGMNLGRLGFPSQA----KITKFSYC-VPTKQAQPASGSFYLGNN 250
Query: 197 SEVSGGGVVS------TSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
S V+ + + D Y + L+GIS+G + P A G
Sbjct: 251 PASSSFRYVNLLTFGQSQRMPNLDPLAYTLPLQGISIG--GKKLNIPPSVFKPNAGGSGQ 308
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG--SQLCYKTPS--MAGIAPIL 306
ID+G+ T L + YN + E++ + + G + +C+ + + + +
Sbjct: 309 TMIDSGSEFTYLVDEAYNVIREELVKKVGPKIKKGYMYGGVADICFDGDAIEIGRLVGDM 368
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYDFDS 363
F+ G ++ +I + GV C M + + I GNF Q +L++ +D +
Sbjct: 369 VFEFEKGVQI-VIPKERVLATVDGGVHCLGMGRSERLGAGGNIIGNFHQQNLWVEFDLAN 427
Query: 364 QMVSFKPTDCTK 375
+ V F DC+K
Sbjct: 428 RRVGFGEADCSK 439
>gi|357128280|ref|XP_003565802.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 530
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 101/421 (23%), Positives = 166/421 (39%), Gaps = 66/421 (15%)
Query: 13 VQSNVSTAN-GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCL----------------- 54
VQS + N G Y++ IGTPP+ ++DT +DL W+ C
Sbjct: 95 VQSGMGVVNVGMYLVTVRIGTPPVA-FSMVLDTANDLTWLNCRLRRRKGKHHGRPSSTAT 153
Query: 55 ---------PCVQCYKQVKPIYNPASSSSYKELSC-QSEQCHLLDTVSCSS---QQLCNY 101
P + K Y P+ SSS++ C Q + C +C S + C+Y
Sbjct: 154 TTTMSAAMEPEMDAPVVKKTWYRPSLSSSWRRYRCSQKDACGSFPHNTCRSPNHNESCSY 213
Query: 102 TYGYADSSLTKGVLATERITF---------GNSNNFFDNVVFGCGHNNTGVFNENEMGLV 152
Y D ++T+G+ E T G + +V GC G + G++
Sbjct: 214 EQMYEDGTVTRGIYGRETATVPVSVSGAGEGQTAVLLPGLVLGCSTFEAGATVDAHDGVL 273
Query: 153 GLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSK 212
LG +S + ++ G +FS+CL+ + S + FG ++GG + T+LV
Sbjct: 274 TLGNHAVSFGTVAAARFGG-RFSFCLLHTMSGRDTFSYLTFGPNPALNGGAMEETNLVYS 332
Query: 213 EDKTYYFVTLEGISVGNLSNSSKL--IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRL 270
D F G++ G + +L IP A+ G + +DTG T L + +
Sbjct: 333 PDGEPAFGA--GVT-GVFVDGERLAGIPPEVWDPAVLGGALNLDTGTSLTGLVEPAF--- 386
Query: 271 EEQVRNAI--KLTPYQDPRL-GSQLCYKTPSMAGIA------------PILTAHFDGGAK 315
E VR A+ +L Q + G +CYK AG P + F+GGA+
Sbjct: 387 -EAVRAAVDRRLGHLQKEDVAGFDICYKWAFGAGAGDEGVDPAHNVTVPKVAFEFEGGAR 445
Query: 316 VPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
+ + +P V GV C + + + GN + +D + + F+ CT
Sbjct: 446 LEPVARGIVLPEVVPGVACLGFRRREVGPSVLGNVHMQEHVWEFDHMAGKLRFRKDKCTN 505
Query: 376 Q 376
Sbjct: 506 H 506
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 99/370 (26%), Positives = 153/370 (41%), Gaps = 42/370 (11%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
G Y + IGTPP + IVDTGS + +V C C C P ++PA SSSYK L C
Sbjct: 32 KGYYTSRVKIGTPPH-EFSLIVDTGSTVTYVPCSSCTHCGNHQDPRFSPALSSSYKPLEC 90
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNF-FDNVVFGCGHN 139
SE T C + Y YA+ S + GVL + I F NS++ +VFGC
Sbjct: 91 GSE----CSTGFCDGSR--KYQRQYAEKSTSSGVLGKDVIGFSNSSDLGGQRLVFGCETA 144
Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQLG-ANKFSYCLVPFHTDSSITSKMYFGNGS 197
TG ++++ G++GLGR LS+ Q++ + + FS C M G G+
Sbjct: 145 ETGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCY----------GGMDEGGGA 194
Query: 198 EVSGGGVVSTSLV----SKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFI 253
+ GG +V YY + L+GI VG K + G + +
Sbjct: 195 MILGGFQPPKDMVFTASDPHRSPYYNLMLKGIRVGGSPLRLKPEVFDGKYGTV------L 248
Query: 254 DTGAPPTLLP----KDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY-----KTPSMAGIAP 304
D+G P + F + ++EQV ++K P D + +CY +++ P
Sbjct: 249 DSGTTYAYFPGAAFQAFKSAVKEQV-GSLKEVPGPDEKF-KDICYAGAGTNVSNLSQFFP 306
Query: 305 ILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDS 363
+ F G V L F + G +C + + G ++ + Y+
Sbjct: 307 SVDFVFGDGQSVTLSPENYLFRHTKISGAYCLGVFENGDPTTLLGGIIVRNMLVTYNRGK 366
Query: 364 QMVSFKPTDC 373
+ F T C
Sbjct: 367 ASIGFLKTKC 376
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 105/215 (48%), Gaps = 24/215 (11%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IGTPP IVDTGS + +V C C QC + P + P SS+Y+ +SC
Sbjct: 87 NGYYTTRIWIGTPPQT-FALIVDTGSTVTYVPCSTCEQCGRHQDPKFEPELSSTYQPVSC 145
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
+D + ++ C Y YA+ S + GVL + I+FGN + +FGC +
Sbjct: 146 N------IDCTCDNERKQCVYERQYAEMSSSSGVLGEDIISFGNQSELVPQRAIFGCENQ 199
Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
TG ++++ G++GLGR LS+ Q++ + + ++ FS C M G G+
Sbjct: 200 ETGDLYSQRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCY----------GGMDIGGGA 249
Query: 198 EVSGGGVVSTSLVSKED----KTYYFVTLEGISVG 228
+ GG + +V E YY + L+ I V
Sbjct: 250 MILGGISPPSGMVFAESDPVRSQYYNIDLKAIHVA 284
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 101/378 (26%), Positives = 167/378 (44%), Gaps = 44/378 (11%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYKEL 78
Y K IGTPP + VDTGSD++WV C+ C +C + +Y+P SSS +
Sbjct: 87 YYTKIEIGTPPK-PFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAV 145
Query: 79 SCQSEQCHLL-----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNNFF 129
SC ++ C C++ + C Y Y D S T G ++ + + GN+
Sbjct: 146 SCDNKFCAATYGSGEKLPGCTAGKPCEYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQTRH 205
Query: 130 --DNVVFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHT 183
NV+FGCG G N+ G++G G++ S SQ+ S K FS+CL
Sbjct: 206 AKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCL----- 260
Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
+I F G EV V ST L+ + ++Y V L+ I V N+ +L P+ +
Sbjct: 261 -DTIKGGGIFAIG-EVVQPKVKSTPLL--PNMSHYNVNLQSIDVAG--NALQLPPHIFET 314
Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK-TPSMAGI 302
K ID+G T LP+ Y + V + ++ + LC++ + S+
Sbjct: 315 S--EKRGTIIDSGTTLTYLPELVYKDILAAVFQKHQDITFR--TIQGFLCFEYSESVDDG 370
Query: 303 APILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIFGNFAQSDLF 356
P +T HF+ + + F + ++C QP D D+ + G+ S+
Sbjct: 371 FPKITFHFEDDLGLNVYPHDYFFQNG-DNLYCLGFQNGGFQPKDAKDMVLLGDLVLSNKV 429
Query: 357 IGYDFDSQMVSFKPTDCT 374
+ YD + Q++ + +C+
Sbjct: 430 VVYDLEKQVIGWTDYNCS 447
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 96/373 (25%), Positives = 160/373 (42%), Gaps = 43/373 (11%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IGTPP IVDTGS + +V C C C + P + P S +Y+ + C
Sbjct: 86 NGYYTTRLWIGTPPQ-RFALIVDTGSTVTYVPCSTCEHCGRHQDPKFQPDLSETYQPVKC 144
Query: 81 QSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGH 138
+ +C C Y YA+ S + GVL + ++FGN + VFGC +
Sbjct: 145 TPD-------CNCDGDTNQCMYDRQYAEMSSSSGVLGEDVVSFGNLSELAPQRAVFGCEN 197
Query: 139 NNTG-VFNENEMGLVGLGRTRLSLASQIL-SQLGANKFSYCLVPFHTDSSITSKMYFGNG 196
+ TG ++++ G++GLGR LS+ Q++ ++ ++ FS C M G G
Sbjct: 198 DETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCY----------GGMDVGGG 247
Query: 197 SEVSGGGVVSTSLV---SKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
+ + GG +V S D++ YY + L+ + V KL N K
Sbjct: 248 AMILGGISPPEDMVFTHSDPDRSPYYNINLKEMHVA----GKKL--QLNPKVFDGKHGTV 301
Query: 253 IDTGAPPTLLPKD---FYNRLEEQVRNAIKLTPYQDPRLGSQLCY-----KTPSMAGIAP 304
+D+G LP+ + R + RN++K DP +C+ +A P
Sbjct: 302 LDSGTTYAYLPETAFLAFKRAIMKERNSLKQINGPDPNY-KDICFTGAGIDVSQLAKSFP 360
Query: 305 ILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYDFD 362
++ F+ G K+ L F V G +C + D + G + + YD +
Sbjct: 361 VVDMVFENGHKLSLSPENYLFRHSKVRGAYCLGVFSNGRDPTTLLGGIFVRNTLVMYDRE 420
Query: 363 SQMVSFKPTDCTK 375
+ + F T+C++
Sbjct: 421 NSKIGFWKTNCSE 433
>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
Length = 389
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 80/277 (28%), Positives = 124/277 (44%), Gaps = 22/277 (7%)
Query: 91 VSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMG 150
V S+ +CNY Y D S T+G L E++ FG + +FGCG NN G+F G
Sbjct: 68 VCGSAAPICNYAINYGDGSFTRGELGHEKLKFGTI--LVKDFIFGCGRNNKGLFG-GVSG 124
Query: 151 LVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLV 210
L+GLGR+ LSL SQ G FSYCL P S + GN S +S + +
Sbjct: 125 LMGLGRSDLSLISQTSGIFGG-VFSYCL-PSTERKGSGSLILGGNSSVYRNSSPISYAKM 182
Query: 211 SKEDKTY--YFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYN 268
+ + Y YF+ L GIS+G ++ + ++ + +D+G T LP Y
Sbjct: 183 IENPQLYNFYFINLTGISIGGVALQAP---------SVGPSRILVDSGTVITRLPPTIYK 233
Query: 269 RLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTST--FI 325
L+ + P C+ + + P + HF+G A++ + T F+
Sbjct: 234 ALKAEFLKQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFV 293
Query: 326 PPPVEGVFCFAMQPID--GDVGIFGNFAQSDLFIGYD 360
V C A+ ++ +V I GN+ Q +L + YD
Sbjct: 294 KSDASQV-CLALASLEYQDEVAILGNYQQKNLRVIYD 329
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 99/360 (27%), Positives = 158/360 (43%), Gaps = 28/360 (7%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
YV+ +GTP I I DTGS WV C C C+ + S++ K +SC +
Sbjct: 82 YVISVGLGTPAKTQIVEI-DTGSSTSWVFC-ECDGCHTNPRTFLQSRSTTCAK-VSCGTS 138
Query: 84 QCHLLDTV-SCSSQQ---LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
C L + C + C + Y D S + G+L + +TF + + FGC +
Sbjct: 139 MCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK-IPSFTFGCNLD 197
Query: 140 NTGVFNE--NEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM--YFGN 195
+ G NE N GL+G+G +S+ Q S + FSYCL ++ SK YF
Sbjct: 198 SFGA-NEFGNVDGLLGMGAGPMSVLKQ--SSPRFDGFSYCLPLQKSERGFFSKTTGYFSL 254
Query: 196 GSEVSGGGVVSTSLVSKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFID 254
G + V T +V++ T +FV L ISV + +L S S+ + D
Sbjct: 255 GKVATRTDVRYTKMVARRKNTELFFVDLAAISV----DGERL---GLSPSIFSRKGVVFD 307
Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM-AGIAPILTAHFDGG 313
+G+ + +P + L +++R + L + CY S+ G P ++ HFD G
Sbjct: 308 SGSELSYIPDRALSVLSQRIRELL-LRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDG 366
Query: 314 AKVPLIHTSTFIPPPV--EGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
A+ L F+ V + V+C A P + V I G+ Q+ + YD Q++ P+
Sbjct: 367 ARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYDLKRQLIGIGPS 425
>gi|110738505|dbj|BAF01178.1| hypothetical protein [Arabidopsis thaliana]
Length = 284
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 68/209 (32%), Positives = 104/209 (49%), Gaps = 23/209 (11%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IGTPP + IVD+GS + +V C C QC K P + P SS+Y+ + C
Sbjct: 90 NGYYTTRLWIGTPPQM-FALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKC 148
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
+ C+ D ++ C Y YA+ S +KGVL + I+FGN + VFGC
Sbjct: 149 NMD-CNCDD-----DREQCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETV 202
Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGS 197
TG ++++ G++GLG+ LSL Q++ + L +N F C M G GS
Sbjct: 203 ETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCY----------GGMDVGGGS 252
Query: 198 EVSGGGVVSTSLV---SKEDKTYYFVTLE 223
+ GG + +V S D+++ T+
Sbjct: 253 MILGGFDYPSDMVFTDSDPDRSFGMATVH 281
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 103/397 (25%), Positives = 170/397 (42%), Gaps = 79/397 (19%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y K IGTP Y VDTGSD+MWV C+ C QC ++ +YN S S K
Sbjct: 78 GLYYAKIGIGTPAK-SYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGK 136
Query: 77 ELSCQSEQCHLLD---TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGN------SNN 127
+SC + C+ + C + C Y Y D S T G + + + + +
Sbjct: 137 LVSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQT 196
Query: 128 FFDNVVFGCGHNNTGVF---NENEM-GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFH 182
+V+FGCG +G NE + G++G G+ S+ SQ+ S K F++CL
Sbjct: 197 ANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL---- 252
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGN--LS 231
+G +GGG+ + V + ++ +Y V + + VG L+
Sbjct: 253 ------------DGR--NGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLN 298
Query: 232 NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
+ L + GAI ID+G LP+ Y L +++ + Q+P L
Sbjct: 299 IPADLFQPGDRKGAI------IDSGTTLAYLPEIIYEPLVKKITS-------QEPALKVH 345
Query: 292 LC---YKTPSMAGIA----PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCF-----AMQ 338
+ YK +G P +T HF+ + + H F P EG++C AMQ
Sbjct: 346 IVDKDYKCFQYSGRVDEGFPNVTFHFENSVFLRVYPHDYLF---PYEGMWCIGWQNSAMQ 402
Query: 339 PID-GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
D ++ + G+ S+ + YD ++Q++ + +C+
Sbjct: 403 SRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNCS 439
>gi|297744129|emb|CBI37099.3| unnamed protein product [Vitis vinifera]
Length = 299
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/140 (38%), Positives = 74/140 (52%), Gaps = 27/140 (19%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIY-GIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
V++ V NGE++M +IGTP + Y I+DTGSDL+W QC PC C+ Q PI++P
Sbjct: 86 VEAPVHAGNGEFLMNLAIGTPA--ETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEK 143
Query: 72 SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SSS+ +L C S+ H S T+GVLATE TFG+++
Sbjct: 144 SSSFSKLPCSSDLYH----------------------SSTQGVLATETFTFGDAS--VSK 179
Query: 132 VVFGCGHNNTGVFNENEMGL 151
+ FGCG +N G GL
Sbjct: 180 IGFGCGEDNRGRAYSQGAGL 199
>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 542
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 102/381 (26%), Positives = 164/381 (43%), Gaps = 68/381 (17%)
Query: 30 IGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-------YKQVKP---IYNPASSSSYKELS 79
IGTP + + + D GSDL+WV C C+QC Y + Y+P+ SS+ K LS
Sbjct: 119 IGTPHVSFLVAL-DAGSDLLWVPC-DCLQCAPLSASYYSSLDRDLNEYSPSHSSTSKHLS 176
Query: 80 CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN-------- 131
C + C L + S +Q C Y+ Y + + L E I SN DN
Sbjct: 177 CSHQLCELGPNCN-SPKQPCPYSMDYYTENTSSSGLLVEDILHLASNG--DNALSYSVRA 233
Query: 132 -VVFGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQLG--ANKFSYCLVPFHTDSS 186
VV GCG +G + + GL+GLG +S+ S L++ G N FS C D
Sbjct: 234 PVVIGCGMKQSGGYLDGVAPDGLMGLGLAEISVPS-FLAKAGLIRNSFSMCF-----DED 287
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKE-DKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
+ +++FG+ + ST ++ + + T Y V +EG VG+ S
Sbjct: 288 DSGRIFFGDQGPTTQQ---STPFLTLDGNYTTYVVGVEGFCVGS------------SCLK 332
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT--------P 297
+ +DTG T LP Y R+ E+ + T + CYK+ P
Sbjct: 333 QTSFRALVDTGTSFTFLPNGVYERITEEFDRQVNATISSFNGYPWKYCYKSSSNHLTKVP 392
Query: 298 SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV--FCFAMQPIDGDVGIFGNFAQSDL 355
S+ I P+ + +IH F+ ++G+ FC A+QP +GD+G G +
Sbjct: 393 SVKLIFPLNNSF--------VIHNPVFMIYGIQGITGFCLAIQPTEGDIGTIGQNFMAGY 444
Query: 356 FIGYDFDSQMVSFKPTDCTKQ 376
+ +D ++ + + + C +
Sbjct: 445 RVVFDRENMKLGWSHSSCEDR 465
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 102/397 (25%), Positives = 168/397 (42%), Gaps = 79/397 (19%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y K IGTP Y VDTGSD+MWV C+ C QC ++ +YN S S K
Sbjct: 78 GLYYAKIGIGTPAK-SYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGK 136
Query: 77 ELSCQSEQCHLLD---TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGN------SNN 127
+SC + C+ + C + C Y Y D S T G + + + + +
Sbjct: 137 LVSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQT 196
Query: 128 FFDNVVFGCGHNNTGVF---NENEM-GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFH 182
+V+FGCG +G NE + G++G G+ S+ SQ+ S K F++CL
Sbjct: 197 ANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL---- 252
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGN--LS 231
+GGG+ + V + ++ +Y V + + VG L+
Sbjct: 253 --------------DGRNGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLT 298
Query: 232 NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
+ L + GAI ID+G LP+ Y L +++ + Q+P L
Sbjct: 299 IPADLFQPGDRKGAI------IDSGTTLAYLPEIIYEPLVKKITS-------QEPALKVH 345
Query: 292 LC---YKTPSMAGIA----PILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCF-----AMQ 338
+ YK +G P +T HF+ + + H F P EG++C AMQ
Sbjct: 346 IVDKDYKCFQYSGRVDEGFPNVTFHFENSVFLRVYPHDYLF---PHEGMWCIGWQNSAMQ 402
Query: 339 PID-GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
D ++ + G+ S+ + YD ++Q++ + +C+
Sbjct: 403 SRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNCS 439
>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
Length = 389
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 100/372 (26%), Positives = 160/372 (43%), Gaps = 38/372 (10%)
Query: 26 MKFSIGTPPL-LDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQ 84
M S+GTPP L+ VD+G WV C ++ P S+S+ +L C S
Sbjct: 1 MDLSLGTPPQPLNFTLAVDSG--FSWVACSSSCAINCTTASLFQPGLSTSHTKLPCGSPS 58
Query: 85 CHLLDTV--SCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN--FFDNVVFGCGHNN 140
C V SC C+Y Y + + G L ++ T + N N+ GCG ++
Sbjct: 59 CSAFSAVSTSCGPSSSCSYNTSYGTNFSSAGDLVSDIATMDSVRNRKVAANLSLGCGRDS 118
Query: 141 TGVFN-ENEMGLVGLGRTRLSLASQILSQLG-ANKFSYCLVPFHTDSSITSKMYFGN--- 195
G+ + G VG + +S Q LS LG +KF YCL P T K+ GN
Sbjct: 119 GGLLELLDTSGFVGFDKGNVSFMGQ-LSALGYRSKFIYCL-PSDT---FRGKLVIGNYKL 173
Query: 196 -GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKL-IPYYNSSGAISKGNMFI 253
+ +S + + + + YF+ L IS+ N ++ I + S+G G I
Sbjct: 174 RNASISSSMAYTPMITNPQAAELYFINLSTISIDK--NKFQVPIQGFLSNGT---GGTVI 228
Query: 254 DTGAPPTLLPKDFYNRLEEQVR----NAIKLTPYQDPRLGSQLCYKTPSMAGIAP--ILT 307
DT + L DFY +L + ++ N ++++ LG +LCY + + P LT
Sbjct: 229 DTTTFLSYLTSDFYTQLVQAIKNYTTNLVEVSSSVADALGVELCYNISANSDFPPPATLT 288
Query: 308 AHFDGGAKVPLIHTSTFI---PPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYDF 361
HF GGA V + ++ F+ V C A+ + + ++ + G + Q DL + YD
Sbjct: 289 YHFLGGAGVEV--STWFLLDDSDSVNNTICMAIGRSESVGPNLNVIGTYQQLDLTVEYDL 346
Query: 362 DSQMVSFKPTDC 373
+ F C
Sbjct: 347 EQMRYGFGAQGC 358
>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
Length = 537
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 96/396 (24%), Positives = 156/396 (39%), Gaps = 52/396 (13%)
Query: 22 GEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQCLPCVQCYKQV----------------- 63
G Y++ IGTP L Y +V DT +DL W+ C + K
Sbjct: 122 GMYLVSVRIGTPAL--PYNLVLDTATDLTWINCRLRRRKGKHYGRQSMGQTMSVGGEGAT 179
Query: 64 -------KPIYNPASSSSYKELSCQSEQCHLLDTVSC---SSQQLCNYTYGYADSSLTKG 113
K Y PA SSS++ + C ++C +L +C S + C+Y D ++T G
Sbjct: 180 AAKKEASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSYFQKTQDGTVTIG 239
Query: 114 VLATERITFGNSNNFFDN---VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLG 170
+ E+ T S+ ++ GC G + G++ LG +S A + G
Sbjct: 240 IYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHAAKRFG 299
Query: 171 ANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGN 229
+FS+CL+ ++ +S + FG V G G + T ++ D K Y + G+ VG
Sbjct: 300 -QRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAKVTGVLVGG 358
Query: 230 LSNSSKL-IP--YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP 286
+L IP +++ + G + +DT T L + Y + + + P
Sbjct: 359 ----ERLDIPDEVWDAERFVG-GGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYE 413
Query: 287 RLGSQLCYK--------TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ 338
G + CYK P+ P T GGA++ S +P GV C A +
Sbjct: 414 LEGFEYCYKWTFTGDGVXPAHNVTIPSFTVEMAGGARLEPEAKSVVMPEVEPGVACLAFR 473
Query: 339 P-IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ G GI GN + D + F+ C
Sbjct: 474 KLLRGGPGILGNVFMQEYIWEIDHGDGKIRFRKDKC 509
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 106/391 (27%), Positives = 168/391 (42%), Gaps = 65/391 (16%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVK-PIYNPASSSSYK 76
G Y K +GTPP + +DTGSD++WV C C C K Q++ ++P SSS
Sbjct: 82 GLYYTKVKLGTPPR-EFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSAS 140
Query: 77 ELSCQSEQCH--LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG---------NS 125
+SC +C+ CS LC+Y++ Y D S T G ++ ++F NS
Sbjct: 141 LVSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINS 200
Query: 126 NNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPF 181
+ F VFGC + TG G+ GLG+ LS+ SQ+ Q L FS+CL
Sbjct: 201 SAPF---VFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCL--- 254
Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTY---------YFVTLEGISVGNLSN 232
+ SGGG++ + + D Y Y V L+ I+V
Sbjct: 255 --------------KGDKSGGGIMVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAV----- 295
Query: 233 SSKLIPYYNSSGAISKGN-MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP-RLGS 290
+ +++P S I+ G+ IDTG LP + Y+ + + NA+ + Y P S
Sbjct: 296 NGQILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAIANAV--SQYGRPITYES 353
Query: 291 QLCYK-TPSMAGIAPILTAHFDGGAKV---PLIHTSTFIPPPVEGVFCFAMQPID-GDVG 345
C++ T + P ++ F GGA + P + F ++C Q + +
Sbjct: 354 YQCFEITAGDVDVFPEVSLSFAGGASMVLRPHAYLQIF-SSSGSSIWCIGFQRMSHRRIT 412
Query: 346 IFGNFAQSDLFIGYDFDSQMVSFKPTDCTKQ 376
I G+ D + YD Q + + DC+ +
Sbjct: 413 ILGDLVLKDKVVVYDLVRQRIGWAEYDCSLE 443
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 99/360 (27%), Positives = 158/360 (43%), Gaps = 28/360 (7%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
YV+ +GTP I I DTGS WV C C C+ + S++ K +SC +
Sbjct: 82 YVISVGLGTPAKTQIVEI-DTGSSTSWVFC-ECDGCHTNPRTFLQSRSTTCAK-VSCGTS 138
Query: 84 QCHLLDTV-SCSSQQ---LCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHN 139
C L + C + C + Y D S + G+L + +TF + FGC +
Sbjct: 139 MCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQK-IPGFSFGCNMD 197
Query: 140 NTGVFNE--NEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM--YFGN 195
+ G NE N GL+G+G +S+ Q S + FSYCL ++ SK YF
Sbjct: 198 SFGA-NEFGNVDGLLGMGAGPMSVLKQ--SSPTFDCFSYCLPLQKSERGFFSKTTGYFSL 254
Query: 196 GSEVSGGGVVSTSLVSKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFID 254
G + V T +V+++ T +FV L ISV + +L S S+ + D
Sbjct: 255 GKVATRTDVRYTKMVARKKNTELFFVDLTAISV----DGERL---GLSPSVFSRKGVVFD 307
Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSM-AGIAPILTAHFDGG 313
+G+ + +P + L +++R + L + CY S+ G P ++ HFD G
Sbjct: 308 SGSELSYIPDRALSVLSQRIRELL-LKRGAAEEESERNCYDMRSVDEGDMPAISLHFDDG 366
Query: 314 AKVPLIHTSTFIPPPV--EGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
A+ L F+ V + V+C A P + V I G+ Q+ + YD Q++ P+
Sbjct: 367 ARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYDLKRQLIGIGPS 425
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 102/390 (26%), Positives = 173/390 (44%), Gaps = 52/390 (13%)
Query: 15 SNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNP 69
S ++T G Y + IGTP Y VDTGSD++WV C+ C C ++ +Y+P
Sbjct: 81 SGLATETGLYFTRIGIGTPAKR-YYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDP 139
Query: 70 ASSSSYKELSCQSEQC---HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF---- 122
S S + ++C + C + SC+S C Y+ Y D S T G T+ + +
Sbjct: 140 RGSQSGELVTCDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVS 199
Query: 123 --GNSNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSY 176
G + +V FGCG G + + G++G G++ S+ SQ+ + K F++
Sbjct: 200 GDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAH 259
Query: 177 CLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSS 234
CL ++ F G+ V V +T LV D +Y V L+GI VG L +
Sbjct: 260 CL------DTVNGGGIFAIGNVVQ-PKVKTTPLV--PDMPHYNVILKGIDVGGTALGLPT 310
Query: 235 KLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQDPRLGSQ 291
+ NS G I ID+G +P+ Y L V I + QD
Sbjct: 311 NIFDSGNSKGTI------IDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS---- 360
Query: 292 LCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DV 344
C++ + S+ P +T HF+G + ++ ++ + ++C +Q DG D+
Sbjct: 361 -CFQYSGSVDDGFPEVTFHFEGDVSL-IVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDM 418
Query: 345 GIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ G+ S+ + YD ++Q + + +C+
Sbjct: 419 VLLGDLVLSNKLVLYDLENQAIGWADYNCS 448
>gi|297838267|ref|XP_002887015.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297332856|gb|EFH63274.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 324
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 72/247 (29%), Positives = 112/247 (45%), Gaps = 33/247 (13%)
Query: 25 VMKFSIGTPPLLDIYGIVDTGSDLMWVQC----LPCVQCYKQVKPIYNPASSSSYKELSC 80
++ IGTPP ++DTGS L W+QC LP + K ++P+ SSS+ L C
Sbjct: 75 IISLPIGTPPQAQQM-VLDTGSQLSWIQCHRKKLP-----PKPKTSFDPSLSSSFSTLPC 128
Query: 81 QSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
C SC S +LC+Y+Y YAD + +G L E+ITF N+ ++ G
Sbjct: 129 SHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNT-EITPPLILG 187
Query: 136 CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGN 195
C ++ ++ G++G+ R RLS +SQ KFSYC+ P T F
Sbjct: 188 CATESS-----DDRGILGMNRGRLSF----VSQAKITKFSYCIPPKSNRPGFTPTGSFYL 238
Query: 196 GSEVSGGGVVSTSLVSKEDKTYYFVTLE--------GISVGNLSNSSKLIPYYNSSGAIS 247
G + G SL++ ++ V E GI + SS L N G +
Sbjct: 239 GDNPNSKGFKYVSLLTFPERVEILVPKERVLVNVGDGIHCVGIGRSSMLGAASNIIGNVH 298
Query: 248 KGNMFID 254
+ N++++
Sbjct: 299 QQNLWVE 305
>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 444
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 95/375 (25%), Positives = 159/375 (42%), Gaps = 39/375 (10%)
Query: 25 VMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI--YNPASSSSYKELSCQS 82
++ IGTP ++DTGS L W+QC P P ++P+ SSS+ +L C
Sbjct: 82 ILSLPIGTPSQSQEL-VLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSH 140
Query: 83 EQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
C SC S +LC+Y+Y YAD + +G L E+ TF NS ++ GC
Sbjct: 141 PLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQT-TPPLILGCA 199
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDS---SITSKMYFG 194
+T V G++G+ RLS +SQ +KFSYC +P ++ + T Y G
Sbjct: 200 KESTDV-----KGILGMNLGRLSF----ISQAKISKFSYC-IPTRSNRPGLASTGSFYLG 249
Query: 195 NGSEVSGGGVVS------TSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
G VS + + D Y V L GI +G + + +G
Sbjct: 250 ENPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGG--S 307
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS--QLCYKTPSMAGIAPI- 305
G +D+G+ T L Y++++E++ + + GS +C+ I +
Sbjct: 308 GQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRLI 367
Query: 306 --LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQSDLFIGYD 360
L F G ++ L+ + G+ C + + I GN Q +L++ +D
Sbjct: 368 GDLVFEFGRGVEI-LVEKQRLLVNVGGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFD 426
Query: 361 FDSQMVSFKPTDCTK 375
++ V F +C++
Sbjct: 427 VANRRVGFSKAECSR 441
>gi|296082634|emb|CBI21639.3| unnamed protein product [Vitis vinifera]
Length = 278
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 51/131 (38%), Positives = 72/131 (54%), Gaps = 27/131 (20%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIY-GIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS 71
V++ V NGE++MK +IGTP + Y I+DTGSDL+W QC PC C+ Q PI++P
Sbjct: 79 VEAPVHAGNGEFLMKLAIGTPA--ETYSAIMDTGSDLIWTQCKPCKDCFDQPTPIFDPKK 136
Query: 72 SSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
SSS+ +L C S+ + S T+GVLATE FG+++
Sbjct: 137 SSSFSKLPCSSDLYY----------------------SSTQGVLATETFAFGDAS--VSK 172
Query: 132 VVFGCGHNNTG 142
+ FGCG +N G
Sbjct: 173 IGFGCGEDNDG 183
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 90/368 (24%), Positives = 157/368 (42%), Gaps = 33/368 (8%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IGTPP IVDTGS + +V C C C P + P +S +Y+ + C
Sbjct: 90 NGYYTTRLWIGTPPQ-RFALIVDTGSTVTYVPCSTCKHCGSHQDPKFRPEASETYQPVKC 148
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
+ QC+ D ++ C Y YA+ S + GVL + ++FGN + +FGC ++
Sbjct: 149 -TWQCNCDD-----DRKQCTYERRYAEMSTSSGVLGEDVVSFGNQSELSPQRAIFGCEND 202
Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILS-QLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
TG ++N+ G++GLGR LS+ Q++ ++ ++ FS C + M G S
Sbjct: 203 ETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLC---YGGMGVGGGAMVLGGIS 259
Query: 198 EVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
+ + V YY + L+ I V +L + N K +D+G
Sbjct: 260 PPADMVFTHSDPVR---SPYYNIDLKEIHVAG----KRL--HLNPKVFDGKHGTVLDSGT 310
Query: 258 PPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCY-----KTPSMAGIAPILTAH 309
LP+ + + + +++K DP + +C+ ++ P++
Sbjct: 311 TYAYLPESAFLAFKHAIMKETHSLKRISGPDPHY-NDICFSGAEINVSQLSKSFPVVEMV 369
Query: 310 FDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYDFDSQMVS 367
F G K+ L F V G +C + D + G + + YD + +
Sbjct: 370 FGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHSKIG 429
Query: 368 FKPTDCTK 375
F T+C++
Sbjct: 430 FWKTNCSE 437
>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
Length = 519
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 105/443 (23%), Positives = 167/443 (37%), Gaps = 99/443 (22%)
Query: 13 VQSNVSTANGEYVMKFSIGTP--PLLDIYGIVDTGSDLMWVQCL-----PCVQCYKQVKP 65
+ S T G+Y ++F +GTP P L + DTGSDL WV+C Y P
Sbjct: 96 LSSGAYTGTGQYFVRFRVGTPARPFLLV---ADTGSDLTWVKCHRHDHDAPAPGYGYAAP 152
Query: 66 ----------------------IYNPASSSSYKELSCQSEQCHL---LDTVSCSSQ-QLC 99
++ P S ++ + C S+ C +C + C
Sbjct: 153 ASNDSSTSSLSAAAASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPC 212
Query: 100 NYTYGYADSSLTKGVLATERITFGNSNN---------FFDNVVFGCGHNNTGVFNENEMG 150
Y Y Y D S +G + T+ T S VV GC + TG G
Sbjct: 213 AYDYRYKDGSAARGTVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFLASDG 272
Query: 151 LVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS-- 208
++ LG + +S AS+ ++ G +FSYCLV + TS + FG VS T+
Sbjct: 273 VLSLGYSNISFASRAAARFG-GRFSYCLVDHLAPRNATSYLTFGPNPAVSSSPPSKTACA 331
Query: 209 --------------------LVSKEDKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAIS 247
L+ + +Y VT+ GISV G L +L+ + + G
Sbjct: 332 GGGSPAAAPPGPGGARQTPLLLDHRMRPFYAVTVNGISVDGELLRIPRLV-WDVAKG--- 387
Query: 248 KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTP------------YQDPRLGSQLCYK 295
G +D+G T+L Y + + + P + P G L
Sbjct: 388 -GGAILDSGTSLTVLVSPAYRAVVAALNKKLAGLPRVTMDPFDYCYNWTSPSTGEDLTVA 446
Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGD---VGIFGNFAQ 352
P +A HF G A++ +++ GV C +Q +G+ V + GN Q
Sbjct: 447 MPELA-------VHFAGSARL-QPPAKSYVIDAAPGVKCIGLQ--EGEWPGVSVIGNILQ 496
Query: 353 SDLFIGYDFDSQMVSFKPTDCTK 375
+ +D ++ + FK + CT+
Sbjct: 497 QEHLWEFDLKNRRLRFKRSRCTQ 519
>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
Length = 468
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 117/410 (28%), Positives = 169/410 (41%), Gaps = 74/410 (18%)
Query: 18 STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP---CVQC------YKQVKPIYN 68
S + G Y M S+GTP + I+DTGS L+W C C C ++ P +
Sbjct: 78 SRSYGGYSMSLSLGTPSQ-TVKLIMDTGSSLVWFPCTSRYVCASCNFPNTDITKI-PKFM 135
Query: 69 PASSSSYKELSCQSEQCHLLDTVSCSSQ-QLCN------------YTYGYADSSLTKGVL 115
P SSS K + C++ +C + S S+ CN Y Y S T G+L
Sbjct: 136 PRLSSSSKLIGCKNPKCAWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLGS-TAGLL 194
Query: 116 ATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFS 175
+E I F N + + GC +T G+ G GR++ SL Q LG KFS
Sbjct: 195 LSETINFPNKT--ISDFLAGCSLLST----RQPEGIAGFGRSQESLPLQ----LGLKKFS 244
Query: 176 YCLVPFH-TDSSITSKMYFGNGSEVSGGGVVSTS-------LVSKED---KTYYFVTLEG 224
YCLV DS ++S + G S S L S+ + + YY+V L
Sbjct: 245 YCLVSRRFDDSPVSSDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRK 304
Query: 225 ISVGNLSNSSKLIPY-YNSSGAISKGNMFIDTGAPPTLLPKDFYNRL----EEQVRNAIK 279
I VG + +PY + G+ G +D+G+ T + + L E+Q+ N
Sbjct: 305 IIVGK---THVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTV 361
Query: 280 LTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAK--VPLIHTSTFIPPPVEGVFCFA 336
T Q G + C+ + P LT F GGAK +PL + F+ GV C
Sbjct: 362 ATNVQK-LTGLRPCFDISGEKSVVIPDLTFQFKGGAKMQLPLSNYFAFVD---MGVVCLT 417
Query: 337 M-----QPIDGDVG--------IFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ + GD G I GNF Q + +I YD ++ FK C
Sbjct: 418 IVSDNAAALGGDGGVRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQSC 467
>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 430
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 101/382 (26%), Positives = 165/382 (43%), Gaps = 51/382 (13%)
Query: 25 VMKFSIGTPPL-LDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELS---- 79
V+ IGTPP D+ ++DTGS L W+QC + K++ P+ P ++S LS
Sbjct: 67 VVSLPIGTPPQPTDL--VLDTGSQLSWIQCHD-KKVKKRLPPLPKPKTASFDPSLSSSFS 123
Query: 80 --------CQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN 131
C+ SC +LC+Y+Y YAD +L +G L E+ TF S +
Sbjct: 124 LLPCNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLS-TPP 182
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
V+ GC +T EN G++G+ RLS +SQ +KFSYC VP T S+ T
Sbjct: 183 VILGCAQAST----ENR-GILGMNHGRLSF----ISQAKISKFSYC-VPSRTGSNPTGLF 232
Query: 192 YFGNGSEVSGGGVVSTSLVSKE-------DKTYYFVTLEGISVGNLSNSSKLIPYYNSSG 244
Y G+ S V T L E D Y + ++ I + + P
Sbjct: 233 YLGDNPNSSKFKYV-TMLTFPESQSSPNLDPLAYTLPMKAIKIAG--KRLNIPPAAFKPD 289
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI-----KLTPYQDPRLGSQLCYKTPSM 299
A G ID+G+ T L + Y +++E+V + K Y D + +C+
Sbjct: 290 AGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADV---ADMCFDAGVT 346
Query: 300 AGIAPI---LTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM---QPIDGDVGIFGNFAQS 353
A + ++ FD G ++ + + +GV C + + + I G Q
Sbjct: 347 AEVGRRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQ 406
Query: 354 DLFIGYDFDSQMVSFKPTDCTK 375
++++ YD ++ V F +C++
Sbjct: 407 NMWVEYDLANKRVGFGGAECSR 428
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 107/386 (27%), Positives = 174/386 (45%), Gaps = 60/386 (15%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N + ++G PP +I ++DTGS+L W+ C + + ++NP SSS+Y + C
Sbjct: 62 NVTLTVTLAVGDPPQ-NISMVLDTGSELSWLHC----KKSPNLGSVFNPVSSSTYSPVPC 116
Query: 81 QSEQCH-----LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
S C L SC + LC+ YAD++ +G LA E G+ +F
Sbjct: 117 SSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTR--PGTLF 174
Query: 135 GC---GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
GC G ++ + GL+G+ R LS ++QLG +KFSYC+ +DSS+ +
Sbjct: 175 GCMDSGLSSNSEEDAKSTGLMGMNRGSLSF----VNQLGFSKFSYCIS--GSDSSVF--L 226
Query: 192 YFGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGN-LSNSSKLIPYYNSSG 244
G+ S G + T LV + D+ Y V LEGI VG+ + + K + + +G
Sbjct: 227 LLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTG 286
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEE----QVRNAIKLTPYQDPRLGSQ----LCYKT 296
A G +D+G T L Y L+ Q ++ ++L DP Q LCYK
Sbjct: 287 A---GQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLV--DDPDFVFQGTMDLCYKV 341
Query: 297 -----PSMAGIAPILTAHFDG------GAKVPLIHTSTFIPPPVEGVFCFAMQPIDG--- 342
P+ +G+ P+++ F G G K+ L + E V+CF D
Sbjct: 342 GSTTRPNFSGL-PMVSLMFRGAEMSVSGQKL-LYRVNGAGSEGKEEVYCFTFGNSDLLGI 399
Query: 343 DVGIFGNFAQSDLFIGYDFDSQMVSF 368
+ + G+ Q ++++ +D V F
Sbjct: 400 EAFVIGHHHQQNVWMEFDLAKSRVGF 425
>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
Length = 393
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 89/347 (25%), Positives = 127/347 (36%), Gaps = 80/347 (23%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQ---CYKQVKPIYNPASSSSYKELS 79
EYV+ +G+P + ++DTGSD+ WVQC PC C+ +++PA+SS+Y +
Sbjct: 105 EYVISVGLGSPAVTQRV-VIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFN 163
Query: 80 CQSEQCHLL----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
C + C L + C ++ C Y Y D S T G FG
Sbjct: 164 CSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGT----------------GFQFG 207
Query: 136 CGHNNTGV-FNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
C H G ++ GL+GLG SL SQ
Sbjct: 208 CSHAELGAGMDDKTDGLIGLGGDAQSLVSQ------------------------------ 237
Query: 195 NGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFID 254
T+ SK+ TYYF LE I+VG L P ++G++ +D
Sbjct: 238 ------------TAARSKKVPTYYFAALEDIAVGG--KKLGLSPSVFAAGSL------VD 277
Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGG 313
+G T LP Y L R + +P C+ + ++ P + F GG
Sbjct: 278 SGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAGG 337
Query: 314 AKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYD 360
A V L G FA D G GN Q + YD
Sbjct: 338 AVVDLDAHGIV----SGGCLAFAPTRDDKAFGTIGNVQQRTFEVLYD 380
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 106/391 (27%), Positives = 168/391 (42%), Gaps = 65/391 (16%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK----QVK-PIYNPASSSSYK 76
G Y K +GTPP + +DTGSD++WV C C C K Q++ ++P SSS
Sbjct: 82 GLYYTKVKLGTPPR-EFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSAS 140
Query: 77 ELSCQSEQCH--LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG---------NS 125
+SC +C+ CS LC+Y++ Y D S T G ++ ++F NS
Sbjct: 141 LVSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINS 200
Query: 126 NNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPF 181
+ F VFGC + +G G+ GLG+ LS+ SQ+ Q L FS+CL
Sbjct: 201 SAPF---VFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCL--- 254
Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTY---------YFVTLEGISVGNLSN 232
+ SGGG++ + + D Y Y V L+ I+V
Sbjct: 255 --------------KGDKSGGGIMVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAV----- 295
Query: 233 SSKLIPYYNSSGAISKGN-MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP-RLGS 290
+ +++P S I+ G+ IDTG LP + Y+ + V NA+ + Y P S
Sbjct: 296 NGQILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAVANAV--SQYGRPITYES 353
Query: 291 QLCYK-TPSMAGIAPILTAHFDGGAKV---PLIHTSTFIPPPVEGVFCFAMQPID-GDVG 345
C++ T + P ++ F GGA + P + F ++C Q + +
Sbjct: 354 YQCFEITAGDVDVFPQVSLSFAGGASMVLGPRAYLQIF-SSSGSSIWCIGFQRMSHRRIT 412
Query: 346 IFGNFAQSDLFIGYDFDSQMVSFKPTDCTKQ 376
I G+ D + YD Q + + DC+ +
Sbjct: 413 ILGDLVLKDKVVVYDLVRQRIGWAEYDCSLE 443
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 104/389 (26%), Positives = 169/389 (43%), Gaps = 59/389 (15%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N + ++G+PP ++ ++DTGS+L W+ C + + + ++NP SS +Y ++ C
Sbjct: 66 NVSLTVSLTVGSPPQ-NVTMVLDTGSELSWLHC----KKTQFLNSVFNPLSSKTYSKVPC 120
Query: 81 QSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
S C L VSC + +LC+ YAD++ +G LA E G+ +FG
Sbjct: 121 LSPTCKTRTRDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRLGSLTK--PATIFG 178
Query: 136 C---GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
C G ++ + GL+G+ R LS ++Q+G KFSYC+ F DS+ +
Sbjct: 179 CMDSGFSSNSEEDSKTTGLIGMNRGSLSF----VNQMGYPKFSYCISGF--DSA--GVLL 230
Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNS---SKLIPYYNSS 243
GN S + T LV D+ Y V LEGI V N S S +P + +
Sbjct: 231 LGNASFPWLKPLSYTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVP--DHT 288
Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLE----EQVRNAIKLTPYQDPRLGSQ----LCY- 294
GA G +D+G T L Y L+ Q R +K+ D Q LCY
Sbjct: 289 GA---GQTMVDSGTQFTFLLGPVYTALKNEFLSQTRGILKV--LNDDNFVFQGAMDLCYL 343
Query: 295 ---KTPSMAGIAPILTAHFDGGA-KVPLIHTSTFIPPPVEG---VFCFAMQPID---GDV 344
P++ + P+++ F G V +P V G V+CF D +
Sbjct: 344 LDSSRPNLQNL-PVVSLMFQGAEMSVSGERLLYRVPGEVRGRDSVWCFTFGNSDLLGVEA 402
Query: 345 GIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ G+ Q ++++ +D + + C
Sbjct: 403 FVIGHHHQQNVWMEFDLEKSRIGLADVRC 431
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 104/399 (26%), Positives = 167/399 (41%), Gaps = 81/399 (20%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y K IGTP D Y VDTG+D+MWV C+ C +C + +YN SSS K
Sbjct: 71 GLYYAKIGIGTPSK-DYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSGK 129
Query: 77 ELSCQSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD- 130
+ C E C LL + + C Y Y D S T G + + F +
Sbjct: 130 LVPCDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGDLKT 189
Query: 131 -----NVVFGCGHNNTGVF---NENEM-GLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
+V+FGCG +G NE + G++G G+ S+ SQ+ S K F++CL
Sbjct: 190 ASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCL-- 247
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGN-- 229
NG V+GGG+ + V + D+ +Y V + I VG+
Sbjct: 248 --------------NG--VNGGGIFAIGHVVQPTVNTTPLLPDQPHYSVNMTAIQVGHTF 291
Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
L+ S+ +S G I ID+G LP Y L ++ + Q P L
Sbjct: 292 LNLSTDASEQRDSKGTI------IDSGTTLAYLPDGIYQPLVYKILS-------QQPNLK 338
Query: 290 SQ------LCYK-TPSMAGIAPILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPID 341
Q C++ + S+ P +T +F+ G + + H F+ E ++C Q
Sbjct: 339 VQTLHDEYTCFQYSGSVDDGFPNVTFYFENGLSLKVYPHDYLFLS---ENLWCIGWQNSG 395
Query: 342 G------DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
++ + G+ S+ + YD ++Q++ + +C+
Sbjct: 396 AQSRDSKNMTLLGDLVLSNKLVFYDLENQVIGWTEYNCS 434
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 171/383 (44%), Gaps = 51/383 (13%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQ----VK-PIYNPASSSSYK 76
G Y K +G PP D Y VDTGSD++WV C C +C + VK +Y+P SS+S
Sbjct: 80 GLYFAKIGLGNPPK-DYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSAT 138
Query: 77 ELSCQSEQCHLLDT---VSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNNFF 129
+ C + C C+ C Y+ Y D S T G + + F GN
Sbjct: 139 RIYCDDDFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQTSS 198
Query: 130 DN--VVFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHT 183
N V+FGCG +G +E G++G G+ S+ SQ+ + + F++CL
Sbjct: 199 ANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCL----- 253
Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLIPYYN 241
++ F G EV V +T +V ++ +Y V ++ I VG L + + +
Sbjct: 254 -DNVKGGGIFAIG-EVVSPKVNTTPMVP--NQPHYNVVMKEIEVGGNVLELPTDIFDTGD 309
Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQDPRLGSQLCYK-TP 297
G I ID+G LP+ Y + ++ + +KL ++ C++ T
Sbjct: 310 RRGTI------IDSGTTLAYLPEVVYESMMTKIVSEQPGLKLHTVEEQF----TCFQYTG 359
Query: 298 SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIFGNFA 351
++ P++ HF+G + ++ ++ E V+CF MQ DG D+ + G+
Sbjct: 360 NVNEGFPVVKFHFNGSLSLT-VNPHDYLFQIHEEVWCFGWQNSGMQSKDGRDMTLLGDLV 418
Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
S+ + YD ++Q + + +C+
Sbjct: 419 LSNKLVLYDLENQAIGWTDYNCS 441
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 88/381 (23%), Positives = 164/381 (43%), Gaps = 65/381 (17%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y K +G+PP + + VDTGSD++W+ C PC +C + +++ +SS+ K
Sbjct: 72 GLYFTKIKLGSPPK-EYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSK 130
Query: 77 ELSCQSEQCHLL-DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFG------NSNNFF 129
++ C + C + + SC C+Y YAD S + G + +T +
Sbjct: 131 KVGCDDDFCSFISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLG 190
Query: 130 DNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDS 185
VVFGCG + +G + G++G G++ S+ SQ+ + A + FS+CL
Sbjct: 191 QEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL------- 243
Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGNLSNSSKL 236
V GGG+ + +V ++ +Y V L G+ V +S
Sbjct: 244 -----------DNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDV---DGTSLD 289
Query: 237 IPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV--RNAIKLTPYQDPRLGSQLCY 294
+P + G +D+G PK Y+ L E + R +KL ++ + C+
Sbjct: 290 LP----RSIVRNGGTIVDSGTTLAYFPKVLYDSLIETILARQPVKLHIVEE----TFQCF 341
Query: 295 KTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP------IDGDVGIF 347
+ A P ++ F+ K+ ++ ++ E ++CF Q +V +
Sbjct: 342 SFSTNVDEAFPPVSFEFEDSVKLT-VYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILL 400
Query: 348 GNFAQSDLFIGYDFDSQMVSF 368
G+ S+ + YD D++++ +
Sbjct: 401 GDLVLSNKLVVYDLDNEVIGW 421
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 98/396 (24%), Positives = 163/396 (41%), Gaps = 71/396 (17%)
Query: 19 TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
++ G Y K +G+P + Y VDTGSD++WV C C C K+ +Y+P S
Sbjct: 67 SSTGLYYTKVGLGSPAK-EFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDPNGSK 125
Query: 74 SYKELSCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITF----GN 124
+ + C C DT S C C Y+ Y D S T G + +TF GN
Sbjct: 126 TSNAVPCGDGFC--TDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEVSGN 183
Query: 125 SNNFFDN--VVFGCGHNNTGVFNENE----MGLVGLGRTRLSLASQILSQLGANK-FSYC 177
+ DN V+FGCG +G + N G++G G+ S+ SQ+ + + FS+C
Sbjct: 184 LHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHC 243
Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED---------KTYYFVTLEGISVG 228
L H GGG+ S V + +Y V L+ + V
Sbjct: 244 LDSHH------------------GGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDMDV- 284
Query: 229 NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQD 285
L+P Y +G + ID+G LP YN+L +V + +KL +D
Sbjct: 285 --DGEPILLPLYLFDSGSGRGTI-IDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVED 341
Query: 286 PRLGSQLCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID--- 341
C+ + + P++ HF+G + +H ++ E ++C Q
Sbjct: 342 ----QFTCFHYSDKLDEGFPVVKFHFEGLSLT--VHPHDYLFLYKEDIYCIGWQKSSTQT 395
Query: 342 ---GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
D+ + G+ S+ + YD ++ ++ + +C+
Sbjct: 396 KEGRDLILIGDLVLSNKLVVYDLENMVIGWTNFNCS 431
>gi|125579874|gb|EAZ21020.1| hypothetical protein OsJ_36669 [Oryza sativa Japonica Group]
Length = 382
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 71/235 (30%), Positives = 113/235 (48%), Gaps = 17/235 (7%)
Query: 150 GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSL 209
GL+GLGR RLSL +SQ GA KFSYCL P+ ++ T ++ G + + G G V T+
Sbjct: 153 GLMGLGRGRLSL----VSQTGATKFSYCLTPYFHNNGATGHLFVGASASLGGHGDVMTTQ 208
Query: 210 VSKEDK--TYYFVTLEGISVG--NLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKD 265
K K +Y++ L G++VG L + + + + G + ID+G+P T L D
Sbjct: 209 FVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSGSPFTSLVHD 268
Query: 266 FYNRLEEQVR---NAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTS 322
Y+ L ++ N + P D G+ LC + + P + HF GGA + + S
Sbjct: 269 AYDALASELAARLNGSLVAPPPDADDGA-LCVARRDVGRVVPAVVFHFRGGADMAVPAES 327
Query: 323 TFIPPPVEGVFCFAMQPIDG---DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ PV+ G + GN+ Q ++ + YD + SF+P DC+
Sbjct: 328 YWA--PVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQPADCS 380
Score = 41.6 bits (96), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 29/50 (58%), Gaps = 2/50 (4%)
Query: 17 VSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV-QCYKQVKP 65
V A +YV ++ IG PP ++DTGSDL+W QC C+ Q + Q P
Sbjct: 83 VRWATLQYVAEYLIGDPPQ-RAEALIDTGSDLVWTQCSTCLRQGFSQAGP 131
>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 513
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 99/367 (26%), Positives = 162/367 (44%), Gaps = 48/367 (13%)
Query: 29 SIGTPPLLDIYGI-VDTGSDLMWVQCLPCVQCYKQVKP---------IYNPASSSSYKEL 78
++GTP D + + +DTGSDL W+ C C C +++K IY+P +SS+ ++
Sbjct: 109 TVGTPS--DWFMVALDTGSDLFWLPC-DCTNCVRELKAPGGSSLDLNIYSPNASSTSTKV 165
Query: 79 SCQSEQCHLLDTVSCSSQQLCNYTYGY-ADSSLTKGVLATERITF----GNSNNFFDNVV 133
C S C D + S + C Y Y ++ + + GVL + + +S V
Sbjct: 166 PCNSTLCTRGDRCA-SPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVT 224
Query: 134 FGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSK 190
FGCG TGVF++ GL GLG +S+ S + + + AN FS C F D + +
Sbjct: 225 FGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMC---FGNDGA--GR 279
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
+ FG+ V T L ++ Y +T+ ISVG ++G +
Sbjct: 280 ISFGDKGSVDQR---ETPLNIRQPHPTYNITVTKISVGG------------NTGDLEFDA 324
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ--DPRLGSQLCYK-TPSMAGIA-PIL 306
+F D+G T L Y + E + YQ D L + CY +P+ P +
Sbjct: 325 VF-DSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAV 383
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
GG+ P+ H IP V+C A+ I+ D+ I G + + +D + ++
Sbjct: 384 NLTMKGGSSYPVYHPLVVIPMKDTDVYCLAIMKIE-DISIIGQNFMTGYRVVFDREKLIL 442
Query: 367 SFKPTDC 373
+K +DC
Sbjct: 443 GWKESDC 449
>gi|242089103|ref|XP_002440384.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
gi|241945669|gb|EES18814.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
Length = 555
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 99/412 (24%), Positives = 164/412 (39%), Gaps = 57/412 (13%)
Query: 13 VQSNVSTAN-GEYVMKFSIGTPPLLDIYGIV-DTGSDLMWVQC----------------- 53
++S ++TA+ G Y++ GTP L Y +V DT +DL W+ C
Sbjct: 128 MRSALNTAHVGMYLVSVRFGTPAL--PYNLVLDTANDLTWINCRLRRRKGKHYGRQSSKT 185
Query: 54 ---------LPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQ---QLCNY 101
+ + + K Y PA SSS++ + C +QC L +C S + C+Y
Sbjct: 186 MSVGGDDDVVAALAKKEARKNWYRPAKSSSWRRIRCSEQQCAHLPYNTCQSPSKLESCSY 245
Query: 102 TYGYADSSLTKGVLATERITFGNSNNFFDN---VVFGCGHNNTGVFNENEMGLVGLGRTR 158
D ++T G+ E+ T S+ +V GC G + G++ LG
Sbjct: 246 YQKTQDGTVTIGIYGNEKATVTVSDGRMAKLPGLVLGCSVLEAGASVDAHDGVLSLGNGH 305
Query: 159 LSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKED-KTY 217
+S A + + G +FS+CL+ ++ +S + FG V G G + T ++ D K
Sbjct: 306 MSFAIHAVLRFGG-RFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETEILYNVDVKAA 364
Query: 218 YFVTLEGISVGNLSNSSKL-IP--YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV 274
Y + + VG +L IP +N + G + +DT T L + Y L +
Sbjct: 365 YGPRVTAVLVGG----ERLDIPDDVWNIDKGLGSG-VILDTSTSVTSLVPEAYEPLVAAL 419
Query: 275 RNAIKLTPYQDPRLGSQLCYK--------TPSMAGIAPILTAHFDGGAKVPLIHTSTFIP 326
+ P ++ G + CY+ P+ P +T GGA++ S +P
Sbjct: 420 DRHLAHLP-RESFAGFEYCYRWTFTGDGVDPAHNVTIPKVTVEMTGGARLEPEAKSVVMP 478
Query: 327 PPVEGVFCFAMQ--PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTKQ 376
GV C A + P G I GN + D F+ C +
Sbjct: 479 EVGHGVACLAFRKLPWGGGPCIIGNVLMQEYIWEIDHSKATFRFRKDKCNTR 530
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 94/370 (25%), Positives = 157/370 (42%), Gaps = 37/370 (10%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IGTPP IVDTGS + +V C C QC + P + P SS+Y+ + C
Sbjct: 10 NGYYTTRLWIGTPPQ-RFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKC 68
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHN 139
+D +Q C Y YA+ S + GVL + I+FGN + VFGC +
Sbjct: 69 N------IDCNCDDEKQQCVYERQYAEMSTSSGVLGEDIISFGNLSALAPQRAVFGCENM 122
Query: 140 NTG-VFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMYFGNGS 197
TG +++++ G++G+GR LS+ ++ + N FS C + M G
Sbjct: 123 ETGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLC---YGGMGIGGGAMVLG--- 176
Query: 198 EVSGGGVVSTSLVSKEDKT---YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFID 254
G S + S+ D YY + L+ I V + K +P N + K +D
Sbjct: 177 ---GISPPSNMVFSQSDPVRSPYYNIDLKEIHV-----AGKPLP-LNPTVFDGKHGTILD 227
Query: 255 TGAPPTLLPKDFYNRLEEQVRNAIK-LTPYQDPRLG-SQLCY-----KTPSMAGIAPILT 307
+G LP+ + ++ + + L P + P + +C+ ++ P +
Sbjct: 228 SGTTYAYLPEAAFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQLSSSFPAVE 287
Query: 308 AHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFDSQM 365
F G K+ L F V G +C + Q + G + + YD ++
Sbjct: 288 MVFGNGQKLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDRENSK 347
Query: 366 VSFKPTDCTK 375
+ F T+C++
Sbjct: 348 IGFWKTNCSE 357
>gi|125552953|gb|EAY98662.1| hypothetical protein OsI_20585 [Oryza sativa Indica Group]
Length = 429
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 114/419 (27%), Positives = 177/419 (42%), Gaps = 71/419 (16%)
Query: 17 VSTANGEYVMKFSIGTPP-LLDIYGIVDTGSDLMWVQC-----LPCVQC------YKQVK 64
V+T Y++ ++G PP + +Y +DTGSDL WV C C++C K +
Sbjct: 18 VTTYTDGYLLSLNLGMPPQVFQVY--LDTGSDLTWVPCGTNSSYQCLECGNEHSTSKPIP 75
Query: 65 PIYNPASSSSYKELSCQSEQC---HLLD-------TVSCS----SQQLCN-----YTYGY 105
SSS+ KEL C S C H D V C+ LC ++Y Y
Sbjct: 76 SFSPSQSSSNMKEL-CGSRFCVDIHSSDNSHDPCAAVGCAIPSFMSGLCTRPCPPFSYTY 134
Query: 106 ADSSLTKGVLATERITFGNS----NNFFD--NVVFGCGHNNTGVFNENEMGLVGLGRTRL 159
+L G LA + +T S D FGC G +G+ G G+ L
Sbjct: 135 GGGALVLGSLAKDIVTLHGSIFGIAILLDVPGFCFGC----VGSSIREPIGIAGFGKGIL 190
Query: 160 SLASQILSQLGANKFSYCLVPFH--TDSSITSKMYFGNGSEVSGGGVVSTSLV-SKEDKT 216
SL SQ+ FS+C + F + + TS + G+ + + + T ++ S +
Sbjct: 191 SLPSQL--GFLDKGFSHCFLGFRFARNPNFTSSLIMGDLALSAKDDFLFTPMLKSITNPN 248
Query: 217 YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRN 276
+Y++ LEG+S+G+ + P +S + G M +DTG T LP FY + + +
Sbjct: 249 FYYIGLEGVSIGD-GAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPFYTAILSSLAS 307
Query: 277 AIKLTPYQD--PRLGSQLCYK-----TPSMAGIAPILTAHFDGGAKVPLIHTSTF--IPP 327
I D R G LC+K TP P++ HF G K+ L S + +
Sbjct: 308 VILYERSYDLEMRTGFDLCFKIPCTHTPCTQDELPLINFHFLGDVKLTLPKDSCYYAVTA 367
Query: 328 PVEGVF--CFAMQPIDG--DVG--------IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
P V C Q +D DVG + G+F ++ + YD ++ + F+P DC
Sbjct: 368 PKNSVVVKCLLFQRMDDEDDVGGANNGPGAVLGSFQMQNVEVVYDMEAGRIGFQPKDCA 426
>gi|125554529|gb|EAZ00135.1| hypothetical protein OsI_22138 [Oryza sativa Indica Group]
Length = 472
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 103/382 (26%), Positives = 162/382 (42%), Gaps = 55/382 (14%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELS 79
++M S+G PP++++ I DTGS L WVQC PC V C+ Q PI++P S + + +
Sbjct: 114 FLMAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 172
Query: 80 CQSEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDN 131
C S +C L +C ++ C Y+ Y + + + G + T+ + G+S F +
Sbjct: 173 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMD 229
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSIT 188
++FGC + ++E E G+ G G + S Q+ L FSYCL TD +
Sbjct: 230 LMFGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL---PTDETKP 284
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
M G + G TSL ++ Y +T+E + ++N +L+ S
Sbjct: 285 GYMILGRYDRAAMDGGY-TSLFRSINRPTYSLTMEML----IANGQRLV--------TSS 331
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK---------- 295
M +D+GA T L + L++ + A+ Y R S +CY
Sbjct: 332 SEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNG 391
Query: 296 --TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFA 351
TP S P+L F GGA + L + F P G+ FA P I GN
Sbjct: 392 TITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRV 450
Query: 352 QSDLFIGYDFDSQMVSFKPTDC 373
+D + FK C
Sbjct: 451 TRSFGTTFDIQGKQFGFKYAAC 472
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 97/394 (24%), Positives = 167/394 (42%), Gaps = 50/394 (12%)
Query: 11 NVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----P 65
N+ + + T G Y K +G+PP D Y VDTGSD++WV C+ C +C ++
Sbjct: 57 NLGGNGLPTETGLYFTKLGLGSPPR-DYYVQVDTGSDILWVNCVECSRCPRKSDLGIDLT 115
Query: 66 IYNPASSSSYKELSCQSEQCHLL---DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF 122
+Y+P S + +SC + C C S+ C Y+ Y D S T G + +T+
Sbjct: 116 LYDPKGSETSDVVSCDQDFCSATFDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTY 175
Query: 123 GNSNNFF------DNVVFGCGHNNTGVF----NENEMGLVGLGRTRLSLASQILSQLGAN 172
N +++FGCG +G E G++G G+ S+ SQ+ +
Sbjct: 176 NRINGNLRTSPQNSSIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVK 235
Query: 173 K-FSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN-- 229
K FS+CL ++ F G EV V +T LV + +Y V L+ I V
Sbjct: 236 KIFSHCL------DNVRGGGIFAIG-EVVEPKVSTTPLVPR--MAHYNVVLKSIEVDTDI 286
Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
L S + N G + ID+G LP Y+ L ++V + P L
Sbjct: 287 LQLPSDIFDSVNGKGTV------IDSGTTLAYLPDIVYDELIQKV---LARQPGLKLYLV 337
Query: 290 SQ--LCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG---- 342
Q C+ T ++ P++ HF + ++ ++ +G++C Q
Sbjct: 338 EQQFRCFLYTGNVDRGFPVVKLHFKDSLSLT-VYPHDYLFQFKDGIWCIGWQRSVAQTKN 396
Query: 343 --DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
D+ + G+ S+ + YD ++ ++ + +C+
Sbjct: 397 GKDMTLLGDLVLSNKLVIYDLENMVIGWTDYNCS 430
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 99/396 (25%), Positives = 160/396 (40%), Gaps = 57/396 (14%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPAS--------SS 73
G Y + + GTPP ++ I DTGS L+W C +C + P +PA+ SS
Sbjct: 130 GAYSVSLAFGTPPQ-NLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSS 188
Query: 74 SYKELSCQSEQCHLL---------DTVSCSSQQLCNYTYGYA---DSSLTKGVLATERIT 121
S K + C++ +C + + S++ + GY S T G+L +E +
Sbjct: 189 SVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGATAGILLSETLD 248
Query: 122 FGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
N + + GC + G+ G GR SL SQ+ +FS+CLV
Sbjct: 249 LENKR--VPDFLVGCSVMSV----HQPAGIAGFGRGPESLP----SQMRLKRFSHCLVSR 298
Query: 182 -HTDSSITSKMYFGNGSEVSGGGVVS---------TSLVSKEDKTYYFVTLEGISVGNLS 231
DS ++S + +GSE S S+ + + YY+++L I +G
Sbjct: 299 GFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIG--- 355
Query: 232 NSSKLIPY-YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPR 287
PY Y + G ID+G+ T L K + + +++ + P + +
Sbjct: 356 GKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQ 415
Query: 288 LGSQLCYKTPSMAGIA--PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVG 345
G + C+ P A P + F GG K+ L + EGV C M + VG
Sbjct: 416 SGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMTDEAVVG 475
Query: 346 -------IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
I G F Q ++ + YD Q + F+ CT
Sbjct: 476 GGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKCT 511
>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
Length = 493
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 102/415 (24%), Positives = 166/415 (40%), Gaps = 74/415 (17%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP--CVQCYKQVKPIYN----PASSSSYK 76
+Y + F++ + P + +DTGSDL+W C P C+ C + + P SS+ +
Sbjct: 81 DYTLSFTLNSNPPQHVSLYLDTGSDLVWFPCKPFECILCEGKAENTTASTPPPRLSSTAR 140
Query: 77 ELSCQSEQCHL--------------------LDTVSCSSQQLCNYTYGYADSSLTKGVLA 116
+ C+S C ++T C S ++ Y Y D SL +
Sbjct: 141 SVHCKSSACSAAHSNLPTSDLCAIADCPLESIETSDCHSFSCPSFYYAYGDGSLVARLYH 200
Query: 117 TE-RITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILS---QLGAN 172
++ + N FGC H +G+ G GR LSL +Q+ S QLG N
Sbjct: 201 DSIKLPLATPSLSLHNFTFGCAHTALA----EPVGVAGFGRGVLSLPAQLASFAPQLG-N 255
Query: 173 KFSYCLV--PFHTDS-SITSKMYFGNGSE----VSGGGV--VSTSLVSKEDKTYYF-VTL 222
+FSYCLV F++D + S + G+ + V+ V V TS++ Y++ V L
Sbjct: 256 RFSYCLVSHSFNSDRLRLPSPLILGHSDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGL 315
Query: 223 EGISVGNLSNSSKLIP---YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI- 278
EGIS+G K IP + G + +D+G T+LP YN + + N +
Sbjct: 316 EGISIGK-----KKIPAPEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVG 370
Query: 279 ---KLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---- 331
+ + + G CY ++ I P L HF G ++ + ++G
Sbjct: 371 RVYERAKEVEDKTGLGPCYYYDTVVNI-PSLVLHFVGNESSVVLPKKNYFYDFLDGGDGV 429
Query: 332 -----VFCFAM-------QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
V C + + G GN+ Q + YD + + V F C
Sbjct: 430 RRKRRVGCLMLMNGGEEAELTGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKCA 484
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 105/386 (27%), Positives = 172/386 (44%), Gaps = 60/386 (15%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N + ++G PP +I ++DTGS+L W+ C + + ++NP SSS+Y + C
Sbjct: 62 NVTLTVTLAVGDPPQ-NISMVLDTGSELSWLHC----KKSPNLGSVFNPVSSSTYSPVPC 116
Query: 81 QSEQCH-----LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
S C L SC + LC+ YAD++ +G LA E G+ +F
Sbjct: 117 SSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTR--PGTLF 174
Query: 135 GC---GHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
GC G ++ + GL+G+ R LS ++QLG +KFSYC+ + S + +
Sbjct: 175 GCMDSGLSSNSEEDAKSTGLMGMNRGSLSF----VNQLGFSKFSYCI----SGSDSSGFL 226
Query: 192 YFGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGN-LSNSSKLIPYYNSSG 244
G+ S G + T LV + D+ Y V LEGI VG+ + + K + + +G
Sbjct: 227 LLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTG 286
Query: 245 AISKGNMFIDTGAPPTLLPKDFYNRLEE----QVRNAIKLTPYQDPRLGSQ----LCYKT 296
A G +D+G T L Y L+ Q ++ ++L DP Q LCYK
Sbjct: 287 A---GQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLV--DDPDFVFQGTMDLCYKV 341
Query: 297 -----PSMAGIAPILTAHFDG------GAKVPLIHTSTFIPPPVEGVFCFAMQPIDG--- 342
P+ +G+ P+++ F G G K+ L + E V+CF D
Sbjct: 342 GSTTRPNFSGL-PMVSLMFRGAEMSVSGQKL-LYRVNGAGSEGKEEVYCFTFGNSDLLGI 399
Query: 343 DVGIFGNFAQSDLFIGYDFDSQMVSF 368
+ + G+ Q ++++ +D V F
Sbjct: 400 EAFVIGHHHQQNVWMEFDLAKSRVGF 425
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 95/384 (24%), Positives = 158/384 (41%), Gaps = 57/384 (14%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y K +G P + +DTGSD++WV C PC C +++ SSS +
Sbjct: 82 GLYFTKVKLGNPAR-EFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSAR 140
Query: 77 ELSCQSEQCHLLDTVS--CSSQ-QLCNYTYGYADSSLTKGVLATERITF----GNSN--N 127
L C C + T + C +Q C+Y++ Y D S T G T+ + F G S N
Sbjct: 141 VLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIAN 200
Query: 128 FFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHT 183
+VFGC G G+ G G+ S+ SQ+ S+ + FS+CL
Sbjct: 201 SSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL----- 255
Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKE--------DKTYYFVTLEGISV-GNLSNSS 234
G E GG +V ++ + +Y + L+ I++ G L +
Sbjct: 256 -----------KGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFPNP 304
Query: 235 KLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY 294
+ P N+ G ID+G L ++ Y+ + + +A+ + GSQ
Sbjct: 305 TMFPISNA------GETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFR 358
Query: 295 KTPSMAGIAPILTAHFDGGAKVPL-----IHTSTFIPPPVEGVFCFAMQPIDGDVGIFGN 349
+ S+A I P+L +F+G A + + + + + P ++C Q + + I G+
Sbjct: 359 VSMSVADIFPVLRFNFEGIASMVVTPEEYLQFDSIVREP--ALWCIGFQKAEDGLNILGD 416
Query: 350 FAQSDLFIGYDFDSQMVSFKPTDC 373
D I YD Q + + DC
Sbjct: 417 LVLKDKIIVYDLARQRIGWANYDC 440
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 88/309 (28%), Positives = 138/309 (44%), Gaps = 40/309 (12%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N + ++GTPP + ++DTGS+L W+ C + + + ++NP SSSY + C
Sbjct: 67 NVTLTVSLTVGTPPQ-SVTMVLDTGSELSWLHC----KKQQNINSVFNPHLSSSYTPIPC 121
Query: 81 QSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
S C L VSC S LC+ T YAD + +G LA++ TF S + ++FG
Sbjct: 122 MSPICKTRTRDFLIPVSCDSNNLCHVTVSYADFTSLEGNLASD--TFAISGSGQPGIIFG 179
Query: 136 ---CGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
G ++ + GL+G+ R LS ++Q+G KFSYC+ + + +
Sbjct: 180 SMDSGFSSNANEDSKTTGLMGMNRGSLSF----VTQMGFPKFSYCI----SGKDASGVLL 231
Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLS-NSSKLIPYYNSSGA 245
FG+ + G + T LV D+ Y V L GI VG+ K I + +GA
Sbjct: 232 FGDATFKWLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGA 291
Query: 246 ISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRL----GSQLCYKTPSM 299
G +D+G T L Y L + + LT +DP LC++
Sbjct: 292 ---GQTMVDSGTRFTFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFEGAMDLCFRV-RR 347
Query: 300 AGIAPILTA 308
G+ P + A
Sbjct: 348 GGVVPAVPA 356
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 105/397 (26%), Positives = 170/397 (42%), Gaps = 64/397 (16%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP---CVQC-YKQVK----PIYNPASSS 73
G Y + + GTPP + ++DTGS L+W C C +C + ++ P + P SS
Sbjct: 90 GGYSISLNFGTPPQTTKF-VMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSS 148
Query: 74 SYKELSCQSEQCHLL------------DTVSCSSQQLCN-YTYGYADSSLTKGVLATERI 120
S + C++ +C L D + + Q C Y Y S T G+L +E +
Sbjct: 149 SSNLIGCKNHKCSWLFGPKVQSKCQECDPTTQNCTQSCPPYVIQYGLGS-TAGLLLSETL 207
Query: 121 TFGNSNNFFDNVVFGCGHNNTGVFN-ENEMGLVGLGRTRLSLASQILSQLGANKFSYCLV 179
F + + GC +F+ G+ G GR+ SL SQ LG KFSYCLV
Sbjct: 208 DFPHKKTI-PGFLVGCS-----LFSIRQPEGIAGFGRSPESLPSQ----LGLKKFSYCLV 257
Query: 180 PFHTDSSITSK---MYFGNGSEVSGGGVVSTSLVSKED----KTYYFVTLEGISVGNLSN 232
D + S + G+GS+ + +S + K + YY+V L I +G +
Sbjct: 258 SHAFDDTPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVLLRNIVIG---D 314
Query: 233 SSKLIPY-YNSSGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQDPR 287
+ +PY + G+ G +D+G T + K Y E+QV + T Q+ +
Sbjct: 315 THVKVPYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQN-Q 373
Query: 288 LGSQLCYKTPSMAGIA-PILTAHFDGGAK--VPLIHTSTFIPPPVEGVFCFAMQPID--- 341
G + C+ ++ P HF GGAK +PL + +F+ GV C + +
Sbjct: 374 TGLRPCFNISGEKSVSVPEFIFHFKGGAKMALPLANYFSFVD---SGVICLTIVSDNMSG 430
Query: 342 -----GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
G I GN+ Q + + +D ++ FK +C
Sbjct: 431 SGIGGGPAIILGNYQQRNFHVEFDLKNERFGFKQQNC 467
>gi|125552158|gb|EAY97867.1| hypothetical protein OsI_19787 [Oryza sativa Indica Group]
Length = 477
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 102/368 (27%), Positives = 157/368 (42%), Gaps = 43/368 (11%)
Query: 18 STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV---QCYKQVKPIYNPASSSS 74
+T G Y++ +GTPP +YG D S +WV C CV C +Y
Sbjct: 81 ATTGGTYLITVGVGTPPQY-VYGAFDISSQFVWVPCEECVSPYSCPSDKTGVYKTLPREL 139
Query: 75 YKELSCQSEQCH-LLDTVSCSS--QQLCNYTYGYADSSLTKGV--LATERITFGNSNNFF 129
Y SC ++C ++ C + C YT Y + T+ L + T G+ N
Sbjct: 140 Y---SCGEQRCRTIVGQPDCGAPYNGPCKYTCRYGGAGGTETEGHLGLQPFTLGD-NTMP 195
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI-- 187
N++FGCG E G++GL R RLSL SQ+ QLG +FSY P + D++
Sbjct: 196 VNMIFGCGLE-----PETNFGVIGLNRGRLSLISQL--QLG--RFSYYFAPEYDDTAAGN 246
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTY---YFVTLEGISVGNLSNSSKLIPYYNSSG 244
S + FG + T S E+ Y Y V L G+ VG SN+ ++ G
Sbjct: 247 ASFILFGEYAVPQTSNPRYTQFWSYENGAYSYLYLVGLSGMRVG--SNNLNML------G 298
Query: 245 AISKGN----MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
A S G ++ T P T L K+ Y+ L ++ + + LG LCY + +A
Sbjct: 299 AGSGGRDPLVAYLSTSVPVTFLEKNAYDLLRRELVSTVGSDTVDGSALGLDLCYTSQYLA 358
Query: 301 GIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP--IDGDVGIFGNFAQSDLFI 357
P + F GA + L + G+ C + P + G + + G+ Q+ +
Sbjct: 359 KAKFPAMALVFWDGAVMELQPRNYLYQDTATGLECLTILPTAVAGGLSLLGSLIQTGTHM 418
Query: 358 GYDFDSQM 365
Y +D Q+
Sbjct: 419 MY-YDIQI 425
>gi|115463625|ref|NP_001055412.1| Os05g0384300 [Oryza sativa Japonica Group]
gi|50511407|gb|AAT77330.1| unknown protein [Oryza sativa Japonica Group]
gi|113578963|dbj|BAF17326.1| Os05g0384300 [Oryza sativa Japonica Group]
gi|222631434|gb|EEE63566.1| hypothetical protein OsJ_18383 [Oryza sativa Japonica Group]
Length = 477
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 102/368 (27%), Positives = 157/368 (42%), Gaps = 43/368 (11%)
Query: 18 STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCV---QCYKQVKPIYNPASSSS 74
+T G Y++ +GTPP +YG D S +WV C CV C +Y
Sbjct: 81 ATTGGTYLITVGVGTPPQY-VYGAFDISSQFVWVPCEECVSPYSCPSDKTGVYKTLPREL 139
Query: 75 YKELSCQSEQCH-LLDTVSCSS--QQLCNYTYGYADSSLTKGV--LATERITFGNSNNFF 129
Y SC ++C ++ C + C YT Y + T+ L + T G+ N
Sbjct: 140 Y---SCGEQRCRTIVGQPDCGAPYNGPCKYTCRYGGAGGTETEGHLGLQPFTLGD-NTMP 195
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSI-- 187
N++FGCG E G++GL R RLSL SQ+ QLG +FSY P + D++
Sbjct: 196 VNMIFGCGLE-----PETNFGVIGLNRGRLSLISQL--QLG--RFSYYFAPEYDDTAAGN 246
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDKTY---YFVTLEGISVGNLSNSSKLIPYYNSSG 244
S + FG + T S E+ Y Y V L G+ VG SN+ ++ G
Sbjct: 247 ASFILFGEYAVPQTSNPRYTQFWSYENGAYSYLYLVGLSGMRVG--SNNLNML------G 298
Query: 245 AISKGN----MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
A S G ++ T P T L K+ Y+ L ++ + + LG LCY + +A
Sbjct: 299 AGSGGRDPLVAYLSTSVPITFLEKNAYDLLRRELVSTVGSDTVDGSALGLDLCYTSQYLA 358
Query: 301 GIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP--IDGDVGIFGNFAQSDLFI 357
P + F GA + L + G+ C + P + G + + G+ Q+ +
Sbjct: 359 KAKFPAMALVFWDGAVMELQPRNYLYQDTATGLECLTILPTAVAGGLSLLGSLIQTGTHM 418
Query: 358 GYDFDSQM 365
Y +D Q+
Sbjct: 419 MY-YDIQI 425
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 90/369 (24%), Positives = 156/369 (42%), Gaps = 35/369 (9%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
NG Y + IGTPP IVDTGS + +V C C C P + P S +Y+ + C
Sbjct: 90 NGYYTARLWIGTPPQ-RFALIVDTGSTVTYVPCSTCRHCGSHQDPKFRPEDSETYQPVKC 148
Query: 81 QSEQCHLLDTVSC-SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGH 138
+ QC +C + ++ C Y YA+ S + G L + ++FGN +FGC +
Sbjct: 149 -TWQC------NCDNDRKQCTYERRYAEMSTSSGALGEDVVSFGNQTELSPQRAIFGCEN 201
Query: 139 NNTG-VFNENEMGLVGLGRTRLSLASQILS-QLGANKFSYCLVPFHTDSSITSKMYFGNG 196
+ TG ++N+ G++GLGR LS+ Q++ ++ ++ FS C + M G
Sbjct: 202 DETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLC---YGGMGVGGGAMVLGGI 258
Query: 197 SEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTG 256
S + + V YY + L+ I V +L + N K +D+G
Sbjct: 259 SPPADMVFTRSDPVR---SPYYNIDLKEIHVAG----KRL--HLNPKVFDGKHGTVLDSG 309
Query: 257 APPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCYK-----TPSMAGIAPILTA 308
LP+ + + + +++K DPR + +C+ ++ P++
Sbjct: 310 TTYAYLPESAFLAFKHAIMKETHSLKRISGPDPRY-NDICFSGAEIDVSQISKSFPVVEM 368
Query: 309 HFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGD-VGIFGNFAQSDLFIGYDFDSQMV 366
F G K+ L F V G +C + D + G + + YD + +
Sbjct: 369 VFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHTKI 428
Query: 367 SFKPTDCTK 375
F T+C++
Sbjct: 429 GFWKTNCSE 437
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 166/383 (43%), Gaps = 47/383 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK------QVKPIYNPASSSSY 75
G Y + +G P + + +DTGSD++WV C PC C Q++ +NP SSS+
Sbjct: 89 GLYFTRVKLGNPAK-EFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLES-FNPDSSSTA 146
Query: 76 KELSCQSEQC-------HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GN 124
++C ++C + S S C YT+ Y D S T G ++ + F GN
Sbjct: 147 SRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGN 206
Query: 125 SN--NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK--FSYC 177
N ++VFGC ++ +G + + G+ G G+ +LS+ SQ L+ LG + FS+C
Sbjct: 207 EQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQ-LNSLGVSPKVFSHC 265
Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSK 235
L I + G E+ G+V T LV + +Y + LE I+V L S
Sbjct: 266 LKGSDNGGGI---LVLG---EIVEPGLVYTPLVPSQ--PHYNLNLESIAVNGQKLPIDSS 317
Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
L N+ G I +D+G L Y+ + A+ + GSQ
Sbjct: 318 LFTTSNTQGTI------VDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFIT 371
Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFI-PPPVEG--VFCFAMQPIDG-DVGIFGNFA 351
+ S+ P +T +F GG + + + + V+ ++C Q G ++ I G+
Sbjct: 372 SSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLV 431
Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
D YD + + + DC+
Sbjct: 432 LKDKIFVYDLANMRMGWADYDCS 454
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 166/383 (43%), Gaps = 47/383 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK------QVKPIYNPASSSSY 75
G Y + +G P + + +DTGSD++WV C PC C Q++ +NP SSS+
Sbjct: 87 GLYFTRVKLGNPAK-EFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLES-FNPDSSSTA 144
Query: 76 KELSCQSEQC-------HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GN 124
++C ++C + S S C YT+ Y D S T G ++ + F GN
Sbjct: 145 SRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGN 204
Query: 125 SN--NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK--FSYC 177
N ++VFGC ++ +G + + G+ G G+ +LS+ SQ L+ LG + FS+C
Sbjct: 205 EQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQ-LNSLGVSPKVFSHC 263
Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSK 235
L I + G E+ G+V T LV + +Y + LE I+V L S
Sbjct: 264 LKGSDNGGGI---LVLG---EIVEPGLVYTPLVPSQ--PHYNLNLESIAVNGQKLPIDSS 315
Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
L N+ G I +D+G L Y+ + A+ + GSQ
Sbjct: 316 LFTTSNTQGTI------VDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFIT 369
Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFI-PPPVEG--VFCFAMQPIDG-DVGIFGNFA 351
+ S+ P +T +F GG + + + + V+ ++C Q G ++ I G+
Sbjct: 370 SSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLV 429
Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
D YD + + + DC+
Sbjct: 430 LKDKIFVYDLANMRMGWADYDCS 452
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 113/408 (27%), Positives = 168/408 (41%), Gaps = 59/408 (14%)
Query: 10 NNVVQSNVSTAN-GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP---CVQC-YKQVK 64
N+V +S +S + G Y S GTP ++ I DTGS L+W C C +C + ++
Sbjct: 66 NSVFKSPLSPHSYGAYSTPLSFGTP-QQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKID 124
Query: 65 PI----YNPASSSSYKELSCQSEQCHLLDTVSCSSQ-QLCN------------YTYGYAD 107
P + P SSS K + CQ+ +C + SQ + CN Y Y
Sbjct: 125 PTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGS 184
Query: 108 SSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILS 167
S T G+L +E + F + N V GC + G+ G GR SL SQ
Sbjct: 185 GS-TAGLLLSETLDFPDKK--IPNFVVGCSFLSI----HQPSGIAGFGRGSESLPSQ--- 234
Query: 168 QLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVST------SLVSKEDKTYYFVT 221
+G KF+YCL D S S + + V G+ T S+ + K YY++
Sbjct: 235 -MGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLN 293
Query: 222 LEGISVGNLSNSSKLIPY-YNSSGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRN 276
+ I VG N + +PY + G G ID+G+ T + K E+Q+ N
Sbjct: 294 IRKIIVG---NQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLAN 350
Query: 277 AIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF 335
+ T + G + C+ + P L F GGAK L + F GV C
Sbjct: 351 WTRATDVET-LTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACL 409
Query: 336 AM---QPIDGDVG------IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ Q DG G I G F Q + ++ YD +Q + F+ C+
Sbjct: 410 TVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 166/383 (43%), Gaps = 47/383 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK------QVKPIYNPASSSSY 75
G Y + +G P + + +DTGSD++WV C PC C Q++ +NP SSS+
Sbjct: 3 GLYFTRVKLGNPAK-EFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLES-FNPDSSSTA 60
Query: 76 KELSCQSEQC-------HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GN 124
++C ++C + S S C YT+ Y D S T G ++ + F GN
Sbjct: 61 SRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGN 120
Query: 125 SN--NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK--FSYC 177
N ++VFGC ++ +G + + G+ G G+ +LS+ SQ L+ LG + FS+C
Sbjct: 121 EQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQ-LNSLGVSPKVFSHC 179
Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSK 235
L I + G E+ G+V T LV + +Y + LE I+V L S
Sbjct: 180 LKGSDNGGGI---LVLG---EIVEPGLVYTPLVPSQ--PHYNLNLESIAVNGQKLPIDSS 231
Query: 236 LIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK 295
L N+ G I +D+G L Y+ + A+ + GSQ
Sbjct: 232 LFTTSNTQGTI------VDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFIT 285
Query: 296 TPSMAGIAPILTAHFDGGAKVPLIHTSTFI-PPPVEG--VFCFAMQPIDG-DVGIFGNFA 351
+ S+ P +T +F GG + + + + V+ ++C Q G ++ I G+
Sbjct: 286 SSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLV 345
Query: 352 QSDLFIGYDFDSQMVSFKPTDCT 374
D YD + + + DC+
Sbjct: 346 LKDKIFVYDLANMRMGWADYDCS 368
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 101/381 (26%), Positives = 164/381 (43%), Gaps = 47/381 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y + IG+PP D + VDTGSD++WV C+ C C K+ +YNP SSS+
Sbjct: 71 GLYYARIGIGSPPN-DFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTST 129
Query: 77 ELSCQSEQCHLLDTV---SCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNNFF 129
++C C C LC Y Y D S T G + I GN
Sbjct: 130 LITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSE 189
Query: 130 DN--VVFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHT 183
N +VFGCG +G +E G++G G+ S+ SQ+ + K F++CL
Sbjct: 190 TNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCL----- 244
Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
SI+ F G EV + +T +V ++ +Y V L G+ VG+ + L + S
Sbjct: 245 -DSISGGGIFAIG-EVVEPKLXNTPVVP--NQAHYNVVLNGVKVGDTALDLPLGLFETS- 299
Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQLCYK-TPSM 299
K ID+G LP+ Y L E++ A +KL D C+ ++
Sbjct: 300 ---YKRGAIIDSGTTLAYLPESIYLPLMEKILGAQPDLKLRTVDD----QFTCFVFDKNV 352
Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-----QPIDG-DVGIFGNFAQS 353
P +T F+ + + I+ ++ + V+C Q DG +V + G+
Sbjct: 353 DDGFPTVTFKFE-ESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQ 411
Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
+ + Y+ ++Q + + +C+
Sbjct: 412 NKLVYYNLENQTIGWTEYNCS 432
>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
Length = 557
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 101/385 (26%), Positives = 153/385 (39%), Gaps = 52/385 (13%)
Query: 21 NGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSSY- 75
+G+Y IG PP LD VDTGSDL W+QC PC C K P+Y PA
Sbjct: 184 DGQYYTSIFIGNPPRPYFLD----VDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVP 239
Query: 76 -KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV-- 132
++L CQ Q + C + + C+Y YAD S + GVLA + + +N + +
Sbjct: 240 PRDLLCQELQG---NQNYCETCKQCDYEIEYADQSSSMGVLARDDMHMIATNGGREKLDF 296
Query: 133 VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
VFGC ++ G + G++GL +S SQ+ S + AN F +C+ +
Sbjct: 297 VFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCIT---REQGGG 353
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
M+ G+ V GV TS+ S D Y+ + G+ A S
Sbjct: 354 GYMFLGD-DYVPRWGVTWTSIRSGPDNLYH-TQAHHVKYGDQQ-------LRRPEQAGST 404
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT-------PSMAG 301
+ D+G+ T LP + Y L ++ A LC+K +
Sbjct: 405 VQVIFDSGSSYTYLPNEIYENLVAAIKYASPGFVQDTSDRTLPLCWKADFPVRYLEDVKQ 464
Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPVE-------GVFCFAM----QPIDGDVGIFGNF 350
L HF K L + TF P + G C + + G I G+
Sbjct: 465 FFEPLNLHF---GKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDV 521
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCTK 375
+ + YD + + + +DCTK
Sbjct: 522 SLRGKLVVYDNQRKQIGWADSDCTK 546
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 99/381 (25%), Positives = 161/381 (42%), Gaps = 41/381 (10%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-----YKQVKPIYNPASSSSYK 76
G Y + +G+PP D Y +DTGSD++WV C C C + ++P SS++
Sbjct: 82 GLYFTRVQLGSPPK-DFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAA 140
Query: 77 ELSCQSEQC----HLLDTVSCSSQQLCNYTYGYADSSLTKGV-------LATERITFGN- 124
+SC ++C D++ S C YT+ Y D S T G L T ++ G
Sbjct: 141 LVSCSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLSSGEL 200
Query: 125 ---SNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYC 177
+ +V F C TG +++ G+ G G+ +S+ SQ+ SQ + FS+C
Sbjct: 201 SQICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFSHC 260
Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLI 237
L D S + G E+ +V T LV + +Y + L+ ISV + +
Sbjct: 261 L---KGDDSGGGVLVLG---EIVEPNIVYTPLVPSQP--HYNLYLQSISVAG--QTLAID 310
Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTP 297
P + GA S +D+G L + Y+ + + + L G+Q T
Sbjct: 311 P--SVFGASSNQGTIVDSGTTLAYLAEGAYDPFVSAITSVVSLNARTYLSKGNQCYLVTS 368
Query: 298 SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDG-DVGIFGNFAQS 353
S+ + P ++ +F GGA + L + G V+C Q G + I G+
Sbjct: 369 SVNDVFPQVSLNFAGGASLILNPQDYLLQQNSVGGAAVWCVGFQKTPGQQITILGDLVLK 428
Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
D YD +Q V + DC+
Sbjct: 429 DKIFVYDIANQRVGWTNYDCS 449
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 113/408 (27%), Positives = 168/408 (41%), Gaps = 59/408 (14%)
Query: 10 NNVVQSNVSTAN-GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP---CVQC-YKQVK 64
N+V +S +S + G Y S GTP ++ I DTGS L+W C C +C + ++
Sbjct: 66 NSVFKSPLSPHSYGAYSTPLSFGTP-QQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKID 124
Query: 65 PI----YNPASSSSYKELSCQSEQCHLLDTVSCSSQ-QLCN------------YTYGYAD 107
P + P SSS K + CQ+ +C + SQ + CN Y Y
Sbjct: 125 PTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGS 184
Query: 108 SSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILS 167
S T G+L +E + F + N V GC + G+ G GR SL SQ
Sbjct: 185 GS-TAGLLLSETLDF--PDKXIPNFVVGCSFLSI----HQPSGIAGFGRGSESLPSQ--- 234
Query: 168 QLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVST------SLVSKEDKTYYFVT 221
+G KF+YCL D S S + + V G+ T S+ + K YY++
Sbjct: 235 -MGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLN 293
Query: 222 LEGISVGNLSNSSKLIPY-YNSSGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRN 276
+ I VG N + +PY + G G ID+G+ T + K E+Q+ N
Sbjct: 294 IRKIIVG---NQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLAN 350
Query: 277 AIKLTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF 335
+ T + G + C+ + P L F GGAK L + F GV C
Sbjct: 351 WTRATDVET-LTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACL 409
Query: 336 AM---QPIDGDVG------IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ Q DG G I G F Q + ++ YD +Q + F+ C+
Sbjct: 410 TVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457
>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
Length = 573
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 103/383 (26%), Positives = 162/383 (42%), Gaps = 48/383 (12%)
Query: 21 NGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSSY- 75
+G+Y +G PP LD VDTGSDL W+QC PC C K P+Y PA
Sbjct: 200 DGQYYTSIFVGNPPRPYFLD----VDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVP 255
Query: 76 -KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV-- 132
K+L CQ Q + C + + C+Y YAD S + GVLA + + +N + +
Sbjct: 256 PKDLLCQELQGN---QNYCETCKQCDYEIEYADRSSSMGVLARDDMHIITTNGGREKLDF 312
Query: 133 VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
VFGC ++ G + G++GL +SL SQ+ +Q + +N F +C+ D +
Sbjct: 313 VFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCIT---RDPNGG 369
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
M+ G+ V G+ ST + S D ++ + + G+ S + +SG +
Sbjct: 370 GYMFLGD-DYVPRWGMTSTPIRSAPDNLFH-TEAQKVYYGDQQLSMR-----GASG--NS 420
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT-------PSMAG 301
+ D+G+ T LP + Y L ++ A LC T +
Sbjct: 421 VQVIFDSGSSYTYLPDEIYKNLIAAIKYAYPNFVQDSSDRTLPLCLATDFPVRYLEDVKQ 480
Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPV-----EGVFCFAM---QPID-GDVGIFGNFAQ 352
+ L HF G + T T +P +G C + ID G I G+ A
Sbjct: 481 LFKPLNLHF-GKRWFVMPRTFTILPDNYLIISDKGNVCLGFLNGKDIDHGSTVIVGDNAL 539
Query: 353 SDLFIGYDFDSQMVSFKPTDCTK 375
+ YD + + + +DCTK
Sbjct: 540 RGKLVVYDNQQRQIGWTNSDCTK 562
>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
Length = 574
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 103/383 (26%), Positives = 162/383 (42%), Gaps = 48/383 (12%)
Query: 21 NGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSSY- 75
+G+Y +G PP LD VDTGSDL W+QC PC C K P+Y PA
Sbjct: 201 DGQYYTSIFVGNPPRPYFLD----VDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVP 256
Query: 76 -KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV-- 132
K+L CQ Q + C + + C+Y YAD S + GVLA + + +N + +
Sbjct: 257 PKDLLCQELQGN---QNYCETCKQCDYEIEYADRSSSMGVLARDDMHIITTNGGREKLDF 313
Query: 133 VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
VFGC ++ G + G++GL +SL SQ+ +Q + +N F +C+ D +
Sbjct: 314 VFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCIT---RDPNGG 370
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
M+ G+ V G+ ST + S D ++ + + G+ S + +SG +
Sbjct: 371 GYMFLGD-DYVPRWGMTSTPIRSAPDNLFH-TEAQKVYYGDQQLSMR-----GASG--NS 421
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT-------PSMAG 301
+ D+G+ T LP + Y L ++ A LC T +
Sbjct: 422 VQVIFDSGSSYTYLPDEIYKNLIAAIKYAYPNFVQDSSDRTLPLCLATDFPVRYLEDVKQ 481
Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPV-----EGVFCFAM---QPID-GDVGIFGNFAQ 352
+ L HF G + T T +P +G C + ID G I G+ A
Sbjct: 482 LFKPLNLHF-GKRWFVMPRTFTILPDNYLIISDKGNVCLGFLNGKDIDHGSTVIVGDNAL 540
Query: 353 SDLFIGYDFDSQMVSFKPTDCTK 375
+ YD + + + +DCTK
Sbjct: 541 RGKLVVYDNQQRQIGWTNSDCTK 563
>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
Length = 473
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 99/387 (25%), Positives = 169/387 (43%), Gaps = 53/387 (13%)
Query: 21 NGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSS--S 74
NG Y +G+PP LD+ DTGSDL W+QC PC C K P+Y P +
Sbjct: 98 NGLYFTHIFVGSPPRRYFLDM----DTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNLVP 153
Query: 75 YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN--V 132
K+ C Q +L T C + + C+Y YAD S + GVLA++ + +N +
Sbjct: 154 LKDSLCVEVQRNL-KTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANGSLTKLGI 212
Query: 133 VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
+FGC ++ G+ + G++GL + ++SL SQ+ SQ + N +CL +D++
Sbjct: 213 MFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLT---SDATGG 269
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
M+ G+ V G+ +++ Y+ ++ +S+ S+ + G +
Sbjct: 270 GYMFLGD-DFVPYWGMAWVPMLNSHSPNYHSQIMK------ISHGSRQLSLGRQDGRTER 322
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYKTP----SMAG 301
+ DTG+ T PK+ Y L +++ Q DP L +C++ S+
Sbjct: 323 --VVFDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTL--PVCWRAKFPIRSVID 378
Query: 302 IAPI---LTAHFDGGAKVPLIHTSTFIPPP------VEGVFCFAM----QPIDGDVGIFG 348
+ LT F +K ++ T IPP +G C + DG I G
Sbjct: 379 VKQFFQPLTLQFR--SKWWIVSTKFRIPPEGYLIISNKGNVCLGILDGSNVHDGSTIILG 436
Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDCTK 375
+ + + YD +Q + + + C K
Sbjct: 437 DISLRGKLVVYDNVNQKIGWAQSTCVK 463
>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
Length = 388
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 81/271 (29%), Positives = 120/271 (44%), Gaps = 32/271 (11%)
Query: 16 NVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI-----YNPA 70
N+ G Y IGTP + Y +DTGS WV + C QC + + Y+P
Sbjct: 75 NIPYGTGLYYTDIGIGTPAV-KYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPR 133
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------GN 124
SS S KE+ C C C+ C Y GYAD LT G+L T+ + + G
Sbjct: 134 SSVSSKEVKCDDTIC--TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQ 191
Query: 125 SNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK--FSYCLV 179
+ +V FGCG +G N + + G++G G + + SQ L+ G K FS+CL
Sbjct: 192 TQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQ-LAAAGKTKKIFSHCL- 249
Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPY 239
S F G V V T+ + K ++ Y+ V L+ I N++ ++ +P
Sbjct: 250 -----DSTNGGGIFAIGEVVEPK--VKTTPIVKNNEVYHLVNLKSI---NVAGTTLQLP- 298
Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRL 270
N G FID+G+ LP+ Y+ L
Sbjct: 299 ANIFGTTKTKGTFIDSGSTLVYLPEIIYSEL 329
>gi|224035171|gb|ACN36661.1| unknown [Zea mays]
Length = 378
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 101/370 (27%), Positives = 136/370 (36%), Gaps = 73/370 (19%)
Query: 54 LPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQQLCN-YTYGYADSSL 110
+PC P+ + A +S+ C +C L D T SC + C Y Y D SL
Sbjct: 24 IPCAS------PLCSAAHASAPPSDLCAVARCPLEDIETGSCGASHACPPLYYAYGDGSL 77
Query: 111 TKGVLATERITFGNSNNF-----FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQI 165
L R+ G DN F C H G +G+ G GR LSL Q+
Sbjct: 78 VAH-LRRGRVALGAGARASVAVAVDNFTFACAHTALG----EPVGVAGFGRGPLSLPGQL 132
Query: 166 LSQLGANKFSYCLVP--FHTDSSIT-SKMYFGNG-----SEVSGGGVVSTSLVSKEDKTY 217
QL + +FSYCLV F D I S + G + G V T L+ Y
Sbjct: 133 SPQL-SGRFSYCLVSHSFRADRLIRPSPLILGRSPDDAAAAAETDGFVYTPLLHNPKHPY 191
Query: 218 YF-VTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLE----- 271
++ V LE +SVG ++ P G M +D+G T+LP + Y R+
Sbjct: 192 FYSVALEAVSVGAARIQAR--PELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFAR 249
Query: 272 ----------EQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHT 321
E+ LTP CY+ + P L HF G A V L
Sbjct: 250 AMAAAGFARAERAEEQTGLTP----------CYRYAASDRGVPPLALHFRGNATVALPRR 299
Query: 322 STFIPPPVEG---------VFCFAM--------QPIDGDVGIFGNFAQSDLFIGYDFDSQ 364
+ F+ E V C + + DG G GNF Q + YD D+
Sbjct: 300 NYFMGFKSEDAGAGTRKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAG 359
Query: 365 MVSFKPTDCT 374
V F CT
Sbjct: 360 RVGFARRRCT 369
>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
Length = 287
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 86/289 (29%), Positives = 125/289 (43%), Gaps = 19/289 (6%)
Query: 90 TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNENEM 149
T CS C Y Y D S T G A + +T +S++ FGCG N G+F E
Sbjct: 13 TRGCSGGH-CLYGVQYGDGSYTIGFFAMDTLTL-SSHDAIKGFRFGCGERNEGLFGE-AA 69
Query: 150 GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTS- 208
GL+GLGR + SL Q + G F++C F SS T + FG GS + +ST+
Sbjct: 70 GLLGLGRGKTSLPVQTYDKYG-GVFAHC---FPARSSGTGYLEFGPGSSPAVSAKLSTTP 125
Query: 209 LVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYN 268
++ T+Y+V + GI VG KL+P S + +D+G T LP Y+
Sbjct: 126 MLIDTGPTFYYVGMTGIRVGG-----KLLPIPQS--VFAAAGTIVDSGTVITRLPPAAYS 178
Query: 269 RLEEQVRNAIKLTPYQDPRLGSQL--CYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFI 325
L ++ Y+ S L CY + +A P ++ F GG + + +
Sbjct: 179 SLRSAFAASMAARGYKRAPALSLLDTCYDLTGASEVAIPTVSLLFQGGVSLDVDASGIIY 238
Query: 326 PPPV-EGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
V + FA DV I GN + YD S++V F P C
Sbjct: 239 AASVSQACLGFAGNEAADDVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287
>gi|357128791|ref|XP_003566053.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 441
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 117/436 (26%), Positives = 174/436 (39%), Gaps = 95/436 (21%)
Query: 17 VSTANGEYVMKFSIGTPP-LLDIYGIVDTGSDLMWV--------QCLPCVQCYKQVKPIY 67
++T Y++ ++GTPP + +Y +DTGSDL WV QCL C + KP
Sbjct: 18 IATYTDGYLLSLNLGTPPQVFQVY--LDTGSDLTWVPCGTNTSYQCLECGNEHSISKP-- 73
Query: 68 NPA-----SSSSYKEL-----------------SCQSEQCHLLDTVSCSSQQLCN-YTYG 104
PA S SS ++L +C + C + +S +LC + Y
Sbjct: 74 TPAFSLSQSYSSTRDLCGSRFCVDVHSSDNSHDACAAAGCSIPVFMSGLCTRLCPPFAYT 133
Query: 105 YADSSLTKGVLATERITFGNS------NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTR 158
Y +L G LA + I S F FGC G +G+ G G+ +
Sbjct: 134 YGGRALVLGSLARDTIALHGSIYGISVPIEFPGFCFGC----VGSSIREPIGIAGFGKGK 189
Query: 159 LSLASQILSQLGANKFSYCLVPFH--TDSSITSKMYFGN-GSEVSGGGVVSTSLVSKEDK 215
LSL SQ+ FS+C + F + +ITS M G+ V G + + L S
Sbjct: 190 LSLPSQL--GFLDKGFSHCFLGFWFARNPNITSPMVIGDLALSVKDGFLFTPMLKSLTYP 247
Query: 216 TYYFVTLEGISVGNLSNSSKLIPYYNS-SGAISKGN--MFIDTGAPPTLLPKDFYNRLEE 272
+Y++ LEG+++G+ + IP S SG S+GN + +DTG T L FY
Sbjct: 248 NFYYIGLEGVTIGD----NAAIPAPPSLSGIDSEGNGGVIVDTGTTYTHLSDPFY---AS 300
Query: 273 QVRNAIKLTPYQ-----DPRLGSQLCYKTPSMAGIA-----PILTAHFDGGAKVPLIHTS 322
+ + PY + R G LC K P M P +T H G + L S
Sbjct: 301 VLSSLSSTVPYNRSYELEIRTGFDLCLKVPCMHAPCNDDELPPITVHLGGDVTLALPKES 360
Query: 323 TF--IPPPVEGVF--CFAMQPIDGD--------------------VGIFGNFAQSDLFIG 358
+ + P V C Q D D + G+F ++ +
Sbjct: 361 CYYAVTAPRNSVVIKCLLFQRKDDDGVFSADNDDGEDASFSAGGPAAVLGSFQMQNVEVV 420
Query: 359 YDFDSQMVSFKPTDCT 374
YD +S V F+P DC
Sbjct: 421 YDLESGRVGFQPRDCA 436
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 162/383 (42%), Gaps = 53/383 (13%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC----------YKQVKPIYNPA 70
NG Y + IGTP + IVD+GS + +V C C QC + P + P
Sbjct: 89 NGYYTTRLYIGTPSQ-EFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPD 147
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF- 129
SS+Y + C +D + + C Y YA+ S + GVL + ++FG +
Sbjct: 148 LSSTYSPVKCN------VDCTCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKP 201
Query: 130 DNVVFGCGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSI 187
VFGC + TG +F+++ G++GLGR +LS+ Q++ + + ++ FS C
Sbjct: 202 QRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCY--------- 252
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIP-YYNS 242
M G G+ V GG +V YY + L+ I V + +L P +N
Sbjct: 253 -GGMDVGGGTMVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVA--GKALRLDPKIFN- 308
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCY----- 294
SK +D+G LP+ + ++ V N++K DP +C+
Sbjct: 309 ----SKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNY-KDICFAGAGR 363
Query: 295 KTPSMAGIAPILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQ 352
++ + P + F G K+ L F VEG +C + Q + G
Sbjct: 364 NVSQLSEVFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVV 423
Query: 353 SDLFIGYDFDSQMVSFKPTDCTK 375
+ + YD ++ + F T+C++
Sbjct: 424 RNTLVTYDRHNEKIGFWKTNCSE 446
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 80/261 (30%), Positives = 129/261 (49%), Gaps = 22/261 (8%)
Query: 24 YVMKFSIGTP--PLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
Y+++ +IGTP P+L +DT +D WV C CV C V +++P+ SSS + L C
Sbjct: 91 YIVRANIGTPAQPMLVA---LDTSNDAAWVPCSGCVGCASSV--LFDPSKSSSSRNLQCD 145
Query: 82 SEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNT 141
+ QC +C++ + C + Y S++ + L + +T +N+ + FGC T
Sbjct: 146 APQCKQAPNPTCTAGKSCGFNMTYGGSTI-EASLTQDTLTL--ANDVIKSYTFGCISKAT 202
Query: 142 GVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSG 201
G + GL+GLGR LSL SQ L + FSYCL P S+ + + G +
Sbjct: 203 GT-SLPAQGLMGLGRGPLSLISQT-QNLYMSTFSYCL-PNSKSSNFSGSLRL--GPKYQP 257
Query: 202 GGVVSTSLVSKEDK-TYYFVTLEGISVGN--LSNSSKLIPYYNSSGAISKGNMFIDTGAP 258
+ +T L+ + + Y+V L GI VGN + + + + S+GA G +F D+G
Sbjct: 258 VRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGA---GTIF-DSGTV 313
Query: 259 PTLLPKDFYNRLEEQVRNAIK 279
T L + Y + + R IK
Sbjct: 314 FTRLVEPAYVAVRNEFRRRIK 334
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 102/396 (25%), Positives = 170/396 (42%), Gaps = 53/396 (13%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP---CVQC-YKQVKPI----YNPASSS 73
G Y + ++GTPP + ++DTGS L+W C C C + + P + P +SS
Sbjct: 86 GGYSIDLNLGTPPQTSPF-VLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSS 144
Query: 74 SYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLT-------KGVLATERITFGNSN 126
+ K L C++ +C L S+ G + SLT G+ AT ++
Sbjct: 145 TAKLLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGATAGFLLLDNL 204
Query: 127 NFFDNVV--FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
NF V F G + + G+ G GR + SL SQ + +FSYCLV D
Sbjct: 205 NFPGKTVPQFLVGCSILSI--RQPSGIAGFGRGQESLPSQ----MNLKRFSYCLVSHRFD 258
Query: 185 SSITSK---MYFGNGSEVSGGGVVSTSLVSKED-----KTYYFVTLEGISVGNLSNSSKL 236
+ S + + + G+ T S + YY+VTL + VG +
Sbjct: 259 DTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVDVK--- 315
Query: 237 IPY-YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--- 292
IPY + G+ G +D+G+ T + + YN + ++ + ++ + +Q
Sbjct: 316 IPYKFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQSGLS 375
Query: 293 -CYKTPSMAGIA-PILTAHFDGGAKV--PLIHTSTFIPPPVEGVFCF-------AMQP-I 340
C+ + I+ P T F GGAK+ PL++ +F+ V CF A QP
Sbjct: 376 PCFNISGVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGD--AEVLCFTVVSDGGAGQPKT 433
Query: 341 DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTKQ 376
G I GN+ Q + ++ YD +++ F P +C ++
Sbjct: 434 AGPAIILGNYQQQNFYVEYDLENERFGFGPRNCKRK 469
>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
Length = 473
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 100/376 (26%), Positives = 163/376 (43%), Gaps = 57/376 (15%)
Query: 29 SIGTPPLLDIYGI-VDTGSDLMWVQCLPCVQCYKQVKP---------IYNPASSSSYKEL 78
++GTP D + + +DTGSDL W+ C C C +++K IY+P +SS+ ++
Sbjct: 60 TVGTPS--DWFMVALDTGSDLFWLPC-DCTNCVRELKAPGGSSLDLNIYSPNASSTSTKV 116
Query: 79 SCQSEQCHLLDTVSCSSQQLCNYTYGY-ADSSLTKGVLATERITF----GNSNNFFDNVV 133
C S C D + S + C Y Y ++ + + GVL + + +S V
Sbjct: 117 PCNSTLCTRGDRCA-SPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVT 175
Query: 134 FGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSK 190
FGCG TGVF++ GL GLG +S+ S + + + AN FS C F D + +
Sbjct: 176 FGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMC---FGNDGA--GR 230
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
+ FG+ V T L ++ Y +T+ ISVG ++G +
Sbjct: 231 ISFGDKGSVDQR---ETPLNIRQPHPTYNITVTKISVGG------------NTGDLEFDA 275
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ--DPRLGSQLCY--KTPSMAGIA--- 303
+F D+G T L Y + E + YQ D L + CY + P +G
Sbjct: 276 VF-DSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALRLPLYSGHHHPN 334
Query: 304 ------PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFI 357
P + GG+ P+ H IP V+C A+ I+ D+ I G + +
Sbjct: 335 KDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLAIMKIE-DISIIGQNFMTGYRV 393
Query: 358 GYDFDSQMVSFKPTDC 373
+D + ++ +K +DC
Sbjct: 394 VFDREKLILGWKESDC 409
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 162/383 (42%), Gaps = 53/383 (13%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC----------YKQVKPIYNPA 70
NG Y + IGTP + IVD+GS + +V C C QC + P + P
Sbjct: 88 NGYYTTRLYIGTPSQ-EFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPD 146
Query: 71 SSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF- 129
SS+Y + C +D + + C Y YA+ S + GVL + ++FG +
Sbjct: 147 LSSTYSPVKCN------VDCTCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKP 200
Query: 130 DNVVFGCGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSI 187
VFGC + TG +F+++ G++GLGR +LS+ Q++ + + ++ FS C
Sbjct: 201 QRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCY--------- 251
Query: 188 TSKMYFGNGSEVSGGGVVSTSLVSKEDK----TYYFVTLEGISVGNLSNSSKLIP-YYNS 242
M G G+ V GG +V YY + L+ I V + +L P +N
Sbjct: 252 -GGMDVGGGTMVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVA--GKALRLDPKIFN- 307
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCY----- 294
SK +D+G LP+ + ++ V N++K DP +C+
Sbjct: 308 ----SKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNY-KDICFAGAGR 362
Query: 295 KTPSMAGIAPILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQ 352
++ + P + F G K+ L F VEG +C + Q + G
Sbjct: 363 NVSQLSEVFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVV 422
Query: 353 SDLFIGYDFDSQMVSFKPTDCTK 375
+ + YD ++ + F T+C++
Sbjct: 423 RNTLVTYDRHNEKIGFWKTNCSE 445
>gi|145324889|ref|NP_001077691.1| aspartyl protease [Arabidopsis thaliana]
gi|332194268|gb|AEE32389.1| aspartyl protease [Arabidopsis thaliana]
Length = 410
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 94/404 (23%), Positives = 170/404 (42%), Gaps = 45/404 (11%)
Query: 1 MSPATYFYPNNVVQSNVSTANGEYVMKFSIGTPPLLDIYGI-VDTGSDLMWVQC-LPCVQ 58
++P F+P+ V+ + Y + +G P Y + +DTGS+L W+QC PC
Sbjct: 10 LTPPLRFFPSVVMCIQMGML---YYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTS 66
Query: 59 CYKQVKPIYNPASSSSYK--ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLA 116
C K +Y P + + E C Q + L T C + C+Y YAD S + GVL
Sbjct: 67 CAKGANQLYKPRKDNLVRSSEAFCVEVQRNQL-TEHCENCHQCDYEIEYADHSYSMGVLT 125
Query: 117 TER--ITFGNSNNFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LG 170
++ + N + ++VFGCG++ G+ + G++GL R ++SL SQ+ S+ +
Sbjct: 126 KDKFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGII 185
Query: 171 ANKFSYCLVPFHTDSSITSKMYFGNGSE-VSGGGVVSTSLVSKEDKTYYFVTLEGISVGN 229
+N +CL S + + Y GS+ V G+ ++ Y + + +S G
Sbjct: 186 SNVVGHCLA-----SDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQ 240
Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
++ +G + G + DTG+ T P Y++L ++ L +D
Sbjct: 241 -----GMLSLDGENGRV--GKVLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDE 293
Query: 290 SQ-LCYKTP------SMAGIAPILT-AHFDGGAKVPLIHTSTFIPPPV------EGVFCF 335
+ +C++ S++ + G+K +I I P +G C
Sbjct: 294 TLPICWRAKTNFPFSSLSDVKKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNVCL 353
Query: 336 AM----QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
+ DG I G+ + I YD + + + +DC +
Sbjct: 354 GILDGSSVHDGSTIILGDISMRGHLIVYDNVKRRIGWMKSDCVR 397
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 84/272 (30%), Positives = 128/272 (47%), Gaps = 35/272 (12%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-YKQVKPI----YNPASSSS 74
A G Y + S+GTPP Y VDTGS++ WV+C PC C + P+ ++P S++
Sbjct: 37 AMGLYYTRISLGTPPQ-QFYVDVDTGSNVAWVKCAPCTGCEHSGDVPVPMSTFDPRKSTT 95
Query: 75 YKELSCQSEQCHLLD-TVSCSSQQL-CNYTYGYADSSLTKGVLATERITFGN-------S 125
+SC +C +L+ + CS ++L C Y+ Y D S T G + TF +
Sbjct: 96 KISISCTDAECGVLNKKLQCSPERLSCPYSLLYGDGSSTAGYYLNDVFTFNQVPSDNSTA 155
Query: 126 NNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTD 184
+ +VFGCG TG ++ + GL+G G T +SL +Q+ Q + N F++CL D
Sbjct: 156 KSGTARLVFGCGGTQTGSWSVD--GLLGFGPTTVSLPNQLAQQNISVNIFAHCL---QGD 210
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLE--GISVGNLSNSSKLIPYYNS 242
S + G E +V T +V ED +Y V L GIS N++ + Y
Sbjct: 211 VSGRGSLVIGTIREPD---LVYTPMVFGED--HYNVQLLNIGISGRNVTTPASFDLEYT- 264
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV 274
G + ID+G T L + Y+ V
Sbjct: 265 ------GGVIIDSGTTLTYLVQPAYDEFRRGV 290
>gi|340810907|gb|AEK75380.1| S5 [Oryza sativa]
Length = 472
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 161/382 (42%), Gaps = 55/382 (14%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELS 79
++M S+G PP++++ I DTGS L WVQC PC V C+ Q PI++P S + + +
Sbjct: 114 FLMAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 172
Query: 80 CQSEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDN 131
C S +C L +C ++ C Y+ Y + + + G + T+ + G+S F +
Sbjct: 173 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMD 229
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSIT 188
++FGC + ++E E G+ G G + S Q+ L FSYCL TD +
Sbjct: 230 LMFGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL---PTDETKP 284
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
M G + G T L ++ Y +T+E + ++N +L+ S
Sbjct: 285 GYMILGRYDRAAMDGGY-TPLFRSINRPTYSLTMEML----IANGQRLV--------TSS 331
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK---------- 295
M +D+GA T L + L++ + A+ Y R S +CY
Sbjct: 332 SEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNG 391
Query: 296 --TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFA 351
TP S P+L F GGA + L + F P G+ FA P I GN
Sbjct: 392 TITPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRV 450
Query: 352 QSDLFIGYDFDSQMVSFKPTDC 373
+D + FK C
Sbjct: 451 TRSFGTTFDIQGKQFGFKYAAC 472
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 101/381 (26%), Positives = 163/381 (42%), Gaps = 47/381 (12%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y + IG+PP D + VDTGSD++WV C+ C C K+ +YNP SSS+
Sbjct: 71 GLYYARIGIGSPPN-DFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTST 129
Query: 77 ELSCQSEQCHLLDTV---SCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSNNFF 129
++C C C LC Y Y D S T G + I GN
Sbjct: 130 LITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSE 189
Query: 130 DN--VVFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHT 183
N +VFGCG +G +E G++G G+ S+ SQ+ + K F++CL
Sbjct: 190 TNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCL----- 244
Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSS 243
SI+ F G EV + +T +V ++ +Y V L G+ VG+ + L + S
Sbjct: 245 -DSISGGGIFAIG-EVVEPKLKTTPVVP--NQAHYNVVLNGVKVGDTALDLPLGLFETS- 299
Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQLCYK-TPSM 299
K ID+G LP Y L E++ A +KL D C+ ++
Sbjct: 300 ---YKRGAIIDSGTTLAYLPDSIYLPLMEKILGAQPDLKLRTVDD----QFTCFVFDKNV 352
Query: 300 AGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-----QPIDG-DVGIFGNFAQS 353
P +T F+ + + I+ ++ + V+C Q DG +V + G+
Sbjct: 353 DDGFPTVTFKFE-ESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQ 411
Query: 354 DLFIGYDFDSQMVSFKPTDCT 374
+ + Y+ ++Q + + +C+
Sbjct: 412 NKLVYYNLENQTIGWTEYNCS 432
>gi|340810987|gb|AEK75420.1| S5 [Oryza rufipogon]
gi|340810989|gb|AEK75421.1| S5 [Oryza rufipogon]
gi|340810991|gb|AEK75422.1| S5 [Oryza rufipogon]
gi|340811001|gb|AEK75427.1| S5 [Oryza rufipogon]
gi|340811019|gb|AEK75436.1| S5 [Oryza rufipogon]
gi|340811104|gb|AEK75478.1| S5 [Oryza rufipogon]
gi|340811124|gb|AEK75488.1| S5 [Oryza rufipogon]
Length = 472
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 161/382 (42%), Gaps = 55/382 (14%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELS 79
++M S+G PP++++ I DTGS L WVQC PC V C+ Q PI++P S + + +
Sbjct: 114 FLMAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 172
Query: 80 CQSEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDN 131
C S +C L +C ++ C Y+ Y + + + G + T+ + G+S F +
Sbjct: 173 CSSVKCGELRYDLRLQQANCMEKENSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMD 229
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSIT 188
++FGC + ++E E G+ G G + S Q+ L FSYCL TD +
Sbjct: 230 LMFGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL---PTDETKP 284
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
M G + G T L ++ Y +T+E + ++N +L+ S
Sbjct: 285 GYMILGRYDRAAMDGGY-TPLFRSINRPTYSLTMEML----IANGQRLV--------TSS 331
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK---------- 295
M +D+GA T L + L++ + A+ Y R S +CY
Sbjct: 332 SEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNG 391
Query: 296 --TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFA 351
TP S P+L F GGA + L + F P G+ FA P I GN
Sbjct: 392 TITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRV 450
Query: 352 QSDLFIGYDFDSQMVSFKPTDC 373
+D + FK C
Sbjct: 451 TRSFGTTFDIQGKQFGFKYAAC 472
>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 686
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 100/385 (25%), Positives = 166/385 (43%), Gaps = 49/385 (12%)
Query: 21 NGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSS--S 74
NG Y +G+PP LD+ DTGSDL W+QC PC C K P+Y P +
Sbjct: 311 NGLYFTHIFVGSPPRRYFLDM----DTGSDLTWIQCDAPCTSCAKGPNPLYKPKKGNLVP 366
Query: 75 YKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN--V 132
K+ C Q +L T C + + C+Y YAD S + GVLA++ + +N +
Sbjct: 367 LKDSLCVEVQRNL-KTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANGSLTKLGI 425
Query: 133 VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
+FGC ++ G+ + G++GL + ++SL SQ+ SQ + N +CL +D++
Sbjct: 426 MFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLT---SDATGG 482
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
M+ G+ V G+ +++ Y+ ++ +S+ S+ + G +
Sbjct: 483 GYMFLGD-DFVPYWGMAWVPMLNSHSPNYHSQIMK------ISHGSRQLSLGRQDGRTER 535
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYKTPSMAGIA-- 303
+ DTG+ T PK+ Y L +++ Q DP L K P + I
Sbjct: 536 --VVFDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTLPVCWRAKFPIRSVIDVK 593
Query: 304 ---PILTAHFDGGAKVPLIHTSTFIPPP------VEGVFCFAM----QPIDGDVGIFGNF 350
LT F +K ++ T IPP +G C + DG I G+
Sbjct: 594 QFFQPLTLQFR--SKWWIVSTKFRIPPEGYLIISNKGNVCLGILDGSNVHDGSTIILGDI 651
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCTK 375
+ + YD +Q + + + C K
Sbjct: 652 SLRGKLVVYDNVNQKIGWAQSTCVK 676
>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 507
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 93/330 (28%), Positives = 144/330 (43%), Gaps = 29/330 (8%)
Query: 64 KPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCN-------YTYGYADSSLTKGVLA 116
K ++ P S S++ ++C S++C + D S LC Y YAD S KG
Sbjct: 188 KGVFCPHRSKSFQAVTCASQKCKI-DLSQLFSLSLCPKPSDPCLYDISYADGSSAKGFFG 246
Query: 117 TERITFGNSN---NFFDNVVFGCGHN-NTGV-FNENEMGLVGLGRTRLSLASQILSQLGA 171
T+ IT N +N+ GC + GV FNE+ G++GLG + S + + GA
Sbjct: 247 TDTITVDLKNGKEGKLNNLTIGCTKSMENGVNFNEDTGGILGLGFAKDSFIDKAAYEYGA 306
Query: 172 NKFSYCLVPFHTDSSITSKMYFGNGSEVS-GGGVVSTSLVSKEDKTYYFVTLEGISVGNL 230
KFSYCLV + +++S + G G + T L+ +Y V + GIS+G
Sbjct: 307 -KFSYCLVDHLSHRNVSSYLTIGGHHNAKLLGEIKRTELILFP--PFYGVNVVGISIG-- 361
Query: 231 SNSSKLIPYYNSSGAISKGNMFIDTGAPPT-LLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
K+ P S+G ID+G T LL + E +++ K+ G
Sbjct: 362 GQMLKIPPQVWDFN--SQGGTLIDSGTTLTALLVPAYEPVFEALIKSLTKVKRVTGEDFG 419
Query: 290 S-QLCYKTPSM-AGIAPILTAHFDGGAKV-PLIHTSTFIPPPVEGVFCFAMQPIDGDVG- 345
+ C+ + P L HF GGA+ P + + P+ V C + PIDG G
Sbjct: 420 ALDFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPL--VKCIGIVPIDGIGGA 477
Query: 346 -IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ GN Q + +D + + F P+ CT
Sbjct: 478 SVIGNIMQQNHLWEFDLSTNTIGFAPSICT 507
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 100/398 (25%), Positives = 171/398 (42%), Gaps = 77/398 (19%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSS 74
A G Y K IGTPP + Y VDTGSD+MWV C+ C +C + +Y+ SSS
Sbjct: 79 AVGLYYAKIGIGTPPK-NYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIKESSS 137
Query: 75 YKELSCQSEQCHLLD---TVSCSSQQLCNYTYGYADSSLTKGVLATERITFG------NS 125
K + C E C ++ C++ C Y Y D S T G + + + +
Sbjct: 138 GKLVPCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKT 197
Query: 126 NNFFDNVVFGCGHNNTGVF---NENEM-GLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
++ ++VFGCG +G NE + G++G G+ S+ SQ+ S K F++CL
Sbjct: 198 DSANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCL-- 255
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGN-- 229
NG V+GGG+ + V + D+ +Y V + + VG+
Sbjct: 256 --------------NG--VNGGGIFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHTF 299
Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG 289
LS S+ ++S + ID+G LP+ Y L ++ + Q P L
Sbjct: 300 LSLST------DTSAQGDRKGTIIDSGTTLAYLPEGIYEPLVYKMIS-------QHPDLK 346
Query: 290 SQ------LCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQ---- 338
Q C++ + S+ P +T F+ G + ++ ++ P V +C Q
Sbjct: 347 VQTLHDEYTCFQYSESVDDGFPAVTFFFENGLSLK-VYPHDYLFPSVN-FWCIGWQNSGT 404
Query: 339 --PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
++ + G+ S+ + YD ++Q + + +C+
Sbjct: 405 QSRDSKNMTLLGDLVLSNKLVFYDLENQAIGWAEYNCS 442
>gi|196212952|gb|ACG76112.1| S5 [Oryza sativa Indica Group]
gi|338809989|gb|AEJ08560.1| S5 [Oryza barthii]
gi|340810883|gb|AEK75368.1| S5 [Oryza sativa]
gi|340810885|gb|AEK75369.1| S5 [Oryza sativa]
gi|340810889|gb|AEK75371.1| S5 [Oryza sativa]
gi|340810895|gb|AEK75374.1| S5 [Oryza sativa]
gi|340810897|gb|AEK75375.1| S5 [Oryza sativa]
gi|340810905|gb|AEK75379.1| S5 [Oryza sativa]
gi|340810909|gb|AEK75381.1| S5 [Oryza sativa]
gi|340810911|gb|AEK75382.1| S5 [Oryza sativa]
gi|340810913|gb|AEK75383.1| S5 [Oryza sativa]
gi|340810923|gb|AEK75388.1| S5 [Oryza sativa]
gi|340810925|gb|AEK75389.1| S5 [Oryza sativa]
gi|340810929|gb|AEK75391.1| S5 [Oryza sativa]
gi|340810935|gb|AEK75394.1| S5 [Oryza sativa]
gi|340810937|gb|AEK75395.1| S5 [Oryza sativa]
gi|340810939|gb|AEK75396.1| S5 [Oryza sativa]
gi|340810941|gb|AEK75397.1| S5 [Oryza sativa]
gi|340810943|gb|AEK75398.1| S5 [Oryza sativa]
gi|340810951|gb|AEK75402.1| S5 [Oryza sativa]
gi|340810953|gb|AEK75403.1| S5 [Oryza sativa]
gi|340810963|gb|AEK75408.1| S5 [Oryza sativa]
gi|340810965|gb|AEK75409.1| S5 [Oryza sativa]
gi|340810973|gb|AEK75413.1| S5 [Oryza nivara]
gi|340811003|gb|AEK75428.1| S5 [Oryza rufipogon]
gi|340811005|gb|AEK75429.1| S5 [Oryza rufipogon]
gi|340811009|gb|AEK75431.1| S5 [Oryza rufipogon]
gi|340811023|gb|AEK75438.1| S5 [Oryza rufipogon]
gi|340811025|gb|AEK75439.1| S5 [Oryza nivara]
gi|340811031|gb|AEK75442.1| S5 [Oryza rufipogon]
gi|340811033|gb|AEK75443.1| S5 [Oryza rufipogon]
gi|340811035|gb|AEK75444.1| S5 [Oryza nivara]
gi|340811039|gb|AEK75446.1| S5 [Oryza rufipogon]
gi|340811049|gb|AEK75451.1| S5 [Oryza nivara]
gi|340811053|gb|AEK75453.1| S5 [Oryza rufipogon]
gi|340811055|gb|AEK75454.1| S5 [Oryza nivara]
gi|340811057|gb|AEK75455.1| S5 [Oryza rufipogon]
gi|340811059|gb|AEK75456.1| S5 [Oryza rufipogon]
gi|340811061|gb|AEK75457.1| S5 [Oryza rufipogon]
gi|340811065|gb|AEK75459.1| S5 [Oryza nivara]
gi|340811067|gb|AEK75460.1| S5 [Oryza nivara]
gi|340811069|gb|AEK75461.1| S5 [Oryza nivara]
gi|340811071|gb|AEK75462.1| S5 [Oryza rufipogon]
gi|340811081|gb|AEK75467.1| S5 [Oryza nivara]
gi|340811083|gb|AEK75468.1| S5 [Oryza nivara]
gi|340811087|gb|AEK75470.1| S5 [Oryza nivara]
gi|340811092|gb|AEK75472.1| S5 [Oryza nivara]
gi|340811102|gb|AEK75477.1| S5 [Oryza rufipogon]
gi|340811106|gb|AEK75479.1| S5 [Oryza rufipogon]
gi|340811108|gb|AEK75480.1| S5 [Oryza rufipogon]
gi|340811110|gb|AEK75481.1| S5 [Oryza rufipogon]
gi|340811112|gb|AEK75482.1| S5 [Oryza rufipogon]
gi|340811118|gb|AEK75485.1| S5 [Oryza nivara]
gi|340811120|gb|AEK75486.1| S5 [Oryza rufipogon]
Length = 472
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 161/382 (42%), Gaps = 55/382 (14%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELS 79
++M S+G PP++++ I DTGS L WVQC PC V C+ Q PI++P S + + +
Sbjct: 114 FLMAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 172
Query: 80 CQSEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDN 131
C S +C L +C ++ C Y+ Y + + + G + T+ + G+S F +
Sbjct: 173 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMD 229
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSIT 188
++FGC + ++E E G+ G G + S Q+ L FSYCL TD +
Sbjct: 230 LMFGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL---PTDETKP 284
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
M G + G T L ++ Y +T+E + ++N +L+ S
Sbjct: 285 GYMILGRYDRAAMDGGY-TPLFRSINRPTYSLTMEML----IANGQRLV--------TSS 331
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK---------- 295
M +D+GA T L + L++ + A+ Y R S +CY
Sbjct: 332 SEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNG 391
Query: 296 --TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFA 351
TP S P+L F GGA + L + F P G+ FA P I GN
Sbjct: 392 TITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRV 450
Query: 352 QSDLFIGYDFDSQMVSFKPTDC 373
+D + FK C
Sbjct: 451 TRSFGTTFDIQGKQFGFKYAAC 472
>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 445
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 96/385 (24%), Positives = 157/385 (40%), Gaps = 60/385 (15%)
Query: 32 TPPLLDIYGI----------------VDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSY 75
TPPL YG+ +DT S L W++C C+ +Q P+++P+ SSSY
Sbjct: 67 TPPLEYTYGVAVTIGTGRGKSTYFLVLDTASSLPWMRCAHCLPVQRQRSPVFDPSDSSSY 126
Query: 76 KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
+ L S C + V + + + G A G + T+ I GN +V FG
Sbjct: 127 RPLHPTSPLCRAPNPVLPAGDKCSFHLPGEA-----HGYVGTDTIILGNPTLPIHSVAFG 181
Query: 136 CGHNNTGVFNENEM-GLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
C + G + G +G+G+ SL QI ++G ++FSYCL+ + F
Sbjct: 182 CAQSTEGFDTKGTFAGTLGMGKLPTSLIMQIKDRVG-SRFSYCLIGLGHSPGRNGFIRF- 239
Query: 195 NGSEVSGGGVVSTSLVSKEDK--------------TYYFVTLEGISVGN--LSNSSKLIP 238
G+++ T LV K + Y+V L GIS+ + + +
Sbjct: 240 -GADIPD----PTLLVHHRIKILPTPPHLPHGVADSAYYVKLLGISLNGTPIPGIRQAMF 294
Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY---QDPRLGSQLCYK 295
S G+ G F+D G T L Y +EE V + ++ Y +DP LC++
Sbjct: 295 ERRSDGS---GGCFVDAGTQVTHLVPAAYAVVEEAVAHMVQQWGYKRVRDPNF--SLCFR 349
Query: 296 T-PSMAGIAPILTAHFDGGAKVPLIH-----TSTFIPPPVEGVFCFAM-QPIDGDVGIFG 348
P + P LT F+G A + H + F+ + + CF + + G + G
Sbjct: 350 EHPGIWSHIPKLTLDFEGPASRTVAHLEIVSRNLFLKVDNQPLVCFGVYRTSRGSPTVVG 409
Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDC 373
Q D +D + ++F C
Sbjct: 410 AMQQVDTRFIFDLHANTITFHRESC 434
>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
Length = 450
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/355 (25%), Positives = 145/355 (40%), Gaps = 52/355 (14%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHL-LDTV-----SCS 94
IVDTGSDL WVQC PC CY Q P+++P+ S+SY + C + C L SC+
Sbjct: 125 IVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCA 184
Query: 95 S---------QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFN 145
+ + C Y+ Y D S ++GVLAT+ + G ++ D VFGCG +N G
Sbjct: 185 TVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS--VDGFVFGCGLSNRG--- 239
Query: 146 ENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVV 205
L R + +S S G + + + D+S + N + VS
Sbjct: 240 --------LRRPGSAASSPTASPPGTSGDAAGSLSLGGDTS-----SYRNATPVS----Y 282
Query: 206 STSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKD 265
+ + +YF+ + G SVG + ++ + N+ +D+G T L
Sbjct: 283 TRMIADPAQPPFYFMNVTGASVGGAAVAAA---------GLGAANVLLDSGTVITRLAPS 333
Query: 266 FYN--RLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGI-APILTAHFDGGAKVPLIHTS 322
Y R E + + P P CY + P+LT + GA + +
Sbjct: 334 VYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEAGADMTVDAAG 393
Query: 323 TFIPPPVEGV-FCFAMQPI--DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+G C AM + + I GN+ Q + + YD + F DC+
Sbjct: 394 MLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 448
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/355 (24%), Positives = 159/355 (44%), Gaps = 46/355 (12%)
Query: 41 IVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLDTVSCS-SQQLC 99
I+DTGS + ++ C C C K ++P S++ K+L+C C+ T SC+ + C
Sbjct: 29 IIDTGSTITYIPCKDCSHCGKHTAEWFDPDKSTTAKKLACGDPLCN-CGTPSCTCNNDRC 87
Query: 100 NYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG-VFNENEMGLVGLGRTR 158
Y+ YA+ S ++G + + F +S++ +VFGC + TG ++ + G++G+G
Sbjct: 88 YYSRTYAERSSSEGWMIEDTFGFPDSDSPV-RLVFGCENGETGEIYRQMADGIMGMGNNH 146
Query: 159 LSLASQILSQ-LGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTY 217
+ SQ++ + + + FS C + I + G+ + G V T L++ Y
Sbjct: 147 NAFQSQLVQRKVIEDVFSLCFG--YPKDGI---LLLGDVTLPEGANTVYTPLLTHLHLHY 201
Query: 218 YFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV--- 274
Y V ++GI+V + + + G + +D+G T LP D + + + V
Sbjct: 202 YNVKMDGITVNGQTLAFDASVFDRGYGTV------LDSGTTFTYLPTDAFKAMAKAVGDY 255
Query: 275 --RNAIKLTPYQDPRLGSQLCYKTP-----SMAGIAPILTAHFDGGAKVPLIHTSTFIPP 327
+ ++ TP DP+ + +C+K + P F GGAK+ L P
Sbjct: 256 VEKKGLQSTPGADPQY-NDICWKGAPDQFKDLDKYFPPAEFVFGGGAKLTL--------P 306
Query: 328 PVEGVFCFAMQPIDGDVGIF---------GNFAQSDLFIGYDFDSQMVSFKPTDC 373
P+ + F +P + +GIF G + D+ + YD + V F C
Sbjct: 307 PLR--YLFLSKPAEYCLGIFDNGNSGALVGGVSVRDVVVTYDRRNSKVGFTTMAC 359
>gi|115465777|ref|NP_001056488.1| Os05g0591300 [Oryza sativa Japonica Group]
gi|113580039|dbj|BAF18402.1| Os05g0591300 [Oryza sativa Japonica Group]
Length = 453
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 83/294 (28%), Positives = 136/294 (46%), Gaps = 30/294 (10%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELS 79
+++ +GTP + + +DTGS L WVQC PC ++C+ Q V PI++P++SS+++ +
Sbjct: 53 FLIPVKLGTPAVQYLV-TMDTGSSLSWVQCRPCTIKCHVQPAKVGPIFDPSNSSTFRHVG 111
Query: 80 CQSEQCHLL------DTVSCSS-QQLCNYTYGYADS-SLTKGVLATERITFGNSNNF--- 128
C + C L + +C + +C YT Y + + G T+R+ G
Sbjct: 112 CSTSICSYLGRTLRIQSKACMEWEDICLYTMSYGGGWAYSVGKAVTDRLVLGGGETTRTT 171
Query: 129 --FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSS 186
N VFGC +T E G+ GLG + S QI L FSYCL S
Sbjct: 172 LSLANFVFGCSM-DTQYSTHKEAGIFGLGTSNYSF-EQIAPLLSYKAFSYCL-----PSD 224
Query: 187 ITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAI 246
+ Y G + SGG V TS+ + Y + + G++V ++ + + + S
Sbjct: 225 EAHQGYLSIGPDSSGG--VPTSMFPGTPRPVYSIGMTGLTV-TVNGEVRSLVSGSGSSPS 281
Query: 247 SKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLG--SQLCYKTPS 298
M +D+GA TLL + +LE+ + A++ Y +QLC+ T S
Sbjct: 282 PSSLMVVDSGAKLTLLLASTFGQLEDAIIPAMESLGYSLNTAAGQNQLCFLTES 335
>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
gi|223942623|gb|ACN25395.1| unknown [Zea mays]
Length = 378
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 99/386 (25%), Positives = 154/386 (39%), Gaps = 35/386 (9%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI--YNPA 70
+ S T G+Y ++F +GTP + + DTGSDL WV+C P + +
Sbjct: 3 LSSGAYTGTGQYFVRFRVGTPAQPFVL-VADTGSDLTWVKCRGAAGPPASDPPAREFRAS 61
Query: 71 SSSSYKELSCQSEQCHL---LDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFG--- 123
S S+ L+C S+ C +CSS C Y Y Y D S +GV+ T+ T
Sbjct: 62 ESRSWAPLACSSDTCTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSG 121
Query: 124 ----------NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK 173
VV GC G ++ G++ LG + +S AS+ ++ G +
Sbjct: 122 SGSEDGSGGGGRRAKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFG-GR 180
Query: 174 FSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNS 233
FSYCLV + +S + FG G E G T LV D+ + ++
Sbjct: 181 FSYCLVDHLAPRNASSYLTFGPGPEGGGAPAARTPLV--LDRRVSPFYAVAVDAVYVAGE 238
Query: 234 SKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY--QDPRLGSQ 291
+ IP + G +D+G T+L Y + + + P DP +
Sbjct: 239 ALDIP-ADVWDVGRGGGAILDSGTSLTVLATPAYRAVVAALGGRLAALPRVAMDP---FE 294
Query: 292 LCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG---DVGIFG 348
CY + A P L F G A++ S ++ GV C +Q +G V + G
Sbjct: 295 YCYNWTAGAPEIPKLEVSFAGSARLEPPAKS-YVIDAAPGVKCIGVQ--EGAWPGVSVIG 351
Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDCT 374
N Q + +D + + FK T C
Sbjct: 352 NILQQEHLWEFDLRDRWLRFKHTRCA 377
>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
Length = 469
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 99/386 (25%), Positives = 154/386 (39%), Gaps = 35/386 (9%)
Query: 13 VQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI--YNPA 70
+ S T G+Y ++F +GTP + + DTGSDL WV+C P + +
Sbjct: 94 LSSGAYTGTGQYFVRFRVGTPAQPFVL-VADTGSDLTWVKCRGAAGPPASDPPAREFRAS 152
Query: 71 SSSSYKELSCQSEQCHL---LDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFG--- 123
S S+ L+C S+ C +CSS C Y Y Y D S +GV+ T+ T
Sbjct: 153 ESRSWAPLACSSDTCTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSG 212
Query: 124 ----------NSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK 173
VV GC G ++ G++ LG + +S AS+ ++ G +
Sbjct: 213 SGSEDGSGGGGRRAKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFG-GR 271
Query: 174 FSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNS 233
FSYCLV + +S + FG G E G T LV D+ + ++
Sbjct: 272 FSYCLVDHLAPRNASSYLTFGPGPEGGGAPAARTPLV--LDRRVSPFYAVAVDAVYVAGE 329
Query: 234 SKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY--QDPRLGSQ 291
+ IP + G +D+G T+L Y + + + P DP +
Sbjct: 330 ALDIP-ADVWDVGRGGGAILDSGTSLTVLATPAYRAVVAALGGRLAALPRVAMDP---FE 385
Query: 292 LCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDG---DVGIFG 348
CY + A P L F G A++ S ++ GV C +Q +G V + G
Sbjct: 386 YCYNWTAGAPEIPKLEVSFAGSARLEPPAKS-YVIDAAPGVKCIGVQ--EGAWPGVSVIG 442
Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDCT 374
N Q + +D + + FK T C
Sbjct: 443 NILQQEHLWEFDLRDRWLRFKHTRCA 468
>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 421
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 78/225 (34%), Positives = 111/225 (49%), Gaps = 25/225 (11%)
Query: 10 NNVVQSNVSTANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNP 69
N+ +N+ +G +++ + GTPP + I+DTGS + W QC CV C + +N
Sbjct: 114 NHAHNNNLFDEDGNFLVDVAFGTPPQ-NFMLILDTGSSITWTQCKACVNCLQDSHRYFNW 172
Query: 70 ASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF 129
++SS+Y SC + TV NY Y D S + G + +T S + F
Sbjct: 173 SASSTYSSGSC------IPGTVE------NNYNMTYGDDSTSVGNYGCDTMTLEPS-DVF 219
Query: 130 DNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSIT 188
FGCG NN G F G++GLG+ +LS SQ S+ NK FSYCL + SI
Sbjct: 220 QKFQFGCGRNNKGDFGSGVDGMLGLGQGQLSTVSQTASKF--NKVFSYCLP---EEDSIG 274
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSK----EDKTYYFVTLEGISVGN 229
S + FG + + TSLV+ ++ YYFV L ISVGN
Sbjct: 275 S-LLFGEKATSQSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGN 318
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 100/396 (25%), Positives = 170/396 (42%), Gaps = 73/396 (18%)
Query: 20 ANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSS 74
A G Y K IGTPP + Y VDTGSD+MWV C+ C +C + +Y+ SSS
Sbjct: 81 AVGLYYAKIGIGTPPK-NYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIKESSS 139
Query: 75 YKELSCQSEQCHLLD---TVSCSSQQLCNYTYGYADSSLTKGVLATERITFG------NS 125
K + C E C ++ C++ C Y Y D S T G + + + +
Sbjct: 140 GKFVPCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKT 199
Query: 126 NNFFDNVVFGCGHNNTGVF---NENEM-GLVGLGRTRLSLASQILSQLGANK-FSYCLVP 180
++ ++VFGCG +G NE + G++G G+ S+ SQ+ S K F++CL
Sbjct: 200 DSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCL-- 257
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGN-- 229
NG V+GGG+ + V + D+ +Y V + + VG+
Sbjct: 258 --------------NG--VNGGGIFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHAF 301
Query: 230 LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDP 286
LS S+ + G I ID+G LP+ Y L ++ + +K+ D
Sbjct: 302 LSLSTDTSTQGDRKGTI------IDSGTTLAYLPEGIYEPLVYKIISQHPDLKVRTLHD- 354
Query: 287 RLGSQLCYK-TPSMAGIAPILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQ------ 338
C++ + S+ P +T +F+ G + + H F P +C Q
Sbjct: 355 ---EYTCFQYSESVDDGFPAVTFYFENGLSLKVYPHDYLF---PSGDFWCIGWQNSGTQS 408
Query: 339 PIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
++ + G+ S+ + YD ++Q++ + +C+
Sbjct: 409 RDSKNMTLLGDLVLSNKLVFYDLENQVIGWTEYNCS 444
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 104/379 (27%), Positives = 173/379 (45%), Gaps = 44/379 (11%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYK------QVKPIYNPASSSSY 75
G Y K +G+PP + +DTGSD++WV C C C + Q+ ++ +SSS+
Sbjct: 64 GLYFTKVKLGSPPR-EFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLN-FFDSSSSSTA 121
Query: 76 KELSCQSEQC-HLLDTVS--CSSQ-QLCNYTYGYADSSLTKGVLATERITFGN--SNNFF 129
++ C C + T + CSSQ C+YT+ Y D S T G ++ + F +
Sbjct: 122 GQVRCSDPICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLI 181
Query: 130 DN----VVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPF 181
DN +VFGC +G + + G+ G G+ LS+ SQ+ ++ + FS+CL
Sbjct: 182 DNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCL--- 238
Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
D S + G E+ G+V + LV + +Y + L I+V + +L+P
Sbjct: 239 KGDGSGGGILVLG---EILEPGIVYSPLVPSQ--PHYNLNLLSIAV-----NGQLLPIDP 288
Query: 242 SSGAISKGN-MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDP--RLGSQLCYKTPS 298
++ A S +D+G L + Y+ V NAI ++P P G+Q + S
Sbjct: 289 AAFATSNSQGTIVDSGTTLAYLVAEAYDPFVSAV-NAI-VSPSVTPITSKGNQCYLVSTS 346
Query: 299 MAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEG---VFCFAMQPIDGDVGIFGNFAQSDL 355
++ + P+ + +F GGA + L IP G ++C Q + G V I G+ D
Sbjct: 347 VSQMFPLASFNFAGGASMVLKPEDYLIPFGSSGGSAMWCIGFQKVQG-VTILGDLVLKDK 405
Query: 356 FIGYDFDSQMVSFKPTDCT 374
YD Q + + DC+
Sbjct: 406 IFVYDLVRQRIGWANYDCS 424
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 100/389 (25%), Positives = 159/389 (40%), Gaps = 56/389 (14%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N + ++GTPP ++ ++DTGS+L W+ C +N S SY+ + C
Sbjct: 28 NISLTVSLTVGTPPQ-NVSMVIDTGSELSWLYCNKTTTTTSYPT-TFNQTRSISYRPIPC 85
Query: 81 QSEQC-----HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFG 135
S C SC S LC+ T YAD+S ++G LA++ G S+ +VFG
Sbjct: 86 SSSTCTNQTRDFSIPASCDSNSLCHATLSYADASSSEGNLASDTFHMGASD--IPGMVFG 143
Query: 136 CGHNNTGVFNENE------MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
C + VF+ N GL+G+ R LS +SQ+G KFSYC+ + + +
Sbjct: 144 CMDS---VFSSNSDEDSKNTGLMGMNRGSLSF----VSQMGFPKFSYCI----SGTDFSG 192
Query: 190 KMYFGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLSNSSKLIPYYNS- 242
+ G + + T LV D+ Y V LEGI V S +L+P S
Sbjct: 193 MLLLGESNFTWAVPLNYTPLVQISTPLPYFDRIAYTVQLEGIKV-----SDRLLPIPKSV 247
Query: 243 --SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCY 294
G +D+G T L Y L + N L +DP Q LCY
Sbjct: 248 FEPDHTGAGQTMVDSGTQFTFLLGPAYTALRSEFLNQTTGFLRVLEDPDFVFQGAMDLCY 307
Query: 295 KTPSMAGIAPIL--TAHFDGGAKVPLIHTSTF--IPPPVEG---VFCFAMQPID---GDV 344
+ P + P L + GA++ + +P + G V C + D +
Sbjct: 308 RVPISQRVLPRLPTVSLVFNGAEMTVADERVLYRVPGEIRGNDSVHCLSFGNSDLLGVEA 367
Query: 345 GIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
+ G+ Q ++++ +D + + C
Sbjct: 368 YVIGHHHQQNVWMEFDLERSRIGLAQVRC 396
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 80/278 (28%), Positives = 136/278 (48%), Gaps = 25/278 (8%)
Query: 12 VVQSNVSTANGE-------YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK 64
V +S+V A+G Y+++ +IGTP + + DT +D W+ C CV C V
Sbjct: 69 VTKSSVPIASGRGIVQSPTYIVRANIGTPAQAMLVAL-DTSNDAAWIPCSGCVGCSSSV- 126
Query: 65 PIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGN 124
+++P+ SSS + L C++ QC SC+ + C + Y S++ + L + +T
Sbjct: 127 -LFDPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYGGSAI-EAYLTQDTLTL-- 182
Query: 125 SNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTD 184
+ + N FGC + +G + GL+GLGR LSL SQ L + FSYCL P
Sbjct: 183 ATDVIPNYTFGCINKASGT-SLPAQGLMGLGRGPLSLISQS-QNLYQSTFSYCL-PNSKS 239
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGN--LSNSSKLIPYYN 241
S+ + + G ++ + +T L+ + + Y+V L GI VGN + + + +
Sbjct: 240 SNFSGSLRLGPKNQPI--RIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDP 297
Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK 279
++GA G +F D+G T L + Y + + R +K
Sbjct: 298 ATGA---GTIF-DSGTVYTRLVEPAYVAMRNEFRRRVK 331
>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 113/366 (30%), Positives = 151/366 (41%), Gaps = 46/366 (12%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
+G +++ GTP I+DTGSD W+QC C K +NP+ SSSY SC
Sbjct: 126 DGLFLVNVGFGTPQQ-KFNLIIDTGSDTTWIQCNSCSLGNCHNKKTFNPSLSSSYSNRSC 184
Query: 81 QSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
DT NYT Y D+S +KGV + +T + F FGCG +
Sbjct: 185 IPST----DT---------NYTMKYEDNSYSKGVFVCDEVTL--KPDVFPKFQFGCGDSG 229
Query: 141 TGVFNENEMGLVGLGR-TRLSLASQILSQLGANKFSYCLVPF-HTDSSITSKMYFGNGSE 198
G F G++GL + + SL SQ S+ KFSYC P HT S + FG +
Sbjct: 230 GGEFG-TASGVLGLAKGEQYSLISQTASKF-KKKFSYCFPPKEHTLGS----LLFGEKAI 283
Query: 199 VSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYYNSSGAISKGNMFIDTG 256
+ + T L++ YFV L GISV L+ SS L + S G I ID+G
Sbjct: 284 SASPSLKFTQLLNPPSGLGYFVELIGISVAKKRLNVSSSL---FASPGTI------IDSG 334
Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL---CYKTPSMAGI---APILTAHF 310
T LP Y L + + P P +L CY G P + HF
Sbjct: 335 TVITRLPTAAYEALRTAFQQEMLHCPSISPPPQEKLLDTCYNLKGCGGRNIKLPEIVLHF 394
Query: 311 DGGAKVPLIHTSTFIPPP---VEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVS 367
G V L H S + + FA + V I GN Q L + YD + +
Sbjct: 395 VGEVDVSL-HPSGILWANGDLTQACLAFARKSNPSHVTIIGNRQQVSLKVVYDIEGGRLG 453
Query: 368 FKPTDC 373
F DC
Sbjct: 454 FG-NDC 458
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 102/387 (26%), Positives = 177/387 (45%), Gaps = 54/387 (13%)
Query: 19 TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
T G Y + IG+PP Y VDTGSD++WV C+ C C + Y+PA S
Sbjct: 79 TDTGLYYTRIEIGSPPK-GYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSG 137
Query: 74 SYKELSCQSEQCHLLDTV-----SC-SSQQLCNYTYGYADSSLTKGVLATERITF----G 123
+ + C+ E C + ++ +C S+ C + Y D S T G T+ + + G
Sbjct: 138 T--TVGCEQEFC-VANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSG 194
Query: 124 NSNNFFDN--VVFGCGHN---NTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYC 177
N N + FGCG + G N+ G++G G++ S+ SQ+ + K F++C
Sbjct: 195 NGQTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHC 254
Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLI 237
L ++ F G+ V V +T LV + T+Y V L+GISVG ++ +
Sbjct: 255 L------DTVRGGGIFAIGNVVQ-PKVKTTPLV--PNVTHYNVNLQGISVG---GATLQL 302
Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTP---YQDPRLGSQLCY 294
P SKG + ID+G LP++ Y L V + + P YQD +C+
Sbjct: 303 PTSTFDSGDSKGTI-IDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQD-----FVCF 356
Query: 295 K-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIF 347
+ + S+ P++T F+G + ++ ++ ++C +Q DG D+ +
Sbjct: 357 QFSGSIDDGFPVITFSFEGDLTLN-VYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLL 415
Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCT 374
G+ S+ + YD + +++ + +C+
Sbjct: 416 GDLVLSNKLVVYDLEKEVIGWTDYNCS 442
>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
Length = 583
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 91/384 (23%), Positives = 162/384 (42%), Gaps = 42/384 (10%)
Query: 21 NGEYVMKFSIGTPPLLDIYGI-VDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSSYK-- 76
+G Y + +G P Y + +DTGS+L W+QC PC C K +Y P + +
Sbjct: 200 DGLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGANQLYKPRKDNLVRSS 259
Query: 77 ELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATER--ITFGNSNNFFDNVVF 134
E C Q + L T C + C+Y YAD S + GVL ++ + N + ++VF
Sbjct: 260 EAFCVEVQRNQL-TEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLAESDIVF 318
Query: 135 GCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSK 190
GCG++ G+ + G++GL R ++SL SQ+ S+ + +N +CL S + +
Sbjct: 319 GCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLA-----SDLNGE 373
Query: 191 MYFGNGSE-VSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKG 249
Y GS+ V G+ ++ Y + + +S G ++ +G + G
Sbjct: 374 GYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQ-----GMLSLDGENGRV--G 426
Query: 250 NMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTP------SMAGI 302
+ DTG+ T P Y++L ++ L +D + +C++ S++ +
Sbjct: 427 KVLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLPICWRAKTNFPFSSLSDV 486
Query: 303 APILT-AHFDGGAKVPLIHTSTFIPPPV------EGVFCFAM----QPIDGDVGIFGNFA 351
G+K +I I P +G C + DG I G+ +
Sbjct: 487 KKFFRPITLQIGSKWLIISRKLLIQPEDYLIISNKGNVCLGILDGSSVHDGSTIILGDIS 546
Query: 352 QSDLFIGYDFDSQMVSFKPTDCTK 375
I YD + + + +DC +
Sbjct: 547 MRGHLIVYDNVKRRIGWMKSDCVR 570
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 110/393 (27%), Positives = 160/393 (40%), Gaps = 54/393 (13%)
Query: 18 STANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLP---CVQC--YKQVKPIYNPASS 72
S + G Y + S GTPP + ++DTGS +W C C C ++ P + P S
Sbjct: 71 SHSYGGYSISLSFGTPPQTLSF-VMDTGSSFVWFPCTLRYLCNNCSFTSRISP-FLPKHS 128
Query: 73 SSYKELSCQSEQCHLL----------DTVSCSSQQLCNYTYGYADSSLTKGVLATERITF 122
SS K + C++ +C + D S + Q+C S T GV +E T
Sbjct: 129 SSSKIIGCKNPKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYGSGTTGGVALSE--TL 186
Query: 123 GNSNNFFDNVVFGCGHNNTGVFNENE-MGLVGLGRTRLSLASQILSQLGANKFSYCLVPF 181
N + GC VF+ + G+ G GR SL SQ LG KFSYCL+
Sbjct: 187 HLHGLIVPNFLVGCS-----VFSSRQPAGIAGFGRGPSSLPSQ----LGLTKFSYCLLSH 237
Query: 182 HTDSSITSKMYFGNG---SEVSGGGVVSTSLVSK---EDK----TYYFVTLEGISVGNLS 231
D + S + S+ ++ T LV +DK YY+V+L IS+G S
Sbjct: 238 KFDDTQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRS 297
Query: 232 NSSKLIPY-YNSSGAISKGNMFIDTGAPPTLLPKDFY----NRLEEQVRNAIKLTPYQDP 286
IPY Y S G ID+G T + + + N QV+N + +
Sbjct: 298 VK---IPYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERAL-MVEA 353
Query: 287 RLGSQLCYKTPSMAGIA-PILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-----QPI 340
G + C+ + P L HF GGA V L + F V CF + +
Sbjct: 354 LSGLKPCFNVSGAKELELPQLRLHFKGGADVELPLENYFAFLGSREVACFTVVTDGAEKA 413
Query: 341 DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
G I GNF + ++ YD ++ + FK C
Sbjct: 414 SGPGMILGNFQMQNFYVEYDLQNERLGFKKESC 446
>gi|340810915|gb|AEK75384.1| S5 [Oryza sativa]
gi|340810917|gb|AEK75385.1| S5 [Oryza sativa]
gi|340810919|gb|AEK75386.1| S5 [Oryza sativa]
gi|340810927|gb|AEK75390.1| S5 [Oryza sativa]
gi|340810975|gb|AEK75414.1| S5 [Oryza nivara]
gi|340810979|gb|AEK75416.1| S5 [Oryza nivara]
gi|340810995|gb|AEK75424.1| S5 [Oryza nivara]
gi|340811027|gb|AEK75440.1| S5 [Oryza nivara]
gi|340811063|gb|AEK75458.1| S5 [Oryza nivara]
Length = 357
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 159/380 (41%), Gaps = 55/380 (14%)
Query: 26 MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELSCQ 81
M S+G PP++++ I DTGS L WVQC PC V C+ Q PI++P S + + + C
Sbjct: 1 MAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 82 SEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDNVV 133
S +C L +C ++ C Y+ Y + + + G + T+ + G+S F +++
Sbjct: 60 SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMDLM 116
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSITSK 190
FGC + ++E E G+ G G + S Q+ L FSYCL TD +
Sbjct: 117 FGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL---PTDETKPGY 171
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
M G + G T L ++ Y +T+E + ++N +L+ S
Sbjct: 172 MILGRYDRAAMDGGY-TPLFRSINRPTYSLTMEML----IANGQRLV--------TSSSE 218
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK------------ 295
M +D+GA T L + L++ + A+ Y R S +CY
Sbjct: 219 MIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTI 278
Query: 296 TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQS 353
TP S P+L F GGA + L + F P G+ FA P I GN
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRVTR 337
Query: 354 DLFIGYDFDSQMVSFKPTDC 373
+D + FK C
Sbjct: 338 SFGTTFDIQGKQFGFKYAAC 357
>gi|340810959|gb|AEK75406.1| S5 [Oryza sativa]
gi|340810971|gb|AEK75412.1| S5 [Oryza rufipogon]
Length = 357
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 159/380 (41%), Gaps = 55/380 (14%)
Query: 26 MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELSCQ 81
M S+G PP++++ I DTGS L WVQC PC V C+ Q PI++P S + + + C
Sbjct: 1 MAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 82 SEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDNVV 133
S +C L +C ++ C Y+ Y + + + G + T+ + G+S F +++
Sbjct: 60 SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMDLM 116
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSITSK 190
FGC + ++E E G+ G G + S Q+ L FSYCL TD +
Sbjct: 117 FGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL---PTDETKPGY 171
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
M G + G T L ++ Y +T+E + ++N +L+ S
Sbjct: 172 MILGRYDRAAMDGGY-TPLFRSINRPTYSLTMEML----IANGQRLV--------TSSSE 218
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK------------ 295
M +D+GA T L + L++ + A+ Y R S +CY
Sbjct: 219 MIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTI 278
Query: 296 TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQS 353
TP S P+L F GGA + L + F P G+ FA P I GN
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRVTR 337
Query: 354 DLFIGYDFDSQMVSFKPTDC 373
+D + FK C
Sbjct: 338 SFGTTFDIQGKQFGFKYAAC 357
>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 564
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 75/267 (28%), Positives = 125/267 (46%), Gaps = 31/267 (11%)
Query: 21 NGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSSY- 75
+G+Y +G PP LD VDTGSDL W+QC PC C K P+Y PA
Sbjct: 191 DGQYYTSIFVGNPPRPYFLD----VDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVP 246
Query: 76 -KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV-- 132
++L CQ Q D C++ + C+Y YAD S + GVLA + + +N + +
Sbjct: 247 PRDLLCQELQG---DQNYCATCKQCDYEIEYADRSSSMGVLAKDDMHMIATNGGREKLDF 303
Query: 133 VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
VFGC ++ G + G++GL +SL SQ+ SQ + +N F +C+ + +
Sbjct: 304 VFGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCIT---KEPNGG 360
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
M+ G+ V G+ + D Y+ + ++ G+ + + + +G S
Sbjct: 361 GYMFLGD-DYVPRWGMTWAPIRGGPDNLYH-TEAQKVNYGD-----QQLRMHGQAG--SS 411
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVR 275
+ D+G+ T LP + Y +L ++
Sbjct: 412 IQVIFDSGSSYTYLPDEIYKKLVTAIK 438
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 96/380 (25%), Positives = 160/380 (42%), Gaps = 43/380 (11%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y + +G+PP + + +DTGSD++WV C PC C +NP +SS+
Sbjct: 89 GLYFTRVKLGSPPK-EYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSS 147
Query: 77 ELSCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITF----GNSN- 126
++ C ++C S S C YT+ Y D S T G ++ + F GN
Sbjct: 148 KIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQT 207
Query: 127 -NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK--FSYCLVP 180
N ++VFGC ++ +G + + G+ G G+ +LS+ SQ L+ LG + FS+CL
Sbjct: 208 ANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQ-LNSLGVSPKVFSHCLKG 266
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIP 238
I + G E+ G+V T LV + +Y + LE I V L S L
Sbjct: 267 SDNGGGI---LVLG---EIVEPGLVYTPLVPSQ--PHYNLNLESIVVNGQKLPIDSSLFT 318
Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS 298
N+ G I +D+G L Y+ + A+ + G+Q + S
Sbjct: 319 TSNTQGTI------VDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQCFVTSSS 372
Query: 299 MAGIAPILTAHFDGGAKVPLIHTSTFIPPPV---EGVFCFAMQPIDG-DVGIFGNFAQSD 354
+ P ++ +F GG + + + + ++C Q G + I G+ D
Sbjct: 373 VDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKD 432
Query: 355 LFIGYDFDSQMVSFKPTDCT 374
YD + + + DC+
Sbjct: 433 KIFVYDLANMRMGWTDYDCS 452
>gi|297724243|ref|NP_001174485.1| Os05g0511050 [Oryza sativa Japonica Group]
gi|222632192|gb|EEE64324.1| hypothetical protein OsJ_19161 [Oryza sativa Japonica Group]
gi|255676482|dbj|BAH93213.1| Os05g0511050 [Oryza sativa Japonica Group]
Length = 432
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 111/422 (26%), Positives = 175/422 (41%), Gaps = 74/422 (17%)
Query: 17 VSTANGEYVMKFSIGTPP-LLDIYGIVDTGSDLMWVQC-----LPCVQC------YKQVK 64
V+T Y++ ++G PP + +Y +DTGSDL WV C C++C K +
Sbjct: 18 VTTYTDGYLLSLNLGMPPQVFQVY--LDTGSDLTWVPCGTNSSYQCLECGNEHSTSKPIP 75
Query: 65 PIYNPASSSSYKELSCQSEQC---HLLD-------TVSCS----SQQLCN-----YTYGY 105
SSS+ KEL C S C H D V C+ LC ++Y Y
Sbjct: 76 SFSPSQSSSNMKEL-CGSRFCVDIHSSDNSHDPCAAVGCAIPSFMSDLCTRPCPPFSYTY 134
Query: 106 ADSSLTKGVLATERITFGNS----NNFFD--NVVFGCGHNNTGVFNENEMGLVGLGRTRL 159
+L G LA + +T S D FGC G +G+ G G+ L
Sbjct: 135 GGGALVLGSLAKDIVTLHGSIFGIAILLDVPGFCFGC----VGSSIREPIGIAGFGKGIL 190
Query: 160 SLASQILSQLGANKFSYCLVPFH--TDSSITSKMYFGNGSEVSGGGVVSTSLV-SKEDKT 216
SL SQ+ FS+C + F + + TS + G+ + + + T ++ S +
Sbjct: 191 SLPSQL--GFLDKGFSHCFLGFRFARNPNFTSSLIMGDLALSAKDDFLFTPMLKSITNPN 248
Query: 217 YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRN 276
+Y++ LEG+S+G+ + P +S + G M +DTG T LP FY + + +
Sbjct: 249 FYYIGLEGVSIGD-GAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPFYTAILSSLAS 307
Query: 277 AIKLTPYQD--PRLGSQLCYK-----TPSMAGIAPILTAHFDGGAKVPLIHTSTF--IPP 327
I D R G LC+K TP P++ HF G K+ L S + +
Sbjct: 308 VILYERSYDLEMRTGFDLCFKIPCTHTPCTQDELPLINFHFLGDVKLTLPKDSCYYAVTA 367
Query: 328 PVEGVF--CFAMQPI-------------DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
P V C Q + +G + G+F ++ + YD ++ + F+P D
Sbjct: 368 PKNSVVVKCLLFQRMDNDDDDDDVGGANNGPGAVLGSFQMQNVEVVYDMEAGRIGFQPKD 427
Query: 373 CT 374
C
Sbjct: 428 CA 429
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 96/380 (25%), Positives = 160/380 (42%), Gaps = 43/380 (11%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y + +G+PP + + +DTGSD++WV C PC C +NP +SS+
Sbjct: 89 GLYFTRVKLGSPPK-EYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSS 147
Query: 77 ELSCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITF----GNSN- 126
++ C ++C S S C YT+ Y D S T G ++ + F GN
Sbjct: 148 KIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQT 207
Query: 127 -NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK--FSYCLVP 180
N ++VFGC ++ +G + + G+ G G+ +LS+ SQ L+ LG + FS+CL
Sbjct: 208 ANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQ-LNSLGVSPKVFSHCLKG 266
Query: 181 FHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIP 238
I + G E+ G+V T LV + +Y + LE I V L S L
Sbjct: 267 SDNGGGI---LVLG---EIVEPGLVYTPLVPSQ--PHYNLNLESIVVNGQKLPIDSSLFT 318
Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS 298
N+ G I +D+G L Y+ + A+ + G+Q + S
Sbjct: 319 TSNTQGTI------VDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQCFVTSSS 372
Query: 299 MAGIAPILTAHFDGGAKVPLIHTSTFIPPPV---EGVFCFAMQPIDG-DVGIFGNFAQSD 354
+ P ++ +F GG + + + + ++C Q G + I G+ D
Sbjct: 373 VDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKD 432
Query: 355 LFIGYDFDSQMVSFKPTDCT 374
YD + + + DC+
Sbjct: 433 KIFVYDLANMRMGWTDYDCS 452
>gi|196212948|gb|ACG76110.1| S5 [Oryza sativa Japonica Group]
gi|340810887|gb|AEK75370.1| S5 [Oryza sativa]
gi|340810903|gb|AEK75378.1| S5 [Oryza sativa]
gi|340810921|gb|AEK75387.1| S5 [Oryza sativa]
gi|340810955|gb|AEK75404.1| S5 [Oryza sativa]
gi|340811079|gb|AEK75466.1| S5 [Oryza nivara]
gi|340811090|gb|AEK75471.1| S5 [Oryza rufipogon]
gi|340811116|gb|AEK75484.1| S5 [Oryza nivara]
Length = 357
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 159/380 (41%), Gaps = 55/380 (14%)
Query: 26 MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELSCQ 81
M S+G PP++++ I DTGS L WVQC PC V C+ Q PI++P S + + + C
Sbjct: 1 MAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 82 SEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDNVV 133
S +C L +C ++ C Y+ Y + + + G + T+ + G+S F +++
Sbjct: 60 SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMDLM 116
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSITSK 190
FGC + ++E E G+ G G + S Q+ L FSYCL TD +
Sbjct: 117 FGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL---PTDETKPGY 171
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
M G + G T L ++ Y +T+E + ++N +L+ S
Sbjct: 172 MILGRYDRAAMDGGY-TPLFRSINRPTYSLTMEML----IANGQRLV--------TSSSE 218
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK------------ 295
M +D+GA T L + L++ + A+ Y R S +CY
Sbjct: 219 MIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTI 278
Query: 296 TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQS 353
TP S P+L F GGA + L + F P G+ FA P I GN
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRVTR 337
Query: 354 DLFIGYDFDSQMVSFKPTDC 373
+D + FK C
Sbjct: 338 SFGTTFDIQGKQFGFKYAAC 357
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 103/386 (26%), Positives = 172/386 (44%), Gaps = 60/386 (15%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N + ++G+PP +I ++DTGS+L W+ C + + ++NP SSS+Y + C
Sbjct: 58 NVTLTVTLAVGSPPQ-NISMVLDTGSELSWLHC----KKSPNLGSVFNPVSSSTYSPVPC 112
Query: 81 QSEQCH-----LLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
S C L SC + C+ YAD++ +G LA + G+ +F
Sbjct: 113 SSPICRTRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVIGSVTR--PGTLF 170
Query: 135 GCGHNNTGVFNENE-----MGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITS 189
GC ++G+ +++E GL+G+ R LS ++QLG +KFSYC+ + S +
Sbjct: 171 GC--MDSGLSSDSEEDAKSTGLMGMNRGSLSF----VNQLGFSKFSYCI----SGSDSSG 220
Query: 190 KMYFGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGN-LSNSSKLIPYYNS 242
+ G+ S G + T LV + D+ Y V LEGI VG+ + + K + +
Sbjct: 221 ILLLGDASYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDH 280
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCYKT 296
+GA G +D+G T L Y L+ + K L DP Q LCY+
Sbjct: 281 TGA---GQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGTMDLCYRV 337
Query: 297 -----PSMAGIAPILTAHFDG------GAKVPLIHTSTFIPPPVEGVFCFAMQPIDG--- 342
P+ G+ P+++ F G G K+ L + E V+CF D
Sbjct: 338 GSSTRPNFTGL-PVISLMFRGAEMSVSGQKL-LYRVNGAGSEGKEEVYCFTFGNSDLLGI 395
Query: 343 DVGIFGNFAQSDLFIGYDFDSQMVSF 368
+ + G+ Q ++++ +D V F
Sbjct: 396 EAFVIGHHHQQNVWMEFDLAKSRVGF 421
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 102/387 (26%), Positives = 172/387 (44%), Gaps = 52/387 (13%)
Query: 19 TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
TA G Y + IG+PP Y VDTGSD++WV + C C + Y+PA S
Sbjct: 80 TATGLYYTRIEIGSPPK-GYYVQVDTGSDILWVNGISCDGCPTRSGLGIELTQYDPAGSG 138
Query: 74 SYKELSCQSEQCHLLDTVS-----C-SSQQLCNYTYGYADSSLTKGVLATERITF----G 123
+ + C+ E C S C S+ C + Y D S T G T+ + + G
Sbjct: 139 T--TVGCEQEFCVANSAASGVPPACPSAASPCQFRITYGDGSSTTGFYVTDFVQYNQVSG 196
Query: 124 NSNNFFDNV--VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYC 177
N NV FGCG G + G++G G++ S+ SQ+ + K F++C
Sbjct: 197 NGQTTPSNVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHC 256
Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLI 237
L ++ F G+ V V +T LV + T+Y V L+GISVG ++ +
Sbjct: 257 L------DTVRGGGIFAIGNVVQPPIVKTTPLV--PNATHYNVNLQGISVG---GATLQL 305
Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQLCY 294
P SKG + ID+G LP++ Y L V + + + Y+D +C+
Sbjct: 306 PTSTFDSGDSKGTI-IDSGTTLAYLPREVYRTLLTAVFDKHPDLAVRNYED-----FICF 359
Query: 295 K-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIF 347
+ + S+ P++T F+G + ++ ++ ++C +Q DG D+ +
Sbjct: 360 QFSGSLDEEFPVITFSFEGDLTLN-VYPHDYLFQNGNDLYCMGFLDGGVQTKDGKDMVLL 418
Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCT 374
G+ S+ + YD + Q++ + +C+
Sbjct: 419 GDLVLSNKLVVYDLEKQVIGWTDYNCS 445
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 105/393 (26%), Positives = 160/393 (40%), Gaps = 64/393 (16%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSC 80
N + ++GTPP ++ ++DTGS+L W+ C P K + P +SS++ + C
Sbjct: 82 NVSLTVSLAVGTPPQ-NVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPC 140
Query: 81 QSEQCHLLDTVS---CS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGC 136
S QC D S C + C+ + YAD S + G LAT+ G+ FGC
Sbjct: 141 ASAQCRSRDLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAVGSGPPL--RAAFGC 198
Query: 137 GHNNTGVFNEN-----EMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
+ F+ + GL+G+ R LS +SQ +FSYC+ +D +
Sbjct: 199 ---MSSAFDSSPDGVASAGLLGMNRGALSF----VSQASTRRFSYCI----SDRDDAGVL 247
Query: 192 YFGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLS---NSSKLIPYYNS 242
G+ + + T + D+ Y V L GI VG +S L P +
Sbjct: 248 LLGHSDLPTFLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTG 307
Query: 243 SGAISKGNMFIDTGAPPTLLPKDFYNRLE-EQVRNAIKLTP-YQDPRLGSQ----LCYKT 296
+ G +D+G T L D Y+ L+ E R A L P DP Q C++
Sbjct: 308 A-----GQTMVDSGTQFTFLLGDAYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRV 362
Query: 297 PSMAGIAPILTAHFDG------GAKVPLIHTSTFIPPPVE-----GVFCFA-----MQPI 340
P G +P TA G GA++ + P E GV+C M PI
Sbjct: 363 PQ--GRSPP-TARLPGVTLLFNGAEMAVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPI 419
Query: 341 DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
V G+ Q ++++ YD + V P C
Sbjct: 420 MAYV--IGHHHQMNVWVEYDLERGRVGLAPVRC 450
>gi|222629275|gb|EEE61407.1| hypothetical protein OsJ_15596 [Oryza sativa Japonica Group]
Length = 466
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 99/399 (24%), Positives = 151/399 (37%), Gaps = 75/399 (18%)
Query: 23 EYVMKFSIGTPPLLDIYGI-VDTGSDLMWVQCLP--CVQCYKQV---------------- 63
+Y + S+G P + +DTGSDL+W C P C+ C +
Sbjct: 87 DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPIDS 146
Query: 64 ------KPIYNPASSSSYKELSCQSEQCHL--LDTVSCSSQQLCNYTYGYADSSLTKGVL 115
P+ + A SS+ C + +C L ++T SC+S Y Y D SL L
Sbjct: 147 RRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVAN-L 205
Query: 116 ATERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFS 175
R+ S +N F C H +G+ G GR LSL +Q+ L + +
Sbjct: 206 RRGRVGLAASMAV-ENFTFACAHTALA----EPVGVAGFGRGPLSLPAQLAPSLSGSTDA 260
Query: 176 YCLVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYF-VTLEGISVGNLSNSS 234
+ TD V T L+ Y++ V LE +SVG +
Sbjct: 261 AAIGASETD-------------------FVYTPLLHNPKHPYFYSVALEAVSVGGKRIQA 301
Query: 235 KLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPY-----QDPRLG 289
+ P G M +D+G T+LP D + R+ ++ A+ + + + G
Sbjct: 302 Q--PELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTG 359
Query: 290 SQLCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVE---GVFCFAMQPIDGD-- 343
CY +PS + P+ HF G A V L + F+ E V C + + G+
Sbjct: 360 LAPCYHYSPSDRAVPPV-ALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNND 418
Query: 344 --------VGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
G GNF Q + YD D+ V F CT
Sbjct: 419 DGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 457
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 82/280 (29%), Positives = 139/280 (49%), Gaps = 29/280 (10%)
Query: 12 VVQSNVSTANGE-------YVMKFSIGTP--PLLDIYGIVDTGSDLMWVQCLPCVQCYKQ 62
V +S+V A+G Y+++ +IGTP P+L +DT +D W+ C CV C
Sbjct: 69 VRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVA---LDTSNDAAWIPCSGCVGCSSS 125
Query: 63 VKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF 122
V +++P+ SSS + L C++ QC SC+ + C + Y S++ + L + +T
Sbjct: 126 V--LFDPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYGGSTI-EAYLTQDTLTL 182
Query: 123 GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
+++ N FGC + +G + GL+GLGR LSL SQ L + FSYCL P
Sbjct: 183 --ASDVIPNYTFGCINKASGT-SLPAQGLMGLGRGPLSLISQS-QNLYQSTFSYCL-PNS 237
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGN--LSNSSKLIPY 239
S+ + + G ++ + +T L+ + + Y+V L GI VGN + + + +
Sbjct: 238 KSSNFSGSLRLGPKNQPI--RIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAF 295
Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK 279
++GA G +F D+G T L + Y + + R +K
Sbjct: 296 DPATGA---GTIF-DSGTVYTRLVEPAYVAVRNEFRRRVK 331
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 96/396 (24%), Positives = 166/396 (41%), Gaps = 77/396 (19%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y K IGTP D Y VDTGSD+MWV C+ C +C + +YN S S K
Sbjct: 84 GLYYAKVGIGTPSK-DYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGK 142
Query: 77 ELSCQSEQCHLLD---TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD--- 130
+ C E C+ ++ C++ C Y Y D S T G + + + +
Sbjct: 143 LVPCDEEFCYEVNGGPLSGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSGDLQTTS 202
Query: 131 ---NVVFGCGHNNTGVF----NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFH 182
+V+FGCG +G E G++G G++ S+ SQ+ + K F++CL
Sbjct: 203 SNGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCL---- 258
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGN--LS 231
++GGG+ + V + ++ +Y V + + VG L
Sbjct: 259 --------------DGINGGGIFAIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGEDFLH 304
Query: 232 NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ 291
++ + GAI ID+G LP+ Y L ++ + Q P L
Sbjct: 305 LPTEEFEAGDRKGAI------IDSGTTLAYLPEIVYEPLVSKIIS-------QQPDLKVH 351
Query: 292 L------CYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQP 339
+ C++ + S+ P +T HF+ + +H ++ P EG++C MQ
Sbjct: 352 IVRDEYTCFQYSGSVDDGFPNVTFHFENSVFLK-VHPHEYL-FPFEGLWCIGWQNSGMQS 409
Query: 340 ID-GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
D ++ + G+ S+ + YD ++Q + + +C+
Sbjct: 410 RDRRNMTLLGDLVLSNKLVLYDLENQAIGWTEYNCS 445
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 102/387 (26%), Positives = 176/387 (45%), Gaps = 54/387 (13%)
Query: 19 TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
T G Y + IG+PP Y VDTGSD++WV C+ C C + Y+PA S
Sbjct: 79 TDTGLYYTRIEIGSPPK-GYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSG 137
Query: 74 SYKELSCQSEQCHLLDTV-----SC-SSQQLCNYTYGYADSSLTKGVLATERITF----G 123
+ + C+ E C + ++ +C S+ C + Y D S T G T+ + + G
Sbjct: 138 T--TVGCEQEFC-VANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSG 194
Query: 124 NSNNFFDN--VVFGCGHN---NTGVFNENEMGLVGLGRTRLSLASQILSQLGANK-FSYC 177
N N + FGCG + G N+ G++G G++ S+ SQ+ + K F++C
Sbjct: 195 NGQTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHC 254
Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLI 237
L ++ F G+ V V +T LV + T+Y V L+GISVG ++ +
Sbjct: 255 L------DTVRGGGIFAIGNVVQ-PKVKTTPLV--PNVTHYNVNLQGISVG---GATLQL 302
Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTP---YQDPRLGSQLCY 294
P SKG + ID+G LP++ Y L V + + P YQD +C+
Sbjct: 303 PTSTFDSGDSKGTI-IDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQD-----FVCF 356
Query: 295 K-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIF 347
+ + S+ P++T F G + ++ ++ ++C +Q DG D+ +
Sbjct: 357 QFSGSIDDGFPVITFSFKGDLTLN-VYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLL 415
Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCT 374
G+ S+ + YD + +++ + +C+
Sbjct: 416 GDLVLSNKLVVYDLEKEVIGWTDYNCS 442
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 82/280 (29%), Positives = 139/280 (49%), Gaps = 29/280 (10%)
Query: 12 VVQSNVSTANGE-------YVMKFSIGTP--PLLDIYGIVDTGSDLMWVQCLPCVQCYKQ 62
V +S+V A+G Y+++ +IGTP P+L +DT +D W+ C CV C
Sbjct: 69 VRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVA---LDTSNDAAWIPCSGCVGCSSS 125
Query: 63 VKPIYNPASSSSYKELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF 122
V +++P+ SSS + L C++ QC SC+ + C + Y S++ + L + +T
Sbjct: 126 V--LFDPSKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYGGSTI-EAYLTQDTLTL 182
Query: 123 GNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFH 182
+++ N FGC + +G + GL+GLGR LSL SQ L + FSYCL P
Sbjct: 183 --ASDVIPNYTFGCINKASGT-SLPAQGLMGLGRGPLSLISQS-QNLYQSTFSYCL-PNS 237
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGN--LSNSSKLIPY 239
S+ + + G ++ + +T L+ + + Y+V L GI VGN + + + +
Sbjct: 238 KSSNFSGSLRLGPKNQPI--RIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAF 295
Query: 240 YNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK 279
++GA G +F D+G T L + Y + + R +K
Sbjct: 296 DPATGA---GTIF-DSGTVYTRLVEPAYVAVRNEFRRRVK 331
>gi|255685714|gb|ACU28346.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 60/101 (59%), Gaps = 13/101 (12%)
Query: 26 MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQC 85
MK IGTPP +I ++DTGS+L+W QCLPC+ CY Q PI++P+ SS++KE C
Sbjct: 1 MKLQIGTPPF-EIEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCN---- 55
Query: 86 HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
+ C+Y Y D S T+G LATE +T +++
Sbjct: 56 --------TPDHSCSYKIVYDDKSYTQGTLATETVTIHSTS 88
>gi|125553836|gb|EAY99441.1| hypothetical protein OsI_21409 [Oryza sativa Indica Group]
Length = 376
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 48/145 (33%), Positives = 76/145 (52%), Gaps = 7/145 (4%)
Query: 41 IVDTGSDLMWVQCLPC--VQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD--TVSCSSQ 96
I+D+GSD+ WVQC PC + C+ Q P+++PA+S++Y + C S C L CS+
Sbjct: 164 IIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYSAVPCSSAACARLGPYRRGCSAN 223
Query: 97 QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTG-VFNENEMGLVGLG 155
C + + Y D + G +++ +T G + +FGC H + G F+ + G + LG
Sbjct: 224 VQCQFGFTYTDGATATGTYSSDDLTLG-PYDVVRGFLFGCAHADRGSTFSFDVSGTLALG 282
Query: 156 RTRLSLASQILSQLGANKFSYCLVP 180
S Q +Q G FSYC+ P
Sbjct: 283 GGAQSFVQQTATQYG-RVFSYCIPP 306
>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
Length = 557
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 96/385 (24%), Positives = 155/385 (40%), Gaps = 52/385 (13%)
Query: 21 NGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSSY- 75
+G+Y +G PP LD VDTGSDL W+QC PC C K P+Y P
Sbjct: 184 DGQYYTSIFVGNPPRPYFLD----VDTGSDLTWIQCDAPCTNCAKGPHPLYKPTKEKIVP 239
Query: 76 -KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV-- 132
++L CQ Q + C + + C+Y YAD S + GVLA + + +N + +
Sbjct: 240 PRDLLCQELQG---NQNYCETCKQCDYEIEYADQSSSMGVLARDDMHLIATNGGREKLDF 296
Query: 133 VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
VFGC ++ G + G++GL +SL SQ+ S + +N F +C+ +
Sbjct: 297 VFGCAYDQQGQLLSSPAKTDGILGLSNAAISLPSQLASHGIISNIFGHCIT---REQGGG 353
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
M+ G+ V G+ TS+ S D Y+ ++ + + +G +
Sbjct: 354 GYMFLGD-DYVPRWGITWTSIRSGPDNLYH------TEAHHVKYGDQQLRMREQAGNTVQ 406
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT-------PSMAG 301
+ D+G+ T LP + Y L ++ A LC+K +
Sbjct: 407 --VIFDSGSSYTYLPDEIYENLVAAIKYASPGFVQDSSDRTLPLCWKADFPVRYLEDVKQ 464
Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPVE-------GVFCFAM----QPIDGDVGIFGNF 350
L HF K L + TF P + G C + + G I G+
Sbjct: 465 FFKPLNLHF---GKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDV 521
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCTK 375
+ + YD + + + +DCTK
Sbjct: 522 SLRGKLVVYDNQRRQIGWTNSDCTK 546
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 96/385 (24%), Positives = 155/385 (40%), Gaps = 56/385 (14%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y K +G P + +DTGSD++WV C PC C +++ SSS +
Sbjct: 82 GLYFTKVKLGNPAR-EFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSAR 140
Query: 77 ELSCQSEQCHLLDTVS--CSSQ-QLCNYTYGYADSSLTKGVLATERITF----GNSN--N 127
L C C + T + C +Q C+Y++ Y D S T G T+ + F G S N
Sbjct: 141 VLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIAN 200
Query: 128 FFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHT 183
+VFGC G G+ G G+ S+ SQ+ S+ + FS+CL
Sbjct: 201 SSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL----- 255
Query: 184 DSSITSKMYFGNGSEVSGGGVVSTSLVSKE--------DKTYYFVTLEGISV-GNLSNSS 234
G E GG +V ++ + +Y + L+ I++ G L +
Sbjct: 256 -----------KGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFPNP 304
Query: 235 KLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCY 294
+ P N+ G ID+G L ++ Y+ + + +A+ + GSQ
Sbjct: 305 TMFPISNA------GETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFR 358
Query: 295 KTPSMAGIAPILTAHFDGGAKVP------LIHTSTFIPPPVEGVFCFAMQPIDGDVGIFG 348
+ S+A I P+L +F+G A + L S ++C Q + + I G
Sbjct: 359 VSMSVADIFPVLRFNFEGIASMVVTPEEYLQFDSIVSCYKFASLWCIGFQKAEDGLNILG 418
Query: 349 NFAQSDLFIGYDFDSQMVSFKPTDC 373
+ D I YD Q + + DC
Sbjct: 419 DLVLKDKIIVYDLAQQRIGWANYDC 443
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 96/394 (24%), Positives = 165/394 (41%), Gaps = 73/394 (18%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y K IGTP D Y VDTGSD+MWV C+ C +C K +YN S + K
Sbjct: 76 GLYYAKIGIGTPTK-DYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGK 134
Query: 77 ELSCQSEQCHLLD---TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD--- 130
+ C E C+ ++ C++ C Y Y D S T G + + + +
Sbjct: 135 LVPCDQEFCYEINGGQLPGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLKTTA 194
Query: 131 ---NVVFGCGHNNTGVF---NENEM-GLVGLGRTRLSLASQILSQLGANK-FSYCLVPFH 182
+V+FGCG +G NE + G++G G++ S+ SQ+ K F++CL
Sbjct: 195 ANGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCL---- 250
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGN--LS 231
+GGG+ V + ++ +Y V + + VG+ LS
Sbjct: 251 --------------DGTNGGGIFVIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGHEFLS 296
Query: 232 NSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRL 288
+ + + GAI ID+G LP+ Y L ++ + +K+ +D
Sbjct: 297 LPTDVFEAGDRKGAI------IDSGTTLAYLPEMVYKPLVSKIISQQPDLKVHTVRD--- 347
Query: 289 GSQLCYK-TPSMAGIAPILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQ------PI 340
C++ + S+ P +T HF+ + + H F P EG++C Q
Sbjct: 348 -EYTCFQYSDSLDDGFPNVTFHFENSVILKVYPHEYLF---PFEGLWCIGWQNSGVQSRD 403
Query: 341 DGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
++ + G+ S+ + YD ++Q + + +C+
Sbjct: 404 RRNMTLLGDLVLSNKLVLYDLENQAIGWTEYNCS 437
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 103/385 (26%), Positives = 170/385 (44%), Gaps = 50/385 (12%)
Query: 19 TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
T G Y + IGTP Y VDTGSD++WV C+ C +C ++ +Y+P SS
Sbjct: 84 TDTGLYYTEIGIGTPTK-RYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSS 142
Query: 74 SYKELSCQSEQCH-----LLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF------ 122
+ ++SC C LL C++ C Y+ Y D S T G ++ + F
Sbjct: 143 TGSKVSCDQGFCAATYGGLLP--GCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGD 200
Query: 123 GNSNNFFDNVVFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQLGANK--FSYC 177
G + V FGCG G N+ G++G G++ S+ SQ LS G K F++C
Sbjct: 201 GQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQ-LSAAGKVKKIFAHC 259
Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLI 237
L +I F G+ V V +T LV + +Y V L+ I VG + KL
Sbjct: 260 L------DTINGGGIFAIGNVVQ-PKVKTTPLV--PNMPHYNVNLKSIDVG--GTALKLP 308
Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK-T 296
+ +G K ID+G T LP+ Y + V K + + + LC++
Sbjct: 309 SHMFDTG--EKKGTIIDSGTTLTYLPEIVYKEIMLAVFAKHKDITFHNVQ--EFLCFQYV 364
Query: 297 PSMAGIAPILTAHFDGGAKVPL-IHTSTFIPPPVEGVFCF-----AMQPIDGD-VGIFGN 349
+ P +T HF+ +PL ++ + + ++C +Q DG + + G+
Sbjct: 365 GRVDDDFPKITFHFEN--DLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGD 422
Query: 350 FAQSDLFIGYDFDSQMVSFKPTDCT 374
S+ + YD ++Q++ + +C+
Sbjct: 423 LVLSNKLVVYDLENQVIGWTEYNCS 447
>gi|340810931|gb|AEK75392.1| S5 [Oryza sativa]
gi|340810983|gb|AEK75418.1| S5 [Oryza nivara]
gi|340810985|gb|AEK75419.1| S5 [Oryza nivara]
gi|340810997|gb|AEK75425.1| S5 [Oryza nivara]
gi|340811011|gb|AEK75432.1| S5 [Oryza nivara]
gi|340811013|gb|AEK75433.1| S5 [Oryza nivara]
gi|340811041|gb|AEK75447.1| S5 [Oryza nivara]
gi|340811043|gb|AEK75448.1| S5 [Oryza nivara]
Length = 474
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 160/382 (41%), Gaps = 55/382 (14%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELS 79
++M S+G PP++++ I DTGS L WVQC PC V C+ Q PI++P S + + +
Sbjct: 116 FLMAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 174
Query: 80 CQSEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDN 131
C S +C L +C ++ C Y+ Y + + + G + T+ + G+S F +
Sbjct: 175 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMD 231
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSIT 188
++FGC + ++E E G+ G G + S Q+ L FSYCL TD +
Sbjct: 232 LMFGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL---PTDETKP 286
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
M G + G T L ++ Y +T+E + ++N +L+ S
Sbjct: 287 GYMILGRYDRAAMDGGY-TPLFRSINRPTYSLTMEML----IANGQRLV--------TSS 333
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK---------- 295
M +D+GA T L + L++ + A+ Y R S +CY
Sbjct: 334 SEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNG 393
Query: 296 --TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFA 351
TP S P L F GGA + L + F P G+ FA P I GN
Sbjct: 394 TITPFSNWSALPPLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRV 452
Query: 352 QSDLFIGYDFDSQMVSFKPTDC 373
+D + FK C
Sbjct: 453 TRSFGTTFDIQGKQFGFKYAAC 474
>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 94/361 (26%), Positives = 155/361 (42%), Gaps = 35/361 (9%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSE 83
YV + +GTP +VDT S L WV C PC+ + P +NP +SS+YK + C S
Sbjct: 126 YVTQVQLGTPAKTHNV-LVDTASSLSWVGCEPCIN--ACLIPTFNPNASSTYKVVGCGSA 182
Query: 84 QCHLLDTVSCSSQQL------CNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
C+ + + + + + C+Y Y D SL+ GV++++ +T+G + F +FGC
Sbjct: 183 LCNAVPSATMARKSCMAPTEGCSYRQSYHDYSLSVGVVSSDTLTYGLGSQKF---IFGCC 239
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
+ GV G++G+ + SL SQ+ SYC P + + FG
Sbjct: 240 NLFRGVGGRYS-GILGMSVNKFSLFSQMTVGHRYRAMSYCF-PHPRNQGF---LQFGRYD 294
Query: 198 EVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGA 257
E + + D YFV + + V +S + SSG + F DTG
Sbjct: 295 EHKSLLRFTPLYI---DGNNYFVHVSNVMVETMSLDVQ------SSGNQTM-RCFFDTGT 344
Query: 258 PPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPS--MAG--IAPILTAHFDGG 313
P T+LP+ + L + V N ++ Y+ Q C++ + G P + F G
Sbjct: 345 PYTMLPQSLFVSLSDTVGNLVE-GYYRVGASTGQTCFQADGNWIEGDLYMPTVKIEFQNG 403
Query: 314 AKVPLIHTS-TFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
A++ L F+ P VFC A + DG + G+ + D + + +
Sbjct: 404 ARITLNSEDLMFMEEP--NVFCLAFKMNDGGDIVLGSRHLMGVHTVVDLEMMTMGLRGQG 461
Query: 373 C 373
C
Sbjct: 462 C 462
>gi|255685712|gb|ACU28345.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 59/101 (58%), Gaps = 13/101 (12%)
Query: 26 MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQC 85
MK IGTPP +I ++DTGS+L+W QCLPC+ CY Q PI++P+ SS++KE C
Sbjct: 1 MKLQIGTPPF-EIEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCN---- 55
Query: 86 HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
+ C Y Y D S T+G LATE +T +++
Sbjct: 56 --------TPDHSCXYKIVYDDKSYTQGTLATETVTIHSTS 88
>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
Length = 451
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 94/363 (25%), Positives = 147/363 (40%), Gaps = 29/363 (7%)
Query: 23 EYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQS 82
YV + +GTP + I D +D WV C P ++P SS+Y+ + C +
Sbjct: 106 SYVARARLGTPAQALLVAI-DPSNDAAWVPCA--ACAGCARAPSFDPTRSSTYRPVRCGA 162
Query: 83 EQCHLLDTVSCSS--QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNN 140
QC SC C + YA S+ + +L + + + + FGC H
Sbjct: 163 PQCSQAPAPSCPGGLGSSCAFNLSYAASTF-QALLGQDALALHDDVDAVAAYTFGCLHVV 221
Query: 141 TGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVS 200
TG + GLVG GR LS SQ G + FSYCL P + S+ + + G +
Sbjct: 222 TG-GSVPPQGLVGFGRGPLSFPSQTKDVYG-SVFSYCL-PSYKSSNFSGTLRLGPAGQPK 278
Query: 201 GGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAI---SKGNMFIDTG 256
+ +T L+S + + Y+V + GI VG + +P S+ A S +D G
Sbjct: 279 --RIKTTPLLSNPHRPSLYYVNMVGIRVGG-----RPVPVPASALAFDPTSGRGTIVDAG 331
Query: 257 APPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKV 316
T L Y + + R+ ++ P P G CY + P +T FDG V
Sbjct: 332 TMFTRLSAPVYAAVRDVFRSRVR-APVAGPLGGFDTCYN---VTISVPTVTFSFDGRVSV 387
Query: 317 PLIHTSTFIPPPVEGVFCFAMQP-----IDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPT 371
L + I G+ C AM +D + + + Q + + +D + V F
Sbjct: 388 TLPEENVVIRSSSGGIACLAMAAGPPDGVDAALNVLASMQQQNHRVLFDVANGRVGFSRE 447
Query: 372 DCT 374
CT
Sbjct: 448 LCT 450
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 100/389 (25%), Positives = 167/389 (42%), Gaps = 56/389 (14%)
Query: 19 TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
T+NG Y K +G D Y VDTGSD +WV C+ C C K+ +Y+P S
Sbjct: 71 TSNGLYYTKIGLGPK---DYYVQVDTGSDTLWVNCVGCTACPKKSGLGMDLTLYDPNLSK 127
Query: 74 SYKELSCQSEQC---HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GNSN 126
+ K + C E C + C+ C Y+ Y D S T G + +TF G+
Sbjct: 128 TSKAVPCDDEFCTSTYDGQISGCTKGMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLR 187
Query: 127 NFFDN--VVFGCGHNNTGVFNENE----MGLVGLGRTRLSLASQILSQLGANK-FSYCLV 179
DN V+FGCG +G + G++G G+ S+ SQ+ + + FS+CL
Sbjct: 188 TVPDNTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCL- 246
Query: 180 PFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGN--LSNSSKLI 237
SI+ F G EV V +T L+ + +Y V L+ I V + S ++
Sbjct: 247 -----DSISGGGIFAIG-EVVQPKVKTTPLL--QGMAHYNVVLKDIEVAGDPIQLPSDIL 298
Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQV---RNAIKLTPYQDPRLGSQLCY 294
+ G I ID+G LP Y++L E++ R+ +KL +D C+
Sbjct: 299 DSSSGRGTI------IDSGTTLAYLPVSIYDQLLEKILAQRSGMKLYLVEDQF----TCF 348
Query: 295 ---KTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQP-----IDG-DVG 345
S+ + P + F+ G + + ++ E ++C Q DG ++
Sbjct: 349 HYSDEESVDDLFPTVKFTFEEGLTLT-TYPRDYLFLFKEDMWCVGWQKSMAQTKDGKELI 407
Query: 346 IFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ G+ ++ + YD D+ + + +C+
Sbjct: 408 LLGDLVLANKLVVYDLDNMAIGWADYNCS 436
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 96/367 (26%), Positives = 163/367 (44%), Gaps = 36/367 (9%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
G YV++ +GTPP L ++ ++DT +D +W+ C C C +N SSS+Y +SC
Sbjct: 103 GNYVVRARLGTPPQL-MFMVLDTSNDAVWLPCSGCSGC-SNASTSFNTNSSSTYSTVSCS 160
Query: 82 SEQCHLLDTVSCSSQ----QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
+ QC ++C S +C++ Y S L + +T S + N FGC
Sbjct: 161 TTQCTQARGLTCPSSTPQPSICSFNQSYGGDSSFSANLVQDTLTL--SPDVIPNFSFGCI 218
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGS 197
++ +G + GL+GLGR +SL SQ S L + FSYCL F + YF
Sbjct: 219 NSASG-NSLPPQGLMGLGRGPMSLVSQTTS-LYSGVFSYCLPSFRS-------FYFSGSL 269
Query: 198 EVSGGG----VVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMF 252
++ G + T L+ + + Y+V L G+SVG++ + P Y + + S
Sbjct: 270 KLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSV--QVPVDPVYLTFDSNSGAGTI 327
Query: 253 IDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGS-QLCYKTPSMAGIAPILTAHFD 311
ID+G T + Y + ++ R + + LG+ C+ + + P +T H
Sbjct: 328 IDSGTVITRFAQPVYEAIRDEFRKQVNGS---FSTLGAFDTCFSADN-ENVTPKITLHMT 383
Query: 312 G-GAKVPLIHTSTFIPPPVEGVFCFAM----QPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
K+P+ +T I + C +M Q + + + N Q +L I +D + +
Sbjct: 384 SLDLKLPM--ENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRI 441
Query: 367 SFKPTDC 373
P C
Sbjct: 442 GIAPEPC 448
>gi|326490700|dbj|BAJ90017.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326493830|dbj|BAJ85377.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 459
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 98/357 (27%), Positives = 145/357 (40%), Gaps = 51/357 (14%)
Query: 37 DIYGIVDTGSDLMWV-QCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLL---DTVS 92
D+ G+VD D +W QC+ P+ + C S+ C L DT
Sbjct: 102 DVSGVVDVLDDFVWTTQCV--------AAPV----------RVQCASQTCRSLLANDTTD 143
Query: 93 C-----SSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCGHNNTGVFNEN 147
S C+Y YA S T G LA E + G+ F + GC N+
Sbjct: 144 ACGGNPSGDDTCSYVNVYAPGSNTTGFLANETVAVGS---FVGAAILGCSAANSTGPLVG 200
Query: 148 EMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT-SKMYFGNGS--EVSGGGV 204
E+G G R LSL +SQL +KFSY L P SS + S + G+ + + GGG
Sbjct: 201 EVGSFGFNRGALSL----VSQLSVSKFSYYLAPDEAGSSDSESVVLLGDAAVPQTRGGGR 256
Query: 205 VSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPK 264
+ L S Y+V L I V + S ++ + S G + + T P T L +
Sbjct: 257 STPLLRSTAFPDVYYVKLSAIQVDGQALSGIPAGAFDLAADGSSGGVVMGTLYPITRLQE 316
Query: 265 DFYNRLEEQVRNAIKLTPYQDPRLGS---QLCYKTPSMAGIA-PILTAHFDGG---AKVP 317
D YN + + + + I LCY S+A + P +T FDGG A +
Sbjct: 317 DAYNAVRQALVSKINAQEVNGSAFAGGVFDLCYDAQSVATLTFPKITLVFDGGNAPATLE 376
Query: 318 LIHTSTFIPPPVEGVFCFAMQPIDGDVG-----IFGNFAQSDLFIGYDFDSQMVSFK 369
L F V G+ CF M P+ VG + G+ Q+ + YD + ++ +
Sbjct: 377 LTTVHYFFKDNVTGLQCFTMLPM--PVGTPFGSVLGSMVQAGTNMIYDVGGETLTLE 431
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 104/389 (26%), Positives = 159/389 (40%), Gaps = 56/389 (14%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI--YNPASSSSYKEL 78
N + ++GTPP ++ ++DTGS+L W+ C P + + P +S ++ +
Sbjct: 63 NVSLTVSLAVGTPPQ-NVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASV 121
Query: 79 SCQSEQCHLLDTVS---CS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
C S QC D S C + + C + YAD S + G LATE T G F
Sbjct: 122 PCDSAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPL--RAAF 179
Query: 135 GCGHN--NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GC +T GL+G+ R LS +SQ +FSYC+ +D +
Sbjct: 180 GCMATAFDTSPDGVATAGLLGMNRGALSF----VSQASTRRFSYCI----SDRDDAGVLL 231
Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLS---NSSKLIPYYNSS 243
G+ S++ + T L D+ Y V L GI VG +S L P + +
Sbjct: 232 LGH-SDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAP--DHT 288
Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCYKTP 297
GA G +D+G T L D Y+ L+ + K L DP Q C++ P
Sbjct: 289 GA---GQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVP 345
Query: 298 ---SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVE-----GVFCFA-----MQPIDGDV 344
+ P +T F+ GA++ + P E GV+C M PI V
Sbjct: 346 QGRAPPARLPAVTLLFN-GAQMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYV 404
Query: 345 GIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
G+ Q ++++ YD + V P C
Sbjct: 405 --IGHHHQMNVWVEYDLERGRVGLAPIRC 431
>gi|222632756|gb|EEE64888.1| hypothetical protein OsJ_19747 [Oryza sativa Japonica Group]
Length = 384
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 80/276 (28%), Positives = 127/276 (46%), Gaps = 29/276 (10%)
Query: 42 VDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELSCQSEQCHLL------DTV 91
+DTGS L WVQC PC ++C+ Q V PI++P++SS+++ + C + C L +
Sbjct: 1 MDTGSSLSWVQCRPCTIKCHVQPAKVGPIFDPSNSSTFRHVGCSTSICSYLGRTLRIQSK 60
Query: 92 SCSS-QQLCNYTYGYADS-SLTKGVLATERITFGNSNNF-----FDNVVFGCGHNNTGVF 144
+C + +C YT Y + + G T+R+ G N VFGC +T
Sbjct: 61 ACMEWEDICLYTMSYGGGWAYSVGKAVTDRLVLGGGETTRTTLSLANFVFGCSM-DTQYS 119
Query: 145 NENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFGNGSEVSGGGV 204
E G+ GLG + S QI L FSYCL S + Y G + SGG
Sbjct: 120 THKEAGIFGLGTSNYSF-EQIAPLLSYKAFSYCL-----PSDEAHQGYLSIGPDSSGG-- 171
Query: 205 VSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPK 264
V TS+ + Y + + G++V ++ + + + S M +D+GA TLL
Sbjct: 172 VPTSMFPGTPRPVYSIGMTGLTV-TVNGEVRSLVSGSGSSPSPSSLMVVDSGAKLTLLLA 230
Query: 265 DFYNRLEEQVRNAIKLTPYQDPRLG--SQLCYKTPS 298
+ +LE+ + A++ Y +QLC+ T S
Sbjct: 231 STFGQLEDAIIPAMESLGYSLNTAAGQNQLCFLTES 266
>gi|300681439|emb|CBH32531.1| hypothetical protein TAA_ctg0091b.00060.1 [Triticum aestivum]
Length = 426
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 99/365 (27%), Positives = 152/365 (41%), Gaps = 40/365 (10%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQ 81
G V K S+G + G+VD +D +W QC P+ SS + E+ C
Sbjct: 74 GLVVYKISVGVAEEV-FSGVVDVATDFIWAQC-----------PV-----SSDFTEVFCF 116
Query: 82 SEQCHLL----DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVFGCG 137
S+ C L D S+ C Y Y Y T G ++ E +T + +FGC
Sbjct: 117 SQTCQLALDEEDACGNSTSFTCPYAYQYGPGISTTGYISAEEVT-AVGTHITGRALFGCS 175
Query: 138 HNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSIT-SKMYFGNG 196
+T V + E G++G R SL LSQL ++FSY ++P D + S + G+
Sbjct: 176 LAST-VPLDGESGVLGFSRGPYSL----LSQLKISRFSYFMLPDDADKPDSESVLLLGDD 230
Query: 197 SEVSGGGVVSTSLVSKED-KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDT 255
+ ST L+ E Y+V L GI V + S S ++ + G + + T
Sbjct: 231 AVPQTNSSRSTPLLRNEAYPDLYYVKLTGIKVDDKSLSGIPAGTFDLAANGCSGGVVMST 290
Query: 256 GAPPTLLPKDFYNRLEEQVRNAIK---LTPYQDPRLGSQLCYKTPSMAGIA-PILTAHFD 311
+P T L YN L + + IK + P D +LCY S+A + P +T F
Sbjct: 291 LSPITYLQPAAYNALTRALASKIKSQPVRPKADDVADLRLCYNIQSVANLTFPKITLVFH 350
Query: 312 G--GAKVPLIHTST--FIPPPVEGVFCFAMQPIDGD---VGIFGNFAQSDLFIGYDFDSQ 364
G G P+ T+ FI G+ C M P + G+ Q+ + YD
Sbjct: 351 GVDGRPAPMELTTAHYFIRENSTGLQCLTMLPTPAGSPVSSVLGSLLQTGTHMIYDLRGG 410
Query: 365 MVSFK 369
++F+
Sbjct: 411 SLTFE 415
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 99/393 (25%), Positives = 160/393 (40%), Gaps = 72/393 (18%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y K IGTP D Y VDTGSD++WV C C +C + +Y+ +S++
Sbjct: 153 GLYFAKIGIGTPSK-DYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSD 211
Query: 77 ELSCQSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD---- 130
+ C C L D C C Y+ Y D S T G + + + + F
Sbjct: 212 AVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPT 271
Query: 131 --NVVFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTD 184
VVFGCG+ +G +E G++G G+ S+ SQ+ S K FS+CL
Sbjct: 272 NGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL------ 325
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGNLSNSSK 235
V GGG+ + V + +++ +Y V ++ I VG
Sbjct: 326 ------------DNVDGGGIFAIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGG------ 367
Query: 236 LIPYYNSSGAISKGNM---FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ- 291
P S A G+ ID+G P++ Y L E++ L+ D RL +
Sbjct: 368 -DPLDVPSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLIEKI-----LSQQPDLRLHTVE 421
Query: 292 ---LCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-----QPIDG 342
C+ T ++ P +T HFD + ++ ++ E +C Q DG
Sbjct: 422 QAFTCFDYTGNVDDGFPTVTLHFDKSISLT-VYPHEYLFQVKEFEWCIGWQNSGAQTKDG 480
Query: 343 -DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
D+ + G+ S+ + YD + Q + + +C+
Sbjct: 481 KDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNCS 513
>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 528
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 93/374 (24%), Positives = 160/374 (42%), Gaps = 47/374 (12%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI----------YNPASSS 73
Y S+GTPP + + DTGSDL W+ C C + ++ I Y P +S+
Sbjct: 102 YYANVSVGTPPSSFLVAL-DTGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTPNAST 160
Query: 74 SYKELSCQSEQCHLLDTVSCSS-QQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD-- 130
+ + C ++C + CSS +C Y Y++S+ TKG L + + +
Sbjct: 161 TSSSIRCSDKRC--FGSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHLATEDENLTPV 218
Query: 131 --NVVFGCGHNNTGVFNENEM--GLVGLGRTRLSLASQIL-SQLGANKFSYCLVPFHTDS 185
NV GCG TG+F N G++GLG S+ S + + + AN FS C F
Sbjct: 219 KANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMC---FGRVI 275
Query: 186 SITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGA 245
++ FG+ T +S T Y V + G+SV +L +++
Sbjct: 276 GNVGRISFGDRGYTD---QEETPFISVAPSTAYGVNISGVSVAGDPVDIRLFAKFDT--- 329
Query: 246 ISKGNMFIDTGAPP-TLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK-TPSMAGIA 303
G+ F P +L K F +E++ R DP L + CY +P+ I
Sbjct: 330 ---GSSFTHLREPAYGVLTKSFDELVEDRRRPV-------DPELPFEFCYDLSPNATTIQ 379
Query: 304 -PILTAHFDGGAKVPLIHTSTFIPPPVEG--VFCFA-MQPIDGDVGIFGNFAQSDLFIGY 359
P++ F GG+K+ +++ F EG ++C ++ + + + G + I +
Sbjct: 380 FPLVEMTFIGGSKI-ILNNPFFTARTQEGNVMYCLGVLKSVGLKINVIGQNFVAGYRIVF 438
Query: 360 DFDSQMVSFKPTDC 373
D + ++ +K + C
Sbjct: 439 DRERMILGWKQSLC 452
>gi|340811098|gb|AEK75475.1| S5 [Oryza nivara]
Length = 357
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 158/380 (41%), Gaps = 55/380 (14%)
Query: 26 MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELSCQ 81
M S+G PP++++ I DTGS L WVQC PC V C+ Q PI++P S + + + C
Sbjct: 1 MAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 82 SEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDNVV 133
S +C L +C ++ C Y+ Y + + + G + T+ + G+S F +++
Sbjct: 60 SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMDLM 116
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSITSK 190
FGC + ++E E G+ G G + S Q+ L FSYCL TD +
Sbjct: 117 FGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL---PTDETKPGY 171
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
M G + G T L ++ Y +T E + ++N +L+ S
Sbjct: 172 MILGRYDRAAMDGGY-TPLFRSINRPTYSLTTEML----IANGQRLV--------TSSSE 218
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK------------ 295
M +D+GA T L + L++ + A+ Y R S +CY
Sbjct: 219 MIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTI 278
Query: 296 TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQS 353
TP S P+L F GGA + L + F P G+ FA P I GN
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRVTR 337
Query: 354 DLFIGYDFDSQMVSFKPTDC 373
+D + FK C
Sbjct: 338 SFGTTFDIQGKQFGFKYAAC 357
>gi|340810945|gb|AEK75399.1| S5 [Oryza sativa]
gi|340810957|gb|AEK75405.1| S5 [Oryza sativa]
gi|340811007|gb|AEK75430.1| S5 [Oryza nivara]
gi|340811073|gb|AEK75463.1| S5 [Oryza rufipogon]
gi|340811094|gb|AEK75473.1| S5 [Oryza rufipogon]
Length = 357
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 158/380 (41%), Gaps = 55/380 (14%)
Query: 26 MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELSCQ 81
M S+G PP++++ I DTGS L WVQC PC V C+ Q PI++P S + + + C
Sbjct: 1 MAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 82 SEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDNVV 133
S +C L +C ++ C Y+ Y + + + G + T+ + G+S F +++
Sbjct: 60 SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMDLM 116
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSITSK 190
FGC + ++E E G+ G G + S Q+ L FSYCL TD +
Sbjct: 117 FGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL---PTDETKPGY 171
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
M G + G T L ++ Y +T E + ++N +L+ S
Sbjct: 172 MILGRYDRAAMDGGY-TPLFRSINRPTYSLTTEML----IANGQRLV--------TSSSE 218
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK------------ 295
M +D+GA T L + L++ + A+ Y R S +CY
Sbjct: 219 MIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTI 278
Query: 296 TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQS 353
TP S P+L F GGA + L + F P G+ FA P I GN
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRVTR 337
Query: 354 DLFIGYDFDSQMVSFKPTDC 373
+D + FK C
Sbjct: 338 SFGTTFDIQGKQFGFKYAAC 357
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 101/375 (26%), Positives = 160/375 (42%), Gaps = 43/375 (11%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-YKQV--KPIYNPASSSSYKE 77
G Y + IGTP + IVDTGS + +V C C C + Q P + P +SSSY+
Sbjct: 96 KGYYTSRVFIGTPAQ-EFALIVDTGSTVTYVPCSSCTHCGHHQACFDPRFKPDNSSSYQT 154
Query: 78 LSCQSEQCHLLDTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFGNSNNFFDN-VVFG 135
+SC S C T C ++ C Y YA+ S +KGVL + + FGN + + ++FG
Sbjct: 155 VSCNSPDC---ITKMCDARVHQCKYERVYAEMSSSKGVLGKDLLGFGNGSRLQPHPLLFG 211
Query: 136 CGHNNTG-VFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMYFG 194
C TG ++ ++ G++GLGR LS+ Q++ GA + S+ L D G
Sbjct: 212 CETAETGDLYLQHADGIMGLGRGPLSIVDQLVGT-GAMEDSFSLCYGGMDE--------G 262
Query: 195 NGSEVSGGGVVSTSLV-SKED---KTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
GS V G ++V +K D YY + L I V +S N + G
Sbjct: 263 GGSMVLGAIPPPPAMVFAKSDPNRSNYYNLELSEIQVQGVS--------LNVPSEVFNGR 314
Query: 251 M--FIDTGAPPTLLPKDFYNRLEE---QVRNAIKLTPYQDPRLGSQLCY-----KTPSMA 300
+ +D+G LP ++ ++ Q +++ P DP +C+ + ++
Sbjct: 315 LGTVLDSGTTYAYLPDKAFDAFKDAITQQLGSLQAVPGPDPSY-PDVCFAGAGSDSKALG 373
Query: 301 GIAPILTAHFDGGAKVPLI-HTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGY 359
P + F G KV L F V G +C + G + + Y
Sbjct: 374 KHFPPVDFVFSGNQKVFLAPENYLFKHTKVPGAYCLGFFKNQDATTLLGGIVVRNTLVTY 433
Query: 360 DFDSQMVSFKPTDCT 374
D + + F T+CT
Sbjct: 434 DRANHQIGFFKTNCT 448
>gi|255685716|gb|ACU28347.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
gi|255685726|gb|ACU28352.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
gi|255685728|gb|ACU28353.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
Length = 91
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 42/101 (41%), Positives = 59/101 (58%), Gaps = 13/101 (12%)
Query: 26 MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQC 85
MK IGTPP +I ++DTGS+L+W QCLPC+ CY Q PI++P+ SS++KE C
Sbjct: 1 MKLQIGTPPF-EIEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCN---- 55
Query: 86 HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSN 126
+ C Y Y D S T+G LATE +T +++
Sbjct: 56 --------TPDHSCPYKIVYDDKSYTQGTLATETVTIHSTS 88
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 95/378 (25%), Positives = 159/378 (42%), Gaps = 43/378 (11%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYKEL 78
Y + +G+PP + + +DTGSD++WV C PC C +NP +SS+ ++
Sbjct: 117 YFTRVKLGSPPK-EYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 175
Query: 79 SCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERITF----GNSN--N 127
C ++C S S C YT+ Y D S T G ++ + F GN N
Sbjct: 176 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTAN 235
Query: 128 FFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK--FSYCLVPFH 182
++VFGC ++ +G + + G+ G G+ +LS+ SQ L+ LG + FS+CL
Sbjct: 236 SSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQ-LNSLGVSPKVFSHCLKGSD 294
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVG--NLSNSSKLIPYY 240
I + G E+ G+V T LV + +Y + LE I V L S L
Sbjct: 295 NGGGI---LVLG---EIVEPGLVYTPLVPSQ--PHYNLNLESIVVNGQKLPIDSSLFTTS 346
Query: 241 NSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
N+ G I +D+G L Y+ + A+ + G+Q + S+
Sbjct: 347 NTQGTI------VDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVD 400
Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPPV---EGVFCFAMQPIDG-DVGIFGNFAQSDLF 356
P ++ +F GG + + + + ++C Q G + I G+ D
Sbjct: 401 SSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKI 460
Query: 357 IGYDFDSQMVSFKPTDCT 374
YD + + + DC+
Sbjct: 461 FVYDLANMRMGWTDYDCS 478
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 95/362 (26%), Positives = 149/362 (41%), Gaps = 41/362 (11%)
Query: 30 IGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD 89
IGTPP + IVDTGS + +V C C QC P + P S +Y + C D
Sbjct: 2 IGTPPQ-EFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCNP------D 54
Query: 90 TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHNNTG-VFNEN 147
+ C Y YA+ S + G+L + ++FGN + VFGC + TG +F+++
Sbjct: 55 CTCDTENDQCTYERQYAEMSSSSGILGEDLVSFGNMSELKPQRAVFGCENAETGDLFSQH 114
Query: 148 EMGLVGLGRTRLSLASQILSQLGAN-KFSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVS 206
G++GLGR LS+ Q++ + N FS C M G G+ V G
Sbjct: 115 ADGIMGLGRGDLSIVDQLVEKGVINDSFSLCY----------GGMEVGGGAMVLGQISPP 164
Query: 207 TSLV---SKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLL 262
+ +V S D++ YY + L G+ V KL N K +D+G L
Sbjct: 165 SDMVFSHSDPDRSPYYNIELRGLHVAG----KKLD--INPQVFDGKHGTILDSGTTYAYL 218
Query: 263 PKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCY-----KTPSMAGIAPILTAHFDGGA 314
P+ + + + + +K DP + +C+ + P + P + FD G
Sbjct: 219 PEAAFLPFIQAITSELHGLKQIRGPDPNY-NDVCFSGAGSEIPELYKTFPSVDMVFDNGE 277
Query: 315 KVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
K L F V G +C + Q + G + + YD + V F T+
Sbjct: 278 KYSLSPENYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTN 337
Query: 373 CT 374
C+
Sbjct: 338 CS 339
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 95/362 (26%), Positives = 149/362 (41%), Gaps = 41/362 (11%)
Query: 30 IGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPIYNPASSSSYKELSCQSEQCHLLD 89
IGTPP + IVDTGS + +V C C QC P + P S +Y + C D
Sbjct: 2 IGTPPQ-EFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCNP------D 54
Query: 90 TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFF-DNVVFGCGHNNTG-VFNEN 147
+ C Y YA+ S + G+L + ++FGN + VFGC + TG +F+++
Sbjct: 55 CTCDTENDQCTYERQYAEMSSSSGILGEDLVSFGNMSELKPQRAVFGCENAETGDLFSQH 114
Query: 148 EMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTDSSITSKMYFGNGSEVSGGGVVS 206
G++GLGR LS+ Q++ + N FS C M G G+ V G
Sbjct: 115 ADGIMGLGRGDLSIVDQLVEKGVINDSFSLCY----------GGMEVGGGAMVLGQISPP 164
Query: 207 TSLV---SKEDKT-YYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLL 262
+ +V S D++ YY + L G+ V KL N K +D+G L
Sbjct: 165 SDMVFSHSDPDRSPYYNIELRGLHVAG----KKLD--INPQVFDGKHGTILDSGTTYAYL 218
Query: 263 PKDFYNRLEEQVR---NAIKLTPYQDPRLGSQLCY-----KTPSMAGIAPILTAHFDGGA 314
P+ + + + + +K DP + +C+ + P + P + FD G
Sbjct: 219 PEAAFLPFIQAITSELHGLKQIRGPDPNY-NDVCFSGAGSEIPELYKTFPSVDMVFDNGE 277
Query: 315 KVPLI-HTSTFIPPPVEGVFCFAM-QPIDGDVGIFGNFAQSDLFIGYDFDSQMVSFKPTD 372
K L F V G +C + Q + G + + YD + V F T+
Sbjct: 278 KYSLSPENYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTN 337
Query: 373 CT 374
C+
Sbjct: 338 CS 339
>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 94/378 (24%), Positives = 153/378 (40%), Gaps = 49/378 (12%)
Query: 22 GEYVMKFSIGTPPL---LDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSSYKE 77
G Y + +IG PP LDI DTGSDL WVQC PC C K + +Y P ++
Sbjct: 66 GHYSVILNIGNPPKAFDLDI----DTGSDLTWVQCDAPCKGCTKPLDKLYKPKNN----R 117
Query: 78 LSCQSEQCHLLDTVSCS-SQQLCNYTYGYADSSLTKGVLATER--ITFGNSNNFFDNVVF 134
+ C S C + +C + C+Y YAD + GVL ++ + N + + F
Sbjct: 118 VPCASSLCQAIQNNNCDIPTEQCDYEVEYADLGSSLGVLLSDYFPLRLNNGSLLQPRIAF 177
Query: 135 GCGHNNTGVFNE---NEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKM 191
GCG++ + + G++GLGR + S+ SQ L LG + V H S +T
Sbjct: 178 GCGYDQKYLGPHSPPDTAGILGLGRGKASILSQ-LRTLGITQN----VVGHCFSRVTGGF 232
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
F + G+ T ++ T Y S+ + + I +
Sbjct: 233 LFFGDHLLPPSGITWTPMLRSSSDTLY------------SSGPAELLFGGKPTGIKGLQL 280
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQL--CYKTPS-MAGIAPI--- 305
D+G+ T Y + VR + P +D L C+KT + I I
Sbjct: 281 IFDSGSSYTYFNAQVYQSILNLVRKDLSGMPLKDAPEEKALAVCWKTAKPIKSILDIKSF 340
Query: 306 ---LTAHFDGGAKVPL-IHTSTFIPPPVEGVFCFAM----QPIDGDVGIFGNFAQSDLFI 357
LT +F V L + ++ +G C + + G++ + G+ D +
Sbjct: 341 FKPLTINFIKAKNVQLQLAPEDYLIITKDGNVCLGILNGGEQGLGNLNVIGDIFMQDRVV 400
Query: 358 GYDFDSQMVSFKPTDCTK 375
YD + Q + + PT+C +
Sbjct: 401 VYDNERQQIGWFPTNCNR 418
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 99/393 (25%), Positives = 160/393 (40%), Gaps = 72/393 (18%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y K IGTP D Y VDTGSD++WV C C +C + +Y+ +S++
Sbjct: 72 GLYFAKIGIGTPSK-DYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSD 130
Query: 77 ELSCQSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD---- 130
+ C C L D C C Y+ Y D S T G + + + + F
Sbjct: 131 AVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPT 190
Query: 131 --NVVFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTD 184
VVFGCG+ +G +E G++G G+ S+ SQ+ S K FS+CL
Sbjct: 191 NGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL------ 244
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGNLSNSSK 235
V GGG+ + V + +++ +Y V ++ I VG
Sbjct: 245 ------------DNVDGGGIFAIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGG------ 286
Query: 236 LIPYYNSSGAISKGNM---FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ- 291
P S A G+ ID+G P++ Y L E++ L+ D RL +
Sbjct: 287 -DPLDVPSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLIEKI-----LSQQPDLRLHTVE 340
Query: 292 ---LCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAM-----QPIDG 342
C+ T ++ P +T HFD + ++ ++ E +C Q DG
Sbjct: 341 QAFTCFDYTGNVDDGFPTVTLHFDKSISLT-VYPHEYLFQVKEFEWCIGWQNSGAQTKDG 399
Query: 343 -DVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
D+ + G+ S+ + YD + Q + + +C+
Sbjct: 400 KDLTLLGDLVLSNKLVVYDLEKQGIGWVEYNCS 432
>gi|51091919|dbj|BAD35188.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|125596474|gb|EAZ36254.1| hypothetical protein OsJ_20576 [Oryza sativa Japonica Group]
gi|196212950|gb|ACG76111.1| S5 [Oryza sativa Japonica Group]
gi|340810891|gb|AEK75372.1| S5 [Oryza sativa]
gi|340810893|gb|AEK75373.1| S5 [Oryza sativa]
gi|340810899|gb|AEK75376.1| S5 [Oryza sativa]
gi|340810901|gb|AEK75377.1| S5 [Oryza sativa]
gi|340810933|gb|AEK75393.1| S5 [Oryza sativa]
gi|340810947|gb|AEK75400.1| S5 [Oryza sativa]
gi|340810949|gb|AEK75401.1| S5 [Oryza sativa]
gi|340810967|gb|AEK75410.1| S5 [Oryza sativa]
gi|340810969|gb|AEK75411.1| S5 [Oryza sativa]
gi|340810999|gb|AEK75426.1| S5 [Oryza rufipogon]
gi|340811017|gb|AEK75435.1| S5 [Oryza rufipogon]
gi|340811029|gb|AEK75441.1| S5 [Oryza nivara]
gi|340811051|gb|AEK75452.1| S5 [Oryza nivara]
gi|340811075|gb|AEK75464.1| S5 [Oryza nivara]
gi|340811077|gb|AEK75465.1| S5 [Oryza rufipogon]
gi|340811085|gb|AEK75469.1| S5 [Oryza nivara]
gi|340811096|gb|AEK75474.1| S5 [Oryza rufipogon]
gi|340811100|gb|AEK75476.1| S5 [Oryza rufipogon]
gi|340811114|gb|AEK75483.1| S5 [Oryza nivara]
Length = 472
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 101/382 (26%), Positives = 160/382 (41%), Gaps = 55/382 (14%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELS 79
++M S+G PP++++ I DTGS L WVQC PC V C+ Q PI++P S + + +
Sbjct: 114 FLMAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 172
Query: 80 CQSEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDN 131
C S +C L +C ++ C Y+ Y + + + G + T+ + G+S F +
Sbjct: 173 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMD 229
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSIT 188
++FGC + ++E E G+ G G + S Q+ L SYCL TD +
Sbjct: 230 LMFGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCL---PTDETKP 284
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
M G + G T L ++ Y +T+E + ++N +L+ S
Sbjct: 285 GYMILGRYDRAAMDGGY-TPLFRSINRPTYSLTMEML----IANGQRLV--------TSS 331
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK---------- 295
M +D+GA T L + L++ + A+ Y R S +CY
Sbjct: 332 SEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNG 391
Query: 296 --TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFA 351
TP S P+L F GGA + L + F P G+ FA P I GN
Sbjct: 392 TITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRV 450
Query: 352 QSDLFIGYDFDSQMVSFKPTDC 373
+D + FK C
Sbjct: 451 TRSFGTTFDIQGKQFGFKYAVC 472
>gi|340810993|gb|AEK75423.1| S5 [Oryza rufipogon]
gi|340811015|gb|AEK75434.1| S5 [Oryza nivara]
gi|340811021|gb|AEK75437.1| S5 [Oryza nivara]
Length = 474
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 101/382 (26%), Positives = 160/382 (41%), Gaps = 55/382 (14%)
Query: 24 YVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELS 79
++M S+G PP++++ I DTGS L WVQC PC V C+ Q PI++P S + + +
Sbjct: 116 FLMAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVR 174
Query: 80 CQSEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDN 131
C S +C L +C ++ C Y+ Y + + + G + T+ + G+S F +
Sbjct: 175 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMD 231
Query: 132 VVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSIT 188
++FGC + ++E E G+ G G + S Q+ L SYCL TD +
Sbjct: 232 LMFGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCL---PTDETKP 286
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
M G + G T L ++ Y +T+E + ++N +L+ S
Sbjct: 287 GYMILGRYDRAAMDGGY-TPLFRSINRPTYSLTMEML----IANGQRLV--------TSS 333
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK---------- 295
M +D+GA T L + L++ + A+ Y R S +CY
Sbjct: 334 SEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNG 393
Query: 296 --TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFA 351
TP S P+L F GGA + L + F P G+ FA P I GN
Sbjct: 394 TITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRV 452
Query: 352 QSDLFIGYDFDSQMVSFKPTDC 373
+D + FK C
Sbjct: 453 TRSFGTTFDIQGKQFGFKYAVC 474
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 104/389 (26%), Positives = 159/389 (40%), Gaps = 56/389 (14%)
Query: 21 NGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKPI--YNPASSSSYKEL 78
N + ++GTPP ++ ++DTGS+L W+ C P + + P +S ++ +
Sbjct: 62 NVSLTVSLAVGTPPQ-NVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASV 120
Query: 79 SCQSEQCHLLDTVS---CS-SQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNVVF 134
C S QC D S C + + C + YAD S + G LATE T G F
Sbjct: 121 PCGSAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPL--RAAF 178
Query: 135 GCGHN--NTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSYCLVPFHTDSSITSKMY 192
GC +T GL+G+ R LS +SQ +FSYC+ +D +
Sbjct: 179 GCMATAFDTSPDGVATAGLLGMNRGALSF----VSQASTRRFSYCI----SDRDDAGVLL 230
Query: 193 FGNGSEVSGGGVVSTSLVSKE------DKTYYFVTLEGISVGNLS---NSSKLIPYYNSS 243
G+ S++ + T L D+ Y V L GI VG +S L P + +
Sbjct: 231 LGH-SDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAP--DHT 287
Query: 244 GAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIK--LTPYQDPRLGSQ----LCYKTP 297
GA G +D+G T L D Y+ L+ + K L DP Q C++ P
Sbjct: 288 GA---GQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVP 344
Query: 298 ---SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVE-----GVFCFA-----MQPIDGDV 344
+ P +T F+ GA++ + P E GV+C M PI V
Sbjct: 345 QGRAPPARLPAVTLLFN-GAQMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYV 403
Query: 345 GIFGNFAQSDLFIGYDFDSQMVSFKPTDC 373
G+ Q ++++ YD + V P C
Sbjct: 404 --IGHHHQMNVWVEYDLERGRVGLAPIRC 430
>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 98/367 (26%), Positives = 159/367 (43%), Gaps = 48/367 (13%)
Query: 29 SIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP---------IYNPASSSSYKELS 79
++GTP + + DTGSDL W+ C C C +++K IY+P +SS+ ++
Sbjct: 109 TVGTPSDWFLVAL-DTGSDLFWLPC-DCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVP 166
Query: 80 CQSEQCHLLDTVSCSSQQLCNYTYGY-ADSSLTKGVLATERITF----GNSNNFFDNVVF 134
C S C D + S + C Y Y ++ + + GVL + + +S V
Sbjct: 167 CNSTLCTRGDRCA-SPESNCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTL 225
Query: 135 GCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKM 191
GCG TGVF++ GL GLG +S+ S + + + AN FS C F D + ++
Sbjct: 226 GCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMC---FGNDGA--GRI 280
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISV-GNLSNSSKLIPYYNSSGAISKGN 250
FG+ V T L ++ Y +T+ ISV GN +G +
Sbjct: 281 SFGDKGSVDQR---ETPLNIRQPHPTYNITVTKISVEGN-------------TGDLEFDA 324
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ--DPRLGSQLCYK-TPSMAGIA-PIL 306
+F D+G T L Y + E + YQ D L + CY +P+ P +
Sbjct: 325 VF-DSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAV 383
Query: 307 TAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMV 366
GG+ P+ H IP V+C A+ I+ D+ I G + + +D + ++
Sbjct: 384 NLTMKGGSSYPVYHPLVVIPMKDTDVYCLAILKIE-DISIIGQNFMTGYRVVFDREKLIL 442
Query: 367 SFKPTDC 373
+K +DC
Sbjct: 443 GWKESDC 449
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 91/377 (24%), Positives = 159/377 (42%), Gaps = 38/377 (10%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y K +GTPP + +DTGSD++WV C C C + + ++ SS+
Sbjct: 76 GLYYTKVKMGTPPK-EFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAA 134
Query: 77 ELSCQSEQCHLL---DTVSCSSQ-QLCNYTYGYADSSLTKGVLATERITFG------NSN 126
+ C C CS + C+YT+ Y D S T G ++ + F +
Sbjct: 135 LIPCSDPICTSRVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPAV 194
Query: 127 NFFDNVVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFH 182
N +VFGC + +G + + G+ G G LS+ SQ+ S+ + FS+CL
Sbjct: 195 NSSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCL---- 250
Query: 183 TDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNS 242
E+ +V + LV + +Y + L+ I+V + +L+P +
Sbjct: 251 --KGDGDGGGVLVLGEILEPSIVYSPLVPSQ--PHYNLNLQSIAV-----NGQLLPINPA 301
Query: 243 SGAIS--KGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMA 300
+IS +G +D G L ++ Y+ L + A+ + Q G+Q + S+
Sbjct: 302 VFSISNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQCYLVSTSIG 361
Query: 301 GIAPILTAHFDGGAKVPLIHTSTFIPPP-VEG--VFCFAMQPIDGDVGIFGNFAQSDLFI 357
I P ++ +F+GGA + L + ++G ++C Q I G+ D +
Sbjct: 362 DIFPSVSLNFEGGASMVLKPEQYLMHNGYLDGAEMWCIGFQKFQEGASILGDLVLKDKIV 421
Query: 358 GYDFDSQMVSFKPTDCT 374
YD Q + + DC+
Sbjct: 422 VYDIAQQRIGWANYDCS 438
>gi|217426809|gb|ACK44517.1| AT5G10080-like protein [Arabidopsis arenosa]
Length = 506
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 99/378 (26%), Positives = 163/378 (43%), Gaps = 60/378 (15%)
Query: 30 IGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-------YKQVKPI----YNPASSSSYKEL 78
IGTP + + + DTGSDL+W+ C CVQC Y + YNP+SSS+ K
Sbjct: 106 IGTPSVSFLVAL-DTGSDLLWIPC-NCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVF 163
Query: 79 SCQSEQCHLLDTVS-CSS-QQLCNYTYGYADSSLTKGVLATERI---TFGNSNNFFD--- 130
C + C D+ S C S ++ C YT Y + + L E I T+ +N +
Sbjct: 164 LCSHKLC---DSASDCESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSS 220
Query: 131 ----NVVFGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQLG--ANKFSYCLVPFH 182
VV GCG +G + + GL+GLG +S+ S LS+ G N FS C
Sbjct: 221 SVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPS-FLSKAGLMRNSFSLCF---- 275
Query: 183 TDSSITSKMYFGN-GSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
D + ++YFG+ G + ST + E+ + Y V +E +GN
Sbjct: 276 -DEEDSGRIYFGDMGPSIQQ----STPFLQLENNSGYIVGVEACCIGN------------ 318
Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKTPSMAG 301
S + FID+G T LP++ Y ++ ++ I T + + CY++ S+
Sbjct: 319 SCLKQTSFTTFIDSGQSFTYLPEEIYRKVALEIDRHINATSKSFEGVSWEYCYES-SVEP 377
Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPVEGV--FCFAMQPIDGD-VGIFGNFAQSDLFIG 358
P + F +IH F+ +G+ FC + P + +G G +
Sbjct: 378 KVPAIKLKFSHNNTF-VIHKPLFVFQQSQGLVQFCLPISPSGQEGIGSIGQNYMRGYRMV 436
Query: 359 YDFDSQMVSFKPTDCTKQ 376
+D ++ + + + C ++
Sbjct: 437 FDRENMKLRWSASKCQEE 454
>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
Length = 515
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 96/366 (26%), Positives = 158/366 (43%), Gaps = 44/366 (12%)
Query: 29 SIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVKP---------IYNPASSSSYKELS 79
++GTP + + DTGSDL W+ C C +++K IY+P +SS+ ++
Sbjct: 109 TVGTPSDWFLVAL-DTGSDLFWLPCDCSTNCVRELKAPGGSSLDLNIYSPNASSTSSKVP 167
Query: 80 CQSEQCHLLDTVSCSSQQLCNYTYGY-ADSSLTKGVLATERITF----GNSNNFFDNVVF 134
C S C +D + S C Y Y ++ + + GVL + + NS +
Sbjct: 168 CNSTLCTRVDRCA-SPLSDCPYQIRYLSNGTSSTGVLVEDVLHLVSMEKNSKPIRARITL 226
Query: 135 GCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSITSKM 191
GCG TGVF++ GL GLG +S+ S + + + AN FS C F D + ++
Sbjct: 227 GCGLVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMC---FGDDGA--GRI 281
Query: 192 YFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGNM 251
FG+ V T L ++ Y VT+ ISVG ++G + +
Sbjct: 282 SFGDKGSVDQR---ETPLNIRQPHPTYNVTVTQISVG------------GNTGDLEFDAV 326
Query: 252 FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ-DPRLGSQLCYK-TPSMAGIA-PILTA 308
F DTG T L Y + E + YQ D L + CY +P+ P +
Sbjct: 327 F-DTGTSFTYLTDAPYTLISESFNSLALDKRYQTDSELPFEYCYAVSPNKKSFEYPDVNL 385
Query: 309 HFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPIDGDVGIFGNFAQSDLFIGYDFDSQMVSF 368
GG+ P+ H +P V+C A+ + D+ I G + + +D + ++ +
Sbjct: 386 TMKGGSSYPVYHPLIVVPIEDTVVYCLAIMKSE-DISIIGQNFMTGYRVVFDREKLILGW 444
Query: 369 KPTDCT 374
K +DC+
Sbjct: 445 KESDCS 450
>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 395
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 95/391 (24%), Positives = 161/391 (41%), Gaps = 50/391 (12%)
Query: 14 QSNVSTANGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNP 69
+S +Y +IG PP LDI DTGSD W+ C PC C K P+Y P
Sbjct: 6 KSTAVVPERQYYTSINIGNPPRPYFLDI----DTGSDFTWIHCDAPCTNCTKGPHPVYKP 61
Query: 70 ASSSSYKELSCQSEQCHLL--DTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNN 127
K + + C L + C + + C+Y YAD S +KGVLA + + ++
Sbjct: 62 TEG---KIVHPRDPLCEELQGNQNYCETCKQCDYEITYADRSSSKGVLARDNMQLTTADG 118
Query: 128 FFDNV--VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQIL-SQLGANKFSYCLVPF 181
NV VFGC HN G ++ G++GL +SL++Q+ S + +N F +C+
Sbjct: 119 EMKNVDFVFGCAHNQQGKLLDSPTSTDGILGLSNGAISLSTQLANSGIISNVFGHCMA-- 176
Query: 182 HTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYN 241
TD S M+ G+ V G+ + + Y V ++ ++ +
Sbjct: 177 -TDPSSGGYMFLGD-DYVPRWGMTWVPIRNGPGNVY------STEVPKVNYGAQELNLRG 228
Query: 242 SSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYK----TP 297
+G +++ + D+G+ T P + Y L + +A + C K
Sbjct: 229 QAGKLTQ--VIFDSGSSYTYFPHEIYTNLIALLEDASPGFVRDESDQTLPFCMKPNVPVR 286
Query: 298 SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPV-------EGVFCFAMQPIDG-DVG---- 345
S+ + + K + +TF P +G C + +DG ++G
Sbjct: 287 SVGDVEQLFNPLILQLRKRWFVIPTTFAISPENYLIISDKGNVCLGV--LDGTEIGHSST 344
Query: 346 -IFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
I G+ + F+ YD D + + +DCT+
Sbjct: 345 IIIGDASLRGKFVVYDNDENRIGWVQSDCTR 375
>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
gi|219888491|gb|ACL54620.1| unknown [Zea mays]
Length = 557
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 100/385 (25%), Positives = 152/385 (39%), Gaps = 52/385 (13%)
Query: 21 NGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSSY- 75
+G+Y IG PP LD VDTGSDL W+QC PC K P+Y PA
Sbjct: 184 DGQYYTSIFIGNPPRPYFLD----VDTGSDLTWIQCDAPCTNFAKGPHPLYKPAKEKIVP 239
Query: 76 -KELSCQSEQCHLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFDNV-- 132
++L CQ Q + C + + C+Y YAD S + GVLA + + +N + +
Sbjct: 240 PRDLLCQELQG---NQNYCETCKQCDYEIEYADQSSSMGVLARDDMHMIATNGGREKLDF 296
Query: 133 VFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQ-LGANKFSYCLVPFHTDSSIT 188
VFGC ++ G + G++GL +S SQ+ S + AN F +C+ +
Sbjct: 297 VFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCIT---REQGGG 353
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISK 248
M+ G+ V GV TS+ S D Y+ + G+ A S
Sbjct: 354 GYMFLGD-DYVPRWGVTWTSIRSGPDNLYH-TQAHHVKYGDQQ-------LRRPEQAGST 404
Query: 249 GNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQLCYKT-------PSMAG 301
+ D+G+ T LP + Y L ++ A LC+K +
Sbjct: 405 VQVIFDSGSSYTYLPNEIYENLVAAIKYASPGFVQDTSDRTLPLCWKADFPVRYLEDVKQ 464
Query: 302 IAPILTAHFDGGAKVPLIHTSTFIPPPVE-------GVFCFAM----QPIDGDVGIFGNF 350
L HF K L + TF P + G C + + G I G+
Sbjct: 465 FFEPLNLHF---GKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDV 521
Query: 351 AQSDLFIGYDFDSQMVSFKPTDCTK 375
+ + YD + + + +DCTK
Sbjct: 522 SLRGKLVVYDNQRKQIGWADSDCTK 546
>gi|222635172|gb|EEE65304.1| hypothetical protein OsJ_20543 [Oryza sativa Japonica Group]
Length = 274
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 77/274 (28%), Positives = 110/274 (40%), Gaps = 64/274 (23%)
Query: 114 VLATERITFGNSNNF----FDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQL 169
+LAT+ TFG +N V FGCGH N G+F NE G+ G GR R SL SQL
Sbjct: 48 ILATDSFTFGGDDNAGGLAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLP----SQL 103
Query: 170 GANKFSYCLVP-FHTDSS------ITSKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVT 221
FSYC F T SS + G V +T L+ + + YFV
Sbjct: 104 NVTSFSYCFTSMFDTKSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVP 163
Query: 222 LEGISVGNLSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLT 281
L GISVG + +P + + ID+GA T LP+D Y ++ + +
Sbjct: 164 LRGISVG---GARVAVPESR-----LRSSTIIDSGASITTLPEDVYEAVKAEFVS----- 210
Query: 282 PYQDPRLGSQLCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCFAMQPID 341
Q PR ++ D A+ V C +
Sbjct: 211 --QLPR--GNYVFE---------------DYAAR----------------VLCVVLDAAA 235
Query: 342 GDVGIFGNFAQSDLFIGYDFDSQMVSFKPTDCTK 375
G+ + GN+ Q + + YD ++ ++SF P C K
Sbjct: 236 GEQVVIGNYQQQNTHVVYDLENDVLSFAPARCDK 269
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 98/391 (25%), Positives = 156/391 (39%), Gaps = 69/391 (17%)
Query: 22 GEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSSSYK 76
G Y K IGTP D Y VDTGSD++WV C C +C + +Y+ +S++
Sbjct: 153 GLYFAKIGIGTPSK-DYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSD 211
Query: 77 ELSCQSEQCHLLD--TVSCSSQQLCNYTYGYADSSLTKGVLATERITFGNSNNFFD---- 130
+ C C L D C C Y+ Y D S T G + + + + F
Sbjct: 212 AVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPT 271
Query: 131 --NVVFGCGHNNTGVF---NENEMGLVGLGRTRLSLASQILSQLGANK-FSYCLVPFHTD 184
VVFGCG+ +G +E G++G G+ S+ SQ+ S K FS+CL
Sbjct: 272 NGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL------ 325
Query: 185 SSITSKMYFGNGSEVSGGGVVSTSLVSK---------EDKTYYFVTLEGISVGNLSNSSK 235
V GGG+ + V + +++ +Y V ++ I VG
Sbjct: 326 ------------DNVDGGGIFAIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGG------ 367
Query: 236 LIPYYNSSGAISKGNM---FIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRLGSQ- 291
P S A G+ ID+G P++ Y L E++ L+ D RL +
Sbjct: 368 -DPLDVPSDAFESGDRKGTIIDSGTTLAYFPQEVYVPLIEKI-----LSQQPDLRLHTVE 421
Query: 292 ---LCYK-TPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGVFCF---AMQPIDG-D 343
C+ T ++ P +T HFD + + E + Q DG D
Sbjct: 422 QAFTCFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYLFQHEFEWCIGWQNSGAQTKDGKD 481
Query: 344 VGIFGNFAQSDLFIGYDFDSQMVSFKPTDCT 374
+ + G+ S+ + YD + Q + + +C+
Sbjct: 482 LTLLGDLVLSNKLVVYDLEKQGIGWVEYNCS 512
>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 102/377 (27%), Positives = 172/377 (45%), Gaps = 65/377 (17%)
Query: 30 IGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-------YKQVKP---IYNPASSSSYKELS 79
IGTP + + + D GSDL+WV C C+QC Y ++ Y+P+ SS+ K LS
Sbjct: 99 IGTPNVSFLVAL-DAGSDLLWVPC-DCMQCAPLSASYYDRLGRDLNEYSPSLSSTSKPLS 156
Query: 80 CQSEQCHLLDTVSCSSQQLCNYTYG-YADSSLTKGVLATERITF------GNSNNFFDNV 132
C + C L SS+ C Y Y++++ + G+L +R+ + ++ + +V
Sbjct: 157 CNDQLCELGSDCK-SSKDPCPYLASYYSENTSSSGLLIEDRLHLAPFSEHASRSSVWASV 215
Query: 133 VFGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQLG--ANKFSYCLVPFHTDSSIT 188
+ GCG +G F++ GL+GLG LS+ S +L++ G N FS C D + +
Sbjct: 216 IIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPS-LLAKAGLVRNTFSICF-----DDNHS 269
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
+ FG+ V+ STS V E K Y + +EG VG +SS +
Sbjct: 270 GTILFGDQGLVTQK---STSFVPLEGKFVTYLIEVEGYLVG------------SSSLKTA 314
Query: 248 KGNMFIDTGAPPTLLPKDFYNRL----EEQV---RNAIKLTPYQDPRLGSQLCYKTPSMA 300
+D+G T LP + Y ++ ++QV R++ K +P+ + CY + S
Sbjct: 315 GFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSPW-------KYCYNSSSQE 367
Query: 301 GI-APILTAHFDGGAKVPLIHTST--FIPPPVE-GVFCFAMQPIDGDVGIFGNFAQSDLF 356
+ P +T F ++H I E VFC +QPI + GI G
Sbjct: 368 LLNIPTVTLVFAMNQSF-IVHNPVIKLISENEEFNVFCLPIQPIHEEFGIIGQNFMWGYR 426
Query: 357 IGYDFDSQMVSFKPTDC 373
+ +D ++ + + ++C
Sbjct: 427 MVFDRENLKLGWSTSNC 443
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 103/387 (26%), Positives = 171/387 (44%), Gaps = 54/387 (13%)
Query: 19 TANGEYVMKFSIGTPPLLDIYGIVDTGSDLMWVQCLPCVQCYKQVK-----PIYNPASSS 73
TA G Y + IG+P Y VDTGSD++WV C+ C C Y+PA S
Sbjct: 80 TATGLYYTQIEIGSPSK-GYYVQVDTGSDILWVNCIRCDGCPTTSGLGIELTQYDPAGSG 138
Query: 74 SYKELSCQSEQC-----HLLDTVSCSSQQLCNYTYGYADSSLTKGVLATERITF----GN 124
+ + C E C + L S+ C + Y D S T G ++ + + GN
Sbjct: 139 T--TVGCDQEFCVANSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGN 196
Query: 125 SNNFFDN--VVFGCGHNNTGVFNENEM---GLVGLGRTRLSLASQILSQLGANK-FSYCL 178
N + FGCG G + G++G G+ S+ SQ+ + K F++CL
Sbjct: 197 GQTTPSNASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCL 256
Query: 179 VPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIP 238
H F G+ V V +T LV ++ T+Y V L+GISVG ++ +P
Sbjct: 257 DTVHGGG------IFAIGNVVQ-PKVKTTPLV--QNVTHYNVNLQGISVG---GATLQLP 304
Query: 239 YYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNA---IKLTPYQDPRLGSQLCYK 295
SKG + ID+G LP++ Y L V + + L YQD +C++
Sbjct: 305 SSTFDSGDSKGTI-IDSGTTLAYLPREVYRTLLTAVFDKYQDLALHNYQD-----FVCFQ 358
Query: 296 -TPSMAGIAPILTAHFDGGAKVPL-IHTSTFIPPPVEGVFCF-----AMQPIDG-DVGIF 347
+ S+ P++T F+G ++ L ++ ++ ++C +Q DG D+ +
Sbjct: 359 FSGSIDDGFPVVTFSFEG--EITLNVYPHDYLFQNENDLYCMGFLDGGVQTKDGKDMVLL 416
Query: 348 GNFAQSDLFIGYDFDSQMVSFKPTDCT 374
G+ S+ + YD + Q++ + +C+
Sbjct: 417 GDLVLSNKLVVYDLEKQVIGWADYNCS 443
>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 531
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 102/377 (27%), Positives = 172/377 (45%), Gaps = 65/377 (17%)
Query: 30 IGTPPLLDIYGIVDTGSDLMWVQCLPCVQC-------YKQVKP---IYNPASSSSYKELS 79
IGTP + + + D GSDL+WV C C+QC Y ++ Y+P+ SS+ K LS
Sbjct: 109 IGTPNVSFLVAL-DAGSDLLWVPC-DCMQCAPLSASYYDRLGRDLNEYSPSLSSTSKPLS 166
Query: 80 CQSEQCHLLDTVSCSSQQLCNYTYG-YADSSLTKGVLATERITF------GNSNNFFDNV 132
C + C L SS+ C Y Y++++ + G+L +R+ + ++ + +V
Sbjct: 167 CNDQLCELGSDCK-SSKDPCPYLASYYSENTSSSGLLIEDRLHLAPFSEHASRSSVWASV 225
Query: 133 VFGCGHNNTGVFNENEM--GLVGLGRTRLSLASQILSQLG--ANKFSYCLVPFHTDSSIT 188
+ GCG +G F++ GL+GLG LS+ S +L++ G N FS C D + +
Sbjct: 226 IIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPS-LLAKAGLVRNTFSICF-----DDNHS 279
Query: 189 SKMYFGNGSEVSGGGVVSTSLVSKEDK-TYYFVTLEGISVGNLSNSSKLIPYYNSSGAIS 247
+ FG+ V+ STS V E K Y + +EG VG +SS +
Sbjct: 280 GTILFGDQGLVTQK---STSFVPLEGKFVTYLIEVEGYLVG------------SSSLKTA 324
Query: 248 KGNMFIDTGAPPTLLPKDFYNRL----EEQV---RNAIKLTPYQDPRLGSQLCYKTPSMA 300
+D+G T LP + Y ++ ++QV R++ K +P+ + CY + S
Sbjct: 325 GFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSPW-------KYCYNSSSQE 377
Query: 301 GI-APILTAHFDGGAKVPLIHTST--FIPPPVE-GVFCFAMQPIDGDVGIFGNFAQSDLF 356
+ P +T F ++H I E VFC +QPI + GI G
Sbjct: 378 LLNIPTVTLVFAMNQSF-IVHNPVIKLISENEEFNVFCLPIQPIHEEFGIIGQNFMWGYR 436
Query: 357 IGYDFDSQMVSFKPTDC 373
+ +D ++ + + ++C
Sbjct: 437 MVFDRENLKLGWSTSNC 453
>gi|340810981|gb|AEK75417.1| S5 [Oryza rufipogon]
Length = 357
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 158/380 (41%), Gaps = 55/380 (14%)
Query: 26 MKFSIGTPPLLDIYGIVDTGSDLMWVQCLPC-VQCYKQ---VKPIYNPASSSSYKELSCQ 81
M S+G PP++++ I DTGS L WVQC PC V C+ Q PI++P S + + + C
Sbjct: 1 MAVSLGKPPVVNLVAI-DTGSTLSWVQCQPCAVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59
Query: 82 SEQCH------LLDTVSCSSQQ-LCNYTYGYADS-SLTKGVLATERITFGNSNNFFDNVV 133
S +C L +C ++ C Y+ Y + + + G + T+ + G+S F +++
Sbjct: 60 SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS---FMDLM 116
Query: 134 FGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQ---LGANKFSYCLVPFHTDSSITSK 190
FGC + ++E E G+ G G + S Q+ L SYCL TD +
Sbjct: 117 FGCSMDVK--YSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCL---PTDETKPGY 171
Query: 191 MYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLIPYYNSSGAISKGN 250
M G + G T L ++ Y +T+E + ++N +L+ S
Sbjct: 172 MILGRYDRAAMDGGY-TPLFRSINRPTYSLTMEML----IANGQRLV--------TSSSE 218
Query: 251 MFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQ---DPRLGSQLCYK------------ 295
M +D+GA T L + L++ + A+ Y R S +CY
Sbjct: 219 MIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTI 278
Query: 296 TP-SMAGIAPILTAHFDGGAKVPLIHTSTFIPPPVEGV-FCFAMQPIDGDVGIFGNFAQS 353
TP S P+L F GGA + L + F P G+ FA P I GN
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRS-QILGNRVTR 337
Query: 354 DLFIGYDFDSQMVSFKPTDC 373
+D + FK C
Sbjct: 338 SFGTTFDIQGKQFGFKYAVC 357
>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
Length = 358
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 87/302 (28%), Positives = 131/302 (43%), Gaps = 46/302 (15%)
Query: 13 VQSNVSTANGEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYN 68
+Q NV G Y + +IG P LD VDTGSDL W+QC PC C K P+Y
Sbjct: 44 LQGNV-YPTGHYYVTMNIGNPAKPYFLD----VDTGSDLTWLQCDAPCRSCNKVPHPLYR 98
Query: 69 PASSSSYKELSCQSEQCHLLDT-----VSCSSQQLCNYTYGYADSSLTKGVLATERITFG 123
P ++S + C + C L + C S + C+Y Y DS+ ++GVL + +
Sbjct: 99 PTANS---LVPCANALCTALHSGHGSNNKCPSPKQCDYQIKYTDSASSQGVLINDNFSLP 155
Query: 124 -NSNNFFDNVVFGCGHN----NTGVFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYC 177
S+N + FGCG++ G G++GLGR +SL SQ+ Q + N +C
Sbjct: 156 MRSSNIRPGLTFGCGYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHC 215
Query: 178 LVPFHTDSSITSKMYFGNGSEVSGGGVVSTSLVSKEDKTYYFVTLEGISVGNLSNSSKLI 237
L T+ G G G +V TS V+ +V + IS S S +
Sbjct: 216 L---STN---------GGGFLFFGDDIVPTSRVT-------WVPMAKISGNYYSPGSGTL 256
Query: 238 PYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAI--KLTPYQDPRLGSQLCYK 295
+ S + + D+G+ T Y + +++ + L DP L LC+K
Sbjct: 257 YFDRRSLGVKPMEVVFDSGSTYTYFTAQPYQAVVSALKSGLSKSLKQVSDPSL--PLCWK 314
Query: 296 TP 297
P
Sbjct: 315 GP 316
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/173 (35%), Positives = 86/173 (49%), Gaps = 23/173 (13%)
Query: 22 GEYVMKFSIGTPP---LLDIYGIVDTGSDLMWVQC-LPCVQCYKQVKPIYNPASSSSYKE 77
G Y + +IG P LD VDTGSDL W+QC PC C K P+Y P + K
Sbjct: 55 GHYYVTMNIGDPAKPYFLD----VDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKN---KL 107
Query: 78 LSCQSEQCHLLDTVS-----CSSQQLCNYTYGYADSSLTKGVLATERIT--FGNSNNFFD 130
+ C + C L + S C++QQ C+Y Y D + + GVL T+ + N +N
Sbjct: 108 VPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSLPLRNKSNVRP 167
Query: 131 NVVFGCGHN----NTGVFNENEMGLVGLGRTRLSLASQILSQ-LGANKFSYCL 178
++ FGCG++ G GL+GLGR +SL SQ+ Q + N +CL
Sbjct: 168 SLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCL 220
>gi|56784900|dbj|BAD82194.1| aspartic proteinase nepenthesin I-like [Oryza sativa Japonica
Group]
Length = 260
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 82/273 (30%), Positives = 130/273 (47%), Gaps = 33/273 (12%)
Query: 117 TERITFGNSNNFFDNVVFGCGHNNTGVFNENEMGLVGLGRTRLSLASQILSQLGANKFSY 176
TE TFG+ F + FGC + G F GLVGLGR +LSL ++QL F Y
Sbjct: 2 TETFTFGDDAAAFPGIAFGCTLRSEGGFGTGS-GLVGLGRGKLSL----VTQLNVEAFGY 56
Query: 177 CLVPFHTDSSITSKMYFGNGSEVSGG---GVVSTSLVSK---EDKTYYFVTLEGISVGN- 229
L +D S S + FG+ ++V+GG +ST L++ +D +Y+V L GISVG
Sbjct: 57 RL---SSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGK 113
Query: 230 -LSNSSKLIPYYNSSGAISKGNMFIDTGAPPTLLPKDFYNRLEEQVRNAIKLTPYQDPRL 288
+ S + S+GA G + D+G T+LP Y + +++ + + +Q P
Sbjct: 114 LVQIPSGTFSFDRSTGA---GGVIFDSGTTLTMLPDPAYTLVRDELLSQMG---FQKPPP 167
Query: 289 GSQ----LCYKTPSMAGIAPILTAHFDGGAKVPLIHTSTFIPPPV----EGVFCFAMQPI 340
+ +C+ S P + HFDGGA + L T ++P E C+++
Sbjct: 168 AANDDDLICFTGGSSTTTFPSMVLHFDGGADMDL-STENYLPQMQGQNGETARCWSVVKS 226
Query: 341 DGDVGIFGNFAQSDLFIGYDF--DSQMVSFKPT 371
+ I GN Q D + +D +++M+ PT
Sbjct: 227 SQALTIIGNIMQMDFHVVFDLSGNARMLFQPPT 259
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.135 0.409
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,284,721,262
Number of Sequences: 23463169
Number of extensions: 273047639
Number of successful extensions: 555895
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 666
Number of HSP's successfully gapped in prelim test: 1705
Number of HSP's that attempted gapping in prelim test: 548786
Number of HSP's gapped (non-prelim): 2642
length of query: 376
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 232
effective length of database: 8,980,499,031
effective search space: 2083475775192
effective search space used: 2083475775192
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)